Query 015021
Match_columns 414
No_of_seqs 97 out of 116
Neff 5.0
Searched_HMMs 46136
Date Fri Mar 29 02:20:04 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/015021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/015021hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14655 RAB3GAP2_N: Rab3 GTPa 100.0 2E-100 5E-105 782.6 38.0 365 28-405 1-405 (415)
2 KOG2727 Rab3 GTPase-activating 100.0 1.1E-90 2.3E-95 739.9 20.1 381 1-405 1-413 (1244)
3 cd00200 WD40 WD40 domain, foun 97.9 0.018 4E-07 51.1 25.7 81 305-404 175-255 (289)
4 KOG0273 Beta-transducin family 97.8 0.00026 5.7E-09 74.2 12.2 53 295-347 440-492 (524)
5 KOG0271 Notchless-like WD40 re 97.4 0.0039 8.5E-08 64.6 14.8 79 303-402 363-443 (480)
6 KOG0263 Transcription initiati 96.9 0.016 3.4E-07 64.0 13.8 83 298-399 568-650 (707)
7 KOG0266 WD40 repeat-containing 96.7 0.011 2.3E-07 62.2 10.2 88 299-405 195-283 (456)
8 PTZ00421 coronin; Provisional 96.6 0.014 3.1E-07 62.3 11.0 88 299-405 117-205 (493)
9 KOG1539 WD repeat protein [Gen 96.6 0.066 1.4E-06 60.0 16.1 286 77-403 203-528 (910)
10 PLN00181 protein SPA1-RELATED; 96.5 1.2 2.6E-05 49.9 25.9 93 33-129 487-585 (793)
11 KOG0266 WD40 repeat-containing 96.5 0.014 3E-07 61.3 10.2 85 299-402 238-322 (456)
12 KOG0299 U3 snoRNP-associated p 96.5 0.0039 8.5E-08 65.4 6.0 82 305-405 200-281 (479)
13 KOG0319 WD40-repeat-containing 96.2 0.41 8.8E-06 53.3 19.2 86 299-405 497-584 (775)
14 cd00200 WD40 WD40 domain, foun 96.2 0.034 7.5E-07 49.4 9.3 82 305-405 133-214 (289)
15 KOG0649 WD40 repeat protein [G 96.0 0.035 7.6E-07 55.0 8.8 82 309-409 116-197 (325)
16 KOG0263 Transcription initiati 95.9 0.021 4.5E-07 63.1 7.8 88 299-405 527-614 (707)
17 KOG0271 Notchless-like WD40 re 95.9 0.026 5.7E-07 58.6 7.9 96 297-406 147-243 (480)
18 PF12341 DUF3639: Protein of u 95.8 0.014 3.1E-07 38.9 3.6 27 76-106 1-27 (27)
19 KOG0291 WD40-repeat-containing 95.8 1.7 3.6E-05 49.0 21.3 112 34-160 101-221 (893)
20 KOG0639 Transducin-like enhanc 95.0 0.027 5.8E-07 60.3 4.5 77 308-404 510-587 (705)
21 KOG0265 U5 snRNP-specific prot 94.8 0.097 2.1E-06 53.0 7.6 79 310-406 93-171 (338)
22 KOG2727 Rab3 GTPase-activating 94.7 0.039 8.5E-07 62.5 4.9 84 140-223 150-243 (1244)
23 PTZ00421 coronin; Provisional 94.5 0.26 5.6E-06 52.9 10.5 81 303-402 164-249 (493)
24 KOG0315 G-protein beta subunit 94.5 0.28 6.1E-06 48.9 9.8 87 299-404 207-294 (311)
25 KOG1446 Histone H3 (Lys4) meth 94.2 0.26 5.7E-06 49.9 9.2 82 308-405 188-269 (311)
26 KOG0279 G protein beta subunit 94.1 0.29 6.3E-06 49.3 9.2 88 301-405 99-187 (315)
27 KOG0316 Conserved WD40 repeat- 94.1 0.18 4E-06 49.9 7.6 84 297-399 49-132 (307)
28 KOG1273 WD40 repeat protein [G 93.9 0.18 4E-06 51.6 7.5 87 301-412 18-104 (405)
29 TIGR03866 PQQ_ABC_repeats PQQ- 93.8 0.54 1.2E-05 43.7 10.1 76 310-405 209-286 (300)
30 TIGR03866 PQQ_ABC_repeats PQQ- 93.8 0.49 1.1E-05 44.0 9.8 75 311-404 34-109 (300)
31 PTZ00420 coronin; Provisional 93.8 0.47 1E-05 52.0 10.9 87 299-405 117-204 (568)
32 KOG0295 WD40 repeat-containing 93.7 5.8 0.00013 41.5 17.8 92 309-406 237-330 (406)
33 PF00400 WD40: WD domain, G-be 93.0 0.18 4E-06 33.9 4.1 36 300-335 4-39 (39)
34 PTZ00420 coronin; Provisional 93.0 0.99 2.2E-05 49.5 11.8 50 308-357 168-220 (568)
35 PLN00181 protein SPA1-RELATED; 92.7 0.66 1.4E-05 51.9 10.3 79 309-405 534-613 (793)
36 PF14727 PHTB1_N: PTHB1 N-term 91.6 1.3 2.9E-05 46.8 10.4 70 80-154 23-102 (418)
37 KOG0308 Conserved WD40 repeat- 91.2 0.73 1.6E-05 50.9 8.0 80 307-405 171-250 (735)
38 KOG1274 WD40 repeat protein [G 91.0 0.7 1.5E-05 52.5 7.9 108 9-127 442-570 (933)
39 KOG0272 U4/U6 small nuclear ri 90.6 0.93 2E-05 47.8 7.9 76 307-403 303-380 (459)
40 KOG0284 Polyadenylation factor 90.6 0.44 9.5E-06 50.1 5.4 79 309-408 182-262 (464)
41 PF02239 Cytochrom_D1: Cytochr 89.9 1.9 4E-05 44.6 9.4 50 303-352 32-81 (369)
42 KOG0646 WD40 repeat protein [G 89.8 3.6 7.7E-05 44.0 11.4 203 88-346 91-316 (476)
43 KOG0279 G protein beta subunit 88.9 2.2 4.9E-05 43.1 8.8 78 307-403 148-227 (315)
44 PF15492 Nbas_N: Neuroblastoma 88.8 0.63 1.4E-05 46.7 4.8 43 304-346 226-268 (282)
45 KOG0277 Peroxisomal targeting 88.5 1.5 3.2E-05 44.1 7.1 88 297-402 94-183 (311)
46 PF15492 Nbas_N: Neuroblastoma 88.2 2.5 5.4E-05 42.6 8.6 76 35-113 3-78 (282)
47 KOG0282 mRNA splicing factor [ 88.1 1.2 2.6E-05 47.7 6.6 67 310-396 435-503 (503)
48 KOG0319 WD40-repeat-containing 88.1 2.2 4.7E-05 47.8 8.8 82 305-405 461-542 (775)
49 KOG0283 WD40 repeat-containing 87.6 3.1 6.7E-05 46.7 9.7 69 88-164 461-541 (712)
50 KOG0285 Pleiotropic regulator 86.8 4.7 0.0001 42.2 9.8 50 301-350 145-194 (460)
51 KOG0318 WD40 repeat stress pro 86.8 3.3 7.1E-05 45.0 9.0 40 303-342 483-522 (603)
52 KOG1407 WD40 repeat protein [F 86.7 0.58 1.3E-05 46.9 3.2 47 299-345 181-227 (313)
53 KOG0291 WD40-repeat-containing 86.3 12 0.00026 42.5 13.1 88 35-131 313-404 (893)
54 PF08662 eIF2A: Eukaryotic tra 86.3 5.6 0.00012 37.1 9.4 75 310-403 103-183 (194)
55 PF08662 eIF2A: Eukaryotic tra 86.2 5.3 0.00011 37.3 9.2 76 309-405 61-140 (194)
56 KOG0288 WD40 repeat protein Ti 86.0 1.5 3.2E-05 46.3 5.8 42 310-351 390-431 (459)
57 TIGR02658 TTQ_MADH_Hv methylam 85.8 4.5 9.7E-05 41.9 9.2 67 318-404 11-91 (352)
58 KOG0281 Beta-TrCP (transducin 84.9 2.6 5.6E-05 44.0 6.8 98 308-405 236-355 (499)
59 PF08596 Lgl_C: Lethal giant l 84.8 4.8 0.0001 42.2 9.0 64 43-106 99-171 (395)
60 PF12894 Apc4_WD40: Anaphase-p 84.5 1.9 4E-05 32.1 4.2 36 76-113 11-46 (47)
61 KOG0296 Angio-associated migra 84.2 12 0.00026 39.2 11.3 98 303-403 102-225 (399)
62 KOG0275 Conserved WD40 repeat- 84.1 1.2 2.5E-05 46.0 4.0 81 306-405 305-385 (508)
63 KOG4640 Anaphase-promoting com 83.8 2.8 6.1E-05 46.3 6.9 84 34-126 25-112 (665)
64 KOG0276 Vesicle coat complex C 83.8 16 0.00035 40.7 12.6 211 77-411 56-269 (794)
65 KOG0306 WD40-repeat-containing 83.5 2.8 6.1E-05 47.2 6.8 89 299-406 99-187 (888)
66 KOG0318 WD40 repeat stress pro 83.1 8 0.00017 42.2 9.8 88 302-405 185-272 (603)
67 KOG1912 WD40 repeat protein [G 82.6 2.2 4.8E-05 48.3 5.7 66 320-403 80-148 (1062)
68 KOG0307 Vesicle coat complex C 82.2 9.8 0.00021 44.5 10.7 76 57-134 47-131 (1049)
69 KOG0316 Conserved WD40 repeat- 82.0 3.8 8.2E-05 40.9 6.4 77 310-403 186-262 (307)
70 KOG0282 mRNA splicing factor [ 81.8 1.2 2.5E-05 47.7 3.1 85 301-404 208-294 (503)
71 COG2319 FOG: WD40 repeat [Gene 81.6 18 0.0004 33.2 10.6 86 302-405 150-236 (466)
72 KOG2110 Uncharacterized conser 80.2 12 0.00026 39.2 9.6 82 308-413 130-214 (391)
73 KOG0293 WD40 repeat-containing 80.2 6.5 0.00014 41.8 7.8 261 15-405 214-477 (519)
74 KOG0646 WD40 repeat protein [G 80.1 3.4 7.4E-05 44.1 5.8 71 308-398 82-153 (476)
75 KOG0286 G-protein beta subunit 79.7 9.2 0.0002 39.1 8.4 52 299-350 178-230 (343)
76 KOG2321 WD40 repeat protein [G 79.5 12 0.00027 41.3 9.8 112 15-129 154-281 (703)
77 KOG3914 WD repeat protein WDR4 79.3 5.9 0.00013 41.6 7.1 76 310-405 154-230 (390)
78 KOG0286 G-protein beta subunit 79.1 13 0.00027 38.2 9.1 76 309-404 99-180 (343)
79 KOG1408 WD40 repeat protein [F 79.0 2.7 5.8E-05 47.3 4.8 83 301-399 453-535 (1080)
80 KOG0285 Pleiotropic regulator 78.8 11 0.00023 39.7 8.7 84 310-412 238-323 (460)
81 KOG1538 Uncharacterized conser 78.7 5.9 0.00013 44.5 7.2 40 309-350 14-54 (1081)
82 KOG0280 Uncharacterized conser 78.5 11 0.00024 38.5 8.6 77 309-403 123-202 (339)
83 KOG1240 Protein kinase contain 78.3 8.6 0.00019 45.6 8.7 81 310-405 1198-1280(1431)
84 KOG1188 WD40 repeat protein [G 78.1 3.7 7.9E-05 42.6 5.1 66 320-404 41-108 (376)
85 KOG0289 mRNA splicing factor [ 78.0 13 0.00029 39.7 9.3 79 308-404 304-383 (506)
86 KOG1446 Histone H3 (Lys4) meth 78.0 4.2 9E-05 41.5 5.4 39 311-349 236-274 (311)
87 KOG0264 Nucleosome remodeling 77.9 10 0.00022 40.3 8.4 46 306-351 271-318 (422)
88 KOG0281 Beta-TrCP (transducin 77.7 1.9 4E-05 45.0 2.9 79 301-402 354-432 (499)
89 KOG0306 WD40-repeat-containing 76.7 12 0.00026 42.5 8.9 115 33-156 24-147 (888)
90 PF03178 CPSF_A: CPSF A subuni 76.2 23 0.0005 35.0 10.2 52 75-127 22-96 (321)
91 smart00320 WD40 WD40 repeats. 75.9 5.3 0.00011 23.7 3.7 33 303-335 8-40 (40)
92 KOG1063 RNA polymerase II elon 75.2 19 0.0004 40.6 9.8 56 76-131 664-735 (764)
93 PF14779 BBS1: Ciliary BBSome 75.0 6.3 0.00014 39.3 5.7 53 77-129 177-234 (257)
94 KOG0310 Conserved WD40 repeat- 74.4 6.3 0.00014 42.3 5.9 78 307-402 68-145 (487)
95 PRK01742 tolB translocation pr 74.0 10 0.00022 39.4 7.3 44 311-355 336-379 (429)
96 PF00780 CNH: CNH domain; Int 74.0 25 0.00054 33.5 9.5 68 38-111 102-169 (275)
97 KOG0273 Beta-transducin family 74.0 17 0.00038 39.2 8.9 55 75-131 234-288 (524)
98 KOG0265 U5 snRNP-specific prot 73.8 22 0.00049 36.5 9.3 86 299-403 39-125 (338)
99 KOG1539 WD repeat protein [Gen 73.7 15 0.00033 42.0 8.8 58 303-362 572-629 (910)
100 KOG0275 Conserved WD40 repeat- 73.3 4.9 0.00011 41.6 4.6 87 306-403 212-298 (508)
101 KOG0296 Angio-associated migra 72.6 9.6 0.00021 39.9 6.5 53 77-129 284-337 (399)
102 KOG2048 WD40 repeat protein [G 72.6 23 0.00049 39.7 9.7 80 307-404 154-239 (691)
103 KOG0274 Cdc4 and related F-box 72.5 12 0.00027 40.8 7.7 78 306-405 330-408 (537)
104 KOG1963 WD40 repeat protein [G 71.4 12 0.00025 42.7 7.3 78 310-402 208-285 (792)
105 COG2319 FOG: WD40 repeat [Gene 70.7 36 0.00077 31.3 9.3 80 303-402 194-275 (466)
106 KOG4328 WD40 protein [Function 69.9 1.7E+02 0.0037 31.8 17.2 80 76-164 186-274 (498)
107 KOG0267 Microtubule severing p 69.7 6.1 0.00013 44.5 4.6 83 302-403 149-231 (825)
108 KOG0294 WD40 repeat-containing 69.4 34 0.00073 35.5 9.4 50 82-131 211-261 (362)
109 KOG1517 Guanine nucleotide bin 67.9 34 0.00073 40.6 10.0 57 75-132 1207-1269(1387)
110 KOG1274 WD40 repeat protein [G 67.8 6 0.00013 45.4 4.2 46 303-348 134-179 (933)
111 PF12894 Apc4_WD40: Anaphase-p 66.2 15 0.00032 27.3 4.7 30 310-340 14-43 (47)
112 KOG2695 WD40 repeat protein [G 66.2 12 0.00026 39.2 5.6 74 315-405 307-383 (425)
113 TIGR02658 TTQ_MADH_Hv methylam 64.5 13 0.00028 38.6 5.6 39 307-345 298-338 (352)
114 KOG0264 Nucleosome remodeling 64.2 25 0.00054 37.4 7.6 82 299-398 308-404 (422)
115 KOG1034 Transcriptional repres 63.4 13 0.00028 38.7 5.2 69 319-405 105-173 (385)
116 KOG0771 Prolactin regulatory e 63.0 12 0.00026 39.4 5.1 49 308-359 282-330 (398)
117 PF10168 Nup88: Nuclear pore c 62.9 67 0.0014 36.6 11.2 78 30-107 85-178 (717)
118 KOG0294 WD40 repeat-containing 62.8 37 0.0008 35.2 8.3 84 300-402 76-161 (362)
119 KOG0278 Serine/threonine kinas 62.8 64 0.0014 32.8 9.7 87 34-129 189-277 (334)
120 PRK03629 tolB translocation pr 61.8 32 0.00069 35.9 8.0 48 308-355 243-294 (429)
121 PRK04922 tolB translocation pr 61.2 37 0.0008 35.3 8.3 39 310-348 294-336 (433)
122 TIGR02276 beta_rpt_yvtn 40-res 61.1 22 0.00047 24.1 4.6 29 317-345 1-30 (42)
123 KOG0771 Prolactin regulatory e 60.8 65 0.0014 34.2 9.9 108 19-132 221-337 (398)
124 KOG0308 Conserved WD40 repeat- 60.4 21 0.00044 40.0 6.4 98 298-403 108-206 (735)
125 PRK01742 tolB translocation pr 59.7 41 0.00088 35.0 8.3 39 309-347 249-290 (429)
126 PF00400 WD40: WD domain, G-be 58.9 18 0.00038 24.0 3.8 29 76-106 11-39 (39)
127 KOG0315 G-protein beta subunit 58.4 19 0.00042 36.3 5.3 54 303-356 254-307 (311)
128 KOG0313 Microtubule binding pr 57.9 26 0.00057 37.0 6.4 76 308-401 301-379 (423)
129 PF06977 SdiA-regulated: SdiA- 57.7 82 0.0018 31.1 9.6 48 302-350 16-66 (248)
130 KOG0305 Anaphase promoting com 57.4 32 0.0007 37.3 7.2 86 303-405 297-383 (484)
131 KOG1408 WD40 repeat protein [F 57.3 49 0.0011 37.9 8.6 82 311-408 600-681 (1080)
132 PRK01029 tolB translocation pr 57.2 60 0.0013 34.1 9.1 46 303-348 322-370 (428)
133 KOG0295 WD40 repeat-containing 56.9 52 0.0011 34.7 8.3 69 318-405 303-371 (406)
134 KOG0274 Cdc4 and related F-box 56.6 84 0.0018 34.5 10.4 48 303-350 245-292 (537)
135 PF02239 Cytochrom_D1: Cytochr 56.0 20 0.00044 37.0 5.3 39 308-346 78-117 (369)
136 PF02897 Peptidase_S9_N: Proly 55.9 27 0.0006 35.5 6.3 49 308-356 124-178 (414)
137 PRK03629 tolB translocation pr 55.8 65 0.0014 33.6 9.1 47 309-355 332-381 (429)
138 KOG0269 WD40 repeat-containing 55.7 12 0.00026 42.4 3.8 67 319-404 100-170 (839)
139 COG4946 Uncharacterized protei 54.9 17 0.00037 39.5 4.6 38 309-346 403-440 (668)
140 KOG2048 WD40 repeat protein [G 53.7 16 0.00036 40.7 4.3 32 310-341 478-509 (691)
141 PRK04922 tolB translocation pr 53.4 69 0.0015 33.3 8.8 46 310-355 338-386 (433)
142 KOG0641 WD40 repeat protein [G 53.2 22 0.00048 35.5 4.7 47 302-348 223-272 (350)
143 KOG0292 Vesicle coat complex C 53.0 41 0.0009 39.2 7.4 86 301-405 45-130 (1202)
144 PRK11028 6-phosphogluconolacto 51.9 84 0.0018 30.8 8.7 30 310-339 128-158 (330)
145 KOG3881 Uncharacterized conser 51.8 29 0.00063 36.7 5.6 49 77-133 203-251 (412)
146 KOG0288 WD40 repeat protein Ti 51.6 15 0.00031 39.2 3.4 85 302-405 214-298 (459)
147 KOG0272 U4/U6 small nuclear ri 51.1 52 0.0011 35.2 7.3 84 303-404 341-424 (459)
148 PRK11028 6-phosphogluconolacto 49.6 1.1E+02 0.0024 30.0 9.1 28 311-338 83-111 (330)
149 KOG2096 WD40 repeat protein [G 49.3 29 0.00063 36.2 5.0 67 32-106 334-400 (420)
150 KOG2919 Guanine nucleotide-bin 49.2 49 0.0011 34.6 6.6 97 298-410 241-340 (406)
151 PF10313 DUF2415: Uncharacteri 49.0 51 0.0011 24.3 5.0 29 311-339 4-35 (43)
152 PF14781 BBS2_N: Ciliary BBSom 48.6 71 0.0015 29.1 6.9 73 76-159 47-124 (136)
153 KOG0640 mRNA cleavage stimulat 48.6 29 0.00062 36.1 4.8 103 299-406 163-299 (430)
154 PRK02889 tolB translocation pr 48.0 72 0.0016 33.2 7.9 40 305-344 193-235 (427)
155 PF01403 Sema: Sema domain; I 47.2 81 0.0018 32.9 8.2 52 78-129 365-431 (433)
156 PRK04792 tolB translocation pr 47.1 1.2E+02 0.0026 31.9 9.5 47 309-355 307-357 (448)
157 PF04841 Vps16_N: Vps16, N-ter 47.0 2.3E+02 0.005 29.7 11.5 72 29-111 29-112 (410)
158 TIGR02800 propeller_TolB tol-p 46.6 1E+02 0.0022 31.1 8.6 43 302-344 184-229 (417)
159 KOG1407 WD40 repeat protein [F 46.4 39 0.00084 34.3 5.3 83 303-405 16-102 (313)
160 PF04841 Vps16_N: Vps16, N-ter 46.3 65 0.0014 33.7 7.3 57 88-154 48-105 (410)
161 KOG1538 Uncharacterized conser 45.3 2.2E+02 0.0047 32.7 11.2 49 310-360 459-507 (1081)
162 KOG0301 Phospholipase A2-activ 43.8 84 0.0018 35.6 7.8 77 301-398 173-249 (745)
163 KOG0647 mRNA export protein (c 43.2 1.5E+02 0.0033 30.8 9.0 47 310-358 75-124 (347)
164 PF13464 DUF4115: Domain of un 42.7 74 0.0016 25.2 5.6 55 96-160 5-60 (77)
165 KOG0284 Polyadenylation factor 41.7 37 0.00079 36.3 4.5 92 302-412 133-226 (464)
166 KOG0289 mRNA splicing factor [ 41.7 1.1E+02 0.0024 33.1 8.0 78 309-405 349-426 (506)
167 PRK00178 tolB translocation pr 41.5 1.7E+02 0.0037 30.0 9.4 38 310-347 245-285 (430)
168 KOG0310 Conserved WD40 repeat- 41.4 1.6E+02 0.0035 32.0 9.3 86 300-403 103-190 (487)
169 PF04762 IKI3: IKI3 family; I 41.4 3.5E+02 0.0076 31.8 12.8 165 4-212 240-434 (928)
170 KOG4190 Uncharacterized conser 41.3 18 0.00039 40.0 2.3 51 309-362 878-928 (1034)
171 KOG1009 Chromatin assembly com 41.0 1E+02 0.0022 33.0 7.6 99 301-405 59-160 (434)
172 PRK02889 tolB translocation pr 40.6 54 0.0012 34.1 5.7 40 311-350 331-373 (427)
173 KOG2445 Nuclear pore complex c 40.0 1.2E+02 0.0026 31.5 7.8 34 76-110 223-258 (361)
174 KOG2394 WD40 protein DMR-N9 [G 39.9 74 0.0016 35.2 6.5 78 309-405 292-369 (636)
175 PF00780 CNH: CNH domain; Int 39.7 74 0.0016 30.3 6.1 61 91-161 8-68 (275)
176 KOG1645 RING-finger-containing 39.6 64 0.0014 34.5 5.9 80 302-401 188-269 (463)
177 PF11768 DUF3312: Protein of u 39.3 1E+02 0.0022 34.1 7.5 73 308-401 260-332 (545)
178 KOG2110 Uncharacterized conser 39.2 66 0.0014 33.9 5.8 60 299-358 165-226 (391)
179 KOG0639 Transducin-like enhanc 38.9 32 0.00069 37.8 3.6 49 79-129 595-643 (705)
180 KOG2114 Vacuolar assembly/sort 38.5 69 0.0015 37.1 6.3 55 77-132 125-184 (933)
181 PF06200 tify: tify domain; I 38.2 57 0.0012 23.2 3.7 24 138-161 3-26 (36)
182 KOG0293 WD40 repeat-containing 37.8 1.2E+02 0.0026 32.8 7.5 85 302-405 219-306 (519)
183 PRK04792 tolB translocation pr 37.4 1.2E+02 0.0026 31.9 7.7 37 311-347 265-304 (448)
184 KOG0650 WD40 repeat nucleolar 37.2 61 0.0013 36.3 5.5 43 303-345 396-438 (733)
185 KOG1036 Mitotic spindle checkp 35.7 1.3E+02 0.0027 31.2 7.1 41 310-350 235-275 (323)
186 PRK05137 tolB translocation pr 35.5 1.9E+02 0.004 30.1 8.7 81 303-402 197-282 (435)
187 TIGR03300 assembly_YfgL outer 34.5 1.5E+02 0.0033 29.6 7.6 64 319-403 105-168 (377)
188 PF07433 DUF1513: Protein of u 34.4 72 0.0016 32.7 5.2 40 306-345 215-255 (305)
189 KOG0321 WD40 repeat-containing 34.3 81 0.0018 35.5 5.9 85 314-412 58-148 (720)
190 KOG3881 Uncharacterized conser 34.2 1.2E+02 0.0027 32.1 7.0 76 307-401 247-323 (412)
191 KOG0283 WD40 repeat-containing 33.6 1.3E+02 0.0028 34.3 7.4 77 308-401 502-579 (712)
192 KOG4497 Uncharacterized conser 33.3 33 0.00071 36.0 2.6 32 313-345 14-45 (447)
193 COG3041 Uncharacterized protei 32.5 15 0.00032 31.3 -0.0 10 344-353 53-62 (91)
194 KOG4283 Transcription-coupled 32.0 1.6E+02 0.0034 30.7 7.1 64 319-400 158-221 (397)
195 KOG0300 WD40 repeat-containing 31.9 1.5E+02 0.0034 31.0 7.1 85 297-401 304-389 (481)
196 PLN02919 haloacid dehalogenase 31.9 2.1E+02 0.0046 34.0 9.2 82 311-402 807-892 (1057)
197 KOG0305 Anaphase promoting com 31.8 1.1E+02 0.0024 33.4 6.4 78 307-404 217-294 (484)
198 KOG0278 Serine/threonine kinas 31.6 2.1E+02 0.0045 29.3 7.7 80 304-402 221-301 (334)
199 KOG1273 WD40 repeat protein [G 31.4 4.4E+02 0.0096 27.8 10.2 41 303-343 61-101 (405)
200 KOG2111 Uncharacterized conser 30.5 3.7E+02 0.008 28.1 9.4 79 303-399 177-257 (346)
201 KOG1188 WD40 repeat protein [G 30.1 50 0.0011 34.5 3.3 39 91-129 41-80 (376)
202 KOG0267 Microtubule severing p 29.5 76 0.0016 36.2 4.7 58 302-359 65-124 (825)
203 PF06433 Me-amine-dh_H: Methyl 29.2 2.9E+02 0.0062 28.9 8.6 57 330-404 270-326 (342)
204 KOG1334 WD40 repeat protein [G 28.8 47 0.001 36.2 2.9 69 316-403 403-471 (559)
205 KOG0277 Peroxisomal targeting 28.8 2.2E+02 0.0047 29.1 7.3 79 303-397 4-90 (311)
206 PF10282 Lactonase: Lactonase, 28.7 2.7E+02 0.0059 28.0 8.3 80 33-115 248-332 (345)
207 TIGR02800 propeller_TolB tol-p 28.4 1.2E+02 0.0025 30.6 5.7 34 308-341 322-358 (417)
208 KOG0322 G-protein beta subunit 28.3 45 0.00098 33.9 2.6 40 297-336 283-322 (323)
209 PF13360 PQQ_2: PQQ-like domai 27.9 81 0.0017 28.8 4.1 28 315-342 208-235 (238)
210 PF14783 BBS2_Mid: Ciliary BBS 27.7 2E+02 0.0042 25.4 6.1 46 79-126 2-49 (111)
211 PRK00178 tolB translocation pr 27.3 3.1E+02 0.0066 28.1 8.6 41 311-351 334-377 (430)
212 KOG2096 WD40 repeat protein [G 26.8 1E+02 0.0023 32.3 4.9 42 297-338 74-117 (420)
213 PF12234 Rav1p_C: RAVE protein 26.7 6.9E+02 0.015 28.3 11.5 93 34-132 34-128 (631)
214 PF01011 PQQ: PQQ enzyme repea 26.6 94 0.002 21.3 3.3 18 325-342 6-23 (38)
215 PF13360 PQQ_2: PQQ-like domai 26.4 3.5E+02 0.0075 24.6 8.0 85 55-155 6-92 (238)
216 KOG2114 Vacuolar assembly/sort 26.3 1.1E+02 0.0024 35.6 5.4 46 313-359 29-74 (933)
217 KOG0302 Ribosome Assembly prot 26.1 2.1E+02 0.0045 30.6 6.9 74 306-400 256-334 (440)
218 PF01436 NHL: NHL repeat; Int 26.1 1.2E+02 0.0025 19.7 3.4 20 323-342 7-26 (28)
219 COG3391 Uncharacterized conser 26.0 1.4E+02 0.003 30.8 5.7 45 311-355 163-214 (381)
220 PLN02919 haloacid dehalogenase 25.9 2.8E+02 0.006 33.0 8.8 36 310-345 742-778 (1057)
221 KOG1064 RAVE (regulator of V-A 25.8 61 0.0013 40.7 3.4 67 308-403 2337-2403(2439)
222 PF11396 DUF2874: Protein of u 24.9 2.9E+02 0.0063 20.4 6.2 38 75-112 21-61 (61)
223 PF02393 US22: US22 like; Int 24.7 1.2E+02 0.0025 25.8 4.2 28 317-344 79-106 (125)
224 KOG2139 WD40 repeat protein [G 24.4 1.4E+02 0.0031 31.7 5.4 72 33-107 349-430 (445)
225 KOG4460 Nuclear pore complex, 24.4 3E+02 0.0065 30.9 7.9 40 314-354 470-509 (741)
226 TIGR03300 assembly_YfgL outer 23.8 2.7E+02 0.0059 27.8 7.3 49 63-118 85-134 (377)
227 PF10411 DsbC_N: Disulfide bon 23.3 1.6E+02 0.0035 22.3 4.3 37 76-114 11-47 (57)
228 KOG0299 U3 snoRNP-associated p 23.1 3.1E+02 0.0067 29.9 7.6 88 301-405 136-239 (479)
229 PF14408 Actino_peptide: Ribos 22.7 97 0.0021 24.4 2.9 22 312-333 5-26 (59)
230 KOG0973 Histone transcription 22.5 1.1E+02 0.0023 36.0 4.5 48 303-350 125-172 (942)
231 KOG0321 WD40 repeat-containing 22.3 2.3E+02 0.0049 32.1 6.6 98 303-403 140-253 (720)
232 smart00564 PQQ beta-propeller 21.9 1.5E+02 0.0033 18.9 3.4 21 322-342 9-29 (33)
233 KOG1523 Actin-related protein 21.9 2.9E+02 0.0063 29.0 6.9 84 24-117 204-289 (361)
234 KOG1912 WD40 repeat protein [G 21.7 71 0.0015 36.9 2.8 40 91-130 438-478 (1062)
235 KOG0270 WD40 repeat-containing 21.5 2.9E+02 0.0063 29.9 7.0 54 79-133 332-387 (463)
236 PRK04043 tolB translocation pr 21.4 1.6E+02 0.0035 30.9 5.3 43 313-355 324-375 (419)
237 KOG4378 Nuclear protein COP1 [ 21.2 4.3E+02 0.0092 29.4 8.3 81 302-403 116-200 (673)
238 PF11768 DUF3312: Protein of u 21.2 3E+02 0.0065 30.6 7.3 33 88-120 309-341 (545)
239 KOG1272 WD40-repeat-containing 20.9 1E+02 0.0022 33.6 3.6 36 303-338 289-324 (545)
240 PF07893 DUF1668: Protein of u 20.9 1.7E+02 0.0037 29.9 5.2 38 305-344 64-101 (342)
241 KOG1517 Guanine nucleotide bin 20.8 1E+02 0.0022 36.9 3.8 69 34-107 1309-1380(1387)
242 KOG0647 mRNA export protein (c 20.5 1.4E+02 0.0031 30.9 4.4 52 74-126 25-79 (347)
243 KOG0772 Uncharacterized conser 20.3 1.2E+02 0.0025 33.6 3.9 77 310-402 320-398 (641)
244 KOG0322 G-protein beta subunit 20.3 1.8E+02 0.0038 29.9 4.9 69 310-397 254-322 (323)
No 1
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=100.00 E-value=2.1e-100 Score=782.62 Aligned_cols=365 Identities=31% Similarity=0.452 Sum_probs=310.8
Q ss_pred cccCCCeeeeccCcceeeeeecceEEE--EeecCCC------CCceeecccCCCCCCCeEEEEEeEEe---------CCc
Q 015021 28 WLVNDPNLLCALDMHTIALANRYQTVI--INWADPE------GLVAKIRPELSPIASEYITAIEWLVF---------EEM 90 (414)
Q Consensus 28 WL~~~~~~~~sp~~~~la~A~~~~~v~--~~w~~~~------~~~v~~~g~l~~~~~e~ITs~~~lpl---------~dw 90 (414)
||+| |++++||+||+||+|+++|+|| .+|+.++ .+.+.|+|+|+.+++|+|||++|||| +||
T Consensus 1 WL~~-~~isls~~~d~laiA~~~r~vil~~~w~~~~~~~~~~~~~~~~~g~l~~~~~e~ITsi~clpl~s~~~s~~~~dw 79 (415)
T PF14655_consen 1 WLQD-CSISLSPDGDLLAIARGQRLVILTSKWDSSRKGENENTYSISWSGPLDDEPGECITSILCLPLSSQKRSTGGPDW 79 (415)
T ss_pred Cccc-ceEEecCCCCEEEEEcCCEEEEEEeeccccccCCCCCeEEEEeeeeccCCCCCEEEEEEEEEeecccccCCCCCc
Confidence 9999 9999999999999999999999 5895533 24489999999977799999999999 899
Q ss_pred EEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHHHHHHHHHhc
Q 015021 91 RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQKMLQRWFQDS 170 (414)
Q Consensus 91 ~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~ 170 (414)
+||||||++||||||||+|+|||+|+||++||++||||+++++..++...|||+|+||++||+|||++|+++|++|++|+
T Consensus 80 ~~I~VG~ssG~vrfyte~G~LL~~Q~~h~~pV~~ik~~~~~~~~~~~~~~eel~ily~~~v~~Idg~sL~~~L~~~~~~~ 159 (415)
T PF14655_consen 80 TCIAVGTSSGYVRFYTENGVLLLSQLLHEEPVLKIKCRSTKIPRHPGDSSEELSILYPSAVVIIDGFSLFSVLRACRNQV 159 (415)
T ss_pred EEEEEEecccEEEEEeccchHHHHHhcCccceEEEEecccCCCCCCcccccEEEEEECCEEEEEecHHHHHHHHHHHHhh
Confidence 99999999999999999999999999999999999999999866666666999999999999999999999999999999
Q ss_pred cccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhh--------------ccccceeEEEeC
Q 015021 171 NSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQ--------------SSQRYFCAVTIG 236 (414)
Q Consensus 171 ~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~--------------s~~~~~~~i~vG 236 (414)
+++.|+.+ + .+.+++|+||||+|++++.|+|++++|+++|++||++ +...++++|++|
T Consensus 160 ~~~~~~~~-------~-~~~~~~L~ykKw~l~~~~~i~D~~~~G~~~~~~fd~l~~aS~~gf~a~~~~s~~~~~~~i~~G 231 (415)
T PF14655_consen 160 ARGAASGS-------D-SPAPPPLSYKKWNLQSQDTINDAAICGPMPPSTFDHLVTASIGGFNAKYRSSPPRMSRYITVG 231 (415)
T ss_pred hhhhhccc-------c-cCCCCccceeEecCCCCCcEeeEEEecCCCCcHHHHHHhhhcccccceeecCCcceEEEEEec
Confidence 99865321 1 3347889999999999999999999999999999984 335778999999
Q ss_pred CCceeEEEEeccCCCcchhhhhhhhhhH-HHHHHhhhhhhhhhcC--CCCCC-CC-----CCCCcccccCCCCccccCCC
Q 015021 237 EDSVISAFRLSEDRSRSLVGAILSKVVP-ATFSTISSLSKMIWRS--EQSPK-KS-----EPKPQSFARASPLTCLKDHP 307 (414)
Q Consensus 237 ~~P~la~y~~~~~~~~s~~~al~S~va~-av~S~v~S~ak~~W~~--~~~~~-~~-----e~kp~~~~~a~pl~~l~D~~ 307 (414)
++||+|||+.+|+.+++ ++++|+. +|.|+++++++.+|++ .++++ +. ++++++.++..+.++|+|.+
T Consensus 232 ~~P~v~f~~~~e~~s~~----~ls~va~~aVas~l~sav~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~l~~r~~l~D~~ 307 (415)
T PF14655_consen 232 SSPFVSFYYASEGSSQP----LLSDVASSAVASKLTSAVSGWLGWGSWRSEQQPQEKQPPEPKPEPAAPLPMRFGLPDSK 307 (415)
T ss_pred CCceEEEEEccCCCCcc----cHHHHHHHHHHHHHHhhhHhhcccCCCCCccccccccccccCcCCCcccceEEeeccCC
Confidence 99999999888877765 5667777 8888888876634432 22221 11 22333333333336799999
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
|++++|++||+|+|||+||++|||+|+|+++++++|||||||||||+|+++.+..+...+..+....+.+++||||||||
T Consensus 308 R~~~~i~~sP~~~laA~tDslGRV~LiD~~~~~vvrmWKGYRdAqc~wi~~~~~~~~~~~~~~~~~~~~~~~l~LvIyap 387 (415)
T PF14655_consen 308 REGESICLSPSGRLAAVTDSLGRVLLIDVARGIVVRMWKGYRDAQCGWIEVPEEGDRDRSNSNSPKSSSRFALFLVIYAP 387 (415)
T ss_pred ceEEEEEECCCCCEEEEEcCCCcEEEEECCCChhhhhhccCccceEEEEEeecccccccccccccCCCCcceEEEEEEec
Confidence 99999999999999999999999999999999999999999999999999998876432333444445678999999999
Q ss_pred CCCeEEEeeCCCCcceee
Q 015021 388 RKGIIEHLAGMADENWTT 405 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v~~ 405 (414)
|||+||||+||||+||++
T Consensus 388 rRg~lEvW~~~~g~Rv~a 405 (415)
T PF14655_consen 388 RRGILEVWSMRQGPRVAA 405 (415)
T ss_pred cCCeEEEEecCCCCEEEE
Confidence 999999999999999954
No 2
>KOG2727 consensus Rab3 GTPase-activating protein, non-catalytic subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=1.1e-90 Score=739.94 Aligned_cols=381 Identities=44% Similarity=0.626 Sum_probs=331.8
Q ss_pred CCCCcccceeeeeeecccccccCCCCCcccCCCeeeeccCcceeeeeecceEEE--EeecCCCCC---ceeecccCCCCC
Q 015021 1 MSKRTHTTEVGSIACTDLSDLGAGKEGWLVNDPNLLCALDMHTIALANRYQTVI--INWADPEGL---VAKIRPELSPIA 75 (414)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~g~~~~~WL~~~~~~~~sp~~~~la~A~~~~~v~--~~w~~~~~~---~v~~~g~l~~~~ 75 (414)
|.+|.|.+|.||+.|+|+.+.|+++ ||++ |+++++|++|++++|+++|||| .+|.+.+++ .|.|+|+|++++
T Consensus 1 ~w~~w~~~e~g~~e~ee~t~lg~~~--WL~~-cnl~l~s~~d~~~~A~e~rfvfL~~~Wk~pd~p~~~~Vgw~g~l~dpe 77 (1244)
T KOG2727|consen 1 MWKRWHLTELGCIECEELTELGAGK--WLLV-CNLNLLSALDSHSLALENRFVFLIVNWKDPDAPVYKRVGWRGDLSDPE 77 (1244)
T ss_pred CcccccccccCchhhhhhhcccccc--hHHh-cCcccCcchHHHHHHhhcceEEEEecCCCCCCCceEEEEeccccCCcc
Confidence 7789999999999999999999988 9999 9999999999999999999999 679999866 499999999988
Q ss_pred CCeEEEEEeEEe--------CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEe
Q 015021 76 SEYITAIEWLVF--------EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVM 147 (414)
Q Consensus 76 ~e~ITs~~~lpl--------~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily 147 (414)
+|+|||++|+|| +|||||||||++|||+||||+|.|||+|.+|++||++||||+++. +++++|+|
T Consensus 78 ~e~ITa~~clpl~n~s~dgr~dwtcVavGt~sGyV~FYTe~Gvllf~Q~~~edPVl~lk~r~~k~-------d~~l~i~y 150 (1244)
T KOG2727|consen 78 AESITAIECLPLDNVSHDGRVDWTCVAVGTSSGYVLFYTETGVLLFKQIVHEDPVLKLKVRGTKK-------DLMLEISY 150 (1244)
T ss_pred cceeeeeeeeeccccccccccceeEEEEecccceEEEEecccHHHHHHHhccCccceEEEEEeec-------CcEEEEee
Confidence 999999999999 799999999999999999999999999999999999999999976 89999999
Q ss_pred CCeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceeccc-CC-CCCceeeEEeCCCCCCchhhh-
Q 015021 148 PGVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNV-SK-YGPCADAAITGLMPPPLMEVQ- 224 (414)
Q Consensus 148 ~~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l-~~-~~~i~Daa~~G~~~p~~~d~~- 224 (414)
|.++|+|+|++|++.|++|+++++++. ..+ +.....++++||||.+ ++ ...|.|+++.+...||++|+.
T Consensus 151 p~~~~~I~g~sl~~~L~ncq~~Vqkaa------~Ek--nsn~~~~~~~~qk~~l~qdi~~~I~hai~~~~~~ppt~Dq~v 222 (1244)
T KOG2727|consen 151 PEICIVIPGVSLRFDLSNCQSMVQKAA------QEK--NSNFWDQKNRKQKAELTQDIYQRIPHAIWNVNKNPPTVDQTV 222 (1244)
T ss_pred cceEEEECCchhhhhHHHHHHHHHHHH------Hhc--cCCcCCccchhhhhhcccchhhccchheeecccCCccHHHhh
Confidence 999999999999999999999998873 111 1122457899999999 44 446777777666788989982
Q ss_pred --------------ccccceeEEEeCCCceeEEEEeccCCCcchhhhhhhhhhHHHHHHhhhhhhhhhcCCC--CCCCCC
Q 015021 225 --------------SSQRYFCAVTIGEDSVISAFRLSEDRSRSLVGAILSKVVPATFSTISSLSKMIWRSEQ--SPKKSE 288 (414)
Q Consensus 225 --------------s~~~~~~~i~vG~~P~la~y~~~~~~~~s~~~al~S~va~av~S~v~S~ak~~W~~~~--~~~~~e 288 (414)
|-.+|++++++|++||++||+++++.+.+++++++.+|++++.|-+.|....+|+++. +..+++
T Consensus 223 tas~~~gy~a~~k~SpPrySq~vt~ge~pf~gFf~a~eg~~~~llg~Vak~v~s~A~sn~asg~fgi~~ep~~sp~~kp~ 302 (1244)
T KOG2727|consen 223 TASMPPGYLALQKPSPPRYSQAVTIGEDPFIGFFRASEGRGRSLLGAVAKKVVSAAASNIASGSFGIWREPDQSPKRKPE 302 (1244)
T ss_pred hcccCchhhhhccCCCcceeeeEEecCCceeeeeeeccccccccHHHHHHHhhhhhhhhhhhheeeccCCCCcChhhcCC
Confidence 2247889999999999999999999999877777777777665544443332444443 446677
Q ss_pred CCCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccc
Q 015021 289 PKPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSS 368 (414)
Q Consensus 289 ~kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~ 368 (414)
|||++.++++.+.+|+|.+|+|++|++||+|+|||+||++|||+|+|+++++|||||||||||||+|+++.+..++.
T Consensus 303 pk~~saapa~~R~~i~D~~R~ge~lslSP~gtlAAVTD~lgRVlLlDta~~ivvr~wKGYRDAsc~fv~vkek~~~s--- 379 (1244)
T KOG2727|consen 303 PKTQSAAPASSRTCIKDYPRKGEKLSLSPSGTLAAVTDSLGRVLLLDTAALIVVRLWKGYRDASCVFVEVKEKKGKS--- 379 (1244)
T ss_pred CCCCcccccceeeccccCccccceeeeCCCccEEEEecccCcEEEEehhhhhHHHHhcccccceeEEEEcccccCCC---
Confidence 78877665666678999999999999999999999999999999999999999999999999999999988765432
Q ss_pred cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 369 AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 369 ~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
++.+.++||||||||||||+||||.||+|++|.+
T Consensus 380 ---~~~~sRvAlFLvIyAPRrgiLEVW~~q~gpRV~A 413 (1244)
T KOG2727|consen 380 ---EPVKSRVALFLVIYAPRRGILEVWQMQTGPRVLA 413 (1244)
T ss_pred ---ccCcCceeEEEEEecccccHHHHHHhccCCeEEE
Confidence 5677889999999999999999999999999976
No 3
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.85 E-value=0.018 Score=51.12 Aligned_cols=81 Identities=17% Similarity=0.138 Sum_probs=58.7
Q ss_pred CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
........+..+|+++.++++...|.|.++|+.++..++.++++.+ .+....-.+ . . .+++
T Consensus 175 ~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~----~i~~~~~~~----------~--~---~~~~ 235 (289)
T cd00200 175 GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN----GVNSVAFSP----------D--G---YLLA 235 (289)
T ss_pred cCccccceEEECCCcCEEEEecCCCcEEEEECCCCceecchhhcCC----ceEEEEEcC----------C--C---cEEE
Confidence 3344677899999998777777799999999999999999987765 232221111 0 0 2555
Q ss_pred EcCCCCeEEEeeCCCCccee
Q 015021 385 HAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v~ 404 (414)
.+..+|.|.+|++++++.+.
T Consensus 236 ~~~~~~~i~i~~~~~~~~~~ 255 (289)
T cd00200 236 SGSEDGTIRVWDLRTGECVQ 255 (289)
T ss_pred EEcCCCcEEEEEcCCceeEE
Confidence 56679999999999877653
No 4
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=97.76 E-value=0.00026 Score=74.25 Aligned_cols=53 Identities=21% Similarity=0.318 Sum_probs=47.0
Q ss_pred ccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecc
Q 015021 295 ARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKG 347 (414)
Q Consensus 295 ~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKG 347 (414)
+...|++.|.+++--.-++.-||+|+|.|..+..|+|.++++.++.+++-.+|
T Consensus 440 ~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l~~s~~~ 492 (524)
T KOG0273|consen 440 ESGVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKLVKSYQG 492 (524)
T ss_pred cCCceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEeccccchheeEeecC
Confidence 33557778999999999999999999999999999999999999998876655
No 5
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=97.42 E-value=0.0039 Score=64.56 Aligned_cols=79 Identities=15% Similarity=0.192 Sum_probs=67.6
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
..-+......+..|||++|+|.+.-+--|=|+|-.+|+-+-.++|+=+| ||+|.. | .
T Consensus 363 mtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsa-----D------------s---- 421 (480)
T KOG0271|consen 363 MTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSA-----D------------S---- 421 (480)
T ss_pred hhchhhheeeEEECCCccEEEEeecccceeeeeCCCcchhhhhhhccceeEEEEecc-----C------------c----
Confidence 4445667889999999999999999999999999999999999999987 788863 1 1
Q ss_pred EEEEEcCCCCeEEEeeCCCCcc
Q 015021 381 CLAIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~ 402 (414)
=|++...++..|+||++++-+.
T Consensus 422 RLlVS~SkDsTLKvw~V~tkKl 443 (480)
T KOG0271|consen 422 RLLVSGSKDSTLKVWDVRTKKL 443 (480)
T ss_pred cEEEEcCCCceEEEEEeeeeee
Confidence 1888999999999999998654
No 6
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=96.89 E-value=0.016 Score=64.01 Aligned_cols=83 Identities=17% Similarity=0.146 Sum_probs=65.1
Q ss_pred CCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 298 SPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 298 ~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
...+.|.-+++.+++++.||+|++.|..|..|+|++||+.+|..+...||+++.- .=+...- + +
T Consensus 568 ~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti-~SlsFS~--d-g------------ 631 (707)
T KOG0263|consen 568 NSVRIFTGHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGHTGTI-YSLSFSR--D-G------------ 631 (707)
T ss_pred cEEEEecCCCCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcccCce-eEEEEec--C-C------------
Confidence 3345688899999999999999999999999999999999999999999996632 2222211 1 0
Q ss_pred ceEEEEEEcCCCCeEEEeeCCC
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMA 399 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~ 399 (414)
-..+-+.-+..|.+||+..
T Consensus 632 ---~vLasgg~DnsV~lWD~~~ 650 (707)
T KOG0263|consen 632 ---NVLASGGADNSVRLWDLTK 650 (707)
T ss_pred ---CEEEecCCCCeEEEEEchh
Confidence 1445677789999999865
No 7
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.66 E-value=0.011 Score=62.19 Aligned_cols=88 Identities=16% Similarity=0.156 Sum_probs=73.6
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEc-CcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDT-QALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~-~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
.+..+.++.+.+..++.+|++++++.+..++.|-++|+ ..+..+|..|||.+ ++....=.+++
T Consensus 195 ~~~~l~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~----~v~~~~f~p~g------------ 258 (456)
T KOG0266|consen 195 LLRELSGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHST----YVTSVAFSPDG------------ 258 (456)
T ss_pred hhccccccccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEecCCCC----ceEEEEecCCC------------
Confidence 34557889999999999999999999999999999999 77799999999999 66444322211
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
-|++.+-.++.+.||++++|+++-+
T Consensus 259 ---~~i~Sgs~D~tvriWd~~~~~~~~~ 283 (456)
T KOG0266|consen 259 ---NLLVSGSDDGTVRIWDVRTGECVRK 283 (456)
T ss_pred ---CEEEEecCCCcEEEEeccCCeEEEe
Confidence 2889999999999999999988743
No 8
>PTZ00421 coronin; Provisional
Probab=96.62 E-value=0.014 Score=62.30 Aligned_cols=88 Identities=15% Similarity=0.126 Sum_probs=67.9
Q ss_pred CCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 299 PLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
|+..|..+.+.+..|..+|++ +++|++...|.|.|+|+.++..++.++|+.+ .|....-.+++
T Consensus 117 ~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~----~V~sla~spdG------------ 180 (493)
T PTZ00421 117 PIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSD----QITSLEWNLDG------------ 180 (493)
T ss_pred ceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCCCC----ceEEEEEECCC------------
Confidence 444577778889999999986 6888888899999999999999999999987 33322111111
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
-+++.+..+|.|.||++++|+.+.+
T Consensus 181 ---~lLatgs~Dg~IrIwD~rsg~~v~t 205 (493)
T PTZ00421 181 ---SLLCTTSKDKKLNIIDPRDGTIVSS 205 (493)
T ss_pred ---CEEEEecCCCEEEEEECCCCcEEEE
Confidence 1567788999999999999987743
No 9
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.61 E-value=0.066 Score=59.99 Aligned_cols=286 Identities=18% Similarity=0.162 Sum_probs=142.2
Q ss_pred CeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCc-cceeEEEEecccCCCCCCCCCCeEEEEeCCe-E--E
Q 015021 77 EYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHP-GRILKLRVRGSRRDLTQDTAEEEVCVVMPGV-L--A 152 (414)
Q Consensus 77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~-~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-i--v 152 (414)
.+||+++=-|.-| |||||+-+|-|.+|...-+.++.-.=|+ ++|.+|.+|+-.. .-+.+--+.+ + -
T Consensus 203 s~IT~ieqsPaLD--VVaiG~~~G~ViifNlK~dkil~sFk~d~g~VtslSFrtDG~--------p~las~~~~G~m~~w 272 (910)
T KOG1539|consen 203 SRITAIEQSPALD--VVAIGLENGTVIIFNLKFDKILMSFKQDWGRVTSLSFRTDGN--------PLLASGRSNGDMAFW 272 (910)
T ss_pred cceeEeccCCcce--EEEEeccCceEEEEEcccCcEEEEEEccccceeEEEeccCCC--------eeEEeccCCceEEEE
Confidence 5688887666544 8999999999999996544444445565 9999999998553 2244444422 2 3
Q ss_pred EEeChhHHHHHHHHH-HhccccccCCCCccCCCcccccccCCccceecccCCCCCc--eeeEEeCCCCCCchhh-hcccc
Q 015021 153 RFDGSEIQKMLQRWF-QDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPC--ADAAITGLMPPPLMEV-QSSQR 228 (414)
Q Consensus 153 ~IdG~sL~~~Lr~c~-~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i--~Daa~~G~~~p~~~d~-~s~~~ 228 (414)
-+|+-.|..++++.. ..+... .-..+.+=+-...+.=+.+.|-++..+-. -==.=-|...||.+=- +-.++
T Consensus 273 DLe~kkl~~v~~nah~~sv~~~-----~fl~~epVl~ta~~DnSlk~~vfD~~dg~pR~LR~R~GHs~Pp~~irfy~~~g 347 (910)
T KOG1539|consen 273 DLEKKKLINVTRNAHYGSVTGA-----TFLPGEPVLVTAGADNSLKVWVFDSGDGVPRLLRSRGGHSAPPSCIRFYGSQG 347 (910)
T ss_pred EcCCCeeeeeeeccccCCcccc-----eecCCCceEeeccCCCceeEEEeeCCCCcchheeeccCCCCCchheeeeccCc
Confidence 466777777777532 000000 00000000000112235577777643321 0001123334444322 11233
Q ss_pred ceeEEEeCCCceeEEEEeccCC-CcchhhhhhhhhhHHH--HHH-------hhhhh----h-hhhcCCCC---C------
Q 015021 229 YFCAVTIGEDSVISAFRLSEDR-SRSLVGAILSKVVPAT--FST-------ISSLS----K-MIWRSEQS---P------ 284 (414)
Q Consensus 229 ~~~~i~vG~~P~la~y~~~~~~-~~s~~~al~S~va~av--~S~-------v~S~a----k-~~W~~~~~---~------ 284 (414)
++ ++.+|.++.+=.|-..-++ ++++..-...+-++.+ .+. +..|+ + .-|..-.. .
T Consensus 348 ~~-ilsa~~Drt~r~fs~~~e~~~~~l~~~~~~~~~kk~~~~~~~~~k~p~i~~fa~~~~RE~~W~Nv~~~h~~~~~~~t 426 (910)
T KOG1539|consen 348 HF-ILSAKQDRTLRSFSVISESQSQELGQLHNKKRAKKVNVFSTEKLKLPPIVEFAFENAREKEWDNVITAHKGKRSAYT 426 (910)
T ss_pred EE-EEecccCcchhhhhhhHHHHhHhhcccccccccccccccchhhhcCCcceeeecccchhhhhcceeEEecCcceEEE
Confidence 33 6777777755444222111 1111110000111111 000 00010 0 01200000 0
Q ss_pred -----CCC-CC--CCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE
Q 015021 285 -----KKS-EP--KPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM 356 (414)
Q Consensus 285 -----~~~-e~--kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi 356 (414)
+.. +. +|.+ +.-..-...++++|++|+++.+.-+-|-|-+++.+.|+...-+= -+-|.= -
T Consensus 427 W~~~n~~~G~~~L~~~~---------~~~~~~~~~av~vs~CGNF~~IG~S~G~Id~fNmQSGi~r~sf~-~~~ah~--~ 494 (910)
T KOG1539|consen 427 WNFRNKTSGRHVLDPKR---------FKKDDINATAVCVSFCGNFVFIGYSKGTIDRFNMQSGIHRKSFG-DSPAHK--G 494 (910)
T ss_pred EeccCcccccEEecCcc---------ccccCcceEEEEEeccCceEEEeccCCeEEEEEcccCeeecccc-cCcccc--C
Confidence 000 00 1111 22234678999999999999999999999999999998765441 000000 0
Q ss_pred EEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021 357 EMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 357 ~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v 403 (414)
++.... . +..=-++|-+...|+|..|+...+..+
T Consensus 495 ~V~gla-----------~--D~~n~~~vsa~~~Gilkfw~f~~k~l~ 528 (910)
T KOG1539|consen 495 EVTGLA-----------V--DGTNRLLVSAGADGILKFWDFKKKVLK 528 (910)
T ss_pred ceeEEE-----------e--cCCCceEEEccCcceEEEEecCCccee
Confidence 000000 0 111127899999999999999887744
No 10
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.55 E-value=1.2 Score=49.90 Aligned_cols=93 Identities=18% Similarity=0.191 Sum_probs=62.1
Q ss_pred CeeeeccCcceeeeeecceEEEEeecCCCC--Cceeec-c--cCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-
Q 015021 33 PNLLCALDMHTIALANRYQTVIINWADPEG--LVAKIR-P--ELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD- 106 (414)
Q Consensus 33 ~~~~~sp~~~~la~A~~~~~v~~~w~~~~~--~~v~~~-g--~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt- 106 (414)
+.++.+|+|++||.|-...-|- -|+.... ...... + .+. ....|+++.|-|.. ...++.|-.+|.|++|+
T Consensus 487 ~~i~fs~dg~~latgg~D~~I~-iwd~~~~~~~~~~~~~~~~~~~--~~~~v~~l~~~~~~-~~~las~~~Dg~v~lWd~ 562 (793)
T PLN00181 487 CAIGFDRDGEFFATAGVNKKIK-IFECESIIKDGRDIHYPVVELA--SRSKLSGICWNSYI-KSQVASSNFEGVVQVWDV 562 (793)
T ss_pred EEEEECCCCCEEEEEeCCCEEE-EEECCcccccccccccceEEec--ccCceeeEEeccCC-CCEEEEEeCCCeEEEEEC
Confidence 4577889999988887765443 2433210 000000 0 111 13568888888763 45688888999999999
Q ss_pred cCCcEEeecccCccceeEEEEec
Q 015021 107 LKGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 107 e~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
++|.++..+.-|..+|..+.+..
T Consensus 563 ~~~~~~~~~~~H~~~V~~l~~~p 585 (793)
T PLN00181 563 ARSQLVTEMKEHEKRVWSIDYSS 585 (793)
T ss_pred CCCeEEEEecCCCCCEEEEEEcC
Confidence 57888888788999999999753
No 11
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.55 E-value=0.014 Score=61.30 Aligned_cols=85 Identities=24% Similarity=0.217 Sum_probs=68.4
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
.+..|..+.--..+++.+|.|++.+..+.+|-|-++|+.++..++++||+.+ .|....-..+ .
T Consensus 238 ~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~----~is~~~f~~d-------------~ 300 (456)
T KOG0266|consen 238 NLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSD----GISGLAFSPD-------------G 300 (456)
T ss_pred EEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCC----ceEEEEECCC-------------C
Confidence 4456888888889999999999999999999999999999999999999999 5544322211 1
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCcc
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~ 402 (414)
. +| .-+-.+|.|.|||+.+|++
T Consensus 301 ~-~l-~s~s~d~~i~vwd~~~~~~ 322 (456)
T KOG0266|consen 301 N-LL-VSASYDGTIRVWDLETGSK 322 (456)
T ss_pred C-EE-EEcCCCccEEEEECCCCce
Confidence 1 23 3336699999999999994
No 12
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.54 E-value=0.0039 Score=65.42 Aligned_cols=82 Identities=18% Similarity=0.113 Sum_probs=64.0
Q ss_pred CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
.+.++..++++||+|+|+|..+-.-.|.++|..++.-+..|||+|++=.+=.- . +..- =.+
T Consensus 200 ~h~keil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~f-r---------------~gt~---~ly 260 (479)
T KOG0299|consen 200 GHVKEILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVKVFKGHRGAVSSLAF-R---------------KGTS---ELY 260 (479)
T ss_pred cccceeEEEEEcCCCcEEEecCCCceEEEecCcccchhhcccccccceeeeee-e---------------cCcc---cee
Confidence 67789999999999999999999999999999999999999999995433221 1 1000 114
Q ss_pred EcCCCCeEEEeeCCCCcceee
Q 015021 385 HAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v~~ 405 (414)
-+..++.+.||++++-..|.|
T Consensus 261 s~s~Drsvkvw~~~~~s~vet 281 (479)
T KOG0299|consen 261 SASADRSVKVWSIDQLSYVET 281 (479)
T ss_pred eeecCCceEEEehhHhHHHHH
Confidence 566788899999988776654
No 13
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.23 E-value=0.41 Score=53.27 Aligned_cols=86 Identities=20% Similarity=0.189 Sum_probs=74.6
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccce--eeEEEEEecccccccccccCCCCC
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDAS--CVFMEMLVNKDAATSSAYYAPVKS 376 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq--~~wi~~~~~~~~~~~~~~~~~~k~ 376 (414)
.+..|.-++|..-++..||.-++.|++..++.|=++.+.+...+.-+.|++.|= +.|+.- ++
T Consensus 497 l~~vLsGH~RGvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~~---------------~~- 560 (775)
T KOG0319|consen 497 LLGVLSGHTRGVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIRN---------------GK- 560 (775)
T ss_pred EEEEeeCCccceEEEEeccccceeEeccCCceEEEEEeccceeeeeecCccceeEeeeeeeC---------------Cc-
Confidence 345799999999999999999999999999999999999999999999999973 445530 12
Q ss_pred cceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 377 DYCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 377 ~~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
-.|.+-.+|+|++|++++++|+.|
T Consensus 561 -----qliS~~adGliKlWnikt~eC~~t 584 (775)
T KOG0319|consen 561 -----QLISAGADGLIKLWNIKTNECEMT 584 (775)
T ss_pred -----EEEeccCCCcEEEEeccchhhhhh
Confidence 458999999999999999999865
No 14
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.21 E-value=0.034 Score=49.36 Aligned_cols=82 Identities=17% Similarity=0.137 Sum_probs=58.7
Q ss_pred CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
........+..+|+++++++++..|.|.++|+.++..++.++++.+ .+....-.++ + . .++
T Consensus 133 ~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~----~i~~~~~~~~---------~---~---~l~ 193 (289)
T cd00200 133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTG----EVNSVAFSPD---------G---E---KLL 193 (289)
T ss_pred cCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEecCcc----ccceEEECCC---------c---C---EEE
Confidence 3445678899999999999998899999999999999999987664 2221111110 0 0 233
Q ss_pred EcCCCCeEEEeeCCCCcceee
Q 015021 385 HAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v~~ 405 (414)
.+-.+|.|.+|+++.++.+.+
T Consensus 194 ~~~~~~~i~i~d~~~~~~~~~ 214 (289)
T cd00200 194 SSSSDGTIKLWDLSTGKCLGT 214 (289)
T ss_pred EecCCCcEEEEECCCCceecc
Confidence 444499999999998877643
No 15
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=95.99 E-value=0.035 Score=55.03 Aligned_cols=82 Identities=16% Similarity=0.237 Sum_probs=62.9
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR 388 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr 388 (414)
++.++.++|..+-+..+-.+|-++-+|+.+|.+-|.+||+-| .+......+.. --+....-
T Consensus 116 eINam~ldP~enSi~~AgGD~~~y~~dlE~G~i~r~~rGHtD----YvH~vv~R~~~---------------~qilsG~E 176 (325)
T KOG0649|consen 116 EINAMWLDPSENSILFAGGDGVIYQVDLEDGRIQREYRGHTD----YVHSVVGRNAN---------------GQILSGAE 176 (325)
T ss_pred ccceeEeccCCCcEEEecCCeEEEEEEecCCEEEEEEcCCcc----eeeeeeecccC---------------cceeecCC
Confidence 567899999976444444889999999999999999999999 77665432211 12355677
Q ss_pred CCeEEEeeCCCCcceeeccce
Q 015021 389 KGIIEHLAGMADENWTTSLNY 409 (414)
Q Consensus 389 Rg~lEvW~~~~g~~v~~~~~~ 409 (414)
+|.+.||+++++++|-+---|
T Consensus 177 DGtvRvWd~kt~k~v~~ie~y 197 (325)
T KOG0649|consen 177 DGTVRVWDTKTQKHVSMIEPY 197 (325)
T ss_pred CccEEEEeccccceeEEeccc
Confidence 999999999999999664433
No 16
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=95.94 E-value=0.021 Score=63.08 Aligned_cols=88 Identities=18% Similarity=0.212 Sum_probs=69.6
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
|++.|-.+--...++..-|+..|.|..-++-+|=+||+.+|..||++-||+. =|.+..-.+.| +
T Consensus 527 PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF~GH~~----~V~al~~Sp~G-----------r- 590 (707)
T KOG0263|consen 527 PLRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSVRIFTGHKG----PVTALAFSPCG-----------R- 590 (707)
T ss_pred chhhhcccccccceEEECCcccccccCCCCceEEEEEcCCCcEEEEecCCCC----ceEEEEEcCCC-----------c-
Confidence 5666666667778899999999999999999999999999999999999888 34443322222 1
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
||+ -+--.|+|.+||+.+|+.|.+
T Consensus 591 --~La-Sg~ed~~I~iWDl~~~~~v~~ 614 (707)
T KOG0263|consen 591 --YLA-SGDEDGLIKIWDLANGSLVKQ 614 (707)
T ss_pred --eEe-ecccCCcEEEEEcCCCcchhh
Confidence 444 566799999999999988753
No 17
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=95.90 E-value=0.026 Score=58.60 Aligned_cols=96 Identities=19% Similarity=0.165 Sum_probs=76.1
Q ss_pred CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021 297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK 375 (414)
Q Consensus 297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k 375 (414)
.+|++...-++.-..+|+.||||+++|..--.|-|.|+|-.+| ++.|-.+|++- ||....=.+- |..+.
T Consensus 147 eTp~~t~KgH~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K----~It~Lawep~-----hl~p~- 216 (480)
T KOG0271|consen 147 ETPLFTCKGHKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKK----WITALAWEPL-----HLVPP- 216 (480)
T ss_pred CCcceeecCCccEEEEEEECCCcchhhccccCCeEEEecCCCCCcccccccCccc----ceeEEeeccc-----ccCCC-
Confidence 3577778888889999999999999999999999999998776 57799999999 9987643221 11111
Q ss_pred CcceEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021 376 SDYCLCLAIHAPRKGIIEHLAGMADENWTTS 406 (414)
Q Consensus 376 ~~~~l~LvIyaprRg~lEvW~~~~g~~v~~~ 406 (414)
.. ++-.+.++|.+.||++..|.++-+.
T Consensus 217 --~r--~las~skDg~vrIWd~~~~~~~~~l 243 (480)
T KOG0271|consen 217 --CR--RLASSSKDGSVRIWDTKLGTCVRTL 243 (480)
T ss_pred --cc--ceecccCCCCEEEEEccCceEEEEe
Confidence 11 5567889999999999999998553
No 18
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=95.79 E-value=0.014 Score=38.86 Aligned_cols=27 Identities=37% Similarity=0.455 Sum_probs=20.7
Q ss_pred CCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021 76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYD 106 (414)
Q Consensus 76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt 106 (414)
+|.|+++. + ....|+|+||.+|||+||
T Consensus 1 gE~i~aia---~-g~~~vavaTS~~~lRifs 27 (27)
T PF12341_consen 1 GEEIEAIA---A-GDSWVAVATSAGYLRIFS 27 (27)
T ss_pred CceEEEEE---c-cCCEEEEEeCCCeEEecC
Confidence 46677664 3 334799999999999997
No 19
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.78 E-value=1.7 Score=48.95 Aligned_cols=112 Identities=14% Similarity=0.146 Sum_probs=72.4
Q ss_pred eeeeccCcceeeeeecceEEEEeecCCC----CCc--eeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEec
Q 015021 34 NLLCALDMHTIALANRYQTVIINWADPE----GLV--AKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDL 107 (414)
Q Consensus 34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~----~~~--v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte 107 (414)
.|..||+|..+|++.+..+-| |..++ ++| +--+-.+.. -+.||++.|. .|..++++|-.+=.+|+|..
T Consensus 101 ~i~fSPng~~fav~~gn~lqi--w~~P~~~~~~~~pFvl~r~~~g~--fddi~si~Ws--~DSr~l~~gsrD~s~rl~~v 174 (893)
T KOG0291|consen 101 AIKFSPNGKFFAVGCGNLLQI--WHAPGEIKNEFNPFVLHRTYLGH--FDDITSIDWS--DDSRLLVTGSRDLSARLFGV 174 (893)
T ss_pred eEEECCCCcEEEEEecceeEE--EecCcchhcccCcceEeeeecCC--ccceeEEEec--cCCceEEeccccceEEEEEe
Confidence 467899999999999997776 22222 233 112223322 3568988886 68899999999999999996
Q ss_pred CCcEE---eecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHH
Q 015021 108 KGDLV---HRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQ 160 (414)
Q Consensus 108 ~G~LL---~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~ 160 (414)
++..= +--.=|.++|+.--+.. .+.+++.+..++.+..=..++.
T Consensus 175 ~~~k~~~~~~l~gHkd~VvacfF~~---------~~~~l~tvskdG~l~~W~~~~~ 221 (893)
T KOG0291|consen 175 DGNKNLFTYALNGHKDYVVACFFGA---------NSLDLYTVSKDGALFVWTCDLR 221 (893)
T ss_pred ccccccceEeccCCCcceEEEEecc---------CcceEEEEecCceEEEEEecCC
Confidence 54432 12234567777654322 1566888888875544444443
No 20
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=94.99 E-value=0.027 Score=60.27 Aligned_cols=77 Identities=18% Similarity=0.219 Sum_probs=57.8
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc-ceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD-ASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd-Aq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
--+..++++||-+++-.+=+.|-|.+||+.+-.+||-+|||-| |.| |.+ .+++ + -.--.
T Consensus 510 paCyALa~spDakvcFsccsdGnI~vwDLhnq~~VrqfqGhtDGasc--Idi---s~dG------------t---klWTG 569 (705)
T KOG0639|consen 510 PACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASC--IDI---SKDG------------T---KLWTG 569 (705)
T ss_pred hhhhhhhcCCccceeeeeccCCcEEEEEcccceeeecccCCCCCcee--EEe---cCCC------------c---eeecC
Confidence 4466789999999888888899999999999999999999999 222 111 1111 0 11224
Q ss_pred CCCCeEEEeeCCCCccee
Q 015021 387 PRKGIIEHLAGMADENWT 404 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~ 404 (414)
..+..|..||+|.|+++.
T Consensus 570 GlDntvRcWDlregrqlq 587 (705)
T KOG0639|consen 570 GLDNTVRCWDLREGRQLQ 587 (705)
T ss_pred CCccceeehhhhhhhhhh
Confidence 558899999999999874
No 21
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=94.81 E-value=0.097 Score=53.01 Aligned_cols=79 Identities=18% Similarity=0.034 Sum_probs=65.5
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
+-.+.-.++++-+.++-.+-||..+|+++|..+|=.||+-+ ++....+ .++.+.||.-+.-+
T Consensus 93 VM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~----~vNs~~p--------------~rrg~~lv~SgsdD 154 (338)
T KOG0265|consen 93 VMELHGMRDGSHILSCGTDKTVRGWDAETGKRIRKHKGHTS----FVNSLDP--------------SRRGPQLVCSGSDD 154 (338)
T ss_pred eEeeeeccCCCEEEEecCCceEEEEecccceeeehhccccc----eeeecCc--------------cccCCeEEEecCCC
Confidence 33456678888888888889999999999999999999999 8877643 23566899999999
Q ss_pred CeEEEeeCCCCcceeec
Q 015021 390 GIIEHLAGMADENWTTS 406 (414)
Q Consensus 390 g~lEvW~~~~g~~v~~~ 406 (414)
|.+.+||||..+.+-|-
T Consensus 155 ~t~kl~D~R~k~~~~t~ 171 (338)
T KOG0265|consen 155 GTLKLWDIRKKEAIKTF 171 (338)
T ss_pred ceEEEEeecccchhhcc
Confidence 99999999977776553
No 22
>KOG2727 consensus Rab3 GTPase-activating protein, non-catalytic subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.67 E-value=0.039 Score=62.46 Aligned_cols=84 Identities=45% Similarity=0.833 Sum_probs=72.5
Q ss_pred CCeEEEEeCCeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCC--
Q 015021 140 EEEVCVVMPGVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMP-- 217 (414)
Q Consensus 140 ~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~-- 217 (414)
-++.+++-|+...+.++...+.+.+.+..+.++..|.++-..++..........|++..|+.-+..++.|..+.+++.
T Consensus 150 yp~~~~~I~g~sl~~~L~ncq~~Vqkaa~Eknsn~~~~~~~~qk~~l~qdi~~~I~hai~~~~~~ppt~Dq~vtas~~~g 229 (1244)
T KOG2727|consen 150 YPEICIVIPGVSLRFDLSNCQSMVQKAAQEKNSNFWDQKNRKQKAELTQDIYQRIPHAIWNVNKNPPTVDQTVTASMPPG 229 (1244)
T ss_pred ecceEEEECCchhhhhHHHHHHHHHHHHHhccCCcCCccchhhhhhcccchhhccchheeecccCCccHHHhhhcccCch
Confidence 456899999999999999999999999999999999999988887665556788999999999888889988887664
Q ss_pred --------CCchhh
Q 015021 218 --------PPLMEV 223 (414)
Q Consensus 218 --------p~~~d~ 223 (414)
||.+++
T Consensus 230 y~a~~k~SpPrySq 243 (1244)
T KOG2727|consen 230 YLALQKPSPPRYSQ 243 (1244)
T ss_pred hhhhccCCCcceee
Confidence 677777
No 23
>PTZ00421 coronin; Provisional
Probab=94.50 E-value=0.26 Score=52.86 Aligned_cols=81 Identities=16% Similarity=0.182 Sum_probs=59.4
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEEEEecccccccccccCCCCCcce
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFMEMLVNKDAATSSAYYAPVKSDYC 379 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~~~~~~~~~~~~~~~~~~k~~~~ 379 (414)
+..+...+.+|+.+|+|+++|++...|.|-++|+.++..++-++|+..+ .|.|.. + ..
T Consensus 164 l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~-----~--------------~~ 224 (493)
T PTZ00421 164 IKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAK-----R--------------KD 224 (493)
T ss_pred EcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcC-----C--------------CC
Confidence 4445566889999999999999999999999999999999999988764 355542 0 01
Q ss_pred EEEEEE--cCCCCeEEEeeCCCCcc
Q 015021 380 LCLAIH--APRKGIIEHLAGMADEN 402 (414)
Q Consensus 380 l~LvIy--aprRg~lEvW~~~~g~~ 402 (414)
+++..- .-.+|.|.+|++++..+
T Consensus 225 ~ivt~G~s~s~Dr~VklWDlr~~~~ 249 (493)
T PTZ00421 225 LIITLGCSKSQQRQIMLWDTRKMAS 249 (493)
T ss_pred eEEEEecCCCCCCeEEEEeCCCCCC
Confidence 122221 13478899999987653
No 24
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=94.47 E-value=0.28 Score=48.89 Aligned_cols=87 Identities=15% Similarity=0.095 Sum_probs=64.4
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
|+.-|.-..|++.++-+||++++.|++.+.-.|.++.+.+. ..=+..+|+-. |+.--.=.. ...
T Consensus 207 P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~gh~r----WvWdc~FS~-----------dg~ 271 (311)
T KOG0315|consen 207 PVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLTGHQR----WVWDCAFSA-----------DGE 271 (311)
T ss_pred EhhheecccceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEeecCCc----eEEeeeecc-----------Ccc
Confidence 44447788899999999999999999999999999999987 33345555543 876332111 111
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCccee
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v~ 404 (414)
|||- |..|+...+|++.+|+.|.
T Consensus 272 ---YlvT-assd~~~rlW~~~~~k~v~ 294 (311)
T KOG0315|consen 272 ---YLVT-ASSDHTARLWDLSAGKEVR 294 (311)
T ss_pred ---EEEe-cCCCCceeecccccCceee
Confidence 6655 5556999999999999874
No 25
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=94.23 E-value=0.26 Score=49.89 Aligned_cols=82 Identities=20% Similarity=0.246 Sum_probs=63.2
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
-+-..|.-||+|++..++++.+-++|+|.-+|.++.=+-||+.++--=+++.=..| ++ +|+-.-
T Consensus 188 ~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPd----------s~------Fvl~gs 251 (311)
T KOG1446|consen 188 AEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPD----------SK------FVLSGS 251 (311)
T ss_pred cceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCCCcceeEEECCC----------Cc------EEEEec
Confidence 34577899999999999999999999999999999999999887611122211111 11 556666
Q ss_pred CCCeEEEeeCCCCcceee
Q 015021 388 RKGIIEHLAGMADENWTT 405 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v~~ 405 (414)
-+|.|-||++.+|++|+.
T Consensus 252 ~dg~i~vw~~~tg~~v~~ 269 (311)
T KOG1446|consen 252 DDGTIHVWNLETGKKVAV 269 (311)
T ss_pred CCCcEEEEEcCCCcEeeE
Confidence 789999999999999964
No 26
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=94.12 E-value=0.29 Score=49.26 Aligned_cols=88 Identities=11% Similarity=0.056 Sum_probs=74.1
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc-ccceeeEEEEEecccccccccccCCCCCcce
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY-RDASCVFMEMLVNKDAATSSAYYAPVKSDYC 379 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy-RdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~ 379 (414)
.-|.-+.+...+++++|+++-++....+--|.|+|+-..+...|=.+. || |+....-.+ ++..
T Consensus 99 ~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g~ck~t~~~~~~~~----WVscvrfsP----------~~~~-- 162 (315)
T KOG0279|consen 99 RRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGVCKYTIHEDSHRE----WVSCVRFSP----------NESN-- 162 (315)
T ss_pred EEEEecCCceEEEEecCCCceeecCCCcceeeeeeecccEEEEEecCCCcC----cEEEEEEcC----------CCCC--
Confidence 347788899999999999999999999999999999999999999998 99 998764322 2111
Q ss_pred EEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 380 LCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 380 l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
..++-+.-++.|+||++++.+.-++
T Consensus 163 -p~Ivs~s~DktvKvWnl~~~~l~~~ 187 (315)
T KOG0279|consen 163 -PIIVSASWDKTVKVWNLRNCQLRTT 187 (315)
T ss_pred -cEEEEccCCceEEEEccCCcchhhc
Confidence 3778999999999999999887654
No 27
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.10 E-value=0.18 Score=49.88 Aligned_cols=84 Identities=12% Similarity=0.031 Sum_probs=64.8
Q ss_pred CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCC
Q 015021 297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKS 376 (414)
Q Consensus 297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~ 376 (414)
.+.+....-..++...+.++.|+.-+|..-..--|.+||+++|.++|=|.|+ +||+-=+...|+.
T Consensus 49 g~liktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~rgH-~aqVNtV~fNees-------------- 113 (307)
T KOG0316|consen 49 GALIKTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDRRFRGH-LAQVNTVRFNEES-------------- 113 (307)
T ss_pred cceeeeecCCCceeeeccccccccccccCCCCceEEEEEcccCeeeeecccc-cceeeEEEecCcc--------------
Confidence 3455668888999999999999977777777788999999999999999998 5677666654432
Q ss_pred cceEEEEEEcCCCCeEEEeeCCC
Q 015021 377 DYCLCLAIHAPRKGIIEHLAGMA 399 (414)
Q Consensus 377 ~~~l~LvIyaprRg~lEvW~~~~ 399 (414)
-.|..+.-+..+.+|+.|.
T Consensus 114 ----SVv~SgsfD~s~r~wDCRS 132 (307)
T KOG0316|consen 114 ----SVVASGSFDSSVRLWDCRS 132 (307)
T ss_pred ----eEEEeccccceeEEEEccc
Confidence 1455566677777777664
No 28
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=93.94 E-value=0.18 Score=51.55 Aligned_cols=87 Identities=16% Similarity=0.163 Sum_probs=66.5
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
++-.|..- .+++..+|+|.+.|+.=..|||.++|..+-.+-||.-||-.+=+.=-. ..++ +
T Consensus 18 ~~tld~~~-a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~W----S~dg-----------r--- 78 (405)
T KOG1273|consen 18 THTLDNPL-AECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCW----SRDG-----------R--- 78 (405)
T ss_pred ceeccCCc-cceEEeccCcceeeeeccCCcEEEEEccccchhhhhhccccceeEEEe----cCCC-----------C---
Confidence 34344444 899999999999999999999999999999999999887664332211 1111 1
Q ss_pred EEEEEcCCCCeEEEeeCCCCcceeeccceeee
Q 015021 381 CLAIHAPRKGIIEHLAGMADENWTTSLNYSVR 412 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~v~~~~~~~~~ 412 (414)
+..-+.|+..+..|++..|-++ |.+|
T Consensus 79 -~LltsS~D~si~lwDl~~gs~l-----~rir 104 (405)
T KOG1273|consen 79 -KLLTSSRDWSIKLWDLLKGSPL-----KRIR 104 (405)
T ss_pred -EeeeecCCceeEEEeccCCCce-----eEEE
Confidence 4567899999999999999877 6665
No 29
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=93.84 E-value=0.54 Score=43.74 Aligned_cols=76 Identities=16% Similarity=0.138 Sum_probs=50.0
Q ss_pred eeEEEECCCCCEEEEE-cCCCcEEEEEcCcceeEeeec-ccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 310 GERLTLSPSGSLAAIT-DSLGRILLLDTQALVVVRLWK-GYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 310 ~~~i~~sP~~~laa~t-D~lGRV~L~D~~~~~vvrmwK-GyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
...|..+|+++++.++ ...++|.++|+.++.+++... |..--.+.| .+. ++ +|++-+.
T Consensus 209 ~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~~~~~~~~~~------~~~---------g~-----~l~~~~~ 268 (300)
T TIGR03866 209 PVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLVGQRVWQLAF------TPD---------EK-----YLLTTNG 268 (300)
T ss_pred ccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEeCCCcceEEE------CCC---------CC-----EEEEEcC
Confidence 3468899999875543 456799999999999887643 321111111 111 11 3444456
Q ss_pred CCCeEEEeeCCCCcceee
Q 015021 388 RKGIIEHLAGMADENWTT 405 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v~~ 405 (414)
..|.|.||++++|+++.+
T Consensus 269 ~~~~i~v~d~~~~~~~~~ 286 (300)
T TIGR03866 269 VSNDVSVIDVAALKVIKS 286 (300)
T ss_pred CCCeEEEEECCCCcEEEE
Confidence 789999999999998754
No 30
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=93.82 E-value=0.49 Score=44.02 Aligned_cols=75 Identities=15% Similarity=0.127 Sum_probs=51.0
Q ss_pred eEEEECCCCCEE-EEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 311 ERLTLSPSGSLA-AITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 311 ~~i~~sP~~~la-a~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
..+.++|+|+.+ ++....++|.++|+.++.+++.+....+. ..+.+. ++ + + +|++-....
T Consensus 34 ~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~~--~~~~~~---~~---------g--~---~l~~~~~~~ 94 (300)
T TIGR03866 34 RGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPDP--ELFALH---PN---------G--K---ILYIANEDD 94 (300)
T ss_pred CceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCCc--cEEEEC---CC---------C--C---EEEEEcCCC
Confidence 468999999865 55677899999999999998877653331 122111 10 1 1 355555677
Q ss_pred CeEEEeeCCCCccee
Q 015021 390 GIIEHLAGMADENWT 404 (414)
Q Consensus 390 g~lEvW~~~~g~~v~ 404 (414)
+.|.+|++++++.+.
T Consensus 95 ~~l~~~d~~~~~~~~ 109 (300)
T TIGR03866 95 NLVTVIDIETRKVLA 109 (300)
T ss_pred CeEEEEECCCCeEEe
Confidence 899999999877653
No 31
>PTZ00420 coronin; Provisional
Probab=93.79 E-value=0.47 Score=51.96 Aligned_cols=87 Identities=10% Similarity=-0.059 Sum_probs=58.6
Q ss_pred CCccccCCCceeeEEEECCCCCE-EEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSL-AAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~l-aa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
|+..+..+.+.+..|+.+|++.. +|++...|.|.|+|+.++..+..++.. + .+.-+... .++
T Consensus 117 p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~~~-~-~V~Slsws---pdG------------ 179 (568)
T PTZ00420 117 PQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINMP-K-KLSSLKWN---IKG------------ 179 (568)
T ss_pred ceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEecC-C-cEEEEEEC---CCC------------
Confidence 33346667788999999999964 566778899999999999877766532 1 22111111 111
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
. +++-+-++|.|.||++++|+.+.+
T Consensus 180 -~--lLat~s~D~~IrIwD~Rsg~~i~t 204 (568)
T PTZ00420 180 -N--LLSGTCVGKHMHIIDPRKQEIASS 204 (568)
T ss_pred -C--EEEEEecCCEEEEEECCCCcEEEE
Confidence 1 334456788999999999887743
No 32
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=93.72 E-value=5.8 Score=41.48 Aligned_cols=92 Identities=7% Similarity=-0.102 Sum_probs=61.2
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
-...|.+.-||++.|..++.=+|-++=++++....+..++--. -.+|..-.+..+ .....+.. ..+.+| .-+
T Consensus 237 wvr~v~v~~DGti~As~s~dqtl~vW~~~t~~~k~~lR~hEh~vEci~wap~~~~~~----i~~at~~~-~~~~~l-~s~ 310 (406)
T KOG0295|consen 237 WVRMVRVNQDGTIIASCSNDQTLRVWVVATKQCKAELREHEHPVECIAWAPESSYPS----ISEATGST-NGGQVL-GSG 310 (406)
T ss_pred hEEEEEecCCeeEEEecCCCceEEEEEeccchhhhhhhccccceEEEEecccccCcc----hhhccCCC-CCccEE-Eee
Confidence 5677889999999999999999999999999666665554331 123443222111 00011111 122244 568
Q ss_pred CCCCeEEEeeCCCCcceeec
Q 015021 387 PRKGIIEHLAGMADENWTTS 406 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~~~ 406 (414)
.|+++|++|+|++|.|+-|-
T Consensus 311 SrDktIk~wdv~tg~cL~tL 330 (406)
T KOG0295|consen 311 SRDKTIKIWDVSTGMCLFTL 330 (406)
T ss_pred cccceEEEEeccCCeEEEEE
Confidence 89999999999999998663
No 33
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=92.98 E-value=0.18 Score=33.93 Aligned_cols=36 Identities=25% Similarity=0.389 Sum_probs=32.7
Q ss_pred CccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEE
Q 015021 300 LTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLD 335 (414)
Q Consensus 300 l~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D 335 (414)
+..|......+.+|+.+|+++++|++...|.|.++|
T Consensus 4 ~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 4 VRTFRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EEEEcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 456788889999999999999999999999999987
No 34
>PTZ00420 coronin; Provisional
Probab=92.96 E-value=0.99 Score=49.47 Aligned_cols=50 Identities=12% Similarity=0.143 Sum_probs=42.4
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEE
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFME 357 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~ 357 (414)
..+.+++.+|+|+++|++-..|.|.|+|+.++.+++.++||... .+-|+.
T Consensus 168 ~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~ 220 (568)
T PTZ00420 168 KKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWID 220 (568)
T ss_pred CcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEee
Confidence 35789999999999999888899999999999999999998763 444653
No 35
>PLN00181 protein SPA1-RELATED; Provisional
Probab=92.73 E-value=0.66 Score=51.92 Aligned_cols=79 Identities=14% Similarity=0.024 Sum_probs=57.9
Q ss_pred eeeEEEECCC-CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 309 KGERLTLSPS-GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 309 ~~~~i~~sP~-~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
...+++.+|. ++++|+++..|.|.|||+.++..++.++|+.+. |....-.+ . +. -+++-+.
T Consensus 534 ~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~~~----V~~l~~~p----------~--~~--~~L~Sgs 595 (793)
T PLN00181 534 KLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKR----VWSIDYSS----------A--DP--TLLASGS 595 (793)
T ss_pred ceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecCCCCC----EEEEEEcC----------C--CC--CEEEEEc
Confidence 4567777765 689999999999999999999999999999873 22211100 0 00 1456667
Q ss_pred CCCeEEEeeCCCCcceee
Q 015021 388 RKGIIEHLAGMADENWTT 405 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v~~ 405 (414)
.+|.|.+|++++++++.+
T Consensus 596 ~Dg~v~iWd~~~~~~~~~ 613 (793)
T PLN00181 596 DDGSVKLWSINQGVSIGT 613 (793)
T ss_pred CCCEEEEEECCCCcEEEE
Confidence 799999999999887643
No 36
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=91.63 E-value=1.3 Score=46.82 Aligned_cols=70 Identities=19% Similarity=0.301 Sum_probs=50.1
Q ss_pred EEEEeEEe----CCcEEEEEeccccEEEEEecCC------cEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCC
Q 015021 80 TAIEWLVF----EEMRALAVGTSRGYFLVYDLKG------DLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPG 149 (414)
Q Consensus 80 Ts~~~lpl----~dw~~I~VG~ssG~vrfyte~G------~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~ 149 (414)
-++++-++ .+.-.|+||--+|++|+|..++ +||+...+ ..||++|.+..=.. ......|.||.|.
T Consensus 23 ~~l~v~~~~~~~~~~d~IivGS~~G~LrIy~P~~~~~~~~~lllE~~l-~~PILqv~~G~F~s----~~~~~~LaVLhP~ 97 (418)
T PF14727_consen 23 GSLCVGNLDNSPSGSDKIIVGSYSGILRIYDPSGNEFQPEDLLLETQL-KDPILQVECGKFVS----GSEDLQLAVLHPR 97 (418)
T ss_pred ceEEEEcccCCCCCccEEEEeccccEEEEEccCCCCCCCccEEEEEec-CCcEEEEEeccccC----CCCcceEEEecCC
Confidence 34545555 2446999999999999999533 47777666 58999999876321 1235689999999
Q ss_pred eEEEE
Q 015021 150 VLARF 154 (414)
Q Consensus 150 ~iv~I 154 (414)
.++..
T Consensus 98 kl~vY 102 (418)
T PF14727_consen 98 KLSVY 102 (418)
T ss_pred EEEEE
Confidence 86543
No 37
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=91.16 E-value=0.73 Score=50.87 Aligned_cols=80 Identities=15% Similarity=0.150 Sum_probs=66.7
Q ss_pred CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
+-.+-+++.-|+|..++..+.-+-+.|+|-.++.=+-=.+|||| -+.+....+++ . =++.+
T Consensus 171 k~siYSLA~N~t~t~ivsGgtek~lr~wDprt~~kimkLrGHTd----NVr~ll~~dDG-----------t----~~ls~ 231 (735)
T KOG0308|consen 171 KDSIYSLAMNQTGTIIVSGGTEKDLRLWDPRTCKKIMKLRGHTD----NVRVLLVNDDG-----------T----RLLSA 231 (735)
T ss_pred ccceeeeecCCcceEEEecCcccceEEeccccccceeeeecccc----ceEEEEEcCCC-----------C----eEeec
Confidence 34467889999999999999999999999998875555669999 88777655544 1 24789
Q ss_pred CCCCeEEEeeCCCCcceee
Q 015021 387 PRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~~ 405 (414)
..+|.|.+|++.+-+|++|
T Consensus 232 sSDgtIrlWdLgqQrCl~T 250 (735)
T KOG0308|consen 232 SSDGTIRLWDLGQQRCLAT 250 (735)
T ss_pred CCCceEEeeeccccceeee
Confidence 9999999999999999998
No 38
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=91.03 E-value=0.7 Score=52.53 Aligned_cols=108 Identities=19% Similarity=0.131 Sum_probs=65.7
Q ss_pred eeeeeeecccc-----------cccCCCCCcccCCCeee---eccCcceeee-eecceEEE------EeecCCCCCceee
Q 015021 9 EVGSIACTDLS-----------DLGAGKEGWLVNDPNLL---CALDMHTIAL-ANRYQTVI------INWADPEGLVAKI 67 (414)
Q Consensus 9 ~~~~~~~~~~~-----------~~g~~~~~WL~~~~~~~---~sp~~~~la~-A~~~~~v~------~~w~~~~~~~v~~ 67 (414)
++|||-|-+-+ +.|.-+..=+.|.-.+- +|-.|-++|- +-++|... ..|++. -.|
T Consensus 442 ~vGiI~t~~~e~~~ssIdVeFHD~sihr~~H~~d~~~y~lA~ls~~g~llAsp~s~sk~~sil~~~h~~w~s~----seW 517 (933)
T KOG1274|consen 442 EVGIIRTVVNEANDSSIDVEFHDTSIHRAYHFSDLFGYELADLSEKGTLLASPESESKLGSILYRAHFSWDSH----SEW 517 (933)
T ss_pred ccceEEEEeccCcCceEEEEEeccCccceeeeeccccceeeeccccceEEecccccCCcceEEEEcccCcccc----cce
Confidence 57888853322 23433555554422222 3334433332 33444444 236553 357
Q ss_pred cccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEE
Q 015021 68 RPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRV 127 (414)
Q Consensus 68 ~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~ 127 (414)
.=.|.. +|.|++|++-+. .|+|+|+.||||+||..|.-..-- =|+.||+...+
T Consensus 518 tm~lP~--~E~~~~V~~t~~----~Vav~TS~~~lRvFt~gGvq~~I~-t~~gP~vtaag 570 (933)
T KOG1274|consen 518 TMILPL--QESIEAVAATSG----WVAVATSLGYLRVFTIGGVQREIF-TLPGPVVTAAG 570 (933)
T ss_pred eeecCC--CCceeEEEccCc----EEEEEeccceEEEEEecceeeeEe-ecccceEEeec
Confidence 767744 488999977544 799999999999999999754332 45789998873
No 39
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=90.63 E-value=0.93 Score=47.84 Aligned_cols=76 Identities=17% Similarity=0.147 Sum_probs=58.9
Q ss_pred CceeeEEEECCCCCEEEEE--cCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 307 PRKGERLTLSPSGSLAAIT--DSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~t--D~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
.+...+|+--|||.++++. |++||| +|+.+|.-|-...||-+.=++ +.. .+. -|.+.
T Consensus 303 s~~v~~iaf~~DGSL~~tGGlD~~~Rv--WDlRtgr~im~L~gH~k~I~~-V~f---sPN---------------Gy~lA 361 (459)
T KOG0272|consen 303 SKGVFSIAFQPDGSLAATGGLDSLGRV--WDLRTGRCIMFLAGHIKEILS-VAF---SPN---------------GYHLA 361 (459)
T ss_pred ccccceeEecCCCceeeccCccchhhe--eecccCcEEEEecccccceee-EeE---CCC---------------ceEEe
Confidence 4667789999999999994 999996 899999999999998772111 111 111 17888
Q ss_pred EcCCCCeEEEeeCCCCcce
Q 015021 385 HAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v 403 (414)
-...+++..||+++--..+
T Consensus 362 Tgs~Dnt~kVWDLR~r~~l 380 (459)
T KOG0272|consen 362 TGSSDNTCKVWDLRMRSEL 380 (459)
T ss_pred ecCCCCcEEEeeecccccc
Confidence 8899999999999876665
No 40
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=90.55 E-value=0.44 Score=50.09 Aligned_cols=79 Identities=16% Similarity=0.120 Sum_probs=62.4
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccc--cceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYR--DASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyR--dAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
.+..++.||+-..++.+..+|+|.++|.....-=|+..|+= =-.|.|.. .| -|+.-+
T Consensus 182 aIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~vL~GHgwdVksvdWHP----------------~k-----gLiasg 240 (464)
T KOG0284|consen 182 AIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEERVLRGHGWDVKSVDWHP----------------TK-----GLIASG 240 (464)
T ss_pred hhheeccCCCCceeEEecCCCeEEEEeccCCchhheeccCCCCcceeccCC----------------cc-----ceeEEc
Confidence 45678999988888888889999999999988888887762 11223332 12 488999
Q ss_pred CCCCeEEEeeCCCCcceeeccc
Q 015021 387 PRKGIIEHLAGMADENWTTSLN 408 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~~~~~ 408 (414)
.++..|++||.++|+|++|...
T Consensus 241 skDnlVKlWDprSg~cl~tlh~ 262 (464)
T KOG0284|consen 241 SKDNLVKLWDPRSGSCLATLHG 262 (464)
T ss_pred cCCceeEeecCCCcchhhhhhh
Confidence 9999999999999999998753
No 41
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=89.89 E-value=1.9 Score=44.56 Aligned_cols=50 Identities=18% Similarity=0.180 Sum_probs=36.5
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccce
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDAS 352 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq 352 (414)
++-...--..+..+|||+++-++...|.|.+||+.++.+++-.|--.++.
T Consensus 32 i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~ 81 (369)
T PF02239_consen 32 IPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPR 81 (369)
T ss_dssp EE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEE
T ss_pred EcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcc
Confidence 33333333456789999999999889999999999999999888776653
No 42
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=89.83 E-value=3.6 Score=43.97 Aligned_cols=203 Identities=18% Similarity=0.187 Sum_probs=114.4
Q ss_pred CCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCe-EEEEeChhHHHHHHH
Q 015021 88 EEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGV-LARFDGSEIQKMLQR 165 (414)
Q Consensus 88 ~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-iv~IdG~sL~~~Lr~ 165 (414)
|+-.+++.|+-+|-+.... -+|.||-.-+-|.-+|.+|++-.-.. -+.=--.++ |..-+-.+|.+.
T Consensus 91 n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~ITcL~fs~dgs---------~iiTgskDg~V~vW~l~~lv~a--- 158 (476)
T KOG0646|consen 91 NLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQSITCLKFSDDGS---------HIITGSKDGAVLVWLLTDLVSA--- 158 (476)
T ss_pred CCceEEEeecccCcEEEEEeccccHHHHHHhhccceeEEEEeCCCc---------EEEecCCCccEEEEEEEeeccc---
Confidence 6678999999999999999 59999988899999999999654322 111111111 211111222211
Q ss_pred HHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEE-eCCCCCCchhhhccccceeEEEeCCCceeEEE
Q 015021 166 WFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAI-TGLMPPPLMEVQSSQRYFCAVTIGEDSVISAF 244 (414)
Q Consensus 166 c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~-~G~~~p~~~d~~s~~~~~~~i~vG~~P~la~y 244 (414)
+ +...+.|+.+|+= ..-+|.|.-| .|... .++++++.+-.+-.|
T Consensus 159 --------------------~--~~~~~~p~~~f~~-HtlsITDl~ig~Gg~~------------~rl~TaS~D~t~k~w 203 (476)
T KOG0646|consen 159 --------------------D--NDHSVKPLHIFSD-HTLSITDLQIGSGGTN------------ARLYTASEDRTIKLW 203 (476)
T ss_pred --------------------c--cCCCccceeeecc-CcceeEEEEecCCCcc------------ceEEEecCCceEEEE
Confidence 1 1124466667752 3347888877 33211 125677777666666
Q ss_pred EeccCCCcchhhhhhhhhhHHHHHHhhhhh--------------hhhhcCC--C-CCCCCCCCCccc-ccCCCCccccCC
Q 015021 245 RLSEDRSRSLVGAILSKVVPATFSTISSLS--------------KMIWRSE--Q-SPKKSEPKPQSF-ARASPLTCLKDH 306 (414)
Q Consensus 245 ~~~~~~~~s~~~al~S~va~av~S~v~S~a--------------k~~W~~~--~-~~~~~e~kp~~~-~~a~pl~~l~D~ 306 (414)
-++.+. + +-.++.-+ .+++++ +..|-.+ + +.+..-.+.+.. +--+....|.-+
T Consensus 204 dlS~g~---L----Llti~fp~--si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh 274 (476)
T KOG0646|consen 204 DLSLGV---L----LLTITFPS--SIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGH 274 (476)
T ss_pred Eeccce---e----eEEEecCC--cceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeeccc
Confidence 655442 1 11110000 011110 0011000 0 111110111110 112233446666
Q ss_pred Cc--eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021 307 PR--KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK 346 (414)
Q Consensus 307 ~R--~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK 346 (414)
.. .+.+++++-||++.+..|-.|.|.++|+...++||.--
T Consensus 275 ~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~iRtl~ 316 (476)
T KOG0646|consen 275 ENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQCIRTLQ 316 (476)
T ss_pred cCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHHHHHHh
Confidence 55 79999999999999999999999999999999999643
No 43
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=88.95 E-value=2.2 Score=43.10 Aligned_cols=78 Identities=8% Similarity=0.053 Sum_probs=66.6
Q ss_pred CceeeEEEECCC--CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 307 PRKGERLTLSPS--GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 307 ~R~~~~i~~sP~--~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
+--..++..+|+ +-+++.+-.++-|=++|+.+-++..++=|+-. -+...+..++| .|.+
T Consensus 148 ~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~----~v~t~~vSpDG---------------slca 208 (315)
T KOG0279|consen 148 REWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSG----YVNTVTVSPDG---------------SLCA 208 (315)
T ss_pred cCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhhccccccc----cEEEEEECCCC---------------CEEe
Confidence 566899999999 57888899999999999999999999999766 66666555444 3789
Q ss_pred EcCCCCeEEEeeCCCCcce
Q 015021 385 HAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v 403 (414)
|..++|.+-+|++..|+++
T Consensus 209 sGgkdg~~~LwdL~~~k~l 227 (315)
T KOG0279|consen 209 SGGKDGEAMLWDLNEGKNL 227 (315)
T ss_pred cCCCCceEEEEEccCCcee
Confidence 9999999999999999998
No 44
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=88.75 E-value=0.63 Score=46.74 Aligned_cols=43 Identities=30% Similarity=0.428 Sum_probs=39.5
Q ss_pred cCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021 304 KDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK 346 (414)
Q Consensus 304 ~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK 346 (414)
.+.+-.+-++.+||+|++.|+...-|+|.|+++-+....|-||
T Consensus 226 ~~~~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~~~~~W~ 268 (282)
T PF15492_consen 226 GQEQDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLRLQRSWK 268 (282)
T ss_pred ccCCCceEEEEECCCCCEEEEEEcCCeEEEEecCcchhhcccc
Confidence 4556678899999999999999999999999999999999996
No 45
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.54 E-value=1.5 Score=44.05 Aligned_cols=88 Identities=15% Similarity=0.109 Sum_probs=67.3
Q ss_pred CCCCccccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021 297 ASPLTCLKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK 375 (414)
Q Consensus 297 a~pl~~l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k 375 (414)
+.|+.++.+++|++.++--.+..+ -..++.=+|.|=|+|-.+..-|+-++|+++.=.+=. ..+.
T Consensus 94 s~Pi~~~kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~--~sp~------------- 158 (311)
T KOG0277|consen 94 SKPIHKFKEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNSCIYQAA--FSPH------------- 158 (311)
T ss_pred CcchhHHHhhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCccEEEEEe--cCCC-------------
Confidence 458899999999999999999875 445567789999999999999999999998422111 1000
Q ss_pred CcceEEEEEEcCCCCeEEEeeCCC-Ccc
Q 015021 376 SDYCLCLAIHAPRKGIIEHLAGMA-DEN 402 (414)
Q Consensus 376 ~~~~l~LvIyaprRg~lEvW~~~~-g~~ 402 (414)
.. =|+-++.-+|.+.+|+++. |+.
T Consensus 159 -~~--nlfas~Sgd~~l~lwdvr~~gk~ 183 (311)
T KOG0277|consen 159 -IP--NLFASASGDGTLRLWDVRSPGKF 183 (311)
T ss_pred -CC--CeEEEccCCceEEEEEecCCCce
Confidence 01 2778999999999999886 443
No 46
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=88.25 E-value=2.5 Score=42.57 Aligned_cols=76 Identities=18% Similarity=0.152 Sum_probs=49.0
Q ss_pred eeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEe
Q 015021 35 LLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVH 113 (414)
Q Consensus 35 ~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~ 113 (414)
++++.+|.+||+-+++.+-|-.=.++...-+.=|..-.+...| =.=++| -+|.+.+|.-.++|+|+||+..|..||
T Consensus 3 ~~~~~~Gk~lAi~qd~~iEiRsa~Ddf~si~~kcqVpkD~~PQ-WRkl~W--SpD~tlLa~a~S~G~i~vfdl~g~~lf 78 (282)
T PF15492_consen 3 LALSSDGKLLAILQDQCIEIRSAKDDFSSIIGKCQVPKDPNPQ-WRKLAW--SPDCTLLAYAESTGTIRVFDLMGSELF 78 (282)
T ss_pred eeecCCCcEEEEEeccEEEEEeccCCchheeEEEecCCCCCch-heEEEE--CCCCcEEEEEcCCCeEEEEecccceeE
Confidence 6789999999999999988832222211111112211111112 111222 189999999999999999999998888
No 47
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=88.14 E-value=1.2 Score=47.67 Aligned_cols=67 Identities=18% Similarity=0.220 Sum_probs=51.9
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
...+..||||++++..|+.|+|..+|-.+-.++.-||+|-.. +..|+... +. -||-+.
T Consensus 435 s~~v~fSpDG~~l~SGdsdG~v~~wdwkt~kl~~~lkah~~~ci~v~wHP~e-------------~S-------kvat~~ 494 (503)
T KOG0282|consen 435 SCQVDFSPDGRTLCSGDSDGKVNFWDWKTTKLVSKLKAHDQPCIGVDWHPVE-------------PS-------KVATCG 494 (503)
T ss_pred eeeEEEcCCCCeEEeecCCccEEEeechhhhhhhccccCCcceEEEEecCCC-------------cc-------eeEecc
Confidence 345778999999999999999999999999999999999331 23454321 12 346667
Q ss_pred CCCeEEEee
Q 015021 388 RKGIIEHLA 396 (414)
Q Consensus 388 rRg~lEvW~ 396 (414)
=.|.|.||+
T Consensus 495 w~G~Ikiwd 503 (503)
T KOG0282|consen 495 WDGLIKIWD 503 (503)
T ss_pred cCceeEecC
Confidence 789999995
No 48
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=88.07 E-value=2.2 Score=47.79 Aligned_cols=82 Identities=13% Similarity=0.133 Sum_probs=64.3
Q ss_pred CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
-+...+.+++++|+.+|+|+..-+--.=++|+.++.++-+.+|||.- -|-.-+... + -++-
T Consensus 461 aHdKdIN~Vaia~ndkLiAT~SqDktaKiW~le~~~l~~vLsGH~RG--vw~V~Fs~~--------------d---q~la 521 (775)
T KOG0319|consen 461 AHDKDINCVAIAPNDKLIATGSQDKTAKIWDLEQLRLLGVLSGHTRG--VWCVSFSKN--------------D---QLLA 521 (775)
T ss_pred hhcccccceEecCCCceEEecccccceeeecccCceEEEEeeCCccc--eEEEEeccc--------------c---ceeE
Confidence 34577899999999999999877777778888899999999999982 133222111 1 2666
Q ss_pred EcCCCCeEEEeeCCCCcceee
Q 015021 385 HAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v~~ 405 (414)
-+.-+..|+||++.++.|+-|
T Consensus 522 T~SgD~TvKIW~is~fSClkT 542 (775)
T KOG0319|consen 522 TCSGDKTVKIWSISTFSCLKT 542 (775)
T ss_pred eccCCceEEEEEeccceeeee
Confidence 778899999999999999866
No 49
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.56 E-value=3.1 Score=46.71 Aligned_cols=69 Identities=23% Similarity=0.370 Sum_probs=49.1
Q ss_pred CCcEEEEEeccccEEEEEecCCcEEeecc-cCcc--------ceeEEEEecccCCCCCCCCCCeEEEEeCCe-EEEEeC-
Q 015021 88 EEMRALAVGTSRGYFLVYDLKGDLVHRQL-IHPG--------RILKLRVRGSRRDLTQDTAEEEVCVVMPGV-LARFDG- 156 (414)
Q Consensus 88 ~dw~~I~VG~ssG~vrfyte~G~LL~sQ~-lh~~--------pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-iv~IdG- 156 (414)
||-...+||+=.|+.|||+-.|..|..|. ++.. .|..|.+.... .++|.|--.+- |=++||
T Consensus 461 PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~--------~~~vLVTSnDSrIRI~d~~ 532 (712)
T KOG0283|consen 461 PDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGD--------PDEVLVTSNDSRIRIYDGR 532 (712)
T ss_pred cCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCC--------CCeEEEecCCCceEEEecc
Confidence 66778999999999999999999888873 3222 45555544433 33788888886 778899
Q ss_pred -hhHHHHHH
Q 015021 157 -SEIQKMLQ 164 (414)
Q Consensus 157 -~sL~~~Lr 164 (414)
-+|-.-++
T Consensus 533 ~~~lv~KfK 541 (712)
T KOG0283|consen 533 DKDLVHKFK 541 (712)
T ss_pred chhhhhhhc
Confidence 66654443
No 50
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=86.82 E-value=4.7 Score=42.21 Aligned_cols=50 Identities=12% Similarity=0.062 Sum_probs=44.3
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
+.+.-+-.-+.+|.++|.+.|+|....++-|=++|+++|+..--.-|+-.
T Consensus 145 rVi~gHlgWVr~vavdP~n~wf~tgs~DrtikIwDlatg~LkltltGhi~ 194 (460)
T KOG0285|consen 145 RVISGHLGWVRSVAVDPGNEWFATGSADRTIKIWDLATGQLKLTLTGHIE 194 (460)
T ss_pred hhhhhccceEEEEeeCCCceeEEecCCCceeEEEEcccCeEEEeecchhh
Confidence 45777788899999999999999999999999999999999887777643
No 51
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=86.76 E-value=3.3 Score=45.01 Aligned_cols=40 Identities=25% Similarity=0.343 Sum_probs=37.3
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVV 342 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vv 342 (414)
+.+++-....|+.||++.++|++|..+.|+|+|+++..+-
T Consensus 483 ~~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~~~ 522 (603)
T KOG0318|consen 483 LLEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASREVK 522 (603)
T ss_pred eecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCcee
Confidence 6777888999999999999999999999999999999983
No 52
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=86.67 E-value=0.58 Score=46.93 Aligned_cols=47 Identities=23% Similarity=0.377 Sum_probs=40.6
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw 345 (414)
|+.+|.-++-.+..|..+|+|+++|+.-.+.-|-|+|+..++.+|+.
T Consensus 181 pv~si~AH~snCicI~f~p~GryfA~GsADAlvSLWD~~ELiC~R~i 227 (313)
T KOG1407|consen 181 PVQSIKAHPSNCICIEFDPDGRYFATGSADALVSLWDVDELICERCI 227 (313)
T ss_pred cccccccCCcceEEEEECCCCceEeeccccceeeccChhHhhhheee
Confidence 44456666667889999999999999888889999999999999987
No 53
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=86.29 E-value=12 Score=42.52 Aligned_cols=88 Identities=16% Similarity=0.215 Sum_probs=63.9
Q ss_pred eeeccCcceeeeeecc--eEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEEEEEec-CCc
Q 015021 35 LLCALDMHTIALANRY--QTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYFLVYDL-KGD 110 (414)
Q Consensus 35 ~~~sp~~~~la~A~~~--~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte-~G~ 110 (414)
+++..+||.||++... .++|=.|-+. .+-++=-|-. .+||++ .. ||-..||-|-.+|-|++|.. +|-
T Consensus 313 ~~~N~tGDWiA~g~~klgQLlVweWqsE-sYVlKQQgH~-----~~i~~l---~YSpDgq~iaTG~eDgKVKvWn~~Sgf 383 (893)
T KOG0291|consen 313 VSFNSTGDWIAFGCSKLGQLLVWEWQSE-SYVLKQQGHS-----DRITSL---AYSPDGQLIATGAEDGKVKVWNTQSGF 383 (893)
T ss_pred EEecccCCEEEEcCCccceEEEEEeecc-ceeeeccccc-----cceeeE---EECCCCcEEEeccCCCcEEEEeccCce
Confidence 4567889999999988 7777778554 2223333333 335544 44 78899999999999999995 777
Q ss_pred EEeecccCccceeEEEEeccc
Q 015021 111 LVHRQLIHPGRILKLRVRGSR 131 (414)
Q Consensus 111 LL~sQ~lh~~pV~~ik~r~~~ 131 (414)
++..=-=|.+-|..+++--..
T Consensus 384 C~vTFteHts~Vt~v~f~~~g 404 (893)
T KOG0291|consen 384 CFVTFTEHTSGVTAVQFTARG 404 (893)
T ss_pred EEEEeccCCCceEEEEEEecC
Confidence 777666678888888865544
No 54
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=86.28 E-value=5.6 Score=37.12 Aligned_cols=75 Identities=15% Similarity=0.244 Sum_probs=50.6
Q ss_pred eeEEEECCCCCEEEEE--cCC-CcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 310 GERLTLSPSGSLAAIT--DSL-GRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 310 ~~~i~~sP~~~laa~t--D~l-GRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
...|.-||+|+++|++ +++ |.|-++|+.+...+.-.+-..-..|.|-. ++ ++ +.-..-+
T Consensus 103 ~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~~~i~~~~~~~~t~~~WsP------dG-----------r~-~~ta~t~ 164 (194)
T PF08662_consen 103 RNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKKKKISTFEHSDATDVEWSP------DG-----------RY-LATATTS 164 (194)
T ss_pred ceEEEECCCCCEEEEEEccCCCcEEEEEECCCCEEeeccccCcEEEEEEcC------CC-----------CE-EEEEEec
Confidence 3579999999998886 344 99999999999998877655433444431 11 11 1222334
Q ss_pred CC---CCeEEEeeCCCCcce
Q 015021 387 PR---KGIIEHLAGMADENW 403 (414)
Q Consensus 387 pr---Rg~lEvW~~~~g~~v 403 (414)
|| +.-+.||+. +|+++
T Consensus 165 ~r~~~dng~~Iw~~-~G~~l 183 (194)
T PF08662_consen 165 PRLRVDNGFKIWSF-QGRLL 183 (194)
T ss_pred cceeccccEEEEEe-cCeEe
Confidence 54 366899998 58888
No 55
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=86.21 E-value=5.3 Score=37.29 Aligned_cols=76 Identities=16% Similarity=0.163 Sum_probs=48.6
Q ss_pred eeeEEEECCCCCE-EEEE-cCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE--
Q 015021 309 KGERLTLSPSGSL-AAIT-DSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI-- 384 (414)
Q Consensus 309 ~~~~i~~sP~~~l-aa~t-D~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI-- 384 (414)
.+..++=+|+|+. |+++ +.-++|.|+|+..-.+...=++.++ .+.|=. . + . +|++
T Consensus 61 ~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~~~~~~~n-~i~wsP------~---------G---~--~l~~~g 119 (194)
T PF08662_consen 61 PIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFSFGTQPRN-TISWSP------D---------G---R--FLVLAG 119 (194)
T ss_pred ceEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEeecCCCce-EEEECC------C---------C---C--EEEEEE
Confidence 4889999999964 4553 5567999999973333333344444 233321 1 1 1 3444
Q ss_pred EcCCCCeEEEeeCCCCcceee
Q 015021 385 HAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~~v~~ 405 (414)
+.-..|.|++|+++..++|.+
T Consensus 120 ~~n~~G~l~~wd~~~~~~i~~ 140 (194)
T PF08662_consen 120 FGNLNGDLEFWDVRKKKKIST 140 (194)
T ss_pred ccCCCcEEEEEECCCCEEeec
Confidence 344568899999999998865
No 56
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=85.97 E-value=1.5 Score=46.32 Aligned_cols=42 Identities=17% Similarity=0.186 Sum_probs=39.8
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA 351 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA 351 (414)
-.+++.||++.|+|+.-..|+|.+|++.+|.+..+.|+-+..
T Consensus 390 wtrvvfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~ 431 (459)
T KOG0288|consen 390 WTRVVFSPDGSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSN 431 (459)
T ss_pred cceeEECCCCceeeeccCCCcEEEEEccCceEEEEeccCCCC
Confidence 677899999999999999999999999999999999998886
No 57
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=85.78 E-value=4.5 Score=41.95 Aligned_cols=67 Identities=13% Similarity=0.074 Sum_probs=51.0
Q ss_pred CCCEEEEEcCC-----CcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC-----
Q 015021 318 SGSLAAITDSL-----GRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP----- 387 (414)
Q Consensus 318 ~~~laa~tD~l-----GRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap----- 387 (414)
+++++-++|.- |||.+||..++.++.|+..-+..+. + +. .| .. +|.+=.+
T Consensus 11 ~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G~~P~~--~-~s--pD-------------g~--~lyva~~~~~R~ 70 (352)
T TIGR02658 11 DARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGGFLPNP--V-VA--SD-------------GS--FFAHASTVYSRI 70 (352)
T ss_pred CCCEEEEECCcccccCceEEEEECCCCEEEEEEEccCCCce--e-EC--CC-------------CC--EEEEEecccccc
Confidence 56889999987 9999999999999999976665553 2 11 11 11 4556666
Q ss_pred ----CCCeEEEeeCCCCccee
Q 015021 388 ----RKGIIEHLAGMADENWT 404 (414)
Q Consensus 388 ----rRg~lEvW~~~~g~~v~ 404 (414)
|.+.|+|||+++++.+.
T Consensus 71 ~~G~~~d~V~v~D~~t~~~~~ 91 (352)
T TIGR02658 71 ARGKRTDYVEVIDPQTHLPIA 91 (352)
T ss_pred ccCCCCCEEEEEECccCcEEe
Confidence 88999999999998874
No 58
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=84.91 E-value=2.6 Score=44.00 Aligned_cols=98 Identities=14% Similarity=0.012 Sum_probs=68.9
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE-----EEEeccccc-------ccc-------
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM-----EMLVNKDAA-------TSS------- 368 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi-----~~~~~~~~~-------~~~------- 368 (414)
-.|..+|+..+.++.+...++..|.++|+++|.++...=|+-||-.+-. .+.+..|.. +.+
T Consensus 236 HtGSVLCLqyd~rviisGSSDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps~it~rrV 315 (499)
T KOG0281|consen 236 HTGSVLCLQYDERVIVSGSSDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDRSIAVWDMASPTDITLRRV 315 (499)
T ss_pred CCCcEEeeeccceEEEecCCCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCceeEEEeccCchHHHHHHH
Confidence 3477899999999999989999999999999999999999998866532 222222211 000
Q ss_pred ---cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 369 ---AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 369 ---~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
........++-==+++-|.-+..|+||++.+|+.|-|
T Consensus 316 LvGHrAaVNvVdfd~kyIVsASgDRTikvW~~st~efvRt 355 (499)
T KOG0281|consen 316 LVGHRAAVNVVDFDDKYIVSASGDRTIKVWSTSTCEFVRT 355 (499)
T ss_pred HhhhhhheeeeccccceEEEecCCceEEEEeccceeeehh
Confidence 0111111111112778999999999999999999965
No 59
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=84.83 E-value=4.8 Score=42.20 Aligned_cols=64 Identities=31% Similarity=0.435 Sum_probs=37.5
Q ss_pred eeeeeecc-eEEEEeecCCC-CCceeecc-cCCCCCCCeEEEEEeEEe---CCc---EEEEEeccccEEEEEe
Q 015021 43 TIALANRY-QTVIINWADPE-GLVAKIRP-ELSPIASEYITAIEWLVF---EEM---RALAVGTSRGYFLVYD 106 (414)
Q Consensus 43 ~la~A~~~-~~v~~~w~~~~-~~~v~~~g-~l~~~~~e~ITs~~~lpl---~dw---~~I~VG~ssG~vrfyt 106 (414)
.+||||++ .++|.--.... .++...+. -++...++.||+++|-.+ .|- .|+.|||+.|.|..|.
T Consensus 99 Fvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~G~v~~fk 171 (395)
T PF08596_consen 99 FVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNSGNVLTFK 171 (395)
T ss_dssp EEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEETTSEEEEEE
T ss_pred EEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeCCCCEEEEE
Confidence 45788865 45555442222 12222222 133345689999999877 333 8999999999999996
No 60
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=84.47 E-value=1.9 Score=32.06 Aligned_cols=36 Identities=25% Similarity=0.478 Sum_probs=30.2
Q ss_pred CCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEe
Q 015021 76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVH 113 (414)
Q Consensus 76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~ 113 (414)
.++|+++.|=|-.| .||+|+.+|.|.+|-.+|..+.
T Consensus 11 ~~~v~~~~w~P~md--LiA~~t~~g~v~v~Rl~~qriw 46 (47)
T PF12894_consen 11 PSRVSCMSWCPTMD--LIALGTEDGEVLVYRLNWQRIW 46 (47)
T ss_pred CCcEEEEEECCCCC--EEEEEECCCeEEEEECCCcCcc
Confidence 46688888877655 7999999999999999998765
No 61
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=84.16 E-value=12 Score=39.17 Aligned_cols=98 Identities=14% Similarity=0.051 Sum_probs=66.5
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc-------ccc-------cc-
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK-------DAA-------TS- 367 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~-------~~~-------~~- 367 (414)
+..++-....+.-|-+|.|.|+.|-.|.|++++..++...+..- |--..+.|+.-.... +++ .+
T Consensus 102 ltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~-~e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~ 180 (399)
T KOG0296|consen 102 LTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLD-QEVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQ 180 (399)
T ss_pred ecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCceEEEee-cccCceEEEEecccccEEEeecCCCcEEEEECCCc
Confidence 56677778999999999999999999999999999998877554 455566676554321 111 00
Q ss_pred ---------ccccCCCC--CcceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021 368 ---------SAYYAPVK--SDYCLCLAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 368 ---------~~~~~~~k--~~~~l~LvIyaprRg~lEvW~~~~g~~v 403 (414)
+.+-..++ ++.-..+.-|- +|+|.+|+.++|+.+
T Consensus 181 ~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~--dgti~~Wn~ktg~p~ 225 (399)
T KOG0296|consen 181 ALCKVMSGHNSPCTCGEFIPDGKRILTGYD--DGTIIVWNPKTGQPL 225 (399)
T ss_pred ceeeEecCCCCCcccccccCCCceEEEEec--CceEEEEecCCCcee
Confidence 01101111 11112455665 999999999999865
No 62
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=84.08 E-value=1.2 Score=46.04 Aligned_cols=81 Identities=15% Similarity=0.160 Sum_probs=62.4
Q ss_pred CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021 306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH 385 (414)
Q Consensus 306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy 385 (414)
+.....++..|-|+.-+..+..+-.|-+--+..|..++-++|+-. ++.-.--++++ .-+|.
T Consensus 305 HtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsS----yvn~a~ft~dG---------------~~iis 365 (508)
T KOG0275|consen 305 HTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSS----YVNEATFTDDG---------------HHIIS 365 (508)
T ss_pred hccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccc----cccceEEcCCC---------------CeEEE
Confidence 345577889999997666666666677778899999999999988 66422222222 25799
Q ss_pred cCCCCeEEEeeCCCCcceee
Q 015021 386 APRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 386 aprRg~lEvW~~~~g~~v~~ 405 (414)
|..+|.++||++++++|+.|
T Consensus 366 aSsDgtvkvW~~KtteC~~T 385 (508)
T KOG0275|consen 366 ASSDGTVKVWHGKTTECLST 385 (508)
T ss_pred ecCCccEEEecCcchhhhhh
Confidence 99999999999999999987
No 63
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=83.83 E-value=2.8 Score=46.33 Aligned_cols=84 Identities=23% Similarity=0.208 Sum_probs=57.5
Q ss_pred eeeeccCcceeeeeecc-eEEEEeecCCCCCceeecccCCCCCCCeEE-EEEeEEeCCcEEEEEeccccEEEEEe-cCCc
Q 015021 34 NLLCALDMHTIALANRY-QTVIINWADPEGLVAKIRPELSPIASEYIT-AIEWLVFEEMRALAVGTSRGYFLVYD-LKGD 110 (414)
Q Consensus 34 ~~~~sp~~~~la~A~~~-~~v~~~w~~~~~~~v~~~g~l~~~~~e~IT-s~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~ 110 (414)
.+-..|--|+||.+.+. .+++-+...+. -| +++ +++|.+| +++|= +|-..+||||.+|-+++.+ |+|.
T Consensus 25 ~~ewnP~~dLiA~~t~~gelli~R~n~qR----lw--tip-~p~~~v~~sL~W~--~DGkllaVg~kdG~I~L~Dve~~~ 95 (665)
T KOG4640|consen 25 RIEWNPKMDLIATRTEKGELLIHRLNWQR----LW--TIP-IPGENVTASLCWR--PDGKLLAVGFKDGTIRLHDVEKGG 95 (665)
T ss_pred EEEEcCccchhheeccCCcEEEEEeccce----eE--ecc-CCCCccceeeeec--CCCCEEEEEecCCeEEEEEccCCC
Confidence 34567889999987776 46664322221 01 122 2477787 66665 4789999999999999999 8999
Q ss_pred EEeecccC-ccceeEEE
Q 015021 111 LVHRQLIH-PGRILKLR 126 (414)
Q Consensus 111 LL~sQ~lh-~~pV~~ik 126 (414)
.|.+..+. +++|.++-
T Consensus 96 ~l~~~~~s~e~~is~~~ 112 (665)
T KOG4640|consen 96 RLVSFLFSVETDISKGI 112 (665)
T ss_pred ceeccccccccchheee
Confidence 99887665 34555544
No 64
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=83.76 E-value=16 Score=40.74 Aligned_cols=211 Identities=17% Similarity=0.237 Sum_probs=134.7
Q ss_pred CeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEe
Q 015021 77 EYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFD 155 (414)
Q Consensus 77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~Id 155 (414)
.-|.+.-.|+=.+| |++|-++++||+|. .+|+.+..=--|++-|.+|.+..+.+. ||-.
T Consensus 56 ~PvRa~kfiaRknW--iv~GsDD~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~-----------vLts------- 115 (794)
T KOG0276|consen 56 VPVRAAKFIARKNW--IVTGSDDMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPY-----------VLTS------- 115 (794)
T ss_pred cchhhheeeeccce--EEEecCCceEEEEecccceeeEEeeccccceeeeeecCCCCe-----------EEec-------
Confidence 45777777777888 78999999999999 799999888889999999998776651 1110
Q ss_pred ChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhhccccceeEEEe
Q 015021 156 GSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQSSQRYFCAVTI 235 (414)
Q Consensus 156 G~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~s~~~~~~~i~v 235 (414)
+. .+-.|.|+-+..=.|.- .| .+|..| |..
T Consensus 116 ----------------------------SD-------Dm~iKlW~we~~wa~~q-tf------------eGH~Hy--VMq 145 (794)
T KOG0276|consen 116 ----------------------------SD-------DMTIKLWDWENEWACEQ-TF------------EGHEHY--VMQ 145 (794)
T ss_pred ----------------------------CC-------ccEEEEeeccCceeeee-EE------------cCcceE--EEE
Confidence 01 15567777765322211 11 123333 321
Q ss_pred CCCceeEEEEeccCCCcchhhhhhhhhhHHHHHHhhhhhhhhhcCCCCCCCCCCCCcccccCCCCccccCCCceeeEEEE
Q 015021 236 GEDSVISAFRLSEDRSRSLVGAILSKVVPATFSTISSLSKMIWRSEQSPKKSEPKPQSFARASPLTCLKDHPRKGERLTL 315 (414)
Q Consensus 236 G~~P~la~y~~~~~~~~s~~~al~S~va~av~S~v~S~ak~~W~~~~~~~~~e~kp~~~~~a~pl~~l~D~~R~~~~i~~ 315 (414)
+ .|+.....+ |.++-+. .-.| .|.=. . ..|-..|.-+.|...+|.-
T Consensus 146 -----v-~fnPkD~nt--FaS~sLD-----------rTVK-VWslg--------s------~~~nfTl~gHekGVN~Vdy 191 (794)
T KOG0276|consen 146 -----V-AFNPKDPNT--FASASLD-----------RTVK-VWSLG--------S------PHPNFTLEGHEKGVNCVDY 191 (794)
T ss_pred -----E-EecCCCccc--eeeeecc-----------ccEE-EEEcC--------C------CCCceeeeccccCcceEEe
Confidence 2 233332222 2221111 1123 55211 0 1233557777888888888
Q ss_pred CCCC--CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEE
Q 015021 316 SPSG--SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIE 393 (414)
Q Consensus 316 sP~~--~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lE 393 (414)
=|.| -+....-.+--|-++|-++...|+-.+||-. -+....- |.-|=++|-..-+|.+.
T Consensus 192 y~~gdkpylIsgaDD~tiKvWDyQtk~CV~TLeGHt~----Nvs~v~f---------------hp~lpiiisgsEDGTvr 252 (794)
T KOG0276|consen 192 YTGGDKPYLISGADDLTIKVWDYQTKSCVQTLEGHTN----NVSFVFF---------------HPELPIIISGSEDGTVR 252 (794)
T ss_pred ccCCCcceEEecCCCceEEEeecchHHHHHHhhcccc----cceEEEe---------------cCCCcEEEEecCCccEE
Confidence 7777 3666654455788999999999999999977 3333211 22334899999999999
Q ss_pred EeeCCCCcceeeccceee
Q 015021 394 HLAGMADENWTTSLNYSV 411 (414)
Q Consensus 394 vW~~~~g~~v~~~~~~~~ 411 (414)
||+..+-+.. ..|||..
T Consensus 253 iWhs~Ty~lE-~tLn~gl 269 (794)
T KOG0276|consen 253 IWNSKTYKLE-KTLNYGL 269 (794)
T ss_pred EecCcceehh-hhhhcCC
Confidence 9999998766 6688864
No 65
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=83.50 E-value=2.8 Score=47.22 Aligned_cols=89 Identities=12% Similarity=0.104 Sum_probs=72.5
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
++-.|+-+++.+..+..+-.|...|..--+|-|+|+|+..-..+-=.+||-|+=.+..-.. . +
T Consensus 99 ~~~tfngHK~AVt~l~fd~~G~rlaSGskDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~--~------------~--- 161 (888)
T KOG0306|consen 99 ILITFNGHKAAVTTLKFDKIGTRLASGSKDTDIIVWDLVGEEGLFRLRGHKDSITQALFLN--G------------D--- 161 (888)
T ss_pred eeeeecccccceEEEEEcccCceEeecCCCccEEEEEeccceeeEEeecchHHHhHHhccC--C------------C---
Confidence 4445888999999999999999999999999999999999999999999999644333221 0 1
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADENWTTS 406 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~v~~~ 406 (414)
-++|-..+++.|++|++.+-+|+-|-
T Consensus 162 --~~lvS~sKDs~iK~WdL~tqhCf~Th 187 (888)
T KOG0306|consen 162 --SFLVSVSKDSMIKFWDLETQHCFETH 187 (888)
T ss_pred --eEEEEeccCceEEEEecccceeeeEE
Confidence 27788899999999999998888553
No 66
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=83.07 E-value=8 Score=42.16 Aligned_cols=88 Identities=20% Similarity=0.139 Sum_probs=62.2
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC 381 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~ 381 (414)
++.++.|=..++--+|||.++|.+-++|.|+|+|=.+|..+-...+ -+|.=+=|-...=.+ +.-.|
T Consensus 185 s~r~HskFV~~VRysPDG~~Fat~gsDgki~iyDGktge~vg~l~~-~~aHkGsIfalsWsP-------------Ds~~~ 250 (603)
T KOG0318|consen 185 SFREHSKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEKVGELED-SDAHKGSIFALSWSP-------------DSTQF 250 (603)
T ss_pred cccccccceeeEEECCCCCeEEEecCCccEEEEcCCCccEEEEecC-CCCccccEEEEEECC-------------CCceE
Confidence 4667778889999999999999999999999999999999988865 333332222221111 11114
Q ss_pred EEEEcCCCCeEEEeeCCCCcceee
Q 015021 382 LAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 382 LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
|- +.-+-.++||++-+..++.|
T Consensus 251 ~T--~SaDkt~KIWdVs~~slv~t 272 (603)
T KOG0318|consen 251 LT--VSADKTIKIWDVSTNSLVST 272 (603)
T ss_pred EE--ecCCceEEEEEeeccceEEE
Confidence 33 34467889999888877755
No 67
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=82.61 E-value=2.2 Score=48.28 Aligned_cols=66 Identities=24% Similarity=0.313 Sum_probs=46.3
Q ss_pred CEEEEEcCCCcEEEEEcCcceeEeeecccc-c--ceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEee
Q 015021 320 SLAAITDSLGRILLLDTQALVVVRLWKGYR-D--ASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLA 396 (414)
Q Consensus 320 ~laa~tD~lGRV~L~D~~~~~vvrmwKGyR-d--Aq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~ 396 (414)
-+.|++|-.|||+|+|...+.++- |=-.- | -++.|++.... .+-+.|+||.| ..|-.|+
T Consensus 80 lliAsaD~~GrIil~d~~~~s~~~-~l~~~~~~~qdl~W~~~rd~---------------Srd~LlaIh~s--s~lvLwn 141 (1062)
T KOG1912|consen 80 LLIASADISGRIILVDFVLASVIN-WLSHSNDSVQDLCWVPARDD---------------SRDVLLAIHGS--STLVLWN 141 (1062)
T ss_pred eeEEeccccCcEEEEEehhhhhhh-hhcCCCcchhheeeeeccCc---------------chheeEEecCC--cEEEEEE
Confidence 478899999999999999876543 31111 1 23455554321 12457999998 6899999
Q ss_pred CCCCcce
Q 015021 397 GMADENW 403 (414)
Q Consensus 397 ~~~g~~v 403 (414)
+.+|+++
T Consensus 142 tdtG~k~ 148 (1062)
T KOG1912|consen 142 TDTGEKF 148 (1062)
T ss_pred ccCCcee
Confidence 9999987
No 68
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.15 E-value=9.8 Score=44.46 Aligned_cols=76 Identities=20% Similarity=0.268 Sum_probs=58.1
Q ss_pred ecCCCCCceeecccCCCCCCCeEEEEEeEEeCCc--EEEEEeccccEEEEEecCC-------cEEeecccCccceeEEEE
Q 015021 57 WADPEGLVAKIRPELSPIASEYITAIEWLVFEEM--RALAVGTSRGYFLVYDLKG-------DLVHRQLIHPGRILKLRV 127 (414)
Q Consensus 57 w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw--~~I~VG~ssG~vrfyte~G-------~LL~sQ~lh~~pV~~ik~ 127 (414)
|..++.++++-+|.++. .+|-.-+.|=++.+. =+||=|+.+|.|-||+..- .+|..+.-|.++|+.|.+
T Consensus 47 d~~~~~~dlk~~~s~~s--~~rF~kL~W~~~g~~~~GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDf 124 (1049)
T KOG0307|consen 47 DFSDESSDLKPVGSLQS--SNRFNKLAWGSYGSHSHGLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDF 124 (1049)
T ss_pred cccCccccccccccccc--cccceeeeecccCCCccceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeec
Confidence 44444556677787766 588888888888444 4899999999999999544 467777899999999998
Q ss_pred ecccCCC
Q 015021 128 RGSRRDL 134 (414)
Q Consensus 128 r~~~~~~ 134 (414)
..-+.++
T Consensus 125 N~~q~nl 131 (1049)
T KOG0307|consen 125 NPFQGNL 131 (1049)
T ss_pred cccCCce
Confidence 7766543
No 69
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=81.96 E-value=3.8 Score=40.90 Aligned_cols=77 Identities=14% Similarity=0.172 Sum_probs=57.8
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
+.+++.||+++-..++-..+-+=|+|-.+|.++..+||+-+-+..==-... +++ --|+-..-+
T Consensus 186 it~vs~s~d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn~eykldc~l~--------------qsd---thV~sgSED 248 (307)
T KOG0316|consen 186 ITSVSFSKDGNCSLASSLDSTLRLLDKETGKLLKSYKGHKNMEYKLDCCLN--------------QSD---THVFSGSED 248 (307)
T ss_pred ceeEEecCCCCEEEEeeccceeeecccchhHHHHHhcccccceeeeeeeec--------------ccc---eeEEeccCC
Confidence 567899999986666666678889999999999999999884432111110 111 267888999
Q ss_pred CeEEEeeCCCCcce
Q 015021 390 GIIEHLAGMADENW 403 (414)
Q Consensus 390 g~lEvW~~~~g~~v 403 (414)
|.+-+|++-.++.+
T Consensus 249 G~Vy~wdLvd~~~~ 262 (307)
T KOG0316|consen 249 GKVYFWDLVDETQI 262 (307)
T ss_pred ceEEEEEeccceee
Confidence 99999999988876
No 70
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=81.84 E-value=1.2 Score=47.75 Aligned_cols=85 Identities=14% Similarity=0.077 Sum_probs=66.5
Q ss_pred ccccCCCceeeEEEECC-CCCEEEEEcCCCcEEEEEcCc-ceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 301 TCLKDHPRKGERLTLSP-SGSLAAITDSLGRILLLDTQA-LVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP-~~~laa~tD~lGRV~L~D~~~-~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
+.+.+++..+.+|...| .+.|+...-.+|.|.|+|+-. +..+|-+.||+.+= .+.. . .+ ..
T Consensus 208 ~~~~gH~kgvsai~~fp~~~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~V------rd~~--~--------s~-~g 270 (503)
T KOG0282|consen 208 HNLSGHTKGVSAIQWFPKKGHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPV------RDAS--F--------NN-CG 270 (503)
T ss_pred eeccCCccccchhhhccceeeEEEecCCCceEEEEEEecCcceehhhhcchhhh------hhhh--c--------cc-cC
Confidence 45889988899999999 889999999999999999998 99999999998731 1100 0 00 00
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCccee
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~v~ 404 (414)
.-..-+.-++.|.+||+.+|+++.
T Consensus 271 --~~fLS~sfD~~lKlwDtETG~~~~ 294 (503)
T KOG0282|consen 271 --TSFLSASFDRFLKLWDTETGQVLS 294 (503)
T ss_pred --CeeeeeecceeeeeeccccceEEE
Confidence 122457779999999999999883
No 71
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=81.61 E-value=18 Score=33.25 Aligned_cols=86 Identities=22% Similarity=0.197 Sum_probs=61.5
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcC-CCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDS-LGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~-lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
.+....-.+..+..+|++++.+.... .|.+.++|+..+..++..+|+.+ .+....-.+.+ +
T Consensus 150 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~~~~~~~~~-----------~--- 211 (466)
T COG2319 150 TLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTD----PVSSLAFSPDG-----------G--- 211 (466)
T ss_pred EEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCCC----ceEEEEEcCCc-----------c---
Confidence 35555555669999999987777775 99999999999999999999555 33332211111 1
Q ss_pred EEEEEcCCCCeEEEeeCCCCcceee
Q 015021 381 CLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
++++-...+|.+.+|+.+.++.+.+
T Consensus 212 ~~~~~~~~d~~i~~wd~~~~~~~~~ 236 (466)
T COG2319 212 LLIASGSSDGTIRLWDLSTGKLLRS 236 (466)
T ss_pred eEEEEecCCCcEEEEECCCCcEEee
Confidence 3444468889999999888887754
No 72
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=80.22 E-value=12 Score=39.16 Aligned_cols=82 Identities=20% Similarity=0.220 Sum_probs=56.7
Q ss_pred ceeeEEEECCCCCEEEE--EcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021 308 RKGERLTLSPSGSLAAI--TDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH 385 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~--tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy 385 (414)
+....+++++.+.|.|. +-+.|.|+|+|+.+.+.+-+...|++ =+.+..-..+| -|.--
T Consensus 130 ~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~----~lAalafs~~G---------------~llAT 190 (391)
T KOG2110|consen 130 KGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKG----PLAALAFSPDG---------------TLLAT 190 (391)
T ss_pred cceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCC----ceeEEEECCCC---------------CEEEE
Confidence 34445555555678887 34589999999999999998877777 34443322222 14445
Q ss_pred cCCCCe-EEEeeCCCCcceeeccceeeec
Q 015021 386 APRKGI-IEHLAGMADENWTTSLNYSVRQ 413 (414)
Q Consensus 386 aprRg~-lEvW~~~~g~~v~~~~~~~~~~ 413 (414)
|.-+|. |.|+++.+|+++ |..|.
T Consensus 191 ASeKGTVIRVf~v~~G~kl-----~eFRR 214 (391)
T KOG2110|consen 191 ASEKGTVIRVFSVPEGQKL-----YEFRR 214 (391)
T ss_pred eccCceEEEEEEcCCccEe-----eeeeC
Confidence 666664 579999999999 88874
No 73
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=80.18 E-value=6.5 Score=41.84 Aligned_cols=261 Identities=19% Similarity=0.215 Sum_probs=156.6
Q ss_pred ecccccccCC-CCCcccCCCeeeeccCcceeeeeecceEEEE-eecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEE
Q 015021 15 CTDLSDLGAG-KEGWLVNDPNLLCALDMHTIALANRYQTVII-NWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRA 92 (414)
Q Consensus 15 ~~~~~~~g~~-~~~WL~~~~~~~~sp~~~~la~A~~~~~v~~-~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~ 92 (414)
|.|-..+-.- +|.|+ +..|+.|.+||-|-...-++. .-..+.+ ++..-+|-- -.+-|..|.|=| +|-..
T Consensus 214 ~qt~qil~~htdEVWf-----l~FS~nGkyLAsaSkD~Taiiw~v~~d~~--~kl~~tlvg-h~~~V~yi~wSP-DdryL 284 (519)
T KOG0293|consen 214 SQTWQILQDHTDEVWF-----LQFSHNGKYLASASKDSTAIIWIVVYDVH--FKLKKTLVG-HSQPVSYIMWSP-DDRYL 284 (519)
T ss_pred chhhhhHhhCCCcEEE-----EEEcCCCeeEeeccCCceEEEEEEecCcc--eeeeeeeec-ccCceEEEEECC-CCCeE
Confidence 4555555555 88888 347899999999887755552 2222222 444444411 135688888877 46789
Q ss_pred EEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHHHHHHHHHhcc
Q 015021 93 LAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQKMLQRWFQDSN 171 (414)
Q Consensus 93 I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~~ 171 (414)
+++|++.- ++..+ .+|++.+.=- | .-=.++. =+.=||++.-+|-|..= |.|
T Consensus 285 laCg~~e~-~~lwDv~tgd~~~~y~-~-~~~~S~~----------------sc~W~pDg~~~V~Gs~d----r~i----- 336 (519)
T KOG0293|consen 285 LACGFDEV-LSLWDVDTGDLRHLYP-S-GLGFSVS----------------SCAWCPDGFRFVTGSPD----RTI----- 336 (519)
T ss_pred EecCchHh-eeeccCCcchhhhhcc-c-CcCCCcc----------------eeEEccCCceeEecCCC----CcE-----
Confidence 99999887 77777 5777654210 0 0011122 23334444333333221 111
Q ss_pred ccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhhccccceeEEEeCCCceeEEEEeccCCC
Q 015021 172 SNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQSSQRYFCAVTIGEDSVISAFRLSEDRS 251 (414)
Q Consensus 172 ~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~s~~~~~~~i~vG~~P~la~y~~~~~~~ 251 (414)
..|+ ..++ + ..+|.-.+.-.+.|.+++ .++.+ +++++.+|-+..|..+.-..
T Consensus 337 -~~wd-------lDgn------~-~~~W~gvr~~~v~dlait------------~Dgk~-vl~v~~d~~i~l~~~e~~~d 388 (519)
T KOG0293|consen 337 -IMWD-------LDGN------I-LGNWEGVRDPKVHDLAIT------------YDGKY-VLLVTVDKKIRLYNREARVD 388 (519)
T ss_pred -EEec-------CCcc------h-hhcccccccceeEEEEEc------------CCCcE-EEEEecccceeeechhhhhh
Confidence 1121 1111 1 344655544456666664 23444 68888888887775432110
Q ss_pred cchhhhhhhhhhHHHHHHhhhhhhhhhcCCCCCCCCCCCCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcE
Q 015021 252 RSLVGAILSKVVPATFSTISSLSKMIWRSEQSPKKSEPKPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRI 331 (414)
Q Consensus 252 ~s~~~al~S~va~av~S~v~S~ak~~W~~~~~~~~~e~kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV 331 (414)
. .+....-.+.++++|-+|+||.+.=..--|
T Consensus 389 r-------------------------------------------------~lise~~~its~~iS~d~k~~LvnL~~qei 419 (519)
T KOG0293|consen 389 R-------------------------------------------------GLISEEQPITSFSISKDGKLALVNLQDQEI 419 (519)
T ss_pred h-------------------------------------------------ccccccCceeEEEEcCCCcEEEEEcccCee
Confidence 0 011122335788999999999998777789
Q ss_pred EEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 332 LLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 332 ~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
-|+|++...++|-++|++--.+-=-.++...+ + -+|-...-++-|-||+-.+|+.+++
T Consensus 420 ~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~-------------~---~fiaSGSED~kvyIWhr~sgkll~~ 477 (519)
T KOG0293|consen 420 HLWDLEENKLVRKYFGHKQGHFIIRSCFGGGN-------------D---KFIASGSEDSKVYIWHRISGKLLAV 477 (519)
T ss_pred EEeecchhhHHHHhhcccccceEEEeccCCCC-------------c---ceEEecCCCceEEEEEccCCceeEe
Confidence 99999999999999999975443223331111 1 1777888899999999999999875
No 74
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=80.12 E-value=3.4 Score=44.10 Aligned_cols=71 Identities=14% Similarity=0.157 Sum_probs=54.3
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecc-cccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
-....|+.+|+|.|.+..---|.++||.+.+|.++.+|-+ |++ |....-.+++ -.+|-+
T Consensus 82 g~v~al~s~n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~-----ITcL~fs~dg---------------s~iiTg 141 (476)
T KOG0646|consen 82 GPVHALASSNLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQS-----ITCLKFSDDG---------------SHIITG 141 (476)
T ss_pred cceeeeecCCCceEEEeecccCcEEEEEeccccHHHHHHhhccc-----eeEEEEeCCC---------------cEEEec
Confidence 4467788999999999988899999999999999999954 444 3333222222 156899
Q ss_pred CCCCeEEEeeCC
Q 015021 387 PRKGIIEHLAGM 398 (414)
Q Consensus 387 prRg~lEvW~~~ 398 (414)
.++|.|-||.+-
T Consensus 142 skDg~V~vW~l~ 153 (476)
T KOG0646|consen 142 SKDGAVLVWLLT 153 (476)
T ss_pred CCCccEEEEEEE
Confidence 999999999863
No 75
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=79.65 E-value=9.2 Score=39.13 Aligned_cols=52 Identities=19% Similarity=0.211 Sum_probs=46.2
Q ss_pred CCccccCCCceeeEEEECC-CCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 299 PLTCLKDHPRKGERLTLSP-SGSLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP-~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
....|.++.-.+-+|.++| ++++++..--++--.|+|+..+..++++-|+-.
T Consensus 178 ~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~ghes 230 (343)
T KOG0286|consen 178 QTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFEGHES 230 (343)
T ss_pred EEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeeccccc
Confidence 3356889999999999999 889999988888999999999999999999864
No 76
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=79.48 E-value=12 Score=41.27 Aligned_cols=112 Identities=21% Similarity=0.199 Sum_probs=64.6
Q ss_pred ecccccccCCCCCcccC-----CCeeeeccCcceeeeeecc-eEEEEeecCCCCCceeecccCCC-------CCCCeEEE
Q 015021 15 CTDLSDLGAGKEGWLVN-----DPNLLCALDMHTIALANRY-QTVIINWADPEGLVAKIRPELSP-------IASEYITA 81 (414)
Q Consensus 15 ~~~~~~~g~~~~~WL~~-----~~~~~~sp~~~~la~A~~~-~~v~~~w~~~~~~~v~~~g~l~~-------~~~e~ITs 81 (414)
..|+=.|-+-...||+- .+.-+++....+=.||.|. -=+|--|++-.... +|.|+. +.+..--+
T Consensus 154 g~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksr---v~~l~~~~~v~s~pg~~~~~s 230 (703)
T KOG2321|consen 154 GSEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSR---VGTLDAASSVNSHPGGDAAPS 230 (703)
T ss_pred CcceEEEEccccccccccccccccceeeeecCccceEEecccCceEEEecchhhhh---heeeecccccCCCccccccCc
Confidence 34444444555667753 1333333333333344444 33445687765322 122211 01111222
Q ss_pred EEeEEeCC-cEEEEEeccccEEEEEec--CCcEEeecccCccceeEEEEec
Q 015021 82 IEWLVFEE-MRALAVGTSRGYFLVYDL--KGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 82 ~~~lpl~d-w~~I~VG~ssG~vrfyte--~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
+..|.|.| -.=++|||++|+|.||+. .--||...+.++.||.+|....
T Consensus 231 vTal~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~ 281 (703)
T KOG2321|consen 231 VTALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQD 281 (703)
T ss_pred ceEEEecCCceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccc
Confidence 33346655 688999999999999995 6778888899999999999743
No 77
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=79.26 E-value=5.9 Score=41.57 Aligned_cols=76 Identities=14% Similarity=0.169 Sum_probs=62.3
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR 388 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr 388 (414)
...|++|||.+.+..+|.++.|=+....+--+|.=| =||++ |+...+..+. ++.+.+.-
T Consensus 154 l~dVavS~D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~e----FVS~isl~~~----------------~~LlS~sG 213 (390)
T KOG3914|consen 154 LLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESFCLGHKE----FVSTISLTDN----------------YLLLSGSG 213 (390)
T ss_pred hheeeecCCCCEEEEecCCceEEEEecCcccchhhhccccHh----heeeeeeccC----------------ceeeecCC
Confidence 346899999999999999999999888888777754 57888 8776654331 46789999
Q ss_pred CCeEEEeeCCCCcceee
Q 015021 389 KGIIEHLAGMADENWTT 405 (414)
Q Consensus 389 Rg~lEvW~~~~g~~v~~ 405 (414)
++.|.+|+..+|++..|
T Consensus 214 D~tlr~Wd~~sgk~L~t 230 (390)
T KOG3914|consen 214 DKTLRLWDITSGKLLDT 230 (390)
T ss_pred CCcEEEEecccCCcccc
Confidence 99999999999999854
No 78
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=79.07 E-value=13 Score=38.17 Aligned_cols=76 Identities=12% Similarity=-0.004 Sum_probs=57.5
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcc------eeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQAL------VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL 382 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~------~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L 382 (414)
-+-..+.+|+|+++|+.-..--..++++.+. .+.|+.+||+. ++..-.-.+++ .
T Consensus 99 WVMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtg----ylScC~f~dD~----------------~ 158 (343)
T KOG0286|consen 99 WVMTCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTG----YLSCCRFLDDN----------------H 158 (343)
T ss_pred eEEEEEECCCCCeEEecCcCceeEEEecccccccccceeeeeecCccc----eeEEEEEcCCC----------------c
Confidence 4567899999999999755556777788766 78899999998 77665433322 3
Q ss_pred EEEcCCCCeEEEeeCCCCccee
Q 015021 383 AIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 383 vIyaprRg~lEvW~~~~g~~v~ 404 (414)
+|-+.-+...-+|++.+|+++.
T Consensus 159 ilT~SGD~TCalWDie~g~~~~ 180 (343)
T KOG0286|consen 159 ILTGSGDMTCALWDIETGQQTQ 180 (343)
T ss_pred eEecCCCceEEEEEcccceEEE
Confidence 4556667788899999999874
No 79
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=79.02 E-value=2.7 Score=47.35 Aligned_cols=83 Identities=23% Similarity=0.126 Sum_probs=58.5
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
.++.|++-.+.+|++||+|+..|..|..|-+=++|++.....- ++++.|.+-- ...+ .+....+
T Consensus 453 ~s~~d~r~G~R~~~vSp~gqhLAsGDr~GnlrVy~Lq~l~~~~-----------~~eAHesEil---cLey--S~p~~~~ 516 (1080)
T KOG1408|consen 453 VSTCDSRFGFRALAVSPDGQHLASGDRGGNLRVYDLQELEYTC-----------FMEAHESEIL---CLEY--SFPVLTN 516 (1080)
T ss_pred hhhcCcccceEEEEECCCcceecccCccCceEEEEehhhhhhh-----------heecccceeE---EEee--cCchhhh
Confidence 4689999999999999999999999999999999999887544 4444432200 0000 0111112
Q ss_pred EEEEEcCCCCeEEEeeCCC
Q 015021 381 CLAIHAPRKGIIEHLAGMA 399 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~ 399 (414)
-|.-.|.|+.+|-|+|+..
T Consensus 517 kLLASasrdRlIHV~Dv~r 535 (1080)
T KOG1408|consen 517 KLLASASRDRLIHVYDVKR 535 (1080)
T ss_pred HhhhhccCCceEEEEeccc
Confidence 3667888899999998764
No 80
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=78.78 E-value=11 Score=39.70 Aligned_cols=84 Identities=17% Similarity=0.089 Sum_probs=63.6
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
..++.+-|+-..++..-++--+=++|+.+..-|.+.+|+|..-+.= .+.+. + --||-...+
T Consensus 238 V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V-~~~~~-d-----------------pqvit~S~D 298 (460)
T KOG0285|consen 238 VYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASV-MCQPT-D-----------------PQVITGSHD 298 (460)
T ss_pred eEEEeccccceeEEecCCcceEEEeeecccceEEEecCCCCcceeE-EeecC-C-----------------CceEEecCC
Confidence 5668888999888887777777789999999999999999954332 22111 1 046888999
Q ss_pred CeEEEeeCCCCcceeeccc--eeee
Q 015021 390 GIIEHLAGMADENWTTSLN--YSVR 412 (414)
Q Consensus 390 g~lEvW~~~~g~~v~~~~~--~~~~ 412 (414)
+.|.+||++.|+-..|-.| -|||
T Consensus 299 ~tvrlWDl~agkt~~tlt~hkksvr 323 (460)
T KOG0285|consen 299 STVRLWDLRAGKTMITLTHHKKSVR 323 (460)
T ss_pred ceEEEeeeccCceeEeeecccceee
Confidence 9999999999998776543 4555
No 81
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=78.70 E-value=5.9 Score=44.47 Aligned_cols=40 Identities=28% Similarity=0.425 Sum_probs=34.9
Q ss_pred eeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 309 KGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 309 ~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
.+..|+-.||| +|+.++|+ |++++|+..|..+.-.||+.|
T Consensus 14 ci~d~afkPDGsqL~lAAg~--rlliyD~ndG~llqtLKgHKD 54 (1081)
T KOG1538|consen 14 CINDIAFKPDGTQLILAAGS--RLLVYDTSDGTLLQPLKGHKD 54 (1081)
T ss_pred chheeEECCCCceEEEecCC--EEEEEeCCCcccccccccccc
Confidence 45568999999 56666665 999999999999999999999
No 82
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=78.52 E-value=11 Score=38.53 Aligned_cols=77 Identities=18% Similarity=0.215 Sum_probs=58.9
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe--eecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR--LWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr--mwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
...++-.+|.+.-++++|+-|++..++-..+.+-. -||++- ++ +|+-.....+. =||.+.
T Consensus 123 ~~lslD~~~~~~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He-~E-~Wta~f~~~~p----------------nlvytG 184 (339)
T KOG0280|consen 123 EALSLDISTSGTKIFVSDSRGSISGVYETEMVLEKVQTWKVHE-FE-AWTAKFSDKEP----------------NLVYTG 184 (339)
T ss_pred eeeEEEeeccCceEEEEcCCCcEEEEecceeeeeecccccccc-ee-eeeeecccCCC----------------ceEEec
Confidence 56778888999999999999999999988888877 899873 23 36644432211 288899
Q ss_pred CCCCeEEEeeCC-CCcce
Q 015021 387 PRKGIIEHLAGM-ADENW 403 (414)
Q Consensus 387 prRg~lEvW~~~-~g~~v 403 (414)
.-+|.|-.|++| .++.|
T Consensus 185 gDD~~l~~~D~R~p~~~i 202 (339)
T KOG0280|consen 185 GDDGSLSCWDIRIPKTFI 202 (339)
T ss_pred CCCceEEEEEecCCccee
Confidence 999999999999 45544
No 83
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=78.35 E-value=8.6 Score=45.64 Aligned_cols=81 Identities=14% Similarity=0.121 Sum_probs=56.8
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC--
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP-- 387 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap-- 387 (414)
+.+|++||+++|+++.-+-|-+.|||+.-+..++-|+===+|+.-=+.+..-.. +. .-.|.+.
T Consensus 1198 vTSi~idp~~~WlviGts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~~~~-----------~~----S~~vs~~~~ 1262 (1431)
T KOG1240|consen 1198 VTSIVIDPWCNWLVIGTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWLCPTYP-----------QE----SVSVSAGSS 1262 (1431)
T ss_pred eeEEEecCCceEEEEecCCceEEEEEeecCceeecccCcccCCcceEEeeccCC-----------CC----ceEEEeccc
Confidence 689999999999999999999999999999999999743333333222211100 00 1223333
Q ss_pred CCCeEEEeeCCCCcceee
Q 015021 388 RKGIIEHLAGMADENWTT 405 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v~~ 405 (414)
-.+-|++|+|-+|.+=++
T Consensus 1263 ~~nevs~wn~~~g~~~~v 1280 (1431)
T KOG1240|consen 1263 SNNEVSTWNMETGLRQTV 1280 (1431)
T ss_pred CCCceeeeecccCcceEE
Confidence 368899999999987643
No 84
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=78.13 E-value=3.7 Score=42.57 Aligned_cols=66 Identities=15% Similarity=0.125 Sum_probs=53.7
Q ss_pred CEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeC
Q 015021 320 SLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAG 397 (414)
Q Consensus 320 ~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~ 397 (414)
+.+|+.=+-|-|=|+|..+|..++.+|||-++ +++|+... ....|+-+.-+|.|.+|++
T Consensus 41 ~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~d-------------------s~h~v~s~ssDG~Vr~wD~ 101 (376)
T KOG1188|consen 41 TAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCD-------------------SPHGVISCSSDGTVRLWDI 101 (376)
T ss_pred eeEEEEecCCeEEEEeccchhhhheecCCCCcccceEEecCC-------------------CCCeeEEeccCCeEEEEEe
Confidence 67788888999999999999999999999875 44444310 1248899999999999999
Q ss_pred CCCccee
Q 015021 398 MADENWT 404 (414)
Q Consensus 398 ~~g~~v~ 404 (414)
|.-.+++
T Consensus 102 Rs~~e~a 108 (376)
T KOG1188|consen 102 RSQAESA 108 (376)
T ss_pred ecchhhh
Confidence 9877764
No 85
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=78.03 E-value=13 Score=39.72 Aligned_cols=79 Identities=13% Similarity=0.139 Sum_probs=56.2
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc-ceEEEEEEc
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD-YCLCLAIHA 386 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~-~~l~LvIya 386 (414)
-....+.+-|+|.|+..+++.|....-|..+|..+-+---- -.+|+-- ....| -.|.|.-|.
T Consensus 304 ~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~-~s~v~~t----------------s~~fHpDgLifgtgt 366 (506)
T KOG0289|consen 304 EPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDE-TSDVEYT----------------SAAFHPDGLIFGTGT 366 (506)
T ss_pred ccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeec-cccceeE----------------EeeEcCCceEEeccC
Confidence 34567889999999999999999999999999776543110 0111111 11111 256788888
Q ss_pred CCCCeEEEeeCCCCccee
Q 015021 387 PRKGIIEHLAGMADENWT 404 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~ 404 (414)
| +|.|+||++..+..++
T Consensus 367 ~-d~~vkiwdlks~~~~a 383 (506)
T KOG0289|consen 367 P-DGVVKIWDLKSQTNVA 383 (506)
T ss_pred C-CceEEEEEcCCccccc
Confidence 8 9999999999988664
No 86
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=77.96 E-value=4.2 Score=41.49 Aligned_cols=39 Identities=15% Similarity=0.180 Sum_probs=36.6
Q ss_pred eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccc
Q 015021 311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYR 349 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyR 349 (414)
..-+-+|+++++.+++..|||.++++.+|..+..|+|-+
T Consensus 236 ~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~~~~ 274 (311)
T KOG1446|consen 236 LSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLRGPN 274 (311)
T ss_pred eeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEecCCC
Confidence 577889999999999999999999999999999999973
No 87
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=77.86 E-value=10 Score=40.29 Aligned_cols=46 Identities=17% Similarity=0.247 Sum_probs=38.7
Q ss_pred CCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcce-eEeeecccccc
Q 015021 306 HPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALV-VVRLWKGYRDA 351 (414)
Q Consensus 306 ~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~-vvrmwKGyRdA 351 (414)
+.+...++...|.+ .++|+.++.|+|.|+|+.++. .+.-..|++|+
T Consensus 271 h~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~~lh~~e~H~de 318 (422)
T KOG0264|consen 271 HSAEVNCVAFNPFNEFILATGSADKTVALWDLRNLNKPLHTFEGHEDE 318 (422)
T ss_pred cCCceeEEEeCCCCCceEEeccCCCcEEEeechhcccCceeccCCCcc
Confidence 34677889999998 677778999999999999875 57788999996
No 88
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=77.67 E-value=1.9 Score=45.01 Aligned_cols=79 Identities=19% Similarity=0.170 Sum_probs=59.2
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
+.|.-++|.+-. +--.|++++...++-.|=|+|+..|..+||..|+-| -+.+.--.+ |
T Consensus 354 Rtl~gHkRGIAC--lQYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHEe----LvRciRFd~-----------k----- 411 (499)
T KOG0281|consen 354 RTLNGHKRGIAC--LQYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHEE----LVRCIRFDN-----------K----- 411 (499)
T ss_pred hhhhccccccee--hhccCeEEEecCCCceEEEEeccccHHHHHHhchHH----hhhheeecC-----------c-----
Confidence 457777777654 446789999988889999999999999999999988 343321111 1
Q ss_pred EEEEEcCCCCeEEEeeCCCCcc
Q 015021 381 CLAIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~ 402 (414)
-++....+|.|+||+++.+..
T Consensus 412 -rIVSGaYDGkikvWdl~aald 432 (499)
T KOG0281|consen 412 -RIVSGAYDGKIKVWDLQAALD 432 (499)
T ss_pred -eeeeccccceEEEEecccccC
Confidence 235566799999999998753
No 89
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=76.71 E-value=12 Score=42.49 Aligned_cols=115 Identities=16% Similarity=0.188 Sum_probs=69.3
Q ss_pred CeeeeccC--cceeee-eecceEEE-EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-c
Q 015021 33 PNLLCALD--MHTIAL-ANRYQTVI-INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-L 107 (414)
Q Consensus 33 ~~~~~sp~--~~~la~-A~~~~~v~-~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e 107 (414)
++....-. |.++|+ |.++.++- ++-..-+ ..+.- +. .-..||++. +.+|---+||||.+|.|++|+ +
T Consensus 24 ~~~~~~~~~~Gr~va~~a~E~vn~WdlRtge~~-~~l~~---~~--~k~evt~l~--~~~d~l~lAVGYaDGsVqif~~~ 95 (888)
T KOG0306|consen 24 INFVVKRSGKGRAVAVSALEQVNIWDLRTGEIE-KKLIL---LK--KKAEVTCLR--SSDDILLLAVGYADGSVQIFSLE 95 (888)
T ss_pred eeEEEeecCCCcEEEEeccccEeEEeeecchhh-hhhhh---hc--ccceEEEee--ccCCcceEEEEecCceEEeeccC
Confidence 44444444 888888 77776654 3321000 00100 00 112355542 337778899999999999999 6
Q ss_pred CCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC----CeEEEEeC
Q 015021 108 KGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP----GVLARFDG 156 (414)
Q Consensus 108 ~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~----~~iv~IdG 156 (414)
++..++.=-.|...|.-|++-.-...+ -..+.+.-+|||. .+..++.|
T Consensus 96 s~~~~~tfngHK~AVt~l~fd~~G~rl-aSGskDt~IIvwDlV~E~Gl~rL~G 147 (888)
T KOG0306|consen 96 SEEILITFNGHKAAVTTLKFDKIGTRL-ASGSKDTDIIVWDLVGEEGLFRLRG 147 (888)
T ss_pred CCceeeeecccccceEEEEEcccCceE-eecCCCccEEEEEeccceeeEEeec
Confidence 888888777799999988876543211 1112344455554 45677777
No 90
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=76.16 E-value=23 Score=35.04 Aligned_cols=52 Identities=21% Similarity=0.381 Sum_probs=38.5
Q ss_pred CCCeEEEEEeEEeCC-----cEEEEEecc---------c-cEEEEEecCC--------cEEeecccCccceeEEEE
Q 015021 75 ASEYITAIEWLVFEE-----MRALAVGTS---------R-GYFLVYDLKG--------DLVHRQLIHPGRILKLRV 127 (414)
Q Consensus 75 ~~e~ITs~~~lpl~d-----w~~I~VG~s---------s-G~vrfyte~G--------~LL~sQ~lh~~pV~~ik~ 127 (414)
++|.++|++-+.+.+ ..+|+|||. + |.+.+|+-.- .++++.- .+.||..|..
T Consensus 22 ~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~-~~g~V~ai~~ 96 (321)
T PF03178_consen 22 PNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHSTE-VKGPVTAICS 96 (321)
T ss_dssp TTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEEEE-ESS-EEEEEE
T ss_pred CCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEEEe-ecCcceEhhh
Confidence 479999999999964 799999997 3 9999999544 2333333 3678888873
No 91
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=75.89 E-value=5.3 Score=23.72 Aligned_cols=33 Identities=33% Similarity=0.434 Sum_probs=26.4
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLD 335 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D 335 (414)
+.........+...|.++++++....|.|.++|
T Consensus 8 ~~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 8 LKGHTGPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred EEecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 445556788999999999999988889887765
No 92
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=75.23 E-value=19 Score=40.56 Aligned_cols=56 Identities=21% Similarity=0.230 Sum_probs=41.7
Q ss_pred CCeEEEEEeEEe--CCc-EEEEEeccccEEEEEecC-----CcEE-------eec-ccCccceeEEEEeccc
Q 015021 76 SEYITAIEWLVF--EEM-RALAVGTSRGYFLVYDLK-----GDLV-------HRQ-LIHPGRILKLRVRGSR 131 (414)
Q Consensus 76 ~e~ITs~~~lpl--~dw-~~I~VG~ssG~vrfyte~-----G~LL-------~sQ-~lh~~pV~~ik~r~~~ 131 (414)
++.+|++.++|+ ++- -||+||+..|-+.+|+-. ++.. +.+ +-|.+.|.+|+.|...
T Consensus 664 ~~aVTAv~~~~~~~~e~~~~vavGle~GeI~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~aV~rl~w~p~~ 735 (764)
T KOG1063|consen 664 SLAVTAVAYLPVDHNEKGDVVAVGLEKGEIVLWRRKREHRQVTVGTFNLDTRLCATIGPDSAVNRLLWRPTC 735 (764)
T ss_pred CCceeeEEeeccccccccceEEEEecccEEEEEecccccccccceeeeeccccccccChHHhhheeEecccc
Confidence 688999999999 333 499999999999999932 2221 222 4567899999988653
No 93
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=74.96 E-value=6.3 Score=39.34 Aligned_cols=53 Identities=21% Similarity=0.310 Sum_probs=40.3
Q ss_pred CeEEEEEeEEe-----CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021 77 EYITAIEWLVF-----EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 77 e~ITs~~~lpl-----~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
..||++..|.- ....|++|||-+|.|.+.+.++--++.|.-=+++-..|.+-.
T Consensus 177 t~ITcm~tikk~~~d~~a~scLViGTE~~~i~iLd~~af~il~~~~lpsvPv~i~~~G 234 (257)
T PF14779_consen 177 TVITCMATIKKSSADEDAVSCLVIGTESGEIYILDPQAFTILKQVQLPSVPVFISVSG 234 (257)
T ss_pred ceeEEeeeecccccCCCCcceEEEEecCCeEEEECchhheeEEEEecCCCceEEEEEe
Confidence 46888888887 345999999999999999999999998864444333555443
No 94
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=74.44 E-value=6.3 Score=42.31 Aligned_cols=78 Identities=10% Similarity=0.086 Sum_probs=59.8
Q ss_pred CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
++..-+++..-||+|+|+.|.-|-|=++|..+..++|..+++-. -...+.... . ..-+...+
T Consensus 68 k~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~r~iLR~~~ah~a-pv~~~~f~~-------------~----d~t~l~s~ 129 (487)
T KOG0310|consen 68 KDVVYSVDFRSDGRLLAAGDESGHVKVFDMKSRVILRQLYAHQA-PVHVTKFSP-------------Q----DNTMLVSG 129 (487)
T ss_pred ccceeEEEeecCCeEEEccCCcCcEEEeccccHHHHHHHhhccC-ceeEEEecc-------------c----CCeEEEec
Confidence 46678889999999999999999999999999999999988753 222222211 1 11355778
Q ss_pred CCCCeEEEeeCCCCcc
Q 015021 387 PRKGIIEHLAGMADEN 402 (414)
Q Consensus 387 prRg~lEvW~~~~g~~ 402 (414)
.-++++.+|++-++.-
T Consensus 130 sDd~v~k~~d~s~a~v 145 (487)
T KOG0310|consen 130 SDDKVVKYWDLSTAYV 145 (487)
T ss_pred CCCceEEEEEcCCcEE
Confidence 8889999999988763
No 95
>PRK01742 tolB translocation protein TolB; Provisional
Probab=74.02 E-value=10 Score=39.39 Aligned_cols=44 Identities=14% Similarity=0.106 Sum_probs=32.6
Q ss_pred eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeE
Q 015021 311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVF 355 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~w 355 (414)
.....||+|++++.+.. .+++++|+.+|...++.+++.+-.+.|
T Consensus 336 ~~~~~SpDG~~ia~~~~-~~i~~~Dl~~g~~~~lt~~~~~~~~~~ 379 (429)
T PRK01742 336 YSAQISADGKTLVMING-DNVVKQDLTSGSTEVLSSTFLDESPSI 379 (429)
T ss_pred CCccCCCCCCEEEEEcC-CCEEEEECCCCCeEEecCCCCCCCceE
Confidence 35778999998877755 568889999998877777765533333
No 96
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=73.96 E-value=25 Score=33.52 Aligned_cols=68 Identities=25% Similarity=0.450 Sum_probs=46.9
Q ss_pred ccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcE
Q 015021 38 ALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDL 111 (414)
Q Consensus 38 sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~L 111 (414)
......|++|.+.++.+.+|......-.....++.- .+.++++.|+ ...|+||+.++|..+=-++|..
T Consensus 102 ~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~l--p~~~~~i~~~----~~~i~v~~~~~f~~idl~~~~~ 169 (275)
T PF00780_consen 102 HEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISL--PDPPSSIAFL----GNKICVGTSKGFYLIDLNTGSP 169 (275)
T ss_pred cccceEEEEEECCEEEEEEEECCcccccceeEEEEc--CCCcEEEEEe----CCEEEEEeCCceEEEecCCCCc
Confidence 566778899999999999998753211022233433 2668888888 4588899999987765565654
No 97
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=73.95 E-value=17 Score=39.16 Aligned_cols=55 Identities=22% Similarity=0.433 Sum_probs=46.1
Q ss_pred CCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEeccc
Q 015021 75 ASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSR 131 (414)
Q Consensus 75 ~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~ 131 (414)
.+.-||++.|= .|-+.+|.|+-+|++|+|+.+|.|+.+--+|..||.+||-+...
T Consensus 234 ~nkdVT~L~Wn--~~G~~LatG~~~G~~riw~~~G~l~~tl~~HkgPI~slKWnk~G 288 (524)
T KOG0273|consen 234 SNKDVTSLDWN--NDGTLLATGSEDGEARIWNKDGNLISTLGQHKGPIFSLKWNKKG 288 (524)
T ss_pred ccCCcceEEec--CCCCeEEEeecCcEEEEEecCchhhhhhhccCCceEEEEEcCCC
Confidence 34778888442 23479999999999999999999999999999999999966543
No 98
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=73.78 E-value=22 Score=36.48 Aligned_cols=86 Identities=19% Similarity=0.152 Sum_probs=60.7
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcce-eEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALV-VVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~-vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
|+--++-++-++..+..+|+|.++|..-.+--|+|+++-..+ =.-..|||-.|=+.=--+. + +.
T Consensus 39 p~m~l~gh~geI~~~~F~P~gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~---d-----------~s- 103 (338)
T KOG0265|consen 39 PIMLLPGHKGEIYTIKFHPDGSCFASGGSDRAIVLWNVYGDCENFWVLKGHSGAVMELHGMR---D-----------GS- 103 (338)
T ss_pred hhhhcCCCcceEEEEEECCCCCeEeecCCcceEEEEeccccccceeeeccccceeEeeeecc---C-----------CC-
Confidence 333477888999999999999999999888889998854332 2234567777644322211 1 11
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v 403 (414)
-+.-+.-+-.|-+||+++|+++
T Consensus 104 ----~i~S~gtDk~v~~wD~~tG~~~ 125 (338)
T KOG0265|consen 104 ----HILSCGTDKTVRGWDAETGKRI 125 (338)
T ss_pred ----EEEEecCCceEEEEecccceee
Confidence 3456777889999999999987
No 99
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=73.69 E-value=15 Score=41.96 Aligned_cols=58 Identities=17% Similarity=0.183 Sum_probs=47.6
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK 362 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~ 362 (414)
|.-+..++...+.||+|+|.++++.++.|=++|+-+|..|-+. .=|.-|-.+...++.
T Consensus 572 f~gh~nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~--~vd~~~~sls~SPng 629 (910)
T KOG1539|consen 572 FWGHGNRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLIDGL--LVDSPCTSLSFSPNG 629 (910)
T ss_pred hhccccceeeeEeCCCCcEEEEeecCCcEEEEeccCcceeeeE--ecCCcceeeEECCCC
Confidence 4555677889999999999999999999999999999999887 346666666655443
No 100
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=73.30 E-value=4.9 Score=41.65 Aligned_cols=87 Identities=16% Similarity=0.020 Sum_probs=58.9
Q ss_pred CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021 306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH 385 (414)
Q Consensus 306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy 385 (414)
-+-+.++-..||||+|.+...-+|-|=+||-.+|.+-+=.|= .||=.|+.+.+....-.- .+ +. =.+--
T Consensus 212 ~KSh~EcA~FSPDgqyLvsgSvDGFiEVWny~~GKlrKDLkY--QAqd~fMMmd~aVlci~F-----SR--Ds--EMlAs 280 (508)
T KOG0275|consen 212 QKSHVECARFSPDGQYLVSGSVDGFIEVWNYTTGKLRKDLKY--QAQDNFMMMDDAVLCISF-----SR--DS--EMLAS 280 (508)
T ss_pred cccchhheeeCCCCceEeeccccceeeeehhccchhhhhhhh--hhhcceeecccceEEEee-----cc--cH--HHhhc
Confidence 355788999999999999999999999999999987665542 355566655422100000 00 00 01123
Q ss_pred cCCCCeEEEeeCCCCcce
Q 015021 386 APRKGIIEHLAGMADENW 403 (414)
Q Consensus 386 aprRg~lEvW~~~~g~~v 403 (414)
...+|-|+||.+++|.|+
T Consensus 281 GsqDGkIKvWri~tG~Cl 298 (508)
T KOG0275|consen 281 GSQDGKIKVWRIETGQCL 298 (508)
T ss_pred cCcCCcEEEEEEecchHH
Confidence 456899999999999987
No 101
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=72.65 E-value=9.6 Score=39.89 Aligned_cols=53 Identities=21% Similarity=0.283 Sum_probs=45.0
Q ss_pred CeEEEEEeEEe-CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021 77 EYITAIEWLVF-EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 77 e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
+.+-|++.+|+ .....-|+|.-+|-|.+|+-.-..|+.++-|+++|.+|+.-.
T Consensus 284 ~~~esve~~~~ss~lpL~A~G~vdG~i~iyD~a~~~~R~~c~he~~V~~l~w~~ 337 (399)
T KOG0296|consen 284 ELDESVESIPSSSKLPLAACGSVDGTIAIYDLAASTLRHICEHEDGVTKLKWLN 337 (399)
T ss_pred hhhhhhhhcccccccchhhcccccceEEEEecccchhheeccCCCceEEEEEcC
Confidence 55666666676 445678999999999999999999999999999999999655
No 102
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=72.59 E-value=23 Score=39.68 Aligned_cols=80 Identities=14% Similarity=0.142 Sum_probs=63.4
Q ss_pred CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe-----eecc-cccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR-----LWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr-----mwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
+-+..+|.-.|++.-+|..-.+|+|.++|+..+.-+. +-+- -|+.-|.|-......
T Consensus 154 ~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd------------------ 215 (691)
T KOG2048|consen 154 KSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRD------------------ 215 (691)
T ss_pred cceEEEEEecCCccEEEecccCceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeec------------------
Confidence 4557788888999988888899999999999999887 4443 348889997665422
Q ss_pred EEEEEcCCCCeEEEeeCCCCccee
Q 015021 381 CLAIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~v~ 404 (414)
-.+|-.=.+|.|.+||-++|-.+.
T Consensus 216 ~tI~sgDS~G~V~FWd~~~gTLiq 239 (691)
T KOG2048|consen 216 STIASGDSAGTVTFWDSIFGTLIQ 239 (691)
T ss_pred CcEEEecCCceEEEEcccCcchhh
Confidence 156788889999999999998763
No 103
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=72.47 E-value=12 Score=40.80 Aligned_cols=78 Identities=12% Similarity=0.041 Sum_probs=57.2
Q ss_pred CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021 306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH 385 (414)
Q Consensus 306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy 385 (414)
+.-.+..+.++ +.+.++.-..|.|.++|+.++..++-.+|+-. |+....... . -.++-
T Consensus 330 h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~----~V~sl~~~~-----------~-----~~~~S 387 (537)
T KOG0274|consen 330 HTGPVNCVQLD--EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTG----RVYSLIVDS-----------E-----NRLLS 387 (537)
T ss_pred ccccEEEEEec--CCEEEEEecCceEEEEEhhhceeeeeecCCcc----eEEEEEecC-----------c-----ceEEe
Confidence 55556677777 66666655566999999999999999999777 666542211 0 14466
Q ss_pred cCCCCeEEEeeCCCC-cceee
Q 015021 386 APRKGIIEHLAGMAD-ENWTT 405 (414)
Q Consensus 386 aprRg~lEvW~~~~g-~~v~~ 405 (414)
..-++.|++||+.++ +|+.|
T Consensus 388 gs~D~~IkvWdl~~~~~c~~t 408 (537)
T KOG0274|consen 388 GSLDTTIKVWDLRTKRKCIHT 408 (537)
T ss_pred eeeccceEeecCCchhhhhhh
Confidence 777899999999999 88744
No 104
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=71.38 E-value=12 Score=42.69 Aligned_cols=78 Identities=21% Similarity=0.099 Sum_probs=45.0
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
.+...+||+++++|+.|+.|||+++---.- -=+-+----+.|.-........ ..+.+ .+..+.+-
T Consensus 208 ~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~----~~~~~t~t~lHWH~~~V~~L~f---------S~~G~--~LlSGG~E 272 (792)
T KOG1963|consen 208 ITCVALSPNERYLAAGDSDGRILVWRDFGS----SDDSETCTLLHWHHDEVNSLSF---------SSDGA--YLLSGGRE 272 (792)
T ss_pred ceeEEeccccceEEEeccCCcEEEEecccc----ccccccceEEEecccccceeEE---------ecCCc--eEeecccc
Confidence 678999999999999999999876532220 0000001112366211111100 00111 34567778
Q ss_pred CeEEEeeCCCCcc
Q 015021 390 GIIEHLAGMADEN 402 (414)
Q Consensus 390 g~lEvW~~~~g~~ 402 (414)
|.|-+|.+.++++
T Consensus 273 ~VLv~Wq~~T~~k 285 (792)
T KOG1963|consen 273 GVLVLWQLETGKK 285 (792)
T ss_pred eEEEEEeecCCCc
Confidence 9999999988764
No 105
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=70.66 E-value=36 Score=31.34 Aligned_cols=80 Identities=24% Similarity=0.255 Sum_probs=57.7
Q ss_pred ccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEe-eecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 303 LKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVR-LWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvr-mwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
+.........+..+|+++ +++.....|.|.++|...+..++ .+.|+.+-. +..... . .
T Consensus 194 ~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~---~~~~~~------------~--~--- 253 (466)
T COG2319 194 LAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV---VSSFSP------------D--G--- 253 (466)
T ss_pred eccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEeeecCCCCcce---eEeECC------------C--C---
Confidence 444678888899999998 44445999999999999999999 788888842 111111 0 0
Q ss_pred EEEEEcCCCCeEEEeeCCCCcc
Q 015021 381 CLAIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~ 402 (414)
-+++-....+.+.+|+++....
T Consensus 254 ~~~~~~~~d~~~~~~~~~~~~~ 275 (466)
T COG2319 254 SLLASGSSDGTIRLWDLRSSSS 275 (466)
T ss_pred CEEEEecCCCcEEEeeecCCCc
Confidence 1333778889999999987765
No 106
>KOG4328 consensus WD40 protein [Function unknown]
Probab=69.86 E-value=1.7e+02 Score=31.80 Aligned_cols=80 Identities=19% Similarity=0.340 Sum_probs=57.2
Q ss_pred CCeEEEEEeEEeCCcEEEEEeccccEEEEEec-------CCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC
Q 015021 76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDL-------KGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP 148 (414)
Q Consensus 76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte-------~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~ 148 (414)
.++||++.+.|-..-..|++|=..|.|-|..- .|..+| .-|-.||-.|+|.-.-. +.-++--|.
T Consensus 186 ~~Rit~l~fHPt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f--~~hs~~Vs~l~F~P~n~-------s~i~ssSyD 256 (498)
T KOG4328|consen 186 DRRITSLAFHPTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLF--TPHSGPVSGLKFSPANT-------SQIYSSSYD 256 (498)
T ss_pred ccceEEEEecccCcceEEEEccCCCcEEEEecCCCCCccCceEEe--ccCCccccceEecCCCh-------hheeeeccC
Confidence 58999999999999899999999999999986 344544 35668999999765433 444566666
Q ss_pred CeE--EEEeChhHHHHHH
Q 015021 149 GVL--ARFDGSEIQKMLQ 164 (414)
Q Consensus 149 ~~i--v~IdG~sL~~~Lr 164 (414)
+.| .-+.+.-+..+|+
T Consensus 257 GtiR~~D~~~~i~e~v~s 274 (498)
T KOG4328|consen 257 GTIRLQDFEGNISEEVLS 274 (498)
T ss_pred ceeeeeeecchhhHHHhh
Confidence 665 3344444444433
No 107
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=69.71 E-value=6.1 Score=44.46 Aligned_cols=83 Identities=18% Similarity=0.104 Sum_probs=59.3
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC 381 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~ 381 (414)
.+.-+.|....+.++|+|+|.|..-..--|=++|+.+|.++.-+|++-. +..=++.. ..=|
T Consensus 149 ~~~s~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~ef~~~e~-~v~sle~h------------------p~e~ 209 (825)
T KOG0267|consen 149 TYKSHTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGKLSKEFKSHEG-KVQSLEFH------------------PLEV 209 (825)
T ss_pred eecCCcceeEEEeecCCCceeeccCCcceeeeecccccccccccccccc-cccccccC------------------chhh
Confidence 3455778899999999999999965577888899999999988887543 22222211 1114
Q ss_pred EEEEcCCCCeEEEeeCCCCcce
Q 015021 382 LAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 382 LvIyaprRg~lEvW~~~~g~~v 403 (414)
|.=-...++++.+|++.+.+-|
T Consensus 210 Lla~Gs~d~tv~f~dletfe~I 231 (825)
T KOG0267|consen 210 LLAPGSSDRTVRFWDLETFEVI 231 (825)
T ss_pred hhccCCCCceeeeeccceeEEe
Confidence 4455567889999999977766
No 108
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=69.35 E-value=34 Score=35.48 Aligned_cols=50 Identities=18% Similarity=0.210 Sum_probs=42.6
Q ss_pred EEeEEeCCcEEEEEeccccEEEEEecC-CcEEeecccCccceeEEEEeccc
Q 015021 82 IEWLVFEEMRALAVGTSRGYFLVYDLK-GDLVHRQLIHPGRILKLRVRGSR 131 (414)
Q Consensus 82 ~~~lpl~dw~~I~VG~ssG~vrfyte~-G~LL~sQ~lh~~pV~~ik~r~~~ 131 (414)
+.|+++.+-..++||-+++.+++++.+ +.-++.-.-|+.+|..|..-+.+
T Consensus 211 ~l~~~~l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~ 261 (362)
T KOG0294|consen 211 ILCATFLDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNP 261 (362)
T ss_pred ceeeeecCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecC
Confidence 456777788899999999999999975 88888889999999999966654
No 109
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=67.94 E-value=34 Score=40.57 Aligned_cols=57 Identities=23% Similarity=0.179 Sum_probs=40.4
Q ss_pred CCCeEEEEEeEEeCCcEEEEEeccccEEEEEec------CCcEEeecccCccceeEEEEecccC
Q 015021 75 ASEYITAIEWLVFEEMRALAVGTSRGYFLVYDL------KGDLVHRQLIHPGRILKLRVRGSRR 132 (414)
Q Consensus 75 ~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte------~G~LL~sQ~lh~~pV~~ik~r~~~~ 132 (414)
.+.+|||+.+=- ..-..||+||.+|.||+|+. .+....+|.=..+||+.+++|....
T Consensus 1207 s~t~vTaLS~~~-~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~ 1269 (1387)
T KOG1517|consen 1207 SSTLVTALSADL-VHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGL 1269 (1387)
T ss_pred CCccceeecccc-cCCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCC
Confidence 356777775522 23579999999999999994 3455555554445699999988664
No 110
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=67.85 E-value=6 Score=45.37 Aligned_cols=46 Identities=22% Similarity=0.410 Sum_probs=40.4
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY 348 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy 348 (414)
|.-++-....|..+|.+.+.|+++-.|-|.+||++.+.+..+|+|-
T Consensus 134 lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v 179 (933)
T KOG1274|consen 134 LRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGV 179 (933)
T ss_pred ecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccC
Confidence 3334455678999999999999999999999999999999999985
No 111
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=66.24 E-value=15 Score=27.27 Aligned_cols=30 Identities=17% Similarity=0.222 Sum_probs=27.0
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcce
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALV 340 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~ 340 (414)
+..++-||+..|.|++...|.|++..+ +++
T Consensus 14 v~~~~w~P~mdLiA~~t~~g~v~v~Rl-~~q 43 (47)
T PF12894_consen 14 VSCMSWCPTMDLIALGTEDGEVLVYRL-NWQ 43 (47)
T ss_pred EEEEEECCCCCEEEEEECCCeEEEEEC-CCc
Confidence 678999999999999999999999998 444
No 112
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=66.16 E-value=12 Score=39.16 Aligned_cols=74 Identities=16% Similarity=0.112 Sum_probs=52.8
Q ss_pred ECCCCCEEEEEcCCCcEEEEEcCccee---EeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCe
Q 015021 315 LSPSGSLAAITDSLGRILLLDTQALVV---VRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGI 391 (414)
Q Consensus 315 ~sP~~~laa~tD~lGRV~L~D~~~~~v---vrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~ 391 (414)
+..++++..++|-.|.|-|+|+..-.- |+.+.||=+- -+-+++.-+.+. -.|.-+..+=.
T Consensus 307 Lq~s~q~LmaS~M~gkikLyD~R~~K~~~~V~qYeGHvN~-~a~l~~~v~~ee----------------g~I~s~GdDcy 369 (425)
T KOG2695|consen 307 LQFSQQKLMASDMTGKIKLYDLRATKCKKSVMQYEGHVNL-SAYLPAHVKEEE----------------GSIFSVGDDCY 369 (425)
T ss_pred hccccceEeeccCcCceeEeeehhhhcccceeeeeccccc-cccccccccccc----------------ceEEEccCeeE
Confidence 345788999999999999999987777 8888888651 112222111110 14455788888
Q ss_pred EEEeeCCCCcceee
Q 015021 392 IEHLAGMADENWTT 405 (414)
Q Consensus 392 lEvW~~~~g~~v~~ 405 (414)
..||++++|+...|
T Consensus 370 tRiWsl~~ghLl~t 383 (425)
T KOG2695|consen 370 TRIWSLDSGHLLCT 383 (425)
T ss_pred EEEEecccCceeec
Confidence 99999999999866
No 113
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=64.55 E-value=13 Score=38.57 Aligned_cols=39 Identities=8% Similarity=0.042 Sum_probs=34.3
Q ss_pred CceeeEEEECCCCC-EEEEEc-CCCcEEEEEcCcceeEeee
Q 015021 307 PRKGERLTLSPSGS-LAAITD-SLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 307 ~R~~~~i~~sP~~~-laa~tD-~lGRV~L~D~~~~~vvrmw 345 (414)
.+....|+++|+|+ ++.+++ .-|.|.+||+.++.+++--
T Consensus 298 G~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~~i 338 (352)
T TIGR02658 298 GHEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELSSV 338 (352)
T ss_pred CCceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEeee
Confidence 46788999999999 888887 6788999999999999866
No 114
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=64.15 E-value=25 Score=37.45 Aligned_cols=82 Identities=15% Similarity=0.180 Sum_probs=57.9
Q ss_pred CCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCccee--------------EeeecccccceeeEEEEEeccc
Q 015021 299 PLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVV--------------VRLWKGYRDASCVFMEMLVNKD 363 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~v--------------vrmwKGyRdAq~~wi~~~~~~~ 363 (414)
|++.+...+-+...+.-||.. +..|.+-..||++++|+.+-.- +-|=-|+|.-=+-|-
T Consensus 308 ~lh~~e~H~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~Dfs------- 380 (422)
T KOG0264|consen 308 PLHTFEGHEDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSDFS------- 380 (422)
T ss_pred CceeccCCCcceEEEEeCCCCCceeEecccCCcEEEEeccccccccChhhhccCCcceeEEecCccccccccc-------
Confidence 667899999999999999999 7888889999999999976332 234444444210000
Q ss_pred ccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021 364 AATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM 398 (414)
Q Consensus 364 ~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~ 398 (414)
-.|.. . ..+-.+.-+++|.||.|-
T Consensus 381 -------Wnp~e--P--W~I~SvaeDN~LqIW~~s 404 (422)
T KOG0264|consen 381 -------WNPNE--P--WTIASVAEDNILQIWQMA 404 (422)
T ss_pred -------CCCCC--C--eEEEEecCCceEEEeecc
Confidence 00111 1 467788889999999986
No 115
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=63.41 E-value=13 Score=38.66 Aligned_cols=69 Identities=14% Similarity=0.141 Sum_probs=51.6
Q ss_pred CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021 319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM 398 (414)
Q Consensus 319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~ 398 (414)
+-+.|++-.+|-|-+||+.++....=..|+=+| |. | .+.-|. ..-|++-+..+..|.+|++|
T Consensus 105 ~p~la~~G~~GvIrVid~~~~~~~~~~~ghG~s----IN--e--------ik~~p~----~~qlvls~SkD~svRlwnI~ 166 (385)
T KOG1034|consen 105 NPFLAAGGYLGVIRVIDVVSGQCSKNYRGHGGS----IN--E--------IKFHPD----RPQLVLSASKDHSVRLWNIQ 166 (385)
T ss_pred CeeEEeecceeEEEEEecchhhhccceeccCcc----ch--h--------hhcCCC----CCcEEEEecCCceEEEEecc
Confidence 457788889999999999999998777666552 11 0 111112 22499999999999999999
Q ss_pred CCcceee
Q 015021 399 ADENWTT 405 (414)
Q Consensus 399 ~g~~v~~ 405 (414)
++.||+-
T Consensus 167 ~~~Cv~V 173 (385)
T KOG1034|consen 167 TDVCVAV 173 (385)
T ss_pred CCeEEEE
Confidence 9999964
No 116
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=62.98 E-value=12 Score=39.42 Aligned_cols=49 Identities=20% Similarity=0.307 Sum_probs=42.2
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEE
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEML 359 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~ 359 (414)
..+.++.||++|+++|+.-..|-|.++|+.+++.+..- +.|.-.+|.-.
T Consensus 282 ~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~v---k~aH~~~VT~l 330 (398)
T KOG0771|consen 282 KSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYV---KEAHLGFVTGL 330 (398)
T ss_pred CcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEee---hhhheeeeeeE
Confidence 35789999999999999999999999999999999887 57777777543
No 117
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=62.89 E-value=67 Score=36.56 Aligned_cols=78 Identities=19% Similarity=0.242 Sum_probs=53.7
Q ss_pred cCCCeeeeccCcceeeeeecceEEE--E--eecCCC-----CCceeecc------cCCCCCCCeEEEEEeEEe-CCcEEE
Q 015021 30 VNDPNLLCALDMHTIALANRYQTVI--I--NWADPE-----GLVAKIRP------ELSPIASEYITAIEWLVF-EEMRAL 93 (414)
Q Consensus 30 ~~~~~~~~sp~~~~la~A~~~~~v~--~--~w~~~~-----~~~v~~~g------~l~~~~~e~ITs~~~lpl-~dw~~I 93 (414)
++-..|.++|+|++||++-..-++| + +|...+ ...+.++- ......+-.|.-+.|=|. .+...+
T Consensus 85 f~v~~i~~n~~g~~lal~G~~~v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~~~l 164 (717)
T PF10168_consen 85 FEVHQISLNPTGSLLALVGPRGVVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESDSHL 164 (717)
T ss_pred eeEEEEEECCCCCEEEEEcCCcEEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCCCeE
Confidence 3335788999999999876664444 3 486554 12355542 112212457889999999 356788
Q ss_pred EEeccccEEEEEec
Q 015021 94 AVGTSRGYFLVYDL 107 (414)
Q Consensus 94 ~VG~ssG~vrfyte 107 (414)
+|=|+++.+|+|+.
T Consensus 165 ~vLtsdn~lR~y~~ 178 (717)
T PF10168_consen 165 VVLTSDNTLRLYDI 178 (717)
T ss_pred EEEecCCEEEEEec
Confidence 99999999999997
No 118
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=62.80 E-value=37 Score=35.20 Aligned_cols=84 Identities=14% Similarity=0.118 Sum_probs=65.4
Q ss_pred CccccCCCceeeEEEECCCCC--EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021 300 LTCLKDHPRKGERLTLSPSGS--LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 300 l~~l~D~~R~~~~i~~sP~~~--laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
+..|.++...+..+..+|+-. +...+-..|-|+++|+..=..+.-.|++-.+ +..|.+... +|
T Consensus 76 lg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~-Vt~lsiHPS------------~K-- 140 (362)
T KOG0294|consen 76 LGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQ-VTDLSIHPS------------GK-- 140 (362)
T ss_pred hcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeecccccc-cceeEecCC------------Cc--
Confidence 345667777888888888873 4444445699999999999999999999876 777776532 22
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcc
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~ 402 (414)
|++-..+++.|..|++-+|+.
T Consensus 141 ----LALsVg~D~~lr~WNLV~Gr~ 161 (362)
T KOG0294|consen 141 ----LALSVGGDQVLRTWNLVRGRV 161 (362)
T ss_pred ----eEEEEcCCceeeeehhhcCcc
Confidence 778888999999999998875
No 119
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=62.80 E-value=64 Score=32.77 Aligned_cols=87 Identities=11% Similarity=0.024 Sum_probs=57.5
Q ss_pred eeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEE
Q 015021 34 NLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLV 112 (414)
Q Consensus 34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL 112 (414)
++-+|++|++|.+|+++-+.|. +...=+.++-. + -...|+|..-= |+--+.+.|=.++|++-|+ .+|+-+
T Consensus 189 SlEvs~dG~ilTia~gssV~Fw--daksf~~lKs~-k----~P~nV~SASL~--P~k~~fVaGged~~~~kfDy~TgeEi 259 (334)
T KOG0278|consen 189 SLEVSQDGRILTIAYGSSVKFW--DAKSFGLLKSY-K----MPCNVESASLH--PKKEFFVAGGEDFKVYKFDYNTGEEI 259 (334)
T ss_pred ceeeccCCCEEEEecCceeEEe--ccccccceeec-c----Ccccccccccc--CCCceEEecCcceEEEEEeccCCcee
Confidence 4558999999999999988772 22110001100 0 01333333111 4557889999999999999 789888
Q ss_pred eec-ccCccceeEEEEec
Q 015021 113 HRQ-LIHPGRILKLRVRG 129 (414)
Q Consensus 113 ~sQ-~lh~~pV~~ik~r~ 129 (414)
=+- -=|+.||..+|+-.
T Consensus 260 ~~~nkgh~gpVhcVrFSP 277 (334)
T KOG0278|consen 260 GSYNKGHFGPVHCVRFSP 277 (334)
T ss_pred eecccCCCCceEEEEECC
Confidence 762 45789999999754
No 120
>PRK03629 tolB translocation protein TolB; Provisional
Probab=61.82 E-value=32 Score=35.89 Aligned_cols=48 Identities=19% Similarity=0.324 Sum_probs=34.3
Q ss_pred ceeeEEEECCCCCEEEEE-cCCC--cEEEEEcCcceeEeeeccc-ccceeeE
Q 015021 308 RKGERLTLSPSGSLAAIT-DSLG--RILLLDTQALVVVRLWKGY-RDASCVF 355 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~t-D~lG--RV~L~D~~~~~vvrmwKGy-RdAq~~w 355 (414)
+.......||+|+.+|.+ +..| +|.++|+.++.+.++..+. .+....|
T Consensus 243 ~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~~~~~~~w 294 (429)
T PRK03629 243 RHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRSNNTEPTW 294 (429)
T ss_pred CCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCCCCcCceEE
Confidence 334568899999866654 5555 6999999999998888763 3344444
No 121
>PRK04922 tolB translocation protein TolB; Provisional
Probab=61.21 E-value=37 Score=35.27 Aligned_cols=39 Identities=26% Similarity=0.278 Sum_probs=28.8
Q ss_pred eeEEEECCCCCEEE-EEcCCCc--EEEEEcCcceeEeee-ccc
Q 015021 310 GERLTLSPSGSLAA-ITDSLGR--ILLLDTQALVVVRLW-KGY 348 (414)
Q Consensus 310 ~~~i~~sP~~~laa-~tD~lGR--V~L~D~~~~~vvrmw-KGy 348 (414)
....+.+|+|++++ ++|..|+ |+++|+.++..-|+- .|.
T Consensus 294 ~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~ 336 (433)
T PRK04922 294 DTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGN 336 (433)
T ss_pred ccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCCC
Confidence 34578999997555 5677775 999999998887763 443
No 122
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=61.14 E-value=22 Score=24.07 Aligned_cols=29 Identities=21% Similarity=0.327 Sum_probs=22.0
Q ss_pred CCCCEEEEEc-CCCcEEEEEcCcceeEeee
Q 015021 317 PSGSLAAITD-SLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 317 P~~~laa~tD-~lGRV~L~D~~~~~vvrmw 345 (414)
|+|+.+-+++ ..+.|.++|+.++.+++-.
T Consensus 1 pd~~~lyv~~~~~~~v~~id~~~~~~~~~i 30 (42)
T TIGR02276 1 PDGTKLYVTNSGSNTVSVIDTATNKVIATI 30 (42)
T ss_pred CCCCEEEEEeCCCCEEEEEECCCCeEEEEE
Confidence 6787555554 5889999999998877654
No 123
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=60.78 E-value=65 Score=34.18 Aligned_cols=108 Identities=18% Similarity=0.209 Sum_probs=69.1
Q ss_pred ccccCCCCCcccCCCeeeeccCcceeeeeecc---eEEE----EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcE
Q 015021 19 SDLGAGKEGWLVNDPNLLCALDMHTIALANRY---QTVI----INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMR 91 (414)
Q Consensus 19 ~~~g~~~~~WL~~~~~~~~sp~~~~la~A~~~---~~v~----~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~ 91 (414)
...++-+.+|+...|.+...++++.|-+|..+ +.|. .-|..+ .-+.++..+.. .|-|||+. +. +|-.
T Consensus 221 a~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~--~~l~~~~~~~~--~~siSsl~-VS-~dGk 294 (398)
T KOG0771|consen 221 ARKTPFSKDEMFSSCRFSVDNAQETLRLAASQFPGGGVRLCDISLWSGS--NFLRLRKKIKR--FKSISSLA-VS-DDGK 294 (398)
T ss_pred hhcCCcccchhhhhceecccCCCceEEEEEecCCCCceeEEEeeeeccc--cccchhhhhhc--cCcceeEE-Ec-CCCc
Confidence 34444566777777999998888777665544 3333 224332 11334444422 35666652 22 4678
Q ss_pred EEEEeccccEEEEEec-C-CcEEeecccCccceeEEEEecccC
Q 015021 92 ALAVGTSRGYFLVYDL-K-GDLVHRQLIHPGRILKLRVRGSRR 132 (414)
Q Consensus 92 ~I~VG~ssG~vrfyte-~-G~LL~sQ~lh~~pV~~ik~r~~~~ 132 (414)
++++|+.+|-|.+|.. + -.+.+.-..|..-|.+|.+-.-.+
T Consensus 295 f~AlGT~dGsVai~~~~~lq~~~~vk~aH~~~VT~ltF~Pdsr 337 (398)
T KOG0771|consen 295 FLALGTMDGSVAIYDAKSLQRLQYVKEAHLGFVTGLTFSPDSR 337 (398)
T ss_pred EEEEeccCCcEEEEEeceeeeeEeehhhheeeeeeEEEcCCcC
Confidence 9999999999999993 2 344455577888999999877553
No 124
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=60.42 E-value=21 Score=40.03 Aligned_cols=98 Identities=11% Similarity=0.043 Sum_probs=63.9
Q ss_pred CCCccccCCCceeeEEEE-CCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCC
Q 015021 298 SPLTCLKDHPRKGERLTL-SPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKS 376 (414)
Q Consensus 298 ~pl~~l~D~~R~~~~i~~-sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~ 376 (414)
..+.+|-+++-=+.+|+. .+++.++|..-..++|.|+|+.+|.. +.- |.+-.+++..-. .+.-..-|.-.-.
T Consensus 108 ~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~-~l~-----~s~n~~t~~sl~-sG~k~siYSLA~N 180 (735)
T KOG0308|consen 108 FCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTA-TLV-----ASFNNVTVNSLG-SGPKDSIYSLAMN 180 (735)
T ss_pred hhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEccCcch-hhh-----hhccccccccCC-CCCccceeeeecC
Confidence 455678899888999999 89999999999999999999999965 000 111122211100 0100000000011
Q ss_pred cceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021 377 DYCLCLAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 377 ~~~l~LvIyaprRg~lEvW~~~~g~~v 403 (414)
+ -.++++-..--+.|.+|+.++++++
T Consensus 181 ~-t~t~ivsGgtek~lr~wDprt~~ki 206 (735)
T KOG0308|consen 181 Q-TGTIIVSGGTEKDLRLWDPRTCKKI 206 (735)
T ss_pred C-cceEEEecCcccceEEeccccccce
Confidence 1 2267888888999999999999998
No 125
>PRK01742 tolB translocation protein TolB; Provisional
Probab=59.67 E-value=41 Score=34.95 Aligned_cols=39 Identities=23% Similarity=0.309 Sum_probs=28.7
Q ss_pred eeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021 309 KGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG 347 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG 347 (414)
....+..+|+|++++. .+..| +|+++|+.++.+.++-.+
T Consensus 249 ~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~ 290 (429)
T PRK01742 249 HNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSG 290 (429)
T ss_pred ccCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccC
Confidence 3446899999976665 45556 678899999888777654
No 126
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=58.90 E-value=18 Score=23.96 Aligned_cols=29 Identities=38% Similarity=0.478 Sum_probs=23.7
Q ss_pred CCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021 76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYD 106 (414)
Q Consensus 76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt 106 (414)
.+.|+++.|-|- -..++.|-.+|.|++|+
T Consensus 11 ~~~i~~i~~~~~--~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 11 SSSINSIAWSPD--GNFLASGSSDGTIRVWD 39 (39)
T ss_dssp SSSEEEEEEETT--SSEEEEEETTSEEEEEE
T ss_pred CCcEEEEEEecc--cccceeeCCCCEEEEEC
Confidence 466888888765 56889999999999985
No 127
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=58.44 E-value=19 Score=36.27 Aligned_cols=54 Identities=24% Similarity=0.181 Sum_probs=48.2
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM 356 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi 356 (414)
|.-+.|-.=..+.|-+|+|++++.+.+-+=|+|+.++..+|.|-|++-|-+.|-
T Consensus 254 l~gh~rWvWdc~FS~dg~YlvTassd~~~rlW~~~~~k~v~qy~gh~K~~vc~~ 307 (311)
T KOG0315|consen 254 LTGHQRWVWDCAFSADGEYLVTASSDHTARLWDLSAGKEVRQYQGHHKAAVCVA 307 (311)
T ss_pred eecCCceEEeeeeccCccEEEecCCCCceeecccccCceeeecCCcccccEEEE
Confidence 555668888889999999999999999999999999999999999999877664
No 128
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=57.94 E-value=26 Score=36.98 Aligned_cols=76 Identities=14% Similarity=0.100 Sum_probs=59.9
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc---eeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQAL---VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI 384 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~---~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI 384 (414)
....+|..+|..+|+|+.-+.--+-|+|-.++ .+..-+-||-. |++...-.+ . --|+.+
T Consensus 301 ksl~~i~~~~~~~Ll~~gssdr~irl~DPR~~~gs~v~~s~~gH~n----wVssvkwsp----------~----~~~~~~ 362 (423)
T KOG0313|consen 301 KSLNCISYSPLSKLLASGSSDRHIRLWDPRTGDGSVVSQSLIGHKN----WVSSVKWSP----------T----NEFQLV 362 (423)
T ss_pred cceeEeecccccceeeecCCCCceeecCCCCCCCceeEEeeecchh----hhhheecCC----------C----CceEEE
Confidence 34567899999999999999999999998765 45568889988 998653211 1 117889
Q ss_pred EcCCCCeEEEeeCCCCc
Q 015021 385 HAPRKGIIEHLAGMADE 401 (414)
Q Consensus 385 yaprRg~lEvW~~~~g~ 401 (414)
.+..+|.+.+||+|.-.
T Consensus 363 S~S~D~t~klWDvRS~k 379 (423)
T KOG0313|consen 363 SGSYDNTVKLWDVRSTK 379 (423)
T ss_pred EEecCCeEEEEEeccCC
Confidence 99999999999999755
No 129
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=57.74 E-value=82 Score=31.10 Aligned_cols=48 Identities=31% Similarity=0.369 Sum_probs=34.0
Q ss_pred cccCCCceeeEEEECCC-CCEEEEEcCCCcEEEEEcCcceeEeee--ccccc
Q 015021 302 CLKDHPRKGERLTLSPS-GSLAAITDSLGRILLLDTQALVVVRLW--KGYRD 350 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~-~~laa~tD~lGRV~L~D~~~~~vvrmw--KGyRd 350 (414)
.|++..++...|+.+|+ ++|+|++|.-+.|+-+|+. |.++|-+ .|..|
T Consensus 16 ~l~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~els~~-G~vlr~i~l~g~~D 66 (248)
T PF06977_consen 16 PLPGILDELSGLTYNPDTGTLFAVQDEPGEIYELSLD-GKVLRRIPLDGFGD 66 (248)
T ss_dssp E-TT--S-EEEEEEETTTTEEEEEETTTTEEEEEETT---EEEEEE-SS-SS
T ss_pred ECCCccCCccccEEcCCCCeEEEEECCCCEEEEEcCC-CCEEEEEeCCCCCC
Confidence 36677778999999997 5899999999999999985 7788755 45444
No 130
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=57.40 E-value=32 Score=37.32 Aligned_cols=86 Identities=14% Similarity=0.039 Sum_probs=59.2
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL 382 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L 382 (414)
+.-++.+...+..+|++++.|...++.+|.++|...-..+.-..+++.|==+....+=.. . .|
T Consensus 297 ~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~----------------~-lL 359 (484)
T KOG0305|consen 297 LQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQS----------------G-LL 359 (484)
T ss_pred hhcccceeeeeEECCCCCeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCcc----------------C-ce
Confidence 666778888999999999999999999999999966666665666776643332221000 0 12
Q ss_pred EEEc-CCCCeEEEeeCCCCcceee
Q 015021 383 AIHA-PRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 383 vIya-prRg~lEvW~~~~g~~v~~ 405 (414)
++=. -.++.|.+|++.+|.++.+
T Consensus 360 AsGGGs~D~~i~fwn~~~g~~i~~ 383 (484)
T KOG0305|consen 360 ATGGGSADRCIKFWNTNTGARIDS 383 (484)
T ss_pred EEcCCCcccEEEEEEcCCCcEecc
Confidence 2211 2367888999988888743
No 131
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=57.34 E-value=49 Score=37.88 Aligned_cols=82 Identities=15% Similarity=0.193 Sum_probs=60.9
Q ss_pred eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCC
Q 015021 311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKG 390 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg 390 (414)
-.+.++|+-++.++.=-+-.|=++|+.+|.-++.+||-||-+=.-|-+..+ +-+.||+-.-. +.
T Consensus 600 YDm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lD---------------PSgiY~atScs-dk 663 (1080)
T KOG1408|consen 600 YDMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILD---------------PSGIYLATSCS-DK 663 (1080)
T ss_pred EEeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCCceEEEEEC---------------CCccEEEEeec-CC
Confidence 346899999999998777788899999999999999999954444444322 11335554444 56
Q ss_pred eEEEeeCCCCcceeeccc
Q 015021 391 IIEHLAGMADENWTTSLN 408 (414)
Q Consensus 391 ~lEvW~~~~g~~v~~~~~ 408 (414)
.|-++|.-.|+|||+-..
T Consensus 664 tl~~~Df~sgEcvA~m~G 681 (1080)
T KOG1408|consen 664 TLCFVDFVSGECVAQMTG 681 (1080)
T ss_pred ceEEEEeccchhhhhhcC
Confidence 788899999999987543
No 132
>PRK01029 tolB translocation protein TolB; Provisional
Probab=57.16 E-value=60 Score=34.07 Aligned_cols=46 Identities=20% Similarity=0.099 Sum_probs=34.0
Q ss_pred ccCCCceeeEEEECCCCCEEEE-EcCC--CcEEEEEcCcceeEeeeccc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAI-TDSL--GRILLLDTQALVVVRLWKGY 348 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~-tD~l--GRV~L~D~~~~~vvrmwKGy 348 (414)
+.+..+.......||+|+++|. ++.. .+|.++|+.++..-++..+.
T Consensus 322 lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~~ 370 (428)
T PRK01029 322 LTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTSP 370 (428)
T ss_pred eccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCCC
Confidence 3334445567889999986665 4443 47999999999998888774
No 133
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=56.91 E-value=52 Score=34.68 Aligned_cols=69 Identities=9% Similarity=0.016 Sum_probs=54.3
Q ss_pred CCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeC
Q 015021 318 SGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAG 397 (414)
Q Consensus 318 ~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~ 397 (414)
.+++.+....++-|=.+|+.+|..+--.+|+=+ |+.-..=.+ .+| || +..--++.|.||++
T Consensus 303 ~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghdn----wVr~~af~p---------~Gk-----yi-~ScaDDktlrvwdl 363 (406)
T KOG0295|consen 303 GGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHDN----WVRGVAFSP---------GGK-----YI-LSCADDKTLRVWDL 363 (406)
T ss_pred CccEEEeecccceEEEEeccCCeEEEEEecccc----eeeeeEEcC---------CCe-----EE-EEEecCCcEEEEEe
Confidence 457888888999999999999999999999999 996443211 122 43 55667999999999
Q ss_pred CCCcceee
Q 015021 398 MADENWTT 405 (414)
Q Consensus 398 ~~g~~v~~ 405 (414)
++++|.-|
T Consensus 364 ~~~~cmk~ 371 (406)
T KOG0295|consen 364 KNLQCMKT 371 (406)
T ss_pred ccceeeec
Confidence 99998744
No 134
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=56.58 E-value=84 Score=34.49 Aligned_cols=48 Identities=13% Similarity=-0.009 Sum_probs=37.6
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
|..+.-..-.+++.-.+.+....-.+..+-++|+.+|..++...|+-+
T Consensus 245 l~GH~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~s 292 (537)
T KOG0274|consen 245 LVGHFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTS 292 (537)
T ss_pred ccCCCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCc
Confidence 555556666777766567777777788999999999999999998766
No 135
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=55.99 E-value=20 Score=36.99 Aligned_cols=39 Identities=18% Similarity=0.376 Sum_probs=29.7
Q ss_pred ceeeEEEECCCCCEEEEEc-CCCcEEEEEcCcceeEeeec
Q 015021 308 RKGERLTLSPSGSLAAITD-SLGRILLLDTQALVVVRLWK 346 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD-~lGRV~L~D~~~~~vvrmwK 346 (414)
.....|++||+|++++++. .-+.|.++|..++.+++.-.
T Consensus 78 ~~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~ 117 (369)
T PF02239_consen 78 GNPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIP 117 (369)
T ss_dssp SEEEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE
T ss_pred CCcceEEEcCCCCEEEEEecCCCceeEeccccccceeecc
Confidence 5678899999999998874 67999999999999998653
No 136
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=55.95 E-value=27 Score=35.54 Aligned_cols=49 Identities=14% Similarity=0.118 Sum_probs=36.4
Q ss_pred ceeeEEEECCCCCEEEEE-cCCCc----EEEEEcCcceeEe-eecccccceeeEE
Q 015021 308 RKGERLTLSPSGSLAAIT-DSLGR----ILLLDTQALVVVR-LWKGYRDASCVFM 356 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~t-D~lGR----V~L~D~~~~~vvr-mwKGyRdAq~~wi 356 (414)
.......+||+|+++|.+ |..|. +.++|+.+|..+. ...+.+-..+.|.
T Consensus 124 ~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~~~~~~~W~ 178 (414)
T PF02897_consen 124 VSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENPKFSSVSWS 178 (414)
T ss_dssp EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEEESEEEEEC
T ss_pred EEeeeeeECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCcccccccceEEEe
Confidence 455678999999988875 77776 9999999998876 4344444346665
No 137
>PRK03629 tolB translocation protein TolB; Provisional
Probab=55.79 E-value=65 Score=33.62 Aligned_cols=47 Identities=15% Similarity=0.126 Sum_probs=33.5
Q ss_pred eeeEEEECCCCCEEEEEc---CCCcEEEEEcCcceeEeeecccccceeeE
Q 015021 309 KGERLTLSPSGSLAAITD---SLGRILLLDTQALVVVRLWKGYRDASCVF 355 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD---~lGRV~L~D~~~~~vvrmwKGyRdAq~~w 355 (414)
.......||+|++++.+. ....|.++|+.++.+.++-.++.+....|
T Consensus 332 ~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~~~~~~p~~ 381 (429)
T PRK03629 332 QNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTDTFLDETPSI 381 (429)
T ss_pred CccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCCCCCCCCceE
Confidence 345688999998777642 33569999999998877776665544444
No 138
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=55.74 E-value=12 Score=42.39 Aligned_cols=67 Identities=13% Similarity=0.127 Sum_probs=45.8
Q ss_pred CCEEEEEcCCCcEEEEEcCc---ceeEeeeccc-ccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEE
Q 015021 319 GSLAAITDSLGRILLLDTQA---LVVVRLWKGY-RDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEH 394 (414)
Q Consensus 319 ~~laa~tD~lGRV~L~D~~~---~~vvrmwKGy-RdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEv 394 (414)
.+|+|++.+.|-|.++|+.. -..+.+++-| |.|.+-=+..+ --.++|-+.++|.|++
T Consensus 100 ~NlIAT~s~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~t-------------------ep~iliSGSQDg~vK~ 160 (839)
T KOG0269|consen 100 SNLIATCSTNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHST-------------------EPNILISGSQDGTVKC 160 (839)
T ss_pred hhhheeecCCCcEEEEecCccccchhhhHhhhhccceeeeeeccC-------------------CccEEEecCCCceEEE
Confidence 47899999999999999998 3334444444 22221111110 1158999999999999
Q ss_pred eeCCCCccee
Q 015021 395 LAGMADENWT 404 (414)
Q Consensus 395 W~~~~g~~v~ 404 (414)
||+|--.-+-
T Consensus 161 ~DlR~~~S~~ 170 (839)
T KOG0269|consen 161 WDLRSKKSKS 170 (839)
T ss_pred Eeeecccccc
Confidence 9999866553
No 139
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=54.94 E-value=17 Score=39.53 Aligned_cols=38 Identities=18% Similarity=0.198 Sum_probs=32.9
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK 346 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK 346 (414)
.++++.+||+|++++++..-+-++++|+.+|.+--+=|
T Consensus 403 ~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idk 440 (668)
T COG4946 403 NIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDK 440 (668)
T ss_pred ceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecc
Confidence 46789999999999999999999999999998754433
No 140
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=53.67 E-value=16 Score=40.73 Aligned_cols=32 Identities=25% Similarity=0.387 Sum_probs=29.5
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCccee
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVV 341 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~v 341 (414)
+.+|.+||+|+|+|+.+..|-|.++|+.+++-
T Consensus 478 I~~l~~SsdG~yiaa~~t~g~I~v~nl~~~~~ 509 (691)
T KOG2048|consen 478 ISRLVVSSDGNYIAAISTRGQIFVYNLETLES 509 (691)
T ss_pred ceeEEEcCCCCEEEEEeccceEEEEEccccee
Confidence 67899999999999999999999999998864
No 141
>PRK04922 tolB translocation protein TolB; Provisional
Probab=53.39 E-value=69 Score=33.28 Aligned_cols=46 Identities=33% Similarity=0.406 Sum_probs=33.3
Q ss_pred eeEEEECCCCCEEEEEcCC-C--cEEEEEcCcceeEeeecccccceeeE
Q 015021 310 GERLTLSPSGSLAAITDSL-G--RILLLDTQALVVVRLWKGYRDASCVF 355 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~l-G--RV~L~D~~~~~vvrmwKGyRdAq~~w 355 (414)
...++.||+|+++|.+... | +|.++|+.++...++-.|..+....|
T Consensus 338 ~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~~~~~p~~ 386 (433)
T PRK04922 338 NARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGSLDESPSF 386 (433)
T ss_pred ccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCCCCCCceE
Confidence 3468999999888775332 2 69999999998877766655544444
No 142
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=53.24 E-value=22 Score=35.48 Aligned_cols=47 Identities=17% Similarity=0.195 Sum_probs=38.1
Q ss_pred cccCCC---ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc
Q 015021 302 CLKDHP---RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY 348 (414)
Q Consensus 302 ~l~D~~---R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy 348 (414)
.+.|.. --..+++++|+|+|.|..-..---+|+|+..+.+|.-+--+
T Consensus 223 ~~~~~glessavaav~vdpsgrll~sg~~dssc~lydirg~r~iq~f~ph 272 (350)
T KOG0641|consen 223 DFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLYDIRGGRMIQRFHPH 272 (350)
T ss_pred cccCCCcccceeEEEEECCCcceeeeccCCCceEEEEeeCCceeeeeCCC
Confidence 355555 44678999999999999877788999999999998876544
No 143
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=53.04 E-value=41 Score=39.20 Aligned_cols=86 Identities=9% Similarity=0.029 Sum_probs=71.6
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
+-|.++..-...|..-|++-+++..-.+=.|=+|+..+..++--.+||=| +|.+..= |.-.
T Consensus 45 ~rFdeHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlD----YVRt~~F---------------Hhey 105 (1202)
T KOG0292|consen 45 DRFDEHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLD----YVRTVFF---------------HHEY 105 (1202)
T ss_pred hhhhccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccc----eeEEeec---------------cCCC
Confidence 34777888888999999999999987788899999999999999999999 8876521 1111
Q ss_pred EEEEEcCCCCeEEEeeCCCCcceee
Q 015021 381 CLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
=-+|.|.-+-+|.||++|++++|++
T Consensus 106 PWIlSASDDQTIrIWNwqsr~~iav 130 (1202)
T KOG0292|consen 106 PWILSASDDQTIRIWNWQSRKCIAV 130 (1202)
T ss_pred ceEEEccCCCeEEEEeccCCceEEE
Confidence 2678999999999999999999975
No 144
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=51.85 E-value=84 Score=30.83 Aligned_cols=30 Identities=13% Similarity=0.061 Sum_probs=23.9
Q ss_pred eeEEEECCCCCEEEEEcC-CCcEEEEEcCcc
Q 015021 310 GERLTLSPSGSLAAITDS-LGRILLLDTQAL 339 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~-lGRV~L~D~~~~ 339 (414)
...++++|+|+++.++|. .++|.++|+.+.
T Consensus 128 ~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~ 158 (330)
T PRK11028 128 CHSANIDPDNRTLWVPCLKEDRIRLFTLSDD 158 (330)
T ss_pred ccEeEeCCCCCEEEEeeCCCCEEEEEEECCC
Confidence 456789999988877654 599999999873
No 145
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=51.79 E-value=29 Score=36.66 Aligned_cols=49 Identities=18% Similarity=0.131 Sum_probs=38.2
Q ss_pred CeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCC
Q 015021 77 EYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRD 133 (414)
Q Consensus 77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~ 133 (414)
=+||++-+||=.+..-+|-+|-.|+||+|+.+ -|. -||.++.++.+++.
T Consensus 203 vW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~-----~qR---RPV~~fd~~E~~is 251 (412)
T KOG3881|consen 203 VWITDIRFLEGSPNYKFATITRYHQVRLYDTR-----HQR---RPVAQFDFLENPIS 251 (412)
T ss_pred eeeccceecCCCCCceEEEEecceeEEEecCc-----ccC---cceeEeccccCcce
Confidence 45666666655557899999999999999973 233 79999999998853
No 146
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=51.57 E-value=15 Score=39.17 Aligned_cols=85 Identities=11% Similarity=0.017 Sum_probs=61.0
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC 381 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~ 381 (414)
+|.-.--.+++|..+++|+....+-+.+-+-+||+..+....-..||-| ++.+..-+. .+.+
T Consensus 214 tLaGs~g~it~~d~d~~~~~~iAas~d~~~r~Wnvd~~r~~~TLsGHtd------kVt~ak~~~----------~~~~-- 275 (459)
T KOG0288|consen 214 TLAGSLGNITSIDFDSDNKHVIAASNDKNLRLWNVDSLRLRHTLSGHTD------KVTAAKFKL----------SHSR-- 275 (459)
T ss_pred hhhccCCCcceeeecCCCceEEeecCCCceeeeeccchhhhhhhccccc------ceeeehhhc----------cccc--
Confidence 4666667788999999999888888899999999999999999999998 333322110 1111
Q ss_pred EEEEcCCCCeEEEeeCCCCcceee
Q 015021 382 LAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 382 LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
||-+.++..|+.|+||.+.|.-|
T Consensus 276 -vVsgs~DRtiK~WDl~k~~C~kt 298 (459)
T KOG0288|consen 276 -VVSGSADRTIKLWDLQKAYCSKT 298 (459)
T ss_pred -eeeccccchhhhhhhhhhheecc
Confidence 66666666666666666666544
No 147
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=51.09 E-value=52 Score=35.21 Aligned_cols=84 Identities=14% Similarity=0.120 Sum_probs=63.8
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL 382 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L 382 (414)
|.-+-+++.++.-||+|...|+.-+++-+=++|+....-+..--+|++ =+ ++..++|.. . ++
T Consensus 341 L~gH~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~n----lV----------S~Vk~~p~~---g-~f 402 (459)
T KOG0272|consen 341 LAGHIKEILSVAFSPNGYHLATGSSDNTCKVWDLRMRSELYTIPAHSN----LV----------SQVKYSPQE---G-YF 402 (459)
T ss_pred ecccccceeeEeECCCceEEeecCCCCcEEEeeecccccceecccccc----hh----------hheEecccC---C-eE
Confidence 555678899999999999999988888888999988888777777776 11 122333311 1 45
Q ss_pred EEEcCCCCeEEEeeCCCCccee
Q 015021 383 AIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 383 vIyaprRg~lEvW~~~~g~~v~ 404 (414)
..-+..++.+.||+.+++.++-
T Consensus 403 L~TasyD~t~kiWs~~~~~~~k 424 (459)
T KOG0272|consen 403 LVTASYDNTVKIWSTRTWSPLK 424 (459)
T ss_pred EEEcccCcceeeecCCCcccch
Confidence 6788999999999999988763
No 148
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=49.61 E-value=1.1e+02 Score=30.02 Aligned_cols=28 Identities=4% Similarity=0.016 Sum_probs=22.8
Q ss_pred eEEEECCCCCEEEEE-cCCCcEEEEEcCc
Q 015021 311 ERLTLSPSGSLAAIT-DSLGRILLLDTQA 338 (414)
Q Consensus 311 ~~i~~sP~~~laa~t-D~lGRV~L~D~~~ 338 (414)
..|+++|+|+++.++ ...|+|.++|+.+
T Consensus 83 ~~i~~~~~g~~l~v~~~~~~~v~v~~~~~ 111 (330)
T PRK11028 83 THISTDHQGRFLFSASYNANCVSVSPLDK 111 (330)
T ss_pred eEEEECCCCCEEEEEEcCCCeEEEEEECC
Confidence 479999999866664 5689999999974
No 149
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=49.34 E-value=29 Score=36.17 Aligned_cols=67 Identities=19% Similarity=0.143 Sum_probs=40.9
Q ss_pred CCeeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021 32 DPNLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD 106 (414)
Q Consensus 32 ~~~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt 106 (414)
+|-++|||.|+.||++.++-+=+.+ .+.++. -+++..+-+++||+|.|-|- +-..+-+| +-|||++.
T Consensus 334 p~RL~lsP~g~~lA~s~gs~l~~~~---se~g~~--~~~~e~~h~~~Is~is~~~~-g~~~atcG--dr~vrv~~ 400 (420)
T KOG2096|consen 334 PVRLELSPSGDSLAVSFGSDLKVFA---SEDGKD--YPELEDIHSTTISSISYSSD-GKYIATCG--DRYVRVIR 400 (420)
T ss_pred ceEEEeCCCCcEEEeecCCceEEEE---cccCcc--chhHHHhhcCceeeEEecCC-CcEEeeec--ceeeeeec
Confidence 4789999999999999999764432 221111 12333344678888877552 22344444 45666654
No 150
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=49.21 E-value=49 Score=34.58 Aligned_cols=97 Identities=18% Similarity=0.115 Sum_probs=61.9
Q ss_pred CCCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021 298 SPLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK 375 (414)
Q Consensus 298 ~pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k 375 (414)
.|+..|-..+..++.++-+|+| +|...+--.-.|+.+|+..- .+|--..++++-.=+=|...-+. +
T Consensus 241 ~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~~~pv~~L~rhv~~TNQRI~FDld~------------~ 308 (406)
T KOG2919|consen 241 RPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYSRDPVYALERHVGDTNQRILFDLDP------------K 308 (406)
T ss_pred CceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhccchhhhhhhhccCccceEEEecCC------------C
Confidence 4565566777788999999999 67777888889999999753 33333333333212222221111 1
Q ss_pred CcceEEEEEEcCCCCeEEEeeCCC-Ccceeecccee
Q 015021 376 SDYCLCLAIHAPRKGIIEHLAGMA-DENWTTSLNYS 410 (414)
Q Consensus 376 ~~~~l~LvIyaprRg~lEvW~~~~-g~~v~~~~~~~ 410 (414)
.. +| .-.-.||.|.||+++. |+-+-.-.|||
T Consensus 309 ~~---~L-asG~tdG~V~vwdlk~~gn~~sv~~~~s 340 (406)
T KOG2919|consen 309 GE---IL-ASGDTDGSVRVWDLKDLGNEVSVTGNYS 340 (406)
T ss_pred Cc---ee-eccCCCccEEEEecCCCCCccccccccc
Confidence 11 22 3345799999999999 88776666665
No 151
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=48.97 E-value=51 Score=24.35 Aligned_cols=29 Identities=28% Similarity=0.421 Sum_probs=23.8
Q ss_pred eEEEECCCC---CEEEEEcCCCcEEEEEcCcc
Q 015021 311 ERLTLSPSG---SLAAITDSLGRILLLDTQAL 339 (414)
Q Consensus 311 ~~i~~sP~~---~laa~tD~lGRV~L~D~~~~ 339 (414)
.++.-||.. .|.|.+.--|||-++|+.++
T Consensus 4 R~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~~ 35 (43)
T PF10313_consen 4 RCCKFSPEPGGNDLLAWAEHQGRVHIVDTRSN 35 (43)
T ss_pred EEEEeCCCCCcccEEEEEccCCeEEEEEcccC
Confidence 456677754 59999999999999999864
No 152
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=48.59 E-value=71 Score=29.14 Aligned_cols=73 Identities=19% Similarity=0.292 Sum_probs=50.0
Q ss_pred CCeEEEEEeEEe---CCcEEEEEeccccEEEEEe--cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCe
Q 015021 76 SEYITAIEWLVF---EEMRALAVGTSRGYFLVYD--LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGV 150 (414)
Q Consensus 76 ~e~ITs~~~lpl---~dw~~I~VG~ssG~vrfyt--e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~ 150 (414)
++.|||+..=|| .+.-+++||+.+. |..|+ ++-++.++++ ++-|..|-+-+... ..+.|.| -+.
T Consensus 47 n~~italaaG~l~~~~~~D~LliGt~t~-llaYDV~~N~d~Fyke~--~DGvn~i~~g~~~~------~~~~l~i--vGG 115 (136)
T PF14781_consen 47 NQEITALAAGRLKPDDGRDCLLIGTQTS-LLAYDVENNSDLFYKEV--PDGVNAIVIGKLGD------IPSPLVI--VGG 115 (136)
T ss_pred CCceEEEEEEecCCCCCcCEEEEeccce-EEEEEcccCchhhhhhC--ccceeEEEEEecCC------CCCcEEE--ECc
Confidence 688999999999 3557999999885 66787 4777777766 47777777664432 1333444 455
Q ss_pred EEEEeChhH
Q 015021 151 LARFDGSEI 159 (414)
Q Consensus 151 iv~IdG~sL 159 (414)
=|.|-|++-
T Consensus 116 ncsi~Gfd~ 124 (136)
T PF14781_consen 116 NCSIQGFDY 124 (136)
T ss_pred eEEEEEeCC
Confidence 566666653
No 153
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=48.59 E-value=29 Score=36.11 Aligned_cols=103 Identities=19% Similarity=0.270 Sum_probs=75.1
Q ss_pred CC-ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccccee----eEEEEEe----ccccc----
Q 015021 299 PL-TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASC----VFMEMLV----NKDAA---- 365 (414)
Q Consensus 299 pl-~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~----~wi~~~~----~~~~~---- 365 (414)
|+ +.|+|+--+...+..-|..+..+...+++-|=|+|...-.+=|-+|=+-|++- .|...-+ .++..
T Consensus 163 PvIRTlYDH~devn~l~FHPre~ILiS~srD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgTdHp~~rl 242 (430)
T KOG0640|consen 163 PVIRTLYDHVDEVNDLDFHPRETILISGSRDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGTDHPTLRL 242 (430)
T ss_pred ceEeehhhccCcccceeecchhheEEeccCCCeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEecCCCceeE
Confidence 44 67999999999999999999999999999999999999999999988888652 2221100 00100
Q ss_pred -------------ccc--------cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021 366 -------------TSS--------AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTTS 406 (414)
Q Consensus 366 -------------~~~--------~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~~ 406 (414)
.+. .++.+.- -|.|-+.++|.|++|+--.++||.|-
T Consensus 243 Ydv~T~QcfvsanPd~qht~ai~~V~Ys~t~-----~lYvTaSkDG~IklwDGVS~rCv~t~ 299 (430)
T KOG0640|consen 243 YDVNTYQCFVSANPDDQHTGAITQVRYSSTG-----SLYVTASKDGAIKLWDGVSNRCVRTI 299 (430)
T ss_pred EeccceeEeeecCcccccccceeEEEecCCc-----cEEEEeccCCcEEeeccccHHHHHHH
Confidence 000 1122111 26688999999999999999999763
No 154
>PRK02889 tolB translocation protein TolB; Provisional
Probab=48.04 E-value=72 Score=33.15 Aligned_cols=40 Identities=15% Similarity=0.027 Sum_probs=29.5
Q ss_pred CCCceeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEee
Q 015021 305 DHPRKGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRL 344 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrm 344 (414)
..+....+...||+|+.+|. ++..| .|.++|+.+|...++
T Consensus 193 ~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l 235 (427)
T PRK02889 193 SSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVV 235 (427)
T ss_pred cCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEe
Confidence 34455668899999976665 44444 599999999987776
No 155
>PF01403 Sema: Sema domain; InterPro: IPR001627 The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in a hepatocyte growth factor receptor, in SEX protein [] and in viral proteins. CD100 (also called SEMA4D) is associated with PTPase and serine kinase activity. CD100 increases PMA, CD3 and CD2 induced T cell proliferation, increases CD45 induced T cell adhesion, induces B cell homotypic adhesion and down-regulates B cell expression of CD23. The Sema domain is characterised by a conserved set of cysteine residues, which form four disulphide bonds to stabilise the structure. The Sema domain fold is a variation of the beta propeller topology, with seven blades radially arranged around a central axis. Each blade contains a four- stranded (strands A to D) antiparallel beta sheet. The inner strand of each blade (A) lines the channel at the centre of the propeller, with strands B and C of the same repeat radiating outward, and strand D of the next repeat forming the outer edge of the blade. The large size of the Sema domain is not due to a single inserted domain but results from the presence of additional secondary structure elements inserted in most of the blades. The Sema domain uses a 'loop and hook' system to close the circle between the first and the last blades. The blades are constructed sequentially with an N-terminal beta- strand closing the circle by providing the outermost strand (D) of the seventh (C-terminal) blade. The beta-propeller is further stabilised by an extension of the N terminus, providing an additional, fifth beta-strand on the outer edge of blade 6 [, , ]. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0005515 protein binding; PDB: 3NVX_A 3NVQ_A 3OL2_A 1OLZ_B 3OKT_A 3AL9_B 3OKY_A 3AL8_B 3NVN_A 3OKW_A ....
Probab=47.20 E-value=81 Score=32.95 Aligned_cols=52 Identities=15% Similarity=0.101 Sum_probs=36.2
Q ss_pred eEEEEEeE--EeCC--cEEEEEeccccEEE-EEe--cCC-------cEEeecccCcc-ceeEEEEec
Q 015021 78 YITAIEWL--VFEE--MRALAVGTSRGYFL-VYD--LKG-------DLVHRQLIHPG-RILKLRVRG 129 (414)
Q Consensus 78 ~ITs~~~l--pl~d--w~~I~VG~ssG~vr-fyt--e~G-------~LL~sQ~lh~~-pV~~ik~r~ 129 (414)
+.|+|..- ...| .+++-+||++|.|. ..- .++ .+.-.|.+++. ||+.+++..
T Consensus 365 ~~T~i~v~~v~~~~~~~tV~flGT~~G~l~K~v~~~~~~~~~~~~~~iee~~~~~~~~pI~~~~l~~ 431 (433)
T PF01403_consen 365 RLTSIAVDRVQVENGSYTVAFLGTDDGRLHKKVVLSNSSSGHYESYIIEEIQVFPDSEPIQSMKLSP 431 (433)
T ss_dssp -EEEEEEEEEEETTTCEEEEEEEETTSEEEEEEEESSSSTCT-EEEEEEEEE-STSC-EEEEEEEET
T ss_pred eeeEEEEEEEecCCCcEEEEEEecCCceEEEEEEecCCCCcccccEEEEEEeecCCCCceEEEEecc
Confidence 78999876 6666 89999999999999 332 222 22334577764 999998754
No 156
>PRK04792 tolB translocation protein TolB; Provisional
Probab=47.07 E-value=1.2e+02 Score=31.94 Aligned_cols=47 Identities=17% Similarity=0.257 Sum_probs=33.1
Q ss_pred eeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEee-ecccccceeeE
Q 015021 309 KGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRL-WKGYRDASCVF 355 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrm-wKGyRdAq~~w 355 (414)
.....+.+|+|++++. +|..| .|+++|+.++.+.|+ +.|..+....|
T Consensus 307 ~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~~~~~~~~ 357 (448)
T PRK04792 307 IDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGEQNLGGSI 357 (448)
T ss_pred CccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCCCCCcCeeE
Confidence 3456788999976655 56555 588999999998887 56665543333
No 157
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=46.98 E-value=2.3e+02 Score=29.67 Aligned_cols=72 Identities=22% Similarity=0.226 Sum_probs=45.0
Q ss_pred ccCCCeeeeccCcceeeeeecc-eEEEE--------e-ecCCCC--CceeecccCCCCCCCeEEEEEeEEeCCcEEEEEe
Q 015021 29 LVNDPNLLCALDMHTIALANRY-QTVII--------N-WADPEG--LVAKIRPELSPIASEYITAIEWLVFEEMRALAVG 96 (414)
Q Consensus 29 L~~~~~~~~sp~~~~la~A~~~-~~v~~--------~-w~~~~~--~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG 96 (414)
|.+ ..++++|.|-.||+.+.. |++-. . ...+++ ..+.|.+ ++|-.+.|.. ---++|=
T Consensus 29 l~~-~~va~a~~gGpIAi~~d~~k~~~~~~~~p~~I~iys~sG~ll~~i~w~~-------~~iv~~~wt~---~e~LvvV 97 (410)
T PF04841_consen 29 LSD-YIVAVAPYGGPIAIIRDESKLVPVGSAKPNSIQIYSSSGKLLSSIPWDS-------GRIVGMGWTD---DEELVVV 97 (410)
T ss_pred ccc-eeEEEcCCCceEEEEecCcccccccCCCCcEEEEECCCCCEeEEEEECC-------CCEEEEEECC---CCeEEEE
Confidence 445 889999999999999888 44432 0 111110 1133322 3455555533 2234455
Q ss_pred ccccEEEEEecCCcE
Q 015021 97 TSRGYFLVYDLKGDL 111 (414)
Q Consensus 97 ~ssG~vrfyte~G~L 111 (414)
+++|.+++|+..|+.
T Consensus 98 ~~dG~v~vy~~~G~~ 112 (410)
T PF04841_consen 98 QSDGTVRVYDLFGEF 112 (410)
T ss_pred EcCCEEEEEeCCCce
Confidence 799999999999998
No 158
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=46.65 E-value=1e+02 Score=31.07 Aligned_cols=43 Identities=14% Similarity=0.030 Sum_probs=31.5
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcC---CCcEEEEEcCcceeEee
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDS---LGRILLLDTQALVVVRL 344 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~---lGRV~L~D~~~~~vvrm 344 (414)
.|.+..+...+...||+|+++|.+.. ..+|.++|+.++...++
T Consensus 184 ~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~ 229 (417)
T TIGR02800 184 TITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKV 229 (417)
T ss_pred EeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEe
Confidence 35566666778889999987777533 24799999999866554
No 159
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=46.35 E-value=39 Score=34.33 Aligned_cols=83 Identities=10% Similarity=-0.067 Sum_probs=57.4
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc----eeeEEEEEecccccccccccCCCCCcc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA----SCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA----q~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
+....-+..+|+-.-+|+..|.....+.|.++++..+..+.-|-+++.- |..|-+. +.
T Consensus 16 ~~~~~~~v~Sv~wn~~g~~lasgs~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~------------------~~ 77 (313)
T KOG1407|consen 16 LQGHVQKVHSVAWNCDGTKLASGSFDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPK------------------HP 77 (313)
T ss_pred hhhhhhcceEEEEcccCceeeecccCCceEEEEecchhhhhhhcccCCCcchhhheeCCC------------------CC
Confidence 4455667889999999999999888888888888888555555444432 4444431 11
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
-|| .-|.-.-.+.+|+.+.|++++.
T Consensus 78 d~~--atas~dk~ir~wd~r~~k~~~~ 102 (313)
T KOG1407|consen 78 DLF--ATASGDKTIRIWDIRSGKCTAR 102 (313)
T ss_pred cce--EEecCCceEEEEEeccCcEEEE
Confidence 223 3444567899999999999865
No 160
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=46.28 E-value=65 Score=33.74 Aligned_cols=57 Identities=25% Similarity=0.403 Sum_probs=39.4
Q ss_pred CCcEEEEEecccc-EEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEE
Q 015021 88 EEMRALAVGTSRG-YFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARF 154 (414)
Q Consensus 88 ~dw~~I~VG~ssG-~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~I 154 (414)
.+..++.+|...- +|++||-+|.+|-+-.+..++|+.+- +-. .|+|.||..++.+++
T Consensus 48 d~~k~~~~~~~~p~~I~iys~sG~ll~~i~w~~~~iv~~~---wt~-------~e~LvvV~~dG~v~v 105 (410)
T PF04841_consen 48 DESKLVPVGSAKPNSIQIYSSSGKLLSSIPWDSGRIVGMG---WTD-------DEELVVVQSDGTVRV 105 (410)
T ss_pred cCcccccccCCCCcEEEEECCCCCEeEEEEECCCCEEEEE---ECC-------CCeEEEEEcCCEEEE
Confidence 4444555555544 69999999999877555555555554 322 788999999987655
No 161
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=45.28 E-value=2.2e+02 Score=32.74 Aligned_cols=49 Identities=14% Similarity=0.077 Sum_probs=37.6
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEe
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLV 360 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~ 360 (414)
+..|.++-.-+..|+.|.-||..+.|+.++.++-.=-|.-. +.|=...|
T Consensus 459 vrClDINA~R~kLAvVDD~~~c~v~DI~t~elL~QEpg~nS--V~wNT~~E 507 (1081)
T KOG1538|consen 459 VRCLDINASRKKLAVVDDNDTCLVYDIDTKELLFQEPGANS--VAWNTQCE 507 (1081)
T ss_pred eEEeeccCCcceEEEEccCCeEEEEEccCCceEeecCCCce--EEeecccc
Confidence 45667777778889999999999999999999987666544 55654443
No 162
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=43.83 E-value=84 Score=35.61 Aligned_cols=77 Identities=13% Similarity=0.059 Sum_probs=54.0
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
..|.-+.-....+++=|++.++ .++|+|-|.++|+ +|.+++-.-|++. ++-...... ..
T Consensus 173 ~tf~gHtD~VRgL~vl~~~~fl-ScsNDg~Ir~w~~-~ge~l~~~~ghtn----~vYsis~~~----------~~----- 231 (745)
T KOG0301|consen 173 KTFSGHTDCVRGLAVLDDSHFL-SCSNDGSIRLWDL-DGEVLLEMHGHTN----FVYSISMAL----------SD----- 231 (745)
T ss_pred hhhccchhheeeeEEecCCCeE-eecCCceEEEEec-cCceeeeeeccce----EEEEEEecC----------CC-----
Confidence 4456666677788888887654 5689999999999 6666666679998 664433111 00
Q ss_pred EEEEEcCCCCeEEEeeCC
Q 015021 381 CLAIHAPRKGIIEHLAGM 398 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~ 398 (414)
-+++-..-++.|+||...
T Consensus 232 ~~Ivs~gEDrtlriW~~~ 249 (745)
T KOG0301|consen 232 GLIVSTGEDRTLRIWKKD 249 (745)
T ss_pred CeEEEecCCceEEEeecC
Confidence 267778888888888765
No 163
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=43.23 E-value=1.5e+02 Score=30.76 Aligned_cols=47 Identities=19% Similarity=0.094 Sum_probs=36.0
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEEE
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFMEM 358 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~~ 358 (414)
...++-+-+|+.+.+.+..|-+-|+|++++++..+= --|| .|.||+.
T Consensus 75 vL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q~~~v~--~Hd~pvkt~~wv~~ 124 (347)
T KOG0647|consen 75 VLDVCWSDDGSKVFSGGCDKQAKLWDLASGQVSQVA--AHDAPVKTCHWVPG 124 (347)
T ss_pred eEEEEEccCCceEEeeccCCceEEEEccCCCeeeee--ecccceeEEEEecC
Confidence 345777888999999999999999999999887652 2222 4677754
No 164
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=42.73 E-value=74 Score=25.23 Aligned_cols=55 Identities=13% Similarity=0.264 Sum_probs=40.2
Q ss_pred eccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeE-EEEeChhHH
Q 015021 96 GTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVL-ARFDGSEIQ 160 (414)
Q Consensus 96 G~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~i-v~IdG~sL~ 160 (414)
....-+|++.+.+|..+++.++++.-...+.... .=+|.+=.++++ +.+||.++-
T Consensus 5 a~~~sWv~V~d~dG~~~~~~~l~~G~~~~~~~~~----------~~~i~iGna~~v~v~~nG~~~~ 60 (77)
T PF13464_consen 5 ATGDSWVEVTDADGKVLFSGTLKAGETKTFEGKE----------PFRIRIGNAGAVEVTVNGKPVD 60 (77)
T ss_pred EeCCeEEEEEeCCCcEeeeeeeCCCcEEEEeCCC----------CEEEEEeCCCcEEEEECCEECC
Confidence 3456789999999999999999988887774221 222555666665 888888773
No 165
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=41.71 E-value=37 Score=36.26 Aligned_cols=92 Identities=14% Similarity=0.141 Sum_probs=63.5
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC 381 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~ 381 (414)
++..++--+..+..|++|+|+...|..|-|=.++.+ ...|.|.+++-++++-=+-... .| .| |
T Consensus 133 ilQaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpn-mnnVk~~~ahh~eaIRdlafSp-nD----------sk-----F 195 (464)
T KOG0284|consen 133 ILQAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPN-MNNVKIIQAHHAEAIRDLAFSP-ND----------SK-----F 195 (464)
T ss_pred HhhhhcccceeEEEccCCCEEEEcCCCceEEecccc-hhhhHHhhHhhhhhhheeccCC-CC----------ce-----e
Confidence 456666677889999999999999999999999975 5667788777765543332211 11 11 2
Q ss_pred EEEEcCCCCeEEEeeCCCCcc--eeeccceeee
Q 015021 382 LAIHAPRKGIIEHLAGMADEN--WTTSLNYSVR 412 (414)
Q Consensus 382 LvIyaprRg~lEvW~~~~g~~--v~~~~~~~~~ 412 (414)
.-+.-+|+|+||+....+. +-+.-+|-||
T Consensus 196 --~t~SdDg~ikiWdf~~~kee~vL~GHgwdVk 226 (464)
T KOG0284|consen 196 --LTCSDDGTIKIWDFRMPKEERVLRGHGWDVK 226 (464)
T ss_pred --EEecCCCeEEEEeccCCchhheeccCCCCcc
Confidence 4567799999999887543 4444445444
No 166
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=41.65 E-value=1.1e+02 Score=33.08 Aligned_cols=78 Identities=18% Similarity=0.206 Sum_probs=57.9
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR 388 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr 388 (414)
...++++-|||-+++..-..|-|=+||+..+..+-=+-|+- +.+.=|..-|+ .| ||+. +--
T Consensus 349 ~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpght-~~vk~i~FsEN---------------GY--~Lat-~ad 409 (506)
T KOG0289|consen 349 EYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGHT-GPVKAISFSEN---------------GY--WLAT-AAD 409 (506)
T ss_pred eeEEeeEcCCceEEeccCCCceEEEEEcCCccccccCCCCC-CceeEEEeccC---------------ce--EEEE-Eec
Confidence 46778899999999999999999999999999888888843 34444443322 12 5555 445
Q ss_pred CCeEEEeeCCCCcceee
Q 015021 389 KGIIEHLAGMADENWTT 405 (414)
Q Consensus 389 Rg~lEvW~~~~g~~v~~ 405 (414)
+|.|.+||+|-.+..-|
T Consensus 410 d~~V~lwDLRKl~n~kt 426 (506)
T KOG0289|consen 410 DGSVKLWDLRKLKNFKT 426 (506)
T ss_pred CCeEEEEEehhhcccce
Confidence 88899999998776544
No 167
>PRK00178 tolB translocation protein TolB; Provisional
Probab=41.48 E-value=1.7e+02 Score=29.99 Aligned_cols=38 Identities=24% Similarity=0.269 Sum_probs=28.5
Q ss_pred eeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021 310 GERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG 347 (414)
Q Consensus 310 ~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG 347 (414)
......||+|+.++. .+..| +|+++|+.++.+.++.++
T Consensus 245 ~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~ 285 (430)
T PRK00178 245 NGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNH 285 (430)
T ss_pred cCCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEcccC
Confidence 345788999976664 55555 799999999998887653
No 168
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=41.44 E-value=1.6e+02 Score=32.00 Aligned_cols=86 Identities=12% Similarity=0.033 Sum_probs=62.8
Q ss_pred CccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 300 LTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 300 l~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
|+.++-+.-....+--+|++ +.++.+|..+-+-++|+.+..+.--.-|+-| .+.+-.-.+ .
T Consensus 103 LR~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htD----YVR~g~~~~----------~---- 164 (487)
T KOG0310|consen 103 LRQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQAELSGHTD----YVRCGDISP----------A---- 164 (487)
T ss_pred HHHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEEEecCCcc----eeEeecccc----------C----
Confidence 34566677777888899988 5666678888888899999987656778888 555432211 1
Q ss_pred eEEEEEEcCCCCeEEEeeCCCC-cce
Q 015021 379 CLCLAIHAPRKGIIEHLAGMAD-ENW 403 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g-~~v 403 (414)
+=.+|.-...+|.|.+|++|.- +.+
T Consensus 165 ~~hivvtGsYDg~vrl~DtR~~~~~v 190 (487)
T KOG0310|consen 165 NDHIVVTGSYDGKVRLWDTRSLTSRV 190 (487)
T ss_pred CCeEEEecCCCceEEEEEeccCCcee
Confidence 1148899999999999999986 555
No 169
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=41.35 E-value=3.5e+02 Score=31.83 Aligned_cols=165 Identities=16% Similarity=0.148 Sum_probs=0.0
Q ss_pred CcccceeeeeeecccccccCC-CCCcccCCCeeeeccCcceeeeeec----ceEEEEeecCCCCCceeecccCCCCCCCe
Q 015021 4 RTHTTEVGSIACTDLSDLGAG-KEGWLVNDPNLLCALDMHTIALANR----YQTVIINWADPEGLVAKIRPELSPIASEY 78 (414)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~g~~-~~~WL~~~~~~~~sp~~~~la~A~~----~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ 78 (414)
|.+.+| |-+-+..+.--|+. .=.|= |.|.+||-+++ ..+||-.-+.-.++..+.+- ++ +.+.
T Consensus 240 RVy~Re-G~L~stSE~v~gLe~~l~Wr---------PsG~lIA~~q~~~~~~~VvFfErNGLrhgeF~l~~--~~-~~~~ 306 (928)
T PF04762_consen 240 RVYSRE-GELQSTSEPVDGLEGALSWR---------PSGNLIASSQRLPDRHDVVFFERNGLRHGEFTLRF--DP-EEEK 306 (928)
T ss_pred EEECCC-ceEEeccccCCCccCCccCC---------CCCCEEEEEEEcCCCcEEEEEecCCcEeeeEecCC--CC-CCce
Q ss_pred EEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC---------
Q 015021 79 ITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP--------- 148 (414)
Q Consensus 79 ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~--------- 148 (414)
|..+.|= .|.+++||-+.+. |++|| -|-.-=++|-++...-..+..=.+-+ +..-.|.|+..
T Consensus 307 v~~l~Wn--~ds~iLAv~~~~~-vqLWt~~NYHWYLKqei~~~~~~~~~~~~Wdp-----e~p~~L~v~t~~g~~~~~~~ 378 (928)
T PF04762_consen 307 VIELAWN--SDSEILAVWLEDR-VQLWTRSNYHWYLKQEIRFSSSESVNFVKWDP-----EKPLRLHVLTSNGQYEIYDF 378 (928)
T ss_pred eeEEEEC--CCCCEEEEEecCC-ceEEEeeCCEEEEEEEEEccCCCCCCceEECC-----CCCCEEEEEecCCcEEEEEE
Q ss_pred ---------------CeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEE
Q 015021 149 ---------------GVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAI 212 (414)
Q Consensus 149 ---------------~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~ 212 (414)
+.++.|||..| .--.-.....|||+.+..-.++. +|+|++|
T Consensus 379 ~~~v~~s~~~~~~D~g~vaVIDG~~l---------------------llTpf~~a~VPPPMs~~~l~~~~--~v~~vaf 434 (928)
T PF04762_consen 379 AWDVSRSPGSSPNDNGTVAVIDGNKL---------------------LLTPFRRAVVPPPMSSYELELPS--PVNDVAF 434 (928)
T ss_pred EEEEEecCCCCccCceEEEEEeCCeE---------------------EEecccccCCCchHhceEEcCCC--CcEEEEE
No 170
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=41.31 E-value=18 Score=40.04 Aligned_cols=51 Identities=22% Similarity=0.372 Sum_probs=43.5
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK 362 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~ 362 (414)
-...|++.|.|+|.|++=+-|-|..+|..+|.+|.-| |--+|.-+|.-.+.
T Consensus 878 ~~R~iaVa~~GN~lAa~LSnGci~~LDaR~G~vINsw---rpmecdllqlaaps 928 (1034)
T KOG4190|consen 878 LTRAIAVADKGNKLAAALSNGCIAILDARNGKVINSW---RPMECDLLQLAAPS 928 (1034)
T ss_pred heeEEEeccCcchhhHHhcCCcEEEEecCCCceeccC---CcccchhhhhcCch
Confidence 3467999999999999999999999999999999988 67778777765544
No 171
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=41.04 E-value=1e+02 Score=32.97 Aligned_cols=99 Identities=17% Similarity=0.211 Sum_probs=58.1
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc-eee--EEEEEecccccccccccCCCCCc
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA-SCV--FMEMLVNKDAATSSAYYAPVKSD 377 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA-q~~--wi~~~~~~~~~~~~~~~~~~k~~ 377 (414)
..|.-+.|....|--+|+|.+.|..+..|-|+|+=... +|..-.-+.| +.. |+......-.. ..+....=..+
T Consensus 59 s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk~~~---~~~~~~d~e~~~~ke~w~v~k~lr~h~-~diydL~Ws~d 134 (434)
T KOG1009|consen 59 SSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWKQGD---VRIFDADTEADLNKEKWVVKKVLRGHR-DDIYDLAWSPD 134 (434)
T ss_pred ecccCCcceeEEEEEcCCcCeeeecCCCceEEEEEecC---cCCccccchhhhCccceEEEEEecccc-cchhhhhccCC
Confidence 46888899999999999999999999999999987663 1211000000 001 22211100000 00000000011
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
. -+++.+.-+..+-+|++..|+.+++
T Consensus 135 -~-~~l~s~s~dns~~l~Dv~~G~l~~~ 160 (434)
T KOG1009|consen 135 -S-NFLVSGSVDNSVRLWDVHAGQLLAI 160 (434)
T ss_pred -C-ceeeeeeccceEEEEEeccceeEee
Confidence 1 2556777799999999999998865
No 172
>PRK02889 tolB translocation protein TolB; Provisional
Probab=40.62 E-value=54 Score=34.06 Aligned_cols=40 Identities=28% Similarity=0.363 Sum_probs=30.1
Q ss_pred eEEEECCCCCEEEEEcCCC---cEEEEEcCcceeEeeeccccc
Q 015021 311 ERLTLSPSGSLAAITDSLG---RILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~lG---RV~L~D~~~~~vvrmwKGyRd 350 (414)
.....||+|+++|.+...| +|.++|+.++...++-.+..+
T Consensus 331 ~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~~~ 373 (427)
T PRK02889 331 TSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTTRD 373 (427)
T ss_pred CceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCCCc
Confidence 4578999999888765444 699999999987776555433
No 173
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=40.03 E-value=1.2e+02 Score=31.53 Aligned_cols=34 Identities=26% Similarity=0.432 Sum_probs=29.6
Q ss_pred CCeEEEEEeEEe--CCcEEEEEeccccEEEEEecCCc
Q 015021 76 SEYITAIEWLVF--EEMRALAVGTSRGYFLVYDLKGD 110 (414)
Q Consensus 76 ~e~ITs~~~lpl--~dw~~I~VG~ssG~vrfyte~G~ 110 (414)
.+-|++|.|-|- .+...|||++-+| ||+|...+.
T Consensus 223 ~dpI~di~wAPn~Gr~y~~lAvA~kDg-v~I~~v~~~ 258 (361)
T KOG2445|consen 223 TDPIRDISWAPNIGRSYHLLAVATKDG-VRIFKVKVA 258 (361)
T ss_pred CCcceeeeeccccCCceeeEEEeecCc-EEEEEEeec
Confidence 577999999998 5668999999999 999997653
No 174
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=39.88 E-value=74 Score=35.20 Aligned_cols=78 Identities=13% Similarity=0.156 Sum_probs=59.4
Q ss_pred eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021 309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR 388 (414)
Q Consensus 309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr 388 (414)
-+..++.||||++.|+..-+|-+=++|-.+..++-+.|.|=.+ .+-+.=..| +| ++.-...
T Consensus 292 ~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGG---LLCvcWSPD----------GK------yIvtGGE 352 (636)
T KOG2394|consen 292 SINEFAFSPDGKYLATVSQDGFLRIFDFDTQELLGVMKSYFGG---LLCVCWSPD----------GK------YIVTGGE 352 (636)
T ss_pred cccceeEcCCCceEEEEecCceEEEeeccHHHHHHHHHhhccc---eEEEEEcCC----------cc------EEEecCC
Confidence 6778999999999999988898999999999999999999663 222221112 23 3444667
Q ss_pred CCeEEEeeCCCCcceee
Q 015021 389 KGIIEHLAGMADENWTT 405 (414)
Q Consensus 389 Rg~lEvW~~~~g~~v~~ 405 (414)
+-+|-||++--++-||-
T Consensus 353 DDLVtVwSf~erRVVAR 369 (636)
T KOG2394|consen 353 DDLVTVWSFEERRVVAR 369 (636)
T ss_pred cceEEEEEeccceEEEe
Confidence 89999999998887763
No 175
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=39.73 E-value=74 Score=30.26 Aligned_cols=61 Identities=26% Similarity=0.352 Sum_probs=43.5
Q ss_pred EEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHH
Q 015021 91 RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQK 161 (414)
Q Consensus 91 ~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~ 161 (414)
.-++||+++| |.+|..+..--..+.++..+|.+|.+-.. -+-|.++..+.+...|-.+|..
T Consensus 8 ~~L~vGt~~G-l~~~~~~~~~~~~~i~~~~~I~ql~vl~~---------~~~llvLsd~~l~~~~L~~l~~ 68 (275)
T PF00780_consen 8 DRLLVGTEDG-LYVYDLSDPSKPTRILKLSSITQLSVLPE---------LNLLLVLSDGQLYVYDLDSLEP 68 (275)
T ss_pred CEEEEEECCC-EEEEEecCCccceeEeecceEEEEEEecc---------cCEEEEEcCCccEEEEchhhcc
Confidence 5789999999 88888755555555666667888885532 3347888887888788777753
No 176
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=39.64 E-value=64 Score=34.54 Aligned_cols=80 Identities=18% Similarity=0.185 Sum_probs=60.9
Q ss_pred cccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 302 CLKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
-++-..+-+..|++||.++ |+......-.|=++|+.+..++.++-.|+- -|--..+..+ .
T Consensus 188 ~lp~~g~~IrdlafSp~~~GLl~~asl~nkiki~dlet~~~vssy~a~~~---~wSC~wDlde------------~---- 248 (463)
T KOG1645|consen 188 ILPGEGSFIRDLAFSPFNEGLLGLASLGNKIKIMDLETSCVVSSYIAYNQ---IWSCCWDLDE------------R---- 248 (463)
T ss_pred cccccchhhhhhccCccccceeeeeccCceEEEEecccceeeeheeccCC---ceeeeeccCC------------c----
Confidence 3677788899999999997 777777777888999999999999988843 3443333221 1
Q ss_pred EEEEEcC-CCCeEEEeeCCCCc
Q 015021 381 CLAIHAP-RKGIIEHLAGMADE 401 (414)
Q Consensus 381 ~LvIyap-rRg~lEvW~~~~g~ 401 (414)
=+|||. .||.|=|+||++-+
T Consensus 249 -h~IYaGl~nG~VlvyD~R~~~ 269 (463)
T KOG1645|consen 249 -HVIYAGLQNGMVLVYDMRQPE 269 (463)
T ss_pred -ceeEEeccCceEEEEEccCCC
Confidence 248886 57999999999854
No 177
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=39.33 E-value=1e+02 Score=34.12 Aligned_cols=73 Identities=15% Similarity=0.094 Sum_probs=55.1
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
-...+.+.+|+..++++.=.+|-|+|+|..++.....=-..+=..++|.. ..+ +++-+.
T Consensus 260 s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~ka~~~P~~iaWHp-------------------~ga--i~~V~s 318 (545)
T PF11768_consen 260 SQVICCARSPSEDKLVLGCEDGSIILYDTTRGVTLLAKAEFIPTLIAWHP-------------------DGA--IFVVGS 318 (545)
T ss_pred CcceEEecCcccceEEEEecCCeEEEEEcCCCeeeeeeecccceEEEEcC-------------------CCc--EEEEEc
Confidence 35667888999998888888999999999999887664445555555552 112 567788
Q ss_pred CCCeEEEeeCCCCc
Q 015021 388 RKGIIEHLAGMADE 401 (414)
Q Consensus 388 rRg~lEvW~~~~g~ 401 (414)
.||.|-+||+-...
T Consensus 319 ~qGelQ~FD~ALsp 332 (545)
T PF11768_consen 319 EQGELQCFDMALSP 332 (545)
T ss_pred CCceEEEEEeecCc
Confidence 89999999997654
No 178
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=39.17 E-value=66 Score=33.91 Aligned_cols=60 Identities=18% Similarity=0.316 Sum_probs=47.3
Q ss_pred CCccccCCCceeeEEEECCCCCEEEEEcCCCcEE-EEEcCcceeEeee-cccccceeeEEEE
Q 015021 299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRIL-LLDTQALVVVRLW-KGYRDASCVFMEM 358 (414)
Q Consensus 299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~-L~D~~~~~vvrmw-KGyRdAq~~wi~~ 358 (414)
|...+.-++-....|+.+|+|++.|++..-|+|| ++++..|+-+.-+ +|++=+++.=+-.
T Consensus 165 ~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~F 226 (391)
T KOG2110|consen 165 PVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSF 226 (391)
T ss_pred eeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEE
Confidence 3345677777889999999999999998889887 7899999988866 5888666554443
No 179
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=38.88 E-value=32 Score=37.76 Aligned_cols=49 Identities=18% Similarity=0.172 Sum_probs=39.2
Q ss_pred EEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021 79 ITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 79 ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
|-|+-+-|-.|| ++||+.+++|-+-+.+|--=+--.+|++=|++||+-.
T Consensus 595 IfSLg~cP~~dW--lavGMens~vevlh~skp~kyqlhlheScVLSlKFa~ 643 (705)
T KOG0639|consen 595 IFSLGYCPTGDW--LAVGMENSNVEVLHTSKPEKYQLHLHESCVLSLKFAY 643 (705)
T ss_pred heecccCCCccc--eeeecccCcEEEEecCCccceeecccccEEEEEEecc
Confidence 333334444778 8999999999999998888777789999999999654
No 180
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.50 E-value=69 Score=37.13 Aligned_cols=55 Identities=15% Similarity=0.218 Sum_probs=37.1
Q ss_pred CeEEEEEeEEeCCcEEEEEeccccEEEEEe-----cCCcEEeecccCccceeEEEEecccC
Q 015021 77 EYITAIEWLVFEEMRALAVGTSRGYFLVYD-----LKGDLVHRQLIHPGRILKLRVRGSRR 132 (414)
Q Consensus 77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-----e~G~LL~sQ~lh~~pV~~ik~r~~~~ 132 (414)
+.++++..+. .|-.+|||||.+|.|.+|- +.|...--+.--.+||..|-+|..+.
T Consensus 125 ~~p~s~l~Vs-~~l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL~~~~d~~ 184 (933)
T KOG2114|consen 125 PSPASSLAVS-EDLKTIVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGLALRSDGK 184 (933)
T ss_pred CCcceEEEEE-ccccEEEEEecCcEEEEEcCcchhccccceeeeccCCCCceeeEEecCCc
Confidence 3334443333 3578999999999999995 45553333333357999999988764
No 181
>PF06200 tify: tify domain; InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants). It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability. Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include: Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ]. A. thaliana ZIM-like proteins (ZML) []. A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) [].
Probab=38.23 E-value=57 Score=23.20 Aligned_cols=24 Identities=17% Similarity=0.302 Sum_probs=19.3
Q ss_pred CCCCeEEEEeCCeEEEEeChhHHH
Q 015021 138 TAEEEVCVVMPGVLARFDGSEIQK 161 (414)
Q Consensus 138 ~~~eel~Ily~~~iv~IdG~sL~~ 161 (414)
.+...|+|.|.+.++.+|+..-..
T Consensus 3 ~~~~qLTIfY~G~V~Vfd~v~~~K 26 (36)
T PF06200_consen 3 PETAQLTIFYGGQVCVFDDVPPDK 26 (36)
T ss_pred CCCCcEEEEECCEEEEeCCCCHHH
Confidence 346679999999999999876543
No 182
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=37.77 E-value=1.2e+02 Score=32.76 Aligned_cols=85 Identities=16% Similarity=0.085 Sum_probs=56.4
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec---ccccceeeEEEEEecccccccccccCCCCCcc
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK---GYRDASCVFMEMLVNKDAATSSAYYAPVKSDY 378 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK---GyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~ 378 (414)
.|.|...++=-++-||+|+|.|.+...-..+++++....=+.+.| |+-+ -+.+|.-. |. ++
T Consensus 219 il~~htdEVWfl~FS~nGkyLAsaSkD~Taiiw~v~~d~~~kl~~tlvgh~~-~V~yi~wS-------------PD--dr 282 (519)
T KOG0293|consen 219 ILQDHTDEVWFLQFSHNGKYLASASKDSTAIIWIVVYDVHFKLKKTLVGHSQ-PVSYIMWS-------------PD--DR 282 (519)
T ss_pred hHhhCCCcEEEEEEcCCCeeEeeccCCceEEEEEEecCcceeeeeeeecccC-ceEEEEEC-------------CC--CC
Confidence 589999999999999999999998888888888776655454443 3322 12222111 00 11
Q ss_pred eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021 379 CLCLAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 379 ~l~LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
+|+-- |-.-.+..|+.++|.+..+
T Consensus 283 --yLlaC-g~~e~~~lwDv~tgd~~~~ 306 (519)
T KOG0293|consen 283 --YLLAC-GFDEVLSLWDVDTGDLRHL 306 (519)
T ss_pred --eEEec-CchHheeeccCCcchhhhh
Confidence 34333 3355689999999998754
No 183
>PRK04792 tolB translocation protein TolB; Provisional
Probab=37.39 E-value=1.2e+02 Score=31.91 Aligned_cols=37 Identities=22% Similarity=0.330 Sum_probs=28.6
Q ss_pred eEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021 311 ERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG 347 (414)
Q Consensus 311 ~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG 347 (414)
.....||+|+.+|. .+..| +|+++|+.++.+.++-.+
T Consensus 265 ~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~ 304 (448)
T PRK04792 265 GAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALTRITRH 304 (448)
T ss_pred CCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccC
Confidence 46789999975554 55556 499999999998887654
No 184
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=37.19 E-value=61 Score=36.26 Aligned_cols=43 Identities=19% Similarity=0.236 Sum_probs=39.1
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw 345 (414)
+.-+.-...+|+++|.|.|+|..-.+|-|=++.+++|..||-|
T Consensus 396 yrGHtg~Vr~iSvdp~G~wlasGsdDGtvriWEi~TgRcvr~~ 438 (733)
T KOG0650|consen 396 YRGHTGLVRSISVDPSGEWLASGSDDGTVRIWEIATGRCVRTV 438 (733)
T ss_pred EeccCCeEEEEEecCCcceeeecCCCCcEEEEEeecceEEEEE
Confidence 5666777889999999999999999999999999999999976
No 185
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=35.67 E-value=1.3e+02 Score=31.20 Aligned_cols=41 Identities=15% Similarity=0.102 Sum_probs=37.5
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
..+|+.+|-.+.+|+.+++|-|..||+.+..=++.+-+|-+
T Consensus 235 VNai~Fhp~~~tfaTgGsDG~V~~Wd~~~rKrl~q~~~~~~ 275 (323)
T KOG1036|consen 235 VNAIAFHPIHGTFATGGSDGIVNIWDLFNRKRLKQLAKYET 275 (323)
T ss_pred eceeEeccccceEEecCCCceEEEccCcchhhhhhccCCCC
Confidence 46899999999999999999999999999999999999943
No 186
>PRK05137 tolB translocation protein TolB; Provisional
Probab=35.54 E-value=1.9e+02 Score=30.05 Aligned_cols=81 Identities=12% Similarity=0.033 Sum_probs=0.0
Q ss_pred ccCCCceeeEEEECCCCCEEEEEc---CCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcce
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITD---SLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYC 379 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD---~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~ 379 (414)
+..+.+.......||+|+.+|.+. ..+.|+++|+.+|.. +....+...-...-..++..
T Consensus 197 lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~-~~l~~~~g~~~~~~~SPDG~----------------- 258 (435)
T PRK05137 197 LTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQR-ELVGNFPGMTFAPRFSPDGR----------------- 258 (435)
T ss_pred EecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcE-EEeecCCCcccCcEECCCCC-----------------
Q ss_pred EEEEEEcCCCCeEEEe--eCCCCcc
Q 015021 380 LCLAIHAPRKGIIEHL--AGMADEN 402 (414)
Q Consensus 380 l~LvIyaprRg~lEvW--~~~~g~~ 402 (414)
.|++-..++|..+|| ++.+|+.
T Consensus 259 -~la~~~~~~g~~~Iy~~d~~~~~~ 282 (435)
T PRK05137 259 -KVVMSLSQGGNTDIYTMDLRSGTT 282 (435)
T ss_pred -EEEEEEecCCCceEEEEECCCCce
No 187
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=34.46 E-value=1.5e+02 Score=29.64 Aligned_cols=64 Identities=17% Similarity=0.342 Sum_probs=37.2
Q ss_pred CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021 319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM 398 (414)
Q Consensus 319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~ 398 (414)
+..+.+.+..|+|+.+|..+|.++ |+---+.++.- .|.-... .++.....|.|-.|+.+
T Consensus 105 ~~~v~v~~~~g~l~ald~~tG~~~--W~~~~~~~~~~----------------~p~v~~~---~v~v~~~~g~l~a~d~~ 163 (377)
T TIGR03300 105 GGLVFVGTEKGEVIALDAEDGKEL--WRAKLSSEVLS----------------PPLVANG---LVVVRTNDGRLTALDAA 163 (377)
T ss_pred CCEEEEEcCCCEEEEEECCCCcEe--eeeccCceeec----------------CCEEECC---EEEEECCCCeEEEEEcC
Confidence 455556667799999999999984 54222221110 0000011 23344567888888888
Q ss_pred CCcce
Q 015021 399 ADENW 403 (414)
Q Consensus 399 ~g~~v 403 (414)
+|+.+
T Consensus 164 tG~~~ 168 (377)
T TIGR03300 164 TGERL 168 (377)
T ss_pred CCcee
Confidence 88764
No 188
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=34.44 E-value=72 Score=32.73 Aligned_cols=40 Identities=15% Similarity=0.116 Sum_probs=33.3
Q ss_pred CCceeeEEEECCCCCEEEE-EcCCCcEEEEEcCcceeEeee
Q 015021 306 HPRKGERLTLSPSGSLAAI-TDSLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 306 ~~R~~~~i~~sP~~~laa~-tD~lGRV~L~D~~~~~vvrmw 345 (414)
.+.-+.+|+.+++|.++|+ +=.-|+|.++|..+|.++..-
T Consensus 215 l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg~~~~~~ 255 (305)
T PF07433_consen 215 LNGYIGSIAADRDGRLIAVTSPRGGRVAVWDAATGRLLGSV 255 (305)
T ss_pred hCCceEEEEEeCCCCEEEEECCCCCEEEEEECCCCCEeecc
Confidence 3466899999999986655 689999999999999987643
No 189
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=34.27 E-value=81 Score=35.47 Aligned_cols=85 Identities=24% Similarity=0.242 Sum_probs=52.4
Q ss_pred EECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecc-cccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCe
Q 015021 314 TLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGI 391 (414)
Q Consensus 314 ~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~ 391 (414)
+-+|+. .++|++|..|-|.|+|+..-.-- .=|- .++-++.|=-+.+.. + .+++ -+.|-|.-+..
T Consensus 58 s~~~n~eHiLavadE~G~i~l~dt~~~~fr-~ee~~lk~~~aH~nAifDl~--w------apge-----~~lVsasGDsT 123 (720)
T KOG0321|consen 58 SAAPNKEHILAVADEDGGIILFDTKSIVFR-LEERQLKKPLAHKNAIFDLK--W------APGE-----SLLVSASGDST 123 (720)
T ss_pred cCCCCccceEEEecCCCceeeecchhhhcc-hhhhhhcccccccceeEeec--c------CCCc-----eeEEEccCCce
Confidence 344555 68899999999999999764432 1000 112122221111110 0 1122 46788888999
Q ss_pred EEEeeCCCCcceeeccc----eeee
Q 015021 392 IEHLAGMADENWTTSLN----YSVR 412 (414)
Q Consensus 392 lEvW~~~~g~~v~~~~~----~~~~ 412 (414)
+.+|++.+++++-+++| |||+
T Consensus 124 ~r~Wdvk~s~l~G~~~~~GH~~Svk 148 (720)
T KOG0321|consen 124 IRPWDVKTSRLVGGRLNLGHTGSVK 148 (720)
T ss_pred eeeeeeccceeecceeecccccccc
Confidence 99999999999988665 5554
No 190
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=34.24 E-value=1.2e+02 Score=32.14 Aligned_cols=76 Identities=12% Similarity=0.097 Sum_probs=54.8
Q ss_pred CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEee-ecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021 307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRL-WKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH 385 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrm-wKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy 385 (414)
---..++.+.|+|+++-++|..|-+-.+|+.++..+-. +||+-.+- .=|.+. |. + =++-.
T Consensus 247 E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsi-rsih~h-------------p~--~---~~las 307 (412)
T KOG3881|consen 247 ENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSI-RSIHCH-------------PT--H---PVLAS 307 (412)
T ss_pred cCcceeeeecCCCcEEEEecccchhheecccCceeeccccCCccCCc-ceEEEc-------------CC--C---ceEEe
Confidence 44567899999999999999999999999999998877 88765411 111111 11 1 14445
Q ss_pred cCCCCeEEEeeCCCCc
Q 015021 386 APRKGIIEHLAGMADE 401 (414)
Q Consensus 386 aprRg~lEvW~~~~g~ 401 (414)
...+..|.|+++++-+
T Consensus 308 ~GLDRyvRIhD~ktrk 323 (412)
T KOG3881|consen 308 CGLDRYVRIHDIKTRK 323 (412)
T ss_pred eccceeEEEeecccch
Confidence 6678889999998833
No 191
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=33.59 E-value=1.3e+02 Score=34.34 Aligned_cols=77 Identities=17% Similarity=0.196 Sum_probs=55.6
Q ss_pred ceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 308 RKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 308 R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
|+++.++-.|.. .-+.||-++-||=++|+....++.-+||+|.++= |..+.- ..-+| -+|.+
T Consensus 502 ~rITG~Q~~p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~~S---Q~~Asf--------s~Dgk------~IVs~ 564 (712)
T KOG0283|consen 502 KRITGLQFFPGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNTSS---QISASF--------SSDGK------HIVSA 564 (712)
T ss_pred ceeeeeEecCCCCCeEEEecCCCceEEEeccchhhhhhhcccccCCc---ceeeeE--------ccCCC------EEEEe
Confidence 467778888776 3577788888999999999999999999999543 111100 01122 35778
Q ss_pred CCCCeEEEeeCCCCc
Q 015021 387 PRKGIIEHLAGMADE 401 (414)
Q Consensus 387 prRg~lEvW~~~~g~ 401 (414)
..+.-|-||+++.-+
T Consensus 565 seDs~VYiW~~~~~~ 579 (712)
T KOG0283|consen 565 SEDSWVYIWKNDSFN 579 (712)
T ss_pred ecCceEEEEeCCCCc
Confidence 899999999986544
No 192
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=33.30 E-value=33 Score=35.96 Aligned_cols=32 Identities=25% Similarity=0.501 Sum_probs=27.2
Q ss_pred EEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021 313 LTLSPSGSLAAITDSLGRILLLDTQALVVVRLW 345 (414)
Q Consensus 313 i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw 345 (414)
.+.||+|+|+|+...- |+++=|+.+.++++++
T Consensus 14 c~fSp~g~yiAs~~~y-rlviRd~~tlq~~qlf 45 (447)
T KOG4497|consen 14 CSFSPCGNYIASLSRY-RLVIRDSETLQLHQLF 45 (447)
T ss_pred eeECCCCCeeeeeeee-EEEEeccchhhHHHHH
Confidence 4789999999986444 9999999999998865
No 193
>COG3041 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.49 E-value=15 Score=31.34 Aligned_cols=10 Identities=60% Similarity=1.212 Sum_probs=8.3
Q ss_pred eeccccccee
Q 015021 344 LWKGYRDASC 353 (414)
Q Consensus 344 mwKGyRdAq~ 353 (414)
=|||||||.+
T Consensus 53 ~wkg~RdCHi 62 (91)
T COG3041 53 DWKGYRDCHI 62 (91)
T ss_pred Cccchhhccc
Confidence 3999999765
No 194
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=31.97 E-value=1.6e+02 Score=30.71 Aligned_cols=64 Identities=16% Similarity=0.053 Sum_probs=46.1
Q ss_pred CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021 319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM 398 (414)
Q Consensus 319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~ 398 (414)
..|+|+.-..-.|=|-|++.|..-+...|+||- +-..+=.+ +.. |..--+.-+|.+..|++|
T Consensus 158 HcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~~----vlaV~Wsp-----------~~e---~vLatgsaDg~irlWDiR 219 (397)
T KOG4283|consen 158 HCLIAAGTRDVQVRLCDIASGSFSHTLSGHRDG----VLAVEWSP-----------SSE---WVLATGSADGAIRLWDIR 219 (397)
T ss_pred ceEEEEecCCCcEEEEeccCCcceeeeccccCc----eEEEEecc-----------Cce---eEEEecCCCceEEEEEee
Confidence 457788766778999999999999999999993 22222111 111 444567789999999998
Q ss_pred CC
Q 015021 399 AD 400 (414)
Q Consensus 399 ~g 400 (414)
.-
T Consensus 220 ra 221 (397)
T KOG4283|consen 220 RA 221 (397)
T ss_pred cc
Confidence 63
No 195
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=31.89 E-value=1.5e+02 Score=31.02 Aligned_cols=85 Identities=13% Similarity=0.112 Sum_probs=60.1
Q ss_pred CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021 297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK 375 (414)
Q Consensus 297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k 375 (414)
..+++.|.-+.-+-...+--|+.+|++.+..+-..-|+|.... ..+-++.|+-|---.=+- +.+
T Consensus 304 ge~v~~LtGHd~ELtHcstHptQrLVvTsSrDtTFRLWDFReaI~sV~VFQGHtdtVTS~vF---~~d------------ 368 (481)
T KOG0300|consen 304 GEVVNILTGHDSELTHCSTHPTQRLVVTSSRDTTFRLWDFREAIQSVAVFQGHTDTVTSVVF---NTD------------ 368 (481)
T ss_pred CceeccccCcchhccccccCCcceEEEEeccCceeEeccchhhcceeeeecccccceeEEEE---ecC------------
Confidence 3455566667777888899999999999988888888998643 356788998882211111 001
Q ss_pred CcceEEEEEEcCCCCeEEEeeCCCCc
Q 015021 376 SDYCLCLAIHAPRKGIIEHLAGMADE 401 (414)
Q Consensus 376 ~~~~l~LvIyaprRg~lEvW~~~~g~ 401 (414)
.-|+...-+..|+||+++|-+
T Consensus 369 -----d~vVSgSDDrTvKvWdLrNMR 389 (481)
T KOG0300|consen 369 -----DRVVSGSDDRTVKVWDLRNMR 389 (481)
T ss_pred -----CceeecCCCceEEEeeecccc
Confidence 245777778999999999854
No 196
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=31.86 E-value=2.1e+02 Score=33.95 Aligned_cols=82 Identities=16% Similarity=0.124 Sum_probs=47.8
Q ss_pred eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee----cccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW----KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw----KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
..|+++|+|++.++--..+||-+||..++.+..+- .||+|-. .....- +.+.......+.. |.|--
T Consensus 807 ~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~----~~~a~l----~~P~GIavd~dG~--lyVaD 876 (1057)
T PLN02919 807 LGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGK----ALKAQL----SEPAGLALGENGR--LFVAD 876 (1057)
T ss_pred ceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCc----cccccc----CCceEEEEeCCCC--EEEEE
Confidence 47899999987666556789999999998877654 2343310 000000 0000000000111 44556
Q ss_pred CCCCeEEEeeCCCCcc
Q 015021 387 PRKGIIEHLAGMADEN 402 (414)
Q Consensus 387 prRg~lEvW~~~~g~~ 402 (414)
-..+.|.+|++.+++.
T Consensus 877 t~Nn~Irvid~~~~~~ 892 (1057)
T PLN02919 877 TNNSLIRYLDLNKGEA 892 (1057)
T ss_pred CCCCEEEEEECCCCcc
Confidence 6778999999998864
No 197
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=31.83 E-value=1.1e+02 Score=33.35 Aligned_cols=78 Identities=18% Similarity=0.161 Sum_probs=57.1
Q ss_pred CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021 307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA 386 (414)
Q Consensus 307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya 386 (414)
.-...++.-+|+|+++|+.=..|.|.|+|..+..-+|-.+|.=.+.++=+. = +. ..++-.
T Consensus 217 ~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~rvg~la---W-------------~~----~~lssG 276 (484)
T KOG0305|consen 217 EELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHASRVGSLA---W-------------NS----SVLSSG 276 (484)
T ss_pred CCceEEEEECCCCCEEEEeecCCeEEEEehhhccccccccCCcCceeEEEe---c-------------cC----ceEEEe
Confidence 667899999999999999999999999999999999999884222222111 0 00 245666
Q ss_pred CCCCeEEEeeCCCCccee
Q 015021 387 PRKGIIEHLAGMADENWT 404 (414)
Q Consensus 387 prRg~lEvW~~~~g~~v~ 404 (414)
-|.|.|-+++++..+.+.
T Consensus 277 sr~~~I~~~dvR~~~~~~ 294 (484)
T KOG0305|consen 277 SRDGKILNHDVRISQHVV 294 (484)
T ss_pred cCCCcEEEEEEecchhhh
Confidence 677777777777766553
No 198
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=31.61 E-value=2.1e+02 Score=29.26 Aligned_cols=80 Identities=13% Similarity=0.081 Sum_probs=58.0
Q ss_pred cCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021 304 KDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL 382 (414)
Q Consensus 304 ~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L 382 (414)
.|.+-.+++-.|.|.....++.-.++.+.-+|-.+|.=+-.+ ||+-.+ +.+..-.+++ -+
T Consensus 221 ~k~P~nV~SASL~P~k~~fVaGged~~~~kfDy~TgeEi~~~nkgh~gp----VhcVrFSPdG---------------E~ 281 (334)
T KOG0278|consen 221 YKMPCNVESASLHPKKEFFVAGGEDFKVYKFDYNTGEEIGSYNKGHFGP----VHCVRFSPDG---------------EL 281 (334)
T ss_pred ccCccccccccccCCCceEEecCcceEEEEEeccCCceeeecccCCCCc----eEEEEECCCC---------------ce
Confidence 355677889999999999999999999999999999988886 877652 2111111111 12
Q ss_pred EEEcCCCCeEEEeeCCCCcc
Q 015021 383 AIHAPRKGIIEHLAGMADEN 402 (414)
Q Consensus 383 vIyaprRg~lEvW~~~~g~~ 402 (414)
.-....+|.|.+|.+-.|+-
T Consensus 282 yAsGSEDGTirlWQt~~~~~ 301 (334)
T KOG0278|consen 282 YASGSEDGTIRLWQTTPGKT 301 (334)
T ss_pred eeccCCCceEEEEEecCCCc
Confidence 23456789999999877653
No 199
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=31.39 E-value=4.4e+02 Score=27.75 Aligned_cols=41 Identities=22% Similarity=0.089 Sum_probs=36.0
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR 343 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr 343 (414)
|.-+-|-+.+|+=||+|++..++..+--|.|+|+..|.+++
T Consensus 61 lsaH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~l~ 101 (405)
T KOG1273|consen 61 LSAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSPLK 101 (405)
T ss_pred hhccccceeEEEecCCCCEeeeecCCceeEEEeccCCCcee
Confidence 44556788999999999999999888999999999998876
No 200
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=30.53 E-value=3.7e+02 Score=28.13 Aligned_cols=79 Identities=19% Similarity=0.299 Sum_probs=57.2
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEE-EEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRIL-LLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~-L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
++-+.-.+..++|.=+|.++|++..-|.+| ++|+.+|..+.-. +|+-.|.+.=|....+ ..
T Consensus 177 I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~-----------------~s 239 (346)
T KOG2111|consen 177 INAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPN-----------------SS 239 (346)
T ss_pred EEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCC-----------------cc
Confidence 666667788999999999999999999876 8899999988754 5777777666654211 11
Q ss_pred EEEEEcCCCCeEEEeeCCC
Q 015021 381 CLAIHAPRKGIIEHLAGMA 399 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~ 399 (414)
+|+. +..+|.|-|+.++-
T Consensus 240 ~Lav-sSdKgTlHiF~l~~ 257 (346)
T KOG2111|consen 240 WLAV-SSDKGTLHIFSLRD 257 (346)
T ss_pred EEEE-EcCCCeEEEEEeec
Confidence 3443 44577777777664
No 201
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=30.10 E-value=50 Score=34.55 Aligned_cols=39 Identities=28% Similarity=0.306 Sum_probs=31.4
Q ss_pred EEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEec
Q 015021 91 RALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRG 129 (414)
Q Consensus 91 ~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~ 129 (414)
+.||||+|+|.||+|+ ++|.+|-.=--|+.-+..+|+-+
T Consensus 41 ~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~ 80 (376)
T KOG1188|consen 41 TAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFIS 80 (376)
T ss_pred eeEEEEecCCeEEEEeccchhhhheecCCCCcccceEEec
Confidence 7899999999999999 57777765566777777777655
No 202
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=29.53 E-value=76 Score=36.22 Aligned_cols=58 Identities=28% Similarity=0.230 Sum_probs=48.0
Q ss_pred cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEE
Q 015021 302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEML 359 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~ 359 (414)
+|.-+.--+++|.+++...|+|..-..|-|=++|+..+.++|-.-|+|+- .+.|....
T Consensus 65 S~~~hespIeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~ 124 (825)
T KOG0267|consen 65 SLTGHESPIESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYG 124 (825)
T ss_pred eeeccCCcceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccCcceeeeccce
Confidence 45666667899999999999999999999999999999999999999874 33444443
No 203
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=29.15 E-value=2.9e+02 Score=28.94 Aligned_cols=57 Identities=9% Similarity=-0.023 Sum_probs=36.1
Q ss_pred cEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCccee
Q 015021 330 RILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWT 404 (414)
Q Consensus 330 RV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~ 404 (414)
-|-++|+.++..|+-|+.=.. .-=|++.... | . .|+.-..-.|.|.||+..+|+.+.
T Consensus 270 eVWv~D~~t~krv~Ri~l~~~--~~Si~Vsqd~------------~--P--~L~~~~~~~~~l~v~D~~tGk~~~ 326 (342)
T PF06433_consen 270 EVWVYDLKTHKRVARIPLEHP--IDSIAVSQDD------------K--P--LLYALSAGDGTLDVYDAATGKLVR 326 (342)
T ss_dssp EEEEEETTTTEEEEEEEEEEE--ESEEEEESSS------------S-----EEEEEETTTTEEEEEETTT--EEE
T ss_pred EEEEEECCCCeEEEEEeCCCc--cceEEEccCC------------C--c--EEEEEcCCCCeEEEEeCcCCcEEe
Confidence 488999999999998864222 1124443211 1 1 344445568999999999999884
No 204
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=28.83 E-value=47 Score=36.22 Aligned_cols=69 Identities=14% Similarity=0.189 Sum_probs=48.7
Q ss_pred CCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEe
Q 015021 316 SPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHL 395 (414)
Q Consensus 316 sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW 395 (414)
-|...|++.....|+|.+||=.++.+||..||=|. =+.+.++ |..+=+.--..-..-|+||
T Consensus 403 GPrsEyVvSGSDCGhIFiW~K~t~eii~~MegDr~----VVNCLEp---------------HP~~PvLAsSGid~DVKIW 463 (559)
T KOG1334|consen 403 GPRSEYVVSGSDCGHIFIWDKKTGEIIRFMEGDRH----VVNCLEP---------------HPHLPVLASSGIDHDVKIW 463 (559)
T ss_pred cCccceEEecCccceEEEEecchhHHHHHhhcccc----eEeccCC---------------CCCCchhhccCCccceeee
Confidence 46668999989999999999999999999999998 3444432 1111122233456778888
Q ss_pred eCCCCcce
Q 015021 396 AGMADENW 403 (414)
Q Consensus 396 ~~~~g~~v 403 (414)
+-+.-++.
T Consensus 464 TP~~~er~ 471 (559)
T KOG1334|consen 464 TPLTAERA 471 (559)
T ss_pred cCCccccc
Confidence 87655544
No 205
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=28.82 E-value=2.2e+02 Score=29.12 Aligned_cols=79 Identities=18% Similarity=0.189 Sum_probs=50.1
Q ss_pred ccCCCceeeEEEECCC--CCEEEEEc------CCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCC
Q 015021 303 LKDHPRKGERLTLSPS--GSLAAITD------SLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPV 374 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~--~~laa~tD------~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~ 374 (414)
+.-..|+|.++.-||= ++||++|- .-||+.++|+..+.=|+...-|-=+.+-|=..-.+.
T Consensus 4 ~~tpgf~GysvqfSPf~~nrLavAt~q~yGl~G~G~L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse~------------ 71 (311)
T KOG0277|consen 4 HTTPGFHGYSVQFSPFVENRLAVATAQHYGLAGNGRLFILEVTDPKGIQECQSYDTEDGLFDVAWSEN------------ 71 (311)
T ss_pred eecCCcccceeEecccccchhheeehhhcccccCceEEEEecCCCCCeEEEEeeecccceeEeeecCC------------
Confidence 3345699999999996 57777753 468999999986665665544433322221111110
Q ss_pred CCcceEEEEEEcCCCCeEEEeeC
Q 015021 375 KSDYCLCLAIHAPRKGIIEHLAG 397 (414)
Q Consensus 375 k~~~~l~LvIyaprRg~lEvW~~ 397 (414)
+=-.+|-|--+|.|.+|++
T Consensus 72 ----~e~~~~~a~GDGSLrl~d~ 90 (311)
T KOG0277|consen 72 ----HENQVIAASGDGSLRLFDL 90 (311)
T ss_pred ----CcceEEEEecCceEEEecc
Confidence 0025677888899999985
No 206
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=28.72 E-value=2.7e+02 Score=27.97 Aligned_cols=80 Identities=9% Similarity=0.098 Sum_probs=48.2
Q ss_pred CeeeeccCcceeeeeecc--eEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe---c
Q 015021 33 PNLLCALDMHTIALANRY--QTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD---L 107 (414)
Q Consensus 33 ~~~~~sp~~~~la~A~~~--~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt---e 107 (414)
..|.+||+|+.|-++++. .+.+.+.+.+ .+.++..+.+.. +++.-..+..=| .+-.+++.+-.++.|.+|. +
T Consensus 248 ~~i~ispdg~~lyvsnr~~~sI~vf~~d~~-~g~l~~~~~~~~-~G~~Pr~~~~s~-~g~~l~Va~~~s~~v~vf~~d~~ 324 (345)
T PF10282_consen 248 AEIAISPDGRFLYVSNRGSNSISVFDLDPA-TGTLTLVQTVPT-GGKFPRHFAFSP-DGRYLYVANQDSNTVSVFDIDPD 324 (345)
T ss_dssp EEEEE-TTSSEEEEEECTTTEEEEEEECTT-TTTEEEEEEEEE-SSSSEEEEEE-T-TSSEEEEEETTTTEEEEEEEETT
T ss_pred eeEEEecCCCEEEEEeccCCEEEEEEEecC-CCceEEEEEEeC-CCCCccEEEEeC-CCCEEEEEecCCCeEEEEEEeCC
Confidence 458899999999988876 5666666443 233444444433 233333333222 2345666778899999996 4
Q ss_pred CCcEEeec
Q 015021 108 KGDLVHRQ 115 (414)
Q Consensus 108 ~G~LL~sQ 115 (414)
+|.|-...
T Consensus 325 tG~l~~~~ 332 (345)
T PF10282_consen 325 TGKLTPVG 332 (345)
T ss_dssp TTEEEEEE
T ss_pred CCcEEEec
Confidence 77776655
No 207
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=28.43 E-value=1.2e+02 Score=30.62 Aligned_cols=34 Identities=24% Similarity=0.262 Sum_probs=26.8
Q ss_pred ceeeEEEECCCCCEEEEEcCCC---cEEEEEcCccee
Q 015021 308 RKGERLTLSPSGSLAAITDSLG---RILLLDTQALVV 341 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lG---RV~L~D~~~~~v 341 (414)
........+|+|++++.++..+ +|.++|+.++..
T Consensus 322 ~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~ 358 (417)
T TIGR02800 322 GYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGE 358 (417)
T ss_pred CCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCe
Confidence 3445778999999888876665 999999988654
No 208
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=28.32 E-value=45 Score=33.94 Aligned_cols=40 Identities=30% Similarity=0.356 Sum_probs=35.2
Q ss_pred CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEc
Q 015021 297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDT 336 (414)
Q Consensus 297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~ 336 (414)
..||..|.-++-.+.+++-+|+..+.|.+.-++||-|+++
T Consensus 283 l~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rISLWkL 322 (323)
T KOG0322|consen 283 LNPLAVLKYHSAGVNAVAFSPDCELMAAASKDARISLWKL 322 (323)
T ss_pred CCchhhhhhhhcceeEEEeCCCCchhhhccCCceEEeeec
Confidence 4567778888888999999999999999999999999986
No 209
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=27.91 E-value=81 Score=28.79 Aligned_cols=28 Identities=36% Similarity=0.338 Sum_probs=22.0
Q ss_pred ECCCCCEEEEEcCCCcEEEEEcCcceeE
Q 015021 315 LSPSGSLAAITDSLGRILLLDTQALVVV 342 (414)
Q Consensus 315 ~sP~~~laa~tD~lGRV~L~D~~~~~vv 342 (414)
+.+.+..+.+++..|+|+.+|+.+|.++
T Consensus 208 ~~~~~~~l~~~~~~~~l~~~d~~tG~~~ 235 (238)
T PF13360_consen 208 PSVDGGTLYVTSSDGRLYALDLKTGKVV 235 (238)
T ss_dssp EECCCTEEEEEETTTEEEEEETTTTEEE
T ss_pred ceeeCCEEEEEeCCCEEEEEECCCCCEE
Confidence 3456666666668999999999999875
No 210
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=27.66 E-value=2e+02 Score=25.40 Aligned_cols=46 Identities=17% Similarity=0.230 Sum_probs=34.3
Q ss_pred EEEEEeEEeCCc--EEEEEeccccEEEEEecCCcEEeecccCccceeEEE
Q 015021 79 ITAIEWLVFEEM--RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLR 126 (414)
Q Consensus 79 ITs~~~lpl~dw--~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik 126 (414)
|||++...+.+. .=++||..+..+|+|... .++.++=-.+.|..|.
T Consensus 2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~~--e~~~Ei~e~~~v~~L~ 49 (111)
T PF14783_consen 2 VTALCLFDFDGDGENELLVGSDDFEIRVFKGD--EIVAEITETDKVTSLC 49 (111)
T ss_pred eeEEEEEecCCCCcceEEEecCCcEEEEEeCC--cEEEEEecccceEEEE
Confidence 788888888443 889999999999999853 4455544456666665
No 211
>PRK00178 tolB translocation protein TolB; Provisional
Probab=27.27 E-value=3.1e+02 Score=28.15 Aligned_cols=41 Identities=24% Similarity=0.149 Sum_probs=29.4
Q ss_pred eEEEECCCCCEEEEEcC-CC--cEEEEEcCcceeEeeecccccc
Q 015021 311 ERLTLSPSGSLAAITDS-LG--RILLLDTQALVVVRLWKGYRDA 351 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD~-lG--RV~L~D~~~~~vvrmwKGyRdA 351 (414)
.....||+|++++.+.. .| +|.++|+.++..-++-.+..|.
T Consensus 334 ~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~~~~ 377 (430)
T PRK00178 334 ARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTSLDE 377 (430)
T ss_pred cceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCCCCC
Confidence 45688999988887643 23 5899999998876665554443
No 212
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=26.82 E-value=1e+02 Score=32.25 Aligned_cols=42 Identities=26% Similarity=0.356 Sum_probs=35.7
Q ss_pred CCCC--ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCc
Q 015021 297 ASPL--TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQA 338 (414)
Q Consensus 297 a~pl--~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~ 338 (414)
..|| ..|.-++..+..++.|-+|++.|+.-.+|-|.|+|+..
T Consensus 74 ~Hpl~~~~LKgH~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~D 117 (420)
T KOG2096|consen 74 VHPLNVSVLKGHKKEVTDVAFSSDGKKLATISGDRSIRLWDVRD 117 (420)
T ss_pred ccchhhhhhhccCCceeeeEEcCCCceeEEEeCCceEEEEecch
Confidence 3455 35889999999999999999999988899999998754
No 213
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=26.71 E-value=6.9e+02 Score=28.33 Aligned_cols=93 Identities=16% Similarity=0.209 Sum_probs=57.4
Q ss_pred eeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCc-EEEEEeccccEEEEEec-CCcE
Q 015021 34 NLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEM-RALAVGTSRGYFLVYDL-KGDL 111 (414)
Q Consensus 34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw-~~I~VG~ssG~vrfyte-~G~L 111 (414)
.+..|..+.. |+-.+++--++=|+..+ +.++..-.++ +.+.|..+-|...+|- .+++|||. .+|.+|+. .-+.
T Consensus 34 li~gss~~k~-a~V~~~~~~LtIWD~~~-~~lE~~~~f~--~~~~I~dLDWtst~d~qsiLaVGf~-~~v~l~~Q~R~dy 108 (631)
T PF12234_consen 34 LISGSSIKKI-AVVDSSRSELTIWDTRS-GVLEYEESFS--EDDPIRDLDWTSTPDGQSILAVGFP-HHVLLYTQLRYDY 108 (631)
T ss_pred eEeecccCcE-EEEECCCCEEEEEEcCC-cEEEEeeeec--CCCceeeceeeecCCCCEEEEEEcC-cEEEEEEccchhh
Confidence 3445554444 55565555555677664 2244444442 3688999999999775 88999996 47777775 2222
Q ss_pred EeecccCccceeEEEEecccC
Q 015021 112 VHRQLIHPGRILKLRVRGSRR 132 (414)
Q Consensus 112 L~sQ~lh~~pV~~ik~r~~~~ 132 (414)
+-... .-.||.+|.++...+
T Consensus 109 ~~~~p-~w~~i~~i~i~~~T~ 128 (631)
T PF12234_consen 109 TNKGP-SWAPIRKIDISSHTP 128 (631)
T ss_pred hcCCc-ccceeEEEEeecCCC
Confidence 21111 125899999987553
No 214
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=26.64 E-value=94 Score=21.30 Aligned_cols=18 Identities=22% Similarity=0.228 Sum_probs=16.2
Q ss_pred EcCCCcEEEEEcCcceeE
Q 015021 325 TDSLGRILLLDTQALVVV 342 (414)
Q Consensus 325 tD~lGRV~L~D~~~~~vv 342 (414)
....|+|.-+|..+|.++
T Consensus 6 ~~~~g~l~AlD~~TG~~~ 23 (38)
T PF01011_consen 6 GTPDGYLYALDAKTGKVL 23 (38)
T ss_dssp ETTTSEEEEEETTTTSEE
T ss_pred eCCCCEEEEEECCCCCEE
Confidence 388999999999999986
No 215
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=26.37 E-value=3.5e+02 Score=24.57 Aligned_cols=85 Identities=18% Similarity=0.096 Sum_probs=51.0
Q ss_pred EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCC
Q 015021 55 INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRD 133 (414)
Q Consensus 55 ~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~ 133 (414)
..|+.. .++..|+-.+.+ .++...+.++++.-.+.++..+|.|..|+ .+|.++.+.-+ ++++... +
T Consensus 6 ~~~d~~-tG~~~W~~~~~~----~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~-~~~~~~~-----~-- 72 (238)
T PF13360_consen 6 SALDPR-TGKELWSYDLGP----GIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDL-PGPISGA-----P-- 72 (238)
T ss_dssp EEEETT-TTEEEEEEECSS----SCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEEC-SSCGGSG-----E--
T ss_pred EEEECC-CCCEEEEEECCC----CCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeec-cccccce-----e--
Confidence 344443 466888887733 14444444554444566668999999999 59999988877 4443322 0
Q ss_pred CCCCCCCCeEEEEeCCe-EEEEe
Q 015021 134 LTQDTAEEEVCVVMPGV-LARFD 155 (414)
Q Consensus 134 ~~~~~~~eel~Ily~~~-iv~Id 155 (414)
......++|...+. +..+|
T Consensus 73 ---~~~~~~v~v~~~~~~l~~~d 92 (238)
T PF13360_consen 73 ---VVDGGRVYVGTSDGSLYALD 92 (238)
T ss_dssp ---EEETTEEEEEETTSEEEEEE
T ss_pred ---eecccccccccceeeeEecc
Confidence 11145566666543 44444
No 216
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=26.27 E-value=1.1e+02 Score=35.57 Aligned_cols=46 Identities=22% Similarity=0.419 Sum_probs=36.9
Q ss_pred EEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEE
Q 015021 313 LTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEML 359 (414)
Q Consensus 313 i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~ 359 (414)
=+.||++..+|+.++.|||..+|- +-..+|=||.|-.-++.++...
T Consensus 29 sc~~s~~~~vvigt~~G~V~~Ln~-s~~~~~~fqa~~~siv~~L~~~ 74 (933)
T KOG2114|consen 29 SCCSSSTGSVVIGTADGRVVILNS-SFQLIRGFQAYEQSIVQFLYIL 74 (933)
T ss_pred eEEcCCCceEEEeeccccEEEecc-cceeeehheecchhhhhHhhcc
Confidence 356888899999999999999874 4456699999988777776654
No 217
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=26.12 E-value=2.1e+02 Score=30.61 Aligned_cols=74 Identities=18% Similarity=0.113 Sum_probs=52.8
Q ss_pred CCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcc----eeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021 306 HPRKGERLTLSPSG-SLAAITDSLGRILLLDTQAL----VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL 380 (414)
Q Consensus 306 ~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~----~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l 380 (414)
+.+..+.|+=||+. ..+|.+.-.|-|=++|+..+ ++.+ |.+-. =+.|.. + ..+.-
T Consensus 256 H~~SVEDLqWSptE~~vfaScS~DgsIrIWDiRs~~~~~~~~~--kAh~s----DVNVIS----W---------nr~~~- 315 (440)
T KOG0302|consen 256 HTKSVEDLQWSPTEDGVFASCSCDGSIRIWDIRSGPKKAAVST--KAHNS----DVNVIS----W---------NRREP- 315 (440)
T ss_pred cccchhhhccCCccCceEEeeecCceEEEEEecCCCccceeEe--eccCC----ceeeEE----c---------cCCcc-
Confidence 77889999999998 68888899999999999988 3333 44332 233321 1 01112
Q ss_pred EEEEEcCCCCeEEEeeCCCC
Q 015021 381 CLAIHAPRKGIIEHLAGMAD 400 (414)
Q Consensus 381 ~LvIyaprRg~lEvW~~~~g 400 (414)
|.-..--.|.+.||++|+.
T Consensus 316 -lLasG~DdGt~~iwDLR~~ 334 (440)
T KOG0302|consen 316 -LLASGGDDGTLSIWDLRQF 334 (440)
T ss_pred -eeeecCCCceEEEEEhhhc
Confidence 5567788999999999983
No 218
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=26.05 E-value=1.2e+02 Score=19.70 Aligned_cols=20 Identities=30% Similarity=0.278 Sum_probs=14.2
Q ss_pred EEEcCCCcEEEEEcCcceeE
Q 015021 323 AITDSLGRILLLDTQALVVV 342 (414)
Q Consensus 323 a~tD~lGRV~L~D~~~~~vv 342 (414)
.+.|+.|+|++-|..+..|.
T Consensus 7 vav~~~g~i~VaD~~n~rV~ 26 (28)
T PF01436_consen 7 VAVDSDGNIYVADSGNHRVQ 26 (28)
T ss_dssp EEEETTSEEEEEECCCTEEE
T ss_pred EEEeCCCCEEEEECCCCEEE
Confidence 34568888888887776654
No 219
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=25.95 E-value=1.4e+02 Score=30.83 Aligned_cols=45 Identities=22% Similarity=0.309 Sum_probs=38.3
Q ss_pred eEEEECCCCCEEEEEc-CCCcEEEEEcCcceeEe------eecccccceeeE
Q 015021 311 ERLTLSPSGSLAAITD-SLGRILLLDTQALVVVR------LWKGYRDASCVF 355 (414)
Q Consensus 311 ~~i~~sP~~~laa~tD-~lGRV~L~D~~~~~vvr------mwKGyRdAq~~w 355 (414)
..+.++|+|+.+.++| ..++|.++|..+..+.| +=.|.+.+...+
T Consensus 163 ~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v 214 (381)
T COG3391 163 TGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAV 214 (381)
T ss_pred ceEEECCCCCeEEEEecCCCeEEEEeCCCcceeccccccccccCCCCceEEE
Confidence 7899999999777776 89999999999999997 666777766665
No 220
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=25.93 E-value=2.8e+02 Score=33.03 Aligned_cols=36 Identities=28% Similarity=0.115 Sum_probs=27.5
Q ss_pred eeEEEECCCCCEEEEEcCC-CcEEEEEcCcceeEeee
Q 015021 310 GERLTLSPSGSLAAITDSL-GRILLLDTQALVVVRLW 345 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~l-GRV~L~D~~~~~vvrmw 345 (414)
-..|+++|+|+.+.++|.. +||..+|+.++.+.-+.
T Consensus 742 P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~ 778 (1057)
T PLN02919 742 PSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLA 778 (1057)
T ss_pred ccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEE
Confidence 3459999999866777765 89999999987654333
No 221
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=25.78 E-value=61 Score=40.67 Aligned_cols=67 Identities=12% Similarity=0.066 Sum_probs=54.6
Q ss_pred ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
-..+.|+-.|..++....-..|-|.|+|+.+.+.++-|+. |. .. -...-.+
T Consensus 2337 ~gaT~l~~~P~~qllisggr~G~v~l~D~rqrql~h~~~~-------~~-~~---------------------~~f~~~s 2387 (2439)
T KOG1064|consen 2337 GGATVLAYAPKHQLLISGGRKGEVCLFDIRQRQLRHTFQA-------LD-TR---------------------EYFVTGS 2387 (2439)
T ss_pred CCceEEEEcCcceEEEecCCcCcEEEeehHHHHHHHHhhh-------hh-hh---------------------heeeccC
Confidence 3467889999999999999999999999999999999977 22 00 1446678
Q ss_pred CCCeEEEeeCCCCcce
Q 015021 388 RKGIIEHLAGMADENW 403 (414)
Q Consensus 388 rRg~lEvW~~~~g~~v 403 (414)
.||.+.||++-.-..+
T Consensus 2388 s~g~ikIw~~s~~~ll 2403 (2439)
T KOG1064|consen 2388 SEGNIKIWRLSEFGLL 2403 (2439)
T ss_pred cccceEEEEccccchh
Confidence 8999999999776555
No 222
>PF11396 DUF2874: Protein of unknown function (DUF2874); InterPro: IPR021533 This bacterial family of proteins are probable periplasmic proteins with unknown function. There are between one and four copies of this domain per sequence. ; PDB: 3DUE_A 3U1W_B 3DB7_A 4DSD_A 3ELG_A.
Probab=24.87 E-value=2.9e+02 Score=20.40 Aligned_cols=38 Identities=16% Similarity=0.170 Sum_probs=32.2
Q ss_pred CCCeEEEEEeEEeCCcEEEEEecc---ccEEEEEecCCcEE
Q 015021 75 ASEYITAIEWLVFEEMRALAVGTS---RGYFLVYDLKGDLV 112 (414)
Q Consensus 75 ~~e~ITs~~~lpl~dw~~I~VG~s---sG~vrfyte~G~LL 112 (414)
++..|+.+.-.-.++....-|-+. .++-..|+.+|.+|
T Consensus 21 p~~~i~~v~~~~~~~~~~Y~v~l~~~~~~~~v~fd~~G~~l 61 (61)
T PF11396_consen 21 PGAKIKEVEKETDPGGKYYEVELKKGGNEYEVYFDANGNWL 61 (61)
T ss_dssp TTSEEEEEEEEEETTEEEEEEEETETTTSEEEEEETTS-EE
T ss_pred CCCeEEEEEEEEcCCCCEEEEEEEEeCCeEEEEEcCCCCCC
Confidence 467888888888888888889999 99999999999986
No 223
>PF02393 US22: US22 like; InterPro: IPR003360 Herpesviruses are large and complex DNA viruses, widely found in nature. Human cytomegalovirus (HCMV), an important human pathogen, defines the betaherpesvirus family. Mouse cytomegalovirus (MCMV) and rat cytomegalovirus serve as biological model systems for HCMV. HCMV, MCMV, and rat CMV display the largest genomes among the herpesviruses and are essentially co-linear over the central 180 kb of the 230-kb genomes. Betaherpesviruses, which include the CMVs as well as human herpesviruses 6 and 7, differ from alpha- and gammaherpesviruses by the presence of additional gene families such as the US22 gene family, which are mainly clustered at the ends of the genome. The US22 family was first described in HCMV. This gene family comprises 12 members in both HCMV and MCMV and 11 in rat CMV []. Members of the US22 gene family are characterised by stretches of hydrophobic and charged residues as well as up to four conserved sequence motifs which are specific for betaherpesviruses. Motif I differs between the HCMV US and UL family members []. Motifs I and II have consensus sequences, while motifs III and IV are less well defined but have stretches of non-polar residues [, ]. Members of this gene family are widely divergent in function and their involvement in viral replication []. This entry contains US22 family members from the Cytomegalovirus, Muromegalovirus and the Roseolovirus taxonomic groups. The name sake of this family US22 is an early nuclear protein that is secreted from cells []. The US22 family may have a role in virus replication and pathogenesis [].
Probab=24.69 E-value=1.2e+02 Score=25.78 Aligned_cols=28 Identities=21% Similarity=0.286 Sum_probs=23.9
Q ss_pred CCCCEEEEEcCCCcEEEEEcCcceeEee
Q 015021 317 PSGSLAAITDSLGRILLLDTQALVVVRL 344 (414)
Q Consensus 317 P~~~laa~tD~lGRV~L~D~~~~~vvrm 344 (414)
+.-+..+..|..|||+.+|..+..+.|+
T Consensus 79 ~~~~~vvl~~~~G~Vy~yd~~~~~l~~l 106 (125)
T PF02393_consen 79 FRDRLVVLVGESGRVYAYDPEDDRLYRL 106 (125)
T ss_pred ccceEEEEEeCCCeEEEEEcCCCEEEEE
Confidence 4557888899999999999999887765
No 224
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=24.38 E-value=1.4e+02 Score=31.65 Aligned_cols=72 Identities=18% Similarity=0.117 Sum_probs=51.0
Q ss_pred CeeeeccCcceeeeeecc-------eEEEEeecCCCCCc--eeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEE
Q 015021 33 PNLLCALDMHTIALANRY-------QTVIINWADPEGLV--AKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYF 102 (414)
Q Consensus 33 ~~~~~sp~~~~la~A~~~-------~~v~~~w~~~~~~~--v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~v 102 (414)
-.++..|.|++||+-++. |.+|++|+...-.. ++-||.+.. |.=..|..=|+ .+-.++-|+-++|++
T Consensus 349 q~lawDpsGeyLav~fKg~~~v~~~k~~i~~fdtr~sp~vels~cg~i~g---e~P~~IsF~pl~n~g~lLsiaWsTGri 425 (445)
T KOG2139|consen 349 QCLAWDPSGEYLAVIFKGQSFVLLCKLHISRFDTRKSPPVELSYCGMIGG---EYPAYISFGPLKNEGRLLSIAWSTGRI 425 (445)
T ss_pred ceeeECCCCCEEEEEEcCCchhhhhhhhhhhhcccccCceEEEecccccC---CCCceEEeeecccCCcEEEEEeccCce
Confidence 357888999999987765 44557776664333 456788744 33444555555 677899999999999
Q ss_pred EEEec
Q 015021 103 LVYDL 107 (414)
Q Consensus 103 rfyte 107 (414)
.-|..
T Consensus 426 q~ypl 430 (445)
T KOG2139|consen 426 QRYPL 430 (445)
T ss_pred Eeeee
Confidence 99874
No 225
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.38 E-value=3e+02 Score=30.86 Aligned_cols=40 Identities=13% Similarity=0.170 Sum_probs=30.6
Q ss_pred EECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceee
Q 015021 314 TLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCV 354 (414)
Q Consensus 314 ~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~ 354 (414)
+++|=-.++...|.+|.-|+.=+..|++|- ||=--++|..
T Consensus 470 ~V~pi~Gf~~L~d~~G~~IV~vLsSGecI~-w~Ll~~~h~~ 509 (741)
T KOG4460|consen 470 QVAPIRGFWILPDILGPTIVCILSSGECII-WPLLSTVHPA 509 (741)
T ss_pred ccccccceeeccccCCceEEEEecCCcEEE-Eeeecccccc
Confidence 788888888899999976666677887775 8776666543
No 226
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=23.81 E-value=2.7e+02 Score=27.84 Aligned_cols=49 Identities=20% Similarity=0.188 Sum_probs=30.6
Q ss_pred CceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccC
Q 015021 63 LVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIH 118 (414)
Q Consensus 63 ~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh 118 (414)
+++.|+-.+.. .+.+ -|..+-.-+.||..+|.|..++ ++|+++.+..+.
T Consensus 85 G~~~W~~~~~~----~~~~---~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~ 134 (377)
T TIGR03300 85 GKRLWRVDLDE----RLSG---GVGADGGLVFVGTEKGEVIALDAEDGKELWRAKLS 134 (377)
T ss_pred CcEeeeecCCC----Cccc---ceEEcCCEEEEEcCCCEEEEEECCCCcEeeeeccC
Confidence 45777666533 1221 1333344566888899999888 489988776543
No 227
>PF10411 DsbC_N: Disulfide bond isomerase protein N-terminus; InterPro: IPR018950 This is the N-terminal domain of the disulphide bond isomerase DsbC. The whole molecule is V-shaped, where each arm is a DsbC monomer of two domains linked by a hinge; and the N-termini of each monomer join to form the dimer interface at the base of the V, so are vital for dimerisation []. DsbC is required for disulphide bond formation and functions as a disulphide bond isomerase during oxidative protein-folding in bacterial periplasm. It also has chaperone activity []. ; PDB: 1EEJ_B 2IYJ_A 1TJD_A 1JZD_B 1JZO_A 1G0T_B 1T3B_A.
Probab=23.34 E-value=1.6e+02 Score=22.33 Aligned_cols=37 Identities=11% Similarity=0.117 Sum_probs=26.5
Q ss_pred CCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEee
Q 015021 76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHR 114 (414)
Q Consensus 76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~s 114 (414)
+=.|+++...|+++..=|.+ .+|-+.+.|++|.-|+.
T Consensus 11 ~~~v~~v~~spi~GlyeV~~--~~~~i~Y~~~dg~yli~ 47 (57)
T PF10411_consen 11 GLKVESVSPSPIPGLYEVVL--KGGGILYVDEDGRYLIQ 47 (57)
T ss_dssp T-TCEEEEE-SSTTEEEEEE---TTEEEEEETTSSEEEE
T ss_pred CCceeEEEcCCCCCeEEEEE--CCCeEEEEcCCCCEEEE
Confidence 55799999999999755555 66778778899987764
No 228
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=23.09 E-value=3.1e+02 Score=29.86 Aligned_cols=88 Identities=15% Similarity=0.011 Sum_probs=60.4
Q ss_pred ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeE-eeecc-------cccceeeEEEEEecccccccccccC
Q 015021 301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVV-RLWKG-------YRDASCVFMEMLVNKDAATSSAYYA 372 (414)
Q Consensus 301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vv-rmwKG-------yRdAq~~wi~~~~~~~~~~~~~~~~ 372 (414)
..+--+++...+++++|++++.-.++..|-|+=+|+.+|.-. .+|+- |+--+..| . .
T Consensus 136 ~~~~~H~~s~~~vals~d~~~~fsask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r-~--------------~ 200 (479)
T KOG0299|consen 136 RVIGKHQLSVTSVALSPDDKRVFSASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESR-K--------------G 200 (479)
T ss_pred eeeccccCcceEEEeeccccceeecCCCcceeeeehhcCcccccccccchhhhhccCCCCccc-c--------------c
Confidence 346677888999999999999999999999999999999988 46653 22212222 0 0
Q ss_pred CCCCcceEE--------EEEEcCCCCeEEEeeCCCCcceee
Q 015021 373 PVKSDYCLC--------LAIHAPRKGIIEHLAGMADENWTT 405 (414)
Q Consensus 373 ~~k~~~~l~--------LvIyaprRg~lEvW~~~~g~~v~~ 405 (414)
+.+ .-|. .+....++..|-||+.++++-|-+
T Consensus 201 h~k--eil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~~ 239 (479)
T KOG0299|consen 201 HVK--EILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVKV 239 (479)
T ss_pred ccc--eeEEEEEcCCCcEEEecCCCceEEEecCcccchhhc
Confidence 000 0112 224567778888999999887743
No 229
>PF14408 Actino_peptide: Ribosomally synthesized peptide in actinomycetes
Probab=22.65 E-value=97 Score=24.45 Aligned_cols=22 Identities=23% Similarity=0.382 Sum_probs=19.1
Q ss_pred EEEECCCCCEEEEEcCCCcEEE
Q 015021 312 RLTLSPSGSLAAITDSLGRILL 333 (414)
Q Consensus 312 ~i~~sP~~~laa~tD~lGRV~L 333 (414)
++.++|.-++++..|+.|+++-
T Consensus 5 ~~~lDP~TQ~~v~~D~~G~~ve 26 (59)
T PF14408_consen 5 RVVLDPDTQTGVYVDRDGPVVE 26 (59)
T ss_pred eeeECCCceeeEEEcCCCCcee
Confidence 5679999999999999998754
No 230
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=22.55 E-value=1.1e+02 Score=36.02 Aligned_cols=48 Identities=15% Similarity=0.317 Sum_probs=43.9
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD 350 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd 350 (414)
|--++-.+..++=||++.|.|......+|+++|..+...+++.+||-.
T Consensus 125 l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~s 172 (942)
T KOG0973|consen 125 LRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLKVLRGHQS 172 (942)
T ss_pred EecCCCccceeccCCCccEEEEecccceEEEEccccceeeeeeecccc
Confidence 555778889999999999999999999999999999999999999865
No 231
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=22.27 E-value=2.3e+02 Score=32.15 Aligned_cols=98 Identities=12% Similarity=-0.022 Sum_probs=56.0
Q ss_pred ccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEec-cccc--ccccc--cCCCC-
Q 015021 303 LKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVN-KDAA--TSSAY--YAPVK- 375 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~-~~~~--~~~~~--~~~~k- 375 (414)
+..+.+..-++|..|++. .++..-.+|-|+|+|+..-. |=+=+|+..|.-...... .+.. ....+ ..+.+
T Consensus 140 ~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~n~---~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~t 216 (720)
T KOG0321|consen 140 NLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRCNG---VDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNT 216 (720)
T ss_pred ecccccccchhhhccCCCcceeeccCCCcEEEEEEeccc---hhhHHHHhhhhhccccCCCCCCchhhccccccccccCc
Confidence 667788889999999996 55556899999999987665 222344444433332221 0000 00000 00000
Q ss_pred --C--cceEE----EEEEcCC-CCeEEEeeCCCCcce
Q 015021 376 --S--DYCLC----LAIHAPR-KGIIEHLAGMADENW 403 (414)
Q Consensus 376 --~--~~~l~----LvIyapr-Rg~lEvW~~~~g~~v 403 (414)
+ ..-+| +++-|.- +|.|+|||++.-.+.
T Consensus 217 i~ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk~~~~ 253 (720)
T KOG0321|consen 217 IFSSVTVVLFKDESTLASAGAADSTIKVWDLRKNYTA 253 (720)
T ss_pred eeeeeEEEEEeccceeeeccCCCcceEEEeecccccc
Confidence 0 01111 3556665 999999999976554
No 232
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=21.95 E-value=1.5e+02 Score=18.91 Aligned_cols=21 Identities=14% Similarity=0.254 Sum_probs=16.7
Q ss_pred EEEEcCCCcEEEEEcCcceeE
Q 015021 322 AAITDSLGRILLLDTQALVVV 342 (414)
Q Consensus 322 aa~tD~lGRV~L~D~~~~~vv 342 (414)
+.+++..|++.-+|..+|.++
T Consensus 9 v~~~~~~g~l~a~d~~~G~~~ 29 (33)
T smart00564 9 VYVGSTDGTLYALDAKTGEIL 29 (33)
T ss_pred EEEEcCCCEEEEEEcccCcEE
Confidence 444566799999999999875
No 233
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=21.90 E-value=2.9e+02 Score=28.95 Aligned_cols=84 Identities=18% Similarity=0.343 Sum_probs=46.2
Q ss_pred CCCCcccCCCeeeeccCcceee-eeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEE
Q 015021 24 GKEGWLVNDPNLLCALDMHTIA-LANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYF 102 (414)
Q Consensus 24 ~~~~WL~~~~~~~~sp~~~~la-~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~v 102 (414)
...+|..+ +..||+|..|| +++.+-.-+..=....+-..+.... ----.+++|+ .+-.+|+.|++.+=+
T Consensus 204 ~~ggwvh~---v~fs~sG~~lawv~Hds~v~~~da~~p~~~v~~~~~~-----~lP~ls~~~i--se~~vv~ag~~c~P~ 273 (361)
T KOG1523|consen 204 SSGGWVHG---VLFSPSGNRLAWVGHDSTVSFVDAAGPSERVQSVATA-----QLPLLSVSWI--SENSVVAAGYDCGPV 273 (361)
T ss_pred cCCCceee---eEeCCCCCEeeEecCCCceEEeecCCCchhccchhhc-----cCCceeeEee--cCCceeecCCCCCce
Confidence 45678877 77788888877 4555554443211110000111110 0112333333 577899999997776
Q ss_pred EEEe-cCCcEEeeccc
Q 015021 103 LVYD-LKGDLVHRQLI 117 (414)
Q Consensus 103 rfyt-e~G~LL~sQ~l 117 (414)
.|-+ +.|-|.|.-.+
T Consensus 274 lf~~~~~~~l~~~~~l 289 (361)
T KOG1523|consen 274 LFVTDEEGGLSFARRL 289 (361)
T ss_pred EEEeccccceeeehhc
Confidence 6654 77766665443
No 234
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=21.69 E-value=71 Score=36.91 Aligned_cols=40 Identities=28% Similarity=0.330 Sum_probs=33.9
Q ss_pred EEEEEeccccEEEEEecCCcEEeec-ccCccceeEEEEecc
Q 015021 91 RALAVGTSRGYFLVYDLKGDLVHRQ-LIHPGRILKLRVRGS 130 (414)
Q Consensus 91 ~~I~VG~ssG~vrfyte~G~LL~sQ-~lh~~pV~~ik~r~~ 130 (414)
-.+||||++|.|-+|+..-.+++.. ..|.+-|..|+--+.
T Consensus 438 pLvAvGT~sGTV~vvdvst~~v~~~fsvht~~VkgleW~g~ 478 (1062)
T KOG1912|consen 438 PLVAVGTNSGTVDVVDVSTNAVAASFSVHTSLVKGLEWLGN 478 (1062)
T ss_pred eeEEeecCCceEEEEEecchhhhhhhcccccceeeeeeccc
Confidence 4689999999999999988888877 689999988875553
No 235
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=21.46 E-value=2.9e+02 Score=29.93 Aligned_cols=54 Identities=22% Similarity=0.275 Sum_probs=42.1
Q ss_pred EEEEEeEEeCCcEEEEEeccccEEEEEe--cCCcEEeecccCccceeEEEEecccCC
Q 015021 79 ITAIEWLVFEEMRALAVGTSRGYFLVYD--LKGDLVHRQLIHPGRILKLRVRGSRRD 133 (414)
Q Consensus 79 ITs~~~lpl~dw~~I~VG~ssG~vrfyt--e~G~LL~sQ~lh~~pV~~ik~r~~~~~ 133 (414)
|--+.|=|.... .++||+.+|.|+-|+ ..|..++.-.-|+++|-.|-+-...++
T Consensus 332 VEkv~w~~~se~-~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~~~p~ 387 (463)
T KOG0270|consen 332 VEKVAWDPHSEN-SFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNIQTPG 387 (463)
T ss_pred eEEEEecCCCce-eEEEecCCceEEeeecCCCCCceeEEEeccCCcceEEecCCCCc
Confidence 444556665444 678999999999999 478899999999999999987665554
No 236
>PRK04043 tolB translocation protein TolB; Provisional
Probab=21.40 E-value=1.6e+02 Score=30.89 Aligned_cols=43 Identities=14% Similarity=0.040 Sum_probs=30.9
Q ss_pred EEECCCCCEEEEEcCC---------CcEEEEEcCcceeEeeecccccceeeE
Q 015021 313 LTLSPSGSLAAITDSL---------GRILLLDTQALVVVRLWKGYRDASCVF 355 (414)
Q Consensus 313 i~~sP~~~laa~tD~l---------GRV~L~D~~~~~vvrmwKGyRdAq~~w 355 (414)
...||+|+++|.+-.- .+|.++|+.++..-++-.+..+....|
T Consensus 324 ~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~LT~~~~~~~p~~ 375 (419)
T PRK04043 324 SSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRLTANGVNQFPRF 375 (419)
T ss_pred ceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEECCCCCCcCCeEE
Confidence 3789999866664221 489999999999888877755544333
No 237
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=21.22 E-value=4.3e+02 Score=29.44 Aligned_cols=81 Identities=21% Similarity=0.179 Sum_probs=48.4
Q ss_pred cccCCCceeeEEEECCCC--CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccc--cccCCCCCc
Q 015021 302 CLKDHPRKGERLTLSPSG--SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSS--AYYAPVKSD 377 (414)
Q Consensus 302 ~l~D~~R~~~~i~~sP~~--~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~--~~~~~~k~~ 377 (414)
.|.|++-.+. ++.-++ .|+|.....|.|+|..+.++.=---+|= +.+.+. ..+.+.|.
T Consensus 116 ~lkdh~stvt--~v~YN~~DeyiAsvs~gGdiiih~~~t~~~tt~f~~---------------~sgqsvRll~ys~skr- 177 (673)
T KOG4378|consen 116 FLKDHQSTVT--YVDYNNTDEYIASVSDGGDIIIHGTKTKQKTTTFTI---------------DSGQSVRLLRYSPSKR- 177 (673)
T ss_pred hccCCcceeE--EEEecCCcceeEEeccCCcEEEEecccCccccceec---------------CCCCeEEEeecccccc-
Confidence 4778774444 445444 5777777789999999887643222210 111000 12333342
Q ss_pred ceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021 378 YCLCLAIHAPRKGIIEHLAGMADENW 403 (414)
Q Consensus 378 ~~l~LvIyaprRg~lEvW~~~~g~~v 403 (414)
||..-|.-+|+|-+||++--..+
T Consensus 178 ---~lL~~asd~G~VtlwDv~g~sp~ 200 (673)
T KOG4378|consen 178 ---FLLSIASDKGAVTLWDVQGMSPI 200 (673)
T ss_pred ---eeeEeeccCCeEEEEeccCCCcc
Confidence 57777889999999999864433
No 238
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=21.22 E-value=3e+02 Score=30.58 Aligned_cols=33 Identities=18% Similarity=0.343 Sum_probs=27.9
Q ss_pred CCcEEEEEeccccEEEEEecCCcEEeecccCcc
Q 015021 88 EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPG 120 (414)
Q Consensus 88 ~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~ 120 (414)
||..+++||-..|-+..|+-.=.-+-.|++.++
T Consensus 309 p~gai~~V~s~qGelQ~FD~ALspi~~qLlsEd 341 (545)
T PF11768_consen 309 PDGAIFVVGSEQGELQCFDMALSPIKMQLLSED 341 (545)
T ss_pred CCCcEEEEEcCCceEEEEEeecCccceeecccc
Confidence 788899999999999999987777777777654
No 239
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=20.91 E-value=1e+02 Score=33.61 Aligned_cols=36 Identities=14% Similarity=0.153 Sum_probs=29.9
Q ss_pred ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCc
Q 015021 303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQA 338 (414)
Q Consensus 303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~ 338 (414)
+.-++--..+|+++|+|+|.|+|-.+-+|=+||+.+
T Consensus 289 iLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 289 ILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRN 324 (545)
T ss_pred HHhcCCCcceEEECCCCcEEeecccccceeEeeecc
Confidence 444556678999999999999999999999999743
No 240
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=20.88 E-value=1.7e+02 Score=29.90 Aligned_cols=38 Identities=29% Similarity=0.324 Sum_probs=30.4
Q ss_pred CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEee
Q 015021 305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRL 344 (414)
Q Consensus 305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrm 344 (414)
...+....+.+ .|+.+.++|..||.+|+|+.+..+..+
T Consensus 64 ~~~~~~~F~al--~gskIv~~d~~~~t~vyDt~t~av~~~ 101 (342)
T PF07893_consen 64 RGPWSMDFFAL--HGSKIVAVDQSGRTLVYDTDTRAVATG 101 (342)
T ss_pred CCCceeEEEEe--cCCeEEEEcCCCCeEEEECCCCeEecc
Confidence 44455566666 788899999999999999999988843
No 241
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=20.78 E-value=1e+02 Score=36.87 Aligned_cols=69 Identities=19% Similarity=0.218 Sum_probs=47.0
Q ss_pred eeeeccCcceeeeeecceEEEEeecCCCC--CceeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEEEEEec
Q 015021 34 NLLCALDMHTIALANRYQTVIINWADPEG--LVAKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYFLVYDL 107 (414)
Q Consensus 34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~--~~v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte 107 (414)
.+.+-+.+++||-+..+.+.| |+.+++ ..+.-.+-. -+++|.+..||-| |=+--+|+|+.+-+|.+|+.
T Consensus 1309 al~VH~hapiiAsGs~q~ikI--y~~~G~~l~~~k~n~~F---~~q~~gs~scL~FHP~~~llAaG~~Ds~V~iYs~ 1380 (1387)
T KOG1517|consen 1309 ALTVHEHAPIIASGSAQLIKI--YSLSGEQLNIIKYNPGF---MGQRIGSVSCLAFHPHRLLLAAGSADSTVSIYSC 1380 (1387)
T ss_pred eeeeccCCCeeeecCcceEEE--EecChhhhcccccCccc---ccCcCCCcceeeecchhHhhhhccCCceEEEeec
Confidence 355667777777666666555 333331 112212211 2688999999999 77889999999999999994
No 242
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=20.52 E-value=1.4e+02 Score=30.89 Aligned_cols=52 Identities=25% Similarity=0.414 Sum_probs=40.9
Q ss_pred CCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecC--CcEEe-ecccCccceeEEE
Q 015021 74 IASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLK--GDLVH-RQLIHPGRILKLR 126 (414)
Q Consensus 74 ~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~--G~LL~-sQ~lh~~pV~~ik 126 (414)
.+++.|++++.=| -.+..++.|-=+|.||.|..+ |.+.= .|.=|+.||+-+.
T Consensus 25 pP~DsIS~l~FSP-~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~ 79 (347)
T KOG0647|consen 25 PPEDSISALAFSP-QADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVC 79 (347)
T ss_pred CcccchheeEecc-ccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEE
Confidence 3578999999988 577799999999999999953 44442 4566788998876
No 243
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=20.33 E-value=1.2e+02 Score=33.64 Aligned_cols=77 Identities=14% Similarity=-0.010 Sum_probs=46.4
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceee--EEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCV--FMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP 387 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~--wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap 387 (414)
.++.+-+|+|+|+|..=..|-|=.||. .+.-+|.===||||.-. =|....=..+ ++ +| ..-.
T Consensus 320 ~tsC~~nrdg~~iAagc~DGSIQ~W~~-~~~~v~p~~~vk~AH~~g~~Itsi~FS~d---------g~-----~L-lSRg 383 (641)
T KOG0772|consen 320 VTSCAWNRDGKLIAAGCLDGSIQIWDK-GSRTVRPVMKVKDAHLPGQDITSISFSYD---------GN-----YL-LSRG 383 (641)
T ss_pred ceeeecCCCcchhhhcccCCceeeeec-CCcccccceEeeeccCCCCceeEEEeccc---------cc-----hh-hhcc
Confidence 567788999999998888899999998 44444533335565433 1111100000 01 11 1223
Q ss_pred CCCeEEEeeCCCCcc
Q 015021 388 RKGIIEHLAGMADEN 402 (414)
Q Consensus 388 rRg~lEvW~~~~g~~ 402 (414)
-++.|+||++++.++
T Consensus 384 ~D~tLKvWDLrq~kk 398 (641)
T KOG0772|consen 384 FDDTLKVWDLRQFKK 398 (641)
T ss_pred CCCceeeeecccccc
Confidence 368999999999765
No 244
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=20.33 E-value=1.8e+02 Score=29.87 Aligned_cols=69 Identities=16% Similarity=0.082 Sum_probs=51.5
Q ss_pred eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021 310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK 389 (414)
Q Consensus 310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR 389 (414)
...+.+=||++.+|++-=+|||=++.-.++..+-+.|=|++ -|.=+-... +. =|.--|..+
T Consensus 254 v~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyHsa-gvn~vAfsp--d~----------------~lmAaaskD 314 (323)
T KOG0322|consen 254 VSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYHSA-GVNAVAFSP--DC----------------ELMAAASKD 314 (323)
T ss_pred ccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhhhc-ceeEEEeCC--CC----------------chhhhccCC
Confidence 45588899999999999999999999999999999999995 332222111 00 144456778
Q ss_pred CeEEEeeC
Q 015021 390 GIIEHLAG 397 (414)
Q Consensus 390 g~lEvW~~ 397 (414)
+.|-+|.+
T Consensus 315 ~rISLWkL 322 (323)
T KOG0322|consen 315 ARISLWKL 322 (323)
T ss_pred ceEEeeec
Confidence 88888875
Done!