Query 001090
Match_columns 1159
No_of_seqs 339 out of 784
Neff 3.8
Searched_HMMs 46136
Date Thu Mar 28 15:33:11 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001090.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001090hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0644 Uncharacterized conser 100.0 6E-121 1E-125 1054.2 33.2 696 1-1149 412-1113(1113)
2 PF06507 Auxin_resp: Auxin res 99.8 2.5E-20 5.5E-25 171.5 8.9 77 935-1012 6-83 (83)
3 cd05529 Bromo_WDR9_I_like Brom 99.8 3.8E-19 8.2E-24 174.6 12.6 124 1022-1148 3-127 (128)
4 cd05497 Bromo_Brdt_I_like Brom 99.7 7.9E-18 1.7E-22 160.8 9.7 80 1070-1149 26-107 (107)
5 cd05495 Bromo_cbp_like Bromodo 99.7 1.1E-17 2.4E-22 159.9 10.5 104 1043-1149 1-106 (108)
6 cd05509 Bromo_gcn5_like Bromod 99.7 4E-17 8.7E-22 153.1 9.9 80 1070-1149 22-101 (101)
7 cd05504 Bromo_Acf1_like Bromod 99.7 4E-17 8.7E-22 157.8 10.1 82 1070-1151 33-114 (115)
8 cd05508 Bromo_RACK7 Bromodomai 99.7 3.7E-17 8E-22 154.6 9.6 77 1070-1146 23-99 (99)
9 cd05507 Bromo_brd8_like Bromod 99.7 4.6E-17 1E-21 154.6 10.2 100 1046-1149 4-103 (104)
10 cd05496 Bromo_WDR9_II Bromodom 99.7 5.1E-17 1.1E-21 158.3 9.7 81 1070-1150 26-107 (119)
11 cd05516 Bromo_SNF2L2 Bromodoma 99.7 7.8E-17 1.7E-21 153.8 10.4 80 1070-1149 28-107 (107)
12 cd05510 Bromo_SPT7_like Bromod 99.7 8E-17 1.7E-21 155.4 10.3 82 1070-1151 29-111 (112)
13 cd05505 Bromo_WSTF_like Bromod 99.7 5E-17 1.1E-21 153.1 8.7 77 1070-1146 21-97 (97)
14 cd05517 Bromo_polybromo_II Bro 99.7 8.7E-17 1.9E-21 152.8 9.7 76 1070-1145 27-102 (103)
15 cd05524 Bromo_polybromo_I Brom 99.7 9.1E-17 2E-21 154.9 9.6 105 1045-1151 5-110 (113)
16 cd05515 Bromo_polybromo_V Brom 99.7 1.5E-16 3.3E-21 151.3 10.3 78 1070-1147 27-104 (105)
17 cd05501 Bromo_SP100C_like Brom 99.7 7.7E-17 1.7E-21 153.6 7.7 74 1076-1150 27-100 (102)
18 cd05502 Bromo_tif1_like Bromod 99.7 2.7E-16 5.8E-21 150.1 11.2 103 1043-1150 2-107 (109)
19 cd05500 Bromo_BDF1_2_I Bromodo 99.7 2.2E-16 4.7E-21 149.3 10.4 101 1042-1146 1-103 (103)
20 cd05520 Bromo_polybromo_III Br 99.7 1.5E-16 3.2E-21 151.2 9.2 76 1070-1145 27-102 (103)
21 cd05503 Bromo_BAZ2A_B_like Bro 99.7 1.5E-16 3.2E-21 149.1 8.7 77 1070-1146 21-97 (97)
22 cd05518 Bromo_polybromo_IV Bro 99.7 1.9E-16 4.1E-21 150.6 9.2 76 1070-1145 27-102 (103)
23 cd05519 Bromo_SNF2 Bromodomain 99.7 2.8E-16 6.1E-21 148.6 9.8 99 1046-1145 4-102 (103)
24 cd05511 Bromo_TFIID Bromodomai 99.7 1.8E-16 3.9E-21 152.6 8.6 81 1070-1150 21-101 (112)
25 smart00297 BROMO bromo domain. 99.7 4.4E-16 9.5E-21 145.2 10.6 81 1069-1149 27-107 (107)
26 cd05528 Bromo_AAA Bromodomain; 99.6 5.8E-16 1.3E-20 149.3 10.7 103 1046-1152 4-110 (112)
27 cd05521 Bromo_Rsc1_2_I Bromodo 99.6 5.6E-16 1.2E-20 148.3 9.5 76 1070-1147 28-103 (106)
28 cd05525 Bromo_ASH1 Bromodomain 99.6 5.8E-16 1.3E-20 148.1 9.2 76 1070-1145 29-104 (106)
29 cd05513 Bromo_brd7_like Bromod 99.6 5.4E-16 1.2E-20 146.5 8.3 74 1071-1144 23-96 (98)
30 cd05499 Bromo_BDF1_2_II Bromod 99.6 5E-16 1.1E-20 146.4 7.4 76 1071-1146 25-102 (102)
31 cd05512 Bromo_brd1_like Bromod 99.6 9.3E-16 2E-20 144.7 8.7 73 1070-1142 22-94 (98)
32 KOG1474 Transcription initiati 99.6 1.1E-15 2.3E-20 183.3 8.2 102 1045-1150 222-325 (640)
33 cd05498 Bromo_Brdt_II_like Bro 99.6 1.6E-15 3.4E-20 142.7 7.5 70 1077-1146 33-102 (102)
34 cd05522 Bromo_Rsc1_2_II Bromod 99.6 3.1E-15 6.7E-20 142.3 9.4 75 1070-1144 28-102 (104)
35 cd05506 Bromo_plant1 Bromodoma 99.6 1.6E-15 3.4E-20 141.8 7.3 70 1077-1146 30-99 (99)
36 cd05492 Bromo_ZMYND11 Bromodom 99.6 5.1E-15 1.1E-19 142.7 7.9 74 1077-1150 34-107 (109)
37 cd04369 Bromodomain Bromodomai 99.5 2.6E-14 5.7E-19 128.4 7.3 77 1070-1146 23-99 (99)
38 PF00439 Bromodomain: Bromodom 99.5 3.7E-14 8E-19 127.0 6.8 68 1070-1137 17-84 (84)
39 cd05526 Bromo_polybromo_VI Bro 99.4 7.9E-13 1.7E-17 127.9 9.7 78 1070-1149 30-107 (110)
40 COG5076 Transcription factor i 99.4 7.4E-13 1.6E-17 149.7 8.9 81 1071-1151 170-250 (371)
41 KOG1245 Chromatin remodeling c 98.9 1.2E-09 2.5E-14 140.3 7.1 79 1070-1149 1322-1400(1404)
42 cd05491 Bromo_TBP7_like Bromod 98.9 1.4E-09 3E-14 106.8 4.3 41 1088-1128 63-103 (119)
43 KOG0263 Transcription initiati 98.8 1.1E-08 2.4E-13 122.5 8.2 85 1-85 551-641 (707)
44 KOG1472 Histone acetyltransfer 98.6 1.8E-08 3.9E-13 121.8 3.5 80 1071-1150 628-707 (720)
45 KOG0273 Beta-transducin family 98.5 7.6E-08 1.6E-12 110.9 5.8 63 1-63 426-488 (524)
46 KOG0955 PHD finger protein BR1 98.3 4.2E-07 9.2E-12 114.0 5.7 86 1065-1150 580-666 (1051)
47 KOG0266 WD40 repeat-containing 98.3 9.3E-07 2E-11 102.8 7.2 60 1-60 262-321 (456)
48 KOG0266 WD40 repeat-containing 98.2 1.6E-06 3.4E-11 100.9 7.4 80 1-80 219-305 (456)
49 PTZ00421 coronin; Provisional 98.2 1.8E-06 3.8E-11 102.1 7.2 60 1-60 142-201 (493)
50 KOG1827 Chromatin remodeling c 98.1 6.5E-06 1.4E-10 98.9 8.7 108 1039-1146 46-155 (629)
51 KOG0272 U4/U6 small nuclear ri 98.1 3.4E-06 7.3E-11 96.7 5.8 60 1-60 319-378 (459)
52 KOG0279 G protein beta subunit 98.1 4.4E-06 9.6E-11 92.1 6.1 63 1-63 79-141 (315)
53 KOG0386 Chromatin remodeling c 98.1 3.3E-06 7.3E-11 104.4 5.2 105 1045-1150 1027-1131(1157)
54 KOG0295 WD40 repeat-containing 98.0 8.2E-06 1.8E-10 92.4 7.3 70 1-70 308-378 (406)
55 KOG0282 mRNA splicing factor [ 98.0 3.6E-06 7.8E-11 97.6 3.6 75 1-75 231-311 (503)
56 KOG1445 Tumor-specific antigen 98.0 5.2E-06 1.1E-10 98.5 4.4 80 1-80 694-781 (1012)
57 KOG0008 Transcription initiati 98.0 5.8E-06 1.3E-10 104.3 4.5 71 1076-1146 1409-1479(1563)
58 PF00400 WD40: WD domain, G-be 97.8 3E-05 6.4E-10 59.9 5.2 39 17-55 1-39 (39)
59 PLN00181 protein SPA1-RELATED; 97.8 2.6E-05 5.7E-10 96.1 7.5 60 1-60 549-609 (793)
60 KOG0279 G protein beta subunit 97.8 1.9E-05 4E-10 87.4 5.3 60 1-60 166-225 (315)
61 KOG0286 G-protein beta subunit 97.8 2.8E-05 6.1E-10 86.5 6.5 60 1-60 160-220 (343)
62 PTZ00420 coronin; Provisional 97.8 3.6E-05 7.9E-10 92.7 7.4 59 1-60 142-200 (568)
63 KOG2111 Uncharacterized conser 97.8 4.3E-05 9.3E-10 85.7 6.7 59 1-59 197-258 (346)
64 KOG0271 Notchless-like WD40 re 97.7 4.4E-05 9.4E-10 87.2 5.5 60 1-60 131-190 (480)
65 KOG0286 G-protein beta subunit 97.7 4.6E-05 9.9E-10 84.8 5.5 60 1-60 203-262 (343)
66 PTZ00421 coronin; Provisional 97.6 7.7E-05 1.7E-09 88.5 6.8 60 1-60 92-159 (493)
67 KOG0772 Uncharacterized conser 97.6 4.5E-05 9.8E-10 89.4 4.3 80 1-80 285-381 (641)
68 cd00200 WD40 WD40 domain, foun 97.6 0.00015 3.2E-09 71.8 6.9 59 1-59 25-83 (289)
69 cd00200 WD40 WD40 domain, foun 97.6 0.00014 3.1E-09 71.8 6.9 60 1-60 67-126 (289)
70 KOG0282 mRNA splicing factor [ 97.6 8.8E-05 1.9E-09 86.5 5.7 59 1-60 274-333 (503)
71 TIGR03866 PQQ_ABC_repeats PQQ- 97.5 0.00022 4.9E-09 74.0 8.0 59 1-60 5-64 (300)
72 KOG0271 Notchless-like WD40 re 97.5 6.9E-05 1.5E-09 85.6 4.4 77 1-77 383-461 (480)
73 cd05494 Bromodomain_1 Bromodom 97.5 5.5E-05 1.2E-09 74.1 2.6 79 1045-1127 3-90 (114)
74 KOG0647 mRNA export protein (c 97.5 0.00013 2.8E-09 81.7 5.3 59 2-60 45-105 (347)
75 PTZ00420 coronin; Provisional 97.5 0.00019 4.1E-09 86.7 7.1 60 1-60 91-159 (568)
76 KOG0263 Transcription initiati 97.5 0.00011 2.3E-09 89.3 4.8 58 1-58 593-650 (707)
77 KOG0278 Serine/threonine kinas 97.4 0.00023 4.9E-09 78.3 6.3 61 2-62 241-302 (334)
78 KOG0315 G-protein beta subunit 97.4 0.00024 5.2E-09 78.1 6.2 60 1-60 231-291 (311)
79 KOG0319 WD40-repeat-containing 97.4 0.00021 4.6E-09 86.6 6.3 61 1-61 521-581 (775)
80 KOG1472 Histone acetyltransfer 97.4 0.00012 2.5E-09 89.7 4.1 61 1074-1134 311-371 (720)
81 KOG0277 Peroxisomal targeting 97.3 0.00028 6.1E-09 77.7 6.0 59 1-59 121-180 (311)
82 PF08662 eIF2A: Eukaryotic tra 97.3 0.00028 6.2E-09 73.8 5.8 64 6-72 124-193 (194)
83 KOG1474 Transcription initiati 97.3 7E-05 1.5E-09 91.4 1.4 75 1078-1152 23-97 (640)
84 KOG0275 Conserved WD40 repeat- 97.3 0.00014 3E-09 81.8 3.0 80 1-80 279-365 (508)
85 KOG0771 Prolactin regulatory e 97.3 0.00025 5.4E-09 81.6 5.1 99 1-107 160-263 (398)
86 KOG0303 Actin-binding protein 97.3 0.00047 1E-08 79.4 6.6 59 1-60 148-206 (472)
87 KOG0318 WD40 repeat stress pro 97.2 0.00049 1.1E-08 81.3 6.6 60 1-60 206-268 (603)
88 KOG0291 WD40-repeat-containing 97.2 0.00041 8.9E-09 84.4 5.9 57 1-57 494-550 (893)
89 KOG0315 G-protein beta subunit 97.2 0.0005 1.1E-08 75.7 5.8 84 1-85 14-106 (311)
90 KOG0645 WD40 repeat protein [G 97.2 0.00072 1.6E-08 75.0 7.0 55 1-55 166-223 (312)
91 PLN00181 protein SPA1-RELATED; 97.2 0.00055 1.2E-08 84.8 6.8 59 1-59 676-740 (793)
92 KOG1034 Transcriptional repres 97.1 0.00063 1.4E-08 77.0 6.2 58 1-58 152-212 (385)
93 KOG0293 WD40 repeat-containing 97.1 0.00037 7.9E-09 80.5 4.4 58 1-58 456-514 (519)
94 KOG0275 Conserved WD40 repeat- 97.1 0.00039 8.4E-09 78.4 4.4 62 1-62 322-383 (508)
95 KOG0272 U4/U6 small nuclear ri 97.1 0.00019 4.2E-09 82.8 2.1 92 1-92 235-335 (459)
96 KOG0273 Beta-transducin family 97.1 0.00037 7.9E-09 81.4 4.2 58 1-59 251-308 (524)
97 KOG1407 WD40 repeat protein [F 97.1 0.00099 2.1E-08 73.8 7.0 57 1-58 81-137 (313)
98 KOG1446 Histone H3 (Lys4) meth 97.1 0.0011 2.3E-08 74.6 7.3 95 1-97 116-224 (311)
99 KOG0301 Phospholipase A2-activ 97.1 0.00087 1.9E-08 81.1 6.7 56 1-57 194-249 (745)
100 KOG1828 IRF-2-binding protein 97.0 8.1E-05 1.8E-09 85.1 -2.2 72 1073-1144 43-114 (418)
101 KOG1828 IRF-2-binding protein 97.0 0.00033 7.2E-09 80.3 2.5 65 1075-1140 234-298 (418)
102 KOG1273 WD40 repeat protein [G 97.0 0.00064 1.4E-08 76.7 4.6 60 1-60 39-98 (405)
103 KOG0316 Conserved WD40 repeat- 97.0 0.0008 1.7E-08 73.7 4.9 60 1-60 33-92 (307)
104 TIGR03866 PQQ_ABC_repeats PQQ- 97.0 0.0018 3.9E-08 67.4 7.2 57 4-61 226-283 (300)
105 KOG0296 Angio-associated migra 96.9 0.0014 3.1E-08 74.9 6.6 60 1-60 80-139 (399)
106 KOG0008 Transcription initiati 96.9 0.00093 2E-08 85.3 5.6 76 1068-1143 1280-1355(1563)
107 KOG0281 Beta-TrCP (transducin 96.9 0.00034 7.3E-09 79.5 1.6 58 1-60 374-431 (499)
108 KOG0293 WD40 repeat-containing 96.9 0.00064 1.4E-08 78.6 3.6 61 1-61 285-346 (519)
109 KOG2110 Uncharacterized conser 96.9 0.0017 3.8E-08 74.3 6.9 60 1-60 189-251 (391)
110 KOG1332 Vesicle coat complex C 96.9 0.00087 1.9E-08 73.7 4.3 73 1-74 74-150 (299)
111 KOG0265 U5 snRNP-specific prot 96.9 0.0012 2.5E-08 74.2 5.3 142 1-142 106-284 (338)
112 KOG0646 WD40 repeat protein [G 96.9 0.0015 3.3E-08 76.3 6.2 62 1-62 192-312 (476)
113 KOG0291 WD40-repeat-containing 96.8 0.0016 3.4E-08 79.6 6.3 87 1-92 366-455 (893)
114 KOG0289 mRNA splicing factor [ 96.8 0.0023 5E-08 74.5 7.0 82 2-85 364-445 (506)
115 KOG0302 Ribosome Assembly prot 96.7 0.0025 5.3E-08 73.3 6.3 58 1-59 274-334 (440)
116 KOG2096 WD40 repeat protein [G 96.7 0.0035 7.5E-08 71.1 7.2 111 1-112 203-335 (420)
117 KOG0295 WD40 repeat-containing 96.7 0.0026 5.5E-08 72.9 6.2 60 1-60 209-268 (406)
118 KOG0319 WD40-repeat-containing 96.6 0.0032 6.8E-08 76.9 6.2 64 1-64 479-542 (775)
119 KOG0264 Nucleosome remodeling 96.6 0.0046 1E-07 72.0 7.3 72 1-72 289-368 (422)
120 KOG0265 U5 snRNP-specific prot 96.6 0.0043 9.3E-08 69.8 6.8 61 1-61 63-124 (338)
121 KOG0316 Conserved WD40 repeat- 96.5 0.0052 1.1E-07 67.6 7.0 60 1-60 75-134 (307)
122 KOG0300 WD40 repeat-containing 96.5 0.0019 4E-08 73.0 3.6 57 2-58 165-221 (481)
123 KOG0973 Histone transcription 96.4 0.0039 8.5E-08 78.3 5.9 58 1-58 145-202 (942)
124 KOG0296 Angio-associated migra 96.4 0.0051 1.1E-07 70.5 6.3 60 2-61 165-224 (399)
125 KOG0269 WD40 repeat-containing 96.4 0.0029 6.4E-08 77.5 4.5 59 1-59 150-209 (839)
126 KOG0289 mRNA splicing factor [ 96.4 0.0083 1.8E-07 70.1 7.9 91 1-94 319-411 (506)
127 KOG0640 mRNA cleavage stimulat 96.4 0.0019 4E-08 72.9 2.6 62 1-62 277-340 (430)
128 KOG0318 WD40 repeat stress pro 96.4 0.0061 1.3E-07 72.4 6.9 59 2-60 460-520 (603)
129 KOG0650 WD40 repeat nucleolar 96.4 0.0056 1.2E-07 73.6 6.5 72 1-73 416-489 (733)
130 KOG0308 Conserved WD40 repeat- 96.3 0.0054 1.2E-07 74.2 6.1 62 1-62 187-248 (735)
131 KOG0283 WD40 repeat-containing 96.3 0.0053 1.2E-07 75.5 5.8 59 1-60 384-443 (712)
132 KOG0288 WD40 repeat protein Ti 96.2 0.0072 1.6E-07 70.2 6.3 60 1-60 316-420 (459)
133 KOG4283 Transcription-coupled 96.2 0.0045 9.8E-08 69.6 4.6 60 1-60 205-279 (397)
134 KOG0274 Cdc4 and related F-box 96.2 0.0048 1E-07 74.5 5.1 58 1-59 345-402 (537)
135 smart00320 WD40 WD40 repeats. 96.2 0.015 3.2E-07 40.4 5.4 40 16-55 1-40 (40)
136 KOG0310 Conserved WD40 repeat- 96.1 0.0084 1.8E-07 70.6 6.3 106 1-111 170-289 (487)
137 KOG0285 Pleiotropic regulator 96.1 0.0096 2.1E-07 68.4 6.2 60 1-60 167-226 (460)
138 COG2319 FOG: WD40 repeat [Gene 96.0 0.017 3.6E-07 59.3 7.3 59 2-60 172-232 (466)
139 KOG1539 WD repeat protein [Gen 96.0 0.0051 1.1E-07 76.0 4.0 58 3-60 552-609 (910)
140 KOG0270 WD40 repeat-containing 96.0 0.0091 2E-07 69.8 5.6 57 1-57 260-317 (463)
141 KOG0639 Transducin-like enhanc 96.0 0.0064 1.4E-07 71.9 4.3 77 1-77 525-602 (705)
142 KOG0302 Ribosome Assembly prot 96.0 0.012 2.5E-07 68.0 6.3 59 1-59 318-380 (440)
143 KOG0294 WD40 repeat-containing 96.0 0.01 2.2E-07 67.4 5.5 60 1-60 57-118 (362)
144 KOG0267 Microtubule severing p 95.9 0.003 6.4E-08 77.0 1.5 56 2-57 87-142 (825)
145 KOG0645 WD40 repeat protein [G 95.9 0.015 3.3E-07 64.9 6.7 60 1-60 77-138 (312)
146 PF08662 eIF2A: Eukaryotic tra 95.9 0.014 3.1E-07 61.3 6.3 55 3-60 79-136 (194)
147 KOG2048 WD40 repeat protein [G 95.9 0.016 3.4E-07 70.5 7.4 97 1-97 84-191 (691)
148 KOG1274 WD40 repeat protein [G 95.9 0.007 1.5E-07 75.4 4.4 58 1-58 154-263 (933)
149 KOG0267 Microtubule severing p 95.9 0.011 2.4E-07 72.3 5.9 60 1-60 128-187 (825)
150 KOG1446 Histone H3 (Lys4) meth 95.9 0.018 3.9E-07 65.1 7.1 56 5-60 207-265 (311)
151 KOG0285 Pleiotropic regulator 95.8 0.015 3.2E-07 66.9 6.2 61 1-61 251-311 (460)
152 COG2319 FOG: WD40 repeat [Gene 95.8 0.031 6.6E-07 57.5 7.8 56 3-58 130-187 (466)
153 KOG0284 Polyadenylation factor 95.8 0.0089 1.9E-07 69.4 4.3 57 1-57 238-294 (464)
154 KOG0281 Beta-TrCP (transducin 95.8 0.0081 1.8E-07 68.7 3.9 81 1-87 211-291 (499)
155 KOG1009 Chromatin assembly com 95.7 0.014 3E-07 67.8 5.6 61 1-61 81-157 (434)
156 KOG1009 Chromatin assembly com 95.7 0.011 2.4E-07 68.6 4.7 58 1-58 30-96 (434)
157 KOG0283 WD40 repeat-containing 95.6 0.014 2.9E-07 72.1 5.4 56 1-57 426-481 (712)
158 PRK01742 tolB translocation pr 95.6 0.024 5.1E-07 65.8 7.1 76 4-80 181-264 (429)
159 KOG0292 Vesicle coat complex C 95.6 0.02 4.3E-07 71.5 6.4 58 1-58 109-166 (1202)
160 PF11768 DUF3312: Protein of u 95.5 0.022 4.7E-07 68.7 6.4 57 2-60 276-332 (545)
161 KOG0276 Vesicle coat complex C 95.5 0.033 7.1E-07 67.6 7.8 80 1-80 113-201 (794)
162 KOG0290 Conserved WD40 repeat- 95.5 0.024 5.1E-07 64.0 6.2 58 2-59 168-229 (364)
163 KOG1036 Mitotic spindle checkp 95.5 0.0095 2.1E-07 67.2 3.2 48 1-48 248-295 (323)
164 KOG0274 Cdc4 and related F-box 95.5 0.023 5E-07 68.8 6.7 59 2-62 306-364 (537)
165 KOG0973 Histone transcription 95.5 0.018 3.9E-07 72.7 5.9 59 1-59 85-161 (942)
166 KOG0300 WD40 repeat-containing 95.5 0.011 2.4E-07 67.1 3.6 77 1-78 288-372 (481)
167 KOG0292 Vesicle coat complex C 95.5 0.0099 2.2E-07 74.0 3.4 80 1-80 25-110 (1202)
168 KOG0321 WD40 repeat-containing 95.5 0.012 2.6E-07 71.2 4.0 58 1-58 116-176 (720)
169 KOG0278 Serine/threonine kinas 95.4 0.027 5.8E-07 62.6 6.2 54 1-56 159-212 (334)
170 KOG0308 Conserved WD40 repeat- 95.4 0.021 4.5E-07 69.4 5.7 58 1-58 229-286 (735)
171 KOG0268 Sof1-like rRNA process 95.4 0.016 3.4E-07 66.8 4.4 212 1-223 83-338 (433)
172 KOG2110 Uncharacterized conser 95.4 0.025 5.5E-07 65.2 6.0 77 6-82 152-237 (391)
173 KOG0269 WD40 repeat-containing 95.3 0.014 3E-07 71.9 4.1 59 2-60 194-253 (839)
174 KOG1407 WD40 repeat protein [F 95.2 0.017 3.7E-07 64.4 3.8 68 1-68 205-272 (313)
175 KOG0644 Uncharacterized conser 95.2 0.012 2.5E-07 73.0 2.8 70 1075-1145 91-190 (1113)
176 PF02239 Cytochrom_D1: Cytochr 95.0 0.046 1E-06 63.1 7.0 91 4-94 13-112 (369)
177 KOG0643 Translation initiation 95.0 0.021 4.6E-07 63.8 3.9 62 2-63 164-226 (327)
178 KOG0305 Anaphase promoting com 94.9 0.04 8.6E-07 66.1 6.2 54 6-59 410-463 (484)
179 KOG0268 Sof1-like rRNA process 94.9 0.028 6.2E-07 64.8 4.6 80 1-80 204-332 (433)
180 KOG0294 WD40 repeat-containing 94.7 0.05 1.1E-06 62.0 5.7 60 1-60 101-160 (362)
181 KOG0313 Microtubule binding pr 94.6 0.047 1E-06 63.3 5.3 54 1-55 362-416 (423)
182 KOG2048 WD40 repeat protein [G 94.4 0.076 1.6E-06 64.9 6.7 59 1-60 170-236 (691)
183 KOG0646 WD40 repeat protein [G 94.3 0.035 7.5E-07 65.5 3.6 54 5-58 101-154 (476)
184 KOG0276 Vesicle coat complex C 94.3 0.027 5.9E-07 68.2 2.8 58 1-58 201-258 (794)
185 KOG1408 WD40 repeat protein [F 94.2 0.037 8E-07 67.9 3.7 56 1-56 657-712 (1080)
186 PF02239 Cytochrom_D1: Cytochr 94.1 0.074 1.6E-06 61.5 5.9 61 3-64 54-115 (369)
187 KOG2139 WD40 repeat protein [G 94.1 0.064 1.4E-06 62.0 5.2 54 2-55 212-266 (445)
188 KOG4378 Nuclear protein COP1 [ 94.1 0.055 1.2E-06 64.4 4.7 68 2-69 182-251 (673)
189 KOG0307 Vesicle coat complex C 94.1 0.049 1.1E-06 69.4 4.6 57 2-58 85-148 (1049)
190 KOG2055 WD40 repeat protein [G 94.1 0.082 1.8E-06 62.5 6.1 104 1-104 229-347 (514)
191 KOG4328 WD40 protein [Function 94.0 0.053 1.2E-06 63.9 4.4 56 1-56 339-398 (498)
192 KOG0264 Nucleosome remodeling 93.9 0.079 1.7E-06 62.2 5.5 58 1-58 244-304 (422)
193 COG5076 Transcription factor i 93.8 0.019 4.2E-07 66.1 0.5 76 1074-1149 288-363 (371)
194 KOG0313 Microtubule binding pr 93.7 0.1 2.2E-06 60.6 6.0 59 1-60 119-180 (423)
195 KOG1408 WD40 repeat protein [F 93.7 0.11 2.3E-06 64.1 6.3 80 1-80 612-700 (1080)
196 KOG1332 Vesicle coat complex C 93.7 0.14 3.1E-06 57.0 6.6 60 1-60 27-91 (299)
197 KOG4378 Nuclear protein COP1 [ 93.6 0.081 1.8E-06 63.0 4.9 59 1-60 225-283 (673)
198 KOG2055 WD40 repeat protein [G 93.6 0.11 2.4E-06 61.5 6.0 57 2-58 361-418 (514)
199 KOG2394 WD40 protein DMR-N9 [G 93.5 0.039 8.6E-07 66.0 2.2 55 2-56 307-361 (636)
200 PRK05137 tolB translocation pr 93.5 0.18 3.8E-06 58.7 7.5 73 6-79 181-261 (435)
201 PRK05137 tolB translocation pr 93.4 0.18 4E-06 58.5 7.5 74 6-80 225-306 (435)
202 cd05493 Bromo_ALL-1 Bromodomai 93.4 0.11 2.4E-06 52.9 4.9 64 1089-1152 59-122 (131)
203 KOG0310 Conserved WD40 repeat- 93.4 0.13 2.9E-06 61.0 6.2 89 1-90 127-225 (487)
204 KOG0305 Anaphase promoting com 93.4 0.16 3.4E-06 61.2 6.9 56 5-60 195-250 (484)
205 KOG2321 WD40 repeat protein [G 93.3 0.14 3E-06 62.1 6.2 59 2-60 192-261 (703)
206 KOG0284 Polyadenylation factor 93.1 0.093 2E-06 61.4 4.4 79 1-79 196-280 (464)
207 KOG0299 U3 snoRNP-associated p 92.9 0.16 3.4E-06 60.2 5.8 63 1-63 158-238 (479)
208 KOG0306 WD40-repeat-containing 92.8 0.16 3.4E-06 63.0 5.9 89 5-93 85-182 (888)
209 KOG1587 Cytoplasmic dynein int 92.8 0.068 1.5E-06 65.1 2.8 84 1-85 415-509 (555)
210 KOG0647 mRNA export protein (c 92.4 0.18 3.8E-06 57.5 5.1 59 1-60 88-148 (347)
211 PRK04922 tolB translocation pr 92.3 0.19 4.2E-06 58.4 5.6 73 6-79 183-263 (433)
212 KOG0288 WD40 repeat protein Ti 92.3 0.066 1.4E-06 62.6 1.7 62 1-62 191-254 (459)
213 KOG1445 Tumor-specific antigen 92.2 0.13 2.9E-06 62.6 4.1 56 3-58 146-201 (1012)
214 PRK04922 tolB translocation pr 92.1 0.25 5.3E-06 57.6 6.2 56 4-60 267-327 (433)
215 PRK03629 tolB translocation pr 91.8 0.41 9E-06 56.0 7.6 54 6-60 222-278 (429)
216 PRK00178 tolB translocation pr 91.8 0.48 1.1E-05 54.6 8.0 71 8-79 180-258 (430)
217 KOG1036 Mitotic spindle checkp 91.6 0.21 4.5E-06 56.9 4.5 57 2-60 30-86 (323)
218 KOG2106 Uncharacterized conser 91.5 0.39 8.4E-06 57.7 6.9 38 27-64 447-485 (626)
219 PRK11028 6-phosphogluconolacto 91.4 0.35 7.6E-06 53.5 6.1 55 2-57 7-65 (330)
220 KOG1063 RNA polymerase II elon 91.4 0.34 7.4E-06 59.8 6.4 73 8-83 553-625 (764)
221 KOG2445 Nuclear pore complex c 91.4 0.31 6.8E-06 55.7 5.7 56 1-56 77-143 (361)
222 KOG0306 WD40-repeat-containing 91.3 0.19 4.2E-06 62.3 4.3 58 1-58 608-665 (888)
223 KOG4547 WD40 repeat-containing 91.2 0.36 7.8E-06 58.5 6.3 57 2-60 119-175 (541)
224 KOG0649 WD40 repeat protein [G 91.2 0.41 8.8E-06 53.6 6.2 61 1-61 130-190 (325)
225 TIGR02800 propeller_TolB tol-p 91.2 0.47 1E-05 53.8 6.9 73 6-79 169-249 (417)
226 KOG0642 Cell-cycle nuclear pro 91.1 0.25 5.4E-06 59.7 4.8 61 1-61 360-430 (577)
227 TIGR02800 propeller_TolB tol-p 91.0 0.55 1.2E-05 53.3 7.3 53 7-60 214-269 (417)
228 KOG0299 U3 snoRNP-associated p 91.0 0.22 4.8E-06 59.0 4.2 57 1-57 218-274 (479)
229 KOG0641 WD40 repeat protein [G 91.0 0.48 1E-05 52.4 6.4 60 1-60 198-264 (350)
230 PRK03629 tolB translocation pr 91.0 0.39 8.5E-06 56.2 6.3 52 8-60 268-322 (429)
231 PRK00178 tolB translocation pr 90.9 0.58 1.3E-05 54.0 7.5 53 7-60 223-278 (430)
232 PRK02889 tolB translocation pr 90.7 0.45 9.8E-06 55.5 6.4 73 6-79 175-255 (427)
233 PRK11028 6-phosphogluconolacto 90.6 0.36 7.8E-06 53.4 5.3 54 3-57 144-205 (330)
234 KOG2394 WD40 protein DMR-N9 [G 90.6 0.23 5.1E-06 59.8 3.9 36 25-60 288-323 (636)
235 KOG2096 WD40 repeat protein [G 90.5 0.26 5.6E-06 56.6 4.0 58 2-59 245-310 (420)
236 TIGR02658 TTQ_MADH_Hv methylam 90.4 0.71 1.5E-05 53.8 7.5 57 7-64 279-337 (352)
237 KOG1539 WD repeat protein [Gen 90.3 0.6 1.3E-05 58.7 7.2 58 3-60 220-278 (910)
238 KOG1524 WD40 repeat-containing 90.2 0.39 8.5E-06 58.0 5.3 52 1-53 202-282 (737)
239 KOG0643 Translation initiation 90.1 0.5 1.1E-05 53.4 5.7 62 1-62 26-87 (327)
240 KOG2919 Guanine nucleotide-bin 90.1 0.32 6.9E-06 56.0 4.3 182 1-192 127-323 (406)
241 PRK02889 tolB translocation pr 90.1 0.92 2E-05 53.0 8.2 73 7-80 220-300 (427)
242 KOG0772 Uncharacterized conser 89.9 0.48 1E-05 57.1 5.7 79 1-80 333-426 (641)
243 KOG0277 Peroxisomal targeting 89.6 0.61 1.3E-05 52.4 5.9 57 1-58 164-222 (311)
244 KOG1240 Protein kinase contain 89.4 0.66 1.4E-05 60.4 6.7 86 1-91 1065-1163(1431)
245 KOG3914 WD repeat protein WDR4 88.9 0.35 7.5E-06 56.6 3.6 54 2-56 127-180 (390)
246 KOG4227 WD40 repeat protein [G 88.9 0.99 2.1E-05 53.0 7.1 58 1-58 121-180 (609)
247 KOG1310 WD40 repeat protein [G 88.9 0.72 1.6E-05 56.0 6.2 58 1-58 66-126 (758)
248 PF15492 Nbas_N: Neuroblastoma 88.7 0.38 8.3E-06 54.3 3.7 48 28-76 44-91 (282)
249 KOG1587 Cytoplasmic dynein int 88.7 0.35 7.5E-06 59.2 3.6 58 2-59 459-518 (555)
250 PRK01742 tolB translocation pr 88.7 0.56 1.2E-05 54.7 5.1 52 4-56 267-321 (429)
251 KOG0301 Phospholipase A2-activ 88.5 0.4 8.6E-06 59.2 3.8 52 2-56 117-168 (745)
252 KOG2445 Nuclear pore complex c 88.0 0.56 1.2E-05 53.8 4.4 54 1-55 243-316 (361)
253 KOG1273 WD40 repeat protein [G 87.9 1.2 2.6E-05 51.4 6.8 60 1-60 81-186 (405)
254 PRK04792 tolB translocation pr 87.9 1.1 2.3E-05 53.0 6.8 52 8-60 243-297 (448)
255 PF10313 DUF2415: Uncharacteri 87.8 0.88 1.9E-05 38.6 4.3 31 28-58 1-34 (43)
256 KOG1274 WD40 repeat protein [G 87.6 0.94 2E-05 57.5 6.3 59 2-60 113-171 (933)
257 smart00333 TUDOR Tudor domain. 87.3 1.4 3.1E-05 37.3 5.4 52 967-1033 2-53 (57)
258 TIGR02658 TTQ_MADH_Hv methylam 87.3 1.9 4.2E-05 50.3 8.2 88 7-96 27-142 (352)
259 KOG0974 WD-repeat protein WDR6 87.3 0.63 1.4E-05 59.5 4.6 56 1-58 191-247 (967)
260 KOG4497 Uncharacterized conser 87.3 1.2 2.7E-05 51.6 6.5 137 3-142 67-228 (447)
261 PF07676 PD40: WD40-like Beta 86.7 1.4 2.9E-05 34.8 4.7 33 22-54 3-38 (39)
262 smart00743 Agenet Tudor-like d 86.6 1.7 3.6E-05 37.7 5.6 38 967-1017 2-39 (61)
263 PRK04043 tolB translocation pr 86.5 2.4 5.2E-05 50.1 8.6 74 6-80 212-293 (419)
264 KOG0307 Vesicle coat complex C 86.4 0.62 1.3E-05 60.0 3.9 60 1-60 270-330 (1049)
265 PRK04792 tolB translocation pr 85.4 1.4 3.1E-05 52.0 6.1 56 4-60 281-341 (448)
266 KOG1334 WD40 repeat protein [G 85.3 0.52 1.1E-05 56.5 2.4 56 1-56 158-216 (559)
267 KOG1188 WD40 repeat protein [G 85.1 1.5 3.2E-05 51.0 5.7 105 5-111 48-154 (376)
268 KOG2314 Translation initiation 84.9 1.3 2.9E-05 53.9 5.5 51 7-58 282-335 (698)
269 KOG0640 mRNA cleavage stimulat 84.8 0.74 1.6E-05 52.9 3.2 69 1-69 188-259 (430)
270 KOG0974 WD-repeat protein WDR6 84.7 1.3 2.7E-05 56.9 5.4 60 2-62 150-210 (967)
271 KOG3881 Uncharacterized conser 84.6 1.1 2.4E-05 52.5 4.6 59 2-60 221-280 (412)
272 COG4946 Uncharacterized protei 84.5 2.7 5.8E-05 50.8 7.6 57 3-60 377-434 (668)
273 KOG0642 Cell-cycle nuclear pro 83.9 1.1 2.5E-05 54.4 4.4 57 1-57 310-374 (577)
274 KOG4547 WD40 repeat-containing 83.8 1.2 2.7E-05 54.1 4.6 52 5-56 162-218 (541)
275 KOG1524 WD40 repeat-containing 83.6 0.68 1.5E-05 56.1 2.4 52 3-55 81-132 (737)
276 KOG1063 RNA polymerase II elon 83.3 1.4 3.1E-05 54.6 4.9 90 1-90 588-687 (764)
277 PRK01029 tolB translocation pr 83.2 2.3 5.1E-05 50.1 6.6 53 7-60 351-406 (428)
278 KOG1034 Transcriptional repres 82.7 1.3 2.7E-05 51.4 3.9 59 2-60 110-169 (385)
279 KOG1272 WD40-repeat-containing 82.5 0.89 1.9E-05 54.4 2.8 56 3-58 269-324 (545)
280 PF00930 DPPIV_N: Dipeptidyl p 82.5 1.3 2.7E-05 50.5 3.9 53 5-59 21-73 (353)
281 KOG1538 Uncharacterized conser 82.3 1.7 3.7E-05 53.8 5.1 51 5-55 31-81 (1081)
282 KOG1645 RING-finger-containing 82.2 2.4 5.3E-05 50.2 6.0 58 2-60 211-269 (463)
283 PRK04043 tolB translocation pr 82.2 2.6 5.7E-05 49.7 6.5 55 5-60 255-312 (419)
284 PF15492 Nbas_N: Neuroblastoma 81.8 3.3 7.1E-05 47.2 6.7 51 27-77 229-280 (282)
285 PF04053 Coatomer_WDAD: Coatom 81.3 2.7 5.8E-05 50.4 6.2 48 6-56 125-172 (443)
286 KOG0649 WD40 repeat protein [G 81.2 2.1 4.5E-05 48.2 4.8 24 1-24 172-195 (325)
287 KOG4283 Transcription-coupled 80.5 3 6.6E-05 47.9 5.9 57 3-59 164-221 (397)
288 KOG4328 WD40 protein [Function 80.1 1.4 3.1E-05 52.6 3.3 60 1-60 251-312 (498)
289 KOG3914 WD repeat protein WDR4 79.5 0.91 2E-05 53.3 1.5 24 1-24 209-232 (390)
290 PRK01029 tolB translocation pr 78.8 5.7 0.00012 47.0 7.7 56 5-60 301-362 (428)
291 KOG0641 WD40 repeat protein [G 78.8 4.7 0.0001 45.0 6.5 56 1-56 247-302 (350)
292 KOG2106 Uncharacterized conser 78.7 4 8.7E-05 49.6 6.4 54 1-55 261-314 (626)
293 KOG2315 Predicted translation 78.1 2.7 5.8E-05 51.3 4.8 40 6-47 335-374 (566)
294 PF06003 SMN: Survival motor n 77.8 2.4 5.2E-05 47.7 4.1 56 965-1033 66-121 (264)
295 PF05641 Agenet: Agenet domain 77.4 5.4 0.00012 35.9 5.5 41 968-1018 1-41 (68)
296 KOG1963 WD40 repeat protein [G 76.7 3.8 8.1E-05 52.0 5.6 54 1-56 267-321 (792)
297 KOG0639 Transducin-like enhanc 76.6 1.9 4.1E-05 52.2 3.0 56 2-58 568-623 (705)
298 KOG1963 WD40 repeat protein [G 75.1 4.8 0.0001 51.1 6.0 60 1-60 221-284 (792)
299 KOG0321 WD40 repeat-containing 74.9 5.4 0.00012 49.6 6.2 67 1-72 233-313 (720)
300 KOG1188 WD40 repeat protein [G 74.3 3.7 8.1E-05 47.9 4.4 55 4-58 140-197 (376)
301 KOG4714 Nucleoporin [Nuclear s 74.3 1.6 3.4E-05 49.5 1.4 56 2-57 197-254 (319)
302 KOG1007 WD repeat protein TSSC 74.2 4.9 0.00011 46.2 5.3 56 5-60 191-248 (370)
303 KOG1007 WD repeat protein TSSC 73.7 6.2 0.00013 45.5 5.9 59 1-59 231-291 (370)
304 PF00400 WD40: WD domain, G-be 73.5 1.4 3.1E-05 33.8 0.7 13 1-13 27-39 (39)
305 KOG0303 Actin-binding protein 72.6 4.9 0.00011 47.7 4.9 94 1-94 98-207 (472)
306 KOG1354 Serine/threonine prote 71.0 11 0.00024 44.4 7.1 119 7-147 293-428 (433)
307 COG0823 TolB Periplasmic compo 70.7 8.2 0.00018 46.1 6.4 73 7-80 218-298 (425)
308 KOG3881 Uncharacterized conser 70.2 7.6 0.00016 46.0 5.7 57 2-60 166-237 (412)
309 COG3386 Gluconolactonase [Carb 70.1 14 0.00031 42.5 7.8 49 10-58 144-194 (307)
310 PF09465 LBR_tudor: Lamin-B re 69.8 10 0.00022 34.0 5.1 42 965-1018 3-44 (55)
311 PF13360 PQQ_2: PQQ-like domai 69.4 8.1 0.00018 40.1 5.3 57 5-62 1-59 (238)
312 KOG0280 Uncharacterized conser 67.5 12 0.00026 43.2 6.4 64 1-66 182-251 (339)
313 KOG4532 WD40-like repeat conta 66.8 10 0.00022 43.5 5.6 74 2-75 220-302 (344)
314 PF12894 Apc4_WD40: Anaphase-p 66.2 11 0.00023 32.4 4.4 30 27-56 11-40 (47)
315 KOG0650 WD40 repeat nucleolar 64.5 3.5 7.5E-05 50.9 1.6 130 6-144 586-726 (733)
316 KOG4640 Anaphase-promoting com 64.4 16 0.00034 45.7 7.0 34 31-64 66-99 (665)
317 TIGR03300 assembly_YfgL outer 64.1 18 0.00038 41.2 7.1 56 3-62 247-302 (377)
318 PLN02919 haloacid dehalogenase 62.8 16 0.00035 48.4 7.2 58 3-60 821-891 (1057)
319 KOG4227 WD40 repeat protein [G 62.7 12 0.00026 44.5 5.4 62 1-62 72-140 (609)
320 PF07433 DUF1513: Protein of u 62.6 34 0.00073 39.8 8.9 95 9-104 30-140 (305)
321 KOG3621 WD40 repeat-containing 62.3 14 0.00031 46.6 6.2 56 3-58 94-155 (726)
322 PF07569 Hira: TUP1-like enhan 61.2 19 0.00041 39.4 6.3 74 5-79 30-117 (219)
323 PLN02919 haloacid dehalogenase 61.0 19 0.00041 47.7 7.4 92 3-95 758-893 (1057)
324 PF13360 PQQ_2: PQQ-like domai 59.3 13 0.00029 38.5 4.7 58 3-62 42-99 (238)
325 KOG2079 Vacuolar assembly/sort 58.8 16 0.00034 48.1 5.9 53 5-57 150-205 (1206)
326 PF10282 Lactonase: Lactonase, 57.5 23 0.00049 40.4 6.5 53 5-57 265-322 (345)
327 KOG1272 WD40-repeat-containing 57.3 10 0.00022 45.9 3.7 51 1-52 309-359 (545)
328 KOG1064 RAVE (regulator of V-A 56.8 11 0.00023 51.8 4.2 94 1-98 2224-2329(2439)
329 KOG1310 WD40 repeat protein [G 56.4 15 0.00032 45.4 4.9 56 1-56 640-695 (758)
330 KOG2695 WD40 repeat protein [G 55.5 19 0.00041 42.5 5.3 63 1-63 315-382 (425)
331 COG4246 Uncharacterized protei 54.0 17 0.00037 41.6 4.6 66 27-121 73-138 (340)
332 KOG4849 mRNA cleavage factor I 53.5 8.7 0.00019 45.0 2.3 72 89-175 293-368 (498)
333 KOG3617 WD40 and TPR repeat-co 53.2 24 0.00053 45.6 6.1 52 7-58 81-132 (1416)
334 PF14655 RAB3GAP2_N: Rab3 GTPa 52.8 24 0.00053 42.4 5.9 38 25-62 305-342 (415)
335 KOG1912 WD40 repeat protein [G 51.9 18 0.00038 46.3 4.7 62 1-62 83-148 (1062)
336 PRK02888 nitrous-oxide reducta 51.8 28 0.00061 43.9 6.4 54 6-59 295-353 (635)
337 PF08450 SGL: SMP-30/Gluconola 48.7 43 0.00094 35.8 6.5 50 7-58 115-165 (246)
338 PF08450 SGL: SMP-30/Gluconola 48.4 56 0.0012 35.0 7.3 108 7-115 60-201 (246)
339 KOG0732 AAA+-type ATPase conta 48.4 8.5 0.00018 50.6 1.4 76 1079-1154 550-635 (1080)
340 cd04508 TUDOR Tudor domains ar 44.0 37 0.00079 27.8 4.0 34 971-1017 1-34 (48)
341 PF02897 Peptidase_S9_N: Proly 44.0 51 0.0011 38.1 6.6 57 3-61 146-214 (414)
342 KOG1523 Actin-related protein 43.0 46 0.001 39.1 5.9 144 20-183 3-162 (361)
343 KOG2002 TPR-containing nuclear 41.4 19 0.00042 46.9 2.9 50 289-344 940-989 (1018)
344 COG4946 Uncharacterized protei 40.4 73 0.0016 39.3 7.1 76 5-80 421-505 (668)
345 PRK13616 lipoprotein LpqB; Pro 40.2 36 0.00077 42.6 4.9 45 7-55 430-474 (591)
346 PF04841 Vps16_N: Vps16, N-ter 39.6 68 0.0015 38.1 6.9 34 27-60 216-249 (410)
347 KOG2315 Predicted translation 39.6 95 0.0021 38.7 8.0 77 8-85 252-333 (566)
348 KOG4190 Uncharacterized conser 38.6 33 0.00072 42.5 4.1 57 1-59 751-814 (1034)
349 KOG1409 Uncharacterized conser 38.3 26 0.00056 41.4 3.0 59 1-59 213-272 (404)
350 KOG0771 Prolactin regulatory e 38.0 33 0.00072 41.0 3.9 42 25-66 279-320 (398)
351 TIGR03300 assembly_YfgL outer 37.0 69 0.0015 36.5 6.2 58 3-62 71-128 (377)
352 KOG2066 Vacuolar assembly/sort 36.8 43 0.00093 43.1 4.8 53 1-55 87-144 (846)
353 PF07433 DUF1513: Protein of u 36.0 43 0.00094 38.9 4.4 43 4-46 74-117 (305)
354 PF04762 IKI3: IKI3 family; I 35.2 61 0.0013 42.7 6.0 29 27-55 120-148 (928)
355 KOG1240 Protein kinase contain 34.8 38 0.00083 45.4 4.0 54 7-60 1173-1228(1431)
356 KOG0270 WD40 repeat-containing 34.5 59 0.0013 39.5 5.2 59 1-59 303-362 (463)
357 KOG2111 Uncharacterized conser 34.3 94 0.002 36.7 6.6 76 6-81 158-244 (346)
358 KOG4649 PQQ (pyrrolo-quinoline 33.8 64 0.0014 37.3 5.1 56 3-58 69-124 (354)
359 PRK11138 outer membrane biogen 33.4 1.1E+02 0.0023 35.6 7.0 56 3-62 262-317 (394)
360 KOG1523 Actin-related protein 32.9 62 0.0013 38.1 4.9 60 1-60 71-133 (361)
361 COG3391 Uncharacterized conser 32.8 1E+02 0.0022 36.1 6.8 54 5-59 138-192 (381)
362 KOG1064 RAVE (regulator of V-A 32.5 22 0.00048 49.1 1.5 65 1-75 2352-2420(2439)
363 COG5170 CDC55 Serine/threonine 30.7 40 0.00088 39.5 3.0 57 3-60 240-312 (460)
364 PF03178 CPSF_A: CPSF A subuni 30.7 1.3E+02 0.0029 33.7 7.1 72 6-77 139-228 (321)
365 COG0823 TolB Periplasmic compo 30.2 53 0.0012 39.5 4.0 48 8-56 263-311 (425)
366 KOG2727 Rab3 GTPase-activating 30.2 41 0.00089 44.0 3.2 29 32-60 326-354 (1244)
367 TIGR02276 beta_rpt_yvtn 40-res 30.0 52 0.0011 25.7 2.7 25 37-61 1-26 (42)
368 KOG4497 Uncharacterized conser 29.0 77 0.0017 37.6 4.8 99 8-115 192-292 (447)
369 KOG0322 G-protein beta subunit 28.9 88 0.0019 36.2 5.2 74 1-74 221-304 (323)
370 KOG2919 Guanine nucleotide-bin 28.6 95 0.0021 36.8 5.4 57 3-59 226-283 (406)
371 KOG3026 Splicing factor SPF30 28.6 47 0.001 37.6 2.9 29 967-999 90-118 (262)
372 PF11715 Nup160: Nucleoporin N 28.3 37 0.00081 41.1 2.4 22 2-23 235-256 (547)
373 PF03022 MRJP: Major royal jel 28.2 1.5E+02 0.0033 33.7 7.0 54 8-61 35-100 (287)
374 PF06977 SdiA-regulated: SdiA- 27.3 54 0.0012 36.8 3.2 55 30-87 24-79 (248)
375 PF08940 DUF1918: Domain of un 27.3 1.3E+02 0.0029 27.5 4.9 41 969-1018 4-44 (58)
376 KOG2130 Phosphatidylserine-spe 27.2 1.1E+02 0.0025 36.0 5.7 96 43-142 130-270 (407)
377 PF00930 DPPIV_N: Dipeptidyl p 26.9 60 0.0013 37.2 3.6 55 5-59 61-136 (353)
378 KOG1538 Uncharacterized conser 26.9 1.4E+02 0.0031 38.2 6.8 98 11-108 431-540 (1081)
379 PF05764 YL1: YL1 nuclear prot 25.8 1E+02 0.0023 34.4 5.0 29 171-199 25-53 (240)
380 PRK11138 outer membrane biogen 25.1 2.1E+02 0.0047 33.1 7.6 63 3-65 75-146 (394)
381 KOG0290 Conserved WD40 repeat- 24.4 1.1E+02 0.0025 35.8 5.0 52 7-58 266-319 (364)
382 KOG2066 Vacuolar assembly/sort 24.3 97 0.0021 40.2 4.9 38 38-75 82-124 (846)
383 PF15390 DUF4613: Domain of un 23.8 1.3E+02 0.0028 38.2 5.6 32 24-55 152-184 (671)
384 KOG1517 Guanine nucleotide bin 23.3 1.4E+02 0.003 40.2 6.0 72 3-76 1227-1304(1387)
385 KOG1517 Guanine nucleotide bin 23.3 1.2E+02 0.0025 40.8 5.3 57 1-58 1273-1334(1387)
386 PF14583 Pectate_lyase22: Olig 22.8 1.4E+02 0.003 36.0 5.6 51 6-56 319-380 (386)
387 KOG2079 Vacuolar assembly/sort 22.3 55 0.0012 43.5 2.4 57 5-63 107-166 (1206)
388 PF03088 Str_synth: Strictosid 22.2 1.1E+02 0.0024 29.6 3.9 43 2-45 32-74 (89)
389 PF11768 DUF3312: Protein of u 21.7 1.6E+02 0.0035 36.9 5.9 72 9-80 238-316 (545)
390 KOG0280 Uncharacterized conser 21.7 60 0.0013 37.9 2.2 38 1-40 227-266 (339)
391 PF10282 Lactonase: Lactonase, 21.6 2.9E+02 0.0062 31.7 7.7 53 5-58 60-118 (345)
392 PF01731 Arylesterase: Arylest 21.5 2.5E+02 0.0054 27.1 6.0 46 7-55 36-82 (86)
393 KOG2321 WD40 repeat protein [G 20.8 71 0.0015 40.1 2.7 48 8-56 295-342 (703)
394 KOG2041 WD40 repeat protein [G 20.6 85 0.0018 40.4 3.3 55 2-56 88-144 (1189)
395 COG5354 Uncharacterized protei 20.5 1E+02 0.0022 38.3 3.9 51 5-56 106-159 (561)
396 cd00028 B_lectin Bulb-type man 20.5 4.4E+02 0.0095 25.7 7.6 75 6-86 30-108 (116)
397 KOG4532 WD40-like repeat conta 20.2 2.1E+02 0.0046 33.4 6.0 58 3-60 134-191 (344)
No 1
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=100.00 E-value=6.3e-121 Score=1054.15 Aligned_cols=696 Identities=49% Similarity=0.761 Sum_probs=518.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCcccee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQ 80 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLir 80 (1159)
|||||||.++||||+.|.+|++|..++++++|||||+|||+||++|++|+|+|+|+|++++++.++|+|||++||||||+
T Consensus 412 msag~dgst~iwdi~eg~pik~y~~gh~kl~d~kFSqdgts~~lsd~hgql~i~g~gqs~s~k~ak~dqffl~dyrplir 491 (1113)
T KOG0644|consen 412 MSAGYDGSTIIWDIWEGIPIKHYFIGHGKLVDGKFSQDGTSIALSDDHGQLYILGTGQSKSQKKAKYDQFFLGDYRPLIR 491 (1113)
T ss_pred hhccCCCceEeeecccCCcceeeecccceeeccccCCCCceEecCCCCCceEEeccCCCccccccccceEeecCcccccc
Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCccccccCcCCcccCCCCCcccCCCCCCCchhHHhhhhhhhcccCccccCCccccccCCCCcCCCccccccccCcc
Q 001090 81 DTYGNVLDQETQLAPHRRNLQDPLCDSAMIPYPEPYQTMYQQRRLGALGIEWRPSSLKLAVGPDFSLDQGYQLQPLADLD 160 (1159)
Q Consensus 81 D~~g~vlDe~Tq~~PHL~~~~~~L~D~~~~PYp~~~Q~~~~~rrl~~l~~e~~p~~~~~~vg~~~~~~~~~~~~p~~~~~ 160 (1159)
|.||||+|++||++||+ +||++|||.+|+|||.+||++||+||++|+ +|+|+||
T Consensus 492 dTn~~vldqeTq~~ph~-~~p~~l~ds~~iphpe~yqtl~q~rr~sa~---------------------dh~mp~l---- 545 (1113)
T KOG0644|consen 492 DTNGYVLDQETQLAPHR-NPPDFLCDSDMIPHPEPYQTLFQGRRLSAL---------------------DHLMPPL---- 545 (1113)
T ss_pred cccchhhhhHhhhcccc-CCCCceeccCCCcCCchhhhcccccccchh---------------------hhcCCcc----
Confidence 99999999999999994 779999999999999999999999999988 7888888
Q ss_pred cccCCCccccccccCCcccccccCCCCCcccccccccccccccccCCCCCCCCCCcCCcCCCCCCCccchhhhhhccccc
Q 001090 161 VMIDPLPEFIDVMDWEPENEVQSDDNDSEYNVAEEYSTEEKGSLSSTSSGDSECSAEDSEDGENPMDGLRRSKRKKQKAE 240 (1159)
Q Consensus 161 ~~~~~~~e~~d~~~wepe~ev~sdd~dseyn~~ee~ss~e~~~~~~~ss~~~~~s~~~s~~~~~~~~~lrrs~rkk~k~~ 240 (1159)
||+|+|+||||.+|+||+||||||..+|++
T Consensus 546 -------~fia~~~~E~e~~Vls~~nds~~~~~~es~------------------------------------------- 575 (1113)
T KOG0644|consen 546 -------EFIATMDWEPEEQVLSDQNDSEYNRSPESL------------------------------------------- 575 (1113)
T ss_pred -------cchhhhccccchhhhhhcccccccCCcccc-------------------------------------------
Confidence 999999999999999999999999966511
Q ss_pred eeeeeccccccccccccccCCCcCCCCccccccCCCcccccccCCCccccchHHHHHHHHhhhhhhccCCCCCCCCCCCC
Q 001090 241 VEIMTSSGRRVKRRILDESEGNAAFGNKRTRKSGNRQKSSRRKSSTSKSLRPQRAAARNARSFFSKITGASTDGEDVDGS 320 (1159)
Q Consensus 241 ~~~~tssgrrv~krnlde~~~~~~~~~~r~rk~~~~~k~s~~kssk~k~~rpqr~aarnal~~fski~~~~~~~ed~d~~ 320 (1159)
+|.+|. .+.-+|+++.+.|.|-+++|++++|+.||+|||||||+++||||+|++.+.+.
T Consensus 576 ----------------seldg~-isq~krt~n~r~g~k~s~~~hg~s~ss~~~R~~~Rn~~~~isk~sg~~~~~~s---- 634 (1113)
T KOG0644|consen 576 ----------------SELDGM-ISQLKRTQNQRMGAKQSKRKHGLSKSSRPPRAAARNASSDISKISGISLDRKS---- 634 (1113)
T ss_pred ----------------hhhhhh-hhhhhhhhccccccchhhccCCCCccccCCcccccccchhhccCCCccccccc----
Confidence 888888 88899999999999999999999999999999999999999999999998763
Q ss_pred CCCCCcccccccccccchhhhhhhhhhhhhhcccCcccccccccccCCCCCcccccccCCceEEEEeeccCCCccccccc
Q 001090 321 EGELSESESDLQDSYIESEESGRSLLNEQRKHSKGKGISLDDSEDVTKLDTPESHVNAGIRRLVLKLPVRDSNKHELQER 400 (1159)
Q Consensus 321 ~~~~s~s~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~~~n~~~~rlvl~~~~rd~~~~~~~~~ 400 (1159)
++.+..+|+.....-+..+.+.+++ +.+.++++.+ ..+.
T Consensus 635 ----g~~e~~~q~~~~~p~eq~~~~~~~~--------------------------------~wR~e~~vce-----~leg 673 (1113)
T KOG0644|consen 635 ----GQVEGVTQMHQNAPSEQIATLRTLQ--------------------------------AWRREVVVCE-----ELEG 673 (1113)
T ss_pred ----cCCccchhhccCCCHHHHhhhhHHH--------------------------------HhhcccCccc-----cccc
Confidence 4455555555333333333322111 1111111111 0111
Q ss_pred cccccccccceeccCchhhhccccCCCCcccccCCCCCCCCcccccccccCCCCcccccccccccCcccCCceeeccccc
Q 001090 401 TSDKCNQLVSVIGTSSEAHQEATEGNGNRVSYVGNNCSSVDANCGLMERRGRGQFDKLEDYLNLSNGYKDGKIRWGGVRA 480 (1159)
Q Consensus 401 ~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~v~~ 480 (1159)
++++.+.- .+.. ..+-. ..+||+++.
T Consensus 674 t~pe~ed~--------------------------------------------~~~~-dtdev-~rng~kngR-------- 699 (1113)
T KOG0644|consen 674 TFPELEDE--------------------------------------------AMAV-DTDEV-ERNGYKNGR-------- 699 (1113)
T ss_pred cCcccccc--------------------------------------------cccc-ccchh-hhcCCCccc--------
Confidence 11100000 0000 00000 011222211
Q ss_pred ccccccccccCCCCCCCCCCCcccCcccchhhcccCCCCCCCCCCCCCcccccccCCCCCCCccccccccccCCCCCCcc
Q 001090 481 RSSKRLKIGEMMPLDANNGSGIHLDDDKEKESEVNGHVKPEKDGIDISCGEEITNCGDNTDEVPLKNVKNLSGENNDVYS 560 (1159)
Q Consensus 481 ~~~k~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 560 (1159)
.- ++++.++ .|+
T Consensus 700 -Kr-k~~l~~~-------~Sn----------------------------------------------------------- 711 (1113)
T KOG0644|consen 700 -KR-KLELRHP-------LSN----------------------------------------------------------- 711 (1113)
T ss_pred -cc-chhccCc-------ccc-----------------------------------------------------------
Confidence 00 0111100 000
Q ss_pred CCcccccccCCccccccCCCCccccCCCCCCCCCCCCCCCCCCCCcccccccCCceeeeeecccccccCccccCCCCCcc
Q 001090 561 GDASCKEQQSGFSELNYYDESKCVNTTDEDTTPYPNHLQNGTIQPSELKEILTPVSTKLRIRSKRILRDADVENQNNGCD 640 (1159)
Q Consensus 561 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (1159)
.+ ..++.- .. .|.+++....+.|+. +.
T Consensus 712 --------ks----------------------------d~vtlV--~~---------d~~~~~~~~s~aP~~------~l 738 (1113)
T KOG0644|consen 712 --------KS----------------------------DSVTLV--SQ---------DLADQSTCVSRAPNG------GL 738 (1113)
T ss_pred --------cc----------------------------ccchhh--hh---------hhhcccceeeeCCcc------cc
Confidence 00 000000 00 011222222232321 10
Q ss_pred cccCCCCCCCCCCCCcccccCCCCCcccCCCC--CCcccccccccCCCCCCCCCCCccCCCCCCCCCCCCcchhhHHHhh
Q 001090 641 ALHGSSLDIKPNSLPEVLESDGTNRTSSDRGA--DGSQRLNAQIDSTSEHDPLGSHSHSHDPLGSHSHSHSRKMFNVVYR 718 (1159)
Q Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~v~r 718 (1159)
..-++.+ +..+...++||+.++.....+.. ||..+..+++..+. ++..+.... |.+..++ ||
T Consensus 739 d~lNDgf--sD~kiD~t~D~se~~gs~~Eis~Rndrr~eS~~~esds~--------SSa~sv~l~--dgsi~~~----ts 802 (1113)
T KOG0644|consen 739 DSLNDGF--SDVKIDLTLDCSEDSGSGEEISLRNDRRSESNPEESDSL--------SSAYSVWLA--DGSINLQ----TS 802 (1113)
T ss_pred ccccccc--ccccccchhhhhhcCCcccceeeecchhhccCcccccch--------hhhhheecc--cCCcccc----cc
Confidence 1011111 22334566677777666655554 55555544433333 122223333 3333333 32
Q ss_pred c-ccccccCCCCCCCCCCCCCccccCCCCCCCCCCCCCccccccccccccccCCCcccccchhccCCCCCccccCCCCCC
Q 001090 719 R-SKTNRDRTNSEGDGGGVGESTLNANNNNFHESATDGSRRTRSMGLKTTTCDPDNVSSNLRLEQHNQPEDMYSGHNRST 797 (1159)
Q Consensus 719 r-~~~~~~~~~~~~~~~~~~e~~~~~n~~~~~~~~~~~~~~~~s~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 797 (1159)
| +|- .|.+ .+
T Consensus 803 rrgRi--~k~~-------------------------------------------------------------------~s 813 (1113)
T KOG0644|consen 803 RRGRI--TKFC-------------------------------------------------------------------SS 813 (1113)
T ss_pred ccccc--cccC-------------------------------------------------------------------cc
Confidence 2 222 2222 12
Q ss_pred CCCCCCc--ccccCCCcccccccccccccccccCCCCCcccccCccccccccccccccccCCCCCCCCCCCCeEEEechh
Q 001090 798 SRCQLPH--EEWGSSSKMTVGLRSTRNRRTSYLFCDSSPIDRRKTHQSLRKGSWLMLSTHEEGSRYIPQLGDEVVYLRQG 875 (1159)
Q Consensus 798 ~~~~~~~--ee~~s~s~~~~~~rs~~~r~~~~~~p~~~~~~~r~~~~~~~~~sWL~~~~~~~~spYVPQ~GDEVVYfRQG 875 (1159)
+...|+. ++|.++.+. ..+.|++.. -...+..-..+|+++.+++||++++|+++|+|||||||||+|||||
T Consensus 814 t~a~Lt~e~~~~p~~~~~-----~s~s~s~~~--~~p~~~~~~~~~~~~~~~sWltls~hee~~ryipQmgDEViyfrQg 886 (1113)
T KOG0644|consen 814 TEAELTTELSSPPCSQDK-----GSGSKSHKR--STPAPLENMKPQQLFEKPSWLTLSIHEEGCRYIPQMGDEVIYFRQG 886 (1113)
T ss_pred chhhhhhcccCCCccccc-----cccccCccc--CCCccccccchhhhcCCceeEEeehhhcCCcccccccceeehhhhh
Confidence 2222222 334444444 223333332 2233555566789999999999999999999999999999999999
Q ss_pred HHHHHHHhccCCCCCccccc-CCCcceeeEEEEEEEEEecCCCCccceeeEEeeecCCCCCCCCeEEEEcCCCCCCCceE
Q 001090 876 HQEYINYSGSREVGPWITVK-GNIRAVEFCKVESLEYATASGSGDSCCKMTLKFIDPTSSVSNMTFRLTLPEVTGFPDFL 954 (1159)
Q Consensus 876 HeeYLeavr~~~~~PW~~lk-~~LR~~E~CKVvgIkYeI~Pps~tscCkLtL~fLdpaSg~tg~~FsV~Y~p~~d~pDFL 954 (1159)
||+||++++.++. +|++.+ .++-++|.|||+.|.|..+|+++++||+|+|+||||++.+..++|+|+||++.++||||
T Consensus 887 hqeyl~~~~~n~~-~~~~~~p~~~~~v~~~kv~kl~~~~y~~~~~s~c~m~l~~idp~s~~~~k~F~ltlpdlv~fpDFl 965 (1113)
T KOG0644|consen 887 HQEYLEAVRLNNI-ELNNKEPWNKMAVEICKVEKLVYITYPGSGDSCCKMKLAVIDPASKLMDKSFKLTLPDLVTFPDFL 965 (1113)
T ss_pred hHHHHhhhhhccc-cccccCcccccchhhheeeeeeeeeccCCCcchheeeeeeecchhhhhhccceeecccccCcchhh
Confidence 9999999987644 444332 34778999999999999999999999999999999999999999999999999999999
Q ss_pred eehhhHHHHhccCCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCCCCcccccCCCccccCC
Q 001090 955 VERTRFDAAIQRNWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTEPTETHLHSPWELFDSD 1034 (1159)
Q Consensus 955 Vlr~rYdkAm~rnWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~~~E~eRVSPWEIEpid 1034 (1159)
|++++|++|++|||+.||+|++||++++.++|+||+|+|.+++|++|+||+|||+||.|+||++ |++.+||||+++++
T Consensus 966 V~rsrYd~AiQrnW~~~d~crvwwrda~~e~g~WWeG~ils~~pksp~fpdSpwery~v~~~~~--e~~~~spwe~~~i~ 1043 (1113)
T KOG0644|consen 966 VERSRYDAAIQRNWTCRDKCRVWWRDAGEEDGAWWEGRILSVKPKSPDFPDSPWERYIVRYDNT--ETELHSPWEMEPIP 1043 (1113)
T ss_pred hhhhhHHHHHhhccccccceeEEEccCCCcCCceeeeeeeeccCCCCCCCCCcceeEEEEecCC--cccccCccccCCCc
Confidence 9999999999999999999999999988889999999999999999999999999999999999 99999999999752
Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHH
Q 001090 1035 TQWEQPRIDDDNRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDI 1114 (1159)
Q Consensus 1035 ~p~e~P~id~E~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DV 1114 (1159)
+| |.+.++.|..|+.|+.||||+|||++++|.+||
T Consensus 1044 ---------de------------------------------------~~~~fpvpls~evi~~rlEn~yYrs~e~~~hdv 1078 (1113)
T KOG0644|consen 1044 ---------DE------------------------------------VDNRFPVPLSLEVIRSRLENNYYRSQEALRHDV 1078 (1113)
T ss_pred ---------cc------------------------------------cCCCCCCcccHHHHHHHHHhhhhhhhHhhhcch
Confidence 11 556789999999999999999999999999999
Q ss_pred HHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1115 AVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1115 rLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
..|++||++|++-+..+.-++..|..||.+.|.++
T Consensus 1079 s~mlsnae~~fg~~~~~~~ki~~l~~~~~~Tlpk~ 1113 (1113)
T KOG0644|consen 1079 SVMLSNAETFFGRNKNVAIKISFLSPWFDRTLPKL 1113 (1113)
T ss_pred hhhhcccceeecccccHHHHhhhcchhhhhhccCC
Confidence 99999999999999999999999999999998764
No 2
>PF06507 Auxin_resp: Auxin response factor; InterPro: IPR010525 This pattern represents a conserved region of auxin-responsive transcription factors. The plant hormone auxin (indole-3-acetic acid) can regulate the gene expression of several families, including Aux/IAA, GH3 and SAUR families. Two related families of proteins, Aux/IAA proteins (IPR003311 from INTERPRO) and the auxin response factors (ARF), are key regulators of auxin-modulated gene expression []. There are multiple ARF proteins, some of which activate, while others repress transcription. ARF proteins bind to auxin-responsive cis-acting promoter elements (AuxREs) using an N-terminal DNA-binding domain. It is thought that Aux/IAA proteins activate transcription by modifying ARF activity through the C-terminal protein-protein interaction domains (IPR011525 from INTERPRO) found in both Aux/IAA and ARF proteins. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0009725 response to hormone stimulus, 0005634 nucleus
Probab=99.82 E-value=2.5e-20 Score=171.49 Aligned_cols=77 Identities=18% Similarity=0.485 Sum_probs=73.4
Q ss_pred CCCCeEEEEcCCCCCCCceEeehhhHHHHhccCCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCC-CCCCCCcceeE
Q 001090 935 VSNMTFRLTLPEVTGFPDFLVERTRFDAAIQRNWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSS-EFPDSPWERYT 1012 (1159)
Q Consensus 935 ~tg~~FsV~Y~p~~d~pDFLVlr~rYdkAm~rnWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp-~wPdSpWe~L~ 1012 (1159)
.++..|.|+|+|+++.+||||++++|++|++.+|++||||+|.||+||. ..++|.|||++|.+.+| +||+|+|+||+
T Consensus 6 ~~~~~F~V~Y~PRa~~sEFVV~~~k~~~al~~~~~~GmRfkM~fE~eds-~~~~~~GtI~~v~~~dp~~w~~S~WR~Lq 83 (83)
T PF06507_consen 6 ATGSPFEVFYYPRASPSEFVVPASKYDKALNHPWSVGMRFKMRFETEDS-SERRWQGTIVGVSDLDPIRWPGSKWRMLQ 83 (83)
T ss_pred hcCCeEEEEECCCCCCcceEEEHHHHHHHhcCCCCCCcEEEEEeccCCC-ccceeeeEEeEeeccCCCCCCCCCcccCc
Confidence 4789999999999999999999999999999999999999999999986 67788999999999999 99999999985
No 3
>cd05529 Bromo_WDR9_I_like Bromodomain; WDR9 repeat I_like subfamily. WDR9 is a human gene located in the Down Syndrome critical region-2 of chromosome 21. It encodes for a nuclear protein containing WD40 repeats and two bromodomains, which may function as a transcriptional regulator involved in chromatin remodeling and play a role in embryonic development. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.80 E-value=3.8e-19 Score=174.61 Aligned_cols=124 Identities=44% Similarity=0.713 Sum_probs=105.5
Q ss_pred ccccCCCccccCCCCCCCCCCCHHHHHHHHHHHHHHHHhhccchhhhhhhhhcccc-ccCcccccCCCCCCHHHHHHHHH
Q 001090 1022 THLHSPWELFDSDTQWEQPRIDDDNRNKLLSAFAKLEQSANRVQDQYGVQKLKQVS-QKTNFTNRFPVPLSLDVIQSRLE 1100 (1159)
Q Consensus 1022 ~eRVSPWEIEpid~p~e~P~id~E~r~kLlsal~klees~~r~~DryGI~~L~~~~-e~PdY~~iIk~PMDLsTIr~RLe 1100 (1159)
....++|++...+ |..|++.++.+++++..+.++.....+.. ..-+..++... .+|+|+++|++||||+||++||+
T Consensus 3 ~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~l~~l~~~~~~~~-~~~F~~pv~~~~~~p~Y~~iI~~PmdL~tI~~kl~ 79 (128)
T cd05529 3 NPLSSEWELFDPG--WEQPHIRDEERERLISGLDKLLLSLQLEI-AEYFEYPVDLRAWYPDYWNRVPVPMDLETIRSRLE 79 (128)
T ss_pred CCCCCchhccccc--CcCCCCCHHHHHHHHHHHHHHHhcccCcc-cccccCCCCccccCCcHHHHcCCCCCHHHHHHHHh
Confidence 3457999997654 88899999999999999999864211111 01134555666 89999999999999999999999
Q ss_pred cCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhc
Q 001090 1101 NNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSS 1148 (1159)
Q Consensus 1101 N~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~Lkk 1148 (1159)
+++|+++++|..||+||+.||++||+++|.++++|+.|+++|.+.|+.
T Consensus 80 ~~~Y~s~~~f~~Dv~Li~~Na~~yN~~~s~i~~~A~~l~~~~~~~l~~ 127 (128)
T cd05529 80 NRYYRSLEALRHDVRLILSNAETFNEPNSEIAKKAKRLSDWLLRILSS 127 (128)
T ss_pred cCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHhcc
Confidence 999999999999999999999999999999999999999999999875
No 4
>cd05497 Bromo_Brdt_I_like Bromodomain, Brdt_like subfamily, repeat I. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.73 E-value=7.9e-18 Score=160.82 Aligned_cols=80 Identities=24% Similarity=0.395 Sum_probs=73.8
Q ss_pred hhhhcccc--ccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHh
Q 001090 1070 VQKLKQVS--QKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLS 1147 (1159)
Q Consensus 1070 I~~L~~~~--e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~Lk 1147 (1159)
+.++++.. ++|+||++|++||||+||++||++++|+++++|..||+|||.||.+||+++|+|+.+|..|+++|+..|+
T Consensus 26 F~~PVd~~~~~~pdY~~iIk~PmDL~tI~~kL~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~A~~l~~~f~~~l~ 105 (107)
T cd05497 26 FQQPVDAVKLNLPDYHKIIKTPMDLGTIKKRLENNYYWSASECIQDFNTMFTNCYIYNKPGDDVVLMAQTLEKLFLQKLA 105 (107)
T ss_pred ccCCCCcccccCCcHHHHHcCcccHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH
Confidence 34555443 6999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred cC
Q 001090 1148 SL 1149 (1159)
Q Consensus 1148 kL 1149 (1159)
.+
T Consensus 106 ~~ 107 (107)
T cd05497 106 QM 107 (107)
T ss_pred cC
Confidence 64
No 5
>cd05495 Bromo_cbp_like Bromodomain, cbp_like subfamily. Cbp (CREB binding protein or CREBBP) is an acetyltransferase acting on histone, which gives a specific tag for transcriptional activation and also acetylates non-histone proteins. CREBBP binds specifically to phosphorylated CREB protein and augments the activity of phosphorylated CREB to activate transcription of cAMP-responsive genes. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.73 E-value=1.1e-17 Score=159.93 Aligned_cols=104 Identities=14% Similarity=0.272 Sum_probs=87.1
Q ss_pred CHHHHHHHHHHHHHHHHhhccchhhhhhhhhccc--cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHh
Q 001090 1043 DDDNRNKLLSAFAKLEQSANRVQDQYGVQKLKQV--SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSN 1120 (1159)
Q Consensus 1043 d~E~r~kLlsal~klees~~r~~DryGI~~L~~~--~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sN 1120 (1159)
+++.+++++..+.++...-. ...-+...+.. .++|+||++|++||||+||++||+++.|.++.+|..||+|||.|
T Consensus 1 ~~~l~~~~~~il~~l~~~~~---~s~~F~~PV~~~~~~~pdY~~iIk~PmDL~tI~~kL~~~~Y~s~~ef~~D~~li~~N 77 (108)
T cd05495 1 PEELRQALMPTLEKLYKQDP---ESLPFRQPVDPKLLGIPDYFDIVKNPMDLSTIRRKLDTGQYQDPWQYVDDVWLMFDN 77 (108)
T ss_pred CHHHHHHHHHHHHHHHHcCc---ccchhcCCCCccccCCCcHHHHhCCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHH
Confidence 35778888888888754200 01113344433 36899999999999999999999999999999999999999999
Q ss_pred hhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1121 AESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1121 A~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
|.+||+++|.++++|..|+++|++.|+.+
T Consensus 78 a~~yN~~~s~i~~~a~~l~~~F~~~~~~~ 106 (108)
T cd05495 78 AWLYNRKTSRVYKYCTKLAEVFEQEIDPV 106 (108)
T ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999998764
No 6
>cd05509 Bromo_gcn5_like Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.70 E-value=4e-17 Score=153.15 Aligned_cols=80 Identities=21% Similarity=0.416 Sum_probs=75.5
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+..++....+|+|+.+|++||||.||++||+++.|+++++|..||+|||+||.+||+++|.++++|..|+++|++.|++|
T Consensus 22 F~~pv~~~~~p~Y~~~I~~PmdL~tI~~kl~~~~Y~s~~~f~~Dv~li~~Na~~yN~~~s~~~~~a~~l~~~f~~~~~~~ 101 (101)
T cd05509 22 FLEPVDKEEAPDYYDVIKKPMDLSTMEEKLENGYYVTLEEFVADLKLIFDNCRLYNGPDTEYYKCANKLEKFFWKKLKEL 101 (101)
T ss_pred hcCCCChhhcCCHHHHhcCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHhhC
Confidence 45566667799999999999999999999999999999999999999999999999999999999999999999999875
No 7
>cd05504 Bromo_Acf1_like Bromodomain; Acf1_like or BAZ1A_like subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was identified as a novel human bromodomain gene by cDNA library screening. The Drosophila homologue, Acf1, is part of the CHRAC (chromatin accessibility complex) and regulates ISWI-induced nucleosome remodeling. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.70 E-value=4e-17 Score=157.79 Aligned_cols=82 Identities=18% Similarity=0.237 Sum_probs=77.1
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+..++...++|+||++|+.||||+||++||+++.|+++++|..||+|||+||.+||+++|+++.+|..|+.+|...++++
T Consensus 33 F~~pvd~~~~pdY~~vI~~PmDL~tI~~kL~~~~Y~s~~~f~~Dv~LI~~Na~~yN~~~s~i~~~A~~l~~~f~~~~~~~ 112 (115)
T cd05504 33 FLRPVSKIEVPDYYDIIKKPMDLGTIKEKLNMGEYKLAEEFLSDIQLVFSNCFLYNPEHTSVYKAGTRLQRFFIKRCRKL 112 (115)
T ss_pred hcCCCCccccccHHHHhcCcccHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHh
Confidence 45566667899999999999999999999999999999999999999999999999999999999999999999999987
Q ss_pred CC
Q 001090 1150 KA 1151 (1159)
Q Consensus 1150 ~~ 1151 (1159)
..
T Consensus 113 ~~ 114 (115)
T cd05504 113 GL 114 (115)
T ss_pred CC
Confidence 64
No 8
>cd05508 Bromo_RACK7 Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.70 E-value=3.7e-17 Score=154.62 Aligned_cols=77 Identities=18% Similarity=0.302 Sum_probs=71.9
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
+.+++....+|+|+.+|++||||+||++||+++.|+++++|..||+|||+||.+||+++|.++.+|+.|.+.|...|
T Consensus 23 F~~PV~~~~~pdY~~iIk~PmDL~tI~~kl~~~~Y~s~~ef~~Dv~LI~~Na~~YN~~~s~i~~~A~~l~~~~~~e~ 99 (99)
T cd05508 23 FLKPVDLEQFPDYAQYVFKPMDLSTLEKNVRKKAYGSTDAFLADAKWILHNAIIYNGGDHKLTQAAKAIVKICEQEM 99 (99)
T ss_pred hcCCCChhhCCCHHHHcCCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHhhC
Confidence 45666667899999999999999999999999999999999999999999999999999999999999999998653
No 9
>cd05507 Bromo_brd8_like Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.70 E-value=4.6e-17 Score=154.56 Aligned_cols=100 Identities=17% Similarity=0.228 Sum_probs=84.7
Q ss_pred HHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhcc
Q 001090 1046 NRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYF 1125 (1159)
Q Consensus 1046 ~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN 1125 (1159)
.+..++..+..+... ....-+.+++....+|+|+++|+.||||+||++||+++.|+++++|..||+|||+||.+||
T Consensus 4 ~~~~~~~il~~l~~~----~~a~~F~~pV~~~~~p~Y~~iIk~PmDL~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN 79 (104)
T cd05507 4 WKKAILLVYRTLASH----RYASVFLKPVTEDIAPGYHSVVYRPMDLSTIKKNIENGTIRSTAEFQRDVLLMFQNAIMYN 79 (104)
T ss_pred HHHHHHHHHHHHHcC----CCCHhhcCCCCccccCCHHHHhCCCcCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHC
Confidence 456677777776431 1111245556666899999999999999999999999999999999999999999999999
Q ss_pred CCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1126 GRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1126 ~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
++++.++.+|+.|++.|.+.|..+
T Consensus 80 ~~~s~v~~~A~~l~~~~~~~~~~~ 103 (104)
T cd05507 80 SSDHDVYLMAVEMQREVMSQIQQL 103 (104)
T ss_pred CCCCHHHHHHHHHHHHHHHHhhcc
Confidence 999999999999999999998764
No 10
>cd05496 Bromo_WDR9_II Bromodomain; WDR9 repeat II_like subfamily. WDR9 is a human gene located in the Down Syndrome critical region-2 of chromosome 21. It encodes for a nuclear protein containing WD40 repeats and two bromodomains, which may function as a transcriptional regulator involved in chromatin remodeling and play a role in embryonic development. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.69 E-value=5.1e-17 Score=158.33 Aligned_cols=81 Identities=16% Similarity=0.313 Sum_probs=75.7
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCC-CCHHHHHHHHHHHHHHHHHhc
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGR-NTDLSTKIKRLSDLVTRTLSS 1148 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p-~S~I~k~Ak~L~d~F~~~Lkk 1148 (1159)
+..++....+|+|+++|++||||+||++||++++|.++++|..||+|||.||++||++ +|.|+.+|..|+.+|++.+++
T Consensus 26 F~~PVd~~~~pdY~~iIk~PmDL~tIk~kL~~~~Y~~~~ef~~D~~lif~Na~~yN~~~~s~i~~~a~~L~~~F~~~~~~ 105 (119)
T cd05496 26 FRQPVDLLKYPDYRDIIDTPMDLGTVKETLFGGNYDDPMEFAKDVRLIFSNSKSYTPNKRSRIYSMTLRLSALFEEHIKK 105 (119)
T ss_pred ccCCCChhhcCcHHHHhCCcccHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 4566666789999999999999999999999999999999999999999999999985 999999999999999999988
Q ss_pred CC
Q 001090 1149 LK 1150 (1159)
Q Consensus 1149 L~ 1150 (1159)
|.
T Consensus 106 l~ 107 (119)
T cd05496 106 II 107 (119)
T ss_pred HH
Confidence 74
No 11
>cd05516 Bromo_SNF2L2 Bromodomain, SNF2L2-like subfamily, specific to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 2 is a global transcriptional activator, which cooperates with nuclear hormone receptors to boost transcriptional activation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.69 E-value=7.8e-17 Score=153.84 Aligned_cols=80 Identities=16% Similarity=0.303 Sum_probs=75.5
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+..++....+|+||.+|+.||||+||++||+++.|.++++|..||+|||.||++||+++|.|+.+|..|+++|...++++
T Consensus 28 F~~~p~~~~~pdYy~iI~~Pmdl~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~a~~l~~~f~~~~~~~ 107 (107)
T cd05516 28 FIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYRSLEDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFKSARQKI 107 (107)
T ss_pred hhcCCCcccCCCHHHHcCCCCCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHhcC
Confidence 34566777899999999999999999999999999999999999999999999999999999999999999999998864
No 12
>cd05510 Bromo_SPT7_like Bromodomain; SPT7_like subfamily. SPT7 is a yeast protein that functions as a component of the transcription regulatory histone acetylation (HAT) complexes SAGA, SALSA, and SLIK. SAGA is involved in the RNA polymerase II-dependent transcriptional regulation of about 10% of all yeast genes. The SPT7 bromodomain has been shown to weakly interact with acetylated histone H3, but not H4. The human representative of this subfamily is cat eye syndrome critical region protein 2 (CECR2). Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.69 E-value=8e-17 Score=155.36 Aligned_cols=82 Identities=12% Similarity=0.203 Sum_probs=76.0
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCC-HHHHHHHHHHHHHHHHHhc
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNT-DLSTKIKRLSDLVTRTLSS 1148 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S-~I~k~Ak~L~d~F~~~Lkk 1148 (1159)
+.+++...++|+||++|+.||||+||++||+++.|.++++|..||+|||+||.+||++++ +++++|..|++.|...|..
T Consensus 29 F~~pv~~~~~pdY~~iIk~PmdL~tI~~kl~~~~Y~s~~ef~~D~~Li~~N~~~yN~~~s~~~~~~A~~l~~~~~~~~~~ 108 (112)
T cd05510 29 FLTKVSKREAPDYYDIIKKPMDLGTMLKKLKNLQYKSKAEFVDDLNLIWKNCLLYNSDPSHPLRRHANFMKKKAEHLLKL 108 (112)
T ss_pred hhcCCChhhcCCHHHHhcCccCHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH
Confidence 456667778999999999999999999999999999999999999999999999999866 7889999999999999998
Q ss_pred CCC
Q 001090 1149 LKA 1151 (1159)
Q Consensus 1149 L~~ 1151 (1159)
|+.
T Consensus 109 ~~~ 111 (112)
T cd05510 109 IPD 111 (112)
T ss_pred CCC
Confidence 864
No 13
>cd05505 Bromo_WSTF_like Bromodomain; Williams syndrome transcription factor-like subfamily (WSTF-like). The Williams-Beuren syndrome deletion transcript 9 is a putative transcriptional regulator. WSTF was found to play a role in vitamin D-mediated transcription as part of two chromatin remodeling complexes, WINAC and WICH. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.69 E-value=5e-17 Score=153.06 Aligned_cols=77 Identities=14% Similarity=0.346 Sum_probs=72.1
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
+..++....+|+|+++|++||||+||++||+++.|.++++|..||+|||.||.+||+++|.|+.+|..|+++|.+.|
T Consensus 21 F~~pv~~~~~pdY~~iIk~PmDL~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~a~~le~~f~~~~ 97 (97)
T cd05505 21 FREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYSSVQEFLDDMKLVFSNAEKYYENGSYVLSCMRKTEQCCVNLL 97 (97)
T ss_pred ccCCCChhhcccHHHHcCCcCCHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 45556667899999999999999999999999999999999999999999999999999999999999999998764
No 14
>cd05517 Bromo_polybromo_II Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.68 E-value=8.7e-17 Score=152.79 Aligned_cols=76 Identities=20% Similarity=0.433 Sum_probs=72.2
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+..+....++|+||++|++||||+||++||+++.|.++++|..||+|||.||.+||+++|.|+.+|..|+++|...
T Consensus 27 F~~lp~~~~~pdYy~vI~~PmdL~tI~~kl~~~~Y~s~~~f~~D~~lm~~Na~~yN~~~s~i~~~A~~l~~~f~~~ 102 (103)
T cd05517 27 FQKLPSKVLYPDYYAVIKEPIDLKTIAQRIQSGYYKSIEDMEKDLDLMVKNAKTFNEPGSQVYKDANAIKKIFTAK 102 (103)
T ss_pred HhcCCCCCCCCCHHHHcCCCcCHHHHHHHHCcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHhc
Confidence 4567788899999999999999999999999999999999999999999999999999999999999999999753
No 15
>cd05524 Bromo_polybromo_I Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.68 E-value=9.1e-17 Score=154.93 Aligned_cols=105 Identities=13% Similarity=0.294 Sum_probs=86.7
Q ss_pred HHHHHHHHHHHHHHHhhcc-chhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhh
Q 001090 1045 DNRNKLLSAFAKLEQSANR-VQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAES 1123 (1159)
Q Consensus 1045 E~r~kLlsal~klees~~r-~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~t 1123 (1159)
+.+..|+..+.+......+ ... -+..++....+|+||++|++||||+||++||+++.|.++++|..||+|||+||.+
T Consensus 5 ~~c~~il~~l~~~~~~~g~~l~~--~F~~~p~~~~~PdYy~iI~~Pmdl~tI~~kl~~~~Y~s~~~f~~D~~lm~~Na~~ 82 (113)
T cd05524 5 AVCQELYDTIRNYKSEDGRILCE--SFIRVPKRRNEPEYYEVVSNPIDLLKIQQKLKTEEYDDVDDLTADFELLINNAKA 82 (113)
T ss_pred HHHHHHHHHHHhhcccCCCchhH--HHhcCCCcccCCCHHHHhCCccCHHHHHHHhCcCCCCCHHHHHHHHHHHHHHHHH
Confidence 4456666666654332111 111 1345677788999999999999999999999999999999999999999999999
Q ss_pred ccCCCCHHHHHHHHHHHHHHHHHhcCCC
Q 001090 1124 YFGRNTDLSTKIKRLSDLVTRTLSSLKA 1151 (1159)
Q Consensus 1124 FN~p~S~I~k~Ak~L~d~F~~~LkkL~~ 1151 (1159)
||+++|.++++|..|+++|...++++..
T Consensus 83 yN~~~s~~~~~A~~L~~~f~~~~~~~~~ 110 (113)
T cd05524 83 YYKPDSPEHKDACKLWELFLSARNEVLS 110 (113)
T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHhhc
Confidence 9999999999999999999999988653
No 16
>cd05515 Bromo_polybromo_V Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.68 E-value=1.5e-16 Score=151.28 Aligned_cols=78 Identities=21% Similarity=0.390 Sum_probs=73.8
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHh
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLS 1147 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~Lk 1147 (1159)
+..++...++|+||++|++||||+||++||+++.|.++++|..||+|||.||.+||+++|.|+..|..|+++|.++.+
T Consensus 27 F~~~p~~~~~pdYy~iIk~PmdL~tI~~kl~~~~Y~s~~ef~~D~~l~~~Na~~yN~~~s~i~~~A~~L~~~~~~~~~ 104 (105)
T cd05515 27 FMRLPSKSEYPDYYDVIKKPIDMEKIRSKIEGNQYQSLDDMVSDFVLMFDNACKYNEPDSQIYKDALTLQKVLLETKR 104 (105)
T ss_pred hccCCCcccCCcHHHHcCCCcCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHc
Confidence 456777889999999999999999999999999999999999999999999999999999999999999999998754
No 17
>cd05501 Bromo_SP100C_like Bromodomain, SP100C_like subfamily. The SP100C protein is a splice variant of SP100, a major component of PML-SP100 nuclear bodies (NBs), which are poorly understood. It is covalently modified by SUMO-1 and may play a role in processes at the chromatin level. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.67 E-value=7.7e-17 Score=153.60 Aligned_cols=74 Identities=14% Similarity=0.301 Sum_probs=70.8
Q ss_pred ccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1076 VSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1076 ~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
...+|+|+++|++||||+||++||.++.|.++++|+.||+|||.||++||+++ +++.+|..|+++|++.++.+.
T Consensus 27 p~~~pdY~~iIk~PMDL~tI~~kL~~~~Y~s~~ef~~D~~Lif~N~~~yN~~~-~~~~~a~~L~~~Fek~~~~~f 100 (102)
T cd05501 27 PYYIRDYCQGIKEPMWLNKVKERLNERVYHTVEGFVRDMRLIFHNHKLFYKDD-DFGQVGITLEKKFEKNFKEVF 100 (102)
T ss_pred CCCCCchHHHcCCCCCHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHHHHHHHHh
Confidence 45789999999999999999999999999999999999999999999999999 999999999999999998754
No 18
>cd05502 Bromo_tif1_like Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.67 E-value=2.7e-16 Score=150.08 Aligned_cols=103 Identities=15% Similarity=0.229 Sum_probs=85.5
Q ss_pred CHHHHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHc---CCCCCHHHHHHHHHHHHH
Q 001090 1043 DDDNRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLEN---NYYRGLEAVKHDIAVMLS 1119 (1159)
Q Consensus 1043 d~E~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN---~~YRsleaf~~DVrLI~s 1119 (1159)
+++++.++...+.++... .. ..-+..++.. .+|+|+.+|++||||+||++||++ +.|.++++|..||+|||+
T Consensus 2 ~~~~~~~c~~il~~l~~~--~~--s~~F~~pv~~-~~p~Y~~iI~~PmdL~tI~~kL~~~~~~~Y~s~~~f~~D~~li~~ 76 (109)
T cd05502 2 SPIDQRKCERLLLELYCH--EL--SLPFHEPVSP-SVPNYYKIIKTPMDLSLIRKKLQPKSPQHYSSPEEFVADVRLMFK 76 (109)
T ss_pred CHHHHHHHHHHHHHHHhC--CC--ChhhcCCCCC-CCCCHHHHCCCCccHHHHHHHHhcCCCCCCCCHHHHHHHHHHHHH
Confidence 456666776666665431 11 1113445555 799999999999999999999999 599999999999999999
Q ss_pred hhhhccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1120 NAESYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1120 NA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
||..||+++|.++.+|..|+++|++.++.+-
T Consensus 77 Na~~yN~~~s~i~~~a~~l~~~f~~~~~~~~ 107 (109)
T cd05502 77 NCYKFNEEDSEVAQAGKELELFFEEQLKEIL 107 (109)
T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHC
Confidence 9999999999999999999999999998863
No 19
>cd05500 Bromo_BDF1_2_I Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat I. BDF1 and BDF2 are yeast transcription factors involved in the expression of a wide range of genes, including snRNAs; they are required for sporulation and DNA repair and protect histone H4 from deacetylation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.67 E-value=2.2e-16 Score=149.34 Aligned_cols=101 Identities=21% Similarity=0.322 Sum_probs=84.8
Q ss_pred CCHHHHHHHHHHHHHHHHhhccchhhhhhhhhccc--cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHH
Q 001090 1042 IDDDNRNKLLSAFAKLEQSANRVQDQYGVQKLKQV--SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLS 1119 (1159)
Q Consensus 1042 id~E~r~kLlsal~klees~~r~~DryGI~~L~~~--~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~s 1119 (1159)
+++++++.+++.+.++... ....-+...++. ..+|+|+++|++||||+||++||+++.|.++++|..||+|||+
T Consensus 1 ~t~~~~~~~~~ii~~l~~~----~~a~~F~~pv~~~~~~~p~Y~~~I~~P~dL~tI~~kl~~~~Y~s~~~f~~D~~li~~ 76 (103)
T cd05500 1 MTKHQHKFLLSSIRSLKRL----KDARPFLVPVDPVKLNIPHYPTIIKKPMDLGTIERKLKSNVYTSVEEFTADFNLMVD 76 (103)
T ss_pred CCHHHHHHHHHHHHHHHcC----CCChhhcCCCCcccccCCCHHHHhcCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHH
Confidence 3567888888888887542 111113344332 3789999999999999999999999999999999999999999
Q ss_pred hhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1120 NAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1120 NA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
||.+||+++|.++.+|+.|+..|++.|
T Consensus 77 Na~~yN~~~s~~~~~A~~l~~~fe~~~ 103 (103)
T cd05500 77 NCLTFNGPEHPVSQMGKRLQAAFEKHL 103 (103)
T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 999999999999999999999999864
No 20
>cd05520 Bromo_polybromo_III Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.67 E-value=1.5e-16 Score=151.24 Aligned_cols=76 Identities=25% Similarity=0.437 Sum_probs=72.0
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+.+++....+|+||.+|++||||+||++||+++.|+++++|..||+|||.||.+||+++|.++.+|..|+.+|+.+
T Consensus 27 F~~~p~~~~~PdYy~iI~~PmdL~tI~~kl~~~~Y~s~~~f~~D~~lm~~Na~~yN~~~s~i~~~A~~L~~~f~~~ 102 (103)
T cd05520 27 FLKLPSKRKYPDYYQEIKNPISLQQIRTKLKNGEYETLEELEADLNLMFENAKRYNVPNSRIYKDAEKLQKLMQAK 102 (103)
T ss_pred hhcCCCcccCCCHHHHcCCCcCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHh
Confidence 4566777889999999999999999999999999999999999999999999999999999999999999999864
No 21
>cd05503 Bromo_BAZ2A_B_like Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.67 E-value=1.5e-16 Score=149.15 Aligned_cols=77 Identities=13% Similarity=0.333 Sum_probs=71.6
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
+..++....+|+|+++|+.||||+||++||+++.|+++++|..|++|||+||..||+++|.++++|..|+++|+..+
T Consensus 21 F~~pv~~~~~p~Y~~iIk~PmdL~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~a~~l~~~f~~~~ 97 (97)
T cd05503 21 FLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYKTLEEFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKRW 97 (97)
T ss_pred hcCCCCccccCCHHHHhCCCCCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 34555666799999999999999999999999999999999999999999999999999999999999999998764
No 22
>cd05518 Bromo_polybromo_IV Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.66 E-value=1.9e-16 Score=150.62 Aligned_cols=76 Identities=16% Similarity=0.288 Sum_probs=71.6
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+..++...++|+||.+|+.||||+||++||+++.|.++++|..||+|||.||.+||+++|+|+.+|..|+++|..+
T Consensus 27 F~~~p~~~~~pdYy~iIk~Pmdl~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~A~~le~~~~~~ 102 (103)
T cd05518 27 FMEKPSKKDYPDYYKIILEPIDLKTIEHNIRNDKYATEEELMDDFKLMFRNARHYNEEGSQVYEDANILEKVLKEK 102 (103)
T ss_pred HhcCCCcccCccHHHHcCCCcCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHhc
Confidence 3566777889999999999999999999999999999999999999999999999999999999999999999763
No 23
>cd05519 Bromo_SNF2 Bromodomain, SNF2-like subfamily, specific to fungi. SNF2 is a yeast protein involved in transcriptional activation, it is the catalytic component of the SWI/SNF ATP-dependent chromatin remodeling complex. The protein is essential for the regulation of gene expression (both positive and negative) of a large number of genes. The SWI/SNF complex changes chromatin structure by altering DNA-histone contacts within the nucleosome, which results in a re-positioning of the nucleosome and facilitates or represses the binding of gene-specific transcription factors. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.66 E-value=2.8e-16 Score=148.58 Aligned_cols=99 Identities=17% Similarity=0.238 Sum_probs=80.3
Q ss_pred HHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhcc
Q 001090 1046 NRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYF 1125 (1159)
Q Consensus 1046 ~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN 1125 (1159)
.+..|+.++........+... .-+..++....+|+||++|++||||+||++||+++.|.++.+|..|++|||.||++||
T Consensus 4 ~~~~i~~~v~~~~~~~~~~~~-~~F~~~p~~~~~pdYy~iIk~Pmdl~~I~~kl~~~~Y~s~~~f~~D~~li~~Na~~yn 82 (103)
T cd05519 4 AMLEIYDAVLNCEDETGRKLS-ELFLEKPSKKLYPDYYVIIKRPIALDQIKRRIEGRAYKSLEEFLEDFHLMFANARTYN 82 (103)
T ss_pred HHHHHHHHHHHhcCcCCCchh-HHhcCCCCCCCCcCHHHHcCCCcCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHC
Confidence 345555555554321111110 1135667777899999999999999999999999999999999999999999999999
Q ss_pred CCCCHHHHHHHHHHHHHHHH
Q 001090 1126 GRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1126 ~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+++|.++.+|..|++.|...
T Consensus 83 ~~~s~i~~~A~~l~~~f~~~ 102 (103)
T cd05519 83 QEGSIVYEDAVEMEKAFKKK 102 (103)
T ss_pred CCCCHHHHHHHHHHHHHHHh
Confidence 99999999999999999875
No 24
>cd05511 Bromo_TFIID Bromodomain, TFIID-like subfamily. Human TAFII250 (or TAF250) is the largest subunit of TFIID, a large multi-domain complex, which initiates the assembly of the transcription machinery. TAFII250 contains two bromodomains that specifically bind to acetylated histone H4. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.66 E-value=1.8e-16 Score=152.57 Aligned_cols=81 Identities=17% Similarity=0.334 Sum_probs=76.2
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+..++....+|+||.+|+.||||+||++||+++.|+++++|+.||+|||+||.+||++++.++++|+.|.++|.+.|+.+
T Consensus 21 F~~pv~~~~~p~Y~~~I~~PmdL~tI~~kl~~~~Y~s~~ef~~Dv~li~~Na~~yN~~~s~i~~~A~~l~~~~~~~~~~~ 100 (112)
T cd05511 21 FHTPVNKKKVPDYYKIIKRPMDLQTIRKKISKHKYQSREEFLEDIELIVDNSVLYNGPDSVYTKKAKEMLELAEELLAER 100 (112)
T ss_pred hcCCCChhhcccHHHHhcCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHh
Confidence 45556666799999999999999999999999999999999999999999999999999999999999999999999887
Q ss_pred C
Q 001090 1150 K 1150 (1159)
Q Consensus 1150 ~ 1150 (1159)
+
T Consensus 101 ~ 101 (112)
T cd05511 101 E 101 (112)
T ss_pred H
Confidence 5
No 25
>smart00297 BROMO bromo domain.
Probab=99.66 E-value=4.4e-16 Score=145.21 Aligned_cols=81 Identities=25% Similarity=0.460 Sum_probs=75.6
Q ss_pred hhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhc
Q 001090 1069 GVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSS 1148 (1159)
Q Consensus 1069 GI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~Lkk 1148 (1159)
.+..++....+|+|+.+|+.||||.||++||+++.|+++++|..||++|+.||.+||++++.++.+|+.|.+.|.+.|+.
T Consensus 27 ~F~~~~~~~~~p~Y~~~i~~P~dl~~I~~kl~~~~Y~s~~ef~~D~~li~~Na~~~n~~~s~~~~~a~~l~~~f~~~~~~ 106 (107)
T smart00297 27 PFLKPVDRKEAPDYYDIIKKPMDLSTIKKKLENGKYSSVEEFVADVQLMFSNAKTYNGPDSEVYKDAKKLEKFFEKKLRE 106 (107)
T ss_pred hhccCCChhhccCHHHHhcCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHhh
Confidence 35666667779999999999999999999999999999999999999999999999999999999999999999999875
Q ss_pred C
Q 001090 1149 L 1149 (1159)
Q Consensus 1149 L 1149 (1159)
+
T Consensus 107 ~ 107 (107)
T smart00297 107 L 107 (107)
T ss_pred C
Confidence 3
No 26
>cd05528 Bromo_AAA Bromodomain; sub-family co-occurring with AAA domains. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine. The structure(2DKW) in this alignment is an uncharacterized protein predicted from analysis of cDNA clones from human fetal liver
Probab=99.65 E-value=5.8e-16 Score=149.34 Aligned_cols=103 Identities=17% Similarity=0.238 Sum_probs=85.8
Q ss_pred HHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhcc
Q 001090 1046 NRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYF 1125 (1159)
Q Consensus 1046 ~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN 1125 (1159)
.|-.|...+.++... ....-+.++++..++|+||++|++||||+||++||++++|.++++|..||+|||.||..||
T Consensus 4 lr~~L~~il~~l~~~----~~~~~F~~pv~~~~~pdY~~vI~~PmdL~tI~~kl~~~~Y~s~~ef~~Dv~li~~Na~~yN 79 (112)
T cd05528 4 LRLFLRDVLKRLASD----KRFNAFTKPVDEEEVPDYYEIIKQPMDLQTILQKLDTHQYLTAKDFLKDIDLIVTNALEYN 79 (112)
T ss_pred HHHHHHHHHHHHHhC----CCchhhcCCCCccccCcHHHHHcCCCCHHHHHHHHcCCCcCCHHHHHHHHHHHHHHHHHHC
Confidence 344555556655431 1112245667777899999999999999999999999999999999999999999999999
Q ss_pred CCC----CHHHHHHHHHHHHHHHHHhcCCCC
Q 001090 1126 GRN----TDLSTKIKRLSDLVTRTLSSLKAP 1152 (1159)
Q Consensus 1126 ~p~----S~I~k~Ak~L~d~F~~~LkkL~~~ 1152 (1159)
+++ +.|+.+|..|++.|...+.++.++
T Consensus 80 ~~~s~~~s~i~~~A~~L~~~~~~~~~~~~~~ 110 (112)
T cd05528 80 PDRDPADKLIRSRACELRDEVHAMIEAELDP 110 (112)
T ss_pred CCCCccccHHHHHHHHHHHHHHHHHHhcCCC
Confidence 995 699999999999999999987665
No 27
>cd05521 Bromo_Rsc1_2_I Bromodomain, repeat I in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are components of the RSC complex (remodeling the structure of chromatin), are essential for transcriptional control, and have a specific domain architecture including two bromodomains. The RSC complex has also been linked to homologous recombination and nonhomologous end-joining repair of DNA double strand breaks. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.64 E-value=5.6e-16 Score=148.28 Aligned_cols=76 Identities=18% Similarity=0.264 Sum_probs=71.6
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHh
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLS 1147 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~Lk 1147 (1159)
+..+.....+|+||++|++||||+||++||++ |.++++|..|++|||+||++||+++|++++.|..|+++|...+.
T Consensus 28 F~~lp~~~~~pdYy~iI~~PmdL~tI~~kl~~--Y~s~~ef~~D~~li~~Na~~yN~~~s~i~~~A~~le~~~~~~~~ 103 (106)
T cd05521 28 FNVLPLRKDYPDYYKIIKNPLSLNTVKKRLPH--YTNAQEFVNDLAQIPWNARLYNTKGSVIYKYALILEKYINDVII 103 (106)
T ss_pred hhcCCccccCccHHHHhcCCCCHHHHHHHHHc--CCCHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHhhc
Confidence 45667778999999999999999999999999 99999999999999999999999999999999999999998764
No 28
>cd05525 Bromo_ASH1 Bromodomain; ASH1_like sub-family. ASH1 (absent, small, or homeotic 1) is a member of the trithorax-group in Drosophila melanogaster, an epigenetic transcriptional regulator of HOX genes. Drosophila ASH1 has been shown to methylate specific lysines in histones H3 and H4. Mammalian ASH1 has been shown to methylate histone H3. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.64 E-value=5.8e-16 Score=148.09 Aligned_cols=76 Identities=24% Similarity=0.486 Sum_probs=72.4
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+.+++....+|+||++|++||||+||++||+++.|.++++|..|++|||+||.+||+++|.|+++|..|+++|++.
T Consensus 29 F~~lp~k~~~pdYy~~I~~P~dL~tI~~kl~~~~Y~s~~ef~~D~~l~f~Na~~yn~~~S~i~~~A~~L~~~f~~~ 104 (106)
T cd05525 29 FINLPSKKKNPDYYERITDPVDLSTIEKQILTGYYKTPEAFDSDMLKVFRNAEKYYGRKSPIGRDVCRLRKAYYQA 104 (106)
T ss_pred hccCCCcccCCchhhhCCCCcCHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHc
Confidence 4667778899999999999999999999999999999999999999999999999999999999999999999864
No 29
>cd05513 Bromo_brd7_like Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.63 E-value=5.4e-16 Score=146.50 Aligned_cols=74 Identities=19% Similarity=0.260 Sum_probs=67.6
Q ss_pred hhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHH
Q 001090 1071 QKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTR 1144 (1159)
Q Consensus 1071 ~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~ 1144 (1159)
...+....+|+|+++|++||||+||++||+++.|.++++|..||+|||.||.+||+++|.++++|..|...-..
T Consensus 23 ~~PV~~~~~pdY~~vIk~PmDL~tI~~kl~~~~Y~s~~~f~~D~~li~~Na~~yN~~~s~~~~~A~~L~~~~~~ 96 (98)
T cd05513 23 AFPVTDFIAPGYSSIIKHPMDFSTMKEKIKNNDYQSIEEFKDDFKLMCENAMKYNKPDTIYYKAAKKLLHSGMK 96 (98)
T ss_pred cCcCCccccccHHHHHcCccCHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHhhhh
Confidence 44555667999999999999999999999999999999999999999999999999999999999999765443
No 30
>cd05499 Bromo_BDF1_2_II Bromodomain. BDF1/BDF2 like subfamily, restricted to fungi, repeat II. BDF1 and BDF2 are yeast transcription factors involved in the expression of a wide range of genes, including snRNAs; they are required for sporulation and DNA repair and protect histone H4 from deacetylation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.63 E-value=5e-16 Score=146.39 Aligned_cols=76 Identities=16% Similarity=0.293 Sum_probs=70.7
Q ss_pred hhhcccc--ccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1071 QKLKQVS--QKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1071 ~~L~~~~--e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
..+++.. .+|+|+++|++||||+||++||+++.|+++++|..|++|||.||.+||+++|.++.+|..|+++|++.|
T Consensus 25 ~~pvd~~~~~~pdY~~~I~~P~dL~~I~~kl~~~~Y~s~~ef~~D~~li~~N~~~yn~~~s~~~~~a~~l~~~fe~~~ 102 (102)
T cd05499 25 LDPVDPVALNIPNYFSIIKKPMDLGTISKKLQNGQYQSAKEFERDVRLIFKNCYTFNPEGTDVYMMGHQLEEVFNDKW 102 (102)
T ss_pred cCCCCccccCCCCHHHHhcCCCCHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 3444444 899999999999999999999999999999999999999999999999999999999999999999864
No 31
>cd05512 Bromo_brd1_like Bromodomain; brd1_like subfamily. BRD1 is a mammalian gene which encodes for a nuclear protein assumed to be a transcriptional regulator. BRD1 has been implicated with brain development and susceptibility to schizophrenia and bipolar affective disorder. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.62 E-value=9.3e-16 Score=144.67 Aligned_cols=73 Identities=21% Similarity=0.399 Sum_probs=67.8
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLV 1142 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F 1142 (1159)
+.+++....+|+|+++|+.||||+||++||+++.|.++++|..||+|||.||.+||+++|.+++.|..|++.-
T Consensus 22 F~~pVd~~~~pdY~~iIk~PmDL~tI~~kl~~~~Y~s~~ef~~D~~li~~Na~~yN~~~s~~~~~A~~l~~~~ 94 (98)
T cd05512 22 FSEPVDLSEVPDYLDHIKQPMDFSTMRKKLESQRYRTLEDFEADFNLIINNCLAYNAKDTIFYRAAVRLRDQG 94 (98)
T ss_pred hcCCCCccccCCHHHHhcCCcCHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHhh
Confidence 3555666789999999999999999999999999999999999999999999999999999999999998753
No 32
>KOG1474 consensus Transcription initiation factor TFIID, subunit BDF1 and related bromodomain proteins [Transcription]
Probab=99.60 E-value=1.1e-15 Score=183.34 Aligned_cols=102 Identities=18% Similarity=0.336 Sum_probs=83.7
Q ss_pred HHHHHHHHHHHHHHHhhccchhhhhhhhhccc--cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhh
Q 001090 1045 DNRNKLLSAFAKLEQSANRVQDQYGVQKLKQV--SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAE 1122 (1159)
Q Consensus 1045 E~r~kLlsal~klees~~r~~DryGI~~L~~~--~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~ 1122 (1159)
.+....++.|.+|.. +++. +-+...+++ -+.||||+||++||||+|||.||++++|.++.+|..||||||.||+
T Consensus 222 ~~lk~C~~iLk~l~~--~k~a--wpF~~PVD~v~LgLpDY~~IIK~PMDLgTIK~kL~~~~Y~~~~eF~~DVRL~F~Ncm 297 (640)
T KOG1474|consen 222 ELLKQCLSILKRLMK--HKHA--WPFNEPVDVVKLGLPDYHDIIKHPMDLGTIKKKLEKGEYKSAEEFAADVRLTFDNCM 297 (640)
T ss_pred HHHHHHHHHHHHHHh--ccCC--CCcCCCcCHHhcCCcchhhhcCCCccHHHHHhhhcccccCCHHHHHHHHHHHHHHHH
Confidence 444445555555543 2211 334445443 3689999999999999999999999999999999999999999999
Q ss_pred hccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1123 SYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1123 tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
+||.+++++|.+|+.|+++|+.+|+.++
T Consensus 298 ~YNp~g~dV~~Ma~~L~~~Fe~rw~~~~ 325 (640)
T KOG1474|consen 298 TYNPEGSDVYAMAKKLQEVFEERWASMP 325 (640)
T ss_pred hcCCCCCHHHHHHHHHHHHHHHHHhhcc
Confidence 9999999999999999999999999966
No 33
>cd05498 Bromo_Brdt_II_like Bromodomain, Brdt_like subfamily, repeat II. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.60 E-value=1.6e-15 Score=142.71 Aligned_cols=70 Identities=19% Similarity=0.362 Sum_probs=67.4
Q ss_pred cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1077 SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1077 ~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
.++|+|+++|++||||+||++||+++.|+++++|..||+|||+||..||+++|.++.+|..|++.|++.+
T Consensus 33 ~~~p~Y~~~I~~Pmdl~~I~~kl~~~~Y~s~~ef~~D~~li~~Na~~yn~~~s~i~~~a~~l~~~fe~~~ 102 (102)
T cd05498 33 LGLHDYHDIIKHPMDLSTIKKKLDNREYADAQEFAADVRLMFSNCYKYNPPDHPVHAMARKLQDVFEDRW 102 (102)
T ss_pred cCCCcHHHHccCCCcHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 3689999999999999999999999999999999999999999999999999999999999999998763
No 34
>cd05522 Bromo_Rsc1_2_II Bromodomain, repeat II in Rsc1/2_like subfamily, specific to fungi. Rsc1 and Rsc2 are components of the RSC complex (remodeling the structure of chromatin), are essential for transcriptional control, and have a specific domain architecture including two bromodomains. The RSC complex has also been linked to homologous recombination and nonhomologous end-joining repair of DNA double strand breaks. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.59 E-value=3.1e-15 Score=142.28 Aligned_cols=75 Identities=19% Similarity=0.353 Sum_probs=71.4
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTR 1144 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~ 1144 (1159)
+..++....+|+||++|++||||+||++||+++.|.++.+|..|++|||.||.+||++++.++.+|..|++.|..
T Consensus 28 F~~~p~~~~~pdYy~~I~~Pmdl~tI~~kl~~~~Y~s~~~f~~D~~li~~Na~~yn~~~s~i~~~A~~l~~~f~~ 102 (104)
T cd05522 28 FEKLPDKAREPEYYQEISNPISLDDIKKKVKRRKYKSFDQFLNDLNLMFENAKLYNENDSQEYKDAVLLEKEARL 102 (104)
T ss_pred HhcCCCccccCcHHHHhCCCcCHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH
Confidence 456677778999999999999999999999999999999999999999999999999999999999999999986
No 35
>cd05506 Bromo_plant1 Bromodomain, uncharacterized subfamily specific to plants. Might function as a global transcription factor. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.59 E-value=1.6e-15 Score=141.80 Aligned_cols=70 Identities=20% Similarity=0.326 Sum_probs=67.3
Q ss_pred cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1077 SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1077 ~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
.++|+|+.+|++||||+||++||+++.|.++++|..|+++||.||.+||+++|+++.+|..|+..|+.++
T Consensus 30 ~~~p~Y~~~I~~P~dl~tI~~kL~~~~Y~s~~ef~~D~~li~~Na~~yn~~~s~i~~~a~~l~~~fe~~w 99 (99)
T cd05506 30 LGLPDYFDIIKKPMDLGTVKKKLEKGEYSSPEEFAADVRLTFANAMRYNPPGNDVHTMAKELLKIFETRW 99 (99)
T ss_pred cCCCCHHHHHcCCCCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 3689999999999999999999999999999999999999999999999999999999999999998753
No 36
>cd05492 Bromo_ZMYND11 Bromodomain; ZMYND11_like sub-family. ZMYND11 or BS69 is a ubiquitously expressed nuclear protein that has been shown to associate with chromatin. It interacts with chromatin remodeling factors and might play a role in chromatin remodeling and gene expression. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=99.56 E-value=5.1e-15 Score=142.66 Aligned_cols=74 Identities=19% Similarity=0.297 Sum_probs=69.7
Q ss_pred cccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1077 SQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1077 ~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
..+|+|+.+|+.||||+||++||+++.|+++++|..||+|||+||.+||+++|.++.+|+.|-.-....|..|.
T Consensus 34 ~~~pdY~~iIk~PmDL~tI~~kl~~~~Y~s~~ef~~Dv~LI~~N~~~yNg~~s~~~~~A~~l~~d~~~el~Ei~ 107 (109)
T cd05492 34 TKLPKRRRLIHTHLDVADIQEKINSEKYTSLEEFKADALLLLHNTAIFHGADSEQYDAARWLYRDTCHDLRELR 107 (109)
T ss_pred ccCCCHHHHhCCCCcHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHh
Confidence 36899999999999999999999999999999999999999999999999999999999999998888776653
No 37
>cd04369 Bromodomain Bromodomain. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=99.51 E-value=2.6e-14 Score=128.41 Aligned_cols=77 Identities=23% Similarity=0.456 Sum_probs=71.2
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
+..++....+|+|+.+|++||||.+|++||++++|.++.+|..|+++|+.||..||++++.++.+|..|...|...+
T Consensus 23 F~~~~~~~~~~~Y~~~i~~P~~l~~I~~kl~~~~Y~s~~~f~~D~~li~~Na~~~n~~~~~~~~~a~~l~~~~~~~~ 99 (99)
T cd04369 23 FLEPVDPKEAPDYYEVIKNPMDLSTIKKKLKNGEYKSLEEFEADVRLIFSNAKTYNGPGSPIYKDAKKLEKLFEKLL 99 (99)
T ss_pred HhcCCChhcCCCHHHHHhCcccHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 34555566799999999999999999999999999999999999999999999999999999999999999998753
No 38
>PF00439 Bromodomain: Bromodomain; InterPro: IPR001487 Bromodomains are found in a variety of mammalian, invertebrate and yeast DNA-binding proteins []. Bromodomains can interact with acetylated lysine []. In some proteins, the classical bromodomain has diverged to such an extent that parts of the region are either missing or contain an insertion (e.g., mammalian protein HRX, Caenorhabditis elegans hypothetical protein ZK783.4, yeast protein YTA7). The bromodomain may occur as a single copy, or in duplicate. The precise function of the domain is unclear, but it may be involved in protein-protein interactions and may play a role in assembly or activity of multi-component complexes involved in transcriptional activation [].; GO: 0005515 protein binding; PDB: 3P1C_A 4A9K_B 3SVH_A 3P1E_B 3P1F_A 1JSP_B 2L85_A 3P1D_B 3DWY_B 2D82_A ....
Probab=99.49 E-value=3.7e-14 Score=127.03 Aligned_cols=68 Identities=19% Similarity=0.397 Sum_probs=61.3
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHH
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKR 1137 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~ 1137 (1159)
+.+++....+|+|+++|+.||||++|++||+++.|+++++|..||++||.||..||+++|+++.+|++
T Consensus 17 F~~~~~~~~~p~y~~~i~~P~dL~~I~~kl~~~~Y~s~~~f~~Dv~~i~~Na~~yn~~~s~~~~~A~~ 84 (84)
T PF00439_consen 17 FSKPVDPKEYPDYYEIIKNPMDLSTIRKKLENGKYKSIEEFEADVRLIFQNARRYNPPDSPIYKAAEK 84 (84)
T ss_dssp GSSSTHTTTSTTHHHHSSSS--HHHHHHHHHTTSSSSHHHHHHHHHHHHHHHHHHSCTTSHHHHHHHH
T ss_pred hcCCCChhhCCCHHHHHhhccchhhhhHHhhccchhhHHHHHHHHHHHHHHHHHHCCCcCHHHHHhcC
Confidence 44555677899999999999999999999999999999999999999999999999999999999974
No 39
>cd05526 Bromo_polybromo_VI Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Probab=99.41 E-value=7.9e-13 Score=127.90 Aligned_cols=78 Identities=18% Similarity=0.344 Sum_probs=70.6
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+.+|.. ..|+|+.+|+.||||.+|++||+++.|+++++|..|+.+||.||++||+++|.|++.|..|+.+|.+.-.+|
T Consensus 30 f~~LP~--~~~~~~~~ik~Pi~l~~Ik~ki~~~~Y~~ld~~~~D~~lmf~NAr~yN~~~S~iy~dA~eLq~~f~~~rd~~ 107 (110)
T cd05526 30 LAELPE--LAVDGVGPKKIPLTLDIIKRNVDKGRYRRLDKFQEDMFEVLERARRLSRTDSEIYEDAVELQQFFIKIRDEL 107 (110)
T ss_pred HHHCCC--cccCchhhhcCCccHHHHHHHHHcCCcCcHHHHHHHHHHHHHHHHHhCcccCHHHHHHHHHHHHHHHHHHHH
Confidence 455555 467888999999999999999999999999999999999999999999999999999999999999775543
No 40
>COG5076 Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription]
Probab=99.38 E-value=7.4e-13 Score=149.68 Aligned_cols=81 Identities=20% Similarity=0.380 Sum_probs=76.5
Q ss_pred hhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1071 QKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1071 ~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
..+.....+|+||++|+.||||.||+++|+++.|+++++|+.|++|||.||.+||+|++.++.+|+.|++.|...|+.++
T Consensus 170 ~~~p~k~~~PdYy~iIk~Pm~L~~i~kkl~~~~Y~s~eef~~D~~lM~~N~~~yN~~~s~v~~~a~~l~~~~~~~i~~~~ 249 (371)
T COG5076 170 LGLPSKREYPDYYEIIKSPMDLLTIQKKLKNGRYKSFEEFVSDLNLMFDNCKLYNGPDSSVYVDAKELEKYFLKLIEEIP 249 (371)
T ss_pred ccCCccccCCChheeecchhhHHHHHHHHHhhhhhhHHHHHHHHHHHHHhhhhccCCCcchhhhhHHHHHHHHHHHHhcc
Confidence 45566778999999999999999999999999999999999999999999999999999999999999999999999776
Q ss_pred C
Q 001090 1151 A 1151 (1159)
Q Consensus 1151 ~ 1151 (1159)
.
T Consensus 250 ~ 250 (371)
T COG5076 250 E 250 (371)
T ss_pred c
Confidence 3
No 41
>KOG1245 consensus Chromatin remodeling complex WSTF-ISWI, large subunit (contains heterochromatin localization, PHD and BROMO domains) [Chromatin structure and dynamics]
Probab=98.92 E-value=1.2e-09 Score=140.31 Aligned_cols=79 Identities=15% Similarity=0.288 Sum_probs=73.7
Q ss_pred hhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1070 VQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1070 I~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
+.++++...+|+||.||++||||+||+.||.-+.|.++++|..||+|||.||.+||++ |.|+++.-.|..+|...+...
T Consensus 1322 FlepVn~~~vp~Y~~IIk~Pmdl~tir~k~~~~~Y~~~eef~~Di~lvf~Nc~~yN~~-s~i~~ag~~l~~ff~~~~~~~ 1400 (1404)
T KOG1245|consen 1322 FLEPVNPKEVPDYYDIIKKPMDLSTIREKLSKGIYPSPEEFATDIELVFDNCETYNED-SEIGRAGTCLRRFFHKRWRKK 1400 (1404)
T ss_pred hhccCChhhcccHHHHhcChhHHHHHHHHHhcccCCCHHHHHHHHHHHHHHHHHhccc-hhhhhhcchHHHHHHHHHHhh
Confidence 4566778899999999999999999999999999999999999999999999999999 999999999999999866543
No 42
>cd05491 Bromo_TBP7_like Bromodomain; TBP7_like subfamily, limited to fungi. TBP7, or TAT-binding protein homolog 7, is a yeast protein of unknown function that contains AAA-superfamily ATP-ase domains and a bromodomain. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=98.88 E-value=1.4e-09 Score=106.75 Aligned_cols=41 Identities=29% Similarity=0.433 Sum_probs=39.7
Q ss_pred CCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCC
Q 001090 1088 VPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRN 1128 (1159)
Q Consensus 1088 ~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~ 1128 (1159)
+||||+||++||.|++|.++++|++||+|||.||++||.++
T Consensus 63 y~MDL~tIe~RL~ng~Y~tp~~F~~DiklI~~Nc~~ynd~d 103 (119)
T cd05491 63 YNMDLDTIEERLWNGYYATPKDFLKDIKRIVRDAKTIGDRE 103 (119)
T ss_pred eccCHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhCCHH
Confidence 69999999999999999999999999999999999999875
No 43
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=98.77 E-value=1.1e-08 Score=122.53 Aligned_cols=85 Identities=24% Similarity=0.318 Sum_probs=73.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCC------ccccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDA------KYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~------p~EQFF~tD 74 (1159)
++||.|.+|++|||.+|.++|+|.||.+.|-.++|||+|++||+|+++|-|.||+++.|...+.. .+-.=|+-|
T Consensus 551 aTGSsD~tVRlWDv~~G~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~d 630 (707)
T KOG0263|consen 551 ATGSSDRTVRLWDVSTGNSVRIFTGHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGHTGTIYSLSFSRD 630 (707)
T ss_pred ccCCCCceEEEEEcCCCcEEEEecCCCCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcccCceeEEEEecC
Confidence 58999999999999999999999999999999999999999999999999999999998755432 233457778
Q ss_pred CccceecCCCC
Q 001090 75 YRPLVQDTYGN 85 (1159)
Q Consensus 75 YrpLirD~~g~ 85 (1159)
..-|+-+..++
T Consensus 631 g~vLasgg~Dn 641 (707)
T KOG0263|consen 631 GNVLASGGADN 641 (707)
T ss_pred CCEEEecCCCC
Confidence 87777766555
No 44
>KOG1472 consensus Histone acetyltransferase SAGA/ADA, catalytic subunit PCAF/GCN5 and related proteins [Chromatin structure and dynamics; Transcription]
Probab=98.61 E-value=1.8e-08 Score=121.79 Aligned_cols=80 Identities=21% Similarity=0.335 Sum_probs=74.9
Q ss_pred hhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1071 QKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1071 ~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
.+.+...++|+||.+|.+||||.||+.||.+++|..++.|+.|+.+|+.||+.||++++.+++.|..|..+|...+..+.
T Consensus 628 ~~Pv~~~e~pdyy~~I~~pmDl~tM~~~l~~~~y~~~~~f~ad~~~vf~ncr~yn~~~~~y~k~~~~le~~~~~k~~~~i 707 (720)
T KOG1472|consen 628 LKPVNKKEVPDYYDVIKHPMDLRTMQNRLKDNQYTEVELFMADVVRVFANCRMYNGSDTQYYKCAQALEKFFLFKLNELI 707 (720)
T ss_pred cCccccccCCcHHHHhcccccHHHHhhhccccchhhHHHHHHHHHHHHhhhhccCCccchheecccchhhhhcchhhhhh
Confidence 45566788999999999999999999999999999999999999999999999999999999999999999998887653
No 45
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=98.54 E-value=7.6e-08 Score=110.85 Aligned_cols=63 Identities=27% Similarity=0.410 Sum_probs=59.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK 63 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~ 63 (1159)
+||+.|++|.+|||+.|+||.+|..|..+|++++|||||.++|+|+-+|.|+||.+-+++.++
T Consensus 426 ~sas~dstV~lwdv~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l~~ 488 (524)
T KOG0273|consen 426 ASASFDSTVKLWDVESGVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKLVK 488 (524)
T ss_pred EEeecCCeEEEEEccCCceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEeccccchheeE
Confidence 589999999999999999999999999999999999999999999999999999988876554
No 46
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=98.34 E-value=4.2e-07 Score=113.99 Aligned_cols=86 Identities=21% Similarity=0.444 Sum_probs=76.8
Q ss_pred hhhhhh-hhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHH
Q 001090 1065 QDQYGV-QKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVT 1143 (1159)
Q Consensus 1065 ~DryGI-~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~ 1143 (1159)
.|.+|| ...+...++|+|+.+|+.||||.||+.+|+++.|+++++|..|+.||+.||..||..++.+++.|..+++...
T Consensus 580 kD~~gif~~pvd~~e~pdy~~iik~pmd~~t~~~kl~s~~y~tle~ieed~~l~~~nc~~yn~~dtv~~r~av~~~e~~~ 659 (1051)
T KOG0955|consen 580 KDSYGIFAEPVDPSELPDYIDIIKKPMDFFTMRLKLESGAYSTLEPIEEDVNLIVSNCMEYNAKDTVYYRAAVRLRELIK 659 (1051)
T ss_pred ccccCceeeccChhhcccHHHHhcCccchhhhhhhccccchhhhhHHHHhHhHhHhHHHHhhccCeehHhhhHHHHhhhh
Confidence 345665 3345667899999999999999999999999999999999999999999999999999999999999999998
Q ss_pred HHHhcCC
Q 001090 1144 RTLSSLK 1150 (1159)
Q Consensus 1144 ~~LkkL~ 1150 (1159)
..+.+.+
T Consensus 660 ~~~~~ar 666 (1051)
T KOG0955|consen 660 KDFRNAR 666 (1051)
T ss_pred hHHHhcc
Confidence 8776654
No 47
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.30 E-value=9.3e-07 Score=102.78 Aligned_cols=60 Identities=23% Similarity=0.396 Sum_probs=58.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||+|++|+|||+.+|+++++|.+|...|..++|+|||..|++++.+|.|.||++..+.
T Consensus 262 ~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~ 321 (456)
T KOG0266|consen 262 VSGSDDGTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSASYDGTIRVWDLETGS 321 (456)
T ss_pred EEecCCCcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEcCCCccEEEEECCCCc
Confidence 589999999999999999999999999999999999999999999999999999999887
No 48
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.25 E-value=1.6e-06 Score=100.91 Aligned_cols=80 Identities=25% Similarity=0.358 Sum_probs=66.7
Q ss_pred CCcccCCcEEEEEc-cCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccc------cccc
Q 001090 1 MSAGYDGKTIVWDI-WEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQ------FFLG 73 (1159)
Q Consensus 1 lSAG~DG~I~lWDI-~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ------FF~t 73 (1159)
+||+-|++|+|||+ ..|.++++|.+|...|++++|+|+|+.|++|+.+|.+.||++..++..+..+..+ -|..
T Consensus 219 ~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~ 298 (456)
T KOG0266|consen 219 LSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGISGLAFSP 298 (456)
T ss_pred EEecCCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCCceEEEEECC
Confidence 58999999999999 7779999999999999999999999999999999999999999977554444322 3555
Q ss_pred CCcccee
Q 001090 74 DYRPLVQ 80 (1159)
Q Consensus 74 DYrpLir 80 (1159)
|.+.|+-
T Consensus 299 d~~~l~s 305 (456)
T KOG0266|consen 299 DGNLLVS 305 (456)
T ss_pred CCCEEEE
Confidence 5554443
No 49
>PTZ00421 coronin; Provisional
Probab=98.22 E-value=1.8e-06 Score=102.15 Aligned_cols=60 Identities=22% Similarity=0.362 Sum_probs=56.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||+||+|+|||+.+|.++..|.+|...|.+++|+|||..||++..+|+|.||++..+.
T Consensus 142 aSgs~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~ 201 (493)
T PTZ00421 142 ASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT 201 (493)
T ss_pred EEEeCCCEEEEEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCc
Confidence 479999999999999999999999999999999999999999999999999999987764
No 50
>KOG1827 consensus Chromatin remodeling complex RSC, subunit RSC1/Polybromo and related proteins [Chromatin structure and dynamics; Transcription]
Probab=98.11 E-value=6.5e-06 Score=98.95 Aligned_cols=108 Identities=19% Similarity=0.251 Sum_probs=84.9
Q ss_pred CCCCCHHHHHHHHHHHHHHHHhhc-cchhhh-hhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHH
Q 001090 1039 QPRIDDDNRNKLLSAFAKLEQSAN-RVQDQY-GVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAV 1116 (1159)
Q Consensus 1039 ~P~id~E~r~kLlsal~klees~~-r~~Dry-GI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrL 1116 (1159)
.|.|++....++...|..+-.-.. ...+.+ -+.+|......|.||.+|..||.|..|++|+..+.|++.+.|..|+.|
T Consensus 46 ~p~i~~~~~~~f~~il~~~~~~~d~~gk~~~d~feklp~~~~~p~yy~~i~~pisl~~ik~kv~k~~y~~~~~f~~D~~l 125 (629)
T KOG1827|consen 46 SPVIDPPLIPKFKTILASLLDLKDDEGKQLFDKFEKLPSRKEFPEYYYVIQQPISLDQIKRKVKKGRYKRLSFFQLDFLL 125 (629)
T ss_pred ccccChHHHHHHHHHHHHHHhhccccCcccchhHhhccccccCCCcceeecCcccHHHHHHHHHhcccccHHHHHHHHHH
Confidence 455566555555555544321100 001111 246777888899999999999999999999999999999999999999
Q ss_pred HHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1117 MLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1117 I~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
|++||++||.++|.++++|..|+..|....
T Consensus 126 m~ena~~~n~~ds~~~~~s~~l~~~~~~~~ 155 (629)
T KOG1827|consen 126 MTENARLYNRPDSLIYKDSGELEKYFISLE 155 (629)
T ss_pred HHHHHHHhcCcchhhhhhhhhhhcchhhhh
Confidence 999999999999999999999999998754
No 51
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=98.10 E-value=3.4e-06 Score=96.72 Aligned_cols=60 Identities=18% Similarity=0.259 Sum_probs=57.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.|..-+|||+.+|.+|..|.+|-..|++++|||.|.+||+|..+++..||++-...
T Consensus 319 ~tGGlD~~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~ 378 (459)
T KOG0272|consen 319 ATGGLDSLGRVWDLRTGRCIMFLAGHIKEILSVAFSPNGYHLATGSSDNTCKVWDLRMRS 378 (459)
T ss_pred eccCccchhheeecccCcEEEEecccccceeeEeECCCceEEeecCCCCcEEEeeecccc
Confidence 589999999999999999999999999999999999999999999999999999997654
No 52
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=98.08 E-value=4.4e-06 Score=92.13 Aligned_cols=63 Identities=22% Similarity=0.277 Sum_probs=58.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK 63 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~ 63 (1159)
+|||+||+++|||+.+|++.+.|.||...|..++||||-++|+++..+-+|.+|++=....|.
T Consensus 79 lS~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g~ck~t 141 (315)
T KOG0279|consen 79 LSASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGVCKYT 141 (315)
T ss_pred EeccccceEEEEEecCCcEEEEEEecCCceEEEEecCCCceeecCCCcceeeeeeecccEEEE
Confidence 689999999999999999999999999999999999999999999999999999985444443
No 53
>KOG0386 consensus Chromatin remodeling complex SWI/SNF, component SWI2 and related ATPases (DNA/RNA helicase superfamily) [Chromatin structure and dynamics; Transcription]
Probab=98.06 E-value=3.3e-06 Score=104.38 Aligned_cols=105 Identities=20% Similarity=0.307 Sum_probs=85.3
Q ss_pred HHHHHHHHHHHHHHHhhccchhhhhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhc
Q 001090 1045 DNRNKLLSAFAKLEQSANRVQDQYGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESY 1124 (1159)
Q Consensus 1045 E~r~kLlsal~klees~~r~~DryGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tF 1124 (1159)
....+++....+..++..|.. .-++..+.....+|+||.+|+.||++..|++++++..|.++.++..|+.+++.||++|
T Consensus 1027 ~~~~~i~~~~~~~~~~~~r~~-~~~~~~~~s~k~~~d~~~~i~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~na~~~ 1105 (1157)
T KOG0386|consen 1027 KQALKIASTSIKYKDSAGREL-SEVFLKLPSRKEYPDYYEIIKKPVAIDKIKKRIENHKYNSLKELEKDFMLLFNNARTY 1105 (1157)
T ss_pred HHHHHHHHHHHhccccccccc-chhcccCcccccccchHHHhcchhhHHHHhhhccccccchHHHHHHHHHhhcchhhhh
Confidence 344555555544433222211 1234677778899999999999999999999999999999999999999999999999
Q ss_pred cCCCCHHHHHHHHHHHHHHHHHhcCC
Q 001090 1125 FGRNTDLSTKIKRLSDLVTRTLSSLK 1150 (1159)
Q Consensus 1125 N~p~S~I~k~Ak~L~d~F~~~LkkL~ 1150 (1159)
|+.+|.++..|..|+.+|......+.
T Consensus 1106 ~~egs~~y~d~~~l~~~~~~~~~~~~ 1131 (1157)
T KOG0386|consen 1106 NEEGSRVYEDAIVLQSVFKSARQEIS 1131 (1157)
T ss_pred ccCCceechhHHHHHHHHhhhHHHHh
Confidence 99999999999999999988776665
No 54
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.04 E-value=8.2e-06 Score=92.41 Aligned_cols=70 Identities=20% Similarity=0.232 Sum_probs=62.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCC-ccccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDA-KYDQF 70 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~-p~EQF 70 (1159)
.|++.|++|++|||.+|.||-.+.+|...|.+++|||-|++|++.-++++|.+|++-.+...+.. +++.|
T Consensus 308 ~s~SrDktIk~wdv~tg~cL~tL~ghdnwVr~~af~p~Gkyi~ScaDDktlrvwdl~~~~cmk~~~ah~hf 378 (406)
T KOG0295|consen 308 GSGSRDKTIKIWDVSTGMCLFTLVGHDNWVRGVAFSPGGKYILSCADDKTLRVWDLKNLQCMKTLEAHEHF 378 (406)
T ss_pred EeecccceEEEEeccCCeEEEEEecccceeeeeEEcCCCeEEEEEecCCcEEEEEeccceeeeccCCCcce
Confidence 37899999999999999999999999999999999999999999999999999999887766544 45556
No 55
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=98.00 E-value=3.6e-06 Score=97.63 Aligned_cols=75 Identities=21% Similarity=0.318 Sum_probs=65.9
Q ss_pred CCcccCCcEEEEEccC-CeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc-----cCCccccccccC
Q 001090 1 MSAGYDGKTIVWDIWE-GIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ-----KDAKYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~-G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~-----~~~p~EQFF~tD 74 (1159)
||||-||+|.||++.. |.|||+|.+|.-+|.|++||.+|+.|.++.-++.|.+||+.+|... ..+|+-+=||.|
T Consensus 231 LS~gmD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd 310 (503)
T KOG0282|consen 231 LSGGMDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVLSRFHLDKVPTCVKFHPD 310 (503)
T ss_pred EecCCCceEEEEEEecCcceehhhhcchhhhhhhhccccCCeeeeeecceeeeeeccccceEEEEEecCCCceeeecCCC
Confidence 6999999999999999 9999999999999999999999999999999999999999998622 234555656655
Q ss_pred C
Q 001090 75 Y 75 (1159)
Q Consensus 75 Y 75 (1159)
.
T Consensus 311 ~ 311 (503)
T KOG0282|consen 311 N 311 (503)
T ss_pred C
Confidence 5
No 56
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=97.97 E-value=5.2e-06 Score=98.45 Aligned_cols=80 Identities=19% Similarity=0.218 Sum_probs=65.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC--ccC-Cccc-----cccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES--QKD-AKYD-----QFFL 72 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~--~~~-~p~E-----QFF~ 72 (1159)
++|+||-+|.|||+.++..--.|.+|...||++||||||+.+|+.--+|+|.+|.--+++. |+. .|.+ -.|.
T Consensus 694 a~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~~rVy~Prs~e~pv~Eg~gpvgtRgARi~wa 773 (1012)
T KOG1445|consen 694 AVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGTLRVYEPRSREQPVYEGKGPVGTRGARILWA 773 (1012)
T ss_pred hhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcceeeeecCceEEEeCCCCCCCccccCCCCccCcceeEEEE
Confidence 5899999999999999998888999999999999999999999999999999999877652 221 2322 1566
Q ss_pred cCCcccee
Q 001090 73 GDYRPLVQ 80 (1159)
Q Consensus 73 tDYrpLir 80 (1159)
-|.|-+|-
T Consensus 774 cdgr~viv 781 (1012)
T KOG1445|consen 774 CDGRIVIV 781 (1012)
T ss_pred ecCcEEEE
Confidence 66666554
No 57
>KOG0008 consensus Transcription initiation factor TFIID, subunit TAF1 [Transcription]
Probab=97.95 E-value=5.8e-06 Score=104.28 Aligned_cols=71 Identities=14% Similarity=0.348 Sum_probs=64.9
Q ss_pred ccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHH
Q 001090 1076 VSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTL 1146 (1159)
Q Consensus 1076 ~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~L 1146 (1159)
....|+||.+|+.||||.||.+.+..+.|.+.++|..||++|+.||..||++++.+...|+.+-++.+..|
T Consensus 1409 ~k~~~~yy~kik~pmdl~~i~~n~~~~~y~s~~e~l~dv~~i~~n~~~~ng~e~~y~~k~~k~~ev~~~~~ 1479 (1563)
T KOG0008|consen 1409 KKRVPDYYKKIKNPMDLETILKNIPPHKYDSRSEFLDDVNLIYVNSVEYNGAESAYTKKARKIGEVGLANL 1479 (1563)
T ss_pred hhhchHHHHHhcChhhHHHHhhcCCccccccHHHHhhhhHhhcccceeecCccccccHHHHHHHHHHHHHH
Confidence 34589999999999999999999999999999999999999999999999999999999998777665443
No 58
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.85 E-value=3e-05 Score=59.88 Aligned_cols=39 Identities=18% Similarity=0.297 Sum_probs=37.7
Q ss_pred CeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 17 GIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 17 G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
|+++++|.+|...|.+++|+|+|.+||++..+|+|.||+
T Consensus 1 g~~~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 1 GKCVRTFRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEEEEESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred CeEEEEEcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 689999999999999999999999999999999999996
No 59
>PLN00181 protein SPA1-RELATED; Provisional
Probab=97.84 E-value=2.6e-05 Score=96.15 Aligned_cols=60 Identities=20% Similarity=0.316 Sum_probs=56.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||++||+|+|||+.+|..+..|.+|...|++++|+| ||..||++..+|+|.||++..+.
T Consensus 549 as~~~Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~ 609 (793)
T PLN00181 549 ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV 609 (793)
T ss_pred EEEeCCCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCc
Confidence 4789999999999999999999999999999999997 89999999999999999998764
No 60
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=97.83 E-value=1.9e-05 Score=87.38 Aligned_cols=60 Identities=18% Similarity=0.256 Sum_probs=57.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||+|++|+|||+.+=+....|.||.+-|-.++|||||...|+|-.+|.+++|++-.++
T Consensus 166 vs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~k 225 (315)
T KOG0279|consen 166 VSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPDGSLCASGGKDGEAMLWDLNEGK 225 (315)
T ss_pred EEccCCceEEEEccCCcchhhccccccccEEEEEECCCCCEEecCCCCceEEEEEccCCc
Confidence 589999999999999999989999999999999999999999999999999999998886
No 61
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=97.82 E-value=2.8e-05 Score=86.47 Aligned_cols=60 Identities=17% Similarity=0.296 Sum_probs=57.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||+-|.+..||||++|+.+..|.||.+.|+.+.++| ||.+||++.-+++-.||++-.+.
T Consensus 160 lT~SGD~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~ 220 (343)
T KOG0286|consen 160 LTGSGDMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQ 220 (343)
T ss_pred EecCCCceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCcc
Confidence 5789999999999999999999999999999999999 99999999999999999997764
No 62
>PTZ00420 coronin; Provisional
Probab=97.79 E-value=3.6e-05 Score=92.75 Aligned_cols=59 Identities=19% Similarity=0.193 Sum_probs=53.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.||+|+|||+.+|..+..+. ++..|..++|+|||..||++..+|+|.||++-++.
T Consensus 142 aSgS~DgtIrIWDl~tg~~~~~i~-~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~ 200 (568)
T PTZ00420 142 CSSGFDSFVNIWDIENEKRAFQIN-MPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQE 200 (568)
T ss_pred EEEeCCCeEEEEECCCCcEEEEEe-cCCcEEEEEECCCCCEEEEEecCCEEEEEECCCCc
Confidence 478999999999999999887775 56789999999999999999999999999987664
No 63
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=97.76 E-value=4.3e-05 Score=85.70 Aligned_cols=59 Identities=22% Similarity=0.307 Sum_probs=53.4
Q ss_pred CCcccCCc-EEEEEccCCeeeEEeecC--cceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGK-TIVWDIWEGIPIRIYEIS--RFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~-I~lWDI~~G~~Ir~f~~g--~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|||.-|| |+|||-.+|+.|.-|..| +..|+.++||||+..||++.+.|+|+||++-..
T Consensus 197 ATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~ 258 (346)
T KOG2111|consen 197 ATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSSDKGTLHIFSLRDT 258 (346)
T ss_pred EEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEcCCCeEEEEEeecC
Confidence 47888887 579999999999999876 789999999999999999999999999998553
No 64
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=97.69 E-value=4.4e-05 Score=87.20 Aligned_cols=60 Identities=18% Similarity=0.328 Sum_probs=56.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||-|-+++|||+.+-.++.+..+|..-|+.++|||||+.||+|--+|.|.||+--+|+
T Consensus 131 ~tGsGD~TvR~WD~~TeTp~~t~KgH~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~ 190 (480)
T KOG0271|consen 131 VTGSGDTTVRLWDLDTETPLFTCKGHKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQ 190 (480)
T ss_pred EecCCCceEEeeccCCCCcceeecCCccEEEEEEECCCcchhhccccCCeEEEecCCCCC
Confidence 589999999999999999999999999999999999999999999999999999955544
No 65
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=97.69 E-value=4.6e-05 Score=84.83 Aligned_cols=60 Identities=18% Similarity=0.364 Sum_probs=57.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.|+..+|||+..|.+++.|.+|...|-.++|-|+|.-||+|.++|+..+|++-...
T Consensus 203 vSg~cD~~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGSDD~tcRlyDlRaD~ 262 (343)
T KOG0286|consen 203 VSGGCDKSAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGSDDATCRLYDLRADQ 262 (343)
T ss_pred EecccccceeeeeccCcceeEeecccccccceEEEccCCCeeeecCCCceeEEEeecCCc
Confidence 489999999999999999999999999999999999999999999999999999997754
No 66
>PTZ00421 coronin; Provisional
Probab=97.63 E-value=7.7e-05 Score=88.55 Aligned_cols=60 Identities=18% Similarity=0.230 Sum_probs=53.9
Q ss_pred CCcccCCcEEEEEccCC-------eeeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEG-------IPIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G-------~~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.||+|+||||.++ .++..|.+|...|..++|+|++ ..||++..+|+|.||++..+.
T Consensus 92 aSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~ 159 (493)
T PTZ00421 92 FTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGK 159 (493)
T ss_pred EEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCe
Confidence 58999999999999876 4678888999999999999996 689999999999999998764
No 67
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=97.61 E-value=4.5e-05 Score=89.43 Aligned_cols=80 Identities=23% Similarity=0.329 Sum_probs=63.0
Q ss_pred CCcccCCcEEEEEccCCe-eeEEee-----cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC-----ccCCccc-
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYE-----ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES-----QKDAKYD- 68 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~-----~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~-----~~~~p~E- 68 (1159)
||+|+||+++||||.+-+ .+.+|- +++..+-.|+|+|||+.||+|=.+|.|-||++|+--. .+++-..
T Consensus 285 lT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g 364 (641)
T KOG0772|consen 285 LTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLPG 364 (641)
T ss_pred EEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcchhhhcccCCceeeeecCCcccccceEeeeccCCC
Confidence 689999999999999887 666774 4588999999999999999999999999999976321 1222211
Q ss_pred -----cccccCCcccee
Q 001090 69 -----QFFLGDYRPLVQ 80 (1159)
Q Consensus 69 -----QFF~tDYrpLir 80 (1159)
--|+.|.+-|+-
T Consensus 365 ~~Itsi~FS~dg~~LlS 381 (641)
T KOG0772|consen 365 QDITSISFSYDGNYLLS 381 (641)
T ss_pred CceeEEEeccccchhhh
Confidence 158888877664
No 68
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.58 E-value=0.00015 Score=71.76 Aligned_cols=59 Identities=20% Similarity=0.250 Sum_probs=54.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
++|+.||.|.|||+.++..+..+..+...+..+.|+|+|..|+++...|.|.||++..+
T Consensus 25 ~~~~~~g~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~ 83 (289)
T cd00200 25 ATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETG 83 (289)
T ss_pred EEeecCcEEEEEEeeCCCcEEEEecCCcceeEEEECCCCCEEEEEcCCCeEEEEEcCcc
Confidence 36788999999999999999999998899999999999999999999999999998765
No 69
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.58 E-value=0.00014 Score=71.80 Aligned_cols=60 Identities=22% Similarity=0.288 Sum_probs=55.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++++.||.|.|||+.++..++.|..+...|..++|+|++..|+++..+|.|.+|++..+.
T Consensus 67 ~~~~~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 126 (289)
T cd00200 67 ASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK 126 (289)
T ss_pred EEEcCCCeEEEEEcCcccceEEEeccCCcEEEEEEcCCCCEEEEecCCCeEEEEECCCcE
Confidence 367889999999999999999999888899999999999999999999999999997544
No 70
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=97.56 E-value=8.8e-05 Score=86.52 Aligned_cols=59 Identities=19% Similarity=0.347 Sum_probs=54.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
||||+|+.|.|||++||+++..|..+. .++.++|-||+ ..|.+|-++|+|..||+-+++
T Consensus 274 LS~sfD~~lKlwDtETG~~~~~f~~~~-~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~k 333 (503)
T KOG0282|consen 274 LSASFDRFLKLWDTETGQVLSRFHLDK-VPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGK 333 (503)
T ss_pred eeeecceeeeeeccccceEEEEEecCC-CceeeecCCCCCcEEEEecCCCcEEEEeccchH
Confidence 699999999999999999999998654 57889999999 889999999999999998886
No 71
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.54 E-value=0.00022 Score=73.99 Aligned_cols=59 Identities=20% Similarity=0.264 Sum_probs=50.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEE-EEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASI-ILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~-a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|++.||+|.+||+.+|.+++.|.++. .+..++|+|||..+ +++..+|.|.+|++..++
T Consensus 5 ~s~~~d~~v~~~d~~t~~~~~~~~~~~-~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~ 64 (300)
T TIGR03866 5 VSNEKDNTISVIDTATLEVTRTFPVGQ-RPRGITLSKDGKLLYVCASDSDTIQVIDLATGE 64 (300)
T ss_pred EEecCCCEEEEEECCCCceEEEEECCC-CCCceEECCCCCEEEEEECCCCeEEEEECCCCc
Confidence 478999999999999999999997653 46789999999987 567789999999998765
No 72
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=97.53 E-value=6.9e-05 Score=85.62 Aligned_cols=77 Identities=18% Similarity=0.279 Sum_probs=70.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc--cccccccCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK--YDQFFLGDYRP 77 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p--~EQFF~tDYrp 77 (1159)
+||+.|..|+|||-.+|+.|.+|.||-.+||-+|||-|.+.|+++..+-+|.||++-+.+..-..| .++-|.=|+.|
T Consensus 383 ASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsaDsRLlVS~SkDsTLKvw~V~tkKl~~DLpGh~DEVf~vDwsp 461 (480)
T KOG0271|consen 383 ASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSADSRLLVSGSKDSTLKVWDVRTKKLKQDLPGHADEVFAVDWSP 461 (480)
T ss_pred EEeecccceeeeeCCCcchhhhhhhccceeEEEEeccCccEEEEcCCCceEEEEEeeeeeecccCCCCCceEEEEEecC
Confidence 589999999999999999999999999999999999999999999999999999998887665666 56777777775
No 73
>cd05494 Bromodomain_1 Bromodomain; uncharacterized subfamily. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=97.49 E-value=5.5e-05 Score=74.08 Aligned_cols=79 Identities=6% Similarity=0.028 Sum_probs=54.2
Q ss_pred HHHHHHHHHHHHHHHhhccchhhhhhhhhccc--cccCcccccCCCCCCHHHHHHHHHcC-------CCCCHHHHHHHHH
Q 001090 1045 DNRNKLLSAFAKLEQSANRVQDQYGVQKLKQV--SQKTNFTNRFPVPLSLDVIQSRLENN-------YYRGLEAVKHDIA 1115 (1159)
Q Consensus 1045 E~r~kLlsal~klees~~r~~DryGI~~L~~~--~e~PdY~~iIk~PMDLsTIr~RLeN~-------~YRsleaf~~DVr 1115 (1159)
+..+.++..+..+.... ...-+...+.. ..+|+|+++|++||||+||+++|.+. +|.--+....++.
T Consensus 3 e~~~~~l~~l~~~~~~~----~~~pF~~PVd~~~~~~pdY~~iIK~PMDL~ti~~kl~~~~~~~~~~~~~~~~~~~~~~~ 78 (114)
T cd05494 3 EALERVLRELKRHRRNE----DAWPFLEPVNPPRRGAPDYRDVIKRPMSFGTKVNNIVETGARDLEDLQIVQEDPADKQI 78 (114)
T ss_pred HHHHHHHHHHHHhhhCC----CCCCcCCCCCchhcCCCChhhhcCCCCChHHHHHHHHcccccccccccccccccccccc
Confidence 44566666666654210 11113444555 67999999999999999999999996 5555566667777
Q ss_pred HHHHhhhhccCC
Q 001090 1116 VMLSNAESYFGR 1127 (1159)
Q Consensus 1116 LI~sNA~tFN~p 1127 (1159)
+++.||-.+|..
T Consensus 79 ~~~~~~~~~~~~ 90 (114)
T cd05494 79 DDEGRRSPSNIY 90 (114)
T ss_pred ccccccCccccc
Confidence 777777777654
No 74
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=97.47 E-value=0.00013 Score=81.67 Aligned_cols=59 Identities=17% Similarity=0.331 Sum_probs=51.9
Q ss_pred CcccCCcEEEEEccC-CeeeE-EeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWE-GIPIR-IYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~-G~~Ir-~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-+|+||+|++|+|+. |..+- ....+.++|+|++||-||+.+.+++.+|++.||++.++.
T Consensus 45 A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q 105 (347)
T KOG0647|consen 45 AGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLASGQ 105 (347)
T ss_pred ecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeeccCCceEEEEccCCC
Confidence 478999999999998 66443 344678999999999999999999999999999998885
No 75
>PTZ00420 coronin; Provisional
Probab=97.46 E-value=0.00019 Score=86.75 Aligned_cols=60 Identities=18% Similarity=0.102 Sum_probs=51.8
Q ss_pred CCcccCCcEEEEEccCCe--------eeEEeecCcceeeeeeeCCCCCEE-EEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI--------PIRIYEISRFRLVDGKFSPDGASI-ILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~--------~Ir~f~~g~~~i~D~~FSPDGt~~-a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.||+|+|||+.++. ++..|.+|...|..++|+|+|..+ |++..+|+|.||++..+.
T Consensus 91 ASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~ 159 (568)
T PTZ00420 91 ASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEK 159 (568)
T ss_pred EEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCc
Confidence 589999999999998752 455778888999999999999875 678889999999998765
No 76
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=97.45 E-value=0.00011 Score=89.32 Aligned_cols=58 Identities=17% Similarity=0.247 Sum_probs=55.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+||+.||.|.||||.+|.++..|.+|.+.|+++-||+||..||++..+-+|.+||+-.
T Consensus 593 aSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~dg~vLasgg~DnsV~lWD~~~ 650 (707)
T KOG0263|consen 593 ASGDEDGLIKIWDLANGSLVKQLKGHTGTIYSLSFSRDGNVLASGGADNSVRLWDLTK 650 (707)
T ss_pred eecccCCcEEEEEcCCCcchhhhhcccCceeEEEEecCCCEEEecCCCCeEEEEEchh
Confidence 5899999999999999999999999999999999999999999999999999998754
No 77
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=97.42 E-value=0.00023 Score=78.30 Aligned_cols=61 Identities=25% Similarity=0.262 Sum_probs=57.6
Q ss_pred CcccCCcEEEEEccCCeeeEEe-ecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIY-EISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f-~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
.||.|+.++.+|..+|+-|-.| .||+++|..++|||||..-|+|..+|+|.||-++.++.+
T Consensus 241 aGged~~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSPdGE~yAsGSEDGTirlWQt~~~~~~ 302 (334)
T KOG0278|consen 241 AGGEDFKVYKFDYNTGEEIGSYNKGHFGPVHCVRFSPDGELYASGSEDGTIRLWQTTPGKTY 302 (334)
T ss_pred ecCcceEEEEEeccCCceeeecccCCCCceEEEEECCCCceeeccCCCceEEEEEecCCCch
Confidence 4899999999999999999999 689999999999999999999999999999999988755
No 78
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=97.40 E-value=0.00024 Score=78.07 Aligned_cols=60 Identities=13% Similarity=0.164 Sum_probs=54.1
Q ss_pred CCcccCCcEEEEEccCC-eeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEG-IPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G-~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+++|.|.++.||.+++= +.-+.+.+|+.-++||+||-||++|+++.++++..||++..++
T Consensus 231 at~ssdktv~iwn~~~~~kle~~l~gh~rWvWdc~FS~dg~YlvTassd~~~rlW~~~~~k 291 (311)
T KOG0315|consen 231 ATCSSDKTVKIWNTDDFFKLELVLTGHQRWVWDCAFSADGEYLVTASSDHTARLWDLSAGK 291 (311)
T ss_pred EeecCCceEEEEecCCceeeEEEeecCCceEEeeeeccCccEEEecCCCCceeecccccCc
Confidence 47899999999999987 4445677899999999999999999999999999999998876
No 79
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.40 E-value=0.00021 Score=86.55 Aligned_cols=61 Identities=16% Similarity=0.255 Sum_probs=58.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
||+|-|+||+||-|.++.|+++|.||.++|+-+.|-.+|++|++++++|-|.||.+-+++.
T Consensus 521 aT~SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~~~~qliS~~adGliKlWnikt~eC 581 (775)
T KOG0319|consen 521 ATCSGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIRNGKQLISAGADGLIKLWNIKTNEC 581 (775)
T ss_pred EeccCCceEEEEEeccceeeeeecCccceeEeeeeeeCCcEEEeccCCCcEEEEeccchhh
Confidence 6899999999999999999999999999999999999999999999999999999977653
No 80
>KOG1472 consensus Histone acetyltransferase SAGA/ADA, catalytic subunit PCAF/GCN5 and related proteins [Chromatin structure and dynamics; Transcription]
Probab=97.39 E-value=0.00012 Score=89.68 Aligned_cols=61 Identities=16% Similarity=0.266 Sum_probs=54.8
Q ss_pred ccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHH
Q 001090 1074 KQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTK 1134 (1159)
Q Consensus 1074 ~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~ 1134 (1159)
+....+|+||.+|+.||||.|+.++|.+..|.+.+.|+.|+.+||.||++||...+..+..
T Consensus 311 vs~~~a~~y~~i~k~pmdl~t~~~k~~~~~y~~~~~fv~d~~~~~~n~~~~n~ee~~~~~~ 371 (720)
T KOG1472|consen 311 VSKEDAPNYYQIIKAPMDLSTELKKLKSGPYCSKEEFVNDLMLIWRNCEKYNSEESHGLIE 371 (720)
T ss_pred CChhhCcchHHhhhcchHHHHHHHHhccccccchhHHHHHHHHHHhcchhhccccchhhhh
Confidence 4556799999999999999999999999999999999999999999999999976554433
No 81
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.35 E-value=0.00028 Score=77.68 Aligned_cols=59 Identities=15% Similarity=0.281 Sum_probs=55.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+++|+||+|+|||-..+.-|++|.||..-|+..+||| ++..||++..+|++.||++-..
T Consensus 121 ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~ 180 (311)
T KOG0277|consen 121 LTSSWDGTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSP 180 (311)
T ss_pred EeeccCCceEeecCCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCC
Confidence 4679999999999999999999999999999999999 6889999999999999998664
No 82
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=97.34 E-value=0.00028 Score=73.83 Aligned_cols=64 Identities=20% Similarity=0.367 Sum_probs=50.0
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC------ceeEEEEecCCCCCccCCccccccc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD------VGQLYILNTGQGESQKDAKYDQFFL 72 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~------~G~l~Ifg~G~g~~~~~~p~EQFF~ 72 (1159)
.|.|.|||+.+.++|..+.. ..+.+++|||||++||++.. +..+.||.+- |......+.++||+
T Consensus 124 ~G~l~~wd~~~~~~i~~~~~--~~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~-G~~l~~~~~~~l~~ 193 (194)
T PF08662_consen 124 NGDLEFWDVRKKKKISTFEH--SDATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ-GRLLYKKPFDELYQ 193 (194)
T ss_pred CcEEEEEECCCCEEeecccc--CcEEEEEEcCCCCEEEEEEeccceeccccEEEEEec-CeEeEecchhhhhC
Confidence 47799999999999977753 34689999999999998875 5778899984 55555566666653
No 83
>KOG1474 consensus Transcription initiation factor TFIID, subunit BDF1 and related bromodomain proteins [Transcription]
Probab=97.33 E-value=7e-05 Score=91.43 Aligned_cols=75 Identities=21% Similarity=0.348 Sum_probs=69.9
Q ss_pred ccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCCCC
Q 001090 1078 QKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLKAP 1152 (1159)
Q Consensus 1078 e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~~~ 1152 (1159)
..|+|+.+|+.|||++||..||++++|........|+..+|+||.+||.+.-.|+.++..++++|...+..++.+
T Consensus 23 ~~~~~~~~~~~~~d~~~~~~~~e~n~~~~~~~~~~~f~~~~sn~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 97 (640)
T KOG1474|consen 23 NLPAYYEIIKRPMDIGTIEKRVENNYYFSASECIADFKTKFSNCYLFNDSGDDVVRMKQSLEKLFPKKLRSMPSD 97 (640)
T ss_pred cchhhhcccCCCCCchhhhhhhccCccccHhhhhhhccccccchhcccCCccchhhccccchhhccccccccccc
Confidence 368999999999999999999999999999999999999999999999999999999999999998877776643
No 84
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=97.30 E-value=0.00014 Score=81.83 Aligned_cols=80 Identities=23% Similarity=0.290 Sum_probs=69.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEee-cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc------ccccccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE-ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK------YDQFFLG 73 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~-~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p------~EQFF~t 73 (1159)
+||+-||+|.||-|++|.|||.|. .|.-.|-.+.||.|+.+|.++.-+-++.|-|+-+|+.+|... .|--|..
T Consensus 279 AsGsqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyvn~a~ft~ 358 (508)
T KOG0275|consen 279 ASGSQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSSYVNEATFTD 358 (508)
T ss_pred hccCcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccccccceEEcC
Confidence 589999999999999999999999 566789999999999999999999999999999998776543 2456777
Q ss_pred CCcccee
Q 001090 74 DYRPLVQ 80 (1159)
Q Consensus 74 DYrpLir 80 (1159)
|.+-+|.
T Consensus 359 dG~~iis 365 (508)
T KOG0275|consen 359 DGHHIIS 365 (508)
T ss_pred CCCeEEE
Confidence 7776665
No 85
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.29 E-value=0.00025 Score=81.65 Aligned_cols=99 Identities=21% Similarity=0.233 Sum_probs=74.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccCCccccccccCCc-cc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKDAKYDQFFLGDYR-PL 78 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~~p~EQFF~tDYr-pL 78 (1159)
+++|.||+++||++-+=..+..+..++..|-|+.|||||..||+.... ...||++-+|. .+..+| |.+||+ ..
T Consensus 160 atgg~dg~lRv~~~Ps~~t~l~e~~~~~eV~DL~FS~dgk~lasig~d-~~~VW~~~~g~~~a~~t~----~~k~~~~~~ 234 (398)
T KOG0771|consen 160 ATGGTDGTLRVWEWPSMLTILEEIAHHAEVKDLDFSPDGKFLASIGAD-SARVWSVNTGAALARKTP----FSKDEMFSS 234 (398)
T ss_pred eeccccceEEEEecCcchhhhhhHhhcCccccceeCCCCcEEEEecCC-ceEEEEeccCchhhhcCC----cccchhhhh
Confidence 368999999999999999988888999999999999999999988888 99999998885 233344 778887 23
Q ss_pred ---eecCCCCccccccCcCCcccCCCCCcccC
Q 001090 79 ---VQDTYGNVLDQETQLAPHRRNLQDPLCDS 107 (1159)
Q Consensus 79 ---irD~~g~vlDe~Tq~~PHL~~~~~~L~D~ 107 (1159)
..|..+-+|=-.+++-|+=+- .+||.
T Consensus 235 cRF~~d~~~~~l~laa~~~~~~~v---~~~~~ 263 (398)
T KOG0771|consen 235 CRFSVDNAQETLRLAASQFPGGGV---RLCDI 263 (398)
T ss_pred ceecccCCCceEEEEEecCCCCce---eEEEe
Confidence 334444444445555555432 34554
No 86
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=97.25 E-value=0.00047 Score=79.35 Aligned_cols=59 Identities=17% Similarity=0.343 Sum_probs=55.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||+|.+|+|||+.||+.+-.+. |+..|+++.|+-||.+|+++-.+-+|.||+-..++
T Consensus 148 lsag~Dn~v~iWnv~tgeali~l~-hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~~~ 206 (472)
T KOG0303|consen 148 LSAGSDNTVSIWNVGTGEALITLD-HPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRRGT 206 (472)
T ss_pred hhccCCceEEEEeccCCceeeecC-CCCeEEEEEeccCCceeeeecccceeEEEcCCCCc
Confidence 699999999999999999887766 99999999999999999999999999999987776
No 87
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=97.23 E-value=0.00049 Score=81.27 Aligned_cols=60 Identities=20% Similarity=0.279 Sum_probs=56.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEee---cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE---ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~---~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+|.||+|+|+|=.+|+.+..|. +|.|.||.+.||||+++|+++..+-++.||++-..+
T Consensus 206 at~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaDkt~KIWdVs~~s 268 (603)
T KOG0318|consen 206 ATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPDSTQFLTVSADKTIKIWDVSTNS 268 (603)
T ss_pred EEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCCCceEEEecCCceEEEEEeeccc
Confidence 589999999999999999999998 789999999999999999999999999999987764
No 88
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.21 E-value=0.00041 Score=84.40 Aligned_cols=57 Identities=21% Similarity=0.346 Sum_probs=46.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
+|||||.+|+||||..-....-=......+++++|+|||.-||++..+|+|.||+.-
T Consensus 494 aS~SWDkTVRiW~if~s~~~vEtl~i~sdvl~vsfrPdG~elaVaTldgqItf~d~~ 550 (893)
T KOG0291|consen 494 ASGSWDKTVRIWDIFSSSGTVETLEIRSDVLAVSFRPDGKELAVATLDGQITFFDIK 550 (893)
T ss_pred EeccccceEEEEEeeccCceeeeEeeccceeEEEEcCCCCeEEEEEecceEEEEEhh
Confidence 589999999999996553222222346689999999999999999999999999973
No 89
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=97.19 E-value=0.0005 Score=75.67 Aligned_cols=84 Identities=19% Similarity=0.275 Sum_probs=70.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccc--------ccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQ--------FFL 72 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ--------FF~ 72 (1159)
+|||||-+|++|-..+|+|++++.-..+.|--+..+||+..||++- .-||.|||+-++.....+-+|- =|+
T Consensus 14 vsA~YDhTIRfWqa~tG~C~rTiqh~dsqVNrLeiTpdk~~LAaa~-~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~ 92 (311)
T KOG0315|consen 14 VSAGYDHTIRFWQALTGICSRTIQHPDSQVNRLEITPDKKDLAAAG-NQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQ 92 (311)
T ss_pred EeccCcceeeeeehhcCeEEEEEecCccceeeEEEcCCcchhhhcc-CCeeEEEEccCCCCCceeEEeccCCceEEEEEe
Confidence 5899999999999999999999998899999999999999999875 4589999999887554444443 499
Q ss_pred cCCccceec-CCCC
Q 001090 73 GDYRPLVQD-TYGN 85 (1159)
Q Consensus 73 tDYrpLirD-~~g~ 85 (1159)
-|.|.+..- +.|-
T Consensus 93 ~dgrWMyTgseDgt 106 (311)
T KOG0315|consen 93 CDGRWMYTGSEDGT 106 (311)
T ss_pred ecCeEEEecCCCce
Confidence 999988773 4443
No 90
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=97.19 E-value=0.00072 Score=75.03 Aligned_cols=55 Identities=16% Similarity=0.194 Sum_probs=51.5
Q ss_pred CCcccCCcEEEEEcc-CC--eeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 1 MSAGYDGKTIVWDIW-EG--IPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~-~G--~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
+|+|||.+|++|+-. .+ +|+.++.++...|+.++|.|.|..|++++++|++.||.
T Consensus 166 ~S~SYDnTIk~~~~~~dddW~c~~tl~g~~~TVW~~~F~~~G~rl~s~sdD~tv~Iw~ 223 (312)
T KOG0645|consen 166 FSCSYDNTIKVYRDEDDDDWECVQTLDGHENTVWSLAFDNIGSRLVSCSDDGTVSIWR 223 (312)
T ss_pred EEeccCCeEEEEeecCCCCeeEEEEecCccceEEEEEecCCCceEEEecCCcceEeee
Confidence 589999999999988 44 48999999999999999999999999999999999998
No 91
>PLN00181 protein SPA1-RELATED; Provisional
Probab=97.18 E-value=0.00055 Score=84.77 Aligned_cols=59 Identities=12% Similarity=0.160 Sum_probs=53.2
Q ss_pred CCcccCCcEEEEEccC------CeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWE------GIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~------G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|||.||+|+|||+.. +.+++.|.+|...+..++|+|+|.+||++..+|+|+||+....
T Consensus 676 vs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~~~ 740 (793)
T PLN00181 676 VSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKAFP 740 (793)
T ss_pred EEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeCCCEEEEEECCCC
Confidence 4899999999999985 4689999999999999999999999999999999999996543
No 92
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=97.15 E-value=0.00063 Score=76.98 Aligned_cols=58 Identities=21% Similarity=0.272 Sum_probs=54.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEee---cCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE---ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~---~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
||||-|-.|+||||+++.||.+|. ||+.-|+++-||+||..||++--+=+|.+|.+-.
T Consensus 152 ls~SkD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~~ 212 (385)
T KOG1034|consen 152 LSASKDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLNV 212 (385)
T ss_pred EEecCCceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecCh
Confidence 589999999999999999999996 5789999999999999999999999999999864
No 93
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=97.14 E-value=0.00037 Score=80.50 Aligned_cols=58 Identities=17% Similarity=0.240 Sum_probs=54.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
||||.|++|.||+-.+|.+|.++.||...|--++|.| |=.+||++.++|+|.|||.+.
T Consensus 456 aSGSED~kvyIWhr~sgkll~~LsGHs~~vNcVswNP~~p~m~ASasDDgtIRIWg~~~ 514 (519)
T KOG0293|consen 456 ASGSEDSKVYIWHRISGKLLAVLSGHSKTVNCVSWNPADPEMFASASDDGTIRIWGPSD 514 (519)
T ss_pred EecCCCceEEEEEccCCceeEeecCCcceeeEEecCCCCHHHhhccCCCCeEEEecCCc
Confidence 6999999999999999999999999999999999999 557899999999999999764
No 94
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=97.14 E-value=0.00039 Score=78.35 Aligned_cols=62 Identities=18% Similarity=0.277 Sum_probs=58.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
||||.|-+|+|--+..|++++-|.||..-|-++.|++||.+|+++.++|++.+|..-+++..
T Consensus 322 LS~sfD~tvRiHGlKSGK~LKEfrGHsSyvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~ 383 (508)
T KOG0275|consen 322 LSASFDQTVRIHGLKSGKCLKEFRGHSSYVNEATFTDDGHHIISASSDGTVKVWHGKTTECL 383 (508)
T ss_pred hcccccceEEEeccccchhHHHhcCccccccceEEcCCCCeEEEecCCccEEEecCcchhhh
Confidence 69999999999999999999999999999999999999999999999999999997766543
No 95
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=97.13 E-value=0.00019 Score=82.79 Aligned_cols=92 Identities=18% Similarity=0.220 Sum_probs=75.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc---cCC---ccccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ---KDA---KYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~---~~~---p~EQFF~tD 74 (1159)
+||++||+|+||++.+-.+|.-|.||...|-.++|.|+|++|+++.-+-++.|||+-++.-. .-- -+.--|+.|
T Consensus 235 at~s~Dgtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~ElL~QEGHs~~v~~iaf~~D 314 (459)
T KOG0272|consen 235 ATASADGTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSELLLQEGHSKGVFSIAFQPD 314 (459)
T ss_pred eeeccCCceeeeccCCCcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchhhHhhcccccccceeEecCC
Confidence 58999999999999999999999999999999999999999999999999999999887522 211 233458888
Q ss_pred Ccccee---cCCCCccccccC
Q 001090 75 YRPLVQ---DTYGNVLDQETQ 92 (1159)
Q Consensus 75 YrpLir---D~~g~vlDe~Tq 92 (1159)
.--+.. |+-|-|-|--|.
T Consensus 315 GSL~~tGGlD~~~RvWDlRtg 335 (459)
T KOG0272|consen 315 GSLAATGGLDSLGRVWDLRTG 335 (459)
T ss_pred CceeeccCccchhheeecccC
Confidence 854443 666777776554
No 96
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=97.12 E-value=0.00037 Score=81.43 Aligned_cols=58 Identities=17% Similarity=0.313 Sum_probs=54.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
++|++||.++||+. .|.++.+|..|.++||.+||+-+|++|++++.+|++.||+.-+|
T Consensus 251 atG~~~G~~riw~~-~G~l~~tl~~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g 308 (524)
T KOG0273|consen 251 ATGSEDGEARIWNK-DGNLISTLGQHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTG 308 (524)
T ss_pred EEeecCcEEEEEec-CchhhhhhhccCCceEEEEEcCCCCEEEeccCCccEEEEeccCc
Confidence 47999999999997 78899999999999999999999999999999999999997554
No 97
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=97.10 E-value=0.00099 Score=73.76 Aligned_cols=57 Identities=21% Similarity=0.310 Sum_probs=48.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|+|+-|.+|++||+.+|+|+.......+-|+ +.|||||.++|+++.+-.|.+++.-.
T Consensus 81 atas~dk~ir~wd~r~~k~~~~i~~~~eni~-i~wsp~g~~~~~~~kdD~it~id~r~ 137 (313)
T KOG1407|consen 81 ATASGDKTIRIWDIRSGKCTARIETKGENIN-ITWSPDGEYIAVGNKDDRITFIDART 137 (313)
T ss_pred EEecCCceEEEEEeccCcEEEEeeccCcceE-EEEcCCCCEEEEecCcccEEEEEecc
Confidence 5789999999999999999987765444443 68999999999999999999999654
No 98
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=97.10 E-value=0.0011 Score=74.57 Aligned_cols=95 Identities=17% Similarity=0.213 Sum_probs=68.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC---CC--CccCC--cc----cc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ---GE--SQKDA--KY----DQ 69 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~---g~--~~~~~--p~----EQ 69 (1159)
||||.|++|+|||+..-.|.-.+.-..-+ =+||.|.|..||++-..+.|.||++-+ |. .+... .. +.
T Consensus 116 lS~S~D~tvrLWDlR~~~cqg~l~~~~~p--i~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l 193 (311)
T KOG1446|consen 116 LSSSLDKTVRLWDLRVKKCQGLLNLSGRP--IAAFDPEGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDL 193 (311)
T ss_pred EecccCCeEEeeEecCCCCceEEecCCCc--ceeECCCCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCccceeee
Confidence 69999999999999977776555433222 268999999999999999999999844 21 22211 11 12
Q ss_pred ccccCCcc-ceecCCC--CccccccCcCCcc
Q 001090 70 FFLGDYRP-LVQDTYG--NVLDQETQLAPHR 97 (1159)
Q Consensus 70 FF~tDYrp-LirD~~g--~vlDe~Tq~~PHL 97 (1159)
=|++|.+- ||-..++ ++||+=+.+.++-
T Consensus 194 ~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~t 224 (311)
T KOG1446|consen 194 EFSPDGKSILLSTNASFIYLLDAFDGTVKST 224 (311)
T ss_pred EEcCCCCEEEEEeCCCcEEEEEccCCcEeee
Confidence 59999994 4556666 7999988886554
No 99
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=97.07 E-value=0.00087 Score=81.08 Aligned_cols=56 Identities=16% Similarity=0.217 Sum_probs=54.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
+|+|+||.|++||+ +|++|+.+.+|..-||.+..++++..|+++-++|+|.||..+
T Consensus 194 lScsNDg~Ir~w~~-~ge~l~~~~ghtn~vYsis~~~~~~~Ivs~gEDrtlriW~~~ 249 (745)
T KOG0301|consen 194 LSCSNDGSIRLWDL-DGEVLLEMHGHTNFVYSISMALSDGLIVSTGEDRTLRIWKKD 249 (745)
T ss_pred EeecCCceEEEEec-cCceeeeeeccceEEEEEEecCCCCeEEEecCCceEEEeecC
Confidence 68999999999999 999999999999999999999999999999999999999876
No 100
>KOG1828 consensus IRF-2-binding protein CELTIX-1, contains BROMO domain [Transcription]
Probab=97.02 E-value=8.1e-05 Score=85.14 Aligned_cols=72 Identities=18% Similarity=0.125 Sum_probs=65.5
Q ss_pred hccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHH
Q 001090 1073 LKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTR 1144 (1159)
Q Consensus 1073 L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~ 1144 (1159)
+..-.-+|+|.+||+.|||+.||+.+++-++|-++.++..|.++|..||..||.+++.+...|++|..+-..
T Consensus 43 plt~~map~y~~iis~Pmd~~t~r~kidd~~yl~L~~m~~d~kl~~~na~~yn~~~Tv~~~aaKrL~~v~~~ 114 (418)
T KOG1828|consen 43 PLTDKMAPNYLEIISEPMDRITKRSKIDDTRYLVLSQMEFDRKLPDGNATLYNLHPTVPIVAAKRLCPVRLG 114 (418)
T ss_pred ccchhhccchHhhhhcccccccccccCCCccceechhhhhhhcccccchhhhhcCCccccccccccchhhcc
Confidence 334445899999999999999999999999999999999999999999999999999999999999876543
No 101
>KOG1828 consensus IRF-2-binding protein CELTIX-1, contains BROMO domain [Transcription]
Probab=97.01 E-value=0.00033 Score=80.35 Aligned_cols=65 Identities=8% Similarity=0.008 Sum_probs=60.6
Q ss_pred cccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHH
Q 001090 1075 QVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSD 1140 (1159)
Q Consensus 1075 ~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d 1140 (1159)
....+|.|..+|++|+|.+|++.+..+++|.| -+|..|+.||+.||.+||++...+++.|+.+..
T Consensus 234 tas~aP~YSm~Ik~~~~~~Tygdk~~andy~S-~~f~~D~kl~~l~amT~gehsk~yyelank~lh 298 (418)
T KOG1828|consen 234 TASFAPGYSMTITEVEPPGTYGDKSSANDYES-LSFTQDRKLIALKAVTNGEHSKSYYELANKQLH 298 (418)
T ss_pred hhhhcccccccccccCCCcchhhhhhhhhhhh-hhhhcccchhhHHHHhcCCcchHHHHHHHhhhh
Confidence 34568999999999999999999999999999 999999999999999999999999999988765
No 102
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=97.01 E-value=0.00064 Score=76.66 Aligned_cols=60 Identities=13% Similarity=0.144 Sum_probs=56.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+.|..||.|.|||+.|-.+-|+|.+|--+|-++||||||+.|.++..+-.+.+|++-.|.
T Consensus 39 AvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs 98 (405)
T KOG1273|consen 39 AVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGS 98 (405)
T ss_pred eeeccCCcEEEEEccccchhhhhhccccceeEEEecCCCCEeeeecCCceeEEEeccCCC
Confidence 468899999999999999999999999999999999999999999999999999987775
No 103
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.98 E-value=0.0008 Score=73.70 Aligned_cols=60 Identities=17% Similarity=0.361 Sum_probs=58.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|++|.|.+|+||+.-+|.+|+.|.+|.-.|+|++-|-|...||++-.+-.+.+|++.+|+
T Consensus 33 ltcGsdrtvrLWNp~rg~liktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGk 92 (307)
T KOG0316|consen 33 LTCGSDRTVRLWNPLRGALIKTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGK 92 (307)
T ss_pred EEcCCCceEEeecccccceeeeecCCCceeeeccccccccccccCCCCceEEEEEcccCe
Confidence 589999999999999999999999999999999999999999999999999999999886
No 104
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.97 E-value=0.0018 Score=67.37 Aligned_cols=57 Identities=12% Similarity=0.149 Sum_probs=47.5
Q ss_pred ccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-CceeEEEEecCCCCC
Q 001090 4 GYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVGQLYILNTGQGES 61 (1159)
Q Consensus 4 G~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G~g~~ 61 (1159)
+.++.|.+||+.+|+.++.+..+ ..+++++|+|||.+|+++. ..|.|.||++..++.
T Consensus 226 ~~~~~i~v~d~~~~~~~~~~~~~-~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~~ 283 (300)
T TIGR03866 226 GPANRVAVVDAKTYEVLDYLLVG-QRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALKV 283 (300)
T ss_pred CCCCeEEEEECCCCcEEEEEEeC-CCcceEEECCCCCEEEEEcCCCCeEEEEECCCCcE
Confidence 44678999999999998776543 3688999999999999875 589999999988763
No 105
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=96.94 E-value=0.0014 Score=74.93 Aligned_cols=60 Identities=22% Similarity=0.316 Sum_probs=56.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||-|-.-+|||+.+|..+-...+|.-.|..+.||-||++||+||-.|+|.||..-++.
T Consensus 80 aTGGgDD~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~ 139 (399)
T KOG0296|consen 80 ATGGGDDLAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGG 139 (399)
T ss_pred EecCCCceEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCc
Confidence 589999999999999999887778999999999999999999999999999999987765
No 106
>KOG0008 consensus Transcription initiation factor TFIID, subunit TAF1 [Transcription]
Probab=96.93 E-value=0.00093 Score=85.34 Aligned_cols=76 Identities=14% Similarity=0.296 Sum_probs=68.1
Q ss_pred hhhhhhccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHH
Q 001090 1068 YGVQKLKQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVT 1143 (1159)
Q Consensus 1068 yGI~~L~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~ 1143 (1159)
+.+...++.+++++||.||..||||.|+|+.+..+-|-.-+.|..|+.+|++|..+||+|.+.+...|..+-...-
T Consensus 1280 ~~f~~Pv~~k~v~dyy~vi~~P~~lq~~kk~v~kr~y~~r~~fle~~~~~~~ns~~yng~~~~~t~~~q~mls~~~ 1355 (1563)
T KOG0008|consen 1280 YPFPTPVNAKEVKDYYRVITPPMDLQTQKKLVRKRLYESREHFLEELPLIVSNSTKYNGPLASLTRQQQSMLSLCF 1355 (1563)
T ss_pred cCCCCccchhhccchhhccCCCcchHHHHHHHHHHHHHHHHHHHHHhHHHhhchhhhcCchHHHHHHHHHHHHHHH
Confidence 3455667778899999999999999999999999999999999999999999999999999999999987655543
No 107
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=96.93 E-value=0.00034 Score=79.46 Aligned_cols=58 Identities=21% Similarity=0.250 Sum_probs=52.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||+.|.+|+||||+.|.|||+++||.--|..+.| |-..|++|--+|+|.||++-.+.
T Consensus 374 VSGSSDntIRlwdi~~G~cLRvLeGHEeLvRciRF--d~krIVSGaYDGkikvWdl~aal 431 (499)
T KOG0281|consen 374 VSGSSDNTIRLWDIECGACLRVLEGHEELVRCIRF--DNKRIVSGAYDGKIKVWDLQAAL 431 (499)
T ss_pred EecCCCceEEEEeccccHHHHHHhchHHhhhheee--cCceeeeccccceEEEEeccccc
Confidence 48899999999999999999999999988999999 88999999999999999997765
No 108
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.91 E-value=0.00064 Score=78.59 Aligned_cols=61 Identities=16% Similarity=0.354 Sum_probs=55.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecC-cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEIS-RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g-~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
+++|.|--++|||+.+|.+.+.|..+ ...+-+|+|-|||+.|++|..++++.+||+.-...
T Consensus 285 laCg~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~i~~wdlDgn~~ 346 (519)
T KOG0293|consen 285 LACGFDEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRTIIMWDLDGNIL 346 (519)
T ss_pred EecCchHheeeccCCcchhhhhcccCcCCCcceeEEccCCceeEecCCCCcEEEecCCcchh
Confidence 47888999999999999999999866 67899999999999999999999999999965543
No 109
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=96.90 E-value=0.0017 Score=74.29 Aligned_cols=60 Identities=18% Similarity=0.247 Sum_probs=53.7
Q ss_pred CCcccCCcE-EEEEccCCeeeEEeecC--cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKT-IVWDIWEGIPIRIYEIS--RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I-~lWDI~~G~~Ir~f~~g--~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.-|+| +|+.|.+|..+.-|..| ...|+.++||||+++|+++.+.++|+||-++...
T Consensus 189 ATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~~ 251 (391)
T KOG2110|consen 189 ATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKVS 251 (391)
T ss_pred EEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEecccc
Confidence 478888875 69999999999999877 7889999999999999999999999999988764
No 110
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.89 E-value=0.00087 Score=73.67 Aligned_cols=73 Identities=18% Similarity=0.215 Sum_probs=58.6
Q ss_pred CCcccCCcEEEEEccCCe--eeEEeecCcceeeeeeeCCC--CCEEEEecCceeEEEEecCCCCCccCCccccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGI--PIRIYEISRFRLVDGKFSPD--GASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~--~Ir~f~~g~~~i~D~~FSPD--Gt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tD 74 (1159)
+|++|||+||||--++|. .+..|..|...|-+++|.|- |.+|||+.++|.|.||.+-.. -...++..+|=|.=
T Consensus 74 AScsYDgkVIiWke~~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~-g~w~t~ki~~aH~~ 150 (299)
T KOG1332|consen 74 ASCSYDGKVIIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSS-GGWTTSKIVFAHEI 150 (299)
T ss_pred eEeecCceEEEEecCCCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCC-CCccchhhhhcccc
Confidence 589999999999999997 34445688999999999997 678999999999999998654 23445666665543
No 111
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=96.88 E-value=0.0012 Score=74.17 Aligned_cols=142 Identities=23% Similarity=0.283 Sum_probs=87.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEE-EecCceeEEEEecCCCCCccCC--ccccc---cccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASII-LSDDVGQLYILNTGQGESQKDA--KYDQF---FLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg~G~g~~~~~~--p~EQF---F~tD 74 (1159)
+|+|.|.+|+.||+++|++++.|.+|..-|-.++=+.=|..|+ ++.++|++.+||+-+....+.. +|+|+ |.-+
T Consensus 106 ~S~gtDk~v~~wD~~tG~~~rk~k~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k~~~~t~~~kyqltAv~f~d~ 185 (338)
T KOG0265|consen 106 LSCGTDKTVRGWDAETGKRIRKHKGHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKKEAIKTFENKYQLTAVGFKDT 185 (338)
T ss_pred EEecCCceEEEEecccceeeehhccccceeeecCccccCCeEEEecCCCceEEEEeecccchhhccccceeEEEEEeccc
Confidence 5899999999999999999999999877666666555565554 8899999999999876654433 56663 3333
Q ss_pred Cccce----------ecC---CC-Ccc----ccccCcCC------cccC-CCCCcccCCCCCCCch------hHHhhhhh
Q 001090 75 YRPLV----------QDT---YG-NVL----DQETQLAP------HRRN-LQDPLCDSAMIPYPEP------YQTMYQQR 123 (1159)
Q Consensus 75 YrpLi----------rD~---~g-~vl----De~Tq~~P------HL~~-~~~~L~D~~~~PYp~~------~Q~~~~~r 123 (1159)
--..+ .|- .+ |++ |.-|.+.- -|-| |-..|+=.+.-||.+. ||...-..
T Consensus 186 s~qv~sggIdn~ikvWd~r~~d~~~~lsGh~DtIt~lsls~~gs~llsnsMd~tvrvwd~rp~~p~~R~v~if~g~~hnf 265 (338)
T KOG0265|consen 186 SDQVISGGIDNDIKVWDLRKNDGLYTLSGHADTITGLSLSRYGSFLLSNSMDNTVRVWDVRPFAPSQRCVKIFQGHIHNF 265 (338)
T ss_pred ccceeeccccCceeeeccccCcceEEeecccCceeeEEeccCCCccccccccceEEEEEecccCCCCceEEEeecchhhh
Confidence 22221 221 11 111 22222110 0001 1123333456677763 34444455
Q ss_pred hhcccCccccCCccccccC
Q 001090 124 RLGALGIEWRPSSLKLAVG 142 (1159)
Q Consensus 124 rl~~l~~e~~p~~~~~~vg 142 (1159)
...-|+.-|-|-+-+++.|
T Consensus 266 eknlL~cswsp~~~~i~ag 284 (338)
T KOG0265|consen 266 EKNLLKCSWSPNGTKITAG 284 (338)
T ss_pred hhhcceeeccCCCCccccc
Confidence 6677888888887666666
No 112
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=96.87 E-value=0.0015 Score=76.30 Aligned_cols=62 Identities=16% Similarity=0.234 Sum_probs=52.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEee--------------------------------------------------------
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE-------------------------------------------------------- 24 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~-------------------------------------------------------- 24 (1159)
+|||.|.+++||||..|+.|.++.
T Consensus 192 ~TaS~D~t~k~wdlS~g~LLlti~fp~si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~ 271 (476)
T KOG0646|consen 192 YTASEDRTIKLWDLSLGVLLLTITFPSSIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVL 271 (476)
T ss_pred EEecCCceEEEEEeccceeeEEEecCCcceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeee
Confidence 489999999999999999887521
Q ss_pred -cCcc--eeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 25 -ISRF--RLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 25 -~g~~--~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
||.+ +|-.++.|=||+.|++||.+|++.||++++....
T Consensus 272 ~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~i 312 (476)
T KOG0646|consen 272 VGHENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQCI 312 (476)
T ss_pred ccccCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHHH
Confidence 2234 8999999999999999999999999999987543
No 113
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.85 E-value=0.0016 Score=79.63 Aligned_cols=87 Identities=17% Similarity=0.229 Sum_probs=68.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccC--Cc-cccccccCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKD--AK-YDQFFLGDYRP 77 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~--~p-~EQFF~tDYrp 77 (1159)
+||+.||+|+|||+.+|-|+.+|..|...|--++|+-+|..|.++.-+|++..||+--..-|.. .| +-|| .-
T Consensus 366 aTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrNfRTft~P~p~Qf-----sc 440 (893)
T KOG0291|consen 366 ATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRNFRTFTSPEPIQF-----SC 440 (893)
T ss_pred EeccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEeeeecccceeeeecCCCceee-----eE
Confidence 5899999999999999999999999999999999999999999999999999999876554432 22 3454 13
Q ss_pred ceecCCCCccccccC
Q 001090 78 LVQDTYGNVLDQETQ 92 (1159)
Q Consensus 78 LirD~~g~vlDe~Tq 92 (1159)
|--|..|-+|=+.+|
T Consensus 441 vavD~sGelV~AG~~ 455 (893)
T KOG0291|consen 441 VAVDPSGELVCAGAQ 455 (893)
T ss_pred EEEcCCCCEEEeecc
Confidence 444555554444444
No 114
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=96.81 E-value=0.0023 Score=74.54 Aligned_cols=82 Identities=17% Similarity=0.207 Sum_probs=67.8
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceec
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQD 81 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD 81 (1159)
+|.-||.|.|||+.++.-+..|-+|.++|..++||-+|-+||++-++|.+.+||+-.-+-++..+-+-|| .-..|.-|
T Consensus 364 tgt~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLRKl~n~kt~~l~~~~--~v~s~~fD 441 (506)
T KOG0289|consen 364 TGTPDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLRKLKNFKTIQLDEKK--EVNSLSFD 441 (506)
T ss_pred ccCCCceEEEEEcCCccccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEehhhcccceeeccccc--cceeEEEc
Confidence 5778999999999999999999999999999999999999999999999999999876655555444454 33345555
Q ss_pred CCCC
Q 001090 82 TYGN 85 (1159)
Q Consensus 82 ~~g~ 85 (1159)
..|-
T Consensus 442 ~SGt 445 (506)
T KOG0289|consen 442 QSGT 445 (506)
T ss_pred CCCC
Confidence 5554
No 115
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=96.72 E-value=0.0025 Score=73.34 Aligned_cols=58 Identities=21% Similarity=0.235 Sum_probs=50.4
Q ss_pred CCcccCCcEEEEEccCCe---eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI---PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~---~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|+|.||.|+||||..|. +| .+..|.+.|-=+-|+.+=.+||+|+++|.|.||++-+-
T Consensus 274 aScS~DgsIrIWDiRs~~~~~~~-~~kAh~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~ 334 (440)
T KOG0302|consen 274 ASCSCDGSIRIWDIRSGPKKAAV-STKAHNSDVNVISWNRREPLLASGGDDGTLSIWDLRQF 334 (440)
T ss_pred EeeecCceEEEEEecCCCcccee-EeeccCCceeeEEccCCcceeeecCCCceEEEEEhhhc
Confidence 589999999999999993 33 22577888999999999999999999999999998664
No 116
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=96.70 E-value=0.0035 Score=71.07 Aligned_cols=111 Identities=23% Similarity=0.231 Sum_probs=80.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC--CccCCc------ccc---
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE--SQKDAK------YDQ--- 69 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~--~~~~~p------~EQ--- 69 (1159)
|||+.|-+|+|||+. |+.|......+...++++.||||+.||++-=.--+.+|..=.++ .++.+. .-|
T Consensus 203 msas~dt~i~lw~lk-Gq~L~~idtnq~~n~~aavSP~GRFia~~gFTpDVkVwE~~f~kdG~fqev~rvf~LkGH~saV 281 (420)
T KOG2096|consen 203 MSASLDTKICLWDLK-GQLLQSIDTNQSSNYDAAVSPDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVKRVFSLKGHQSAV 281 (420)
T ss_pred EEecCCCcEEEEecC-CceeeeeccccccccceeeCCCCcEEEEecCCCCceEEEEEeccCcchhhhhhhheeccchhhe
Confidence 799999999999998 99999999999999999999999999998877777777763221 222111 112
Q ss_pred ---ccccCCcccee-cCCCC-------ccccccCcCCcccCCCCCcccCCCCCC
Q 001090 70 ---FFLGDYRPLVQ-DTYGN-------VLDQETQLAPHRRNLQDPLCDSAMIPY 112 (1159)
Q Consensus 70 ---FF~tDYrpLir-D~~g~-------vlDe~Tq~~PHL~~~~~~L~D~~~~PY 112 (1159)
-|+++.+.++. ...|- |-=+..|.|--|..-|-|||++.++|-
T Consensus 282 ~~~aFsn~S~r~vtvSkDG~wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~ 335 (420)
T KOG2096|consen 282 LAAAFSNSSTRAVTVSKDGKWRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPV 335 (420)
T ss_pred eeeeeCCCcceeEEEecCCcEEEeeccceEecCCCchHhhcCCcchhhcCCCce
Confidence 48888776654 22221 222445555555544569999999997
No 117
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.70 E-value=0.0026 Score=72.91 Aligned_cols=60 Identities=13% Similarity=0.169 Sum_probs=57.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+|+|.+|+.||+.+|-|+++|.+++--|.-++-.-||+.||+++.+-+|.+|=++.+.
T Consensus 209 lS~srD~tik~We~~tg~cv~t~~~h~ewvr~v~v~~DGti~As~s~dqtl~vW~~~t~~ 268 (406)
T KOG0295|consen 209 LSCSRDNTIKAWECDTGYCVKTFPGHSEWVRMVRVNQDGTIIASCSNDQTLRVWVVATKQ 268 (406)
T ss_pred eecccccceeEEecccceeEEeccCchHhEEEEEecCCeeEEEecCCCceEEEEEeccch
Confidence 699999999999999999999999999999999999999999999999999999998873
No 118
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.57 E-value=0.0032 Score=76.87 Aligned_cols=64 Identities=11% Similarity=0.057 Sum_probs=59.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKD 64 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~ 64 (1159)
+|||+|.+..||+++.++.+.+|.||.-.||++.|||.-+.+|++..+++|.||.+...+..+.
T Consensus 479 AT~SqDktaKiW~le~~~l~~vLsGH~RGvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT 542 (775)
T KOG0319|consen 479 ATGSQDKTAKIWDLEQLRLLGVLSGHTRGVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKT 542 (775)
T ss_pred EecccccceeeecccCceEEEEeeCCccceEEEEeccccceeEeccCCceEEEEEeccceeeee
Confidence 5899999999999999999999999999999999999999999999999999999987765543
No 119
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=96.57 E-value=0.0046 Score=72.03 Aligned_cols=72 Identities=17% Similarity=0.305 Sum_probs=59.4
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCC-CC-----ccCCccccccc
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQG-ES-----QKDAKYDQFFL 72 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g-~~-----~~~~p~EQFF~ 72 (1159)
+|||.||+|.|||+.+=. ++.+|.+|...|+-+-|||.- +.||++-.+|.|.||++..- +. .+..|.|.+|.
T Consensus 289 AT~S~D~tV~LwDlRnL~~~lh~~e~H~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~ 368 (422)
T KOG0264|consen 289 ATGSADKTVALWDLRNLNKPLHTFEGHEDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFI 368 (422)
T ss_pred EeccCCCcEEEeechhcccCceeccCCCcceEEEEeCCCCCceeEecccCCcEEEEeccccccccChhhhccCCcceeEE
Confidence 589999999999998866 889999999999999999986 55678889999999999762 22 24457776553
No 120
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=96.57 E-value=0.0043 Score=69.82 Aligned_cols=61 Identities=13% Similarity=0.223 Sum_probs=52.8
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
+|||+|..|+||.+..-+ -.-+..+|.++|.++.|.+||.+|.++..+-+|..||..+|+.
T Consensus 63 aSgG~Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~ 124 (338)
T KOG0265|consen 63 ASGGSDRAIVLWNVYGDCENFWVLKGHSGAVMELHGMRDGSHILSCGTDKTVRGWDAETGKR 124 (338)
T ss_pred eecCCcceEEEEeccccccceeeeccccceeEeeeeccCCCEEEEecCCceEEEEeccccee
Confidence 689999999999986444 2234458999999999999999999999999999999998873
No 121
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.54 E-value=0.0052 Score=67.57 Aligned_cols=60 Identities=15% Similarity=0.184 Sum_probs=57.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+|-|..|.+|||.+|+.+|.|.+|-++|-.++|--+..-++++.=+-++.+|++-+..
T Consensus 75 ~s~GgDk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~s 134 (307)
T KOG0316|consen 75 ASCGGDKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSRS 134 (307)
T ss_pred ccCCCCceEEEEEcccCeeeeecccccceeeEEEecCcceEEEeccccceeEEEEcccCC
Confidence 489999999999999999999999999999999999999999999999999999987654
No 122
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.52 E-value=0.0019 Score=73.01 Aligned_cols=57 Identities=16% Similarity=0.195 Sum_probs=53.2
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
||+.|-+-+||-|++|.||..|.||-|.|-.++|+|.|..++++..+|+-+||-...
T Consensus 165 tASADhTA~iWs~Esg~CL~~Y~GH~GSVNsikfh~s~~L~lTaSGD~taHIW~~av 221 (481)
T KOG0300|consen 165 TASADHTARIWSLESGACLATYTGHTGSVNSIKFHNSGLLLLTASGDETAHIWKAAV 221 (481)
T ss_pred ecccccceeEEeeccccceeeecccccceeeEEeccccceEEEccCCcchHHHHHhh
Confidence 789999999999999999999999999999999999999999999999888887443
No 123
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=96.42 E-value=0.0039 Score=78.35 Aligned_cols=58 Identities=16% Similarity=0.211 Sum_probs=55.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|+|.|++|+|||..+.++++++.+|++.|--+.|-|=|++||+-.++.+|.||.++.
T Consensus 145 vS~s~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt~d 202 (942)
T KOG0973|consen 145 VSVSLDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPIGKYFASQSDDRTLKVWRTSD 202 (942)
T ss_pred EEecccceEEEEccccceeeeeeecccccccceEECCccCeeeeecCCceEEEEEccc
Confidence 4889999999999999999999999999999999999999999999999999999765
No 124
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=96.42 E-value=0.0051 Score=70.54 Aligned_cols=60 Identities=23% Similarity=0.402 Sum_probs=56.6
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
-|+.||.|..|-|-++...++|.||...+-.++|.|||..++++.++|+|.+|+.-+|..
T Consensus 165 AG~~DGsvWmw~ip~~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dgti~~Wn~ktg~p 224 (399)
T KOG0296|consen 165 AGSTDGSVWMWQIPSQALCKVMSGHNSPCTCGEFIPDGKRILTGYDDGTIIVWNPKTGQP 224 (399)
T ss_pred eecCCCcEEEEECCCcceeeEecCCCCCcccccccCCCceEEEEecCceEEEEecCCCce
Confidence 478899999999999999999999999999999999999999999999999999988853
No 125
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.40 E-value=0.0029 Score=77.48 Aligned_cols=59 Identities=25% Similarity=0.366 Sum_probs=55.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|||-||+|.+||+..-.-+.+|.+..-+|.|++||| +|..||++++.|.|.+||+-+-
T Consensus 150 iSGSQDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp 209 (839)
T KOG0269|consen 150 ISGSQDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGYLQLWDLRQP 209 (839)
T ss_pred EecCCCceEEEEeeecccccccccccchhhhceeeccCCCceEEEecCCceEEEeeccCc
Confidence 5899999999999999999999999888999999999 7999999999999999998654
No 126
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=96.39 E-value=0.0083 Score=70.12 Aligned_cols=91 Identities=19% Similarity=0.190 Sum_probs=67.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEee--cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE--ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPL 78 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~--~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpL 78 (1159)
+||+.||+.++-|+.+|.+|-+-. +..+.+-.++|.|||..|+++..+|.|-||++-++.-....|.. ..--+.+
T Consensus 319 lsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpgh---t~~vk~i 395 (506)
T KOG0289|consen 319 LSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGH---TGPVKAI 395 (506)
T ss_pred EEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEEeccCCCceEEEEEcCCccccccCCCC---CCceeEE
Confidence 589999999999999999887765 46788999999999999999999999999999776533333321 1112233
Q ss_pred eecCCCCccccccCcC
Q 001090 79 VQDTYGNVLDQETQLA 94 (1159)
Q Consensus 79 irD~~g~vlDe~Tq~~ 94 (1159)
--.+|||.|=..+...
T Consensus 396 ~FsENGY~Lat~add~ 411 (506)
T KOG0289|consen 396 SFSENGYWLATAADDG 411 (506)
T ss_pred EeccCceEEEEEecCC
Confidence 3357777555444444
No 127
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=96.39 E-value=0.0019 Score=72.93 Aligned_cols=62 Identities=18% Similarity=0.261 Sum_probs=55.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecC--cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEIS--RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g--~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+||++||.|+|||=..++||++|... ...|-++.|+-+|++|.++.-+-.+.+|.+++|...
T Consensus 277 vTaSkDG~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~l 340 (430)
T KOG0640|consen 277 VTASKDGAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTGRML 340 (430)
T ss_pred EEeccCCcEEeeccccHHHHHHHHhhcCCceeeeEEEccCCeEEeecCCcceeeeeeecCCceE
Confidence 47999999999999999999999853 457999999999999999999999999999888743
No 128
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=96.38 E-value=0.0061 Score=72.41 Aligned_cols=59 Identities=24% Similarity=0.309 Sum_probs=51.9
Q ss_pred CcccCCcEEEEEccCCeee--EEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPI--RIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~I--r~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-||.||+|.|+-|..++.. .+...|.++|-+++|||||++||++|..+++.+|++.+.+
T Consensus 460 VGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~ 520 (603)
T KOG0318|consen 460 VGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASRE 520 (603)
T ss_pred EecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCc
Confidence 3799999999999987742 3456799999999999999999999999999999987765
No 129
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=96.37 E-value=0.0056 Score=73.57 Aligned_cols=72 Identities=15% Similarity=0.208 Sum_probs=53.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCE-EEEecCceeEEEEecCCCCCccCCcc-cccccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGAS-IILSDDVGQLYILNTGQGESQKDAKY-DQFFLG 73 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~-~a~tD~~G~l~Ifg~G~g~~~~~~p~-EQFF~t 73 (1159)
+|||.||+|+||-|.||.||+++.-.. .|..++|.|.+.. |.++-..+.++|+..+.|.....-+. |.||..
T Consensus 416 asGsdDGtvriWEi~TgRcvr~~~~d~-~I~~vaw~P~~~~~vLAvA~~~~~~ivnp~~G~~~e~~~t~ell~~~ 489 (733)
T KOG0650|consen 416 ASGSDDGTVRIWEIATGRCVRTVQFDS-EIRSVAWNPLSDLCVLAVAVGECVLIVNPIFGDRLEVGPTKELLASA 489 (733)
T ss_pred eecCCCCcEEEEEeecceEEEEEeecc-eeEEEEecCCCCceeEEEEecCceEEeCccccchhhhcchhhhhhcC
Confidence 589999999999999999999987433 8999999999864 33444445599999988865444443 334443
No 130
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.33 E-value=0.0054 Score=74.23 Aligned_cols=62 Identities=16% Similarity=0.182 Sum_probs=58.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++||.-+-|+|||=.+++.|..+.||.--|.++-.++||+.++++.++|+|.+|++|+...+
T Consensus 187 vsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl 248 (735)
T KOG0308|consen 187 VSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCL 248 (735)
T ss_pred EecCcccceEEeccccccceeeeeccccceEEEEEcCCCCeEeecCCCceEEeeecccccee
Confidence 57999999999999999999999999999999999999999999999999999999997643
No 131
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.26 E-value=0.0053 Score=75.51 Aligned_cols=59 Identities=14% Similarity=0.220 Sum_probs=53.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
||||-|.||+||++.+-.||++|. |..=|-.++|.| |-.+|++|.-+|++.||++=..+
T Consensus 384 LSSSMDKTVRLWh~~~~~CL~~F~-HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~ 443 (712)
T KOG0283|consen 384 LSSSMDKTVRLWHPGRKECLKVFS-HNDFVTCVAFNPVDDRYFISGSLDGKVRLWSISDKK 443 (712)
T ss_pred EeccccccEEeecCCCcceeeEEe-cCCeeEEEEecccCCCcEeecccccceEEeecCcCe
Confidence 699999999999999999999996 677799999999 89999999999999999975443
No 132
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=96.24 E-value=0.0072 Score=70.24 Aligned_cols=60 Identities=27% Similarity=0.415 Sum_probs=47.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecC---------------------------------------------cceeeeeee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEIS---------------------------------------------RFRLVDGKF 35 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g---------------------------------------------~~~i~D~~F 35 (1159)
+|+=.|++|++||+.++.+++.-..| ...+--+.|
T Consensus 316 ~SgH~DkkvRfwD~Rs~~~~~sv~~gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvf 395 (459)
T KOG0288|consen 316 ISGHFDKKVRFWDIRSADKTRSVPLGGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVF 395 (459)
T ss_pred eecccccceEEEeccCCceeeEeecCcceeeEeeccCCeEEeeecCCCceeeeecccccEEEEeeccccccccccceeEE
Confidence 46667999999998888877753210 123556789
Q ss_pred CCCCCEEEEecCceeEEEEecCCCC
Q 001090 36 SPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 36 SPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
||||.++|+|..+|.|+||.+-.++
T Consensus 396 Spd~~YvaAGS~dgsv~iW~v~tgK 420 (459)
T KOG0288|consen 396 SPDGSYVAAGSADGSVYIWSVFTGK 420 (459)
T ss_pred CCCCceeeeccCCCcEEEEEccCce
Confidence 9999999999999999999997775
No 133
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=96.24 E-value=0.0045 Score=69.64 Aligned_cols=60 Identities=15% Similarity=0.245 Sum_probs=53.2
Q ss_pred CCcccCCcEEEEEccCC-eeeEEeecC--------------cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEG-IPIRIYEIS--------------RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G-~~Ir~f~~g--------------~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.||.|+||||.+- -|++++.+| .++|+.+||+.||.++++..-+-++.+|....|+
T Consensus 205 atgsaDg~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tSd~~~l~~~gtd~r~r~wn~~~G~ 279 (397)
T KOG4283|consen 205 ATGSADGAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTSDARYLASCGTDDRIRVWNMESGR 279 (397)
T ss_pred EecCCCceEEEEEeecccceeEEeecccCccCccccccccccceeeeeeecccchhhhhccCccceEEeecccCc
Confidence 48999999999999988 488888744 4889999999999999999999999999987776
No 134
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=96.22 E-value=0.0048 Score=74.47 Aligned_cols=58 Identities=16% Similarity=0.208 Sum_probs=53.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+||+|||+|.|||+.+|+|++++.+|.+.|+++.|.+- .++++|.-+++|.+|++...
T Consensus 345 vsgs~d~~v~VW~~~~~~cl~sl~gH~~~V~sl~~~~~-~~~~Sgs~D~~IkvWdl~~~ 402 (537)
T KOG0274|consen 345 VSGSYDGTVKVWDPRTGKCLKSLSGHTGRVYSLIVDSE-NRLLSGSLDTTIKVWDLRTK 402 (537)
T ss_pred EEEecCceEEEEEhhhceeeeeecCCcceEEEEEecCc-ceEEeeeeccceEeecCCch
Confidence 48999999999999999999999999999999977554 88998888899999999876
No 135
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=96.17 E-value=0.015 Score=40.39 Aligned_cols=40 Identities=20% Similarity=0.303 Sum_probs=36.2
Q ss_pred CCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 16 EGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 16 ~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
++.++..|..+...|..++|.|++..|+++..+|++.+|+
T Consensus 1 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred CcEEEEEEEecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 3678888888888999999999999999999999999984
No 136
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.14 E-value=0.0084 Score=70.64 Aligned_cols=106 Identities=18% Similarity=0.224 Sum_probs=78.3
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccc----------
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQ---------- 69 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ---------- 69 (1159)
+||||||+|+|||+..-. .+..| .|.++|-++.|=|-|..||++-. -.+.+||+=.|.. .++.+
T Consensus 170 vtGsYDg~vrl~DtR~~~~~v~el-nhg~pVe~vl~lpsgs~iasAgG-n~vkVWDl~~G~q---ll~~~~~H~KtVTcL 244 (487)
T KOG0310|consen 170 VTGSYDGKVRLWDTRSLTSRVVEL-NHGCPVESVLALPSGSLIASAGG-NSVKVWDLTTGGQ---LLTSMFNHNKTVTCL 244 (487)
T ss_pred EecCCCceEEEEEeccCCceeEEe-cCCCceeeEEEcCCCCEEEEcCC-CeEEEEEecCCce---ehhhhhcccceEEEE
Confidence 589999999999998874 44333 46678999999999999997643 3688999865431 22222
Q ss_pred ccccCCcccee---cCCCCccccccCcCCcccCCCCCcccCCCCC
Q 001090 70 FFLGDYRPLVQ---DTYGNVLDQETQLAPHRRNLQDPLCDSAMIP 111 (1159)
Q Consensus 70 FF~tDYrpLir---D~~g~vlDe~Tq~~PHL~~~~~~L~D~~~~P 111 (1159)
=|.+|..-|+- |.+=.|-|--+-..-|-+.+|+++++..-.|
T Consensus 245 ~l~s~~~rLlS~sLD~~VKVfd~t~~Kvv~s~~~~~pvLsiavs~ 289 (487)
T KOG0310|consen 245 RLASDSTRLLSGSLDRHVKVFDTTNYKVVHSWKYPGPVLSIAVSP 289 (487)
T ss_pred EeecCCceEeecccccceEEEEccceEEEEeeecccceeeEEecC
Confidence 26677777776 6666678866667788888888888876666
No 137
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=96.08 E-value=0.0096 Score=68.39 Aligned_cols=60 Identities=10% Similarity=0.029 Sum_probs=56.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||+.|++|.||||.+|+...++.||-..|..++||+=-.+|-++-+++++.-||+...+
T Consensus 167 ~tgs~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nk 226 (460)
T KOG0285|consen 167 ATGSADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNK 226 (460)
T ss_pred EecCCCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEecCCCeeEEEechhhh
Confidence 489999999999999999999999999999999999999999999999999999987654
No 138
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=96.04 E-value=0.017 Score=59.33 Aligned_cols=59 Identities=25% Similarity=0.446 Sum_probs=50.5
Q ss_pred Ccc-cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCCCC
Q 001090 2 SAG-YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG-~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+++ .||+|.+||+.++..+..+.++...|..++|+|+|. .++++..+|.|.+|++..+.
T Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~ 232 (466)
T COG2319 172 SGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGK 232 (466)
T ss_pred ecCCCCCceEEEEcCCCceEEeeccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCc
Confidence 345 499999999999999999999999999999999999 44455999999999766443
No 139
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.04 E-value=0.0051 Score=75.99 Aligned_cols=58 Identities=24% Similarity=0.170 Sum_probs=54.4
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|..|=.|+|.|+.+-+.+|.|.||...|-|++|||||+.||+++-+|+|.+||+-++.
T Consensus 552 ~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrWlisasmD~tIr~wDlpt~~ 609 (910)
T KOG1539|consen 552 ALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRWLISASMDSTIRTWDLPTGT 609 (910)
T ss_pred hcCceeEEEEEchhhhhhHHhhccccceeeeEeCCCCcEEEEeecCCcEEEEeccCcc
Confidence 5567789999999999999999999999999999999999999999999999987764
No 140
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.99 E-value=0.0091 Score=69.81 Aligned_cols=57 Identities=14% Similarity=0.198 Sum_probs=52.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCC-CCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPD-GASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPD-Gt~~a~tD~~G~l~Ifg~G 57 (1159)
+|||.|.+|+||||.+|+|-+++.-+...|-.++|.|- ++.|.+|.-+|++-|+++-
T Consensus 260 aSgsaD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R 317 (463)
T KOG0270|consen 260 ASGSADKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCR 317 (463)
T ss_pred EecCCCceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeecc
Confidence 59999999999999999999999988889999999995 6777799999999999986
No 141
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=95.97 E-value=0.0064 Score=71.93 Aligned_cols=77 Identities=21% Similarity=0.242 Sum_probs=68.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccCCccccccccCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKDAKYDQFFLGDYRP 77 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~~p~EQFF~tDYrp 77 (1159)
+|+..||+|.||||.+-..+|.|.||...+..+-.|+||+.|=+|--+-++.-||+-.+. .+..-..-|.|+--|-|
T Consensus 525 FsccsdGnI~vwDLhnq~~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqhdF~SQIfSLg~cP 602 (705)
T KOG0639|consen 525 FSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQHDFSSQIFSLGYCP 602 (705)
T ss_pred eeeccCCcEEEEEcccceeeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhhhhhhhheecccCC
Confidence 467889999999999999999999999999999999999999999999999999998886 44455667888877765
No 142
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=95.97 E-value=0.012 Score=68.02 Aligned_cols=59 Identities=22% Similarity=0.445 Sum_probs=52.0
Q ss_pred CCcccCCcEEEEEcc---CCeeeEEeecCcceeeeeeeCCC-CCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIW---EGIPIRIYEISRFRLVDGKFSPD-GASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~---~G~~Ir~f~~g~~~i~D~~FSPD-Gt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|||.||++.||||. .|.+|..|..|..+|..+.|+|- -..||++.++-+|.||++..-
T Consensus 318 asG~DdGt~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~p~e~s~iaasg~D~QitiWDlsvE 380 (440)
T KOG0302|consen 318 ASGGDDGTLSIWDLRQFKSGQPVATFKYHKAPITSIEWHPHEDSVIAASGEDNQITIWDLSVE 380 (440)
T ss_pred eecCCCceEEEEEhhhccCCCcceeEEeccCCeeEEEeccccCceEEeccCCCcEEEEEeecc
Confidence 589999999999986 47799999999999999999996 456678889999999998654
No 143
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.95 E-value=0.01 Score=67.38 Aligned_cols=60 Identities=20% Similarity=0.242 Sum_probs=56.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC--EEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA--SIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt--~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||.|-+|.|+|+.+-..+..+.-|.+.|-.++|.|+-. +|+++.++|+|.||++|+=+
T Consensus 57 aSGssDetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~ 118 (362)
T KOG0294|consen 57 ASGSSDETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWE 118 (362)
T ss_pred eccCCCCcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeE
Confidence 5899999999999999999999999999999999999998 99999999999999999854
No 144
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=95.94 E-value=0.003 Score=77.02 Aligned_cols=56 Identities=20% Similarity=0.267 Sum_probs=53.2
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
.|..||+|+||||++++.+|.+.+|...+..+.|+|=|.++|.+..++-|-||++-
T Consensus 87 agsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStdtd~~iwD~R 142 (825)
T KOG0267|consen 87 AGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYGEFFASGSTDTDLKIWDIR 142 (825)
T ss_pred ccccCCceeeeehhhhhhhhhhhccccCcceeeeccceEEeccccccccceehhhh
Confidence 57889999999999999999999999999999999999999999999999999974
No 145
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=95.92 E-value=0.015 Score=64.91 Aligned_cols=60 Identities=15% Similarity=0.261 Sum_probs=55.3
Q ss_pred CCcccCCcEEEEEccCCe--eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI--PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|||+|+++.||-=..|+ |+.+++||...|..++||++|.+||++..+-.++||.+-.+.
T Consensus 77 a~aSFD~t~~Iw~k~~~efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~dedd 138 (312)
T KOG0645|consen 77 ASASFDATVVIWKKEDGEFECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDD 138 (312)
T ss_pred EEeeccceEEEeecCCCceeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCC
Confidence 489999999999988886 999999999999999999999999999999999999986443
No 146
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=95.92 E-value=0.014 Score=61.30 Aligned_cols=55 Identities=20% Similarity=0.281 Sum_probs=44.2
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec---CceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD---DVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD---~~G~l~Ifg~G~g~ 60 (1159)
|..+++|.|||+. |++|..|. ...+-.++|||+|++||++. ..|.|.||++-..+
T Consensus 79 g~~~~~v~lyd~~-~~~i~~~~--~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~~ 136 (194)
T PF08662_consen 79 GSMPAKVTLYDVK-GKKIFSFG--TQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKKK 136 (194)
T ss_pred ccCCcccEEEcCc-ccEeEeec--CCCceEEEECCCCCEEEEEEccCCCcEEEEEECCCCE
Confidence 4567899999996 88887775 45566899999999999874 45899999987544
No 147
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=95.92 E-value=0.016 Score=70.52 Aligned_cols=97 Identities=15% Similarity=0.200 Sum_probs=82.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccCCcccc-------ccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKDAKYDQ-------FFL 72 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~~p~EQ-------FF~ 72 (1159)
+|+|-+|.|+-||+.+|++++.++.-.++||.++.+|-++.++.+-++|.|++|..|.++ .|+....-| =|+
T Consensus 84 FS~g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~ 163 (691)
T KOG2048|consen 84 FSSGLSGSITEWDLHTLKQKYNIDSNGGAIWSIAINPENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWN 163 (691)
T ss_pred EeecCCceEEEEecccCceeEEecCCCcceeEEEeCCccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEEec
Confidence 589999999999999999999999999999999999999999999999999999999998 677666655 366
Q ss_pred cCCccceecCCCC---ccccccCcCCcc
Q 001090 73 GDYRPLVQDTYGN---VLDQETQLAPHR 97 (1159)
Q Consensus 73 tDYrpLirD~~g~---vlDe~Tq~~PHL 97 (1159)
+|-.-|+--..++ +-|..+++.-|+
T Consensus 164 ~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~ 191 (691)
T KOG2048|consen 164 PTGTKIAGGSIDGVIRIWDVKSGQTLHI 191 (691)
T ss_pred CCccEEEecccCceEEEEEcCCCceEEE
Confidence 6666566654445 558888888885
No 148
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=95.89 E-value=0.007 Score=75.36 Aligned_cols=58 Identities=28% Similarity=0.456 Sum_probs=45.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec----------------------C------------------------------cc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI----------------------S------------------------------RF 28 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~----------------------g------------------------------~~ 28 (1159)
++++.||+|.||||++|++++++.+ | +.
T Consensus 154 Avss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss 233 (933)
T KOG1274|consen 154 AVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSSS 233 (933)
T ss_pred EEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEccCCceeheeeccccccc
Confidence 4678899999999999998775320 0 23
Q ss_pred eeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 29 RLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 29 ~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
.+-++.|||.|++||+++-.|+|.||++.+
T Consensus 234 ~~~~~~wsPnG~YiAAs~~~g~I~vWnv~t 263 (933)
T KOG1274|consen 234 KFSDLQWSPNGKYIAASTLDGQILVWNVDT 263 (933)
T ss_pred ceEEEEEcCCCcEEeeeccCCcEEEEeccc
Confidence 377888888888888888888888888664
No 149
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=95.87 E-value=0.011 Score=72.29 Aligned_cols=60 Identities=20% Similarity=0.239 Sum_probs=55.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|+++.|+.+.|||+..--|+..|.+|...|--++|+|||+.+|++.++-.+-||+.-.|+
T Consensus 128 a~gStdtd~~iwD~Rk~Gc~~~~~s~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk 187 (825)
T KOG0267|consen 128 ASGSTDTDLKIWDIRKKGCSHTYKSHTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGK 187 (825)
T ss_pred ccccccccceehhhhccCceeeecCCcceeEEEeecCCCceeeccCCcceeeeecccccc
Confidence 689999999999999777889999988888888999999999999999999999987776
No 150
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=95.85 E-value=0.018 Score=65.06 Aligned_cols=56 Identities=16% Similarity=0.258 Sum_probs=48.9
Q ss_pred cCCcEEEEEccCCeeeEEeecCccee---eeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRL---VDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i---~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-.+.+.|-|-.+|..+..|.+.+.+- .+++|+|||+.|.+|.++|+|+||++.+|.
T Consensus 207 ~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~ 265 (311)
T KOG1446|consen 207 NASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGK 265 (311)
T ss_pred CCCcEEEEEccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEecCCCcEEEEEcCCCc
Confidence 35678889999999999998765443 899999999999999999999999997765
No 151
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=95.80 E-value=0.015 Score=66.90 Aligned_cols=61 Identities=15% Similarity=0.291 Sum_probs=57.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
+|||-|.+++||||.+-..|.++.||...|.++.|-|---+++++..+++|.+||+-.|++
T Consensus 251 ~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt 311 (460)
T KOG0285|consen 251 VTGGRDSTIRVWDIRTRASVHVLSGHTNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKT 311 (460)
T ss_pred EecCCcceEEEeeecccceEEEecCCCCcceeEEeecCCCceEEecCCceEEEeeeccCce
Confidence 5899999999999999999999999999999999999889999999999999999977764
No 152
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=95.77 E-value=0.031 Score=57.46 Aligned_cols=56 Identities=25% Similarity=0.398 Sum_probs=51.6
Q ss_pred cccCCcEEEEEccC-CeeeEEeecCcceeeeeeeCCCCCEEEEecC-ceeEEEEecCC
Q 001090 3 AGYDGKTIVWDIWE-GIPIRIYEISRFRLVDGKFSPDGASIILSDD-VGQLYILNTGQ 58 (1159)
Q Consensus 3 AG~DG~I~lWDI~~-G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~-~G~l~Ifg~G~ 58 (1159)
++.|+.|.+||+.. +..+..+.++...|.+++|+|+|.+++++.. +|.+.+|++..
T Consensus 130 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (466)
T COG2319 130 SSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRT 187 (466)
T ss_pred CCCCccEEEEEecCCCeEEEEEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCC
Confidence 34599999999999 9999999999999999999999999999986 99999999876
No 153
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=95.76 E-value=0.0089 Score=69.44 Aligned_cols=57 Identities=16% Similarity=0.261 Sum_probs=55.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
+|||-|..|.|||-.+|.||.++.+|..-|..++|+|+|..|+++..+-.+-+||+-
T Consensus 238 asgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n~N~Llt~skD~~~kv~DiR 294 (464)
T KOG0284|consen 238 ASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPNGNWLLTGSKDQSCKVFDIR 294 (464)
T ss_pred EEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCCCCeeEEccCCceEEEEehh
Confidence 589999999999999999999999999999999999999999999999999999986
No 154
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=95.76 E-value=0.0081 Score=68.72 Aligned_cols=81 Identities=12% Similarity=0.112 Sum_probs=66.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCcccee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQ 80 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLir 80 (1159)
+||.-|.+|.|||..+-.|++++.||.|.|+.+.| |-+.|++|.++-++.||++-.|+..+. .|-|+|----+|
T Consensus 211 VSGlrDnTikiWD~n~~~c~~~L~GHtGSVLCLqy--d~rviisGSSDsTvrvWDv~tge~l~t----lihHceaVLhlr 284 (499)
T KOG0281|consen 211 VSGLRDNTIKIWDKNSLECLKILTGHTGSVLCLQY--DERVIVSGSSDSTVRVWDVNTGEPLNT----LIHHCEAVLHLR 284 (499)
T ss_pred hcccccCceEEeccccHHHHHhhhcCCCcEEeeec--cceEEEecCCCceEEEEeccCCchhhH----HhhhcceeEEEE
Confidence 58999999999999999999999999999999999 778999999999999999999985542 344555544445
Q ss_pred cCCCCcc
Q 001090 81 DTYGNVL 87 (1159)
Q Consensus 81 D~~g~vl 87 (1159)
=.||+.|
T Consensus 285 f~ng~mv 291 (499)
T KOG0281|consen 285 FSNGYMV 291 (499)
T ss_pred EeCCEEE
Confidence 4555543
No 155
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=95.73 E-value=0.014 Score=67.83 Aligned_cols=61 Identities=16% Similarity=0.260 Sum_probs=51.5
Q ss_pred CCcccCCcEEEEEcc--------C--------CeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIW--------E--------GIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~--------~--------G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
+|||.+|.|+||=.. + =...++..+|.-.|+|++|||||..++++.-+-.+++|+++.|..
T Consensus 81 ASg~D~g~v~lWk~~~~~~~~~d~e~~~~ke~w~v~k~lr~h~~diydL~Ws~d~~~l~s~s~dns~~l~Dv~~G~l 157 (434)
T KOG1009|consen 81 ASGGDGGEVFLWKQGDVRIFDADTEADLNKEKWVVKKVLRGHRDDIYDLAWSPDSNFLVSGSVDNSVRLWDVHAGQL 157 (434)
T ss_pred eecCCCceEEEEEecCcCCccccchhhhCccceEEEEEecccccchhhhhccCCCceeeeeeccceEEEEEecccee
Confidence 589999999999776 2 234445667788999999999999999999999999999988863
No 156
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=95.69 E-value=0.011 Score=68.56 Aligned_cols=58 Identities=22% Similarity=0.379 Sum_probs=49.9
Q ss_pred CCcccCCcEEEEEccCCeeeE---------EeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIR---------IYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir---------~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|||-|.+|+||=|.++..=. ++.+|+.+|--+.|+|+|-.||++.+.|.++||=.+.
T Consensus 30 aT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk~~~ 96 (434)
T KOG1009|consen 30 ATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWKQGD 96 (434)
T ss_pred ecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCceEEEEEecC
Confidence 589999999999998876322 3447899999999999999999999999999996553
No 157
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.64 E-value=0.014 Score=72.13 Aligned_cols=56 Identities=23% Similarity=0.380 Sum_probs=49.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
+||+-||+|+||+|..-+++--++- +.-|-.+||+|||+..++|.-.|...+|.+-
T Consensus 426 iSGSLD~KvRiWsI~d~~Vv~W~Dl-~~lITAvcy~PdGk~avIGt~~G~C~fY~t~ 481 (712)
T KOG0283|consen 426 ISGSLDGKVRLWSISDKKVVDWNDL-RDLITAVCYSPDGKGAVIGTFNGYCRFYDTE 481 (712)
T ss_pred eecccccceEEeecCcCeeEeehhh-hhhheeEEeccCCceEEEEEeccEEEEEEcc
Confidence 6999999999999988887744443 3789999999999999999999999999874
No 158
>PRK01742 tolB translocation protein TolB; Provisional
Probab=95.62 E-value=0.024 Score=65.81 Aligned_cols=76 Identities=20% Similarity=0.094 Sum_probs=53.6
Q ss_pred ccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec---CceeEEEEecCCCCCc--cCCcc---ccccccCC
Q 001090 4 GYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD---DVGQLYILNTGQGESQ--KDAKY---DQFFLGDY 75 (1159)
Q Consensus 4 G~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD---~~G~l~Ifg~G~g~~~--~~~p~---EQFF~tDY 75 (1159)
.++..|.|||...+. .+.+.++...+.+.+|||||+.||.+. ...+|++|++..+... ...+. .--|+.|.
T Consensus 181 ~~~~~i~i~d~dg~~-~~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG 259 (429)
T PRK01742 181 SQPYEVRVADYDGFN-QFIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGHNGAPAFSPDG 259 (429)
T ss_pred CceEEEEEECCCCCC-ceEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCccCceeECCCC
Confidence 357899999985444 666777888999999999999999763 3458999998766421 11111 12477787
Q ss_pred cccee
Q 001090 76 RPLVQ 80 (1159)
Q Consensus 76 rpLir 80 (1159)
+-|+.
T Consensus 260 ~~La~ 264 (429)
T PRK01742 260 SRLAF 264 (429)
T ss_pred CEEEE
Confidence 76654
No 159
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.57 E-value=0.02 Score=71.49 Aligned_cols=58 Identities=14% Similarity=0.177 Sum_probs=55.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|||.|-||+||+++++.||.++.||.--|-.+.|.|---.|+++.-+-++.+||++.
T Consensus 109 lSASDDQTIrIWNwqsr~~iavltGHnHYVMcAqFhptEDlIVSaSLDQTVRVWDisG 166 (1202)
T KOG0292|consen 109 LSASDDQTIRIWNWQSRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLDQTVRVWDISG 166 (1202)
T ss_pred EEccCCCeEEEEeccCCceEEEEecCceEEEeeccCCccceEEEecccceEEEEeecc
Confidence 6999999999999999999999999999999999999999999999999999999864
No 160
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=95.51 E-value=0.022 Score=68.68 Aligned_cols=57 Identities=21% Similarity=0.373 Sum_probs=50.6
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-|.-||.|+|||..+|.-. +....+....++|.|||..|+++++.|.|-+||++-+.
T Consensus 276 lGC~DgSiiLyD~~~~~t~--~~ka~~~P~~iaWHp~gai~~V~s~qGelQ~FD~ALsp 332 (545)
T PF11768_consen 276 LGCEDGSIILYDTTRGVTL--LAKAEFIPTLIAWHPDGAIFVVGSEQGELQCFDMALSP 332 (545)
T ss_pred EEecCCeEEEEEcCCCeee--eeeecccceEEEEcCCCcEEEEEcCCceEEEEEeecCc
Confidence 3678999999999999654 55677889999999999999999999999999998764
No 161
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.51 E-value=0.033 Score=67.61 Aligned_cols=80 Identities=13% Similarity=0.214 Sum_probs=65.6
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCCCCCccC-------Ccccccc
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQGESQKD-------AKYDQFF 71 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~g~~~~~-------~p~EQFF 71 (1159)
||+|.|-+|.+||.+.+- |.++|+||.--|-+++|-| |-.+||++.-++++.+|.+|+...-.. +-+=-||
T Consensus 113 LtsSDDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrTVKVWslgs~~~nfTl~gHekGVN~Vdyy 192 (794)
T KOG0276|consen 113 LTSSDDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASASLDRTVKVWSLGSPHPNFTLEGHEKGVNCVDYY 192 (794)
T ss_pred EecCCccEEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeeeeccccEEEEEcCCCCCceeeeccccCcceEEec
Confidence 689999999999999998 8889999999999999999 678999999999999999999753221 1223377
Q ss_pred ccCCcccee
Q 001090 72 LGDYRPLVQ 80 (1159)
Q Consensus 72 ~tDYrpLir 80 (1159)
+.+-.|-+-
T Consensus 193 ~~gdkpylI 201 (794)
T KOG0276|consen 193 TGGDKPYLI 201 (794)
T ss_pred cCCCcceEE
Confidence 777665444
No 162
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=95.51 E-value=0.024 Score=64.03 Aligned_cols=58 Identities=14% Similarity=0.229 Sum_probs=49.7
Q ss_pred CcccCCcEEEEEccCCee--eEE-eecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCCC
Q 001090 2 SAGYDGKTIVWDIWEGIP--IRI-YEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~--Ir~-f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~g 59 (1159)
++|-|-+-.||||++|.. +++ ++.|.-.|+|++|+-+|+ .||++..+|.+.||++-.-
T Consensus 168 tSSiDTTCTiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaDGSvRmFDLR~l 229 (364)
T KOG0290|consen 168 TSSIDTTCTIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGADGSVRMFDLRSL 229 (364)
T ss_pred eecccCeEEEEEEeeccccceeeEEEecCcceeEEEeccCccceEEEecCCCcEEEEEeccc
Confidence 568899999999999965 454 567888999999999886 6889999999999999764
No 163
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=95.50 E-value=0.0095 Score=67.25 Aligned_cols=48 Identities=19% Similarity=0.391 Sum_probs=45.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDV 48 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~ 48 (1159)
+|||.||.|.+||+.+-+.|+.|..-...|..++||-||..||.++++
T Consensus 248 aTgGsDG~V~~Wd~~~rKrl~q~~~~~~SI~slsfs~dG~~LAia~sy 295 (323)
T KOG1036|consen 248 ATGGSDGIVNIWDLFNRKRLKQLAKYETSISSLSFSMDGSLLAIASSY 295 (323)
T ss_pred EecCCCceEEEccCcchhhhhhccCCCCceEEEEeccCCCeEEEEech
Confidence 589999999999999999999999889999999999999999988764
No 164
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=95.50 E-value=0.023 Score=68.78 Aligned_cols=59 Identities=14% Similarity=0.200 Sum_probs=53.2
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
|||.|.+|++|||.+|.+++++.+|.++|.++... |..|++|..+|.|.+|++-.++..
T Consensus 306 sgs~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~~~~cl 364 (537)
T KOG0274|consen 306 SGSRDNTVKVWDVTNGACLNLLRGHTGPVNCVQLD--EPLLVSGSYDGTVKVWDPRTGKCL 364 (537)
T ss_pred eccCCceEEEEeccCcceEEEeccccccEEEEEec--CCEEEEEecCceEEEEEhhhceee
Confidence 57899999999999999999999999999999997 999999999999999999876643
No 165
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=95.49 E-value=0.018 Score=72.73 Aligned_cols=59 Identities=20% Similarity=0.262 Sum_probs=52.0
Q ss_pred CCcccCCcEEEEEccC-------Ce-----------eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWE-------GI-----------PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~-------G~-----------~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|||.|+.|.||+-.. |. ++.+|.+|.+.|.|++|||||.+||++.-++++.||+.=+.
T Consensus 85 AsGSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF 161 (942)
T KOG0973|consen 85 ASGSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTF 161 (942)
T ss_pred eeccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCCccEEEEecccceEEEEccccc
Confidence 5899999999999984 22 67788899999999999999999999999999999985443
No 166
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.49 E-value=0.011 Score=67.06 Aligned_cols=77 Identities=18% Similarity=0.215 Sum_probs=63.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCC--------ccccccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDA--------KYDQFFL 72 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~--------p~EQFF~ 72 (1159)
++|++|.+-+||||++|+++.++.||...+--|+-.|.-+.++++..+-++.+|+|--+ ....+ -.---|.
T Consensus 288 vTaSWDRTAnlwDVEtge~v~~LtGHd~ELtHcstHptQrLVvTsSrDtTFRLWDFRea-I~sV~VFQGHtdtVTS~vF~ 366 (481)
T KOG0300|consen 288 VTASWDRTANLWDVETGEVVNILTGHDSELTHCSTHPTQRLVVTSSRDTTFRLWDFREA-IQSVAVFQGHTDTVTSVVFN 366 (481)
T ss_pred eeeeccccceeeeeccCceeccccCcchhccccccCCcceEEEEeccCceeEeccchhh-cceeeeecccccceeEEEEe
Confidence 47999999999999999999999999999999999999999999999999999998632 11111 1122688
Q ss_pred cCCccc
Q 001090 73 GDYRPL 78 (1159)
Q Consensus 73 tDYrpL 78 (1159)
+||+-+
T Consensus 367 ~dd~vV 372 (481)
T KOG0300|consen 367 TDDRVV 372 (481)
T ss_pred cCCcee
Confidence 888744
No 167
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.46 E-value=0.0099 Score=73.97 Aligned_cols=80 Identities=16% Similarity=0.226 Sum_probs=67.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc----c--ccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK----Y--DQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p----~--EQFF~tD 74 (1159)
|++=|.|.|.|||-.=|++|..|..|.|+|.-++|.|++-.|++|.++=+|.+|.+-.....-... | --|||..
T Consensus 25 LtslHsG~IQlWDYRM~tli~rFdeHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlDYVRt~~FHhe 104 (1202)
T KOG0292|consen 25 LTSLHSGVIQLWDYRMGTLIDRFDEHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLDYVRTVFFHHE 104 (1202)
T ss_pred EEeecCceeeeehhhhhhHHhhhhccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccceeEEeeccCC
Confidence 466789999999999999999999999999999999999999999999999999987765322111 2 2499999
Q ss_pred Ccccee
Q 001090 75 YRPLVQ 80 (1159)
Q Consensus 75 YrpLir 80 (1159)
|-.+|-
T Consensus 105 yPWIlS 110 (1202)
T KOG0292|consen 105 YPWILS 110 (1202)
T ss_pred CceEEE
Confidence 965554
No 168
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=95.45 E-value=0.012 Score=71.20 Aligned_cols=58 Identities=16% Similarity=0.115 Sum_probs=52.9
Q ss_pred CCcccCCcEEEEEccCCeeeEE--eecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRI--YEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~--f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~ 58 (1159)
+||+-|.+|++|||.+++++.. |.||.+.|.+.||+|+- ..||+|-.+|.|+||++-+
T Consensus 116 VsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~ 176 (720)
T KOG0321|consen 116 VSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRC 176 (720)
T ss_pred EEccCCceeeeeeeccceeecceeecccccccchhhhccCCCcceeeccCCCcEEEEEEec
Confidence 5789999999999999999887 99999999999999976 5688999999999999844
No 169
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=95.41 E-value=0.027 Score=62.62 Aligned_cols=54 Identities=17% Similarity=0.300 Sum_probs=38.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
||+.+|++|+|||+.+|+.++.++. ...|-++-.|+||..|..++. +.|.+|+.
T Consensus 159 LSSadd~tVRLWD~rTgt~v~sL~~-~s~VtSlEvs~dG~ilTia~g-ssV~Fwda 212 (334)
T KOG0278|consen 159 LSSADDKTVRLWDHRTGTEVQSLEF-NSPVTSLEVSQDGRILTIAYG-SSVKFWDA 212 (334)
T ss_pred EeeccCCceEEEEeccCcEEEEEec-CCCCcceeeccCCCEEEEecC-ceeEEecc
Confidence 5889999999999999999987653 224556666777776655543 34555553
No 170
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.39 E-value=0.021 Score=69.39 Aligned_cols=58 Identities=19% Similarity=0.275 Sum_probs=54.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|||.||+|+|||+-.-.||.+|.-|.-.||.+.-+|+=++|.+||.+|.|+.-++-.
T Consensus 229 ls~sSDgtIrlWdLgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~rd~~i~~Tdl~n 286 (735)
T KOG0308|consen 229 LSASSDGTIRLWDLGQQRCLATYIVHKEGVWALQSSPSFTHVYSGGRDGNIYRTDLRN 286 (735)
T ss_pred eecCCCceEEeeeccccceeeeEEeccCceEEEeeCCCcceEEecCCCCcEEecccCC
Confidence 6899999999999999999999999988899999999999999999999999988765
No 171
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.38 E-value=0.016 Score=66.76 Aligned_cols=212 Identities=17% Similarity=0.171 Sum_probs=112.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC---------CCc--cCCcccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG---------ESQ--KDAKYDQ 69 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g---------~~~--~~~p~EQ 69 (1159)
+||++||.|+|||+.+-+|++.|..|.|.|..+++.- +.|.++.++-+|..|++... .-+ .....+-
T Consensus 83 aSGs~DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~--~~~~tvgdDKtvK~wk~~~~p~~tilg~s~~~gIdh~~~~~ 160 (433)
T KOG0268|consen 83 ASGSCDGEVKIWNLSQRECIRTFKAHEGLVRGICVTQ--TSFFTVGDDKTVKQWKIDGPPLHTILGKSVYLGIDHHRKNS 160 (433)
T ss_pred hccccCceEEEEehhhhhhhheeecccCceeeEEecc--cceEEecCCcceeeeeccCCcceeeeccccccccccccccc
Confidence 6899999999999999999999999999999999988 55665556667777764321 000 0000111
Q ss_pred ccccCCc-cceecCCCC-ccccccCcCCcccC--CCCCcccC------CCC------CCCchhHHhhhhhhhcccCcccc
Q 001090 70 FFLGDYR-PLVQDTYGN-VLDQETQLAPHRRN--LQDPLCDS------AMI------PYPEPYQTMYQQRRLGALGIEWR 133 (1159)
Q Consensus 70 FF~tDYr-pLirD~~g~-vlDe~Tq~~PHL~~--~~~~L~D~------~~~------PYp~~~Q~~~~~rrl~~l~~e~~ 133 (1159)
-|.|=.. -=|.|.+.. +|-.-+=-++.+-. .-|.+.+. ++. =-..|++..+.+=| +.+|-|.
T Consensus 161 ~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi~~mR--TN~Iswn 238 (433)
T KOG0268|consen 161 VFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCASDRSIVLYDLRQASPLKKVILTMR--TNTICWN 238 (433)
T ss_pred cccccCceeeecccccCCccceeecCCCceeEEecCCCcchheeeeccCCceEEEecccCCccceeeeecc--ccceecC
Confidence 2222221 112232111 11111111121100 00111111 111 11234666666554 4488899
Q ss_pred CCccccccC-CCCcCCCccccccccCcccccCCCcccccc------ccCCcc-cccccCCCC---Cccccccccccc---
Q 001090 134 PSSLKLAVG-PDFSLDQGYQLQPLADLDVMIDPLPEFIDV------MDWEPE-NEVQSDDND---SEYNVAEEYSTE--- 199 (1159)
Q Consensus 134 p~~~~~~vg-~~~~~~~~~~~~p~~~~~~~~~~~~e~~d~------~~wepe-~ev~sdd~d---seyn~~ee~ss~--- 199 (1159)
|--.+|.+| .|.++- . -|.--|-+||--..|. ++.-|- .|++|---| --|++.+-.|-+
T Consensus 239 PeafnF~~a~ED~nlY-~------~DmR~l~~p~~v~~dhvsAV~dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdiYh 311 (433)
T KOG0268|consen 239 PEAFNFVAANEDHNLY-T------YDMRNLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDIYH 311 (433)
T ss_pred ccccceeeccccccce-e------hhhhhhcccchhhcccceeEEEeccCCCcchhccccccceEEEeecCCCcchhhhh
Confidence 965677666 344431 1 1222233455444442 455665 677664433 357887777665
Q ss_pred ---ccccccCCCCCCCCCCcCCcCCCC
Q 001090 200 ---EKGSLSSTSSGDSECSAEDSEDGE 223 (1159)
Q Consensus 200 ---e~~~~~~~ss~~~~~s~~~s~~~~ 223 (1159)
=|-..|--=|-|+-|--..||+.+
T Consensus 312 tkRMq~V~~Vk~S~Dskyi~SGSdd~n 338 (433)
T KOG0268|consen 312 TKRMQHVFCVKYSMDSKYIISGSDDGN 338 (433)
T ss_pred HhhhheeeEEEEeccccEEEecCCCcc
Confidence 155555555666666666666655
No 172
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=95.37 E-value=0.025 Score=65.15 Aligned_cols=77 Identities=19% Similarity=0.351 Sum_probs=62.8
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeE-EEEecCCCCCccC--------CccccccccCCc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQL-YILNTGQGESQKD--------AKYDQFFLGDYR 76 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l-~Ifg~G~g~~~~~--------~p~EQFF~tDYr 76 (1159)
-|.|+|||+.+=+++..+..|.+.|-.++|||||++||++.+.|+| .+|.+-.|..... ..|-.-|+.|-.
T Consensus 152 ~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~ 231 (391)
T KOG2110|consen 152 SGDVVLFDTINLQPVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQ 231 (391)
T ss_pred CceEEEEEcccceeeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCC
Confidence 4889999999999999999999999999999999999999999995 6888866653211 124446999987
Q ss_pred cceecC
Q 001090 77 PLVQDT 82 (1159)
Q Consensus 77 pLirD~ 82 (1159)
-|.--.
T Consensus 232 ~L~~sS 237 (391)
T KOG2110|consen 232 FLAASS 237 (391)
T ss_pred eEEEec
Confidence 666543
No 173
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.35 E-value=0.014 Score=71.86 Aligned_cols=59 Identities=10% Similarity=0.070 Sum_probs=53.0
Q ss_pred CcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|+...|.|.+|||..=. +.+.|..|.|.|+.|-|+|++..||+|..++++.||+.+.+.
T Consensus 194 s~~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~ 253 (839)
T KOG0269|consen 194 SIHDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSR 253 (839)
T ss_pred EecCCceEEEeeccCchhHHHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCC
Confidence 45556899999998765 888899999999999999999999999999999999998754
No 174
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=95.18 E-value=0.017 Score=64.38 Aligned_cols=68 Identities=18% Similarity=0.191 Sum_probs=62.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYD 68 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~E 68 (1159)
++||.|..|-|||+..=.|+|+|....-+|..+-||=||++||++.++=-|=|-.+.+|......|.+
T Consensus 205 A~GsADAlvSLWD~~ELiC~R~isRldwpVRTlSFS~dg~~lASaSEDh~IDIA~vetGd~~~eI~~~ 272 (313)
T KOG1407|consen 205 ATGSADALVSLWDVDELICERCISRLDWPVRTLSFSHDGRMLASASEDHFIDIAEVETGDRVWEIPCE 272 (313)
T ss_pred eeccccceeeccChhHhhhheeeccccCceEEEEeccCcceeeccCccceEEeEecccCCeEEEeecc
Confidence 57999999999999999999999999999999999999999999999999999999888866655544
No 175
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=95.17 E-value=0.012 Score=72.99 Aligned_cols=70 Identities=16% Similarity=0.143 Sum_probs=62.2
Q ss_pred cccccCcccccCCCCCCHHHHHHHHHcCCC--------------C----------CHHH------HHHHHHHHHHhhhhc
Q 001090 1075 QVSQKTNFTNRFPVPLSLDVIQSRLENNYY--------------R----------GLEA------VKHDIAVMLSNAESY 1124 (1159)
Q Consensus 1075 ~~~e~PdY~~iIk~PMDLsTIr~RLeN~~Y--------------R----------slea------f~~DVrLI~sNA~tF 1124 (1159)
+...+|-|+.+..+|.+|+|++.+|.+++| + ++.+ ..|.+.+|..||+.|
T Consensus 91 d~~~pp~~~~~a~vpTlLgtg~qsLl~r~k~~~~~~~~~s~~~~~h~~~~~~~~~sl~s~~~~~~~h~~a~~i~~at~~~ 170 (1113)
T KOG0644|consen 91 DKPIPPRYCTIARVPTLLGTGRQSLLRRAKDIRHTVWKGSAFRWPHMHADQVRGVSLRSIGGGFEIHHRAPSIGCATFSI 170 (1113)
T ss_pred CCCCCcceeeeecccchhcchhHHHHhhhhhcccccccccccccccccCcccccceeccCCcchhhhhcCcccccceeee
Confidence 345678899999999999999999999999 4 5555 899999999999999
Q ss_pred cCCCCHHHHHHHHHHHHHHHH
Q 001090 1125 FGRNTDLSTKIKRLSDLVTRT 1145 (1159)
Q Consensus 1125 N~p~S~I~k~Ak~L~d~F~~~ 1145 (1159)
+.|++ +|+.++.+.+++..+
T Consensus 171 akPgt-mvqkmk~ikrLlgH~ 190 (1113)
T KOG0644|consen 171 AKPGT-MVQKMKNIKRLLGHR 190 (1113)
T ss_pred cCcHH-HHHHHHHHHHHHhhh
Confidence 99999 999999999988665
No 176
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=95.05 E-value=0.046 Score=63.11 Aligned_cols=91 Identities=19% Similarity=0.136 Sum_probs=63.3
Q ss_pred ccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc-----cccccccCCccc
Q 001090 4 GYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK-----YDQFFLGDYRPL 78 (1159)
Q Consensus 4 G~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p-----~EQFF~tDYrpL 78 (1159)
.-+|+|.|.|..+++.++.|..+...-...+|||||+++.++..+|.|.++|+.+.+..+..+ .---|+.|.+-|
T Consensus 13 ~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~~i~~s~DG~~~ 92 (369)
T PF02239_consen 13 RGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPRGIAVSPDGKYV 92 (369)
T ss_dssp GGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEEEEEE--TTTEE
T ss_pred cCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcceEEEcCCCCEE
Confidence 457999999999999999998765555667899999999999999999999999987554433 333678888766
Q ss_pred ee---cCCCC-ccccccCcC
Q 001090 79 VQ---DTYGN-VLDQETQLA 94 (1159)
Q Consensus 79 ir---D~~g~-vlDe~Tq~~ 94 (1159)
+- ..+.. |+|.+|..+
T Consensus 93 ~v~n~~~~~v~v~D~~tle~ 112 (369)
T PF02239_consen 93 YVANYEPGTVSVIDAETLEP 112 (369)
T ss_dssp EEEEEETTEEEEEETTT--E
T ss_pred EEEecCCCceeEeccccccc
Confidence 63 22333 668777544
No 177
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=95.01 E-value=0.021 Score=63.84 Aligned_cols=62 Identities=19% Similarity=0.190 Sum_probs=52.0
Q ss_pred CcccCCcEEEEEccCCee-eEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc
Q 001090 2 SAGYDGKTIVWDIWEGIP-IRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK 63 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~-Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~ 63 (1159)
+|..||.|.+||+.+|.. +.+-..|..+|-|++||||+++|+++..+-+-.+|++-+=+.+|
T Consensus 164 ~Ghe~G~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v~K 226 (327)
T KOG0643|consen 164 AGHEDGSISIYDARTGKELVDSDEEHSSKINDLQFSRDRTYFITGSKDTTAKLVDVRTLEVLK 226 (327)
T ss_pred EecCCCcEEEEEcccCceeeechhhhccccccccccCCcceEEecccCccceeeeccceeeEE
Confidence 577899999999999964 44455677799999999999999999999999999987654333
No 178
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=94.93 E-value=0.04 Score=66.06 Aligned_cols=54 Identities=17% Similarity=0.143 Sum_probs=50.6
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
++.|+||+.-+=.+++.+.+|...|+.+++||||+.||++..+.+|.+|.+...
T Consensus 410 ~n~i~lw~~ps~~~~~~l~gH~~RVl~la~SPdg~~i~t~a~DETlrfw~~f~~ 463 (484)
T KOG0305|consen 410 ENQITLWKYPSMKLVAELLGHTSRVLYLALSPDGETIVTGAADETLRFWNLFDE 463 (484)
T ss_pred CCcEEEEeccccceeeeecCCcceeEEEEECCCCCEEEEecccCcEEeccccCC
Confidence 568999999999999999999999999999999999999999999999997653
No 179
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=94.88 E-value=0.028 Score=64.77 Aligned_cols=80 Identities=19% Similarity=0.299 Sum_probs=61.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEe------------------------------------------ecCcceeeeeeeCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIY------------------------------------------EISRFRLVDGKFSPD 38 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f------------------------------------------~~g~~~i~D~~FSPD 38 (1159)
+||+.|+.|+|+|+.++.++++. .+|-.+|.|+.|||-
T Consensus 204 as~~sDrsIvLyD~R~~~Pl~KVi~~mRTN~IswnPeafnF~~a~ED~nlY~~DmR~l~~p~~v~~dhvsAV~dVdfspt 283 (433)
T KOG0268|consen 204 ASCASDRSIVLYDLRQASPLKKVILTMRTNTICWNPEAFNFVAANEDHNLYTYDMRNLSRPLNVHKDHVSAVMDVDFSPT 283 (433)
T ss_pred eeeccCCceEEEecccCCccceeeeeccccceecCccccceeeccccccceehhhhhhcccchhhcccceeEEEeccCCC
Confidence 47889999999999999988752 134469999999999
Q ss_pred CCEEEEecCceeEEEEecCCCCCcc--CCcccc-----ccccCCcccee
Q 001090 39 GASIILSDDVGQLYILNTGQGESQK--DAKYDQ-----FFLGDYRPLVQ 80 (1159)
Q Consensus 39 Gt~~a~tD~~G~l~Ifg~G~g~~~~--~~p~EQ-----FF~tDYrpLir 80 (1159)
|+-||+|.-+-+|.||++--+.+-. .++--| -|+-|..-++-
T Consensus 284 G~EfvsgsyDksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~S 332 (433)
T KOG0268|consen 284 GQEFVSGSYDKSIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMDSKYIIS 332 (433)
T ss_pred cchhccccccceEEEeecCCCcchhhhhHhhhheeeEEEEeccccEEEe
Confidence 9999999999999999997665432 133223 36666655554
No 180
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=94.65 E-value=0.05 Score=61.99 Aligned_cols=60 Identities=18% Similarity=0.258 Sum_probs=56.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||+.||+|++||+-.=++++.|-+|.++|-|++..|-|..-.+.-.++.|.+|++=.|.
T Consensus 101 lS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr 160 (362)
T KOG0294|consen 101 LSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQVLRTWNLVRGR 160 (362)
T ss_pred eeecCCCcEEEEEcCCeEEeeeecccccccceeEecCCCceEEEEcCCceeeeehhhcCc
Confidence 589999999999999999999999999999999999999999999999999999995554
No 181
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=94.56 E-value=0.047 Score=63.30 Aligned_cols=54 Identities=19% Similarity=0.368 Sum_probs=49.9
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
+||+|||+++|||+.+-. +|-.+.+|.-+|+++.|. +|-.||+|..+-+|.||-
T Consensus 362 ~S~S~D~t~klWDvRS~k~plydI~~h~DKvl~vdW~-~~~~IvSGGaD~~l~i~~ 416 (423)
T KOG0313|consen 362 VSGSYDNTVKLWDVRSTKAPLYDIAGHNDKVLSVDWN-EGGLIVSGGADNKLRIFK 416 (423)
T ss_pred EEEecCCeEEEEEeccCCCcceeeccCCceEEEEecc-CCceEEeccCcceEEEec
Confidence 589999999999999999 997778899999999996 788999999999999996
No 182
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=94.36 E-value=0.076 Score=64.94 Aligned_cols=59 Identities=20% Similarity=0.299 Sum_probs=49.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEee--------cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE--------ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~--------~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++|+-||.|++||+.+|.-+.+.. +...-||.+.|=.||+ ||+||+.|++-+|+.-.|.
T Consensus 170 ~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd~t-I~sgDS~G~V~FWd~~~gT 236 (691)
T KOG2048|consen 170 AGGSIDGVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRDST-IASGDSAGTVTFWDSIFGT 236 (691)
T ss_pred EecccCceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeecCc-EEEecCCceEEEEcccCcc
Confidence 468999999999999999887443 2367799999999986 7899999999999986664
No 183
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=94.28 E-value=0.035 Score=65.48 Aligned_cols=54 Identities=22% Similarity=0.391 Sum_probs=51.0
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
--|+|.||-+.+|..|.++..|=-.|-.++||-||.+|+++..+|.+++|.+=.
T Consensus 101 i~g~lYlWelssG~LL~v~~aHYQ~ITcL~fs~dgs~iiTgskDg~V~vW~l~~ 154 (476)
T KOG0646|consen 101 ISGNLYLWELSSGILLNVLSAHYQSITCLKFSDDGSHIITGSKDGAVLVWLLTD 154 (476)
T ss_pred ccCcEEEEEeccccHHHHHHhhccceeEEEEeCCCcEEEecCCCccEEEEEEEe
Confidence 789999999999999999988888999999999999999999999999998744
No 184
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.27 E-value=0.027 Score=68.24 Aligned_cols=58 Identities=22% Similarity=0.202 Sum_probs=54.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+||+.|-+|+|||.+|-.||+++.||-.-|--++|.|-=-.||+|.++|++.||.-.+
T Consensus 201 IsgaDD~tiKvWDyQtk~CV~TLeGHt~Nvs~v~fhp~lpiiisgsEDGTvriWhs~T 258 (794)
T KOG0276|consen 201 ISGADDLTIKVWDYQTKSCVQTLEGHTNNVSFVFFHPELPIIISGSEDGTVRIWNSKT 258 (794)
T ss_pred EecCCCceEEEeecchHHHHHHhhcccccceEEEecCCCcEEEEecCCccEEEecCcc
Confidence 4899999999999999999999999999999999999999999999999999997443
No 185
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=94.20 E-value=0.037 Score=67.91 Aligned_cols=56 Identities=14% Similarity=0.092 Sum_probs=52.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+++..|.+|.+.|..+|+|+....||.-.|.-++|++|=+|||++..+|.|+||-+
T Consensus 657 atScsdktl~~~Df~sgEcvA~m~GHsE~VTG~kF~nDCkHlISvsgDgCIFvW~l 712 (1080)
T KOG1408|consen 657 ATSCSDKTLCFVDFVSGECVAQMTGHSEAVTGVKFLNDCKHLISVSGDGCIFVWKL 712 (1080)
T ss_pred EEeecCCceEEEEeccchhhhhhcCcchheeeeeecccchhheeecCCceEEEEEC
Confidence 35678999999999999999999999999999999999999999999999999974
No 186
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=94.14 E-value=0.074 Score=61.46 Aligned_cols=61 Identities=21% Similarity=0.299 Sum_probs=48.0
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-CceeEEEEecCCCCCccC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVGQLYILNTGQGESQKD 64 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G~g~~~~~ 64 (1159)
++.||.|-+||+.+++.++.+..|.. -..+++||||++++++. ..|++.||+..+.+..+.
T Consensus 54 ~~rdg~vsviD~~~~~~v~~i~~G~~-~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~ 115 (369)
T PF02239_consen 54 ANRDGTVSVIDLATGKVVATIKVGGN-PRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKT 115 (369)
T ss_dssp EETTSEEEEEETTSSSEEEEEE-SSE-EEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEE
T ss_pred EcCCCeEEEEECCcccEEEEEecCCC-cceEEEcCCCCEEEEEecCCCceeEeccccccceee
Confidence 46799999999999999999987765 47799999999999887 689999999877665443
No 187
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=94.10 E-value=0.064 Score=62.03 Aligned_cols=54 Identities=19% Similarity=0.273 Sum_probs=47.2
Q ss_pred Cccc-CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 2 SAGY-DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 2 SAG~-DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
||++ |..|+|||+.+|.++..--.+.+.+.=++|||||..|-++.-++.+.+|.
T Consensus 212 tAS~gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~davfrlw~ 266 (445)
T KOG2139|consen 212 TASFGSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAVFRLWQ 266 (445)
T ss_pred ecccCcceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEecccceeeeeh
Confidence 4554 57899999999998887767778888899999999999999999999993
No 188
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=94.09 E-value=0.055 Score=64.41 Aligned_cols=68 Identities=19% Similarity=0.198 Sum_probs=56.2
Q ss_pred CcccCCcEEEEEccCCeeeEEee-cCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCCCCccCCcccc
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYE-ISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQGESQKDAKYDQ 69 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~-~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ 69 (1159)
+||++|.|-|||+..-.++..|. .|..+-.++||||-- ..||+..-+-+|+|||+|+..+-+..-|+-
T Consensus 182 ~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l~y~~ 251 (673)
T KOG4378|consen 182 IASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRLTYSH 251 (673)
T ss_pred eeccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecccceEEEeecccccccceeeecC
Confidence 79999999999999888988776 677889999999965 566788899999999999876555544544
No 189
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.08 E-value=0.049 Score=69.41 Aligned_cols=57 Identities=14% Similarity=0.172 Sum_probs=50.0
Q ss_pred CcccCCcEEEEEccC---Ce---eeEEeecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCC
Q 001090 2 SAGYDGKTIVWDIWE---GI---PIRIYEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~---G~---~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~ 58 (1159)
.|+.||+|.|||... |. .|+.+.-|.|.|..+.|.|.+. .||+|++.|.|+|||+-.
T Consensus 85 GG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~q~nlLASGa~~geI~iWDlnn 148 (1049)
T KOG0307|consen 85 GGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPFQGNLLASGADDGEILIWDLNK 148 (1049)
T ss_pred ccccCCceEEecchhhccCcchHHHhhhcccCCceeeeeccccCCceeeccCCCCcEEEeccCC
Confidence 578999999999876 44 4556678999999999999998 999999999999999865
No 190
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=94.07 E-value=0.082 Score=62.52 Aligned_cols=104 Identities=16% Similarity=0.279 Sum_probs=73.8
Q ss_pred CCcccCCcEEEEEccCCe--eeEEeecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCCCCCccCCc----ccc---c
Q 001090 1 MSAGYDGKTIVWDIWEGI--PIRIYEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQGESQKDAK----YDQ---F 70 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~--~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~g~~~~~~p----~EQ---F 70 (1159)
|+||+||+++|+-|..-+ .|....-..++|..++|+|+|+ .++++...--+++|++-.++..+-.| .++ .
T Consensus 229 lvaG~d~~lrifqvDGk~N~~lqS~~l~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~e~ 308 (514)
T KOG2055|consen 229 LVAGLDGTLRIFQVDGKVNPKLQSIHLEKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSMER 308 (514)
T ss_pred EEecCCCcEEEEEecCccChhheeeeeccCccceeeecCCCceEEEecccceEEEEeeccccccccccCCCCcccchhhe
Confidence 589999999999886444 5555556789999999999999 77788888899999998887443322 233 3
Q ss_pred c---ccCCccceecCCCC--ccccccCcCCcccCCCCCc
Q 001090 71 F---LGDYRPLVQDTYGN--VLDQETQLAPHRRNLQDPL 104 (1159)
Q Consensus 71 F---~tDYrpLirD~~g~--vlDe~Tq~~PHL~~~~~~L 104 (1159)
| |-+--.+|.--+|| +|-..|..--+-+.|+|.+
T Consensus 309 FeVShd~~fia~~G~~G~I~lLhakT~eli~s~KieG~v 347 (514)
T KOG2055|consen 309 FEVSHDSNFIAIAGNNGHIHLLHAKTKELITSFKIEGVV 347 (514)
T ss_pred eEecCCCCeEEEcccCceEEeehhhhhhhhheeeeccEE
Confidence 4 33333444445565 7777777777666666633
No 191
>KOG4328 consensus WD40 protein [Function unknown]
Probab=94.02 E-value=0.053 Score=63.95 Aligned_cols=56 Identities=16% Similarity=0.291 Sum_probs=47.3
Q ss_pred CCcccCCcEEEEEccCCeeeE----EeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIR----IYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir----~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+|||-|++++|||+.+=..=+ .-..|.-+|-.+.|||+|-.|++++-+-.|.||+-
T Consensus 339 aT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~~D~~IRv~ds 398 (498)
T KOG4328|consen 339 ATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTCQDNEIRVFDS 398 (498)
T ss_pred eecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEeeccCCceEEeec
Confidence 689999999999997533222 22368889999999999999999999999999985
No 192
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=93.91 E-value=0.079 Score=62.21 Aligned_cols=58 Identities=17% Similarity=0.251 Sum_probs=48.1
Q ss_pred CCcccCCcEEEEEccCC--eeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEG--IPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G--~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
.|+|.||.+.|||+.++ ++-+.-..|.+.|..++|.| ++..||+++.+|+|.+||+-.
T Consensus 244 ~sv~dd~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRn 304 (422)
T KOG0264|consen 244 GSVGDDGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGSADKTVALWDLRN 304 (422)
T ss_pred eeecCCCeEEEEEcCCCCCCCcccccccCCceeEEEeCCCCCceEEeccCCCcEEEeechh
Confidence 37899999999999954 44445567889999999999 566677888899999999854
No 193
>COG5076 Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription]
Probab=93.85 E-value=0.019 Score=66.06 Aligned_cols=76 Identities=24% Similarity=0.376 Sum_probs=70.8
Q ss_pred ccccccCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcC
Q 001090 1074 KQVSQKTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSL 1149 (1159)
Q Consensus 1074 ~~~~e~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL 1149 (1159)
......|+|+++|..+|++.|.+.+|.+.+|+...++..|..++++||..||++...+++.+..+.+.+...+...
T Consensus 288 ~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (371)
T COG5076 288 VSDEEVPDYYKDIRDPMDLSTKELKLRNNYYRPEETFVRDAKLFFDNCVMYNGEVTDYYKNANVLEDFVIKKTRLI 363 (371)
T ss_pred CCcccccchhhhhhcccccccchhhhhcccCCCccccccccchhhhcccccchhhhhhhhhccchhhhHhhhhhhh
Confidence 3456789999999999999999999999999999999999999999999999999999999999999998877654
No 194
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=93.74 E-value=0.1 Score=60.58 Aligned_cols=59 Identities=17% Similarity=0.293 Sum_probs=49.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCC---CEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDG---ASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDG---t~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|||+|||+++|||. .|++++.+.||.++|.+++|--=- +.|+++..+-+|.+|-+..++
T Consensus 119 ltgsYDg~~riWd~-~Gk~~~~~~Ght~~ik~v~~v~~n~~~~~fvsas~Dqtl~Lw~~~~~~ 180 (423)
T KOG0313|consen 119 LTGSYDGTSRIWDL-KGKSIKTIVGHTGPIKSVAWVIKNSSSCLFVSASMDQTLRLWKWNVGE 180 (423)
T ss_pred EEeecCCeeEEEec-CCceEEEEecCCcceeeeEEEecCCccceEEEecCCceEEEEEecCch
Confidence 68999999999998 799999999999999998884322 358999888888888776654
No 195
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=93.73 E-value=0.11 Score=64.09 Aligned_cols=80 Identities=20% Similarity=0.272 Sum_probs=69.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecC---cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc--cc----ccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEIS---RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK--YD----QFF 71 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g---~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p--~E----QFF 71 (1159)
++++.|.+|+|+||++|++.|.|-|. .|.+.-+..-|-|-+||++=++-+|.||||-+|+...++. -| .-|
T Consensus 612 ~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsdktl~~~Df~sgEcvA~m~GHsE~VTG~kF 691 (1080)
T KOG1408|consen 612 VTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSDKTLCFVDFVSGECVAQMTGHSEAVTGVKF 691 (1080)
T ss_pred EEEecccceEEEeccccceeeeecccccCCCceEEEEECCCccEEEEeecCCceEEEEeccchhhhhhcCcchheeeeee
Confidence 47899999999999999999999863 4889999999999999999999999999999998665543 22 369
Q ss_pred ccCCcccee
Q 001090 72 LGDYRPLVQ 80 (1159)
Q Consensus 72 ~tDYrpLir 80 (1159)
.+|-+-||-
T Consensus 692 ~nDCkHlIS 700 (1080)
T KOG1408|consen 692 LNDCKHLIS 700 (1080)
T ss_pred cccchhhee
Confidence 999998887
No 196
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=93.65 E-value=0.14 Score=57.00 Aligned_cols=60 Identities=10% Similarity=0.104 Sum_probs=54.6
Q ss_pred CCcccCCcEEEEEccCCe---eeEEeecCcceeeeeeeCC--CCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI---PIRIYEISRFRLVDGKFSP--DGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~---~Ir~f~~g~~~i~D~~FSP--DGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+|.||+|+|..+.... .|..+.||.|+|+-++|.- -|+.||++.-+|++.||--+.|.
T Consensus 27 ATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiWke~~g~ 91 (299)
T KOG1332|consen 27 ATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGKVIIWKEENGR 91 (299)
T ss_pred eeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecccccCcEeeEeecCceEEEEecCCCc
Confidence 589999999999999876 4667789999999999998 99999999999999999988774
No 197
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=93.61 E-value=0.081 Score=63.04 Aligned_cols=59 Identities=22% Similarity=0.295 Sum_probs=47.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|-|||.+|+|+|+..-.....+. -.-++-.++|+|+|++||+|.+.|.|+.||+-.-+
T Consensus 225 vsVG~Dkki~~yD~~s~~s~~~l~-y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~~k 283 (673)
T KOG4378|consen 225 VSVGYDKKINIYDIRSQASTDRLT-YSHPLSTVAFSECGTYLCAGNSKGELIAYDMRSTK 283 (673)
T ss_pred EEecccceEEEeecccccccceee-ecCCcceeeecCCceEEEeecCCceEEEEecccCC
Confidence 477999999999998655544433 23467788999999999999999999999986543
No 198
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=93.58 E-value=0.11 Score=61.48 Aligned_cols=57 Identities=16% Similarity=0.267 Sum_probs=49.8
Q ss_pred CcccCCcEEEEEccCCeeeEEee-cCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYE-ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~-~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
..|-+|.|.+|||.+-.|++.|. .|..-=..+|.|++|.+||+|...|-+.||+.-+
T Consensus 361 ~~~~~GeV~v~nl~~~~~~~rf~D~G~v~gts~~~S~ng~ylA~GS~~GiVNIYd~~s 418 (514)
T KOG2055|consen 361 ASGGTGEVYVWNLRQNSCLHRFVDDGSVHGTSLCISLNGSYLATGSDSGIVNIYDGNS 418 (514)
T ss_pred EEcCCceEEEEecCCcceEEEEeecCccceeeeeecCCCceEEeccCcceEEEeccch
Confidence 35678999999999999999997 5555667889999999999999999999999654
No 199
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=93.48 E-value=0.039 Score=66.05 Aligned_cols=55 Identities=16% Similarity=0.141 Sum_probs=44.2
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+-|.||.++|+|..+=+.+-++.-=-+.++.++|||||++||+|-++-=+.||.+
T Consensus 307 ~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDDLVtVwSf 361 (636)
T KOG2394|consen 307 TVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDDLVTVWSF 361 (636)
T ss_pred EEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCcceEEEEEe
Confidence 4578899988888776655443333467999999999999999999988888887
No 200
>PRK05137 tolB translocation protein TolB; Provisional
Probab=93.48 E-value=0.18 Score=58.66 Aligned_cols=73 Identities=19% Similarity=0.302 Sum_probs=53.4
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec---CceeEEEEecCCCCCccCC--c---cccccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD---DVGQLYILNTGQGESQKDA--K---YDQFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD---~~G~l~Ifg~G~g~~~~~~--p---~EQFF~tDYrp 77 (1159)
-.+|.+||. .|..++.+..+...+...+|||||+.||.+. ..++|+++++..+.....+ + ..--|+.|.+-
T Consensus 181 ~~~l~~~d~-dg~~~~~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~ 259 (435)
T PRK05137 181 IKRLAIMDQ-DGANVRYLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRK 259 (435)
T ss_pred ceEEEEECC-CCCCcEEEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecCCCcccCcEECCCCCE
Confidence 358999998 5666677888888999999999999999764 3478999999777532211 1 11247777776
Q ss_pred ce
Q 001090 78 LV 79 (1159)
Q Consensus 78 Li 79 (1159)
|+
T Consensus 260 la 261 (435)
T PRK05137 260 VV 261 (435)
T ss_pred EE
Confidence 54
No 201
>PRK05137 tolB translocation protein TolB; Provisional
Probab=93.43 E-value=0.18 Score=58.54 Aligned_cols=74 Identities=15% Similarity=0.211 Sum_probs=49.2
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEE-ecCc--eeEEEEecCCCCCccCCc-----cccccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIIL-SDDV--GQLYILNTGQGESQKDAK-----YDQFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~-tD~~--G~l~Ifg~G~g~~~~~~p-----~EQFF~tDYrp 77 (1159)
+.+|.+||+.+|... .+....+.+...+|||||.+||. .+.. .+|+++++..+....-+. ..--|+.|.+-
T Consensus 225 ~~~i~~~dl~~g~~~-~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~~~~~~~~spDG~~ 303 (435)
T PRK05137 225 RPRVYLLDLETGQRE-LVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPAIDTSPSYSPDGSQ 303 (435)
T ss_pred CCEEEEEECCCCcEE-EeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCCCceEEccCCCCccCceeEcCCCCE
Confidence 478999999999864 34455667889999999999874 3444 458888987765322221 12245666655
Q ss_pred cee
Q 001090 78 LVQ 80 (1159)
Q Consensus 78 Lir 80 (1159)
|+.
T Consensus 304 i~f 306 (435)
T PRK05137 304 IVF 306 (435)
T ss_pred EEE
Confidence 543
No 202
>cd05493 Bromo_ALL-1 Bromodomain, ALL-1 like proteins. ALL-1 is a vertebrate homologue of Drosophila trithorax and is often affected in chromosomal rearrangements that are linked to acute leukemias, such as acute lymphocytic leukemia (ALL). Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=93.41 E-value=0.11 Score=52.93 Aligned_cols=64 Identities=17% Similarity=0.311 Sum_probs=49.0
Q ss_pred CCCHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHHHHHHHHhcCCCC
Q 001090 1089 PLSLDVIQSRLENNYYRGLEAVKHDIAVMLSNAESYFGRNTDLSTKIKRLSDLVTRTLSSLKAP 1152 (1159)
Q Consensus 1089 PMDLsTIr~RLeN~~YRsleaf~~DVrLI~sNA~tFN~p~S~I~k~Ak~L~d~F~~~LkkL~~~ 1152 (1159)
|.||.-|++||++++|+++.+|.+||-+|+.=+..=-+...++-++-..+.-+|.+.|+++..+
T Consensus 59 p~dL~~V~kkl~~G~Y~sv~~F~~DvvkIiqa~l~~e~~~pe~~ka~s~~Ksf~ik~me~vf~W 122 (131)
T cd05493 59 PLDLEAVGKKLEAGFYTSVLDFSDDIVKIIQAALNSEGGQPEIKKANSMAKSFFIKLMESVFPW 122 (131)
T ss_pred cccHHHHHHHHhccceehHHHHHHHHHHHHHHHHhhccCCccccCcchHHHHHHHHHHHHhccc
Confidence 8999999999999999999999999999987665443333444444455667778888776543
No 203
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=93.38 E-value=0.13 Score=60.99 Aligned_cols=89 Identities=17% Similarity=0.167 Sum_probs=64.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCC-CCEEEEecCceeEEEEecCCCC--Cc---cCCcccc-cccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPD-GASIILSDDVGQLYILNTGQGE--SQ---KDAKYDQ-FFLG 73 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPD-Gt~~a~tD~~G~l~Ifg~G~g~--~~---~~~p~EQ-FF~t 73 (1159)
+||+.|+.+.+||+.++..+.-+.+|.--|..++|+|- +..|++|.-+|+|.+|++-+.. .+ -..|.|- .|..
T Consensus 127 ~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~~~v~elnhg~pVe~vl~lp 206 (487)
T KOG0310|consen 127 VSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLTSRVVELNHGCPVESVLALP 206 (487)
T ss_pred EecCCCceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCCceeEEecCCCceeeEEEcC
Confidence 47999999999999999975455688889999999998 4567799999999999987652 11 1245554 3333
Q ss_pred CCccceecCCCC---ccccc
Q 001090 74 DYRPLVQDTYGN---VLDQE 90 (1159)
Q Consensus 74 DYrpLirD~~g~---vlDe~ 90 (1159)
- .-+|-.+.|+ |.|--
T Consensus 207 s-gs~iasAgGn~vkVWDl~ 225 (487)
T KOG0310|consen 207 S-GSLIASAGGNSVKVWDLT 225 (487)
T ss_pred C-CCEEEEcCCCeEEEEEec
Confidence 2 2455556666 66654
No 204
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=93.37 E-value=0.16 Score=61.18 Aligned_cols=56 Identities=16% Similarity=0.327 Sum_probs=50.7
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.+-.|.||+-.+|.....+..+...|..++|+|||++||+|...|.+.||+....+
T Consensus 195 lg~~vylW~~~s~~v~~l~~~~~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k 250 (484)
T KOG0305|consen 195 LGQSVYLWSASSGSVTELCSFGEELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQK 250 (484)
T ss_pred ecceEEEEecCCCceEEeEecCCCceEEEEECCCCCEEEEeecCCeEEEEehhhcc
Confidence 34579999999999999998889999999999999999999999999999976543
No 205
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=93.29 E-value=0.14 Score=62.05 Aligned_cols=59 Identities=17% Similarity=0.281 Sum_probs=50.5
Q ss_pred CcccCCcEEEEEccCCeeeEEeec------Ccc-----eeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEI------SRF-----RLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~------g~~-----~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.|+.||.|-+||-..-..+.++.. +++ .|..++|+-||.+||+|.+.|+++||++-...
T Consensus 192 ~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~ 261 (703)
T KOG2321|consen 192 CGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASK 261 (703)
T ss_pred ecccCceEEEecchhhhhheeeecccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCC
Confidence 466799999999999888877652 233 49999999999999999999999999998765
No 206
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=93.14 E-value=0.093 Score=61.38 Aligned_cols=79 Identities=14% Similarity=0.185 Sum_probs=65.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc------cCCccccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ------KDAKYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~------~~~p~EQFF~tD 74 (1159)
+||+.||+|.|||...++.-+++.||.-.|..+.|.|.--.||++..+--+.|||.-+|... |.+-...=|..+
T Consensus 196 ~t~SdDg~ikiWdf~~~kee~vL~GHgwdVksvdWHP~kgLiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n 275 (464)
T KOG0284|consen 196 LTCSDDGTIKIWDFRMPKEERVLRGHGWDVKSVDWHPTKGLIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPN 275 (464)
T ss_pred EEecCCCeEEEEeccCCchhheeccCCCCcceeccCCccceeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCC
Confidence 58999999999999999999999999999999999999999999988889999999888744 333344455555
Q ss_pred Cccce
Q 001090 75 YRPLV 79 (1159)
Q Consensus 75 YrpLi 79 (1159)
..-|+
T Consensus 276 ~N~Ll 280 (464)
T KOG0284|consen 276 GNWLL 280 (464)
T ss_pred CCeeE
Confidence 54333
No 207
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=92.90 E-value=0.16 Score=60.17 Aligned_cols=63 Identities=22% Similarity=0.236 Sum_probs=52.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEee------------------cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE------------------ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~------------------~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+||+-||+|.=|++.+|..++-+. +|...|+.+|.||||.+||+|..+-+|.||++-+.+..
T Consensus 158 fsask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r~~h~keil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv 237 (479)
T KOG0299|consen 158 FSASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESRKGHVKEILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHV 237 (479)
T ss_pred eecCCCcceeeeehhcCcccccccccchhhhhccCCCCcccccccceeEEEEEcCCCcEEEecCCCceEEEecCcccchh
Confidence 589999999999999999763221 34467999999999999999999999999999887644
Q ss_pred c
Q 001090 63 K 63 (1159)
Q Consensus 63 ~ 63 (1159)
+
T Consensus 238 ~ 238 (479)
T KOG0299|consen 238 K 238 (479)
T ss_pred h
Confidence 3
No 208
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=92.83 E-value=0.16 Score=63.03 Aligned_cols=89 Identities=19% Similarity=0.212 Sum_probs=69.3
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec-CCCCCc-----cCCccccccccCCccc
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT-GQGESQ-----KDAKYDQFFLGDYRPL 78 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~-G~g~~~-----~~~p~EQFF~tDYrpL 78 (1159)
.||.|.|||..+++.+-.|.||..+|--++|+-+|+.||+|.-+|-|.+||+ |-.-.+ +..-.+-||.++-.-|
T Consensus 85 aDGsVqif~~~s~~~~~tfngHK~AVt~l~fd~~G~rlaSGskDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~~~~~l 164 (888)
T KOG0306|consen 85 ADGSVQIFSLESEEILITFNGHKAAVTTLKFDKIGTRLASGSKDTDIIVWDLVGEEGLFRLRGHKDSITQALFLNGDSFL 164 (888)
T ss_pred cCceEEeeccCCCceeeeecccccceEEEEEcccCceEeecCCCccEEEEEeccceeeEEeecchHHHhHHhccCCCeEE
Confidence 5999999999999999999999999999999999999999999999999998 221122 2233455888866666
Q ss_pred ee---cCCCCccccccCc
Q 001090 79 VQ---DTYGNVLDQETQL 93 (1159)
Q Consensus 79 ir---D~~g~vlDe~Tq~ 93 (1159)
|- |.---+-|-+||.
T Consensus 165 vS~sKDs~iK~WdL~tqh 182 (888)
T KOG0306|consen 165 VSVSKDSMIKFWDLETQH 182 (888)
T ss_pred EEeccCceEEEEecccce
Confidence 65 3333355666663
No 209
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=92.79 E-value=0.068 Score=65.13 Aligned_cols=84 Identities=23% Similarity=0.210 Sum_probs=62.0
Q ss_pred CCcccCCcEEEEEcc-CCeeeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCCC---CccC-----Cccccc
Q 001090 1 MSAGYDGKTIVWDIW-EGIPIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQGE---SQKD-----AKYDQF 70 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~-~G~~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g~---~~~~-----~p~EQF 70 (1159)
||+| |.+|+||.-. .-.+|-.|......|+|+||||-= +.||++|.+|.|.|||+=... ..++ +...-|
T Consensus 415 ls~g-DW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~~~~Pv~s~~~~~~~l~~~~ 493 (555)
T KOG1587|consen 415 LSVG-DWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQDDEEPVLSQKVCSPALTRVR 493 (555)
T ss_pred eeec-cceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhhccccCCcccccccccccceee
Confidence 4666 9999999998 777887777888889999999986 789999999999999995532 1111 112335
Q ss_pred cccCCcccee-cCCCC
Q 001090 71 FLGDYRPLVQ-DTYGN 85 (1159)
Q Consensus 71 F~tDYrpLir-D~~g~ 85 (1159)
|.-..+-|.- |++|.
T Consensus 494 ~s~~g~~lavGd~~G~ 509 (555)
T KOG1587|consen 494 WSPNGKLLAVGDANGT 509 (555)
T ss_pred cCCCCcEEEEecCCCc
Confidence 5555554444 78883
No 210
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=92.39 E-value=0.18 Score=57.51 Aligned_cols=59 Identities=15% Similarity=0.255 Sum_probs=53.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC--EEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA--SIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt--~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++||.||++.||||.+| ++..+..|.++|..|+|=+-.. .|++|.=+-+|..||+-+..
T Consensus 88 f~g~~Dk~~k~wDL~S~-Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~ 148 (347)
T KOG0647|consen 88 FSGGCDKQAKLWDLASG-QVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSN 148 (347)
T ss_pred EeeccCCceEEEEccCC-CeeeeeecccceeEEEEecCCCcceeEecccccceeecccCCCC
Confidence 47999999999999999 6778889999999999999888 89999999999999988654
No 211
>PRK04922 tolB translocation protein TolB; Provisional
Probab=92.33 E-value=0.19 Score=58.43 Aligned_cols=73 Identities=19% Similarity=0.138 Sum_probs=51.1
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC---ceeEEEEecCCCCCccC--Ccc---ccccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD---VGQLYILNTGQGESQKD--AKY---DQFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~---~G~l~Ifg~G~g~~~~~--~p~---EQFF~tDYrp 77 (1159)
..+|.|||.. |...+.+..+...+...+|||||++||.+.. ..+|+++++..+....- .+. .--|+.|.+-
T Consensus 183 ~~~l~i~D~~-g~~~~~lt~~~~~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~ 261 (433)
T PRK04922 183 RYALQVADSD-GYNPQTILRSAEPILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRELVASFRGINGAPSFSPDGRR 261 (433)
T ss_pred eEEEEEECCC-CCCceEeecCCCccccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEEEeccCCCCccCceECCCCCE
Confidence 3579999985 6666667777888999999999999997653 34799999976653211 111 1146777765
Q ss_pred ce
Q 001090 78 LV 79 (1159)
Q Consensus 78 Li 79 (1159)
|+
T Consensus 262 l~ 263 (433)
T PRK04922 262 LA 263 (433)
T ss_pred EE
Confidence 54
No 212
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=92.27 E-value=0.066 Score=62.63 Aligned_cols=62 Identities=16% Similarity=0.191 Sum_probs=56.7
Q ss_pred CCcccCCcEEEEEccCCe--eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGI--PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++||.|..|.+|++.-++ .+.++.|..+.|.+++|-+||.+++++..++.+.||.+++....
T Consensus 191 atgg~Dr~Ik~W~v~~~k~~~~~tLaGs~g~it~~d~d~~~~~~iAas~d~~~r~Wnvd~~r~~ 254 (459)
T KOG0288|consen 191 ATGGSDRIIKLWNVLGEKSELISTLAGSLGNITSIDFDSDNKHVIAASNDKNLRLWNVDSLRLR 254 (459)
T ss_pred hhcchhhhhhhhhcccchhhhhhhhhccCCCcceeeecCCCceEEeecCCCceeeeeccchhhh
Confidence 479999999999999998 77778888999999999999999999999999999999987533
No 213
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=92.18 E-value=0.13 Score=62.64 Aligned_cols=56 Identities=18% Similarity=0.149 Sum_probs=51.2
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
.+.-|.|.||||.+++.+-.+.+|.-.|-++.||-||..||+++-+-+|.||+--.
T Consensus 146 s~a~g~v~i~D~stqk~~~el~~h~d~vQSa~WseDG~llatscKdkqirifDPRa 201 (1012)
T KOG1445|consen 146 SGAHGSVYITDISTQKTAVELSGHTDKVQSADWSEDGKLLATSCKDKQIRIFDPRA 201 (1012)
T ss_pred eccCceEEEEEcccCceeecccCCchhhhccccccCCceEeeecCCcceEEeCCcc
Confidence 46679999999999999988889999999999999999999999999999999643
No 214
>PRK04922 tolB translocation protein TolB; Provisional
Probab=92.12 E-value=0.25 Score=57.56 Aligned_cols=56 Identities=25% Similarity=0.335 Sum_probs=40.1
Q ss_pred ccCC--cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe-cCcee--EEEEecCCCC
Q 001090 4 GYDG--KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS-DDVGQ--LYILNTGQGE 60 (1159)
Q Consensus 4 G~DG--~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~G~--l~Ifg~G~g~ 60 (1159)
+.+| .|.+||+.+|.+.+ +..+.....+.+|||||++|+.+ |..|. |+++++..+.
T Consensus 267 s~~g~~~Iy~~d~~~g~~~~-lt~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~ 327 (433)
T PRK04922 267 SRDGNPEIYVMDLGSRQLTR-LTNHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGS 327 (433)
T ss_pred eCCCCceEEEEECCCCCeEE-CccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 4445 59999999998654 44444556788999999999855 55554 8888875554
No 215
>PRK03629 tolB translocation protein TolB; Provisional
Probab=91.82 E-value=0.41 Score=55.96 Aligned_cols=54 Identities=19% Similarity=0.382 Sum_probs=38.9
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe-cCce--eEEEEecCCCC
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS-DDVG--QLYILNTGQGE 60 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~G--~l~Ifg~G~g~ 60 (1159)
+.+|.+||+.+|+...+.. ..+.+.+.+|||||++||.+ +..| +|+++++..+.
T Consensus 222 ~~~i~i~dl~~G~~~~l~~-~~~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~ 278 (429)
T PRK03629 222 RSALVIQTLANGAVRQVAS-FPRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQ 278 (429)
T ss_pred CcEEEEEECCCCCeEEccC-CCCCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCC
Confidence 4579999999997544322 22334568999999999965 5455 59999987664
No 216
>PRK00178 tolB translocation protein TolB; Provisional
Probab=91.80 E-value=0.48 Score=54.61 Aligned_cols=71 Identities=17% Similarity=0.133 Sum_probs=49.0
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe-cCc--eeEEEEecCCCCCccCC--c---cccccccCCccce
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS-DDV--GQLYILNTGQGESQKDA--K---YDQFFLGDYRPLV 79 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~--G~l~Ifg~G~g~~~~~~--p---~EQFF~tDYrpLi 79 (1159)
+|.++|...+. ++.+..+...+...+|||||++||.+ +.. .+|+++++..+....-+ + ..-.|+.|.+-|+
T Consensus 180 ~l~~~d~~g~~-~~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la 258 (430)
T PRK00178 180 TLQRSDYDGAR-AVTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLA 258 (430)
T ss_pred EEEEECCCCCC-ceEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEE
Confidence 58889997555 45555666788999999999999865 333 47999999777532211 1 1235788877665
No 217
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=91.57 E-value=0.21 Score=56.93 Aligned_cols=57 Identities=18% Similarity=0.294 Sum_probs=47.6
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-++|||.++|+|+..- .++.-..+..+|+||+|.+ -.++++|+.+|+|..+|+-.+.
T Consensus 30 vssWDgslrlYdv~~~-~l~~~~~~~~plL~c~F~d-~~~~~~G~~dg~vr~~Dln~~~ 86 (323)
T KOG1036|consen 30 VSSWDGSLRLYDVPAN-SLKLKFKHGAPLLDCAFAD-ESTIVTGGLDGQVRRYDLNTGN 86 (323)
T ss_pred EEeccCcEEEEeccch-hhhhheecCCceeeeeccC-CceEEEeccCceEEEEEecCCc
Confidence 4689999999999887 4444446788999999998 5778999999999999996654
No 218
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=91.53 E-value=0.39 Score=57.70 Aligned_cols=38 Identities=21% Similarity=0.351 Sum_probs=31.5
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccC
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKD 64 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~ 64 (1159)
..+|--++|||||.+||++..+++|+||.+..+. +|+.
T Consensus 447 ~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~g~~y~r 485 (626)
T KOG2106|consen 447 NEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSANGRKYSR 485 (626)
T ss_pred CCceEEEEEcCCCCEEEEecCCCeEEEEEECCCCcEEEE
Confidence 3567788999999999999999999999996643 4443
No 219
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=91.42 E-value=0.35 Score=53.49 Aligned_cols=55 Identities=11% Similarity=0.206 Sum_probs=41.8
Q ss_pred CcccCCcEEEEEccC-Ce--eeEEeecCcceeeeeeeCCCCCEEEEe-cCceeEEEEecC
Q 001090 2 SAGYDGKTIVWDIWE-GI--PIRIYEISRFRLVDGKFSPDGASIILS-DDVGQLYILNTG 57 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~-G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~G~l~Ifg~G 57 (1159)
++++||.|.+||+.+ |+ .++.+..+ +...-++|||||.+|.++ -..+.|.+|++.
T Consensus 7 ~~~~~~~I~~~~~~~~g~l~~~~~~~~~-~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~ 65 (330)
T PRK11028 7 ASPESQQIHVWNLNHEGALTLLQVVDVP-GQVQPMVISPDKRHLYVGVRPEFRVLSYRIA 65 (330)
T ss_pred EcCCCCCEEEEEECCCCceeeeeEEecC-CCCccEEECCCCCEEEEEECCCCcEEEEEEC
Confidence 468899999999974 64 56666532 445567999999998655 468999999885
No 220
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=91.37 E-value=0.34 Score=59.76 Aligned_cols=73 Identities=12% Similarity=0.140 Sum_probs=55.8
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceecCC
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQDTY 83 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD~~ 83 (1159)
-|+||...+=..+..+.+|...|--++|||||++|+++..+-++.+|..+-......- --+..++..+|.|+.
T Consensus 553 vI~lw~t~~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsRDRt~sl~~~~~~~~~e~~---fa~~k~HtRIIWdcs 625 (764)
T KOG1063|consen 553 VIRLWNTANWLQVQELEGHSLTVTRLAFSPDGRYLLSVSRDRTVSLYEVQEDIKDEFR---FACLKAHTRIIWDCS 625 (764)
T ss_pred EEEEEeccchhhhheecccceEEEEEEECCCCcEEEEeecCceEEeeeeecccchhhh---hccccccceEEEEcc
Confidence 3789999998888888999999999999999999999999999999987544321100 113555556666543
No 221
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=91.35 E-value=0.31 Score=55.69 Aligned_cols=56 Identities=25% Similarity=0.418 Sum_probs=45.3
Q ss_pred CCcccCCcEEEEEcc------CC---eeeEEeecCcceeeeeeeCCC--CCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIW------EG---IPIRIYEISRFRLVDGKFSPD--GASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~------~G---~~Ir~f~~g~~~i~D~~FSPD--Gt~~a~tD~~G~l~Ifg~ 56 (1159)
++++||++|.||.=. .| +...++...+.+|+|++|+|= |.+||++..+|.|.||..
T Consensus 77 A~cS~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYEA 143 (361)
T KOG2445|consen 77 ATCSYDRTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASADGILRIYEA 143 (361)
T ss_pred EEEecCCceeeeeecccccccccceeEEEEEeecCCcceeEEEecchhcceEEEEeccCcEEEEEec
Confidence 478999999999752 12 133345567899999999995 999999999999999984
No 222
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=91.32 E-value=0.19 Score=62.31 Aligned_cols=58 Identities=19% Similarity=0.213 Sum_probs=55.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|+|-||.|+=||=+.=++|+++.+|+..|+.++.+|+|.+++++..+-.|.+|.-+-
T Consensus 608 Ft~gKD~kvKqWDg~kFe~iq~L~~H~~ev~cLav~~~G~~vvs~shD~sIRlwE~td 665 (888)
T KOG0306|consen 608 FTCGKDGKVKQWDGEKFEEIQKLDGHHSEVWCLAVSPNGSFVVSSSHDKSIRLWERTD 665 (888)
T ss_pred EEecCcceEEeechhhhhhheeeccchheeeeeEEcCCCCeEEeccCCceeEeeeccC
Confidence 4899999999999999999999999999999999999999999999999999998554
No 223
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=91.22 E-value=0.36 Score=58.48 Aligned_cols=57 Identities=26% Similarity=0.274 Sum_probs=52.6
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|+|.|++|.+|+..++..+++|..+.-.+..++.||||..++++ .++|.+|++..++
T Consensus 119 S~~ad~~v~~~~~~~~~~~~~~~~~~~~~~sl~is~D~~~l~~a--s~~ik~~~~~~ke 175 (541)
T KOG4547|consen 119 SVGADLKVVYILEKEKVIIRIWKEQKPLVSSLCISPDGKILLTA--SRQIKVLDIETKE 175 (541)
T ss_pred ecCCceeEEEEecccceeeeeeccCCCccceEEEcCCCCEEEec--cceEEEEEccCce
Confidence 78999999999999999999999999999999999999999876 4789999987765
No 224
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=91.18 E-value=0.41 Score=53.55 Aligned_cols=61 Identities=18% Similarity=0.116 Sum_probs=51.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
+.||-||.|.-||+++|+.-|.|.||..-|..+.----.-.|.+|.++|++.||++-+++.
T Consensus 130 ~~AgGD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~ 190 (325)
T KOG0649|consen 130 LFAGGDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKH 190 (325)
T ss_pred EEecCCeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEeccccce
Confidence 3688999999999999999999999987777777733344577899999999999988763
No 225
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=91.15 E-value=0.47 Score=53.82 Aligned_cols=73 Identities=22% Similarity=0.206 Sum_probs=49.8
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCc---eeEEEEecCCCCCccCC-----ccccccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDV---GQLYILNTGQGESQKDA-----KYDQFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~---G~l~Ifg~G~g~~~~~~-----p~EQFF~tDYrp 77 (1159)
...|.+||.. |...+.+..+...++..+|||||++||.+... .+|++|++..+...... ...-.|+.|..-
T Consensus 169 ~~~l~~~d~~-g~~~~~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~ 247 (417)
T TIGR02800 169 RYELQVADYD-GANPQTITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGMNGAPAFSPDGSK 247 (417)
T ss_pred cceEEEEcCC-CCCCEEeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCE
Confidence 4568899985 55556666666789999999999999977543 48999998776422111 112246667655
Q ss_pred ce
Q 001090 78 LV 79 (1159)
Q Consensus 78 Li 79 (1159)
|+
T Consensus 248 l~ 249 (417)
T TIGR02800 248 LA 249 (417)
T ss_pred EE
Confidence 54
No 226
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=91.06 E-value=0.25 Score=59.68 Aligned_cols=61 Identities=11% Similarity=0.111 Sum_probs=49.4
Q ss_pred CCcccCCcEEEEEc------cCCe----eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC
Q 001090 1 MSAGYDGKTIVWDI------WEGI----PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES 61 (1159)
Q Consensus 1 lSAG~DG~I~lWDI------~~G~----~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~ 61 (1159)
.|||+||+|+.|.+ ..+- ..-+|.||-.+||-+++|+--..|++.+.+|++++|..+....
T Consensus 360 ysgg~Dg~I~~w~~p~n~dp~ds~dp~vl~~~l~Ghtdavw~l~~s~~~~~Llscs~DgTvr~w~~~~~~~ 430 (577)
T KOG0642|consen 360 YSGGIDGTIRCWNLPPNQDPDDSYDPSVLSGTLLGHTDAVWLLALSSTKDRLLSCSSDGTVRLWEPTEESP 430 (577)
T ss_pred EeeccCceeeeeccCCCCCcccccCcchhccceeccccceeeeeecccccceeeecCCceEEeeccCCcCc
Confidence 48999999999932 2221 2224778899999999999999999999999999999876653
No 227
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=91.04 E-value=0.55 Score=53.26 Aligned_cols=53 Identities=21% Similarity=0.295 Sum_probs=39.8
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe-cCce--eEEEEecCCCC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS-DDVG--QLYILNTGQGE 60 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~G--~l~Ifg~G~g~ 60 (1159)
.+|.+||+.+|...++ ....+.+..++|||||..|+.+ +..| +|++++++.+.
T Consensus 214 ~~i~v~d~~~g~~~~~-~~~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~ 269 (417)
T TIGR02800 214 PEIYVQDLATGQREKV-ASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQ 269 (417)
T ss_pred cEEEEEECCCCCEEEe-ecCCCCccceEECCCCCEEEEEECCCCCccEEEEECCCCC
Confidence 5799999999976543 3344566779999999998854 4444 69999998765
No 228
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=91.01 E-value=0.22 Score=59.01 Aligned_cols=57 Identities=11% Similarity=0.168 Sum_probs=51.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
+|||-|..|.|||..+++.+++|.+|+++|+.++|--.-..+.++.-+..+.||.+.
T Consensus 218 atgg~d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~ 274 (479)
T KOG0299|consen 218 ATGGRDRHVQIWDCDTLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSID 274 (479)
T ss_pred EecCCCceEEEecCcccchhhcccccccceeeeeeecCccceeeeecCCceEEEehh
Confidence 589999999999999999999999999999999998777778888888999998864
No 229
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=90.98 E-value=0.48 Score=52.41 Aligned_cols=60 Identities=12% Similarity=0.174 Sum_probs=53.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec----C---cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI----S---RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~----g---~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||+.|.+|++||+.--.|++++.. + ..+|-.++.-|-|+.||++..+-.-.+|++-.+.
T Consensus 198 ~sgsqdktirfwdlrv~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~dssc~lydirg~r 264 (350)
T KOG0641|consen 198 ASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLYDIRGGR 264 (350)
T ss_pred EccCCCceEEEEeeeccceeeeccCcccCCCcccceeEEEEECCCcceeeeccCCCceEEEEeeCCc
Confidence 5899999999999999999998752 2 4789999999999999999999999999996664
No 230
>PRK03629 tolB translocation protein TolB; Provisional
Probab=90.97 E-value=0.39 Score=56.16 Aligned_cols=52 Identities=21% Similarity=0.333 Sum_probs=38.2
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-Cce--eEEEEecCCCC
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVG--QLYILNTGQGE 60 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G--~l~Ifg~G~g~ 60 (1159)
+|.+||+.+|...+.. .+...+...+|||||+.||.+. ..| +|+++++..+.
T Consensus 268 ~I~~~d~~tg~~~~lt-~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~ 322 (429)
T PRK03629 268 NLYVMDLASGQIRQVT-DGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGA 322 (429)
T ss_pred EEEEEECCCCCEEEcc-CCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCC
Confidence 5999999999876544 4455678999999999998554 444 66777775553
No 231
>PRK00178 tolB translocation protein TolB; Provisional
Probab=90.89 E-value=0.58 Score=53.98 Aligned_cols=53 Identities=17% Similarity=0.240 Sum_probs=39.1
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEE-ecCce--eEEEEecCCCC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIIL-SDDVG--QLYILNTGQGE 60 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~-tD~~G--~l~Ifg~G~g~ 60 (1159)
.+|.+||+.+|+..++. ...+.+...+|||||..||. .+..| .|+++++..+.
T Consensus 223 ~~l~~~~l~~g~~~~l~-~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~ 278 (430)
T PRK00178 223 PRIFVQNLDTGRREQIT-NFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQ 278 (430)
T ss_pred CEEEEEECCCCCEEEcc-CCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCC
Confidence 36899999999865543 23344567899999999984 45555 68999987765
No 232
>PRK02889 tolB translocation protein TolB; Provisional
Probab=90.70 E-value=0.45 Score=55.48 Aligned_cols=73 Identities=18% Similarity=0.247 Sum_probs=50.6
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-C--ceeEEEEecCCCCCccCCccc-----cccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-D--VGQLYILNTGQGESQKDAKYD-----QFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~--~G~l~Ifg~G~g~~~~~~p~E-----QFF~tDYrp 77 (1159)
...|.+||. .|...+.+..+...+...+|||||+.||.+. . ..+|+++++..+.....+... --|+.|.+-
T Consensus 175 ~~~L~~~D~-dG~~~~~l~~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~ 253 (427)
T PRK02889 175 RYQLQISDA-DGQNAQSALSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRT 253 (427)
T ss_pred ccEEEEECC-CCCCceEeccCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCccceEECCCCCE
Confidence 467888898 6766666777778899999999999999654 2 346999999777532222111 136666665
Q ss_pred ce
Q 001090 78 LV 79 (1159)
Q Consensus 78 Li 79 (1159)
|+
T Consensus 254 la 255 (427)
T PRK02889 254 LA 255 (427)
T ss_pred EE
Confidence 54
No 233
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=90.62 E-value=0.36 Score=53.38 Aligned_cols=54 Identities=4% Similarity=-0.042 Sum_probs=39.7
Q ss_pred cccCCcEEEEEccCCeeeE-------EeecCcceeeeeeeCCCCCEEEEecC-ceeEEEEecC
Q 001090 3 AGYDGKTIVWDIWEGIPIR-------IYEISRFRLVDGKFSPDGASIILSDD-VGQLYILNTG 57 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir-------~f~~g~~~i~D~~FSPDGt~~a~tD~-~G~l~Ifg~G 57 (1159)
...+++|.|||+.+...+. .+..| .....++|+|||++|.+++. .+.|.+|++-
T Consensus 144 ~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g-~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~ 205 (330)
T PRK11028 144 CLKEDRIRLFTLSDDGHLVAQEPAEVTTVEG-AGPRHMVFHPNQQYAYCVNELNSSVDVWQLK 205 (330)
T ss_pred eCCCCEEEEEEECCCCcccccCCCceecCCC-CCCceEEECCCCCEEEEEecCCCEEEEEEEe
Confidence 3457999999998733221 11122 23567899999999998887 9999999984
No 234
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=90.56 E-value=0.23 Score=59.81 Aligned_cols=36 Identities=28% Similarity=0.335 Sum_probs=31.9
Q ss_pred cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 25 ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 25 ~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.+.+.|..++|||||++||++..+|.|.||++-+-+
T Consensus 288 ~~~g~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~e 323 (636)
T KOG2394|consen 288 IGEGSINEFAFSPDGKYLATVSQDGFLRIFDFDTQE 323 (636)
T ss_pred eccccccceeEcCCCceEEEEecCceEEEeeccHHH
Confidence 456799999999999999999999999999986543
No 235
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=90.51 E-value=0.26 Score=56.61 Aligned_cols=58 Identities=16% Similarity=0.245 Sum_probs=47.8
Q ss_pred CcccCCcEEEEEccCCe------eeEEee--cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 2 SAGYDGKTIVWDIWEGI------PIRIYE--ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~------~Ir~f~--~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
++|.---|++|.+.=|+ ..|+|. ||..+|..+||||+.+.+|++.-+|+|.||++...
T Consensus 245 ~~gFTpDVkVwE~~f~kdG~fqev~rvf~LkGH~saV~~~aFsn~S~r~vtvSkDG~wriwdtdVr 310 (420)
T KOG2096|consen 245 VSGFTPDVKVWEPIFTKDGTFQEVKRVFSLKGHQSAVLAAAFSNSSTRAVTVSKDGKWRIWDTDVR 310 (420)
T ss_pred EecCCCCceEEEEEeccCcchhhhhhhheeccchhheeeeeeCCCcceeEEEecCCcEEEeeccce
Confidence 45556668999986443 456665 78999999999999999999999999999998764
No 236
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=90.36 E-value=0.71 Score=53.76 Aligned_cols=57 Identities=19% Similarity=0.214 Sum_probs=49.2
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC-EEEEec-CceeEEEEecCCCCCccC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA-SIILSD-DVGQLYILNTGQGESQKD 64 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD-~~G~l~Ifg~G~g~~~~~ 64 (1159)
+.|.++|+.+++.++.+..|+ .++.++|||||+ +|.++. ..|.|+++|+.+++..+.
T Consensus 279 ~~V~ViD~~t~kvi~~i~vG~-~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~~ 337 (352)
T TIGR02658 279 RFLFVVDAKTGKRLRKIELGH-EIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELSS 337 (352)
T ss_pred CEEEEEECCCCeEEEEEeCCC-ceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEee
Confidence 589999999999999988765 788999999999 888888 689999999988864443
No 237
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=90.35 E-value=0.6 Score=58.73 Aligned_cols=58 Identities=12% Similarity=0.191 Sum_probs=52.1
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEE-EEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASI-ILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~-a~tD~~G~l~Ifg~G~g~ 60 (1159)
|--+|+|+|.+|..|+.|..|.+..++|..+.|.-||..+ |+++..|++.||++.-.+
T Consensus 220 G~~~G~ViifNlK~dkil~sFk~d~g~VtslSFrtDG~p~las~~~~G~m~~wDLe~kk 278 (910)
T KOG1539|consen 220 GLENGTVIIFNLKFDKILMSFKQDWGRVTSLSFRTDGNPLLASGRSNGDMAFWDLEKKK 278 (910)
T ss_pred eccCceEEEEEcccCcEEEEEEccccceeEEEeccCCCeeEEeccCCceEEEEEcCCCe
Confidence 4568999999999999999999999999999999999965 578888999999987654
No 238
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=90.20 E-value=0.39 Score=58.02 Aligned_cols=52 Identities=25% Similarity=0.346 Sum_probs=38.4
Q ss_pred CCcccCCcEEEEEccCCeeeEE------------------ee-----------cCcceeeeeeeCCCCCEEEEecCceeE
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRI------------------YE-----------ISRFRLVDGKFSPDGASIILSDDVGQL 51 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~------------------f~-----------~g~~~i~D~~FSPDGt~~a~tD~~G~l 51 (1159)
+|||.|=+..|||-. |..|-. |. -.-|.||.+.||||||++++|...|++
T Consensus 202 ~sgGED~kfKvWD~~-G~~Lf~S~~~ey~ITSva~npd~~~~v~S~nt~R~~~p~~GSifnlsWS~DGTQ~a~gt~~G~v 280 (737)
T KOG1524|consen 202 ASGGEDFRFKIWDAQ-GANLFTSAAEEYAITSVAFNPEKDYLLWSYNTARFSSPRVGSIFNLSWSADGTQATCGTSTGQL 280 (737)
T ss_pred eecCCceeEEeeccc-CcccccCChhccceeeeeeccccceeeeeeeeeeecCCCccceEEEEEcCCCceeeccccCceE
Confidence 589999999999964 332210 11 114789999999999999999999988
Q ss_pred EE
Q 001090 52 YI 53 (1159)
Q Consensus 52 ~I 53 (1159)
.+
T Consensus 281 ~~ 282 (737)
T KOG1524|consen 281 IV 282 (737)
T ss_pred EE
Confidence 64
No 239
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=90.12 E-value=0.5 Score=53.38 Aligned_cols=62 Identities=15% Similarity=0.157 Sum_probs=58.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+|++-|.++-||=-.+|+.|-.|.||.++||.|...-|..+++++..+-++.||++-.|+..
T Consensus 26 FscaKD~~~~vw~s~nGerlGty~GHtGavW~~Did~~s~~liTGSAD~t~kLWDv~tGk~l 87 (327)
T KOG0643|consen 26 FSCAKDSTPTVWYSLNGERLGTYDGHTGAVWCCDIDWDSKHLITGSADQTAKLWDVETGKQL 87 (327)
T ss_pred EEecCCCCceEEEecCCceeeeecCCCceEEEEEecCCcceeeeccccceeEEEEcCCCcEE
Confidence 47899999999999999999999999999999999999999999999999999999998743
No 240
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=90.09 E-value=0.32 Score=55.97 Aligned_cols=182 Identities=20% Similarity=0.260 Sum_probs=100.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec--C---cceeeeeeeCCCCCEEEEecCceeEEEEec-CCCCCccCCccccccccC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI--S---RFRLVDGKFSPDGASIILSDDVGQLYILNT-GQGESQKDAKYDQFFLGD 74 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~--g---~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~-G~g~~~~~~p~EQFF~tD 74 (1159)
++.+-|.-|.|||-.+|+.-..|.+ + -.+-.+++|||||.+|.+|+. -.|.||++ -.|..... |--||++-
T Consensus 127 a~ssr~~PIh~wdaftG~lraSy~~ydh~de~taAhsL~Fs~DGeqlfaGyk-rcirvFdt~RpGr~c~v--y~t~~~~k 203 (406)
T KOG2919|consen 127 AVSSRDQPIHLWDAFTGKLRASYRAYDHQDEYTAAHSLQFSPDGEQLFAGYK-RCIRVFDTSRPGRDCPV--YTTVTKGK 203 (406)
T ss_pred eeccccCceeeeeccccccccchhhhhhHHhhhhheeEEecCCCCeEeeccc-ceEEEeeccCCCCCCcc--hhhhhccc
Confidence 3567888999999999998888874 2 347789999999999999986 58999998 33321111 12244432
Q ss_pred Cc--cceecCCCCccccccCcCCcccC-CCCCcccCCCCCCCchhHHhhhhhhhcccCccccCCccccccCCCCcC----
Q 001090 75 YR--PLVQDTYGNVLDQETQLAPHRRN-LQDPLCDSAMIPYPEPYQTMYQQRRLGALGIEWRPSSLKLAVGPDFSL---- 147 (1159)
Q Consensus 75 Yr--pLirD~~g~vlDe~Tq~~PHL~~-~~~~L~D~~~~PYp~~~Q~~~~~rrl~~l~~e~~p~~~~~~vg~~~~~---- 147 (1159)
.. -+|-=-.-++.|. +-++-|-.- .-|+.||-.+.| +|.+. ++.=|---+.|+|--=++=+|+--..
T Consensus 204 ~gq~giisc~a~sP~~~-~~~a~gsY~q~~giy~~~~~~p----l~llg-gh~gGvThL~~~edGn~lfsGaRk~dkIl~ 277 (406)
T KOG2919|consen 204 FGQKGIISCFAFSPMDS-KTLAVGSYGQRVGIYNDDGRRP----LQLLG-GHGGGVTHLQWCEDGNKLFSGARKDDKILC 277 (406)
T ss_pred ccccceeeeeeccCCCC-cceeeecccceeeeEecCCCCc----eeeec-ccCCCeeeEEeccCcCeecccccCCCeEEE
Confidence 21 1111000000110 000111110 014566666555 34333 44334344678877666666643331
Q ss_pred -CCccccccccCcccccCCCccccccccCCccccc-ccCCCCCcccc
Q 001090 148 -DQGYQLQPLADLDVMIDPLPEFIDVMDWEPENEV-QSDDNDSEYNV 192 (1159)
Q Consensus 148 -~~~~~~~p~~~~~~~~~~~~e~~d~~~wepe~ev-~sdd~dseyn~ 192 (1159)
|--+-..|+-.|.|-++---+ +=..|.+|.-|+ .|-|||---+|
T Consensus 278 WDiR~~~~pv~~L~rhv~~TNQ-RI~FDld~~~~~LasG~tdG~V~v 323 (406)
T KOG2919|consen 278 WDIRYSRDPVYALERHVGDTNQ-RILFDLDPKGEILASGDTDGSVRV 323 (406)
T ss_pred EeehhccchhhhhhhhccCccc-eEEEecCCCCceeeccCCCccEEE
Confidence 123445566666665542212 124677888766 46667766554
No 241
>PRK02889 tolB translocation protein TolB; Provisional
Probab=90.09 E-value=0.92 Score=52.98 Aligned_cols=73 Identities=19% Similarity=0.242 Sum_probs=46.0
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEE-ecCcee--EEEEecCCCCCccCCc-----cccccccCCccc
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIIL-SDDVGQ--LYILNTGQGESQKDAK-----YDQFFLGDYRPL 78 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~-tD~~G~--l~Ifg~G~g~~~~~~p-----~EQFF~tDYrpL 78 (1159)
.+|.+||+.+|++.++ ....+.+...+|||||..||. .+..|. |+++++..+....-+. ..--|+.|.+-|
T Consensus 220 ~~I~~~dl~~g~~~~l-~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~~~~wSpDG~~l 298 (427)
T PRK02889 220 PVVYVHDLATGRRRVV-ANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQSSGIDTEPFFSPDGRSI 298 (427)
T ss_pred cEEEEEECCCCCEEEe-ecCCCCccceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCCCCCCCcCeEEcCCCCEE
Confidence 4699999999987554 333456678999999999985 566665 5566654443221111 112466666655
Q ss_pred ee
Q 001090 79 VQ 80 (1159)
Q Consensus 79 ir 80 (1159)
+.
T Consensus 299 ~f 300 (427)
T PRK02889 299 YF 300 (427)
T ss_pred EE
Confidence 54
No 242
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=89.87 E-value=0.48 Score=57.09 Aligned_cols=79 Identities=16% Similarity=0.232 Sum_probs=57.0
Q ss_pred CCcccCCcEEEEEccCCeeeE--Ee--ecCc--ceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-Ccc-----CCccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIR--IY--EISR--FRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQK-----DAKYD 68 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir--~f--~~g~--~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~-----~~p~E 68 (1159)
+.|.-||.|-+||. .+.-++ .+ ..|. -.|..++||+||.+|++-..++.|.+|++-+-+ .+. -++|.
T Consensus 333 Aagc~DGSIQ~W~~-~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~ 411 (641)
T KOG0772|consen 333 AAGCLDGSIQIWDK-GSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFP 411 (641)
T ss_pred hhcccCCceeeeec-CCcccccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccccccchhhhcCCCccCC
Confidence 45788999999997 222222 11 2343 489999999999999999999999999998754 221 12344
Q ss_pred c---ccccCCcccee
Q 001090 69 Q---FFLGDYRPLVQ 80 (1159)
Q Consensus 69 Q---FF~tDYrpLir 80 (1159)
+ -|+.|-+-|+.
T Consensus 412 ~tdc~FSPd~kli~T 426 (641)
T KOG0772|consen 412 GTDCCFSPDDKLILT 426 (641)
T ss_pred CCccccCCCceEEEe
Confidence 4 68888876665
No 243
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.62 E-value=0.61 Score=52.45 Aligned_cols=57 Identities=16% Similarity=0.166 Sum_probs=48.9
Q ss_pred CCcccCCcEEEEEccC-CeeeEEeecCcceeeeeeeCC-CCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWE-GIPIRIYEISRFRLVDGKFSP-DGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~-G~~Ir~f~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|+|-||+++||||.. |+.+. +..|...|+.|-|+- |-..+|+++.++.|.+||+-.
T Consensus 164 as~Sgd~~l~lwdvr~~gk~~~-i~ah~~Eil~cdw~ky~~~vl~Tg~vd~~vr~wDir~ 222 (311)
T KOG0277|consen 164 ASASGDGTLRLWDVRSPGKFMS-IEAHNSEILCCDWSKYNHNVLATGGVDNLVRGWDIRN 222 (311)
T ss_pred EEccCCceEEEEEecCCCceeE-EEeccceeEeecccccCCcEEEecCCCceEEEEehhh
Confidence 5899999999999975 55554 888999999999996 667788999999999999864
No 244
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=89.41 E-value=0.66 Score=60.38 Aligned_cols=86 Identities=15% Similarity=0.101 Sum_probs=58.6
Q ss_pred CCcccCCcEEEEEccCC----e---eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc------CCcc
Q 001090 1 MSAGYDGKTIVWDIWEG----I---PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK------DAKY 67 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G----~---~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~------~~p~ 67 (1159)
+|||.||+|+|||...= . +.-+|.--...+.-+.+=+.|.+||++..+|.+.+++++.....+ ..|.
T Consensus 1065 vsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~vt~~~~~~~~Av~t~DG~v~~~~id~~~~~~~~~~~~ri~n 1144 (1431)
T KOG1240|consen 1065 VSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEKVTMCGNGDQFAVSTKDGSVRVLRIDHYNVSKRVATQVRIPN 1144 (1431)
T ss_pred EEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEEEEeccCCCeEEEEcCCCeEEEEEccccccccceeeeeeccc
Confidence 58999999999997541 2 222444445566667777899999999999999999999843222 1222
Q ss_pred ccccccCCccceecCCCCcccccc
Q 001090 68 DQFFLGDYRPLVQDTYGNVLDQET 91 (1159)
Q Consensus 68 EQFF~tDYrpLirD~~g~vlDe~T 91 (1159)
.-++. -+.+.++++-+.++
T Consensus 1145 ~~~~g-----~vv~m~a~~~~~~S 1163 (1431)
T KOG1240|consen 1145 LKKDG-----VVVSMHAFTAIVQS 1163 (1431)
T ss_pred ccCCC-----ceEEeecccccccc
Confidence 22332 25567777766666
No 245
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=88.92 E-value=0.35 Score=56.56 Aligned_cols=54 Identities=20% Similarity=0.006 Sum_probs=45.5
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
.||.+..+-+|-+..|.++ -..||-..|+|++||||+++|+++|.++||.|--+
T Consensus 127 kagD~~~~di~s~~~~~~~-~~lGhvSml~dVavS~D~~~IitaDRDEkIRvs~y 180 (390)
T KOG3914|consen 127 KAGDVYSFDILSADSGRCE-PILGHVSMLLDVAVSPDDQFIITADRDEKIRVSRY 180 (390)
T ss_pred ecCCceeeeeecccccCcc-hhhhhhhhhheeeecCCCCEEEEecCCceEEEEec
Confidence 4678888888888886654 34467788999999999999999999999999864
No 246
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=88.88 E-value=0.99 Score=53.04 Aligned_cols=58 Identities=16% Similarity=0.124 Sum_probs=53.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEee--cCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE--ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~--~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
.||+.||+||+-||++-+.|-+|. ..+++||-+.-+|--..||++..+|.++||+.--
T Consensus 121 ~SG~~~~~VI~HDiEt~qsi~V~~~~~~~~~VY~m~~~P~DN~~~~~t~~~~V~~~D~Rd 180 (609)
T KOG4227|consen 121 YSGERWGTVIKHDIETKQSIYVANENNNRGDVYHMDQHPTDNTLIVVTRAKLVSFIDNRD 180 (609)
T ss_pred ecCCCcceeEeeecccceeeeeecccCcccceeecccCCCCceEEEEecCceEEEEeccC
Confidence 489999999999999999999986 3579999999999999999999999999999744
No 247
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=88.86 E-value=0.72 Score=55.99 Aligned_cols=58 Identities=17% Similarity=0.253 Sum_probs=50.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCc-ceeeeeeeCCC--CCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISR-FRLVDGKFSPD--GASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~-~~i~D~~FSPD--Gt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|||.|-.|+|||...-+++....+|| .-||..||=|- -..++++-.+-.|.+|++-.
T Consensus 66 ~SGSDD~r~ivWd~~~~KllhsI~TgHtaNIFsvKFvP~tnnriv~sgAgDk~i~lfdl~~ 126 (758)
T KOG1310|consen 66 ASGSDDTRLIVWDPFEYKLLHSISTGHTANIFSVKFVPYTNNRIVLSGAGDKLIKLFDLDS 126 (758)
T ss_pred eecCCcceEEeecchhcceeeeeecccccceeEEeeeccCCCeEEEeccCcceEEEEeccc
Confidence 589999999999999999999888665 56999999995 45677888888999999975
No 248
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=88.75 E-value=0.38 Score=54.33 Aligned_cols=48 Identities=17% Similarity=0.182 Sum_probs=34.7
Q ss_pred ceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCc
Q 001090 28 FRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYR 76 (1159)
Q Consensus 28 ~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYr 76 (1159)
-+..-++|||||+.||.+++.|+|.+|++=....+--.| .-=|.+|+.
T Consensus 44 PQWRkl~WSpD~tlLa~a~S~G~i~vfdl~g~~lf~I~p-~~~~~~d~~ 91 (282)
T PF15492_consen 44 PQWRKLAWSPDCTLLAYAESTGTIRVFDLMGSELFVIPP-AMSFPGDLS 91 (282)
T ss_pred chheEEEECCCCcEEEEEcCCCeEEEEecccceeEEcCc-ccccCCccc
Confidence 356789999999999999999999999985444442222 222455553
No 249
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=88.68 E-value=0.35 Score=59.23 Aligned_cols=58 Identities=22% Similarity=0.426 Sum_probs=44.4
Q ss_pred CcccCCcEEEEEccCCe--eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 2 SAGYDGKTIVWDIWEGI--PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
++.-||+|-|||+..-. ++.+-..+.-.+.-..||+.|+.||+||..|+++|+.++..
T Consensus 459 ~~d~~G~l~iWDLl~~~~~Pv~s~~~~~~~l~~~~~s~~g~~lavGd~~G~~~~~~l~~~ 518 (555)
T KOG1587|consen 459 TVDGDGNLDIWDLLQDDEEPVLSQKVCSPALTRVRWSPNGKLLAVGDANGTTHILKLSES 518 (555)
T ss_pred EEcCCCceehhhhhccccCCcccccccccccceeecCCCCcEEEEecCCCcEEEEEcCch
Confidence 34569999999997665 44333334445666678888999999999999999999754
No 250
>PRK01742 tolB translocation protein TolB; Provisional
Probab=88.66 E-value=0.56 Score=54.69 Aligned_cols=52 Identities=25% Similarity=0.302 Sum_probs=36.6
Q ss_pred ccCCcEEE--EEccCCeeeEEeecCcceeeeeeeCCCCCEEE-EecCceeEEEEec
Q 001090 4 GYDGKTIV--WDIWEGIPIRIYEISRFRLVDGKFSPDGASII-LSDDVGQLYILNT 56 (1159)
Q Consensus 4 G~DG~I~l--WDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg~ 56 (1159)
+.||.+.| ||+.+|.. +.+..+...+...+|||||..|+ +++..|...||.+
T Consensus 267 ~~~g~~~Iy~~d~~~~~~-~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~ 321 (429)
T PRK01742 267 SKDGVLNIYVMGANGGTP-SQLTSGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRM 321 (429)
T ss_pred ecCCcEEEEEEECCCCCe-EeeccCCCCcCCEEECCCCCEEEEEECCCCCceEEEE
Confidence 46776554 57777765 44566777788999999999887 4566776666554
No 251
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=88.49 E-value=0.4 Score=59.21 Aligned_cols=52 Identities=13% Similarity=0.196 Sum_probs=25.3
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
|||+|.|+++|=+ |++.-+..+|..+||.+++=|++ .+++|..+-.|.||-=
T Consensus 117 SgSWD~TakvW~~--~~l~~~l~gH~asVWAv~~l~e~-~~vTgsaDKtIklWk~ 168 (745)
T KOG0301|consen 117 SGSWDSTAKVWRI--GELVYSLQGHTASVWAVASLPEN-TYVTGSADKTIKLWKG 168 (745)
T ss_pred ecccccceEEecc--hhhhcccCCcchheeeeeecCCC-cEEeccCcceeeeccC
Confidence 4555555555554 33444444555555555555555 4444444445555443
No 252
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=88.01 E-value=0.56 Score=53.76 Aligned_cols=54 Identities=11% Similarity=0.114 Sum_probs=46.0
Q ss_pred CCcccCCcEEEEEccCC--------------------eeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 1 MSAGYDGKTIVWDIWEG--------------------IPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G--------------------~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
+.|++|| |+||.|..- +.|..|.+|+|.|+-+.|-=-|+.|+++.++|.+.+|-
T Consensus 243 AvA~kDg-v~I~~v~~~~s~i~~ee~~~~~~~~~l~v~~vs~~~~H~~~VWrv~wNmtGtiLsStGdDG~VRLWk 316 (361)
T KOG2445|consen 243 AVATKDG-VRIFKVKVARSAIEEEEVLAPDLMTDLPVEKVSELDDHNGEVWRVRWNMTGTILSSTGDDGCVRLWK 316 (361)
T ss_pred EEeecCc-EEEEEEeeccchhhhhcccCCCCccccceEEeeeccCCCCceEEEEEeeeeeEEeecCCCceeeehh
Confidence 4689999 999999831 13334678999999999999999999999999999984
No 253
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=87.87 E-value=1.2 Score=51.44 Aligned_cols=60 Identities=23% Similarity=0.465 Sum_probs=45.7
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec---------C--------------cceeee-----------------------ee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI---------S--------------RFRLVD-----------------------GK 34 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~---------g--------------~~~i~D-----------------------~~ 34 (1159)
||||.|..|++||+..|.+++.+.- | .-.|.+ +-
T Consensus 81 ltsS~D~si~lwDl~~gs~l~rirf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~ 160 (405)
T KOG1273|consen 81 LTSSRDWSIKLWDLLKGSPLKRIRFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGV 160 (405)
T ss_pred eeecCCceeEEEeccCCCceeEEEccCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCcccccccccccc
Confidence 6899999999999999998885420 0 001111 23
Q ss_pred eCCCCCEEEEecCceeEEEEecCCCC
Q 001090 35 FSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 35 FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|-+-|.+|++|.+.|.|++++..+-+
T Consensus 161 fdr~g~yIitGtsKGkllv~~a~t~e 186 (405)
T KOG1273|consen 161 FDRRGKYIITGTSKGKLLVYDAETLE 186 (405)
T ss_pred ccCCCCEEEEecCcceEEEEecchhe
Confidence 77889999999999999999987754
No 254
>PRK04792 tolB translocation protein TolB; Provisional
Probab=87.86 E-value=1.1 Score=53.05 Aligned_cols=52 Identities=21% Similarity=0.351 Sum_probs=37.7
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe-cCce--eEEEEecCCCC
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS-DDVG--QLYILNTGQGE 60 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t-D~~G--~l~Ifg~G~g~ 60 (1159)
+|.+||+.+|+..++. ...+.....+|||||++||.+ +..| +|+++++..+.
T Consensus 243 ~L~~~dl~tg~~~~lt-~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~ 297 (448)
T PRK04792 243 EIFVQDIYTQVREKVT-SFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKA 297 (448)
T ss_pred EEEEEECCCCCeEEec-CCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCC
Confidence 6899999999864432 223344578999999999864 5555 58888987765
No 255
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=87.75 E-value=0.88 Score=38.61 Aligned_cols=31 Identities=23% Similarity=0.321 Sum_probs=27.8
Q ss_pred ceeeeeeeCCCCC---EEEEecCceeEEEEecCC
Q 001090 28 FRLVDGKFSPDGA---SIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 28 ~~i~D~~FSPDGt---~~a~tD~~G~l~Ifg~G~ 58 (1159)
|+|..++|||+.. .||.+-..|+++|+|+-+
T Consensus 1 GAvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43)
T PF10313_consen 1 GAVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43)
T ss_pred CCeEEEEeCCCCCcccEEEEEccCCeEEEEEccc
Confidence 6899999999666 999999999999999864
No 256
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=87.58 E-value=0.94 Score=57.52 Aligned_cols=59 Identities=12% Similarity=0.181 Sum_probs=56.0
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-||.|=.|+|-++..+..++++.+|.++|..+.|.|.|..||+++-+|+|.||++-.+.
T Consensus 113 agsdD~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~ 171 (933)
T KOG1274|consen 113 AGSDDTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGI 171 (933)
T ss_pred eecCceeEEEEeccccchheeecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccch
Confidence 47889999999999999999999999999999999999999999999999999998765
No 257
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=87.31 E-value=1.4 Score=37.29 Aligned_cols=52 Identities=23% Similarity=0.533 Sum_probs=41.3
Q ss_pred CCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCCCCcccccCCCccccC
Q 001090 967 NWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTEPTETHLHSPWELFDS 1033 (1159)
Q Consensus 967 nWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~~~E~eRVSPWEIEpi 1033 (1159)
.|.+|+.|.+.| . ++.||.|+|+++.+ + ..+.|.+.+-| ..+.|.+.+|.++
T Consensus 2 ~~~~G~~~~a~~-~----d~~wyra~I~~~~~------~---~~~~V~f~D~G-~~~~v~~~~l~~l 53 (57)
T smart00333 2 TFKVGDKVAARW-E----DGEWYRARIIKVDG------E---QLYEVFFIDYG-NEEVVPPSDLRPL 53 (57)
T ss_pred CCCCCCEEEEEe-C----CCCEEEEEEEEECC------C---CEEEEEEECCC-ccEEEeHHHeecC
Confidence 689999999998 3 57999999999985 1 67899998743 5567777777654
No 258
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=87.29 E-value=1.9 Score=50.29 Aligned_cols=88 Identities=11% Similarity=0.040 Sum_probs=64.8
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC----------ceeEEEEecCCCCCccCCc----cc----
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD----------VGQLYILNTGQGESQKDAK----YD---- 68 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~----------~G~l~Ifg~G~g~~~~~~p----~E---- 68 (1159)
|+|.+-|..+++.+..+..|..+-. + +||||+.|.++.. ++.|.+||+.+.+.-...+ ++
T Consensus 27 ~~v~ViD~~~~~v~g~i~~G~~P~~-~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~ 104 (352)
T TIGR02658 27 TQVYTIDGEAGRVLGMTDGGFLPNP-V-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVG 104 (352)
T ss_pred ceEEEEECCCCEEEEEEEccCCCce-e-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCchhhcc
Confidence 8999999999999999997755544 4 9999999886655 8999999999877443322 13
Q ss_pred ----cc-cccCCccce-ecCC---C-CccccccCcCCc
Q 001090 69 ----QF-FLGDYRPLV-QDTY---G-NVLDQETQLAPH 96 (1159)
Q Consensus 69 ----QF-F~tDYrpLi-rD~~---g-~vlDe~Tq~~PH 96 (1159)
+| |+.|.+.|+ .+.. . -|||.++...--
T Consensus 105 ~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ 142 (352)
T TIGR02658 105 TYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVR 142 (352)
T ss_pred CccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEE
Confidence 54 888888776 3422 1 288887765433
No 259
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=87.28 E-value=0.63 Score=59.51 Aligned_cols=56 Identities=16% Similarity=0.066 Sum_probs=50.8
Q ss_pred CCcccCCcEEEEEccCCeeeE-EeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIR-IYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir-~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+|.|.|.+|+||+|++++.+. +-.+|.+.||+++|.|. .|+++..+=+..+|+.-.
T Consensus 191 ~s~SdDRsiRlW~i~s~~~~~~~~fgHsaRvw~~~~~~n--~i~t~gedctcrvW~~~~ 247 (967)
T KOG0974|consen 191 ASVSDDRSIRLWPIDSREVLGCTGFGHSARVWACCFLPN--RIITVGEDCTCRVWGVNG 247 (967)
T ss_pred EEEecCcceeeeecccccccCcccccccceeEEEEeccc--eeEEeccceEEEEEeccc
Confidence 478999999999999999888 77789999999999999 999999999999996543
No 260
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=87.27 E-value=1.2 Score=51.55 Aligned_cols=137 Identities=23% Similarity=0.226 Sum_probs=83.8
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC-ceeEEEEecCCCCC-c-cCCc---cccccccCCc
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD-VGQLYILNTGQGES-Q-KDAK---YDQFFLGDYR 76 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~-~G~l~Ifg~G~g~~-~-~~~p---~EQFF~tDYr 76 (1159)
+.-|+.|.+|+|..-+---+.+-|+..+...+|||||++|..+.+ +-+|.+|.+-+.+- + +..+ +---||.|.+
T Consensus 67 ~yk~~~vqvwsl~Qpew~ckIdeg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~~~~~~pK~~~kg~~f~~dg~ 146 (447)
T KOG4497|consen 67 AYKDPKVQVWSLVQPEWYCKIDEGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKGYLLPHPKTNVKGYAFHPDGQ 146 (447)
T ss_pred eeccceEEEEEeecceeEEEeccCCCcceeeeECCCcceEeeeecceeEEEEEEeccceeEEecccccCceeEEECCCCc
Confidence 345889999999999988888899999999999999999886665 56899999876542 2 2221 1125777766
Q ss_pred c--c--eecCCCCcc---ccccCcCCcccCCCCCcccCCCCCCCch------------hHHhhhhhhhcccCccccCCcc
Q 001090 77 P--L--VQDTYGNVL---DQETQLAPHRRNLQDPLCDSAMIPYPEP------------YQTMYQQRRLGALGIEWRPSSL 137 (1159)
Q Consensus 77 p--L--irD~~g~vl---De~Tq~~PHL~~~~~~L~D~~~~PYp~~------------~Q~~~~~rrl~~l~~e~~p~~~ 137 (1159)
- | .||.--||- =..=++.-|..- .-.|..|+-..|. |--+.=.|-||-+-.||+|..-
T Consensus 147 f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~---dT~DltgieWsPdg~~laVwd~~Leykv~aYe~~lG~k~v~wsP~~q 223 (447)
T KOG4497|consen 147 FCAILSRRDCKDYVQISSCKAWILLKEFKL---DTIDLTGIEWSPDGNWLAVWDNVLEYKVYAYERGLGLKFVEWSPCNQ 223 (447)
T ss_pred eeeeeecccHHHHHHHHhhHHHHHHHhcCC---CcccccCceECCCCcEEEEecchhhheeeeeeeccceeEEEeccccc
Confidence 2 2 124433320 011111222210 1234444444432 2211225567777788888877
Q ss_pred ccccC
Q 001090 138 KLAVG 142 (1159)
Q Consensus 138 ~~~vg 142 (1159)
-+|||
T Consensus 224 flavG 228 (447)
T KOG4497|consen 224 FLAVG 228 (447)
T ss_pred eEEee
Confidence 77777
No 261
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=86.68 E-value=1.4 Score=34.83 Aligned_cols=33 Identities=27% Similarity=0.189 Sum_probs=22.3
Q ss_pred EeecCcceeeeeeeCCCCCEEE-EecCc--eeEEEE
Q 001090 22 IYEISRFRLVDGKFSPDGASII-LSDDV--GQLYIL 54 (1159)
Q Consensus 22 ~f~~g~~~i~D~~FSPDGt~~a-~tD~~--G~l~If 54 (1159)
....+.+.-...+|||||++|+ +++.. |..-||
T Consensus 3 ~~t~~~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 3 QLTNSPGDDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp EES-SSSSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred CcccCCccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 3445666788899999999999 55555 665554
No 262
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=86.62 E-value=1.7 Score=37.74 Aligned_cols=38 Identities=32% Similarity=0.626 Sum_probs=31.7
Q ss_pred CCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcC
Q 001090 967 NWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKT 1017 (1159)
Q Consensus 967 nWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn 1017 (1159)
.|++|+.+.++|.+ ++.||.|+|+.+.. -.+|.|.++.
T Consensus 2 ~~~~G~~Ve~~~~~----~~~W~~a~V~~~~~---------~~~~~V~~~~ 39 (61)
T smart00743 2 DFKKGDRVEVFSKE----EDSWWEAVVTKVLG---------DGKYLVRYLT 39 (61)
T ss_pred CcCCCCEEEEEECC----CCEEEEEEEEEECC---------CCEEEEEECC
Confidence 58999999999964 68999999999874 2358899977
No 263
>PRK04043 tolB translocation protein TolB; Provisional
Probab=86.53 E-value=2.4 Score=50.09 Aligned_cols=74 Identities=19% Similarity=0.283 Sum_probs=49.3
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe---cCceeEEEEecCCCCCccCC--cc---ccccccCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS---DDVGQLYILNTGQGESQKDA--KY---DQFFLGDYRP 77 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t---D~~G~l~Ifg~G~g~~~~~~--p~---EQFF~tDYrp 77 (1159)
+..|.++|+.+|+.-++.. ..+.+....|||||+.||.+ +..++|+++++..+....-+ +. +--|+.|.+-
T Consensus 212 ~~~Iyv~dl~tg~~~~lt~-~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~~d~~p~~SPDG~~ 290 (419)
T PRK04043 212 KPTLYKYNLYTGKKEKIAS-SQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPGIDVNGNFVEDDKR 290 (419)
T ss_pred CCEEEEEECCCCcEEEEec-CCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCCccCccEECCCCCE
Confidence 4579999999998766654 45555667899999999855 33468999998666422212 21 1246777765
Q ss_pred cee
Q 001090 78 LVQ 80 (1159)
Q Consensus 78 Lir 80 (1159)
|+.
T Consensus 291 I~F 293 (419)
T PRK04043 291 IVF 293 (419)
T ss_pred EEE
Confidence 554
No 264
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=86.38 E-value=0.62 Score=60.04 Aligned_cols=60 Identities=17% Similarity=0.254 Sum_probs=54.3
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
||+|-|++|++|+..||+.|-.|-.+...+||++|.|--= .||++.-+|+|.||.+-.+.
T Consensus 270 lSsgkD~~ii~wN~~tgEvl~~~p~~~nW~fdv~w~pr~P~~~A~asfdgkI~I~sl~~~~ 330 (1049)
T KOG0307|consen 270 LSSGKDNRIICWNPNTGEVLGELPAQGNWCFDVQWCPRNPSVMAAASFDGKISIYSLQGTD 330 (1049)
T ss_pred hcccCCCCeeEecCCCceEeeecCCCCcceeeeeecCCCcchhhhheeccceeeeeeecCC
Confidence 6899999999999999999998888888999999999766 88999999999999985543
No 265
>PRK04792 tolB translocation protein TolB; Provisional
Probab=85.43 E-value=1.4 Score=51.99 Aligned_cols=56 Identities=32% Similarity=0.451 Sum_probs=39.5
Q ss_pred ccCCc--EEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-Cce--eEEEEecCCCC
Q 001090 4 GYDGK--TIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVG--QLYILNTGQGE 60 (1159)
Q Consensus 4 G~DG~--I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G--~l~Ifg~G~g~ 60 (1159)
+.+|+ |.+||+.+|+..++ ..+.......+|||||..|+.+. ..| +|+++++..++
T Consensus 281 ~~~g~~~Iy~~dl~tg~~~~l-t~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~ 341 (448)
T PRK04792 281 SKDGQPEIYVVDIATKALTRI-TRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGK 341 (448)
T ss_pred eCCCCeEEEEEECCCCCeEEC-ccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 44554 88889998876544 44445667889999999998653 333 68888876654
No 266
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=85.27 E-value=0.52 Score=56.46 Aligned_cols=56 Identities=27% Similarity=0.433 Sum_probs=47.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcc-eeeeeeeCC--CCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRF-RLVDGKFSP--DGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~-~i~D~~FSP--DGt~~a~tD~~G~l~Ifg~ 56 (1159)
+|+|.|.+|+|||+.+|.+.-.|..||+ -||-.+|-| +-+.||....+|++.+--+
T Consensus 158 ~SgSDD~~vv~WdW~~~~~~l~f~SGH~~NvfQaKFiP~s~d~ti~~~s~dgqvr~s~i 216 (559)
T KOG1334|consen 158 ASGSDDLQVVVWDWVSGSPKLSFESGHCNNVFQAKFIPFSGDRTIVTSSRDGQVRVSEI 216 (559)
T ss_pred eccCccceEEeehhhccCcccccccccccchhhhhccCCCCCcCceeccccCceeeeee
Confidence 5899999999999999999999986655 699999988 4578888888888876544
No 267
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=85.07 E-value=1.5 Score=50.98 Aligned_cols=105 Identities=12% Similarity=0.069 Sum_probs=72.8
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCC--CCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceecC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPD--GASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQDT 82 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPD--Gt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD~ 82 (1159)
.-|.|+|+|..+|..+..|.+.+..+-.++|+-+ +..+.++..+|.|.+||+-+.....+..+.|.=.+-+--|-+.+
T Consensus 48 Sngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~~f~~ld~nc 127 (376)
T KOG1188|consen 48 SNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCSSDGTVRLWDIRSQAESARISWTQQSGTPFICLDLNC 127 (376)
T ss_pred cCCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEeccCCeEEEEEeecchhhhheeccCCCCCcceEeeccC
Confidence 4589999999999999999999999999999884 66677999999999999977654444444443222222223334
Q ss_pred CCCccccccCcCCcccCCCCCcccCCCCC
Q 001090 83 YGNVLDQETQLAPHRRNLQDPLCDSAMIP 111 (1159)
Q Consensus 83 ~g~vlDe~Tq~~PHL~~~~~~L~D~~~~P 111 (1159)
+++++=..|.+---.. .=.|-|+.+.-
T Consensus 128 k~~ii~~GtE~~~s~A--~v~lwDvR~~q 154 (376)
T KOG1188|consen 128 KKNIIACGTELTRSDA--SVVLWDVRSEQ 154 (376)
T ss_pred cCCeEEeccccccCce--EEEEEEecccc
Confidence 6777766666554331 12566775543
No 268
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=84.87 E-value=1.3 Score=53.93 Aligned_cols=51 Identities=22% Similarity=0.315 Sum_probs=41.2
Q ss_pred CcEEEEEccCCeeeEEeec--Ccce-eeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 7 GKTIVWDIWEGIPIRIYEI--SRFR-LVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~--g~~~-i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
.+||||||.+|...|.|.. ++.. ..=++||-|+.+||.-.. -.|+||.+-+
T Consensus 282 ~~l~IWDI~tG~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~-~sisIyEtps 335 (698)
T KOG2314|consen 282 QQLIIWDIATGLLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTG-NSISIYETPS 335 (698)
T ss_pred ceEEEEEccccchhcceeccCCCccccceEEeccCCceeEEecc-ceEEEEecCc
Confidence 5799999999999999975 5444 344689999999997766 5689998655
No 269
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=84.78 E-value=0.74 Score=52.92 Aligned_cols=69 Identities=16% Similarity=0.189 Sum_probs=56.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEe--ecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccC-Ccccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIY--EISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKD-AKYDQ 69 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f--~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~-~p~EQ 69 (1159)
+||+-|++|.|+|+..-..-|-| .+..-.|..+.|.|-|-+|++|.+|-.+.+|++.+...|-. .|.+|
T Consensus 188 iS~srD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgTdHp~~rlYdv~T~QcfvsanPd~q 259 (430)
T KOG0640|consen 188 ISGSRDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGTDHPTLRLYDVNTYQCFVSANPDDQ 259 (430)
T ss_pred EeccCCCeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEecCCCceeEEeccceeEeeecCcccc
Confidence 58999999999999887655544 36677899999999999999999999999999988765533 34444
No 270
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=84.71 E-value=1.3 Score=56.94 Aligned_cols=60 Identities=20% Similarity=0.100 Sum_probs=51.7
Q ss_pred CcccCCcEEEEEccCCeeeEE-eecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 2 SAGYDGKTIVWDIWEGIPIRI-YEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~-f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+|+-=|.||+|+.. +.-..+ ..||.|+||++.||-||+++|+..++=.+.||.+++++..
T Consensus 150 ~gsv~~~iivW~~~-~dn~p~~l~GHeG~iF~i~~s~dg~~i~s~SdDRsiRlW~i~s~~~~ 210 (967)
T KOG0974|consen 150 SGSVFGEIIVWKPH-EDNKPIRLKGHEGSIFSIVTSLDGRYIASVSDDRSIRLWPIDSREVL 210 (967)
T ss_pred eccccccEEEEecc-ccCCcceecccCCceEEEEEccCCcEEEEEecCcceeeeeccccccc
Confidence 56667899999998 555555 5688999999999999999999999999999999988643
No 271
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=84.59 E-value=1.1 Score=52.53 Aligned_cols=59 Identities=15% Similarity=0.257 Sum_probs=54.0
Q ss_pred CcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++..=++|+|+|...|. ++..|.-...+|-....-|+|++|.+||..|+|.+||+-.+.
T Consensus 221 t~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~k 280 (412)
T KOG3881|consen 221 TITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGK 280 (412)
T ss_pred EEecceeEEEecCcccCcceeEeccccCcceeeeecCCCcEEEEecccchhheecccCce
Confidence 55667899999999998 999999889999999999999999999999999999987665
No 272
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=84.49 E-value=2.7 Score=50.75 Aligned_cols=57 Identities=25% Similarity=0.369 Sum_probs=50.8
Q ss_pred cccCC-cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDG-KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG-~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|-+|| .|-|+|..+|+. +.++..-+-|+.++.||||..++++.+.+.|+++++..|.
T Consensus 377 gt~dgD~l~iyd~~~~e~-kr~e~~lg~I~av~vs~dGK~~vvaNdr~el~vididngn 434 (668)
T COG4946 377 GTNDGDKLGIYDKDGGEV-KRIEKDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGN 434 (668)
T ss_pred eccCCceEEEEecCCceE-EEeeCCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCC
Confidence 55788 899999988874 5678888999999999999999999999999999998876
No 273
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=83.93 E-value=1.1 Score=54.36 Aligned_cols=57 Identities=16% Similarity=0.218 Sum_probs=49.9
Q ss_pred CCcccCCcEEEEEccC------C--eeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 1 MSAGYDGKTIVWDIWE------G--IPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~------G--~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
++|+-||+|.||.++. + ++|-.|.+|.++|+.++.++.|.++-++.-+|.|..|.+-
T Consensus 310 it~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v~~n~~~~ysgg~Dg~I~~w~~p 374 (577)
T KOG0642|consen 310 ITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVVPSNGEHCYSGGIDGTIRCWNLP 374 (577)
T ss_pred EEeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEecCCceEEEeeccCceeeeeccC
Confidence 4789999999999932 2 2566788999999999999999999999999999999765
No 274
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=83.77 E-value=1.2 Score=54.09 Aligned_cols=52 Identities=12% Similarity=-0.021 Sum_probs=46.3
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCC-----CCEEEEecCceeEEEEec
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPD-----GASIILSDDVGQLYILNT 56 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPD-----Gt~~a~tD~~G~l~Ifg~ 56 (1159)
.-++|.+|||++++.++.|.||.+.|..+.|--+ |.+|.+++.++....+=+
T Consensus 162 as~~ik~~~~~~kevv~~ftgh~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~ 218 (541)
T KOG4547|consen 162 ASRQIKVLDIETKEVVITFTGHGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWV 218 (541)
T ss_pred ccceEEEEEccCceEEEEecCCCcceEEEEEEEeccccccceeeeccccccceeEEE
Confidence 4578999999999999999999999999999998 999999998887766544
No 275
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=83.58 E-value=0.68 Score=56.06 Aligned_cols=52 Identities=21% Similarity=0.396 Sum_probs=43.2
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
++.||++.|-. ..|+.-+....|.++|..+.|+|||+-|.++.++|.|.||.
T Consensus 81 ~s~DGkf~il~-k~~rVE~sv~AH~~A~~~gRW~~dGtgLlt~GEDG~iKiWS 132 (737)
T KOG1524|consen 81 CSNDGRFVILN-KSARVERSISAHAAAISSGRWSPDGAGLLTAGEDGVIKIWS 132 (737)
T ss_pred EcCCceEEEec-ccchhhhhhhhhhhhhhhcccCCCCceeeeecCCceEEEEe
Confidence 35677776654 35666666778999999999999999999999999999997
No 276
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=83.26 E-value=1.4 Score=54.63 Aligned_cols=90 Identities=14% Similarity=0.146 Sum_probs=63.0
Q ss_pred CCcccCCcEEEEEccCCeeeE-Ee---ecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC--CCccCC---c-cccc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIR-IY---EISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG--ESQKDA---K-YDQF 70 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir-~f---~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g--~~~~~~---p-~EQF 70 (1159)
||+|-|.++.||.+.++...- .| ..|.--|+||-|+||+.+||++..+-++.+|+.-.. ...++. + .+-+
T Consensus 588 LsvsRDRt~sl~~~~~~~~~e~~fa~~k~HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~~~~d~~i~~~a~~~~~~aV 667 (764)
T KOG1063|consen 588 LSVSRDRTVSLYEVQEDIKDEFRFACLKAHTRIIWDCSWSPDEKYFATASRDKKVKVWEEPDLRDKYISRFACLKFSLAV 667 (764)
T ss_pred EEeecCceEEeeeeecccchhhhhccccccceEEEEcccCcccceeEEecCCceEEEEeccCchhhhhhhhchhccCCce
Confidence 589999999999997776432 12 256778999999999999999999999999998655 222221 1 1113
Q ss_pred cccCCccceecCCCCccccc
Q 001090 71 FLGDYRPLVQDTYGNVLDQE 90 (1159)
Q Consensus 71 F~tDYrpLirD~~g~vlDe~ 90 (1159)
=.=+|.|+..-+.++||-..
T Consensus 668 TAv~~~~~~~~e~~~~vavG 687 (764)
T KOG1063|consen 668 TAVAYLPVDHNEKGDVVAVG 687 (764)
T ss_pred eeEEeeccccccccceEEEE
Confidence 34456666666666555433
No 277
>PRK01029 tolB translocation protein TolB; Provisional
Probab=83.19 E-value=2.3 Score=50.08 Aligned_cols=53 Identities=15% Similarity=0.228 Sum_probs=39.4
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec---CceeEEEEecCCCC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD---DVGQLYILNTGQGE 60 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD---~~G~l~Ifg~G~g~ 60 (1159)
.+|.+||+.+|++.+.. .+...+.+..|||||++|+.+. ..+.|+++++..++
T Consensus 351 ~~I~v~dl~~g~~~~Lt-~~~~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~ 406 (428)
T PRK01029 351 RQICVYDLATGRDYQLT-TSPENKESPSWAIDSLHLVYSAGNSNESELYLISLITKK 406 (428)
T ss_pred cEEEEEECCCCCeEEcc-CCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 47999999999875544 3455678899999999998542 24678888886554
No 278
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=82.69 E-value=1.3 Score=51.39 Aligned_cols=59 Identities=20% Similarity=0.227 Sum_probs=52.8
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-||+=|.|+|-|+.+|++.+.|.+|..+|-+++|-|+- ++|+++..+-.|.||++-...
T Consensus 110 ~~G~~GvIrVid~~~~~~~~~~~ghG~sINeik~~p~~~qlvls~SkD~svRlwnI~~~~ 169 (385)
T KOG1034|consen 110 AGGYLGVIRVIDVVSGQCSKNYRGHGGSINEIKFHPDRPQLVLSASKDHSVRLWNIQTDV 169 (385)
T ss_pred eecceeEEEEEecchhhhccceeccCccchhhhcCCCCCcEEEEecCCceEEEEeccCCe
Confidence 47899999999999999999999999999999999998 566789999999999885543
No 279
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=82.51 E-value=0.89 Score=54.35 Aligned_cols=56 Identities=11% Similarity=0.065 Sum_probs=52.7
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|..-|+|-||-=..-++|..+.-|.++|.++++-|+|+++|++.-+-++.|||+-.
T Consensus 269 GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 269 GHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRN 324 (545)
T ss_pred cCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeecccccceeEeeecc
Confidence 56789999999999999999999999999999999999999999999999999854
No 280
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=82.45 E-value=1.3 Score=50.54 Aligned_cols=53 Identities=30% Similarity=0.508 Sum_probs=42.6
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
..+.+.|+||.+|+...+... ...+.+.+|||||..||.+-. +-|+|..+..+
T Consensus 21 ~~~~y~i~d~~~~~~~~l~~~-~~~~~~~~~sP~g~~~~~v~~-~nly~~~~~~~ 73 (353)
T PF00930_consen 21 FKGDYYIYDIETGEITPLTPP-PPKLQDAKWSPDGKYIAFVRD-NNLYLRDLATG 73 (353)
T ss_dssp EEEEEEEEETTTTEEEESS-E-ETTBSEEEE-SSSTEEEEEET-TEEEEESSTTS
T ss_pred cceeEEEEecCCCceEECcCC-ccccccceeecCCCeeEEEec-CceEEEECCCC
Confidence 457889999999988776666 788999999999999998865 68999976554
No 281
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=82.34 E-value=1.7 Score=53.85 Aligned_cols=51 Identities=12% Similarity=0.154 Sum_probs=48.5
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
.|..+.|+|+..|..+..+-+|..-|+.+|||-||..||+|..+-.+.||.
T Consensus 31 Ag~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~VI~W~ 81 (1081)
T KOG1538|consen 31 AGSRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSVIIWT 81 (1081)
T ss_pred cCCEEEEEeCCCcccccccccccceEEEEEEccCCceeccCCCceeEEEec
Confidence 467899999999999999999999999999999999999999999999996
No 282
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=82.22 E-value=2.4 Score=50.19 Aligned_cols=58 Identities=16% Similarity=0.140 Sum_probs=53.1
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEE-EecCceeEEEEecCCCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASII-LSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg~G~g~ 60 (1159)
.|+-+.+|.|-||++-.++-.|..+ .++|+|+|--|-.+++ +||..|-++|||+-+-+
T Consensus 211 ~asl~nkiki~dlet~~~vssy~a~-~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~~~ 269 (463)
T KOG1645|consen 211 LASLGNKIKIMDLETSCVVSSYIAY-NQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQPE 269 (463)
T ss_pred eeccCceEEEEecccceeeeheecc-CCceeeeeccCCcceeEEeccCceEEEEEccCCC
Confidence 5788999999999999999999988 8999999999998887 99999999999996644
No 283
>PRK04043 tolB translocation protein TolB; Provisional
Probab=82.15 E-value=2.6 Score=49.74 Aligned_cols=55 Identities=16% Similarity=0.244 Sum_probs=39.1
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC-ce--eEEEEecCCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD-VG--QLYILNTGQGE 60 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~-~G--~l~Ifg~G~g~ 60 (1159)
.+..|.++|+.+|...+ +..+.+......|||||+.|+.+.+ .| +|+++++..+.
T Consensus 255 g~~~Iy~~dl~~g~~~~-LT~~~~~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~ 312 (419)
T PRK04043 255 GQPDIYLYDTNTKTLTQ-ITNYPGIDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGS 312 (419)
T ss_pred CCcEEEEEECCCCcEEE-cccCCCccCccEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 35678899999887544 4444444556789999999996543 34 68888876654
No 284
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=81.82 E-value=3.3 Score=47.18 Aligned_cols=51 Identities=16% Similarity=0.128 Sum_probs=40.3
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccCCccccccccCCcc
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKDAKYDQFFLGDYRP 77 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~~p~EQFF~tDYrp 77 (1159)
.-.|+-+..||||+.||+.+-.|.|++|++=+=. ..++..-||=+..+-.|
T Consensus 229 ~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~~~~~W~~~eqPg~~e~np 280 (282)
T PF15492_consen 229 QDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLRLQRSWKQDEQPGYNEINP 280 (282)
T ss_pred CCceEEEEECCCCCEEEEEEcCCeEEEEecCcchhhcccchhhCCCCccCCC
Confidence 5689999999999999999999999999987643 44566677744444443
No 285
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=81.34 E-value=2.7 Score=50.40 Aligned_cols=48 Identities=25% Similarity=0.281 Sum_probs=32.6
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
++.|++||+++|..|+.+.-. +|..+.||+||.++|....+ .++|+..
T Consensus 125 ~~~i~~yDw~~~~~i~~i~v~--~vk~V~Ws~~g~~val~t~~-~i~il~~ 172 (443)
T PF04053_consen 125 SDFICFYDWETGKLIRRIDVS--AVKYVIWSDDGELVALVTKD-SIYILKY 172 (443)
T ss_dssp TTEEEEE-TTT--EEEEESS---E-EEEEE-TTSSEEEEE-S--SEEEEEE
T ss_pred CCCEEEEEhhHcceeeEEecC--CCcEEEEECCCCEEEEEeCC-eEEEEEe
Confidence 457999999999999998855 39999999999999977544 5555544
No 286
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=81.23 E-value=2.1 Score=48.17 Aligned_cols=24 Identities=29% Similarity=0.480 Sum_probs=22.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE 24 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~ 24 (1159)
+||+.||+++|||..+|+++.++.
T Consensus 172 lsG~EDGtvRvWd~kt~k~v~~ie 195 (325)
T KOG0649|consen 172 LSGAEDGTVRVWDTKTQKHVSMIE 195 (325)
T ss_pred eecCCCccEEEEeccccceeEEec
Confidence 689999999999999999999886
No 287
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=80.52 E-value=3 Score=47.95 Aligned_cols=57 Identities=14% Similarity=0.126 Sum_probs=50.1
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEE-EEecCceeEEEEecCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASI-ILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~-a~tD~~G~l~Ifg~G~g 59 (1159)
|-.|-+|+|-||..|.+-.++.||+..|+.+-|||---.+ |++.-+|.+.+|++--+
T Consensus 164 gtr~~~VrLCDi~SGs~sH~LsGHr~~vlaV~Wsp~~e~vLatgsaDg~irlWDiRra 221 (397)
T KOG4283|consen 164 GTRDVQVRLCDIASGSFSHTLSGHRDGVLAVEWSPSSEWVLATGSADGAIRLWDIRRA 221 (397)
T ss_pred ecCCCcEEEEeccCCcceeeeccccCceEEEEeccCceeEEEecCCCceEEEEEeecc
Confidence 3456689999999999999999999999999999998875 57888999999998443
No 288
>KOG4328 consensus WD40 protein [Function unknown]
Probab=80.06 E-value=1.4 Score=52.55 Aligned_cols=60 Identities=20% Similarity=0.399 Sum_probs=52.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec--CcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI--SRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~--g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+||||+|++=|+++++.--+|.. ....+.++.|+-+-..+..+|..|-+.+|++-.+.
T Consensus 251 ~ssSyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~ 312 (498)
T KOG4328|consen 251 YSSSYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRTDG 312 (498)
T ss_pred eeeccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeecCC
Confidence 4899999999999999986555554 68889999999999999999999999999996654
No 289
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=79.55 E-value=0.91 Score=53.26 Aligned_cols=24 Identities=25% Similarity=0.412 Sum_probs=22.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEee
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE 24 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~ 24 (1159)
||||-|++|++||+.+|++++.|.
T Consensus 209 lS~sGD~tlr~Wd~~sgk~L~t~d 232 (390)
T KOG3914|consen 209 LSGSGDKTLRLWDITSGKLLDTCD 232 (390)
T ss_pred eecCCCCcEEEEecccCCcccccc
Confidence 699999999999999999999876
No 290
>PRK01029 tolB translocation protein TolB; Provisional
Probab=78.85 E-value=5.7 Score=46.96 Aligned_cols=56 Identities=21% Similarity=0.306 Sum_probs=36.3
Q ss_pred cCCcEEEE--EccC-CeeeEEeecCcceeeeeeeCCCCCEEEEec-Cc--eeEEEEecCCCC
Q 001090 5 YDGKTIVW--DIWE-GIPIRIYEISRFRLVDGKFSPDGASIILSD-DV--GQLYILNTGQGE 60 (1159)
Q Consensus 5 ~DG~I~lW--DI~~-G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~--G~l~Ifg~G~g~ 60 (1159)
.+|...|| ++.. |...+.+..+...+...+|||||+.||.+. .. .+|++|++..+.
T Consensus 301 ~~g~~~ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~ 362 (428)
T PRK01029 301 KDGRPRIYIMQIDPEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGR 362 (428)
T ss_pred CCCCceEEEEECcccccceEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCC
Confidence 35544444 4432 333444554455677899999999999653 33 479999987775
No 291
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=78.75 E-value=4.7 Score=45.01 Aligned_cols=56 Identities=16% Similarity=0.208 Sum_probs=51.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+||-.|..-.|+||..|.+|..|..|...|..+.|||--.++.++.-+-+|.+-++
T Consensus 247 ~sg~~dssc~lydirg~r~iq~f~phsadir~vrfsp~a~yllt~syd~~ikltdl 302 (350)
T KOG0641|consen 247 ASGHADSSCMLYDIRGGRMIQRFHPHSADIRCVRFSPGAHYLLTCSYDMKIKLTDL 302 (350)
T ss_pred eeccCCCceEEEEeeCCceeeeeCCCccceeEEEeCCCceEEEEecccceEEEeec
Confidence 36677899999999999999999999999999999999999999998888888876
No 292
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=78.72 E-value=4 Score=49.58 Aligned_cols=54 Identities=7% Similarity=0.124 Sum_probs=47.5
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
+||..+|+|.||+.-+-...+.-..|.+.||.+++-.||+.|- |--+-+|..||
T Consensus 261 iTgDS~G~i~Iw~~~~~~~~k~~~aH~ggv~~L~~lr~GtllS-GgKDRki~~Wd 314 (626)
T KOG2106|consen 261 ITGDSGGNILIWSKGTNRISKQVHAHDGGVFSLCMLRDGTLLS-GGKDRKIILWD 314 (626)
T ss_pred EeecCCceEEEEeCCCceEEeEeeecCCceEEEEEecCccEee-cCccceEEecc
Confidence 4788999999999966666665558999999999999999998 99999999999
No 293
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=78.06 E-value=2.7 Score=51.33 Aligned_cols=40 Identities=23% Similarity=0.466 Sum_probs=34.3
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD 47 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~ 47 (1159)
-|.|-|||+.+-++|..|..-...+++ |||||.+|.++.-
T Consensus 335 ~G~mEvwDv~n~K~i~~~~a~~tt~~e--W~PdGe~flTATT 374 (566)
T KOG2315|consen 335 PGDMEVWDVPNRKLIAKFKAANTTVFE--WSPDGEYFLTATT 374 (566)
T ss_pred CCceEEEeccchhhccccccCCceEEE--EcCCCcEEEEEec
Confidence 488999999999999999877777765 9999999997754
No 294
>PF06003 SMN: Survival motor neuron protein (SMN); InterPro: IPR010304 This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.; GO: 0003723 RNA binding, 0006397 mRNA processing, 0005634 nucleus, 0005737 cytoplasm; PDB: 1MHN_A 4A4G_A 3S6N_M 4A4E_A 1G5V_A 4A4H_A 4A4F_A 2D9T_A.
Probab=77.80 E-value=2.4 Score=47.66 Aligned_cols=56 Identities=20% Similarity=0.391 Sum_probs=34.4
Q ss_pred ccCCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCCCCcccccCCCccccC
Q 001090 965 QRNWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTEPTETHLHSPWELFDS 1033 (1159)
Q Consensus 965 ~rnWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~~~E~eRVSPWEIEpi 1033 (1159)
...|+|||+|+..|. .+|.||.++|.+|..... .+.|+++.-+ ..+.|..=+|.+.
T Consensus 66 ~~~WkvGd~C~A~~s----~Dg~~Y~A~I~~i~~~~~--------~~~V~f~gYg-n~e~v~l~dL~~~ 121 (264)
T PF06003_consen 66 NKKWKVGDKCMAVYS----EDGQYYPATIESIDEEDG--------TCVVVFTGYG-NEEEVNLSDLKPS 121 (264)
T ss_dssp TT---TT-EEEEE-T----TTSSEEEEEEEEEETTTT--------EEEEEETTTT-EEEEEEGGGEEET
T ss_pred ccCCCCCCEEEEEEC----CCCCEEEEEEEEEcCCCC--------EEEEEEcccC-CeEeeehhhhccc
Confidence 458999999999993 467899999999985321 4558887753 3344444444443
No 295
>PF05641 Agenet: Agenet domain; InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=77.40 E-value=5.4 Score=35.93 Aligned_cols=41 Identities=22% Similarity=0.447 Sum_probs=27.6
Q ss_pred CCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCC
Q 001090 968 WTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTE 1018 (1159)
Q Consensus 968 WsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~ 1018 (1159)
|++|+++.+.-..++ ..+.||.|+|+.....+ .|.|++++-
T Consensus 1 F~~G~~VEV~s~e~g-~~gaWf~a~V~~~~~~~---------~~~V~Y~~~ 41 (68)
T PF05641_consen 1 FKKGDEVEVSSDEDG-FRGAWFPATVLKENGDD---------KYLVEYDDL 41 (68)
T ss_dssp --TT-EEEEEE-SBT-T--EEEEEEEEEEETT----------EEEEEETT-
T ss_pred CCCCCEEEEEEcCCC-CCcEEEEEEEEEeCCCc---------EEEEEECCc
Confidence 578999999765544 58999999999987632 899999764
No 296
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=76.70 E-value=3.8 Score=52.03 Aligned_cols=54 Identities=19% Similarity=0.332 Sum_probs=49.0
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecC-cceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEIS-RFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g-~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+|||+-|-+.+|-+++|. +.|..- .+.|.++.+||||.+.++.-.+-+|+++-.
T Consensus 267 lSGG~E~VLv~Wq~~T~~--kqfLPRLgs~I~~i~vS~ds~~~sl~~~DNqI~li~~ 321 (792)
T KOG1963|consen 267 LSGGREGVLVLWQLETGK--KQFLPRLGSPILHIVVSPDSDLYSLVLEDNQIHLIKA 321 (792)
T ss_pred eecccceEEEEEeecCCC--cccccccCCeeEEEEEcCCCCeEEEEecCceEEEEec
Confidence 699999999999999999 778754 678999999999999999999999999865
No 297
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=76.64 E-value=1.9 Score=52.18 Aligned_cols=56 Identities=18% Similarity=0.417 Sum_probs=47.7
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|||-|.+|+-||+.+|..+..+.- ..+||++..+|.|-.||+|=..+++.|+-+-.
T Consensus 568 TGGlDntvRcWDlregrqlqqhdF-~SQIfSLg~cP~~dWlavGMens~vevlh~sk 623 (705)
T KOG0639|consen 568 TGGLDNTVRCWDLREGRQLQQHDF-SSQIFSLGYCPTGDWLAVGMENSNVEVLHTSK 623 (705)
T ss_pred cCCCccceeehhhhhhhhhhhhhh-hhhheecccCCCccceeeecccCcEEEEecCC
Confidence 799999999999999987643221 35799999999999999999999999998643
No 298
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=75.14 E-value=4.8 Score=51.11 Aligned_cols=60 Identities=20% Similarity=0.334 Sum_probs=50.2
Q ss_pred CCcccCCcEEEEEccC--Ce--eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDIWE--GI--PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~--G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++|--||+|.||-=.. +. -.+.|.=|+..|-+++||+||.+|-+|-.-|-|..|-+|+++
T Consensus 221 Aa~d~dGrI~vw~d~~~~~~~~t~t~lHWH~~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~ 284 (792)
T KOG1963|consen 221 AAGDSDGRILVWRDFGSSDDSETCTLLHWHHDEVNSLSFSSDGAYLLSGGREGVLVLWQLETGK 284 (792)
T ss_pred EEeccCCcEEEEeccccccccccceEEEecccccceeEEecCCceEeecccceEEEEEeecCCC
Confidence 3567799999995433 43 445566789999999999999999999999999999999886
No 299
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=74.94 E-value=5.4 Score=49.58 Aligned_cols=67 Identities=21% Similarity=0.274 Sum_probs=45.3
Q ss_pred CCccc-CCcEEEEEccCCeeeEEeecC-------------cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc
Q 001090 1 MSAGY-DGKTIVWDIWEGIPIRIYEIS-------------RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK 66 (1159)
Q Consensus 1 lSAG~-DG~I~lWDI~~G~~Ir~f~~g-------------~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p 66 (1159)
+|||. ||.|+||||.+-.+. |.++ ...+..+..-.-|++|-+.=.++.|++|++-+ +...|
T Consensus 233 aSaga~D~~iKVWDLRk~~~~--~r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~sIy~ynm~s---~s~sP 307 (720)
T KOG0321|consen 233 ASAGAADSTIKVWDLRKNYTA--YRQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNSIYFYNMRS---LSISP 307 (720)
T ss_pred eeccCCCcceEEEeecccccc--cccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecCCcEEEEeccc---cCcCc
Confidence 46777 999999999876654 4322 24555666656677776555599999999753 44455
Q ss_pred cccccc
Q 001090 67 YDQFFL 72 (1159)
Q Consensus 67 ~EQFF~ 72 (1159)
..-||+
T Consensus 308 ~~~~sg 313 (720)
T KOG0321|consen 308 VAEFSG 313 (720)
T ss_pred hhhccC
Confidence 555554
No 300
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=74.31 E-value=3.7 Score=47.86 Aligned_cols=55 Identities=15% Similarity=0.309 Sum_probs=47.3
Q ss_pred ccCCcEEEEEccCCee-eEEee-cCcceeeeeeeCCC-CCEEEEecCceeEEEEecCC
Q 001090 4 GYDGKTIVWDIWEGIP-IRIYE-ISRFRLVDGKFSPD-GASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 4 G~DG~I~lWDI~~G~~-Ir~f~-~g~~~i~D~~FSPD-Gt~~a~tD~~G~l~Ifg~G~ 58 (1159)
..|..|.||||..-.. |+.|. .|+-.|-.++|.|. =..|++|..+|-+.||++-.
T Consensus 140 ~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrFHP~~pnlLlSGSvDGLvnlfD~~~ 197 (376)
T KOG1188|consen 140 RSDASVVLWDVRSEQQLLRQLNESHNDDVTQLRFHPSDPNLLLSGSVDGLVNLFDTKK 197 (376)
T ss_pred cCceEEEEEEeccccchhhhhhhhccCcceeEEecCCCCCeEEeecccceEEeeecCC
Confidence 3577899999999987 88876 67889999999994 57888999999999999743
No 301
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=74.28 E-value=1.6 Score=49.49 Aligned_cols=56 Identities=14% Similarity=0.251 Sum_probs=48.7
Q ss_pred CcccCCcEEEEEccCCeeeEEe-ecCcceeeeeeeCC-CCCEEEEecCceeEEEEecC
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIY-EISRFRLVDGKFSP-DGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f-~~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G 57 (1159)
.|+.||.|-|||+..+..+..+ ..|...|+.+.|.| ++-+|-++.++|.|+.|+--
T Consensus 197 cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedGslw~wdas 254 (319)
T KOG4714|consen 197 CGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDGSLWHWDAS 254 (319)
T ss_pred EecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCCCchheeEecCCCcEEEEcCC
Confidence 5789999999999999755544 47889999999999 79999999999999999843
No 302
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=74.18 E-value=4.9 Score=46.22 Aligned_cols=56 Identities=23% Similarity=0.288 Sum_probs=46.7
Q ss_pred cCCcEEEEEccCCeeeEEeecCcc-eeeeeeeCCCCCEE-EEecCceeEEEEecCCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRF-RLVDGKFSPDGASI-ILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~-~i~D~~FSPDGt~~-a~tD~~G~l~Ifg~G~g~ 60 (1159)
-|+++..||+.+-.++-.....|+ .|.|+-|-|.-+++ |+++++|.|.||++-.-+
T Consensus 191 ~d~tl~~~D~RT~~~~~sI~dAHgq~vrdlDfNpnkq~~lvt~gDdgyvriWD~R~tk 248 (370)
T KOG1007|consen 191 SDSTLQFWDLRTMKKNNSIEDAHGQRVRDLDFNPNKQHILVTCGDDGYVRIWDTRKTK 248 (370)
T ss_pred CCCcEEEEEccchhhhcchhhhhcceeeeccCCCCceEEEEEcCCCccEEEEeccCCC
Confidence 488999999998888877765554 59999999998875 588999999999987654
No 303
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=73.75 E-value=6.2 Score=45.48 Aligned_cols=59 Identities=15% Similarity=0.282 Sum_probs=48.1
Q ss_pred CCcccCCcEEEEEccCCe-eeEEeecCcceeeeeeeCCCCCEEE-EecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDGASII-LSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg~G~g 59 (1159)
+|+|.||.|+|||...-+ ++..+.+|.--||.+.|-|---.|+ ++.++-.+.++-.++-
T Consensus 231 vt~gDdgyvriWD~R~tk~pv~el~~HsHWvW~VRfn~~hdqLiLs~~SDs~V~Lsca~sv 291 (370)
T KOG1007|consen 231 VTCGDDGYVRIWDTRKTKFPVQELPGHSHWVWAVRFNPEHDQLILSGGSDSAVNLSCASSV 291 (370)
T ss_pred EEcCCCccEEEEeccCCCccccccCCCceEEEEEEecCccceEEEecCCCceeEEEecccc
Confidence 589999999999998766 8877888889999999999876666 6667777777765553
No 304
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=73.55 E-value=1.4 Score=33.78 Aligned_cols=13 Identities=46% Similarity=0.800 Sum_probs=12.0
Q ss_pred CCcccCCcEEEEE
Q 001090 1 MSAGYDGKTIVWD 13 (1159)
Q Consensus 1 lSAG~DG~I~lWD 13 (1159)
+|||.||+|+|||
T Consensus 27 ~s~~~D~~i~vwd 39 (39)
T PF00400_consen 27 ASGSSDGTIRVWD 39 (39)
T ss_dssp EEEETTSEEEEEE
T ss_pred eeeCCCCEEEEEC
Confidence 5899999999998
No 305
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=72.64 E-value=4.9 Score=47.70 Aligned_cols=94 Identities=22% Similarity=0.207 Sum_probs=66.9
Q ss_pred CCcccCCcEEEEEccCCee-------eEEeecCcceeeeeeeCCCCCE-EEEecCceeEEEEecCCCCCccCC-ccc---
Q 001090 1 MSAGYDGKTIVWDIWEGIP-------IRIYEISRFRLVDGKFSPDGAS-IILSDDVGQLYILNTGQGESQKDA-KYD--- 68 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~-------Ir~f~~g~~~i~D~~FSPDGt~-~a~tD~~G~l~Ifg~G~g~~~~~~-p~E--- 68 (1159)
+|||.|-+|.||.|=.+-. +..+.||.-.|--++|.|--.- |+++..+-++.||++|+|+..-.. -.+
T Consensus 98 ASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~v~iWnv~tgeali~l~hpd~i~ 177 (472)
T KOG0303|consen 98 ASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNTVSIWNVGTGEALITLDHPDMVY 177 (472)
T ss_pred ecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCceEEEEeccCCceeeecCCCCeEE
Confidence 6899999999999977643 4556688889999999998765 457888889999999999843221 122
Q ss_pred -cccccCCcccee---cCCCCccccccCcC
Q 001090 69 -QFFLGDYRPLVQ---DTYGNVLDQETQLA 94 (1159)
Q Consensus 69 -QFF~tDYrpLir---D~~g~vlDe~Tq~~ 94 (1159)
.=|..|..-|.. |--=-|+|.-|+..
T Consensus 178 S~sfn~dGs~l~TtckDKkvRv~dpr~~~~ 207 (472)
T KOG0303|consen 178 SMSFNRDGSLLCTTCKDKKVRVIDPRRGTV 207 (472)
T ss_pred EEEeccCCceeeeecccceeEEEcCCCCcE
Confidence 247777665543 44344666555543
No 306
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=71.02 E-value=11 Score=44.37 Aligned_cols=119 Identities=18% Similarity=0.287 Sum_probs=73.9
Q ss_pred CcEEEEEc-cCCeeeEEeecC---------------cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccc
Q 001090 7 GKTIVWDI-WEGIPIRIYEIS---------------RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQF 70 (1159)
Q Consensus 7 G~I~lWDI-~~G~~Ir~f~~g---------------~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQF 70 (1159)
-+|.|||+ -...++.+|.-| -..-|.|+||-|+.++.+|.-.--+.||++..|.... +.
T Consensus 293 ltvk~wD~nme~~pv~t~~vh~~lr~kLc~lYEnD~IfdKFec~~sg~~~~v~TGsy~n~frvf~~~~gsk~d-~t---- 367 (433)
T KOG1354|consen 293 LTVKLWDLNMEAKPVETYPVHEYLRSKLCSLYENDAIFDKFECSWSGNDSYVMTGSYNNVFRVFNLARGSKED-FT---- 367 (433)
T ss_pred ceeEEEeccccCCcceEEeehHhHHHHHHHHhhccchhheeEEEEcCCcceEecccccceEEEecCCCCccee-ec----
Confidence 47899999 778899988744 2456779999999999999999999999955443111 00
Q ss_pred cccCCccceecCCCCccccccCc-CCcccCCCCCcccCCCCCCCchhHHhhhhhhhcccCccccCCccccccCCCCcC
Q 001090 71 FLGDYRPLVQDTYGNVLDQETQL-APHRRNLQDPLCDSAMIPYPEPYQTMYQQRRLGALGIEWRPSSLKLAVGPDFSL 147 (1159)
Q Consensus 71 F~tDYrpLirD~~g~vlDe~Tq~-~PHL~~~~~~L~D~~~~PYp~~~Q~~~~~rrl~~l~~e~~p~~~~~~vg~~~~~ 147 (1159)
+.+.+.. +|+- ...+.+|-..|.--+...--.-...++--|-.-|+|.+=-|||.+.-++
T Consensus 368 ----------------l~asr~~~~~~~-~~k~~~V~~~g~r~~~~~~vd~ldf~kkilh~aWhp~en~ia~aatnnl 428 (433)
T KOG1354|consen 368 ----------------LEASRKNMKPRK-VLKLRLVSSSGKRKRDEISVDALDFRKKILHTAWHPKENSIAVAATNNL 428 (433)
T ss_pred ----------------ccccccCCcccc-cccceeeecCCCccccccccchhhhhhHHHhhccCCccceeeeeecCce
Confidence 0011111 2222 2224556655555443322222233344456679999988888776653
No 307
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=70.72 E-value=8.2 Score=46.12 Aligned_cols=73 Identities=18% Similarity=0.152 Sum_probs=44.9
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEE-EecCceeE--EEEecCCCCCccCCc-----cccccccCCccc
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASII-LSDDVGQL--YILNTGQGESQKDAK-----YDQFFLGDYRPL 78 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l--~Ifg~G~g~~~~~~p-----~EQFF~tDYrpL 78 (1159)
..|++||+.+|..-++.. -.+.-..-+|||||..|| +.+.+|.. +++++..+..-+.+- .+-+|+.|.+-|
T Consensus 218 ~~i~~~~l~~g~~~~i~~-~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~~~gi~~~Ps~spdG~~i 296 (425)
T COG0823 218 PRIYYLDLNTGKRPVILN-FNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTNGFGINTSPSWSPDGSKI 296 (425)
T ss_pred ceEEEEeccCCccceeec-cCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceecccCCccccCccCCCCCCEE
Confidence 469999999999766554 344455678999999999 55556655 555554443111111 122566666655
Q ss_pred ee
Q 001090 79 VQ 80 (1159)
Q Consensus 79 ir 80 (1159)
+.
T Consensus 297 vf 298 (425)
T COG0823 297 VF 298 (425)
T ss_pred EE
Confidence 44
No 308
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=70.21 E-value=7.6 Score=46.03 Aligned_cols=57 Identities=14% Similarity=0.268 Sum_probs=47.1
Q ss_pred CcccC--CcEEEEEccCCeeeEEeecC-----------cceeeeeeeCCC--CCEEEEecCceeEEEEecCCCC
Q 001090 2 SAGYD--GKTIVWDIWEGIPIRIYEIS-----------RFRLVDGKFSPD--GASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 2 SAG~D--G~I~lWDI~~G~~Ir~f~~g-----------~~~i~D~~FSPD--Gt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+||-- ..+.|||++.+++| |... +..+-|+.|-|+ -.+||++...|++.+|++-.+.
T Consensus 166 ~GGke~~n~lkiwdle~~~qi--w~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~~qR 237 (412)
T KOG3881|consen 166 TGGKENINELKIWDLEQSKQI--WSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYHQVRLYDTRHQR 237 (412)
T ss_pred cCchhcccceeeeecccceee--eeccCCCCccccceeeeeeccceecCCCCCceEEEEecceeEEEecCcccC
Confidence 46666 77899999999554 8743 566778999999 9999999999999999987554
No 309
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=70.11 E-value=14 Score=42.45 Aligned_cols=49 Identities=16% Similarity=0.251 Sum_probs=36.3
Q ss_pred EEEEcc-CCeeeEEeecCcceeeeeeeCCCCCEEEEecCc-eeEEEEecCC
Q 001090 10 IVWDIW-EGIPIRIYEISRFRLVDGKFSPDGASIILSDDV-GQLYILNTGQ 58 (1159)
Q Consensus 10 ~lWDI~-~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~-G~l~Ifg~G~ 58 (1159)
.||=|. .|..++.+..+-.---.+||||||+.|-.+|.. +.|+-|++..
T Consensus 144 ~lyr~~p~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~ 194 (307)
T COG3386 144 SLYRVDPDGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDP 194 (307)
T ss_pred eEEEEcCCCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecCc
Confidence 344444 566666666554444568999999999999987 8999998864
No 310
>PF09465 LBR_tudor: Lamin-B receptor of TUDOR domain; InterPro: IPR019023 The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=69.84 E-value=10 Score=34.00 Aligned_cols=42 Identities=33% Similarity=0.663 Sum_probs=30.4
Q ss_pred ccCCCCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCC
Q 001090 965 QRNWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTE 1018 (1159)
Q Consensus 965 ~rnWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~ 1018 (1159)
.+.|..|++++.+|. .+..||.|.|++..... ..++|.++.+
T Consensus 3 ~~k~~~Ge~V~~rWP----~s~lYYe~kV~~~d~~~--------~~y~V~Y~DG 44 (55)
T PF09465_consen 3 SRKFAIGEVVMVRWP----GSSLYYEGKVLSYDSKS--------DRYTVLYEDG 44 (55)
T ss_dssp SSSS-SS-EEEEE-T----TTS-EEEEEEEEEETTT--------TEEEEEETTS
T ss_pred cccccCCCEEEEECC----CCCcEEEEEEEEecccC--------ceEEEEEcCC
Confidence 368999999999995 35689999999977533 4679999876
No 311
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=69.43 E-value=8.1 Score=40.10 Aligned_cols=57 Identities=25% Similarity=0.370 Sum_probs=42.3
Q ss_pred cCCcEEEEEccCCeeeEEeecCc--ceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISR--FRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~--~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
.+|+|..||..+|+.+-.+.-+. ...... ..++|.++.+++..|.|+.|+..+|+..
T Consensus 1 ~~g~l~~~d~~tG~~~W~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~~d~~tG~~~ 59 (238)
T PF13360_consen 1 DDGTLSALDPRTGKELWSYDLGPGIGGPVAT-AVPDGGRVYVASGDGNLYALDAKTGKVL 59 (238)
T ss_dssp -TSEEEEEETTTTEEEEEEECSSSCSSEEET-EEEETTEEEEEETTSEEEEEETTTSEEE
T ss_pred CCCEEEEEECCCCCEEEEEECCCCCCCccce-EEEeCCEEEEEcCCCEEEEEECCCCCEE
Confidence 37999999999999998886422 222222 4557777777799999999998887643
No 312
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=67.52 E-value=12 Score=43.24 Aligned_cols=64 Identities=25% Similarity=0.285 Sum_probs=48.8
Q ss_pred CCcccCCcEEEEEcc-CCeeeEEee---cCcceeeeeeeCC-CCCEEEEecCceeEEEEecC-CCCCccCCc
Q 001090 1 MSAGYDGKTIVWDIW-EGIPIRIYE---ISRFRLVDGKFSP-DGASIILSDDVGQLYILNTG-QGESQKDAK 66 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~-~G~~Ir~f~---~g~~~i~D~~FSP-DGt~~a~tD~~G~l~Ifg~G-~g~~~~~~p 66 (1159)
+|||.||.+.-|||. -++.| |. .|...|.++.=|| ++++||+|.-+-+|.+||+- -+++...++
T Consensus 182 ytGgDD~~l~~~D~R~p~~~i--~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm~kPl~~~~ 251 (339)
T KOG0280|consen 182 YTGGDDGSLSCWDIRIPKTFI--WHNSKVHTSGVVSIYSSPPKPTYIATGSYDECIRVLDTRNMGKPLFKAK 251 (339)
T ss_pred EecCCCceEEEEEecCCccee--eecceeeecceEEEecCCCCCceEEEeccccceeeeehhcccCccccCc
Confidence 489999999999998 44443 44 3567777777675 79999999999999999986 344444443
No 313
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=66.82 E-value=10 Score=43.48 Aligned_cols=74 Identities=14% Similarity=0.020 Sum_probs=52.5
Q ss_pred CcccCCcEEEEEccCCeeeEEee-----cCcceeeeeeeCCCCC--EEEEecCceeEEEEecCCCCC-cc-CCccccccc
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYE-----ISRFRLVDGKFSPDGA--SIILSDDVGQLYILNTGQGES-QK-DAKYDQFFL 72 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~-----~g~~~i~D~~FSPDGt--~~a~tD~~G~l~Ifg~G~g~~-~~-~~p~EQFF~ 72 (1159)
.|..||++.||||..=.....+. -|.|+|..|+|||-|. .|--+-..+.+++.|+-.+.. +- ..|.+.||.
T Consensus 220 v~~Qdg~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~~hv~D~R~~~~~q~I~i~~d~~~~ 299 (344)
T KOG4532|consen 220 VVFQDGTCAIYDVRNMATPMAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSRVHVVDTRNYVNHQVIVIPDDVERK 299 (344)
T ss_pred EEecCCcEEEEEecccccchhhhcccCCCCCCceEEEEecCCCcceEEEEecCcceEEEEEcccCceeeEEecCcccccc
Confidence 57789999999998755443343 3579999999999775 355788899999999977652 22 234444555
Q ss_pred cCC
Q 001090 73 GDY 75 (1159)
Q Consensus 73 tDY 75 (1159)
.-.
T Consensus 300 ~~t 302 (344)
T KOG4532|consen 300 HNT 302 (344)
T ss_pred ccc
Confidence 443
No 314
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=66.22 E-value=11 Score=32.35 Aligned_cols=30 Identities=17% Similarity=0.226 Sum_probs=27.8
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
...|--++|+|..-.||.+...|+|+||-+
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl 40 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRL 40 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECCCeEEEEEC
Confidence 456889999999999999999999999998
No 315
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=64.47 E-value=3.5 Score=50.91 Aligned_cols=130 Identities=18% Similarity=0.261 Sum_probs=85.5
Q ss_pred CCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC-CccCCc------cccccccCCccc
Q 001090 6 DGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE-SQKDAK------YDQFFLGDYRPL 78 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~-~~~~~p------~EQFF~tDYrpL 78 (1159)
-..|+|+|+..++.++.+..|.--|-.++.+|-|-.|+++.-++++..|++.-+. .|+... .+.-||.-| ||
T Consensus 586 q~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~k~~WfDldlsskPyk~lr~H~~avr~Va~H~ry-PL 664 (733)
T KOG0650|consen 586 QRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDKKMCWFDLDLSSKPYKTLRLHEKAVRSVAFHKRY-PL 664 (733)
T ss_pred ccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCeeEEEEcccCcchhHHhhhhhhhhhhhhhcccc-ce
Confidence 3579999999999999999999999999999999999999999999999998774 454433 345788888 77
Q ss_pred eecCCCCccccccCcCCcccCCCCCcccCCCCCCCch---hHHhhhhhhhcccCccccCCcc-ccccCCC
Q 001090 79 VQDTYGNVLDQETQLAPHRRNLQDPLCDSAMIPYPEP---YQTMYQQRRLGALGIEWRPSSL-KLAVGPD 144 (1159)
Q Consensus 79 irD~~g~vlDe~Tq~~PHL~~~~~~L~D~~~~PYp~~---~Q~~~~~rrl~~l~~e~~p~~~-~~~vg~~ 144 (1159)
+-.+. |-.|-+--|=|..-++ .-+|-=-| ++.-.--+.||-|..-|+|..+ -|+-|++
T Consensus 665 fas~s----dDgtv~Vfhg~VY~Dl----~qnpliVPlK~L~gH~~~~~~gVLd~~wHP~qpWLfsAGAd 726 (733)
T KOG0650|consen 665 FASGS----DDGTVIVFHGMVYNDL----LQNPLIVPLKRLRGHEKTNDLGVLDTIWHPRQPWLFSAGAD 726 (733)
T ss_pred eeeec----CCCcEEEEeeeeehhh----hcCCceEeeeeccCceeecccceEeecccCCCceEEecCCC
Confidence 65322 2233333343321111 11222111 1111223446777777999853 3444544
No 316
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=64.37 E-value=16 Score=45.75 Aligned_cols=34 Identities=15% Similarity=0.244 Sum_probs=29.9
Q ss_pred eeeeeCCCCCEEEEecCceeEEEEecCCCCCccC
Q 001090 31 VDGKFSPDGASIILSDDVGQLYILNTGQGESQKD 64 (1159)
Q Consensus 31 ~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~ 64 (1159)
..++|-|||+.||+|=.+|+|.|.|...+.....
T Consensus 66 ~sL~W~~DGkllaVg~kdG~I~L~Dve~~~~l~~ 99 (665)
T KOG4640|consen 66 ASLCWRPDGKLLAVGFKDGTIRLHDVEKGGRLVS 99 (665)
T ss_pred eeeeecCCCCEEEEEecCCeEEEEEccCCCceec
Confidence 4899999999999999999999999988764443
No 317
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=64.12 E-value=18 Score=41.17 Aligned_cols=56 Identities=18% Similarity=0.281 Sum_probs=42.0
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++++|.++.||+.+|+.+-.+..+. . ..++.+|.+|.+++.+|.|+.|+...|+..
T Consensus 247 ~~~~g~l~a~d~~tG~~~W~~~~~~--~--~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~ 302 (377)
T TIGR03300 247 VSYQGRVAALDLRSGRVLWKRDASS--Y--QGPAVDDNRLYVTDADGVVVALDRRSGSEL 302 (377)
T ss_pred EEcCCEEEEEECCCCcEEEeeccCC--c--cCceEeCCEEEEECCCCeEEEEECCCCcEE
Confidence 4578999999999999886665221 1 123346778888899999999999888643
No 318
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=62.85 E-value=16 Score=48.39 Aligned_cols=58 Identities=12% Similarity=0.115 Sum_probs=44.0
Q ss_pred cccCCcEEEEEccCCeeeEEeecCc-------------ceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISR-------------FRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~-------------~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
-.++++|.+||..+|.+..+...|. .....++|+|||+.+|+-...+.|.+|++-.+.
T Consensus 821 Ds~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~ 891 (1057)
T PLN02919 821 DSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNKGE 891 (1057)
T ss_pred ECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCCCc
Confidence 3567899999999998776543221 135678999999988888888999999986543
No 319
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=62.67 E-value=12 Score=44.48 Aligned_cols=62 Identities=19% Similarity=0.233 Sum_probs=52.2
Q ss_pred CCcccCCcEEEEEccCC------eeeEEeecCc-ceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEG------IPIRIYEISR-FRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G------~~Ir~f~~g~-~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+|||.|-+++||.+..- ++|.+...+| .-||.++|---.+.+-+|-..|++..-++.+.++.
T Consensus 72 ~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N~~~~SG~~~~~VI~HDiEt~qsi 140 (609)
T KOG4227|consen 72 ASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLENRFLYSGERWGTVIKHDIETKQSI 140 (609)
T ss_pred eecCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCCeeEecCCCcceeEeeecccceee
Confidence 58999999999999754 4555555444 88999999999999999999999999999887644
No 320
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=62.57 E-value=34 Score=39.78 Aligned_cols=95 Identities=20% Similarity=0.289 Sum_probs=63.3
Q ss_pred EEEEEccCCeeeEEeecCcceee--eeeeCCCCCEEEEe-----cCceeEEEEecCCCC-------CccCCccccccccC
Q 001090 9 TIVWDIWEGIPIRIYEISRFRLV--DGKFSPDGASIILS-----DDVGQLYILNTGQGE-------SQKDAKYDQFFLGD 74 (1159)
Q Consensus 9 I~lWDI~~G~~Ir~f~~g~~~i~--D~~FSPDGt~~a~t-----D~~G~l~Ifg~G~g~-------~~~~~p~EQFF~tD 74 (1159)
..+||..+|+.+..+....+--| =++|||||+.|.+| +..|.|-||+.-.+- +.---|-|.-++.|
T Consensus 30 ~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pD 109 (305)
T PF07433_consen 30 ALVFDCRTGQLLQRLWAPPGRHFYGHGVFSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPD 109 (305)
T ss_pred EEEEEcCCCceeeEEcCCCCCEEecCEEEcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCC
Confidence 57899999999988765444333 36899999999976 568999999987221 11112556778888
Q ss_pred CccceecCCCCcc-cccc-CcCCcccCCCCCc
Q 001090 75 YRPLVQDTYGNVL-DQET-QLAPHRRNLQDPL 104 (1159)
Q Consensus 75 YrpLirD~~g~vl-De~T-q~~PHL~~~~~~L 104 (1159)
.+-|+- +||-+. ..++ ...=.|-.|+|.|
T Consensus 110 G~tLvV-ANGGI~Thpd~GR~kLNl~tM~psL 140 (305)
T PF07433_consen 110 GETLVV-ANGGIETHPDSGRAKLNLDTMQPSL 140 (305)
T ss_pred CCEEEE-EcCCCccCcccCceecChhhcCCce
Confidence 888887 666643 3333 2333444466633
No 321
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=62.32 E-value=14 Score=46.55 Aligned_cols=56 Identities=16% Similarity=0.318 Sum_probs=43.7
Q ss_pred cccCCcEEEEEccCCeeeE-Eee-----cCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 3 AGYDGKTIVWDIWEGIPIR-IYE-----ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir-~f~-----~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|+.-|.|.+.=+..+.+=. +|. .|...|-.++||+||+.|-+||+.|+|...-+-+
T Consensus 94 gt~~g~V~v~ql~~~~p~~~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 94 GTASGRVSVFQLNKELPRDLDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred hcCCceEEeehhhccCCCcceeeccccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence 5566777777777765221 222 3578999999999999999999999999988776
No 322
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=61.17 E-value=19 Score=39.37 Aligned_cols=74 Identities=15% Similarity=0.170 Sum_probs=49.0
Q ss_pred cCCcEEEEEccCCeeeEEe-------e-------cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccc
Q 001090 5 YDGKTIVWDIWEGIPIRIY-------E-------ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQF 70 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f-------~-------~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQF 70 (1159)
.+|.+.+|||.+++++-.- . .....|..+.++.+|.=||+- +.|+.+.|+..-+.=.+-+-.--|
T Consensus 30 ~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt~~G~PiV~l-sng~~y~y~~~L~~W~~vsd~w~~ 108 (219)
T PF07569_consen 30 SSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLTSNGVPIVTL-SNGDSYSYSPDLGCWIRVSDSWWA 108 (219)
T ss_pred CCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEcCCCCEEEEE-eCCCEEEeccccceeEEeccchhh
Confidence 5899999999999976432 1 245679999999999988755 458889998655431111111125
Q ss_pred cccCCccce
Q 001090 71 FLGDYRPLV 79 (1159)
Q Consensus 71 F~tDYrpLi 79 (1159)
+.++|....
T Consensus 109 ~~S~~~~~~ 117 (219)
T PF07569_consen 109 IGSQYWDSL 117 (219)
T ss_pred hhccccccc
Confidence 555665433
No 323
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=60.98 E-value=19 Score=47.71 Aligned_cols=92 Identities=13% Similarity=0.003 Sum_probs=61.6
Q ss_pred cccCCcEEEEEccCCeeeEEeecC----------------c------ceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEIS----------------R------FRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g----------------~------~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.+.+++|++||+.+|.... +.++ . ..-..++|+|||..+++-...++|.+|+...+.
T Consensus 758 Ds~n~~Irv~D~~tg~~~~-~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~ 836 (1057)
T PLN02919 758 DSESSSIRALDLKTGGSRL-LAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKR 836 (1057)
T ss_pred ECCCCeEEEEECCCCcEEE-EEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCe
Confidence 3457899999999876421 1111 0 123578999999988888888999999874332
Q ss_pred Cc-------------------cCCccccccccCCccceecCCCC---ccccccCcCC
Q 001090 61 SQ-------------------KDAKYDQFFLGDYRPLVQDTYGN---VLDQETQLAP 95 (1159)
Q Consensus 61 ~~-------------------~~~p~EQFF~tDYrpLirD~~g~---vlDe~Tq~~P 95 (1159)
.. -..|..-.|..|.+-+|-|.+++ ++|..|....
T Consensus 837 v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~~~ 893 (1057)
T PLN02919 837 VTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNKGEAA 893 (1057)
T ss_pred EEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCCCccc
Confidence 11 01344456777888888887776 6787776543
No 324
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=59.30 E-value=13 Score=38.52 Aligned_cols=58 Identities=21% Similarity=0.326 Sum_probs=40.1
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++.+|.|+.||+.+|+.+-.+... ..+...- ..++..++++...|+|+.|+...|+..
T Consensus 42 ~~~~~~l~~~d~~tG~~~W~~~~~-~~~~~~~-~~~~~~v~v~~~~~~l~~~d~~tG~~~ 99 (238)
T PF13360_consen 42 ASGDGNLYALDAKTGKVLWRFDLP-GPISGAP-VVDGGRVYVGTSDGSLYALDAKTGKVL 99 (238)
T ss_dssp EETTSEEEEEETTTSEEEEEEECS-SCGGSGE-EEETTEEEEEETTSEEEEEETTTSCEE
T ss_pred EcCCCEEEEEECCCCCEEEEeecc-cccccee-eecccccccccceeeeEecccCCccee
Confidence 468999999999999999888752 2211112 334555555557779999998887643
No 325
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=58.81 E-value=16 Score=48.07 Aligned_cols=53 Identities=17% Similarity=0.281 Sum_probs=41.3
Q ss_pred cCCcEEEEEccCCeeeEEeec--C-cceeeeeeeCCCCCEEEEecCceeEEEEecC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEI--S-RFRLVDGKFSPDGASIILSDDVGQLYILNTG 57 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~--g-~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G 57 (1159)
-+|.|.+||+.+|..++.|+. + +..|+-..+.-++..+.++|.-|.+|=.-+-
T Consensus 150 ~~G~V~v~D~~~~k~l~~i~e~~ap~t~vi~v~~t~~nS~llt~D~~Gsf~~lv~n 205 (1206)
T KOG2079|consen 150 GDGHVTVWDMHRAKILKVITEHGAPVTGVIFVGRTSQNSKLLTSDTGGSFWKLVFN 205 (1206)
T ss_pred CCCcEEEEEccCCcceeeeeecCCccceEEEEEEeCCCcEEEEccCCCceEEEEec
Confidence 378999999999999999873 2 3456666677777799999999986655443
No 326
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=57.54 E-value=23 Score=40.44 Aligned_cols=53 Identities=17% Similarity=0.240 Sum_probs=37.3
Q ss_pred cCCcEEEEEc--cCCee--eEEeecCcceeeeeeeCCCCCEEEEec-CceeEEEEecC
Q 001090 5 YDGKTIVWDI--WEGIP--IRIYEISRFRLVDGKFSPDGASIILSD-DVGQLYILNTG 57 (1159)
Q Consensus 5 ~DG~I~lWDI--~~G~~--Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G 57 (1159)
-.+.|.+++| .+|.. +..+..+...-.+++|+|||.+|+++. ..+.|.+|.+.
T Consensus 265 ~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d 322 (345)
T PF10282_consen 265 GSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDID 322 (345)
T ss_dssp TTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEE
T ss_pred cCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEe
Confidence 3677899999 45653 333333333478999999999999776 67799999874
No 327
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=57.33 E-value=10 Score=45.94 Aligned_cols=51 Identities=20% Similarity=0.328 Sum_probs=40.8
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEE
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLY 52 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~ 52 (1159)
+|+|.|..|.||||..=..+.++.. +...--+.||--|..-++..++-+||
T Consensus 309 aTtG~Dr~~kIWDlR~~~ql~t~~t-p~~a~~ls~SqkglLA~~~G~~v~iw 359 (545)
T KOG1272|consen 309 ATTGLDRKVKIWDLRNFYQLHTYRT-PHPASNLSLSQKGLLALSYGDHVQIW 359 (545)
T ss_pred eecccccceeEeeeccccccceeec-CCCccccccccccceeeecCCeeeee
Confidence 5899999999999998888888777 66777788998888777666654443
No 328
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=56.83 E-value=11 Score=51.83 Aligned_cols=94 Identities=20% Similarity=0.284 Sum_probs=70.6
Q ss_pred CCcccCCcEEEEEccCCeeeEEee-cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccc---ccccCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYE-ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQ---FFLGDYR 76 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~-~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ---FF~tDYr 76 (1159)
+||+-||.|++|-.-.|..|-+|. .|+..|--+.|+-.|..|..+|-+|.|.+|-.+... | ..-| ==.+|++
T Consensus 2224 ltgs~dgsv~~~~w~~~~~v~~~rt~g~s~vtr~~f~~qGnk~~i~d~dg~l~l~q~~pk~-~---~s~qchnk~~~Df~ 2299 (2439)
T KOG1064|consen 2224 LTGSQDGSVRMFEWGHGQQVVCFRTAGNSRVTRSRFNHQGNKFGIVDGDGDLSLWQASPKP-Y---TSWQCHNKALSDFR 2299 (2439)
T ss_pred EecCCCceEEEEeccCCCeEEEeeccCcchhhhhhhcccCCceeeeccCCceeecccCCcc-e---eccccCCcccccee
Confidence 589999999999999999999997 567889999999999999999999999999877221 1 0011 1123333
Q ss_pred c---ce----e-cCCCCccccccCcCCccc
Q 001090 77 P---LV----Q-DTYGNVLDQETQLAPHRR 98 (1159)
Q Consensus 77 p---Li----r-D~~g~vlDe~Tq~~PHL~ 98 (1159)
- |+ + ..|++|-+=.|-+||-+.
T Consensus 2300 Fi~s~~~tag~s~d~~n~~lwDtl~~~~~s 2329 (2439)
T KOG1064|consen 2300 FIGSLLATAGRSSDNRNVCLWDTLLPPMNS 2329 (2439)
T ss_pred eeehhhhccccCCCCCcccchhcccCcccc
Confidence 2 22 2 356778888888887663
No 329
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=56.38 E-value=15 Score=45.39 Aligned_cols=56 Identities=14% Similarity=0.146 Sum_probs=52.9
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
+|||.||.-+||+-.+|..++++.+....|..+.=.|+--.+|++--+-+|.||..
T Consensus 640 asgSddgr~fiwek~tg~i~av~~gdssivnciqghP~~~~latSgiDstiKIwsp 695 (758)
T KOG1310|consen 640 ASGSDDGRFFIWEKLTGSILAVIHGDSSIVNCIQGHPRCPTLATSGIDSTIKIWSP 695 (758)
T ss_pred eEecCCCceEEeecCCcceEEEeeCchhheeeccCCCCCceeeeccCccceEEecc
Confidence 58999999999999999999999999999999999999999999999999999984
No 330
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=55.45 E-value=19 Score=42.55 Aligned_cols=63 Identities=17% Similarity=0.145 Sum_probs=48.7
Q ss_pred CCcccCCcEEEEEccCCee---eEEeecCc--ceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc
Q 001090 1 MSAGYDGKTIVWDIWEGIP---IRIYEISR--FRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK 63 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~---Ir~f~~g~--~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~ 63 (1159)
|+++-+|+|.|||+.-=+| |+.|+||- .+-.-+..-|.-..|+++-++=...||.+-+|..+.
T Consensus 315 maS~M~gkikLyD~R~~K~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~~ghLl~ 382 (425)
T KOG2695|consen 315 MASDMTGKIKLYDLRATKCKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLDSGHLLC 382 (425)
T ss_pred eeccCcCceeEeeehhhhcccceeeeecccccccccccccccccceEEEccCeeEEEEEecccCceee
Confidence 5678899999999987776 99999983 344555666777777777788889999988776443
No 331
>COG4246 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.99 E-value=17 Score=41.60 Aligned_cols=66 Identities=26% Similarity=0.387 Sum_probs=49.4
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceecCCCCccccccCcCCcccCCCCCccc
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQDTYGNVLDQETQLAPHRRNLQDPLCD 106 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD~~g~vlDe~Tq~~PHL~~~~~~L~D 106 (1159)
-+++.+++|-|||..|++.-++|||+- | .+.||++|-+.- +.-.. |+ +++|
T Consensus 73 fgalSairf~~dG~~fiav~DtG~wfe-g---------------------~i~rDa~grl~G----l~dgr--~~-pm~d 123 (340)
T COG4246 73 FGALSAIRFLPDGSQFIAVTDTGHWFE-G---------------------KIQRDANGRLAG----LTDGR--LT-PMRD 123 (340)
T ss_pred ccchheeEeccCCceeEEEeecCceEE-E---------------------EEEeccCCCccc----ccccc--ee-eccc
Confidence 478999999999999999999999863 2 578999998432 22222 22 5889
Q ss_pred CCCCCCCchhHHhhh
Q 001090 107 SAMIPYPEPYQTMYQ 121 (1159)
Q Consensus 107 ~~~~PYp~~~Q~~~~ 121 (1159)
.+|.|-+..+|-.-.
T Consensus 124 ~~Gqpi~~K~e~DaE 138 (340)
T COG4246 124 LDGQPIQEKWEVDAE 138 (340)
T ss_pred CCCCCCcchhccccc
Confidence 999999987775444
No 332
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=53.50 E-value=8.7 Score=45.01 Aligned_cols=72 Identities=29% Similarity=0.514 Sum_probs=43.8
Q ss_pred cccCcCCcccCCCCCcccCCCCCCCchhHHhhhhhhhcccCccccCCccccccCCCCc-C---CCccccccccCcccccC
Q 001090 89 QETQLAPHRRNLQDPLCDSAMIPYPEPYQTMYQQRRLGALGIEWRPSSLKLAVGPDFS-L---DQGYQLQPLADLDVMID 164 (1159)
Q Consensus 89 e~Tq~~PHL~~~~~~L~D~~~~PYp~~~Q~~~~~rrl~~l~~e~~p~~~~~~vg~~~~-~---~~~~~~~p~~~~~~~~~ 164 (1159)
.--|++||||..|| ..+.|-|..-|..|+--.++..| -|+..+.|..|.++ + +...|++||.|
T Consensus 293 P~m~l~phl~gppP----ga~pPaph~NpaffpPP~~~s~G---ppP~~~~~~~pPyn~~~~r~~~p~~~plSe------ 359 (498)
T KOG4849|consen 293 PMMQLSPHLMGPPP----GAGPPAPHNNPAFFPPPQLGSMG---PPPQMNTAMRPPYNGLPPRHVNPQMFPLSE------ 359 (498)
T ss_pred cccccCcccCCCCC----CCCCCCcccCcccCCCCCcCcCC---CCCCCccCCCCCCCCCCcccCCCCCccchH------
Confidence 34457788876443 12224444445555544466666 67777888887777 2 24567777665
Q ss_pred CCccccccccC
Q 001090 165 PLPEFIDVMDW 175 (1159)
Q Consensus 165 ~~~e~~d~~~w 175 (1159)
+||.|.|.-
T Consensus 360 --AEFEdiM~R 368 (498)
T KOG4849|consen 360 --AEFEDIMTR 368 (498)
T ss_pred --HHHHHHHhh
Confidence 478888853
No 333
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=53.17 E-value=24 Score=45.56 Aligned_cols=52 Identities=15% Similarity=0.129 Sum_probs=41.6
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|-+-+|.-.+-+.--+-..|+.+|.-+.|||||+.+.++|..|.+++|-+..
T Consensus 81 g~~~v~~~~~~e~htv~~th~a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d~ 132 (1416)
T KOG3617|consen 81 GVSDVQKTNTTETHTVVETHPAPIQGLDWSHDGTVLMTLDNPGSVHLWRYDV 132 (1416)
T ss_pred ceeEEEecCCceeeeeccCCCCCceeEEecCCCCeEEEcCCCceeEEEEeee
Confidence 4555666555554444557889999999999999999999999999998764
No 334
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=52.78 E-value=24 Score=42.37 Aligned_cols=38 Identities=18% Similarity=0.275 Sum_probs=32.9
Q ss_pred cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 25 ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 25 ~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
-..-.+.+++.+|+|+..|++|+.|+|+++|+..+...
T Consensus 305 D~~R~~~~i~~sP~~~laA~tDslGRV~LiD~~~~~vv 342 (415)
T PF14655_consen 305 DSKREGESICLSPSGRLAAVTDSLGRVLLIDVARGIVV 342 (415)
T ss_pred cCCceEEEEEECCCCCEEEEEcCCCcEEEEECCCChhh
Confidence 34566888999999999999999999999999887643
No 335
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=51.91 E-value=18 Score=46.34 Aligned_cols=62 Identities=24% Similarity=0.242 Sum_probs=51.4
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCC---CC-CEEEEecCceeEEEEecCCCCCc
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSP---DG-ASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSP---DG-t~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
+||..-|+|+|||...|..+..|..+..+|.|++|=| |. -.|++-..--.|.+|.+-+|..+
T Consensus 83 AsaD~~GrIil~d~~~~s~~~~l~~~~~~~qdl~W~~~rd~Srd~LlaIh~ss~lvLwntdtG~k~ 148 (1062)
T KOG1912|consen 83 ASADISGRIILVDFVLASVINWLSHSNDSVQDLCWVPARDDSRDVLLAIHGSSTLVLWNTDTGEKF 148 (1062)
T ss_pred EeccccCcEEEEEehhhhhhhhhcCCCcchhheeeeeccCcchheeEEecCCcEEEEEEccCCcee
Confidence 4688899999999999999999999999999999854 55 45667777788999988887643
No 336
>PRK02888 nitrous-oxide reductase; Validated
Probab=51.77 E-value=28 Score=43.91 Aligned_cols=54 Identities=19% Similarity=0.139 Sum_probs=42.2
Q ss_pred CCcEEEEEccC----CeeeEEeecCcceeeeeeeCCCCCEEEEecC-ceeEEEEecCCC
Q 001090 6 DGKTIVWDIWE----GIPIRIYEISRFRLVDGKFSPDGASIILSDD-VGQLYILNTGQG 59 (1159)
Q Consensus 6 DG~I~lWDI~~----G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~-~G~l~Ifg~G~g 59 (1159)
+++|.+.|..+ |..+..|..=.-..+-+++||||++|+++.- ..+++|+++-.-
T Consensus 295 gn~V~VID~~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~ 353 (635)
T PRK02888 295 GSKVPVVDGRKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKL 353 (635)
T ss_pred CCEEEEEECCccccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhh
Confidence 56799999998 6666666655556677899999999986655 899999998653
No 337
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=48.67 E-value=43 Score=35.82 Aligned_cols=50 Identities=20% Similarity=0.175 Sum_probs=34.9
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-CceeEEEEecCC
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVGQLYILNTGQ 58 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G~ 58 (1159)
|.|..+|.. |+....+.+ -...--++|||||.+|.++| ..++|+.|++..
T Consensus 115 g~v~~~~~~-~~~~~~~~~-~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~ 165 (246)
T PF08450_consen 115 GSVYRIDPD-GKVTVVADG-LGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDA 165 (246)
T ss_dssp EEEEEEETT-SEEEEEEEE-ESSEEEEEEETTSSEEEEEETTTTEEEEEEEET
T ss_pred cceEEECCC-CeEEEEecC-cccccceEECCcchheeecccccceeEEEeccc
Confidence 567777777 665444433 33456789999999887555 567899998854
No 338
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=48.45 E-value=56 Score=34.99 Aligned_cols=108 Identities=19% Similarity=0.182 Sum_probs=61.0
Q ss_pred CcEEEEEccCCeeeEEeec-----CcceeeeeeeCCCCCEEEEecCc--------eeEEEEecCCCCCcc-----CCccc
Q 001090 7 GKTIVWDIWEGIPIRIYEI-----SRFRLVDGKFSPDGASIILSDDV--------GQLYILNTGQGESQK-----DAKYD 68 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~-----g~~~i~D~~FSPDGt~~a~tD~~--------G~l~Ifg~G~g~~~~-----~~p~E 68 (1159)
+.+.++|+.+|+....+.. .....-|++|.|||...++.... |.|+.++.+ +.... ..|.-
T Consensus 60 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~pNG 138 (246)
T PF08450_consen 60 GGIAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGFPNG 138 (246)
T ss_dssp TCEEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESSEEE
T ss_pred CceEEEecCCCcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCcccccc
Confidence 4455669999976555542 45788999999999966654433 889999988 43111 12223
Q ss_pred cccccCCccce-ecCCCCc-----cccccC---------cCCcccCCC-CCcccCCCCCCCch
Q 001090 69 QFFLGDYRPLV-QDTYGNV-----LDQETQ---------LAPHRRNLQ-DPLCDSAMIPYPEP 115 (1159)
Q Consensus 69 QFF~tDYrpLi-rD~~g~v-----lDe~Tq---------~~PHL~~~~-~~L~D~~~~PYp~~ 115 (1159)
--|..|.+-|+ .|...+. +|..+. ..+.....| |+.+|..|+-|-..
T Consensus 139 i~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~ 201 (246)
T PF08450_consen 139 IAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVAD 201 (246)
T ss_dssp EEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEE
T ss_pred eEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEE
Confidence 35777776654 4544432 222222 011111113 47888888877653
No 339
>KOG0732 consensus AAA+-type ATPase containing the bromodomain [Posttranslational modification, protein turnover, chaperones]
Probab=48.39 E-value=8.5 Score=50.64 Aligned_cols=76 Identities=8% Similarity=0.014 Sum_probs=59.3
Q ss_pred cCcccccCCCCCCHHHHHHHHHcCCCCCHHHHHH--HHHHHHHhhhhccCCC--------CHHHHHHHHHHHHHHHHHhc
Q 001090 1079 KTNFTNRFPVPLSLDVIQSRLENNYYRGLEAVKH--DIAVMLSNAESYFGRN--------TDLSTKIKRLSDLVTRTLSS 1148 (1159)
Q Consensus 1079 ~PdY~~iIk~PMDLsTIr~RLeN~~YRsleaf~~--DVrLI~sNA~tFN~p~--------S~I~k~Ak~L~d~F~~~Lkk 1148 (1159)
.++|..+|+.|||+...-.+++...|..+-++.. ++.|||.|+..||+.. ..|.+++..+...-....+.
T Consensus 550 ~~~~~~~iq~~~~va~~~~k~~e~~~~~v~~~e~~~~i~lic~~~lli~~~~~~g~~~lg~aIlh~~~~~~v~s~~issl 629 (1080)
T KOG0732|consen 550 FQDALEDIQGLMDVASSMAKIEEHLKLLVRSFESNFAIRLICRPRLLINGGKGSGQDYLGPAILHRLEGLPVQSLDISSL 629 (1080)
T ss_pred hHHHHHHhhcchhHHhhhhhHHHHhHHHHHhhhcccchhhhcCcHHhcCCCcccccCcccHHHHHHHhccchHHHHHHHH
Confidence 6788999999999999999999999999999999 9999999999999975 34555555554444444444
Q ss_pred CCCCCC
Q 001090 1149 LKAPQF 1154 (1159)
Q Consensus 1149 L~~~q~ 1154 (1159)
|-.+.+
T Consensus 630 l~d~~~ 635 (1080)
T KOG0732|consen 630 LSDEGT 635 (1080)
T ss_pred Hhcccc
Confidence 444433
No 340
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=44.04 E-value=37 Score=27.81 Aligned_cols=34 Identities=21% Similarity=0.564 Sum_probs=25.7
Q ss_pred CCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcC
Q 001090 971 RDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKT 1017 (1159)
Q Consensus 971 GDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn 1017 (1159)
|+.|...|.+ ++.||.|+|+++.+ . ..++|.+.+
T Consensus 1 G~~c~a~~~~----d~~wyra~V~~~~~-~--------~~~~V~f~D 34 (48)
T cd04508 1 GDLCLAKYSD----DGKWYRAKITSILS-D--------GKVEVFFVD 34 (48)
T ss_pred CCEEEEEECC----CCeEEEEEEEEECC-C--------CcEEEEEEc
Confidence 7889998853 58999999999985 1 235677765
No 341
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=43.96 E-value=51 Score=38.12 Aligned_cols=57 Identities=18% Similarity=0.090 Sum_probs=37.5
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeee-eeeCCCCCEEEEec--Cc---------eeEEEEecCCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVD-GKFSPDGASIILSD--DV---------GQLYILNTGQGES 61 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D-~~FSPDGt~~a~tD--~~---------G~l~Ifg~G~g~~ 61 (1159)
|+.--+|+|.||.+|+.+.--... .-+. +.|++||+.|..+. .. =+|+.+.+|++..
T Consensus 146 G~e~~~l~v~Dl~tg~~l~d~i~~--~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~~ 214 (414)
T PF02897_consen 146 GSEWYTLRVFDLETGKFLPDGIEN--PKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQS 214 (414)
T ss_dssp TSSEEEEEEEETTTTEEEEEEEEE--EESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-GG
T ss_pred CCceEEEEEEECCCCcCcCCcccc--cccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCChH
Confidence 344456999999999776532111 2222 99999999988553 32 2388999998753
No 342
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=42.98 E-value=46 Score=39.11 Aligned_cols=144 Identities=20% Similarity=0.295 Sum_probs=83.4
Q ss_pred eEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccccc-ccCCccceecCCCCccccccCcCCccc
Q 001090 20 IRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFF-LGDYRPLVQDTYGNVLDQETQLAPHRR 98 (1159)
Q Consensus 20 Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF-~tDYrpLirD~~g~vlDe~Tq~~PHL~ 98 (1159)
+..|.....+|+.-||++|++.||++-..-.++||..-.+. +++.+- ..++ |..=..|| -+|++-
T Consensus 3 ~~~~~~~~~pitchAwn~drt~iAv~~~~~evhiy~~~~~~-----~w~~~htls~H-----d~~vtgvd----Wap~sn 68 (361)
T KOG1523|consen 3 VVVFHRLLEPITCHAWNSDRTQIAVSPNNHEVHIYSMLGAD-----LWEPAHTLSEH-----DKIVTGVD----WAPKSN 68 (361)
T ss_pred eEEeeeccCceeeeeecCCCceEEeccCCceEEEEEecCCC-----Cceeceehhhh-----CcceeEEe----ecCCCC
Confidence 34566677899999999999999999999999999854432 122110 0111 11001133 245551
Q ss_pred CCCCCcccCCCCCCCchh-------HHhhhhhh-hcccCccccCCccccccCCCCcCC-C-ccccccccCcccc----c-
Q 001090 99 NLQDPLCDSAMIPYPEPY-------QTMYQQRR-LGALGIEWRPSSLKLAVGPDFSLD-Q-GYQLQPLADLDVM----I- 163 (1159)
Q Consensus 99 ~~~~~L~D~~~~PYp~~~-------Q~~~~~rr-l~~l~~e~~p~~~~~~vg~~~~~~-~-~~~~~p~~~~~~~----~- 163 (1159)
. =.-|-.+++-|-... +.+|.-|- -+|.++.|-|.+=+||||.+.-+- . -|+. +-|=| |
T Consensus 69 r--Ivtcs~drnayVw~~~~~~~WkptlvLlRiNrAAt~V~WsP~enkFAVgSgar~isVcy~E~----ENdWWVsKhik 142 (361)
T KOG1523|consen 69 R--IVTCSHDRNAYVWTQPSGGTWKPTLVLLRINRAATCVKWSPKENKFAVGSGARLISVCYYEQ----ENDWWVSKHIK 142 (361)
T ss_pred c--eeEccCCCCccccccCCCCeeccceeEEEeccceeeEeecCcCceEEeccCccEEEEEEEec----ccceehhhhhC
Confidence 1 134455555554432 23444332 367899999999999999887632 1 1111 11211 2
Q ss_pred CCCccccccccCCccccccc
Q 001090 164 DPLPEFIDVMDWEPENEVQS 183 (1159)
Q Consensus 164 ~~~~e~~d~~~wepe~ev~s 183 (1159)
.|+---+--.+|-|-|-.++
T Consensus 143 kPirStv~sldWhpnnVLla 162 (361)
T KOG1523|consen 143 KPIRSTVTSLDWHPNNVLLA 162 (361)
T ss_pred CccccceeeeeccCCcceec
Confidence 35555555688998775543
No 343
>KOG2002 consensus TPR-containing nuclear phosphoprotein that regulates K(+) uptake [Inorganic ion transport and metabolism]
Probab=41.39 E-value=19 Score=46.89 Aligned_cols=50 Identities=32% Similarity=0.244 Sum_probs=24.1
Q ss_pred ccchHHHHHHHHhhhhhhccCCCCCCCCCCCCCCCCCcccccccccccchhhhhhh
Q 001090 289 SLRPQRAAARNARSFFSKITGASTDGEDVDGSEGELSESESDLQDSYIESEESGRS 344 (1159)
Q Consensus 289 ~~rpqr~aarnal~~fski~~~~~~~ed~d~~~~~~s~s~~~~~ds~~~~~~~~~~ 344 (1159)
..-++|+-+ .||-.-++.+.+|+|-.-...+.++++.+++.--.++++++
T Consensus 940 ~~~~~~~k~------~sk~~~~t~e~~D~~~~k~~~~~~~~ds~~~~~~~~~~~~~ 989 (1018)
T KOG2002|consen 940 LSESDRRKP------KSKAFISTSERSDDDVVKKAESDSDDDSQDSREASEESDRP 989 (1018)
T ss_pred cChhhccch------hhhhhhcccccccccccCcccCCcccccccccchhhccCCc
Confidence 345666322 24433344442332222234455555666666666666655
No 344
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=40.38 E-value=73 Score=39.26 Aligned_cols=76 Identities=18% Similarity=0.032 Sum_probs=53.8
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCce----eEEEEecCCCCCccCC-ccc----cccccCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVG----QLYILNTGQGESQKDA-KYD----QFFLGDY 75 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G----~l~Ifg~G~g~~~~~~-p~E----QFF~tDY 75 (1159)
+-+.|-+.||.+|.+-.+=...-+.|.|+.|||+++.||-+=-.| +|.||++..++.+..+ |+- --|-.|.
T Consensus 421 dr~el~vididngnv~~idkS~~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~vTT~ta~DfsPaFD~d~ 500 (668)
T COG4946 421 DRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVTTPTAYDFSPAFDPDG 500 (668)
T ss_pred CceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEEEecCcceeeeeEEEEecCCCeEEEecCCcccccCcccCCCC
Confidence 344577889999986543334568899999999999999655544 7899999888877543 422 2355565
Q ss_pred cccee
Q 001090 76 RPLVQ 80 (1159)
Q Consensus 76 rpLir 80 (1159)
+-|..
T Consensus 501 ryLYf 505 (668)
T COG4946 501 RYLYF 505 (668)
T ss_pred cEEEE
Confidence 55544
No 345
>PRK13616 lipoprotein LpqB; Provisional
Probab=40.22 E-value=36 Score=42.60 Aligned_cols=45 Identities=13% Similarity=0.074 Sum_probs=30.1
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
|+|.+.++..|+.-+ ...+.|.+++|||||+.||..- .|+|++-+
T Consensus 430 gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~-~g~v~Va~ 474 (591)
T PRK13616 430 GQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMII-GGKVYLAV 474 (591)
T ss_pred ceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEE-CCEEEEEE
Confidence 344444555554433 1235699999999999999765 48998843
No 346
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=39.63 E-value=68 Score=38.14 Aligned_cols=34 Identities=26% Similarity=0.422 Sum_probs=29.6
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.++|.-+++||+|++||.-.+.|+|+|......+
T Consensus 216 ~~~i~~iavSpng~~iAl~t~~g~l~v~ssDf~~ 249 (410)
T PF04841_consen 216 DGPIIKIAVSPNGKFIALFTDSGNLWVVSSDFSE 249 (410)
T ss_pred CCCeEEEEECCCCCEEEEEECCCCEEEEECcccc
Confidence 3689999999999999999999999998766554
No 347
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=39.56 E-value=95 Score=38.74 Aligned_cols=77 Identities=14% Similarity=0.180 Sum_probs=47.3
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCc--eeEEEEecCCCCCc--cCCc-cccccccCCccceecC
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDV--GQLYILNTGQGESQ--KDAK-YDQFFLGDYRPLVQDT 82 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~--G~l~Ifg~G~g~~~--~~~p-~EQFF~tDYrpLirD~ 82 (1159)
++.+-++. |.-.++=..+.|+|+|++|+|+|+-||++=.. -++.||++-+...+ ..-| ..-||..-.+-|+.-.
T Consensus 252 ~Lyll~t~-g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~~~v~df~egpRN~~~fnp~g~ii~lAG 330 (566)
T KOG2315|consen 252 TLYLLATQ-GESVSVPLLKEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRGKPVFDFPEGPRNTAFFNPHGNIILLAG 330 (566)
T ss_pred eEEEEEec-CceEEEecCCCCCceEEEECCCCCEEEEEEecccceEEEEcCCCCEeEeCCCCCccceEECCCCCEEEEee
Confidence 34455555 55444444568999999999999999976322 35666666443321 1222 3347777666666655
Q ss_pred CCC
Q 001090 83 YGN 85 (1159)
Q Consensus 83 ~g~ 85 (1159)
.||
T Consensus 331 FGN 333 (566)
T KOG2315|consen 331 FGN 333 (566)
T ss_pred cCC
Confidence 555
No 348
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=38.57 E-value=33 Score=42.48 Aligned_cols=57 Identities=26% Similarity=0.279 Sum_probs=45.5
Q ss_pred CCcccCCcEEEEEccC-Ce------eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWE-GI------PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~-G~------~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+||+.|.+|.||-|.. |. |--+|+.|.-+|.|+.|=-|-+++|++| |-|++|+-=-|
T Consensus 751 iSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~~igfL~~lr~i~ScD--~giHlWDPFig 814 (1034)
T KOG4190|consen 751 ISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIHDIGFLADLRSIASCD--GGIHLWDPFIG 814 (1034)
T ss_pred eeccCCceEEEEEeccccCccccceeeeEhhhccCcccceeeeeccceeeecc--Ccceeeccccc
Confidence 5899999999999852 22 4446778899999999999999999997 56778875333
No 349
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=38.27 E-value=26 Score=41.42 Aligned_cols=59 Identities=15% Similarity=0.199 Sum_probs=49.8
Q ss_pred CCcccCCcEEEEEccCCeeeEE-eecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRI-YEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~-f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g 59 (1159)
+|+..|-.|++|||--+.-+.. ..++...|-.+.-.+--+.+++.+++|.|.+|+.-..
T Consensus 213 fSg~~d~~vi~wdigg~~g~~~el~gh~~kV~~l~~~~~t~~l~S~~edg~i~~w~mn~~ 272 (404)
T KOG1409|consen 213 FSGASDHSVIMWDIGGRKGTAYELQGHNDKVQALSYAQHTRQLISCGEDGGIVVWNMNVK 272 (404)
T ss_pred EeccccCceEEEeccCCcceeeeeccchhhhhhhhhhhhheeeeeccCCCeEEEEeccce
Confidence 5788999999999987765443 3467788888889999999999999999999998765
No 350
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.02 E-value=33 Score=41.03 Aligned_cols=42 Identities=17% Similarity=0.070 Sum_probs=33.4
Q ss_pred cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc
Q 001090 25 ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK 66 (1159)
Q Consensus 25 ~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p 66 (1159)
-..-.|-.++-|+||+.+|.|.-+|.+.||..-.-+.+...|
T Consensus 279 ~~~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~vk 320 (398)
T KOG0771|consen 279 KRFKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYVK 320 (398)
T ss_pred hccCcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEeeh
Confidence 345689999999999999999999999999965544433333
No 351
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=36.96 E-value=69 Score=36.48 Aligned_cols=58 Identities=21% Similarity=0.392 Sum_probs=42.7
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++.+|.|.-+|..+|+.+-.+..+.. + ......+|..+.+++..|.|+-|+.-+|+..
T Consensus 71 ~~~~g~v~a~d~~tG~~~W~~~~~~~-~-~~~p~v~~~~v~v~~~~g~l~ald~~tG~~~ 128 (377)
T TIGR03300 71 ADADGTVVALDAETGKRLWRVDLDER-L-SGGVGADGGLVFVGTEKGEVIALDAEDGKEL 128 (377)
T ss_pred ECCCCeEEEEEccCCcEeeeecCCCC-c-ccceEEcCCEEEEEcCCCEEEEEECCCCcEe
Confidence 56789999999999999877663321 1 1122346788888889999999998777643
No 352
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=36.81 E-value=43 Score=43.13 Aligned_cols=53 Identities=19% Similarity=0.316 Sum_probs=41.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCC-----CCEEEEecCceeEEEEe
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPD-----GASIILSDDVGQLYILN 55 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPD-----Gt~~a~tD~~G~l~Ifg 55 (1159)
+|++.||+|+|--+.++.-+.+|.-+ .++..++|+|| ..+|++|+..| |.++.
T Consensus 87 asCS~DGkv~I~sl~~~~~~~~~df~-rpiksial~Pd~~~~~sk~fv~GG~ag-lvL~e 144 (846)
T KOG2066|consen 87 ASCSDDGKVVIGSLFTDDEITQYDFK-RPIKSIALHPDFSRQQSKQFVSGGMAG-LVLSE 144 (846)
T ss_pred EEecCCCcEEEeeccCCccceeEecC-CcceeEEeccchhhhhhhheeecCcce-EEEeh
Confidence 47788888888888888877666543 46778888998 77888888888 77765
No 353
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=36.04 E-value=43 Score=38.94 Aligned_cols=43 Identities=16% Similarity=0.156 Sum_probs=35.8
Q ss_pred ccCCcEEEEEcc-CCeeeEEeecCcceeeeeeeCCCCCEEEEec
Q 001090 4 GYDGKTIVWDIW-EGIPIRIYEISRFRLVDGKFSPDGASIILSD 46 (1159)
Q Consensus 4 G~DG~I~lWDI~-~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD 46 (1159)
..-|.|-|||+. +-..|.-|..+-..-.++.|.|||.+||++.
T Consensus 74 ~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pDG~tLvVAN 117 (305)
T PF07433_consen 74 TGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPDGETLVVAN 117 (305)
T ss_pred CCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCCCCEEEEEc
Confidence 346899999999 6668888998888888999999997777653
No 354
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=35.19 E-value=61 Score=42.71 Aligned_cols=29 Identities=21% Similarity=0.422 Sum_probs=26.8
Q ss_pred cceeeeeeeCCCCCEEEEecCceeEEEEe
Q 001090 27 RFRLVDGKFSPDGASIILSDDVGQLYILN 55 (1159)
Q Consensus 27 ~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg 55 (1159)
.+.|..++||||+..||.++..|+|.+..
T Consensus 120 d~GI~a~~WSPD~Ella~vT~~~~l~~mt 148 (928)
T PF04762_consen 120 DSGILAASWSPDEELLALVTGEGNLLLMT 148 (928)
T ss_pred cCcEEEEEECCCcCEEEEEeCCCEEEEEe
Confidence 56899999999999999999999999985
No 355
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=34.76 E-value=38 Score=45.36 Aligned_cols=54 Identities=11% Similarity=0.118 Sum_probs=45.1
Q ss_pred CcEEEEEccCCeeeEEe--ecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 7 GKTIVWDIWEGIPIRIY--EISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f--~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+.|++||+..-..+-.. .-.+|.|-+++.+|-|..|+.|.+.|.|.+||+-++.
T Consensus 1173 ~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~G~l~lWDLRF~~ 1228 (1431)
T KOG1240|consen 1173 SRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSRGQLVLWDLRFRV 1228 (1431)
T ss_pred cceEEecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCCceEEEEEeecCc
Confidence 56889998766544333 3569999999999999999999999999999988764
No 356
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=34.52 E-value=59 Score=39.48 Aligned_cols=59 Identities=19% Similarity=0.214 Sum_probs=41.1
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCC-EEEEecCceeEEEEecCCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGA-SIILSDDVGQLYILNTGQG 59 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt-~~a~tD~~G~l~Ifg~G~g 59 (1159)
|||||||+|.|-|...-.+.-..-.=.+.|--++|-|-.. +|.++.++|.|+-|++-..
T Consensus 303 LsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~ 362 (463)
T KOG0270|consen 303 LSGSYDGTVALKDCRDPSNSGKEWKFDGEVEKVAWDPHSENSFFVSTDDGTVYYFDIRNP 362 (463)
T ss_pred EeccccceEEeeeccCccccCceEEeccceEEEEecCCCceeEEEecCCceEEeeecCCC
Confidence 6899999999999974111111111235566677777665 4668999999999998654
No 357
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=34.26 E-value=94 Score=36.65 Aligned_cols=76 Identities=17% Similarity=0.185 Sum_probs=57.5
Q ss_pred CCcEEEEEccCCee--eEEeecCcceeeeeeeCCCCCEEEEecCceeE-EEEecCCCCCccC--------CccccccccC
Q 001090 6 DGKTIVWDIWEGIP--IRIYEISRFRLVDGKFSPDGASIILSDDVGQL-YILNTGQGESQKD--------AKYDQFFLGD 74 (1159)
Q Consensus 6 DG~I~lWDI~~G~~--Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l-~Ifg~G~g~~~~~--------~p~EQFF~tD 74 (1159)
=|+|.|=|+..-+. -.+...|...|--++..-||+.+|++...|+| .||++-.|..... ..|.--|+.|
T Consensus 158 ~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~ 237 (346)
T KOG2111|consen 158 TGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPN 237 (346)
T ss_pred cceEEEEEhhhcCcCCceEEEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCC
Confidence 47888888876665 23455788999999999999999999999996 6999988864321 2344468888
Q ss_pred Cccceec
Q 001090 75 YRPLVQD 81 (1159)
Q Consensus 75 YrpLirD 81 (1159)
--.|.-.
T Consensus 238 ~s~Lavs 244 (346)
T KOG2111|consen 238 SSWLAVS 244 (346)
T ss_pred ccEEEEE
Confidence 7766554
No 358
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=33.75 E-value=64 Score=37.35 Aligned_cols=56 Identities=21% Similarity=0.251 Sum_probs=44.3
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
|.|.|.+.+-++.+|.++-.|..-..-=......|||..|-+|..+|+++.+|.-+
T Consensus 69 GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~~glIycgshd~~~yalD~~~ 124 (354)
T KOG4649|consen 69 GCYSGGLYFLCVKTGSQIWNFVILETVKVRAQCDFDGGLIYCGSHDGNFYALDPKT 124 (354)
T ss_pred EEccCcEEEEEecchhheeeeeehhhhccceEEcCCCceEEEecCCCcEEEecccc
Confidence 67889999999999999998875322222334568999999999999999999744
No 359
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=33.35 E-value=1.1e+02 Score=35.60 Aligned_cols=56 Identities=29% Similarity=0.472 Sum_probs=39.3
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ 62 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~ 62 (1159)
++.||.+...|+.+|+.+=.+..+. ..+ +..+|..|.+++.+|+|+.++...|+..
T Consensus 262 ~~~~g~l~ald~~tG~~~W~~~~~~--~~~--~~~~~~~vy~~~~~g~l~ald~~tG~~~ 317 (394)
T PRK11138 262 LAYNGNLVALDLRSGQIVWKREYGS--VND--FAVDGGRIYLVDQNDRVYALDTRGGVEL 317 (394)
T ss_pred EEcCCeEEEEECCCCCEEEeecCCC--ccC--cEEECCEEEEEcCCCeEEEEECCCCcEE
Confidence 4578889999999998775443221 111 2336777888889999999999888643
No 360
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=32.93 E-value=62 Score=38.12 Aligned_cols=60 Identities=15% Similarity=0.176 Sum_probs=50.4
Q ss_pred CCcccCCcEEEEEc-cCCe--eeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 1 MSAGYDGKTIVWDI-WEGI--PIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 1 lSAG~DG~I~lWDI-~~G~--~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
+|+++|.+-.||-. ..|+ +.-++.....+..-++|||.+..||++...-.|.|+=+.+-.
T Consensus 71 vtcs~drnayVw~~~~~~~WkptlvLlRiNrAAt~V~WsP~enkFAVgSgar~isVcy~E~EN 133 (361)
T KOG1523|consen 71 VTCSHDRNAYVWTQPSGGTWKPTLVLLRINRAATCVKWSPKENKFAVGSGARLISVCYYEQEN 133 (361)
T ss_pred eEccCCCCccccccCCCCeeccceeEEEeccceeeEeecCcCceEEeccCccEEEEEEEeccc
Confidence 47899999999999 6665 444566788999999999999999999999999998776543
No 361
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=32.76 E-value=1e+02 Score=36.09 Aligned_cols=54 Identities=13% Similarity=0.181 Sum_probs=44.1
Q ss_pred cCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEec-CceeEEEEecCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSD-DVGQLYILNTGQG 59 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G~g 59 (1159)
.+++|-+-|-.++..++....|..+ ..++|+|||..+.++| ..+.|++|++-..
T Consensus 138 ~~~~vsvid~~t~~~~~~~~vG~~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~ 192 (381)
T COG3391 138 GNNTVSVIDAATNKVTATIPVGNTP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGN 192 (381)
T ss_pred CCceEEEEeCCCCeEEEEEecCCCc-ceEEECCCCCeEEEEecCCCeEEEEeCCCc
Confidence 3678888888999988885555545 9999999999888887 8999999996444
No 362
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=32.54 E-value=22 Score=49.13 Aligned_cols=65 Identities=15% Similarity=0.220 Sum_probs=48.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcccc----ccccCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQ----FFLGDY 75 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQ----FF~tDY 75 (1159)
+|||-+|.|+||||..-+.++.|.. +| --.+|+.+.+-|.|-||.+-.-......|.|. ||..-.
T Consensus 2352 isggr~G~v~l~D~rqrql~h~~~~-----~~-----~~~~f~~~ss~g~ikIw~~s~~~ll~~~p~e~ak~gfFr~~g 2420 (2439)
T KOG1064|consen 2352 ISGGRKGEVCLFDIRQRQLRHTFQA-----LD-----TREYFVTGSSEGNIKIWRLSEFGLLHTFPSEHAKQGFFRNIG 2420 (2439)
T ss_pred EecCCcCcEEEeehHHHHHHHHhhh-----hh-----hhheeeccCcccceEEEEccccchhhcCchhhcccchhhhcC
Confidence 5899999999999999888877664 32 45689999999999999986544445555543 655443
No 363
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=30.67 E-value=40 Score=39.52 Aligned_cols=57 Identities=19% Similarity=0.198 Sum_probs=41.2
Q ss_pred cccCCcEEEEEccCCe-eeE---Eee------------cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGI-PIR---IYE------------ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~-~Ir---~f~------------~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
++.-|+|.|-|+.... |.. .|. +--..|.|++|||.|++|++-|- -++.||++-..+
T Consensus 240 SsSkG~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdy-ltvkiwDvnm~k 312 (460)
T COG5170 240 SSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVNMAK 312 (460)
T ss_pred ecCCCcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEecc-ceEEEEeccccc
Confidence 4567899999987433 221 222 22468999999999999988764 589999987654
No 364
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=30.67 E-value=1.3e+02 Score=33.73 Aligned_cols=72 Identities=22% Similarity=0.343 Sum_probs=45.6
Q ss_pred CCcEEEEEccCCeeeEEeec------------CcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCC--c----cCCcc
Q 001090 6 DGKTIVWDIWEGIPIRIYEI------------SRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGES--Q----KDAKY 67 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~------------g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~--~----~~~p~ 67 (1159)
...|++=|+..|..+-.|.. ....+.+++|-+|+..|+++|..|.|++|.+-.... . +-...
T Consensus 139 ~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~ 218 (321)
T PF03178_consen 139 KNYILVGDAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERI 218 (321)
T ss_dssp TTEEEEEESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEE
T ss_pred ccEEEEEEcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCCeEEEEEECCCCcccccccccceeE
Confidence 34566667766666555541 256799999999999999999999999999864221 1 11234
Q ss_pred ccccccCCcc
Q 001090 68 DQFFLGDYRP 77 (1159)
Q Consensus 68 EQFF~tDYrp 77 (1159)
-+|+.+|.--
T Consensus 219 ~~f~lg~~v~ 228 (321)
T PF03178_consen 219 SSFHLGDIVN 228 (321)
T ss_dssp EEEE-SS-EE
T ss_pred EEEECCCccc
Confidence 5577777643
No 365
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=30.21 E-value=53 Score=39.51 Aligned_cols=48 Identities=23% Similarity=0.129 Sum_probs=33.1
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEE-EecCceeEEEEec
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASII-LSDDVGQLYILNT 56 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg~ 56 (1159)
.|.+-|+.++..-+ +..+.+--..--|||||+.|| ++|..|+--||=+
T Consensus 263 ~iy~~dl~~~~~~~-Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~ 311 (425)
T COG0823 263 DIYLMDLDGKNLPR-LTNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLY 311 (425)
T ss_pred cEEEEcCCCCccee-cccCCccccCccCCCCCCEEEEEeCCCCCcceEEE
Confidence 46677888888444 555544444888999999999 6777776544433
No 366
>KOG2727 consensus Rab3 GTPase-activating protein, non-catalytic subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=30.16 E-value=41 Score=43.99 Aligned_cols=29 Identities=24% Similarity=0.425 Sum_probs=26.0
Q ss_pred eeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 32 DGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 32 D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
.+..||+||+.|+||+.|+++++|+..+-
T Consensus 326 ~lslSP~gtlAAVTD~lgRVlLlDta~~i 354 (1244)
T KOG2727|consen 326 KLSLSPSGTLAAVTDSLGRVLLLDTAALI 354 (1244)
T ss_pred eeeeCCCccEEEEecccCcEEEEehhhhh
Confidence 34789999999999999999999998764
No 367
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=30.01 E-value=52 Score=25.70 Aligned_cols=25 Identities=16% Similarity=0.512 Sum_probs=19.9
Q ss_pred CCCCEEEEec-CceeEEEEecCCCCC
Q 001090 37 PDGASIILSD-DVGQLYILNTGQGES 61 (1159)
Q Consensus 37 PDGt~~a~tD-~~G~l~Ifg~G~g~~ 61 (1159)
|||..|.+++ ..++|.+|+.+.+..
T Consensus 1 pd~~~lyv~~~~~~~v~~id~~~~~~ 26 (42)
T TIGR02276 1 PDGTKLYVTNSGSNTVSVIDTATNKV 26 (42)
T ss_pred CCCCEEEEEeCCCCEEEEEECCCCeE
Confidence 7888777766 589999999877653
No 368
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=28.97 E-value=77 Score=37.63 Aligned_cols=99 Identities=19% Similarity=0.118 Sum_probs=58.2
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceecCCCCcc
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQDTYGNVL 87 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD~~g~vl 87 (1159)
-+.+||-----.|-.|+.| ..|....|||=|+.||+|..+|.|.|+. -+.+.|.-.|.|- --++|.--.+|
T Consensus 192 ~laVwd~~Leykv~aYe~~-lG~k~v~wsP~~qflavGsyD~~lrvln-----h~tWk~f~eflhl---~s~~dp~~~~~ 262 (447)
T KOG4497|consen 192 WLAVWDNVLEYKVYAYERG-LGLKFVEWSPCNQFLAVGSYDQMLRVLN-----HFTWKPFGEFLHL---CSYHDPTLHLL 262 (447)
T ss_pred EEEEecchhhheeeeeeec-cceeEEEeccccceEEeeccchhhhhhc-----eeeeeehhhhccc---hhccCchhhhh
Confidence 4556664333333233322 3466677788777777777777777765 2333444445443 23567777788
Q ss_pred ccccCcCCcccC--CCCCcccCCCCCCCch
Q 001090 88 DQETQLAPHRRN--LQDPLCDSAMIPYPEP 115 (1159)
Q Consensus 88 De~Tq~~PHL~~--~~~~L~D~~~~PYp~~ 115 (1159)
-.+|.-+|.|.. +.-+=.|..+.|.+..
T Consensus 263 ~ke~~~~~ql~~~cLsf~p~~~~a~~~~~s 292 (447)
T KOG4497|consen 263 EKETFSIVQLLHHCLSFTPTDLEAHIWEES 292 (447)
T ss_pred hhhhcchhhhcccccccCCCccccCccccc
Confidence 888877776631 1223357777777754
No 369
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=28.90 E-value=88 Score=36.23 Aligned_cols=74 Identities=18% Similarity=0.187 Sum_probs=57.6
Q ss_pred CCcccCCcEEEEEccCC--e-eeEEee-cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCcc------ccc
Q 001090 1 MSAGYDGKTIVWDIWEG--I-PIRIYE-ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKY------DQF 70 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G--~-~Ir~f~-~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~------EQF 70 (1159)
+|+|.|-+++.|.+..- . +|+++. ..+..|-.+..-|||+-||++-=+|+|.||..-+...+...+| ..-
T Consensus 221 isgga~dkl~~~Sl~~s~gslq~~~e~~lknpGv~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vA 300 (323)
T KOG0322|consen 221 ISGGADDKLVMYSLNHSTGSLQIRKEITLKNPGVSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVA 300 (323)
T ss_pred cCCCccccceeeeeccccCcccccceEEecCCCccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhhhcceeEEE
Confidence 47889999999988654 3 677775 4677888899999999999999999999999877665443332 335
Q ss_pred cccC
Q 001090 71 FLGD 74 (1159)
Q Consensus 71 F~tD 74 (1159)
|+.|
T Consensus 301 fspd 304 (323)
T KOG0322|consen 301 FSPD 304 (323)
T ss_pred eCCC
Confidence 6666
No 370
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=28.63 E-value=95 Score=36.83 Aligned_cols=57 Identities=7% Similarity=0.109 Sum_probs=50.5
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecC-ceeEEEEecCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDD-VGQLYILNTGQG 59 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~-~G~l~Ifg~G~g 59 (1159)
+||-.++=|+.--.+.++.++.+|.+.|.-++|-|||.+|-+|-. .-+|+.|++-..
T Consensus 226 gsY~q~~giy~~~~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~ 283 (406)
T KOG2919|consen 226 GSYGQRVGIYNDDGRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYS 283 (406)
T ss_pred ecccceeeeEecCCCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhc
Confidence 688888889999999999999999999999999999999987765 678999998654
No 371
>KOG3026 consensus Splicing factor SPF30 [RNA processing and modification]
Probab=28.56 E-value=47 Score=37.55 Aligned_cols=29 Identities=21% Similarity=0.552 Sum_probs=25.8
Q ss_pred CCCCCCeEEEeeccCCCCCCcEEEEEEEEecCC
Q 001090 967 NWTCRDKCKVWWKNESDEDGSWWDGRVLSVKPK 999 (1159)
Q Consensus 967 nWsvGDRFrm~fe~ED~idg~Ww~GtIvgv~p~ 999 (1159)
.|.+|++|...|. .+++||..+|..+.++
T Consensus 90 ~w~vg~K~~A~~~----ddg~~y~AtIe~ita~ 118 (262)
T KOG3026|consen 90 GWKVGDKVQAVFS----DDGQIYDATIEHITAM 118 (262)
T ss_pred ccccCCEEEEeec----CCCceEEeehhhccCC
Confidence 8999999999994 4689999999999875
No 372
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=28.26 E-value=37 Score=41.09 Aligned_cols=22 Identities=14% Similarity=0.264 Sum_probs=17.1
Q ss_pred CcccCCcEEEEEccCCeeeEEe
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIY 23 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f 23 (1159)
+-+.|++|+|||+.+|++|+.+
T Consensus 235 tl~~D~~LRiW~l~t~~~~~~~ 256 (547)
T PF11715_consen 235 TLSRDHTLRIWSLETGQCLATI 256 (547)
T ss_dssp EEETTSEEEEEETTTTCEEEEE
T ss_pred EEeCCCeEEEEECCCCeEEEEe
Confidence 4467888888888888887765
No 373
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=28.21 E-value=1.5e+02 Score=33.72 Aligned_cols=54 Identities=24% Similarity=0.283 Sum_probs=39.2
Q ss_pred cEEEEEccCCeeeEEeec------CcceeeeeeeCC-CC----CEEEEecCc-eeEEEEecCCCCC
Q 001090 8 KTIVWDIWEGIPIRIYEI------SRFRLVDGKFSP-DG----ASIILSDDV-GQLYILNTGQGES 61 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~------g~~~i~D~~FSP-DG----t~~a~tD~~-G~l~Ifg~G~g~~ 61 (1159)
+|.+||+.+++.|+.|.- ...-+-|+++-. +| .++-.+|.. +.|.+|++..+.+
T Consensus 35 KLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s 100 (287)
T PF03022_consen 35 KLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKS 100 (287)
T ss_dssp EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEE
T ss_pred EEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcE
Confidence 799999999999999973 245566766655 22 333356655 7999999998864
No 374
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=27.31 E-value=54 Score=36.82 Aligned_cols=55 Identities=27% Similarity=0.532 Sum_probs=0.0
Q ss_pred eeeeeeCCC-CCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceecCCCCcc
Q 001090 30 LVDGKFSPD-GASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQDTYGNVL 87 (1159)
Q Consensus 30 i~D~~FSPD-Gt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD~~g~vl 87 (1159)
|-.+.|.|| |+.||++|..|.|+.+++ .|+.....+..-| .||.-+-.-.+|.++
T Consensus 24 ~SGLTy~pd~~tLfaV~d~~~~i~els~-~G~vlr~i~l~g~--~D~EgI~y~g~~~~v 79 (248)
T PF06977_consen 24 LSGLTYNPDTGTLFAVQDEPGEIYELSL-DGKVLRRIPLDGF--GDYEGITYLGNGRYV 79 (248)
T ss_dssp EEEEEEETTTTEEEEEETTTTEEEEEET-T--EEEEEE-SS---SSEEEEEE-STTEEE
T ss_pred ccccEEcCCCCeEEEEECCCCEEEEEcC-CCCEEEEEeCCCC--CCceeEEEECCCEEE
No 375
>PF08940 DUF1918: Domain of unknown function (DUF1918); InterPro: IPR015035 This domain is found in various hypothetical bacterial proteins, and has no known function. ; PDB: 2A7Y_A.
Probab=27.28 E-value=1.3e+02 Score=27.51 Aligned_cols=41 Identities=17% Similarity=0.282 Sum_probs=23.7
Q ss_pred CCCCeEEEeeccCCCCCCcEEEEEEEEecCCCCCCCCCCcceeEEEEcCC
Q 001090 969 TCRDKCKVWWKNESDEDGSWWDGRVLSVKPKSSEFPDSPWERYTVQYKTE 1018 (1159)
Q Consensus 969 svGDRFrm~fe~ED~idg~Ww~GtIvgv~p~dp~wPdSpWe~L~VrWDn~ 1018 (1159)
++|||+.+.=.. .+..-..|.|+++.-.+..=| |.|+|+++
T Consensus 4 ~vGD~lvv~g~~---vg~~~r~GeIveV~g~dG~PP------Y~VRw~D~ 44 (58)
T PF08940_consen 4 SVGDRLVVHGRT---VGQPDRHGEIVEVRGPDGSPP------YLVRWDDT 44 (58)
T ss_dssp -TTEEEEES-TT---TS--EEEEEEEE-S-SSS-S-------EEEEETTT
T ss_pred CCCCEEEEcCCc---CCCCCcEeEEEEEECCCCCCC------EEEEecCC
Confidence 589998884322 244557999999985432212 68999985
No 376
>KOG2130 consensus Phosphatidylserine-specific receptor PtdSerR, contains JmjC domain [Chromatin structure and dynamics; Signal transduction mechanisms]
Probab=27.23 E-value=1.1e+02 Score=36.03 Aligned_cols=96 Identities=22% Similarity=0.302 Sum_probs=56.3
Q ss_pred EEecCceeEEEEecCCCCCccC--Cc--c--cccc------------ccCCcccee----cCCCCcccccc---------
Q 001090 43 ILSDDVGQLYILNTGQGESQKD--AK--Y--DQFF------------LGDYRPLVQ----DTYGNVLDQET--------- 91 (1159)
Q Consensus 43 a~tD~~G~l~Ifg~G~g~~~~~--~p--~--EQFF------------~tDYrpLir----D~~g~vlDe~T--------- 91 (1159)
-++-++--|+||+--.|+..+. .. | -.|| -.+||.+|- ...+.-||...
T Consensus 130 ~~~RddsPLYiFDssFgE~~~~rkLl~dY~VPk~F~dDlF~y~g~e~RPpyRWfvmGParSGtsiHIDPlgTSAWNtll~ 209 (407)
T KOG2130|consen 130 KSTRDDSPLYIFDSSFGEHAPRRKLLEDYSVPKYFRDDLFQYLGEERRPPYRWFVMGPARSGTSIHIDPLGTSAWNTLLQ 209 (407)
T ss_pred hccccCCCeEEecchhhcccchhhhhhhcCcchhhhHHHHHhcCcccCCCceeEEecCCCCCceeEECCcchHHHHHHhh
Confidence 3566677899999777764331 10 1 1244 246777764 45556677642
Q ss_pred ----------CcCCcccCCCCCcccCCCCCCCc----hhHHhhhhhhhcccCccccCCccccccC
Q 001090 92 ----------QLAPHRRNLQDPLCDSAMIPYPE----PYQTMYQQRRLGALGIEWRPSSLKLAVG 142 (1159)
Q Consensus 92 ----------q~~PHL~~~~~~L~D~~~~PYp~----~~Q~~~~~rrl~~l~~e~~p~~~~~~vg 142 (1159)
+.||-|.. +-+|-.| -+|. .||..++.-++-.+.-|..|-++-|+-|
T Consensus 210 GhKrW~LfPp~~p~~lvk---v~~~e~g-~~~de~itwf~~~y~rt~~Pswp~E~kPIEc~q~pG 270 (407)
T KOG2130|consen 210 GHKRWVLFPPGTPPELVK---VTVDEGG-KQPDEIITWFSTIYPRTQLPSWPDEYKPIECLQKPG 270 (407)
T ss_pred ccceeEEcCCCCCCCcee---ecccccC-CCCcceechhhhccccccCCCCccccCCceeeecCC
Confidence 22222211 3345554 3332 4888888777777788888877777666
No 377
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=26.89 E-value=60 Score=37.25 Aligned_cols=55 Identities=16% Similarity=0.149 Sum_probs=32.4
Q ss_pred cCCcEEEEEccCCeeeEEeecCc-----------------ceeeeeeeCCCCCEEEEe--c--CceeEEEEecCCC
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISR-----------------FRLVDGKFSPDGASIILS--D--DVGQLYILNTGQG 59 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~-----------------~~i~D~~FSPDGt~~a~t--D--~~G~l~Ifg~G~g 59 (1159)
++++|.|+++.+|..+++=..|. ..-..+-|||||++||.. | ....+.+...+..
T Consensus 61 ~~~nly~~~~~~~~~~~lT~dg~~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~~~~~~ 136 (353)
T PF00930_consen 61 RDNNLYLRDLATGQETQLTTDGEPGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLPDYSPP 136 (353)
T ss_dssp ETTEEEEESSTTSEEEESES--TTTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEEEESSS
T ss_pred ecCceEEEECCCCCeEEeccccceeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEeeccCCc
Confidence 57899999999997666422221 112346699999999943 2 2334445444443
No 378
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=26.87 E-value=1.4e+02 Score=38.19 Aligned_cols=98 Identities=10% Similarity=0.082 Sum_probs=70.5
Q ss_pred EEEccCCeeeEEeec---------CcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCc--cccccccCCccce
Q 001090 11 VWDIWEGIPIRIYEI---------SRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAK--YDQFFLGDYRPLV 79 (1159)
Q Consensus 11 lWDI~~G~~Ir~f~~---------g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p--~EQFF~tDYrpLi 79 (1159)
+--+..|...|+|.. ...+|..+-++..-+.||+.|+.|...++|+.+++.+-+.| +---|.|--..++
T Consensus 431 ~vGlknGqV~kiF~dN~~PiLl~~~~tAvrClDINA~R~kLAvVDD~~~c~v~DI~t~elL~QEpg~nSV~wNT~~E~Ml 510 (1081)
T KOG1538|consen 431 LVGLKNGQVLKIFVDNLFPILLLKQATAVRCLDINASRKKLAVVDDNDTCLVYDIDTKELLFQEPGANSVAWNTQCEDML 510 (1081)
T ss_pred EEEecCCcEEEEEecCCchhheecccceeEEeeccCCcceEEEEccCCeEEEEEccCCceEeecCCCceEEeeccccceE
Confidence 346778999999973 35688889999999999999999999999999998665555 3335555544333
Q ss_pred -ecCCCCccccccCcCCcccCCCCCcccCC
Q 001090 80 -QDTYGNVLDQETQLAPHRRNLQDPLCDSA 108 (1159)
Q Consensus 80 -rD~~g~vlDe~Tq~~PHL~~~~~~L~D~~ 108 (1159)
...+|++==...-.|||..+|+|+.+-+-
T Consensus 511 cfT~~g~L~VR~~~~P~h~qk~~G~VvG~~ 540 (1081)
T KOG1538|consen 511 CFTGGGYLNVRASTFPVHRQKLQGFVVGYN 540 (1081)
T ss_pred EEecCCceEEEeccCCcchhcceEEEEEec
Confidence 33444443345556888888888766553
No 379
>PF05764 YL1: YL1 nuclear protein; InterPro: IPR008895 The proteins in this family are designated YL1 []. They have been shown to be DNA-binding and may be transcription factors [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=25.76 E-value=1e+02 Score=34.38 Aligned_cols=29 Identities=28% Similarity=0.485 Sum_probs=16.0
Q ss_pred ccccCCcccccccCCCCCccccccccccc
Q 001090 171 DVMDWEPENEVQSDDNDSEYNVAEEYSTE 199 (1159)
Q Consensus 171 d~~~wepe~ev~sdd~dseyn~~ee~ss~ 199 (1159)
|-++|-...=..=|++|.||+..++....
T Consensus 25 de~~~~~~~~f~Eee~D~ef~~~~~eed~ 53 (240)
T PF05764_consen 25 DEFFWNQYGLFQEEEDDEEFESEEEEEDE 53 (240)
T ss_pred hhhhhhhcccccccCCCccccCCCccccc
Confidence 55666554412225778899875443333
No 380
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=25.10 E-value=2.1e+02 Score=33.12 Aligned_cols=63 Identities=25% Similarity=0.362 Sum_probs=42.0
Q ss_pred cccCCcEEEEEccCCeeeEEeecCc---------ceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISR---------FRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDA 65 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~---------~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~ 65 (1159)
++.+|.|.-+|..+|+.+=.+.... .....+.-.-+|..|.+++..|.|+-|+..+|+.....
T Consensus 75 ~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~~g~l~ald~~tG~~~W~~ 146 (394)
T PRK11138 75 ADRAGLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSEKGQVYALNAEDGEVAWQT 146 (394)
T ss_pred ECCCCeEEEEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcCCCEEEEEECCCCCCcccc
Confidence 4567888999999999886664321 11111111225667778889999999999888755443
No 381
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=24.44 E-value=1.1e+02 Score=35.81 Aligned_cols=52 Identities=19% Similarity=0.223 Sum_probs=44.4
Q ss_pred CcEEEEEccCCe-eeEEeecCcceeeeeeeCCCC-CEEEEecCceeEEEEecCC
Q 001090 7 GKTIVWDIWEGI-PIRIYEISRFRLVDGKFSPDG-ASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 7 G~I~lWDI~~G~-~Ir~f~~g~~~i~D~~FSPDG-t~~a~tD~~G~l~Ifg~G~ 58 (1159)
..|+|-||.--+ ++..+.+|.+.|-.+||.|-. .+|+++-++-+.+||++-+
T Consensus 266 ~~V~iLDiR~P~tpva~L~~H~a~VNgIaWaPhS~~hictaGDD~qaliWDl~q 319 (364)
T KOG0290|consen 266 NKVVILDIRVPCTPVARLRNHQASVNGIAWAPHSSSHICTAGDDCQALIWDLQQ 319 (364)
T ss_pred ceEEEEEecCCCcceehhhcCcccccceEecCCCCceeeecCCcceEEEEeccc
Confidence 468899998755 888899999999999999985 5677888899999999865
No 382
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=24.29 E-value=97 Score=40.17 Aligned_cols=38 Identities=16% Similarity=0.152 Sum_probs=30.0
Q ss_pred CCCEEEEecCceeEEEEecCCCC-----CccCCccccccccCC
Q 001090 38 DGASIILSDDVGQLYILNTGQGE-----SQKDAKYDQFFLGDY 75 (1159)
Q Consensus 38 DGt~~a~tD~~G~l~Ifg~G~g~-----~~~~~p~EQFF~tDY 75 (1159)
+|.+||++..+|++.|.|+++.. .++..++--.||.||
T Consensus 82 ~Gey~asCS~DGkv~I~sl~~~~~~~~~df~rpiksial~Pd~ 124 (846)
T KOG2066|consen 82 EGEYVASCSDDGKVVIGSLFTDDEITQYDFKRPIKSIALHPDF 124 (846)
T ss_pred CCceEEEecCCCcEEEeeccCCccceeEecCCcceeEEeccch
Confidence 69999999999999999999875 245555555677775
No 383
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=23.85 E-value=1.3e+02 Score=38.21 Aligned_cols=32 Identities=13% Similarity=0.118 Sum_probs=26.6
Q ss_pred ecCcceeeeeeeCCCCCEEE-EecCceeEEEEe
Q 001090 24 EISRFRLVDGKFSPDGASII-LSDDVGQLYILN 55 (1159)
Q Consensus 24 ~~g~~~i~D~~FSPDGt~~a-~tD~~G~l~Ifg 55 (1159)
..++|-|..+||.+||+.|+ +..+.=|-|||+
T Consensus 152 i~~~G~IhCACWT~DG~RLVVAvGSsLHSyiWd 184 (671)
T PF15390_consen 152 IKTSGLIHCACWTKDGQRLVVAVGSSLHSYIWD 184 (671)
T ss_pred ccCCceEEEEEecCcCCEEEEEeCCeEEEEEec
Confidence 35689999999999999988 556677888887
No 384
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=23.34 E-value=1.4e+02 Score=40.15 Aligned_cols=72 Identities=15% Similarity=0.176 Sum_probs=53.9
Q ss_pred cccCCcEEEEEccCCe---eeEEeecCcce--eeeeeeCCCCCE-EEEecCceeEEEEecCCCCCccCCccccccccCCc
Q 001090 3 AGYDGKTIVWDIWEGI---PIRIYEISRFR--LVDGKFSPDGAS-IILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYR 76 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~---~Ir~f~~g~~~--i~D~~FSPDGt~-~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYr 76 (1159)
|-.||.|+++|...-- .|.++..|... |.-+.|-+.|.- |+++...|.|.+|++-.. .+.++..+-.+-+|.
T Consensus 1227 GfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~--~~e~~~~iv~~~~yG 1304 (1387)
T KOG1517|consen 1227 GFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMS--SKETFLTIVAHWEYG 1304 (1387)
T ss_pred eecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCCeEEEEecccC--cccccceeeeccccC
Confidence 4469999999997654 46778777655 999999998877 999999999999998653 333444444444444
No 385
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=23.30 E-value=1.2e+02 Score=40.78 Aligned_cols=57 Identities=14% Similarity=0.065 Sum_probs=45.2
Q ss_pred CCcccCCcEEEEEccCCeeeEEeec--Cc--c-eeeeeeeCCCCCEEEEecCceeEEEEecCC
Q 001090 1 MSAGYDGKTIVWDIWEGIPIRIYEI--SR--F-RLVDGKFSPDGASIILSDDVGQLYILNTGQ 58 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~~G~~Ir~f~~--g~--~-~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~ 58 (1159)
+||+.||-|.+||+..-.....+.. +. | ++..+...++--.||+|.. +.|-||++-.
T Consensus 1273 vSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~-q~ikIy~~~G 1334 (1387)
T KOG1517|consen 1273 VSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSA-QLIKIYSLSG 1334 (1387)
T ss_pred eeeccCCeEEEEecccCcccccceeeeccccCccceeeeeccCCCeeeecCc-ceEEEEecCh
Confidence 5899999999999987654444442 11 3 5888888999999999999 9999999753
No 386
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=22.84 E-value=1.4e+02 Score=35.98 Aligned_cols=51 Identities=24% Similarity=0.339 Sum_probs=0.0
Q ss_pred CCcEEEEEccCCeeeEEee-cCcceeee---------eeeCCCCCEEE-EecCceeEEEEec
Q 001090 6 DGKTIVWDIWEGIPIRIYE-ISRFRLVD---------GKFSPDGASII-LSDDVGQLYILNT 56 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~-~g~~~i~D---------~~FSPDGt~~a-~tD~~G~l~Ifg~ 56 (1159)
|-.|.|+|+++|...+.-. ...-.|.+ ..|||||+.|+ .+|-.|.-.||.+
T Consensus 319 ~p~i~~~~~~~~~~~~l~~h~~sw~v~~~~~q~~hPhp~FSPDgk~VlF~Sd~~G~~~vY~v 380 (386)
T PF14583_consen 319 DPWIYLFDVEAGRFRKLARHDTSWKVLDGDRQVTHPHPSFSPDGKWVLFRSDMEGPPAVYLV 380 (386)
T ss_dssp --EEEEEETTTTEEEEEEE-------BTTBSSTT----EE-TTSSEEEEEE-TTSS-EEEEE
T ss_pred CcEEEEeccccCceeeeeeccCcceeecCCCccCCCCCccCCCCCEEEEECCCCCCccEEEE
No 387
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=22.31 E-value=55 Score=43.45 Aligned_cols=57 Identities=12% Similarity=0.176 Sum_probs=42.5
Q ss_pred cCCcEEEEEccCCeeeEEee---cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCcc
Q 001090 5 YDGKTIVWDIWEGIPIRIYE---ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQK 63 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~---~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~ 63 (1159)
.-|+|.+-|+..- |+-|. ...++|-.++|+-||+.++.|=..|+|.||++-.++.++
T Consensus 107 s~ghvl~~d~~~n--L~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~G~V~v~D~~~~k~l~ 166 (1206)
T KOG2079|consen 107 SHGHVLLSDMTGN--LGPLHQNERVQGPVTSVAFNQDGSLLLAGLGDGHVTVWDMHRAKILK 166 (1206)
T ss_pred Cchhhhhhhhhcc--cchhhcCCccCCcceeeEecCCCceeccccCCCcEEEEEccCCccee
Confidence 3455555555332 22233 247899999999999999999999999999998876543
No 388
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=22.21 E-value=1.1e+02 Score=29.60 Aligned_cols=43 Identities=12% Similarity=0.136 Sum_probs=26.5
Q ss_pred CcccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEe
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILS 45 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~t 45 (1159)
.+..+|.|+-||..+|+.--...+ =.---.+++||||.+|+++
T Consensus 32 e~~~~GRll~ydp~t~~~~vl~~~-L~fpNGVals~d~~~vlv~ 74 (89)
T PF03088_consen 32 EGRPTGRLLRYDPSTKETTVLLDG-LYFPNGVALSPDESFVLVA 74 (89)
T ss_dssp HT---EEEEEEETTTTEEEEEEEE-ESSEEEEEE-TTSSEEEEE
T ss_pred cCCCCcCEEEEECCCCeEEEehhC-CCccCeEEEcCCCCEEEEE
Confidence 356789999999999985333332 1222346889999987765
No 389
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=21.69 E-value=1.6e+02 Score=36.90 Aligned_cols=72 Identities=14% Similarity=0.116 Sum_probs=49.6
Q ss_pred EEEEEccCCeeeEEee---cCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCc-cCC---ccccccccCCcccee
Q 001090 9 TIVWDIWEGIPIRIYE---ISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQ-KDA---KYDQFFLGDYRPLVQ 80 (1159)
Q Consensus 9 I~lWDI~~G~~Ir~f~---~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~-~~~---p~EQFF~tDYrpLir 80 (1159)
.+++++.+++.=++-. .-+..|..|+++|+-..|+.|=.+|.|.+|+...+.+. .++ |.--=.|.|..-++-
T Consensus 238 ~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~ka~~~P~~iaWHp~gai~~V 316 (545)
T PF11768_consen 238 SCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDGSIILYDTTRGVTLLAKAEFIPTLIAWHPDGAIFVV 316 (545)
T ss_pred EEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecCCeEEEEEcCCCeeeeeeecccceEEEEcCCCcEEEE
Confidence 3677887775333322 34788999999999999999999999999998776522 222 222236666554443
No 390
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=21.67 E-value=60 Score=37.93 Aligned_cols=38 Identities=21% Similarity=0.220 Sum_probs=30.6
Q ss_pred CCcccCCcEEEEEcc-CCeeeEEeecC-cceeeeeeeCCCCC
Q 001090 1 MSAGYDGKTIVWDIW-EGIPIRIYEIS-RFRLVDGKFSPDGA 40 (1159)
Q Consensus 1 lSAG~DG~I~lWDI~-~G~~Ir~f~~g-~~~i~D~~FSPDGt 40 (1159)
+||+||-+|++||.. =|++| |.+. .|.||-++++|-=.
T Consensus 227 ~TGsYDe~i~~~DtRnm~kPl--~~~~v~GGVWRi~~~p~~~ 266 (339)
T KOG0280|consen 227 ATGSYDECIRVLDTRNMGKPL--FKAKVGGGVWRIKHHPEIF 266 (339)
T ss_pred EEeccccceeeeehhcccCcc--ccCccccceEEEEecchhh
Confidence 479999999999998 68887 5543 47899999999543
No 391
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=21.62 E-value=2.9e+02 Score=31.74 Aligned_cols=53 Identities=13% Similarity=0.154 Sum_probs=40.6
Q ss_pred cCCcEEEEEccC--Ce--eeEEee-cCcceeeeeeeCCCCCEEEEec-CceeEEEEecCC
Q 001090 5 YDGKTIVWDIWE--GI--PIRIYE-ISRFRLVDGKFSPDGASIILSD-DVGQLYILNTGQ 58 (1159)
Q Consensus 5 ~DG~I~lWDI~~--G~--~Ir~f~-~g~~~i~D~~FSPDGt~~a~tD-~~G~l~Ifg~G~ 58 (1159)
.+|.|..|+|.. |. .|.... .|..+ --++++|||.+|+++. ..|.+.+|.+..
T Consensus 60 ~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p-~~i~~~~~g~~l~vany~~g~v~v~~l~~ 118 (345)
T PF10282_consen 60 DSGGVSSYRIDPDTGTLTLLNSVPSGGSSP-CHIAVDPDGRFLYVANYGGGSVSVFPLDD 118 (345)
T ss_dssp TTTEEEEEEEETTTTEEEEEEEEEESSSCE-EEEEECTTSSEEEEEETTTTEEEEEEECT
T ss_pred CCCCEEEEEECCCcceeEEeeeeccCCCCc-EEEEEecCCCEEEEEEccCCeEEEEEccC
Confidence 578999999997 45 333333 34433 4588999999999998 589999999977
No 392
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=21.55 E-value=2.5e+02 Score=27.13 Aligned_cols=46 Identities=13% Similarity=0.190 Sum_probs=29.7
Q ss_pred CcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCc-eeEEEEe
Q 001090 7 GKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDV-GQLYILN 55 (1159)
Q Consensus 7 G~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~-G~l~Ifg 55 (1159)
|.|+.||-.+ .++-..+-..---+++|||+++|-+++.. +.|++|.
T Consensus 36 ~~Vvyyd~~~---~~~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~ 82 (86)
T PF01731_consen 36 GNVVYYDGKE---VKVVASGFSFANGIAISPDKKYLYVASSLAHSIHVYK 82 (86)
T ss_pred ceEEEEeCCE---eEEeeccCCCCceEEEcCCCCEEEEEeccCCeEEEEE
Confidence 6778887533 33333332223346789999999988876 6677765
No 393
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=20.77 E-value=71 Score=40.07 Aligned_cols=48 Identities=19% Similarity=0.274 Sum_probs=40.2
Q ss_pred cEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 8 KTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 8 ~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
.++|||=.+|+.+...+.-.. |-|+||=||+-+|-++-+.+.+..|=+
T Consensus 295 ~~kiWd~~~Gk~~asiEpt~~-lND~C~~p~sGm~f~Ane~~~m~~yyi 342 (703)
T KOG2321|consen 295 ILKIWDECTGKPMASIEPTSD-LNDFCFVPGSGMFFTANESSKMHTYYI 342 (703)
T ss_pred HhhhcccccCCceeeccccCC-cCceeeecCCceEEEecCCCcceeEEc
Confidence 578999999999887765444 999999999999999999887776643
No 394
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=20.61 E-value=85 Score=40.36 Aligned_cols=55 Identities=18% Similarity=0.287 Sum_probs=44.0
Q ss_pred CcccCCcEEEEEccCCeeeEEee--cCcceeeeeeeCCCCCEEEEecCceeEEEEec
Q 001090 2 SAGYDGKTIVWDIWEGIPIRIYE--ISRFRLVDGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 2 SAG~DG~I~lWDI~~G~~Ir~f~--~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
|+..+|.|++|=+.+|.-+--.. ....-|.++.|.-||+.|+-+=.+|.+.+=.+
T Consensus 88 tSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGavIVGsv 144 (1189)
T KOG2041|consen 88 TSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAVIVGSV 144 (1189)
T ss_pred ccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCEEEEee
Confidence 56789999999998887544333 34578999999999999999999998865444
No 395
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=20.49 E-value=1e+02 Score=38.29 Aligned_cols=51 Identities=24% Similarity=0.198 Sum_probs=30.0
Q ss_pred cCCcEEEEEccCCeeeEEeecCcce--ee-eeeeCCCCCEEEEecCceeEEEEec
Q 001090 5 YDGKTIVWDIWEGIPIRIYEISRFR--LV-DGKFSPDGASIILSDDVGQLYILNT 56 (1159)
Q Consensus 5 ~DG~I~lWDI~~G~~Ir~f~~g~~~--i~-D~~FSPDGt~~a~tD~~G~l~Ifg~ 56 (1159)
++.++++||+.+|..+..|.+-.-. .+ =++||=|-.+.|-. ..-.|+|+.+
T Consensus 106 ~~n~~~vwd~~sg~iv~sf~~~~q~~~~Wp~~k~s~~D~y~ARv-v~~sl~i~e~ 159 (561)
T COG5354 106 SKNNVFVWDIASGMIVFSFNGISQPYLGWPVLKFSIDDKYVARV-VGSSLYIHEI 159 (561)
T ss_pred ccCceeEEeccCceeEeeccccCCcccccceeeeeecchhhhhh-ccCeEEEEec
Confidence 4456777777777777777643222 33 56677777666633 2234556653
No 396
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=20.45 E-value=4.4e+02 Score=25.69 Aligned_cols=75 Identities=16% Similarity=0.228 Sum_probs=36.7
Q ss_pred CCcEEEEEccCCeeeEEeec---C-cceeeeeeeCCCCCEEEEecCceeEEEEecCCCCCccCCccccccccCCccceec
Q 001090 6 DGKTIVWDIWEGIPIRIYEI---S-RFRLVDGKFSPDGASIILSDDVGQLYILNTGQGESQKDAKYDQFFLGDYRPLVQD 81 (1159)
Q Consensus 6 DG~I~lWDI~~G~~Ir~f~~---g-~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~~~~~~p~EQFF~tDYrpLirD 81 (1159)
||.+++.+-..+.++ ... + ......+.|..||.. +..|..|.. ||..+... ....+--.+..|..-.|++
T Consensus 30 dgnlv~~~~~~~~~v--W~snt~~~~~~~~~l~l~~dGnL-vl~~~~g~~-vW~S~~~~--~~~~~~~~L~ddGnlvl~~ 103 (116)
T cd00028 30 DYNLILYKGSSRTVV--WVANRDNPSGSSCTLTLQSDGNL-VIYDGSGTV-VWSSNTTR--VNGNYVLVLLDDGNLVLYD 103 (116)
T ss_pred eEEEEEEeCCCCeEE--EECCCCCCCCCCEEEEEecCCCe-EEEcCCCcE-EEEecccC--CCCceEEEEeCCCCEEEEC
Confidence 777777665433322 321 1 123445777788854 444555643 66655432 1112222444555555555
Q ss_pred CCCCc
Q 001090 82 TYGNV 86 (1159)
Q Consensus 82 ~~g~v 86 (1159)
.+|.+
T Consensus 104 ~~~~~ 108 (116)
T cd00028 104 SDGNF 108 (116)
T ss_pred CCCCE
Confidence 55443
No 397
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=20.19 E-value=2.1e+02 Score=33.42 Aligned_cols=58 Identities=9% Similarity=-0.078 Sum_probs=39.4
Q ss_pred cccCCcEEEEEccCCeeeEEeecCcceeeeeeeCCCCCEEEEecCceeEEEEecCCCC
Q 001090 3 AGYDGKTIVWDIWEGIPIRIYEISRFRLVDGKFSPDGASIILSDDVGQLYILNTGQGE 60 (1159)
Q Consensus 3 AG~DG~I~lWDI~~G~~Ir~f~~g~~~i~D~~FSPDGt~~a~tD~~G~l~Ifg~G~g~ 60 (1159)
|+-|.++.+.++.-|..=.........+..++.||||..++++.+.-++..|.+.-..
T Consensus 134 ~sndht~k~~~~~~~s~~~~~h~~~~~~ns~~~snd~~~~~~Vgds~~Vf~y~id~~s 191 (344)
T KOG4532|consen 134 ASNDHTGKTMVVSGDSNKFAVHNQNLTQNSLHYSNDPSWGSSVGDSRRVFRYAIDDES 191 (344)
T ss_pred ccCCcceeEEEEecCcccceeeccccceeeeEEcCCCceEEEecCCCcceEEEeCCcc
Confidence 3445555555555444221111122559999999999999999999999999986643
Done!