Query 040693
Match_columns 382
No_of_seqs 120 out of 1512
Neff 8.8
Searched_HMMs 46136
Date Fri Mar 29 10:50:15 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/040693.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/040693hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR03075 PQQ_enz_alc_DH PQQ-d 100.0 9E-38 1.9E-42 314.1 35.1 313 1-361 134-524 (527)
2 cd00216 PQQ_DH Dehydrogenases 100.0 4.9E-34 1.1E-38 286.2 33.7 332 1-382 124-488 (488)
3 TIGR03074 PQQ_membr_DH membran 100.0 1.2E-33 2.5E-38 291.8 35.7 339 1-382 274-763 (764)
4 PRK11138 outer membrane biogen 100.0 2E-33 4.4E-38 275.3 30.1 267 1-382 83-363 (394)
5 TIGR03300 assembly_YfgL outer 100.0 1.2E-31 2.7E-36 261.3 30.4 263 1-382 79-348 (377)
6 PRK11138 outer membrane biogen 100.0 2.5E-31 5.4E-36 260.5 30.4 268 5-381 42-321 (394)
7 TIGR03300 assembly_YfgL outer 100.0 2E-30 4.2E-35 252.8 31.0 258 5-380 38-305 (377)
8 cd00216 PQQ_DH Dehydrogenases 100.0 2.7E-30 5.8E-35 259.2 28.6 300 1-382 75-434 (488)
9 TIGR03075 PQQ_enz_alc_DH PQQ-d 100.0 4.6E-27 9.9E-32 236.6 32.0 296 4-382 44-500 (527)
10 PF13360 PQQ_2: PQQ-like domai 99.9 1.4E-25 3.1E-30 204.2 27.3 227 54-380 2-238 (238)
11 PF13360 PQQ_2: PQQ-like domai 99.9 3.7E-24 8.1E-29 194.8 28.0 225 1-337 7-238 (238)
12 TIGR03074 PQQ_membr_DH membran 99.9 3.1E-24 6.8E-29 222.0 30.4 282 3-338 160-486 (764)
13 COG4993 Gcd Glucose dehydrogen 99.9 7.8E-23 1.7E-27 197.7 27.2 340 1-382 286-771 (773)
14 COG1520 FOG: WD40-like repeat 99.9 1.1E-21 2.4E-26 190.7 29.4 273 3-382 39-332 (370)
15 COG1520 FOG: WD40-like repeat 99.9 3.6E-19 7.9E-24 173.1 28.3 262 1-362 82-356 (370)
16 KOG4649 PQQ (pyrrolo-quinoline 99.8 2.7E-18 5.9E-23 150.9 22.9 254 1-374 37-353 (354)
17 KOG4649 PQQ (pyrrolo-quinoline 99.8 1.1E-16 2.4E-21 140.9 21.7 244 10-380 2-256 (354)
18 COG4993 Gcd Glucose dehydrogen 99.8 7.3E-17 1.6E-21 156.7 22.0 270 6-337 183-496 (773)
19 TIGR02658 TTQ_MADH_Hv methylam 98.9 8.5E-07 1.8E-11 84.7 24.9 280 2-380 32-338 (352)
20 TIGR03866 PQQ_ABC_repeats PQQ- 98.8 2.7E-05 5.8E-10 72.3 30.2 263 2-381 16-288 (300)
21 TIGR02658 TTQ_MADH_Hv methylam 98.8 4.5E-06 9.8E-11 79.8 25.0 135 30-234 11-147 (352)
22 TIGR03866 PQQ_ABC_repeats PQQ- 98.7 3.1E-05 6.7E-10 71.9 27.0 184 54-338 10-196 (300)
23 cd00200 WD40 WD40 domain, foun 98.5 8.8E-05 1.9E-09 67.0 25.3 208 54-362 72-284 (289)
24 cd00200 WD40 WD40 domain, foun 98.5 6.6E-05 1.4E-09 67.9 23.1 221 54-379 30-256 (289)
25 PF13570 PQQ_3: PQQ-like domai 98.4 3E-07 6.5E-12 59.1 4.4 40 7-64 1-40 (40)
26 PF02239 Cytochrom_D1: Cytochr 98.4 5.4E-05 1.2E-09 73.5 21.0 194 54-341 15-214 (369)
27 KOG0296 Angio-associated migra 98.4 0.00013 2.9E-09 67.9 21.0 183 143-368 212-396 (399)
28 PF08450 SGL: SMP-30/Gluconola 98.4 0.00024 5.2E-09 64.9 23.0 120 215-359 115-245 (246)
29 PF14269 Arylsulfotran_2: Aryl 98.3 0.00091 2E-08 63.0 26.1 186 142-339 95-298 (299)
30 PF05096 Glu_cyclase_2: Glutam 98.2 9.3E-05 2E-09 67.4 17.2 144 142-340 67-214 (264)
31 PF06433 Me-amine-dh_H: Methyl 98.2 0.00019 4.1E-09 67.5 18.3 126 205-339 195-330 (342)
32 PF01011 PQQ: PQQ enzyme repea 98.2 4.3E-06 9.3E-11 53.0 5.0 37 310-348 1-37 (38)
33 PF13570 PQQ_3: PQQ-like domai 98.2 1.8E-06 4E-11 55.4 3.3 40 331-373 1-40 (40)
34 PF02239 Cytochrom_D1: Cytochr 98.2 0.00014 2.9E-09 70.8 17.6 249 2-357 21-286 (369)
35 PF10282 Lactonase: Lactonase, 98.1 0.00087 1.9E-08 64.6 22.9 157 206-379 156-331 (345)
36 KOG2048 WD40 repeat protein [G 98.1 0.00059 1.3E-08 68.2 21.4 202 30-340 79-286 (691)
37 KOG0296 Angio-associated migra 98.1 0.0015 3.2E-08 61.1 21.9 152 205-381 202-365 (399)
38 PF01011 PQQ: PQQ enzyme repea 98.0 8.7E-06 1.9E-10 51.6 4.3 22 54-75 9-30 (38)
39 PF05935 Arylsulfotrans: Aryls 98.0 0.0022 4.7E-08 64.6 23.5 147 54-297 127-313 (477)
40 PF06433 Me-amine-dh_H: Methyl 98.0 0.0031 6.8E-08 59.5 22.4 272 2-379 22-327 (342)
41 PF05935 Arylsulfotrans: Aryls 97.9 0.0031 6.8E-08 63.5 23.2 280 6-341 136-468 (477)
42 PF09910 DUF2139: Uncharacteri 97.9 0.0044 9.5E-08 56.8 21.5 207 54-333 8-234 (339)
43 PRK11028 6-phosphogluconolacto 97.9 0.011 2.3E-07 56.5 26.0 149 206-375 92-263 (330)
44 KOG0316 Conserved WD40 repeat- 97.9 0.00068 1.5E-08 59.9 15.6 192 142-362 80-293 (307)
45 KOG0291 WD40-repeat-containing 97.9 0.0038 8.2E-08 63.5 22.8 149 205-375 362-513 (893)
46 PF05096 Glu_cyclase_2: Glutam 97.9 0.0021 4.6E-08 58.6 18.9 164 23-295 46-213 (264)
47 smart00564 PQQ beta-propeller 97.8 4.2E-05 9.1E-10 46.6 4.5 31 28-72 3-33 (33)
48 PRK11028 6-phosphogluconolacto 97.8 0.015 3.3E-07 55.4 24.1 153 206-377 138-311 (330)
49 PF10282 Lactonase: Lactonase, 97.7 0.051 1.1E-06 52.4 27.4 234 57-378 17-283 (345)
50 smart00564 PQQ beta-propeller 97.7 9.5E-05 2E-09 45.0 4.6 32 305-338 2-33 (33)
51 KOG2055 WD40 repeat protein [G 97.6 0.0039 8.5E-08 60.0 15.9 177 143-369 237-416 (514)
52 PLN02919 haloacid dehalogenase 97.6 0.19 4E-06 55.7 31.0 148 207-371 697-889 (1057)
53 PTZ00421 coronin; Provisional 97.6 0.11 2.3E-06 52.6 27.1 150 205-379 138-297 (493)
54 KOG2103 Uncharacterized conser 97.5 0.005 1.1E-07 63.3 16.8 200 5-329 22-227 (910)
55 PF08450 SGL: SMP-30/Gluconola 97.5 0.0066 1.4E-07 55.4 16.5 159 205-381 51-221 (246)
56 COG3386 Gluconolactonase [Carb 97.5 0.063 1.4E-06 50.8 22.9 53 308-361 222-277 (307)
57 PRK00178 tolB translocation pr 97.5 0.081 1.7E-06 52.5 25.1 107 207-334 257-368 (430)
58 PLN02919 haloacid dehalogenase 97.5 0.17 3.6E-06 56.1 29.0 137 207-360 754-922 (1057)
59 KOG0310 Conserved WD40 repeat- 97.4 0.014 3E-07 56.7 17.9 133 205-363 166-303 (487)
60 KOG0278 Serine/threonine kinas 97.4 0.0027 5.9E-08 56.7 12.2 149 205-378 155-305 (334)
61 PTZ00421 coronin; Provisional 97.4 0.12 2.7E-06 52.2 25.5 150 142-338 147-299 (493)
62 PRK04922 tolB translocation pr 97.4 0.094 2E-06 52.2 24.5 143 207-372 262-411 (433)
63 PLN00181 protein SPA1-RELATED; 97.4 0.093 2E-06 56.5 26.1 186 53-338 553-747 (793)
64 COG2706 3-carboxymuconate cycl 97.4 0.12 2.6E-06 48.6 25.6 154 207-377 158-328 (346)
65 KOG0649 WD40 repeat protein [G 97.3 0.038 8.3E-07 49.4 18.1 171 54-316 80-263 (325)
66 COG3823 Glutamine cyclotransfe 97.3 0.009 2E-07 52.2 13.4 166 24-297 48-216 (262)
67 KOG1539 WD repeat protein [Gen 97.3 0.057 1.2E-06 55.8 21.0 266 54-374 55-322 (910)
68 PHA02713 hypothetical protein; 97.3 0.23 4.9E-06 51.2 26.1 152 216-375 368-536 (557)
69 KOG0271 Notchless-like WD40 re 97.3 0.0074 1.6E-07 56.9 13.5 233 54-381 136-406 (480)
70 KOG0279 G protein beta subunit 97.3 0.023 5E-07 51.6 16.1 180 140-371 82-263 (315)
71 PF14269 Arylsulfotran_2: Aryl 97.3 0.019 4E-07 54.2 16.5 68 141-233 230-298 (299)
72 PRK05137 tolB translocation pr 97.2 0.17 3.7E-06 50.4 24.3 147 206-374 259-414 (435)
73 PHA02713 hypothetical protein; 97.2 0.11 2.5E-06 53.4 23.4 103 216-337 433-539 (557)
74 COG3823 Glutamine cyclotransfe 97.2 0.013 2.7E-07 51.3 13.6 145 143-341 68-216 (262)
75 PRK03629 tolB translocation pr 97.2 0.23 5E-06 49.4 24.7 143 206-370 256-404 (429)
76 KOG0274 Cdc4 and related F-box 97.2 0.04 8.8E-07 56.1 19.4 175 142-377 310-487 (537)
77 TIGR03548 mutarot_permut cycli 97.2 0.22 4.8E-06 47.4 27.1 127 215-360 139-312 (323)
78 PRK04792 tolB translocation pr 97.2 0.21 4.5E-06 50.0 24.2 143 206-371 275-424 (448)
79 TIGR02800 propeller_TolB tol-p 97.1 0.33 7.1E-06 47.7 24.5 131 207-361 248-386 (417)
80 PTZ00420 coronin; Provisional 97.1 0.42 9.1E-06 49.1 25.4 144 205-371 138-294 (568)
81 KOG0316 Conserved WD40 repeat- 97.1 0.015 3.3E-07 51.6 12.8 180 143-380 39-221 (307)
82 COG3391 Uncharacterized conser 97.1 0.34 7.4E-06 47.4 25.0 207 31-342 85-296 (381)
83 KOG1539 WD repeat protein [Gen 97.0 0.066 1.4E-06 55.4 18.5 118 27-233 168-285 (910)
84 PF05567 Neisseria_PilC: Neiss 96.9 0.009 1.9E-07 57.3 11.3 83 275-359 179-278 (335)
85 KOG2048 WD40 repeat protein [G 96.9 0.033 7.1E-07 56.2 15.3 174 140-365 87-271 (691)
86 KOG0295 WD40 repeat-containing 96.8 0.074 1.6E-06 50.1 15.8 83 274-360 311-395 (406)
87 COG4257 Vgb Streptogramin lyas 96.8 0.15 3.3E-06 46.6 17.2 107 205-332 199-308 (353)
88 PRK02889 tolB translocation pr 96.8 0.62 1.3E-05 46.3 24.9 142 207-371 254-402 (427)
89 KOG0271 Notchless-like WD40 re 96.8 0.1 2.2E-06 49.5 16.6 53 319-371 387-440 (480)
90 KOG0274 Cdc4 and related F-box 96.8 0.38 8.3E-06 49.1 22.2 220 54-379 227-448 (537)
91 KOG2103 Uncharacterized conser 96.8 0.032 7E-07 57.6 14.1 187 63-357 22-216 (910)
92 PRK04792 tolB translocation pr 96.8 0.71 1.5E-05 46.2 24.2 114 206-341 319-436 (448)
93 PTZ00420 coronin; Provisional 96.7 0.9 2E-05 46.7 25.5 147 142-333 147-297 (568)
94 PF07433 DUF1513: Protein of u 96.7 0.55 1.2E-05 44.0 20.5 240 54-364 27-280 (305)
95 KOG0643 Translation initiation 96.7 0.37 8.1E-06 43.8 18.4 142 205-369 64-219 (327)
96 KOG0643 Translation initiation 96.7 0.5 1.1E-05 43.0 21.4 84 274-359 166-251 (327)
97 KOG0318 WD40 repeat stress pro 96.6 0.66 1.4E-05 45.9 21.2 121 23-232 196-317 (603)
98 KOG0266 WD40 repeat-containing 96.6 0.52 1.1E-05 47.2 21.2 147 142-340 267-420 (456)
99 PRK04043 tolB translocation pr 96.5 1 2.2E-05 44.7 23.6 177 143-372 213-400 (419)
100 KOG0285 Pleiotropic regulator 96.5 0.23 4.9E-06 46.9 16.4 165 142-362 214-382 (460)
101 PLN00181 protein SPA1-RELATED; 96.5 0.45 9.7E-06 51.3 21.3 140 205-371 545-691 (793)
102 KOG0646 WD40 repeat protein [G 96.4 0.058 1.3E-06 52.2 12.5 51 274-329 195-247 (476)
103 KOG0275 Conserved WD40 repeat- 96.4 0.12 2.7E-06 47.9 13.9 150 205-380 318-475 (508)
104 KOG0649 WD40 repeat protein [G 96.4 0.19 4E-06 45.1 14.3 125 205-357 126-262 (325)
105 KOG0293 WD40 repeat-containing 96.4 0.071 1.5E-06 51.0 12.5 225 47-370 283-513 (519)
106 KOG1446 Histone H3 (Lys4) meth 96.3 0.83 1.8E-05 42.3 18.9 73 205-296 199-273 (311)
107 PRK04922 tolB translocation pr 96.3 1.3 2.8E-05 44.1 23.9 113 206-340 305-421 (433)
108 KOG0279 G protein beta subunit 96.2 0.91 2E-05 41.6 18.1 112 205-341 162-274 (315)
109 PHA03098 kelch-like protein; P 96.1 2.1 4.5E-05 43.8 23.6 106 215-337 406-517 (534)
110 PF09910 DUF2139: Uncharacteri 96.0 1.3 2.9E-05 41.0 20.2 194 143-381 9-239 (339)
111 COG4946 Uncharacterized protei 96.0 1.8 3.9E-05 42.6 23.8 144 207-381 373-523 (668)
112 KOG0646 WD40 repeat protein [G 96.0 0.42 9.2E-06 46.4 15.7 133 205-363 93-241 (476)
113 TIGR02800 propeller_TolB tol-p 96.0 1.8 4E-05 42.4 23.8 101 216-340 303-407 (417)
114 KOG0266 WD40 repeat-containing 95.9 1.5 3.3E-05 43.9 20.7 143 205-371 258-410 (456)
115 PRK02888 nitrous-oxide reducta 95.9 1.3 2.9E-05 45.4 19.9 185 127-371 197-405 (635)
116 KOG0315 G-protein beta subunit 95.9 1.3 2.8E-05 40.1 18.6 143 205-372 95-247 (311)
117 KOG0278 Serine/threonine kinas 95.9 0.076 1.6E-06 47.7 9.7 100 275-380 163-262 (334)
118 KOG0282 mRNA splicing factor [ 95.9 0.059 1.3E-06 52.4 9.7 104 205-331 270-374 (503)
119 KOG4441 Proteins containing BT 95.8 2.6 5.7E-05 43.5 22.1 236 1-339 305-554 (571)
120 PRK03629 tolB translocation pr 95.8 2.3 5.1E-05 42.2 23.5 109 207-339 301-415 (429)
121 PRK01742 tolB translocation pr 95.8 2.3 5.1E-05 42.2 25.1 103 207-334 262-366 (429)
122 KOG0318 WD40 repeat stress pro 95.7 1.4 3.1E-05 43.7 18.3 147 139-333 208-354 (603)
123 PRK00178 tolB translocation pr 95.7 1.9 4.1E-05 42.7 20.1 151 205-376 211-367 (430)
124 KOG0270 WD40 repeat-containing 95.7 0.2 4.4E-06 48.3 12.2 143 205-376 256-410 (463)
125 PRK05137 tolB translocation pr 95.6 2.7 5.9E-05 41.8 23.4 107 206-333 303-416 (435)
126 KOG2055 WD40 repeat protein [G 95.6 0.66 1.4E-05 45.2 15.4 136 142-329 279-417 (514)
127 KOG0319 WD40-repeat-containing 95.6 0.6 1.3E-05 47.9 15.8 229 49-377 34-273 (775)
128 TIGR03548 mutarot_permut cycli 95.5 2.3 4.9E-05 40.4 24.0 38 277-316 271-310 (323)
129 TIGR03547 muta_rot_YjhT mutatr 95.5 2.4 5.3E-05 40.6 27.0 131 215-360 168-330 (346)
130 KOG0319 WD40-repeat-containing 95.5 0.42 9.1E-06 49.0 14.2 168 142-363 126-305 (775)
131 KOG1027 Serine/threonine prote 95.4 0.16 3.4E-06 53.2 11.4 112 204-345 106-217 (903)
132 PRK02889 tolB translocation pr 95.4 3.2 6.9E-05 41.2 24.9 62 277-342 352-415 (427)
133 KOG0285 Pleiotropic regulator 95.3 0.54 1.2E-05 44.4 13.5 165 84-337 147-315 (460)
134 PF07433 DUF1513: Protein of u 95.3 2.6 5.6E-05 39.6 21.9 123 2-163 33-158 (305)
135 KOG0291 WD40-repeat-containing 95.3 3.4 7.3E-05 42.9 19.7 100 275-377 370-471 (893)
136 COG3419 PilY1 Tfp pilus assemb 95.2 0.79 1.7E-05 49.2 15.7 186 134-338 583-800 (1036)
137 KOG0282 mRNA splicing factor [ 95.2 0.064 1.4E-06 52.1 7.2 142 205-369 227-371 (503)
138 KOG4441 Proteins containing BT 95.0 2.4 5.1E-05 43.9 18.6 128 215-361 349-485 (571)
139 COG3386 Gluconolactonase [Carb 94.9 2.5 5.4E-05 40.0 17.1 53 205-260 223-277 (307)
140 KOG0275 Conserved WD40 repeat- 94.9 0.23 5E-06 46.2 9.7 181 142-366 234-429 (508)
141 KOG3881 Uncharacterized conser 94.9 1.6 3.6E-05 41.7 15.4 158 143-341 173-332 (412)
142 KOG1446 Histone H3 (Lys4) meth 94.8 3.5 7.6E-05 38.3 24.6 183 141-380 78-270 (311)
143 COG3391 Uncharacterized conser 94.8 4.5 9.7E-05 39.5 20.9 166 31-297 127-295 (381)
144 PHA03098 kelch-like protein; P 94.7 5.8 0.00013 40.5 23.2 128 215-360 358-496 (534)
145 KOG0315 G-protein beta subunit 94.6 3.4 7.3E-05 37.5 20.2 148 205-380 136-296 (311)
146 KOG2321 WD40 repeat protein [G 94.6 1.6 3.5E-05 43.8 15.2 215 55-358 155-379 (703)
147 TIGR03547 muta_rot_YjhT mutatr 94.6 4.5 9.7E-05 38.7 21.7 82 277-360 168-266 (346)
148 TIGR03032 conserved hypothetic 94.5 4.3 9.4E-05 38.2 18.1 95 189-297 204-302 (335)
149 KOG0379 Kelch repeat-containin 94.5 6.2 0.00013 39.9 20.6 157 205-379 123-306 (482)
150 PRK14131 N-acetylneuraminic ac 94.4 5.5 0.00012 38.8 26.0 129 216-360 190-352 (376)
151 KOG0286 G-protein beta subunit 94.3 4.4 9.5E-05 37.5 20.3 114 205-340 156-270 (343)
152 PHA02790 Kelch-like protein; P 94.3 6.8 0.00015 39.6 22.0 131 205-360 318-455 (480)
153 KOG4693 Uncharacterized conser 94.3 2 4.3E-05 39.3 13.8 67 214-293 215-282 (392)
154 PHA02790 Kelch-like protein; P 94.2 6.9 0.00015 39.5 23.8 102 205-332 362-473 (480)
155 KOG3881 Uncharacterized conser 94.1 3.5 7.5E-05 39.5 15.7 145 207-362 163-312 (412)
156 KOG2110 Uncharacterized conser 94.0 5.8 0.00013 37.8 20.9 224 54-379 105-339 (391)
157 KOG0639 Transducin-like enhanc 93.9 1 2.2E-05 44.4 12.1 172 142-368 530-702 (705)
158 PLN02153 epithiospecifier prot 93.9 6.3 0.00014 37.7 22.3 134 216-360 102-260 (341)
159 COG3419 PilY1 Tfp pilus assemb 93.7 1.2 2.7E-05 47.8 13.3 175 204-382 581-789 (1036)
160 KOG2110 Uncharacterized conser 93.3 7.7 0.00017 37.0 16.4 150 205-373 97-251 (391)
161 PF14583 Pectate_lyase22: Olig 93.2 8.7 0.00019 37.3 20.7 131 214-361 167-303 (386)
162 COG4946 Uncharacterized protei 93.2 5.7 0.00012 39.2 15.7 79 242-340 227-305 (668)
163 KOG0310 Conserved WD40 repeat- 93.2 9.1 0.0002 37.7 17.1 136 141-329 174-309 (487)
164 KOG1272 WD40-repeat-containing 93.1 1.2 2.6E-05 43.5 11.1 111 206-340 222-334 (545)
165 KOG0639 Transducin-like enhanc 93.1 6.2 0.00014 39.2 15.9 137 205-365 521-658 (705)
166 COG4257 Vgb Streptogramin lyas 93.1 2.4 5.2E-05 39.0 12.3 142 207-374 75-220 (353)
167 KOG0265 U5 snRNP-specific prot 93.0 3.4 7.4E-05 38.3 13.3 104 142-297 111-216 (338)
168 PLN02193 nitrile-specifier pro 92.8 12 0.00026 37.7 27.0 104 216-337 295-416 (470)
169 KOG0293 WD40 repeat-containing 92.7 3.7 8E-05 39.7 13.6 146 192-362 270-417 (519)
170 PRK01029 tolB translocation pr 92.7 11 0.00025 37.3 26.3 69 308-377 337-408 (428)
171 PF14583 Pectate_lyase22: Olig 92.6 1.6 3.5E-05 42.2 11.3 123 219-358 14-142 (386)
172 COG2706 3-carboxymuconate cycl 92.6 9.7 0.00021 36.2 25.3 100 276-377 166-281 (346)
173 KOG0647 mRNA export protein (c 92.5 7.4 0.00016 36.2 14.8 162 191-380 72-238 (347)
174 KOG2321 WD40 repeat protein [G 92.4 3 6.4E-05 42.0 12.9 133 54-256 196-331 (703)
175 PRK04043 tolB translocation pr 92.3 13 0.00028 36.9 22.8 117 206-340 290-410 (419)
176 PF03022 MRJP: Major royal jel 92.1 10 0.00022 35.5 20.8 185 143-359 34-254 (287)
177 PRK02888 nitrous-oxide reducta 92.1 8.2 0.00018 39.9 16.1 178 51-330 211-405 (635)
178 KOG0281 Beta-TrCP (transducin 92.0 0.99 2.1E-05 42.6 8.7 91 275-371 338-429 (499)
179 COG3490 Uncharacterized protei 91.9 11 0.00023 35.1 17.5 43 319-363 300-342 (366)
180 KOG1036 Mitotic spindle checkp 91.3 3.2 7E-05 38.5 11.1 131 205-362 25-156 (323)
181 KOG4547 WD40 repeat-containing 91.3 6.6 0.00014 39.5 14.1 113 206-342 71-185 (541)
182 PF03022 MRJP: Major royal jel 91.0 9.7 0.00021 35.7 14.5 80 276-357 33-126 (287)
183 KOG0263 Transcription initiati 90.7 10 0.00022 39.4 15.2 99 274-375 554-654 (707)
184 PRK14131 N-acetylneuraminic ac 90.7 17 0.00037 35.3 21.1 82 277-360 189-288 (376)
185 KOG1273 WD40 repeat protein [G 90.7 15 0.00032 34.6 16.7 141 206-368 78-224 (405)
186 KOG0281 Beta-TrCP (transducin 90.7 1.5 3.2E-05 41.4 8.4 100 205-332 330-431 (499)
187 KOG0301 Phospholipase A2-activ 90.4 14 0.00031 38.1 15.5 129 205-362 151-281 (745)
188 KOG0303 Actin-binding protein 90.0 5.5 0.00012 38.4 11.6 136 205-363 144-288 (472)
189 PRK01029 tolB translocation pr 90.0 22 0.00047 35.3 25.4 57 277-335 351-409 (428)
190 KOG1272 WD40-repeat-containing 89.9 0.98 2.1E-05 44.1 6.8 174 192-371 131-324 (545)
191 KOG0273 Beta-transducin family 89.7 22 0.00048 35.1 21.0 68 309-380 422-490 (524)
192 PLN02193 nitrile-specifier pro 89.7 24 0.00053 35.5 23.8 129 215-360 244-386 (470)
193 PRK01742 tolB translocation pr 89.6 23 0.00049 35.1 22.9 57 277-339 353-413 (429)
194 KOG4499 Ca2+-binding protein R 89.4 4.5 9.7E-05 36.5 9.9 50 309-359 222-274 (310)
195 KOG2106 Uncharacterized conser 88.9 27 0.00058 34.9 19.1 107 142-297 221-327 (626)
196 KOG4328 WD40 protein [Function 88.7 12 0.00025 36.9 13.0 31 308-340 431-461 (498)
197 KOG0295 WD40 repeat-containing 88.7 18 0.00039 34.6 13.9 144 205-373 205-367 (406)
198 PF05567 Neisseria_PilC: Neiss 88.4 3.6 7.7E-05 39.5 9.7 60 142-224 180-240 (335)
199 KOG0303 Actin-binding protein 88.1 14 0.00029 35.8 12.8 81 275-359 152-234 (472)
200 KOG1273 WD40 repeat protein [G 87.9 9 0.0002 35.9 11.3 150 205-381 35-192 (405)
201 PF14727 PHTB1_N: PTHB1 N-term 87.8 8.2 0.00018 38.2 11.8 20 54-74 261-280 (418)
202 KOG0270 WD40 repeat-containing 87.6 22 0.00047 34.9 14.0 114 134-298 257-374 (463)
203 KOG0289 mRNA splicing factor [ 87.5 19 0.0004 35.3 13.5 84 274-358 322-406 (506)
204 KOG0294 WD40 repeat-containing 87.2 26 0.00057 32.9 18.1 141 205-372 139-283 (362)
205 KOG1036 Mitotic spindle checkp 86.7 28 0.0006 32.6 17.3 104 205-336 65-170 (323)
206 KOG1407 WD40 repeat protein [F 86.6 26 0.00056 32.2 17.5 187 54-344 86-276 (313)
207 KOG0379 Kelch repeat-containin 86.6 39 0.00084 34.2 18.6 114 141-294 188-308 (482)
208 PF02897 Peptidase_S9_N: Proly 86.6 34 0.00073 33.5 22.9 110 249-372 237-357 (414)
209 KOG2106 Uncharacterized conser 86.6 37 0.0008 34.0 20.4 197 54-334 221-441 (626)
210 KOG0289 mRNA splicing factor [ 86.5 35 0.00075 33.5 16.4 114 205-340 315-430 (506)
211 KOG1274 WD40 repeat protein [G 86.4 51 0.0011 35.4 20.5 138 143-330 118-263 (933)
212 PLN02153 epithiospecifier prot 86.4 31 0.00067 32.9 30.4 134 216-360 160-323 (341)
213 cd00028 B_lectin Bulb-type man 86.2 9.4 0.0002 30.2 9.5 49 143-232 65-113 (116)
214 KOG0772 Uncharacterized conser 86.2 39 0.00086 33.9 18.3 88 275-363 384-481 (641)
215 PF00930 DPPIV_N: Dipeptidyl p 86.1 15 0.00034 35.2 12.8 128 214-358 209-345 (353)
216 COG3292 Predicted periplasmic 85.9 42 0.00091 34.3 15.4 85 275-362 352-440 (671)
217 KOG0308 Conserved WD40 repeat- 85.7 40 0.00087 34.8 15.3 108 205-332 130-246 (735)
218 KOG2919 Guanine nucleotide-bin 85.6 33 0.00071 32.5 14.0 47 54-108 132-178 (406)
219 KOG4547 WD40 repeat-containing 84.9 41 0.0009 34.0 14.9 109 140-298 77-185 (541)
220 TIGR02276 beta_rpt_yvtn 40-res 84.8 3.1 6.7E-05 25.9 5.0 33 308-341 2-34 (42)
221 KOG0288 WD40 repeat protein Ti 84.7 21 0.00046 34.7 12.2 112 206-341 313-429 (459)
222 KOG1188 WD40 repeat protein [G 84.6 36 0.00077 32.4 13.5 183 143-371 50-243 (376)
223 smart00108 B_lectin Bulb-type 84.3 13 0.00029 29.2 9.6 23 208-231 89-111 (114)
224 COG0823 TolB Periplasmic compo 84.0 48 0.001 33.0 16.5 131 205-357 250-386 (425)
225 KOG0280 Uncharacterized conser 83.8 9.9 0.00021 35.3 9.3 110 206-344 134-257 (339)
226 KOG1407 WD40 repeat protein [F 83.5 33 0.00072 31.5 12.4 140 205-370 77-219 (313)
227 KOG0265 U5 snRNP-specific prot 82.9 33 0.00073 32.0 12.4 104 205-330 59-164 (338)
228 PF05262 Borrelia_P83: Borreli 82.8 7.2 0.00016 39.2 8.8 96 274-371 372-471 (489)
229 KOG0272 U4/U6 small nuclear ri 82.6 51 0.0011 32.2 15.0 139 205-368 315-458 (459)
230 smart00108 B_lectin Bulb-type 81.9 23 0.0005 27.8 10.2 21 212-233 61-81 (114)
231 KOG1912 WD40 repeat protein [G 80.8 63 0.0014 34.3 14.6 52 309-363 243-298 (1062)
232 KOG4378 Nuclear protein COP1 [ 80.4 35 0.00075 34.2 12.2 103 205-330 177-281 (673)
233 COG0823 TolB Periplasmic compo 80.2 44 0.00096 33.2 13.4 122 215-357 218-342 (425)
234 KOG1274 WD40 repeat protein [G 79.9 51 0.0011 35.4 13.9 111 54-234 117-229 (933)
235 KOG0772 Uncharacterized conser 79.4 42 0.00092 33.7 12.5 104 205-326 376-484 (641)
236 KOG0272 U4/U6 small nuclear ri 79.4 65 0.0014 31.5 13.5 141 205-368 273-416 (459)
237 KOG0640 mRNA cleavage stimulat 79.1 19 0.00041 33.8 9.5 153 205-380 184-343 (430)
238 KOG0647 mRNA export protein (c 78.9 57 0.0012 30.6 20.1 100 206-331 128-230 (347)
239 PF06977 SdiA-regulated: SdiA- 77.5 57 0.0012 29.8 21.2 56 303-360 174-241 (248)
240 PF14783 BBS2_Mid: Ciliary BBS 77.5 34 0.00073 27.1 10.3 77 193-297 5-81 (111)
241 KOG4328 WD40 protein [Function 77.4 10 0.00022 37.2 7.6 100 274-375 254-358 (498)
242 PF02897 Peptidase_S9_N: Proly 77.4 75 0.0016 31.1 15.5 107 141-290 299-409 (414)
243 KOG4693 Uncharacterized conser 77.3 61 0.0013 30.0 22.0 134 215-359 157-310 (392)
244 PF08553 VID27: VID27 cytoplas 77.1 22 0.00048 38.1 10.6 124 212-360 501-638 (794)
245 PF09826 Beta_propel: Beta pro 76.4 95 0.0021 31.8 17.9 81 274-356 301-385 (521)
246 KOG0288 WD40 repeat protein Ti 75.7 51 0.0011 32.2 11.6 108 54-234 321-428 (459)
247 cd00028 B_lectin Bulb-type man 75.5 38 0.00082 26.7 9.6 21 212-233 62-82 (116)
248 KOG0276 Vesicle coat complex C 74.9 1.1E+02 0.0024 31.7 18.0 67 53-165 33-99 (794)
249 KOG0650 WD40 repeat nucleolar 74.9 81 0.0018 32.4 13.2 51 310-363 579-631 (733)
250 KOG0301 Phospholipase A2-activ 74.0 1.1E+02 0.0024 32.0 14.0 59 278-340 348-406 (745)
251 KOG1027 Serine/threonine prote 73.8 21 0.00045 38.0 9.2 177 54-297 36-213 (903)
252 COG3490 Uncharacterized protei 73.6 80 0.0017 29.6 16.0 168 145-359 51-244 (366)
253 KOG0286 G-protein beta subunit 73.6 80 0.0017 29.5 23.5 172 143-365 166-340 (343)
254 PF14339 DUF4394: Domain of un 73.5 71 0.0015 28.9 14.8 27 30-70 37-63 (236)
255 KOG0308 Conserved WD40 repeat- 72.7 1.2E+02 0.0027 31.4 14.4 101 206-330 184-286 (735)
256 PF08553 VID27: VID27 cytoplas 72.6 98 0.0021 33.4 14.0 137 142-329 503-647 (794)
257 KOG0263 Transcription initiati 72.1 1.4E+02 0.0029 31.5 16.2 103 205-330 547-650 (707)
258 PF03178 CPSF_A: CPSF A subuni 71.6 87 0.0019 29.5 12.6 142 216-371 3-158 (321)
259 KOG2395 Protein involved in va 71.5 29 0.00063 35.0 9.2 130 206-360 346-491 (644)
260 KOG4499 Ca2+-binding protein R 70.2 60 0.0013 29.5 10.0 50 249-316 221-273 (310)
261 KOG0283 WD40 repeat-containing 69.2 1.6E+02 0.0034 31.2 14.6 145 31-257 421-565 (712)
262 TIGR02276 beta_rpt_yvtn 40-res 69.1 15 0.00033 22.6 4.7 31 31-74 3-33 (42)
263 smart00284 OLF Olfactomedin-li 69.0 95 0.0021 28.5 12.9 115 246-377 79-213 (255)
264 PF14298 DUF4374: Domain of un 68.1 20 0.00044 35.4 7.4 59 275-333 365-428 (435)
265 KOG0306 WD40-repeat-containing 67.1 78 0.0017 33.4 11.4 90 277-370 394-484 (888)
266 KOG0273 Beta-transducin family 67.0 1.4E+02 0.0031 29.7 19.4 85 274-362 429-516 (524)
267 KOG2111 Uncharacterized conser 66.6 37 0.00079 32.0 8.3 96 276-373 158-259 (346)
268 PF14298 DUF4374: Domain of un 65.2 9.2 0.0002 37.7 4.4 57 2-67 372-428 (435)
269 KOG0306 WD40-repeat-containing 64.8 55 0.0012 34.5 9.8 94 275-372 85-181 (888)
270 COG4880 Secreted protein conta 64.5 90 0.0019 30.9 10.7 47 205-258 149-196 (603)
271 COG5276 Uncharacterized conser 63.7 1.3E+02 0.0029 28.3 17.0 130 204-359 95-231 (370)
272 TIGR03054 photo_alph_chp1 puta 62.6 51 0.0011 27.1 7.6 71 205-292 41-120 (135)
273 KOG0299 U3 snoRNP-associated p 62.5 1.7E+02 0.0037 29.1 18.1 60 310-371 339-411 (479)
274 KOG1332 Vesicle coat complex C 62.4 1.1E+02 0.0024 27.9 10.2 95 274-372 77-195 (299)
275 PF02333 Phytase: Phytase; In 62.3 1.6E+02 0.0035 28.8 18.1 37 288-329 199-237 (381)
276 KOG0283 WD40 repeat-containing 62.2 61 0.0013 34.1 9.8 144 205-372 422-578 (712)
277 PF14339 DUF4394: Domain of un 62.1 25 0.00055 31.8 6.3 63 308-375 37-106 (236)
278 KOG0284 Polyadenylation factor 61.7 22 0.00047 34.6 6.0 60 275-338 242-303 (464)
279 KOG1007 WD repeat protein TSSC 61.0 1.4E+02 0.003 28.0 10.7 56 274-330 190-246 (370)
280 PF00930 DPPIV_N: Dipeptidyl p 59.4 49 0.0011 31.8 8.4 146 216-376 159-320 (353)
281 PF14781 BBS2_N: Ciliary BBSom 59.0 1E+02 0.0022 25.4 9.0 57 276-338 72-134 (136)
282 PF01453 B_lectin: D-mannose b 58.9 44 0.00095 26.4 6.7 22 210-232 58-79 (114)
283 KOG1188 WD40 repeat protein [G 58.9 1.2E+02 0.0025 29.1 10.1 19 205-223 178-196 (376)
284 COG5276 Uncharacterized conser 57.3 1.7E+02 0.0038 27.6 18.8 131 205-362 138-278 (370)
285 PF14727 PHTB1_N: PTHB1 N-term 56.5 2.1E+02 0.0046 28.4 22.9 164 205-381 145-325 (418)
286 COG5167 VID27 Protein involved 55.3 2.1E+02 0.0045 29.2 11.6 24 142-165 489-512 (776)
287 PF13964 Kelch_6: Kelch motif 54.8 34 0.00075 22.2 4.6 37 26-71 6-42 (50)
288 PF05262 Borrelia_P83: Borreli 54.4 1E+02 0.0022 31.2 9.7 84 213-316 373-457 (489)
289 KOG0322 G-protein beta subunit 54.1 1.8E+02 0.004 26.9 11.3 146 204-360 164-313 (323)
290 KOG0642 Cell-cycle nuclear pro 53.7 1.5E+02 0.0032 30.2 10.5 62 310-373 502-564 (577)
291 KOG3914 WD repeat protein WDR4 53.4 1E+02 0.0022 30.0 8.9 68 311-382 165-233 (390)
292 KOG0313 Microtubule binding pr 52.9 2.3E+02 0.0049 27.6 12.1 31 205-235 271-301 (423)
293 KOG0276 Vesicle coat complex C 50.4 3.2E+02 0.0069 28.5 13.2 143 205-371 25-172 (794)
294 COG4880 Secreted protein conta 49.9 1.7E+02 0.0038 29.0 10.0 55 301-359 141-196 (603)
295 KOG0640 mRNA cleavage stimulat 48.5 1.6E+02 0.0035 27.8 9.2 152 205-375 228-386 (430)
296 PF01453 B_lectin: D-mannose b 47.5 90 0.002 24.6 6.8 59 206-293 20-78 (114)
297 PF12894 Apc4_WD40: Anaphase-p 46.5 25 0.00054 23.1 2.8 25 31-70 23-47 (47)
298 KOG0294 WD40 repeat-containing 46.4 2.7E+02 0.0058 26.5 17.9 54 276-331 228-283 (362)
299 KOG1645 RING-finger-containing 43.3 3.3E+02 0.0072 26.8 16.8 92 278-370 361-461 (463)
300 KOG0642 Cell-cycle nuclear pro 42.4 3.9E+02 0.0085 27.3 12.1 50 275-326 509-558 (577)
301 KOG0268 Sof1-like rRNA process 42.3 1.7E+02 0.0037 28.2 8.5 29 207-235 81-109 (433)
302 TIGR02604 Piru_Ver_Nterm putat 42.1 3.2E+02 0.007 26.3 11.1 78 278-359 48-142 (367)
303 PF02191 OLF: Olfactomedin-lik 41.9 2.8E+02 0.006 25.4 13.9 101 276-377 88-208 (250)
304 PF08309 LVIVD: LVIVD repeat; 41.0 69 0.0015 20.4 4.1 26 303-330 5-30 (42)
305 KOG4190 Uncharacterized conser 40.3 87 0.0019 31.8 6.6 109 209-335 798-912 (1034)
306 TIGR03118 PEPCTERM_chp_1 conse 40.2 3.3E+02 0.0072 25.9 13.6 68 141-231 220-287 (336)
307 COG3292 Predicted periplasmic 40.2 4.4E+02 0.0096 27.3 14.4 50 309-360 344-395 (671)
308 KOG2395 Protein involved in va 39.7 4.4E+02 0.0094 27.1 12.2 22 143-164 356-377 (644)
309 PF14781 BBS2_N: Ciliary BBSom 39.2 80 0.0017 25.9 5.2 30 30-74 63-92 (136)
310 PF07893 DUF1668: Protein of u 38.8 3.6E+02 0.0078 25.8 11.3 99 274-375 83-210 (342)
311 PF14517 Tachylectin: Tachylec 38.4 3E+02 0.0065 24.8 9.3 28 206-235 180-207 (229)
312 TIGR03032 conserved hypothetic 38.4 3.6E+02 0.0078 25.7 23.3 160 206-379 114-297 (335)
313 PF14783 BBS2_Mid: Ciliary BBS 37.8 2E+02 0.0044 22.7 10.6 20 143-164 63-82 (111)
314 PF03088 Str_synth: Strictosid 37.5 87 0.0019 23.7 4.9 61 32-107 10-75 (89)
315 COG3055 Uncharacterized protei 37.5 63 0.0014 31.1 4.9 58 306-363 43-104 (381)
316 PF08662 eIF2A: Eukaryotic tra 37.3 2.7E+02 0.0059 24.0 12.2 92 277-380 83-178 (194)
317 KOG0269 WD40 repeat-containing 36.7 2.2E+02 0.0048 30.3 9.0 102 205-329 146-250 (839)
318 PF09826 Beta_propel: Beta pro 36.7 2.1E+02 0.0046 29.3 9.0 51 304-358 17-70 (521)
319 KOG0305 Anaphase promoting com 36.7 4.7E+02 0.01 26.5 16.1 188 101-377 189-381 (484)
320 PF06977 SdiA-regulated: SdiA- 36.5 1.4E+02 0.0031 27.2 7.1 62 249-328 32-93 (248)
321 PF02333 Phytase: Phytase; In 35.4 4.4E+02 0.0095 25.8 13.2 142 205-361 68-229 (381)
322 KOG4283 Transcription-coupled 35.3 3.9E+02 0.0085 25.2 10.0 115 139-288 120-234 (397)
323 PF02191 OLF: Olfactomedin-lik 35.2 3.5E+02 0.0077 24.7 17.7 187 142-359 30-238 (250)
324 KOG0280 Uncharacterized conser 35.0 1.9E+02 0.004 27.2 7.4 73 205-296 178-253 (339)
325 PF14779 BBS1: Ciliary BBSome 34.0 2.3E+02 0.0051 26.0 7.9 60 134-221 196-256 (257)
326 KOG1912 WD40 repeat protein [G 33.9 4.6E+02 0.0099 28.3 10.7 74 206-297 80-155 (1062)
327 KOG3914 WD repeat protein WDR4 33.5 4.7E+02 0.01 25.5 11.9 36 303-340 199-234 (390)
328 PF14779 BBS1: Ciliary BBSome 32.4 1.4E+02 0.003 27.5 6.2 51 311-363 197-252 (257)
329 KOG0284 Polyadenylation factor 32.2 1.7E+02 0.0038 28.6 7.0 71 205-297 234-306 (464)
330 PF14517 Tachylectin: Tachylec 30.7 1.8E+02 0.004 26.2 6.6 78 275-359 139-223 (229)
331 PF11768 DUF3312: Protein of u 30.3 2.5E+02 0.0054 28.8 8.1 68 132-231 270-337 (545)
332 KOG1240 Protein kinase contain 30.2 8.8E+02 0.019 27.7 16.1 69 206-294 1164-1234(1431)
333 KOG0299 U3 snoRNP-associated p 29.8 5.8E+02 0.012 25.5 13.4 27 205-231 154-183 (479)
334 TIGR03118 PEPCTERM_chp_1 conse 29.4 5E+02 0.011 24.7 15.1 88 247-339 195-289 (336)
335 KOG2695 WD40 repeat protein [G 27.3 4E+02 0.0087 25.7 8.3 117 205-341 264-388 (425)
336 KOG0313 Microtubule binding pr 27.1 3.4E+02 0.0074 26.4 7.9 97 274-373 278-379 (423)
337 PRK10115 protease 2; Provision 26.6 8E+02 0.017 26.1 25.3 67 307-374 277-348 (686)
338 KOG1445 Tumor-specific antigen 26.0 3.9E+02 0.0085 28.0 8.5 58 276-335 149-207 (1012)
339 PF01344 Kelch_1: Kelch motif; 25.9 1.4E+02 0.0031 18.6 4.0 35 26-70 6-41 (47)
340 KOG0300 WD40 repeat-containing 24.5 6.2E+02 0.013 24.1 11.2 28 205-232 284-311 (481)
341 KOG3545 Olfactomedin and relat 22.6 6E+02 0.013 23.2 9.0 102 276-377 87-207 (249)
342 PF00400 WD40: WD domain, G-be 22.4 1.8E+02 0.0038 17.0 4.0 8 54-61 32-39 (39)
343 PF05694 SBP56: 56kDa selenium 22.2 1.3E+02 0.0029 29.9 4.4 51 308-359 322-393 (461)
344 KOG1240 Protein kinase contain 22.1 1.2E+03 0.027 26.7 14.0 63 143-233 1173-1237(1431)
345 PHA02581 9 baseplate wedge tai 21.9 6.4E+02 0.014 23.3 8.6 34 320-353 145-178 (284)
346 KOG1275 PAB-dependent poly(A) 21.6 4.8E+02 0.01 28.6 8.5 77 277-358 157-234 (1118)
347 PF03178 CPSF_A: CPSF A subuni 21.4 6.7E+02 0.014 23.3 18.4 175 143-370 2-202 (321)
348 PF08596 Lgl_C: Lethal giant l 21.4 7.8E+02 0.017 24.1 13.1 141 142-338 106-252 (395)
349 COG5167 VID27 Protein involved 21.0 7E+02 0.015 25.6 9.0 102 205-329 479-591 (776)
350 PRK03999 translation initiatio 20.9 4.6E+02 0.0099 21.3 6.8 59 278-340 43-101 (129)
351 KOG1445 Tumor-specific antigen 20.9 1.4E+02 0.0031 31.0 4.4 58 307-367 139-197 (1012)
352 KOG0302 Ribosome Assembly prot 20.1 8.3E+02 0.018 23.9 10.1 99 274-372 277-380 (440)
353 smart00284 OLF Olfactomedin-li 20.0 6.9E+02 0.015 23.0 17.5 150 143-328 94-251 (255)
No 1
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=100.00 E-value=9e-38 Score=314.09 Aligned_cols=313 Identities=28% Similarity=0.491 Sum_probs=245.7
Q ss_pred CCcCCCCceeeeeecCcCc-cceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHA-RSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFG 79 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~-~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~ 79 (382)
|||++|||++|++.+.+.. ...+.++|++.+++||++....+.+. +|.|+|||++||+++|+++..+....
T Consensus 134 ALDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~--------~G~v~AlD~~TG~~lW~~~~~p~~~~ 205 (527)
T TIGR03075 134 ALDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGV--------RGYVTAYDAKTGKLVWRRYTVPGDMG 205 (527)
T ss_pred EEECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCC--------CcEEEEEECCCCceeEeccCcCCCcc
Confidence 7899999999999875432 34577899999999999886555442 78999999999999999988654211
Q ss_pred --------------------CCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCC
Q 040693 80 --------------------KLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPE 139 (382)
Q Consensus 80 --------------------~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (382)
.....+|+++|. .+++|++.++||++++|+ +|.....|..++
T Consensus 206 ~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~-~~s~D~~~~lvy~~tGnp-----------------~p~~~~~r~gdn 267 (527)
T TIGR03075 206 YLDKADKPVGGEPGAKTWPGDAWKTGGGATWG-TGSYDPETNLIYFGTGNP-----------------SPWNSHLRPGDN 267 (527)
T ss_pred cccccccccccccccCCCCCCccccCCCCccC-ceeEcCCCCeEEEeCCCC-----------------CCCCCCCCCCCC
Confidence 111247889997 689999999999999983 343345567788
Q ss_pred CCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 140 NHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 140 ~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
-+...|+|||++|||++|.+|..+++.| +++..+.|+|+++..+|.....|+.++++|.+|+
T Consensus 268 l~~~s~vAld~~TG~~~W~~Q~~~~D~w------------------D~d~~~~p~l~d~~~~G~~~~~v~~~~K~G~~~v 329 (527)
T TIGR03075 268 LYTSSIVARDPDTGKIKWHYQTTPHDEW------------------DYDGVNEMILFDLKKDGKPRKLLAHADRNGFFYV 329 (527)
T ss_pred ccceeEEEEccccCCEEEeeeCCCCCCc------------------cccCCCCcEEEEeccCCcEEEEEEEeCCCceEEE
Confidence 8889999999999999999999999999 5667799999998878877789999999999999
Q ss_pred EeCCCCCeeee----------eccCC-------------------------CCCCCCcccceee---eCCeEEEEecCcc
Q 040693 220 LDRDSGSLIWS----------MEAGP-------------------------GGLGGGAMWGAAT---DERRIYTNIANSQ 261 (382)
Q Consensus 220 ld~~tG~~~W~----------~~~~~-------------------------~~~~g~~~~~~~~---~~~~v~~~~~~~~ 261 (382)
||++|||.+|. ....+ +...|+..|+++. ..+++|+...+..
T Consensus 330 lDr~tG~~i~~~~~~~~~~w~~~~~~~~g~p~~~~~~~~~~~~~~~~~~~~Pg~~Gg~~W~~~A~Dp~~g~~yvp~~~~~ 409 (527)
T TIGR03075 330 LDRTNGKLLSAEPFVDTVNWATGVDLKTGRPIEVPEARSADGKKGKPVGVCPGFLGGKNWQPMAYSPKTGLFYVPANEVC 409 (527)
T ss_pred EECCCCceeccccccCCcccccccCCCCCCCccChhhCcCCCCCCCeeEECCCCcCCCCCCCceECCCCCEEEEeccccc
Confidence 99999999733 21110 2344566777554 5788998877631
Q ss_pred cc------------cc-----ccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEE
Q 040693 262 HK------------NF-----NLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIY 324 (382)
Q Consensus 262 ~~------------~~-----~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~ 324 (382)
.. .+ ...|. .....+.|.|+|++|||++|+.+.+.+ ...+++...+++||+++. +|.|+
T Consensus 410 ~~~~~~~~~~~~g~~~~~~~~~~~p~-~~~~~g~l~AiD~~tGk~~W~~~~~~p-~~~~~l~t~g~lvf~g~~--~G~l~ 485 (527)
T TIGR03075 410 MDYEPEKVSYKKGAAYLGAGLTIKPP-PDDHMGSLIAWDPITGKIVWEHKEDFP-LWGGVLATAGDLVFYGTL--EGYFK 485 (527)
T ss_pred ccccccccccCCCCceeccccccCCC-CCCCceeEEEEeCCCCceeeEecCCCC-CCCcceEECCcEEEEECC--CCeEE
Confidence 10 01 01121 112368899999999999999987663 456677778899999874 89999
Q ss_pred EEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCce
Q 040693 325 AMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYK 361 (382)
Q Consensus 325 ~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g 361 (382)
|+|.+|||++|++++++.+.++|+. ++|++||....|
T Consensus 486 a~D~~TGe~lw~~~~g~~~~a~P~ty~~~G~qYv~~~~G 524 (527)
T TIGR03075 486 AFDAKTGEELWKFKTGSGIVGPPVTYEQDGKQYVAVLSG 524 (527)
T ss_pred EEECCCCCEeEEEeCCCCceecCEEEEeCCEEEEEEEec
Confidence 9999999999999999999999998 999999987554
No 2
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=100.00 E-value=4.9e-34 Score=286.20 Aligned_cols=332 Identities=27% Similarity=0.397 Sum_probs=232.4
Q ss_pred CCcCCCCceeeeeecCcC--ccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDH--ARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNF 78 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~--~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~ 78 (382)
|||++|||++|++++... ....+.++|++.+++||+++...+. .+|..++.|+|||++||+++|++++.+...
T Consensus 124 AlD~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg~~~~~~-----~~~~~~g~v~alD~~TG~~~W~~~~~~~~~ 198 (488)
T cd00216 124 ALDAETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIGSSGAEF-----FACGVRGALRAYDVETGKLLWRFYTTEPDP 198 (488)
T ss_pred EEECCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEecccccc-----ccCCCCcEEEEEECCCCceeeEeeccCCCc
Confidence 689999999999998643 2233688999999999998754332 123347899999999999999998853211
Q ss_pred -CCC--------CCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEE
Q 040693 79 -GKL--------NEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALD 149 (382)
Q Consensus 79 -~~~--------~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald 149 (382)
..+ ...+++.+|. +|++++.+++||+++++.+.. .+. .. .+...+.+.+.|+|||
T Consensus 199 ~~~~~~~~~~~~~~~~g~~vw~-~pa~d~~~g~V~vg~~~g~~~--~~~------~~-------~~~~~~~~~~~l~Ald 262 (488)
T cd00216 199 NAFPTWGPDRQMWGPGGGTSWA-SPTYDPKTNLVYVGTGNGSPW--NWG------GR-------RTPGDNLYTDSIVALD 262 (488)
T ss_pred CCCCCCCCCcceecCCCCCccC-CeeEeCCCCEEEEECCCCCCC--ccC------Cc-------cCCCCCCceeeEEEEc
Confidence 100 0124566675 689998889999999874211 110 00 1122345568999999
Q ss_pred CCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEee-eCceeecEEEEEccCcEEEEEeCCCCCee
Q 040693 150 LDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMY-RNKVKHDIVVAVQKSGFAWALDRDSGSLI 228 (382)
Q Consensus 150 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~-~~g~~~~~v~~~~~~g~l~ald~~tG~~~ 228 (382)
++||+++|+++....+.|.+ +..++|++.++. .+|.....|++++.+|.++|||++||+++
T Consensus 263 ~~tG~~~W~~~~~~~~~~~~------------------~~~s~p~~~~~~~~~g~~~~~V~~g~~~G~l~ald~~tG~~~ 324 (488)
T cd00216 263 ADTGKVKWFYQTTPHDLWDY------------------DGPNQPSLADIKPKDGKPVPAIVHAPKNGFFYVLDRTTGKLI 324 (488)
T ss_pred CCCCCEEEEeeCCCCCCccc------------------ccCCCCeEEeccccCCCeeEEEEEECCCceEEEEECCCCcEe
Confidence 99999999999876655522 233567776543 44443467899999999999999999999
Q ss_pred eeeccCCCCCCCCcccceeeeCCeEEEEecCcccccc-ccCCCCCCCCCceEEEEECCCCcEEeeecCCC--------CC
Q 040693 229 WSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNF-NLKPSKNSTIAGGWVAMDASNGNVLWSTADPS--------NG 299 (382)
Q Consensus 229 W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~--------~~ 299 (382)
|+++... .....+.+.||+.......... ...........+.|+|||++||+++|+++... ..
T Consensus 325 W~~~~~~--------~~~~~~~~~vyv~~~~~~~~~~~~~~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~~ 396 (488)
T cd00216 325 SARPEVE--------QPMAYDPGLVYLGAFHIPLGLPPQKKKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGFP 396 (488)
T ss_pred eEeEeec--------cccccCCceEEEccccccccCcccccCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCCc
Confidence 9988531 1122234788886432110000 00000112346889999999999999998762 12
Q ss_pred CCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceE--EeCCEEEEEeCceeEe----------ecC
Q 040693 300 TAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGAS--VSNGCIYMGNGYKVTV----------GFG 367 (382)
Q Consensus 300 ~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~--~~~g~lyv~~~~g~~~----------~~~ 367 (382)
...+++++.+++||+++. +|.|+|||.+||+++|+++++..+.++|+ +.++++||.+..|... +++
T Consensus 397 ~~~~~~~~~g~~v~~g~~--dG~l~ald~~tG~~lW~~~~~~~~~a~P~~~~~~g~~yv~~~~g~~~~~~~~~~~~~~~~ 474 (488)
T cd00216 397 HWGGSLATAGNLVFAGAA--DGYFRAFDATTGKELWKFRTPSGIQATPMTYEVNGKQYVGVMVGGGGSFPTGMGGVAKLD 474 (488)
T ss_pred ccCcceEecCCeEEEECC--CCeEEEEECCCCceeeEEECCCCceEcCEEEEeCCEEEEEEEecCCccccccccccchhc
Confidence 234566788899999985 99999999999999999999999999998 5699999998776411 112
Q ss_pred CccCCCCCeEEEEEC
Q 040693 368 NKNFTSGTSLYAFCV 382 (382)
Q Consensus 368 ~~~~~~g~~l~~~~~ 382 (382)
..- .-|..|.+|+|
T Consensus 475 ~~~-~~~~~~~~~~l 488 (488)
T cd00216 475 RWT-AMGGYIIAFSL 488 (488)
T ss_pred ccC-CCCCEEEEEEC
Confidence 222 35899999986
No 3
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=100.00 E-value=1.2e-33 Score=291.75 Aligned_cols=339 Identities=23% Similarity=0.308 Sum_probs=247.2
Q ss_pred CCcCCCCceeeeeecCc----------Cc--cceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCcee
Q 040693 1 AVKRSNGKLVWKTKLDD----------HA--RSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRIL 68 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~----------~~--~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~l 68 (382)
|||++|||++|++.... .+ ...+.++|++.+++||++....|+ ..+|.+.|.|+|||++||+++
T Consensus 274 ALDA~TGk~~W~fg~~G~vdl~~~~g~~~~g~~~~ts~P~V~~g~VIvG~~v~d~----~~~~~~~G~I~A~Da~TGkl~ 349 (764)
T TIGR03074 274 ALDADTGKLCEDFGNNGTVDLTAGMGTTPPGYYYPTSPPLVAGTTVVIGGRVADN----YSTDEPSGVIRAFDVNTGALV 349 (764)
T ss_pred EEECCCCCEEEEecCCCceeeecccCcCCCcccccccCCEEECCEEEEEeccccc----ccccCCCcEEEEEECCCCcEe
Confidence 68999999999875321 11 123678999999999998765443 234456899999999999999
Q ss_pred eeeeccCCCCCC---C-CC--CcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCc
Q 040693 69 WQTFMLPDNFGK---L-NE--YAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHS 142 (382)
Q Consensus 69 W~~~~~~~~~~~---~-~~--~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (382)
|+.+........ . .. .++++.|. .+++|++.+++|+++++.. |+.+ ....+..++.+.
T Consensus 350 W~~~~g~p~~~~~~~~g~~~~~gg~n~W~-~~s~D~~~glvy~ptGn~~--pd~~-------------g~~r~~~~n~y~ 413 (764)
T TIGR03074 350 WAWDPGNPDPTAPPAPGETYTRNTPNSWS-VASYDEKLGLVYLPMGNQT--PDQW-------------GGDRTPADEKYS 413 (764)
T ss_pred eEEecCCCCcccCCCCCCEeccCCCCccC-ceEEcCCCCeEEEeCCCcc--cccc-------------CCccccCccccc
Confidence 999875322111 1 11 25778886 6899999999999999831 1111 011124567788
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeee-CceeecEEEEEccCcEEEEEe
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYR-NKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-~g~~~~~v~~~~~~g~l~ald 221 (382)
+.|+|||++|||++|.++..+++.| +++++++|+++++.. +|.....|+.++++|.+++||
T Consensus 414 ~slvALD~~TGk~~W~~Q~~~hD~W------------------D~D~~~~p~L~d~~~~~G~~~~~v~~~~K~G~~~vlD 475 (764)
T TIGR03074 414 SSLVALDATTGKERWVFQTVHHDLW------------------DMDVPAQPSLVDLPDADGTTVPALVAPTKQGQIYVLD 475 (764)
T ss_pred ceEEEEeCCCCceEEEecccCCccc------------------cccccCCceEEeeecCCCcEeeEEEEECCCCEEEEEE
Confidence 9999999999999999999999999 677889999998865 665677899999999999999
Q ss_pred CCCCCeeeeeccCC------------------------------------------------------------------
Q 040693 222 RDSGSLIWSMEAGP------------------------------------------------------------------ 235 (382)
Q Consensus 222 ~~tG~~~W~~~~~~------------------------------------------------------------------ 235 (382)
++|||++|..+.-+
T Consensus 476 r~tG~~l~~~~e~~vp~~~~~ge~~sptQp~~~~~~~~~~~~~~d~~g~t~~dq~~cr~~~~~~~~~g~~tPps~~~~~~ 555 (764)
T TIGR03074 476 RRTGEPIVPVEEVPVPQGAVPGERYSPTQPFSVLTFGPPTLTESDMWGATPFDQLACRIQFKSLRYEGLYTPPSEQGSLV 555 (764)
T ss_pred CCCCCEEeeceeecCCccCCCCccccccccccccccCCcccchhhccCCChhHhhhhhhhhcccccCCCcCCCCCCceEE
Confidence 99999999854210
Q ss_pred -CCCCCCcccceee---eCCeEEEEecCccc-------------------------------ccccc------CCC-CCC
Q 040693 236 -GGLGGGAMWGAAT---DERRIYTNIANSQH-------------------------------KNFNL------KPS-KNS 273 (382)
Q Consensus 236 -~~~~g~~~~~~~~---~~~~v~~~~~~~~~-------------------------------~~~~~------~~~-~~~ 273 (382)
+...|+.+|++.. +.+++|+...+... ..|.. .|. ..+
T Consensus 556 ~Pg~~Gg~nW~~~a~dP~~g~~yv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~py~~~~~~~~~~~g~p~ 635 (764)
T TIGR03074 556 FPGNLGGFNWGGVAVDPTRQVMFVNPMRLPFVSQLVPRAPGDEAPSGAKGKGTEMGLNPNKGTPYAVNMNPFLSPLGIPC 635 (764)
T ss_pred ecCCcccCCCCCceECCCCCEEEEEChhcceeeEeeeccccccccccccccccccccccCCCCcceeecccccCcccCCC
Confidence 1122444555443 45667766443100 00110 000 011
Q ss_pred --CCCceEEEEECCCCcEEeeecCCC------------------CCCCCcceEEeCCEEEE-eeecCCCcEEEEeCCCCc
Q 040693 274 --TIAGGWVAMDASNGNVLWSTADPS------------------NGTAPGPVTVANGVLFG-GSTYRQGPIYAMDVKTGK 332 (382)
Q Consensus 274 --~~~g~v~a~d~~tG~~~W~~~~~~------------------~~~~~~~~~~~~~~v~~-~~~~~~g~l~~ld~~tG~ 332 (382)
..-|.|.|+|++|||++|+.+... .....+++..++++||+ ++ .++.|+|||.+|||
T Consensus 636 ~~pp~G~l~AiDl~tGk~~W~~~~g~~~~~~p~~~~~~~~~~~g~p~~gG~l~TagglvF~~gt--~d~~l~A~D~~tGk 713 (764)
T TIGR03074 636 QAPPWGYMAAIDLKTGKVVWQHPNGTVRDTGPMGIRMPLPIPIGVPTLGGPLATAGGLVFIGAT--QDNYLRAYDLSTGK 713 (764)
T ss_pred CCCCcEEEEEEECCCCcEeeeeECCccccccccccccccccccCCcccCCcEEEcCCEEEEEeC--CCCEEEEEECCCCc
Confidence 134889999999999999998841 12346678888999998 56 48999999999999
Q ss_pred EeEEEecCCceecceEEe---CCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 333 ILWSYDTGATIYGGASVS---NGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 333 ilw~~~~~~~~~~~p~~~---~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
++|+.+++.+..++|+.+ +|+.||.-..|... .+..+.|..|++|.|
T Consensus 714 ~lW~~~l~~~~~a~P~tY~~~~GkQYVvi~aGg~~---~~~~~~Gd~~~afaL 763 (764)
T TIGR03074 714 ELWKARLPAGGQATPMTYMGKDGKQYVVIVAGGHG---SSGTKRGDYVIAYAL 763 (764)
T ss_pred eeeEeeCCCCcccCCEEEEecCCEEEEEEEeCCCc---cCCCCCCCEEEEEeC
Confidence 999999999999998764 69999986654311 256677999999986
No 4
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=100.00 E-value=2e-33 Score=275.32 Aligned_cols=267 Identities=22% Similarity=0.327 Sum_probs=209.0
Q ss_pred CCcCCCCceeeeeecCcCc-------cceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeec
Q 040693 1 AVKRSNGKLVWKTKLDDHA-------RSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFM 73 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~-------~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~ 73 (382)
|||++|||++|++++.... +..+.++|++.+++||++.. ++.|+|||++||+++|++++
T Consensus 83 ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~--------------~g~l~ald~~tG~~~W~~~~ 148 (394)
T PRK11138 83 ALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSE--------------KGQVYALNAEDGEVAWQTKV 148 (394)
T ss_pred EEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcC--------------CCEEEEEECCCCCCcccccC
Confidence 6899999999999886421 11234578999999999885 88999999999999999987
Q ss_pred cCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCC
Q 040693 74 LPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTG 153 (382)
Q Consensus 74 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG 153 (382)
.... .++|++. ++.||+.+++ +.|+|||++||
T Consensus 149 ~~~~-------------~ssP~v~--~~~v~v~~~~---------------------------------g~l~ald~~tG 180 (394)
T PRK11138 149 AGEA-------------LSRPVVS--DGLVLVHTSN---------------------------------GMLQALNESDG 180 (394)
T ss_pred CCce-------------ecCCEEE--CCEEEEECCC---------------------------------CEEEEEEccCC
Confidence 4221 1357776 3688887764 89999999999
Q ss_pred cEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeecc
Q 040693 154 KIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEA 233 (382)
Q Consensus 154 ~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~ 233 (382)
+++|+++....... ....++|++. ++.++++..++.++++|.++|+++|+++.
T Consensus 181 ~~~W~~~~~~~~~~-------------------~~~~~sP~v~--------~~~v~~~~~~g~v~a~d~~~G~~~W~~~~ 233 (394)
T PRK11138 181 AVKWTVNLDVPSLT-------------------LRGESAPATA--------FGGAIVGGDNGRVSAVLMEQGQLIWQQRI 233 (394)
T ss_pred CEeeeecCCCCccc-------------------ccCCCCCEEE--------CCEEEEEcCCCEEEEEEccCChhhheecc
Confidence 99999986532100 0012578776 56788899999999999999999999875
Q ss_pred CCCCCC------CCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEE
Q 040693 234 GPGGLG------GGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV 307 (382)
Q Consensus 234 ~~~~~~------g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~ 307 (382)
..+... ......|.+.++.||+. ...+.++|+|+++|+++|+.+... ...+.+
T Consensus 234 ~~~~~~~~~~~~~~~~~sP~v~~~~vy~~-----------------~~~g~l~ald~~tG~~~W~~~~~~----~~~~~~ 292 (394)
T PRK11138 234 SQPTGATEIDRLVDVDTTPVVVGGVVYAL-----------------AYNGNLVALDLRSGQIVWKREYGS----VNDFAV 292 (394)
T ss_pred ccCCCccchhcccccCCCcEEECCEEEEE-----------------EcCCeEEEEECCCCCEEEeecCCC----ccCcEE
Confidence 432100 01234566788999987 345889999999999999998653 123457
Q ss_pred eCCEEEEeeecCCCcEEEEeCCCCcEeEEEec-CCceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDT-GATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~-~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
.+++||+.+. +|.|+|||+++|+++|+.+. .....++|++.+++||+.+.+|. ++++|..+|+.+|.+.+
T Consensus 293 ~~~~vy~~~~--~g~l~ald~~tG~~~W~~~~~~~~~~~sp~v~~g~l~v~~~~G~---l~~ld~~tG~~~~~~~~ 363 (394)
T PRK11138 293 DGGRIYLVDQ--NDRVYALDTRGGVELWSQSDLLHRLLTAPVLYNGYLVVGDSEGY---LHWINREDGRFVAQQKV 363 (394)
T ss_pred ECCEEEEEcC--CCeEEEEECCCCcEEEcccccCCCcccCCEEECCEEEEEeCCCE---EEEEECCCCCEEEEEEc
Confidence 8899999985 99999999999999998864 34567899999999999999886 67899999999999864
No 5
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=100.00 E-value=1.2e-31 Score=261.27 Aligned_cols=263 Identities=22% Similarity=0.322 Sum_probs=206.7
Q ss_pred CCcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGK 80 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~ 80 (382)
|||++||+++|+++++. .+.++|++.++++|+++. ++.|+|||++||+++|+.++....
T Consensus 79 a~d~~tG~~~W~~~~~~----~~~~~p~v~~~~v~v~~~--------------~g~l~ald~~tG~~~W~~~~~~~~--- 137 (377)
T TIGR03300 79 ALDAETGKRLWRVDLDE----RLSGGVGADGGLVFVGTE--------------KGEVIALDAEDGKELWRAKLSSEV--- 137 (377)
T ss_pred EEEccCCcEeeeecCCC----CcccceEEcCCEEEEEcC--------------CCEEEEEECCCCcEeeeeccCcee---
Confidence 68999999999999864 356789999999999885 889999999999999998874221
Q ss_pred CCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEe
Q 040693 81 LNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQ 160 (382)
Q Consensus 81 ~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~ 160 (382)
.++|.+. ++.+|+...+ +.|+|+|++||+++|+++
T Consensus 138 ----------~~~p~v~--~~~v~v~~~~---------------------------------g~l~a~d~~tG~~~W~~~ 172 (377)
T TIGR03300 138 ----------LSPPLVA--NGLVVVRTND---------------------------------GRLTALDAATGERLWTYS 172 (377)
T ss_pred ----------ecCCEEE--CCEEEEECCC---------------------------------CeEEEEEcCCCceeeEEc
Confidence 1356665 3578887654 889999999999999998
Q ss_pred cCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCC-
Q 040693 161 LGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLG- 239 (382)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~- 239 (382)
....... ....++|++. ++.++++..++.++++|+++|+.+|+.+...+...
T Consensus 173 ~~~~~~~-------------------~~~~~sp~~~--------~~~v~~~~~~g~v~ald~~tG~~~W~~~~~~~~g~~ 225 (377)
T TIGR03300 173 RVTPALT-------------------LRGSASPVIA--------DGGVLVGFAGGKLVALDLQTGQPLWEQRVALPKGRT 225 (377)
T ss_pred cCCCcee-------------------ecCCCCCEEE--------CCEEEEECCCCEEEEEEccCCCEeeeeccccCCCCC
Confidence 7543110 0112567765 56788888899999999999999999875421100
Q ss_pred -----CCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEE
Q 040693 240 -----GGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFG 314 (382)
Q Consensus 240 -----g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~ 314 (382)
......+.+.++.+|+. ...+.++++|+++|+++|+.+... ...+.+.++.||+
T Consensus 226 ~~~~~~~~~~~p~~~~~~vy~~-----------------~~~g~l~a~d~~tG~~~W~~~~~~----~~~p~~~~~~vyv 284 (377)
T TIGR03300 226 ELERLVDVDGDPVVDGGQVYAV-----------------SYQGRVAALDLRSGRVLWKRDASS----YQGPAVDDNRLYV 284 (377)
T ss_pred chhhhhccCCccEEECCEEEEE-----------------EcCCEEEEEECCCCcEEEeeccCC----ccCceEeCCEEEE
Confidence 01123455678899987 345889999999999999998532 2344577899999
Q ss_pred eeecCCCcEEEEeCCCCcEeEEE-ecCCceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 315 GSTYRQGPIYAMDVKTGKILWSY-DTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 315 ~~~~~~g~l~~ld~~tG~ilw~~-~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
.+. +|.|+++|.++|+++|+. ..+....++|++.+++||+.+.+|. +|++|..+|+++|.+.+
T Consensus 285 ~~~--~G~l~~~d~~tG~~~W~~~~~~~~~~ssp~i~g~~l~~~~~~G~---l~~~d~~tG~~~~~~~~ 348 (377)
T TIGR03300 285 TDA--DGVVVALDRRSGSELWKNDELKYRQLTAPAVVGGYLVVGDFEGY---LHWLSREDGSFVARLKT 348 (377)
T ss_pred ECC--CCeEEEEECCCCcEEEccccccCCccccCEEECCEEEEEeCCCE---EEEEECCCCCEEEEEEc
Confidence 884 899999999999999998 5566678899999999999998887 68899999999999863
No 6
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=100.00 E-value=2.5e-31 Score=260.53 Aligned_cols=268 Identities=19% Similarity=0.350 Sum_probs=203.2
Q ss_pred CCCceeeeeecCcCcc-ceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCC
Q 040693 5 SNGKLVWKTKLDDHAR-SFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNE 83 (382)
Q Consensus 5 ~tGk~~W~~~~~~~~~-~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~ 83 (382)
-++|++|++++++... ....++|++.+++||++.. ++.|+|||++||+++|++++..... .
T Consensus 42 ~~~~~~W~~~~g~g~~~~~~~~sPvv~~~~vy~~~~--------------~g~l~ald~~tG~~~W~~~~~~~~~----~ 103 (394)
T PRK11138 42 FTPTTVWSTSVGDGVGDYYSRLHPAVAYNKVYAADR--------------AGLVKALDADTGKEIWSVDLSEKDG----W 103 (394)
T ss_pred CCcceeeEEEcCCCCccceeeeccEEECCEEEEECC--------------CCeEEEEECCCCcEeeEEcCCCccc----c
Confidence 4689999999865322 1245689999999999886 7899999999999999998854210 0
Q ss_pred CcC--ccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEec
Q 040693 84 YAG--AAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQL 161 (382)
Q Consensus 84 ~~g--~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~ 161 (382)
++. +....+.|+++ ++.||+++.+ +.|+|||++||+++|+++.
T Consensus 104 ~~~~~~~~~~~~~~v~--~~~v~v~~~~---------------------------------g~l~ald~~tG~~~W~~~~ 148 (394)
T PRK11138 104 FSKNKSALLSGGVTVA--GGKVYIGSEK---------------------------------GQVYALNAEDGEVAWQTKV 148 (394)
T ss_pred cccccccccccccEEE--CCEEEEEcCC---------------------------------CEEEEEECCCCCCcccccC
Confidence 000 01112245555 4688987654 8999999999999999987
Q ss_pred CCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCC
Q 040693 162 GGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGG 241 (382)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~ 241 (382)
... +.++|++. ++.|++.+.++.++|||++||+++|+++...+.....
T Consensus 149 ~~~------------------------~~ssP~v~--------~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~ 196 (394)
T PRK11138 149 AGE------------------------ALSRPVVS--------DGLVLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLR 196 (394)
T ss_pred CCc------------------------eecCCEEE--------CCEEEEECCCCEEEEEEccCCCEeeeecCCCCccccc
Confidence 643 23678887 6789999999999999999999999998653211111
Q ss_pred cccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCC---------CCCcceEEeCCEE
Q 040693 242 AMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNG---------TAPGPVTVANGVL 312 (382)
Q Consensus 242 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~---------~~~~~~~~~~~~v 312 (382)
....|++.++.+|+. ...+.++++|+++|+++|+.+..... .....+.+.++.|
T Consensus 197 ~~~sP~v~~~~v~~~-----------------~~~g~v~a~d~~~G~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~v 259 (394)
T PRK11138 197 GESAPATAFGGAIVG-----------------GDNGRVSAVLMEQGQLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVV 259 (394)
T ss_pred CCCCCEEECCEEEEE-----------------cCCCEEEEEEccCChhhheeccccCCCccchhcccccCCCcEEECCEE
Confidence 224566677888887 34588999999999999998753311 0123344678899
Q ss_pred EEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 313 FGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 313 ~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
|+.+. +|.++|+|+++|+++|+.+.+. ...|++.+++||+.+.+|. ++++|..+|+++|.+.
T Consensus 260 y~~~~--~g~l~ald~~tG~~~W~~~~~~--~~~~~~~~~~vy~~~~~g~---l~ald~~tG~~~W~~~ 321 (394)
T PRK11138 260 YALAY--NGNLVALDLRSGQIVWKREYGS--VNDFAVDGGRIYLVDQNDR---VYALDTRGGVELWSQS 321 (394)
T ss_pred EEEEc--CCeEEEEECCCCCEEEeecCCC--ccCcEEECCEEEEEcCCCe---EEEEECCCCcEEEccc
Confidence 99884 8999999999999999988653 3467889999999998876 6889999999999763
No 7
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=100.00 E-value=2e-30 Score=252.82 Aligned_cols=258 Identities=23% Similarity=0.419 Sum_probs=201.1
Q ss_pred CCCceeeeeecCcCcc-ceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCC
Q 040693 5 SNGKLVWKTKLDDHAR-SFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNE 83 (382)
Q Consensus 5 ~tGk~~W~~~~~~~~~-~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~ 83 (382)
.+|+++|+++++.... .....+|++.+++||++.. ++.|+|||++||+++|++++.....
T Consensus 38 ~~~~~~W~~~~~~~~~~~~~~~~p~v~~~~v~v~~~--------------~g~v~a~d~~tG~~~W~~~~~~~~~----- 98 (377)
T TIGR03300 38 VKVDQVWSASVGDGVGHYYLRLQPAVAGGKVYAADA--------------DGTVVALDAETGKRLWRVDLDERLS----- 98 (377)
T ss_pred CcceeeeEEEcCCCcCccccccceEEECCEEEEECC--------------CCeEEEEEccCCcEeeeecCCCCcc-----
Confidence 5799999999865321 1245789999999999986 7899999999999999999853221
Q ss_pred CcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCC
Q 040693 84 YAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGG 163 (382)
Q Consensus 84 ~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~ 163 (382)
..|+++ ++.+|+++.+ +.|+|||++||+++|+.....
T Consensus 99 --------~~p~v~--~~~v~v~~~~---------------------------------g~l~ald~~tG~~~W~~~~~~ 135 (377)
T TIGR03300 99 --------GGVGAD--GGLVFVGTEK---------------------------------GEVIALDAEDGKELWRAKLSS 135 (377)
T ss_pred --------cceEEc--CCEEEEEcCC---------------------------------CEEEEEECCCCcEeeeeccCc
Confidence 246665 5689987754 899999999999999987654
Q ss_pred CcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcc
Q 040693 164 YDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAM 243 (382)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~ 243 (382)
. +.++|++. ++.|++...++.|+++|.++|+++|+++...+.......
T Consensus 136 ~------------------------~~~~p~v~--------~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~~~~~~~~ 183 (377)
T TIGR03300 136 E------------------------VLSPPLVA--------NGLVVVRTNDGRLTALDAATGERLWTYSRVTPALTLRGS 183 (377)
T ss_pred e------------------------eecCCEEE--------CCEEEEECCCCeEEEEEcCCCceeeEEccCCCceeecCC
Confidence 3 22567775 678888888999999999999999999875432111112
Q ss_pred cceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCC---------CCCcceEEeCCEEEE
Q 040693 244 WGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNG---------TAPGPVTVANGVLFG 314 (382)
Q Consensus 244 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~---------~~~~~~~~~~~~v~~ 314 (382)
..+.+.++.+|+. ...+.++++|+++|+++|+.+..... ...+.+.+.++.+|+
T Consensus 184 ~sp~~~~~~v~~~-----------------~~~g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~ 246 (377)
T TIGR03300 184 ASPVIADGGVLVG-----------------FAGGKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYA 246 (377)
T ss_pred CCCEEECCEEEEE-----------------CCCCEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEE
Confidence 3455566788886 34588999999999999997753211 012334567889999
Q ss_pred eeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEE
Q 040693 315 GSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 315 ~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
.+. +|.++++|+++|+++|+.+.+ ...+|++.+++||+.+.+|. ++++|..+|+++|.+
T Consensus 247 ~~~--~g~l~a~d~~tG~~~W~~~~~--~~~~p~~~~~~vyv~~~~G~---l~~~d~~tG~~~W~~ 305 (377)
T TIGR03300 247 VSY--QGRVAALDLRSGRVLWKRDAS--SYQGPAVDDNRLYVTDADGV---VVALDRRSGSELWKN 305 (377)
T ss_pred EEc--CCEEEEEECCCCcEEEeeccC--CccCceEeCCEEEEECCCCe---EEEEECCCCcEEEcc
Confidence 885 899999999999999999854 35788899999999998877 688999999999987
No 8
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=99.98 E-value=2.7e-30 Score=259.22 Aligned_cols=300 Identities=22% Similarity=0.340 Sum_probs=196.3
Q ss_pred CCcCCCCceeeeeecCcCc----cceeeeceEEEc-CEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccC
Q 040693 1 AVKRSNGKLVWKTKLDDHA----RSFITMSGTYYK-GAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLP 75 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~----~~~~~~~p~v~~-~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~ 75 (382)
|||++|||++|+++..... ...+...+.+.+ ++||+++. ++.|+|||++||+++|++++..
T Consensus 75 AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~--------------~g~v~AlD~~TG~~~W~~~~~~ 140 (488)
T cd00216 75 ALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTF--------------DGRLVALDAETGKQVWKFGNND 140 (488)
T ss_pred EEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecC--------------CCeEEEEECCCCCEeeeecCCC
Confidence 6899999999999875420 011222334557 99999886 8999999999999999999854
Q ss_pred CCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcE
Q 040693 76 DNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKI 155 (382)
Q Consensus 76 ~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~ 155 (382)
.... .....++|.+.. +.+|+++.... ...+...+.|+|||++||++
T Consensus 141 ~~~~-------~~~i~ssP~v~~--~~v~vg~~~~~------------------------~~~~~~~g~v~alD~~TG~~ 187 (488)
T cd00216 141 QVPP-------GYTMTGAPTIVK--KLVIIGSSGAE------------------------FFACGVRGALRAYDVETGKL 187 (488)
T ss_pred CcCc-------ceEecCCCEEEC--CEEEEeccccc------------------------cccCCCCcEEEEEECCCCce
Confidence 3100 001124677764 68888764310 00012358999999999999
Q ss_pred EEEEecCCCcccccccccCCCC-CCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCc------------------E
Q 040693 156 VWYKQLGGYDVWFGACNWYLNP-NCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSG------------------F 216 (382)
Q Consensus 156 ~W~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g------------------~ 216 (382)
+|+++........... +...+ .|... ...+.++|++. ..++.|++++.++ .
T Consensus 188 ~W~~~~~~~~~~~~~~-~~~~~~~~~~~---g~~vw~~pa~d------~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~ 257 (488)
T cd00216 188 LWRFYTTEPDPNAFPT-WGPDRQMWGPG---GGTSWASPTYD------PKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDS 257 (488)
T ss_pred eeEeeccCCCcCCCCC-CCCCcceecCC---CCCccCCeeEe------CCCCEEEEECCCCCCCccCCccCCCCCCceee
Confidence 9999885331100000 00000 00000 01122455553 2267888887665 7
Q ss_pred EEEEeCCCCCeeeeeccCCCCCCC-Cccccee------eeCC---eEEEEecCccccccccCCCCCCCCCceEEEEECCC
Q 040693 217 AWALDRDSGSLIWSMEAGPGGLGG-GAMWGAA------TDER---RIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASN 286 (382)
Q Consensus 217 l~ald~~tG~~~W~~~~~~~~~~g-~~~~~~~------~~~~---~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~t 286 (382)
|+|||.+||+++|+++......-. .....+. ++++ .||++ ...|.++|||++|
T Consensus 258 l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g-----------------~~~G~l~ald~~t 320 (488)
T cd00216 258 IVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHA-----------------PKNGFFYVLDRTT 320 (488)
T ss_pred EEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEE-----------------CCCceEEEEECCC
Confidence 999999999999999865321100 0111111 1222 46665 4568899999999
Q ss_pred CcEEeeecCCCCCCCCcceEEeCCEEEEeee----------------cCCCcEEEEeCCCCcEeEEEecC---------C
Q 040693 287 GNVLWSTADPSNGTAPGPVTVANGVLFGGST----------------YRQGPIYAMDVKTGKILWSYDTG---------A 341 (382)
Q Consensus 287 G~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~----------------~~~g~l~~ld~~tG~ilw~~~~~---------~ 341 (382)
|+++|+.+..... +....++||+.+. ..+|.|+|||++||+++|+.+.+ .
T Consensus 321 G~~~W~~~~~~~~-----~~~~~~~vyv~~~~~~~~~~~~~~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~ 395 (488)
T cd00216 321 GKLISARPEVEQP-----MAYDPGLVYLGAFHIPLGLPPQKKKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGF 395 (488)
T ss_pred CcEeeEeEeeccc-----cccCCceEEEccccccccCcccccCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCC
Confidence 9999998864211 1223377877531 12689999999999999999987 2
Q ss_pred cee-cceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 342 TIY-GGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 342 ~~~-~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
... +++++.++.||+++.+|. +|++|.+||+++|.+.+
T Consensus 396 ~~~~~~~~~~g~~v~~g~~dG~---l~ald~~tG~~lW~~~~ 434 (488)
T cd00216 396 PHWGGSLATAGNLVFAGAADGY---FRAFDATTGKELWKFRT 434 (488)
T ss_pred cccCcceEecCCeEEEECCCCe---EEEEECCCCceeeEEEC
Confidence 333 456789999999998876 79999999999999975
No 9
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=99.96 E-value=4.6e-27 Score=236.64 Aligned_cols=296 Identities=26% Similarity=0.418 Sum_probs=189.3
Q ss_pred CCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCC
Q 040693 4 RSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNE 83 (382)
Q Consensus 4 ~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~ 83 (382)
.++.++.|+++++.. ..+.++|++.+++||+++. .+.|+|||++||+++|+++........+ .
T Consensus 44 v~~L~~~W~~~~g~~--~g~~stPvv~~g~vyv~s~--------------~g~v~AlDa~TGk~lW~~~~~~~~~~~~-~ 106 (527)
T TIGR03075 44 VKKLQPAWTFSLGKL--RGQESQPLVVDGVMYVTTS--------------YSRVYALDAKTGKELWKYDPKLPDDVIP-V 106 (527)
T ss_pred hccceEEEEEECCCC--CCcccCCEEECCEEEEECC--------------CCcEEEEECCCCceeeEecCCCCccccc-c
Confidence 356889999998532 2367899999999999885 7899999999999999998743211000 0
Q ss_pred CcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCC
Q 040693 84 YAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGG 163 (382)
Q Consensus 84 ~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~ 163 (382)
.... .....+++. ++.||+++.+ +.|+|||++|||++|+++...
T Consensus 107 ~~~~-~~~rg~av~--~~~v~v~t~d---------------------------------g~l~ALDa~TGk~~W~~~~~~ 150 (527)
T TIGR03075 107 MCCD-VVNRGVALY--DGKVFFGTLD---------------------------------ARLVALDAKTGKVVWSKKNGD 150 (527)
T ss_pred cccc-cccccceEE--CCEEEEEcCC---------------------------------CEEEEEECCCCCEEeeccccc
Confidence 0000 000123443 3578887654 899999999999999998753
Q ss_pred CcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc------CcEEEEEeCCCCCeeeeeccCCCC
Q 040693 164 YDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK------SGFAWALDRDSGSLIWSMEAGPGG 237 (382)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~------~g~l~ald~~tG~~~W~~~~~~~~ 237 (382)
... .+.+.++|++. ++.|+++.. +|.|+|||++||+++|++...+..
T Consensus 151 ~~~-------------------~~~~tssP~v~--------~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~~~~p~~ 203 (527)
T TIGR03075 151 YKA-------------------GYTITAAPLVV--------KGKVITGISGGEFGVRGYVTAYDAKTGKLVWRRYTVPGD 203 (527)
T ss_pred ccc-------------------cccccCCcEEE--------CCEEEEeecccccCCCcEEEEEECCCCceeEeccCcCCC
Confidence 310 23356789888 567777643 689999999999999998765321
Q ss_pred --------------------------CCCCcccceee-e--CCeEEEEecCccccccccCCCCC----------------
Q 040693 238 --------------------------LGGGAMWGAAT-D--ERRIYTNIANSQHKNFNLKPSKN---------------- 272 (382)
Q Consensus 238 --------------------------~~g~~~~~~~~-~--~~~v~~~~~~~~~~~~~~~~~~~---------------- 272 (382)
..+...|.... | .++||+.+.+...-....+|+-+
T Consensus 204 ~~~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~~~s~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld~~TG~~ 283 (527)
T TIGR03075 204 MGYLDKADKPVGGEPGAKTWPGDAWKTGGGATWGTGSYDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARDPDTGKI 283 (527)
T ss_pred cccccccccccccccccCCCCCCccccCCCCccCceeEcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEccccCCE
Confidence 12445666544 3 67999998773220001111110
Q ss_pred -------------------------------------CCCCceEEEEECCCCcEEeee----------cCC--CC-CC--
Q 040693 273 -------------------------------------STIAGGWVAMDASNGNVLWST----------ADP--SN-GT-- 300 (382)
Q Consensus 273 -------------------------------------~~~~g~v~a~d~~tG~~~W~~----------~~~--~~-~~-- 300 (382)
....|.+++||.+|||++|.. .+. .. +.
T Consensus 284 ~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~~~~K~G~~~vlDr~tG~~i~~~~~~~~~~w~~~~~~~~g~p~~~ 363 (527)
T TIGR03075 284 KWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLAHADRNGFFYVLDRTNGKLLSAEPFVDTVNWATGVDLKTGRPIEV 363 (527)
T ss_pred EEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEEEeCCCceEEEEECCCCceeccccccCCcccccccCCCCCCCccC
Confidence 123344555555555554221 000 00 00
Q ss_pred ------------------------CCcceEEe--CCEEEEeeec-------------------------------CCCcE
Q 040693 301 ------------------------APGPVTVA--NGVLFGGSTY-------------------------------RQGPI 323 (382)
Q Consensus 301 ------------------------~~~~~~~~--~~~v~~~~~~-------------------------------~~g~l 323 (382)
.-.+.+++ .+++|+.+.. ..|.|
T Consensus 364 ~~~~~~~~~~~~~~~~~Pg~~Gg~~W~~~A~Dp~~g~~yvp~~~~~~~~~~~~~~~~~g~~~~~~~~~~~p~~~~~~g~l 443 (527)
T TIGR03075 364 PEARSADGKKGKPVGVCPGFLGGKNWQPMAYSPKTGLFYVPANEVCMDYEPEKVSYKKGAAYLGAGLTIKPPPDDHMGSL 443 (527)
T ss_pred hhhCcCCCCCCCeeEECCCCcCCCCCCCceECCCCCEEEEecccccccccccccccCCCCceeccccccCCCCCCCceeE
Confidence 00011222 2455555431 02469
Q ss_pred EEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 324 YAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 324 ~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
.|||++|||++|+++.+....+++++ .++.+|+++.+|. ++++|.+||++||.+++
T Consensus 444 ~AiD~~tGk~~W~~~~~~p~~~~~l~t~g~lvf~g~~~G~---l~a~D~~TGe~lw~~~~ 500 (527)
T TIGR03075 444 IAWDPITGKIVWEHKEDFPLWGGVLATAGDLVFYGTLEGY---FKAFDAKTGEELWKFKT 500 (527)
T ss_pred EEEeCCCCceeeEecCCCCCCCcceEECCcEEEEECCCCe---EEEEECCCCCEeEEEeC
Confidence 99999999999999887777777755 5566667666766 79999999999999985
No 10
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.95 E-value=1.4e-25 Score=204.20 Aligned_cols=227 Identities=28% Similarity=0.481 Sum_probs=168.4
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+|.|+|+|+.||+++|+..+.+... +. . +.++. .++.+|+.+.+
T Consensus 2 ~g~l~~~d~~tG~~~W~~~~~~~~~-------~~-~--~~~~~--~~~~v~~~~~~------------------------ 45 (238)
T PF13360_consen 2 DGTLSALDPRTGKELWSYDLGPGIG-------GP-V--ATAVP--DGGRVYVASGD------------------------ 45 (238)
T ss_dssp TSEEEEEETTTTEEEEEEECSSSCS-------SE-E--ETEEE--ETTEEEEEETT------------------------
T ss_pred CCEEEEEECCCCCEEEEEECCCCCC-------Cc-c--ceEEE--eCCEEEEEcCC------------------------
Confidence 6899999999999999998843221 10 0 01222 45789998654
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+.|+|+|+.||+++|++...... ...|.+. ++.|++...
T Consensus 46 ---------~~l~~~d~~tG~~~W~~~~~~~~------------------------~~~~~~~--------~~~v~v~~~ 84 (238)
T PF13360_consen 46 ---------GNLYALDAKTGKVLWRFDLPGPI------------------------SGAPVVD--------GGRVYVGTS 84 (238)
T ss_dssp ---------SEEEEEETTTSEEEEEEECSSCG------------------------GSGEEEE--------TTEEEEEET
T ss_pred ---------CEEEEEECCCCCEEEEeeccccc------------------------cceeeec--------ccccccccc
Confidence 89999999999999999985431 1235444 788888888
Q ss_pred CcEEEEEeCCCCCeeeee-ccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 214 SGFAWALDRDSGSLIWSM-EAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~-~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
++.++++|.+||+++|+. ....+.........+.+.++.+|+.. ..+.|+++|++||+++|+
T Consensus 85 ~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~g~l~~~d~~tG~~~w~ 147 (238)
T PF13360_consen 85 DGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGT-----------------SSGKLVALDPKTGKLLWK 147 (238)
T ss_dssp TSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEE-----------------TCSEEEEEETTTTEEEEE
T ss_pred eeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEe-----------------ccCcEEEEecCCCcEEEE
Confidence 889999999999999994 54322122222334455788888873 358899999999999999
Q ss_pred ecCCCCCCC---------CcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeE
Q 040693 293 TADPSNGTA---------PGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVT 363 (382)
Q Consensus 293 ~~~~~~~~~---------~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~ 363 (382)
......... .+.+++.++.+|+.+. ++.++++|.++|+.+|+.+ .......+...++.||+.+.++.
T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~--~g~~~~~d~~tg~~~w~~~-~~~~~~~~~~~~~~l~~~~~~~~- 223 (238)
T PF13360_consen 148 YPVGEPRGSSPISSFSDINGSPVISDGRVYVSSG--DGRVVAVDLATGEKLWSKP-ISGIYSLPSVDGGTLYVTSSDGR- 223 (238)
T ss_dssp EESSTT-SS--EEEETTEEEEEECCTTEEEEECC--TSSEEEEETTTTEEEEEEC-SS-ECECEECCCTEEEEEETTTE-
T ss_pred eecCCCCCCcceeeecccccceEEECCEEEEEcC--CCeEEEEECCCCCEEEEec-CCCccCCceeeCCEEEEEeCCCE-
Confidence 988542211 2445566779999885 7878999999999999777 55566668999999999997655
Q ss_pred eecCCccCCCCCeEEEE
Q 040693 364 VGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 364 ~~~~~~~~~~g~~l~~~ 380 (382)
++++|..||+.+|.+
T Consensus 224 --l~~~d~~tG~~~W~~ 238 (238)
T PF13360_consen 224 --LYALDLKTGKVVWQQ 238 (238)
T ss_dssp --EEEEETTTTEEEEEE
T ss_pred --EEEEECCCCCEEeEC
Confidence 688999999999986
No 11
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.94 E-value=3.7e-24 Score=194.81 Aligned_cols=225 Identities=29% Similarity=0.467 Sum_probs=160.4
Q ss_pred CCcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGK 80 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~ 80 (382)
|+|++||+++|+..+... .......++..++.+|++.. ++.|+|+|++||+++|++++.....
T Consensus 7 ~~d~~tG~~~W~~~~~~~-~~~~~~~~~~~~~~v~~~~~--------------~~~l~~~d~~tG~~~W~~~~~~~~~-- 69 (238)
T PF13360_consen 7 ALDPRTGKELWSYDLGPG-IGGPVATAVPDGGRVYVASG--------------DGNLYALDAKTGKVLWRFDLPGPIS-- 69 (238)
T ss_dssp EEETTTTEEEEEEECSSS-CSSEEETEEEETTEEEEEET--------------TSEEEEEETTTSEEEEEEECSSCGG--
T ss_pred EEECCCCCEEEEEECCCC-CCCccceEEEeCCEEEEEcC--------------CCEEEEEECCCCCEEEEeecccccc--
Confidence 689999999999998432 22233346679999999864 8899999999999999999832221
Q ss_pred CCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEE-
Q 040693 81 LNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYK- 159 (382)
Q Consensus 81 ~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~- 159 (382)
..|.+. ++.+|+.+.+ +.|++||.+||+++|+.
T Consensus 70 -----------~~~~~~--~~~v~v~~~~---------------------------------~~l~~~d~~tG~~~W~~~ 103 (238)
T PF13360_consen 70 -----------GAPVVD--GGRVYVGTSD---------------------------------GSLYALDAKTGKVLWSIY 103 (238)
T ss_dssp -----------SGEEEE--TTEEEEEETT---------------------------------SEEEEEETTTSCEEEEEE
T ss_pred -----------ceeeec--ccccccccce---------------------------------eeeEecccCCcceeeeec
Confidence 124444 4688887754 79999999999999995
Q ss_pred ecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCC
Q 040693 160 QLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLG 239 (382)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~ 239 (382)
....... .......+.+. ++.++++..++.++++|++||+++|+++...+...
T Consensus 104 ~~~~~~~-------------------~~~~~~~~~~~--------~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~~~~~ 156 (238)
T PF13360_consen 104 LTSSPPA-------------------GVRSSSSPAVD--------GDRLYVGTSSGKLVALDPKTGKLLWKYPVGEPRGS 156 (238)
T ss_dssp E-SSCTC-------------------STB--SEEEEE--------TTEEEEEETCSEEEEEETTTTEEEEEEESSTT-SS
T ss_pred ccccccc-------------------ccccccCceEe--------cCEEEEEeccCcEEEEecCCCcEEEEeecCCCCCC
Confidence 4442210 00011223332 67888888899999999999999999987432211
Q ss_pred C------CcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEE
Q 040693 240 G------GAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLF 313 (382)
Q Consensus 240 g------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~ 313 (382)
. .....+...++.+|+. ...+.++++|.++|+.+|+..... ....+...++.+|
T Consensus 157 ~~~~~~~~~~~~~~~~~~~v~~~-----------------~~~g~~~~~d~~tg~~~w~~~~~~---~~~~~~~~~~~l~ 216 (238)
T PF13360_consen 157 SPISSFSDINGSPVISDGRVYVS-----------------SGDGRVVAVDLATGEKLWSKPISG---IYSLPSVDGGTLY 216 (238)
T ss_dssp --EEEETTEEEEEECCTTEEEEE-----------------CCTSSEEEEETTTTEEEEEECSS----ECECEECCCTEEE
T ss_pred cceeeecccccceEEECCEEEEE-----------------cCCCeEEEEECCCCCEEEEecCCC---ccCCceeeCCEEE
Confidence 1 1123344466788887 334668999999999999777322 2332457789999
Q ss_pred EeeecCCCcEEEEeCCCCcEeEEE
Q 040693 314 GGSTYRQGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 314 ~~~~~~~g~l~~ld~~tG~ilw~~ 337 (382)
+.+. ++.|+++|++||+++|+.
T Consensus 217 ~~~~--~~~l~~~d~~tG~~~W~~ 238 (238)
T PF13360_consen 217 VTSS--DGRLYALDLKTGKVVWQQ 238 (238)
T ss_dssp EEET--TTEEEEEETTTTEEEEEE
T ss_pred EEeC--CCEEEEEECCCCCEEeEC
Confidence 9884 899999999999999974
No 12
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=99.94 E-value=3.1e-24 Score=221.99 Aligned_cols=282 Identities=21% Similarity=0.282 Sum_probs=186.6
Q ss_pred cCCCCceeeeeecCcCcc------ceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCC
Q 040693 3 KRSNGKLVWKTKLDDHAR------SFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPD 76 (382)
Q Consensus 3 d~~tGk~~W~~~~~~~~~------~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~ 76 (382)
+.++.|+.|+++.++... ..+.++|++.+|+||+++. .+.|+|||++|||++|+++....
T Consensus 160 NV~~L~~aWt~~tGd~~~~~~~~~~~~e~TPlvvgg~lYv~t~--------------~~~V~ALDa~TGk~lW~~d~~~~ 225 (764)
T TIGR03074 160 NVGNLKVAWTYHTGDLKTPDDPGEATFQATPLKVGDTLYLCTP--------------HNKVIALDAATGKEKWKFDPKLK 225 (764)
T ss_pred cccCceEEEEEECCCccccccccccccccCCEEECCEEEEECC--------------CCeEEEEECCCCcEEEEEcCCCC
Confidence 345788999998764211 3467899999999999885 78999999999999999998533
Q ss_pred CCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEE
Q 040693 77 NFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIV 156 (382)
Q Consensus 77 ~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~ 156 (382)
.... +.....++..|....... ......++|..++++++..+.+++|+|||++|||++
T Consensus 226 ~~~~-------------~~~~~cRGvay~~~p~~~---------~~~~~~~~p~~~~~rV~~~T~Dg~LiALDA~TGk~~ 283 (764)
T TIGR03074 226 TEAG-------------RQHQTCRGVSYYDAPAAA---------AGPAAPAAPADCARRIILPTSDARLIALDADTGKLC 283 (764)
T ss_pred cccc-------------cccccccceEEecCCccc---------ccccccccccccCCEEEEecCCCeEEEEECCCCCEE
Confidence 2110 000112344444321100 001122457778888888888899999999999999
Q ss_pred EEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc----------CcEEEEEeCCCCC
Q 040693 157 WYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK----------SGFAWALDRDSGS 226 (382)
Q Consensus 157 W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~----------~g~l~ald~~tG~ 226 (382)
|++.....-.|. .... ......+.+.++|++. ++.|+++.. +|.|+|||++||+
T Consensus 284 W~fg~~G~vdl~-------~~~g-~~~~g~~~~ts~P~V~--------~g~VIvG~~v~d~~~~~~~~G~I~A~Da~TGk 347 (764)
T TIGR03074 284 EDFGNNGTVDLT-------AGMG-TTPPGYYYPTSPPLVA--------GTTVVIGGRVADNYSTDEPSGVIRAFDVNTGA 347 (764)
T ss_pred EEecCCCceeee-------cccC-cCCCcccccccCCEEE--------CCEEEEEecccccccccCCCcEEEEEECCCCc
Confidence 998764432121 0000 0111134567889988 677887742 6899999999999
Q ss_pred eeeeeccCCCC------------CCCCcccceee---eCCeEEEEecCccccccccCC-CCCCCCCceEEEEECCCCcEE
Q 040693 227 LIWSMEAGPGG------------LGGGAMWGAAT---DERRIYTNIANSQHKNFNLKP-SKNSTIAGGWVAMDASNGNVL 290 (382)
Q Consensus 227 ~~W~~~~~~~~------------~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~-~~~~~~~g~v~a~d~~tG~~~ 290 (382)
++|+++...+. ..+...|.... ..+++|+...+....-+.... .......+.|+|+|++|||++
T Consensus 348 l~W~~~~g~p~~~~~~~~g~~~~~gg~n~W~~~s~D~~~glvy~ptGn~~pd~~g~~r~~~~n~y~~slvALD~~TGk~~ 427 (764)
T TIGR03074 348 LVWAWDPGNPDPTAPPAPGETYTRNTPNSWSVASYDEKLGLVYLPMGNQTPDQWGGDRTPADEKYSSSLVALDATTGKER 427 (764)
T ss_pred EeeEEecCCCCcccCCCCCCEeccCCCCccCceEEcCCCCeEEEeCCCccccccCCccccCcccccceEEEEeCCCCceE
Confidence 99999864211 12334455444 468999988764432222211 123456789999999999999
Q ss_pred eeecCCCCC-----CCCcceEEe----CC----EEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 291 WSTADPSNG-----TAPGPVTVA----NG----VLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 291 W~~~~~~~~-----~~~~~~~~~----~~----~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
|++..-... ....+++++ ++ .|+.++ ++|.+++||.+|||++|..+
T Consensus 428 W~~Q~~~hD~WD~D~~~~p~L~d~~~~~G~~~~~v~~~~--K~G~~~vlDr~tG~~l~~~~ 486 (764)
T TIGR03074 428 WVFQTVHHDLWDMDVPAQPSLVDLPDADGTTVPALVAPT--KQGQIYVLDRRTGEPIVPVE 486 (764)
T ss_pred EEecccCCccccccccCCceEEeeecCCCcEeeEEEEEC--CCCEEEEEECCCCCEEeece
Confidence 999762211 123344332 44 677777 59999999999999999864
No 13
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=99.92 E-value=7.8e-23 Score=197.70 Aligned_cols=340 Identities=25% Similarity=0.349 Sum_probs=236.8
Q ss_pred CCcCCCCceeeeeecCc------------CccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCcee
Q 040693 1 AVKRSNGKLVWKTKLDD------------HARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRIL 68 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~------------~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~l 68 (382)
|||++|||++|++.... +..+...|.|.+....++++.+..++....+ ..|.+.++|..||+++
T Consensus 286 ALdA~tGkvc~~Fa~~Ga~~l~tgm~~~k~g~y~~tS~p~~~~~~~v~~g~v~Dn~st~e----~sgVir~fdv~tG~l~ 361 (773)
T COG4993 286 ALDADTGKVCWSFANKGALNLETGMKDTKDGLYYGTSPPEFGVKGIVIAGSVADNESTWE----PSGVIRGFDVLTGKLT 361 (773)
T ss_pred EEeCCCCcEeheeccCceeeeeccCCCCCCCeEeecCCCcccceeEEEeeccCCCceeec----cCccccccccccCceE
Confidence 68999999999976331 1123345677788888888777665554433 3578999999999999
Q ss_pred eeeeccCCCCCCC------CCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCc
Q 040693 69 WQTFMLPDNFGKL------NEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHS 142 (382)
Q Consensus 69 W~~~~~~~~~~~~------~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (382)
|..+...+....+ ...+++++|. ++++|++-++||++.+|+ .|+.| +.+++ ..+..+.
T Consensus 362 w~~D~gnpD~t~p~~~g~tyt~nspn~W~-~~SyD~~lnlVy~p~Gn~--~pd~w-----g~trt--------p~dekys 425 (773)
T COG4993 362 WAGDPGNPDPTAPTAPGQTYTRNSPNSWA-SASYDAKLNLVYVPMGNQ--TPDTW-----GGTRT--------PGDEKYS 425 (773)
T ss_pred EccCCCCCCCCCCCCCCceeecCCCCccc-ccccCCCCCeEEEeCCCC--Chhhc-----cCCCC--------ccccccc
Confidence 9998754332222 2357889996 799999999999999995 44444 33443 3455678
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
..++|+|+.||+.+|.++..+++.| +++.+++|++.++..+|.....++..+++|++|.+|+
T Consensus 426 ssivAlD~~TG~~kW~yQtvhhDlW------------------DmDvp~qp~L~D~~~DG~~vpalv~ptk~G~~YVlDR 487 (773)
T COG4993 426 SSIVALDATTGKLKWVYQTVHHDLW------------------DMDVPAQPTLLDITKDGKVVPALVHPTKNGFIYVLDR 487 (773)
T ss_pred ceeEEecCCCcceeeeeeccCcchh------------------cccCCCCceEEEeecCCcEeeeeecccccCcEEEEEc
Confidence 9999999999999999999999999 6778899999999989988888999999999999999
Q ss_pred CCCCeeeeeccCC---------------C----------CCCCCcccce-----------------------eeeCCeEE
Q 040693 223 DSGSLIWSMEAGP---------------G----------GLGGGAMWGA-----------------------ATDERRIY 254 (382)
Q Consensus 223 ~tG~~~W~~~~~~---------------~----------~~~g~~~~~~-----------------------~~~~~~v~ 254 (382)
.||+++=.++..+ + ......+|+. ...+..+|
T Consensus 488 rtGe~lv~~~evp~p~gA~~~d~~~ptqp~s~l~~~~a~pLtE~dmwg~t~fdqlvCri~f~~~ryeg~ytpPst~gsl~ 567 (773)
T COG4993 488 RTGELLVPIPEVPVPQGAIEGDYTAPTQPFSGLPFRPAPPLTEADMWGATMFDQLVCRIAFGGLRYEGRYTPPSTQGSLY 567 (773)
T ss_pred CCCcccccccccCCccccccccccCCCCcccCCCCCCCCCCCcccccCCchhhceechhhhcCccccccccCCCCCeeEE
Confidence 9999874443210 0 0000001110 01122233
Q ss_pred EEecCcc---------------------------------c---------------------cccc------cCCCC---
Q 040693 255 TNIANSQ---------------------------------H---------------------KNFN------LKPSK--- 271 (382)
Q Consensus 255 ~~~~~~~---------------------------------~---------------------~~~~------~~~~~--- 271 (382)
+..+... . ..|. +.|..
T Consensus 568 ~PgN~g~fnwg~~sVdp~r~~~fg~p~~laf~s~~~prd~~~~~~~g~~~~g~e~gv~~~~g~Py~V~~gpflsp~glpc 647 (773)
T COG4993 568 VPGNHGMFNWGGVSVDPVRQVTFGNPYYLAFVSKLVPRDPVGPMENGAAGDGTEGGVVPNYGEPYGVWMGPFLSPGGLPC 647 (773)
T ss_pred EeccccceeccceEeccccceEecCcchhhheeeccccCCCCCcccccccCCcccccccCCCCcceeeeccccCCcCccc
Confidence 3221100 0 0000 01111
Q ss_pred CCCCCceEEEEECCCCcEEeeecCCCC--------------CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEE
Q 040693 272 NSTIAGGWVAMDASNGNVLWSTADPSN--------------GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 272 ~~~~~g~v~a~d~~tG~~~W~~~~~~~--------------~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~ 337 (382)
.....|.+.++|++|||++|+.+.... ....+|+.+.++.+|.+.. .+.+|.++|..+||.+|+.
T Consensus 648 qap~wGyv~a~DlkTgk~~wk~~~gt~~d~~p~plp~~~g~pt~Ggp~~t~Ggv~f~a~~-~dqYLrayd~~~G~~lW~a 726 (773)
T COG4993 648 QAPPWGYVKAIDLKTGKELWKHRNGTVYDMTPVPLPFKVGFPTLGGPIGTAGGVAFIAAT-GDQYLRAYDGTGGKQLWQA 726 (773)
T ss_pred ccCCcceeeeeeccccceeeeccCCccccCcccCcccccccccCCCccccccceEEEecc-hheeeeeeeccCCceeeee
Confidence 013358899999999999999443221 2235677788898888764 5889999999999999999
Q ss_pred ecCCceecceEE---eCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 338 DTGATIYGGASV---SNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 338 ~~~~~~~~~p~~---~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
..+.+..++|+. .+++.|+.-+.|.-. .+-.+-|+.+-+|+|
T Consensus 727 rlpaGGq~tPmty~v~~GkqYvvi~agghg---s~gtk~GDyviAyaL 771 (773)
T COG4993 727 RLPAGGQATPMTYTVAGGKQYVVISAGGHG---SFGTKMGDYVIAYAL 771 (773)
T ss_pred ccccCCcCCCceeecCCCceEEEEEcCCCC---ccCcCCCceEEEEeC
Confidence 998887877765 567788876554422 255677888888875
No 14
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.91 E-value=1.1e-21 Score=190.75 Aligned_cols=273 Identities=24% Similarity=0.462 Sum_probs=189.0
Q ss_pred cCCCCceeeeeecCcCccceeeec-eEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCC
Q 040693 3 KRSNGKLVWKTKLDDHARSFITMS-GTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKL 81 (382)
Q Consensus 3 d~~tGk~~W~~~~~~~~~~~~~~~-p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~ 81 (382)
+..+|+++|...+........... |+..++++|++.. +|.|+|+|+++|+++|+..+....
T Consensus 39 ~~~~g~~~W~~~~~~~~~~~~~~~~~~~~dg~v~~~~~--------------~G~i~A~d~~~g~~~W~~~~~~~~---- 100 (370)
T COG1520 39 NNTSGTLLWSVSLGSGGGGIYAGPAPADGDGTVYVGTR--------------DGNIFALNPDTGLVKWSYPLLGAV---- 100 (370)
T ss_pred cccCcceeeeeecccCccceEeccccEeeCCeEEEecC--------------CCcEEEEeCCCCcEEecccCcCcc----
Confidence 345799999987544322233333 4899999999975 889999999999999998875300
Q ss_pred CCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEec
Q 040693 82 NEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQL 161 (382)
Q Consensus 82 ~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~ 161 (382)
.. .+.|.+.. .+.||+++.+ +.++|||..||+++|++..
T Consensus 101 ------~~-~~~~~~~~-~G~i~~g~~~---------------------------------g~~y~ld~~~G~~~W~~~~ 139 (370)
T COG1520 101 ------AQ-LSGPILGS-DGKIYVGSWD---------------------------------GKLYALDASTGTLVWSRNV 139 (370)
T ss_pred ------ee-ccCceEEe-CCeEEEeccc---------------------------------ceEEEEECCCCcEEEEEec
Confidence 00 12344443 4679998765 6899999999999999998
Q ss_pred CCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCC
Q 040693 162 GGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGG 241 (382)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~ 241 (382)
.. . .. +.+.|++. +..|+..+.+++++|+|.+||+.+|+++...+ ....
T Consensus 140 ~~-~-~~--------------------~~~~~v~~--------~~~v~~~s~~g~~~al~~~tG~~~W~~~~~~~-~~~~ 188 (370)
T COG1520 140 GG-S-PY--------------------YASPPVVG--------DGTVYVGTDDGHLYALNADTGTLKWTYETPAP-LSLS 188 (370)
T ss_pred CC-C-eE--------------------EecCcEEc--------CcEEEEecCCCeEEEEEccCCcEEEEEecCCc-cccc
Confidence 76 1 00 11333333 67888887889999999999999999887653 2222
Q ss_pred cccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec----CCCC-----CCCCcceEEeCCEE
Q 040693 242 AMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA----DPSN-----GTAPGPVTVANGVL 312 (382)
Q Consensus 242 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~----~~~~-----~~~~~~~~~~~~~v 312 (382)
....+.+.++.+|+...+ . .+.++++|+.+|+.+|+.+ .... +....+.+..++.+
T Consensus 189 ~~~~~~~~~~~vy~~~~~--------------~-~~~~~a~~~~~G~~~w~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~ 253 (370)
T COG1520 189 IYGSPAIASGTVYVGSDG--------------Y-DGILYALNAEDGTLKWSQKVSQTIGRTAISTTPAVDGGPVYVDGGV 253 (370)
T ss_pred cccCceeecceEEEecCC--------------C-cceEEEEEccCCcEeeeeeeecccCcccccccccccCceEEECCcE
Confidence 333344678899987331 1 3579999999999999953 3221 11122222334444
Q ss_pred EEeeecCCCcEEEEeCCCCcEeEEEecCC-----ceecceEE-eCCEEEEEeCcee---EeecCCccCCCCCe--EEEEE
Q 040693 313 FGGSTYRQGPIYAMDVKTGKILWSYDTGA-----TIYGGASV-SNGCIYMGNGYKV---TVGFGNKNFTSGTS--LYAFC 381 (382)
Q Consensus 313 ~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-----~~~~~p~~-~~g~lyv~~~~g~---~~~~~~~~~~~g~~--l~~~~ 381 (382)
|.... .+.++|+|..+|+++|+++.+. ..+..+.. .+|++|+...... ...+|+++..+|+. +|.|.
T Consensus 254 ~~~~~--~g~~~~l~~~~G~~~W~~~~~~~~~~~~~~~~~~~~~dG~v~~~~~~~~~~~~~~~~~~~~~~g~~~~~w~~~ 331 (370)
T COG1520 254 YAGSY--GGKLLCLDADTGELIWSFPAGGSVQGSGLYTTPVAGADGKVYIGFTDNDGRGSGSLYALADVPGGTLLKWSYP 331 (370)
T ss_pred EEEec--CCeEEEEEcCCCceEEEEecccEeccCCeeEEeecCCCccEEEEEeccccccccceEEEeccCCCeeEEEEEe
Confidence 55553 7889999999999999999962 33444444 4999999865433 34567788788888 88886
Q ss_pred C
Q 040693 382 V 382 (382)
Q Consensus 382 ~ 382 (382)
.
T Consensus 332 ~ 332 (370)
T COG1520 332 V 332 (370)
T ss_pred C
Confidence 3
No 15
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.86 E-value=3.6e-19 Score=173.09 Aligned_cols=262 Identities=25% Similarity=0.427 Sum_probs=176.8
Q ss_pred CCcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGK 80 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~ 80 (382)
|+|+++|+++|+..+.. ....+.+.+++.++++|+++. ++.++|||..||+++|+++... .
T Consensus 82 A~d~~~g~~~W~~~~~~-~~~~~~~~~~~~~G~i~~g~~--------------~g~~y~ld~~~G~~~W~~~~~~-~--- 142 (370)
T COG1520 82 ALNPDTGLVKWSYPLLG-AVAQLSGPILGSDGKIYVGSW--------------DGKLYALDASTGTLVWSRNVGG-S--- 142 (370)
T ss_pred EEeCCCCcEEecccCcC-cceeccCceEEeCCeEEEecc--------------cceEEEEECCCCcEEEEEecCC-C---
Confidence 68999999999998743 012333444566799999996 8899999999999999999843 0
Q ss_pred CCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEe
Q 040693 81 LNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQ 160 (382)
Q Consensus 81 ~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~ 160 (382)
..|...+++. ...||+.+. ++.++|||+.||+++|+++
T Consensus 143 -------~~~~~~~v~~--~~~v~~~s~---------------------------------~g~~~al~~~tG~~~W~~~ 180 (370)
T COG1520 143 -------PYYASPPVVG--DGTVYVGTD---------------------------------DGHLYALNADTGTLKWTYE 180 (370)
T ss_pred -------eEEecCcEEc--CcEEEEecC---------------------------------CCeEEEEEccCCcEEEEEe
Confidence 1122234443 357888653 3899999999999999998
Q ss_pred cCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc--CcEEEEEeCCCCCeeeeeccCCCCC
Q 040693 161 LGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK--SGFAWALDRDSGSLIWSMEAGPGGL 238 (382)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~--~g~l~ald~~tG~~~W~~~~~~~~~ 238 (382)
.... . .....++|.+. .+.++.... ++.++++|+++|+.+|+.+......
T Consensus 181 ~~~~-~-------------------~~~~~~~~~~~--------~~~vy~~~~~~~~~~~a~~~~~G~~~w~~~~~~~~~ 232 (370)
T COG1520 181 TPAP-L-------------------SLSIYGSPAIA--------SGTVYVGSDGYDGILYALNAEDGTLKWSQKVSQTIG 232 (370)
T ss_pred cCCc-c-------------------ccccccCceee--------cceEEEecCCCcceEEEEEccCCcEeeeeeeecccC
Confidence 8652 0 11133555543 678888877 7789999999999999964332111
Q ss_pred CCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC----CCCCCcceEEeCCEEEE
Q 040693 239 GGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS----NGTAPGPVTVANGVLFG 314 (382)
Q Consensus 239 ~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~----~~~~~~~~~~~~~~v~~ 314 (382)
.......+.+..+.|++.... +. ....+.++|+|..+|+++|+++.+. ......+....++.+|+
T Consensus 233 ~~~~~~~~~~~~~~v~v~~~~-----~~------~~~~g~~~~l~~~~G~~~W~~~~~~~~~~~~~~~~~~~~~dG~v~~ 301 (370)
T COG1520 233 RTAISTTPAVDGGPVYVDGGV-----YA------GSYGGKLLCLDADTGELIWSFPAGGSVQGSGLYTTPVAGADGKVYI 301 (370)
T ss_pred cccccccccccCceEEECCcE-----EE------EecCCeEEEEEcCCCceEEEEecccEeccCCeeEEeecCCCccEEE
Confidence 000001123333444433110 00 0344679999999999999999852 12223333334677777
Q ss_pred eeecC----CCcEEEEeCCCCcE--eEEEecCC-ceecceEEeCCEEEEEeCcee
Q 040693 315 GSTYR----QGPIYAMDVKTGKI--LWSYDTGA-TIYGGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 315 ~~~~~----~g~l~~ld~~tG~i--lw~~~~~~-~~~~~p~~~~g~lyv~~~~g~ 362 (382)
..... .+.+++++...|.. +|.++..+ .....++..++.+|....++.
T Consensus 302 ~~~~~~~~~~~~~~~~~~~~g~~~~~w~~~~~g~~~~~~~~~~~g~~y~~~~~~~ 356 (370)
T COG1520 302 GFTDNDGRGSGSLYALADVPGGTLLKWSYPVGGGYSLSTVAGSDGTLYFGGDDGR 356 (370)
T ss_pred EEeccccccccceEEEeccCCCeeEEEEEeCCCceecccceeccCeEEecccCCc
Confidence 65311 24689999877888 99999887 445777889999999988865
No 16
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=99.83 E-value=2.7e-18 Score=150.90 Aligned_cols=254 Identities=18% Similarity=0.271 Sum_probs=184.3
Q ss_pred CCcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGK 80 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~ 80 (382)
|+|.++|++.|+.-+ +..+..++.+.+|.|++|-. .|.||.|+.+||++.|.+...+...
T Consensus 37 avd~~sG~~~We~il----g~RiE~sa~vvgdfVV~GCy--------------~g~lYfl~~~tGs~~w~f~~~~~vk-- 96 (354)
T KOG4649|consen 37 AVDPQSGNLIWEAIL----GVRIECSAIVVGDFVVLGCY--------------SGGLYFLCVKTGSQIWNFVILETVK-- 96 (354)
T ss_pred EecCCCCcEEeehhh----CceeeeeeEEECCEEEEEEc--------------cCcEEEEEecchhheeeeeehhhhc--
Confidence 689999999999988 45588899999999999873 8999999999999999998765443
Q ss_pred CCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEe
Q 040693 81 LNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQ 160 (382)
Q Consensus 81 ~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~ 160 (382)
+.+..|.+.+++|.++.+ +.+||||+++-.-+|+.+
T Consensus 97 -----------~~a~~d~~~glIycgshd---------------------------------~~~yalD~~~~~cVyksk 132 (354)
T KOG4649|consen 97 -----------VRAQCDFDGGLIYCGSHD---------------------------------GNFYALDPKTYGCVYKSK 132 (354)
T ss_pred -----------cceEEcCCCceEEEecCC---------------------------------CcEEEecccccceEEecc
Confidence 468889999999998865 899999999999999988
Q ss_pred cCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCC--CeeeeeccCCCCC
Q 040693 161 LGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSG--SLIWSMEAGPGGL 238 (382)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG--~~~W~~~~~~~~~ 238 (382)
.++. +..+|++.. -++.+|+....|.+.+...+++ ..+|..+...+
T Consensus 133 cgG~------------------------~f~sP~i~~------g~~sly~a~t~G~vlavt~~~~~~~~~w~~~~~~P-- 180 (354)
T KOG4649|consen 133 CGGG------------------------TFVSPVIAP------GDGSLYAAITAGAVLAVTKNPYSSTEFWAATRFGP-- 180 (354)
T ss_pred cCCc------------------------eeccceecC------CCceEEEEeccceEEEEccCCCCcceehhhhcCCc--
Confidence 7765 336788873 2678999999999999999999 89999986543
Q ss_pred CCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCC---c-c----eEEe--
Q 040693 239 GGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAP---G-P----VTVA-- 308 (382)
Q Consensus 239 ~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~---~-~----~~~~-- 308 (382)
....|..-+..+.++ ..+|.+.++| ..|+.+|+...+++.++. + | +...
T Consensus 181 ---iF~splcv~~sv~i~-----------------~VdG~l~~f~-~sG~qvwr~~t~GpIf~~Pc~s~Ps~q~i~~~~~ 239 (354)
T KOG4649|consen 181 ---IFASPLCVGSSVIIT-----------------TVDGVLTSFD-ESGRQVWRPATKGPIFMEPCESRPSCQQISLENE 239 (354)
T ss_pred ---cccCceeccceEEEE-----------------EeccEEEEEc-CCCcEEEeecCCCceecccccCCCcceEEEEecC
Confidence 222233334444443 3457777777 567888876655532111 0 1 0111
Q ss_pred ----------CCEEEEeee------------------------------cCCCcEEEE---------eCCCCcE--eEEE
Q 040693 309 ----------NGVLFGGST------------------------------YRQGPIYAM---------DVKTGKI--LWSY 337 (382)
Q Consensus 309 ----------~~~v~~~~~------------------------------~~~g~l~~l---------d~~tG~i--lw~~ 337 (382)
+.+++.... ..+|+++.| +.+.|++ +.+.
T Consensus 240 ~Cf~~~~p~~ghL~w~~~~g~t~~vy~~p~l~F~~h~~~~S~~~ll~~~s~dgkv~il~~~~sl~~~~s~~g~lq~~~~~ 319 (354)
T KOG4649|consen 240 NCFCAPLPIAGHLLWATQSGTTLHVYLSPKLRFDLHSPGISYPKLLRRSSGDGKVMILMTSKSLAEISSNGGELQNLEAI 319 (354)
T ss_pred CeEEEeccccceEEEEecCCcEEEEEeCcccceeccCCCCcchhhhhhhcCCCcEEEEEecccccccccCCCccceEEEe
Confidence 111111100 046777777 6667766 5566
Q ss_pred ecCCceecceEEeCCEEEEEeCceeEeecCCccCCCC
Q 040693 338 DTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSG 374 (382)
Q Consensus 338 ~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g 374 (382)
+++..++++|.+.+++|+++-.+.- ++++|..++
T Consensus 320 el~~eIFsSPvii~grl~igcRDdY---v~cldl~~~ 353 (354)
T KOG4649|consen 320 ELSNEIFSSPVIIDGRLLIGCRDDY---VRCLDLDTW 353 (354)
T ss_pred ecCcccccCCeEEccEEEEEEccCe---EEEEecccc
Confidence 8888899999999999999877633 455655543
No 17
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=99.77 E-value=1.1e-16 Score=140.86 Aligned_cols=244 Identities=17% Similarity=0.291 Sum_probs=179.2
Q ss_pred eeeeecCcCccceeeeceEEE-c---CEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCc
Q 040693 10 VWKTKLDDHARSFITMSGTYY-K---GAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYA 85 (382)
Q Consensus 10 ~W~~~~~~~~~~~~~~~p~v~-~---~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~ 85 (382)
+|...+ ..-+.++|+++ + -.||+++. .+.+.|+|+.+|++.|+.-++....
T Consensus 2 rW~vd~----~kCVDaspLVV~~dskT~v~igSH--------------s~~~~avd~~sG~~~We~ilg~RiE------- 56 (354)
T KOG4649|consen 2 RWAVDL----RKCVDASPLVVCNDSKTLVVIGSH--------------SGIVIAVDPQSGNLIWEAILGVRIE------- 56 (354)
T ss_pred ceeccc----hhhccCCcEEEecCCceEEEEecC--------------CceEEEecCCCCcEEeehhhCceee-------
Confidence 677776 34566777755 4 56888885 8899999999999999988853322
Q ss_pred CccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCc
Q 040693 86 GAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYD 165 (382)
Q Consensus 86 g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~ 165 (382)
+++.+- ++.|+++.-+ +.+|.|+.+||++.|.+...+.
T Consensus 57 ------~sa~vv--gdfVV~GCy~---------------------------------g~lYfl~~~tGs~~w~f~~~~~- 94 (354)
T KOG4649|consen 57 ------CSAIVV--GDFVVLGCYS---------------------------------GGLYFLCVKTGSQIWNFVILET- 94 (354)
T ss_pred ------eeeEEE--CCEEEEEEcc---------------------------------CcEEEEEecchhheeeeeehhh-
Confidence 345553 4678887654 8999999999999999988754
Q ss_pred ccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc
Q 040693 166 VWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG 245 (382)
Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~ 245 (382)
+..+|... -+.+.|+.++.|+++||||+.+-.-.|+.+. .|+....
T Consensus 95 -----------------------vk~~a~~d------~~~glIycgshd~~~yalD~~~~~cVykskc-----gG~~f~s 140 (354)
T KOG4649|consen 95 -----------------------VKVRAQCD------FDGGLIYCGSHDGNFYALDPKTYGCVYKSKC-----GGGTFVS 140 (354)
T ss_pred -----------------------hccceEEc------CCCceEEEecCCCcEEEecccccceEEeccc-----CCceecc
Confidence 23444443 2368999999999999999999999999774 3455566
Q ss_pred eee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC--cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCC
Q 040693 246 AAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG--NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQG 321 (382)
Q Consensus 246 ~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG--~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g 321 (382)
|+. .++.+|+. ..+|.|.|+.++++ ...|.....++ .-++|+ .-+..+..++. +|
T Consensus 141 P~i~~g~~sly~a-----------------~t~G~vlavt~~~~~~~~~w~~~~~~P-iF~spl-cv~~sv~i~~V--dG 199 (354)
T KOG4649|consen 141 PVIAPGDGSLYAA-----------------ITAGAVLAVTKNPYSSTEFWAATRFGP-IFASPL-CVGSSVIITTV--DG 199 (354)
T ss_pred ceecCCCceEEEE-----------------eccceEEEEccCCCCcceehhhhcCCc-cccCce-eccceEEEEEe--cc
Confidence 665 36789987 55699999999999 89999887663 334444 45566666665 99
Q ss_pred cEEEEeCCCCcEeEEEecCCceecceEEe---CCEEEEEeCceeEeecCCccCCCCCeEEEE
Q 040693 322 PIYAMDVKTGKILWSYDTGATIYGGASVS---NGCIYMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 322 ~l~~ld~~tG~ilw~~~~~~~~~~~p~~~---~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
.|.+|| ..|+++||..+.+++++.|.-. ...+++.+.+ = +++.-+-.|-++|.+
T Consensus 200 ~l~~f~-~sG~qvwr~~t~GpIf~~Pc~s~Ps~q~i~~~~~~-C---f~~~~p~~ghL~w~~ 256 (354)
T KOG4649|consen 200 VLTSFD-ESGRQVWRPATKGPIFMEPCESRPSCQQISLENEN-C---FCAPLPIAGHLLWAT 256 (354)
T ss_pred EEEEEc-CCCcEEEeecCCCceecccccCCCcceEEEEecCC-e---EEEeccccceEEEEe
Confidence 999999 6799999999999998887653 2344444332 1 234444446666654
No 18
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=99.76 E-value=7.3e-17 Score=156.67 Aligned_cols=270 Identities=21% Similarity=0.273 Sum_probs=169.1
Q ss_pred CCceeeeeecCcCcc------ceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCC
Q 040693 6 NGKLVWKTKLDDHAR------SFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFG 79 (382)
Q Consensus 6 tGk~~W~~~~~~~~~------~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~ 79 (382)
+-++.|++..++-+. .....+|+.++|.+|+.+. ..+++|||++|||++|+++...+..-
T Consensus 183 nL~~AWty~TGD~k~~~d~~e~t~e~tPLkvgdtlYvcTp--------------hn~v~ALDa~TGkekWkydp~~~~nv 248 (773)
T COG4993 183 NLQVAWTYRTGDVKQPEDPGETTNEVTPLKVGDTLYVCTP--------------HNRVFALDAATGKEKWKYDPNLKSNV 248 (773)
T ss_pred ccceeEEEecCcccCCCCcccccccccceEECCEEEEecC--------------cceeEEeeccCCceeeecCCCCCCCc
Confidence 457889999765321 1246789999999999775 78999999999999999997533211
Q ss_pred CCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEE
Q 040693 80 KLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYK 159 (382)
Q Consensus 80 ~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~ 159 (382)
.+-+ .+ .++.-|..... ..+.-|..|++..+.+.+|+|||++|||++|++
T Consensus 249 ~~~~------------~t-CrgVsy~~a~a-----------------~~k~pc~~rIflpt~DarlIALdA~tGkvc~~F 298 (773)
T COG4993 249 DPQH------------QT-CRGVSYGAAKA-----------------DAKSPCPRRIFLPTADARLIALDADTGKVCWSF 298 (773)
T ss_pred cccc------------cc-ccceecccccc-----------------cccCCCceeEEeecCCceEEEEeCCCCcEehee
Confidence 1000 00 11111221111 111227777888888899999999999999998
Q ss_pred ecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEc---------cCcEEEEEeCCCCCeeee
Q 040693 160 QLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQ---------KSGFAWALDRDSGSLIWS 230 (382)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~---------~~g~l~ald~~tG~~~W~ 230 (382)
...+.-. +...+ .+.+...+--.+.|.+.. ..+|+.+. ..|.+.++|..||+.+|.
T Consensus 299 a~~Ga~~-------l~tgm-~~~k~g~y~~tS~p~~~~-------~~~v~~g~v~Dn~st~e~sgVir~fdv~tG~l~w~ 363 (773)
T COG4993 299 ANKGALN-------LETGM-KDTKDGLYYGTSPPEFGV-------KGIVIAGSVADNESTWEPSGVIRGFDVLTGKLTWA 363 (773)
T ss_pred ccCceee-------eeccC-CCCCCCeEeecCCCcccc-------eeEEEeeccCCCceeeccCccccccccccCceEEc
Confidence 7654311 00001 111111233334444431 33444432 247899999999999999
Q ss_pred eccCCCC------------CCCCcccceee---eCCeEEEEecCcccccccc--CCCCCCCCCceEEEEECCCCcEEeee
Q 040693 231 MEAGPGG------------LGGGAMWGAAT---DERRIYTNIANSQHKNFNL--KPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 231 ~~~~~~~------------~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~--~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
.+...+. ..+...|+.+. +-++||+...+.....+.. .|.- ...+..++|+|+.||+.+|.+
T Consensus 364 ~D~gnpD~t~p~~~g~tyt~nspn~W~~~SyD~~lnlVy~p~Gn~~pd~wg~trtp~d-ekysssivAlD~~TG~~kW~y 442 (773)
T COG4993 364 GDPGNPDPTAPTAPGQTYTRNSPNSWASASYDAKLNLVYVPMGNQTPDTWGGTRTPGD-EKYSSSIVALDATTGKLKWVY 442 (773)
T ss_pred cCCCCCCCCCCCCCCceeecCCCCcccccccCCCCCeEEEeCCCCChhhccCCCCccc-ccccceeEEecCCCcceeeee
Confidence 9875431 23445677665 4689999998865555543 3332 236788999999999999998
Q ss_pred cCCCCCCC-----CcceEE----eCC---EEEEeeecCCCcEEEEeCCCCcEeEEE
Q 040693 294 ADPSNGTA-----PGPVTV----ANG---VLFGGSTYRQGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 294 ~~~~~~~~-----~~~~~~----~~~---~v~~~~~~~~g~l~~ld~~tG~ilw~~ 337 (382)
..--...+ .-+..+ ++. .++..+ ++|.+|++|..|||++=.+
T Consensus 443 QtvhhDlWDmDvp~qp~L~D~~~DG~~vpalv~pt--k~G~~YVlDRrtGe~lv~~ 496 (773)
T COG4993 443 QTVHHDLWDMDVPAQPTLLDITKDGKVVPALVHPT--KNGFIYVLDRRTGELLVPI 496 (773)
T ss_pred eccCcchhcccCCCCceEEEeecCCcEeeeeeccc--ccCcEEEEEcCCCcccccc
Confidence 64331111 222222 232 233334 6899999999999987654
No 19
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.92 E-value=8.5e-07 Score=84.74 Aligned_cols=280 Identities=11% Similarity=0.048 Sum_probs=156.3
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccC-ccccccccccccccceEEEEeCccCceeeeeeccCCCCCC
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSS-IEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGK 80 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~-~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~ 80 (382)
||.++++++=+.+.+..+.. + .+ -.+..+|++... .+..+ -..+..|..+|.+|++++.+..+.++...
T Consensus 32 iD~~~~~v~g~i~~G~~P~~-~-~s--pDg~~lyva~~~~~R~~~-----G~~~d~V~v~D~~t~~~~~~i~~p~~p~~- 101 (352)
T TIGR02658 32 IDGEAGRVLGMTDGGFLPNP-V-VA--SDGSFFAHASTVYSRIAR-----GKRTDYVEVIDPQTHLPIADIELPEGPRF- 101 (352)
T ss_pred EECCCCEEEEEEEccCCCce-e-EC--CCCCEEEEEecccccccc-----CCCCCEEEEEECccCcEEeEEccCCCchh-
Confidence 67888888777766543322 1 12 235678887751 00000 00157899999999999999998544221
Q ss_pred CCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEe
Q 040693 81 LNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQ 160 (382)
Q Consensus 81 ~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~ 160 (382)
..+... ....+.+++..+||...+ ....|..+|..+++++=+..
T Consensus 102 ---~~~~~~--~~~~ls~dgk~l~V~n~~-------------------------------p~~~V~VvD~~~~kvv~ei~ 145 (352)
T TIGR02658 102 ---LVGTYP--WMTSLTPDNKTLLFYQFS-------------------------------PSPAVGVVDLEGKAFVRMMD 145 (352)
T ss_pred ---hccCcc--ceEEECCCCCEEEEecCC-------------------------------CCCEEEEEECCCCcEEEEEe
Confidence 112111 236788888899986543 24899999999999999998
Q ss_pred cCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeec-c--CCCC
Q 040693 161 LGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSME-A--GPGG 237 (382)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~-~--~~~~ 237 (382)
.+... .++... .....+.+.+..-..+.+|. +|+..-... . +...
T Consensus 146 vp~~~----------------------------~vy~t~---e~~~~~~~~Dg~~~~v~~d~-~g~~~~~~~~vf~~~~~ 193 (352)
T TIGR02658 146 VPDCY----------------------------HIFPTA---NDTFFMHCRDGSLAKVGYGT-KGNPKIKPTEVFHPEDE 193 (352)
T ss_pred CCCCc----------------------------EEEEec---CCccEEEeecCceEEEEecC-CCceEEeeeeeecCCcc
Confidence 86431 111110 11223333333333444443 355331111 1 0000
Q ss_pred CC-CCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCc----EEeee-cCCC---C-CCCC-cceE
Q 040693 238 LG-GGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGN----VLWST-ADPS---N-GTAP-GPVT 306 (382)
Q Consensus 238 ~~-g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~----~~W~~-~~~~---~-~~~~-~~~~ 306 (382)
.. ..+.+ ...++..+|++ ..|+|+.+|..+.+ ..|.. .... . .... -++.
T Consensus 194 ~v~~rP~~-~~~dg~~~~vs------------------~eG~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia 254 (352)
T TIGR02658 194 YLINHPAY-SNKSGRLVWPT------------------YTGKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVA 254 (352)
T ss_pred ccccCCce-EcCCCcEEEEe------------------cCCeEEEEecCCCcceecceeeeccccccccccCCCcceeEE
Confidence 00 01111 12266777775 33889999954433 23332 2211 0 1111 1243
Q ss_pred E--eCCEEEEeeec--------CCCcEEEEeCCCCcEeEEEecCCceecceEE-eCC-EEEEEeCceeEeecCCccCCCC
Q 040693 307 V--ANGVLFGGSTY--------RQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNG-CIYMGNGYKVTVGFGNKNFTSG 374 (382)
Q Consensus 307 ~--~~~~v~~~~~~--------~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g-~lyv~~~~g~~~~~~~~~~~~g 374 (382)
+ +++++|+.... ..+.|..||.++++++-+.+++....+-.+. .+. .||+++..... +.-+|..++
T Consensus 255 ~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~~--VsViD~~t~ 332 (352)
T TIGR02658 255 YHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIELGHEIDSINVSQDAKPLLYALSTGDKT--LYIFDAETG 332 (352)
T ss_pred EcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCCc--EEEEECcCC
Confidence 4 46799996421 1258999999999999999987755433333 445 67777753332 234778888
Q ss_pred CeEEEE
Q 040693 375 TSLYAF 380 (382)
Q Consensus 375 ~~l~~~ 380 (382)
+.+-+.
T Consensus 333 k~i~~i 338 (352)
T TIGR02658 333 KELSSV 338 (352)
T ss_pred eEEeee
Confidence 877553
No 20
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.79 E-value=2.7e-05 Score=72.34 Aligned_cols=263 Identities=16% Similarity=0.172 Sum_probs=143.6
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCC
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKL 81 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~ 81 (382)
+|+++++.+...+....+. .+..+| .+..+|+... .++.|+.+|.++|+++.+.......
T Consensus 16 ~d~~t~~~~~~~~~~~~~~-~l~~~~--dg~~l~~~~~-------------~~~~v~~~d~~~~~~~~~~~~~~~~---- 75 (300)
T TIGR03866 16 IDTATLEVTRTFPVGQRPR-GITLSK--DGKLLYVCAS-------------DSDTIQVIDLATGEVIGTLPSGPDP---- 75 (300)
T ss_pred EECCCCceEEEEECCCCCC-ceEECC--CCCEEEEEEC-------------CCCeEEEEECCCCcEEEeccCCCCc----
Confidence 4677777766655432111 111111 1345666443 1678999999999987654432110
Q ss_pred CCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEec
Q 040693 82 NEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQL 161 (382)
Q Consensus 82 ~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~ 161 (382)
....+.+++..+|+.... ++.|..+|..+++.+.+++.
T Consensus 76 ----------~~~~~~~~g~~l~~~~~~--------------------------------~~~l~~~d~~~~~~~~~~~~ 113 (300)
T TIGR03866 76 ----------ELFALHPNGKILYIANED--------------------------------DNLVTVIDIETRKVLAEIPV 113 (300)
T ss_pred ----------cEEEECCCCCEEEEEcCC--------------------------------CCeEEEEECCCCeEEeEeeC
Confidence 124566676777776542 37899999999888776653
Q ss_pred CCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccC-cEEEEEeCCCCCeeeeeccCCCCCCC
Q 040693 162 GGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKS-GFAWALDRDSGSLIWSMEAGPGGLGG 240 (382)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~-g~l~ald~~tG~~~W~~~~~~~~~~g 240 (382)
... |.-..+.++ +..++++..+ ..+..+|..+++..........
T Consensus 114 ~~~----------------------------~~~~~~~~d---g~~l~~~~~~~~~~~~~d~~~~~~~~~~~~~~~---- 158 (300)
T TIGR03866 114 GVE----------------------------PEGMAVSPD---GKIVVNTSETTNMAHFIDTKTYEIVDNVLVDQR---- 158 (300)
T ss_pred CCC----------------------------cceEEECCC---CCEEEEEecCCCeEEEEeCCCCeEEEEEEcCCC----
Confidence 211 111111122 3445555444 3466679888877655432211
Q ss_pred CcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC-----CCCCcceEE--eCCEEE
Q 040693 241 GAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN-----GTAPGPVTV--ANGVLF 313 (382)
Q Consensus 241 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~-----~~~~~~~~~--~~~~v~ 313 (382)
........++..++++. ..++.+..+|.++++.+-++..... ......+.+ ++..+|
T Consensus 159 ~~~~~~s~dg~~l~~~~----------------~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~ 222 (300)
T TIGR03866 159 PRFAEFTADGKELWVSS----------------EIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAF 222 (300)
T ss_pred ccEEEECCCCCEEEEEc----------------CCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEE
Confidence 01111111556666652 2357899999999987544432210 111112223 245667
Q ss_pred EeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 314 GGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 314 ~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
++.. .++.+..+|.++++++-....+... .+..+ .+..||+++..... +.-+|..+++.+..+.
T Consensus 223 ~~~~-~~~~i~v~d~~~~~~~~~~~~~~~~-~~~~~~~~g~~l~~~~~~~~~--i~v~d~~~~~~~~~~~ 288 (300)
T TIGR03866 223 VALG-PANRVAVVDAKTYEVLDYLLVGQRV-WQLAFTPDEKYLLTTNGVSND--VSVIDVAALKVIKSIK 288 (300)
T ss_pred EEcC-CCCeEEEEECCCCcEEEEEEeCCCc-ceEEECCCCCEEEEEcCCCCe--EEEEECCCCcEEEEEE
Confidence 6543 3678999999999987655443322 22333 45567776553232 3446666788876654
No 21
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.79 E-value=4.5e-06 Score=79.80 Aligned_cols=135 Identities=15% Similarity=0.051 Sum_probs=91.8
Q ss_pred EcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCC
Q 040693 30 YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGN 109 (382)
Q Consensus 30 ~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~ 109 (382)
...++||....... ..+.|+.||.++++++=+.+.+... ...+.+++..+|++...
T Consensus 11 ~~~~v~V~d~~~~~---------~~~~v~ViD~~~~~v~g~i~~G~~P---------------~~~~spDg~~lyva~~~ 66 (352)
T TIGR02658 11 DARRVYVLDPGHFA---------ATTQVYTIDGEAGRVLGMTDGGFLP---------------NPVVASDGSFFAHASTV 66 (352)
T ss_pred CCCEEEEECCcccc---------cCceEEEEECCCCEEEEEEEccCCC---------------ceeECCCCCEEEEEecc
Confidence 34667886642111 0378999999999999998875322 12477888999998752
Q ss_pred CCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCC
Q 040693 110 LYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADF 189 (382)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (382)
. .|..-+.-...|..+|++|++++.+...++.. .+..
T Consensus 67 ~-----------------------~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p--------------------~~~~ 103 (352)
T TIGR02658 67 Y-----------------------SRIARGKRTDYVEVIDPQTHLPIADIELPEGP--------------------RFLV 103 (352)
T ss_pred c-----------------------cccccCCCCCEEEEEECccCcEEeEEccCCCc--------------------hhhc
Confidence 1 01111222488999999999999999987541 1112
Q ss_pred CCCceEEEeeeCceeecEEEEE--ccCcEEEEEeCCCCCeeeeeccC
Q 040693 190 GEAPMMLSMYRNKVKHDIVVAV--QKSGFAWALDRDSGSLIWSMEAG 234 (382)
Q Consensus 190 ~~~p~~~~~~~~g~~~~~v~~~--~~~g~l~ald~~tG~~~W~~~~~ 234 (382)
...|..+.+..+| ..+|+. +.+..+..+|.++++++-+.+.+
T Consensus 104 ~~~~~~~~ls~dg---k~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp 147 (352)
T TIGR02658 104 GTYPWMTSLTPDN---KTLLFYQFSPSPAVGVVDLEGKAFVRMMDVP 147 (352)
T ss_pred cCccceEEECCCC---CEEEEecCCCCCEEEEEECCCCcEEEEEeCC
Confidence 3456666666666 345544 33788999999999999999864
No 22
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=98.68 E-value=3.1e-05 Score=71.91 Aligned_cols=184 Identities=14% Similarity=0.176 Sum_probs=110.4
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.|+.+|.++++.+...+..... ....+++++..+|+....
T Consensus 10 d~~v~~~d~~t~~~~~~~~~~~~~--------------~~l~~~~dg~~l~~~~~~------------------------ 51 (300)
T TIGR03866 10 DNTISVIDTATLEVTRTFPVGQRP--------------RGITLSKDGKLLYVCASD------------------------ 51 (300)
T ss_pred CCEEEEEECCCCceEEEEECCCCC--------------CceEECCCCCEEEEEECC------------------------
Confidence 889999999999988777642111 135666676677775432
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEE-c
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAV-Q 212 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~-~ 212 (382)
++.|+.+|..+|+.+.+.+.... +.......+ +..+++. .
T Consensus 52 --------~~~v~~~d~~~~~~~~~~~~~~~----------------------------~~~~~~~~~---g~~l~~~~~ 92 (300)
T TIGR03866 52 --------SDTIQVIDLATGEVIGTLPSGPD----------------------------PELFALHPN---GKILYIANE 92 (300)
T ss_pred --------CCeEEEEECCCCcEEEeccCCCC----------------------------ccEEEECCC---CCEEEEEcC
Confidence 37899999999988765543211 111222222 2445444 4
Q ss_pred cCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE
Q 040693 213 KSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL 290 (382)
Q Consensus 213 ~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~ 290 (382)
.++.+..+|..+++.+..++... ....... ++..+++... ....++.+|.++++..
T Consensus 93 ~~~~l~~~d~~~~~~~~~~~~~~------~~~~~~~~~dg~~l~~~~~----------------~~~~~~~~d~~~~~~~ 150 (300)
T TIGR03866 93 DDNLVTVIDIETRKVLAEIPVGV------EPEGMAVSPDGKIVVNTSE----------------TTNMAHFIDTKTYEIV 150 (300)
T ss_pred CCCeEEEEECCCCeEEeEeeCCC------CcceEEECCCCCEEEEEec----------------CCCeEEEEeCCCCeEE
Confidence 57899999999988877765321 1111222 4555555421 1234667899998887
Q ss_pred eeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 291 WSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 291 W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
......... ......-++..+++... .++.++.+|.++++.+-++.
T Consensus 151 ~~~~~~~~~-~~~~~s~dg~~l~~~~~-~~~~v~i~d~~~~~~~~~~~ 196 (300)
T TIGR03866 151 DNVLVDQRP-RFAEFTADGKELWVSSE-IGGTVSVIDVATRKVIKKIT 196 (300)
T ss_pred EEEEcCCCc-cEEEECCCCCEEEEEcC-CCCEEEEEEcCcceeeeeee
Confidence 665443311 11111123456666542 47899999999998766554
No 23
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=98.55 E-value=8.8e-05 Score=66.99 Aligned_cols=208 Identities=16% Similarity=0.180 Sum_probs=120.6
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.|+.+|..+++.+.+....... . ....+.+++..++.+..
T Consensus 72 ~~~i~i~~~~~~~~~~~~~~~~~~-----------i--~~~~~~~~~~~~~~~~~------------------------- 113 (289)
T cd00200 72 DKTIRLWDLETGECVRTLTGHTSY-----------V--SSVAFSPDGRILSSSSR------------------------- 113 (289)
T ss_pred CCeEEEEEcCcccceEEEeccCCc-----------E--EEEEEcCCCCEEEEecC-------------------------
Confidence 789999999888777766532110 0 12344444334444332
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
++.+..+|..+++.+..+...... ..-.....+ +..++.+..
T Consensus 114 --------~~~i~~~~~~~~~~~~~~~~~~~~---------------------------i~~~~~~~~---~~~l~~~~~ 155 (289)
T cd00200 114 --------DKTIKVWDVETGKCLTTLRGHTDW---------------------------VNSVAFSPD---GTFVASSSQ 155 (289)
T ss_pred --------CCeEEEEECCCcEEEEEeccCCCc---------------------------EEEEEEcCc---CCEEEEEcC
Confidence 388999999989888877632211 111111111 355666666
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEe
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLW 291 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W 291 (382)
++.+..+|..+++.+..+..... ....... ++..++++ ..++.+..+|.++++.+-
T Consensus 156 ~~~i~i~d~~~~~~~~~~~~~~~-----~i~~~~~~~~~~~l~~~-----------------~~~~~i~i~d~~~~~~~~ 213 (289)
T cd00200 156 DGTIKLWDLRTGKCVATLTGHTG-----EVNSVAFSPDGEKLLSS-----------------SSDGTIKLWDLSTGKCLG 213 (289)
T ss_pred CCcEEEEEccccccceeEecCcc-----ccceEEECCCcCEEEEe-----------------cCCCcEEEEECCCCceec
Confidence 89999999998988877763221 1222222 33466665 235789999999988877
Q ss_pred eecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEe-C-CEEEEEeCcee
Q 040693 292 STADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVS-N-GCIYMGNGYKV 362 (382)
Q Consensus 292 ~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~-~-g~lyv~~~~g~ 362 (382)
+...... ....+... ++.+++... .++.|+.+|..+++.+..++............ + ..|++++.+|.
T Consensus 214 ~~~~~~~--~i~~~~~~~~~~~~~~~~-~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~d~~ 284 (289)
T cd00200 214 TLRGHEN--GVNSVAFSPDGYLLASGS-EDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGT 284 (289)
T ss_pred chhhcCC--ceEEEEEcCCCcEEEEEc-CCCcEEEEEcCCceeEEEccccCCcEEEEEECCCCCEEEEecCCCe
Confidence 6643221 11222232 244444432 48999999999999888877433333334443 3 44554555543
No 24
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=98.50 E-value=6.6e-05 Score=67.85 Aligned_cols=221 Identities=14% Similarity=0.154 Sum_probs=128.7
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.+..+|..+++........... . ......+++..++++..+
T Consensus 30 ~g~i~i~~~~~~~~~~~~~~~~~~-----------i--~~~~~~~~~~~l~~~~~~------------------------ 72 (289)
T cd00200 30 DGTIKVWDLETGELLRTLKGHTGP-----------V--RDVAASADGTYLASGSSD------------------------ 72 (289)
T ss_pred CcEEEEEEeeCCCcEEEEecCCcc-----------e--eEEEECCCCCEEEEEcCC------------------------
Confidence 789999999988876665432111 0 123444444566665433
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+.+..+|..+++.+.++....... .-.....+ +..++.+..
T Consensus 73 ---------~~i~i~~~~~~~~~~~~~~~~~~i---------------------------~~~~~~~~---~~~~~~~~~ 113 (289)
T cd00200 73 ---------KTIRLWDLETGECVRTLTGHTSYV---------------------------SSVAFSPD---GRILSSSSR 113 (289)
T ss_pred ---------CeEEEEEcCcccceEEEeccCCcE---------------------------EEEEEcCC---CCEEEEecC
Confidence 889999999888777766432211 00111111 345666666
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEe
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLW 291 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W 291 (382)
++.+..+|..+++....+.... ........ ++..++.+ ..++.+..+|.++++...
T Consensus 114 ~~~i~~~~~~~~~~~~~~~~~~-----~~i~~~~~~~~~~~l~~~-----------------~~~~~i~i~d~~~~~~~~ 171 (289)
T cd00200 114 DKTIKVWDVETGKCLTTLRGHT-----DWVNSVAFSPDGTFVASS-----------------SQDGTIKLWDLRTGKCVA 171 (289)
T ss_pred CCeEEEEECCCcEEEEEeccCC-----CcEEEEEEcCcCCEEEEE-----------------cCCCcEEEEEccccccce
Confidence 8999999999888887776221 11122222 23444443 235789999999998887
Q ss_pred eecCCCCCCCCcceEEe-C-CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeC-ceeEeecC
Q 040693 292 STADPSNGTAPGPVTVA-N-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNG-YKVTVGFG 367 (382)
Q Consensus 292 ~~~~~~~~~~~~~~~~~-~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~-~g~~~~~~ 367 (382)
........ ...+... + ..+++++. ++.+..+|..+++.+-++............ .++.++++.+ +|. +.+|
T Consensus 172 ~~~~~~~~--i~~~~~~~~~~~l~~~~~--~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~-i~i~ 246 (289)
T cd00200 172 TLTGHTGE--VNSVAFSPDGEKLLSSSS--DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGT-IRVW 246 (289)
T ss_pred eEecCccc--cceEEECCCcCEEEEecC--CCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEEEcCCCc-EEEE
Confidence 77644321 2222232 3 36777664 899999999998888776432222233333 3366666665 444 3444
Q ss_pred CccCCCCCeEEE
Q 040693 368 NKNFTSGTSLYA 379 (382)
Q Consensus 368 ~~~~~~g~~l~~ 379 (382)
.+ .+++.+..
T Consensus 247 ~~--~~~~~~~~ 256 (289)
T cd00200 247 DL--RTGECVQT 256 (289)
T ss_pred Ec--CCceeEEE
Confidence 43 33554443
No 25
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=98.45 E-value=3e-07 Score=59.12 Aligned_cols=40 Identities=40% Similarity=0.552 Sum_probs=28.3
Q ss_pred CceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCcc
Q 040693 7 GKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKT 64 (382)
Q Consensus 7 Gk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~t 64 (382)
||++|+++++ ..+.++|++.+++||+++. +|.|+|||++|
T Consensus 1 G~~~W~~~~~----~~~~~~~~v~~g~vyv~~~--------------dg~l~ald~~t 40 (40)
T PF13570_consen 1 GKVLWSYDTG----GPIWSSPAVAGGRVYVGTG--------------DGNLYALDAAT 40 (40)
T ss_dssp S-EEEEEE-S----S---S--EECTSEEEEE-T--------------TSEEEEEETT-
T ss_pred CceeEEEECC----CCcCcCCEEECCEEEEEcC--------------CCEEEEEeCCC
Confidence 8999999985 3577899999999999996 89999999976
No 26
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.41 E-value=5.4e-05 Score=73.53 Aligned_cols=194 Identities=20% Similarity=0.232 Sum_probs=112.8
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.|..+|.+|.+++.+.+.....+ ....+.+++..+|+...+
T Consensus 15 ~~~v~viD~~t~~~~~~i~~~~~~h-------------~~~~~s~Dgr~~yv~~rd------------------------ 57 (369)
T PF02239_consen 15 SGSVAVIDGATNKVVARIPTGGAPH-------------AGLKFSPDGRYLYVANRD------------------------ 57 (369)
T ss_dssp GTEEEEEETTT-SEEEEEE-STTEE-------------EEEE-TT-SSEEEEEETT------------------------
T ss_pred CCEEEEEECCCCeEEEEEcCCCCce-------------eEEEecCCCCEEEEEcCC------------------------
Confidence 7899999999999999998742211 124566777789997543
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEE-c
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAV-Q 212 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~-~ 212 (382)
+.|..+|+.+++++-+.+.... |.=..++.+| ..++++ .
T Consensus 58 ---------g~vsviD~~~~~~v~~i~~G~~----------------------------~~~i~~s~DG---~~~~v~n~ 97 (369)
T PF02239_consen 58 ---------GTVSVIDLATGKVVATIKVGGN----------------------------PRGIAVSPDG---KYVYVANY 97 (369)
T ss_dssp ---------SEEEEEETTSSSEEEEEE-SSE----------------------------EEEEEE--TT---TEEEEEEE
T ss_pred ---------CeEEEEECCcccEEEEEecCCC----------------------------cceEEEcCCC---CEEEEEec
Confidence 8999999999999999987643 2223333344 345544 4
Q ss_pred cCcEEEEEeCCCCCeeeeeccCCCCC--CCCcccce-eeeCCe-EEEEecCccccccccCCCCCCCCCceEEEEECCCCc
Q 040693 213 KSGFAWALDRDSGSLIWSMEAGPGGL--GGGAMWGA-ATDERR-IYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGN 288 (382)
Q Consensus 213 ~~g~l~ald~~tG~~~W~~~~~~~~~--~g~~~~~~-~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~ 288 (382)
..+.+..+|.+|.+++-..+...... ......+. ...... .+++. ...+.+..+|.++.+
T Consensus 98 ~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~l----------------kd~~~I~vVdy~d~~ 161 (369)
T PF02239_consen 98 EPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNL----------------KDTGEIWVVDYSDPK 161 (369)
T ss_dssp ETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEE----------------TTTTEEEEEETTTSS
T ss_pred CCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEE----------------ccCCeEEEEEecccc
Confidence 57889999999999999887542111 11111111 112222 22322 234788899988877
Q ss_pred EEeeecCCCCCCCCcceEEeCC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 289 VLWSTADPSNGTAPGPVTVANG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 289 ~~W~~~~~~~~~~~~~~~~~~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
.+..........-..-..-.++ +++++.. ....+.++|.++++.+...+.+.
T Consensus 162 ~~~~~~i~~g~~~~D~~~dpdgry~~va~~-~sn~i~viD~~~~k~v~~i~~g~ 214 (369)
T PF02239_consen 162 NLKVTTIKVGRFPHDGGFDPDGRYFLVAAN-GSNKIAVIDTKTGKLVALIDTGK 214 (369)
T ss_dssp CEEEEEEE--TTEEEEEE-TTSSEEEEEEG-GGTEEEEEETTTTEEEEEEE-SS
T ss_pred ccceeeecccccccccccCcccceeeeccc-ccceeEEEeeccceEEEEeeccc
Confidence 6655443321111111212233 5555543 46789999999999999887653
No 27
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=98.36 E-value=0.00013 Score=67.86 Aligned_cols=183 Identities=13% Similarity=0.074 Sum_probs=109.7
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
+.|+++|++||+++-+..-.... ..|.+.. ......++.+..++.++.++.
T Consensus 212 gti~~Wn~ktg~p~~~~~~~e~~-------------------------~~~~~~~----~~~~~~~~~g~~e~~~~~~~~ 262 (399)
T KOG0296|consen 212 GTIIVWNPKTGQPLHKITQAEGL-------------------------ELPCISL----NLAGSTLTKGNSEGVACGVNN 262 (399)
T ss_pred ceEEEEecCCCceeEEecccccC-------------------------cCCcccc----ccccceeEeccCCccEEEEcc
Confidence 89999999999999887643320 1111110 012467778888898999999
Q ss_pred CCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCC
Q 040693 223 DSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAP 302 (382)
Q Consensus 223 ~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~ 302 (382)
.+||++-..+...+......... +.-.-++.. . ...-.... +..+|+|..+|.++-+++..-..+.+ .
T Consensus 263 ~sgKVv~~~n~~~~~l~~~~e~~---~esve~~~~-s--s~lpL~A~---G~vdG~i~iyD~a~~~~R~~c~he~~---V 330 (399)
T KOG0296|consen 263 GSGKVVNCNNGTVPELKPSQEEL---DESVESIPS-S--SKLPLAAC---GSVDGTIAIYDLAASTLRHICEHEDG---V 330 (399)
T ss_pred ccceEEEecCCCCccccccchhh---hhhhhhccc-c--cccchhhc---ccccceEEEEecccchhheeccCCCc---e
Confidence 99999988774211110000000 000000000 0 00000001 14568999999998888877665542 3
Q ss_pred cceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEEeCCEEEEEeCceeEeecCC
Q 040693 303 GPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASVSNGCIYMGNGYKVTVGFGN 368 (382)
Q Consensus 303 ~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~~~g~lyv~~~~g~~~~~~~ 368 (382)
..+...+ .++|.+.. +|.|+.+|+.||..+..+.-.. .++--.+..+.++.|+.++.....+|.
T Consensus 331 ~~l~w~~t~~l~t~c~--~g~v~~wDaRtG~l~~~y~GH~~~Il~f~ls~~~~~vvT~s~D~~a~VF~ 396 (399)
T KOG0296|consen 331 TKLKWLNTDYLLTACA--NGKVRQWDARTGQLKFTYTGHQMGILDFALSPQKRLVVTVSDDNTALVFE 396 (399)
T ss_pred EEEEEcCcchheeecc--CceEEeeeccccceEEEEecCchheeEEEEcCCCcEEEEecCCCeEEEEe
Confidence 3444455 67888875 9999999999999999886433 233333346888888877766554443
No 28
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.35 E-value=0.00024 Score=64.93 Aligned_cols=120 Identities=19% Similarity=0.261 Sum_probs=72.7
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECC--CCcEE
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDAS--NGNVL 290 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~--tG~~~ 290 (382)
|.++.++.. |+...-.+.-. ..-+.++ +++.+|+.. ...+.|+.+|.. +++..
T Consensus 115 g~v~~~~~~-~~~~~~~~~~~------~pNGi~~s~dg~~lyv~d----------------s~~~~i~~~~~~~~~~~~~ 171 (246)
T PF08450_consen 115 GSVYRIDPD-GKVTVVADGLG------FPNGIAFSPDGKTLYVAD----------------SFNGRIWRFDLDADGGELS 171 (246)
T ss_dssp EEEEEEETT-SEEEEEEEEES------SEEEEEEETTSSEEEEEE----------------TTTTEEEEEEEETTTCCEE
T ss_pred cceEEECCC-CeEEEEecCcc------cccceEECCcchheeecc----------------cccceeEEEecccccccee
Confidence 679999988 66443332110 1112222 567888763 345779999985 33121
Q ss_pred ee---ecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEe---CCEEEEEeC
Q 040693 291 WS---TADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVS---NGCIYMGNG 359 (382)
Q Consensus 291 W~---~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~---~g~lyv~~~ 359 (382)
-+ ...+.......-+.++ ++.+|++.. ..+.|+.+|++ |+++-+.+++.....++++. .++|||++.
T Consensus 172 ~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~-~~~~I~~~~p~-G~~~~~i~~p~~~~t~~~fgg~~~~~L~vTta 245 (246)
T PF08450_consen 172 NRRVFIDFPGGPGYPDGLAVDSDGNLWVADW-GGGRIVVFDPD-GKLLREIELPVPRPTNCAFGGPDGKTLYVTTA 245 (246)
T ss_dssp EEEEEEE-SSSSCEEEEEEEBTTS-EEEEEE-TTTEEEEEETT-SCEEEEEE-SSSSEEEEEEESTTSSEEEEEEB
T ss_pred eeeeEEEcCCCCcCCCcceEcCCCCEEEEEc-CCCEEEEECCC-ccEEEEEcCCCCCEEEEEEECCCCCEEEEEeC
Confidence 11 2222211011224454 588999865 58999999998 99999999986666677773 488999975
No 29
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=98.31 E-value=0.00091 Score=63.01 Aligned_cols=186 Identities=13% Similarity=0.177 Sum_probs=98.5
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCC--CCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPG--PSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
++.|+-||++||+++|+.+....-... .+.....|....+ ...+.+.. =+-++... ..+++|+..-.-..++.
T Consensus 95 d~~~~EiDi~TgevlfeW~a~DH~~~~-~~~~~~~~~~~~g~~~~~~~D~~---HiNsV~~~-~~G~yLiS~R~~~~i~~ 169 (299)
T PF14269_consen 95 DDVFQEIDIETGEVLFEWSASDHVDPN-DSYDSQDPLPGSGGSSSFPWDYF---HINSVDKD-DDGDYLISSRNTSTIYK 169 (299)
T ss_pred cceeEEeccCCCCEEEEEEhhheeccc-ccccccccccCCCcCCCCCCCcc---Eeeeeeec-CCccEEEEecccCEEEE
Confidence 588999999999999999876542110 0000001111100 00111111 11111111 22578888888889999
Q ss_pred EeCCCCCeeeeeccCCC-C-------CCCCccccee--eeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcE
Q 040693 220 LDRDSGSLIWSMEAGPG-G-------LGGGAMWGAA--TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNV 289 (382)
Q Consensus 220 ld~~tG~~~W~~~~~~~-~-------~~g~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~ 289 (382)
+|+.||+++|...-... . +......... ..+...+....|..... .......+.++.+|+++.+.
T Consensus 170 I~~~tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~~~~~~~~IslFDN~~~~~-----~~~~~s~~~v~~ld~~~~~~ 244 (299)
T PF14269_consen 170 IDPSTGKIIWRLGGKRNSDFTLPATNFSWQHDARFLNESNDDGTISLFDNANSDF-----NGTEPSRGLVLELDPETMTV 244 (299)
T ss_pred EECCCCcEEEEeCCCCCCcccccCCcEeeccCCEEeccCCCCCEEEEEcCCCCCC-----CCCcCCCceEEEEECCCCEE
Confidence 99999999999863310 0 0011111111 01222222222210000 11123457899999998866
Q ss_pred EeeecCC---C---CCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEec
Q 040693 290 LWSTADP---S---NGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 290 ~W~~~~~---~---~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~ 339 (382)
.+..... . ....+..-...++.++++-. ..+++.=++++ |+++|++..
T Consensus 245 ~~~~~~~~~~~~~~s~~~G~~Q~L~nGn~li~~g-~~g~~~E~~~~-G~vv~~~~f 298 (299)
T PF14269_consen 245 TLVREYSDHPDGFYSPSQGSAQRLPNGNVLIGWG-NNGRISEFTPD-GEVVWEAQF 298 (299)
T ss_pred EEEEEeecCCCcccccCCCcceECCCCCEEEecC-CCceEEEECCC-CCEEEEEEC
Confidence 6555432 1 11223333456777777653 57889999965 999999864
No 30
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=98.24 E-value=9.3e-05 Score=67.38 Aligned_cols=144 Identities=22% Similarity=0.284 Sum_probs=102.2
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc-CcEEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK-SGFAWAL 220 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~-~g~l~al 220 (382)
.+.|..+|++||+++.+.++++.. |...-++. ++.|+.-+. ++..+.+
T Consensus 67 ~S~l~~~d~~tg~~~~~~~l~~~~-----------------------FgEGit~~--------~d~l~qLTWk~~~~f~y 115 (264)
T PF05096_consen 67 QSSLRKVDLETGKVLQSVPLPPRY-----------------------FGEGITIL--------GDKLYQLTWKEGTGFVY 115 (264)
T ss_dssp EEEEEEEETTTSSEEEEEE-TTT-------------------------EEEEEEE--------TTEEEEEESSSSEEEEE
T ss_pred cEEEEEEECCCCcEEEEEECCccc-----------------------cceeEEEE--------CCEEEEEEecCCeEEEE
Confidence 379999999999999999988752 22333444 677777765 5789999
Q ss_pred eCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCC-
Q 040693 221 DRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNG- 299 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~- 299 (382)
|++|-+++-+++.+. .-|+...++..++++ +.+.+|+-+|+++-++.-++.+....
T Consensus 116 d~~tl~~~~~~~y~~------EGWGLt~dg~~Li~S-----------------DGS~~L~~~dP~~f~~~~~i~V~~~g~ 172 (264)
T PF05096_consen 116 DPNTLKKIGTFPYPG------EGWGLTSDGKRLIMS-----------------DGSSRLYFLDPETFKEVRTIQVTDNGR 172 (264)
T ss_dssp ETTTTEEEEEEE-SS------S--EEEECSSCEEEE------------------SSSEEEEE-TTT-SEEEEEE-EETTE
T ss_pred ccccceEEEEEecCC------cceEEEcCCCEEEEE-----------------CCccceEEECCcccceEEEEEEEECCE
Confidence 999999998887542 568888888999998 56688999999999988887764311
Q ss_pred --CCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 300 --TAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 300 --~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
..-.-+-..+|.||+.-. ....|..||++||+++-.+++.
T Consensus 173 pv~~LNELE~i~G~IyANVW-~td~I~~Idp~tG~V~~~iDls 214 (264)
T PF05096_consen 173 PVSNLNELEYINGKIYANVW-QTDRIVRIDPETGKVVGWIDLS 214 (264)
T ss_dssp E---EEEEEEETTEEEEEET-TSSEEEEEETTT-BEEEEEE-H
T ss_pred ECCCcEeEEEEcCEEEEEeC-CCCeEEEEeCCCCeEEEEEEhh
Confidence 111223456899999875 6889999999999999888664
No 31
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=98.18 E-value=0.00019 Score=67.55 Aligned_cols=126 Identities=15% Similarity=0.115 Sum_probs=75.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCC-------CCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGP-------GGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTI 275 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~-------~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~ 275 (382)
++..++.+.+|.++.+|....+..|..+..- ..+..+.....+. ..+++|+...+. .++...+.
T Consensus 195 ~~~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLMh~g-------~~gsHKdp 267 (342)
T PF06433_consen 195 GGRLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLMHQG-------GEGSHKDP 267 (342)
T ss_dssp TTEEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEEEE---------TT-TTS-
T ss_pred CCeEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEecCC-------CCCCccCC
Confidence 4678889999999999988666554433211 0111011111222 468899865432 22333455
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEec
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~ 339 (382)
+..|+.+|++|+|++-+++++.. ..+-.+.-++ .++|..+. .++.|+.+|+.|||.+-+.+-
T Consensus 268 gteVWv~D~~t~krv~Ri~l~~~-~~Si~Vsqd~~P~L~~~~~-~~~~l~v~D~~tGk~~~~~~~ 330 (342)
T PF06433_consen 268 GTEVWVYDLKTHKRVARIPLEHP-IDSIAVSQDDKPLLYALSA-GDGTLDVYDAATGKLVRSIEQ 330 (342)
T ss_dssp EEEEEEEETTTTEEEEEEEEEEE-ESEEEEESSSS-EEEEEET-TTTEEEEEETTT--EEEEE--
T ss_pred ceEEEEEECCCCeEEEEEeCCCc-cceEEEccCCCcEEEEEcC-CCCeEEEEeCcCCcEEeehhc
Confidence 67799999999999999987652 1222222223 47887663 468899999999999988864
No 32
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=98.17 E-value=4.3e-06 Score=53.01 Aligned_cols=37 Identities=38% Similarity=0.841 Sum_probs=32.1
Q ss_pred CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceE
Q 040693 310 GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGAS 348 (382)
Q Consensus 310 ~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~ 348 (382)
++||+++. +|.|+|||++||+++|+++.+....+.|+
T Consensus 1 ~~v~~~~~--~g~l~AlD~~TG~~~W~~~~~~~~~~~p~ 37 (38)
T PF01011_consen 1 GRVYVGTP--DGYLYALDAKTGKVLWKFQTGPPVDSSPI 37 (38)
T ss_dssp TEEEEETT--TSEEEEEETTTTSEEEEEESSSGGGSCBE
T ss_pred CEEEEeCC--CCEEEEEECCCCCEEEeeeCCCCCccCcC
Confidence 57888874 99999999999999999999887777665
No 33
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=98.17 E-value=1.8e-06 Score=55.40 Aligned_cols=40 Identities=38% Similarity=0.773 Sum_probs=28.1
Q ss_pred CcEeEEEecCCceecceEEeCCEEEEEeCceeEeecCCccCCC
Q 040693 331 GKILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTS 373 (382)
Q Consensus 331 G~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~ 373 (382)
|+++|+++++..+.++|++.+++||+++.+|. +|+||+.|
T Consensus 1 G~~~W~~~~~~~~~~~~~v~~g~vyv~~~dg~---l~ald~~t 40 (40)
T PF13570_consen 1 GKVLWSYDTGGPIWSSPAVAGGRVYVGTGDGN---LYALDAAT 40 (40)
T ss_dssp S-EEEEEE-SS---S--EECTSEEEEE-TTSE---EEEEETT-
T ss_pred CceeEEEECCCCcCcCCEEECCEEEEEcCCCE---EEEEeCCC
Confidence 78999999999899999999999999999988 57788764
No 34
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.16 E-value=0.00014 Score=70.77 Aligned_cols=249 Identities=14% Similarity=0.150 Sum_probs=131.3
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEEE--cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCC
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTYY--KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFG 79 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v~--~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~ 79 (382)
||.+|.+++-+.+.... ........ +..+|+..+ +|.|..+|+.+++++-+.+.+....
T Consensus 21 iD~~t~~~~~~i~~~~~----~h~~~~~s~Dgr~~yv~~r--------------dg~vsviD~~~~~~v~~i~~G~~~~- 81 (369)
T PF02239_consen 21 IDGATNKVVARIPTGGA----PHAGLKFSPDGRYLYVANR--------------DGTVSVIDLATGKVVATIKVGGNPR- 81 (369)
T ss_dssp EETTT-SEEEEEE-STT----EEEEEE-TT-SSEEEEEET--------------TSEEEEEETTSSSEEEEEE-SSEEE-
T ss_pred EECCCCeEEEEEcCCCC----ceeEEEecCCCCEEEEEcC--------------CCeEEEEECCcccEEEEEecCCCcc-
Confidence 67888888887776421 11122222 356888765 8899999999999999998853321
Q ss_pred CCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEE
Q 040693 80 KLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYK 159 (382)
Q Consensus 80 ~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~ 159 (382)
...+.+++.++|++... .+.+..+|.+|.+++-+.
T Consensus 82 -------------~i~~s~DG~~~~v~n~~--------------------------------~~~v~v~D~~tle~v~~I 116 (369)
T PF02239_consen 82 -------------GIAVSPDGKYVYVANYE--------------------------------PGTVSVIDAETLEPVKTI 116 (369)
T ss_dssp -------------EEEE--TTTEEEEEEEE--------------------------------TTEEEEEETTT--EEEEE
T ss_pred -------------eEEEcCCCCEEEEEecC--------------------------------CCceeEeccccccceeec
Confidence 35677788888887543 389999999999999988
Q ss_pred ecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEee-eCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCC
Q 040693 160 QLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMY-RNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGL 238 (382)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~-~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~ 238 (382)
+...... +. ..+.+..+. ... ....|+.....+.++.+|..+.+.+.......
T Consensus 117 ~~~~~~~---------------------~~-~~~Rv~aIv~s~~-~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i~~--- 170 (369)
T PF02239_consen 117 PTGGMPV---------------------DG-PESRVAAIVASPG-RPEFVVNLKDTGEIWVVDYSDPKNLKVTTIKV--- 170 (369)
T ss_dssp E--EE-T---------------------TT-S---EEEEEE-SS-SSEEEEEETTTTEEEEEETTTSSCEEEEEEE----
T ss_pred ccccccc---------------------cc-cCCCceeEEecCC-CCEEEEEEccCCeEEEEEeccccccceeeecc---
Confidence 7764310 00 122222221 122 12345555666889999988777665444321
Q ss_pred CCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCC---CcceEEeCCEEE
Q 040693 239 GGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTA---PGPVTVANGVLF 313 (382)
Q Consensus 239 ~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~---~~~~~~~~~~v~ 313 (382)
+........ ++.++++... .++.|..+|.++++..+..+....+.. ...+-...+.+|
T Consensus 171 -g~~~~D~~~dpdgry~~va~~----------------~sn~i~viD~~~~k~v~~i~~g~~p~~~~~~~~php~~g~vw 233 (369)
T PF02239_consen 171 -GRFPHDGGFDPDGRYFLVAAN----------------GSNKIAVIDTKTGKLVALIDTGKKPHPGPGANFPHPGFGPVW 233 (369)
T ss_dssp --TTEEEEEE-TTSSEEEEEEG----------------GGTEEEEEETTTTEEEEEEE-SSSBEETTEEEEEETTTEEEE
T ss_pred -cccccccccCcccceeeeccc----------------ccceeEEEeeccceEEEEeeccccccccccccccCCCcceEE
Confidence 111122222 3444555432 246799999999999998876442110 001111123444
Q ss_pred EeeecC--------CCcEEEEeCCCCcEeEEEecCCceec-ceEEeCCEEEEE
Q 040693 314 GGSTYR--------QGPIYAMDVKTGKILWSYDTGATIYG-GASVSNGCIYMG 357 (382)
Q Consensus 314 ~~~~~~--------~g~l~~ld~~tG~ilw~~~~~~~~~~-~p~~~~g~lyv~ 357 (382)
...... ...+..+|..+.|++-+.++.++-.. ..--....||+.
T Consensus 234 ~~~~~~~~~~~~ig~~~v~v~d~~~wkvv~~I~~~G~glFi~thP~s~~vwvd 286 (369)
T PF02239_consen 234 ATSGLGYFAIPLIGTDPVSVHDDYAWKVVKTIPTQGGGLFIKTHPDSRYVWVD 286 (369)
T ss_dssp EEEBSSSSEEEEEE--TTT-STTTBTSEEEEEE-SSSS--EE--TT-SEEEEE
T ss_pred eeccccceecccccCCccccchhhcCeEEEEEECCCCcceeecCCCCccEEee
Confidence 443201 22233455566777777777553221 111244566666
No 35
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=98.14 E-value=0.00087 Score=64.62 Aligned_cols=157 Identities=15% Similarity=0.130 Sum_probs=85.1
Q ss_pred cEEEE-EccCcEEEEEeCCCCC--eeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 206 DIVVA-VQKSGFAWALDRDSGS--LIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 206 ~~v~~-~~~~g~l~ald~~tG~--~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
..|++ .-....|+.++.+..+ +.-......+...|.......-++..+|+... .+++|..+
T Consensus 156 ~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e----------------~s~~v~v~ 219 (345)
T PF10282_consen 156 RFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNE----------------LSNTVSVF 219 (345)
T ss_dssp SEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEET----------------TTTEEEEE
T ss_pred CEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecC----------------CCCcEEEE
Confidence 34444 3344556666665443 43322211111122222222226788888732 35677777
Q ss_pred ECC--CCcEEeeecCCC---CC---CCCcceEEe--CCEEEEeeecCCCcEEEEeC--CCCcEeEEE--ecCCceecceE
Q 040693 283 DAS--NGNVLWSTADPS---NG---TAPGPVTVA--NGVLFGGSTYRQGPIYAMDV--KTGKILWSY--DTGATIYGGAS 348 (382)
Q Consensus 283 d~~--tG~~~W~~~~~~---~~---~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~--~tG~ilw~~--~~~~~~~~~p~ 348 (382)
+.. +|+......... .. .....+.+. +..+|+... ..+.|.+|+. ++|++.... +..+..-....
T Consensus 220 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr-~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~ 298 (345)
T PF10282_consen 220 DYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNR-GSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFA 298 (345)
T ss_dssp EEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEEC-TTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEE
T ss_pred eecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEec-cCCEEEEEEEecCCCceEEEEEEeCCCCCccEEE
Confidence 766 776554433221 10 112233343 568898774 3556666665 678775533 33332223334
Q ss_pred E--eCCEEEEEeCceeEeecCCccCCCCCeEEE
Q 040693 349 V--SNGCIYMGNGYKVTVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 349 ~--~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~ 379 (382)
+ .+..|||++.....+.+|.+|..+|++...
T Consensus 299 ~s~~g~~l~Va~~~s~~v~vf~~d~~tG~l~~~ 331 (345)
T PF10282_consen 299 FSPDGRYLYVANQDSNTVSVFDIDPDTGKLTPV 331 (345)
T ss_dssp E-TTSSEEEEEETTTTEEEEEEEETTTTEEEEE
T ss_pred EeCCCCEEEEEecCCCeEEEEEEeCCCCcEEEe
Confidence 4 788899999888888899999999987543
No 36
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=98.13 E-value=0.00059 Score=68.22 Aligned_cols=202 Identities=19% Similarity=0.252 Sum_probs=142.9
Q ss_pred EcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCC
Q 040693 30 YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGN 109 (382)
Q Consensus 30 ~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~ 109 (382)
-+++||-... .|.|.=+|+.+++++-..+.. |+..| ++++.+.++.+.++..+
T Consensus 79 e~~RLFS~g~--------------sg~i~EwDl~~lk~~~~~d~~-----------gg~IW--siai~p~~~~l~Igcdd 131 (691)
T KOG2048|consen 79 EGGRLFSSGL--------------SGSITEWDLHTLKQKYNIDSN-----------GGAIW--SIAINPENTILAIGCDD 131 (691)
T ss_pred cCCeEEeecC--------------CceEEEEecccCceeEEecCC-----------Cccee--EEEeCCccceEEeecCC
Confidence 3677776654 889999999999999888762 66777 47888888888887654
Q ss_pred CCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCC
Q 040693 110 LYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADF 189 (382)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (382)
|.++-++..++++..+..+...
T Consensus 132 ---------------------------------Gvl~~~s~~p~~I~~~r~l~rq------------------------- 153 (691)
T KOG2048|consen 132 ---------------------------------GVLYDFSIGPDKITYKRSLMRQ------------------------- 153 (691)
T ss_pred ---------------------------------ceEEEEecCCceEEEEeecccc-------------------------
Confidence 7888999988998888776644
Q ss_pred CCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCC---CCcccceee-eCCeEEEEecCcccccc
Q 040693 190 GEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLG---GGAMWGAAT-DERRIYTNIANSQHKNF 265 (382)
Q Consensus 190 ~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~---g~~~~~~~~-~~~~v~~~~~~~~~~~~ 265 (382)
.-.++++..+.. +..|+.|..||.+.+.|..+|..+-..++.-.... ....|...+ .++.+..+
T Consensus 154 --~sRvLslsw~~~-~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd~tI~sg--------- 221 (691)
T KOG2048|consen 154 --KSRVLSLSWNPT-GTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRDSTIASG--------- 221 (691)
T ss_pred --cceEEEEEecCC-ccEEEecccCceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeecCcEEEe---------
Confidence 123444443331 34588999999999999999998885544322111 123466544 55666655
Q ss_pred ccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 266 NLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 266 ~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
+..|+|.-.|.++|.+.=........ .-.++++ +++||.+.. +++++.+..++.+-.|.....
T Consensus 222 --------DS~G~V~FWd~~~gTLiqS~~~h~ad--Vl~Lav~~~~d~vfsaGv--d~~ii~~~~~~~~~~wv~~~~ 286 (691)
T KOG2048|consen 222 --------DSAGTVTFWDSIFGTLIQSHSCHDAD--VLALAVADNEDRVFSAGV--DPKIIQYSLTTNKSEWVINSR 286 (691)
T ss_pred --------cCCceEEEEcccCcchhhhhhhhhcc--eeEEEEcCCCCeEEEccC--CCceEEEEecCCccceeeecc
Confidence 67799999999999888666654432 2233333 479999885 999999998877666865443
No 37
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=98.10 E-value=0.0015 Score=61.11 Aligned_cols=152 Identities=9% Similarity=0.092 Sum_probs=92.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..++.+..||.+.++|++||.++-+..-..... ..+.... ....++.. ...+.+....
T Consensus 202 GKr~~tgy~dgti~~Wn~ktg~p~~~~~~~e~~~---~~~~~~~~~~~~~~~g-----------------~~e~~~~~~~ 261 (399)
T KOG0296|consen 202 GKRILTGYDDGTIIVWNPKTGQPLHKITQAEGLE---LPCISLNLAGSTLTKG-----------------NSEGVACGVN 261 (399)
T ss_pred CceEEEEecCceEEEEecCCCceeEEecccccCc---CCccccccccceeEec-----------------cCCccEEEEc
Confidence 4677888889999999999999998877322100 0001111 22233332 2345566667
Q ss_pred CCCCcEEeeecC--CC---------CCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCC
Q 040693 284 ASNGNVLWSTAD--PS---------NGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNG 352 (382)
Q Consensus 284 ~~tG~~~W~~~~--~~---------~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g 352 (382)
..+||++--... +. ....+.+....=.+.-++. -+|.|...|..+-+++-+.+...++..-.-....
T Consensus 262 ~~sgKVv~~~n~~~~~l~~~~e~~~esve~~~~ss~lpL~A~G~--vdG~i~iyD~a~~~~R~~c~he~~V~~l~w~~t~ 339 (399)
T KOG0296|consen 262 NGSGKVVNCNNGTVPELKPSQEELDESVESIPSSSKLPLAACGS--VDGTIAIYDLAASTLRHICEHEDGVTKLKWLNTD 339 (399)
T ss_pred cccceEEEecCCCCccccccchhhhhhhhhcccccccchhhccc--ccceEEEEecccchhheeccCCCceEEEEEcCcc
Confidence 777777655542 11 0011111111112333333 4899999999888887777776664433333457
Q ss_pred EEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 353 CIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 353 ~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
.||.+..+|. ++.+|++||.++..|.
T Consensus 340 ~l~t~c~~g~---v~~wDaRtG~l~~~y~ 365 (399)
T KOG0296|consen 340 YLLTACANGK---VRQWDARTGQLKFTYT 365 (399)
T ss_pred hheeeccCce---EEeeeccccceEEEEe
Confidence 8888888887 5789999999998874
No 38
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=98.03 E-value=8.7e-06 Score=51.60 Aligned_cols=22 Identities=50% Similarity=0.813 Sum_probs=20.5
Q ss_pred cceEEEEeCccCceeeeeeccC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLP 75 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~ 75 (382)
+|.|+|||++||+++|+++...
T Consensus 9 ~g~l~AlD~~TG~~~W~~~~~~ 30 (38)
T PF01011_consen 9 DGYLYALDAKTGKVLWKFQTGP 30 (38)
T ss_dssp TSEEEEEETTTTSEEEEEESSS
T ss_pred CCEEEEEECCCCCEEEeeeCCC
Confidence 9999999999999999999853
No 39
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=98.02 E-value=0.0022 Score=64.62 Aligned_cols=147 Identities=18% Similarity=0.317 Sum_probs=75.8
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
....+++|. +|.++|.......... ..... .++.+++..+
T Consensus 127 ~~~~~~iD~-~G~Vrw~~~~~~~~~~-------------~~~~l-~nG~ll~~~~------------------------- 166 (477)
T PF05935_consen 127 SSYTYLIDN-NGDVRWYLPLDSGSDN-------------SFKQL-PNGNLLIGSG------------------------- 166 (477)
T ss_dssp EEEEEEEET-TS-EEEEE-GGGT--S-------------SEEE--TTS-EEEEEB-------------------------
T ss_pred CceEEEECC-CccEEEEEccCccccc-------------eeeEc-CCCCEEEecC-------------------------
Confidence 678999995 9999999987432210 11222 3345555443
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCc---ccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEE
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYD---VWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVA 210 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~ 210 (382)
..++.+|.. |+++|+++++... .+++. ..++ ++.|+.
T Consensus 167 ---------~~~~e~D~~-G~v~~~~~l~~~~~~~HHD~~-----------------~l~n-------------Gn~L~l 206 (477)
T PF05935_consen 167 ---------NRLYEIDLL-GKVIWEYDLPGGYYDFHHDID-----------------ELPN-------------GNLLIL 206 (477)
T ss_dssp ---------TEEEEE-TT---EEEEEE--TTEE-B-S-EE-----------------E-TT-------------S-EEEE
T ss_pred ---------CceEEEcCC-CCEEEeeecCCcccccccccE-----------------ECCC-------------CCEEEE
Confidence 689999975 9999999987632 11000 0111 222222
Q ss_pred Ec-------------cCcEEEEEeCCCCCeeeeeccCCCC------------------CCCCcccceee------eCCeE
Q 040693 211 VQ-------------KSGFAWALDRDSGSLIWSMEAGPGG------------------LGGGAMWGAAT------DERRI 253 (382)
Q Consensus 211 ~~-------------~~g~l~ald~~tG~~~W~~~~~~~~------------------~~g~~~~~~~~------~~~~v 253 (382)
.. ....++.+| .+|++.|..++...- ......|.-.. .++.|
T Consensus 207 ~~~~~~~~~~~~~~~~~D~Ivevd-~tG~vv~~wd~~d~ld~~~~~~~~~~~~~~~~~~~~~~DW~H~Nsi~yd~~dd~i 285 (477)
T PF05935_consen 207 ASETKYVDEDKDVDTVEDVIVEVD-PTGEVVWEWDFFDHLDPYRDTVLKPYPYGDISGSGGGRDWLHINSIDYDPSDDSI 285 (477)
T ss_dssp EEETTEE-TS-EE---S-EEEEE--TTS-EEEEEEGGGTS-TT--TTGGT--SSSSS-SSTTSBS--EEEEEEETTTTEE
T ss_pred EeecccccCCCCccEecCEEEEEC-CCCCEEEEEehHHhCCcccccccccccccccccCCCCCCccccCccEEeCCCCeE
Confidence 22 134699999 899999998854210 01223333221 25677
Q ss_pred EEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 254 YTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
+++.-+ -+.|+.||.+||+++|....+.
T Consensus 286 ivSsR~----------------~s~V~~Id~~t~~i~Wilg~~~ 313 (477)
T PF05935_consen 286 IVSSRH----------------QSAVIKIDYRTGKIKWILGPPG 313 (477)
T ss_dssp EEEETT----------------T-EEEEEE-TTS-EEEEES-ST
T ss_pred EEEcCc----------------ceEEEEEECCCCcEEEEeCCCC
Confidence 776432 3579999999999999987654
No 40
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=97.99 E-value=0.0031 Score=59.49 Aligned_cols=272 Identities=13% Similarity=0.062 Sum_probs=148.4
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEEE--cCEEEEeccC----ccccccccccccccceEEEEeCccCceeeeeeccC
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTYY--KGAYYVGTSS----IEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLP 75 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v~--~~~v~v~~~~----~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~ 75 (382)
||.++||++=..+.+ +....++. +..+|+++.- ....+ .-.|..+|.+|-++.+...+.+
T Consensus 22 iD~d~~k~lGmi~~g------~~~~~~~spdgk~~y~a~T~~sR~~rG~R--------tDvv~~~D~~TL~~~~EI~iP~ 87 (342)
T PF06433_consen 22 IDADSGKLLGMIDTG------FLGNVALSPDGKTIYVAETFYSRGTRGER--------TDVVEIWDTQTLSPTGEIEIPP 87 (342)
T ss_dssp EETTTTEEEEEEEEE------SSEEEEE-TTSSEEEEEEEEEEETTEEEE--------EEEEEEEETTTTEEEEEEEETT
T ss_pred EECCCCcEEEEeecc------cCCceeECCCCCEEEEEEEEEeccccccc--------eeEEEEEecCcCcccceEecCC
Confidence 577788876666542 11122223 5667765431 12222 3478999999999999999854
Q ss_pred CCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcE
Q 040693 76 DNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKI 155 (382)
Q Consensus 76 ~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~ 155 (382)
.... ..+... ....+..++..+||---. ....|-.+|.+.+|.
T Consensus 88 k~R~----~~~~~~--~~~~ls~dgk~~~V~N~T-------------------------------Pa~SVtVVDl~~~kv 130 (342)
T PF06433_consen 88 KPRA----QVVPYK--NMFALSADGKFLYVQNFT-------------------------------PATSVTVVDLAAKKV 130 (342)
T ss_dssp S-B------BS--G--GGEEE-TTSSEEEEEEES-------------------------------SSEEEEEEETTTTEE
T ss_pred cchh----eecccc--cceEEccCCcEEEEEccC-------------------------------CCCeEEEEECCCCce
Confidence 3111 111111 134566677788885432 237999999999999
Q ss_pred EEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCC-CCCeeeeeccC
Q 040693 156 VWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRD-SGSLIWSMEAG 234 (382)
Q Consensus 156 ~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~-tG~~~W~~~~~ 234 (382)
+=+.+.++=. .++. .+ ..-......||.+..+..+ .||.. +....
T Consensus 131 v~ei~~PGC~----------------------------~iyP---~~--~~~F~~lC~DGsl~~v~Ld~~Gk~~-~~~t~ 176 (342)
T PF06433_consen 131 VGEIDTPGCW----------------------------LIYP---SG--NRGFSMLCGDGSLLTVTLDADGKEA-QKSTK 176 (342)
T ss_dssp EEEEEGTSEE----------------------------EEEE---EE--TTEEEEEETTSCEEEEEETSTSSEE-EEEEE
T ss_pred eeeecCCCEE----------------------------EEEe---cC--CCceEEEecCCceEEEEECCCCCEe-Eeecc
Confidence 9888776421 1111 01 1223344445545444444 57776 32211
Q ss_pred CCC-CCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC--------C-CCC-
Q 040693 235 PGG-LGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS--------N-GTA- 301 (382)
Q Consensus 235 ~~~-~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~--------~-~~~- 301 (382)
.-. ..-.....++. .++..|+. +..|.|+.+|.+..+..|..+... . ...
T Consensus 177 ~F~~~~dp~f~~~~~~~~~~~~~F~-----------------Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG 239 (342)
T PF06433_consen 177 VFDPDDDPLFEHPAYSRDGGRLYFV-----------------SYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGG 239 (342)
T ss_dssp ESSTTTS-B-S--EEETTTTEEEEE-----------------BTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-S
T ss_pred ccCCCCcccccccceECCCCeEEEE-----------------ecCCEEEEEeccCCcccccCcccccCccccccCcCCcc
Confidence 000 00011122332 34455544 567999999998887665543221 0 011
Q ss_pred CcceEE--eCCEEEEeeec--------CCCcEEEEeCCCCcEeEEEecCCceecceEE---eCCEEEEEeC-ceeEeecC
Q 040693 302 PGPVTV--ANGVLFGGSTY--------RQGPIYAMDVKTGKILWSYDTGATIYGGASV---SNGCIYMGNG-YKVTVGFG 367 (382)
Q Consensus 302 ~~~~~~--~~~~v~~~~~~--------~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~---~~g~lyv~~~-~g~~~~~~ 367 (382)
.-.+.+ ..+++|+.... ..-.|+++|++++|.+-|.++...+. +..+ ..-.||..+. ++. ++
T Consensus 240 ~Q~~A~~~~~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~~~-Si~Vsqd~~P~L~~~~~~~~~---l~ 315 (342)
T PF06433_consen 240 WQLIAYHAASGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHPID-SIAVSQDDKPLLYALSAGDGT---LD 315 (342)
T ss_dssp SS-EEEETTTTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEEES-EEEEESSSS-EEEEEETTTTE---EE
T ss_pred eeeeeeccccCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCccc-eEEEccCCCcEEEEEcCCCCe---EE
Confidence 122333 35788886531 12259999999999999999876543 3333 2346776655 333 56
Q ss_pred CccCCCCCeEEE
Q 040693 368 NKNFTSGTSLYA 379 (382)
Q Consensus 368 ~~~~~~g~~l~~ 379 (382)
-+|..||+.+-+
T Consensus 316 v~D~~tGk~~~~ 327 (342)
T PF06433_consen 316 VYDAATGKLVRS 327 (342)
T ss_dssp EEETTT--EEEE
T ss_pred EEeCcCCcEEee
Confidence 789999998754
No 41
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=97.94 E-value=0.0031 Score=63.47 Aligned_cols=280 Identities=16% Similarity=0.242 Sum_probs=120.8
Q ss_pred CCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCc
Q 040693 6 NGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYA 85 (382)
Q Consensus 6 tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~ 85 (382)
+|.++|..+....... .--..-++.++++. ...++.+|. .|+++|.+++....
T Consensus 136 ~G~Vrw~~~~~~~~~~---~~~~l~nG~ll~~~---------------~~~~~e~D~-~G~v~~~~~l~~~~-------- 188 (477)
T PF05935_consen 136 NGDVRWYLPLDSGSDN---SFKQLPNGNLLIGS---------------GNRLYEIDL-LGKVIWEYDLPGGY-------- 188 (477)
T ss_dssp TS-EEEEE-GGGT--S---SEEE-TTS-EEEEE---------------BTEEEEE-T-T--EEEEEE--TTE--------
T ss_pred CccEEEEEccCccccc---eeeEcCCCCEEEec---------------CCceEEEcC-CCCEEEeeecCCcc--------
Confidence 6999999987542111 01234567777766 568999998 89999999984321
Q ss_pred CccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCc
Q 040693 86 GAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYD 165 (382)
Q Consensus 86 g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~ 165 (382)
..+.......+.+++++++....+.... .........|+-+| .||+++|+.++...-
T Consensus 189 --~~~HHD~~~l~nGn~L~l~~~~~~~~~~--------------------~~~~~~~D~Ivevd-~tG~vv~~wd~~d~l 245 (477)
T PF05935_consen 189 --YDFHHDIDELPNGNLLILASETKYVDED--------------------KDVDTVEDVIVEVD-PTGEVVWEWDFFDHL 245 (477)
T ss_dssp --E-B-S-EEE-TTS-EEEEEEETTEE-TS---------------------EE---S-EEEEE--TTS-EEEEEEGGGTS
T ss_pred --cccccccEECCCCCEEEEEeecccccCC--------------------CCccEecCEEEEEC-CCCCEEEEEehHHhC
Confidence 0011234555666777766521100000 00112247899999 899999999876541
Q ss_pred c-cccccccCCCCCCC-CCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCC----
Q 040693 166 V-WFGACNWYLNPNCP-PGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLG---- 239 (382)
Q Consensus 166 ~-~~~~~~~~~~~~~~-~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~---- 239 (382)
. ...... ...+... .......++.= +-++.-+...+.+|+..-....++.+|..||++.|..-....-..
T Consensus 246 d~~~~~~~-~~~~~~~~~~~~~~~DW~H---~Nsi~yd~~dd~iivSsR~~s~V~~Id~~t~~i~Wilg~~~~w~~~~~~ 321 (477)
T PF05935_consen 246 DPYRDTVL-KPYPYGDISGSGGGRDWLH---INSIDYDPSDDSIIVSSRHQSAVIKIDYRTGKIKWILGPPGGWNGTYQD 321 (477)
T ss_dssp -TT--TTG-GT--SSSSS-SSTTSBS-----EEEEEEETTTTEEEEEETTT-EEEEEE-TTS-EEEEES-STT--TTTGG
T ss_pred Cccccccc-ccccccccccCCCCCCccc---cCccEEeCCCCeEEEEcCcceEEEEEECCCCcEEEEeCCCCCCCcccch
Confidence 0 000000 0000000 00000011111 111111122256666667778999999999999999875421100
Q ss_pred -----------------CCcc--cce----ee-eC---CeEEEEecCcccccccc-CCCCCCCCCce--EEEEECCCC--
Q 040693 240 -----------------GGAM--WGA----AT-DE---RRIYTNIANSQHKNFNL-KPSKNSTIAGG--WVAMDASNG-- 287 (382)
Q Consensus 240 -----------------g~~~--~~~----~~-~~---~~v~~~~~~~~~~~~~~-~~~~~~~~~g~--v~a~d~~tG-- 287 (382)
+... ++. .. ++ .++++-.+ ..+.... .........++ .+.+|.+++
T Consensus 322 ~ll~~vd~~G~~~~~~~~~~~~~~gQH~~~~~~~g~~~~l~vFDNg--~~r~~~~~~~~~~~~~~Sr~v~Y~Ide~~~T~ 399 (477)
T PF05935_consen 322 YLLTPVDSNGNPIDCGDGDFDWFWGQHTAHLIPDGPQGNLLVFDNG--NGRGYGQPAYVSPKDNYSRAVEYRIDENKMTV 399 (477)
T ss_dssp GB-EEB-TTS-B-EBSSSS----SS-EEEEE-TTS---SEEEEE----TTGGGS--SSCCG-----EEEEEEEETTTTEE
T ss_pred heeeeeccCCceeeccCCCCcccccccceEEcCCCCeEEEEEEECC--CCCCCCCccccccccccceEEEEEecCCCceE
Confidence 0001 110 11 23 23333222 2222211 00000011223 457888776
Q ss_pred cEEeeecCCC-----CCCCCcceEEeC-CEEEEeee--cC-------CCcEEEEeCCCCcEeEEEecCC
Q 040693 288 NVLWSTADPS-----NGTAPGPVTVAN-GVLFGGST--YR-------QGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 288 ~~~W~~~~~~-----~~~~~~~~~~~~-~~v~~~~~--~~-------~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
+..|++..+. .+..++.-...+ +.+++... +. .+.+.-++..+.++++++.+..
T Consensus 400 ~~vw~y~~~~g~~~yS~~~s~aq~l~n~gn~li~~g~~~~~~~~~~~~~~i~ev~~~~~~v~~e~~~~~ 468 (477)
T PF05935_consen 400 EQVWEYGKPRGNEFYSPIVSSAQYLPNKGNTLITSGMAGLFSNGKPSKGIIIEVDPETKEVVFELTIPS 468 (477)
T ss_dssp EEEEEESGGGGGGG--SS--EEEEETTTTEEEEEEEEETTTSTTSEEEEEEEEEETTT--EEEEEEEEE
T ss_pred EEEEEeCCCCCCCccCCcceeeEEecCCCCEEEEeCcccccccCCCCCceEEEEEcCCCEEEEEEEEec
Confidence 5699998762 233344444556 55444332 10 1257889999999999988754
No 42
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=97.94 E-value=0.0044 Score=56.84 Aligned_cols=207 Identities=14% Similarity=0.218 Sum_probs=128.9
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.-+-+|.++++.+.++++..+.+ .+|+-++.+.-.+ .+.||++.-- . +|..-.
T Consensus 8 eaeahfi~~~d~~~iY~felvG~~P-----~SGGDTYNAV~~v---Dd~IyFGGWV---------------H--APa~y~ 62 (339)
T PF09910_consen 8 EAEAHFIDRDDSEKIYRFELVGPPP-----TSGGDTYNAVEWV---DDFIYFGGWV---------------H--APAVYE 62 (339)
T ss_pred eeeeEEEecCCceEEEEeeeccCCC-----CCCCccceeeeee---cceEEEeeee---------------c--CCceee
Confidence 6777888889999999999854443 2566666543334 4789997532 1 111221
Q ss_pred CC------CCCCCCcceEEEEECCCC--cEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceee
Q 040693 134 KC------IEPENHSNSLLALDLDTG--KIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKH 205 (382)
Q Consensus 134 ~~------~~~~~~~g~v~ald~~tG--~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~ 205 (382)
++ +-+.+-.+.|..+|.+++ +++|+-+......|... -+-++++ .-+
T Consensus 63 gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGE--------------------VSdIlYd-----P~~ 117 (339)
T PF09910_consen 63 GKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKESIHDKTKWAGE--------------------VSDILYD-----PYE 117 (339)
T ss_pred eccCCceEEEEeeccceEEEEEcCCCeEEEEEecccCCccccccc--------------------hhheeeC-----CCc
Confidence 11 223344678999998888 46899888877777432 1334442 114
Q ss_pred cEEEEEccCc----EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 206 DIVVAVQKSG----FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 206 ~~v~~~~~~g----~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
+.|+.+-.|| -+|.+|+.+|+..+-.+.+. .-+..+. +..+++..+.. ..-..|+|
T Consensus 118 D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ps-------~KG~~~~-D~a~F~i~~~~------------~g~~~i~~ 177 (339)
T PF09910_consen 118 DRLLLARADGHANLGVYSLDRRTGKAEKLSSNPS-------LKGTLVH-DYACFGINNFH------------KGVSGIHC 177 (339)
T ss_pred CEEEEEecCCcceeeeEEEcccCCceeeccCCCC-------cCceEee-eeEEEeccccc------------cCCceEEE
Confidence 5666666665 48999999999998877542 1222223 34444442211 22356999
Q ss_pred EECCCCcEEeeec-CCC----C---CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcE
Q 040693 282 MDASNGNVLWSTA-DPS----N---GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKI 333 (382)
Q Consensus 282 ~d~~tG~~~W~~~-~~~----~---~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~i 333 (382)
+|+.+||.+-+.- ... . ....+.+....+++|+-- .|.+++.|+-.+++
T Consensus 178 ~Dli~~~~~~e~f~~~~s~Dg~~~~~~~~G~~~s~ynR~faF~---rGGi~vgnP~~~e~ 234 (339)
T PF09910_consen 178 LDLISGKWVIESFDVSLSVDGGPVIRPELGAMASAYNRLFAFV---RGGIFVGNPYNGEE 234 (339)
T ss_pred EEccCCeEEEEecccccCCCCCceEeeccccEEEEeeeEEEEE---eccEEEeCCCCCCc
Confidence 9999998744321 111 0 122455555677888877 79999999998766
No 43
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.94 E-value=0.011 Score=56.48 Aligned_cols=149 Identities=9% Similarity=0.098 Sum_probs=72.8
Q ss_pred cEEEEEc-cCcEEEEEeCCC-CCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 206 DIVVAVQ-KSGFAWALDRDS-GSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 206 ~~v~~~~-~~g~l~ald~~t-G~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
..+++.. .++.+..+|.++ |.+.-.....+ .........+ +++.+|+.. ...+.|..
T Consensus 92 ~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~---~~~~~~~~~~~p~g~~l~v~~----------------~~~~~v~v 152 (330)
T PRK11028 92 RFLFSASYNANCVSVSPLDKDGIPVAPIQIIE---GLEGCHSANIDPDNRTLWVPC----------------LKEDRIRL 152 (330)
T ss_pred CEEEEEEcCCCeEEEEEECCCCCCCCceeecc---CCCcccEeEeCCCCCEEEEee----------------CCCCEEEE
Confidence 4555543 467788887753 43211111100 0001111112 567788763 23577999
Q ss_pred EECCC-CcEE----eeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCC--CCcEeEEEecC---Cc----eec
Q 040693 282 MDASN-GNVL----WSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVK--TGKILWSYDTG---AT----IYG 345 (382)
Q Consensus 282 ~d~~t-G~~~----W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~--tG~ilw~~~~~---~~----~~~ 345 (382)
+|..+ |++. .....+.+ .....+.+ ++.++|+... ..+.|.++|.+ +|+......+. .. .+.
T Consensus 153 ~d~~~~g~l~~~~~~~~~~~~g-~~p~~~~~~pdg~~lyv~~~-~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 230 (330)
T PRK11028 153 FTLSDDGHLVAQEPAEVTTVEG-AGPRHMVFHPNQQYAYCVNE-LNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWA 230 (330)
T ss_pred EEECCCCcccccCCCceecCCC-CCCceEEECCCCCEEEEEec-CCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccc
Confidence 99876 4321 11122111 01112233 3458888763 46888888876 45543322221 11 111
Q ss_pred -ceEE--eCCEEEEEeCceeEeecCCccCCCCC
Q 040693 346 -GASV--SNGCIYMGNGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 346 -~p~~--~~g~lyv~~~~g~~~~~~~~~~~~g~ 375 (382)
...+ .+..||+++.....+.+|.++..+++
T Consensus 231 ~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~ 263 (330)
T PRK11028 231 ADIHITPDGRHLYACDRTASLISVFSVSEDGSV 263 (330)
T ss_pred eeEEECCCCCEEEEecCCCCeEEEEEEeCCCCe
Confidence 1111 45678998766566677777665543
No 44
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.93 E-value=0.00068 Score=59.94 Aligned_cols=192 Identities=16% Similarity=0.107 Sum_probs=112.3
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCC-------------CCCCCCCCCCCCCC--CceEEEeeeCceeec
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNP-------------NCPPGPSPDADFGE--APMMLSMYRNKVKHD 206 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~--~p~~~~~~~~g~~~~ 206 (382)
+..++.+|..|||++-+++....++.....+..+.- .|......+..+-. .-.+.++ ..++.
T Consensus 80 Dk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~s~ePiQildea~D~V~Si---~v~~h 156 (307)
T KOG0316|consen 80 DKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSRSFEPIQILDEAKDGVSSI---DVAEH 156 (307)
T ss_pred CceEEEEEcccCeeeeecccccceeeEEEecCcceEEEeccccceeEEEEcccCCCCccchhhhhcCceeEE---Eeccc
Confidence 478999999999999888765443221111111110 13222211111000 0001111 13468
Q ss_pred EEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc--eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG--AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
.|+.++-||.+..+|...|.+--.+-- .+... ..-+++.+.++ ..++.+..+|.
T Consensus 157 eIvaGS~DGtvRtydiR~G~l~sDy~g-------~pit~vs~s~d~nc~La~-----------------~l~stlrLlDk 212 (307)
T KOG0316|consen 157 EIVAGSVDGTVRTYDIRKGTLSSDYFG-------HPITSVSFSKDGNCSLAS-----------------SLDSTLRLLDK 212 (307)
T ss_pred EEEeeccCCcEEEEEeecceeehhhcC-------CcceeEEecCCCCEEEEe-----------------eccceeeeccc
Confidence 899999999999999998865433321 11111 11266777777 45688999999
Q ss_pred CCCcEEeeecCCCC--CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee-cceEE--eCCEEEEEeC
Q 040693 285 SNGNVLWSTADPSN--GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY-GGASV--SNGCIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~--~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~-~~p~~--~~g~lyv~~~ 359 (382)
.|||++=.+....+ ......+.-.+..|+-++ ++|.+|..|+.+++++-+++.+.... ..... ...++++++.
T Consensus 213 ~tGklL~sYkGhkn~eykldc~l~qsdthV~sgS--EDG~Vy~wdLvd~~~~sk~~~~~~v~v~dl~~hp~~~~f~~A~~ 290 (307)
T KOG0316|consen 213 ETGKLLKSYKGHKNMEYKLDCCLNQSDTHVFSGS--EDGKVYFWDLVDETQISKLSVVSTVIVTDLSCHPTMDDFITATG 290 (307)
T ss_pred chhHHHHHhcccccceeeeeeeecccceeEEecc--CCceEEEEEeccceeeeeeccCCceeEEeeecccCccceeEecC
Confidence 99999877765442 111223322345677777 59999999999999999998776442 22222 2356666655
Q ss_pred cee
Q 040693 360 YKV 362 (382)
Q Consensus 360 ~g~ 362 (382)
.+.
T Consensus 291 ~~~ 293 (307)
T KOG0316|consen 291 HGD 293 (307)
T ss_pred Cce
Confidence 433
No 45
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.93 E-value=0.0038 Score=63.51 Aligned_cols=149 Identities=16% Similarity=0.177 Sum_probs=100.8
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+..|..+..||.+..+|..+|--.=.+.... .+.........++.++.+ ..+|+|.|+|.
T Consensus 362 gq~iaTG~eDgKVKvWn~~SgfC~vTFteHt---s~Vt~v~f~~~g~~llss-----------------SLDGtVRAwDl 421 (893)
T KOG0291|consen 362 GQLIATGAEDGKVKVWNTQSGFCFVTFTEHT---SGVTAVQFTARGNVLLSS-----------------SLDGTVRAWDL 421 (893)
T ss_pred CcEEEeccCCCcEEEEeccCceEEEEeccCC---CceEEEEEEecCCEEEEe-----------------ecCCeEEeeee
Confidence 3677788888999999988887777766432 111222222355656655 56799999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeCce
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNGYK 361 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g 361 (382)
+.++---++..|. +...+.++++ |.+|++++. +.=.|+..+.+||+++--..-..+-.++... ..+.+.++.+..
T Consensus 422 kRYrNfRTft~P~-p~QfscvavD~sGelV~AG~~-d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWD 499 (893)
T KOG0291|consen 422 KRYRNFRTFTSPE-PIQFSCVAVDPSGELVCAGAQ-DSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWD 499 (893)
T ss_pred cccceeeeecCCC-ceeeeEEEEcCCCCEEEeecc-ceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEecccc
Confidence 9998877777776 3556667776 667777774 3346999999999988765432222222222 456677777888
Q ss_pred eEeecCCccCCCCC
Q 040693 362 VTVGFGNKNFTSGT 375 (382)
Q Consensus 362 ~~~~~~~~~~~~g~ 375 (382)
.++.+|.+...+|.
T Consensus 500 kTVRiW~if~s~~~ 513 (893)
T KOG0291|consen 500 KTVRIWDIFSSSGT 513 (893)
T ss_pred ceEEEEEeeccCce
Confidence 88888888777543
No 46
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=97.90 E-value=0.0021 Score=58.62 Aligned_cols=164 Identities=21% Similarity=0.252 Sum_probs=106.0
Q ss_pred eeeceEE-EcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCC
Q 040693 23 ITMSGTY-YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRN 101 (382)
Q Consensus 23 ~~~~p~v-~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 101 (382)
+..-... .++.+|.++.. ++ +..|..+|++||+++.+.++.+..++ . .++.-++
T Consensus 46 FTQGL~~~~~g~LyESTG~--yG---------~S~l~~~d~~tg~~~~~~~l~~~~Fg-------E-------Git~~~d 100 (264)
T PF05096_consen 46 FTQGLEFLDDGTLYESTGL--YG---------QSSLRKVDLETGKVLQSVPLPPRYFG-------E-------GITILGD 100 (264)
T ss_dssp EEEEEEEEETTEEEEEECS--TT---------EEEEEEEETTTSSEEEEEE-TTT--E-------E-------EEEEETT
T ss_pred cCccEEecCCCEEEEeCCC--CC---------cEEEEEEECCCCcEEEEEECCccccc-------e-------eEEEECC
Confidence 3333444 67899987742 22 56899999999999999998544332 1 1222246
Q ss_pred eEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCC
Q 040693 102 HVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPP 181 (382)
Q Consensus 102 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~ 181 (382)
.+|.-+-. .+..+.+|++|-+++=++++.. ..|....
T Consensus 101 ~l~qLTWk--------------------------------~~~~f~yd~~tl~~~~~~~y~~-EGWGLt~---------- 137 (264)
T PF05096_consen 101 KLYQLTWK--------------------------------EGTGFVYDPNTLKKIGTFPYPG-EGWGLTS---------- 137 (264)
T ss_dssp EEEEEESS--------------------------------SSEEEEEETTTTEEEEEEE-SS-S--EEEE----------
T ss_pred EEEEEEec--------------------------------CCeEEEEccccceEEEEEecCC-cceEEEc----------
Confidence 78887753 3788999999999999998874 3552211
Q ss_pred CCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc---eeeeCCeEEEEec
Q 040693 182 GPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG---AATDERRIYTNIA 258 (382)
Q Consensus 182 ~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~---~~~~~~~v~~~~~ 258 (382)
.+..+++.++...|+-+|++|-++.-+.+.... +.+... ....++.||....
T Consensus 138 ----------------------dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V~~~---g~pv~~LNELE~i~G~IyANVW 192 (264)
T PF05096_consen 138 ----------------------DGKRLIMSDGSSRLYFLDPETFKEVRTIQVTDN---GRPVSNLNELEYINGKIYANVW 192 (264)
T ss_dssp ----------------------CSSCEEEE-SSSEEEEE-TTT-SEEEEEE-EET---TEE---EEEEEEETTEEEEEET
T ss_pred ----------------------CCCEEEEECCccceEEECCcccceEEEEEEEEC---CEECCCcEeEEEEcCEEEEEeC
Confidence 156788888889999999999888777765321 111111 1236899999864
Q ss_pred CccccccccCCCCCCCCCceEEEEECCCCcEEeeecC
Q 040693 259 NSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTAD 295 (382)
Q Consensus 259 ~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~ 295 (382)
....|+.||++||+++=.+++
T Consensus 193 ----------------~td~I~~Idp~tG~V~~~iDl 213 (264)
T PF05096_consen 193 ----------------QTDRIVRIDPETGKVVGWIDL 213 (264)
T ss_dssp ----------------TSSEEEEEETTT-BEEEEEE-
T ss_pred ----------------CCCeEEEEeCCCCeEEEEEEh
Confidence 457899999999998877654
No 47
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=97.81 E-value=4.2e-05 Score=46.61 Aligned_cols=31 Identities=48% Similarity=0.824 Sum_probs=26.5
Q ss_pred EEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeee
Q 040693 28 TYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTF 72 (382)
Q Consensus 28 ~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~ 72 (382)
...++.||+++. ++.|+|+|+++|+++|+++
T Consensus 3 ~~~~~~v~~~~~--------------~g~l~a~d~~~G~~~W~~~ 33 (33)
T smart00564 3 VLSDGTVYVGST--------------DGTLYALDAKTGEILWTYK 33 (33)
T ss_pred EEECCEEEEEcC--------------CCEEEEEEcccCcEEEEcC
Confidence 345679999886 8999999999999999863
No 48
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.78 E-value=0.015 Score=55.40 Aligned_cols=153 Identities=8% Similarity=-0.030 Sum_probs=78.6
Q ss_pred cEEE-EEccCcEEEEEeCCC-CCee----eeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceE
Q 040693 206 DIVV-AVQKSGFAWALDRDS-GSLI----WSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGW 279 (382)
Q Consensus 206 ~~v~-~~~~~g~l~ald~~t-G~~~----W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v 279 (382)
..++ +...++.+..+|.++ |++. ...+... ..+.......-++..+|+... .++.|
T Consensus 138 ~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~--g~~p~~~~~~pdg~~lyv~~~----------------~~~~v 199 (330)
T PRK11028 138 RTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVE--GAGPRHMVFHPNQQYAYCVNE----------------LNSSV 199 (330)
T ss_pred CEEEEeeCCCCEEEEEEECCCCcccccCCCceecCC--CCCCceEEECCCCCEEEEEec----------------CCCEE
Confidence 3444 455668899999876 3321 1111111 011111111126678888632 25778
Q ss_pred EEEECC--CCcEEeeecCCCC------CCCCcceEE--eCCEEEEeeecCCCcEEEEeCCC--CcE--eEEEecCC-cee
Q 040693 280 VAMDAS--NGNVLWSTADPSN------GTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKT--GKI--LWSYDTGA-TIY 344 (382)
Q Consensus 280 ~a~d~~--tG~~~W~~~~~~~------~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~t--G~i--lw~~~~~~-~~~ 344 (382)
..+|.. +|+.......... ..+...+.+ ++..+|++.. ..+.|..++.++ ++. +-..+.+. +..
T Consensus 200 ~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~-~~~~I~v~~i~~~~~~~~~~~~~~~~~~p~~ 278 (330)
T PRK11028 200 DVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDR-TASLISVFSVSEDGSVLSFEGHQPTETQPRG 278 (330)
T ss_pred EEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecC-CCCeEEEEEEeCCCCeEEEeEEEeccccCCc
Confidence 888876 5655443332110 011111222 3457888752 357788887643 322 22223221 111
Q ss_pred cceEEeCCEEEEEeCceeEeecCCccCCCCCeE
Q 040693 345 GGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 345 ~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
......+..||+++.....+.+|.++..+|.+.
T Consensus 279 ~~~~~dg~~l~va~~~~~~v~v~~~~~~~g~l~ 311 (330)
T PRK11028 279 FNIDHSGKYLIAAGQKSHHISVYEIDGETGLLT 311 (330)
T ss_pred eEECCCCCEEEEEEccCCcEEEEEEcCCCCcEE
Confidence 111125779999987666788899988888753
No 49
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.72 E-value=0.051 Score=52.37 Aligned_cols=234 Identities=15% Similarity=0.186 Sum_probs=126.7
Q ss_pred EEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCC
Q 040693 57 LAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCI 136 (382)
Q Consensus 57 l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (382)
++.||.++|++.-........ .. +...+++++..+|+.....
T Consensus 17 ~~~~d~~~g~l~~~~~~~~~~---------~P---s~l~~~~~~~~LY~~~e~~-------------------------- 58 (345)
T PF10282_consen 17 VFRFDEETGTLTLVQTVAEGE---------NP---SWLAVSPDGRRLYVVNEGS-------------------------- 58 (345)
T ss_dssp EEEEETTTTEEEEEEEEEESS---------SE---CCEEE-TTSSEEEEEETTS--------------------------
T ss_pred EEEEcCCCCCceEeeeecCCC---------CC---ceEEEEeCCCEEEEEEccc--------------------------
Confidence 444566888876555432111 11 2357888889999987630
Q ss_pred CCCCCcceE--EEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccC
Q 040693 137 EPENHSNSL--LALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKS 214 (382)
Q Consensus 137 ~~~~~~g~v--~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~ 214 (382)
...+.| +.++.++|++.-.-+.... ...|+-..+.+++ +-++++.-..
T Consensus 59 ---~~~g~v~~~~i~~~~g~L~~~~~~~~~-------------------------g~~p~~i~~~~~g--~~l~vany~~ 108 (345)
T PF10282_consen 59 ---GDSGGVSSYRIDPDTGTLTLLNSVPSG-------------------------GSSPCHIAVDPDG--RFLYVANYGG 108 (345)
T ss_dssp ---STTTEEEEEEEETTTTEEEEEEEEEES-------------------------SSCEEEEEECTTS--SEEEEEETTT
T ss_pred ---cCCCCEEEEEECCCcceeEEeeeeccC-------------------------CCCcEEEEEecCC--CEEEEEEccC
Confidence 012555 5555555776665544321 1456666555555 3334444457
Q ss_pred cEEEEEeCCC-CCeeeeecc--------CCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 215 GFAWALDRDS-GSLIWSMEA--------GPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 215 g~l~ald~~t-G~~~W~~~~--------~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
|.+..++... |++.-.... .+....+........ +++.+|+... ....|+.++
T Consensus 109 g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dl----------------G~D~v~~~~ 172 (345)
T PF10282_consen 109 GSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDL----------------GADRVYVYD 172 (345)
T ss_dssp TEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEET----------------TTTEEEEEE
T ss_pred CeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEec----------------CCCEEEEEE
Confidence 7787777765 665544221 110011111222222 5678888632 235677777
Q ss_pred CCCCc--EEeee--cCCCCCCCCcc--eEE--eCCEEEEeeecCCCcEEEEeCC--CCcEeEEEecCC---c-----eec
Q 040693 284 ASNGN--VLWST--ADPSNGTAPGP--VTV--ANGVLFGGSTYRQGPIYAMDVK--TGKILWSYDTGA---T-----IYG 345 (382)
Q Consensus 284 ~~tG~--~~W~~--~~~~~~~~~~~--~~~--~~~~v~~~~~~~~g~l~~ld~~--tG~ilw~~~~~~---~-----~~~ 345 (382)
....+ +.-.. ..+. ..+| +.+ ++.++|+... ..+.|.+++.. +|+......++. . ..+
T Consensus 173 ~~~~~~~l~~~~~~~~~~---G~GPRh~~f~pdg~~~Yv~~e-~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 248 (345)
T PF10282_consen 173 IDDDTGKLTPVDSIKVPP---GSGPRHLAFSPDGKYAYVVNE-LSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPA 248 (345)
T ss_dssp E-TTS-TEEEEEEEECST---TSSEEEEEE-TTSSEEEEEET-TTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEE
T ss_pred EeCCCceEEEeecccccc---CCCCcEEEEcCCcCEEEEecC-CCCcEEEEeecccCCceeEEEEeeeccccccccCCce
Confidence 66554 43322 2222 2333 333 3468988874 67788888776 675544332221 1 112
Q ss_pred ceEE--eCCEEEEEeCceeEeecCCccCCCCCeEE
Q 040693 346 GASV--SNGCIYMGNGYKVTVGFGNKNFTSGTSLY 378 (382)
Q Consensus 346 ~p~~--~~g~lyv~~~~g~~~~~~~~~~~~g~~l~ 378 (382)
...+ .+..||+++.....+.+|.+|..+|++-.
T Consensus 249 ~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~ 283 (345)
T PF10282_consen 249 EIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTL 283 (345)
T ss_dssp EEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEE
T ss_pred eEEEecCCCEEEEEeccCCEEEEEEEecCCCceEE
Confidence 2233 36789999988778888999988887654
No 50
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=97.68 E-value=9.5e-05 Score=45.01 Aligned_cols=32 Identities=50% Similarity=1.079 Sum_probs=27.3
Q ss_pred eEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 305 VTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 305 ~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
+...++++|+++. +|.|+++|.++|+++|+++
T Consensus 2 ~~~~~~~v~~~~~--~g~l~a~d~~~G~~~W~~~ 33 (33)
T smart00564 2 VVLSDGTVYVGST--DGTLYALDAKTGEILWTYK 33 (33)
T ss_pred cEEECCEEEEEcC--CCEEEEEEcccCcEEEEcC
Confidence 3456779999985 8999999999999999863
No 51
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=97.57 E-value=0.0039 Score=60.00 Aligned_cols=177 Identities=11% Similarity=0.113 Sum_probs=115.3
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceE-EEeeeCceeecEEEEEccCcEEEEEe
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMM-LSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~-~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
-+||.+|.++-..+-+..+... |+- ..+.++| ...|++++..-++|.+|
T Consensus 237 lrifqvDGk~N~~lqS~~l~~f----------------------------Pi~~a~f~p~G--~~~i~~s~rrky~ysyD 286 (514)
T KOG2055|consen 237 LRIFQVDGKVNPKLQSIHLEKF----------------------------PIQKAEFAPNG--HSVIFTSGRRKYLYSYD 286 (514)
T ss_pred EEEEEecCccChhheeeeeccC----------------------------ccceeeecCCC--ceEEEecccceEEEEee
Confidence 5789999888776666655433 221 1122334 44788888888999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTA 301 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~ 301 (382)
..++|+.--.++......+-..+-..-+++.|.+. ...|.|+.+-.+|++.+-..++.+. ..
T Consensus 287 le~ak~~k~~~~~g~e~~~~e~FeVShd~~fia~~-----------------G~~G~I~lLhakT~eli~s~KieG~-v~ 348 (514)
T KOG2055|consen 287 LETAKVTKLKPPYGVEEKSMERFEVSHDSNFIAIA-----------------GNNGHIHLLHAKTKELITSFKIEGV-VS 348 (514)
T ss_pred ccccccccccCCCCcccchhheeEecCCCCeEEEc-----------------ccCceEEeehhhhhhhhheeeeccE-Ee
Confidence 99988765554432211111222222255555544 3458899999999999988888763 22
Q ss_pred CcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCceeEeecCCc
Q 040693 302 PGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYKVTVGFGNK 369 (382)
Q Consensus 302 ~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g~~~~~~~~ 369 (382)
.-....++..+++... .|.|+.+|+.+-+.+-++...+++.+.-+. -++.++.+.++-.++-+|..
T Consensus 349 ~~~fsSdsk~l~~~~~--~GeV~v~nl~~~~~~~rf~D~G~v~gts~~~S~ng~ylA~GS~~GiVNIYd~ 416 (514)
T KOG2055|consen 349 DFTFSSDSKELLASGG--TGEVYVWNLRQNSCLHRFVDDGSVHGTSLCISLNGSYLATGSDSGIVNIYDG 416 (514)
T ss_pred eEEEecCCcEEEEEcC--CceEEEEecCCcceEEEEeecCccceeeeeecCCCceEEeccCcceEEEecc
Confidence 2223334567777764 899999999999999988777766544333 56666666666667888873
No 52
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.55 E-value=0.19 Score=55.69 Aligned_cols=148 Identities=16% Similarity=0.212 Sum_probs=80.9
Q ss_pred EEEEEccCcEEEEEeCCCCCeeeeeccCC-CCCCC--------Ccccceee--eCCeEEEEecCccccccccCCCCCCCC
Q 040693 207 IVVAVQKSGFAWALDRDSGSLIWSMEAGP-GGLGG--------GAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTI 275 (382)
Q Consensus 207 ~v~~~~~~g~l~ald~~tG~~~W~~~~~~-~~~~g--------~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~ 275 (382)
+.++...++.++.+|..+|...--...+. ....+ ....+.++ +++.+|+... .
T Consensus 697 LyVad~~~~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs----------------~ 760 (1057)
T PLN02919 697 VYIAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADS----------------E 760 (1057)
T ss_pred EEEEECCCCeEEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEEC----------------C
Confidence 34444567889999998886542111000 00000 01122223 4566888732 3
Q ss_pred CceEEEEECCCCcEEeeecCCC---------CC--------C--CCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeE
Q 040693 276 AGGWVAMDASNGNVLWSTADPS---------NG--------T--APGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILW 335 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~---------~~--------~--~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw 335 (382)
.+.|..+|+++|+..+...... +. . ....+.+ .++.+|++.. .++.|..+|++++++..
T Consensus 761 n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs-~N~rIrviD~~tg~v~t 839 (1057)
T PLN02919 761 SSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADS-YNHKIKKLDPATKRVTT 839 (1057)
T ss_pred CCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEEC-CCCEEEEEECCCCeEEE
Confidence 4789999999987655331000 00 0 0112233 3467888775 68899999999888775
Q ss_pred EEecCCc----------eecce---EE-eCCEEEEEeCceeEeecCCccC
Q 040693 336 SYDTGAT----------IYGGA---SV-SNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 336 ~~~~~~~----------~~~~p---~~-~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
....+.. .+..| ++ .+|+|||++...+.+.++.++.
T Consensus 840 iaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~ 889 (1057)
T PLN02919 840 LAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889 (1057)
T ss_pred EeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCC
Confidence 4433211 11122 22 4789999987766655555444
No 53
>PTZ00421 coronin; Provisional
Probab=97.55 E-value=0.11 Score=52.61 Aligned_cols=150 Identities=9% Similarity=0.074 Sum_probs=83.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
.++|++++.|+.+..+|..+++.+-.+..-. .......+ ++..++.+ ..++.|..+
T Consensus 138 ~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~-----~~V~sla~spdG~lLatg-----------------s~Dg~IrIw 195 (493)
T PTZ00421 138 MNVLASAGADMVVNVWDVERGKAVEVIKCHS-----DQITSLEWNLDGSLLCTT-----------------SKDKKLNII 195 (493)
T ss_pred CCEEEEEeCCCEEEEEECCCCeEEEEEcCCC-----CceEEEEEECCCCEEEEe-----------------cCCCEEEEE
Confidence 4678888899999999999998765554211 11112211 44555544 456889999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee--cCCCcEEEEeCCCCc-EeEEEecCC-ceecceEE--eCCEEE
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST--YRQGPIYAMDVKTGK-ILWSYDTGA-TIYGGASV--SNGCIY 355 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~--~~~g~l~~ld~~tG~-ilw~~~~~~-~~~~~p~~--~~g~ly 355 (382)
|+++|+.+.+............... .++.++.... ..++.|...|..+.+ .+-...... .....|.+ ..+.||
T Consensus 196 D~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~~~~~~~d~d~~~L~ 275 (493)
T PTZ00421 196 DPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDTNLLY 275 (493)
T ss_pred ECCCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCceEEEEEcCCCCEEE
Confidence 9999998877654432111111112 2345554321 136789999988654 333333222 12223333 345566
Q ss_pred EEe-CceeEeecCCccCCCCCeEEE
Q 040693 356 MGN-GYKVTVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 356 v~~-~~g~~~~~~~~~~~~g~~l~~ 379 (382)
+++ +++. +.+|.+ .+++.+..
T Consensus 276 lggkgDg~-Iriwdl--~~~~~~~~ 297 (493)
T PTZ00421 276 IGSKGEGN-IRCFEL--MNERLTFC 297 (493)
T ss_pred EEEeCCCe-EEEEEe--eCCceEEE
Confidence 665 3544 445544 44665543
No 54
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.52 E-value=0.005 Score=63.27 Aligned_cols=200 Identities=20% Similarity=0.259 Sum_probs=115.5
Q ss_pred CCCceeeeeecCcCccceeeeceEEE---cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCC
Q 040693 5 SNGKLVWKTKLDDHARSFITMSGTYY---KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKL 81 (382)
Q Consensus 5 ~tGk~~W~~~~~~~~~~~~~~~p~v~---~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~ 81 (382)
+-||..|+..+= +....+-... ..+++|.+. +|.|..|++.||+++|+.-+.++..+.
T Consensus 22 q~gkfdwr~~~v----G~~k~~~~~~~t~~~rlivsT~--------------~~vlAsL~~~tGei~WRqvl~~~~~~~- 82 (910)
T KOG2103|consen 22 QAGKFDWRQQLV----GVKKVNFLVYDTKSKRLIVSTE--------------KGVLASLNLRTGEIIWRQVLEPKTSGL- 82 (910)
T ss_pred Hhhhcchhhhcc----cceeEEEEeecCCCceEEEEec--------------cchhheecccCCcEEEEEeccCCCccc-
Confidence 458889988862 2222233333 378888885 899999999999999999886543210
Q ss_pred CCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEec
Q 040693 82 NEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQL 161 (382)
Q Consensus 82 ~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~ 161 (382)
| .+.. -++..+ ...+++.|..+|-..|+...
T Consensus 83 ----~------~~~~------~~iS~d---------------------------------g~~lr~wn~~~g~l~~~i~l 113 (910)
T KOG2103|consen 83 ----G------VPLT------NTISVD---------------------------------GRYLRSWNTNNGILDWEIEL 113 (910)
T ss_pred ----C------ccee------EEEccC---------------------------------CcEEEeecCCCceeeeeccc
Confidence 0 0111 123222 26789999999999999987
Q ss_pred CCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCC
Q 040693 162 GGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGG 241 (382)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~ 241 (382)
... .+..+.. +..++.+... .....|++.|........ -
T Consensus 114 ~~g--------------------------~~~~~~~-----v~~~i~v~~g-------~~~~~g~l~w~~~~~~~~---~ 152 (910)
T KOG2103|consen 114 ADG--------------------------FKGLLLE-----VNKGIAVLNG-------HTRKFGELKWVESFSISI---E 152 (910)
T ss_pred ccc--------------------------cceeEEE-----EccceEEEcc-------eeccccceeehhhccccc---h
Confidence 643 0111111 1112111111 556679999998875421 1
Q ss_pred cccce--eeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE-eeecCCCCCCCCcceEEeCCEEEEeeec
Q 040693 242 AMWGA--ATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL-WSTADPSNGTAPGPVTVANGVLFGGSTY 318 (382)
Q Consensus 242 ~~~~~--~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~-W~~~~~~~~~~~~~~~~~~~~v~~~~~~ 318 (382)
..... ....+.+|+-..- . .....|.+++..+|+.. |+...-.+....-+...+.+.++.++
T Consensus 153 ~~~q~~~~~~t~vvy~~~~l--------~-----~s~~~V~~~~~~~g~v~~~~~~v~~pw~~~~~c~~~k~~vl~~s-- 217 (910)
T KOG2103|consen 153 EDLQDAKIYGTDVVYVLGLL--------K-----RSGSCVQQVFSDDGEVTGPQSTVLGPWFKVLSCSTDKEVVLVCS-- 217 (910)
T ss_pred hHHHHhhhccCcEEEEEEEE--------e-----cCCceEEEEEccCCcEecceeeeecCcccccccccccceEEEcC--
Confidence 11221 1134455543211 0 12355889999999887 77765543322222234567777777
Q ss_pred CCCcEEEEeCC
Q 040693 319 RQGPIYAMDVK 329 (382)
Q Consensus 319 ~~g~l~~ld~~ 329 (382)
+|.+.-+|..
T Consensus 218 -~g~l~s~di~ 227 (910)
T KOG2103|consen 218 -NGTLISLDIS 227 (910)
T ss_pred -CCCeEEEEEE
Confidence 6766666654
No 55
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.51 E-value=0.0066 Score=55.37 Aligned_cols=159 Identities=16% Similarity=0.158 Sum_probs=80.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCC-CCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGG-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
++.++++...+ +..+|.++|+..--.+..... .....+-.....++.+|++........ ....+.|+.++
T Consensus 51 ~g~l~v~~~~~-~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~--------~~~~g~v~~~~ 121 (246)
T PF08450_consen 51 DGRLYVADSGG-IAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGAS--------GIDPGSVYRID 121 (246)
T ss_dssp TSEEEEEETTC-EEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTT--------CGGSEEEEEEE
T ss_pred CCEEEEEEcCc-eEEEecCCCcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccc--------cccccceEEEC
Confidence 46666776655 455599999766555542110 000111111224677998855422100 00117899999
Q ss_pred CCCCcEEeeec-CCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCC--CCcEeEEE---ecCCc--eecceEE-eCC
Q 040693 284 ASNGNVLWSTA-DPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVK--TGKILWSY---DTGAT--IYGGASV-SNG 352 (382)
Q Consensus 284 ~~tG~~~W~~~-~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~--tG~ilw~~---~~~~~--~~~~p~~-~~g 352 (382)
+. |+..-... +.. ..-+.+ ++..+|+... ..++|+.++.+ ++++.-+. +.+.. ..-..++ .+|
T Consensus 122 ~~-~~~~~~~~~~~~----pNGi~~s~dg~~lyv~ds-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G 195 (246)
T PF08450_consen 122 PD-GKVTVVADGLGF----PNGIAFSPDGKTLYVADS-FNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDG 195 (246)
T ss_dssp TT-SEEEEEEEEESS----EEEEEEETTSSEEEEEET-TTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS
T ss_pred CC-CeEEEEecCccc----ccceEECCcchheeeccc-ccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCC
Confidence 98 76443332 221 112223 3457777654 57889999885 33222211 22221 2333444 579
Q ss_pred EEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 353 CIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 353 ~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
+||++.-.+. .++.+++. |+.+-.+.
T Consensus 196 ~l~va~~~~~--~I~~~~p~-G~~~~~i~ 221 (246)
T PF08450_consen 196 NLWVADWGGG--RIVVFDPD-GKLLREIE 221 (246)
T ss_dssp -EEEEEETTT--EEEEEETT-SCEEEEEE
T ss_pred CEEEEEcCCC--EEEEECCC-ccEEEEEc
Confidence 9999853222 23556666 88876654
No 56
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=97.48 E-value=0.063 Score=50.76 Aligned_cols=53 Identities=15% Similarity=0.280 Sum_probs=42.2
Q ss_pred eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC---CEEEEEeCce
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN---GCIYMGNGYK 361 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~---g~lyv~~~~g 361 (382)
.++.+|++.....+.|.+++++ |+.+-++.++....++++..+ +.|||++...
T Consensus 222 adG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP~~~~t~~~FgG~~~~~L~iTs~~~ 277 (307)
T COG3386 222 ADGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLPVKRPTNPAFGGPDLNTLYITSARS 277 (307)
T ss_pred CCCCEEEecccCCceEEEECCC-CcEEEEEECCCCCCccceEeCCCcCEEEEEecCC
Confidence 4678886543122399999999 999999999987788888877 9999998754
No 57
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.48 E-value=0.081 Score=52.49 Aligned_cols=107 Identities=17% Similarity=0.249 Sum_probs=54.7
Q ss_pred EEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+++....+ ..++.+|.++|+..- ..... .......|.+ ++..+++.... .....|+.+|+
T Consensus 257 la~~~~~~g~~~Iy~~d~~~~~~~~-lt~~~-~~~~~~~~sp--Dg~~i~f~s~~--------------~g~~~iy~~d~ 318 (430)
T PRK00178 257 LAFVLSKDGNPEIYVMDLASRQLSR-VTNHP-AIDTEPFWGK--DGRTLYFTSDR--------------GGKPQIYKVNV 318 (430)
T ss_pred EEEEEccCCCceEEEEECCCCCeEE-cccCC-CCcCCeEECC--CCCEEEEEECC--------------CCCceEEEEEC
Confidence 34444433 379999999887532 21111 0111122222 56666665322 12346899999
Q ss_pred CCCcEEeeecCCCCCCCCcce-EEeCCEEEEeeecCCC--cEEEEeCCCCcEe
Q 040693 285 SNGNVLWSTADPSNGTAPGPV-TVANGVLFGGSTYRQG--PIYAMDVKTGKIL 334 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~-~~~~~~v~~~~~~~~g--~l~~ld~~tG~il 334 (382)
.+|+..--... . .....+. .-+++.+++... .++ .|+.+|+++|++.
T Consensus 319 ~~g~~~~lt~~-~-~~~~~~~~Spdg~~i~~~~~-~~~~~~l~~~dl~tg~~~ 368 (430)
T PRK00178 319 NGGRAERVTFV-G-NYNARPRLSADGKTLVMVHR-QDGNFHVAAQDLQRGSVR 368 (430)
T ss_pred CCCCEEEeecC-C-CCccceEECCCCCEEEEEEc-cCCceEEEEEECCCCCEE
Confidence 99875432211 1 1112222 223455555542 122 5899999988753
No 58
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.45 E-value=0.17 Score=56.06 Aligned_cols=137 Identities=15% Similarity=0.097 Sum_probs=75.1
Q ss_pred EEEEEccCcEEEEEeCCCCCeeeeeccC-----------CCCCC-----CCcccceee-eCCeEEEEecCccccccccCC
Q 040693 207 IVVAVQKSGFAWALDRDSGSLIWSMEAG-----------PGGLG-----GGAMWGAAT-DERRIYTNIANSQHKNFNLKP 269 (382)
Q Consensus 207 ~v~~~~~~g~l~ald~~tG~~~W~~~~~-----------~~~~~-----g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~ 269 (382)
+.++...++.+..+|.++|+..+..-.. ..... .....+.++ .++.+|++.
T Consensus 754 LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVAD------------ 821 (1057)
T PLN02919 754 LYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVAD------------ 821 (1057)
T ss_pred EEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEE------------
Confidence 4445566789999999988765432100 00000 001122233 345678763
Q ss_pred CCCCCCCceEEEEECCCCcEEeeecCCCC----------CC-CCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEe--E
Q 040693 270 SKNSTIAGGWVAMDASNGNVLWSTADPSN----------GT-APGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKIL--W 335 (382)
Q Consensus 270 ~~~~~~~g~v~a~d~~tG~~~W~~~~~~~----------~~-~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~il--w 335 (382)
...++|..||++++++......... .. ....+.++ ++.+|++.. .++.|..+|++++++. -
T Consensus 822 ----s~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt-~Nn~Irvid~~~~~~~~~~ 896 (1057)
T PLN02919 822 ----SYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADT-NNSLIRYLDLNKGEAAEIL 896 (1057)
T ss_pred ----CCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEEC-CCCEEEEEECCCCccceeE
Confidence 2357899999999987755432210 00 11122333 567888764 6789999999998763 3
Q ss_pred EEecCCceecceEE-eCCEEEEEeCc
Q 040693 336 SYDTGATIYGGASV-SNGCIYMGNGY 360 (382)
Q Consensus 336 ~~~~~~~~~~~p~~-~~g~lyv~~~~ 360 (382)
..++.+.....+.. .+.+||..++.
T Consensus 897 ~l~~~~~~~~~~~~~~~~~~~~~~~~ 922 (1057)
T PLN02919 897 TLELKGVQPPRPKSKSLKRLRRRSSA 922 (1057)
T ss_pred eeccccccCCCCcccchhhhhhcccc
Confidence 33433322222333 45677776443
No 59
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.45 E-value=0.014 Score=56.66 Aligned_cols=133 Identities=16% Similarity=0.113 Sum_probs=85.1
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCc-ccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGA-MWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+.+++.+++||.+..+|..+-. -|..+.+- |.+ ......-.+.++++. ..+.|..+|
T Consensus 166 ~hivvtGsYDg~vrl~DtR~~~-~~v~elnh----g~pVe~vl~lpsgs~iasA-----------------gGn~vkVWD 223 (487)
T KOG0310|consen 166 DHIVVTGSYDGKVRLWDTRSLT-SRVVELNH----GCPVESVLALPSGSLIASA-----------------GGNSVKVWD 223 (487)
T ss_pred CeEEEecCCCceEEEEEeccCC-ceeEEecC----CCceeeEEEcCCCCEEEEc-----------------CCCeEEEEE
Confidence 6788999999999999988653 55555432 111 111222334455542 236688999
Q ss_pred CCCCcE-EeeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeC
Q 040693 284 ASNGNV-LWSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNG 359 (382)
Q Consensus 284 ~~tG~~-~W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~ 359 (382)
..+|.. +-+..... ...+.+.+ ++.+++.++- ++.+-+||..+-|++..+..++++.+-.+. .+..++++.+
T Consensus 224 l~~G~qll~~~~~H~--KtVTcL~l~s~~~rLlS~sL--D~~VKVfd~t~~Kvv~s~~~~~pvLsiavs~dd~t~viGms 299 (487)
T KOG0310|consen 224 LTTGGQLLTSMFNHN--KTVTCLRLASDSTRLLSGSL--DRHVKVFDTTNYKVVHSWKYPGPVLSIAVSPDDQTVVIGMS 299 (487)
T ss_pred ecCCceehhhhhccc--ceEEEEEeecCCceEeeccc--ccceEEEEccceEEEEeeecccceeeEEecCCCceEEEecc
Confidence 996644 33333111 11223323 3468888774 999999999999999888888877654444 5577888888
Q ss_pred ceeE
Q 040693 360 YKVT 363 (382)
Q Consensus 360 ~g~~ 363 (382)
+|-.
T Consensus 300 nGlv 303 (487)
T KOG0310|consen 300 NGLV 303 (487)
T ss_pred ccee
Confidence 8764
No 60
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=97.45 E-value=0.0027 Score=56.69 Aligned_cols=149 Identities=17% Similarity=0.164 Sum_probs=101.2
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+..|+..+.++.+...|..||+..-+...+.+ ....-...++.++.. ...+.|.-+|+
T Consensus 155 D~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~-----VtSlEvs~dG~ilTi-----------------a~gssV~Fwda 212 (334)
T KOG0278|consen 155 DKCILSSADDKTVRLWDHRTGTEVQSLEFNSP-----VTSLEVSQDGRILTI-----------------AYGSSVKFWDA 212 (334)
T ss_pred CceEEeeccCCceEEEEeccCcEEEEEecCCC-----CcceeeccCCCEEEE-----------------ecCceeEEecc
Confidence 56677778899999999999999988886542 333333344444444 35578999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC--ceecceEEeCCEEEEEeCcee
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA--TIYGGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~--~~~~~p~~~~g~lyv~~~~g~ 362 (382)
++-+++=.+++|.. ..+..+..+.-+|++.. .+..+|.+|..||+.+-.++-+. ++..---.-+|.+|.+.+...
T Consensus 213 ksf~~lKs~k~P~n--V~SASL~P~k~~fVaGg-ed~~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSPdGE~yAsGSEDG 289 (334)
T KOG0278|consen 213 KSFGLLKSYKMPCN--VESASLHPKKEFFVAGG-EDFKVYKFDYNTGEEIGSYNKGHFGPVHCVRFSPDGELYASGSEDG 289 (334)
T ss_pred ccccceeeccCccc--cccccccCCCceEEecC-cceEEEEEeccCCceeeecccCCCCceEEEEECCCCceeeccCCCc
Confidence 99999999999874 23333344555666553 58999999999999998863322 222212226899999776555
Q ss_pred EeecCCccCCCCCeEE
Q 040693 363 TVGFGNKNFTSGTSLY 378 (382)
Q Consensus 363 ~~~~~~~~~~~g~~l~ 378 (382)
..-++-..+.+-..+|
T Consensus 290 TirlWQt~~~~~~~~~ 305 (334)
T KOG0278|consen 290 TIRLWQTTPGKTYGLW 305 (334)
T ss_pred eEEEEEecCCCchhhc
Confidence 5566666665544344
No 61
>PTZ00421 coronin; Provisional
Probab=97.41 E-value=0.12 Score=52.19 Aligned_cols=150 Identities=14% Similarity=0.149 Sum_probs=84.7
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
++.|..+|..+|+.+-.+......+ .-.....+ +..++++..|+.+..+|
T Consensus 147 DgtVrIWDl~tg~~~~~l~~h~~~V---------------------------~sla~spd---G~lLatgs~Dg~IrIwD 196 (493)
T PTZ00421 147 DMVVNVWDVERGKAVEVIKCHSDQI---------------------------TSLEWNLD---GSLLCTTSKDKKLNIID 196 (493)
T ss_pred CCEEEEEECCCCeEEEEEcCCCCce---------------------------EEEEEECC---CCEEEEecCCCEEEEEE
Confidence 4889999999998765543221110 00112222 46788889999999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee-ecCCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS-TADPSNGT 300 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~-~~~~~~~~ 300 (382)
..+|+.+.+.............|.+ +++.++..... ...++.|..+|+++.+.... ........
T Consensus 197 ~rsg~~v~tl~~H~~~~~~~~~w~~--~~~~ivt~G~s-------------~s~Dr~VklWDlr~~~~p~~~~~~d~~~~ 261 (493)
T PTZ00421 197 PRDGTIVSSVEAHASAKSQRCLWAK--RKDLIITLGCS-------------KSQQRQIMLWDTRKMASPYSTVDLDQSSA 261 (493)
T ss_pred CCCCcEEEEEecCCCCcceEEEEcC--CCCeEEEEecC-------------CCCCCeEEEEeCCCCCCceeEeccCCCCc
Confidence 9999988776532210001111221 34444443211 02357899999987654333 22222111
Q ss_pred CCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 301 APGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 301 ~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
...+.+ +++++|++.. .++.|..+|..+++++...+
T Consensus 262 -~~~~~~d~d~~~L~lggk-gDg~Iriwdl~~~~~~~~~~ 299 (493)
T PTZ00421 262 -LFIPFFDEDTNLLYIGSK-GEGNIRCFELMNERLTFCSS 299 (493)
T ss_pred -eEEEEEcCCCCEEEEEEe-CCCeEEEEEeeCCceEEEee
Confidence 111122 2456776652 37899999999999876543
No 62
>PRK04922 tolB translocation protein TolB; Provisional
Probab=97.40 E-value=0.094 Score=52.17 Aligned_cols=143 Identities=15% Similarity=0.139 Sum_probs=70.0
Q ss_pred EEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+++....+ ..|+.+|.++|+..- ...... ......|.+ ++..+++.... .....|+.+|.
T Consensus 262 l~~~~s~~g~~~Iy~~d~~~g~~~~-lt~~~~-~~~~~~~sp--DG~~l~f~sd~--------------~g~~~iy~~dl 323 (433)
T PRK04922 262 LALTLSRDGNPEIYVMDLGSRQLTR-LTNHFG-IDTEPTWAP--DGKSIYFTSDR--------------GGRPQIYRVAA 323 (433)
T ss_pred EEEEEeCCCCceEEEEECCCCCeEE-CccCCC-CccceEECC--CCCEEEEEECC--------------CCCceEEEEEC
Confidence 34444433 469999999887532 221110 011122221 55666655322 11235899999
Q ss_pred CCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCC--CcEEEEeCCCCcEeEEEecCCceecceEE-eCC-EEEEEeC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQ--GPIYAMDVKTGKILWSYDTGATIYGGASV-SNG-CIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~--g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g-~lyv~~~ 359 (382)
.+|+..-...... ....+... ++..+++.+. .+ ..|+.+|+++|++. ....+ ....+|.. .+| .|+..+.
T Consensus 324 ~~g~~~~lt~~g~--~~~~~~~SpDG~~Ia~~~~-~~~~~~I~v~d~~~g~~~-~Lt~~-~~~~~p~~spdG~~i~~~s~ 398 (433)
T PRK04922 324 SGGSAERLTFQGN--YNARASVSPDGKKIAMVHG-SGGQYRIAVMDLSTGSVR-TLTPG-SLDESPSFAPNGSMVLYATR 398 (433)
T ss_pred CCCCeEEeecCCC--CccCEEECCCCCEEEEEEC-CCCceeEEEEECCCCCeE-ECCCC-CCCCCceECCCCCEEEEEEe
Confidence 8887653332111 11222222 3556655542 12 36999999988865 33222 12234444 344 4455544
Q ss_pred ceeEeecCCccCC
Q 040693 360 YKVTVGFGNKNFT 372 (382)
Q Consensus 360 ~g~~~~~~~~~~~ 372 (382)
.+....+|.++..
T Consensus 399 ~~g~~~L~~~~~~ 411 (433)
T PRK04922 399 EGGRGVLAAVSTD 411 (433)
T ss_pred cCCceEEEEEECC
Confidence 4333455656553
No 63
>PLN00181 protein SPA1-RELATED; Provisional
Probab=97.40 E-value=0.093 Score=56.47 Aligned_cols=186 Identities=12% Similarity=0.093 Sum_probs=102.7
Q ss_pred ccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeC-CCCeEEEEcCCCCCCCcchhhcccccCCCCCCC
Q 040693 53 FQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDP-IRNHVYIATGNLYSVPLHIRQCQEENNQTTPTS 131 (382)
Q Consensus 53 ~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (382)
+++.|..+|..+++.+..+... ...++ +..+.+ +...++.+..
T Consensus 553 ~Dg~v~lWd~~~~~~~~~~~~H-----------~~~V~--~l~~~p~~~~~L~Sgs~----------------------- 596 (793)
T PLN00181 553 FEGVVQVWDVARSQLVTEMKEH-----------EKRVW--SIDYSSADPTLLASGSD----------------------- 596 (793)
T ss_pred CCCeEEEEECCCCeEEEEecCC-----------CCCEE--EEEEcCCCCCEEEEEcC-----------------------
Confidence 4899999999999888776431 11111 233443 2234444433
Q ss_pred CCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEE
Q 040693 132 PDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAV 211 (382)
Q Consensus 132 ~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~ 211 (382)
++.|..+|..+++.+-.+..... ..-.....+ .+..+.++
T Consensus 597 ----------Dg~v~iWd~~~~~~~~~~~~~~~----------------------------v~~v~~~~~--~g~~latg 636 (793)
T PLN00181 597 ----------DGSVKLWSINQGVSIGTIKTKAN----------------------------ICCVQFPSE--SGRSLAFG 636 (793)
T ss_pred ----------CCEEEEEECCCCcEEEEEecCCC----------------------------eEEEEEeCC--CCCEEEEE
Confidence 38899999998887655543211 001111111 14678888
Q ss_pred ccCcEEEEEeCCCCCe-eeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC--
Q 040693 212 QKSGFAWALDRDSGSL-IWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG-- 287 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~-~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG-- 287 (382)
..++.++.+|..+++. +-....- ......... ++..++.+ ..++.|..+|+.++
T Consensus 637 s~dg~I~iwD~~~~~~~~~~~~~h-----~~~V~~v~f~~~~~lvs~-----------------s~D~~ikiWd~~~~~~ 694 (793)
T PLN00181 637 SADHKVYYYDLRNPKLPLCTMIGH-----SKTVSYVRFVDSSTLVSS-----------------STDNTLKLWDLSMSIS 694 (793)
T ss_pred eCCCeEEEEECCCCCccceEecCC-----CCCEEEEEEeCCCEEEEE-----------------ECCCEEEEEeCCCCcc
Confidence 8999999999987763 2222210 011111212 44555544 34577888888754
Q ss_pred ----cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 288 ----NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 288 ----~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
+.+.+................++++..++ .++.++.++......+|.+.
T Consensus 695 ~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs--~D~~v~iw~~~~~~~~~s~~ 747 (793)
T PLN00181 695 GINETPLHSFMGHTNVKNFVGLSVSDGYIATGS--ETNEVFVYHKAFPMPVLSYK 747 (793)
T ss_pred ccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEe--CCCEEEEEECCCCCceEEEe
Confidence 33333332221111111112345666666 48999999988887776554
No 64
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.40 E-value=0.12 Score=48.61 Aligned_cols=154 Identities=12% Similarity=0.059 Sum_probs=87.3
Q ss_pred EEEE-EccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEE--EE
Q 040693 207 IVVA-VQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA--MD 283 (382)
Q Consensus 207 ~v~~-~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a--~d 283 (382)
.|++ .-+...++.++.+.|++.-..+..-+...|......--.+..+|+.. -.+++|.+ +|
T Consensus 158 ~l~v~DLG~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~----------------EL~stV~v~~y~ 221 (346)
T COG2706 158 YLVVPDLGTDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVN----------------ELNSTVDVLEYN 221 (346)
T ss_pred EEEEeecCCceEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEe----------------ccCCEEEEEEEc
Confidence 4443 33456788899988987655443322223333333333677788763 23456554 45
Q ss_pred CCCCcEEeeecC---CC---CCCCCcceEE--eCCEEEEeeecCCC--cEEEEeCCCCcEeEEEecC--Cce--ecceEE
Q 040693 284 ASNGNVLWSTAD---PS---NGTAPGPVTV--ANGVLFGGSTYRQG--PIYAMDVKTGKILWSYDTG--ATI--YGGASV 349 (382)
Q Consensus 284 ~~tG~~~W~~~~---~~---~~~~~~~~~~--~~~~v~~~~~~~~g--~l~~ld~~tG~ilw~~~~~--~~~--~~~p~~ 349 (382)
...|+..=.... |. ...+.+.+.+ ++..+|++.- ... .++.+|..+|++.-....+ +.. ......
T Consensus 222 ~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNR-g~dsI~~f~V~~~~g~L~~~~~~~teg~~PR~F~i~~ 300 (346)
T COG2706 222 PAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNR-GHDSIAVFSVDPDGGKLELVGITPTEGQFPRDFNINP 300 (346)
T ss_pred CCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecC-CCCeEEEEEEcCCCCEEEEEEEeccCCcCCccceeCC
Confidence 444654333222 22 1223444444 3456777652 112 3667788888765443332 221 123334
Q ss_pred eCCEEEEEeCceeEeecCCccCCCCCeE
Q 040693 350 SNGCIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 350 ~~g~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
.++-|++++.++..+.+|+.|..||++=
T Consensus 301 ~g~~Liaa~q~sd~i~vf~~d~~TG~L~ 328 (346)
T COG2706 301 SGRFLIAANQKSDNITVFERDKETGRLT 328 (346)
T ss_pred CCCEEEEEccCCCcEEEEEEcCCCceEE
Confidence 6788888998888899999999999864
No 65
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=97.34 E-value=0.038 Score=49.36 Aligned_cols=171 Identities=13% Similarity=0.102 Sum_probs=106.2
Q ss_pred cceEEEEeCc------cCceeeeeeccCCCC--CCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccC
Q 040693 54 QGSLAKLDAK------TGRILWQTFMLPDNF--GKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENN 125 (382)
Q Consensus 54 ~g~l~ald~~------tG~~lW~~~~~~~~~--~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~ 125 (382)
+|.|+++.-. --|.+|++....... ..|. . ..+.+++..+.|+++.|+
T Consensus 80 dG~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPe-I-------Nam~ldP~enSi~~AgGD---------------- 135 (325)
T KOG0649|consen 80 DGLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPE-I-------NAMWLDPSENSILFAGGD---------------- 135 (325)
T ss_pred CceEEEeeehhhhhhccchhhhhhcCccccCcccCCc-c-------ceeEeccCCCcEEEecCC----------------
Confidence 7888887542 346788876532211 1111 1 246788888888888775
Q ss_pred CCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceee
Q 040693 126 QTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKH 205 (382)
Q Consensus 126 ~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~ 205 (382)
+.+|++|.++|++.-.++-....+..... . .+.
T Consensus 136 -----------------~~~y~~dlE~G~i~r~~rGHtDYvH~vv~------------------------R------~~~ 168 (325)
T KOG0649|consen 136 -----------------GVIYQVDLEDGRIQREYRGHTDYVHSVVG------------------------R------NAN 168 (325)
T ss_pred -----------------eEEEEEEecCCEEEEEEcCCcceeeeeee------------------------c------ccC
Confidence 89999999999999888776554321110 0 126
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc-----eeeeCCeEEEEecCccccccccCCCCCCCCCceEE
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG-----AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWV 280 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~-----~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~ 280 (382)
.+|+.+..||.+...|.+|+|..-.++.-.....-++.++ .+.+.+.++.+ ..-.+.
T Consensus 169 ~qilsG~EDGtvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCG------------------gGp~ls 230 (325)
T KOG0649|consen 169 GQILSGAEDGTVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCG------------------GGPKLS 230 (325)
T ss_pred cceeecCCCccEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEec------------------CCCcee
Confidence 7899999999999999999998776665332222222222 23355656654 123355
Q ss_pred EEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEee
Q 040693 281 AMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGS 316 (382)
Q Consensus 281 a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~ 316 (382)
.+.++.-+..-.++++.. .--+.++++.|+++.
T Consensus 231 lwhLrsse~t~vfpipa~---v~~v~F~~d~vl~~G 263 (325)
T KOG0649|consen 231 LWHLRSSESTCVFPIPAR---VHLVDFVDDCVLIGG 263 (325)
T ss_pred EEeccCCCceEEEecccc---eeEeeeecceEEEec
Confidence 666666666666666552 223445667666655
No 66
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=97.30 E-value=0.009 Score=52.17 Aligned_cols=166 Identities=16% Similarity=0.140 Sum_probs=108.6
Q ss_pred eeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeE
Q 040693 24 TMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHV 103 (382)
Q Consensus 24 ~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v 103 (382)
..-....++.++.++. .++ ...|+..|+.+|+++|+..+.++. + |+ -.++.-++.+
T Consensus 48 TQGL~~~~g~i~esTG--~yg---------~S~ir~~~L~~gq~~~s~~l~~~~------~-----Fg--EGit~~gd~~ 103 (262)
T COG3823 48 TQGLEYLDGHILESTG--LYG---------FSKIRVSDLTTGQEIFSEKLAPDT------V-----FG--EGITKLGDYF 103 (262)
T ss_pred hcceeeeCCEEEEecc--ccc---------cceeEEEeccCceEEEEeecCCcc------c-----cc--cceeeccceE
Confidence 3345566777777664 232 458999999999999999985321 1 11 1122234677
Q ss_pred EEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCC
Q 040693 104 YIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGP 183 (382)
Q Consensus 104 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (382)
|.-+-. .+--+.+|++|=+.+=++++.+. .|...
T Consensus 104 y~LTw~--------------------------------egvaf~~d~~t~~~lg~~~y~Ge-GWgLt------------- 137 (262)
T COG3823 104 YQLTWK--------------------------------EGVAFKYDADTLEELGRFSYEGE-GWGLT------------- 137 (262)
T ss_pred EEEEec--------------------------------cceeEEEChHHhhhhcccccCCc-ceeee-------------
Confidence 777654 25668889999898888888765 34211
Q ss_pred CCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc---eeeeCCeEEEEecCc
Q 040693 184 SPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG---AATDERRIYTNIANS 260 (382)
Q Consensus 184 ~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~---~~~~~~~v~~~~~~~ 260 (382)
.+ +..++++++..+++-.|++|=+.+-+...... |.+... ....++.+|....
T Consensus 138 ----------------~d---~~~LimsdGsatL~frdP~tfa~~~~v~VT~~---g~pv~~LNELE~VdG~lyANVw-- 193 (262)
T COG3823 138 ----------------SD---DKNLIMSDGSATLQFRDPKTFAELDTVQVTDD---GVPVSKLNELEWVDGELYANVW-- 193 (262)
T ss_pred ----------------cC---CcceEeeCCceEEEecCHHHhhhcceEEEEEC---CeecccccceeeeccEEEEeee--
Confidence 11 34477788888999999998777766664321 111111 2236889998764
Q ss_pred cccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 261 QHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 261 ~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
+..+|..||+.+|+++=-++...
T Consensus 194 --------------~t~~I~rI~p~sGrV~~widlS~ 216 (262)
T COG3823 194 --------------QTTRIARIDPDSGRVVAWIDLSG 216 (262)
T ss_pred --------------eecceEEEcCCCCcEEEEEEccC
Confidence 45679999999999876666543
No 67
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=97.28 E-value=0.057 Score=55.83 Aligned_cols=266 Identities=12% Similarity=0.108 Sum_probs=146.8
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
....+.+|.++-.++-.-+..++. -.++..+++.+|++.++....-...+.... . ..+.. .
T Consensus 55 gksfqvYd~~kl~ll~vs~~lp~~---------------I~alas~~~~vy~A~g~~i~~~~rgk~i~~--~-~~~~~-a 115 (910)
T KOG1539|consen 55 GKSFQVYDVNKLNLLFVSKPLPDK---------------ITALASDKDYVYVASGNKIYAYARGKHIRH--T-TLLHG-A 115 (910)
T ss_pred CceEEEEeccceEEEEecCCCCCc---------------eEEEEecCceEEEecCcEEEEEEccceEEE--E-ecccc-c
Confidence 556788888877777665332221 134556677888888762211111100000 0 00000 0
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
..-.....+..+.|+|.+.+=-+|.........+ . .. +.+.....- +..+.-+..-=+.|+.+..
T Consensus 116 ~v~~l~~fGe~lia~d~~~~l~vw~~s~~~~e~~--l----~~--------~~~~~~~~~-Ital~HP~TYLNKIvvGs~ 180 (910)
T KOG1539|consen 116 KVHLLLPFGEHLIAVDISNILFVWKTSSIQEELY--L----QS--------TFLKVEGDF-ITALLHPSTYLNKIVVGSS 180 (910)
T ss_pred eEEEEeeecceEEEEEccCcEEEEEecccccccc--c----cc--------eeeeccCCc-eeeEecchhheeeEEEeec
Confidence 0011122357889999988888998876422110 0 00 000000000 0000000111367888999
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
.|.+..++..|||++.+++.-... .-.....|+ -+.|-++ ...|+|+.++++.+|++-++
T Consensus 181 ~G~lql~Nvrt~K~v~~f~~~~s~-IT~ieqsPa--LDVVaiG-----------------~~~G~ViifNlK~dkil~sF 240 (910)
T KOG1539|consen 181 QGRLQLWNVRTGKVVYTFQEFFSR-ITAIEQSPA--LDVVAIG-----------------LENGTVIIFNLKFDKILMSF 240 (910)
T ss_pred CCcEEEEEeccCcEEEEecccccc-eeEeccCCc--ceEEEEe-----------------ccCceEEEEEcccCcEEEEE
Confidence 999999999999999998853210 000112222 2455555 55699999999999999999
Q ss_pred cCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC--ceecceEEeCCEEEEEeCceeEeecCCccC
Q 040693 294 ADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA--TIYGGASVSNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 294 ~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~--~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
..+-+....-..-.+|..+.+... ..|.+...|++..+..|.....+ +........+..+.++.+..+..+.|-||+
T Consensus 241 k~d~g~VtslSFrtDG~p~las~~-~~G~m~~wDLe~kkl~~v~~nah~~sv~~~~fl~~epVl~ta~~DnSlk~~vfD~ 319 (910)
T KOG1539|consen 241 KQDWGRVTSLSFRTDGNPLLASGR-SNGDMAFWDLEKKKLINVTRNAHYGSVTGATFLPGEPVLVTAGADNSLKVWVFDS 319 (910)
T ss_pred EccccceeEEEeccCCCeeEEecc-CCceEEEEEcCCCeeeeeeeccccCCcccceecCCCceEeeccCCCceeEEEeeC
Confidence 886333333343345555555442 46899999999999999876332 222222335556666666556566666664
Q ss_pred CCC
Q 040693 372 TSG 374 (382)
Q Consensus 372 ~~g 374 (382)
.-|
T Consensus 320 ~dg 322 (910)
T KOG1539|consen 320 GDG 322 (910)
T ss_pred CCC
Confidence 444
No 68
>PHA02713 hypothetical protein; Provisional
Probab=97.27 E-value=0.23 Score=51.18 Aligned_cols=152 Identities=13% Similarity=0.119 Sum_probs=83.0
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccc-cC------CCCCCCCCceEEEEECCCCc
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFN-LK------PSKNSTIAGGWVAMDASNGN 288 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~-~~------~~~~~~~~g~v~a~d~~tG~ 288 (382)
.+.++|+.+. .|+.-.+-+ ......+.++-++.||+..+......+. .. ..........+.++|+++.
T Consensus 368 sve~Ydp~~~--~W~~~~~mp--~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td- 442 (557)
T PHA02713 368 TIECYTMGDD--KWKMLPDMP--IALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNN- 442 (557)
T ss_pred eEEEEECCCC--eEEECCCCC--cccccccEEEECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCC-
Confidence 4888999864 688744321 1122233445688999875432111000 00 0000011356999999886
Q ss_pred EEeeecC--CCCCCCCcceEEeCCEEEEeeecC-----CCcEEEEeCCCCcEeEEEe--cCCc-eecceEEeCCEEEEEe
Q 040693 289 VLWSTAD--PSNGTAPGPVTVANGVLFGGSTYR-----QGPIYAMDVKTGKILWSYD--TGAT-IYGGASVSNGCIYMGN 358 (382)
Q Consensus 289 ~~W~~~~--~~~~~~~~~~~~~~~~v~~~~~~~-----~g~l~~ld~~tG~ilw~~~--~~~~-~~~~p~~~~g~lyv~~ 358 (382)
.|+.-. +... ....+++-++.+|+..... ...+.++|+.+ .-.|+.- ++.. ...+.++.+|+||+..
T Consensus 443 -~W~~v~~m~~~r-~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~-~~~W~~~~~m~~~r~~~~~~~~~~~iyv~G 519 (557)
T PHA02713 443 -IWETLPNFWTGT-IRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNT-YNGWELITTTESRLSALHTILHDNTIMMLH 519 (557)
T ss_pred -eEeecCCCCccc-ccCcEEEECCEEEEEeCCCCCCccceeEEEecCCC-CCCeeEccccCcccccceeEEECCEEEEEe
Confidence 587533 2222 2334455678888865311 12367899986 1248752 2222 3456777899999987
Q ss_pred CceeEeecCCccCCCCC
Q 040693 359 GYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 359 ~~g~~~~~~~~~~~~g~ 375 (382)
++.....+-.+|+.|.+
T Consensus 520 g~~~~~~~e~yd~~~~~ 536 (557)
T PHA02713 520 CYESYMLQDTFNVYTYE 536 (557)
T ss_pred eecceeehhhcCccccc
Confidence 75443344556665543
No 69
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=97.27 E-value=0.0074 Score=56.94 Aligned_cols=233 Identities=16% Similarity=0.197 Sum_probs=129.4
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
|.++..+|+.|-.++-+.. |+..|..+....+++..|.-+.-
T Consensus 136 D~TvR~WD~~TeTp~~t~K-------------gH~~WVlcvawsPDgk~iASG~~------------------------- 177 (480)
T KOG0271|consen 136 DTTVRLWDLDTETPLFTCK-------------GHKNWVLCVAWSPDGKKIASGSK------------------------- 177 (480)
T ss_pred CceEEeeccCCCCcceeec-------------CCccEEEEEEECCCcchhhcccc-------------------------
Confidence 5567777777766665544 44556556677777666655443
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+|.|...|+++|+++=+.- .++..|+.+-. =.|.=.. .+ ...+..+++
T Consensus 178 --------dg~I~lwdpktg~~~g~~l-~gH~K~It~La------------------wep~hl~---p~--~r~las~sk 225 (480)
T KOG0271|consen 178 --------DGSIRLWDPKTGQQIGRAL-RGHKKWITALA------------------WEPLHLV---PP--CRRLASSSK 225 (480)
T ss_pred --------CCeEEEecCCCCCcccccc-cCcccceeEEe------------------ecccccC---CC--ccceecccC
Confidence 4899999999998875432 22223322111 1111110 00 225556778
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
||.+...|..-|+.+-...--.. ......|+ .++++|.+ .++.+|..++...|+..-+.
T Consensus 226 Dg~vrIWd~~~~~~~~~lsgHT~-~VTCvrwG---G~gliySg-----------------S~DrtIkvw~a~dG~~~r~l 284 (480)
T KOG0271|consen 226 DGSVRIWDTKLGTCVRTLSGHTA-SVTCVRWG---GEGLIYSG-----------------SQDRTIKVWRALDGKLCREL 284 (480)
T ss_pred CCCEEEEEccCceEEEEeccCcc-ceEEEEEc---CCceEEec-----------------CCCceEEEEEccchhHHHhh
Confidence 88899899887777666542110 01111222 46888887 45577777777776543222
Q ss_pred cCCCCCCC-----------Cc----------c---------------eEEeCCEEEEeeecCCCcEEEEeCC-CCcEeEE
Q 040693 294 ADPSNGTA-----------PG----------P---------------VTVANGVLFGGSTYRQGPIYAMDVK-TGKILWS 336 (382)
Q Consensus 294 ~~~~~~~~-----------~~----------~---------------~~~~~~~v~~~~~~~~g~l~~ld~~-tG~ilw~ 336 (382)
....-... .+ + .-..+.+++-++ ++..++..++. +-+++-|
T Consensus 285 kGHahwvN~lalsTdy~LRtgaf~~t~~~~~~~se~~~~Al~rY~~~~~~~~erlVSgs--Dd~tlflW~p~~~kkpi~r 362 (480)
T KOG0271|consen 285 KGHAHWVNHLALSTDYVLRTGAFDHTGRKPKSFSEEQKKALERYEAVLKDSGERLVSGS--DDFTLFLWNPFKSKKPITR 362 (480)
T ss_pred cccchheeeeeccchhhhhccccccccccCCChHHHHHHHHHHHHHhhccCcceeEEec--CCceEEEecccccccchhh
Confidence 21110000 00 0 001123566666 47888888875 3445555
Q ss_pred EecCCceecceEEe-CCEEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 337 YDTGATIYGGASVS-NGCIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 337 ~~~~~~~~~~p~~~-~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
+.-......+..+. |++..++.+...-++++... +|+.|-+||
T Consensus 363 mtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~--tGk~lasfR 406 (480)
T KOG0271|consen 363 MTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGR--TGKFLASFR 406 (480)
T ss_pred hhchhhheeeEEECCCccEEEEeecccceeeeeCC--Ccchhhhhh
Confidence 54333444555554 45555555677777887654 488887775
No 70
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=97.26 E-value=0.023 Score=51.62 Aligned_cols=180 Identities=14% Similarity=0.083 Sum_probs=112.2
Q ss_pred CCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 140 NHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 140 ~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
++++.+...|+.+|+..=++-....+.... .++ ..+-+|+.++.|-.+-.
T Consensus 82 swD~~lrlWDl~~g~~t~~f~GH~~dVlsv---------------------------a~s---~dn~qivSGSrDkTikl 131 (315)
T KOG0279|consen 82 SWDGTLRLWDLATGESTRRFVGHTKDVLSV---------------------------AFS---TDNRQIVSGSRDKTIKL 131 (315)
T ss_pred cccceEEEEEecCCcEEEEEEecCCceEEE---------------------------Eec---CCCceeecCCCcceeee
Confidence 356999999999998877776655532211 111 11467888888888888
Q ss_pred EeCCCCCeeeeeccCC-CCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC
Q 040693 220 LDRDSGSLIWSMEAGP-GGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN 298 (382)
Q Consensus 220 ld~~tG~~~W~~~~~~-~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~ 298 (382)
+|.. |....+..... ........+.|--+ +.++++. .-+..|...|+.+-+.+-.+.....
T Consensus 132 wnt~-g~ck~t~~~~~~~~WVscvrfsP~~~-~p~Ivs~----------------s~DktvKvWnl~~~~l~~~~~gh~~ 193 (315)
T KOG0279|consen 132 WNTL-GVCKYTIHEDSHREWVSCVRFSPNES-NPIIVSA----------------SWDKTVKVWNLRNCQLRTTFIGHSG 193 (315)
T ss_pred eeec-ccEEEEEecCCCcCcEEEEEEcCCCC-CcEEEEc----------------cCCceEEEEccCCcchhhccccccc
Confidence 8877 44444443221 22222223333212 3333332 2357899999998888877665543
Q ss_pred CCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeEeecCCccC
Q 040693 299 GTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 299 ~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
. ...+.+ .+|-+.+. .+.+|.++..|++.||-+..++.. ....+.++..++..+....+.-++++.++.
T Consensus 194 ~--v~t~~vSpDGslcas-Ggkdg~~~LwdL~~~k~lysl~a~-~~v~sl~fspnrywL~~at~~sIkIwdl~~ 263 (315)
T KOG0279|consen 194 Y--VNTVTVSPDGSLCAS-GGKDGEAMLWDLNEGKNLYSLEAF-DIVNSLCFSPNRYWLCAATATSIKIWDLES 263 (315)
T ss_pred c--EEEEEECCCCCEEec-CCCCceEEEEEccCCceeEeccCC-CeEeeEEecCCceeEeeccCCceEEEeccc
Confidence 2 222323 23433332 336899999999999988877653 356677888888888877777677777765
No 71
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=97.26 E-value=0.019 Score=54.19 Aligned_cols=68 Identities=16% Similarity=0.293 Sum_probs=42.0
Q ss_pred CcceEEEEECCCCcEEEEEecC-CCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 141 HSNSLLALDLDTGKIVWYKQLG-GYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 141 ~~g~v~ald~~tG~~~W~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
..+.++.||+++.+..+..+.. ..... + + .-.++...|. +++.++.=+..+.+..
T Consensus 230 s~~~v~~ld~~~~~~~~~~~~~~~~~~~-~------s----------~~~G~~Q~L~-------nGn~li~~g~~g~~~E 285 (299)
T PF14269_consen 230 SRGLVLELDPETMTVTLVREYSDHPDGF-Y------S----------PSQGSAQRLP-------NGNVLIGWGNNGRISE 285 (299)
T ss_pred CCceEEEEECCCCEEEEEEEeecCCCcc-c------c----------cCCCcceECC-------CCCEEEecCCCceEEE
Confidence 3589999999977777666654 22110 0 0 0011222222 2456666667789999
Q ss_pred EeCCCCCeeeeecc
Q 040693 220 LDRDSGSLIWSMEA 233 (382)
Q Consensus 220 ld~~tG~~~W~~~~ 233 (382)
++++ |+++|++..
T Consensus 286 ~~~~-G~vv~~~~f 298 (299)
T PF14269_consen 286 FTPD-GEVVWEAQF 298 (299)
T ss_pred ECCC-CCEEEEEEC
Confidence 9976 999999863
No 72
>PRK05137 tolB translocation protein TolB; Provisional
Probab=97.25 E-value=0.17 Score=50.36 Aligned_cols=147 Identities=13% Similarity=0.091 Sum_probs=68.4
Q ss_pred cEEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++....+ ..++.+|.++|+..- ...... ......|.+ ++..|++.... .....|+.+|
T Consensus 259 ~la~~~~~~g~~~Iy~~d~~~~~~~~-Lt~~~~-~~~~~~~sp--DG~~i~f~s~~--------------~g~~~Iy~~d 320 (435)
T PRK05137 259 KVVMSLSQGGNTDIYTMDLRSGTTTR-LTDSPA-IDTSPSYSP--DGSQIVFESDR--------------SGSPQLYVMN 320 (435)
T ss_pred EEEEEEecCCCceEEEEECCCCceEE-ccCCCC-ccCceeEcC--CCCEEEEEECC--------------CCCCeEEEEE
Confidence 344444433 469999999886532 221110 111122222 56656554322 1124689999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeec-CCCcEEEEeCCCCcEeEEEecCCceecceEEe--CCEEEEEeC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTY-RQGPIYAMDVKTGKILWSYDTGATIYGGASVS--NGCIYMGNG 359 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~-~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~--~g~lyv~~~ 359 (382)
+.+++..--..... . ...+... ++..+++.... ....|+.+|+++++.. ....+ ....+|... +..|+..+.
T Consensus 321 ~~g~~~~~lt~~~~-~-~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~-~lt~~-~~~~~p~~spDG~~i~~~~~ 396 (435)
T PRK05137 321 ADGSNPRRISFGGG-R-YSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGER-ILTSG-FLVEGPTWAPNGRVIMFFRQ 396 (435)
T ss_pred CCCCCeEEeecCCC-c-ccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceE-eccCC-CCCCCCeECCCCCEEEEEEc
Confidence 88876654333221 1 1222222 34555554421 1236889998765532 22222 223455543 344544443
Q ss_pred ceeE---eecCCccCCCC
Q 040693 360 YKVT---VGFGNKNFTSG 374 (382)
Q Consensus 360 ~g~~---~~~~~~~~~~g 374 (382)
.+.. ..+|.++...+
T Consensus 397 ~~~~~~~~~L~~~dl~g~ 414 (435)
T PRK05137 397 TPGSGGAPKLYTVDLTGR 414 (435)
T ss_pred cCCCCCcceEEEEECCCC
Confidence 2221 23555555443
No 73
>PHA02713 hypothetical protein; Provisional
Probab=97.24 E-value=0.11 Score=53.37 Aligned_cols=103 Identities=12% Similarity=0.100 Sum_probs=58.2
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee--
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST-- 293 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~-- 293 (382)
.+.++|+.+. .|+.-.+-. ......+.++-++.||+..+..+. ......+.++|+.+ ...|+.
T Consensus 433 ~ve~YDP~td--~W~~v~~m~--~~r~~~~~~~~~~~IYv~GG~~~~----------~~~~~~ve~Ydp~~-~~~W~~~~ 497 (557)
T PHA02713 433 KVIRYDTVNN--IWETLPNFW--TGTIRPGVVSHKDDIYVVCDIKDE----------KNVKTCIFRYNTNT-YNGWELIT 497 (557)
T ss_pred eEEEECCCCC--eEeecCCCC--cccccCcEEEECCEEEEEeCCCCC----------CccceeEEEecCCC-CCCeeEcc
Confidence 4788998865 687654321 122333455578899987432110 01123478999987 124884
Q ss_pred cCCCCCCCCcceEEeCCEEEEeeecCCC--cEEEEeCCCCcEeEEE
Q 040693 294 ADPSNGTAPGPVTVANGVLFGGSTYRQG--PIYAMDVKTGKILWSY 337 (382)
Q Consensus 294 ~~~~~~~~~~~~~~~~~~v~~~~~~~~g--~l~~ld~~tG~ilw~~ 337 (382)
+++.... ...+++-++.||+... .++ .+.++|+.|.+ |+.
T Consensus 498 ~m~~~r~-~~~~~~~~~~iyv~Gg-~~~~~~~e~yd~~~~~--W~~ 539 (557)
T PHA02713 498 TTESRLS-ALHTILHDNTIMMLHC-YESYMLQDTFNVYTYE--WNH 539 (557)
T ss_pred ccCcccc-cceeEEECCEEEEEee-ecceeehhhcCccccc--ccc
Confidence 3444333 3344455777777643 234 57778877554 654
No 74
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=97.23 E-value=0.013 Score=51.27 Aligned_cols=145 Identities=17% Similarity=0.225 Sum_probs=101.7
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc-CcEEEEEe
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK-SGFAWALD 221 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~-~g~l~ald 221 (382)
..|...|+.+|+++|+.++.+...+ ....+.. ++.++.-+. ++.-+.+|
T Consensus 68 S~ir~~~L~~gq~~~s~~l~~~~~F----------------------gEGit~~--------gd~~y~LTw~egvaf~~d 117 (262)
T COG3823 68 SKIRVSDLTTGQEIFSEKLAPDTVF----------------------GEGITKL--------GDYFYQLTWKEGVAFKYD 117 (262)
T ss_pred ceeEEEeccCceEEEEeecCCcccc----------------------ccceeec--------cceEEEEEeccceeEEEC
Confidence 7899999999999999998843211 1122222 567777664 57889999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC---
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN--- 298 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~--- 298 (382)
++|-+.+=+++.+. .-|+...|+..+..+ +....++-.|++|-+..=++.+-..
T Consensus 118 ~~t~~~lg~~~y~G------eGWgLt~d~~~Lims-----------------dGsatL~frdP~tfa~~~~v~VT~~g~p 174 (262)
T COG3823 118 ADTLEELGRFSYEG------EGWGLTSDDKNLIMS-----------------DGSATLQFRDPKTFAELDTVQVTDDGVP 174 (262)
T ss_pred hHHhhhhcccccCC------cceeeecCCcceEee-----------------CCceEEEecCHHHhhhcceEEEEECCee
Confidence 99988887777542 558877788777776 4457788889998877766654321
Q ss_pred CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 299 GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 299 ~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
...-.-+-.-+|.+|+.-. +...|..||+++|+++--.++++
T Consensus 175 v~~LNELE~VdG~lyANVw-~t~~I~rI~p~sGrV~~widlS~ 216 (262)
T COG3823 175 VSKLNELEWVDGELYANVW-QTTRIARIDPDSGRVVAWIDLSG 216 (262)
T ss_pred cccccceeeeccEEEEeee-eecceEEEcCCCCcEEEEEEccC
Confidence 1111223334678888764 67899999999999988777764
No 75
>PRK03629 tolB translocation protein TolB; Provisional
Probab=97.21 E-value=0.23 Score=49.40 Aligned_cols=143 Identities=14% Similarity=0.110 Sum_probs=67.0
Q ss_pred cEEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++....++ .|+.+|.++|+..--..... ......|.+ ++..|++.... ...-.|+.+|
T Consensus 256 ~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~--~~~~~~wSP--DG~~I~f~s~~--------------~g~~~Iy~~d 317 (429)
T PRK03629 256 KLAFALSKTGSLNLYVMDLASGQIRQVTDGRS--NNTEPTWFP--DSQNLAYTSDQ--------------AGRPQVYKVN 317 (429)
T ss_pred EEEEEEcCCCCcEEEEEECCCCCEEEccCCCC--CcCceEECC--CCCEEEEEeCC--------------CCCceEEEEE
Confidence 3444444333 69999999887542222110 111122222 56656554322 1123688999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCC-EEEEEeC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNG-CIYMGNG 359 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g-~lyv~~~ 359 (382)
+.+|+..--..... . ...+... ++..++.... .....|+.+|+++|++. ..... ....+|.. .|| .|+..+.
T Consensus 318 ~~~g~~~~lt~~~~-~-~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~-~Lt~~-~~~~~p~~SpDG~~i~~~s~ 393 (429)
T PRK03629 318 INGGAPQRITWEGS-Q-NQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQ-VLTDT-FLDETPSIAPNGTMVIYSSS 393 (429)
T ss_pred CCCCCeEEeecCCC-C-ccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeE-EeCCC-CCCCCceECCCCCEEEEEEc
Confidence 99887643322211 1 1222222 3445544432 11246899999988753 22211 11234444 334 4555554
Q ss_pred ceeEeecCCcc
Q 040693 360 YKVTVGFGNKN 370 (382)
Q Consensus 360 ~g~~~~~~~~~ 370 (382)
++....++.++
T Consensus 394 ~~~~~~l~~~~ 404 (429)
T PRK03629 394 QGMGSVLNLVS 404 (429)
T ss_pred CCCceEEEEEE
Confidence 44433334333
No 76
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=97.21 E-value=0.04 Score=56.08 Aligned_cols=175 Identities=14% Similarity=0.205 Sum_probs=111.1
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
+..|.+.|..+|+.+=...... .|+- .+. ..++.+++++.|+.+-.+|
T Consensus 310 D~tVkVW~v~n~~~l~l~~~h~----------------------------~~V~-~v~---~~~~~lvsgs~d~~v~VW~ 357 (537)
T KOG0274|consen 310 DNTVKVWDVTNGACLNLLRGHT----------------------------GPVN-CVQ---LDEPLLVSGSYDGTVKVWD 357 (537)
T ss_pred CceEEEEeccCcceEEEecccc----------------------------ccEE-EEE---ecCCEEEEEecCceEEEEE
Confidence 5789999988888877665421 1111 111 1268899999999999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeC-CeEEEEecCccccccccCCCCCCCCCceEEEEECCCC-cEEeeecCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDE-RRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG-NVLWSTADPSNG 299 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG-~~~W~~~~~~~~ 299 (382)
+.+++.+-....- ........++. +.+|-+ ..++.|.+.|+.++ +.+-.......
T Consensus 358 ~~~~~cl~sl~gH-----~~~V~sl~~~~~~~~~Sg-----------------s~D~~IkvWdl~~~~~c~~tl~~h~~- 414 (537)
T KOG0274|consen 358 PRTGKCLKSLSGH-----TGRVYSLIVDSENRLLSG-----------------SLDTTIKVWDLRTKRKCIHTLQGHTS- 414 (537)
T ss_pred hhhceeeeeecCC-----cceEEEEEecCcceEEee-----------------eeccceEeecCCchhhhhhhhcCCcc-
Confidence 9999999887732 12333333455 566655 34578999999999 55555544432
Q ss_pred CCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEEeCCEEEEEeCceeEeecCCccCCCCCeE
Q 040693 300 TAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 300 ~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
....+...+..+...+. ++.|..-|.++++.+-..+.+. .........+..++++..+|. +.+| |.++|+..
T Consensus 415 -~v~~l~~~~~~Lvs~~a--D~~Ik~WD~~~~~~~~~~~~~~~~~v~~l~~~~~~il~s~~~~~-~~l~--dl~~~~~~ 487 (537)
T KOG0274|consen 415 -LVSSLLLRDNFLVSSSA--DGTIKLWDAEEGECLRTLEGRHVGGVSALALGKEEILCSSDDGS-VKLW--DLRSGTLI 487 (537)
T ss_pred -cccccccccceeEeccc--cccEEEeecccCceeeeeccCCcccEEEeecCcceEEEEecCCe-eEEE--ecccCchh
Confidence 12333445566666554 8999999999999999888751 222222222456666655544 6666 44445543
No 77
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=97.19 E-value=0.22 Score=47.37 Aligned_cols=127 Identities=13% Similarity=0.143 Sum_probs=70.7
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..++++|+.+ ..|+.-.+-+. ........++.++.||+...... .....+.++|+++. .|+.-
T Consensus 139 ~~v~~yd~~~--~~W~~~~~~p~-~~r~~~~~~~~~~~iYv~GG~~~------------~~~~~~~~yd~~~~--~W~~~ 201 (323)
T TIGR03548 139 NKSYLFNLET--QEWFELPDFPG-EPRVQPVCVKLQNELYVFGGGSN------------IAYTDGYKYSPKKN--QWQKV 201 (323)
T ss_pred ceEEEEcCCC--CCeeECCCCCC-CCCCcceEEEECCEEEEEcCCCC------------ccccceEEEecCCC--eeEEC
Confidence 4689999885 45886432111 11122233346788888644311 11124689999876 48754
Q ss_pred CCC-----CC--CCCcceEEeCCEEEEeeecC------------------------------------CCcEEEEeCCCC
Q 040693 295 DPS-----NG--TAPGPVTVANGVLFGGSTYR------------------------------------QGPIYAMDVKTG 331 (382)
Q Consensus 295 ~~~-----~~--~~~~~~~~~~~~v~~~~~~~------------------------------------~g~l~~ld~~tG 331 (382)
.+. +. .....+.+.++.+|+..... ...+.++|+.+.
T Consensus 202 ~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~ 281 (323)
T TIGR03548 202 ADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTG 281 (323)
T ss_pred CCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhhhhccchhhhhhHHHHhCCCccccCcCceEEEEECCCC
Confidence 321 10 11222334466777654211 135899999876
Q ss_pred cEeEEEec--C--CceecceEEeCCEEEEEeCc
Q 040693 332 KILWSYDT--G--ATIYGGASVSNGCIYMGNGY 360 (382)
Q Consensus 332 ~ilw~~~~--~--~~~~~~p~~~~g~lyv~~~~ 360 (382)
+ |+.-. + .....+.++.+++||+..+.
T Consensus 282 ~--W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~ 312 (323)
T TIGR03548 282 K--WKSIGNSPFFARCGAALLLTGNNIFSINGE 312 (323)
T ss_pred e--eeEcccccccccCchheEEECCEEEEEecc
Confidence 5 86522 2 12334567799999998875
No 78
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.19 E-value=0.21 Score=50.01 Aligned_cols=143 Identities=12% Similarity=0.138 Sum_probs=69.5
Q ss_pred cEEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.++++...++ .|+.+|.++|+..- ..... .......|.+ ++..+++.... .....|+.+|
T Consensus 275 ~La~~~~~~g~~~Iy~~dl~tg~~~~-lt~~~-~~~~~p~wSp--DG~~I~f~s~~--------------~g~~~Iy~~d 336 (448)
T PRK04792 275 KLALVLSKDGQPEIYVVDIATKALTR-ITRHR-AIDTEPSWHP--DGKSLIFTSER--------------GGKPQIYRVN 336 (448)
T ss_pred EEEEEEeCCCCeEEEEEECCCCCeEE-CccCC-CCccceEECC--CCCEEEEEECC--------------CCCceEEEEE
Confidence 3444444444 69999999886532 21111 0111122222 56666665322 1224699999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCC--CcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEe
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQ--GPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGN 358 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~--g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~ 358 (382)
+.+|+..--.. .... ...+... ++..+++.+. .. ..|+.+|+++|++.- ... ...-.+|.. .+..|+..+
T Consensus 337 l~~g~~~~Lt~-~g~~-~~~~~~SpDG~~l~~~~~-~~g~~~I~~~dl~~g~~~~-lt~-~~~d~~ps~spdG~~I~~~~ 411 (448)
T PRK04792 337 LASGKVSRLTF-EGEQ-NLGGSITPDGRSMIMVNR-TNGKFNIARQDLETGAMQV-LTS-TRLDESPSVAPNGTMVIYST 411 (448)
T ss_pred CCCCCEEEEec-CCCC-CcCeeECCCCCEEEEEEe-cCCceEEEEEECCCCCeEE-ccC-CCCCCCceECCCCCEEEEEE
Confidence 99997643322 1111 1222222 4456655543 22 368899999887532 111 112223433 344555555
Q ss_pred CceeEeecCCccC
Q 040693 359 GYKVTVGFGNKNF 371 (382)
Q Consensus 359 ~~g~~~~~~~~~~ 371 (382)
..+....+|.++.
T Consensus 412 ~~~g~~~l~~~~~ 424 (448)
T PRK04792 412 TYQGKQVLAAVSI 424 (448)
T ss_pred ecCCceEEEEEEC
Confidence 4433333454544
No 79
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.10 E-value=0.33 Score=47.70 Aligned_cols=131 Identities=18% Similarity=0.165 Sum_probs=63.5
Q ss_pred EEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+++....+ ..++.+|..+++..--..... ......|.+ ++..|++.... .....|+.+|+
T Consensus 248 l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~--~~~~~~~s~--dg~~l~~~s~~--------------~g~~~iy~~d~ 309 (417)
T TIGR02800 248 LAVSLSKDGNPDIYVMDLDGKQLTRLTNGPG--IDTEPSWSP--DGKSIAFTSDR--------------GGSPQIYMMDA 309 (417)
T ss_pred EEEEECCCCCccEEEEECCCCCEEECCCCCC--CCCCEEECC--CCCEEEEEECC--------------CCCceEEEEEC
Confidence 44444433 469999998876432211110 111122222 56666554322 11246999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCC---cEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEe
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQG---PIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGN 358 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g---~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~ 358 (382)
.+++..-... .. .....+... ++..+++... .+ .|+.+|..+++..--.. ......|.. .++.|++.+
T Consensus 310 ~~~~~~~l~~-~~-~~~~~~~~spdg~~i~~~~~--~~~~~~i~~~d~~~~~~~~l~~--~~~~~~p~~spdg~~l~~~~ 383 (417)
T TIGR02800 310 DGGEVRRLTF-RG-GYNASPSWSPDGDLIAFVHR--EGGGFNIAVMDLDGGGERVLTD--TGLDESPSFAPNGRMILYAT 383 (417)
T ss_pred CCCCEEEeec-CC-CCccCeEECCCCCEEEEEEc--cCCceEEEEEeCCCCCeEEccC--CCCCCCceECCCCCEEEEEE
Confidence 9887542222 21 112222222 3456666553 33 79999998865432111 112233443 345566655
Q ss_pred Cce
Q 040693 359 GYK 361 (382)
Q Consensus 359 ~~g 361 (382)
..+
T Consensus 384 ~~~ 386 (417)
T TIGR02800 384 TRG 386 (417)
T ss_pred eCC
Confidence 544
No 80
>PTZ00420 coronin; Provisional
Probab=97.09 E-value=0.42 Score=49.09 Aligned_cols=144 Identities=9% Similarity=0.041 Sum_probs=82.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
..++++++.|+.+..+|..+++.+..+.... ......+ ++.++.++ ..++.|..+
T Consensus 138 ~~iLaSgS~DgtIrIWDl~tg~~~~~i~~~~------~V~SlswspdG~lLat~-----------------s~D~~IrIw 194 (568)
T PTZ00420 138 YYIMCSSGFDSFVNIWDIENEKRAFQINMPK------KLSSLKWNIKGNLLSGT-----------------CVGKHMHII 194 (568)
T ss_pred CeEEEEEeCCCeEEEEECCCCcEEEEEecCC------cEEEEEECCCCCEEEEE-----------------ecCCEEEEE
Confidence 3455677889999999999999887765321 1112222 44444433 345789999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEE-----eCCEEEEeeecC--CCcEEEEeCCC-CcEeEEEecCCcee-cceEE--eC
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTV-----ANGVLFGGSTYR--QGPIYAMDVKT-GKILWSYDTGATIY-GGASV--SN 351 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~--~g~l~~ld~~t-G~ilw~~~~~~~~~-~~p~~--~~ 351 (382)
|+++|+.+-+............+.. ++++++.+.... ...|...|..+ ++++-...+..... -.|.. ..
T Consensus 195 D~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld~~~~~L~p~~D~~t 274 (568)
T PTZ00420 195 DPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSIDNASAPLIPHYDEST 274 (568)
T ss_pred ECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEecCCccceEEeeeCCC
Confidence 9999998876654432111111111 235555544311 13689999884 67776555443211 12333 34
Q ss_pred CEEEEEeCceeEeecCCccC
Q 040693 352 GCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 352 g~lyv~~~~g~~~~~~~~~~ 371 (382)
+.+|++...+..+.+|.+..
T Consensus 275 g~l~lsGkGD~tIr~~e~~~ 294 (568)
T PTZ00420 275 GLIYLIGKGDGNCRYYQHSL 294 (568)
T ss_pred CCEEEEEECCCeEEEEEccC
Confidence 77887654445566777654
No 81
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.08 E-value=0.015 Score=51.61 Aligned_cols=180 Identities=11% Similarity=0.088 Sum_probs=110.3
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
..|...|+..|..+-++.-.+..+.+.+. . ..+..+..+..|-+++.+|.
T Consensus 39 rtvrLWNp~rg~liktYsghG~EVlD~~~-----------------------s-------~Dnskf~s~GgDk~v~vwDV 88 (307)
T KOG0316|consen 39 RTVRLWNPLRGALIKTYSGHGHEVLDAAL-----------------------S-------SDNSKFASCGGDKAVQVWDV 88 (307)
T ss_pred ceEEeecccccceeeeecCCCceeeeccc-----------------------c-------ccccccccCCCCceEEEEEc
Confidence 68889999999999988776654321110 0 11456777888999999999
Q ss_pred CCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCc--EEeeecCCCCCC
Q 040693 223 DSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGN--VLWSTADPSNGT 300 (382)
Q Consensus 223 ~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~--~~W~~~~~~~~~ 300 (382)
+||+++-++.--. +..+..--.++..|+++. ..+..+.++|.++-. ++-..+... .
T Consensus 89 ~TGkv~Rr~rgH~----aqVNtV~fNeesSVv~Sg----------------sfD~s~r~wDCRS~s~ePiQildea~--D 146 (307)
T KOG0316|consen 89 NTGKVDRRFRGHL----AQVNTVRFNEESSVVASG----------------SFDSSVRLWDCRSRSFEPIQILDEAK--D 146 (307)
T ss_pred ccCeeeeeccccc----ceeeEEEecCcceEEEec----------------cccceeEEEEcccCCCCccchhhhhc--C
Confidence 9999987776211 111111111344444442 456889999998663 332222211 2
Q ss_pred CCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeCceeEeecCCccCCCCCeEEE
Q 040693 301 APGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNGYKVTVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 301 ~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~ 379 (382)
....+.+.+..++.++. +|.+..+|...|++.-- ..+.++.+.-.. .++.+.+++-++. .. -+|-.||++|=+
T Consensus 147 ~V~Si~v~~heIvaGS~--DGtvRtydiR~G~l~sD-y~g~pit~vs~s~d~nc~La~~l~st-lr--LlDk~tGklL~s 220 (307)
T KOG0316|consen 147 GVSSIDVAEHEIVAGSV--DGTVRTYDIRKGTLSSD-YFGHPITSVSFSKDGNCSLASSLDST-LR--LLDKETGKLLKS 220 (307)
T ss_pred ceeEEEecccEEEeecc--CCcEEEEEeecceeehh-hcCCcceeEEecCCCCEEEEeeccce-ee--ecccchhHHHHH
Confidence 23455566778888886 99999999998875432 233332222222 5566666666544 33 367788988765
Q ss_pred E
Q 040693 380 F 380 (382)
Q Consensus 380 ~ 380 (382)
|
T Consensus 221 Y 221 (307)
T KOG0316|consen 221 Y 221 (307)
T ss_pred h
Confidence 5
No 82
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=97.07 E-value=0.34 Score=47.35 Aligned_cols=207 Identities=19% Similarity=0.215 Sum_probs=129.1
Q ss_pred cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCC
Q 040693 31 KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNL 110 (382)
Q Consensus 31 ~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~ 110 (382)
+.++|+.... ...+..+|.++-+++-....+... ...+++++++.+|+....
T Consensus 85 ~~~vyv~~~~-------------~~~v~vid~~~~~~~~~~~vG~~P--------------~~~~~~~~~~~vYV~n~~- 136 (381)
T COG3391 85 GNKVYVTTGD-------------SNTVSVIDTATNTVLGSIPVGLGP--------------VGLAVDPDGKYVYVANAG- 136 (381)
T ss_pred CCeEEEecCC-------------CCeEEEEcCcccceeeEeeeccCC--------------ceEEECCCCCEEEEEecc-
Confidence 4567776642 568999997777777666664321 136788888899998652
Q ss_pred CCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCC
Q 040693 111 YSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFG 190 (382)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (382)
...+.+..+|..|++++=......
T Consensus 137 -----------------------------~~~~~vsvid~~t~~~~~~~~vG~--------------------------- 160 (381)
T COG3391 137 -----------------------------NGNNTVSVIDAATNKVTATIPVGN--------------------------- 160 (381)
T ss_pred -----------------------------cCCceEEEEeCCCCeEEEEEecCC---------------------------
Confidence 024899999999999887744432
Q ss_pred CCceEEEeeeCceeecEEEEE-ccCcEEEEEeCCCCCeeeeeccCC-CCCCCCcccceee--eCCeEEEEecCccccccc
Q 040693 191 EAPMMLSMYRNKVKHDIVVAV-QKSGFAWALDRDSGSLIWSMEAGP-GGLGGGAMWGAAT--DERRIYTNIANSQHKNFN 266 (382)
Q Consensus 191 ~~p~~~~~~~~g~~~~~v~~~-~~~g~l~ald~~tG~~~W~~~~~~-~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~ 266 (382)
.|.-..+..+ +..+++. ..++.+..+|.. +..+|+ .... ....+...+...+ ++..+|+...+
T Consensus 161 -~P~~~a~~p~---g~~vyv~~~~~~~v~vi~~~-~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~------- 227 (381)
T COG3391 161 -TPTGVAVDPD---GNKVYVTNSDDNTVSVIDTS-GNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDG------- 227 (381)
T ss_pred -CcceEEECCC---CCeEEEEecCCCeEEEEeCC-Ccceec-cccccccccCCCCceEEECCCCCEEEEEecc-------
Confidence 1211111222 3445444 467889999965 556664 2211 1112223344444 67889987544
Q ss_pred cCCCCCCCCCceEEEEECCCCcEEee-ecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc
Q 040693 267 LKPSKNSTIAGGWVAMDASNGNVLWS-TADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT 342 (382)
Q Consensus 267 ~~~~~~~~~~g~v~a~d~~tG~~~W~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~ 342 (382)
..++.+..+|..+++..+. ..............-++..+|+... ..+.+..+|..+.++.-....+..
T Consensus 228 -------~~~~~v~~id~~~~~v~~~~~~~~~~~~~~v~~~p~g~~~yv~~~-~~~~V~vid~~~~~v~~~~~~~~~ 296 (381)
T COG3391 228 -------SGSNNVLKIDTATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANS-QGGTVSVIDGATDRVVKTGPTGNE 296 (381)
T ss_pred -------CCCceEEEEeCCCceEEEeccccccCCCCceeECCCCCEEEEEec-CCCeEEEEeCCCCceeeeeccccc
Confidence 2236899999999999988 3333311112222224678888764 468999999999888877766543
No 83
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=97.02 E-value=0.066 Score=55.38 Aligned_cols=118 Identities=21% Similarity=0.245 Sum_probs=87.2
Q ss_pred eEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEE
Q 040693 27 GTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIA 106 (382)
Q Consensus 27 p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~ 106 (382)
|..+=+++.+|.. +|.+..+|..|||++.+++--. ++-.....+|++| .|.++
T Consensus 168 P~TYLNKIvvGs~--------------~G~lql~Nvrt~K~v~~f~~~~---------s~IT~ieqsPaLD----VVaiG 220 (910)
T KOG1539|consen 168 PSTYLNKIVVGSS--------------QGRLQLWNVRTGKVVYTFQEFF---------SRITAIEQSPALD----VVAIG 220 (910)
T ss_pred chhheeeEEEeec--------------CCcEEEEEeccCcEEEEecccc---------cceeEeccCCcce----EEEEe
Confidence 3344477888886 8999999999999999987532 2333444578886 78888
Q ss_pred cCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCC
Q 040693 107 TGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPD 186 (382)
Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (382)
..+ |.|+.+|.+.+|++-+++-....+
T Consensus 221 ~~~---------------------------------G~ViifNlK~dkil~sFk~d~g~V-------------------- 247 (910)
T KOG1539|consen 221 LEN---------------------------------GTVIIFNLKFDKILMSFKQDWGRV-------------------- 247 (910)
T ss_pred ccC---------------------------------ceEEEEEcccCcEEEEEEccccce--------------------
Confidence 776 999999999999999998762211
Q ss_pred CCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeecc
Q 040693 187 ADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEA 233 (382)
Q Consensus 187 ~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~ 233 (382)
.-.+..-|| ..++.++...|.+..+|.+.-+++|...-
T Consensus 248 -------tslSFrtDG--~p~las~~~~G~m~~wDLe~kkl~~v~~n 285 (910)
T KOG1539|consen 248 -------TSLSFRTDG--NPLLASGRSNGDMAFWDLEKKKLINVTRN 285 (910)
T ss_pred -------eEEEeccCC--CeeEEeccCCceEEEEEcCCCeeeeeeec
Confidence 112222334 56666778889999999999999998874
No 84
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=96.95 E-value=0.009 Score=57.25 Aligned_cols=83 Identities=19% Similarity=0.404 Sum_probs=40.6
Q ss_pred CCceEEEEECCC-CcEEeeecCCCCCC-CCcceEEe-C-----CEEEEeeecCCCcEEEEeCCCCc-EeEEEecC----C
Q 040693 275 IAGGWVAMDASN-GNVLWSTADPSNGT-APGPVTVA-N-----GVLFGGSTYRQGPIYAMDVKTGK-ILWSYDTG----A 341 (382)
Q Consensus 275 ~~g~v~a~d~~t-G~~~W~~~~~~~~~-~~~~~~~~-~-----~~v~~~~~~~~g~l~~ld~~tG~-ilw~~~~~----~ 341 (382)
....|+.+|++| |+++|++..+.... ...+..++ + +++|++.. .|.|+-||..+.. -.|+...- .
T Consensus 179 ~~~~lyi~d~~t~G~l~~~i~~~~~~~gl~~~~~~D~d~DG~~D~vYaGDl--~GnlwR~dl~~~~~~~~~~~~~~~g~~ 256 (335)
T PF05567_consen 179 GGAALYILDADTTGALIKKIDVPGGSGGLSSPAVVDSDGDGYVDRVYAGDL--GGNLWRFDLSSANPSSWSVRTIFSGTQ 256 (335)
T ss_dssp --EEEEEEETTT---EEEEEEE--STT-EEEEEEE-TTSSSEE-EEEEEET--TSEEEEEE--TTSTT-GG-EESGGG--
T ss_pred CCcEEEEEECCCCCceEEEEecCCCCccccccEEEeccCCCeEEEEEEEcC--CCcEEEEECCCCCcccceeeecccCcC
Confidence 346799999999 99999988655221 12233333 1 28999985 8999999987422 23543211 2
Q ss_pred ceecceEEe----CCEEEEEeC
Q 040693 342 TIYGGASVS----NGCIYMGNG 359 (382)
Q Consensus 342 ~~~~~p~~~----~g~lyv~~~ 359 (382)
++.+.|.+. ...||++++
T Consensus 257 PIt~aP~v~~~~~~~~V~fGTG 278 (335)
T PF05567_consen 257 PITAAPAVVRDPDGRWVFFGTG 278 (335)
T ss_dssp ---S--EEEE-TTSSEEEEE--
T ss_pred CeEecceEEecCCCCEEEEEeC
Confidence 456666652 345566654
No 85
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=96.94 E-value=0.033 Score=56.16 Aligned_cols=174 Identities=14% Similarity=0.132 Sum_probs=115.3
Q ss_pred CCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 140 NHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 140 ~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
..+|.|.-+|+.+++++-..+..+..+|..+-. .....+.++..+|.++-
T Consensus 87 g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~------------------------------p~~~~l~IgcddGvl~~ 136 (691)
T KOG2048|consen 87 GLSGSITEWDLHTLKQKYNIDSNGGAIWSIAIN------------------------------PENTILAIGCDDGVLYD 136 (691)
T ss_pred cCCceEEEEecccCceeEEecCCCcceeEEEeC------------------------------CccceEEeecCCceEEE
Confidence 345899999999999999999888877744321 11466677778899999
Q ss_pred EeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 220 LDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 220 ld~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
++...+++..+..+.-. -+....... ++-.++.+ ..+|.|.++|.++|..+.......
T Consensus 137 ~s~~p~~I~~~r~l~rq---~sRvLslsw~~~~~~i~~G-----------------s~Dg~Iriwd~~~~~t~~~~~~~~ 196 (691)
T KOG2048|consen 137 FSIGPDKITYKRSLMRQ---KSRVLSLSWNPTGTKIAGG-----------------SIDGVIRIWDVKSGQTLHIITMQL 196 (691)
T ss_pred EecCCceEEEEeecccc---cceEEEEEecCCccEEEec-----------------ccCceEEEEEcCCCceEEEeeecc
Confidence 99999999998887531 112222222 22334544 456889999999999988554433
Q ss_pred CCCCC-------cceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCceeEee
Q 040693 298 NGTAP-------GPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYKVTVG 365 (382)
Q Consensus 298 ~~~~~-------~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g~~~~ 365 (382)
....- +....-++.+..+.. .|.|-..|.++|..+-.+..-.+..-+.++ .+++||++..++.+..
T Consensus 197 d~l~k~~~~iVWSv~~Lrd~tI~sgDS--~G~V~FWd~~~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~~ii~ 271 (691)
T KOG2048|consen 197 DRLSKREPTIVWSVLFLRDSTIASGDS--AGTVTFWDSIFGTLIQSHSCHDADVLALAVADNEDRVFSAGVDPKIIQ 271 (691)
T ss_pred cccccCCceEEEEEEEeecCcEEEecC--CceEEEEcccCcchhhhhhhhhcceeEEEEcCCCCeEEEccCCCceEE
Confidence 11111 122234566666664 788888888888877766554444444444 3479999988887544
No 86
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.84 E-value=0.074 Score=50.06 Aligned_cols=83 Identities=12% Similarity=0.121 Sum_probs=62.4
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN 351 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~ 351 (382)
..+++|..+|..||..+.+.....+ +...+++. |.+++-.. +++.|.+.|.++++-+-..+....+..+.-...
T Consensus 311 SrDktIk~wdv~tg~cL~tL~ghdn--wVr~~af~p~Gkyi~Sca--DDktlrvwdl~~~~cmk~~~ah~hfvt~lDfh~ 386 (406)
T KOG0295|consen 311 SRDKTIKIWDVSTGMCLFTLVGHDN--WVRGVAFSPGGKYILSCA--DDKTLRVWDLKNLQCMKTLEAHEHFVTSLDFHK 386 (406)
T ss_pred cccceEEEEeccCCeEEEEEecccc--eeeeeEEcCCCeEEEEEe--cCCcEEEEEeccceeeeccCCCcceeEEEecCC
Confidence 5678999999999999999876654 33344443 45666666 599999999999988777776555666777778
Q ss_pred CEEEEEeCc
Q 040693 352 GCIYMGNGY 360 (382)
Q Consensus 352 g~lyv~~~~ 360 (382)
...||.++.
T Consensus 387 ~~p~VvTGs 395 (406)
T KOG0295|consen 387 TAPYVVTGS 395 (406)
T ss_pred CCceEEecc
Confidence 888888763
No 87
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=96.83 E-value=0.15 Score=46.61 Aligned_cols=107 Identities=17% Similarity=0.194 Sum_probs=62.0
Q ss_pred ecEEEEEc-cCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQ-KSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~-~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
++.|+... ....|.-+|+.+|..-=--.++.........|.. .-+.+.++. ...+.++.||
T Consensus 199 dGsvwyaslagnaiaridp~~~~aev~p~P~~~~~gsRriwsd--pig~~witt----------------wg~g~l~rfd 260 (353)
T COG4257 199 DGSVWYASLAGNAIARIDPFAGHAEVVPQPNALKAGSRRIWSD--PIGRAWITT----------------WGTGSLHRFD 260 (353)
T ss_pred CCcEEEEeccccceEEcccccCCcceecCCCcccccccccccC--ccCcEEEec----------------cCCceeeEeC
Confidence 45565553 3457888999988322111111100111222221 236677764 3357899999
Q ss_pred CCCCcEEee-ecCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCc
Q 040693 284 ASNGNVLWS-TADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGK 332 (382)
Q Consensus 284 ~~tG~~~W~-~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ 332 (382)
+++-. |. ++++......-.+.+++ ++|+.... ..|.|.-||+.+-+
T Consensus 261 Ps~~s--W~eypLPgs~arpys~rVD~~grVW~sea-~agai~rfdpeta~ 308 (353)
T COG4257 261 PSVTS--WIEYPLPGSKARPYSMRVDRHGRVWLSEA-DAGAIGRFDPETAR 308 (353)
T ss_pred ccccc--ceeeeCCCCCCCcceeeeccCCcEEeecc-ccCceeecCcccce
Confidence 98876 65 45555333344555664 89998654 57899999988544
No 88
>PRK02889 tolB translocation protein TolB; Provisional
Probab=96.82 E-value=0.62 Score=46.28 Aligned_cols=142 Identities=19% Similarity=0.175 Sum_probs=67.8
Q ss_pred EEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+++....++ .+|.+|..+++.. +..... .......|.+ |+..+++.... .....|+.+|.
T Consensus 254 la~~~~~~g~~~Iy~~d~~~~~~~-~lt~~~-~~~~~~~wSp--DG~~l~f~s~~--------------~g~~~Iy~~~~ 315 (427)
T PRK02889 254 LAVALSRDGNSQIYTVNADGSGLR-RLTQSS-GIDTEPFFSP--DGRSIYFTSDR--------------GGAPQIYRMPA 315 (427)
T ss_pred EEEEEccCCCceEEEEECCCCCcE-ECCCCC-CCCcCeEEcC--CCCEEEEEecC--------------CCCcEEEEEEC
Confidence 444444444 6888898766532 222111 0111122322 56666654321 11245888888
Q ss_pred CCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCC--CcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQ--GPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~--g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~ 359 (382)
.+|+..-.. .... ....+... ++..++..+. .. ..|+.+|.++|++..-.. ......|.. .+..|+..+.
T Consensus 316 ~~g~~~~lt-~~g~-~~~~~~~SpDG~~Ia~~s~-~~g~~~I~v~d~~~g~~~~lt~--~~~~~~p~~spdg~~l~~~~~ 390 (427)
T PRK02889 316 SGGAAQRVT-FTGS-YNTSPRISPDGKLLAYISR-VGGAFKLYVQDLATGQVTALTD--TTRDESPSFAPNGRYILYATQ 390 (427)
T ss_pred CCCceEEEe-cCCC-CcCceEECCCCCEEEEEEc-cCCcEEEEEEECCCCCeEEccC--CCCccCceECCCCCEEEEEEe
Confidence 877643222 1111 11222222 3455554442 12 269999999888654322 222244544 3444555554
Q ss_pred ceeEeecCCccC
Q 040693 360 YKVTVGFGNKNF 371 (382)
Q Consensus 360 ~g~~~~~~~~~~ 371 (382)
.+....+|.++.
T Consensus 391 ~~g~~~l~~~~~ 402 (427)
T PRK02889 391 QGGRSVLAAVSS 402 (427)
T ss_pred cCCCEEEEEEEC
Confidence 443344555555
No 89
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=96.82 E-value=0.1 Score=49.49 Aligned_cols=53 Identities=19% Similarity=0.179 Sum_probs=40.3
Q ss_pred CCCcEEEEeCCCCcEeEEEecC-CceecceEEeCCEEEEEeCceeEeecCCccC
Q 040693 319 RQGPIYAMDVKTGKILWSYDTG-ATIYGGASVSNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 319 ~~g~l~~ld~~tG~ilw~~~~~-~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
-+..|...|..+|+-+-.+.-. +.++.-....|.||.|+.+...+.++|.+.+
T Consensus 387 FDkSVkLW~g~tGk~lasfRGHv~~VYqvawsaDsRLlVS~SkDsTLKvw~V~t 440 (480)
T KOG0271|consen 387 FDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSADSRLLVSGSKDSTLKVWDVRT 440 (480)
T ss_pred cccceeeeeCCCcchhhhhhhccceeEEEEeccCccEEEEcCCCceEEEEEeee
Confidence 4888999999999988877522 3455555568999999999888888776554
No 90
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=96.80 E-value=0.38 Score=49.07 Aligned_cols=220 Identities=15% Similarity=0.162 Sum_probs=138.2
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+..|..+|..+|+.+=..-.+ +.+.+|. ..+....+.++-+..
T Consensus 227 ~~tl~~~~~~~~~~i~~~l~G----------H~g~V~~--l~~~~~~~~lvsgS~------------------------- 269 (537)
T KOG0274|consen 227 DSTLHLWDLNNGYLILTRLVG----------HFGGVWG--LAFPSGGDKLVSGST------------------------- 269 (537)
T ss_pred CceeEEeecccceEEEeeccC----------CCCCcee--EEEecCCCEEEEEec-------------------------
Confidence 778889999998887662221 1222332 233333456666554
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+..+...|..||+-.=.+..... +-....+ ....++.++.
T Consensus 270 --------D~t~rvWd~~sg~C~~~l~gh~s---------------------------tv~~~~~-----~~~~~~sgs~ 309 (537)
T KOG0274|consen 270 --------DKTERVWDCSTGECTHSLQGHTS---------------------------SVRCLTI-----DPFLLVSGSR 309 (537)
T ss_pred --------CCcEEeEecCCCcEEEEecCCCc---------------------------eEEEEEc-----cCceEeeccC
Confidence 37888889999988877764332 1122211 1345556678
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
|..+.+.|..+|+.+=.... ..........+++.++.+ ..++.|-.+|+.+++.+-..
T Consensus 310 D~tVkVW~v~n~~~l~l~~~-----h~~~V~~v~~~~~~lvsg-----------------s~d~~v~VW~~~~~~cl~sl 367 (537)
T KOG0274|consen 310 DNTVKVWDVTNGACLNLLRG-----HTGPVNCVQLDEPLLVSG-----------------SYDGTVKVWDPRTGKCLKSL 367 (537)
T ss_pred CceEEEEeccCcceEEEecc-----ccccEEEEEecCCEEEEE-----------------ecCceEEEEEhhhceeeeee
Confidence 99999999999988877662 111223333367778877 55689999999999999887
Q ss_pred cCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCC-cEeEEEecCCceecceEEeCCEEEEEeCceeEeecCCccC
Q 040693 294 ADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTG-KILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 294 ~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG-~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
..... ....+.+++ ..++-++. ++.|.+.|+.+. +-+-.++.......... ..+..+++.+....++++ |.
T Consensus 368 ~gH~~--~V~sl~~~~~~~~~Sgs~--D~~IkvWdl~~~~~c~~tl~~h~~~v~~l~-~~~~~Lvs~~aD~~Ik~W--D~ 440 (537)
T KOG0274|consen 368 SGHTG--RVYSLIVDSENRLLSGSL--DTTIKVWDLRTKRKCIHTLQGHTSLVSSLL-LRDNFLVSSSADGTIKLW--DA 440 (537)
T ss_pred cCCcc--eEEEEEecCcceEEeeee--ccceEeecCCchhhhhhhhcCCcccccccc-cccceeEeccccccEEEe--ec
Confidence 76543 234456677 78888885 899999999988 55555444444343333 344455554444456667 44
Q ss_pred CCCCeEEE
Q 040693 372 TSGTSLYA 379 (382)
Q Consensus 372 ~~g~~l~~ 379 (382)
.+|+.+-.
T Consensus 441 ~~~~~~~~ 448 (537)
T KOG0274|consen 441 EEGECLRT 448 (537)
T ss_pred ccCceeee
Confidence 44665544
No 91
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.79 E-value=0.032 Score=57.56 Aligned_cols=187 Identities=13% Similarity=0.188 Sum_probs=108.9
Q ss_pred ccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCc
Q 040693 63 KTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHS 142 (382)
Q Consensus 63 ~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (382)
.-||..|+...-..... ..-.++..+.++++.+..
T Consensus 22 q~gkfdwr~~~vG~~k~------------~~~~~~t~~~rlivsT~~--------------------------------- 56 (910)
T KOG2103|consen 22 QAGKFDWRQQLVGVKKV------------NFLVYDTKSKRLIVSTEK--------------------------------- 56 (910)
T ss_pred HhhhcchhhhcccceeE------------EEEeecCCCceEEEEecc---------------------------------
Confidence 46788888766321110 134566677789998764
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccC-cEEEEEe
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKS-GFAWALD 221 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~-g~l~ald 221 (382)
+-+.+||+.||+++|+.-+.+...- . ..|.. . + -+.+ ..+++.|
T Consensus 57 ~vlAsL~~~tGei~WRqvl~~~~~~---------------------~-~~~~~---------~---~-iS~dg~~lr~wn 101 (910)
T KOG2103|consen 57 GVLASLNLRTGEIIWRQVLEPKTSG---------------------L-GVPLT---------N---T-ISVDGRYLRSWN 101 (910)
T ss_pred chhheecccCCcEEEEEeccCCCcc---------------------c-Cccee---------E---E-EccCCcEEEeec
Confidence 7899999999999999987765210 0 11111 1 1 2223 3699999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTA 301 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~ 301 (382)
...|-+.|+.+.... . ......+-.+..++. ....+.|+++|...+......
T Consensus 102 ~~~g~l~~~i~l~~g-~---~~~~~~v~~~i~v~~------------------------g~~~~~g~l~w~~~~~~~~~~ 153 (910)
T KOG2103|consen 102 TNNGILDWEIELADG-F---KGLLLEVNKGIAVLN------------------------GHTRKFGELKWVESFSISIEE 153 (910)
T ss_pred CCCceeeeecccccc-c---ceeEEEEccceEEEc------------------------ceeccccceeehhhccccchh
Confidence 999999999997542 1 111111222333332 145678999999876542111
Q ss_pred C-c-ceEEeCCEEEEee--ecCCCcEEEEeCCCCcEe-EEEecCCceecceE--EeCCEEEEE
Q 040693 302 P-G-PVTVANGVLFGGS--TYRQGPIYAMDVKTGKIL-WSYDTGATIYGGAS--VSNGCIYMG 357 (382)
Q Consensus 302 ~-~-~~~~~~~~v~~~~--~~~~g~l~~ld~~tG~il-w~~~~~~~~~~~p~--~~~g~lyv~ 357 (382)
. . ......+.+|... ......+.+++.++|++. |+.....+...... ...+.++++
T Consensus 154 ~~q~~~~~~t~vvy~~~~l~~s~~~V~~~~~~~g~v~~~~~~v~~pw~~~~~c~~~k~~vl~~ 216 (910)
T KOG2103|consen 154 DLQDAKIYGTDVVYVLGLLKRSGSCVQQVFSDDGEVTGPQSTVLGPWFKVLSCSTDKEVVLVC 216 (910)
T ss_pred HHHHhhhccCcEEEEEEEEecCCceEEEEEccCCcEecceeeeecCcccccccccccceEEEc
Confidence 1 1 1112234444322 112556889999999998 77776665554443 244444444
No 92
>PRK04792 tolB translocation protein TolB; Provisional
Probab=96.76 E-value=0.71 Score=46.18 Aligned_cols=114 Identities=12% Similarity=0.117 Sum_probs=57.3
Q ss_pred cEEEEEcc--CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQK--SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~--~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++.... ...++.+|.++|+..--..... ......+.+ +++.+++.... .....|+.+|
T Consensus 319 ~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~--~~~~~~~Sp--DG~~l~~~~~~--------------~g~~~I~~~d 380 (448)
T PRK04792 319 SLIFTSERGGKPQIYRVNLASGKVSRLTFEGE--QNLGGSITP--DGRSMIMVNRT--------------NGKFNIARQD 380 (448)
T ss_pred EEEEEECCCCCceEEEEECCCCCEEEEecCCC--CCcCeeECC--CCCEEEEEEec--------------CCceEEEEEE
Confidence 34444433 2479999999887532211110 111112221 66677665332 1124688899
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
+.+|+..-..... ....|... ++..++..+. .....|+.+|. +|+...+...+.
T Consensus 381 l~~g~~~~lt~~~---~d~~ps~spdG~~I~~~~~~~g~~~l~~~~~-~G~~~~~l~~~~ 436 (448)
T PRK04792 381 LETGAMQVLTSTR---LDESPSVAPNGTMVIYSTTYQGKQVLAAVSI-DGRFKARLPAGQ 436 (448)
T ss_pred CCCCCeEEccCCC---CCCCceECCCCCEEEEEEecCCceEEEEEEC-CCCceEECcCCC
Confidence 9999764322221 11222222 3445555443 11234888887 577777665543
No 93
>PTZ00420 coronin; Provisional
Probab=96.71 E-value=0.9 Score=46.71 Aligned_cols=147 Identities=12% Similarity=0.094 Sum_probs=80.8
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
++.|..+|..+++.+.+...... ..-..+..+ +.++.++..++.+..+|
T Consensus 147 DgtIrIWDl~tg~~~~~i~~~~~----------------------------V~Slswspd---G~lLat~s~D~~IrIwD 195 (568)
T PTZ00420 147 DSFVNIWDIENEKRAFQINMPKK----------------------------LSSLKWNIK---GNLLSGTCVGKHMHIID 195 (568)
T ss_pred CCeEEEEECCCCcEEEEEecCCc----------------------------EEEEEECCC---CCEEEEEecCCEEEEEE
Confidence 48899999999998877643211 001112222 35666677889999999
Q ss_pred CCCCCeeeeeccCCCCCCCCccccee--eeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCC-CcEEeeecCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAA--TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASN-GNVLWSTADPSN 298 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~t-G~~~W~~~~~~~ 298 (382)
..+|+.+-++..-.........|... .+++.++.+..+. ...+.|..+|+++ ++++-.......
T Consensus 196 ~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~-------------~~~R~VkLWDlr~~~~pl~~~~ld~~ 262 (568)
T PTZ00420 196 PRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSK-------------NNMREMKLWDLKNTTSALVTMSIDNA 262 (568)
T ss_pred CCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCC-------------CCccEEEEEECCCCCCceEEEEecCC
Confidence 99999886654321100001112111 1445555432210 1224699999985 666655544331
Q ss_pred CCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcE
Q 040693 299 GTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKI 333 (382)
Q Consensus 299 ~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~i 333 (382)
.....|..- +.+.+|++.. .++.|+.++..++.+
T Consensus 263 ~~~L~p~~D~~tg~l~lsGk-GD~tIr~~e~~~~~~ 297 (568)
T PTZ00420 263 SAPLIPHYDESTGLIYLIGK-GDGNCRYYQHSLGSI 297 (568)
T ss_pred ccceEEeeeCCCCCEEEEEE-CCCeEEEEEccCCcE
Confidence 111112211 2355565442 589999999887754
No 94
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=96.70 E-value=0.55 Score=44.02 Aligned_cols=240 Identities=15% Similarity=0.148 Sum_probs=120.8
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
--.++.||..+|+++=.....+. .++-|+ .++.+++.++|..-++ +
T Consensus 27 G~~~~v~D~~~g~~~~~~~a~~g-----RHFyGH------g~fs~dG~~LytTEnd-~---------------------- 72 (305)
T PF07433_consen 27 GTFALVFDCRTGQLLQRLWAPPG-----RHFYGH------GVFSPDGRLLYTTEND-Y---------------------- 72 (305)
T ss_pred CcEEEEEEcCCCceeeEEcCCCC-----CEEecC------EEEcCCCCEEEEeccc-c----------------------
Confidence 44689999999998865543221 134455 3456666677765433 1
Q ss_pred CCCCCCCCcceEEEEECC-CCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeC-----ceeecE
Q 040693 134 KCIEPENHSNSLLALDLD-TGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRN-----KVKHDI 207 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~-tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~-----g~~~~~ 207 (382)
.+..|.|-..|+. +=+.+=+++..... |++-. -.+..+.|+..+.. ...+..
T Consensus 73 -----~~g~G~IgVyd~~~~~~ri~E~~s~GIG---------------PHel~--l~pDG~tLvVANGGI~Thpd~GR~k 130 (305)
T PF07433_consen 73 -----ETGRGVIGVYDAARGYRRIGEFPSHGIG---------------PHELL--LMPDGETLVVANGGIETHPDSGRAK 130 (305)
T ss_pred -----CCCcEEEEEEECcCCcEEEeEecCCCcC---------------hhhEE--EcCCCCEEEEEcCCCccCcccCcee
Confidence 1234888899987 44555455433221 00000 00112222211100 000122
Q ss_pred EEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCC
Q 040693 208 VVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASN 286 (382)
Q Consensus 208 v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~t 286 (382)
+-..+-+..|.-||..+|+++=+..+++.--. .+..-.++ .++.|.+...... ..... --+.++-...
T Consensus 131 LNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~-lSiRHLa~~~~G~V~~a~Q~qg-~~~~~---------~PLva~~~~g 199 (305)
T PF07433_consen 131 LNLDTMQPSLVYLDARSGALLEQVELPPDLHQ-LSIRHLAVDGDGTVAFAMQYQG-DPGDA---------PPLVALHRRG 199 (305)
T ss_pred cChhhcCCceEEEecCCCceeeeeecCccccc-cceeeEEecCCCcEEEEEecCC-CCCcc---------CCeEEEEcCC
Confidence 22223346688899999999977776542111 11111222 4566666533211 11111 1144444333
Q ss_pred CcEEeeecCCCC-----CCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeC
Q 040693 287 GNVLWSTADPSN-----GTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNG 359 (382)
Q Consensus 287 G~~~W~~~~~~~-----~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~ 359 (382)
+... ....+.. .-..+++.+. ++.|.+++. +.+.+..+|.++|+.+-...++.. ...+..++...++++
T Consensus 200 ~~~~-~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsP-rGg~~~~~d~~tg~~~~~~~l~D~--cGva~~~~~f~~ssG 275 (305)
T PF07433_consen 200 GALR-LLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSP-RGGRVAVWDAATGRLLGSVPLPDA--CGVAPTDDGFLVSSG 275 (305)
T ss_pred Ccce-eccCChHHHHhhCCceEEEEEeCCCCEEEEECC-CCCEEEEEECCCCCEeeccccCce--eeeeecCCceEEeCC
Confidence 3222 1222110 1234555554 346666665 677888899999999998887642 233334444677777
Q ss_pred ceeEe
Q 040693 360 YKVTV 364 (382)
Q Consensus 360 ~g~~~ 364 (382)
.|.+.
T Consensus 276 ~G~~~ 280 (305)
T PF07433_consen 276 QGQLI 280 (305)
T ss_pred CccEE
Confidence 77743
No 95
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=96.67 E-value=0.37 Score=43.79 Aligned_cols=142 Identities=11% Similarity=0.111 Sum_probs=88.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccce--eeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGA--ATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~--~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
...++.++.|..+..+|.+|||.+...+.+.+ .... -.+++++.+...+.. ...+.|..+
T Consensus 64 s~~liTGSAD~t~kLWDv~tGk~la~~k~~~~------Vk~~~F~~~gn~~l~~tD~~m------------g~~~~v~~f 125 (327)
T KOG0643|consen 64 SKHLITGSADQTAKLWDVETGKQLATWKTNSP------VKRVDFSFGGNLILASTDKQM------------GYTCFVSVF 125 (327)
T ss_pred cceeeeccccceeEEEEcCCCcEEEEeecCCe------eEEEeeccCCcEEEEEehhhc------------CcceEEEEE
Confidence 35677888888899999999999988886542 1111 125566666544422 234668888
Q ss_pred ECC-------CCcEEeeecCCCCCCCCcceEE---eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC-CceecceE-Ee
Q 040693 283 DAS-------NGNVLWSTADPSNGTAPGPVTV---ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG-ATIYGGAS-VS 350 (382)
Q Consensus 283 d~~-------tG~~~W~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~-~~~~~~p~-~~ 350 (382)
|++ .-++.-++..+.. ....+. -+..++.+- .+|.|...|+.+|+++=+.... ...+.... ..
T Consensus 126 di~~~~~~~~s~ep~~kI~t~~s---kit~a~Wg~l~~~ii~Gh--e~G~is~~da~~g~~~v~s~~~h~~~Ind~q~s~ 200 (327)
T KOG0643|consen 126 DIRDDSSDIDSEEPYLKIPTPDS---KITSALWGPLGETIIAGH--EDGSISIYDARTGKELVDSDEEHSSKINDLQFSR 200 (327)
T ss_pred EccCChhhhcccCceEEecCCcc---ceeeeeecccCCEEEEec--CCCcEEEEEcccCceeeechhhhccccccccccC
Confidence 887 3345555554431 111111 145666666 4999999999999877655322 22233333 36
Q ss_pred CCEEEEEeCceeEeecCCc
Q 040693 351 NGCIYMGNGYKVTVGFGNK 369 (382)
Q Consensus 351 ~g~lyv~~~~g~~~~~~~~ 369 (382)
+...|++.+.....+++.+
T Consensus 201 d~T~FiT~s~Dttakl~D~ 219 (327)
T KOG0643|consen 201 DRTYFITGSKDTTAKLVDV 219 (327)
T ss_pred CcceEEecccCccceeeec
Confidence 7778888887776666443
No 96
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=96.66 E-value=0.5 Score=43.02 Aligned_cols=84 Identities=19% Similarity=0.215 Sum_probs=49.5
Q ss_pred CCCceEEEEECCCCcEEeeec-CCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eC
Q 040693 274 TIAGGWVAMDASNGNVLWSTA-DPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SN 351 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~-~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~ 351 (382)
..+|.|..+|+++|+++=... ........-.+ ..+...|+... .+..-..+|..+-+++-++.+..+.-...+. ..
T Consensus 166 he~G~is~~da~~g~~~v~s~~~h~~~Ind~q~-s~d~T~FiT~s-~Dttakl~D~~tl~v~Kty~te~PvN~aaisP~~ 243 (327)
T KOG0643|consen 166 HEDGSISIYDARTGKELVDSDEEHSSKINDLQF-SRDRTYFITGS-KDTTAKLVDVRTLEVLKTYTTERPVNTAAISPLL 243 (327)
T ss_pred cCCCcEEEEEcccCceeeechhhhccccccccc-cCCcceEEecc-cCccceeeeccceeeEEEeeecccccceeccccc
Confidence 456999999999997765543 22211111122 22344455443 5777888999998898888776654333222 34
Q ss_pred CEEEEEeC
Q 040693 352 GCIYMGNG 359 (382)
Q Consensus 352 g~lyv~~~ 359 (382)
+.+.++.+
T Consensus 244 d~VilgGG 251 (327)
T KOG0643|consen 244 DHVILGGG 251 (327)
T ss_pred ceEEecCC
Confidence 55555433
No 97
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=96.64 E-value=0.66 Score=45.91 Aligned_cols=121 Identities=17% Similarity=0.189 Sum_probs=76.7
Q ss_pred eeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCe
Q 040693 23 ITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNH 102 (382)
Q Consensus 23 ~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 102 (382)
++.+|- |..|+...+ ||.++++|-+||+.+-.+.-.+... +.+| ...-.++..+
T Consensus 196 VRysPD---G~~Fat~gs-------------Dgki~iyDGktge~vg~l~~~~aHk--------GsIf--alsWsPDs~~ 249 (603)
T KOG0318|consen 196 VRYSPD---GSRFATAGS-------------DGKIYIYDGKTGEKVGELEDSDAHK--------GSIF--ALSWSPDSTQ 249 (603)
T ss_pred EEECCC---CCeEEEecC-------------CccEEEEcCCCccEEEEecCCCCcc--------ccEE--EEEECCCCce
Confidence 344554 666665553 9999999999999998877532221 1222 1334455555
Q ss_pred EEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCc-ccccccccCCCCCCCC
Q 040693 103 VYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYD-VWFGACNWYLNPNCPP 181 (382)
Q Consensus 103 v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~-~~~~~~~~~~~~~~~~ 181 (382)
+....++ ..+...|..+.+++-++.+...- ....+
T Consensus 250 ~~T~SaD---------------------------------kt~KIWdVs~~slv~t~~~~~~v~dqqvG----------- 285 (603)
T KOG0318|consen 250 FLTVSAD---------------------------------KTIKIWDVSTNSLVSTWPMGSTVEDQQVG----------- 285 (603)
T ss_pred EEEecCC---------------------------------ceEEEEEeeccceEEEeecCCchhceEEE-----------
Confidence 5554443 67778888888888887776440 01111
Q ss_pred CCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeec
Q 040693 182 GPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSME 232 (382)
Q Consensus 182 ~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~ 232 (382)
++. .++.|+..+-+|++.-|++.+++++-.+.
T Consensus 286 ------------~lW-------qkd~lItVSl~G~in~ln~~d~~~~~~i~ 317 (603)
T KOG0318|consen 286 ------------CLW-------QKDHLITVSLSGTINYLNPSDPSVLKVIS 317 (603)
T ss_pred ------------EEE-------eCCeEEEEEcCcEEEEecccCCChhheec
Confidence 122 15667777778999999999988665554
No 98
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.56 E-value=0.52 Score=47.24 Aligned_cols=147 Identities=16% Similarity=0.220 Sum_probs=90.6
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
++.|...|..+|+.+=........+ ....+..+ +..++++..|+.+..+|
T Consensus 267 D~tvriWd~~~~~~~~~l~~hs~~i---------------------------s~~~f~~d---~~~l~s~s~d~~i~vwd 316 (456)
T KOG0266|consen 267 DGTVRIWDVRTGECVRKLKGHSDGI---------------------------SGLAFSPD---GNLLVSASYDGTIRVWD 316 (456)
T ss_pred CCcEEEEeccCCeEEEeeeccCCce---------------------------EEEEECCC---CCEEEEcCCCccEEEEE
Confidence 4899999999999888776654311 11122222 47788888999999999
Q ss_pred CCCCCee--eeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 222 RDSGSLI--WSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 222 ~~tG~~~--W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
..+|+.+ -...... ... ....... ++..++.. ..++.+..+|+..++..=.+....
T Consensus 317 ~~~~~~~~~~~~~~~~--~~~-~~~~~~fsp~~~~ll~~-----------------~~d~~~~~w~l~~~~~~~~~~~~~ 376 (456)
T KOG0266|consen 317 LETGSKLCLKLLSGAE--NSA-PVTSVQFSPNGKYLLSA-----------------SLDRTLKLWDLRSGKSVGTYTGHS 376 (456)
T ss_pred CCCCceeeeecccCCC--CCC-ceeEEEECCCCcEEEEe-----------------cCCCeEEEEEccCCcceeeecccC
Confidence 9999965 2211111 010 1111111 44555554 344678888998887766655433
Q ss_pred CC--CCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 298 NG--TAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 298 ~~--~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
.. -...+.... +..++.+. .++.|+.+|..++.++-+.+..
T Consensus 377 ~~~~~~~~~~~~~~~~~i~sg~--~d~~v~~~~~~s~~~~~~l~~h 420 (456)
T KOG0266|consen 377 NLVRCIFSPTLSTGGKLIYSGS--EDGSVYVWDSSSGGILQRLEGH 420 (456)
T ss_pred CcceeEecccccCCCCeEEEEe--CCceEEEEeCCccchhhhhcCC
Confidence 21 112222233 34566666 5999999999999888877655
No 99
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.50 E-value=1 Score=44.67 Aligned_cols=177 Identities=12% Similarity=0.118 Sum_probs=90.0
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEc--cCcEEEEE
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQ--KSGFAWAL 220 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~--~~g~l~al 220 (382)
..|+.+|..+|+..--....... ..| ...++| ..+++... .+..++.+
T Consensus 213 ~~Iyv~dl~tg~~~~lt~~~g~~-------------------------~~~---~~SPDG--~~la~~~~~~g~~~Iy~~ 262 (419)
T PRK04043 213 PTLYKYNLYTGKKEKIASSQGML-------------------------VVS---DVSKDG--SKLLLTMAPKGQPDIYLY 262 (419)
T ss_pred CEEEEEECCCCcEEEEecCCCcE-------------------------Eee---EECCCC--CEEEEEEccCCCcEEEEE
Confidence 58999999998765433322110 111 123344 33444433 34689999
Q ss_pred eCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCC
Q 040693 221 DRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGT 300 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~ 300 (382)
|.++|+.. .....+. ......|.+ |+..|++.... .....|+.+|+.+|+..-.+....
T Consensus 263 dl~~g~~~-~LT~~~~-~d~~p~~SP--DG~~I~F~Sdr--------------~g~~~Iy~~dl~~g~~~rlt~~g~--- 321 (419)
T PRK04043 263 DTNTKTLT-QITNYPG-IDVNGNFVE--DDKRIVFVSDR--------------LGYPNIFMKKLNSGSVEQVVFHGK--- 321 (419)
T ss_pred ECCCCcEE-EcccCCC-ccCccEECC--CCCEEEEEECC--------------CCCceEEEEECCCCCeEeCccCCC---
Confidence 99888633 2221111 111223333 67777776543 122469999999998732222111
Q ss_pred CCcceEEeCCE-EEEeeecC-----C-CcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCceeEeecCCccC
Q 040693 301 APGPVTVANGV-LFGGSTYR-----Q-GPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 301 ~~~~~~~~~~~-v~~~~~~~-----~-g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
......-+++. +|+..... . ..|+.+|+++|++. +.... .....|.. .+..|+..+..+....++.++.
T Consensus 322 ~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~-~LT~~-~~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l 399 (419)
T PRK04043 322 NNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIR-RLTAN-GVNQFPRFSSDGGSIMFIKYLGNQSALGIIRL 399 (419)
T ss_pred cCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeE-ECCCC-CCcCCeEECCCCCEEEEEEccCCcEEEEEEec
Confidence 11222234554 44433210 1 47999999988742 22222 22334554 3455666655544445555555
Q ss_pred C
Q 040693 372 T 372 (382)
Q Consensus 372 ~ 372 (382)
.
T Consensus 400 ~ 400 (419)
T PRK04043 400 N 400 (419)
T ss_pred C
Confidence 3
No 100
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=96.50 E-value=0.23 Score=46.86 Aligned_cols=165 Identities=13% Similarity=0.183 Sum_probs=92.0
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
++.|-|.|+..-|++-.+.-.-...+ | +.--|. .++|+.++.|..+...|
T Consensus 214 dk~VKCwDLe~nkvIR~YhGHlS~V~-----------~---------L~lhPT----------ldvl~t~grDst~RvWD 263 (460)
T KOG0285|consen 214 DKQVKCWDLEYNKVIRHYHGHLSGVY-----------C---------LDLHPT----------LDVLVTGGRDSTIRVWD 263 (460)
T ss_pred CCeeEEEechhhhhHHHhccccceeE-----------E---------Eecccc----------ceeEEecCCcceEEEee
Confidence 48999999998888766643311111 0 111222 36788888899899888
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNG 299 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~ 299 (382)
..|-...-...--. ........ .+..||.+ ..+++|...|.+.|+..-...-...
T Consensus 264 iRtr~~V~~l~GH~-----~~V~~V~~~~~dpqvit~-----------------S~D~tvrlWDl~agkt~~tlt~hkk- 320 (460)
T KOG0285|consen 264 IRTRASVHVLSGHT-----NPVASVMCQPTDPQVITG-----------------SHDSTVRLWDLRAGKTMITLTHHKK- 320 (460)
T ss_pred ecccceEEEecCCC-----CcceeEEeecCCCceEEe-----------------cCCceEEEeeeccCceeEeeecccc-
Confidence 88765554443110 01111112 36777776 5678899999999987766543321
Q ss_pred CCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEE-EeCcee
Q 040693 300 TAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYM-GNGYKV 362 (382)
Q Consensus 300 ~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv-~~~~g~ 362 (382)
....+.. .....|+... ...+-+.+.-.|+.+-.+..-.++..+..+..+.+|+ +..+|.
T Consensus 321 -svral~lhP~e~~fASas--~dnik~w~~p~g~f~~nlsgh~~iintl~~nsD~v~~~G~dng~ 382 (460)
T KOG0285|consen 321 -SVRALCLHPKENLFASAS--PDNIKQWKLPEGEFLQNLSGHNAIINTLSVNSDGVLVSGGDNGS 382 (460)
T ss_pred -eeeEEecCCchhhhhccC--CccceeccCCccchhhccccccceeeeeeeccCceEEEcCCceE
Confidence 1112111 1124555442 5667777776677666533333455566554444444 444443
No 101
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.46 E-value=0.45 Score=51.26 Aligned_cols=140 Identities=14% Similarity=0.133 Sum_probs=82.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee---eCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT---DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
...|+++..|+.+..+|..+++.+..+..-. ...+.... ++..++.+ ..++.|..
T Consensus 545 ~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~-----~~V~~l~~~p~~~~~L~Sg-----------------s~Dg~v~i 602 (793)
T PLN00181 545 KSQVASSNFEGVVQVWDVARSQLVTEMKEHE-----KRVWSIDYSSADPTLLASG-----------------SDDGSVKL 602 (793)
T ss_pred CCEEEEEeCCCeEEEEECCCCeEEEEecCCC-----CCEEEEEEcCCCCCEEEEE-----------------cCCCEEEE
Confidence 4578888889999999999998887775321 12333333 23344444 34588999
Q ss_pred EECCCCcEEeeecCCCCCCCCcceEE--e-CCEEEEeeecCCCcEEEEeCCCCcE-eEEEecCCceecceEEeCCEEEEE
Q 040693 282 MDASNGNVLWSTADPSNGTAPGPVTV--A-NGVLFGGSTYRQGPIYAMDVKTGKI-LWSYDTGATIYGGASVSNGCIYMG 357 (382)
Q Consensus 282 ~d~~tG~~~W~~~~~~~~~~~~~~~~--~-~~~v~~~~~~~~g~l~~ld~~tG~i-lw~~~~~~~~~~~p~~~~g~lyv~ 357 (382)
+|..+++..-....... ...+.+ . +..+..++ .++.|+.+|..+++. +-...............++..+++
T Consensus 603 Wd~~~~~~~~~~~~~~~---v~~v~~~~~~g~~latgs--~dg~I~iwD~~~~~~~~~~~~~h~~~V~~v~f~~~~~lvs 677 (793)
T PLN00181 603 WSINQGVSIGTIKTKAN---ICCVQFPSESGRSLAFGS--ADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFVDSSTLVS 677 (793)
T ss_pred EECCCCcEEEEEecCCC---eEEEEEeCCCCCEEEEEe--CCCeEEEEECCCCCccceEecCCCCCEEEEEEeCCCEEEE
Confidence 99999987766554321 112222 2 34555555 489999999987763 333322122222333345555555
Q ss_pred eCceeEeecCCccC
Q 040693 358 NGYKVTVGFGNKNF 371 (382)
Q Consensus 358 ~~~g~~~~~~~~~~ 371 (382)
.+....+.+|.+..
T Consensus 678 ~s~D~~ikiWd~~~ 691 (793)
T PLN00181 678 SSTDNTLKLWDLSM 691 (793)
T ss_pred EECCCEEEEEeCCC
Confidence 44445577777653
No 102
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=96.43 E-value=0.058 Score=52.20 Aligned_cols=51 Identities=24% Similarity=0.250 Sum_probs=39.3
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
..+..+.++|+..|+++=++.++.. -..+++ .+.++|+++. +|.|+.++..
T Consensus 195 S~D~t~k~wdlS~g~LLlti~fp~s---i~av~lDpae~~~yiGt~--~G~I~~~~~~ 247 (476)
T KOG0646|consen 195 SEDRTIKLWDLSLGVLLLTITFPSS---IKAVALDPAERVVYIGTE--EGKIFQNLLF 247 (476)
T ss_pred cCCceEEEEEeccceeeEEEecCCc---ceeEEEcccccEEEecCC--cceEEeeehh
Confidence 4568899999999999999988763 233334 3568999995 9999988765
No 103
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=96.39 E-value=0.12 Score=47.91 Aligned_cols=150 Identities=12% Similarity=0.112 Sum_probs=95.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+..|+.++.|-.+..--.++||.+-+++--.... .......++..+... ..+|+|..++.
T Consensus 318 ~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyv---n~a~ft~dG~~iisa-----------------SsDgtvkvW~~ 377 (508)
T KOG0275|consen 318 NSQILSASFDQTVRIHGLKSGKCLKEFRGHSSYV---NEATFTDDGHHIISA-----------------SSDGTVKVWHG 377 (508)
T ss_pred cchhhcccccceEEEeccccchhHHHhcCccccc---cceEEcCCCCeEEEe-----------------cCCccEEEecC
Confidence 4677888888888888888999998887321100 001111144545444 45688999999
Q ss_pred CCCcEEeeecCCCCCCC-CcceEEe-CC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCC----ceecceE-EeCCEEEE
Q 040693 285 SNGNVLWSTADPSNGTA-PGPVTVA-NG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGA----TIYGGAS-VSNGCIYM 356 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~-~~~~~~~-~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~----~~~~~p~-~~~g~lyv 356 (382)
+|++-+-++........ .+.+.+. +. -.+++. +...+|.++.. |.++-.+.-+. .+....+ ..+.-+|+
T Consensus 378 KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCN--rsntv~imn~q-GQvVrsfsSGkREgGdFi~~~lSpkGewiYc 454 (508)
T KOG0275|consen 378 KTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCN--RSNTVYIMNMQ-GQVVRSFSSGKREGGDFINAILSPKGEWIYC 454 (508)
T ss_pred cchhhhhhccCCCCcccceeEEEcCCCCceEEEEc--CCCeEEEEecc-ceEEeeeccCCccCCceEEEEecCCCcEEEE
Confidence 99988877765543222 2222222 32 233333 47899999976 88888875542 3333333 36778888
Q ss_pred EeCceeEeecCCccCCCCCeEEEE
Q 040693 357 GNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 357 ~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
...+++ +|+|+..+|++--..
T Consensus 455 igED~v---lYCF~~~sG~LE~tl 475 (508)
T KOG0275|consen 455 IGEDGV---LYCFSVLSGKLERTL 475 (508)
T ss_pred EccCcE---EEEEEeecCceeeee
Confidence 888877 799999999876543
No 104
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=96.36 E-value=0.19 Score=45.12 Aligned_cols=125 Identities=12% Similarity=0.194 Sum_probs=78.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc--eee--eCCeEEEEecCccccccccCCCCCCCCCceEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG--AAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWV 280 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~--~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~ 280 (382)
.+.|+.+.+|+.+|++|.++|++.-.++-- .++. .+. ..+.|+.+ ..+|++.
T Consensus 126 enSi~~AgGD~~~y~~dlE~G~i~r~~rGH-------tDYvH~vv~R~~~~qilsG-----------------~EDGtvR 181 (325)
T KOG0649|consen 126 ENSILFAGGDGVIYQVDLEDGRIQREYRGH-------TDYVHSVVGRNANGQILSG-----------------AEDGTVR 181 (325)
T ss_pred CCcEEEecCCeEEEEEEecCCEEEEEEcCC-------cceeeeeeecccCcceeec-----------------CCCccEE
Confidence 455666668999999999999999888732 2222 111 34555554 4569999
Q ss_pred EEECCCCcEEeeecCCCC--------CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCC
Q 040693 281 AMDASNGNVLWSTADPSN--------GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNG 352 (382)
Q Consensus 281 a~d~~tG~~~W~~~~~~~--------~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g 352 (382)
..|.+|+|-.=.+..... ..+-+.+..+.+.++++. .-.+..+++.+-+..-.+++++... ....+++
T Consensus 182 vWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGg---Gp~lslwhLrsse~t~vfpipa~v~-~v~F~~d 257 (325)
T KOG0649|consen 182 VWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGG---GPKLSLWHLRSSESTCVFPIPARVH-LVDFVDD 257 (325)
T ss_pred EEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecC---CCceeEEeccCCCceEEEeccccee-Eeeeecc
Confidence 999999998777654331 112244444455565554 4556666777677777777765322 2333555
Q ss_pred EEEEE
Q 040693 353 CIYMG 357 (382)
Q Consensus 353 ~lyv~ 357 (382)
.+.++
T Consensus 258 ~vl~~ 262 (325)
T KOG0649|consen 258 CVLIG 262 (325)
T ss_pred eEEEe
Confidence 55554
No 105
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.36 E-value=0.071 Score=51.00 Aligned_cols=225 Identities=13% Similarity=0.054 Sum_probs=122.6
Q ss_pred ccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCC
Q 040693 47 FELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQ 126 (382)
Q Consensus 47 ~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 126 (382)
+...|.++-.+..+|+.||+.+=.+.-+-... .++.+-.+++..++.+..
T Consensus 283 yLlaCg~~e~~~lwDv~tgd~~~~y~~~~~~S------------~~sc~W~pDg~~~V~Gs~------------------ 332 (519)
T KOG0293|consen 283 YLLACGFDEVLSLWDVDTGDLRHLYPSGLGFS------------VSSCAWCPDGFRFVTGSP------------------ 332 (519)
T ss_pred eEEecCchHheeeccCCcchhhhhcccCcCCC------------cceeEEccCCceeEecCC------------------
Confidence 34556667778888999998877765431100 124455556555555432
Q ss_pred CCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEee--eCcee
Q 040693 127 TTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMY--RNKVK 204 (382)
Q Consensus 127 ~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~--~~g~~ 204 (382)
+..+++.|. +|+++-..+.. ..|.+.++. .||
T Consensus 333 ---------------dr~i~~wdl-Dgn~~~~W~gv----------------------------r~~~v~dlait~Dg-- 366 (519)
T KOG0293|consen 333 ---------------DRTIIMWDL-DGNILGNWEGV----------------------------RDPKVHDLAITYDG-- 366 (519)
T ss_pred ---------------CCcEEEecC-Ccchhhccccc----------------------------ccceeEEEEEcCCC--
Confidence 488999997 58885333222 235566543 333
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
..++..+.|..+..++.++-..+=..+...+ -......-++..+.+. .....++..|.
T Consensus 367 -k~vl~v~~d~~i~l~~~e~~~dr~lise~~~----its~~iS~d~k~~Lvn-----------------L~~qei~LWDl 424 (519)
T KOG0293|consen 367 -KYVLLVTVDKKIRLYNREARVDRGLISEEQP----ITSFSISKDGKLALVN-----------------LQDQEIHLWDL 424 (519)
T ss_pred -cEEEEEecccceeeechhhhhhhccccccCc----eeEEEEcCCCcEEEEE-----------------cccCeeEEeec
Confidence 4555555677788888776433322222111 0111111144445554 33466888887
Q ss_pred CCCcEEeeecCCCC-CCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceE--EeCCEEEEEeCc
Q 040693 285 SNGNVLWSTADPSN-GTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGAS--VSNGCIYMGNGY 360 (382)
Q Consensus 285 ~tG~~~W~~~~~~~-~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~--~~~g~lyv~~~~ 360 (382)
+.-++.-++..... .+.-... +.+ +--|+++..++++||..+..+|+++-...-........+ ..+-++|.+.++
T Consensus 425 ~e~~lv~kY~Ghkq~~fiIrSC-Fgg~~~~fiaSGSED~kvyIWhr~sgkll~~LsGHs~~vNcVswNP~~p~m~ASasD 503 (519)
T KOG0293|consen 425 EENKLVRKYFGHKQGHFIIRSC-FGGGNDKFIASGSEDSKVYIWHRISGKLLAVLSGHSKTVNCVSWNPADPEMFASASD 503 (519)
T ss_pred chhhHHHHhhcccccceEEEec-cCCCCcceEEecCCCceEEEEEccCCceeEeecCCcceeeEEecCCCCHHHhhccCC
Confidence 75555444443321 1111111 222 224555444699999999999999987643222111111 156777887777
Q ss_pred eeEeecCCcc
Q 040693 361 KVTVGFGNKN 370 (382)
Q Consensus 361 g~~~~~~~~~ 370 (382)
...+.++...
T Consensus 504 DgtIRIWg~~ 513 (519)
T KOG0293|consen 504 DGTIRIWGPS 513 (519)
T ss_pred CCeEEEecCC
Confidence 7766666544
No 106
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=96.34 E-value=0.83 Score=42.35 Aligned_cols=73 Identities=14% Similarity=0.189 Sum_probs=52.2
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCC--CCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGL--GGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~--~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+..|+..+..+.++.+|+=+|.++-.++..+... .....+.| |+..|+.+ ..+|+|+++
T Consensus 199 GK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftP--ds~Fvl~g-----------------s~dg~i~vw 259 (311)
T KOG1446|consen 199 GKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTP--DSKFVLSG-----------------SDDGTIHVW 259 (311)
T ss_pred CCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCCCcceeEEECC--CCcEEEEe-----------------cCCCcEEEE
Confidence 3678888889999999999999888887554211 11111222 66666666 456999999
Q ss_pred ECCCCcEEeeecCC
Q 040693 283 DASNGNVLWSTADP 296 (382)
Q Consensus 283 d~~tG~~~W~~~~~ 296 (382)
++++|+++-+...+
T Consensus 260 ~~~tg~~v~~~~~~ 273 (311)
T KOG1446|consen 260 NLETGKKVAVLRGP 273 (311)
T ss_pred EcCCCcEeeEecCC
Confidence 99999987776654
No 107
>PRK04922 tolB translocation protein TolB; Provisional
Probab=96.34 E-value=1.3 Score=44.09 Aligned_cols=113 Identities=17% Similarity=0.157 Sum_probs=56.6
Q ss_pred cEEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++.....+ .++.+|..+|+..--...+. ......|.+ ++..|++...+ .....|..+|
T Consensus 305 ~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~--~~~~~~~Sp--DG~~Ia~~~~~--------------~~~~~I~v~d 366 (433)
T PRK04922 305 SIYFTSDRGGRPQIYRVAASGGSAERLTFQGN--YNARASVSP--DGKKIAMVHGS--------------GGQYRIAVMD 366 (433)
T ss_pred EEEEEECCCCCceEEEEECCCCCeEEeecCCC--CccCEEECC--CCCEEEEEECC--------------CCceeEEEEE
Confidence 3444443333 59999988886532211110 111122222 66666665322 1123689999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeC-CEEEEeee-cCCCcEEEEeCCCCcEeEEEecC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGST-YRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
+.+|+..--.... ....+....+ ..++..+. .....|+.+|.+ |+...+...+
T Consensus 367 ~~~g~~~~Lt~~~---~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~-g~~~~~l~~~ 421 (433)
T PRK04922 367 LSTGSVRTLTPGS---LDESPSFAPNGSMVLYATREGGRGVLAAVSTD-GRVRQRLVSA 421 (433)
T ss_pred CCCCCeEECCCCC---CCCCceECCCCCEEEEEEecCCceEEEEEECC-CCceEEcccC
Confidence 9999876322211 1122332333 44444442 123569999985 6666655543
No 108
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=96.21 E-value=0.91 Score=41.55 Aligned_cols=112 Identities=12% Similarity=0.069 Sum_probs=72.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+.+|+.++.|..+...|..+-+++-.+.-- .+........-++.+-++. ..+|.++..|+
T Consensus 162 ~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh----~~~v~t~~vSpDGslcasG----------------gkdg~~~LwdL 221 (315)
T KOG0279|consen 162 NPIIVSASWDKTVKVWNLRNCQLRTTFIGH----SGYVNTVTVSPDGSLCASG----------------GKDGEAMLWDL 221 (315)
T ss_pred CcEEEEccCCceEEEEccCCcchhhccccc----cccEEEEEECCCCCEEecC----------------CCCceEEEEEc
Confidence 566778888999999999887766555421 1112233333455555542 34688999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
..||-+...+... ....+.+. +.+..++.. ...|..+|.++++++..+++..
T Consensus 222 ~~~k~lysl~a~~---~v~sl~fspnrywL~~at--~~sIkIwdl~~~~~v~~l~~d~ 274 (315)
T KOG0279|consen 222 NEGKNLYSLEAFD---IVNSLCFSPNRYWLCAAT--ATSIKIWDLESKAVVEELKLDG 274 (315)
T ss_pred cCCceeEeccCCC---eEeeEEecCCceeEeecc--CCceEEEeccchhhhhhccccc
Confidence 9999976666432 12233344 445555543 6678889999999988876653
No 109
>PHA03098 kelch-like protein; Provisional
Probab=96.06 E-value=2.1 Score=43.80 Aligned_cols=106 Identities=13% Similarity=0.076 Sum_probs=55.9
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..+..+|+.+. .|+.-.+.+. .......+..++.+|+..+...... . .....+..+|++++ .|+.-
T Consensus 406 ~~v~~yd~~t~--~W~~~~~~p~--~r~~~~~~~~~~~iyv~GG~~~~~~----~----~~~~~v~~yd~~~~--~W~~~ 471 (534)
T PHA03098 406 KTVECFSLNTN--KWSKGSPLPI--SHYGGCAIYHDGKIYVIGGISYIDN----I----KVYNIVESYNPVTN--KWTEL 471 (534)
T ss_pred ceEEEEeCCCC--eeeecCCCCc--cccCceEEEECCEEEEECCccCCCC----C----cccceEEEecCCCC--ceeeC
Confidence 35888998764 5876433211 1112223446788888644311000 0 01234899999875 57653
Q ss_pred C--CCCCCCCcceEEeCCEEEEeeecC----CCcEEEEeCCCCcEeEEE
Q 040693 295 D--PSNGTAPGPVTVANGVLFGGSTYR----QGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 295 ~--~~~~~~~~~~~~~~~~v~~~~~~~----~g~l~~ld~~tG~ilw~~ 337 (382)
. +.+. ........++.+|+..... ...++++|+++. .|+.
T Consensus 472 ~~~~~~r-~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~--~W~~ 517 (534)
T PHA03098 472 SSLNFPR-INASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTN--TWTL 517 (534)
T ss_pred CCCCccc-ccceEEEECCEEEEEcCCcCCcccceeEEEeCCCC--EEEe
Confidence 2 2222 2233334467777654211 246889998855 4764
No 110
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=95.99 E-value=1.3 Score=41.02 Aligned_cols=194 Identities=16% Similarity=0.167 Sum_probs=112.9
Q ss_pred ceEEEEECCCCcEEEEEecCCCc---------------ccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecE
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYD---------------VWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDI 207 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~ 207 (382)
+.-.-+|.++++.+.+++..+.. ...|++ ++--.|.++.-..++ +..
T Consensus 9 aeahfi~~~d~~~iY~felvG~~P~SGGDTYNAV~~vDd~IyFG----------------GWVHAPa~y~gk~~g--~~~ 70 (339)
T PF09910_consen 9 AEAHFIDRDDSEKIYRFELVGPPPTSGGDTYNAVEWVDDFIYFG----------------GWVHAPAVYEGKGDG--RAT 70 (339)
T ss_pred eeeEEEecCCceEEEEeeeccCCCCCCCccceeeeeecceEEEe----------------eeecCCceeeeccCC--ceE
Confidence 45556778889999999865431 111211 222355555443333 344
Q ss_pred EEEEccCcEEEEEeCCCCC--eeeeeccCCCCCCCCcccceee-------eCCeEEEEecCccccccccCCCCCCCCCce
Q 040693 208 VVAVQKSGFAWALDRDSGS--LIWSMEAGPGGLGGGAMWGAAT-------DERRIYTNIANSQHKNFNLKPSKNSTIAGG 278 (382)
Q Consensus 208 v~~~~~~g~l~ald~~tG~--~~W~~~~~~~~~~g~~~~~~~~-------~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~ 278 (382)
|-+..+-+.++.+|.++++ ++|+.....+ ..|...+ .++.|+++..++. ..=.
T Consensus 71 IdF~NKYSHVH~yd~e~~~VrLLWkesih~~-----~~WaGEVSdIlYdP~~D~LLlAR~DGh-------------~nLG 132 (339)
T PF09910_consen 71 IDFRNKYSHVHEYDTENDSVRLLWKESIHDK-----TKWAGEVSDILYDPYEDRLLLARADGH-------------ANLG 132 (339)
T ss_pred EEEeeccceEEEEEcCCCeEEEEEecccCCc-----cccccchhheeeCCCcCEEEEEecCCc-------------ceee
Confidence 5555556789999999886 6799875432 4444332 3566777644311 1123
Q ss_pred EEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeee---cCCCcEEEEeCCCCcEeEEE-ecCCc---------eec
Q 040693 279 WVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGST---YRQGPIYAMDVKTGKILWSY-DTGAT---------IYG 345 (382)
Q Consensus 279 v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~---~~~g~l~~ld~~tG~ilw~~-~~~~~---------~~~ 345 (382)
|+.+|.++|+.++-.+.+.. -+.+ +. +.++.... .....|+|+|+.+||.+.+. +.+.. ..+
T Consensus 133 vy~ldr~~g~~~~L~~~ps~---KG~~-~~-D~a~F~i~~~~~g~~~i~~~Dli~~~~~~e~f~~~~s~Dg~~~~~~~~G 207 (339)
T PF09910_consen 133 VYSLDRRTGKAEKLSSNPSL---KGTL-VH-DYACFGINNFHKGVSGIHCLDLISGKWVIESFDVSLSVDGGPVIRPELG 207 (339)
T ss_pred eEEEcccCCceeeccCCCCc---CceE-ee-eeEEEeccccccCCceEEEEEccCCeEEEEecccccCCCCCceEeeccc
Confidence 99999999999998887652 2222 22 33333322 12457999999999874322 22211 113
Q ss_pred ceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 346 GASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 346 ~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
..+-..+|+|.....|. +..|+..++..--|+
T Consensus 208 ~~~s~ynR~faF~rGGi----~vgnP~~~e~~~f~R 239 (339)
T PF09910_consen 208 AMASAYNRLFAFVRGGI----FVGNPYNGEEFRFYR 239 (339)
T ss_pred cEEEEeeeEEEEEeccE----EEeCCCCCCceeEEE
Confidence 33446788888766544 447887777654444
No 111
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=95.99 E-value=1.8 Score=42.56 Aligned_cols=144 Identities=15% Similarity=0.088 Sum_probs=84.2
Q ss_pred EEEEEccCc-EEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKSG-FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~g-~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
.++.++++| .+-.+|..+|++.-.... .|.....-.. ++..+.++ .....++.+|+
T Consensus 373 ~~vigt~dgD~l~iyd~~~~e~kr~e~~-----lg~I~av~vs~dGK~~vva-----------------Ndr~el~vidi 430 (668)
T COG4946 373 GDVIGTNDGDKLGIYDKDGGEVKRIEKD-----LGNIEAVKVSPDGKKVVVA-----------------NDRFELWVIDI 430 (668)
T ss_pred ceEEeccCCceEEEEecCCceEEEeeCC-----ccceEEEEEcCCCcEEEEE-----------------cCceEEEEEEe
Confidence 566788888 899999999884433321 1222222222 44556665 33477999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeee---cCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGST---YRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~---~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~ 359 (382)
.||++.=.-..... .........+.+.++=+. .....|..+|..+||+. ...++.+.-.+|+. .+..||..+.
T Consensus 431 dngnv~~idkS~~~-lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy-~vTT~ta~DfsPaFD~d~ryLYfLs~ 508 (668)
T COG4946 431 DNGNVRLIDKSEYG-LITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIY-DVTTPTAYDFSPAFDPDGRYLYFLSA 508 (668)
T ss_pred cCCCeeEecccccc-eeEEEEEcCCceeEEEecCcceeeeeEEEEecCCCeEE-EecCCcccccCcccCCCCcEEEEEec
Confidence 99987633222211 111111223443333221 12456888999877754 45566666667776 5667777755
Q ss_pred ceeEeecCCccCCCCCeEEEEE
Q 040693 360 YKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 360 ~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
. . +|+.+-+..+.|.
T Consensus 509 R-s------LdPs~Drv~fnf~ 523 (668)
T COG4946 509 R-S------LDPSNDRVIFNFS 523 (668)
T ss_pred c-c------cCCCCCeeEEEEE
Confidence 3 3 7888877777764
No 112
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=95.96 E-value=0.42 Score=46.45 Aligned_cols=133 Identities=16% Similarity=0.141 Sum_probs=82.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..++.++-.|.+|++...||+++-.+.. .......... +++..+++.+ .+|.|+++.
T Consensus 93 G~~l~ag~i~g~lYlWelssG~LL~v~~a-----HYQ~ITcL~fs~dgs~iiTgs----------------kDg~V~vW~ 151 (476)
T KOG0646|consen 93 GYFLLAGTISGNLYLWELSSGILLNVLSA-----HYQSITCLKFSDDGSHIITGS----------------KDGAVLVWL 151 (476)
T ss_pred ceEEEeecccCcEEEEEeccccHHHHHHh-----hccceeEEEEeCCCcEEEecC----------------CCccEEEEE
Confidence 45666777899999999999998866541 1112222222 5566665533 345555544
Q ss_pred CC---------CCcEEeeecCCCCCCCCcceEEe----CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-
Q 040693 284 AS---------NGNVLWSTADPSNGTAPGPVTVA----NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV- 349 (382)
Q Consensus 284 ~~---------tG~~~W~~~~~~~~~~~~~~~~~----~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~- 349 (382)
+. +=+++.......-. ..-+.+. +.++|.++. +..+.+.|...|+++-++-.+.... +.++
T Consensus 152 l~~lv~a~~~~~~~p~~~f~~Htls--ITDl~ig~Gg~~~rl~TaS~--D~t~k~wdlS~g~LLlti~fp~si~-av~lD 226 (476)
T KOG0646|consen 152 LTDLVSADNDHSVKPLHIFSDHTLS--ITDLQIGSGGTNARLYTASE--DRTIKLWDLSLGVLLLTITFPSSIK-AVALD 226 (476)
T ss_pred EEeecccccCCCccceeeeccCcce--eEEEEecCCCccceEEEecC--CceEEEEEeccceeeEEEecCCcce-eEEEc
Confidence 33 22334333332211 1111121 358888874 8889999999999999988776443 3333
Q ss_pred -eCCEEEEEeCceeE
Q 040693 350 -SNGCIYMGNGYKVT 363 (382)
Q Consensus 350 -~~g~lyv~~~~g~~ 363 (382)
.+-++|++++.|.+
T Consensus 227 pae~~~yiGt~~G~I 241 (476)
T KOG0646|consen 227 PAERVVYIGTEEGKI 241 (476)
T ss_pred ccccEEEecCCcceE
Confidence 78999999999985
No 113
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=95.96 E-value=1.8 Score=42.38 Aligned_cols=101 Identities=16% Similarity=0.104 Sum_probs=50.9
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
.++.+|..+++..- .... +.....+.. ++..+++.... .....|+.+|+.+++..-..
T Consensus 303 ~iy~~d~~~~~~~~-l~~~-----~~~~~~~~~spdg~~i~~~~~~--------------~~~~~i~~~d~~~~~~~~l~ 362 (417)
T TIGR02800 303 QIYMMDADGGEVRR-LTFR-----GGYNASPSWSPDGDLIAFVHRE--------------GGGFNIAVMDLDGGGERVLT 362 (417)
T ss_pred eEEEEECCCCCEEE-eecC-----CCCccCeEECCCCCEEEEEEcc--------------CCceEEEEEeCCCCCeEEcc
Confidence 79999998877432 2211 111222222 55666665322 11247999999987654332
Q ss_pred cCCCCCCCCcceEE-eCCEEEEeeecCC-CcEEEEeCCCCcEeEEEecC
Q 040693 294 ADPSNGTAPGPVTV-ANGVLFGGSTYRQ-GPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 294 ~~~~~~~~~~~~~~-~~~~v~~~~~~~~-g~l~~ld~~tG~ilw~~~~~ 340 (382)
... ....+... +++.+++...... ..++.++ .+|+...+...+
T Consensus 363 ~~~---~~~~p~~spdg~~l~~~~~~~~~~~l~~~~-~~g~~~~~~~~~ 407 (417)
T TIGR02800 363 DTG---LDESPSFAPNGRMILYATTRGGRGVLGLVS-TDGRFRARLPLG 407 (417)
T ss_pred CCC---CCCCceECCCCCEEEEEEeCCCcEEEEEEE-CCCceeeECCCC
Confidence 211 11222222 4555655554112 2355444 568877776654
No 114
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=95.92 E-value=1.5 Score=43.89 Aligned_cols=143 Identities=15% Similarity=0.196 Sum_probs=83.7
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+..|+.+..|+.++..|..+|+..-....-.. ....... ++..+..+ ..++.+..+
T Consensus 258 g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~-----~is~~~f~~d~~~l~s~-----------------s~d~~i~vw 315 (456)
T KOG0266|consen 258 GNLLVSGSDDGTVRIWDVRTGECVRKLKGHSD-----GISGLAFSPDGNLLVSA-----------------SYDGTIRVW 315 (456)
T ss_pred CCEEEEecCCCcEEEEeccCCeEEEeeeccCC-----ceEEEEECCCCCEEEEc-----------------CCCccEEEE
Confidence 37889999999999999999998887774321 1222122 44545444 346899999
Q ss_pred ECCCCcEE--eeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC----ceecceEEeCCEE
Q 040693 283 DASNGNVL--WSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA----TIYGGASVSNGCI 354 (382)
Q Consensus 283 d~~tG~~~--W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~----~~~~~p~~~~g~l 354 (382)
|..+|+.+ -.............+.+ .+.+++.+. .++.+...|...++.+-.+.... .++......+++.
T Consensus 316 d~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~--~d~~~~~w~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (456)
T KOG0266|consen 316 DLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSAS--LDRTLKLWDLRSGKSVGTYTGHSNLVRCIFSPTLSTGGKL 393 (456)
T ss_pred ECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEec--CCCeEEEEEccCCcceeeecccCCcceeEecccccCCCCe
Confidence 99999954 22221111101122222 235666666 37788888988887765554322 1222222445666
Q ss_pred EEEeCceeEeecCCccC
Q 040693 355 YMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 355 yv~~~~g~~~~~~~~~~ 371 (382)
.++.+....+.++....
T Consensus 394 i~sg~~d~~v~~~~~~s 410 (456)
T KOG0266|consen 394 IYSGSEDGSVYVWDSSS 410 (456)
T ss_pred EEEEeCCceEEEEeCCc
Confidence 66555445455555443
No 115
>PRK02888 nitrous-oxide reductase; Validated
Probab=95.90 E-value=1.3 Score=45.44 Aligned_cols=185 Identities=10% Similarity=0.026 Sum_probs=99.8
Q ss_pred CCCCCCCCCCC--CCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCcee
Q 040693 127 TTPTSPDKCIE--PENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVK 204 (382)
Q Consensus 127 ~~~~~~~~~~~--~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~ 204 (382)
..|...+|+.. ...+.+.+.+||++|-++.|+....... -......+
T Consensus 197 ~~PlpnDGk~l~~~~ey~~~vSvID~etmeV~~qV~Vdgnp----------------------------d~v~~spd--- 245 (635)
T PRK02888 197 RIPLPNDGKDLDDPKKYRSLFTAVDAETMEVAWQVMVDGNL----------------------------DNVDTDYD--- 245 (635)
T ss_pred ccccCCCCCEeecccceeEEEEEEECccceEEEEEEeCCCc----------------------------ccceECCC---
Confidence 34444455432 2456789999999999999999886531 11111111
Q ss_pred ecEEEEEccC----cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEE
Q 040693 205 HDIVVAVQKS----GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWV 280 (382)
Q Consensus 205 ~~~v~~~~~~----g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~ 280 (382)
+..+++..++ ..+..++..+-..+..++... .....-+++..++. +++|.
T Consensus 246 Gk~afvTsyNsE~G~tl~em~a~e~d~~vvfni~~-------iea~vkdGK~~~V~-------------------gn~V~ 299 (635)
T PRK02888 246 GKYAFSTCYNSEEGVTLAEMMAAERDWVVVFNIAR-------IEEAVKAGKFKTIG-------------------GSKVP 299 (635)
T ss_pred CCEEEEeccCcccCcceeeeccccCceEEEEchHH-------HHHhhhCCCEEEEC-------------------CCEEE
Confidence 3444444321 234444443333222222110 00111144555552 36799
Q ss_pred EEECCC----C-cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcE------------eEEEecCCce
Q 040693 281 AMDASN----G-NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKI------------LWSYDTGATI 343 (382)
Q Consensus 281 a~d~~t----G-~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~i------------lw~~~~~~~~ 343 (382)
.+|.++ + +.+-.++.+.. ...-.+.-++.++|++.. .+..+.+||.++-+. .-+.+++-+=
T Consensus 300 VID~~t~~~~~~~v~~yIPVGKs-PHGV~vSPDGkylyVank-lS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlGP 377 (635)
T PRK02888 300 VVDGRKAANAGSALTRYVPVPKN-PHGVNTSPDGKYFIANGK-LSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLGP 377 (635)
T ss_pred EEECCccccCCcceEEEEECCCC-ccceEECCCCCEEEEeCC-CCCcEEEEEChhhhhhhhccCCccceEEEeeccCCCc
Confidence 999999 4 67777787773 345555445677887763 578899999988664 3343332211
Q ss_pred ecceEE-eCCEEEEEeCceeEeecCCccC
Q 040693 344 YGGASV-SNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 344 ~~~p~~-~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
. +..+ .+|..|++-.-..++..|+++.
T Consensus 378 L-HTaFDg~G~aytslf~dsqv~kwn~~~ 405 (635)
T PRK02888 378 L-HTAFDGRGNAYTTLFLDSQIVKWNIEA 405 (635)
T ss_pred c-eEEECCCCCEEEeEeecceeEEEehHH
Confidence 1 1111 3456777655444455555443
No 116
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=95.90 E-value=1.3 Score=40.07 Aligned_cols=143 Identities=11% Similarity=0.024 Sum_probs=80.8
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc-eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG-AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+.-+|.++.||.+...|...-...-.++.+.+ .... +--....++++ +++|.|...|
T Consensus 95 grWMyTgseDgt~kIWdlR~~~~qR~~~~~sp-----Vn~vvlhpnQteLis~-----------------dqsg~irvWD 152 (311)
T KOG0315|consen 95 GRWMYTGSEDGTVKIWDLRSLSCQRNYQHNSP-----VNTVVLHPNQTELISG-----------------DQSGNIRVWD 152 (311)
T ss_pred CeEEEecCCCceEEEEeccCcccchhccCCCC-----cceEEecCCcceEEee-----------------cCCCcEEEEE
Confidence 35688888899888888776444444443321 1111 11133445555 6789999999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCC------cEeEEEecCCceecc-eEEeCCEE
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTG------KILWSYDTGATIYGG-ASVSNGCI 354 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG------~ilw~~~~~~~~~~~-p~~~~g~l 354 (382)
+.+-.-.-+ .+|........+.+ ++.++.++.. .|..|+.+.-++ +++-+++...+..-. ...-+++.
T Consensus 153 l~~~~c~~~-liPe~~~~i~sl~v~~dgsml~a~nn--kG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~ 229 (311)
T KOG0315|consen 153 LGENSCTHE-LIPEDDTSIQSLTVMPDGSMLAAANN--KGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKY 229 (311)
T ss_pred ccCCccccc-cCCCCCcceeeEEEcCCCcEEEEecC--CccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcE
Confidence 975522211 12222222223333 4556666664 899999988754 334444444443333 33356676
Q ss_pred EEEeCceeEeecCCccCC
Q 040693 355 YMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 355 yv~~~~g~~~~~~~~~~~ 372 (382)
.++.+....+++++.+.-
T Consensus 230 lat~ssdktv~iwn~~~~ 247 (311)
T KOG0315|consen 230 LATCSSDKTVKIWNTDDF 247 (311)
T ss_pred EEeecCCceEEEEecCCc
Confidence 666666677888876653
No 117
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=95.89 E-value=0.076 Score=47.69 Aligned_cols=100 Identities=12% Similarity=0.095 Sum_probs=74.6
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEE
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCI 354 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~l 354 (382)
.+++|...|.+||++.-+...+.. . .+.-+..++.++..+. .+.|...|+++-+++-.++++..+-+.-+--+..+
T Consensus 163 dd~tVRLWD~rTgt~v~sL~~~s~-V-tSlEvs~dG~ilTia~--gssV~Fwdaksf~~lKs~k~P~nV~SASL~P~k~~ 238 (334)
T KOG0278|consen 163 DDKTVRLWDHRTGTEVQSLEFNSP-V-TSLEVSQDGRILTIAY--GSSVKFWDAKSFGLLKSYKMPCNVESASLHPKKEF 238 (334)
T ss_pred cCCceEEEEeccCcEEEEEecCCC-C-cceeeccCCCEEEEec--CceeEEeccccccceeeccCccccccccccCCCce
Confidence 458899999999999999888763 1 2222234555555553 78899999999999999999987666555556677
Q ss_pred EEEeCceeEeecCCccCCCCCeEEEE
Q 040693 355 YMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 355 yv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
||+.+.. ...|.+|..||+.+-+|
T Consensus 239 fVaGged--~~~~kfDy~TgeEi~~~ 262 (334)
T KOG0278|consen 239 FVAGGED--FKVYKFDYNTGEEIGSY 262 (334)
T ss_pred EEecCcc--eEEEEEeccCCceeeec
Confidence 7765532 35688999999999876
No 118
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=95.89 E-value=0.059 Score=52.37 Aligned_cols=104 Identities=11% Similarity=0.056 Sum_probs=70.5
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeC-CeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDE-RRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..++.++.|.++..+|.+||+.+-+++.+.. .....+. .++ +.++++ ..++.|..+|
T Consensus 270 g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~--~~cvkf~--pd~~n~fl~G-----------------~sd~ki~~wD 328 (503)
T KOG0282|consen 270 GTSFLSASFDRFLKLWDTETGQVLSRFHLDKV--PTCVKFH--PDNQNIFLVG-----------------GSDKKIRQWD 328 (503)
T ss_pred CCeeeeeecceeeeeeccccceEEEEEecCCC--ceeeecC--CCCCcEEEEe-----------------cCCCcEEEEe
Confidence 45678889999999999999999999987642 1111122 255 445554 4468899999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTG 331 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG 331 (382)
.++|+++-++....+.. ..-..++++.-|+.+. +++.+...+-..+
T Consensus 329 iRs~kvvqeYd~hLg~i-~~i~F~~~g~rFissS-Ddks~riWe~~~~ 374 (503)
T KOG0282|consen 329 IRSGKVVQEYDRHLGAI-LDITFVDEGRRFISSS-DDKSVRIWENRIP 374 (503)
T ss_pred ccchHHHHHHHhhhhhe-eeeEEccCCceEeeec-cCccEEEEEcCCC
Confidence 99999998887655322 2223356777777765 5666666665443
No 119
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=95.80 E-value=2.6 Score=43.55 Aligned_cols=236 Identities=15% Similarity=0.185 Sum_probs=127.7
Q ss_pred CCcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCcc-ccccccccccccceEEEEeCccCceeeeeeccCCCCC
Q 040693 1 AVKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIE-EGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFG 79 (382)
Q Consensus 1 ald~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~-~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~ 79 (382)
++|++++ .|..-.+- +......+-++.++.||+...... .. .-..+..+|+.+.+ |+. +.+-...
T Consensus 305 ~yd~~~~--~w~~~a~m-~~~r~~~~~~~~~~~lYv~GG~~~~~~--------~l~~ve~YD~~~~~--W~~-~a~M~~~ 370 (571)
T KOG4441|consen 305 CYDPKTN--EWSSLAPM-PSPRCRVGVAVLNGKLYVVGGYDSGSD--------RLSSVERYDPRTNQ--WTP-VAPMNTK 370 (571)
T ss_pred EecCCcC--cEeecCCC-CcccccccEEEECCEEEEEccccCCCc--------ccceEEEecCCCCc--eec-cCCccCc
Confidence 4677766 45443221 122235567788999999665331 11 13578999998888 997 3221111
Q ss_pred CCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEE
Q 040693 80 KLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYK 159 (382)
Q Consensus 80 ~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~ 159 (382)
+ . ...+..-.+.||+..|.. ....-..+-++|+. +-.|+.
T Consensus 371 R----~-------~~~v~~l~g~iYavGG~d---------------------------g~~~l~svE~YDp~--~~~W~~ 410 (571)
T KOG4441|consen 371 R----S-------DFGVAVLDGKLYAVGGFD---------------------------GEKSLNSVECYDPV--TNKWTP 410 (571)
T ss_pred c----c-------cceeEEECCEEEEEeccc---------------------------cccccccEEEecCC--CCcccc
Confidence 1 1 122222346888877641 11223567888875 445765
Q ss_pred ecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc-C------cEEEEEeCCCCCeeeeec
Q 040693 160 QLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK-S------GFAWALDRDSGSLIWSME 232 (382)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~-~------g~l~ald~~tG~~~W~~~ 232 (382)
-.+-.. ++ ....+. +-++.||+..+ + ..+.++|+.|. .|+..
T Consensus 411 va~m~~-----------~r------------~~~gv~------~~~g~iYi~GG~~~~~~~l~sve~YDP~t~--~W~~~ 459 (571)
T KOG4441|consen 411 VAPMLT-----------RR------------SGHGVA------VLGGKLYIIGGGDGSSNCLNSVECYDPETN--TWTLI 459 (571)
T ss_pred cCCCCc-----------ce------------eeeEEE------EECCEEEEEcCcCCCccccceEEEEcCCCC--ceeec
Confidence 332110 00 111111 11566665443 2 45888999864 67765
Q ss_pred cCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec--CCCCCCCCcceEEeCC
Q 040693 233 AGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA--DPSNGTAPGPVTVANG 310 (382)
Q Consensus 233 ~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~--~~~~~~~~~~~~~~~~ 310 (382)
.+- .......+.++.++.||+..+..+. .....+-++|+++-+ |+.- ... .....-+.+.++
T Consensus 460 ~~M--~~~R~~~g~a~~~~~iYvvGG~~~~-----------~~~~~VE~ydp~~~~--W~~v~~m~~-~rs~~g~~~~~~ 523 (571)
T KOG4441|consen 460 APM--NTRRSGFGVAVLNGKIYVVGGFDGT-----------SALSSVERYDPETNQ--WTMVAPMTS-PRSAVGVVVLGG 523 (571)
T ss_pred CCc--ccccccceEEEECCEEEEECCccCC-----------CccceEEEEcCCCCc--eeEcccCcc-ccccccEEEECC
Confidence 432 1223444566688999998554220 122448899997654 5543 322 233444456678
Q ss_pred EEEEeeec----CCCcEEEEeCCCCcEeEEEec
Q 040693 311 VLFGGSTY----RQGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 311 ~v~~~~~~----~~g~l~~ld~~tG~ilw~~~~ 339 (382)
.+|+.... .-..+-++|+.+.+ |+...
T Consensus 524 ~ly~vGG~~~~~~l~~ve~ydp~~d~--W~~~~ 554 (571)
T KOG4441|consen 524 KLYAVGGFDGNNNLNTVECYDPETDT--WTEVT 554 (571)
T ss_pred EEEEEecccCccccceeEEcCCCCCc--eeeCC
Confidence 88876531 12357788887655 66543
No 120
>PRK03629 tolB translocation protein TolB; Provisional
Probab=95.78 E-value=2.3 Score=42.20 Aligned_cols=109 Identities=12% Similarity=0.068 Sum_probs=53.4
Q ss_pred EEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 207 IVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 207 ~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
++++.... -.++.+|.++|+..--.... .....+.. ++..+++.... .....|+.+
T Consensus 301 I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~------~~~~~~~~SpDG~~Ia~~~~~--------------~g~~~I~~~ 360 (429)
T PRK03629 301 LAYTSDQAGRPQVYKVNINGGAPQRITWEG------SQNQDADVSSDGKFMVMVSSN--------------GGQQHIAKQ 360 (429)
T ss_pred EEEEeCCCCCceEEEEECCCCCeEEeecCC------CCccCEEECCCCCEEEEEEcc--------------CCCceEEEE
Confidence 44444333 37899999888654221111 11112222 56666554322 122458889
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeC-CEEEEeeec-CCCcEEEEeCCCCcEeEEEec
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGSTY-RQGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~-~~g~l~~ld~~tG~ilw~~~~ 339 (382)
|+.+|+..--... . ....+....+ ..++..+.. ....|+.++. +|+...+...
T Consensus 361 dl~~g~~~~Lt~~-~--~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~-~G~~~~~l~~ 415 (429)
T PRK03629 361 DLATGGVQVLTDT-F--LDETPSIAPNGTMVIYSSSQGMGSVLNLVST-DGRFKARLPA 415 (429)
T ss_pred ECCCCCeEEeCCC-C--CCCCceECCCCCEEEEEEcCCCceEEEEEEC-CCCCeEECcc
Confidence 9999975422221 1 1122332334 445555431 1224777787 4666665543
No 121
>PRK01742 tolB translocation protein TolB; Provisional
Probab=95.77 E-value=2.3 Score=42.16 Aligned_cols=103 Identities=17% Similarity=0.243 Sum_probs=51.2
Q ss_pred EEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+++....+| .++.+|.++++..- ...... ......|.+ ++..+++.... ...-.|+.++.
T Consensus 262 La~~~~~~g~~~Iy~~d~~~~~~~~-lt~~~~-~~~~~~wSp--DG~~i~f~s~~--------------~g~~~I~~~~~ 323 (429)
T PRK01742 262 LAFASSKDGVLNIYVMGANGGTPSQ-LTSGAG-NNTEPSWSP--DGQSILFTSDR--------------SGSPQVYRMSA 323 (429)
T ss_pred EEEEEecCCcEEEEEEECCCCCeEe-eccCCC-CcCCEEECC--CCCEEEEEECC--------------CCCceEEEEEC
Confidence 344444444 58888988776432 221110 111122222 55555554321 11235777787
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEe
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKIL 334 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~il 334 (382)
.+++.... .. . . ......-++..+++.. ...++.+|+.+|+..
T Consensus 324 ~~~~~~~l-~~-~-~-~~~~~SpDG~~ia~~~---~~~i~~~Dl~~g~~~ 366 (429)
T PRK01742 324 SGGGASLV-GG-R-G-YSAQISADGKTLVMIN---GDNVVKQDLTSGSTE 366 (429)
T ss_pred CCCCeEEe-cC-C-C-CCccCCCCCCEEEEEc---CCCEEEEECCCCCeE
Confidence 77765433 11 1 1 1222222455666655 356778999998754
No 122
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=95.72 E-value=1.4 Score=43.68 Aligned_cols=147 Identities=16% Similarity=0.151 Sum_probs=95.6
Q ss_pred CCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEE
Q 040693 139 ENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAW 218 (382)
Q Consensus 139 ~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ 218 (382)
...++.++.+|.+||+.+-++...... ..+...+...+| ..+++..+.|-...
T Consensus 208 ~gsDgki~iyDGktge~vg~l~~~~aH------------------------kGsIfalsWsPD---s~~~~T~SaDkt~K 260 (603)
T KOG0318|consen 208 AGSDGKIYIYDGKTGEKVGELEDSDAH------------------------KGSIFALSWSPD---STQFLTVSADKTIK 260 (603)
T ss_pred ecCCccEEEEcCCCccEEEEecCCCCc------------------------cccEEEEEECCC---CceEEEecCCceEE
Confidence 445799999999999999988743221 011111222223 35677777788899
Q ss_pred EEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC
Q 040693 219 ALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN 298 (382)
Q Consensus 219 ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~ 298 (382)
..|..+.++.-++.++.. ...-..+-.+.++.|+.. ..+|.|--+++.+++++-.+.....
T Consensus 261 IWdVs~~slv~t~~~~~~--v~dqqvG~lWqkd~lItV-----------------Sl~G~in~ln~~d~~~~~~i~GHnK 321 (603)
T KOG0318|consen 261 IWDVSTNSLVSTWPMGST--VEDQQVGCLWQKDHLITV-----------------SLSGTINYLNPSDPSVLKVISGHNK 321 (603)
T ss_pred EEEeeccceEEEeecCCc--hhceEEEEEEeCCeEEEE-----------------EcCcEEEEecccCCChhheeccccc
Confidence 999999999988887653 111111222345555554 3568999999999998777765443
Q ss_pred CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcE
Q 040693 299 GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKI 333 (382)
Q Consensus 299 ~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~i 333 (382)
....-.+.-++..+|-++. +|.|.-.|..+|+-
T Consensus 322 ~ITaLtv~~d~~~i~Sgsy--DG~I~~W~~~~g~~ 354 (603)
T KOG0318|consen 322 SITALTVSPDGKTIYSGSY--DGHINSWDSGSGTS 354 (603)
T ss_pred ceeEEEEcCCCCEEEeecc--CceEEEEecCCccc
Confidence 2222233234568888886 99999999887753
No 123
>PRK00178 tolB translocation protein TolB; Provisional
Probab=95.68 E-value=1.9 Score=42.71 Aligned_cols=151 Identities=13% Similarity=0.137 Sum_probs=75.6
Q ss_pred ecEEEEEcc--CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQK--SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~--~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
..++++... ...|+.+|.++|+..--..... ......|.+ ++..+++.... .....|+.+
T Consensus 211 ~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~~g--~~~~~~~Sp--DG~~la~~~~~--------------~g~~~Iy~~ 272 (430)
T PRK00178 211 KRIAYVSFEQKRPRIFVQNLDTGRREQITNFEG--LNGAPAWSP--DGSKLAFVLSK--------------DGNPEIYVM 272 (430)
T ss_pred CEEEEEEcCCCCCEEEEEECCCCCEEEccCCCC--CcCCeEECC--CCCEEEEEEcc--------------CCCceEEEE
Confidence 334454433 3479999999887543222110 111122322 56666654322 112469999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEe
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGN 358 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~ 358 (382)
|+++|+..--..... ....+... ++..++..+. .....||.+|..+|++.- ....+.....|.+ .++.|++.+
T Consensus 273 d~~~~~~~~lt~~~~--~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~-lt~~~~~~~~~~~Spdg~~i~~~~ 349 (430)
T PRK00178 273 DLASRQLSRVTNHPA--IDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAER-VTFVGNYNARPRLSADGKTLVMVH 349 (430)
T ss_pred ECCCCCeEEcccCCC--CcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCEEE-eecCCCCccceEECCCCCEEEEEE
Confidence 999987643222221 12222222 3455555442 112469999998887532 2222223334443 455666665
Q ss_pred CceeEeecCCccCCCCCe
Q 040693 359 GYKVTVGFGNKNFTSGTS 376 (382)
Q Consensus 359 ~~g~~~~~~~~~~~~g~~ 376 (382)
..+....++.+|..+|+.
T Consensus 350 ~~~~~~~l~~~dl~tg~~ 367 (430)
T PRK00178 350 RQDGNFHVAAQDLQRGSV 367 (430)
T ss_pred ccCCceEEEEEECCCCCE
Confidence 433333455566666654
No 124
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.66 E-value=0.2 Score=48.29 Aligned_cols=143 Identities=17% Similarity=0.207 Sum_probs=92.5
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
++++..++-|..+...|.++|+..-.+..-. .......+ ....+.++. ..+++|...
T Consensus 256 ~nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~-----k~Vq~l~wh~~~p~~LLsG----------------s~D~~V~l~ 314 (463)
T KOG0270|consen 256 RNVLASGSADKTVKLWDVDTGKPKSSITHHG-----KKVQTLEWHPYEPSVLLSG----------------SYDGTVALK 314 (463)
T ss_pred ceeEEecCCCceEEEEEcCCCCcceehhhcC-----CceeEEEecCCCceEEEec----------------cccceEEee
Confidence 6788888899999999999999887776321 12222211 233333321 556888888
Q ss_pred ECC---CCcEEeeecCCCCCCCCcceEEe---CCEEEEeeecCCCcEEEEeCCC-CcEeEEEecCCceecceEE---eCC
Q 040693 283 DAS---NGNVLWSTADPSNGTAPGPVTVA---NGVLFGGSTYRQGPIYAMDVKT-GKILWSYDTGATIYGGASV---SNG 352 (382)
Q Consensus 283 d~~---tG~~~W~~~~~~~~~~~~~~~~~---~~~v~~~~~~~~g~l~~ld~~t-G~ilw~~~~~~~~~~~p~~---~~g 352 (382)
|.+ .--..|++...-. .++.. ....++++ ++|.|+-+|..+ |+.+|..+...+-+++..+ ..+
T Consensus 315 D~R~~~~s~~~wk~~g~VE-----kv~w~~~se~~f~~~t--ddG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~~~p~ 387 (463)
T KOG0270|consen 315 DCRDPSNSGKEWKFDGEVE-----KVAWDPHSENSFFVST--DDGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNIQTPG 387 (463)
T ss_pred eccCccccCceEEeccceE-----EEEecCCCceeEEEec--CCceEEeeecCCCCCceeEEEeccCCcceEEecCCCCc
Confidence 888 5567888775432 11111 23455555 599999999875 8999999877655555554 234
Q ss_pred EEEEEeCceeEeecCCccCCCCCe
Q 040693 353 CIYMGNGYKVTVGFGNKNFTSGTS 376 (382)
Q Consensus 353 ~lyv~~~~g~~~~~~~~~~~~g~~ 376 (382)
.|-. .+....++++.|+..+++.
T Consensus 388 ~l~t-~s~d~~Vklw~~~~~~~~~ 410 (463)
T KOG0270|consen 388 LLST-ASTDKVVKLWKFDVDSPKS 410 (463)
T ss_pred ceee-ccccceEEEEeecCCCCcc
Confidence 4443 3444568889888888754
No 125
>PRK05137 tolB translocation protein TolB; Provisional
Probab=95.61 E-value=2.7 Score=41.77 Aligned_cols=107 Identities=14% Similarity=0.077 Sum_probs=51.2
Q ss_pred cEEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++....+ ..++.+|.++++..--.. ... ......|.+ ++..|++...+ .....|..+|
T Consensus 303 ~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~-~~~-~~~~~~~Sp--dG~~ia~~~~~--------------~~~~~i~~~d 364 (435)
T PRK05137 303 QIVFESDRSGSPQLYVMNADGSNPRRISF-GGG-RYSTPVWSP--RGDLIAFTKQG--------------GGQFSIGVMK 364 (435)
T ss_pred EEEEEECCCCCCeEEEEECCCCCeEEeec-CCC-cccCeEECC--CCCEEEEEEcC--------------CCceEEEEEE
Confidence 344444333 379999988776543221 110 011122222 66666655322 1124688889
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecC-C---CcEEEEeCCCCcE
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYR-Q---GPIYAMDVKTGKI 333 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~-~---g~l~~ld~~tG~i 333 (382)
+.+++...... .. ....+... ++..|+...... . ..|+.+|.++++.
T Consensus 365 ~~~~~~~~lt~-~~--~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~~ 416 (435)
T PRK05137 365 PDGSGERILTS-GF--LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRNE 416 (435)
T ss_pred CCCCceEeccC-CC--CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCce
Confidence 87665432222 11 12223223 344555443211 1 3699999876654
No 126
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=95.61 E-value=0.66 Score=45.19 Aligned_cols=136 Identities=15% Similarity=0.202 Sum_probs=83.2
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceE--EEeeeCceeecEEEEEccCcEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMM--LSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~--~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
...+|.+|..++|+.--.+..+.. .+.+ +.+..+ +..|...+..|.++.
T Consensus 279 rky~ysyDle~ak~~k~~~~~g~e--------------------------~~~~e~FeVShd---~~fia~~G~~G~I~l 329 (514)
T KOG2055|consen 279 RKYLYSYDLETAKVTKLKPPYGVE--------------------------EKSMERFEVSHD---SNFIAIAGNNGHIHL 329 (514)
T ss_pred ceEEEEeeccccccccccCCCCcc--------------------------cchhheeEecCC---CCeEEEcccCceEEe
Confidence 378999999988876544333221 1122 222222 357777888999999
Q ss_pred EeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCC
Q 040693 220 LDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNG 299 (382)
Q Consensus 220 ld~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~ 299 (382)
|...|++.+-.+..+.. ........++..|+++ ...|.|+.+|+.+-+.+.+....+.
T Consensus 330 LhakT~eli~s~KieG~----v~~~~fsSdsk~l~~~-----------------~~~GeV~v~nl~~~~~~~rf~D~G~- 387 (514)
T KOG2055|consen 330 LHAKTKELITSFKIEGV----VSDFTFSSDSKELLAS-----------------GGTGEVYVWNLRQNSCLHRFVDDGS- 387 (514)
T ss_pred ehhhhhhhhheeeeccE----EeeEEEecCCcEEEEE-----------------cCCceEEEEecCCcceEEEEeecCc-
Confidence 99999999988886531 1122222467777776 3458899999999988888765553
Q ss_pred CCCcceEE-eCCEEEEeeecCCCcEEEEeCC
Q 040693 300 TAPGPVTV-ANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 300 ~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
..+..+.. -++. |+++..+.|.|-..|.+
T Consensus 388 v~gts~~~S~ng~-ylA~GS~~GiVNIYd~~ 417 (514)
T KOG2055|consen 388 VHGTSLCISLNGS-YLATGSDSGIVNIYDGN 417 (514)
T ss_pred cceeeeeecCCCc-eEEeccCcceEEEeccc
Confidence 23333332 2333 22222246666666643
No 127
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.60 E-value=0.6 Score=47.93 Aligned_cols=229 Identities=15% Similarity=0.108 Sum_probs=118.0
Q ss_pred ccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCC
Q 040693 49 LCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTT 128 (382)
Q Consensus 49 ~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~ 128 (382)
++|.....|.-+|.+||+.. .+...... ... -....+++++..+|....+
T Consensus 34 L~t~~~d~Vi~idv~t~~~~--l~s~~~ed---~d~------ita~~l~~d~~~L~~a~rs------------------- 83 (775)
T KOG0319|consen 34 LYTACGDRVIIIDVATGSIA--LPSGSNED---EDE------ITALALTPDEEVLVTASRS------------------- 83 (775)
T ss_pred EEEecCceEEEEEccCCcee--cccCCccc---hhh------hheeeecCCccEEEEeecc-------------------
Confidence 33444667999999999997 22211100 000 1235677777777776554
Q ss_pred CCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEE
Q 040693 129 PTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIV 208 (382)
Q Consensus 129 ~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v 208 (382)
..+..++..+|+.+-...... ..|++.....++ +.++
T Consensus 84 --------------~llrv~~L~tgk~irswKa~H---------------------------e~Pvi~ma~~~~--g~Ll 120 (775)
T KOG0319|consen 84 --------------QLLRVWSLPTGKLIRSWKAIH---------------------------EAPVITMAFDPT--GTLL 120 (775)
T ss_pred --------------ceEEEEEcccchHhHhHhhcc---------------------------CCCeEEEEEcCC--CceE
Confidence 556667777887654443321 356554322222 3566
Q ss_pred EEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeee----CCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 209 VAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATD----ERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 209 ~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~----~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
-.++-|+.+...|.+.+...=.++- .++..+..... -..++.+ ..++.+.++|.
T Consensus 121 AtggaD~~v~VWdi~~~~~th~fkG-----~gGvVssl~F~~~~~~~lL~sg-----------------~~D~~v~vwnl 178 (775)
T KOG0319|consen 121 ATGGADGRVKVWDIKNGYCTHSFKG-----HGGVVSSLLFHPHWNRWLLASG-----------------ATDGTVRVWNL 178 (775)
T ss_pred EeccccceEEEEEeeCCEEEEEecC-----CCceEEEEEeCCccchhheeec-----------------CCCceEEEEEc
Confidence 6777788888888887776666652 12233332221 1223333 34588999999
Q ss_pred CCCcE-EeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC------CEEEEE
Q 040693 285 SNGNV-LWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN------GCIYMG 357 (382)
Q Consensus 285 ~tG~~-~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~------g~lyv~ 357 (382)
.++.. +-..........+-.+..++..++... ++..++..|..+-+.+-..+.-...-+.....+ ..+|..
T Consensus 179 ~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~~--RDkvi~vwd~~~~~~l~~lp~ye~~E~vv~l~~~~~~~~~~~~Ta 256 (775)
T KOG0319|consen 179 NDKRTCLHTMILHKSAVTSLAFSEDSLELLSVG--RDKVIIVWDLVQYKKLKTLPLYESLESVVRLREELGGKGEYIITA 256 (775)
T ss_pred ccCchHHHHHHhhhhheeeeeeccCCceEEEec--cCcEEEEeehhhhhhhheechhhheeeEEEechhcCCcceEEEEe
Confidence 88876 221111111111111112233333333 477788888876555555444332222222222 244445
Q ss_pred eCceeEeecCCccCCCCCeE
Q 040693 358 NGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 358 ~~~g~~~~~~~~~~~~g~~l 377 (382)
.+.|... -++..+|+.+
T Consensus 257 G~~g~~~---~~d~es~~~~ 273 (775)
T KOG0319|consen 257 GGSGVVQ---YWDSESGKCV 273 (775)
T ss_pred cCCceEE---EEecccchhh
Confidence 5555532 2455555544
No 128
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=95.54 E-value=2.3 Score=40.40 Aligned_cols=38 Identities=11% Similarity=0.250 Sum_probs=22.0
Q ss_pred ceEEEEECCCCcEEeeecCCC--CCCCCcceEEeCCEEEEee
Q 040693 277 GGWVAMDASNGNVLWSTADPS--NGTAPGPVTVANGVLFGGS 316 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~--~~~~~~~~~~~~~~v~~~~ 316 (382)
..+.++|+++.+ |+.-.+. .......+..-++.+|+..
T Consensus 271 ~~v~~yd~~~~~--W~~~~~~p~~~r~~~~~~~~~~~iyv~G 310 (323)
T TIGR03548 271 RKILIYNVRTGK--WKSIGNSPFFARCGAALLLTGNNIFSIN 310 (323)
T ss_pred ceEEEEECCCCe--eeEcccccccccCchheEEECCEEEEEe
Confidence 569999999885 9864321 1222333444456666544
No 129
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=95.51 E-value=2.4 Score=40.58 Aligned_cols=131 Identities=15% Similarity=0.137 Sum_probs=68.4
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..+.++|+.+ ..|+.-.+.+ .......+.++.++.||+...... + ......+..+|+...+..|+.-
T Consensus 168 ~~v~~YDp~t--~~W~~~~~~p-~~~r~~~~~~~~~~~iyv~GG~~~-------~---~~~~~~~~~y~~~~~~~~W~~~ 234 (346)
T TIGR03547 168 KNVLSYDPST--NQWRNLGENP-FLGTAGSAIVHKGNKLLLINGEIK-------P---GLRTAEVKQYLFTGGKLEWNKL 234 (346)
T ss_pred ceEEEEECCC--CceeECccCC-CCcCCCceEEEECCEEEEEeeeeC-------C---CccchheEEEEecCCCceeeec
Confidence 3588899875 4688754321 111122333446788888744311 0 0112346677777777889854
Q ss_pred CCC--CCC---C---CcceEEeCCEEEEeeecC--C-------------------CcEEEEeCCCCcEeEEEe--cCCc-
Q 040693 295 DPS--NGT---A---PGPVTVANGVLFGGSTYR--Q-------------------GPIYAMDVKTGKILWSYD--TGAT- 342 (382)
Q Consensus 295 ~~~--~~~---~---~~~~~~~~~~v~~~~~~~--~-------------------g~l~~ld~~tG~ilw~~~--~~~~- 342 (382)
.+- +.. . .....+.++.||+..... . ..+.++|+++. .|+.- ++..
T Consensus 235 ~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~--~W~~~~~lp~~~ 312 (346)
T TIGR03547 235 PPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNG--KWSKVGKLPQGL 312 (346)
T ss_pred CCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCC--cccccCCCCCCc
Confidence 322 110 0 111234577777764210 0 13456677654 47642 2222
Q ss_pred eecceEEeCCEEEEEeCc
Q 040693 343 IYGGASVSNGCIYMGNGY 360 (382)
Q Consensus 343 ~~~~p~~~~g~lyv~~~~ 360 (382)
...+.++.+++|||..+.
T Consensus 313 ~~~~~~~~~~~iyv~GG~ 330 (346)
T TIGR03547 313 AYGVSVSWNNGVLLIGGE 330 (346)
T ss_pred eeeEEEEcCCEEEEEecc
Confidence 233445679999998764
No 130
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.45 E-value=0.42 Score=49.02 Aligned_cols=168 Identities=10% Similarity=-0.018 Sum_probs=94.7
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
++.+...|.+.+...-.++-.+..++.. .+.++ ....+++.+..|+.+.++|
T Consensus 126 D~~v~VWdi~~~~~th~fkG~gGvVssl---------------------------~F~~~-~~~~lL~sg~~D~~v~vwn 177 (775)
T KOG0319|consen 126 DGRVKVWDIKNGYCTHSFKGHGGVVSSL---------------------------LFHPH-WNRWLLASGATDGTVRVWN 177 (775)
T ss_pred cceEEEEEeeCCEEEEEecCCCceEEEE---------------------------EeCCc-cchhheeecCCCceEEEEE
Confidence 4788888887777776666544322211 11111 1246778889999999999
Q ss_pred CCCCCe---eeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC
Q 040693 222 RDSGSL---IWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN 298 (382)
Q Consensus 222 ~~tG~~---~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~ 298 (382)
..++.. .|+...... .......++..++.. .++..+..+|..+-+.+-..+...
T Consensus 178 l~~~~tcl~~~~~H~S~v-----tsL~~~~d~~~~ls~-----------------~RDkvi~vwd~~~~~~l~~lp~ye- 234 (775)
T KOG0319|consen 178 LNDKRTCLHTMILHKSAV-----TSLAFSEDSLELLSV-----------------GRDKVIIVWDLVQYKKLKTLPLYE- 234 (775)
T ss_pred cccCchHHHHHHhhhhhe-----eeeeeccCCceEEEe-----------------ccCcEEEEeehhhhhhhheechhh-
Confidence 998776 344332210 111111133333332 344567778876665555544432
Q ss_pred CCCCcceEEeC------CEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-c-e-ecceEEeCCEEEEEeCceeE
Q 040693 299 GTAPGPVTVAN------GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-T-I-YGGASVSNGCIYMGNGYKVT 363 (382)
Q Consensus 299 ~~~~~~~~~~~------~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~-~-~~~p~~~~g~lyv~~~~g~~ 363 (382)
...+.....+ .+++... .+|.+..+|.++++.+.+...+. . + ...++...+++++.+.+.++
T Consensus 235 -~~E~vv~l~~~~~~~~~~~~TaG--~~g~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vtaeQnl 305 (775)
T KOG0319|consen 235 -SLESVVRLREELGGKGEYIITAG--GSGVVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVTAEQNL 305 (775)
T ss_pred -heeeEEEechhcCCcceEEEEec--CCceEEEEecccchhhhhhccCCchhhhcceeccccCceEEEEccceE
Confidence 1222222322 3455444 58999999999999988776552 1 1 22344466777777777663
No 131
>KOG1027 consensus Serine/threonine protein kinase and endoribonuclease ERN1/IRE1, sensor of the unfolded protein response pathway [Signal transduction mechanisms]
Probab=95.45 E-value=0.16 Score=53.16 Aligned_cols=112 Identities=14% Similarity=0.168 Sum_probs=77.5
Q ss_pred eecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 204 KHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 204 ~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+++++|++..++..+.+|.+||+..|.+....+ ....+|++. ..-+|...|
T Consensus 106 sdGi~ysg~k~d~~~lvD~~tg~~~~tf~~~~~------------~~~~v~~gr-----------------t~ytv~m~d 156 (903)
T KOG1027|consen 106 SDGILYSGSKQDIWYLVDPKTGEIDYTFNTAEP------------IKQLVYLGR-----------------TNYTVTMYD 156 (903)
T ss_pred CCCeEEecccccceEEecCCccceeEEEecCCc------------chhheeccc-----------------ceeEEeccc
Confidence 378999999999999999999999999986531 224566652 224577788
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceec
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYG 345 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~ 345 (382)
.++-...|......-.....+......+...... .+|.+.-+|.++|+.+|.-+...++..
T Consensus 157 ~~~~~~~wn~t~~dy~a~~~~~~~~~~~~~~~~~-~~g~i~t~D~~~g~~~~~q~~~spvv~ 217 (903)
T KOG1027|consen 157 KNVRGKTWNTTFGDYSAQYPSGVRGEKMSHFHSL-GNGYIVTVDSESGEKLWLQDLLSPVVA 217 (903)
T ss_pred CcccCceeeccccchhccCCCccCCceeEEEeec-CCccEEeccCcccceeeccccCCceEE
Confidence 8888888887754311112222222334444443 388999999999999999887655443
No 132
>PRK02889 tolB translocation protein TolB; Provisional
Probab=95.40 E-value=3.2 Score=41.20 Aligned_cols=62 Identities=13% Similarity=0.073 Sum_probs=36.1
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCCc
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGAT 342 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~~ 342 (382)
..|+.+|..+|+...-..... ...+... ++..++..+. .....|+.++. +|+...+...+.+
T Consensus 352 ~~I~v~d~~~g~~~~lt~~~~---~~~p~~spdg~~l~~~~~~~g~~~l~~~~~-~g~~~~~l~~~~g 415 (427)
T PRK02889 352 FKLYVQDLATGQVTALTDTTR---DESPSFAPNGRYILYATQQGGRSVLAAVSS-DGRIKQRLSVQGG 415 (427)
T ss_pred EEEEEEECCCCCeEEccCCCC---ccCceECCCCCEEEEEEecCCCEEEEEEEC-CCCceEEeecCCC
Confidence 368999999998765443221 1223223 3445555443 11235889998 5887777765433
No 133
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=95.34 E-value=0.54 Score=44.42 Aligned_cols=165 Identities=14% Similarity=0.140 Sum_probs=105.3
Q ss_pred CcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCC
Q 040693 84 YAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGG 163 (382)
Q Consensus 84 ~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~ 163 (382)
.+|+.-|..+..+++.+.....++. +..+-.+|.+||+++=...-.-
T Consensus 147 i~gHlgWVr~vavdP~n~wf~tgs~---------------------------------DrtikIwDlatg~LkltltGhi 193 (460)
T KOG0285|consen 147 ISGHLGWVRSVAVDPGNEWFATGSA---------------------------------DRTIKIWDLATGQLKLTLTGHI 193 (460)
T ss_pred hhhccceEEEEeeCCCceeEEecCC---------------------------------CceeEEEEcccCeEEEeecchh
Confidence 3577778777788877443333322 4889999999999987765321
Q ss_pred CcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcc
Q 040693 164 YDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAM 243 (382)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~ 243 (382)
... .-+.++ .....++++..|+.+-|+|.+.-+++-.+.-- -+..
T Consensus 194 ~~v-------------------------r~vavS-----~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGH-----lS~V 238 (460)
T KOG0285|consen 194 ETV-------------------------RGVAVS-----KRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGH-----LSGV 238 (460)
T ss_pred hee-------------------------eeeeec-----ccCceEEEecCCCeeEEEechhhhhHHHhccc-----ccee
Confidence 100 001110 22678888899999999999998888777621 1122
Q ss_pred cceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecC
Q 040693 244 WGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYR 319 (382)
Q Consensus 244 ~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~ 319 (382)
+.... ..+.|+.+ ..+..+...|.+|-..+......... ...+... ++-|+.++.
T Consensus 239 ~~L~lhPTldvl~t~-----------------grDst~RvWDiRtr~~V~~l~GH~~~--V~~V~~~~~dpqvit~S~-- 297 (460)
T KOG0285|consen 239 YCLDLHPTLDVLVTG-----------------GRDSTIRVWDIRTRASVHVLSGHTNP--VASVMCQPTDPQVITGSH-- 297 (460)
T ss_pred EEEeccccceeEEec-----------------CCcceEEEeeecccceEEEecCCCCc--ceeEEeecCCCceEEecC--
Confidence 22222 24555554 45677889999998888777755432 2222233 677888875
Q ss_pred CCcEEEEeCCCCcEeEEE
Q 040693 320 QGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 320 ~g~l~~ld~~tG~ilw~~ 337 (382)
++.|...|...|+-.-..
T Consensus 298 D~tvrlWDl~agkt~~tl 315 (460)
T KOG0285|consen 298 DSTVRLWDLRAGKTMITL 315 (460)
T ss_pred CceEEEeeeccCceeEee
Confidence 888888898888765444
No 134
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=95.31 E-value=2.6 Score=39.61 Aligned_cols=123 Identities=18% Similarity=0.171 Sum_probs=69.0
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEE--EcCEEEEeccCccccccccccccccceEEEEeCc-cCceeeeeeccCCCC
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTY--YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAK-TGRILWQTFMLPDNF 78 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v--~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~-tG~~lW~~~~~~~~~ 78 (382)
||.++|++.=...-++ +--+..--+. .+..||+..+.-+. .+|.|-.+|++ +-+.+=.+....-
T Consensus 33 ~D~~~g~~~~~~~a~~--gRHFyGHg~fs~dG~~LytTEnd~~~---------g~G~IgVyd~~~~~~ri~E~~s~GI-- 99 (305)
T PF07433_consen 33 FDCRTGQLLQRLWAPP--GRHFYGHGVFSPDGRLLYTTENDYET---------GRGVIGVYDAARGYRRIGEFPSHGI-- 99 (305)
T ss_pred EEcCCCceeeEEcCCC--CCEEecCEEEcCCCCEEEEeccccCC---------CcEEEEEEECcCCcEEEeEecCCCc--
Confidence 5777777654333221 1111111222 24566665543332 27899999987 4444444443211
Q ss_pred CCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEE
Q 040693 79 GKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWY 158 (382)
Q Consensus 79 ~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~ 158 (382)
|. ....+.+++..++|+.|.. .+-|-..+..+...+-+..|.-||..||+++=+
T Consensus 100 -------GP----Hel~l~pDG~tLvVANGGI---------------~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q 153 (305)
T PF07433_consen 100 -------GP----HELLLMPDGETLVVANGGI---------------ETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQ 153 (305)
T ss_pred -------Ch----hhEEEcCCCCEEEEEcCCC---------------ccCcccCceecChhhcCCceEEEecCCCceeee
Confidence 11 1245667776888877653 333333333344444467899999999999988
Q ss_pred EecCC
Q 040693 159 KQLGG 163 (382)
Q Consensus 159 ~~~~~ 163 (382)
..+++
T Consensus 154 ~~Lp~ 158 (305)
T PF07433_consen 154 VELPP 158 (305)
T ss_pred eecCc
Confidence 87754
No 135
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.25 E-value=3.4 Score=42.94 Aligned_cols=100 Identities=15% Similarity=0.169 Sum_probs=69.8
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC--C
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN--G 352 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~--g 352 (382)
.++.|..+|...|-=.-++..+......-.....++.++..+ -+|.|.+.|.+.++---.+..|.++..+.+.+| |
T Consensus 370 eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssS--LDGtVRAwDlkRYrNfRTft~P~p~QfscvavD~sG 447 (893)
T KOG0291|consen 370 EDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSS--LDGTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSG 447 (893)
T ss_pred CCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEee--cCCeEEeeeecccceeeeecCCCceeeeEEEEcCCC
Confidence 457788888888877777766553222222333455566666 599999999998887778888888877777766 7
Q ss_pred EEEEEeCceeEeecCCccCCCCCeE
Q 040693 353 CIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 353 ~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
.|.+..+..+ .-+|--+.+||++|
T Consensus 448 elV~AG~~d~-F~IfvWS~qTGqll 471 (893)
T KOG0291|consen 448 ELVCAGAQDS-FEIFVWSVQTGQLL 471 (893)
T ss_pred CEEEeeccce-EEEEEEEeecCeee
Confidence 7777666544 44566778888876
No 136
>COG3419 PilY1 Tfp pilus assembly protein, tip-associated adhesin PilY1 [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=95.17 E-value=0.79 Score=49.19 Aligned_cols=186 Identities=13% Similarity=0.141 Sum_probs=99.6
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCC--cccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEE
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGY--DVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAV 211 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~ 211 (382)
..++.+..+|.|.+||+.+|.++..+-.... .+..+ ..|. +....+.+-.+|.+.++..++.-+.+|+.+
T Consensus 583 ~~VyvgandGmLhaFd~~tG~E~fA~~P~avl~~l~~~-----t~~~---y~~h~yyVDg~p~~~da~~ng~wrsvL~g~ 654 (1036)
T COG3419 583 PVVYVGANDGMLHAFDANTGSERFAYVPSAVLSTLHSL-----TAPG---YTAHQYYVDGSPTAADAYDNGQWRSVLVGG 654 (1036)
T ss_pred ceEEEecCCceeeeccCCccceeeecCcHHHHhhhhhh-----cCCC---cccccceecCCceeehhhcCCcceEEEEee
Confidence 3455666789999999999999987642211 00000 0000 001245577899999887777545555555
Q ss_pred ccC--cEEEEEeCCCC-----CeeeeeccCCCCCCCCcccceee---eCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 212 QKS--GFAWALDRDSG-----SLIWSMEAGPGGLGGGAMWGAAT---DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 212 ~~~--g~l~ald~~tG-----~~~W~~~~~~~~~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
... .-+||||..+- +++|++........|...-.|.+ .++.=++...+.-. - + .....+..
T Consensus 655 ~G~GG~glyALDVTdP~~~~~~~Lw~~~~~d~~~LG~t~gkP~Iv~l~~gswavl~GNGyn----S-~----~n~~al~~ 725 (1036)
T COG3419 655 LGAGGRGLYALDVTDPDFSNSNLLWENNSNDDPDLGYTMGKPRIVPLHDGSWAVLLGNGYN----S-P----ANGAALLV 725 (1036)
T ss_pred cCCCCceeEEEEccCccccCCcchhcccCCCccccccccCCCeEEEcCCCceEEEEccCCC----C-C----CCCcceEE
Confidence 433 36999988754 48898876654334433333322 22222222222110 0 0 11233667
Q ss_pred EECCCC----cEEeeecCCCCCCCCc-----------ceEEe-C---CEEEEeeecCCCcEEEEeCCCCcE-eEEEe
Q 040693 282 MDASNG----NVLWSTADPSNGTAPG-----------PVTVA-N---GVLFGGSTYRQGPIYAMDVKTGKI-LWSYD 338 (382)
Q Consensus 282 ~d~~tG----~~~W~~~~~~~~~~~~-----------~~~~~-~---~~v~~~~~~~~g~l~~ld~~tG~i-lw~~~ 338 (382)
+++.++ +..|+..........+ ++..+ + +++|++. ..|.|+-||+.-... -|.+.
T Consensus 726 ~~L~t~~~~~~~~v~~g~~~~~g~~P~~~~~g~~~~~~~d~~~dG~vd~aYAGD--l~GnlWRFdLsg~~~n~W~va 800 (1036)
T COG3419 726 LNLLTLDATRKVPVQSGTGYGAGVSPVCVGVGGLDVAVLDLDGDGIVDYAYAGD--LGGNLWRFDLSGNAPNSWTVA 800 (1036)
T ss_pred EEeecCCcceeEEEeccCCccccccCccccccccccceeecCCCceEEEEEeec--cCCcEEEEEecCCCCCCccee
Confidence 777766 4555544322111111 11112 2 2788887 489999999874443 66654
No 137
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=95.17 E-value=0.064 Score=52.15 Aligned_cols=142 Identities=13% Similarity=0.107 Sum_probs=89.2
Q ss_pred ecEEEEEccCcEEEEEeCCC-CCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDS-GSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~t-G~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
..+++.++-|+.++.++.-+ ++.+-.+.--.. ........ ..+.-|.+. ..+..+..+|
T Consensus 227 ~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k---~Vrd~~~s-~~g~~fLS~----------------sfD~~lKlwD 286 (503)
T KOG0282|consen 227 GHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRK---PVRDASFN-NCGTSFLSA----------------SFDRFLKLWD 286 (503)
T ss_pred eeEEEecCCCceEEEEEEecCcceehhhhcchh---hhhhhhcc-ccCCeeeee----------------ecceeeeeec
Confidence 56777788888888877665 666655542110 00111111 234444442 4568899999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc-eecceEEeCCEEEEEeCce
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT-IYGGASVSNGCIYMGNGYK 361 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~-~~~~p~~~~g~lyv~~~~g 361 (382)
.+||+.+-+..........- .-.++ +.++++.. +++|...|..+|+++-++...-+ +..-.-+.+|+-||++++.
T Consensus 287 tETG~~~~~f~~~~~~~cvk-f~pd~~n~fl~G~s--d~ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~~g~rFissSDd 363 (503)
T KOG0282|consen 287 TETGQVLSRFHLDKVPTCVK-FHPDNQNIFLVGGS--DKKIRQWDIRSGKVVQEYDRHLGAILDITFVDEGRRFISSSDD 363 (503)
T ss_pred cccceEEEEEecCCCceeee-cCCCCCcEEEEecC--CCcEEEEeccchHHHHHHHhhhhheeeeEEccCCceEeeeccC
Confidence 99999998888765321111 11233 56666664 99999999999999887754433 3344455889999999886
Q ss_pred eEeecCCc
Q 040693 362 VTVGFGNK 369 (382)
Q Consensus 362 ~~~~~~~~ 369 (382)
..+=+|.+
T Consensus 364 ks~riWe~ 371 (503)
T KOG0282|consen 364 KSVRIWEN 371 (503)
T ss_pred ccEEEEEc
Confidence 65444433
No 138
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=94.98 E-value=2.4 Score=43.85 Aligned_cols=128 Identities=18% Similarity=0.194 Sum_probs=81.9
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..+..+|+.+.+ |..-++ - ...+...+.++-++.||+..+..+. ..-..+-++|+++ ..|+.-
T Consensus 349 ~~ve~YD~~~~~--W~~~a~-M-~~~R~~~~v~~l~g~iYavGG~dg~-----------~~l~svE~YDp~~--~~W~~v 411 (571)
T KOG4441|consen 349 SSVERYDPRTNQ--WTPVAP-M-NTKRSDFGVAVLDGKLYAVGGFDGE-----------KSLNSVECYDPVT--NKWTPV 411 (571)
T ss_pred ceEEEecCCCCc--eeccCC-c-cCccccceeEEECCEEEEEeccccc-----------cccccEEEecCCC--Cccccc
Confidence 468889999877 887322 1 2334556666678999988554311 2335688999865 568765
Q ss_pred CCCC-CCCCcceEEeCCEEEEeeecC-----CCcEEEEeCCCCcEeEEEecCC---ceecceEEeCCEEEEEeCce
Q 040693 295 DPSN-GTAPGPVTVANGVLFGGSTYR-----QGPIYAMDVKTGKILWSYDTGA---TIYGGASVSNGCIYMGNGYK 361 (382)
Q Consensus 295 ~~~~-~~~~~~~~~~~~~v~~~~~~~-----~g~l~~ld~~tG~ilw~~~~~~---~~~~~p~~~~g~lyv~~~~g 361 (382)
.+-. ........+-++++|+..... -..+.++|+.+.+ |+...+- ......++.+++||+..+..
T Consensus 412 a~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~--W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~ 485 (571)
T KOG4441|consen 412 APMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNT--WTLIAPMNTRRSGFGVAVLNGKIYVVGGFD 485 (571)
T ss_pred CCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCc--eeecCCcccccccceEEEECCEEEEECCcc
Confidence 4221 223444456688888875311 1468899998655 7765443 24456788999999987754
No 139
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=94.91 E-value=2.5 Score=40.00 Aligned_cols=53 Identities=17% Similarity=0.219 Sum_probs=34.4
Q ss_pred ecEEEEEcc-Cc-EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCc
Q 040693 205 HDIVVAVQK-SG-FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANS 260 (382)
Q Consensus 205 ~~~v~~~~~-~g-~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~ 260 (382)
++.|++... .| .|.+++++ |+++=+++++... .....++.. +.+.||+++...
T Consensus 223 dG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP~~~-~t~~~FgG~-~~~~L~iTs~~~ 277 (307)
T COG3386 223 DGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLPVKR-PTNPAFGGP-DLNTLYITSARS 277 (307)
T ss_pred CCCEEEecccCCceEEEECCC-CcEEEEEECCCCC-CccceEeCC-CcCEEEEEecCC
Confidence 566774433 33 89999999 9999999987421 111222211 468999987654
No 140
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=94.91 E-value=0.23 Score=46.17 Aligned_cols=181 Identities=14% Similarity=0.132 Sum_probs=102.4
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEE-EeeeCceeecEEEEEccCcEEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMML-SMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
+|.|-..|-.|||+.--.++.+.+.++. ...+++. .+.++ ...+-.+..||.+-.+
T Consensus 234 DGFiEVWny~~GKlrKDLkYQAqd~fMM--------------------md~aVlci~FSRD---sEMlAsGsqDGkIKvW 290 (508)
T KOG0275|consen 234 DGFIEVWNYTTGKLRKDLKYQAQDNFMM--------------------MDDAVLCISFSRD---SEMLASGSQDGKIKVW 290 (508)
T ss_pred cceeeeehhccchhhhhhhhhhhcceee--------------------cccceEEEeeccc---HHHhhccCcCCcEEEE
Confidence 4889999999999988777776654321 1122221 11211 1234445567777777
Q ss_pred eCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCC
Q 040693 221 DRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGT 300 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~ 300 (382)
...||..+-+++-.- ..|.......-|+..+... ..+.++...-.++||.+=++.......
T Consensus 291 ri~tG~ClRrFdrAH--tkGvt~l~FSrD~SqiLS~-----------------sfD~tvRiHGlKSGK~LKEfrGHsSyv 351 (508)
T KOG0275|consen 291 RIETGQCLRRFDRAH--TKGVTCLSFSRDNSQILSA-----------------SFDQTVRIHGLKSGKCLKEFRGHSSYV 351 (508)
T ss_pred EEecchHHHHhhhhh--ccCeeEEEEccCcchhhcc-----------------cccceEEEeccccchhHHHhcCccccc
Confidence 777777666665221 1111111111133333332 234556666778899887776655322
Q ss_pred CCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee------------cceEE--eCCEEEEEeCceeEeec
Q 040693 301 APGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY------------GGASV--SNGCIYMGNGYKVTVGF 366 (382)
Q Consensus 301 ~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~------------~~p~~--~~g~lyv~~~~g~~~~~ 366 (382)
......-++..+..++. +|.+.+.+.+|++-+-.++.++.-+ ...++ --+.+|+-+..|.++.-
T Consensus 352 n~a~ft~dG~~iisaSs--DgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsntv~imn~qGQvVrs 429 (508)
T KOG0275|consen 352 NEATFTDDGHHIISASS--DGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVRS 429 (508)
T ss_pred cceEEcCCCCeEEEecC--CccEEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCCeEEEEeccceEEee
Confidence 22333334567777774 9999999999998888876655211 11122 23677777777665543
No 141
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=94.88 E-value=1.6 Score=41.67 Aligned_cols=158 Identities=15% Similarity=0.078 Sum_probs=95.5
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
..+-..|+...+++|+.+..+.+.... ..|. +.....+. +|.....+...+.-+++..+|.
T Consensus 173 n~lkiwdle~~~qiw~aKNvpnD~L~L-----rVPv----------W~tdi~Fl----~g~~~~~fat~T~~hqvR~YDt 233 (412)
T KOG3881|consen 173 NELKIWDLEQSKQIWSAKNVPNDRLGL-----RVPV----------WITDIRFL----EGSPNYKFATITRYHQVRLYDT 233 (412)
T ss_pred cceeeeecccceeeeeccCCCCccccc-----eeee----------eeccceec----CCCCCceEEEEecceeEEEecC
Confidence 566777888899999998877643211 1111 00111111 1222456778888899999999
Q ss_pred CCCC-eeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCC
Q 040693 223 DSGS-LIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTA 301 (382)
Q Consensus 223 ~tG~-~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~ 301 (382)
..+. +.-+++.... .-...+...+++.||++ +..+.+..||.++|+..=..-... .-.
T Consensus 234 ~~qRRPV~~fd~~E~---~is~~~l~p~gn~Iy~g-----------------n~~g~l~~FD~r~~kl~g~~~kg~-tGs 292 (412)
T KOG3881|consen 234 RHQRRPVAQFDFLEN---PISSTGLTPSGNFIYTG-----------------NTKGQLAKFDLRGGKLLGCGLKGI-TGS 292 (412)
T ss_pred cccCcceeEeccccC---cceeeeecCCCcEEEEe-----------------cccchhheecccCceeeccccCCc-cCC
Confidence 9774 5555554321 11112222377889987 566889999999999876522211 011
Q ss_pred CcceEEeCC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 302 PGPVTVANG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 302 ~~~~~~~~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
...+....+ .+.+.. +-+-.|..+|.+|.+++-+..+..
T Consensus 293 irsih~hp~~~~las~-GLDRyvRIhD~ktrkll~kvYvKs 332 (412)
T KOG3881|consen 293 IRSIHCHPTHPVLASC-GLDRYVRIHDIKTRKLLHKVYVKS 332 (412)
T ss_pred cceEEEcCCCceEEee-ccceeEEEeecccchhhhhhhhhc
Confidence 233333333 344433 257889999999988888776654
No 142
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=94.77 E-value=3.5 Score=38.33 Aligned_cols=183 Identities=13% Similarity=0.048 Sum_probs=104.9
Q ss_pred CcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 141 HSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 141 ~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
.+..|..|+..|-|-+-.++-....+. .+..+| + ++..+.++.|..+..+
T Consensus 78 ~d~tIryLsl~dNkylRYF~GH~~~V~--------------------sL~~sP--------~--~d~FlS~S~D~tvrLW 127 (311)
T KOG1446|consen 78 EDDTIRYLSLHDNKYLRYFPGHKKRVN--------------------SLSVSP--------K--DDTFLSSSLDKTVRLW 127 (311)
T ss_pred CCCceEEEEeecCceEEEcCCCCceEE--------------------EEEecC--------C--CCeEEecccCCeEEee
Confidence 357899999998888877765543211 111222 1 5778888888888888
Q ss_pred eCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC--cEEeeecCCC
Q 040693 221 DRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG--NVLWSTADPS 297 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG--~~~W~~~~~~ 297 (382)
|..+-+.+=....... + ..+. ..+++|+...+ ...|..+|.+.- .+-=++.+..
T Consensus 128 DlR~~~cqg~l~~~~~-----p--i~AfDp~GLifA~~~~----------------~~~IkLyD~Rs~dkgPF~tf~i~~ 184 (311)
T KOG1446|consen 128 DLRVKKCQGLLNLSGR-----P--IAAFDPEGLIFALANG----------------SELIKLYDLRSFDKGPFTTFSITD 184 (311)
T ss_pred EecCCCCceEEecCCC-----c--ceeECCCCcEEEEecC----------------CCeEEEEEecccCCCCceeEccCC
Confidence 8775443333332211 1 1122 35677766433 235777777643 1111122111
Q ss_pred CC-CCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee---cceEE-eCCEEEEEeCceeEeecCCcc
Q 040693 298 NG-TAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY---GGASV-SNGCIYMGNGYKVTVGFGNKN 370 (382)
Q Consensus 298 ~~-~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~---~~p~~-~~g~lyv~~~~g~~~~~~~~~ 370 (382)
+. ..-..+-+ +|..+.+.+. .+.++.||+=+|.++-.+....... .+... -+++.+++.++...+++|++
T Consensus 185 ~~~~ew~~l~FS~dGK~iLlsT~--~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~- 261 (311)
T KOG1446|consen 185 NDEAEWTDLEFSPDGKSILLSTN--ASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGSDDGTIHVWNL- 261 (311)
T ss_pred CCccceeeeEEcCCCCEEEEEeC--CCcEEEEEccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEecCCCcEEEEEc-
Confidence 00 01112222 3557777774 8999999999999988886554322 22223 56777776666555778887
Q ss_pred CCCCCeEEEE
Q 040693 371 FTSGTSLYAF 380 (382)
Q Consensus 371 ~~~g~~l~~~ 380 (382)
.+|+..-++
T Consensus 262 -~tg~~v~~~ 270 (311)
T KOG1446|consen 262 -ETGKKVAVL 270 (311)
T ss_pred -CCCcEeeEe
Confidence 558776655
No 143
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=94.76 E-value=4.5 Score=39.53 Aligned_cols=166 Identities=15% Similarity=0.163 Sum_probs=104.6
Q ss_pred cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCC
Q 040693 31 KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNL 110 (382)
Q Consensus 31 ~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~ 110 (382)
++.+|++.... ..+.+..+|.++++++=+...+... .-.++++++..+|+....
T Consensus 127 ~~~vYV~n~~~-----------~~~~vsvid~~t~~~~~~~~vG~~P--------------~~~a~~p~g~~vyv~~~~- 180 (381)
T COG3391 127 GKYVYVANAGN-----------GNNTVSVIDAATNKVTATIPVGNTP--------------TGVAVDPDGNKVYVTNSD- 180 (381)
T ss_pred CCEEEEEeccc-----------CCceEEEEeCCCCeEEEEEecCCCc--------------ceEEECCCCCeEEEEecC-
Confidence 56888877521 1678999999999998776553211 136788888899998733
Q ss_pred CCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCC
Q 040693 111 YSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFG 190 (382)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (382)
.+.|..+|. .+..+|+ ..... .....
T Consensus 181 -------------------------------~~~v~vi~~-~~~~v~~-~~~~~---------------------~~~~~ 206 (381)
T COG3391 181 -------------------------------DNTVSVIDT-SGNSVVR-GSVGS---------------------LVGVG 206 (381)
T ss_pred -------------------------------CCeEEEEeC-CCcceec-ccccc---------------------ccccC
Confidence 389999996 4777776 33221 11223
Q ss_pred CCceEEEeeeCceeecEEEEE-ccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCcccccccc
Q 040693 191 EAPMMLSMYRNKVKHDIVVAV-QKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNL 267 (382)
Q Consensus 191 ~~p~~~~~~~~g~~~~~v~~~-~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~ 267 (382)
..|....+..+| ..+++. ..+ +.+..+|..+++..+....... . ........-++..+|+...
T Consensus 207 ~~P~~i~v~~~g---~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~~~-~-~~~~v~~~p~g~~~yv~~~--------- 272 (381)
T COG3391 207 TGPAGIAVDPDG---NRVYVANDGSGSNNVLKIDTATGNVTATDLPVGS-G-APRGVAVDPAGKAAYVANS--------- 272 (381)
T ss_pred CCCceEEECCCC---CEEEEEeccCCCceEEEEeCCCceEEEecccccc-C-CCCceeECCCCCEEEEEec---------
Confidence 455555555455 334443 333 6999999999999988332211 0 0111111226777887632
Q ss_pred CCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 268 KPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 268 ~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
..+.+..+|..+.++.=......
T Consensus 273 -------~~~~V~vid~~~~~v~~~~~~~~ 295 (381)
T COG3391 273 -------QGGTVSVIDGATDRVVKTGPTGN 295 (381)
T ss_pred -------CCCeEEEEeCCCCceeeeecccc
Confidence 24789999999988877665544
No 144
>PHA03098 kelch-like protein; Provisional
Probab=94.69 E-value=5.8 Score=40.51 Aligned_cols=128 Identities=13% Similarity=0.080 Sum_probs=70.9
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..+..+|+.+. .|+.-.+.+ ........+..++.+|+..+.... ......+..+|++++ .|+.-
T Consensus 358 ~~v~~yd~~~~--~W~~~~~lp--~~r~~~~~~~~~~~iYv~GG~~~~----------~~~~~~v~~yd~~t~--~W~~~ 421 (534)
T PHA03098 358 NTVESWKPGES--KWREEPPLI--FPRYNPCVVNVNNLIYVIGGISKN----------DELLKTVECFSLNTN--KWSKG 421 (534)
T ss_pred ceEEEEcCCCC--ceeeCCCcC--cCCccceEEEECCEEEEECCcCCC----------CcccceEEEEeCCCC--eeeec
Confidence 35778888764 587644322 112223334467888886442110 011356899999875 58764
Q ss_pred CCCC-CCCCcceEEeCCEEEEeeecC-------CCcEEEEeCCCCcEeEEEecC--C-ceecceEEeCCEEEEEeCc
Q 040693 295 DPSN-GTAPGPVTVANGVLFGGSTYR-------QGPIYAMDVKTGKILWSYDTG--A-TIYGGASVSNGCIYMGNGY 360 (382)
Q Consensus 295 ~~~~-~~~~~~~~~~~~~v~~~~~~~-------~g~l~~ld~~tG~ilw~~~~~--~-~~~~~p~~~~g~lyv~~~~ 360 (382)
.+.+ ..........++.+|+..... -..++++|+++. .|+.-.+ . ....+.++.++++|+..+.
T Consensus 422 ~~~p~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~--~W~~~~~~~~~r~~~~~~~~~~~iyv~GG~ 496 (534)
T PHA03098 422 SPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTN--KWTELSSLNFPRINASLCIFNNKIYVVGGD 496 (534)
T ss_pred CCCCccccCceEEEECCEEEEECCccCCCCCcccceEEEecCCCC--ceeeCCCCCcccccceEEEECCEEEEEcCC
Confidence 3221 122334445677777754210 123889998865 4654221 1 2234456679999998764
No 145
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=94.62 E-value=3.4 Score=37.45 Aligned_cols=148 Identities=9% Similarity=-0.010 Sum_probs=73.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee--eeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA--TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
-..+++++.+|.+...|..+- +-..++-+... ....... .++.++... ...|+.++.
T Consensus 136 QteLis~dqsg~irvWDl~~~--~c~~~liPe~~--~~i~sl~v~~dgsml~a~-----------------nnkG~cyvW 194 (311)
T KOG0315|consen 136 QTELISGDQSGNIRVWDLGEN--SCTHELIPEDD--TSIQSLTVMPDGSMLAAA-----------------NNKGNCYVW 194 (311)
T ss_pred cceEEeecCCCcEEEEEccCC--ccccccCCCCC--cceeeEEEcCCCcEEEEe-----------------cCCccEEEE
Confidence 356788888999999998642 22223222111 1222222 245555443 345777877
Q ss_pred ECCCCcEEe------eecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC--ce-ecceEEeC
Q 040693 283 DASNGNVLW------STADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA--TI-YGGASVSN 351 (382)
Q Consensus 283 d~~tG~~~W------~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~--~~-~~~p~~~~ 351 (382)
+.-++...= ++..... ......+ ++.++..++ .+..++.++.++- +.-+..+.+ .. +-..-..|
T Consensus 195 ~l~~~~~~s~l~P~~k~~ah~~--~il~C~lSPd~k~lat~s--sdktv~iwn~~~~-~kle~~l~gh~rWvWdc~FS~d 269 (311)
T KOG0315|consen 195 RLLNHQTASELEPVHKFQAHNG--HILRCLLSPDVKYLATCS--SDKTVKIWNTDDF-FKLELVLTGHQRWVWDCAFSAD 269 (311)
T ss_pred EccCCCccccceEhhheecccc--eEEEEEECCCCcEEEeec--CCceEEEEecCCc-eeeEEEeecCCceEEeeeeccC
Confidence 777653322 2222211 1111222 234444444 3777888887654 211122211 12 22223356
Q ss_pred CEEEEEeCceeEeecCCccCCCCCeEEEE
Q 040693 352 GCIYMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 352 g~lyv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
++..|+.+....+.++.+.. |+.+-.|
T Consensus 270 g~YlvTassd~~~rlW~~~~--~k~v~qy 296 (311)
T KOG0315|consen 270 GEYLVTASSDHTARLWDLSA--GKEVRQY 296 (311)
T ss_pred ccEEEecCCCCceeeccccc--Cceeeec
Confidence 66666555557788887766 6655444
No 146
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=94.61 E-value=1.6 Score=43.77 Aligned_cols=215 Identities=12% Similarity=0.047 Sum_probs=113.2
Q ss_pred ceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCCC
Q 040693 55 GSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDK 134 (382)
Q Consensus 55 g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (382)
..||.||++-|+.|=-+.+. ++..| ...+.+..+++.+++.+
T Consensus 155 ~evYRlNLEqGrfL~P~~~~-----------~~~lN--~v~in~~hgLla~Gt~~------------------------- 196 (703)
T KOG2321|consen 155 SEVYRLNLEQGRFLNPFETD-----------SGELN--VVSINEEHGLLACGTED------------------------- 196 (703)
T ss_pred cceEEEEccccccccccccc-----------cccce--eeeecCccceEEecccC-------------------------
Confidence 35677777777665554442 22222 24566666777776654
Q ss_pred CCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEE--eeeCceeecEEEEEc
Q 040693 135 CIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLS--MYRNKVKHDIVVAVQ 212 (382)
Q Consensus 135 ~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~--~~~~g~~~~~v~~~~ 212 (382)
|.|-++|+.+-+.+=.......- .+. -+....|.+-. +..+ +-.+-+++
T Consensus 197 --------g~VEfwDpR~ksrv~~l~~~~~v--------~s~----------pg~~~~~svTal~F~d~---gL~~aVGt 247 (703)
T KOG2321|consen 197 --------GVVEFWDPRDKSRVGTLDAASSV--------NSH----------PGGDAAPSVTALKFRDD---GLHVAVGT 247 (703)
T ss_pred --------ceEEEecchhhhhheeeeccccc--------CCC----------ccccccCcceEEEecCC---ceeEEeec
Confidence 88999999888777776654330 000 01123333433 3322 35677899
Q ss_pred cCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 213 KSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 213 ~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
.+|.++.+|..+-+++-..+-.-....-...|...-..+.|+.. ....+..+|..+|+..=.
T Consensus 248 s~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~~~~q~~v~S~------------------Dk~~~kiWd~~~Gk~~as 309 (703)
T KOG2321|consen 248 STGSVLIYDLRASKPLLVKDHGYELPIKKLDWQDTDQQNKVVSM------------------DKRILKIWDECTGKPMAS 309 (703)
T ss_pred cCCcEEEEEcccCCceeecccCCccceeeecccccCCCceEEec------------------chHHhhhcccccCCceee
Confidence 99999999999988887766432111112234333122334332 223455678888887655
Q ss_pred ecCCCCCCCCcceEE-eCCEEEEeeecCCCcEEEEeCC--CCcEeEEEecCC---c--eecceEEeCCEEEEEe
Q 040693 293 TADPSNGTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVK--TGKILWSYDTGA---T--IYGGASVSNGCIYMGN 358 (382)
Q Consensus 293 ~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~--tG~ilw~~~~~~---~--~~~~p~~~~g~lyv~~ 358 (382)
++.-.. . .....+ +.|++|.+.. ...++.+=.- .-.+.|=..+.. . --..+.++++.-||+-
T Consensus 310 iEpt~~-l-ND~C~~p~sGm~f~Ane--~~~m~~yyiP~LGPaPrWCSfLdnlTEElEE~~~~TVYDnYkFvTk 379 (703)
T KOG2321|consen 310 IEPTSD-L-NDFCFVPGSGMFFTANE--SSKMHTYYIPSLGPAPRWCSFLDNLTEELEENPETTVYDNYKFVTK 379 (703)
T ss_pred ccccCC-c-CceeeecCCceEEEecC--CCcceeEEccccCCCchhhhHHHhHHHHHhcCCccccccceeeeeH
Confidence 543221 1 111222 3466666652 4443332211 123444322211 0 1134567888888874
No 147
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=94.59 E-value=4.5 Score=38.75 Aligned_cols=82 Identities=20% Similarity=0.214 Sum_probs=47.2
Q ss_pred ceEEEEECCCCcEEeeecC--CCCCCCCcceEEeCCEEEEeeecC-----CCcEEEEeCCCCcEeEEEecCCce------
Q 040693 277 GGWVAMDASNGNVLWSTAD--PSNGTAPGPVTVANGVLFGGSTYR-----QGPIYAMDVKTGKILWSYDTGATI------ 343 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~--~~~~~~~~~~~~~~~~v~~~~~~~-----~g~l~~ld~~tG~ilw~~~~~~~~------ 343 (382)
..+..+|+++. .|+... +........+...++.+|+..... ...++.+|.+..+-.|+.-.+-..
T Consensus 168 ~~v~~YDp~t~--~W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~ 245 (346)
T TIGR03547 168 KNVLSYDPSTN--QWRNLGENPFLGTAGSAIVHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQ 245 (346)
T ss_pred ceEEEEECCCC--ceeECccCCCCcCCCceEEEECCEEEEEeeeeCCCccchheEEEEecCCCceeeecCCCCCCCCCcc
Confidence 56999999865 588643 321223334445577777754211 123566776656677865332111
Q ss_pred ----ecceEEeCCEEEEEeCc
Q 040693 344 ----YGGASVSNGCIYMGNGY 360 (382)
Q Consensus 344 ----~~~p~~~~g~lyv~~~~ 360 (382)
....++.+++||+..+.
T Consensus 246 ~~~~~~~a~~~~~~Iyv~GG~ 266 (346)
T TIGR03547 246 EGLAGAFAGISNGVLLVAGGA 266 (346)
T ss_pred ccccEEeeeEECCEEEEeecC
Confidence 11245689999998764
No 148
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=94.49 E-value=4.3 Score=38.19 Aligned_cols=95 Identities=13% Similarity=0.098 Sum_probs=59.5
Q ss_pred CCCCceEEEeeeCceeecEEE-EEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCcc-ccccc
Q 040693 189 FGEAPMMLSMYRNKVKHDIVV-AVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQ-HKNFN 266 (382)
Q Consensus 189 ~~~~p~~~~~~~~g~~~~~v~-~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~-~~~~~ 266 (382)
++-+|..+ ++.++ .....|.+..+|.++|+..=-...+. ...+....++++|++.+... ...+.
T Consensus 204 mPhSPRWh--------dgrLwvldsgtGev~~vD~~~G~~e~Va~vpG------~~rGL~f~G~llvVgmSk~R~~~~f~ 269 (335)
T TIGR03032 204 MPHSPRWY--------QGKLWLLNSGRGELGYVDPQAGKFQPVAFLPG------FTRGLAFAGDFAFVGLSKLRESRVFG 269 (335)
T ss_pred CCcCCcEe--------CCeEEEEECCCCEEEEEcCCCCcEEEEEECCC------CCcccceeCCEEEEEeccccCCCCcC
Confidence 34577777 44554 45667899999998897655555432 22333445789999887755 33343
Q ss_pred cCCCCC--CCCCceEEEEECCCCcEEeeecCCC
Q 040693 267 LKPSKN--STIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 267 ~~~~~~--~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
.-|-.. ......|..+|++||+++--..+..
T Consensus 270 glpl~~~l~~~~CGv~vidl~tG~vv~~l~feg 302 (335)
T TIGR03032 270 GLPIEERLDALGCGVAVIDLNSGDVVHWLRFEG 302 (335)
T ss_pred CCchhhhhhhhcccEEEEECCCCCEEEEEEeCC
Confidence 333222 1223669999999999776666544
No 149
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=94.48 E-value=6.2 Score=39.92 Aligned_cols=157 Identities=13% Similarity=0.079 Sum_probs=85.3
Q ss_pred ecEEEEEccCc------EEEEEeCCCCCeeeeeccCCCC-CCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCc
Q 040693 205 HDIVVAVQKSG------FAWALDRDSGSLIWSMEAGPGG-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAG 277 (382)
Q Consensus 205 ~~~v~~~~~~g------~l~ald~~tG~~~W~~~~~~~~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g 277 (382)
..+++++..+. .|+.+|..|++ |.+...... ..........+.+++||+-.+.... ....+
T Consensus 123 ~~l~lfGG~~~~~~~~~~l~~~d~~t~~--W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~----------~~~~n 190 (482)
T KOG0379|consen 123 DKLYLFGGTDKKYRNLNELHSLDLSTRT--WSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGT----------GDSLN 190 (482)
T ss_pred CeEEEEccccCCCCChhheEeccCCCCc--EEEecCcCCCCCCcccceEEEECCEEEEECCccCc----------cccee
Confidence 44555555552 69999998764 444332211 1222333344466777765332211 12457
Q ss_pred eEEEEECCCCcEEeeecCCCC----CCCCcceEEeCCEEEEeeecC-----CCcEEEEeCCCCcEeEEEecCC------c
Q 040693 278 GWVAMDASNGNVLWSTADPSN----GTAPGPVTVANGVLFGGSTYR-----QGPIYAMDVKTGKILWSYDTGA------T 342 (382)
Q Consensus 278 ~v~a~d~~tG~~~W~~~~~~~----~~~~~~~~~~~~~v~~~~~~~-----~g~l~~ld~~tG~ilw~~~~~~------~ 342 (382)
.++++|+++-+ |..-.-.+ +..+..+.+.++++++.-.+. -+.+++||.++ ..|+..... .
T Consensus 191 dl~i~d~~~~~--W~~~~~~g~~P~pR~gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~--~~W~~~~~~g~~p~~R 266 (482)
T KOG0379|consen 191 DLHIYDLETST--WSELDTQGEAPSPRYGHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLST--WEWKLLPTGGDLPSPR 266 (482)
T ss_pred eeeeecccccc--ceecccCCCCCCCCCCceEEEECCeEEEEeccccCCceecceEeeeccc--ceeeeccccCCCCCCc
Confidence 79999999888 98643221 123334445555544433211 23599999997 778743321 2
Q ss_pred eecceEEeCCEEEEEeCcee-----EeecCCccCCCCCeEEE
Q 040693 343 IYGGASVSNGCIYMGNGYKV-----TVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 343 ~~~~p~~~~g~lyv~~~~g~-----~~~~~~~~~~~g~~l~~ 379 (382)
...++++.+.++++..+... ..-+|.++.. +..|.
T Consensus 267 ~~h~~~~~~~~~~l~gG~~~~~~~~l~~~~~l~~~--~~~w~ 306 (482)
T KOG0379|consen 267 SGHSLTVSGDHLLLFGGGTDPKQEPLGDLYGLDLE--TLVWS 306 (482)
T ss_pred ceeeeEEECCEEEEEcCCccccccccccccccccc--cccee
Confidence 34455577788877655332 3345666665 44444
No 150
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=94.36 E-value=5.5 Score=38.79 Aligned_cols=129 Identities=16% Similarity=0.199 Sum_probs=64.1
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecC
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTAD 295 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~ 295 (382)
.+.++|+.+ ..|+.-.+.+ .........+..++.||+...+.. ++ .....++.++....+..|+.-.
T Consensus 190 ~v~~YD~~t--~~W~~~~~~p-~~~~~~~a~v~~~~~iYv~GG~~~-------~~---~~~~~~~~~~~~~~~~~W~~~~ 256 (376)
T PRK14131 190 EVLSYDPST--NQWKNAGESP-FLGTAGSAVVIKGNKLWLINGEIK-------PG---LRTDAVKQGKFTGNNLKWQKLP 256 (376)
T ss_pred eEEEEECCC--CeeeECCcCC-CCCCCcceEEEECCEEEEEeeeEC-------CC---cCChhheEEEecCCCcceeecC
Confidence 588899875 4587643221 111222334446788887654311 11 1123345444444556698643
Q ss_pred --CCCCCC------Ccc-eEEeCCEEEEeeecCC----------C------------cEEEEeCCCCcEeEEEe--cCCc
Q 040693 296 --PSNGTA------PGP-VTVANGVLFGGSTYRQ----------G------------PIYAMDVKTGKILWSYD--TGAT 342 (382)
Q Consensus 296 --~~~~~~------~~~-~~~~~~~v~~~~~~~~----------g------------~l~~ld~~tG~ilw~~~--~~~~ 342 (382)
+..... ... ..+.++.||+... .. + .+.++|+++. .|+.- ++..
T Consensus 257 ~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~--~W~~~~~lp~~ 333 (376)
T PRK14131 257 DLPPAPGGSSQEGVAGAFAGYSNGVLLVAGG-ANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNG--KWQKVGELPQG 333 (376)
T ss_pred CCCCCCcCCcCCccceEeceeECCEEEEeec-cCCCCChhhhhcCCcccccCCcceeehheEEecCC--cccccCcCCCC
Confidence 221100 011 2344666776542 11 0 1346787765 47542 2222
Q ss_pred -eecceEEeCCEEEEEeCc
Q 040693 343 -IYGGASVSNGCIYMGNGY 360 (382)
Q Consensus 343 -~~~~p~~~~g~lyv~~~~ 360 (382)
...+.+..+++|||..+.
T Consensus 334 r~~~~av~~~~~iyv~GG~ 352 (376)
T PRK14131 334 LAYGVSVSWNNGVLLIGGE 352 (376)
T ss_pred ccceEEEEeCCEEEEEcCC
Confidence 233456689999998764
No 151
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=94.31 E-value=4.4 Score=37.51 Aligned_cols=114 Identities=10% Similarity=0.086 Sum_probs=75.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+++|+.+++|......|.++|+..-.+.--. |....-... .+...|++.+ .+..-...|
T Consensus 156 D~~ilT~SGD~TCalWDie~g~~~~~f~GH~----gDV~slsl~p~~~ntFvSg~----------------cD~~aklWD 215 (343)
T KOG0286|consen 156 DNHILTGSGDMTCALWDIETGQQTQVFHGHT----GDVMSLSLSPSDGNTFVSGG----------------CDKSAKLWD 215 (343)
T ss_pred CCceEecCCCceEEEEEcccceEEEEecCCc----ccEEEEecCCCCCCeEEecc----------------cccceeeee
Confidence 6889999999999999999999988877211 111111111 2455666533 235566778
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
.+.|.-+=.+....... .+.-.+.+|..|+... +++....+|+...+.+-.+...
T Consensus 216 ~R~~~c~qtF~ghesDI-Nsv~ffP~G~afatGS-DD~tcRlyDlRaD~~~a~ys~~ 270 (343)
T KOG0286|consen 216 VRSGQCVQTFEGHESDI-NSVRFFPSGDAFATGS-DDATCRLYDLRADQELAVYSHD 270 (343)
T ss_pred ccCcceeEeeccccccc-ceEEEccCCCeeeecC-CCceeEEEeecCCcEEeeeccC
Confidence 88887666666554322 2333356777777654 6899999999988888877644
No 152
>PHA02790 Kelch-like protein; Provisional
Probab=94.29 E-value=6.8 Score=39.58 Aligned_cols=131 Identities=9% Similarity=-0.037 Sum_probs=77.5
Q ss_pred ecEEEEEccC---cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 205 HDIVVAVQKS---GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~---g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
++.||+.++. ..+..+|+.+ ..|..-.+-+ ......+.++-++.||+..+.. .....+.+
T Consensus 318 ~~~iYviGG~~~~~sve~ydp~~--n~W~~~~~l~--~~r~~~~~~~~~g~IYviGG~~-------------~~~~~ve~ 380 (480)
T PHA02790 318 NNKLYVVGGLPNPTSVERWFHGD--AAWVNMPSLL--KPRCNPAVASINNVIYVIGGHS-------------ETDTTTEY 380 (480)
T ss_pred CCEEEEECCcCCCCceEEEECCC--CeEEECCCCC--CCCcccEEEEECCEEEEecCcC-------------CCCccEEE
Confidence 4556554332 3467788754 4687643321 1223344555789999874431 11245788
Q ss_pred EECCCCcEEeeecCCCC-CCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC--C-ceecceEEeCCEEEEE
Q 040693 282 MDASNGNVLWSTADPSN-GTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG--A-TIYGGASVSNGCIYMG 357 (382)
Q Consensus 282 ~d~~tG~~~W~~~~~~~-~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~--~-~~~~~p~~~~g~lyv~ 357 (382)
+|+++. .|+.-.+-. +........-++.+|+.. |...++|+++. .|+.-.+ . ....+.++.+|+||+.
T Consensus 381 ydp~~~--~W~~~~~m~~~r~~~~~~~~~~~IYv~G----G~~e~ydp~~~--~W~~~~~m~~~r~~~~~~v~~~~IYvi 452 (480)
T PHA02790 381 LLPNHD--QWQFGPSTYYPHYKSCALVFGRRLFLVG----RNAEFYCESSN--TWTLIDDPIYPRDNPELIIVDNKLLLI 452 (480)
T ss_pred EeCCCC--EEEeCCCCCCccccceEEEECCEEEEEC----CceEEecCCCC--cEeEcCCCCCCccccEEEEECCEEEEE
Confidence 998765 688743221 122333445688888864 56778898754 6875322 2 2345667899999998
Q ss_pred eCc
Q 040693 358 NGY 360 (382)
Q Consensus 358 ~~~ 360 (382)
.+.
T Consensus 453 GG~ 455 (480)
T PHA02790 453 GGF 455 (480)
T ss_pred CCc
Confidence 764
No 153
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=94.28 E-value=2 Score=39.29 Aligned_cols=67 Identities=16% Similarity=0.230 Sum_probs=39.4
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCC-cccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGG-AMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
...+.+||.+|| .|...+.....+++ .....-+.++.+|+=.... +..+.+-..+++||++| ..|+
T Consensus 215 c~~i~~ld~~T~--aW~r~p~~~~~P~GRRSHS~fvYng~~Y~FGGYn---------g~ln~HfndLy~FdP~t--~~W~ 281 (392)
T KOG4693|consen 215 CDTIMALDLATG--AWTRTPENTMKPGGRRSHSTFVYNGKMYMFGGYN---------GTLNVHFNDLYCFDPKT--SMWS 281 (392)
T ss_pred cceeEEEecccc--ccccCCCCCcCCCcccccceEEEcceEEEecccc---------hhhhhhhcceeeccccc--chhe
Confidence 467999999876 68776554433333 3333334777888642211 11123346699999986 4566
Q ss_pred e
Q 040693 293 T 293 (382)
Q Consensus 293 ~ 293 (382)
.
T Consensus 282 ~ 282 (392)
T KOG4693|consen 282 V 282 (392)
T ss_pred e
Confidence 4
No 154
>PHA02790 Kelch-like protein; Provisional
Probab=94.25 E-value=6.9 Score=39.52 Aligned_cols=102 Identities=11% Similarity=-0.002 Sum_probs=57.9
Q ss_pred ecEEEEEcc-C---cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEE
Q 040693 205 HDIVVAVQK-S---GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWV 280 (382)
Q Consensus 205 ~~~v~~~~~-~---g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~ 280 (382)
++.||+.++ + ..+.++|+.+ -.|+.-.+.+. .....+.++-++.||+.. |.+.
T Consensus 362 ~g~IYviGG~~~~~~~ve~ydp~~--~~W~~~~~m~~--~r~~~~~~~~~~~IYv~G-------------------G~~e 418 (480)
T PHA02790 362 NNVIYVIGGHSETDTTTEYLLPNH--DQWQFGPSTYY--PHYKSCALVFGRRLFLVG-------------------RNAE 418 (480)
T ss_pred CCEEEEecCcCCCCccEEEEeCCC--CEEEeCCCCCC--ccccceEEEECCEEEEEC-------------------CceE
Confidence 566665432 2 3466788875 47987544321 112233445788999862 4467
Q ss_pred EEECCCCcEEeeec--CCCCCCCCcceEEeCCEEEEeeecC----CCcEEEEeCCCCc
Q 040693 281 AMDASNGNVLWSTA--DPSNGTAPGPVTVANGVLFGGSTYR----QGPIYAMDVKTGK 332 (382)
Q Consensus 281 a~d~~tG~~~W~~~--~~~~~~~~~~~~~~~~~v~~~~~~~----~g~l~~ld~~tG~ 332 (382)
++|+++. .|+.- .+.+. .....++-++.+|+..... ...+.++|+++.+
T Consensus 419 ~ydp~~~--~W~~~~~m~~~r-~~~~~~v~~~~IYviGG~~~~~~~~~ve~Yd~~~~~ 473 (480)
T PHA02790 419 FYCESSN--TWTLIDDPIYPR-DNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYS 473 (480)
T ss_pred EecCCCC--cEeEcCCCCCCc-cccEEEEECCEEEEECCcCCCcccceEEEEECCCCe
Confidence 8899765 78753 33322 3334456677788765311 1347777877543
No 155
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=94.10 E-value=3.5 Score=39.53 Aligned_cols=145 Identities=15% Similarity=0.101 Sum_probs=85.8
Q ss_pred EEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 207 IVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 207 ~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
++..++.. ..+-.+|.+.++.+|+..-.+....+ ...|.++.+..|+...- ...++. ++..+.|..||+
T Consensus 163 Iva~GGke~~n~lkiwdle~~~qiw~aKNvpnD~L~--LrVPvW~tdi~Fl~g~~--~~~fat-----~T~~hqvR~YDt 233 (412)
T KOG3881|consen 163 IVATGGKENINELKIWDLEQSKQIWSAKNVPNDRLG--LRVPVWITDIRFLEGSP--NYKFAT-----ITRYHQVRLYDT 233 (412)
T ss_pred eEecCchhcccceeeeecccceeeeeccCCCCcccc--ceeeeeeccceecCCCC--CceEEE-----EecceeEEEecC
Confidence 33335554 56777888889999998755432221 22333344444442100 011111 155688999999
Q ss_pred CCC-cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEE-EecCCceecceEEeCC-EEEEEeCce
Q 040693 285 SNG-NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWS-YDTGATIYGGASVSNG-CIYMGNGYK 361 (382)
Q Consensus 285 ~tG-~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~-~~~~~~~~~~p~~~~g-~lyv~~~~g 361 (382)
+.+ +++=.+++.....+...+...++.||++.. .+.|..||..+|+.+-. +.--.+...+...+.+ .+..+.+=.
T Consensus 234 ~~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~--~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GLD 311 (412)
T KOG3881|consen 234 RHQRRPVAQFDFLENPISSTGLTPSGNFIYTGNT--KGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGLD 311 (412)
T ss_pred cccCcceeEeccccCcceeeeecCCCcEEEEecc--cchhheecccCceeeccccCCccCCcceEEEcCCCceEEeeccc
Confidence 988 566667766544444444445678999985 99999999999998876 3322234445555555 344444433
Q ss_pred e
Q 040693 362 V 362 (382)
Q Consensus 362 ~ 362 (382)
+
T Consensus 312 R 312 (412)
T KOG3881|consen 312 R 312 (412)
T ss_pred e
Confidence 4
No 156
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=94.03 E-value=5.8 Score=37.82 Aligned_cols=224 Identities=11% Similarity=0.049 Sum_probs=119.1
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+..+|.+|.++-|++-+.+..+++.. | ..++.+...-.|++-..
T Consensus 105 ee~IyIydI~~MklLhTI~t~~~n~~------g------l~AlS~n~~n~ylAyp~------------------------ 148 (391)
T KOG2110|consen 105 EESIYIYDIKDMKLLHTIETTPPNPK------G------LCALSPNNANCYLAYPG------------------------ 148 (391)
T ss_pred cccEEEEecccceeehhhhccCCCcc------c------eEeeccCCCCceEEecC------------------------
Confidence 55799999999999999988655432 1 12333333322332211
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
....|.|+.+|..+=+.+=........+ ....++.+ +..|-.++.
T Consensus 149 -----s~t~GdV~l~d~~nl~~v~~I~aH~~~l---------------------------Aalafs~~---G~llATASe 193 (391)
T KOG2110|consen 149 -----STTSGDVVLFDTINLQPVNTINAHKGPL---------------------------AALAFSPD---GTLLATASE 193 (391)
T ss_pred -----CCCCceEEEEEcccceeeeEEEecCCce---------------------------eEEEECCC---CCEEEEecc
Confidence 0114899999999888887776443211 11122222 244555555
Q ss_pred Cc-EEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE
Q 040693 214 SG-FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL 290 (382)
Q Consensus 214 ~g-~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~ 290 (382)
.| .+..++..+|+.+.+++-+.. ....+..+. +...+-++ ...++||.|.+++-.
T Consensus 194 KGTVIRVf~v~~G~kl~eFRRG~~---~~~IySL~Fs~ds~~L~~s-----------------S~TeTVHiFKL~~~~-- 251 (391)
T KOG2110|consen 194 KGTVIRVFSVPEGQKLYEFRRGTY---PVSIYSLSFSPDSQFLAAS-----------------SNTETVHIFKLEKVS-- 251 (391)
T ss_pred CceEEEEEEcCCccEeeeeeCCce---eeEEEEEEECCCCCeEEEe-----------------cCCCeEEEEEecccc--
Confidence 56 578889999999999996542 112233333 33433333 334789999887554
Q ss_pred eeecCCCCCC---CCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee-----cceEEeCCEEEEEeCcee
Q 040693 291 WSTADPSNGT---APGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY-----GGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 291 W~~~~~~~~~---~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~-----~~p~~~~g~lyv~~~~g~ 362 (382)
......+.. +..++..+ -.-|.++. -..+ +| +.+.--..+++.... ..+.-...+++|.+.+|.
T Consensus 252 -~~~~~~p~~~~~~~~~~sk~-~~sylps~--V~~~--~~--~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~ 323 (391)
T KOG2110|consen 252 -NNPPESPTAGTSWFGKVSKA-ATSYLPSQ--VSSV--LD--QSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGH 323 (391)
T ss_pred -cCCCCCCCCCCcccchhhhh-hhhhcchh--hhhh--hh--hccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCe
Confidence 111111000 11111000 01133331 1112 22 223333344443322 112225689999999887
Q ss_pred EeecCCccCCCCCeEEE
Q 040693 363 TVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 363 ~~~~~~~~~~~g~~l~~ 379 (382)
.+.|++++++|-..+.
T Consensus 324 -~y~y~l~~~~gGec~l 339 (391)
T KOG2110|consen 324 -LYSYRLPPKEGGECAL 339 (391)
T ss_pred -EEEEEcCCCCCceeEE
Confidence 5779999987776654
No 157
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=93.94 E-value=1 Score=44.44 Aligned_cols=172 Identities=8% Similarity=0.032 Sum_probs=96.6
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
+|.|...|+.+-.++-+++-.... .-.| ++..+ +-.|+.+.-|..|.|+|
T Consensus 530 dGnI~vwDLhnq~~VrqfqGhtDG--------------------------ascI-dis~d---GtklWTGGlDntvRcWD 579 (705)
T KOG0639|consen 530 DGNIAVWDLHNQTLVRQFQGHTDG--------------------------ASCI-DISKD---GTKLWTGGLDNTVRCWD 579 (705)
T ss_pred CCcEEEEEcccceeeecccCCCCC--------------------------ceeE-EecCC---CceeecCCCccceeehh
Confidence 389999999877777776654331 1111 22222 46789999999999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTA 301 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~ 301 (382)
..+|+.+-+.++.....+ .+-.-.++.|-++..+ +.+..+. .+|..+...-.... .
T Consensus 580 lregrqlqqhdF~SQIfS----Lg~cP~~dWlavGMen-----------------s~vevlh-~skp~kyqlhlheS--c 635 (705)
T KOG0639|consen 580 LREGRQLQQHDFSSQIFS----LGYCPTGDWLAVGMEN-----------------SNVEVLH-TSKPEKYQLHLHES--C 635 (705)
T ss_pred hhhhhhhhhhhhhhhhee----cccCCCccceeeeccc-----------------CcEEEEe-cCCccceeeccccc--E
Confidence 999999988886432111 1111134455555333 3344444 24555555444331 1
Q ss_pred CcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeEeecCC
Q 040693 302 PGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVTVGFGN 368 (382)
Q Consensus 302 ~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~~~~~~ 368 (382)
.-.+-++ =|..|+.+. .+.-|-+...--|.-+.+.+....+.+.-+..|++++|+.+...-..+|+
T Consensus 636 VLSlKFa~cGkwfvStG-kDnlLnawrtPyGasiFqskE~SsVlsCDIS~ddkyIVTGSGdkkATVYe 702 (705)
T KOG0639|consen 636 VLSLKFAYCGKWFVSTG-KDNLLNAWRTPYGASIFQSKESSSVLSCDISFDDKYIVTGSGDKKATVYE 702 (705)
T ss_pred EEEEEecccCceeeecC-chhhhhhccCccccceeeccccCcceeeeeccCceEEEecCCCcceEEEE
Confidence 1222222 366666653 44555555544466666665555555555667888888755433333444
No 158
>PLN02153 epithiospecifier protein
Probab=93.88 E-value=6.3 Score=37.72 Aligned_cols=134 Identities=16% Similarity=0.196 Sum_probs=69.7
Q ss_pred EEEEEeCCCCCeeeeeccCCC--C-CCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 216 FAWALDRDSGSLIWSMEAGPG--G-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~--~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
.++++|..+ ..|+.-.... . .........++.++.+|+........... .+ ..-..+.++|+++. .|+
T Consensus 102 ~v~~yd~~t--~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~-~~----~~~~~v~~yd~~~~--~W~ 172 (341)
T PLN02153 102 DFYSYDTVK--NEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMK-TP----ERFRTIEAYNIADG--KWV 172 (341)
T ss_pred cEEEEECCC--CEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccC-CC----cccceEEEEECCCC--eEe
Confidence 588999875 5687532210 0 11122233345677888764432110000 00 11246899999865 588
Q ss_pred ecCCC----CCCCCcceEEeCCEEEEeeec------------CCCcEEEEeCCCCcEeEEEec-----CC-ceecceEEe
Q 040693 293 TADPS----NGTAPGPVTVANGVLFGGSTY------------RQGPIYAMDVKTGKILWSYDT-----GA-TIYGGASVS 350 (382)
Q Consensus 293 ~~~~~----~~~~~~~~~~~~~~v~~~~~~------------~~g~l~~ld~~tG~ilw~~~~-----~~-~~~~~p~~~ 350 (382)
.-... .........+.++.+|+.... ....++++|+++.+ |+.-. +. ....+.++.
T Consensus 173 ~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~--W~~~~~~g~~P~~r~~~~~~~~ 250 (341)
T PLN02153 173 QLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGK--WTEVETTGAKPSARSVFAHAVV 250 (341)
T ss_pred eCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCc--EEeccccCCCCCCcceeeeEEE
Confidence 63211 111222334456666663210 02468999988544 76421 21 233455678
Q ss_pred CCEEEEEeCc
Q 040693 351 NGCIYMGNGY 360 (382)
Q Consensus 351 ~g~lyv~~~~ 360 (382)
+++|||..+.
T Consensus 251 ~~~iyv~GG~ 260 (341)
T PLN02153 251 GKYIIIFGGE 260 (341)
T ss_pred CCEEEEECcc
Confidence 9999998774
No 159
>COG3419 PilY1 Tfp pilus assembly protein, tip-associated adhesin PilY1 [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=93.73 E-value=1.2 Score=47.80 Aligned_cols=175 Identities=17% Similarity=0.169 Sum_probs=90.6
Q ss_pred eecEEEEEccCcEEEEEeCCCCCeeeeeccCCCC-----CCCCcc-cceeeeCCeEEEEecCcc--ccccccCCCCCCCC
Q 040693 204 KHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGG-----LGGGAM-WGAATDERRIYTNIANSQ--HKNFNLKPSKNSTI 275 (382)
Q Consensus 204 ~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~-----~~g~~~-~~~~~~~~~v~~~~~~~~--~~~~~~~~~~~~~~ 275 (382)
+...|+++.+||.|+++|..+|+.+..+-+.... ...... ......++...+...... .+.+++ +.....
T Consensus 581 R~~~VyvgandGmLhaFd~~tG~E~fA~~P~avl~~l~~~t~~~y~~h~yyVDg~p~~~da~~ng~wrsvL~--g~~G~G 658 (1036)
T COG3419 581 RAPVVYVGANDGMLHAFDANTGSERFAYVPSAVLSTLHSLTAPGYTAHQYYVDGSPTAADAYDNGQWRSVLV--GGLGAG 658 (1036)
T ss_pred ccceEEEecCCceeeeccCCccceeeecCcHHHHhhhhhhcCCCcccccceecCCceeehhhcCCcceEEEE--eecCCC
Confidence 4678999999999999999999999988754210 000000 001112222222221111 111111 111133
Q ss_pred CceEEEEECCCC-----cEEeeecCCC----CCCCCcceEE--eCCE--EEEeeec----CCCcEEEEeCCCC----cEe
Q 040693 276 AGGWVAMDASNG-----NVLWSTADPS----NGTAPGPVTV--ANGV--LFGGSTY----RQGPIYAMDVKTG----KIL 334 (382)
Q Consensus 276 ~g~v~a~d~~tG-----~~~W~~~~~~----~~~~~~~~~~--~~~~--v~~~~~~----~~g~l~~ld~~tG----~il 334 (382)
...++|+|+.+= +++|.+.... +..+..|.++ .++. |+++... ..-.++.+++.++ ++-
T Consensus 659 G~glyALDVTdP~~~~~~~Lw~~~~~d~~~LG~t~gkP~Iv~l~~gswavl~GNGynS~~n~~al~~~~L~t~~~~~~~~ 738 (1036)
T COG3419 659 GRGLYALDVTDPDFSNSNLLWENNSNDDPDLGYTMGKPRIVPLHDGSWAVLLGNGYNSPANGAALLVLNLLTLDATRKVP 738 (1036)
T ss_pred CceeEEEEccCccccCCcchhcccCCCccccccccCCCeEEEcCCCceEEEEccCCCCCCCCcceEEEEeecCCcceeEE
Confidence 466999998754 5889876543 2234455544 2443 2222211 1234677776654 445
Q ss_pred EEEecCCceecceEE-eCCEEEE----EeCceeEeecCCccCCCCCeEEEEEC
Q 040693 335 WSYDTGATIYGGASV-SNGCIYM----GNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 335 w~~~~~~~~~~~p~~-~~g~lyv----~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
|+.-........|.. ..+-+++ .+.+|.+-..|+-|. +=-||.|+|
T Consensus 739 v~~g~~~~~g~~P~~~~~g~~~~~~~d~~~dG~vd~aYAGDl--~GnlWRFdL 789 (1036)
T COG3419 739 VQSGTGYGAGVSPVCVGVGGLDVAVLDLDGDGIVDYAYAGDL--GGNLWRFDL 789 (1036)
T ss_pred EeccCCccccccCccccccccccceeecCCCceEEEEEeecc--CCcEEEEEe
Confidence 555444333333433 3333333 355666666677665 666888875
No 160
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=93.34 E-value=7.7 Score=37.01 Aligned_cols=150 Identities=16% Similarity=0.186 Sum_probs=92.2
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+..+++.-... +|.+|..+-+++=.++..++...|.....+-..+..+-.. . ....|.|+.+|.
T Consensus 97 r~RLvV~Lee~-IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp--~-------------s~t~GdV~l~d~ 160 (391)
T KOG2110|consen 97 RKRLVVCLEES-IYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYP--G-------------STTSGDVVLFDT 160 (391)
T ss_pred cceEEEEEccc-EEEEecccceeehhhhccCCCccceEeeccCCCCceEEec--C-------------CCCCceEEEEEc
Confidence 44555544333 9999999999998887654322222222221122222221 1 134688999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEe-CCEEEEeeecCCCc-EEEEeCCCCcEeEEEecCCc--eecceEE-eCCEEEEEeC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGP-IYAMDVKTGKILWSYDTGAT--IYGGASV-SNGCIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~-l~~ld~~tG~ilw~~~~~~~--~~~~p~~-~~g~lyv~~~ 359 (382)
.+-+..=.++..... -..+.+. +|.+.+... ++|. |.+|...+|+.+.++.-+.- ...+.+. .+..+..+++
T Consensus 161 ~nl~~v~~I~aH~~~--lAalafs~~G~llATAS-eKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS 237 (391)
T KOG2110|consen 161 INLQPVNTINAHKGP--LAALAFSPDGTLLATAS-EKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASS 237 (391)
T ss_pred ccceeeeEEEecCCc--eeEEEECCCCCEEEEec-cCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEec
Confidence 999999888866532 2233332 455555443 4564 67889999999999987642 2233444 4556666777
Q ss_pred ceeEeecCCccCCC
Q 040693 360 YKVTVGFGNKNFTS 373 (382)
Q Consensus 360 ~g~~~~~~~~~~~~ 373 (382)
+-.++|+|.++...
T Consensus 238 ~TeTVHiFKL~~~~ 251 (391)
T KOG2110|consen 238 NTETVHIFKLEKVS 251 (391)
T ss_pred CCCeEEEEEecccc
Confidence 77789998887643
No 161
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=93.22 E-value=8.7 Score=37.33 Aligned_cols=131 Identities=14% Similarity=0.127 Sum_probs=58.5
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
...|+.+|..||+..=-++... ..+...+.|. +..+|...... |- ...+.+|+.++ ..|+..|..
T Consensus 167 ~~~i~~idl~tG~~~~v~~~~~--wlgH~~fsP~-dp~li~fCHEG---------pw--~~Vd~RiW~i~-~dg~~~~~v 231 (386)
T PF14583_consen 167 HCRIFTIDLKTGERKVVFEDTD--WLGHVQFSPT-DPTLIMFCHEG---------PW--DLVDQRIWTIN-TDGSNVKKV 231 (386)
T ss_dssp -EEEEEEETTT--EEEEEEESS---EEEEEEETT-EEEEEEEEE-S----------T--TTSS-SEEEEE-TTS---EES
T ss_pred CceEEEEECCCCceeEEEecCc--cccCcccCCC-CCCEEEEeccC---------Cc--ceeceEEEEEE-cCCCcceee
Confidence 4679999999998654444321 2233333332 33333332111 11 12346799999 567777776
Q ss_pred cCCCCC-CCCcceEE-eCC-EEEEeee--cCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeCce
Q 040693 294 ADPSNG-TAPGPVTV-ANG-VLFGGST--YRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNGYK 361 (382)
Q Consensus 294 ~~~~~~-~~~~~~~~-~~~-~v~~~~~--~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g 361 (382)
...... ..+-..-. ++. +.|..-. +.+-.|+.+|++|++..|-.+.+. .++... .+++|+|+.+.+
T Consensus 232 ~~~~~~e~~gHEfw~~DG~~i~y~~~~~~~~~~~i~~~d~~t~~~~~~~~~p~--~~H~~ss~Dg~L~vGDG~d 303 (386)
T PF14583_consen 232 HRRMEGESVGHEFWVPDGSTIWYDSYTPGGQDFWIAGYDPDTGERRRLMEMPW--CSHFMSSPDGKLFVGDGGD 303 (386)
T ss_dssp S---TTEEEEEEEE-TTSS-EEEEEEETTT--EEEEEE-TTT--EEEEEEE-S--EEEEEE-TTSSEEEEEE--
T ss_pred ecCCCCcccccccccCCCCEEEEEeecCCCCceEEEeeCCCCCCceEEEeCCc--eeeeEEcCCCCEEEecCCC
Confidence 532210 00111111 233 3443321 012358899999999887666653 334444 688999987654
No 162
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=93.20 E-value=5.7 Score=39.23 Aligned_cols=79 Identities=18% Similarity=0.162 Sum_probs=51.8
Q ss_pred cccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCC
Q 040693 242 AMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQG 321 (382)
Q Consensus 242 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g 321 (382)
....|.+.+.+||+.+.. ..-|+++..|+.....+-++++.. +.......++.+++... .|
T Consensus 227 ~vS~PmIV~~RvYFlsD~--------------eG~GnlYSvdldGkDlrrHTnFtd--YY~R~~nsDGkrIvFq~---~G 287 (668)
T COG4946 227 NVSSPMIVGERVYFLSDH--------------EGVGNLYSVDLDGKDLRRHTNFTD--YYPRNANSDGKRIVFQN---AG 287 (668)
T ss_pred CcCCceEEcceEEEEecc--------------cCccceEEeccCCchhhhcCCchh--ccccccCCCCcEEEEec---CC
Confidence 445667788888887654 446889999997555555556543 23334445667777765 79
Q ss_pred cEEEEeCCCCcEeEEEecC
Q 040693 322 PIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 322 ~l~~ld~~tG~ilw~~~~~ 340 (382)
.||.+|++|-+ +-+.+++
T Consensus 288 dIylydP~td~-lekldI~ 305 (668)
T COG4946 288 DIYLYDPETDS-LEKLDIG 305 (668)
T ss_pred cEEEeCCCcCc-ceeeecC
Confidence 99999998643 3344443
No 163
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=93.16 E-value=9.1 Score=37.74 Aligned_cols=136 Identities=11% Similarity=0.039 Sum_probs=75.0
Q ss_pred CcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 141 HSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 141 ~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
|++.|...|..+-. -|...+.... +-.-+++- .++.+|..+++ ..+..+
T Consensus 174 YDg~vrl~DtR~~~-~~v~elnhg~------------------------pVe~vl~l-----psgs~iasAgG-n~vkVW 222 (487)
T KOG0310|consen 174 YDGKVRLWDTRSLT-SRVVELNHGC------------------------PVESVLAL-----PSGSLIASAGG-NSVKVW 222 (487)
T ss_pred CCceEEEEEeccCC-ceeEEecCCC------------------------ceeeEEEc-----CCCCEEEEcCC-CeEEEE
Confidence 57999999887654 5655554331 11222221 11344444444 348889
Q ss_pred eCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCC
Q 040693 221 DRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGT 300 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~ 300 (382)
|..+|..+=....+- .........+.++..++.+ ..++.|-.||..+-|.......+.+ .
T Consensus 223 Dl~~G~qll~~~~~H--~KtVTcL~l~s~~~rLlS~-----------------sLD~~VKVfd~t~~Kvv~s~~~~~p-v 282 (487)
T KOG0310|consen 223 DLTTGGQLLTSMFNH--NKTVTCLRLASDSTRLLSG-----------------SLDRHVKVFDTTNYKVVHSWKYPGP-V 282 (487)
T ss_pred EecCCceehhhhhcc--cceEEEEEeecCCceEeec-----------------ccccceEEEEccceEEEEeeecccc-e
Confidence 988665443322210 0000111112244556655 4568999999999999888887763 2
Q ss_pred CCcceEEeCCEEEEeeecCCCcEEEEeCC
Q 040693 301 APGPVTVANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 301 ~~~~~~~~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
....+.-++..++++.. +|-+..-+..
T Consensus 283 Lsiavs~dd~t~viGms--nGlv~~rr~~ 309 (487)
T KOG0310|consen 283 LSIAVSPDDQTVVIGMS--NGLVSIRRRE 309 (487)
T ss_pred eeEEecCCCceEEEecc--cceeeeehhh
Confidence 33333334567777774 6766655433
No 164
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=93.12 E-value=1.2 Score=43.54 Aligned_cols=111 Identities=17% Similarity=0.144 Sum_probs=65.6
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
=+++.++..|++.=.|..+|++.-++..... ....+--. .+..+-++ ...|+|..+.+
T Consensus 222 fLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G----~~~vm~qNP~NaVih~G-----------------hsnGtVSlWSP 280 (545)
T KOG1272|consen 222 FLLVAASEAGFLKYQDVSTGKLVASIRTGAG----RTDVMKQNPYNAVIHLG-----------------HSNGTVSLWSP 280 (545)
T ss_pred heeeecccCCceEEEeechhhhhHHHHccCC----ccchhhcCCccceEEEc-----------------CCCceEEecCC
Confidence 3455666789999999999999998886532 11111000 22333333 23466777777
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
..-+++=++--..+ ....++++. |...+.+ +.+..+-..|+.+-+.+-.+..+
T Consensus 281 ~skePLvKiLcH~g--~V~siAv~~~G~YMaTt-G~Dr~~kIWDlR~~~ql~t~~tp 334 (545)
T KOG1272|consen 281 NSKEPLVKILCHRG--PVSSIAVDRGGRYMATT-GLDRKVKIWDLRNFYQLHTYRTP 334 (545)
T ss_pred CCcchHHHHHhcCC--CcceEEECCCCcEEeec-ccccceeEeeeccccccceeecC
Confidence 76666644433222 244556664 5444443 36788999999877766655553
No 165
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=93.10 E-value=6.2 Score=39.19 Aligned_cols=137 Identities=11% Similarity=0.102 Sum_probs=84.6
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
..+.+....||.+..+|..+-.+.-+++--+. |....-..-|+..|+.+ ..++.|.++|+
T Consensus 521 akvcFsccsdGnI~vwDLhnq~~VrqfqGhtD---GascIdis~dGtklWTG-----------------GlDntvRcWDl 580 (705)
T KOG0639|consen 521 AKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTD---GASCIDISKDGTKLWTG-----------------GLDNTVRCWDL 580 (705)
T ss_pred cceeeeeccCCcEEEEEcccceeeecccCCCC---CceeEEecCCCceeecC-----------------CCccceeehhh
Confidence 35666778899999999998777666653221 11111111145566665 45688999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceE-EeCCEEEEEeCceeE
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGAS-VSNGCIYMGNGYKVT 363 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~-~~~g~lyv~~~~g~~ 363 (382)
++|+.+=+.++....++-+-. -.++.|.++.. ++.+.++.. ++....++.......-+.- ..=|+-||+++..+.
T Consensus 581 regrqlqqhdF~SQIfSLg~c-P~~dWlavGMe--ns~vevlh~-skp~kyqlhlheScVLSlKFa~cGkwfvStGkDnl 656 (705)
T KOG0639|consen 581 REGRQLQQHDFSSQIFSLGYC-PTGDWLAVGME--NSNVEVLHT-SKPEKYQLHLHESCVLSLKFAYCGKWFVSTGKDNL 656 (705)
T ss_pred hhhhhhhhhhhhhhheecccC-CCccceeeecc--cCcEEEEec-CCccceeecccccEEEEEEecccCceeeecCchhh
Confidence 999998887776533222211 12466777764 778888874 4666666655443333332 255888888877664
Q ss_pred ee
Q 040693 364 VG 365 (382)
Q Consensus 364 ~~ 365 (382)
..
T Consensus 657 Ln 658 (705)
T KOG0639|consen 657 LN 658 (705)
T ss_pred hh
Confidence 33
No 166
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=93.10 E-value=2.4 Score=39.03 Aligned_cols=142 Identities=14% Similarity=0.169 Sum_probs=87.5
Q ss_pred EEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECC
Q 040693 207 IVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDAS 285 (382)
Q Consensus 207 ~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~ 285 (382)
+-+.....+.+=-||++||+.. .+.++. |..-.+..+ -++..++. +....|..+|++
T Consensus 75 VWft~qg~gaiGhLdP~tGev~-~ypLg~----Ga~Phgiv~gpdg~~Wit-----------------d~~~aI~R~dpk 132 (353)
T COG4257 75 VWFTAQGTGAIGHLDPATGEVE-TYPLGS----GASPHGIVVGPDGSAWIT-----------------DTGLAIGRLDPK 132 (353)
T ss_pred eEEecCccccceecCCCCCceE-EEecCC----CCCCceEEECCCCCeeEe-----------------cCcceeEEecCc
Confidence 3345566678888999999764 345443 223333333 34555654 333479999999
Q ss_pred CCcE-EeeecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcE-eEEEecCCceecceEEeCCEEEEEeCcee
Q 040693 286 NGNV-LWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKI-LWSYDTGATIYGGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 286 tG~~-~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~i-lw~~~~~~~~~~~p~~~~g~lyv~~~~g~ 362 (382)
|++. .|...........-...++ .+++++... .|.-=-||+.++.+ +|..+.+.+-++-.+..+|.||+++-.|+
T Consensus 133 t~evt~f~lp~~~a~~nlet~vfD~~G~lWFt~q--~G~yGrLdPa~~~i~vfpaPqG~gpyGi~atpdGsvwyaslagn 210 (353)
T COG4257 133 TLEVTRFPLPLEHADANLETAVFDPWGNLWFTGQ--IGAYGRLDPARNVISVFPAPQGGGPYGICATPDGSVWYASLAGN 210 (353)
T ss_pred ccceEEeecccccCCCcccceeeCCCccEEEeec--cccceecCcccCceeeeccCCCCCCcceEECCCCcEEEEecccc
Confidence 9864 5665543322222333455 467777652 55555788886654 67777665566666778999999988777
Q ss_pred EeecCCccCCCC
Q 040693 363 TVGFGNKNFTSG 374 (382)
Q Consensus 363 ~~~~~~~~~~~g 374 (382)
. +-.+|+.+|
T Consensus 211 a--iaridp~~~ 220 (353)
T COG4257 211 A--IARIDPFAG 220 (353)
T ss_pred c--eEEcccccC
Confidence 3 344666555
No 167
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=93.04 E-value=3.4 Score=38.31 Aligned_cols=104 Identities=13% Similarity=0.087 Sum_probs=61.6
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
+..|+.+|.+||+.+-+++....-+- . ..|.-. .-..|..+..|+++..+|
T Consensus 111 Dk~v~~wD~~tG~~~rk~k~h~~~vN--s--------------------~~p~rr-------g~~lv~SgsdD~t~kl~D 161 (338)
T KOG0265|consen 111 DKTVRGWDAETGKRIRKHKGHTSFVN--S--------------------LDPSRR-------GPQLVCSGSDDGTLKLWD 161 (338)
T ss_pred CceEEEEecccceeeehhccccceee--e--------------------cCcccc-------CCeEEEecCCCceEEEEe
Confidence 48999999999999999887644100 0 011111 124556677789999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
..+.+.+-.++... ...+... ....++.+ ..++.|...|++.++.+.......
T Consensus 162 ~R~k~~~~t~~~ky------qltAv~f~d~s~qv~sg-----------------gIdn~ikvWd~r~~d~~~~lsGh~ 216 (338)
T KOG0265|consen 162 IRKKEAIKTFENKY------QLTAVGFKDTSDQVISG-----------------GIDNDIKVWDLRKNDGLYTLSGHA 216 (338)
T ss_pred ecccchhhccccce------eEEEEEecccccceeec-----------------cccCceeeeccccCcceEEeeccc
Confidence 88665555443211 0011111 33455554 234667777887777777766544
No 168
>PLN02193 nitrile-specifier protein
Probab=92.77 E-value=12 Score=37.69 Aligned_cols=104 Identities=10% Similarity=0.056 Sum_probs=54.5
Q ss_pred EEEEEeCCCCCeeeeeccCCCC-CCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 216 FAWALDRDSGSLIWSMEAGPGG-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
.+.++|+.+ ..|+.-..+.. .........++.++.+|+...... .....++++|+++.+ |+..
T Consensus 295 ~~~~yd~~t--~~W~~~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g------------~~~~dv~~yD~~t~~--W~~~ 358 (470)
T PLN02193 295 TLDSYNIVD--KKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNG------------CEVDDVHYYDPVQDK--WTQV 358 (470)
T ss_pred eEEEEECCC--CEEEeCCCCCCCCCCCCCcEEEEECCcEEEEECCCC------------CccCceEEEECCCCE--EEEe
Confidence 578899886 46875432211 111122233345667776533211 123569999998764 8764
Q ss_pred CCC---C-CCCCcceEEeCCEEEEeeecC-------------CCcEEEEeCCCCcEeEEE
Q 040693 295 DPS---N-GTAPGPVTVANGVLFGGSTYR-------------QGPIYAMDVKTGKILWSY 337 (382)
Q Consensus 295 ~~~---~-~~~~~~~~~~~~~v~~~~~~~-------------~g~l~~ld~~tG~ilw~~ 337 (382)
... + .......+..++.+|+..... ...++++|+.+. .|+.
T Consensus 359 ~~~g~~P~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~--~W~~ 416 (470)
T PLN02193 359 ETFGVRPSERSVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETL--QWER 416 (470)
T ss_pred ccCCCCCCCcceeEEEEECCEEEEECCccCCccccccCccceeccEEEEEcCcC--EEEE
Confidence 321 0 112233344566666654211 124899998854 4763
No 169
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.74 E-value=3.7 Score=39.74 Aligned_cols=146 Identities=15% Similarity=0.108 Sum_probs=82.5
Q ss_pred CceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCC
Q 040693 192 APMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSK 271 (382)
Q Consensus 192 ~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 271 (382)
.|+.+-+-.++ +..+++...+..+...|+.||..+-.++-+-.....+..|.| |+-.++++
T Consensus 270 ~~V~yi~wSPD--dryLlaCg~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~p--Dg~~~V~G--------------- 330 (519)
T KOG0293|consen 270 QPVSYIMWSPD--DRYLLACGFDEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCP--DGFRFVTG--------------- 330 (519)
T ss_pred CceEEEEECCC--CCeEEecCchHheeeccCCcchhhhhcccCcCCCcceeEEcc--CCceeEec---------------
Confidence 45555432222 456777777777999999999988887755222222233443 55555554
Q ss_pred CCCCCceEEEEECCCCcE--EeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE
Q 040693 272 NSTIAGGWVAMDASNGNV--LWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV 349 (382)
Q Consensus 272 ~~~~~g~v~a~d~~tG~~--~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~ 349 (382)
..++.+.+.|. +|++ .|+.-.. +....-.++.++..++..+. +..+..++..+...+-......++.+--+.
T Consensus 331 --s~dr~i~~wdl-Dgn~~~~W~gvr~-~~v~dlait~Dgk~vl~v~~--d~~i~l~~~e~~~dr~lise~~~its~~iS 404 (519)
T KOG0293|consen 331 --SPDRTIIMWDL-DGNILGNWEGVRD-PKVHDLAITYDGKYVLLVTV--DKKIRLYNREARVDRGLISEEQPITSFSIS 404 (519)
T ss_pred --CCCCcEEEecC-Ccchhhccccccc-ceeEEEEEcCCCcEEEEEec--ccceeeechhhhhhhccccccCceeEEEEc
Confidence 34477888886 4544 4553211 11233444456667777664 777777776654443333333444444455
Q ss_pred eCCEEEEEeCcee
Q 040693 350 SNGCIYMGNGYKV 362 (382)
Q Consensus 350 ~~g~lyv~~~~g~ 362 (382)
.++++++.+-...
T Consensus 405 ~d~k~~LvnL~~q 417 (519)
T KOG0293|consen 405 KDGKLALVNLQDQ 417 (519)
T ss_pred CCCcEEEEEcccC
Confidence 6677766654433
No 170
>PRK01029 tolB translocation protein TolB; Provisional
Probab=92.72 E-value=11 Score=37.34 Aligned_cols=69 Identities=10% Similarity=-0.048 Sum_probs=36.2
Q ss_pred eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCCceecceEEe-C-CEEEEEeCceeEeecCCccCCCCCeE
Q 040693 308 ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGATIYGGASVS-N-GCIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 308 ~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~-~-g~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
++..++.... .....|+.+|+++|++.--... .....+|... + ..|++....+....+|.++..+|+..
T Consensus 337 DG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~-~~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~ 408 (428)
T PRK01029 337 DGKKIAFCSVIKGVRQICVYDLATGRDYQLTTS-PENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTR 408 (428)
T ss_pred CCCEEEEEEcCCCCcEEEEEECCCCCeEEccCC-CCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEE
Confidence 3444444432 1124699999999886432222 1223445543 3 44555544333345677777666654
No 171
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=92.62 E-value=1.6 Score=42.24 Aligned_cols=123 Identities=15% Similarity=0.144 Sum_probs=58.1
Q ss_pred EEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCC
Q 040693 219 ALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADP 296 (382)
Q Consensus 219 ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~ 296 (382)
-.|+.||..+-+....+.....-....... ++..+++.+.. ....+++.+|++||+..=-++.+
T Consensus 14 ~~D~~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s~~--------------dg~~nly~lDL~t~~i~QLTdg~ 79 (386)
T PF14583_consen 14 WIDPDTGHRVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFASDF--------------DGNRNLYLLDLATGEITQLTDGP 79 (386)
T ss_dssp EE-TTT--EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE-T--------------TSS-EEEEEETTT-EEEE---SS
T ss_pred EeCCCCCceEEEecCCCCcccceeecCCCcCCCCCEEEEEecc--------------CCCcceEEEEcccCEEEECccCC
Confidence 368899988877765432111112222222 45555554332 34577999999999998777765
Q ss_pred CCCCCCcceEEeCCEE-EEeeecCCCcEEEEeCCCCcEeEEEecCCcee--cceEE-eCCEEEEEe
Q 040693 297 SNGTAPGPVTVANGVL-FGGSTYRQGPIYAMDVKTGKILWSYDTGATIY--GGASV-SNGCIYMGN 358 (382)
Q Consensus 297 ~~~~~~~~~~~~~~~v-~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~--~~p~~-~~g~lyv~~ 358 (382)
....-+..+...+..+ |+-. ...|..+|++|.|+.--+..+..+- .+.++ .+++++++.
T Consensus 80 g~~~~g~~~s~~~~~~~Yv~~---~~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v~n~d~t~~~g~ 142 (386)
T PF14583_consen 80 GDNTFGGFLSPDDRALYYVKN---GRSLRRVDLDTLEERVVYEVPDDWKGYGTWVANSDCTKLVGI 142 (386)
T ss_dssp -B-TTT-EE-TTSSEEEEEET---TTEEEEEETTT--EEEEEE--TTEEEEEEEEE-TTSSEEEEE
T ss_pred CCCccceEEecCCCeEEEEEC---CCeEEEEECCcCcEEEEEECCcccccccceeeCCCccEEEEE
Confidence 4222233332234444 5544 5789999999998865556665433 33333 467777663
No 172
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=92.56 E-value=9.7 Score=36.16 Aligned_cols=100 Identities=10% Similarity=0.105 Sum_probs=58.2
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCcc--eEEe--CCEEEEeeecCCCcEEEEeCCC--CcEeEEEe---cCCce---
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPGP--VTVA--NGVLFGGSTYRQGPIYAMDVKT--GKILWSYD---TGATI--- 343 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~~--~~~~--~~~v~~~~~~~~g~l~~ld~~t--G~ilw~~~---~~~~~--- 343 (382)
..+|..++...|++.=....... ..++| +++- +..+|+... -++.|.++..+. |+..---. ++..+
T Consensus 166 ~Dri~~y~~~dg~L~~~~~~~v~-~G~GPRHi~FHpn~k~aY~v~E-L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~ 243 (346)
T COG2706 166 TDRIFLYDLDDGKLTPADPAEVK-PGAGPRHIVFHPNGKYAYLVNE-LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGT 243 (346)
T ss_pred CceEEEEEcccCccccccccccC-CCCCcceEEEcCCCcEEEEEec-cCCEEEEEEEcCCCceEEEeeeeccCccccCCC
Confidence 45689999998876554432221 12333 3332 357888764 567777666554 54432221 22221
Q ss_pred -ecceEE---eCCEEEEEeCceeEeecCCccCCCCCeE
Q 040693 344 -YGGASV---SNGCIYMGNGYKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 344 -~~~p~~---~~g~lyv~~~~g~~~~~~~~~~~~g~~l 377 (382)
..+.+. .+.-||+++........|.++..+|++.
T Consensus 244 ~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~ 281 (346)
T COG2706 244 NWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLE 281 (346)
T ss_pred CceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEE
Confidence 222221 5667888887777788899999988754
No 173
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=92.52 E-value=7.4 Score=36.21 Aligned_cols=162 Identities=10% Similarity=0.005 Sum_probs=90.6
Q ss_pred CCceEEEe-eeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCC
Q 040693 191 EAPMMLSM-YRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKP 269 (382)
Q Consensus 191 ~~p~~~~~-~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 269 (382)
..|+|-.- ..+ +..|+.+.-|+.+..+|..+|++.---.-..+ .-...|....--..|..+
T Consensus 72 ~~PvL~v~Wsdd---gskVf~g~~Dk~~k~wDL~S~Q~~~v~~Hd~p--vkt~~wv~~~~~~cl~TG------------- 133 (347)
T KOG0647|consen 72 DGPVLDVCWSDD---GSKVFSGGCDKQAKLWDLASGQVSQVAAHDAP--VKTCHWVPGMNYQCLVTG------------- 133 (347)
T ss_pred CCCeEEEEEccC---CceEEeeccCCceEEEEccCCCeeeeeecccc--eeEEEEecCCCcceeEec-------------
Confidence 35666432 333 37899999999999999999965433221110 000111111011122332
Q ss_pred CCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC---ceecc
Q 040693 270 SKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA---TIYGG 346 (382)
Q Consensus 270 ~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~---~~~~~ 346 (382)
.-+.+|.-.|.+.-+++-+..+|...+ ...+...++++++. +..|.++++.++-...+...+. .+..-
T Consensus 134 ----SWDKTlKfWD~R~~~pv~t~~LPeRvY---a~Dv~~pm~vVata--~r~i~vynL~n~~te~k~~~SpLk~Q~R~v 204 (347)
T KOG0647|consen 134 ----SWDKTLKFWDTRSSNPVATLQLPERVY---AADVLYPMAVVATA--ERHIAVYNLENPPTEFKRIESPLKWQTRCV 204 (347)
T ss_pred ----ccccceeecccCCCCeeeeeeccceee---ehhccCceeEEEec--CCcEEEEEcCCCcchhhhhcCcccceeeEE
Confidence 234778999999999999999987422 22234567777774 7889999998776555443222 22223
Q ss_pred eEEeCCEE-EEEeCceeEeecCCccCCCCCeEEEE
Q 040693 347 ASVSNGCI-YMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 347 p~~~~g~l-yv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
.+.-+... .+++-.|+ +.+..+|..+.+.-++|
T Consensus 205 a~f~d~~~~alGsiEGr-v~iq~id~~~~~~nFtF 238 (347)
T KOG0647|consen 205 ACFQDKDGFALGSIEGR-VAIQYIDDPNPKDNFTF 238 (347)
T ss_pred EEEecCCceEeeeecce-EEEEecCCCCccCceeE
Confidence 33333333 34555555 44555666444555554
No 174
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=92.41 E-value=3 Score=42.03 Aligned_cols=133 Identities=16% Similarity=0.099 Sum_probs=74.1
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccC--CCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWG--SSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTS 131 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (382)
+|.|-++|+.+-+.+-+.+........++ ....- ++..+..++-.+-|++..
T Consensus 196 ~g~VEfwDpR~ksrv~~l~~~~~v~s~pg----~~~~~svTal~F~d~gL~~aVGts~---------------------- 249 (703)
T KOG2321|consen 196 DGVVEFWDPRDKSRVGTLDAASSVNSHPG----GDAAPSVTALKFRDDGLHVAVGTST---------------------- 249 (703)
T ss_pred CceEEEecchhhhhheeeecccccCCCcc----ccccCcceEEEecCCceeEEeeccC----------------------
Confidence 89999999999998888887544332221 11111 122333333345554443
Q ss_pred CCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEE-EeeeCceeecEEEE
Q 040693 132 PDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMML-SMYRNKVKHDIVVA 210 (382)
Q Consensus 132 ~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-~~~~~g~~~~~v~~ 210 (382)
|.++.+|+.+-+++-...-.. ..|+.. .....+ ..+.|+
T Consensus 250 -----------G~v~iyDLRa~~pl~~kdh~~---------------------------e~pi~~l~~~~~~-~q~~v~- 289 (703)
T KOG2321|consen 250 -----------GSVLIYDLRASKPLLVKDHGY---------------------------ELPIKKLDWQDTD-QQNKVV- 289 (703)
T ss_pred -----------CcEEEEEcccCCceeecccCC---------------------------ccceeeecccccC-CCceEE-
Confidence 889999999888877653221 122221 111111 022333
Q ss_pred EccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEE
Q 040693 211 VQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTN 256 (382)
Q Consensus 211 ~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~ 256 (382)
......+..+|..||+..-..+... .-.+.....+.+++|++
T Consensus 290 S~Dk~~~kiWd~~~Gk~~asiEpt~----~lND~C~~p~sGm~f~A 331 (703)
T KOG2321|consen 290 SMDKRILKIWDECTGKPMASIEPTS----DLNDFCFVPGSGMFFTA 331 (703)
T ss_pred ecchHHhhhcccccCCceeeccccC----CcCceeeecCCceEEEe
Confidence 3334457777888888877776532 22444554577888876
No 175
>PRK04043 tolB translocation protein TolB; Provisional
Probab=92.28 E-value=13 Score=36.89 Aligned_cols=117 Identities=11% Similarity=0.032 Sum_probs=56.2
Q ss_pred cEEEEEccC--cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKS--GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~--g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+++..... ..||.+|.++|+..--...+. . ...|.+ +++.+.......+.. + ......|+.+|
T Consensus 290 ~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~--~--~~~~SP--DG~~Ia~~~~~~~~~-~-------~~~~~~I~v~d 355 (419)
T PRK04043 290 RIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGK--N--NSSVST--YKNYIVYSSRETNNE-F-------GKNTFNLYLIS 355 (419)
T ss_pred EEEEEECCCCCceEEEEECCCCCeEeCccCCC--c--CceECC--CCCEEEEEEcCCCcc-c-------CCCCcEEEEEE
Confidence 344444332 379999999887631111111 0 112222 666555443221100 0 00124699999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
+.+|+..=-+.. . ....|... ++..|+..+. .....|+.++++ |+..-+++..
T Consensus 356 ~~~g~~~~LT~~-~--~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~-g~~~~~l~~~ 410 (419)
T PRK04043 356 TNSDYIRRLTAN-G--VNQFPRFSSDGGSIMFIKYLGNQSALGIIRLN-YNKSFLFPLK 410 (419)
T ss_pred CCCCCeEECCCC-C--CcCCeEECCCCCEEEEEEccCCcEEEEEEecC-CCeeEEeecC
Confidence 999975322222 1 12223323 3444544443 123358888875 6666666553
No 176
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=92.12 E-value=10 Score=35.47 Aligned_cols=185 Identities=14% Similarity=0.153 Sum_probs=87.6
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCce----eecEEEEEcc-CcEE
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKV----KHDIVVAVQK-SGFA 217 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~----~~~~v~~~~~-~g~l 217 (382)
-.|+++|++|++++-++.+++... .....+.++..+.. .+..+|..+. .+.|
T Consensus 34 pKLv~~Dl~t~~li~~~~~p~~~~-----------------------~~~s~lndl~VD~~~~~~~~~~aYItD~~~~gl 90 (287)
T PF03022_consen 34 PKLVAFDLKTNQLIRRYPFPPDIA-----------------------PPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGL 90 (287)
T ss_dssp -EEEEEETTTTCEEEEEE--CCCS------------------------TCGGEEEEEEECTTTTS-SEEEEEEETTTCEE
T ss_pred cEEEEEECCCCcEEEEEECChHHc-----------------------ccccccceEEEEccCCCCcceEEEEeCCCcCcE
Confidence 689999999999999999987621 12223333332221 1356776654 3579
Q ss_pred EEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 218 WALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 218 ~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
..+|..+|+. |+...... ...+.......++..+--... ..+..+.|. ..+++..-|.+-+|..+++++...
T Consensus 91 IV~dl~~~~s-~Rv~~~~~--~~~p~~~~~~i~g~~~~~~dg--~~gial~~~---~~d~r~LYf~~lss~~ly~v~T~~ 162 (287)
T PF03022_consen 91 IVYDLATGKS-WRVLHNSF--SPDPDAGPFTIGGESFQWPDG--IFGIALSPI---SPDGRWLYFHPLSSRKLYRVPTSV 162 (287)
T ss_dssp EEEETTTTEE-EEEETCGC--TTS-SSEEEEETTEEEEETTS--EEEEEE-TT---STTS-EEEEEETT-SEEEEEEHHH
T ss_pred EEEEccCCcE-EEEecCCc--ceeccccceeccCceEecCCC--ccccccCCC---CCCccEEEEEeCCCCcEEEEEHHH
Confidence 9999998865 77765421 111111222223333321100 011111111 112233334444444444432110
Q ss_pred ---------------------CCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCC------CcEeEEEecCCceecceEE
Q 040693 298 ---------------------NGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKT------GKILWSYDTGATIYGGASV 349 (382)
Q Consensus 298 ---------------------~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~t------G~ilw~~~~~~~~~~~p~~ 349 (382)
.......+.++ +|.+|++.. ....|.+.|+++ -+++-+-+..-.+-.+..+
T Consensus 163 L~~~~~~~~~~~~~~v~~lG~k~~~s~g~~~D~~G~ly~~~~-~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i 241 (287)
T PF03022_consen 163 LRDPSLSDAQALASQVQDLGDKGSQSDGMAIDPNGNLYFTDV-EQNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKI 241 (287)
T ss_dssp HCSTT--HHH-HHHT-EEEEE---SECEEEEETTTEEEEEEC-CCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE
T ss_pred hhCccccccccccccceeccccCCCCceEEECCCCcEEEecC-CCCeEEEEeCCCCcCccchheeEEcCceeeccceeee
Confidence 00112233344 588888875 578899999874 1233332221123355666
Q ss_pred eC---CEEEEEeC
Q 040693 350 SN---GCIYMGNG 359 (382)
Q Consensus 350 ~~---g~lyv~~~ 359 (382)
.+ +.||+.+.
T Consensus 242 ~~~~~g~L~v~sn 254 (287)
T PF03022_consen 242 DPEGDGYLWVLSN 254 (287)
T ss_dssp -T--TS-EEEEE-
T ss_pred ccccCceEEEEEC
Confidence 55 99999875
No 177
>PRK02888 nitrous-oxide reductase; Validated
Probab=92.06 E-value=8.2 Score=39.89 Aligned_cols=178 Identities=13% Similarity=0.119 Sum_probs=99.8
Q ss_pred ccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCC
Q 040693 51 CTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPT 130 (382)
Q Consensus 51 ~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (382)
..+.+.+.++|.+|-+++|+..+..... ...++.++.++|+...|.
T Consensus 211 ~ey~~~vSvID~etmeV~~qV~Vdgnpd--------------~v~~spdGk~afvTsyNs-------------------- 256 (635)
T PRK02888 211 KKYRSLFTAVDAETMEVAWQVMVDGNLD--------------NVDTDYDGKYAFSTCYNS-------------------- 256 (635)
T ss_pred cceeEEEEEEECccceEEEEEEeCCCcc--------------cceECCCCCEEEEeccCc--------------------
Confidence 3468999999999999999999853221 235666778888876431
Q ss_pred CCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEE
Q 040693 131 SPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVA 210 (382)
Q Consensus 131 ~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~ 210 (382)
+ ....+..++..+-..+..+...... ....+ +...++
T Consensus 257 ------E---~G~tl~em~a~e~d~~vvfni~~ie-------------------------------a~vkd---GK~~~V 293 (635)
T PRK02888 257 ------E---EGVTLAEMMAAERDWVVVFNIARIE-------------------------------EAVKA---GKFKTI 293 (635)
T ss_pred ------c---cCcceeeeccccCceEEEEchHHHH-------------------------------HhhhC---CCEEEE
Confidence 0 0134555554333322222221100 00011 223333
Q ss_pred EccCcEEEEEeCCC----C-CeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 211 VQKSGFAWALDRDS----G-SLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 211 ~~~~g~l~ald~~t----G-~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.++.+..+|..+ + +++-.++.+. ..++..+ |+..+|++. ..+..+..||
T Consensus 294 --~gn~V~VID~~t~~~~~~~v~~yIPVGK------sPHGV~vSPDGkylyVan----------------klS~tVSVID 349 (635)
T PRK02888 294 --GGSKVPVVDGRKAANAGSALTRYVPVPK------NPHGVNTSPDGKYFIANG----------------KLSPTVTVID 349 (635)
T ss_pred --CCCEEEEEECCccccCCcceEEEEECCC------CccceEECCCCCEEEEeC----------------CCCCcEEEEE
Confidence 245688999998 4 4555555443 3444433 788888863 3467899999
Q ss_pred CCCCcEEeeecCCC-------CCCCCcce--EEeC-CEEEEeeecCCCcEEEEeCCC
Q 040693 284 ASNGNVLWSTADPS-------NGTAPGPV--TVAN-GVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 284 ~~tG~~~W~~~~~~-------~~~~~~~~--~~~~-~~v~~~~~~~~g~l~~ld~~t 330 (382)
.++.+.+..-++.. .....+|+ .+++ +.+|..-. -+.+|.-.|.++
T Consensus 350 v~k~k~~~~~~~~~~~~vvaevevGlGPLHTaFDg~G~aytslf-~dsqv~kwn~~~ 405 (635)
T PRK02888 350 VRKLDDLFDGKIKPRDAVVAEPELGLGPLHTAFDGRGNAYTTLF-LDSQIVKWNIEA 405 (635)
T ss_pred ChhhhhhhhccCCccceEEEeeccCCCcceEEECCCCCEEEeEe-ecceeEEEehHH
Confidence 99877543222111 01122333 2333 46777654 467777777664
No 178
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=91.99 E-value=0.99 Score=42.55 Aligned_cols=91 Identities=9% Similarity=0.010 Sum_probs=56.4
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCE-EEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCE
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGV-LFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGC 353 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~ 353 (382)
.+.++...+..|++-+-+...... +.+.+-++++ |+.++ .+..|..+|+..|+.+--.+-..... -.+..+++
T Consensus 338 gDRTikvW~~st~efvRtl~gHkR---GIAClQYr~rlvVSGS--SDntIRlwdi~~G~cLRvLeGHEeLv-RciRFd~k 411 (499)
T KOG0281|consen 338 GDRTIKVWSTSTCEFVRTLNGHKR---GIACLQYRDRLVVSGS--SDNTIRLWDIECGACLRVLEGHEELV-RCIRFDNK 411 (499)
T ss_pred CCceEEEEeccceeeehhhhcccc---cceehhccCeEEEecC--CCceEEEEeccccHHHHHHhchHHhh-hheeecCc
Confidence 347799999999998877776542 2333345555 44445 48899999999988765443222211 22335555
Q ss_pred EEEEeCceeEeecCCccC
Q 040693 354 IYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 354 lyv~~~~g~~~~~~~~~~ 371 (382)
-.|+.+|...++++.|.+
T Consensus 412 rIVSGaYDGkikvWdl~a 429 (499)
T KOG0281|consen 412 RIVSGAYDGKIKVWDLQA 429 (499)
T ss_pred eeeeccccceEEEEeccc
Confidence 556666655566666654
No 179
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=91.92 E-value=11 Score=35.13 Aligned_cols=43 Identities=16% Similarity=0.190 Sum_probs=29.2
Q ss_pred CCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCceeE
Q 040693 319 RQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKVT 363 (382)
Q Consensus 319 ~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~~ 363 (382)
+.+....+|++||+++-...+.. .+..+...+-+.|++..|.+
T Consensus 300 ~GN~~vi~da~tG~vv~~a~l~d--aaGva~~~~gf~vssg~G~~ 342 (366)
T COG3490 300 RGNRAVIWDAATGAVVSEAALPD--AAGVAAAKGGFAVSSGQGRI 342 (366)
T ss_pred CCCeEEEEEcCCCcEEecccccc--cccceeccCceEEecCCceE
Confidence 34456677888888877766553 24556677777778777774
No 180
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=91.31 E-value=3.2 Score=38.52 Aligned_cols=131 Identities=18% Similarity=0.153 Sum_probs=79.6
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee-eeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA-TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+.++++..||.|..+|...-+++=+++...+ ...-+ .++..++++ ..++.|..+|
T Consensus 25 ~~~LLvssWDgslrlYdv~~~~l~~~~~~~~p------lL~c~F~d~~~~~~G-----------------~~dg~vr~~D 81 (323)
T KOG1036|consen 25 SSDLLVSSWDGSLRLYDVPANSLKLKFKHGAP------LLDCAFADESTIVTG-----------------GLDGQVRRYD 81 (323)
T ss_pred CCcEEEEeccCcEEEEeccchhhhhheecCCc------eeeeeccCCceEEEe-----------------ccCceEEEEE
Confidence 56778888999999999887666555554321 11111 155678887 5568999999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCcee
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~ 362 (382)
+.+|+..---.-..+. ..-.-....+.++.++. ++.|-++|+.+-...-.++.+. -..+.-+.+++|.|++.+-.
T Consensus 82 ln~~~~~~igth~~~i-~ci~~~~~~~~vIsgsW--D~~ik~wD~R~~~~~~~~d~~k-kVy~~~v~g~~LvVg~~~r~ 156 (323)
T KOG1036|consen 82 LNTGNEDQIGTHDEGI-RCIEYSYEVGCVISGSW--DKTIKFWDPRNKVVVGTFDQGK-KVYCMDVSGNRLVVGTSDRK 156 (323)
T ss_pred ecCCcceeeccCCCce-EEEEeeccCCeEEEccc--CccEEEEeccccccccccccCc-eEEEEeccCCEEEEeecCce
Confidence 9999765433322210 00011122466777776 8889999987633344444433 33345567788888766533
No 181
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=91.30 E-value=6.6 Score=39.48 Aligned_cols=113 Identities=11% Similarity=0.078 Sum_probs=71.2
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccc-eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWG-AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
..++-+++.|.+..++...|++-|++..+... +..... ..-.-+.+|... -+..+.-++.
T Consensus 71 ~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~--~~v~~~~~~~~~~ciyS~~-----------------ad~~v~~~~~ 131 (541)
T KOG4547|consen 71 SMLVLGTPQGSVLLYSVAGGEITAKLSTDKHY--GNVNEILDAQRLGCIYSVG-----------------ADLKVVYILE 131 (541)
T ss_pred eEEEeecCCccEEEEEecCCeEEEEEecCCCC--CcceeeecccccCceEecC-----------------CceeEEEEec
Confidence 45667889999999999999999999854311 111111 011345666652 3467888899
Q ss_pred CCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT 342 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~ 342 (382)
++++..-........ ...+.+ .++.+.+.. .+.|-.+|.+++|++-.++-.++
T Consensus 132 ~~~~~~~~~~~~~~~--~~sl~is~D~~~l~~a---s~~ik~~~~~~kevv~~ftgh~s 185 (541)
T KOG4547|consen 132 KEKVIIRIWKEQKPL--VSSLCISPDGKILLTA---SRQIKVLDIETKEVVITFTGHGS 185 (541)
T ss_pred ccceeeeeeccCCCc--cceEEEcCCCCEEEec---cceEEEEEccCceEEEEecCCCc
Confidence 988765554433321 222222 233333333 58999999999999999875544
No 182
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=90.97 E-value=9.7 Score=35.66 Aligned_cols=80 Identities=18% Similarity=0.214 Sum_probs=50.7
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCc----ceEEeC-------CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPG----PVTVAN-------GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY 344 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~----~~~~~~-------~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~ 344 (382)
.-+|++||++|++++-++.++....... .+.++. +.+|++.. ..+.|.++|.++|+. ||+.......
T Consensus 33 ~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~-~~~glIV~dl~~~~s-~Rv~~~~~~~ 110 (287)
T PF03022_consen 33 PPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDS-GGPGLIVYDLATGKS-WRVLHNSFSP 110 (287)
T ss_dssp --EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEET-TTCEEEEEETTTTEE-EEEETCGCTT
T ss_pred CcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCC-CcCcEEEEEccCCcE-EEEecCCcce
Confidence 3579999999999999999886432211 234443 58999985 456899999998876 8886653211
Q ss_pred ---cceEEeCCEEEEE
Q 040693 345 ---GGASVSNGCIYMG 357 (382)
Q Consensus 345 ---~~p~~~~g~lyv~ 357 (382)
..+...++..|-.
T Consensus 111 ~p~~~~~~i~g~~~~~ 126 (287)
T PF03022_consen 111 DPDAGPFTIGGESFQW 126 (287)
T ss_dssp S-SSEEEEETTEEEEE
T ss_pred eccccceeccCceEec
Confidence 1234456665543
No 183
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=90.73 E-value=10 Score=39.41 Aligned_cols=99 Identities=14% Similarity=0.095 Sum_probs=66.6
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SN 351 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~ 351 (382)
..+.++..+|..+|..+-.+.....+ ...+.+. .|+ |+++...++.|...|..+|+++-++.-..+...+..+ .+
T Consensus 554 SsD~tVRlWDv~~G~~VRiF~GH~~~--V~al~~Sp~Gr-~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~d 630 (707)
T KOG0263|consen 554 SSDRTVRLWDVSTGNSVRIFTGHKGP--VTALAFSPCGR-YLASGDEDGLIKIWDLANGSLVKQLKGHTGTIYSLSFSRD 630 (707)
T ss_pred CCCceEEEEEcCCCcEEEEecCCCCc--eEEEEEcCCCc-eEeecccCCcEEEEEcCCCcchhhhhcccCceeEEEEecC
Confidence 44578999999999998888765532 2222332 343 3333336899999999999998777544444444443 67
Q ss_pred CEEEEEeCceeEeecCCccCCCCC
Q 040693 352 GCIYMGNGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 352 g~lyv~~~~g~~~~~~~~~~~~g~ 375 (382)
|.++++.+.+..+-+|.+...++.
T Consensus 631 g~vLasgg~DnsV~lWD~~~~~~~ 654 (707)
T KOG0263|consen 631 GNVLASGGADNSVRLWDLTKVIEL 654 (707)
T ss_pred CCEEEecCCCCeEEEEEchhhccc
Confidence 777777777777777776666555
No 184
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=90.69 E-value=17 Score=35.29 Aligned_cols=82 Identities=21% Similarity=0.262 Sum_probs=44.0
Q ss_pred ceEEEEECCCCcEEeeecC--CCCCCCCcceEEeCCEEEEeeec-----CCCcEEEEeCCCCcEeEEEecCC--ce----
Q 040693 277 GGWVAMDASNGNVLWSTAD--PSNGTAPGPVTVANGVLFGGSTY-----RQGPIYAMDVKTGKILWSYDTGA--TI---- 343 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~--~~~~~~~~~~~~~~~~v~~~~~~-----~~g~l~~ld~~tG~ilw~~~~~~--~~---- 343 (382)
..+.++|+++.+ |+.-. +........++..++.+|+.... +...++.++.+..+-.|+.-.+- .-
T Consensus 189 ~~v~~YD~~t~~--W~~~~~~p~~~~~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~ 266 (376)
T PRK14131 189 KEVLSYDPSTNQ--WKNAGESPFLGTAGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSS 266 (376)
T ss_pred ceEEEEECCCCe--eeECCcCCCCCCCcceEEEECCEEEEEeeeECCCcCChhheEEEecCCCcceeecCCCCCCCcCCc
Confidence 569999987664 77632 32122233445557777765421 11234544443345568753221 10
Q ss_pred ----e-cceEEeCCEEEEEeCc
Q 040693 344 ----Y-GGASVSNGCIYMGNGY 360 (382)
Q Consensus 344 ----~-~~p~~~~g~lyv~~~~ 360 (382)
. ...++.+++|||..+.
T Consensus 267 ~~~~~~~~a~~~~~~iyv~GG~ 288 (376)
T PRK14131 267 QEGVAGAFAGYSNGVLLVAGGA 288 (376)
T ss_pred CCccceEeceeECCEEEEeecc
Confidence 1 1145689999998764
No 185
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=90.69 E-value=15 Score=34.55 Aligned_cols=141 Identities=13% Similarity=0.116 Sum_probs=78.7
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDAS 285 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~ 285 (382)
..++..+.|-.+..+|...|.++-++..+.+.. ...|.|. +.+..++...+ ..-+.++..
T Consensus 78 r~LltsS~D~si~lwDl~~gs~l~rirf~spv~--~~q~hp~-k~n~~va~~~~-----------------~sp~vi~~s 137 (405)
T KOG1273|consen 78 RKLLTSSRDWSIKLWDLLKGSPLKRIRFDSPVW--GAQWHPR-KRNKCVATIME-----------------ESPVVIDFS 137 (405)
T ss_pred CEeeeecCCceeEEEeccCCCceeEEEccCccc--eeeeccc-cCCeEEEEEec-----------------CCcEEEEec
Confidence 567778888889999999999999999876321 1233333 33444444222 123334443
Q ss_pred CCcEEee-ecCCCC-CCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEE-eCCEEEEEeC
Q 040693 286 NGNVLWS-TADPSN-GTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASV-SNGCIYMGNG 359 (382)
Q Consensus 286 tG~~~W~-~~~~~~-~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~-~~g~lyv~~~ 359 (382)
+++-.-- .+.+.. ....+-..+ .++++|.++. .|.+..+|+.|-+.+-.+.+.. ...-+..+ -.++.++.++
T Consensus 138 ~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGts--KGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g~~liiNt 215 (405)
T KOG1273|consen 138 DPKHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTS--KGKLLVYDAETLECVASFRITSVQAIKQIIVSRKGRFLIINT 215 (405)
T ss_pred CCceeeccCCCccccccccccccccCCCCEEEEecC--cceEEEEecchheeeeeeeechheeeeEEEEeccCcEEEEec
Confidence 3221111 111110 111111122 3678999986 9999999999999888777654 33333333 4555555544
Q ss_pred ceeEeecCC
Q 040693 360 YKVTVGFGN 368 (382)
Q Consensus 360 ~g~~~~~~~ 368 (382)
..++.-.|.
T Consensus 216 sDRvIR~ye 224 (405)
T KOG1273|consen 216 SDRVIRTYE 224 (405)
T ss_pred CCceEEEEe
Confidence 444433343
No 186
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=90.66 E-value=1.5 Score=41.43 Aligned_cols=100 Identities=12% Similarity=0.119 Sum_probs=70.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+..|+.+++|-.+...+..|++.+-....-. .+.+. ..+.++++. ..+.+|..+
T Consensus 330 ~kyIVsASgDRTikvW~~st~efvRtl~gHk--------RGIAClQYr~rlvVSG----------------SSDntIRlw 385 (499)
T KOG0281|consen 330 DKYIVSASGDRTIKVWSTSTCEFVRTLNGHK--------RGIACLQYRDRLVVSG----------------SSDNTIRLW 385 (499)
T ss_pred cceEEEecCCceEEEEeccceeeehhhhccc--------ccceehhccCeEEEec----------------CCCceEEEE
Confidence 4578888999999999999998776655322 22222 445555542 456889999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCc
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGK 332 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ 332 (382)
|+..|+.+--.+.... ....+-+++.+++.+.. +|.+-..|..++.
T Consensus 386 di~~G~cLRvLeGHEe--LvRciRFd~krIVSGaY--DGkikvWdl~aal 431 (499)
T KOG0281|consen 386 DIECGACLRVLEGHEE--LVRCIRFDNKRIVSGAY--DGKIKVWDLQAAL 431 (499)
T ss_pred eccccHHHHHHhchHH--hhhheeecCceeeeccc--cceEEEEeccccc
Confidence 9999988755554432 23456678889998885 9998888877553
No 187
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=90.36 E-value=14 Score=38.13 Aligned_cols=129 Identities=14% Similarity=0.183 Sum_probs=77.5
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
++.++.++.|-.+..+.. |+.+-.+.-- -....+.++ ++.. |++. ..+|.|.-.+
T Consensus 151 e~~~vTgsaDKtIklWk~--~~~l~tf~gH-----tD~VRgL~vl~~~~-flSc----------------sNDg~Ir~w~ 206 (745)
T KOG0301|consen 151 ENTYVTGSADKTIKLWKG--GTLLKTFSGH-----TDCVRGLAVLDDSH-FLSC----------------SNDGSIRLWD 206 (745)
T ss_pred CCcEEeccCcceeeeccC--Cchhhhhccc-----hhheeeeEEecCCC-eEee----------------cCCceEEEEe
Confidence 457777777877776665 4444443311 012233333 3322 3332 3457888888
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEEeCCEEEEEeCcee
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASVSNGCIYMGNGYKV 362 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~~~g~lyv~~~~g~ 362 (382)
. +|+.+-+......-...-..+..++.++.+. +++.+...+.. +..-...+|+ .+++.....+|.++++.++|.
T Consensus 207 ~-~ge~l~~~~ghtn~vYsis~~~~~~~Ivs~g--EDrtlriW~~~--e~~q~I~lPttsiWsa~~L~NgDIvvg~SDG~ 281 (745)
T KOG0301|consen 207 L-DGEVLLEMHGHTNFVYSISMALSDGLIVSTG--EDRTLRIWKKD--ECVQVITLPTTSIWSAKVLLNGDIVVGGSDGR 281 (745)
T ss_pred c-cCceeeeeeccceEEEEEEecCCCCeEEEec--CCceEEEeecC--ceEEEEecCccceEEEEEeeCCCEEEeccCce
Confidence 7 8888877765543211122223445555555 47877777754 6666667776 577777888999999999987
No 188
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=89.97 E-value=5.5 Score=38.37 Aligned_cols=136 Identities=10% Similarity=0.147 Sum_probs=74.1
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee-eeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA-TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.++++..+.|..+...|..||+.+-+...+. ..+... ..++.+++++ ..+..|..+|
T Consensus 144 ~NVLlsag~Dn~v~iWnv~tgeali~l~hpd------~i~S~sfn~dGs~l~Tt----------------ckDKkvRv~d 201 (472)
T KOG0303|consen 144 PNVLLSAGSDNTVSIWNVGTGEALITLDHPD------MVYSMSFNRDGSLLCTT----------------CKDKKVRVID 201 (472)
T ss_pred hhhHhhccCCceEEEEeccCCceeeecCCCC------eEEEEEeccCCceeeee----------------cccceeEEEc
Confidence 4666677788899999999999777765321 111111 1233333332 4567799999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeec-CCC-cEEEEeCCC---CcEeEEEecCCceecceEE--eCCEEEE
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTY-RQG-PIYAMDVKT---GKILWSYDTGATIYGGASV--SNGCIYM 356 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~g-~l~~ld~~t---G~ilw~~~~~~~~~~~p~~--~~g~lyv 356 (382)
+++|+++++.....+......+...++.++..... ... ++...|+++ .-.+-.++.+.++.. |.+ ..+.||+
T Consensus 202 pr~~~~v~e~~~heG~k~~Raifl~~g~i~tTGfsr~seRq~aLwdp~nl~eP~~~~elDtSnGvl~-PFyD~dt~ivYl 280 (472)
T KOG0303|consen 202 PRRGTVVSEGVAHEGAKPARAIFLASGKIFTTGFSRMSERQIALWDPNNLEEPIALQELDTSNGVLL-PFYDPDTSIVYL 280 (472)
T ss_pred CCCCcEeeecccccCCCcceeEEeccCceeeeccccccccceeccCcccccCcceeEEeccCCceEE-eeecCCCCEEEE
Confidence 99999999985444333344555566665544321 122 233335543 222333444333332 223 3356776
Q ss_pred Ee-CceeE
Q 040693 357 GN-GYKVT 363 (382)
Q Consensus 357 ~~-~~g~~ 363 (382)
.. +++.|
T Consensus 281 ~GKGD~~I 288 (472)
T KOG0303|consen 281 CGKGDSSI 288 (472)
T ss_pred EecCCcce
Confidence 53 44443
No 189
>PRK01029 tolB translocation protein TolB; Provisional
Probab=89.96 E-value=22 Score=35.34 Aligned_cols=57 Identities=16% Similarity=0.120 Sum_probs=32.6
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeee-cCCCcEEEEeCCCCcEeE
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGST-YRQGPIYAMDVKTGKILW 335 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~-~~~g~l~~ld~~tG~ilw 335 (382)
..|+.+|+.+|+..-...... ....+... ++..+++... .....|+.+|+++|+..-
T Consensus 351 ~~I~v~dl~~g~~~~Lt~~~~--~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~~ 409 (428)
T PRK01029 351 RQICVYDLATGRDYQLTTSPE--NKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTRK 409 (428)
T ss_pred cEEEEEECCCCCeEEccCCCC--CccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEE
Confidence 569999999998764443321 11223223 3344444332 124579999998877543
No 190
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=89.91 E-value=0.98 Score=44.14 Aligned_cols=174 Identities=17% Similarity=0.101 Sum_probs=106.1
Q ss_pred CceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCC----CCCCcccceeeeCCeEEEEecCcccccccc
Q 040693 192 APMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGG----LGGGAMWGAATDERRIYTNIANSQHKNFNL 267 (382)
Q Consensus 192 ~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~----~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~ 267 (382)
.|.-++...+| -.++.+...|.|.+||-.|+++..++.....- +.....+-.+...+.+|+-..++ ....++
T Consensus 131 GPY~~~ytrnG---rhlllgGrKGHlAa~Dw~t~~L~~Ei~v~Etv~Dv~~LHneq~~AVAQK~y~yvYD~~G-tElHCl 206 (545)
T KOG1272|consen 131 GPYHLDYTRNG---RHLLLGGRKGHLAAFDWVTKKLHFEINVMETVRDVTFLHNEQFFAVAQKKYVYVYDNNG-TELHCL 206 (545)
T ss_pred CCeeeeecCCc---cEEEecCCccceeeeecccceeeeeeehhhhhhhhhhhcchHHHHhhhhceEEEecCCC-cEEeeh
Confidence 56666666666 46777888899999999999999998865321 01111111122566777753331 111111
Q ss_pred CCC------------C---CCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCc
Q 040693 268 KPS------------K---NSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGK 332 (382)
Q Consensus 268 ~~~------------~---~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ 332 (382)
+.. . .....|.+.-.|.++|+++=.+....+....-..--.|..+.++-. +|.|....++.-+
T Consensus 207 k~~~~v~rLeFLPyHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~~vm~qNP~NaVih~Ghs--nGtVSlWSP~ske 284 (545)
T KOG1272|consen 207 KRHIRVARLEFLPYHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRTDVMKQNPYNAVIHLGHS--NGTVSLWSPNSKE 284 (545)
T ss_pred hhcCchhhhcccchhheeeecccCCceEEEeechhhhhHHHHccCCccchhhcCCccceEEEcCC--CceEEecCCCCcc
Confidence 110 0 0245688999999999998777665432111110012345555553 8888888999888
Q ss_pred EeEEEecCCceecceEE-eCCEEEEEeCceeEeecCCccC
Q 040693 333 ILWSYDTGATIYGGASV-SNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 333 ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
+|=+.-...+..++.++ .+|+..++++-.+.+++|.+..
T Consensus 285 PLvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 285 PLVKILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRN 324 (545)
T ss_pred hHHHHHhcCCCcceEEECCCCcEEeecccccceeEeeecc
Confidence 88877655556667777 4566666777777777766543
No 191
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=89.74 E-value=22 Score=35.11 Aligned_cols=68 Identities=18% Similarity=0.187 Sum_probs=41.8
Q ss_pred CCEEEEeeecCCCcEEEEeCCCCcEeEEEecC-CceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEE
Q 040693 309 NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG-ATIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 309 ~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~-~~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
|-++..++. ++.|.+.|...|..+-.+.-. .++++-.-..+++...+.+.+..++++. .++|++.=+|
T Consensus 422 ~~~l~sas~--dstV~lwdv~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws--~~~~~l~~s~ 490 (524)
T KOG0273|consen 422 NLMLASASF--DSTVKLWDVESGVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWS--TKTGKLVKSY 490 (524)
T ss_pred CceEEEeec--CCeEEEEEccCCceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEecc--ccchheeEee
Confidence 344444443 899999999999999988433 3444333335566666555555566543 4556655443
No 192
>PLN02193 nitrile-specifier protein
Probab=89.67 E-value=24 Score=35.47 Aligned_cols=129 Identities=12% Similarity=0.104 Sum_probs=70.2
Q ss_pred cEEEEEeCCCCCeeeeeccCCCC-CCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGG-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
..++++|..+ ..|+.-.+... .........+..++.||+-...... .....+.++|+++. .|+.
T Consensus 244 ndv~~yD~~t--~~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~~-----------~~~~~~~~yd~~t~--~W~~ 308 (470)
T PLN02193 244 NGFYSFDTTT--NEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSAT-----------ARLKTLDSYNIVDK--KWFH 308 (470)
T ss_pred ccEEEEECCC--CEEEEcCcCCCCCCCccceEEEEECCEEEEECCCCCC-----------CCcceEEEEECCCC--EEEe
Confidence 3589999986 46876432110 1111222333467788876433110 12356899999875 5875
Q ss_pred cCC-C---CCCCCcceEEeCCEEEEeeec---CCCcEEEEeCCCCcEeEEEecC-----C-ceecceEEeCCEEEEEeCc
Q 040693 294 ADP-S---NGTAPGPVTVANGVLFGGSTY---RQGPIYAMDVKTGKILWSYDTG-----A-TIYGGASVSNGCIYMGNGY 360 (382)
Q Consensus 294 ~~~-~---~~~~~~~~~~~~~~v~~~~~~---~~g~l~~ld~~tG~ilw~~~~~-----~-~~~~~p~~~~g~lyv~~~~ 360 (382)
-.. . .......+.+.++.+|+.... ....++++|+++.+ |+.-.. . ....+.++.+++|||..+.
T Consensus 309 ~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g~~~~dv~~yD~~t~~--W~~~~~~g~~P~~R~~~~~~~~~~~iyv~GG~ 386 (470)
T PLN02193 309 CSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDK--WTQVETFGVRPSERSVFASAAVGKHIVIFGGE 386 (470)
T ss_pred CCCCCCCCCCCCCcEEEEECCcEEEEECCCCCccCceEEEECCCCE--EEEeccCCCCCCCcceeEEEEECCEEEEECCc
Confidence 321 1 111222333445666654320 12569999998654 865321 1 1334456789999998774
No 193
>PRK01742 tolB translocation protein TolB; Provisional
Probab=89.65 E-value=23 Score=35.11 Aligned_cols=57 Identities=16% Similarity=0.112 Sum_probs=28.2
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEEe-CCEEEEeeecCCCc---EEEEeCCCCcEeEEEec
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGP---IYAMDVKTGKILWSYDT 339 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~---l~~ld~~tG~ilw~~~~ 339 (382)
..++.+|+.+|+..-..... . ...+.... +..++.++. ++. ++.++ .+|+.+-++..
T Consensus 353 ~~i~~~Dl~~g~~~~lt~~~--~-~~~~~~sPdG~~i~~~s~--~g~~~~l~~~~-~~G~~~~~l~~ 413 (429)
T PRK01742 353 DNVVKQDLTSGSTEVLSSTF--L-DESPSISPNGIMIIYSST--QGLGKVLQLVS-ADGRFKARLPG 413 (429)
T ss_pred CCEEEEECCCCCeEEecCCC--C-CCCceECCCCCEEEEEEc--CCCceEEEEEE-CCCCceEEccC
Confidence 34777999999765322211 1 12222222 345555553 443 33334 45777666653
No 194
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=89.35 E-value=4.5 Score=36.45 Aligned_cols=50 Identities=20% Similarity=0.329 Sum_probs=40.0
Q ss_pred CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC---CEEEEEeC
Q 040693 309 NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN---GCIYMGNG 359 (382)
Q Consensus 309 ~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~---g~lyv~~~ 359 (382)
.+.+|+++. ..+.++-+|+.|||+|-++.++..-.++....+ +.+|+++.
T Consensus 222 eG~L~Va~~-ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~a 274 (310)
T KOG4499|consen 222 EGNLYVATF-NGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTTA 274 (310)
T ss_pred CCcEEEEEe-cCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEeh
Confidence 578888876 578999999999999999999977666666654 56777654
No 195
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=88.88 E-value=27 Score=34.93 Aligned_cols=107 Identities=15% Similarity=0.119 Sum_probs=61.9
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEe
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
.+.++-+++++|...-+...-... ..-.+..+.-.. ++.++.++.+|++..++
T Consensus 221 k~H~~Fw~~~~~~l~k~~~~fek~-------------------------ekk~Vl~v~F~e--ngdviTgDS~G~i~Iw~ 273 (626)
T KOG2106|consen 221 KGHLYFWTLRGGSLVKRQGIFEKR-------------------------EKKFVLCVTFLE--NGDVITGDSGGNILIWS 273 (626)
T ss_pred CceEEEEEccCCceEEEeeccccc-------------------------cceEEEEEEEcC--CCCEEeecCCceEEEEe
Confidence 378888888877666555443221 112233332222 67788999999999999
Q ss_pred CCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 222 RDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
+.+-++.-+...- .|+...--...++.++.+ ..+..|.+.| .+=+.+-.+++|.
T Consensus 274 ~~~~~~~k~~~aH----~ggv~~L~~lr~GtllSG-----------------gKDRki~~Wd-~~y~k~r~~elPe 327 (626)
T KOG2106|consen 274 KGTNRISKQVHAH----DGGVFSLCMLRDGTLLSG-----------------GKDRKIILWD-DNYRKLRETELPE 327 (626)
T ss_pred CCCceEEeEeeec----CCceEEEEEecCccEeec-----------------CccceEEecc-ccccccccccCch
Confidence 8765555444421 122222222355555553 3346688888 5666666677765
No 196
>KOG4328 consensus WD40 protein [Function unknown]
Probab=88.73 E-value=12 Score=36.85 Aligned_cols=31 Identities=16% Similarity=0.275 Sum_probs=22.3
Q ss_pred eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
+..+++++.. ...|=.||.+.|+.+-++..+
T Consensus 431 ~~~li~vg~~--~r~IDv~~~~~~q~v~el~~P 461 (498)
T KOG4328|consen 431 DYNLIVVGRY--PRPIDVFDGNGGQMVCELHDP 461 (498)
T ss_pred CccEEEEecc--CcceeEEcCCCCEEeeeccCc
Confidence 3467777774 677999999988877665444
No 197
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=88.67 E-value=18 Score=34.58 Aligned_cols=144 Identities=15% Similarity=0.168 Sum_probs=89.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee---eCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT---DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
++.|+..+.|..+.+.+.+||-.+..+... +.|.-.+ .++.|+.+-. .+.+|..
T Consensus 205 gd~ilS~srD~tik~We~~tg~cv~t~~~h-------~ewvr~v~v~~DGti~As~s----------------~dqtl~v 261 (406)
T KOG0295|consen 205 GDHILSCSRDNTIKAWECDTGYCVKTFPGH-------SEWVRMVRVNQDGTIIASCS----------------NDQTLRV 261 (406)
T ss_pred CCeeeecccccceeEEecccceeEEeccCc-------hHhEEEEEecCCeeEEEecC----------------CCceEEE
Confidence 588999999999999999999888877743 3344222 5677776633 3466666
Q ss_pred EECCCC--------------cEEeeecCCCCCCCCcceEEeC-CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecc
Q 040693 282 MDASNG--------------NVLWSTADPSNGTAPGPVTVAN-GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGG 346 (382)
Q Consensus 282 ~d~~tG--------------~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~ 346 (382)
.-..|+ .+.|.-....+.........++ ..+..++ +++.+-..|..+|+.|...---......
T Consensus 262 W~~~t~~~k~~lR~hEh~vEci~wap~~~~~~i~~at~~~~~~~~l~s~S--rDktIk~wdv~tg~cL~tL~ghdnwVr~ 339 (406)
T KOG0295|consen 262 WVVATKQCKAELREHEHPVECIAWAPESSYPSISEATGSTNGGQVLGSGS--RDKTIKIWDVSTGMCLFTLVGHDNWVRG 339 (406)
T ss_pred EEeccchhhhhhhccccceEEEEecccccCcchhhccCCCCCccEEEeec--ccceEEEEeccCCeEEEEEecccceeee
Confidence 666666 2344433222111111111111 2444444 6999999999999999887655555566
Q ss_pred eEEe-CCEEEEEeCceeEeecCCccCCC
Q 040693 347 ASVS-NGCIYMGNGYKVTVGFGNKNFTS 373 (382)
Q Consensus 347 p~~~-~g~lyv~~~~g~~~~~~~~~~~~ 373 (382)
.++. +|+..++-.+...+.+|.+...+
T Consensus 340 ~af~p~Gkyi~ScaDDktlrvwdl~~~~ 367 (406)
T KOG0295|consen 340 VAFSPGGKYILSCADDKTLRVWDLKNLQ 367 (406)
T ss_pred eEEcCCCeEEEEEecCCcEEEEEeccce
Confidence 6664 56655555666666666665543
No 198
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=88.44 E-value=3.6 Score=39.49 Aligned_cols=60 Identities=25% Similarity=0.256 Sum_probs=38.8
Q ss_pred cceEEEEECCC-CcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 142 SNSLLALDLDT-GKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 142 ~g~v~ald~~t-G~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
...||.+|++| |+.+|++....... .-+.|.+++...+|. -+.+|+++-.|+|+-+
T Consensus 180 ~~~lyi~d~~t~G~l~~~i~~~~~~~----------------------gl~~~~~~D~d~DG~-~D~vYaGDl~GnlwR~ 236 (335)
T PF05567_consen 180 GAALYILDADTTGALIKKIDVPGGSG----------------------GLSSPAVVDSDGDGY-VDRVYAGDLGGNLWRF 236 (335)
T ss_dssp -EEEEEEETTT---EEEEEEE--STT-----------------------EEEEEEE-TTSSSE-E-EEEEEETTSEEEEE
T ss_pred CcEEEEEECCCCCceEEEEecCCCCc----------------------cccccEEEeccCCCe-EEEEEEEcCCCcEEEE
Confidence 47899999999 99999987654310 114577776666664 5788999999999999
Q ss_pred eCCC
Q 040693 221 DRDS 224 (382)
Q Consensus 221 d~~t 224 (382)
|...
T Consensus 237 dl~~ 240 (335)
T PF05567_consen 237 DLSS 240 (335)
T ss_dssp E--T
T ss_pred ECCC
Confidence 9864
No 199
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=88.12 E-value=14 Score=35.80 Aligned_cols=81 Identities=11% Similarity=0.152 Sum_probs=50.9
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCC
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNG 352 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g 352 (382)
.+..|...+..||+.+-+.+.+. ...+.-...+|-.++++- ++.+|..+|+.+|++++.-....+.-..-++ .++
T Consensus 152 ~Dn~v~iWnv~tgeali~l~hpd--~i~S~sfn~dGs~l~Ttc-kDKkvRv~dpr~~~~v~e~~~heG~k~~Raifl~~g 228 (472)
T KOG0303|consen 152 SDNTVSIWNVGTGEALITLDHPD--MVYSMSFNRDGSLLCTTC-KDKKVRVIDPRRGTVVSEGVAHEGAKPARAIFLASG 228 (472)
T ss_pred CCceEEEEeccCCceeeecCCCC--eEEEEEeccCCceeeeec-ccceeEEEcCCCCcEeeecccccCCCcceeEEeccC
Confidence 45779999999999988877443 122222223454555443 6899999999999999987433333333232 455
Q ss_pred EEEEEeC
Q 040693 353 CIYMGNG 359 (382)
Q Consensus 353 ~lyv~~~ 359 (382)
. +.+++
T Consensus 229 ~-i~tTG 234 (472)
T KOG0303|consen 229 K-IFTTG 234 (472)
T ss_pred c-eeeec
Confidence 5 44443
No 200
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=87.91 E-value=9 Score=35.94 Aligned_cols=150 Identities=11% Similarity=-0.014 Sum_probs=78.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+..+-++-.+|.++.+|-.|-.+.--+.. .-.+.....+ ++..++.+ ..+..+..+
T Consensus 35 G~~lAvGc~nG~vvI~D~~T~~iar~lsa-----H~~pi~sl~WS~dgr~Llts-----------------S~D~si~lw 92 (405)
T KOG1273|consen 35 GDYLAVGCANGRVVIYDFDTFRIARMLSA-----HVRPITSLCWSRDGRKLLTS-----------------SRDWSIKLW 92 (405)
T ss_pred cceeeeeccCCcEEEEEccccchhhhhhc-----cccceeEEEecCCCCEeeee-----------------cCCceeEEE
Confidence 45666777888888888877543222111 0011111122 55555554 455779999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecc-----eEEeCCEEEE
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGG-----ASVSNGCIYM 356 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~-----p~~~~g~lyv 356 (382)
|+..|.++-++.++.+ .+.....- ..+.+++.-....-.+.-|+...-+.|=+-+ ++....+ .-..+..+|.
T Consensus 93 Dl~~gs~l~rirf~sp-v~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~-d~dln~sas~~~fdr~g~yIit 170 (405)
T KOG1273|consen 93 DLLKGSPLKRIRFDSP-VWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDD-DGDLNSSASHGVFDRRGKYIIT 170 (405)
T ss_pred eccCCCceeEEEccCc-cceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCC-ccccccccccccccCCCCEEEE
Confidence 9999999999988773 33333322 2344444433122333333321122222211 1111111 2236788999
Q ss_pred EeCceeEeecCCccCCCCCeEEEEE
Q 040693 357 GNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 357 ~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
+++.|.+. + +++.|-+.+-+|+
T Consensus 171 GtsKGkll-v--~~a~t~e~vas~r 192 (405)
T KOG1273|consen 171 GTSKGKLL-V--YDAETLECVASFR 192 (405)
T ss_pred ecCcceEE-E--Eecchheeeeeee
Confidence 99998842 3 3444456666665
No 201
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=87.82 E-value=8.2 Score=38.16 Aligned_cols=20 Identities=25% Similarity=0.260 Sum_probs=17.3
Q ss_pred cceEEEEeCccCceeeeeecc
Q 040693 54 QGSLAKLDAKTGRILWQTFML 74 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~ 74 (382)
+..|+||+. +|+++|++++.
T Consensus 261 er~Lf~l~~-~G~l~~~krLd 280 (418)
T PF14727_consen 261 ERSLFCLKD-NGSLRFQKRLD 280 (418)
T ss_pred cceEEEEcC-CCeEEEEEecC
Confidence 447999996 89999999984
No 202
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.55 E-value=22 Score=34.86 Aligned_cols=114 Identities=16% Similarity=0.219 Sum_probs=71.1
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
..+..+..+..|...|..+|+..=........+....- .+.. ...++.++.
T Consensus 257 nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~w-----------------h~~~------------p~~LLsGs~ 307 (463)
T KOG0270|consen 257 NVLASGSADKTVKLWDVDTGKPKSSITHHGKKVQTLEW-----------------HPYE------------PSVLLSGSY 307 (463)
T ss_pred eeEEecCCCceEEEEEcCCCCcceehhhcCCceeEEEe-----------------cCCC------------ceEEEeccc
Confidence 33445566789999999999998877644332210000 1112 356778889
Q ss_pred CcEEEEEeCC---CCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCC-CcE
Q 040693 214 SGFAWALDRD---SGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASN-GNV 289 (382)
Q Consensus 214 ~g~l~ald~~---tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~t-G~~ 289 (382)
|+++...|.. .-..-|++...- -...|-+. .....+++ +.+|+|+-+|+++ |++
T Consensus 308 D~~V~l~D~R~~~~s~~~wk~~g~V----Ekv~w~~~-se~~f~~~-----------------tddG~v~~~D~R~~~~~ 365 (463)
T KOG0270|consen 308 DGTVALKDCRDPSNSGKEWKFDGEV----EKVAWDPH-SENSFFVS-----------------TDDGTVYYFDIRNPGKP 365 (463)
T ss_pred cceEEeeeccCccccCceEEeccce----EEEEecCC-CceeEEEe-----------------cCCceEEeeecCCCCCc
Confidence 9999988877 445667776321 11222221 23444554 4569999999985 599
Q ss_pred EeeecCCCC
Q 040693 290 LWSTADPSN 298 (382)
Q Consensus 290 ~W~~~~~~~ 298 (382)
+|+......
T Consensus 366 vwt~~AHd~ 374 (463)
T KOG0270|consen 366 VWTLKAHDD 374 (463)
T ss_pred eeEEEeccC
Confidence 999987654
No 203
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=87.53 E-value=19 Score=35.31 Aligned_cols=84 Identities=19% Similarity=0.311 Sum_probs=55.3
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNG 352 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g 352 (382)
+.++..---|.++|..+-...............+ .++++|.... .++.|-.+|.+++..+-+|+...+-..+....+|
T Consensus 322 s~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLifgtgt-~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsEN 400 (506)
T KOG0289|consen 322 SNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIFGTGT-PDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSEN 400 (506)
T ss_pred cCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEEeccC-CCceEEEEEcCCccccccCCCCCCceeEEEeccC
Confidence 3456666778888988877766432233333333 4678777643 6899999999999988888765444455555555
Q ss_pred EEEEEe
Q 040693 353 CIYMGN 358 (382)
Q Consensus 353 ~lyv~~ 358 (382)
.-|+.+
T Consensus 401 GY~Lat 406 (506)
T KOG0289|consen 401 GYWLAT 406 (506)
T ss_pred ceEEEE
Confidence 555543
No 204
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.22 E-value=26 Score=32.91 Aligned_cols=141 Identities=8% Similarity=0.048 Sum_probs=80.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+.+-+....|+.+..+|..+|+.....++...... ..|.+ .+...+++. ...|-.+-.
T Consensus 139 ~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~~at~--v~w~~--~Gd~F~v~~------------------~~~i~i~q~ 196 (362)
T KOG0294|consen 139 GKLALSVGGDQVLRTWNLVRGRVAFVLNLKNKATL--VSWSP--QGDHFVVSG------------------RNKIDIYQL 196 (362)
T ss_pred CceEEEEcCCceeeeehhhcCccceeeccCCccee--eEEcC--CCCEEEEEe------------------ccEEEEEec
Confidence 45566777799999999999998888876543111 22221 344444431 133444454
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC---CEEEE-EeCc
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN---GCIYM-GNGY 360 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~---g~lyv-~~~~ 360 (382)
.+-++.-+...+.. ... ....+++.++++. +++.+.++|-+++.++-.+.....-.-...... +.+.+ .+++
T Consensus 197 d~A~v~~~i~~~~r-~l~-~~~l~~~~L~vG~--d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~~lvTaSSD 272 (362)
T KOG0294|consen 197 DNASVFREIENPKR-ILC-ATFLDGSELLVGG--DNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHEYLVTASSD 272 (362)
T ss_pred ccHhHhhhhhcccc-cee-eeecCCceEEEec--CCceEEEeccCCCccceeeecchhheeeeEEEecCCceEEEEeccC
Confidence 45554444444431 112 2223567777776 388899999988888877754433222333222 34444 4556
Q ss_pred eeEeecCCccCC
Q 040693 361 KVTVGFGNKNFT 372 (382)
Q Consensus 361 g~~~~~~~~~~~ 372 (382)
|. ..+|.++..
T Consensus 273 G~-I~vWd~~~~ 283 (362)
T KOG0294|consen 273 GF-IKVWDIDME 283 (362)
T ss_pred ce-EEEEEcccc
Confidence 55 456777665
No 205
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=86.65 E-value=28 Score=32.57 Aligned_cols=104 Identities=11% Similarity=0.019 Sum_probs=64.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee--eeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA--TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
...+++++-||.+..+|..+|+..---.-.. ...... ...+.++.+. -+++|-.+
T Consensus 65 ~~~~~~G~~dg~vr~~Dln~~~~~~igth~~------~i~ci~~~~~~~~vIsgs-----------------WD~~ik~w 121 (323)
T KOG1036|consen 65 ESTIVTGGLDGQVRRYDLNTGNEDQIGTHDE------GIRCIEYSYEVGCVISGS-----------------WDKTIKFW 121 (323)
T ss_pred CceEEEeccCceEEEEEecCCcceeeccCCC------ceEEEEeeccCCeEEEcc-----------------cCccEEEE
Confidence 4688999999999999999986543322211 111111 1345565553 34779999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEE
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWS 336 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~ 336 (382)
|+++-...-.++.+.. ...+.+.++++++++. +-.+..+|+.+=++-.+
T Consensus 122 D~R~~~~~~~~d~~kk---Vy~~~v~g~~LvVg~~--~r~v~iyDLRn~~~~~q 170 (323)
T KOG1036|consen 122 DPRNKVVVGTFDQGKK---VYCMDVSGNRLVVGTS--DRKVLIYDLRNLDEPFQ 170 (323)
T ss_pred eccccccccccccCce---EEEEeccCCEEEEeec--CceEEEEEcccccchhh
Confidence 9987444444444331 2233455777778774 78899999886555443
No 206
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=86.65 E-value=26 Score=32.17 Aligned_cols=187 Identities=16% Similarity=0.112 Sum_probs=98.4
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+-.+..+|..+||..-+.....++ . .....++++++.++..+
T Consensus 86 dk~ir~wd~r~~k~~~~i~~~~en-----------i---~i~wsp~g~~~~~~~kd------------------------ 127 (313)
T KOG1407|consen 86 DKTIRIWDIRSGKCTARIETKGEN-----------I---NITWSPDGEYIAVGNKD------------------------ 127 (313)
T ss_pred CceEEEEEeccCcEEEEeeccCcc-----------e---EEEEcCCCCEEEEecCc------------------------
Confidence 456777888888887777663222 1 23455666777775543
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
..|.-||.++-+++=+.+..-... .+... -+++..+..++
T Consensus 128 ---------D~it~id~r~~~~~~~~~~~~e~n--------------------------e~~w~-----~~nd~Fflt~G 167 (313)
T KOG1407|consen 128 ---------DRITFIDARTYKIVNEEQFKFEVN--------------------------EISWN-----NSNDLFFLTNG 167 (313)
T ss_pred ---------ccEEEEEecccceeehhcccceee--------------------------eeeec-----CCCCEEEEecC
Confidence 789999998888877666543210 00000 11456666666
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEe
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLW 291 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W 291 (382)
.|.+-.|..-.-|++-.++.-+ ........ ++..+-++.+ +..+-..|+.-- +-
T Consensus 168 lG~v~ILsypsLkpv~si~AH~-----snCicI~f~p~GryfA~GsA-----------------DAlvSLWD~~EL--iC 223 (313)
T KOG1407|consen 168 LGCVEILSYPSLKPVQSIKAHP-----SNCICIEFDPDGRYFATGSA-----------------DALVSLWDVDEL--IC 223 (313)
T ss_pred CceEEEEeccccccccccccCC-----cceEEEEECCCCceEeeccc-----------------cceeeccChhHh--hh
Confidence 7888888877666666665432 12222222 2232222211 122223333211 11
Q ss_pred eecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee
Q 040693 292 STADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY 344 (382)
Q Consensus 292 ~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~ 344 (382)
..-+..-..-...+.+ ++.++-.++ ++..|=.-+.+||..+|+.+..++.+
T Consensus 224 ~R~isRldwpVRTlSFS~dg~~lASaS--EDh~IDIA~vetGd~~~eI~~~~~t~ 276 (313)
T KOG1407|consen 224 ERCISRLDWPVRTLSFSHDGRMLASAS--EDHFIDIAEVETGDRVWEIPCEGPTF 276 (313)
T ss_pred heeeccccCceEEEEeccCcceeeccC--ccceEEeEecccCCeEEEeeccCCce
Confidence 1111110001122223 334444444 46666666788999999999887644
No 207
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=86.64 E-value=39 Score=34.22 Aligned_cols=114 Identities=18% Similarity=0.212 Sum_probs=61.7
Q ss_pred CcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccC------
Q 040693 141 HSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKS------ 214 (382)
Q Consensus 141 ~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~------ 214 (382)
....++++|.+|=+ |+.-..... ...|+ ..+.+.. ..+..+++++.+
T Consensus 188 ~~ndl~i~d~~~~~--W~~~~~~g~--------~P~pR------------~gH~~~~-----~~~~~~v~gG~~~~~~~l 240 (482)
T KOG0379|consen 188 SLNDLHIYDLETST--WSELDTQGE--------APSPR------------YGHAMVV-----VGNKLLVFGGGDDGDVYL 240 (482)
T ss_pred ceeeeeeecccccc--ceecccCCC--------CCCCC------------CCceEEE-----ECCeEEEEeccccCCcee
Confidence 35789999998877 987554331 11222 1222221 113444444443
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccc-eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWG-AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
+.+++||..+ ..|+.............+. ..+.+..+++........ ....+.++.+|.. +..|..
T Consensus 241 ~D~~~ldl~~--~~W~~~~~~g~~p~~R~~h~~~~~~~~~~l~gG~~~~~---------~~~l~~~~~l~~~--~~~w~~ 307 (482)
T KOG0379|consen 241 NDVHILDLST--WEWKLLPTGGDLPSPRSGHSLTVSGDHLLLFGGGTDPK---------QEPLGDLYGLDLE--TLVWSK 307 (482)
T ss_pred cceEeeeccc--ceeeeccccCCCCCCcceeeeEEECCEEEEEcCCcccc---------ccccccccccccc--ccceee
Confidence 4699999998 8888444333233333333 334666666654443210 0134668888888 667765
Q ss_pred c
Q 040693 294 A 294 (382)
Q Consensus 294 ~ 294 (382)
-
T Consensus 308 ~ 308 (482)
T KOG0379|consen 308 V 308 (482)
T ss_pred e
Confidence 4
No 208
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=86.64 E-value=34 Score=33.53 Aligned_cols=110 Identities=11% Similarity=0.084 Sum_probs=63.9
Q ss_pred eCCeEEEEecCccccccccCCCCCCCCC-ceEEEEECCCC---cEEeeecCCCCCCCCcceEEeCCEEEEeeec--CCCc
Q 040693 249 DERRIYTNIANSQHKNFNLKPSKNSTIA-GGWVAMDASNG---NVLWSTADPSNGTAPGPVTVANGVLFGGSTY--RQGP 322 (382)
Q Consensus 249 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~-g~v~a~d~~tG---~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~--~~g~ 322 (382)
++..+++.... ... ..++.+|..++ ...|+.-.+........+...++.+|+.+.. ..++
T Consensus 237 d~~~l~i~~~~--------------~~~~s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~yi~Tn~~a~~~~ 302 (414)
T PF02897_consen 237 DGRYLFISSSS--------------GTSESEVYLLDLDDGGSPDAKPKLLSPREDGVEYYVDHHGDRLYILTNDDAPNGR 302 (414)
T ss_dssp TSSEEEEEEES--------------SSSEEEEEEEECCCTTTSS-SEEEEEESSSS-EEEEEEETTEEEEEE-TT-TT-E
T ss_pred cccEEEEEEEc--------------cccCCeEEEEeccccCCCcCCcEEEeCCCCceEEEEEccCCEEEEeeCCCCCCcE
Confidence 56667766544 223 77999999986 5555543322111233344457888887642 3578
Q ss_pred EEEEeCCCCcE-eEEEec-CC---ceecceEEeCCEEEEEeCceeEeecCCccCC
Q 040693 323 IYAMDVKTGKI-LWSYDT-GA---TIYGGASVSNGCIYMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 323 l~~ld~~tG~i-lw~~~~-~~---~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~ 372 (382)
|+.++.++... -|+..+ +. .......+.+++|++....+....+..++..
T Consensus 303 l~~~~l~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~Lvl~~~~~~~~~l~v~~~~ 357 (414)
T PF02897_consen 303 LVAVDLADPSPAEWWTVLIPEDEDVSLEDVSLFKDYLVLSYRENGSSRLRVYDLD 357 (414)
T ss_dssp EEEEETTSTSGGGEEEEEE--SSSEEEEEEEEETTEEEEEEEETTEEEEEEEETT
T ss_pred EEEecccccccccceeEEcCCCCceeEEEEEEECCEEEEEEEECCccEEEEEECC
Confidence 99999988764 455322 21 2456666788999988765554444444444
No 209
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=86.62 E-value=37 Score=34.00 Aligned_cols=197 Identities=14% Similarity=0.119 Sum_probs=104.0
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
++.++-+++++|.+.=+..+.+.... .+..+..+.+ ++.|+.+..
T Consensus 221 k~H~~Fw~~~~~~l~k~~~~fek~ek---------k~Vl~v~F~e-ngdviTgDS------------------------- 265 (626)
T KOG2106|consen 221 KGHLYFWTLRGGSLVKRQGIFEKREK---------KFVLCVTFLE-NGDVITGDS------------------------- 265 (626)
T ss_pred CceEEEEEccCCceEEEeeccccccc---------eEEEEEEEcC-CCCEEeecC-------------------------
Confidence 88999999999988877776433211 0111233332 234444443
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+|.|...+..+-+..-+....+..+ .+- +++ .++.|+.+.+
T Consensus 266 --------~G~i~Iw~~~~~~~~k~~~aH~ggv-----------------------~~L-~~l-------r~GtllSGgK 306 (626)
T KOG2106|consen 266 --------GGNILIWSKGTNRISKQVHAHDGGV-----------------------FSL-CML-------RDGTLLSGGK 306 (626)
T ss_pred --------CceEEEEeCCCceEEeEeeecCCce-----------------------EEE-EEe-------cCccEeecCc
Confidence 3888888876655555555222211 111 111 1567777888
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecC---------cc---------cccc--ccCCCC--
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIAN---------SQ---------HKNF--NLKPSK-- 271 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~---------~~---------~~~~--~~~~~~-- 271 (382)
|-.+.+.| .+=+++-..+++.. .|... ..+-..+-+|+++.. .+ .+.+ ++-|..
T Consensus 307 DRki~~Wd-~~y~k~r~~elPe~--~G~iR-tv~e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hps~~q 382 (626)
T KOG2106|consen 307 DRKIILWD-DNYRKLRETELPEQ--FGPIR-TVAEGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHPSKNQ 382 (626)
T ss_pred cceEEecc-ccccccccccCchh--cCCee-EEecCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCCChhh
Confidence 88888888 44555555555432 11111 000011113333221 11 0111 223322
Q ss_pred --CCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEe
Q 040693 272 --NSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKIL 334 (382)
Q Consensus 272 --~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~il 334 (382)
.+.+++.+...+ .-+++|+..+..+.. .... -.-+.|.+++. .|++++||.++-..+
T Consensus 383 ~~T~gqdk~v~lW~--~~k~~wt~~~~d~~~-~~~f-hpsg~va~Gt~--~G~w~V~d~e~~~lv 441 (626)
T KOG2106|consen 383 LLTCGQDKHVRLWN--DHKLEWTKIIEDPAE-CADF-HPSGVVAVGTA--TGRWFVLDTETQDLV 441 (626)
T ss_pred eeeccCcceEEEcc--CCceeEEEEecCcee-Eeec-cCcceEEEeec--cceEEEEecccceeE
Confidence 135567777777 779999988765311 1111 11246666775 899999999884443
No 210
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=86.49 E-value=35 Score=33.52 Aligned_cols=114 Identities=14% Similarity=0.087 Sum_probs=72.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..++..+.++..---|..+|+.+-...... ........+. -++.+|... ..++.|-.+|
T Consensus 315 geYllsAs~d~~w~Fsd~~~g~~lt~vs~~~---s~v~~ts~~fHpDgLifgtg----------------t~d~~vkiwd 375 (506)
T KOG0289|consen 315 GEYLLSASNDGTWAFSDISSGSQLTVVSDET---SDVEYTSAAFHPDGLIFGTG----------------TPDGVVKIWD 375 (506)
T ss_pred CcEEEEecCCceEEEEEccCCcEEEEEeecc---ccceeEEeeEcCCceEEecc----------------CCCceEEEEE
Confidence 5788899989977777899999888776531 1112222233 356666542 4568899999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
++.+..+=+++...+. ...+.+ +||+..+... +++.|.++|+..-+-.-++.+.
T Consensus 376 lks~~~~a~Fpght~~--vk~i~FsENGY~Lat~a-dd~~V~lwDLRKl~n~kt~~l~ 430 (506)
T KOG0289|consen 376 LKSQTNVAKFPGHTGP--VKAISFSENGYWLATAA-DDGSVKLWDLRKLKNFKTIQLD 430 (506)
T ss_pred cCCccccccCCCCCCc--eeEEEeccCceEEEEEe-cCCeEEEEEehhhcccceeecc
Confidence 9999877666654322 223334 4666555443 5777999998755544444443
No 211
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=86.42 E-value=51 Score=35.37 Aligned_cols=138 Identities=12% Similarity=0.071 Sum_probs=81.6
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEE-EeeeCceeecEEEEEccCcEEEEEe
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMML-SMYRNKVKHDIVVAVQKSGFAWALD 221 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-~~~~~g~~~~~v~~~~~~g~l~ald 221 (382)
-.|.+++.+++...-..+-... |++. ++.+. +..+.+.+-||.|.++|
T Consensus 118 ~~vK~~~~~D~s~~~~lrgh~a----------------------------pVl~l~~~p~---~~fLAvss~dG~v~iw~ 166 (933)
T KOG1274|consen 118 TAVKLLNLDDSSQEKVLRGHDA----------------------------PVLQLSYDPK---GNFLAVSSCDGKVQIWD 166 (933)
T ss_pred eeEEEEeccccchheeecccCC----------------------------ceeeeeEcCC---CCEEEEEecCceEEEEE
Confidence 6889999988887776654422 2221 22222 46777888899999999
Q ss_pred CCCCCeeeeeccCCCC--CC-CCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCC
Q 040693 222 RDSGSLIWSMEAGPGG--LG-GGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADP 296 (382)
Q Consensus 222 ~~tG~~~W~~~~~~~~--~~-g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~ 296 (382)
.++|.+.-....-.+. .. ......+++ +++.+.+. ..++.|..|+..+.+++......
T Consensus 167 ~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~-----------------~~d~~Vkvy~r~~we~~f~Lr~~ 229 (933)
T KOG1274|consen 167 LQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVP-----------------PVDNTVKVYSRKGWELQFKLRDK 229 (933)
T ss_pred cccchhhhhcccCCccccccccceeeeeeecCCCCeEEee-----------------ccCCeEEEEccCCceeheeeccc
Confidence 9999877655432210 00 111111222 33444433 24577999999999999887765
Q ss_pred CCCCCCcceEEe-CC-EEEEeeecCCCcEEEEeCCC
Q 040693 297 SNGTAPGPVTVA-NG-VLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 297 ~~~~~~~~~~~~-~~-~v~~~~~~~~g~l~~ld~~t 330 (382)
.......-+... +| ++-+++ .+|.|...|.++
T Consensus 230 ~~ss~~~~~~wsPnG~YiAAs~--~~g~I~vWnv~t 263 (933)
T KOG1274|consen 230 LSSSKFSDLQWSPNGKYIAAST--LDGQILVWNVDT 263 (933)
T ss_pred ccccceEEEEEcCCCcEEeeec--cCCcEEEEeccc
Confidence 421111222222 34 444434 588899888885
No 212
>PLN02153 epithiospecifier protein
Probab=86.41 E-value=31 Score=32.90 Aligned_cols=134 Identities=18% Similarity=0.230 Sum_probs=64.8
Q ss_pred EEEEEeCCCCCeeeeeccCCCC-CCCCcccceeeeCCeEEEEecCccccccccCCCC-CCCCCceEEEEECCCCcEEeee
Q 040693 216 FAWALDRDSGSLIWSMEAGPGG-LGGGAMWGAATDERRIYTNIANSQHKNFNLKPSK-NSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~-~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
.+.++|+++. .|+.-..... .........++.++.+|+....... ..++. .......+.+||+++. .|+.
T Consensus 160 ~v~~yd~~~~--~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~----~~~gG~~~~~~~~v~~yd~~~~--~W~~ 231 (341)
T PLN02153 160 TIEAYNIADG--KWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATS----ILPGGKSDYESNAVQFFDPASG--KWTE 231 (341)
T ss_pred eEEEEECCCC--eEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccc----cccCCccceecCceEEEEcCCC--cEEe
Confidence 4788998854 6886432210 0111222233456777775332100 00110 0011356999999765 4875
Q ss_pred cC-----CCCCCCCcceEEeCCEEEEeeecC-------------CCcEEEEeCCCCcEeEEEec-------CCce--ecc
Q 040693 294 AD-----PSNGTAPGPVTVANGVLFGGSTYR-------------QGPIYAMDVKTGKILWSYDT-------GATI--YGG 346 (382)
Q Consensus 294 ~~-----~~~~~~~~~~~~~~~~v~~~~~~~-------------~g~l~~ld~~tG~ilw~~~~-------~~~~--~~~ 346 (382)
-. +... ......+.++.+|+..... ...|+++|+++ ..|+.-. +.+. +..
T Consensus 232 ~~~~g~~P~~r-~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~--~~W~~~~~~~~~~~pr~~~~~~~ 308 (341)
T PLN02153 232 VETTGAKPSAR-SVFAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTET--LVWEKLGECGEPAMPRGWTAYTT 308 (341)
T ss_pred ccccCCCCCCc-ceeeeEEECCEEEEECcccCCccccccccccccccEEEEEcCc--cEEEeccCCCCCCCCCccccccc
Confidence 32 2222 2223344566666654210 12689999874 4576321 1111 122
Q ss_pred eE-EeCCEEEEEeCc
Q 040693 347 AS-VSNGCIYMGNGY 360 (382)
Q Consensus 347 p~-~~~g~lyv~~~~ 360 (382)
.. ..++++|+..+.
T Consensus 309 ~~v~~~~~~~~~gG~ 323 (341)
T PLN02153 309 ATVYGKNGLLMHGGK 323 (341)
T ss_pred cccCCcceEEEEcCc
Confidence 23 345688886554
No 213
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=86.21 E-value=9.4 Score=30.24 Aligned_cols=49 Identities=22% Similarity=0.453 Sum_probs=32.9
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
|.|+..|. +|+++|+...... . ...++.-..+|.++.++.
T Consensus 65 GnLvl~~~-~g~~vW~S~~~~~--------------------------~-------------~~~~~~L~ddGnlvl~~~ 104 (116)
T cd00028 65 GNLVIYDG-SGTVVWSSNTTRV--------------------------N-------------GNYVLVLLDDGNLVLYDS 104 (116)
T ss_pred CCeEEEcC-CCcEEEEecccCC--------------------------C-------------CceEEEEeCCCCEEEECC
Confidence 77888886 5899998655421 0 223334456788888886
Q ss_pred CCCCeeeeec
Q 040693 223 DSGSLIWSME 232 (382)
Q Consensus 223 ~tG~~~W~~~ 232 (382)
. |+++|+--
T Consensus 105 ~-~~~~W~Sf 113 (116)
T cd00028 105 D-GNFLWQSF 113 (116)
T ss_pred C-CCEEEcCC
Confidence 5 89999853
No 214
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=86.20 E-value=39 Score=33.91 Aligned_cols=88 Identities=13% Similarity=0.124 Sum_probs=53.1
Q ss_pred CCceEEEEECCCCcE-Eeee-cCCCCCCCCcceEE--eCCEEEEeeec----CCCcEEEEeCCCCcEeEEEecCCceecc
Q 040693 275 IAGGWVAMDASNGNV-LWST-ADPSNGTAPGPVTV--ANGVLFGGSTY----RQGPIYAMDVKTGKILWSYDTGATIYGG 346 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~-~W~~-~~~~~~~~~~~~~~--~~~~v~~~~~~----~~g~l~~ld~~tG~ilw~~~~~~~~~~~ 346 (382)
.++++..+|+++-+. +-.. .++. .+-.+-..+ ++.+|+.++.- ..|.|+.||..|-+++.+..++.....-
T Consensus 384 ~D~tLKvWDLrq~kkpL~~~tgL~t-~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t~d~v~ki~i~~aSvv~ 462 (641)
T KOG0772|consen 384 FDDTLKVWDLRQFKKPLNVRTGLPT-PFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRMTLDTVYKIDISTASVVR 462 (641)
T ss_pred CCCceeeeeccccccchhhhcCCCc-cCCCCccccCCCceEEEecccccCCCCCceEEEEeccceeeEEEecCCCceEEE
Confidence 457788888887643 3222 1222 111222222 34566665431 2467999999999999999887543322
Q ss_pred eEE--eCCEEEEEeCceeE
Q 040693 347 ASV--SNGCIYMGNGYKVT 363 (382)
Q Consensus 347 p~~--~~g~lyv~~~~g~~ 363 (382)
.+. -=|.|++++++|.+
T Consensus 463 ~~WhpkLNQi~~gsgdG~~ 481 (641)
T KOG0772|consen 463 CLWHPKLNQIFAGSGDGTA 481 (641)
T ss_pred EeecchhhheeeecCCCce
Confidence 222 23889999999885
No 215
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=86.08 E-value=15 Score=35.25 Aligned_cols=128 Identities=12% Similarity=-0.003 Sum_probs=67.7
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCccccee--eeCCe-EEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA--TDERR-IYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL 290 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~--~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~ 290 (382)
...+..+|..+|+..+..+............... .+++. +++.. . +.-.+|+.++..++++.
T Consensus 209 ~~~l~~~d~~tg~~~~~~~e~~~~Wv~~~~~~~~~~~~~~~~l~~s~-~--------------~G~~hly~~~~~~~~~~ 273 (353)
T PF00930_consen 209 RLDLVLCDASTGETRVVLEETSDGWVDVYDPPHFLGPDGNEFLWISE-R--------------DGYRHLYLYDLDGGKPR 273 (353)
T ss_dssp EEEEEEEEECTTTCEEEEEEESSSSSSSSSEEEE-TTTSSEEEEEEE-T--------------TSSEEEEEEETTSSEEE
T ss_pred EEEEEEEECCCCceeEEEEecCCcceeeecccccccCCCCEEEEEEE-c--------------CCCcEEEEEccccccee
Confidence 3578889999998887776544333321111111 13333 33332 2 33577999999999877
Q ss_pred eeecCCCCCCCCcceEEe--CCEEEEeeec---CCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEe
Q 040693 291 WSTADPSNGTAPGPVTVA--NGVLFGGSTY---RQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGN 358 (382)
Q Consensus 291 W~~~~~~~~~~~~~~~~~--~~~v~~~~~~---~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~ 358 (382)
+-+..... ....+.++ ++.||..+.. ..-.||.++.+++..+-++........++.+ .+++.|+-+
T Consensus 274 ~lT~G~~~--V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~~~~~~~~~~~Spdg~y~v~~ 345 (353)
T PF00930_consen 274 QLTSGDWE--VTSILGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTCEDGDHYSASFSPDGKYYVDT 345 (353)
T ss_dssp ESS-SSS---EEEEEEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEESSTTSSTTEEEEE-TTSSEEEEE
T ss_pred ccccCcee--ecccceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccCCCCCceEEEECCCCCEEEEE
Confidence 55554321 11233333 4677776641 1336999998834444444433322223433 345555543
No 216
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=85.88 E-value=42 Score=34.27 Aligned_cols=85 Identities=15% Similarity=0.177 Sum_probs=46.6
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC-CcEeEEEecCCceecceEE---e
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT-GKILWSYDTGATIYGGASV---S 350 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t-G~ilw~~~~~~~~~~~p~~---~ 350 (382)
..+.++..+..+|+.+-..-.-.+........-.++.+++++. +..|.-++.++ +-++-...-.. .....++ .
T Consensus 352 s~g~L~van~stG~~v~sv~q~Rg~nit~~~~d~~g~lWlgs~--q~GLsrl~n~n~~avlde~agl~-ss~V~aived~ 428 (671)
T COG3292 352 SIGELMVANGSTGELVRSVHQLRGMNITTTLEDSRGRLWLGSM--QNGLSRLDNKNEWAVLDEDAGLP-SSEVSAIVEDP 428 (671)
T ss_pred ccceEEEecCCCCcEEEEeeeccccccchhhhccCCcEEEEec--ccchhhhccCCcccccccccCCc-ccceeeeeecC
Confidence 3467889999999877664322211122222223678999885 55666677665 33332221111 1111112 4
Q ss_pred CCEEEEEeCcee
Q 040693 351 NGCIYMGNGYKV 362 (382)
Q Consensus 351 ~g~lyv~~~~g~ 362 (382)
++.|.+++++|.
T Consensus 429 dnsLWIGTs~Gl 440 (671)
T COG3292 429 DNSLWIGTSGGL 440 (671)
T ss_pred CCCEEEeccCCe
Confidence 577999988866
No 217
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=85.70 E-value=40 Score=34.76 Aligned_cols=108 Identities=16% Similarity=0.188 Sum_probs=61.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCC--eeeeeccCCCC-CC-CC--cccceee-eCCeEEEEecCccccccccCCCCCCCCCc
Q 040693 205 HDIVVAVQKSGFAWALDRDSGS--LIWSMEAGPGG-LG-GG--AMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAG 277 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~--~~W~~~~~~~~-~~-g~--~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g 277 (382)
+.++..++-|..++.+|..+|. +.-+++.-+.. .. |. +.+..+. ..+.++++. ...+
T Consensus 130 ~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsG----------------gtek 193 (735)
T KOG0308|consen 130 NELVASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSG----------------GTEK 193 (735)
T ss_pred ceeEEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEec----------------Cccc
Confidence 5666677789999999999883 33333321111 11 11 2233222 334455542 2235
Q ss_pred eEEEEECCCCcEEeeecCCCCCCCCcceEE-e-CCEEEEeeecCCCcEEEEeCCCCc
Q 040693 278 GWVAMDASNGNVLWSTADPSNGTAPGPVTV-A-NGVLFGGSTYRQGPIYAMDVKTGK 332 (382)
Q Consensus 278 ~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~-~~~v~~~~~~~~g~l~~ld~~tG~ 332 (382)
-+..+|++|++.+=+...... ....+.+ + +.+++-++. +|.|...|+..-+
T Consensus 194 ~lr~wDprt~~kimkLrGHTd--NVr~ll~~dDGt~~ls~sS--DgtIrlWdLgqQr 246 (735)
T KOG0308|consen 194 DLRLWDPRTCKKIMKLRGHTD--NVRVLLVNDDGTRLLSASS--DGTIRLWDLGQQR 246 (735)
T ss_pred ceEEeccccccceeeeecccc--ceEEEEEcCCCCeEeecCC--CceEEeeeccccc
Confidence 689999999998877664431 2223333 2 346776664 8888888875433
No 218
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=85.65 E-value=33 Score=32.47 Aligned_cols=47 Identities=19% Similarity=0.242 Sum_probs=30.4
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATG 108 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~ 108 (382)
+.-|+.+|+-||+++-+++.-... .++-+. -+..+.+++..||.+..
T Consensus 132 ~~PIh~wdaftG~lraSy~~ydh~----de~taA----hsL~Fs~DGeqlfaGyk 178 (406)
T KOG2919|consen 132 DQPIHLWDAFTGKLRASYRAYDHQ----DEYTAA----HSLQFSPDGEQLFAGYK 178 (406)
T ss_pred cCceeeeeccccccccchhhhhhH----Hhhhhh----eeEEecCCCCeEeeccc
Confidence 446888999999999998763221 112111 14567778888888653
No 219
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=84.93 E-value=41 Score=34.02 Aligned_cols=109 Identities=9% Similarity=-0.013 Sum_probs=72.4
Q ss_pred CCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEE
Q 040693 140 NHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWA 219 (382)
Q Consensus 140 ~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~a 219 (382)
++.|.|..++..-|++-|+......+. ++.... ++..-+.|+..+.+..+.-
T Consensus 77 t~~g~v~~ys~~~g~it~~~st~~h~~--------------------------~v~~~~--~~~~~~ciyS~~ad~~v~~ 128 (541)
T KOG4547|consen 77 TPQGSVLLYSVAGGEITAKLSTDKHYG--------------------------NVNEIL--DAQRLGCIYSVGADLKVVY 128 (541)
T ss_pred cCCccEEEEEecCCeEEEEEecCCCCC--------------------------cceeee--cccccCceEecCCceeEEE
Confidence 345999999999999999998665421 111100 0112467888888999999
Q ss_pred EeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCC
Q 040693 220 LDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSN 298 (382)
Q Consensus 220 ld~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~ 298 (382)
++.++++.+-.....++. .....+.-|+.++.+. ++.|..+|++++|++=++.....
T Consensus 129 ~~~~~~~~~~~~~~~~~~---~~sl~is~D~~~l~~a-------------------s~~ik~~~~~~kevv~~ftgh~s 185 (541)
T KOG4547|consen 129 ILEKEKVIIRIWKEQKPL---VSSLCISPDGKILLTA-------------------SRQIKVLDIETKEVVITFTGHGS 185 (541)
T ss_pred EecccceeeeeeccCCCc---cceEEEcCCCCEEEec-------------------cceEEEEEccCceEEEEecCCCc
Confidence 999999877665544321 1222222255555543 36699999999999988886653
No 220
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=84.81 E-value=3.1 Score=25.93 Aligned_cols=33 Identities=15% Similarity=0.282 Sum_probs=27.0
Q ss_pred eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
+++.+|++.. ..+.|..||+++++++-+.+++.
T Consensus 2 d~~~lyv~~~-~~~~v~~id~~~~~~~~~i~vg~ 34 (42)
T TIGR02276 2 DGTKLYVTNS-GSNTVSVIDTATNKVIATIPVGG 34 (42)
T ss_pred CCCEEEEEeC-CCCEEEEEECCCCeEEEEEECCC
Confidence 3567888774 57899999999999999988853
No 221
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=84.69 E-value=21 Score=34.68 Aligned_cols=112 Identities=14% Similarity=0.066 Sum_probs=75.5
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
..++.+..|..|..+|..++++.-..++.. ........ ++..|... ..+.++-.||.
T Consensus 313 ~~~~SgH~DkkvRfwD~Rs~~~~~sv~~gg-----~vtSl~ls~~g~~lLss-----------------sRDdtl~viDl 370 (459)
T KOG0288|consen 313 SDVISGHFDKKVRFWDIRSADKTRSVPLGG-----RVTSLDLSMDGLELLSS-----------------SRDDTLKVIDL 370 (459)
T ss_pred eeeeecccccceEEEeccCCceeeEeecCc-----ceeeEeeccCCeEEeee-----------------cCCCceeeeec
Confidence 345566778899999999898888887542 22222222 33444443 34578999999
Q ss_pred CCCcEEeeecCCCC---CCCCcceEEe-CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 285 SNGNVLWSTADPSN---GTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 285 ~tG~~~W~~~~~~~---~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
++-++.-.+....- ..+...+... +.+|.+++ .+|.||..+..+||+.-+...+.
T Consensus 371 Rt~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAGS--~dgsv~iW~v~tgKlE~~l~~s~ 429 (459)
T KOG0288|consen 371 RTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVAAGS--ADGSVYIWSVFTGKLEKVLSLST 429 (459)
T ss_pred ccccEEEEeeccccccccccceeEECCCCceeeecc--CCCcEEEEEccCceEEEEeccCC
Confidence 99999888765541 1223333223 34666666 49999999999999998887665
No 222
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=84.63 E-value=36 Score=32.38 Aligned_cols=183 Identities=14% Similarity=0.076 Sum_probs=94.5
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
+.+..+|..||+.+=+++..+... +...+.+ .+ +.+.|++++.||.+.++|.
T Consensus 50 gsv~lyd~~tg~~l~~fk~~~~~~------------------------N~vrf~~--~d--s~h~v~s~ssDG~Vr~wD~ 101 (376)
T KOG1188|consen 50 GSVRLYDKGTGQLLEEFKGPPATT------------------------NGVRFIS--CD--SPHGVISCSSDGTVRLWDI 101 (376)
T ss_pred CeEEEEeccchhhhheecCCCCcc------------------------cceEEec--CC--CCCeeEEeccCCeEEEEEe
Confidence 899999999999999988765522 1111111 11 2577889999999999998
Q ss_pred CCCCeeeeeccCCCCCCCCcccceeee-CCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE-eeecCCCCCC
Q 040693 223 DSGSLIWSMEAGPGGLGGGAMWGAATD-ERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL-WSTADPSNGT 300 (382)
Q Consensus 223 ~tG~~~W~~~~~~~~~~g~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~-W~~~~~~~~~ 300 (382)
.+-...-.+..... ++.+....... ++.++....+. ......|+.+|.+.-+.. -.+...-..
T Consensus 102 Rs~~e~a~~~~~~~--~~~~f~~ld~nck~~ii~~GtE~------------~~s~A~v~lwDvR~~qq~l~~~~eSH~D- 166 (376)
T KOG1188|consen 102 RSQAESARISWTQQ--SGTPFICLDLNCKKNIIACGTEL------------TRSDASVVLWDVRSEQQLLRQLNESHND- 166 (376)
T ss_pred ecchhhhheeccCC--CCCcceEeeccCcCCeEEecccc------------ccCceEEEEEEeccccchhhhhhhhccC-
Confidence 87544433332211 11122222111 23333332110 122355888888765542 111110001
Q ss_pred CCcceEE---eCCEEEEeeecCCCcEEEEeCCCCcE----eEEEecCCceecceEEeCC--EEEEEeCceeEeecCCccC
Q 040693 301 APGPVTV---ANGVLFGGSTYRQGPIYAMDVKTGKI----LWSYDTGATIYGGASVSNG--CIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 301 ~~~~~~~---~~~~v~~~~~~~~g~l~~ld~~tG~i----lw~~~~~~~~~~~p~~~~g--~lyv~~~~g~~~~~~~~~~ 371 (382)
..+.+-+ +.+++.-++. +|.+-.||.+.-.+ +-..+.+..+..---..++ ++|+.+-.+. .++|+++-
T Consensus 167 DVT~lrFHP~~pnlLlSGSv--DGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw~~~~ykrI~clTH~Et-f~~~ele~ 243 (376)
T KOG1188|consen 167 DVTQLRFHPSDPNLLLSGSV--DGLVNLFDTKKDNEEDALLHVINHGSSIHLIGWLSKKYKRIMCLTHMET-FAIYELED 243 (376)
T ss_pred cceeEEecCCCCCeEEeecc--cceEEeeecCCCcchhhHHHhhcccceeeeeeeecCCcceEEEEEccCc-eeEEEccC
Confidence 1122222 3467777775 99999999863211 1122222222222223455 7888876644 45666654
No 223
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=84.28 E-value=13 Score=29.20 Aligned_cols=23 Identities=22% Similarity=0.518 Sum_probs=16.4
Q ss_pred EEEEccCcEEEEEeCCCCCeeeee
Q 040693 208 VVAVQKSGFAWALDRDSGSLIWSM 231 (382)
Q Consensus 208 v~~~~~~g~l~ald~~tG~~~W~~ 231 (382)
.+.-..+|.++.+|. .|+++|+-
T Consensus 89 ~~~L~ddGnlvl~~~-~~~~~W~S 111 (114)
T smart00108 89 VLVLLDDGNLVIYDS-DGNFLWQS 111 (114)
T ss_pred EEEEeCCCCEEEECC-CCCEEeCC
Confidence 334456788888886 47899974
No 224
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=83.99 E-value=48 Score=32.97 Aligned_cols=131 Identities=18% Similarity=0.188 Sum_probs=66.6
Q ss_pred ecEEEEEccCc--EEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSG--FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g--~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
..++++...+| .+|.+|..++. +++...... ......|.+ ++..++..+.. ...-.|+.+
T Consensus 250 ~~l~f~~~rdg~~~iy~~dl~~~~-~~~Lt~~~g-i~~~Ps~sp--dG~~ivf~Sdr--------------~G~p~I~~~ 311 (425)
T COG0823 250 SKLAFSSSRDGSPDIYLMDLDGKN-LPRLTNGFG-INTSPSWSP--DGSKIVFTSDR--------------GGRPQIYLY 311 (425)
T ss_pred CEEEEEECCCCCccEEEEcCCCCc-ceecccCCc-cccCccCCC--CCCEEEEEeCC--------------CCCcceEEE
Confidence 45556665554 79999988555 666443221 122233433 66666665332 122358999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeC-CEEEEeeecCCCc--EEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEE
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVAN-GVLFGGSTYRQGP--IYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMG 357 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~g~--l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~ 357 (382)
|+..++..--+..... .. .|....+ .++.+.+. ..|. +...|+.++.- |+.-.......+|.. .+++.++.
T Consensus 312 ~~~g~~~~riT~~~~~-~~-~p~~SpdG~~i~~~~~-~~g~~~i~~~~~~~~~~-~~~lt~~~~~e~ps~~~ng~~i~~ 386 (425)
T COG0823 312 DLEGSQVTRLTFSGGG-NS-NPVWSPDGDKIVFESS-SGGQWDIDKNDLASGGK-IRILTSTYLNESPSWAPNGRMIMF 386 (425)
T ss_pred CCCCCceeEeeccCCC-Cc-CccCCCCCCEEEEEec-cCCceeeEEeccCCCCc-EEEccccccCCCCCcCCCCceEEE
Confidence 9888876333332221 22 3333444 44333332 2455 77777777665 555443333344433 44444443
No 225
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=83.82 E-value=9.9 Score=35.31 Aligned_cols=110 Identities=18% Similarity=0.325 Sum_probs=66.9
Q ss_pred cEEEEEccCcEEEEEeCCCCCee----eeeccCCCCCCCCcccceee---eCCeEEEEecCccccccccCCCCCCCCCce
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLI----WSMEAGPGGLGGGAMWGAAT---DERRIYTNIANSQHKNFNLKPSKNSTIAGG 278 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~----W~~~~~~~~~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~ 278 (382)
..|++...+|.+..++-+.+.+. |+...- ..|.... +-++||.+ ..++.
T Consensus 134 ~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He~-------E~Wta~f~~~~pnlvytG-----------------gDD~~ 189 (339)
T KOG0280|consen 134 TKIFVSDSRGSISGVYETEMVLEKVQTWKVHEF-------EAWTAKFSDKEPNLVYTG-----------------GDDGS 189 (339)
T ss_pred ceEEEEcCCCcEEEEecceeeeeecccccccce-------eeeeeecccCCCceEEec-----------------CCCce
Confidence 44777777777766665544332 322210 2233222 33677776 45689
Q ss_pred EEEEECC-CCcEEeeecCCCCCCCCcceEE-----eCCEEEEeeecCCCcEEEEeCCC-CcEeEEEecCCcee
Q 040693 279 WVAMDAS-NGNVLWSTADPSNGTAPGPVTV-----ANGVLFGGSTYRQGPIYAMDVKT-GKILWSYDTGATIY 344 (382)
Q Consensus 279 v~a~d~~-tG~~~W~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~g~l~~ld~~t-G~ilw~~~~~~~~~ 344 (382)
+.++|.+ .++-+|+....- .++...+ ....++.+++ +..+..+|..+ ||++.+.+++++++
T Consensus 190 l~~~D~R~p~~~i~~n~kvH---~~GV~SI~ss~~~~~~I~TGsY--De~i~~~DtRnm~kPl~~~~v~GGVW 257 (339)
T KOG0280|consen 190 LSCWDIRIPKTFIWHNSKVH---TSGVVSIYSSPPKPTYIATGSY--DECIRVLDTRNMGKPLFKAKVGGGVW 257 (339)
T ss_pred EEEEEecCCcceeeecceee---ecceEEEecCCCCCceEEEecc--ccceeeeehhcccCccccCccccceE
Confidence 9999999 788899854322 1222222 2347888885 89999999884 88877766666543
No 226
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=83.47 E-value=33 Score=31.46 Aligned_cols=140 Identities=12% Similarity=0.075 Sum_probs=85.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
.+++.+...+..+..+|..+||..-..+..... --..|.| +++.+.++ ..+..|.-||.
T Consensus 77 ~d~~atas~dk~ir~wd~r~~k~~~~i~~~~en--i~i~wsp--~g~~~~~~-----------------~kdD~it~id~ 135 (313)
T KOG1407|consen 77 PDLFATASGDKTIRIWDIRSGKCTARIETKGEN--INITWSP--DGEYIAVG-----------------NKDDRITFIDA 135 (313)
T ss_pred CcceEEecCCceEEEEEeccCcEEEEeeccCcc--eEEEEcC--CCCEEEEe-----------------cCcccEEEEEe
Confidence 355566667778889999999988877743210 0012222 56666665 45577999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE---eCCEEEEEeCce
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV---SNGCIYMGNGYK 361 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~---~~g~lyv~~~~g 361 (382)
++-+++-+.+++.... ...--..|++.|+.+. .|.+..|..-.-|++-..+.... ....+ -+|+-|...+..
T Consensus 136 r~~~~~~~~~~~~e~n-e~~w~~~nd~Fflt~G--lG~v~ILsypsLkpv~si~AH~s--nCicI~f~p~GryfA~GsAD 210 (313)
T KOG1407|consen 136 RTYKIVNEEQFKFEVN-EISWNNSNDLFFLTNG--LGCVEILSYPSLKPVQSIKAHPS--NCICIEFDPDGRYFATGSAD 210 (313)
T ss_pred cccceeehhcccceee-eeeecCCCCEEEEecC--CceEEEEeccccccccccccCCc--ceEEEEECCCCceEeecccc
Confidence 9999988877766311 1111234566666653 68888888776777766654321 11122 356666666665
Q ss_pred eEeecCCcc
Q 040693 362 VTVGFGNKN 370 (382)
Q Consensus 362 ~~~~~~~~~ 370 (382)
..+.|+.++
T Consensus 211 AlvSLWD~~ 219 (313)
T KOG1407|consen 211 ALVSLWDVD 219 (313)
T ss_pred ceeeccChh
Confidence 667776543
No 227
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=82.94 E-value=33 Score=32.01 Aligned_cols=104 Identities=12% Similarity=0.065 Sum_probs=62.2
Q ss_pred ecEEEEEccCcEEEEEe-CCCCCeeeeeccCCCCCCCCcccc-eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALD-RDSGSLIWSMEAGPGGLGGGAMWG-AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald-~~tG~~~W~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+..+..++.|..++..+ ..+-+..|..+.- ++..+-. ...++..||.. ..+.+|+.+
T Consensus 59 gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgH----sgAVM~l~~~~d~s~i~S~-----------------gtDk~v~~w 117 (338)
T KOG0265|consen 59 GSCFASGGSDRAIVLWNVYGDCENFWVLKGH----SGAVMELHGMRDGSHILSC-----------------GTDKTVRGW 117 (338)
T ss_pred CCeEeecCCcceEEEEeccccccceeeeccc----cceeEeeeeccCCCEEEEe-----------------cCCceEEEE
Confidence 35566677788888777 3345566666521 1111111 11156666665 345789999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t 330 (382)
|.+||+...+......-...-.+ ..-|...+.+..+++.+...|..+
T Consensus 118 D~~tG~~~rk~k~h~~~vNs~~p-~rrg~~lv~SgsdD~t~kl~D~R~ 164 (338)
T KOG0265|consen 118 DAETGKRIRKHKGHTSFVNSLDP-SRRGPQLVCSGSDDGTLKLWDIRK 164 (338)
T ss_pred ecccceeeehhccccceeeecCc-cccCCeEEEecCCCceEEEEeecc
Confidence 99999999888876632222222 223444444444689999999874
No 228
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=82.84 E-value=7.2 Score=39.19 Aligned_cols=96 Identities=10% Similarity=0.020 Sum_probs=59.1
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecCCC--cEEEEeCCCCcEeEEEecCCceecceEEe
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYRQG--PIYAMDVKTGKILWSYDTGATIYGGASVS 350 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~g--~l~~ld~~tG~ilw~~~~~~~~~~~p~~~ 350 (382)
...+.|+.+|+.+|+.+-+..+.. .....+.. .+.+|.++..+..+ .|+.||+.|=++.-+....-...|..++.
T Consensus 372 ~~ls~LvllD~~tg~~l~~S~~~~--Ir~r~~~~~~~~~vaI~g~~G~~~ikLvlid~~tLev~kes~~~i~~~S~l~~~ 449 (489)
T PF05262_consen 372 HYLSELVLLDSDTGDTLKRSPVNG--IRGRTFYEREDDLVAIAGCSGNAAIKLVLIDPETLEVKKESEDEISWQSSLIVD 449 (489)
T ss_pred CcceeEEEEeCCCCceecccccce--eccceeEEcCCCEEEEeccCCchheEEEecCcccceeeeeccccccccCceEEc
Confidence 345889999999999998877755 22333333 44555554321222 47777888888877665544444666668
Q ss_pred CCEEEEEe-CceeEeecCCccC
Q 040693 351 NGCIYMGN-GYKVTVGFGNKNF 371 (382)
Q Consensus 351 ~g~lyv~~-~~g~~~~~~~~~~ 371 (382)
++.+|+.- .....-+|..||.
T Consensus 450 ~~~iyaVv~~~~g~~~L~rF~~ 471 (489)
T PF05262_consen 450 GQMIYAVVKKDNGKWYLGRFDS 471 (489)
T ss_pred CCeEEEEEEcCCCeEEEeecCc
Confidence 88888665 3323233444443
No 229
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=82.58 E-value=51 Score=32.24 Aligned_cols=139 Identities=12% Similarity=0.038 Sum_probs=82.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+.++..++.|..-...|..||+.+-...--.. ..+.... -++....+. ..+++....|
T Consensus 315 GSL~~tGGlD~~~RvWDlRtgr~im~L~gH~k-----~I~~V~fsPNGy~lATg----------------s~Dnt~kVWD 373 (459)
T KOG0272|consen 315 GSLAATGGLDSLGRVWDLRTGRCIMFLAGHIK-----EILSVAFSPNGYHLATG----------------SSDNTCKVWD 373 (459)
T ss_pred CceeeccCccchhheeecccCcEEEEeccccc-----ceeeEeECCCceEEeec----------------CCCCcEEEee
Confidence 35666777777777788888887766652111 1111111 234444432 3457788889
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEe--CC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCCc-eecceEEeCCEEEEEeC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVA--NG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT-IYGGASVSNGCIYMGNG 359 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~--~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~-~~~~p~~~~g~lyv~~~ 359 (382)
++.-+.+.+++..... .+.+-+. .| .+..++ .++.+-.....++..+-...-..+ +++--+..++..+++++
T Consensus 374 LR~r~~ly~ipAH~nl--VS~Vk~~p~~g~fL~Tas--yD~t~kiWs~~~~~~~ksLaGHe~kV~s~Dis~d~~~i~t~s 449 (459)
T KOG0272|consen 374 LRMRSELYTIPAHSNL--VSQVKYSPQEGYFLVTAS--YDNTVKIWSTRTWSPLKSLAGHEGKVISLDISPDSQAIATSS 449 (459)
T ss_pred ecccccceecccccch--hhheEecccCCeEEEEcc--cCcceeeecCCCcccchhhcCCccceEEEEeccCCceEEEec
Confidence 8888887777765532 2222222 23 333344 477777777777777665543333 44444557788888888
Q ss_pred ceeEeecCC
Q 040693 360 YKVTVGFGN 368 (382)
Q Consensus 360 ~g~~~~~~~ 368 (382)
+.+..+++.
T Consensus 450 ~DRT~KLW~ 458 (459)
T KOG0272|consen 450 FDRTIKLWR 458 (459)
T ss_pred cCceeeecc
Confidence 888777653
No 230
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=81.91 E-value=23 Score=27.83 Aligned_cols=21 Identities=33% Similarity=0.604 Sum_probs=16.2
Q ss_pred ccCcEEEEEeCCCCCeeeeecc
Q 040693 212 QKSGFAWALDRDSGSLIWSMEA 233 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~~W~~~~ 233 (382)
..+|.|+.+|.. |+.+|+...
T Consensus 61 ~~dGnLvl~~~~-g~~vW~S~t 81 (114)
T smart00108 61 QSDGNLVLYDGD-GRVVWSSNT 81 (114)
T ss_pred eCCCCEEEEeCC-CCEEEEecc
Confidence 457888888865 899999764
No 231
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=80.76 E-value=63 Score=34.28 Aligned_cols=52 Identities=8% Similarity=-0.029 Sum_probs=34.6
Q ss_pred CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc---ee-cceEEeCCEEEEEeCceeE
Q 040693 309 NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT---IY-GGASVSNGCIYMGNGYKVT 363 (382)
Q Consensus 309 ~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~---~~-~~p~~~~g~lyv~~~~g~~ 363 (382)
-+.+|+.- .-.|+.||++=-..+...+++.+ +. --|-.-.+-||+...+|.+
T Consensus 243 rn~lfi~~---prellv~dle~~~~l~vvpier~~akfv~vlP~~~rd~LfclH~nG~l 298 (1062)
T KOG1912|consen 243 RNILFITF---PRELLVFDLEYECCLAVVPIERGGAKFVDVLPDPRRDALFCLHSNGRL 298 (1062)
T ss_pred hceEEEEe---ccceEEEcchhhceeEEEEeccCCcceeEeccCCCcceEEEEecCCeE
Confidence 45666665 67888888886667777777654 11 1233356788888888873
No 232
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=80.42 E-value=35 Score=34.15 Aligned_cols=103 Identities=12% Similarity=0.127 Sum_probs=58.5
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+.++.....+|.+..+|...-.++..+...-.....+..+.| .+..++++.+ .+.+|..||.
T Consensus 177 r~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfsp--sne~l~vsVG----------------~Dkki~~yD~ 238 (673)
T KOG4378|consen 177 RFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSP--SNEALLVSVG----------------YDKKINIYDI 238 (673)
T ss_pred ceeeEeeccCCeEEEEeccCCCcccchhhhccCCcCcceecC--CccceEEEec----------------ccceEEEeec
Confidence 567777888999999998866666555422111111222332 4555666543 3567888998
Q ss_pred CCCcEEeeecCCCCCCCCcceEE-eCCEE-EEeeecCCCcEEEEeCCC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV-ANGVL-FGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~-~~~~v-~~~~~~~~g~l~~ld~~t 330 (382)
...+..=+.....+ .+.+.+ ++|.+ .+++ ..|+|++.|...
T Consensus 239 ~s~~s~~~l~y~~P---lstvaf~~~G~~L~aG~--s~G~~i~YD~R~ 281 (673)
T KOG4378|consen 239 RSQASTDRLTYSHP---LSTVAFSECGTYLCAGN--SKGELIAYDMRS 281 (673)
T ss_pred ccccccceeeecCC---cceeeecCCceEEEeec--CCceEEEEeccc
Confidence 75544333222211 223333 34544 4444 489999999874
No 233
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=80.23 E-value=44 Score=33.20 Aligned_cols=122 Identities=13% Similarity=0.069 Sum_probs=63.8
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeec
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTA 294 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~ 294 (382)
..++.+|.++|+.--..+.+. ..+.+.|.| |+..|.+.... +..-.|+.+|..+++ +++..
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g--~~~~P~fsp--DG~~l~f~~~r--------------dg~~~iy~~dl~~~~-~~~Lt 278 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNG--NNGAPAFSP--DGSKLAFSSSR--------------DGSPDIYLMDLDGKN-LPRLT 278 (425)
T ss_pred ceEEEEeccCCccceeeccCC--ccCCccCCC--CCCEEEEEECC--------------CCCccEEEEcCCCCc-ceecc
Confidence 458888888886554444221 111122222 66666665433 334569999998776 55533
Q ss_pred CCCCCCCCcceEEeCCEEEEeeecCCC--cEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEE
Q 040693 295 DPSNGTAPGPVTVANGVLFGGSTYRQG--PIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMG 357 (382)
Q Consensus 295 ~~~~~~~~~~~~~~~~~v~~~~~~~~g--~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~ 357 (382)
........+...-++..++..+. +.| .|+.+|++.+.. -+.....+....|.+ .+|+.++.
T Consensus 279 ~~~gi~~~Ps~spdG~~ivf~Sd-r~G~p~I~~~~~~g~~~-~riT~~~~~~~~p~~SpdG~~i~~ 342 (425)
T COG0823 279 NGFGINTSPSWSPDGSKIVFTSD-RGGRPQIYLYDLEGSQV-TRLTFSGGGNSNPVWSPDGDKIVF 342 (425)
T ss_pred cCCccccCccCCCCCCEEEEEeC-CCCCcceEEECCCCCce-eEeeccCCCCcCccCCCCCCEEEE
Confidence 22222222233234444444442 344 599999886554 455444444446655 44444443
No 234
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=79.89 E-value=51 Score=35.37 Aligned_cols=111 Identities=11% Similarity=0.017 Sum_probs=73.0
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+-.|-+++.+++...=..+-... .+ ....+++.++++.+.+.+
T Consensus 117 D~~vK~~~~~D~s~~~~lrgh~a-----------pV--l~l~~~p~~~fLAvss~d------------------------ 159 (933)
T KOG1274|consen 117 DTAVKLLNLDDSSQEKVLRGHDA-----------PV--LQLSYDPKGNFLAVSSCD------------------------ 159 (933)
T ss_pred ceeEEEEeccccchheeecccCC-----------ce--eeeeEcCCCCEEEEEecC------------------------
Confidence 56788888888777665543211 01 135688888888887765
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCC-CCceEE-EeeeCceeecEEEEE
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFG-EAPMML-SMYRNKVKHDIVVAV 211 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~-~~~~~g~~~~~v~~~ 211 (382)
|.|+++|.++|.+.-........ .++- +.++.. ...++ ++.+.+.
T Consensus 160 ---------G~v~iw~~~~~~~~~tl~~v~k~---------------------n~~~~s~i~~~~aW~Pk---~g~la~~ 206 (933)
T KOG1274|consen 160 ---------GKVQIWDLQDGILSKTLTGVDKD---------------------NEFILSRICTRLAWHPK---GGTLAVP 206 (933)
T ss_pred ---------ceEEEEEcccchhhhhcccCCcc---------------------ccccccceeeeeeecCC---CCeEEee
Confidence 89999999999888776654431 1111 222211 12233 3566666
Q ss_pred ccCcEEEEEeCCCCCeeeeeccC
Q 040693 212 QKSGFAWALDRDSGSLIWSMEAG 234 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~~W~~~~~ 234 (382)
.-++.|..+++++.+++......
T Consensus 207 ~~d~~Vkvy~r~~we~~f~Lr~~ 229 (933)
T KOG1274|consen 207 PVDNTVKVYSRKGWELQFKLRDK 229 (933)
T ss_pred ccCCeEEEEccCCceeheeeccc
Confidence 77888999999999999888865
No 235
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=79.45 E-value=42 Score=33.71 Aligned_cols=104 Identities=13% Similarity=0.079 Sum_probs=55.6
Q ss_pred ecEEEEEccCcEEEEEeCCCCCe-eeee-ccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSL-IWSM-EAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~-~W~~-~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+++|+.-+.|+.|-.+|...-+. +-.+ .+..+.......+.| ++.+|+.+..- .++...+.|+-|
T Consensus 376 g~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSP--d~kli~TGtS~-----------~~~~~~g~L~f~ 442 (641)
T KOG0772|consen 376 GNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSP--DDKLILTGTSA-----------PNGMTAGTLFFF 442 (641)
T ss_pred cchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccCC--CceEEEecccc-----------cCCCCCceEEEE
Confidence 35566666677777777664332 2111 111111111122222 56666665332 122445789999
Q ss_pred ECCCCcEEeeecCCCCCCCCcceEEeC---CEEEEeeecCCCcEEEE
Q 040693 283 DASNGNVLWSTADPSNGTAPGPVTVAN---GVLFGGSTYRQGPIYAM 326 (382)
Q Consensus 283 d~~tG~~~W~~~~~~~~~~~~~~~~~~---~~v~~~~~~~~g~l~~l 326 (382)
|..|-+.+.++.+... ....+ .-+ +-|++++. +|.++++
T Consensus 443 d~~t~d~v~ki~i~~a--Svv~~-~WhpkLNQi~~gsg--dG~~~vy 484 (641)
T KOG0772|consen 443 DRMTLDTVYKIDISTA--SVVRC-LWHPKLNQIFAGSG--DGTAHVY 484 (641)
T ss_pred eccceeeEEEecCCCc--eEEEE-eecchhhheeeecC--CCceEEE
Confidence 9999999999987642 11111 112 35777774 6777665
No 236
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=79.43 E-value=65 Score=31.51 Aligned_cols=141 Identities=11% Similarity=0.073 Sum_probs=84.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..+..+..|..-..+|..|++.+=..+--. ...+..+. -++.|..+.+ .+..-...|
T Consensus 273 G~~L~TasfD~tWRlWD~~tk~ElL~QEGHs-----~~v~~iaf~~DGSL~~tGG----------------lD~~~RvWD 331 (459)
T KOG0272|consen 273 GKFLGTASFDSTWRLWDLETKSELLLQEGHS-----KGVFSIAFQPDGSLAATGG----------------LDSLGRVWD 331 (459)
T ss_pred CceeeecccccchhhcccccchhhHhhcccc-----cccceeEecCCCceeeccC----------------ccchhheee
Confidence 4677888889888889999988775554211 12222222 3444444322 223334568
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE--eCCEEEEEeCce
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV--SNGCIYMGNGYK 361 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~--~~g~lyv~~~~g 361 (382)
++||+-.=-........ .+.-...||+..+... .++..-..|+.-.+.+..++......+..-+ ..|+..++.++.
T Consensus 332 lRtgr~im~L~gH~k~I-~~V~fsPNGy~lATgs-~Dnt~kVWDLR~r~~ly~ipAH~nlVS~Vk~~p~~g~fL~TasyD 409 (459)
T KOG0272|consen 332 LRTGRCIMFLAGHIKEI-LSVAFSPNGYHLATGS-SDNTCKVWDLRMRSELYTIPAHSNLVSQVKYSPQEGYFLVTASYD 409 (459)
T ss_pred cccCcEEEEecccccce-eeEeECCCceEEeecC-CCCcEEEeeecccccceecccccchhhheEecccCCeEEEEcccC
Confidence 88887665444332111 1111224677666543 6888888898877777776654433333222 578888888888
Q ss_pred eEeecCC
Q 040693 362 VTVGFGN 368 (382)
Q Consensus 362 ~~~~~~~ 368 (382)
..+.+|.
T Consensus 410 ~t~kiWs 416 (459)
T KOG0272|consen 410 NTVKIWS 416 (459)
T ss_pred cceeeec
Confidence 8887765
No 237
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=79.08 E-value=19 Score=33.76 Aligned_cols=153 Identities=14% Similarity=0.151 Sum_probs=83.4
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+.++..+..|+.+-.+|...-...-.++.-+. .-........-.+..|.++ +....+..+|+
T Consensus 184 e~ILiS~srD~tvKlFDfsK~saKrA~K~~qd-~~~vrsiSfHPsGefllvg-----------------TdHp~~rlYdv 245 (430)
T KOG0640|consen 184 ETILISGSRDNTVKLFDFSKTSAKRAFKVFQD-TEPVRSISFHPSGEFLLVG-----------------TDHPTLRLYDV 245 (430)
T ss_pred hheEEeccCCCeEEEEecccHHHHHHHHHhhc-cceeeeEeecCCCceEEEe-----------------cCCCceeEEec
Confidence 45667778888888888653222111111000 0000001111146777776 34466778887
Q ss_pred CCCcEEeeecCCCCCCCCcc---eEE-eCCEEEEeeecCCCcEEEEeCCCCcEeEEEec--CC-ceecceEEeCCEEEEE
Q 040693 285 SNGNVLWSTADPSNGTAPGP---VTV-ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDT--GA-TIYGGASVSNGCIYMG 357 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~---~~~-~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~--~~-~~~~~p~~~~g~lyv~ 357 (382)
.|-+--=.-. |.. ..... +-+ ..+.+|+... .+|.|-.+|.-+++-+-.+.. ++ .+.+..-..+++..++
T Consensus 246 ~T~Qcfvsan-Pd~-qht~ai~~V~Ys~t~~lYvTaS-kDG~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLs 322 (430)
T KOG0640|consen 246 NTYQCFVSAN-PDD-QHTGAITQVRYSSTGSLYVTAS-KDGAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILS 322 (430)
T ss_pred cceeEeeecC-ccc-ccccceeEEEecCCccEEEEec-cCCcEEeeccccHHHHHHHHhhcCCceeeeEEEccCCeEEee
Confidence 7654332221 211 11111 111 2467777654 688888888777665544432 22 2333333467888888
Q ss_pred eCceeEeecCCccCCCCCeEEEE
Q 040693 358 NGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 358 ~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
++...++++|++.+ |+.|-.|
T Consensus 323 SG~DS~vkLWEi~t--~R~l~~Y 343 (430)
T KOG0640|consen 323 SGKDSTVKLWEIST--GRMLKEY 343 (430)
T ss_pred cCCcceeeeeeecC--CceEEEE
Confidence 88999999999987 6666555
No 238
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=78.95 E-value=57 Score=30.57 Aligned_cols=100 Identities=11% Similarity=0.010 Sum_probs=59.7
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDAS 285 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~ 285 (382)
..|..++.|-.|.-.|...-++.-..+++. ..+..-+...+++++.++ ..|..++++
T Consensus 128 ~cl~TGSWDKTlKfWD~R~~~pv~t~~LPe------RvYa~Dv~~pm~vVata~-----------------r~i~vynL~ 184 (347)
T KOG0647|consen 128 QCLVTGSWDKTLKFWDTRSSNPVATLQLPE------RVYAADVLYPMAVVATAE-----------------RHIAVYNLE 184 (347)
T ss_pred ceeEecccccceeecccCCCCeeeeeeccc------eeeehhccCceeEEEecC-----------------CcEEEEEcC
Confidence 456788889889999998888888888764 223332345666676444 668889998
Q ss_pred CCcEEeee-cCCCCCCCCcceE--EeCCEEEEeeecCCCcEEEEeCCCC
Q 040693 286 NGNVLWST-ADPSNGTAPGPVT--VANGVLFGGSTYRQGPIYAMDVKTG 331 (382)
Q Consensus 286 tG~~~W~~-~~~~~~~~~~~~~--~~~~~v~~~~~~~~g~l~~ld~~tG 331 (382)
++....+. ..+.. .....++ .+...-.+++. +|++.....+.+
T Consensus 185 n~~te~k~~~SpLk-~Q~R~va~f~d~~~~alGsi--EGrv~iq~id~~ 230 (347)
T KOG0647|consen 185 NPPTEFKRIESPLK-WQTRCVACFQDKDGFALGSI--EGRVAIQYIDDP 230 (347)
T ss_pred CCcchhhhhcCccc-ceeeEEEEEecCCceEeeee--cceEEEEecCCC
Confidence 88555442 22211 1112222 23333344443 788777766666
No 239
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=77.54 E-value=57 Score=29.81 Aligned_cols=56 Identities=16% Similarity=0.271 Sum_probs=31.3
Q ss_pred cceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee---------cceEE-eCCEEEEEeCc
Q 040693 303 GPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY---------GGASV-SNGCIYMGNGY 360 (382)
Q Consensus 303 ~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~---------~~p~~-~~g~lyv~~~~ 360 (382)
+.+.++ .+.+|+.+. ....|..+| .+|+++-.+.+..+.. -..+. .+|+|||++.-
T Consensus 174 S~l~~~p~t~~lliLS~-es~~l~~~d-~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsEp 241 (248)
T PF06977_consen 174 SGLSYDPRTGHLLILSD-ESRLLLELD-RQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSEP 241 (248)
T ss_dssp -EEEEETTTTEEEEEET-TTTEEEEE--TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEETT
T ss_pred cceEEcCCCCeEEEEEC-CCCeEEEEC-CCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcCC
Confidence 444444 356777764 567899999 5699988888776321 11222 57999999873
No 240
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=77.46 E-value=34 Score=27.10 Aligned_cols=77 Identities=8% Similarity=0.047 Sum_probs=50.0
Q ss_pred ceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCC
Q 040693 193 PMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKN 272 (382)
Q Consensus 193 p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 272 (382)
.++.++..+| .+.|++++.|..+..++.+ +++++...... ....... ..+....+
T Consensus 5 l~~~d~d~dg--~~eLlvGs~D~~IRvf~~~--e~~~Ei~e~~~----v~~L~~~-~~~~F~Y~---------------- 59 (111)
T PF14783_consen 5 LCLFDFDGDG--ENELLVGSDDFEIRVFKGD--EIVAEITETDK----VTSLCSL-GGGRFAYA---------------- 59 (111)
T ss_pred EEEEecCCCC--cceEEEecCCcEEEEEeCC--cEEEEEecccc----eEEEEEc-CCCEEEEE----------------
Confidence 3445555555 6889999999999999975 79999886432 0111111 22333333
Q ss_pred CCCCceEEEEECCCCcEEeeecCCC
Q 040693 273 STIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 273 ~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
...|+|-.++. .+.+|+.+.+.
T Consensus 60 -l~NGTVGvY~~--~~RlWRiKSK~ 81 (111)
T PF14783_consen 60 -LANGTVGVYDR--SQRLWRIKSKN 81 (111)
T ss_pred -ecCCEEEEEeC--cceeeeeccCC
Confidence 23477888875 78999999765
No 241
>KOG4328 consensus WD40 protein [Function unknown]
Probab=77.42 E-value=10 Score=37.17 Aligned_cols=100 Identities=10% Similarity=0.028 Sum_probs=55.5
Q ss_pred CCCceEEEEECCCC--cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC-CcEeEEEecCCceecceEE-
Q 040693 274 TIAGGWVAMDASNG--NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT-GKILWSYDTGATIYGGASV- 349 (382)
Q Consensus 274 ~~~g~v~a~d~~tG--~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t-G~ilw~~~~~~~~~~~p~~- 349 (382)
..+|++.+.|.+++ +++-+.+.....+....+...+..|+++.. -|.+..+|..+ |+..|...+...-..+..+
T Consensus 254 SyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~--~G~f~~iD~R~~~s~~~~~~lh~kKI~sv~~N 331 (498)
T KOG4328|consen 254 SYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDN--VGNFNVIDLRTDGSEYENLRLHKKKITSVALN 331 (498)
T ss_pred ccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeec--ccceEEEEeecCCccchhhhhhhcccceeecC
Confidence 56799999999887 333333222212223333345566666664 57889999876 5557766554433333333
Q ss_pred -eCCEEEEEeCceeEeecCCccCCCCC
Q 040693 350 -SNGCIYMGNGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 350 -~~g~lyv~~~~g~~~~~~~~~~~~g~ 375 (382)
....++++.+-..++.+|.+..-.++
T Consensus 332 P~~p~~laT~s~D~T~kIWD~R~l~~K 358 (498)
T KOG4328|consen 332 PVCPWFLATASLDQTAKIWDLRQLRGK 358 (498)
T ss_pred CCCchheeecccCcceeeeehhhhcCC
Confidence 34455555555555555554444443
No 242
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=77.37 E-value=75 Score=31.05 Aligned_cols=107 Identities=17% Similarity=0.156 Sum_probs=55.8
Q ss_pred CcceEEEEECCCCcE-EEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccC--cEE
Q 040693 141 HSNSLLALDLDTGKI-VWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKS--GFA 217 (382)
Q Consensus 141 ~~g~v~ald~~tG~~-~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~--g~l 217 (382)
..+.|++++..+... -|+.-+.+.. ..-.+..+...+ +.+|+....+ ..|
T Consensus 299 ~~~~l~~~~l~~~~~~~~~~~l~~~~-------------------------~~~~l~~~~~~~--~~Lvl~~~~~~~~~l 351 (414)
T PF02897_consen 299 PNGRLVAVDLADPSPAEWWTVLIPED-------------------------EDVSLEDVSLFK--DYLVLSYRENGSSRL 351 (414)
T ss_dssp TT-EEEEEETTSTSGGGEEEEEE--S-------------------------SSEEEEEEEEET--TEEEEEEEETTEEEE
T ss_pred CCcEEEEecccccccccceeEEcCCC-------------------------CceeEEEEEEEC--CEEEEEEEECCccEE
Confidence 358999999988875 4554333221 011233332222 3344443333 478
Q ss_pred EEEeCCCCCeeeeeccCCCCCCCCc-ccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEE
Q 040693 218 WALDRDSGSLIWSMEAGPGGLGGGA-MWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVL 290 (382)
Q Consensus 218 ~ald~~tG~~~W~~~~~~~~~~g~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~ 290 (382)
..+|...++..-...++.. +.. ......+.+.+++..... ..-+.++.+|.++|+..
T Consensus 352 ~v~~~~~~~~~~~~~~p~~---g~v~~~~~~~~~~~~~~~~ss~-------------~~P~~~y~~d~~t~~~~ 409 (414)
T PF02897_consen 352 RVYDLDDGKESREIPLPEA---GSVSGVSGDFDSDELRFSYSSF-------------TTPPTVYRYDLATGELT 409 (414)
T ss_dssp EEEETT-TEEEEEEESSSS---SEEEEEES-TT-SEEEEEEEET-------------TEEEEEEEEETTTTCEE
T ss_pred EEEECCCCcEEeeecCCcc---eEEeccCCCCCCCEEEEEEeCC-------------CCCCEEEEEECCCCCEE
Confidence 8899886667777666532 111 111112456666654331 33478999999999753
No 243
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=77.32 E-value=61 Score=29.96 Aligned_cols=134 Identities=19% Similarity=0.247 Sum_probs=70.9
Q ss_pred cEEEEEeCCCCCeeeeeccCCCCCCCCcccc-eeeeCCeEEEEecCcccc-ccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 215 GFAWALDRDSGSLIWSMEAGPGGLGGGAMWG-AATDERRIYTNIANSQHK-NFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 215 g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~~~~~~-~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
..++.||.+| .-|+.-........-..+- ..+-+++.|+=....+.. +|.- .......+|+++|.+||. |.
T Consensus 157 ~d~h~ld~~T--mtWr~~~Tkg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs---~~e~Yc~~i~~ld~~T~a--W~ 229 (392)
T KOG4693|consen 157 QDTHVLDFAT--MTWREMHTKGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHS---IHEQYCDTIMALDLATGA--WT 229 (392)
T ss_pred ccceeEeccc--eeeeehhccCCCchhhhhhhhhhccceEEEeccccccCCCccc---hhhhhcceeEEEeccccc--cc
Confidence 4578888775 5676543211100101111 122446677643333221 1111 022556889999998874 65
Q ss_pred ecCCC-----CCCCCcceEEeCCEEEEeeecCC-------CcEEEEeCCCCcEeEEEecCCc------eecceEEeCCEE
Q 040693 293 TADPS-----NGTAPGPVTVANGVLFGGSTYRQ-------GPIYAMDVKTGKILWSYDTGAT------IYGGASVSNGCI 354 (382)
Q Consensus 293 ~~~~~-----~~~~~~~~~~~~~~v~~~~~~~~-------g~l~~ld~~tG~ilw~~~~~~~------~~~~p~~~~g~l 354 (382)
...+. ++. +-+..+.|+.+|+-.. .. ..||+||++ ..+|+.-...+ -....++.++++
T Consensus 230 r~p~~~~~P~GRR-SHS~fvYng~~Y~FGG-Yng~ln~HfndLy~FdP~--t~~W~~I~~~Gk~P~aRRRqC~~v~g~kv 305 (392)
T KOG4693|consen 230 RTPENTMKPGGRR-SHSTFVYNGKMYMFGG-YNGTLNVHFNDLYCFDPK--TSMWSVISVRGKYPSARRRQCSVVSGGKV 305 (392)
T ss_pred cCCCCCcCCCccc-ccceEEEcceEEEecc-cchhhhhhhcceeecccc--cchheeeeccCCCCCcccceeEEEECCEE
Confidence 43222 222 3334467887777542 22 359999998 45676632222 224456789999
Q ss_pred EEEeC
Q 040693 355 YMGNG 359 (382)
Q Consensus 355 yv~~~ 359 (382)
|+..+
T Consensus 306 ~LFGG 310 (392)
T KOG4693|consen 306 YLFGG 310 (392)
T ss_pred EEecC
Confidence 98644
No 244
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=77.14 E-value=22 Score=38.09 Aligned_cols=124 Identities=13% Similarity=0.120 Sum_probs=70.8
Q ss_pred ccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-----eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCC
Q 040693 212 QKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-----DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASN 286 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~t 286 (382)
.....||-+|.++||++=+.+...... ....++.. .....|++ ...+.|+.+|++-
T Consensus 501 ~~~~~ly~mDLe~GKVV~eW~~~~~~~--v~~~~p~~K~aqlt~e~tflG-----------------ls~n~lfriDpR~ 561 (794)
T PF08553_consen 501 NNPNKLYKMDLERGKVVEEWKVHDDIP--VVDIAPDSKFAQLTNEQTFLG-----------------LSDNSLFRIDPRL 561 (794)
T ss_pred CCCCceEEEecCCCcEEEEeecCCCcc--eeEecccccccccCCCceEEE-----------------ECCCceEEeccCC
Confidence 345789999999999985554432110 11222211 23445665 3447789999885
Q ss_pred -C-cEEeeecCCC-CCCCCcc-eEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-----eCCEEEEE
Q 040693 287 -G-NVLWSTADPS-NGTAPGP-VTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-----SNGCIYMG 357 (382)
Q Consensus 287 -G-~~~W~~~~~~-~~~~~~~-~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-----~~g~lyv~ 357 (382)
| +++|...-.. .....+. .+..+|.|.+++. .|.|..+|. .|+ .-+..++ ..+.|++ .||+-.++
T Consensus 562 ~~~k~v~~~~k~Y~~~~~Fs~~aTt~~G~iavgs~--~G~IRLyd~-~g~-~AKT~lp--~lG~pI~~iDvt~DGkwila 635 (794)
T PF08553_consen 562 SGNKLVDSQSKQYSSKNNFSCFATTEDGYIAVGSN--KGDIRLYDR-LGK-RAKTALP--GLGDPIIGIDVTADGKWILA 635 (794)
T ss_pred CCCceeeccccccccCCCceEEEecCCceEEEEeC--CCcEEeecc-cch-hhhhcCC--CCCCCeeEEEecCCCcEEEE
Confidence 3 4666432111 1112222 3345789999985 999999994 454 2333333 2345544 67887777
Q ss_pred eCc
Q 040693 358 NGY 360 (382)
Q Consensus 358 ~~~ 360 (382)
+..
T Consensus 636 Tc~ 638 (794)
T PF08553_consen 636 TCK 638 (794)
T ss_pred eec
Confidence 664
No 245
>PF09826 Beta_propel: Beta propeller domain; InterPro: IPR019198 This entry consists of predicted secreted proteins containing a C-terminal beta-propeller domain distantly related to WD-40 repeats.
Probab=76.41 E-value=95 Score=31.78 Aligned_cols=81 Identities=12% Similarity=0.117 Sum_probs=54.2
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC---CcEeEEEecCC-ceecceEE
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT---GKILWSYDTGA-TIYGGASV 349 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t---G~ilw~~~~~~-~~~~~p~~ 349 (382)
...++|+.+| ++-+++=+.+.-.+.-.--...+-+++.|+.+..+---|++||+++ =+++-+.++++ .-+-+|.
T Consensus 301 ~s~N~lyVLD-~~L~~vG~l~~la~gE~IysvRF~Gd~~Y~VTFrqvDPLfviDLsdP~~P~vlGeLKIPGfS~YLHP~- 378 (521)
T PF09826_consen 301 TSSNNLYVLD-EDLKIVGSLEGLAPGERIYSVRFMGDRAYLVTFRQVDPLFVIDLSDPANPKVLGELKIPGFSDYLHPY- 378 (521)
T ss_pred CceEEEEEEC-CCCcEeEEccccCCCceEEEEEEeCCeEEEEEEeecCceEEEECCCCCCCceeeEEECccchhceeEC-
Confidence 3457799998 7777777765322111122334678999999974557899999875 78999999987 3445555
Q ss_pred eCCEEEE
Q 040693 350 SNGCIYM 356 (382)
Q Consensus 350 ~~g~lyv 356 (382)
.+++|.-
T Consensus 379 ~e~~LlG 385 (521)
T PF09826_consen 379 DENHLLG 385 (521)
T ss_pred CCCeEEE
Confidence 4444443
No 246
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=75.69 E-value=51 Score=32.15 Aligned_cols=108 Identities=15% Similarity=0.099 Sum_probs=71.5
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+..|..+|..+++.+-..+++... ++..+..++..|...+.
T Consensus 321 DkkvRfwD~Rs~~~~~sv~~gg~v--------------tSl~ls~~g~~lLsssR------------------------- 361 (459)
T KOG0288|consen 321 DKKVRFWDIRSADKTRSVPLGGRV--------------TSLDLSMDGLELLSSSR------------------------- 361 (459)
T ss_pred ccceEEEeccCCceeeEeecCcce--------------eeEeeccCCeEEeeecC-------------------------
Confidence 778999998898888888774322 23344444445555432
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
+..+-.+|..|-+++=.++...... ++ + -+.++++ +| +.+|.+|+.
T Consensus 362 --------Ddtl~viDlRt~eI~~~~sA~g~k~---as----------------D--wtrvvfS---pd--~~YvaAGS~ 407 (459)
T KOG0288|consen 362 --------DDTLKVIDLRTKEIRQTFSAEGFKC---AS----------------D--WTRVVFS---PD--GSYVAAGSA 407 (459)
T ss_pred --------CCceeeeecccccEEEEeecccccc---cc----------------c--cceeEEC---CC--CceeeeccC
Confidence 3778899998888877776654310 00 0 1223331 12 578888999
Q ss_pred CcEEEEEeCCCCCeeeeeccC
Q 040693 214 SGFAWALDRDSGSLIWSMEAG 234 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~ 234 (382)
||.|+..+..+||+-.....+
T Consensus 408 dgsv~iW~v~tgKlE~~l~~s 428 (459)
T KOG0288|consen 408 DGSVYIWSVFTGKLEKVLSLS 428 (459)
T ss_pred CCcEEEEEccCceEEEEeccC
Confidence 999999999999988877754
No 247
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=75.52 E-value=38 Score=26.70 Aligned_cols=21 Identities=29% Similarity=0.571 Sum_probs=16.0
Q ss_pred ccCcEEEEEeCCCCCeeeeecc
Q 040693 212 QKSGFAWALDRDSGSLIWSMEA 233 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~~W~~~~ 233 (382)
..+|.|+.+|.. |+++|+...
T Consensus 62 ~~dGnLvl~~~~-g~~vW~S~~ 82 (116)
T cd00028 62 QSDGNLVIYDGS-GTVVWSSNT 82 (116)
T ss_pred ecCCCeEEEcCC-CcEEEEecc
Confidence 457788888864 789998764
No 248
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=74.88 E-value=1.1e+02 Score=31.71 Aligned_cols=67 Identities=9% Similarity=0.073 Sum_probs=48.5
Q ss_pred ccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCC
Q 040693 53 FQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSP 132 (382)
Q Consensus 53 ~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (382)
|.|.|..+|-+|-..+=++++.+.+- .+..+-+..+.+++++.+
T Consensus 33 ynG~V~IWnyetqtmVksfeV~~~Pv-------------Ra~kfiaRknWiv~GsDD----------------------- 76 (794)
T KOG0276|consen 33 YNGDVQIWNYETQTMVKSFEVSEVPV-------------RAAKFIARKNWIVTGSDD----------------------- 76 (794)
T ss_pred ecCeeEEEecccceeeeeeeecccch-------------hhheeeeccceEEEecCC-----------------------
Confidence 48888888888888887777743221 123344466778777654
Q ss_pred CCCCCCCCCcceEEEEECCCCcEEEEEecCCCc
Q 040693 133 DKCIEPENHSNSLLALDLDTGKIVWYKQLGGYD 165 (382)
Q Consensus 133 ~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~ 165 (382)
..|..+|..|++.+-++......
T Consensus 77 ----------~~IrVfnynt~ekV~~FeAH~Dy 99 (794)
T KOG0276|consen 77 ----------MQIRVFNYNTGEKVKTFEAHSDY 99 (794)
T ss_pred ----------ceEEEEecccceeeEEeeccccc
Confidence 78999999999999999877653
No 249
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=74.85 E-value=81 Score=32.41 Aligned_cols=51 Identities=18% Similarity=0.201 Sum_probs=40.5
Q ss_pred CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeC--CEEEEEeCceeE
Q 040693 310 GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSN--GCIYMGNGYKVT 363 (382)
Q Consensus 310 ~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~--g~lyv~~~~g~~ 363 (382)
..+|+++ +..|...|+...+++-+...+.-..++..++. ++|++++-+..+
T Consensus 579 p~lfVaT---q~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~k~ 631 (733)
T KOG0650|consen 579 PYLFVAT---QRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDKKM 631 (733)
T ss_pred ceEEEEe---ccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCee
Confidence 5788888 78899999888877777777777788888854 899999877653
No 250
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=73.99 E-value=1.1e+02 Score=31.99 Aligned_cols=59 Identities=17% Similarity=0.212 Sum_probs=30.3
Q ss_pred eEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 278 GWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 278 ~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
++.|+-=..++-+|..+.... ....+- -.+..++-+-. =..++-+|..+|++-.+.+..
T Consensus 348 ~v~ayqws~~e~r~ikdvig~-~~~~~~-~s~K~l~EGKe--YDyvF~VDi~dGep~~kLPyN 406 (745)
T KOG0301|consen 348 NVEAYQWSNGEWRWIKDVIGE-VVAAQG-NSGKVLHEGKE--YDYVFDVDIGDGEPPYKLPYN 406 (745)
T ss_pred cceeEEeecccceeecccccc-ccccCC-CCcceeecccc--cceEEEEEccCCCCceecCcC
Confidence 345555556666776644321 111111 11223343321 245777888899987777654
No 251
>KOG1027 consensus Serine/threonine protein kinase and endoribonuclease ERN1/IRE1, sensor of the unfolded protein response pathway [Signal transduction mechanisms]
Probab=73.78 E-value=21 Score=38.05 Aligned_cols=177 Identities=19% Similarity=0.155 Sum_probs=96.9
Q ss_pred cceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCCCCCCcchhhcccccCCCCCCCCC
Q 040693 54 QGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNLYSVPLHIRQCQEENNQTTPTSPD 133 (382)
Q Consensus 54 ~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (382)
+ .+.|.+.++|...|+..-.+-... +. +... .....++..+.+|....+ ..+-...-+..+....++=...+
T Consensus 36 d-~l~a~s~~~g~~~~~l~~~pvv~~-~~----~~~~-~~fl~~p~dgsly~l~~~-~sL~Klpftipelv~~~pcrssd 107 (903)
T KOG1027|consen 36 D-SLHAPSSETGFIKWTLSDDPVVAS-PD----GVLQ-PAFLPDPRDGSLYTLGNN-LSLTKLPFTIPELVNASPCRSSD 107 (903)
T ss_pred c-cccCccccccceeeeeccCccccC-Cc----cccc-cccCCCccccceeeccCC-CccccCCccchhhhccCcccCCC
Confidence 6 789999999999999775433211 10 0000 011233333445554332 01000000011122223333445
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK 213 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~ 213 (382)
|.++.+.-.+..+.+|++||+..|.+..... . ...|+.+-.
T Consensus 108 Gi~ysg~k~d~~~lvD~~tg~~~~tf~~~~~--------------------------~-------------~~~v~~grt 148 (903)
T KOG1027|consen 108 GILYSGSKQDIWYLVDPKTGEIDYTFNTAEP--------------------------I-------------KQLVYLGRT 148 (903)
T ss_pred CeEEecccccceEEecCCccceeEEEecCCc--------------------------c-------------hhheecccc
Confidence 6666777778999999999999999876542 1 344555555
Q ss_pred CcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee
Q 040693 214 SGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS 292 (382)
Q Consensus 214 ~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~ 292 (382)
+-.+...|..+-...|........ ....+.. ...+..+.. ...|.+.-+|.++|+.+|.
T Consensus 149 ~ytv~m~d~~~~~~~wn~t~~dy~----a~~~~~~~~~~~~~~~~----------------~~~g~i~t~D~~~g~~~~~ 208 (903)
T KOG1027|consen 149 NYTVTMYDKNVRGKTWNTTFGDYS----AQYPSGVRGEKMSHFHS----------------LGNGYIVTVDSESGEKLWL 208 (903)
T ss_pred eeEEecccCcccCceeeccccchh----ccCCCccCCceeEEEee----------------cCCccEEeccCcccceeec
Confidence 556666777777777877654211 1111111 122222221 2246677899999999999
Q ss_pred ecCCC
Q 040693 293 TADPS 297 (382)
Q Consensus 293 ~~~~~ 297 (382)
.+...
T Consensus 209 q~~~s 213 (903)
T KOG1027|consen 209 QDLLS 213 (903)
T ss_pred cccCC
Confidence 88654
No 252
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=73.64 E-value=80 Score=29.57 Aligned_cols=168 Identities=18% Similarity=0.244 Sum_probs=89.9
Q ss_pred EEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCC
Q 040693 145 LLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDS 224 (382)
Q Consensus 145 v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~t 224 (382)
+..++ ..||.+-...++.. -.++.-.|.+- ..+.++-..--+.+.+|+.+
T Consensus 51 ~a~~~-eaGk~v~~~~lpaR---------------------~Hgi~~~p~~~--------ravafARrPGtf~~vfD~~~ 100 (366)
T COG3490 51 AATLS-EAGKIVFATALPAR---------------------GHGIAFHPALP--------RAVAFARRPGTFAMVFDPNG 100 (366)
T ss_pred EEEEc-cCCceeeeeecccc---------------------cCCeecCCCCc--------ceEEEEecCCceEEEECCCC
Confidence 34444 56999999888764 22333445442 33333334444678889998
Q ss_pred CCeeeeeccCCCCCCCCcccceee---eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC-cEEeeecCCCCCC
Q 040693 225 GSLIWSMEAGPGGLGGGAMWGAAT---DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG-NVLWSTADPSNGT 300 (382)
Q Consensus 225 G~~~W~~~~~~~~~~g~~~~~~~~---~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG-~~~W~~~~~~~~~ 300 (382)
+...-...... +...++-.+ |+..+|.+.++.+ ...|-|=.+|...+ +..=+++...-..
T Consensus 101 ~~~pv~~~s~~----~RHfyGHGvfs~dG~~LYATEndfd------------~~rGViGvYd~r~~fqrvgE~~t~GiGp 164 (366)
T COG3490 101 AQEPVTLVSQE----GRHFYGHGVFSPDGRLLYATENDFD------------PNRGVIGVYDAREGFQRVGEFSTHGIGP 164 (366)
T ss_pred CcCcEEEeccc----CceeecccccCCCCcEEEeecCCCC------------CCCceEEEEecccccceecccccCCcCc
Confidence 87766655432 334444333 6778888754422 23355667777644 2222222211000
Q ss_pred CCcceEEeCCE-EEEeeec----------------CCCcEEEEeCCCCcEeEEEecCCc----eecceEE-eCCEEEEEe
Q 040693 301 APGPVTVANGV-LFGGSTY----------------RQGPIYAMDVKTGKILWSYDTGAT----IYGGASV-SNGCIYMGN 358 (382)
Q Consensus 301 ~~~~~~~~~~~-v~~~~~~----------------~~g~l~~ld~~tG~ilw~~~~~~~----~~~~p~~-~~g~lyv~~ 358 (382)
..-. ...+++ +.++..+ .+-.+..+|..||+++-|..++.. ..-+..+ .+|+++++-
T Consensus 165 Hev~-lm~DGrtlvvanGGIethpdfgR~~lNldsMePSlvlld~atG~liekh~Lp~~l~~lSiRHld~g~dgtvwfgc 243 (366)
T COG3490 165 HEVT-LMADGRTLVVANGGIETHPDFGRTELNLDSMEPSLVLLDAATGNLIEKHTLPASLRQLSIRHLDIGRDGTVWFGC 243 (366)
T ss_pred ceeE-EecCCcEEEEeCCceecccccCccccchhhcCccEEEEeccccchhhhccCchhhhhcceeeeeeCCCCcEEEEE
Confidence 1111 223443 3332210 234688999999999999988843 2233334 466766664
Q ss_pred C
Q 040693 359 G 359 (382)
Q Consensus 359 ~ 359 (382)
.
T Consensus 244 Q 244 (366)
T COG3490 244 Q 244 (366)
T ss_pred E
Confidence 3
No 253
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=73.61 E-value=80 Score=29.55 Aligned_cols=172 Identities=15% Similarity=0.022 Sum_probs=98.5
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
......|.++|+.+=.+.-...++.. +++.+ ...+..+.+.-|......|.
T Consensus 166 ~TCalWDie~g~~~~~f~GH~gDV~s---------------------------lsl~p--~~~ntFvSg~cD~~aklWD~ 216 (343)
T KOG0286|consen 166 MTCALWDIETGQQTQVFHGHTGDVMS---------------------------LSLSP--SDGNTFVSGGCDKSAKLWDV 216 (343)
T ss_pred ceEEEEEcccceEEEEecCCcccEEE---------------------------EecCC--CCCCeEEecccccceeeeec
Confidence 67888999999999888755443221 11111 11466667777777777777
Q ss_pred CCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCC
Q 040693 223 DSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAP 302 (382)
Q Consensus 223 ~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~ 302 (382)
..|.-.-.+.-.... .+..--.-++.-|++. ..+++...||++..+.+=.++-+......
T Consensus 217 R~~~c~qtF~ghesD----INsv~ffP~G~afatG----------------SDD~tcRlyDlRaD~~~a~ys~~~~~~gi 276 (343)
T KOG0286|consen 217 RSGQCVQTFEGHESD----INSVRFFPSGDAFATG----------------SDDATCRLYDLRADQELAVYSHDSIICGI 276 (343)
T ss_pred cCcceeEeecccccc----cceEEEccCCCeeeec----------------CCCceeEEEeecCCcEEeeeccCcccCCc
Confidence 777666665532211 1111001234444442 34578899999999888777644322222
Q ss_pred cceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCCEEEEEeCceeEee
Q 040693 303 GPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNGCIYMGNGYKVTVG 365 (382)
Q Consensus 303 ~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g~lyv~~~~g~~~~ 365 (382)
..+.+ .+.++|++- .+..+.+.|.-.|+.+-...-...-.+..-+ .||..+.+.+=.....
T Consensus 277 tSv~FS~SGRlLfagy--~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~DG~av~TgSWDs~lr 340 (343)
T KOG0286|consen 277 TSVAFSKSGRLLFAGY--DDFTCNVWDTLKGERVGVLAGHENRVSCLGVSPDGMAVATGSWDSTLR 340 (343)
T ss_pred eeEEEcccccEEEeee--cCCceeEeeccccceEEEeeccCCeeEEEEECCCCcEEEecchhHhee
Confidence 22223 355666664 4888999998778877766533333344433 4555555544333333
No 254
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=73.54 E-value=71 Score=28.94 Aligned_cols=27 Identities=26% Similarity=0.240 Sum_probs=21.2
Q ss_pred EcCEEEEeccCccccccccccccccceEEEEeCccCceeee
Q 040693 30 YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQ 70 (382)
Q Consensus 30 ~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~ 70 (382)
.++.||--.. .+.||.+|+.||.....
T Consensus 37 a~G~LYgl~~--------------~g~lYtIn~~tG~aT~v 63 (236)
T PF14339_consen 37 ANGQLYGLGS--------------TGRLYTINPATGAATPV 63 (236)
T ss_pred CCCCEEEEeC--------------CCcEEEEECCCCeEEEe
Confidence 4677887654 88999999999995444
No 255
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=72.72 E-value=1.2e+02 Score=31.39 Aligned_cols=101 Identities=16% Similarity=0.159 Sum_probs=59.3
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
-.|+.++-.+.|..+|+.|++.+-+..--.. ..-...+ |+.+++.+ ..+|+|...|
T Consensus 184 t~ivsGgtek~lr~wDprt~~kimkLrGHTd-----NVr~ll~~dDGt~~ls~-----------------sSDgtIrlWd 241 (735)
T KOG0308|consen 184 TIIVSGGTEKDLRLWDPRTCKKIMKLRGHTD-----NVRVLLVNDDGTRLLSA-----------------SSDGTIRLWD 241 (735)
T ss_pred eEEEecCcccceEEeccccccceeeeecccc-----ceEEEEEcCCCCeEeec-----------------CCCceEEeee
Confidence 4566677788899999999999888772211 1112222 33444443 4457788888
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t 330 (382)
+..-+=+-++-.+....|+-...-.=..||.+. +++.|+.-|+.+
T Consensus 242 LgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~--rd~~i~~Tdl~n 286 (735)
T KOG0308|consen 242 LGQQRCLATYIVHKEGVWALQSSPSFTHVYSGG--RDGNIYRTDLRN 286 (735)
T ss_pred ccccceeeeEEeccCceEEEeeCCCcceEEecC--CCCcEEecccCC
Confidence 766665555555443222211110114677766 478888877765
No 256
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=72.64 E-value=98 Score=33.35 Aligned_cols=137 Identities=20% Similarity=0.130 Sum_probs=72.8
Q ss_pred cceEEEEECCCCcEEEEEecCCCc-ccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYD-VWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
...||-+|+.+||++=++...... +..++.. .+ ..... ....+.+-.+..|+-+
T Consensus 503 ~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p~---------~K--~aqlt--------------~e~tflGls~n~lfri 557 (794)
T PF08553_consen 503 PNKLYKMDLERGKVVEEWKVHDDIPVVDIAPD---------SK--FAQLT--------------NEQTFLGLSDNSLFRI 557 (794)
T ss_pred CCceEEEecCCCcEEEEeecCCCcceeEeccc---------cc--ccccC--------------CCceEEEECCCceEEe
Confidence 378999999999998777776542 1111100 00 00011 2234445455668889
Q ss_pred eCCC-C-CeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC
Q 040693 221 DRDS-G-SLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS 297 (382)
Q Consensus 221 d~~t-G-~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~ 297 (382)
|+.- | ++.|...-. ........+.+. .+|.|.++ ...|.|..|| +.|+. =++.+|+
T Consensus 558 DpR~~~~k~v~~~~k~--Y~~~~~Fs~~aTt~~G~iavg-----------------s~~G~IRLyd-~~g~~-AKT~lp~ 616 (794)
T PF08553_consen 558 DPRLSGNKLVDSQSKQ--YSSKNNFSCFATTEDGYIAVG-----------------SNKGDIRLYD-RLGKR-AKTALPG 616 (794)
T ss_pred ccCCCCCceeeccccc--cccCCCceEEEecCCceEEEE-----------------eCCCcEEeec-ccchh-hhhcCCC
Confidence 9874 3 345532211 112222333333 67888887 4558899998 45643 2333433
Q ss_pred CCCCCcce----EEeCCEEEEeeecCCCcEEEEeCC
Q 040693 298 NGTAPGPV----TVANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 298 ~~~~~~~~----~~~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
.+.|+ +..+|+.++++- +..|..++..
T Consensus 617 ---lG~pI~~iDvt~DGkwilaTc--~tyLlLi~t~ 647 (794)
T PF08553_consen 617 ---LGDPIIGIDVTADGKWILATC--KTYLLLIDTL 647 (794)
T ss_pred ---CCCCeeEEEecCCCcEEEEee--cceEEEEEEe
Confidence 12333 223554444442 8888888853
No 257
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=72.10 E-value=1.4e+02 Score=31.54 Aligned_cols=103 Identities=15% Similarity=0.120 Sum_probs=62.1
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
...+..++.|-++...|..+|...--+.-- -++....++ -.++ |++.+ ..++.|..+|
T Consensus 547 s~Y~aTGSsD~tVRlWDv~~G~~VRiF~GH-----~~~V~al~~Sp~Gr-~LaSg---------------~ed~~I~iWD 605 (707)
T KOG0263|consen 547 SNYVATGSSDRTVRLWDVSTGNSVRIFTGH-----KGPVTALAFSPCGR-YLASG---------------DEDGLIKIWD 605 (707)
T ss_pred ccccccCCCCceEEEEEcCCCcEEEEecCC-----CCceEEEEEcCCCc-eEeec---------------ccCCcEEEEE
Confidence 466667778889999999999887666421 112222222 1222 22222 4568899999
Q ss_pred CCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC
Q 040693 284 ASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 284 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t 330 (382)
+.+|+++=+.........+-....+++.+.++. .+..|...|...
T Consensus 606 l~~~~~v~~l~~Ht~ti~SlsFS~dg~vLasgg--~DnsV~lWD~~~ 650 (707)
T KOG0263|consen 606 LANGSLVKQLKGHTGTIYSLSFSRDGNVLASGG--ADNSVRLWDLTK 650 (707)
T ss_pred cCCCcchhhhhcccCceeEEEEecCCCEEEecC--CCCeEEEEEchh
Confidence 999988766554443222333334555555555 377888888763
No 258
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=71.59 E-value=87 Score=29.45 Aligned_cols=142 Identities=16% Similarity=0.109 Sum_probs=76.0
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceeee-----CCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC---
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATD-----ERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG--- 287 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~-----~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG--- 287 (382)
.+..+|+.+.+.+-++++.+.+..-......... ...+.++++... +.......|+++.|+....
T Consensus 3 ~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~-------~~~~~~~~Gri~v~~i~~~~~~ 75 (321)
T PF03178_consen 3 SIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNY-------GEDPEPSSGRILVFEISESPEN 75 (321)
T ss_dssp EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE---------TTSSS-S-EEEEEEEECSS---
T ss_pred EEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEecccc-------cccccccCcEEEEEEEEccccc
Confidence 4667788777777776665422110000000001 234555443211 0001112289999999984
Q ss_pred ----cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCc-EeEEEecCCc-eecceEEeCCEEEEEeCce
Q 040693 288 ----NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGK-ILWSYDTGAT-IYGGASVSNGCIYMGNGYK 361 (382)
Q Consensus 288 ----~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~-ilw~~~~~~~-~~~~p~~~~g~lyv~~~~g 361 (382)
+.+.+.+.+.+ ...+...+++++++. ++.|+.++.+..+ ++-....... ...+..+.++.+++++...
T Consensus 76 ~~~l~~i~~~~~~g~---V~ai~~~~~~lv~~~---g~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~~ 149 (321)
T PF03178_consen 76 NFKLKLIHSTEVKGP---VTAICSFNGRLVVAV---GNKLYVYDLDNSKTLLKKAFYDSPFYITSLSVFKNYILVGDAMK 149 (321)
T ss_dssp --EEEEEEEEEESS----EEEEEEETTEEEEEE---TTEEEEEEEETTSSEEEEEEE-BSSSEEEEEEETTEEEEEESSS
T ss_pred ceEEEEEEEEeecCc---ceEhhhhCCEEEEee---cCEEEEEEccCcccchhhheecceEEEEEEeccccEEEEEEccc
Confidence 33344444432 233333478877776 7888888887777 5554444332 3455666899999998865
Q ss_pred eEeecCCccC
Q 040693 362 VTVGFGNKNF 371 (382)
Q Consensus 362 ~~~~~~~~~~ 371 (382)
. +.++.++.
T Consensus 150 s-v~~~~~~~ 158 (321)
T PF03178_consen 150 S-VSLLRYDE 158 (321)
T ss_dssp S-EEEEEEET
T ss_pred C-EEEEEEEc
Confidence 5 33444444
No 259
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=71.53 E-value=29 Score=35.04 Aligned_cols=130 Identities=13% Similarity=0.165 Sum_probs=68.3
Q ss_pred cEEE-EEccCcEEEEEeCCCCCee--eeeccCCCCCCCCcccceee-----eCCeEEEEecCccccccccCCCCCCCCCc
Q 040693 206 DIVV-AVQKSGFAWALDRDSGSLI--WSMEAGPGGLGGGAMWGAAT-----DERRIYTNIANSQHKNFNLKPSKNSTIAG 277 (382)
Q Consensus 206 ~~v~-~~~~~g~l~ald~~tG~~~--W~~~~~~~~~~g~~~~~~~~-----~~~~v~~~~~~~~~~~~~~~~~~~~~~~g 277 (382)
++|+ .+.....|+-+|.++||++ |++.-.- ......+-. +...-+++ ....
T Consensus 346 nlil~~~~~~~~l~klDIE~GKIVeEWk~~~di----~mv~~t~d~K~~Ql~~e~TlvG-----------------Ls~n 404 (644)
T KOG2395|consen 346 NLILMDGGEQDKLYKLDIERGKIVEEWKFEDDI----NMVDITPDFKFAQLTSEQTLVG-----------------LSDN 404 (644)
T ss_pred ceEeeCCCCcCcceeeecccceeeeEeeccCCc----ceeeccCCcchhcccccccEEe-----------------ecCC
Confidence 4444 4445567999999999998 6655320 001111100 11111222 2345
Q ss_pred eEEEEECC-CC--cEEeeecCCC--CCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC---ceecceEE
Q 040693 278 GWVAMDAS-NG--NVLWSTADPS--NGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA---TIYGGASV 349 (382)
Q Consensus 278 ~v~a~d~~-tG--~~~W~~~~~~--~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~---~~~~~p~~ 349 (382)
.|..+|++ .| ++.|...-.. ...-.+..+..+|.|.+++. +|.|..+|. .|+.. +..+|+ ++..--+.
T Consensus 405 ~vfriDpRv~~~~kl~~~q~kqy~~k~nFsc~aTT~sG~IvvgS~--~GdIRLYdr-i~~~A-KTAlPgLG~~I~hVdvt 480 (644)
T KOG2395|consen 405 SVFRIDPRVQGKNKLAVVQSKQYSTKNNFSCFATTESGYIVVGSL--KGDIRLYDR-IGRRA-KTALPGLGDAIKHVDVT 480 (644)
T ss_pred ceEEecccccCcceeeeeeccccccccccceeeecCCceEEEeec--CCcEEeehh-hhhhh-hhcccccCCceeeEEee
Confidence 67777776 22 4566643222 11113334456789999985 999999996 45422 222332 22222334
Q ss_pred eCCEEEEEeCc
Q 040693 350 SNGCIYMGNGY 360 (382)
Q Consensus 350 ~~g~lyv~~~~ 360 (382)
++|+-.+++..
T Consensus 481 adGKwil~Tc~ 491 (644)
T KOG2395|consen 481 ADGKWILATCK 491 (644)
T ss_pred ccCcEEEEecc
Confidence 67777776654
No 260
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=70.18 E-value=60 Score=29.50 Aligned_cols=50 Identities=18% Similarity=0.315 Sum_probs=35.6
Q ss_pred eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeC---CEEEEee
Q 040693 249 DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVAN---GVLFGGS 316 (382)
Q Consensus 249 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~---~~v~~~~ 316 (382)
.++.+|+..-+ .++|+.+|+.|||++=++.+|... .+...+.+ +++|+.+
T Consensus 221 ~eG~L~Va~~n----------------g~~V~~~dp~tGK~L~eiklPt~q--itsccFgGkn~d~~yvT~ 273 (310)
T KOG4499|consen 221 TEGNLYVATFN----------------GGTVQKVDPTTGKILLEIKLPTPQ--ITSCCFGGKNLDILYVTT 273 (310)
T ss_pred cCCcEEEEEec----------------CcEEEEECCCCCcEEEEEEcCCCc--eEEEEecCCCccEEEEEe
Confidence 36778887544 488999999999999999998643 22333444 3666665
No 261
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=69.17 E-value=1.6e+02 Score=31.17 Aligned_cols=145 Identities=12% Similarity=0.031 Sum_probs=83.9
Q ss_pred cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEEcCCC
Q 040693 31 KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIATGNL 110 (382)
Q Consensus 31 ~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~~~~~ 110 (382)
|++.|+..+- |+.|..++...-+++-=.++..- .++..+.+++...+||+-+
T Consensus 421 DDryFiSGSL-------------D~KvRiWsI~d~~Vv~W~Dl~~l--------------ITAvcy~PdGk~avIGt~~- 472 (712)
T KOG0283|consen 421 DDRYFISGSL-------------DGKVRLWSISDKKVVDWNDLRDL--------------ITAVCYSPDGKGAVIGTFN- 472 (712)
T ss_pred CCCcEeeccc-------------ccceEEeecCcCeeEeehhhhhh--------------heeEEeccCCceEEEEEec-
Confidence 6777876652 88999999888887744444211 1356788888888888754
Q ss_pred CCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCC
Q 040693 111 YSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFG 190 (382)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (382)
|.+..++...=+..=..+...... ..- ++. +.+++.
T Consensus 473 --------------------------------G~C~fY~t~~lk~~~~~~I~~~~~--------Kk~---~~~-rITG~Q 508 (712)
T KOG0283|consen 473 --------------------------------GYCRFYDTEGLKLVSDFHIRLHNK--------KKK---QGK-RITGLQ 508 (712)
T ss_pred --------------------------------cEEEEEEccCCeEEEeeeEeeccC--------ccc---cCc-eeeeeE
Confidence 778888876555544443322100 000 000 122332
Q ss_pred CCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEe
Q 040693 191 EAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNI 257 (382)
Q Consensus 191 ~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~ 257 (382)
..|.- -+.|++.+.|..+..+|..+-+++=+++-... ...........|+..|+.+.
T Consensus 509 ~~p~~---------~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n-~~SQ~~Asfs~Dgk~IVs~s 565 (712)
T KOG0283|consen 509 FFPGD---------PDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRN-TSSQISASFSSDGKHIVSAS 565 (712)
T ss_pred ecCCC---------CCeEEEecCCCceEEEeccchhhhhhhccccc-CCcceeeeEccCCCEEEEee
Confidence 22211 24788999999999999987777777663211 11122223333666666653
No 262
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=69.06 E-value=15 Score=22.64 Aligned_cols=31 Identities=16% Similarity=0.241 Sum_probs=24.5
Q ss_pred cCEEEEeccCccccccccccccccceEEEEeCccCceeeeeecc
Q 040693 31 KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFML 74 (382)
Q Consensus 31 ~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~ 74 (382)
+.++|++... .+.|..||+++++++=+..++
T Consensus 3 ~~~lyv~~~~-------------~~~v~~id~~~~~~~~~i~vg 33 (42)
T TIGR02276 3 GTKLYVTNSG-------------SNTVSVIDTATNKVIATIPVG 33 (42)
T ss_pred CCEEEEEeCC-------------CCEEEEEECCCCeEEEEEECC
Confidence 3568886652 678999999999998888774
No 263
>smart00284 OLF Olfactomedin-like domains.
Probab=69.03 E-value=95 Score=28.53 Aligned_cols=115 Identities=14% Similarity=0.138 Sum_probs=67.3
Q ss_pred eeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCC-----------CCcceEEeC-CE--
Q 040693 246 AATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGT-----------APGPVTVAN-GV-- 311 (382)
Q Consensus 246 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~-----------~~~~~~~~~-~~-- 311 (382)
.++.++.+|.... ....|..+|+++++..=+..++.... ..--++++. |+
T Consensus 79 ~VVYngslYY~~~----------------~s~~iiKydL~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWv 142 (255)
T smart00284 79 VVVYNGSLYFNKF----------------NSHDICRFDLTTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWV 142 (255)
T ss_pred EEEECceEEEEec----------------CCccEEEEECCCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEE
Confidence 3457788888532 23669999999999864444432110 112234443 43
Q ss_pred EEEeeecCCCcEE--EEeCCCCcE--eEEEecCCceecceEEeCCEEEEEeCcee--EeecCCccCCCCCeE
Q 040693 312 LFGGSTYRQGPIY--AMDVKTGKI--LWSYDTGATIYGGASVSNGCIYMGNGYKV--TVGFGNKNFTSGTSL 377 (382)
Q Consensus 312 v~~~~~~~~g~l~--~ld~~tG~i--lw~~~~~~~~~~~p~~~~g~lyv~~~~g~--~~~~~~~~~~~g~~l 377 (382)
||.... .+|.|. -||+.|=++ .|....+....+..-+.=|.||+..+... .--.|++|+.+++..
T Consensus 143 IYat~~-~~g~ivvSkLnp~tL~ve~tW~T~~~k~sa~naFmvCGvLY~~~s~~~~~~~I~yayDt~t~~~~ 213 (255)
T smart00284 143 IYATEQ-NAGKIVISKLNPATLTIENTWITTYNKRSASNAFMICGILYVTRSLGSKGEKVFYAYDTNTGKEG 213 (255)
T ss_pred EEeccC-CCCCEEEEeeCcccceEEEEEEcCCCcccccccEEEeeEEEEEccCCCCCcEEEEEEECCCCccc
Confidence 333322 346554 788887666 55555555566666677789999975221 111367777776654
No 264
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=68.15 E-value=20 Score=35.40 Aligned_cols=59 Identities=27% Similarity=0.307 Sum_probs=43.0
Q ss_pred CCceEEEEECCCCcEEeeecCCCC--CCCCcceEEeCCEEEEeeecCCC---cEEEEeCCCCcE
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSN--GTAPGPVTVANGVLFGGSTYRQG---PIYAMDVKTGKI 333 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~--~~~~~~~~~~~~~v~~~~~~~~g---~l~~ld~~tG~i 333 (382)
....+..+|..+++..|...+|.. ......+.++++.+|++-...+| .||.||+.|++.
T Consensus 365 ~~~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~~g~~~~IY~iDp~TatA 428 (435)
T PF14298_consen 365 DAKKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTEDGSDPYIYKIDPATATA 428 (435)
T ss_pred ccceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEeecCCCceeEEEEcCccccc
Confidence 345688899999999999888764 12233556788998887542334 699999998764
No 265
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=67.09 E-value=78 Score=33.42 Aligned_cols=90 Identities=8% Similarity=0.084 Sum_probs=62.3
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEEeCCEEE
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASVSNGCIY 355 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~~~g~ly 355 (382)
+.+..++..|++-.-++.-.. .......-.+.+|+.+. ..|.|-.+|..+..++-+.+... .+++-.+..+++=+
T Consensus 394 ~SikiWn~~t~kciRTi~~~y--~l~~~Fvpgd~~Iv~G~--k~Gel~vfdlaS~~l~Eti~AHdgaIWsi~~~pD~~g~ 469 (888)
T KOG0306|consen 394 ESIKIWNRDTLKCIRTITCGY--ILASKFVPGDRYIVLGT--KNGELQVFDLASASLVETIRAHDGAIWSISLSPDNKGF 469 (888)
T ss_pred CcEEEEEccCcceeEEecccc--EEEEEecCCCceEEEec--cCCceEEEEeehhhhhhhhhccccceeeeeecCCCCce
Confidence 668888888888877776542 22333322345677776 59999999999988887776433 34555556788888
Q ss_pred EEeCceeEeecCCcc
Q 040693 356 MGNGYKVTVGFGNKN 370 (382)
Q Consensus 356 v~~~~g~~~~~~~~~ 370 (382)
++.+...++++|.|-
T Consensus 470 vT~saDktVkfWdf~ 484 (888)
T KOG0306|consen 470 VTGSADKTVKFWDFK 484 (888)
T ss_pred EEecCCcEEEEEeEE
Confidence 887777888887653
No 266
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=66.98 E-value=1.4e+02 Score=29.75 Aligned_cols=85 Identities=19% Similarity=0.173 Sum_probs=55.5
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe-CC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecc-eEEe
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA-NG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGG-ASVS 350 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~-p~~~ 350 (382)
..+++|.++|+..|..+....-...+. ..+.+. ++ ++.-++ .+|.|+..+.++|++.-.+.-.+.++.- -...
T Consensus 429 s~dstV~lwdv~~gv~i~~f~kH~~pV--ysvafS~~g~ylAsGs--~dg~V~iws~~~~~l~~s~~~~~~Ifel~Wn~~ 504 (524)
T KOG0273|consen 429 SFDSTVKLWDVESGVPIHTLMKHQEPV--YSVAFSPNGRYLASGS--LDGCVHIWSTKTGKLVKSYQGTGGIFELCWNAA 504 (524)
T ss_pred ecCCeEEEEEccCCceeEeeccCCCce--EEEEecCCCcEEEecC--CCCeeEeccccchheeEeecCCCeEEEEEEcCC
Confidence 356889999999999999875444322 223333 33 444444 5899999999999988777655543322 1235
Q ss_pred CCEEEEEeCcee
Q 040693 351 NGCIYMGNGYKV 362 (382)
Q Consensus 351 ~g~lyv~~~~g~ 362 (382)
+++|-+.-++|.
T Consensus 505 G~kl~~~~sd~~ 516 (524)
T KOG0273|consen 505 GDKLGACASDGS 516 (524)
T ss_pred CCEEEEEecCCC
Confidence 677777766655
No 267
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=66.56 E-value=37 Score=31.99 Aligned_cols=96 Identities=16% Similarity=0.079 Sum_probs=56.5
Q ss_pred CceEEEEECCCCcEE--eeecCCCCCCCCcceEEeCCEEEEeeecCCCc-EEEEeCCCCcEeEEEecCCc--eecceEE-
Q 040693 276 AGGWVAMDASNGNVL--WSTADPSNGTAPGPVTVANGVLFGGSTYRQGP-IYAMDVKTGKILWSYDTGAT--IYGGASV- 349 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~--W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~-l~~ld~~tG~ilw~~~~~~~--~~~~p~~- 349 (382)
.|.|...|+..-+.. -.++........-.+ .-+|.+++... .+|. |..||..+|+.+-++.-+.. ..-..+.
T Consensus 158 ~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~L-n~~Gt~vATaS-tkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFS 235 (346)
T KOG2111|consen 158 TGQVQIVDLASTKPNAPSIINAHDSDIACVAL-NLQGTLVATAS-TKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFS 235 (346)
T ss_pred cceEEEEEhhhcCcCCceEEEcccCceeEEEE-cCCccEEEEec-cCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeC
Confidence 488999998766553 222222211111122 22455555443 3665 66789999999999877642 2223333
Q ss_pred eCCEEEEEeCceeEeecCCccCCC
Q 040693 350 SNGCIYMGNGYKVTVGFGNKNFTS 373 (382)
Q Consensus 350 ~~g~lyv~~~~g~~~~~~~~~~~~ 373 (382)
.+....+.+++..++|+|++..+.
T Consensus 236 p~~s~LavsSdKgTlHiF~l~~~~ 259 (346)
T KOG2111|consen 236 PNSSWLAVSSDKGTLHIFSLRDTE 259 (346)
T ss_pred CCccEEEEEcCCCeEEEEEeecCC
Confidence 566666677777788998876643
No 268
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=65.19 E-value=9.2 Score=37.71 Aligned_cols=57 Identities=19% Similarity=0.207 Sum_probs=40.8
Q ss_pred CcCCCCceeeeeecCcCccceeeeceEEEcCEEEEeccCccccccccccccccceEEEEeCccCce
Q 040693 2 VKRSNGKLVWKTKLDDHARSFITMSGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRI 67 (382)
Q Consensus 2 ld~~tGk~~W~~~~~~~~~~~~~~~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~ 67 (382)
+|+.++++.|...++.+....+..+|.+.++.+|++....+.. ...||-+|+.|++.
T Consensus 372 ~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~~g~---------~~~IY~iDp~TatA 428 (435)
T PF14298_consen 372 FDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTEDGS---------DPYIYKIDPATATA 428 (435)
T ss_pred EEccCceeEEeccCChhhccccccceEeeCCEEEEEEeecCCC---------ceeEEEEcCccccc
Confidence 5888999999988875422334457889999999977533221 24799999988764
No 269
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=64.78 E-value=55 Score=34.46 Aligned_cols=94 Identities=11% Similarity=0.076 Sum_probs=63.4
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEE-eC
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SN 351 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~ 351 (382)
-+|.|..++..++++.-+.+..... .+.+-++ +.++.-++ .++.|.+.|.-.=+-+.+..-.......... .+
T Consensus 85 aDGsVqif~~~s~~~~~tfngHK~A--Vt~l~fd~~G~rlaSGs--kDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~~ 160 (888)
T KOG0306|consen 85 ADGSVQIFSLESEEILITFNGHKAA--VTTLKFDKIGTRLASGS--KDTDIIVWDLVGEEGLFRLRGHKDSITQALFLNG 160 (888)
T ss_pred cCceEEeeccCCCceeeeecccccc--eEEEEEcccCceEeecC--CCccEEEEEeccceeeEEeecchHHHhHHhccCC
Confidence 4588999999999888888766532 2333333 44555555 5899999998755556666543333333333 36
Q ss_pred CEEEEEeCceeEeecCCccCC
Q 040693 352 GCIYMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 352 g~lyv~~~~g~~~~~~~~~~~ 372 (382)
+++.|+++.....++|.++.+
T Consensus 161 ~~~lvS~sKDs~iK~WdL~tq 181 (888)
T KOG0306|consen 161 DSFLVSVSKDSMIKFWDLETQ 181 (888)
T ss_pred CeEEEEeccCceEEEEecccc
Confidence 899999998888888887764
No 270
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=64.48 E-value=90 Score=30.86 Aligned_cols=47 Identities=11% Similarity=0.145 Sum_probs=29.7
Q ss_pred ecEEEEEccCcEEEEEeC-CCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEec
Q 040693 205 HDIVVAVQKSGFAWALDR-DSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIA 258 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~-~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~ 258 (382)
++.+..+...| +++++. ++-|+.|.++.+. .. -..-..++.+|+...
T Consensus 149 ~nvL~i~~~~g-it~yn~~e~~k~vw~~~fnG-----sy-vdaRlynG~lYiv~r 196 (603)
T COG4880 149 GNVLAIGEVGG-ITLYNLYESSKKVWVYNFNG-----SY-VDARLYNGELYIVAR 196 (603)
T ss_pred CcEEEEEEeCC-EEEEEeccccceeEEEecCC-----ce-eeeeeeCCEEEEEEc
Confidence 45555555544 777877 8999999999753 11 111226778887643
No 271
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=63.75 E-value=1.3e+02 Score=28.30 Aligned_cols=130 Identities=18% Similarity=0.178 Sum_probs=80.7
Q ss_pred eecEEEEEccCcEEEEEeCCCCCeeeee-ccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 204 KHDIVVAVQKSGFAWALDRDSGSLIWSM-EAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 204 ~~~~v~~~~~~g~l~ald~~tG~~~W~~-~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+++.++....+.-|..+|..+-+.=-.. ..+ ..+..++..+.++..|++. .+..+.-+
T Consensus 95 se~yvyvad~ssGL~IvDIS~P~sP~~~~~ln----t~gyaygv~vsGn~aYVad-----------------lddgfLiv 153 (370)
T COG5276 95 SEEYVYVADWSSGLRIVDISTPDSPTLIGFLN----TDGYAYGVYVSGNYAYVAD-----------------LDDGFLIV 153 (370)
T ss_pred cccEEEEEcCCCceEEEeccCCCCcceecccc----CCceEEEEEecCCEEEEee-----------------ccCcEEEE
Confidence 3678888888777888888753211111 111 1124455555788888873 23456667
Q ss_pred ECCCC---cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC---CcEeEEEecCCceecceEEeCCEEEE
Q 040693 283 DASNG---NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT---GKILWSYDTGATIYGGASVSNGCIYM 356 (382)
Q Consensus 283 d~~tG---~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t---G~ilw~~~~~~~~~~~p~~~~g~lyv 356 (382)
|..+- .+.=+...+.. ..--+.+.+++.|++.. ++.|..+|..+ -+.+-+++.+.+. -+..+.+++.|+
T Consensus 154 dvsdpssP~lagrya~~~~--d~~~v~ISGn~AYvA~~--d~GL~ivDVSnp~sPvli~~~n~g~g~-~sv~vsdnr~y~ 228 (370)
T COG5276 154 DVSDPSSPQLAGRYALPGG--DTHDVAISGNYAYVAWR--DGGLTIVDVSNPHSPVLIGSYNTGPGT-YSVSVSDNRAYL 228 (370)
T ss_pred ECCCCCCceeeeeeccCCC--CceeEEEecCeEEEEEe--CCCeEEEEccCCCCCeEEEEEecCCce-EEEEecCCeeEE
Confidence 76654 34444444432 12355678999999985 89999998764 5567777776543 355667778877
Q ss_pred EeC
Q 040693 357 GNG 359 (382)
Q Consensus 357 ~~~ 359 (382)
..-
T Consensus 229 vvy 231 (370)
T COG5276 229 VVY 231 (370)
T ss_pred EEc
Confidence 754
No 272
>TIGR03054 photo_alph_chp1 putative photosynthetic complex assembly protein. In twenty or so anoxygenic photosynthetic alpha-Proteobacteria known so far, a gene for a member of this protein family is present and is found in the vicinity of puhA, which encodes a component of the photosynthetic reaction center, and other genes associated with photosynthesis. This protein family is suggested, consequently, as a probable assembly factor for the photosynthetic reaction center, but its seems its actual function has not yet been demonstrated.
Probab=62.62 E-value=51 Score=27.08 Aligned_cols=71 Identities=18% Similarity=0.163 Sum_probs=46.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee-------e--eCCeEEEEecCccccccccCCCCCCCC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA-------T--DERRIYTNIANSQHKNFNLKPSKNSTI 275 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~-------~--~~~~v~~~~~~~~~~~~~~~~~~~~~~ 275 (382)
..+.+....+|.+..+|..+|+.+-..+.+..++.....-+.. + +.....+- ..
T Consensus 41 r~l~f~d~~~G~v~V~~~~~G~~va~~~~g~~GFvrgvlR~l~R~R~~~gv~~~~Pf~L~r-----------------~~ 103 (135)
T TIGR03054 41 LWLVFEDRPDGAVAVVETPDGRLVAILEPGQNGFVRVMLRGLARARARAGVAAEPPFRLTR-----------------YD 103 (135)
T ss_pred EEEEEecCCCCeEEEEECCCCCEEEEecCCCCchhhHhHHHHHHHHHHcCCCCCCCEEEEE-----------------Ee
Confidence 6788888899999988999999999998775433221111100 0 11222221 44
Q ss_pred CceEEEEECCCCcEEee
Q 040693 276 AGGWVAMDASNGNVLWS 292 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~ 292 (382)
+|++...|+.||+..=-
T Consensus 104 dGrltL~Dp~Tg~~i~L 120 (135)
T TIGR03054 104 NGRLTLTDPATGWSIEL 120 (135)
T ss_pred CCcEEEEcCCCCcEEEE
Confidence 68899999999976533
No 273
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=62.47 E-value=1.7e+02 Score=29.06 Aligned_cols=60 Identities=12% Similarity=0.109 Sum_probs=36.8
Q ss_pred CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee------------cce-EEeCCEEEEEeCceeEeecCCccC
Q 040693 310 GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY------------GGA-SVSNGCIYMGNGYKVTVGFGNKNF 371 (382)
Q Consensus 310 ~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~------------~~p-~~~~g~lyv~~~~g~~~~~~~~~~ 371 (382)
.-.+.++ ++|.|+..+...-+++...+...+.. .+. ++.+..|+++.+....+.+|.++.
T Consensus 339 ~HfvsGS--dnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i~~sdL~asGS~~G~vrLW~i~~ 411 (479)
T KOG0299|consen 339 EHFVSGS--DNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVIPGSDLLASGSWSGCVRLWKIED 411 (479)
T ss_pred cceeecc--CCceEEEeeecccCceeEeeccccccCCccccccccceeeeEecccCceEEecCCCCceEEEEecC
Confidence 3333444 48999999999999999887765432 222 335566666655333455555443
No 274
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=62.44 E-value=1.1e+02 Score=27.90 Aligned_cols=95 Identities=16% Similarity=0.239 Sum_probs=52.9
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCc--ceE----EeCCEEEEeeecCCCcEEEEeCCCC------cEeEEEecCC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPG--PVT----VANGVLFGGSTYRQGPIYAMDVKTG------KILWSYDTGA 341 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~--~~~----~~~~~v~~~~~~~~g~l~~ld~~tG------~ilw~~~~~~ 341 (382)
..++.|....-.+| +|+..........+ .+. -.+-++++++. +|.|..|+.++- +|.--.+++.
T Consensus 77 sYDgkVIiWke~~g--~w~k~~e~~~h~~SVNsV~wapheygl~LacasS--DG~vsvl~~~~~g~w~t~ki~~aH~~Gv 152 (299)
T KOG1332|consen 77 SYDGKVIIWKEENG--RWTKAYEHAAHSASVNSVAWAPHEYGLLLACASS--DGKVSVLTYDSSGGWTTSKIVFAHEIGV 152 (299)
T ss_pred ecCceEEEEecCCC--chhhhhhhhhhcccceeecccccccceEEEEeeC--CCcEEEEEEcCCCCccchhhhhcccccc
Confidence 45677888888777 78765433211111 111 12335666664 999888887632 2222223332
Q ss_pred -ceecceEEeCC-----------EEEEEeCceeEeecCCccCC
Q 040693 342 -TIYGGASVSNG-----------CIYMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 342 -~~~~~p~~~~g-----------~lyv~~~~g~~~~~~~~~~~ 372 (382)
.+.-.|+...| +-|++.+-...+++++++..
T Consensus 153 nsVswapa~~~g~~~~~~~~~~~krlvSgGcDn~VkiW~~~~~ 195 (299)
T KOG1332|consen 153 NSVSWAPASAPGSLVDQGPAAKVKRLVSGGCDNLVKIWKFDSD 195 (299)
T ss_pred ceeeecCcCCCccccccCcccccceeeccCCccceeeeecCCc
Confidence 22333443333 55677777788899888874
No 275
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=62.30 E-value=1.6e+02 Score=28.78 Aligned_cols=37 Identities=22% Similarity=0.320 Sum_probs=23.9
Q ss_pred cEEeeecCCCCCCCCcceEEeC--CEEEEeeecCCCcEEEEeCC
Q 040693 288 NVLWSTADPSNGTAPGPVTVAN--GVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 288 ~~~W~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~g~l~~ld~~ 329 (382)
++++++..+.. .--+.+++ +.+|++-. +-.|+.++++
T Consensus 199 ~lVR~f~~~sQ---~EGCVVDDe~g~LYvgEE--~~GIW~y~Ae 237 (381)
T PF02333_consen 199 TLVREFKVGSQ---PEGCVVDDETGRLYVGEE--DVGIWRYDAE 237 (381)
T ss_dssp EEEEEEE-SS----EEEEEEETTTTEEEEEET--TTEEEEEESS
T ss_pred EEEEEecCCCc---ceEEEEecccCCEEEecC--ccEEEEEecC
Confidence 46777776542 22234554 79999884 7889999876
No 276
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=62.15 E-value=61 Score=34.14 Aligned_cols=144 Identities=10% Similarity=0.007 Sum_probs=80.8
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
+..++.|.-|+.+...+..+-++.--.++.. ......+.| ++...+++ +..|....|+.
T Consensus 422 DryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~--lITAvcy~P--dGk~avIG-----------------t~~G~C~fY~t 480 (712)
T KOG0283|consen 422 DRYFISGSLDGKVRLWSISDKKVVDWNDLRD--LITAVCYSP--DGKGAVIG-----------------TFNGYCRFYDT 480 (712)
T ss_pred CCcEeecccccceEEeecCcCeeEeehhhhh--hheeEEecc--CCceEEEE-----------------EeccEEEEEEc
Confidence 5667777777777777776665553334321 111111222 56666666 34466666666
Q ss_pred CCCcEE--eeecCCCC-----CCCCcceE--EeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC----CceecceEEeC
Q 040693 285 SNGNVL--WSTADPSN-----GTAPGPVT--VANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG----ATIYGGASVSN 351 (382)
Q Consensus 285 ~tG~~~--W~~~~~~~-----~~~~~~~~--~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~----~~~~~~p~~~~ 351 (382)
..-+.. |.+..... ....+--. ...+.|.+.+. +.+|..+|..+-+++-+++-. ..+.++... +
T Consensus 481 ~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSn--DSrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs~-D 557 (712)
T KOG0283|consen 481 EGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTSN--DSRIRIYDGRDKDLVHKFKGFRNTSSQISASFSS-D 557 (712)
T ss_pred cCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEecC--CCceEEEeccchhhhhhhcccccCCcceeeeEcc-C
Confidence 655433 22222210 00111111 12246888886 999999999777887777532 234445555 6
Q ss_pred CEEEEEeCceeEeecCCccCC
Q 040693 352 GCIYMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 352 g~lyv~~~~g~~~~~~~~~~~ 372 (382)
|+-.|+.+++..+++|.++..
T Consensus 558 gk~IVs~seDs~VYiW~~~~~ 578 (712)
T KOG0283|consen 558 GKHIVSASEDSWVYIWKNDSF 578 (712)
T ss_pred CCEEEEeecCceEEEEeCCCC
Confidence 666666667788888887543
No 277
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=62.09 E-value=25 Score=31.78 Aligned_cols=63 Identities=21% Similarity=0.217 Sum_probs=40.0
Q ss_pred eCCEEEEeeecCCCcEEEEeCCCCcEeEE--EecCCceecceEE-----eCCEEEEEeCceeEeecCCccCCCCC
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKTGKILWS--YDTGATIYGGASV-----SNGCIYMGNGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~tG~ilw~--~~~~~~~~~~p~~-----~~g~lyv~~~~g~~~~~~~~~~~~g~ 375 (382)
.++.+|..+. .++||.||+.+|..... .++.........- .-+||-|.+..|.- +.+++.+|.
T Consensus 37 a~G~LYgl~~--~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~GqN---lR~npdtGa 106 (236)
T PF14339_consen 37 ANGQLYGLGS--TGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTGQN---LRLNPDTGA 106 (236)
T ss_pred CCCCEEEEeC--CCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCCcE---EEECCCCCC
Confidence 4678888874 89999999999986554 2222222122111 23788888776653 557777776
No 278
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=61.65 E-value=22 Score=34.58 Aligned_cols=60 Identities=23% Similarity=0.242 Sum_probs=37.9
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEE-eC-CEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV-AN-GVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~-~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
.++.|-..|+++|+-+=+.-..+.. ...+.+ .+ +.+..++ ++..+..||..+=+++..+.
T Consensus 242 kDnlVKlWDprSg~cl~tlh~HKnt--Vl~~~f~~n~N~Llt~s--kD~~~kv~DiR~mkEl~~~r 303 (464)
T KOG0284|consen 242 KDNLVKLWDPRSGSCLATLHGHKNT--VLAVKFNPNGNWLLTGS--KDQSCKVFDIRTMKELFTYR 303 (464)
T ss_pred CCceeEeecCCCcchhhhhhhccce--EEEEEEcCCCCeeEEcc--CCceEEEEehhHhHHHHHhh
Confidence 3457889999999887666554421 111122 23 4444444 68889999998766666554
No 279
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=61.05 E-value=1.4e+02 Score=28.03 Aligned_cols=56 Identities=13% Similarity=-0.051 Sum_probs=35.4
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEe-CCEEEEeeecCCCcEEEEeCCC
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA-NGVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~g~l~~ld~~t 330 (382)
..++++.++|.+|-+..|.++..-+. ....+-+. |.--++.+-+++|+|...|...
T Consensus 190 t~d~tl~~~D~RT~~~~~sI~dAHgq-~vrdlDfNpnkq~~lvt~gDdgyvriWD~R~ 246 (370)
T KOG1007|consen 190 TSDSTLQFWDLRTMKKNNSIEDAHGQ-RVRDLDFNPNKQHILVTCGDDGYVRIWDTRK 246 (370)
T ss_pred eCCCcEEEEEccchhhhcchhhhhcc-eeeeccCCCCceEEEEEcCCCccEEEEeccC
Confidence 45688999999999999998754421 11122122 2223333334789999999763
No 280
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=59.38 E-value=49 Score=31.77 Aligned_cols=146 Identities=16% Similarity=0.132 Sum_probs=72.1
Q ss_pred EEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCe-EEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeee
Q 040693 216 FAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERR-IYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWST 293 (382)
Q Consensus 216 ~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~-v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~ 293 (382)
.|+.+|.++|+..+-.................. .++. +++...+.. ...-.+..+|+++|+..+..
T Consensus 159 ~l~v~~~~~~~~~~~~~~~~~~~~~~yl~~v~W~~d~~~l~~~~~nR~------------q~~~~l~~~d~~tg~~~~~~ 226 (353)
T PF00930_consen 159 SLFVVDLASGKTTELDPPNSLNPQDYYLTRVGWSPDGKRLWVQWLNRD------------QNRLDLVLCDASTGETRVVL 226 (353)
T ss_dssp EEEEEESSSTCCCEE---HHHHTSSEEEEEEEEEETTEEEEEEEEETT------------STEEEEEEEEECTTTCEEEE
T ss_pred EEEEEECCCCcEEEeeeccccCCCccCcccceecCCCcEEEEEEcccC------------CCEEEEEEEECCCCceeEEE
Confidence 478889999987643222000000011111111 3344 666544432 33455889999999777665
Q ss_pred cCCCC-CC-CCcceE-E--eCC-EEEEeeecCCC--cEEEEeCCCCcEeEEEecCC-ceecceEE--eCCEEEEEeCcee
Q 040693 294 ADPSN-GT-APGPVT-V--ANG-VLFGGSTYRQG--PIYAMDVKTGKILWSYDTGA-TIYGGASV--SNGCIYMGNGYKV 362 (382)
Q Consensus 294 ~~~~~-~~-~~~~~~-~--~~~-~v~~~~~~~~g--~l~~ld~~tG~ilw~~~~~~-~~~~~p~~--~~g~lyv~~~~g~ 362 (382)
..... .. ...++. . +++ .+++.. ++| .|+.++.+++++. +...+. .+.....+ .++.||.......
T Consensus 227 ~e~~~~Wv~~~~~~~~~~~~~~~~l~~s~--~~G~~hly~~~~~~~~~~-~lT~G~~~V~~i~~~d~~~~~iyf~a~~~~ 303 (353)
T PF00930_consen 227 EETSDGWVDVYDPPHFLGPDGNEFLWISE--RDGYRHLYLYDLDGGKPR-QLTSGDWEVTSILGWDEDNNRIYFTANGDN 303 (353)
T ss_dssp EEESSSSSSSSSEEEE-TTTSSEEEEEEE--TTSSEEEEEEETTSSEEE-ESS-SSS-EEEEEEEECTSSEEEEEESSGG
T ss_pred EecCCcceeeecccccccCCCCEEEEEEE--cCCCcEEEEEccccccee-ccccCceeecccceEcCCCCEEEEEecCCC
Confidence 43221 11 122222 3 233 444444 455 6999999877744 222221 12222222 3578887655422
Q ss_pred --EeecCCccCC-CCCe
Q 040693 363 --TVGFGNKNFT-SGTS 376 (382)
Q Consensus 363 --~~~~~~~~~~-~g~~ 376 (382)
.-++|.++.. .|+.
T Consensus 304 p~~r~lY~v~~~~~~~~ 320 (353)
T PF00930_consen 304 PGERHLYRVSLDSGGEP 320 (353)
T ss_dssp TTSBEEEEEETTETTEE
T ss_pred CCceEEEEEEeCCCCCe
Confidence 4567777777 5443
No 281
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=58.97 E-value=1e+02 Score=25.36 Aligned_cols=57 Identities=18% Similarity=0.372 Sum_probs=41.6
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCcceEE------eCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPGPVTV------ANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~------~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
..+|.|+|..+-..+...+++.+ ...+.+ ...+++++. +-.|..||.+--++.|.+.
T Consensus 72 ~t~llaYDV~~N~d~Fyke~~DG---vn~i~~g~~~~~~~~l~ivGG---ncsi~Gfd~~G~e~fWtVt 134 (136)
T PF14781_consen 72 QTSLLAYDVENNSDLFYKEVPDG---VNAIVIGKLGDIPSPLVIVGG---NCSIQGFDYEGNEIFWTVT 134 (136)
T ss_pred cceEEEEEcccCchhhhhhCccc---eeEEEEEecCCCCCcEEEECc---eEEEEEeCCCCcEEEEEec
Confidence 36799999999988888887653 222223 346777766 6789999988777888763
No 282
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=58.93 E-value=44 Score=26.40 Aligned_cols=22 Identities=27% Similarity=0.496 Sum_probs=17.3
Q ss_pred EEccCcEEEEEeCCCCCeeeeec
Q 040693 210 AVQKSGFAWALDRDSGSLIWSME 232 (382)
Q Consensus 210 ~~~~~g~l~ald~~tG~~~W~~~ 232 (382)
.-..+|.|+.+| .+++++|+-.
T Consensus 58 ~L~~~GNlvl~d-~~~~~lW~Sf 79 (114)
T PF01453_consen 58 VLQDDGNLVLYD-SSGNVLWQSF 79 (114)
T ss_dssp EEETTSEEEEEE-TTSEEEEEST
T ss_pred EEeCCCCEEEEe-ecceEEEeec
Confidence 344588899999 5799999973
No 283
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=58.88 E-value=1.2e+02 Score=29.07 Aligned_cols=19 Identities=11% Similarity=0.268 Sum_probs=15.9
Q ss_pred ecEEEEEccCcEEEEEeCC
Q 040693 205 HDIVVAVQKSGFAWALDRD 223 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~ 223 (382)
.++++.++-||-+-.||.+
T Consensus 178 pnlLlSGSvDGLvnlfD~~ 196 (376)
T KOG1188|consen 178 PNLLLSGSVDGLVNLFDTK 196 (376)
T ss_pred CCeEEeecccceEEeeecC
Confidence 4778899999988888876
No 284
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=57.27 E-value=1.7e+02 Score=27.55 Aligned_cols=131 Identities=18% Similarity=0.137 Sum_probs=76.2
Q ss_pred ecEEEEEccCcEEEEEeCCCCC---eeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGS---LIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~---~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
++..|+..-+.-+..+|..+-+ +.=++. ..+...+-.++.++..|+. +.++.++.
T Consensus 138 Gn~aYVadlddgfLivdvsdpssP~lagrya-----~~~~d~~~v~ISGn~AYvA-----------------~~d~GL~i 195 (370)
T COG5276 138 GNYAYVADLDDGFLIVDVSDPSSPQLAGRYA-----LPGGDTHDVAISGNYAYVA-----------------WRDGGLTI 195 (370)
T ss_pred CCEEEEeeccCcEEEEECCCCCCceeeeeec-----cCCCCceeEEEecCeEEEE-----------------EeCCCeEE
Confidence 4556666533335556665421 111111 1122234556688888887 56788999
Q ss_pred EECCCC---cEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEec---CCc-eecceEEeCCEE
Q 040693 282 MDASNG---NVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDT---GAT-IYGGASVSNGCI 354 (382)
Q Consensus 282 ~d~~tG---~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~---~~~-~~~~p~~~~g~l 354 (382)
+|.++- +.+=+++... ..-.+.+.+++.|+... +..|.-+|..+-+-.|.+-. ... ..++..+.+++.
T Consensus 196 vDVSnp~sPvli~~~n~g~---g~~sv~vsdnr~y~vvy--~egvlivd~s~~ssp~~~gsyet~~p~~~s~v~Vs~~~~ 270 (370)
T COG5276 196 VDVSNPHSPVLIGSYNTGP---GTYSVSVSDNRAYLVVY--DEGVLIVDVSGPSSPTVFGSYETSNPVSISTVPVSGEYA 270 (370)
T ss_pred EEccCCCCCeEEEEEecCC---ceEEEEecCCeeEEEEc--ccceEEEecCCCCCceEeeccccCCcccccceeccccee
Confidence 998765 2333333321 12344466788888875 77788888876665665532 221 224567789999
Q ss_pred EEEeCcee
Q 040693 355 YMGNGYKV 362 (382)
Q Consensus 355 yv~~~~g~ 362 (382)
|+......
T Consensus 271 Yvadga~g 278 (370)
T COG5276 271 YVADGAKG 278 (370)
T ss_pred eeeccccC
Confidence 99876533
No 285
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=56.55 E-value=2.1e+02 Score=28.36 Aligned_cols=164 Identities=10% Similarity=0.045 Sum_probs=80.1
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccc-----cCCCCCCCCCceE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFN-----LKPSKNSTIAGGW 279 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~-----~~~~~~~~~~g~v 279 (382)
++.|++-+-||.|.-++.++ ......++..-.+|...+.+ ..+.+++......-..|. ..........+..
T Consensus 145 ~~~IcVQS~DG~L~~feqe~--~~f~~~lp~~llPgPl~Y~~--~tDsfvt~sss~~l~~Yky~~La~~s~~~~~~~~~~ 220 (418)
T PF14727_consen 145 RDFICVQSMDGSLSFFEQES--FAFSRFLPDFLLPGPLCYCP--RTDSFVTASSSWTLECYKYQDLASASEASSRQSGTE 220 (418)
T ss_pred ceEEEEEecCceEEEEeCCc--EEEEEEcCCCCCCcCeEEee--cCCEEEEecCceeEEEecHHHhhhcccccccccccc
Confidence 68899999999999999774 44454444322223232322 233333332221111110 0000000000000
Q ss_pred E-EEECCCCcEEeeecCCCCCCCCcceEE--eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc-----eecceEEeC
Q 040693 280 V-AMDASNGNVLWSTADPSNGTAPGPVTV--ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT-----IYGGASVSN 351 (382)
Q Consensus 280 ~-a~d~~tG~~~W~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~-----~~~~p~~~~ 351 (382)
. .-..+.=++.|++.+.........+.. ....|++.. +..|++|+. +|++.|...+... .+..+...+
T Consensus 221 ~~~~~~k~l~~dWs~nlGE~~l~i~v~~~~~~~~~IvvLg---er~Lf~l~~-~G~l~~~krLd~~p~~~~~Y~~~~~~~ 296 (418)
T PF14727_consen 221 QDISSGKKLNPDWSFNLGEQALDIQVVRFSSSESDIVVLG---ERSLFCLKD-NGSLRFQKRLDYNPSCFCPYRVPWYNE 296 (418)
T ss_pred ccccccccccceeEEECCceeEEEEEEEcCCCCceEEEEe---cceEEEEcC-CCeEEEEEecCCceeeEEEEEeecccC
Confidence 0 002223357899998764222122111 123555555 789999996 6999998887642 122222223
Q ss_pred C----EEEEEeCceeEeecCCccCCCCCeEEEEE
Q 040693 352 G----CIYMGNGYKVTVGFGNKNFTSGTSLYAFC 381 (382)
Q Consensus 352 g----~lyv~~~~g~~~~~~~~~~~~g~~l~~~~ 381 (382)
+ .+.|++..+. ..+| +..+++|+-.
T Consensus 297 ~~~~~~llV~t~t~~-LlVy----~d~~L~WsA~ 325 (418)
T PF14727_consen 297 PSTRLNLLVGTHTGT-LLVY----EDTTLVWSAQ 325 (418)
T ss_pred CCCceEEEEEecCCe-EEEE----eCCeEEEecC
Confidence 2 3777777655 3333 3456677643
No 286
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=55.35 E-value=2.1e+02 Score=29.17 Aligned_cols=24 Identities=25% Similarity=0.314 Sum_probs=19.7
Q ss_pred cceEEEEECCCCcEEEEEecCCCc
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYD 165 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~ 165 (382)
...+|.+|..-|+++=+.......
T Consensus 489 ~~kLykmDIErGkvveeW~~~ddv 512 (776)
T COG5167 489 RDKLYKMDIERGKVVEEWDLKDDV 512 (776)
T ss_pred cccceeeecccceeeeEeecCCcc
Confidence 378999999999999777776653
No 287
>PF13964 Kelch_6: Kelch motif
Probab=54.77 E-value=34 Score=22.17 Aligned_cols=37 Identities=22% Similarity=0.432 Sum_probs=24.8
Q ss_pred ceEEEcCEEEEeccCccccccccccccccceEEEEeCccCceeeee
Q 040693 26 SGTYYKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQT 71 (382)
Q Consensus 26 ~p~v~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~ 71 (382)
+-++.+++|||........ .+...+.++|++|.+ |+.
T Consensus 6 s~v~~~~~iyv~GG~~~~~-------~~~~~v~~yd~~t~~--W~~ 42 (50)
T PF13964_consen 6 SAVVVGGKIYVFGGYDNSG-------KYSNDVERYDPETNT--WEQ 42 (50)
T ss_pred EEEEECCEEEEECCCCCCC-------CccccEEEEcCCCCc--EEE
Confidence 4477889999966543311 025689999998865 875
No 288
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=54.45 E-value=1e+02 Score=31.20 Aligned_cols=84 Identities=12% Similarity=-0.002 Sum_probs=51.8
Q ss_pred cCcEEEEEeCCCCCeeeeeccCCCCCCCCcccce-eeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEe
Q 040693 213 KSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGA-ATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLW 291 (382)
Q Consensus 213 ~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W 291 (382)
.-+.|+.||++||+.+-+-....- ..... ...+..|.+.... + ...=.|+.||+.|-++.=
T Consensus 373 ~ls~LvllD~~tg~~l~~S~~~~I-----r~r~~~~~~~~~vaI~g~~-G------------~~~ikLvlid~~tLev~k 434 (489)
T PF05262_consen 373 YLSELVLLDSDTGDTLKRSPVNGI-----RGRTFYEREDDLVAIAGCS-G------------NAAIKLVLIDPETLEVKK 434 (489)
T ss_pred cceeEEEEeCCCCceeccccccee-----ccceeEEcCCCEEEEeccC-C------------chheEEEecCcccceeee
Confidence 357899999999999988776431 11121 2244444443111 0 111236777898888876
Q ss_pred eecCCCCCCCCcceEEeCCEEEEee
Q 040693 292 STADPSNGTAPGPVTVANGVLFGGS 316 (382)
Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~v~~~~ 316 (382)
+-... ..+.++++++++.+|+..
T Consensus 435 es~~~--i~~~S~l~~~~~~iyaVv 457 (489)
T PF05262_consen 435 ESEDE--ISWQSSLIVDGQMIYAVV 457 (489)
T ss_pred ecccc--ccccCceEEcCCeEEEEE
Confidence 66554 345678888888888665
No 289
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=54.11 E-value=1.8e+02 Score=26.88 Aligned_cols=146 Identities=11% Similarity=0.039 Sum_probs=73.0
Q ss_pred eecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCC-CCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 204 KHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLG-GGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 204 ~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~-g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
+.-.++++-.+|.+..+|..+|.++=+++...+... .....+++. ..=|...-+ ++....+ ..+=.++.+
T Consensus 164 s~~lllaGyEsghvv~wd~S~~~~~~~~~~~~kv~~~~ash~qpvl--sldyas~~~---rGisgga----~dkl~~~Sl 234 (323)
T KOG0322|consen 164 STFLLLAGYESGHVVIWDLSTGDKIIQLPQSSKVESPNASHKQPVL--SLDYASSCD---RGISGGA----DDKLVMYSL 234 (323)
T ss_pred ceEEEEEeccCCeEEEEEccCCceeeccccccccccchhhccCcce--eeeechhhc---CCcCCCc----cccceeeee
Confidence 355667788889999999999865544442211000 000111110 000111000 0000001 112235677
Q ss_pred ECCCCcEEeeec--CCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCC-EEEEEeC
Q 040693 283 DASNGNVLWSTA--DPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNG-CIYMGNG 359 (382)
Q Consensus 283 d~~tG~~~W~~~--~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g-~lyv~~~ 359 (382)
+-.+|...-+.. .+.+.. .+.-+..++.|++... =++++.+++-+++.+|--.+-..+.....++.-+ .|....+
T Consensus 235 ~~s~gslq~~~e~~lknpGv-~gvrIRpD~KIlATAG-WD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~lmAaas 312 (323)
T KOG0322|consen 235 NHSTGSLQIRKEITLKNPGV-SGVRIRPDGKILATAG-WDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCELMAAAS 312 (323)
T ss_pred ccccCcccccceEEecCCCc-cceEEccCCcEEeecc-cCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCchhhhcc
Confidence 777776544433 222222 2333345666776553 5899999999999998766554444555555433 4444444
Q ss_pred c
Q 040693 360 Y 360 (382)
Q Consensus 360 ~ 360 (382)
.
T Consensus 313 k 313 (323)
T KOG0322|consen 313 K 313 (323)
T ss_pred C
Confidence 3
No 290
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=53.75 E-value=1.5e+02 Score=30.21 Aligned_cols=62 Identities=21% Similarity=0.233 Sum_probs=45.3
Q ss_pred CEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEeCcee-EeecCCccCCC
Q 040693 310 GVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGNGYKV-TVGFGNKNFTS 373 (382)
Q Consensus 310 ~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~~g~-~~~~~~~~~~~ 373 (382)
++.|.+- .+..+..+|-.+|+++-..........+.+++.+..|+-++... -+.++.++.++
T Consensus 502 ~~~~~~h--ed~~Ir~~dn~~~~~l~s~~a~~~svtslai~~ng~~l~s~s~d~sv~l~kld~k~ 564 (577)
T KOG0642|consen 502 DITFTAH--EDRSIRFFDNKTGKILHSMVAHKDSVTSLAIDPNGPYLMSGSHDGSVRLWKLDVKT 564 (577)
T ss_pred CeeEecc--cCCceecccccccccchheeeccceecceeecCCCceEEeecCCceeehhhccchh
Confidence 4666655 48889999999999999887776778888888888777654322 25566666554
No 291
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=53.36 E-value=1e+02 Score=29.96 Aligned_cols=68 Identities=13% Similarity=0.131 Sum_probs=47.2
Q ss_pred EEEEeeecCCCcEEEEeCCCCcEeEEEecCC-ceecceEEeCCEEEEEeCceeEeecCCccCCCCCeEEEEEC
Q 040693 311 VLFGGSTYRQGPIYAMDVKTGKILWSYDTGA-TIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSGTSLYAFCV 382 (382)
Q Consensus 311 ~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~-~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g~~l~~~~~ 382 (382)
.|..+. +++.|.+.....--++-.+-+++ .+.+.+.+.++++.++++... +++.-|-++|+.|-+|+|
T Consensus 165 ~IitaD--RDEkIRvs~ypa~f~IesfclGH~eFVS~isl~~~~~LlS~sGD~--tlr~Wd~~sgk~L~t~dl 233 (390)
T KOG3914|consen 165 FIITAD--RDEKIRVSRYPATFVIESFCLGHKEFVSTISLTDNYLLLSGSGDK--TLRLWDITSGKLLDTCDL 233 (390)
T ss_pred EEEEec--CCceEEEEecCcccchhhhccccHhheeeeeeccCceeeecCCCC--cEEEEecccCCcccccch
Confidence 444444 57888877766555666666655 477889999999977665444 357777888999876653
No 292
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=52.87 E-value=2.3e+02 Score=27.57 Aligned_cols=31 Identities=13% Similarity=0.157 Sum_probs=26.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGP 235 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~ 235 (382)
...+|..+.|..+...|.++|+..=+...+.
T Consensus 271 ~~v~yS~SwDHTIk~WDletg~~~~~~~~~k 301 (423)
T KOG0313|consen 271 ATVIYSVSWDHTIKVWDLETGGLKSTLTTNK 301 (423)
T ss_pred CCceEeecccceEEEEEeecccceeeeecCc
Confidence 5778889999999999999998887776543
No 293
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=50.38 E-value=3.2e+02 Score=28.53 Aligned_cols=143 Identities=10% Similarity=-0.032 Sum_probs=86.2
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
...++++-++|.+...|.+|-.+.-+++....+ .........++.++++ ..+..|..++.
T Consensus 25 ePw~la~LynG~V~IWnyetqtmVksfeV~~~P---vRa~kfiaRknWiv~G-----------------sDD~~IrVfny 84 (794)
T KOG0276|consen 25 EPWILAALYNGDVQIWNYETQTMVKSFEVSEVP---VRAAKFIARKNWIVTG-----------------SDDMQIRVFNY 84 (794)
T ss_pred CceEEEeeecCeeEEEecccceeeeeeeecccc---hhhheeeeccceEEEe-----------------cCCceEEEEec
Confidence 456777888999999999999888888865321 1112233366777776 44578999999
Q ss_pred CCCcEEeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCce-ecceEE--eCCEEEEEeC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATI-YGGASV--SNGCIYMGNG 359 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~-~~~p~~--~~g~lyv~~~ 359 (382)
.|++.+-.++.... .-..+.+. ...+..++ ++-.|-+.|-+.+=..-+.-.+... .+..++ .|..-|++.+
T Consensus 85 nt~ekV~~FeAH~D--yIR~iavHPt~P~vLtsS--DDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~s 160 (794)
T KOG0276|consen 85 NTGEKVKTFEAHSD--YIRSIAVHPTLPYVLTSS--DDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASAS 160 (794)
T ss_pred ccceeeEEeecccc--ceeeeeecCCCCeEEecC--CccEEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeeee
Confidence 99999988886652 12223232 23444444 3556666666533222222222222 233333 5677777777
Q ss_pred ceeEeecCCccC
Q 040693 360 YKVTVGFGNKNF 371 (382)
Q Consensus 360 ~g~~~~~~~~~~ 371 (382)
=.++++++.+..
T Consensus 161 LDrTVKVWslgs 172 (794)
T KOG0276|consen 161 LDRTVKVWSLGS 172 (794)
T ss_pred ccccEEEEEcCC
Confidence 666777776654
No 294
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=49.88 E-value=1.7e+02 Score=28.96 Aligned_cols=55 Identities=24% Similarity=0.411 Sum_probs=38.6
Q ss_pred CCcceEEeCCEEEEeeecCCCcEEEEeC-CCCcEeEEEecCCceecceEEeCCEEEEEeC
Q 040693 301 APGPVTVANGVLFGGSTYRQGPIYAMDV-KTGKILWSYDTGATIYGGASVSNGCIYMGNG 359 (382)
Q Consensus 301 ~~~~~~~~~~~v~~~~~~~~g~l~~ld~-~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~~ 359 (382)
..+-+...++.+.+.. -+.+.+++. ++-|++|.++..+... ..-..+|.||+...
T Consensus 141 ecg~l~l~~nvL~i~~---~~git~yn~~e~~k~vw~~~fnGsyv-daRlynG~lYiv~r 196 (603)
T COG4880 141 ECGILALGGNVLAIGE---VGGITLYNLYESSKKVWVYNFNGSYV-DARLYNGELYIVAR 196 (603)
T ss_pred cceEEEEcCcEEEEEE---eCCEEEEEeccccceeEEEecCCcee-eeeeeCCEEEEEEc
Confidence 4555555566666655 688888887 7899999999876322 23347889998876
No 295
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=48.45 E-value=1.6e+02 Score=27.82 Aligned_cols=152 Identities=11% Similarity=0.101 Sum_probs=84.7
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCccccee-eeCCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAA-TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
+..|++++..-.+..+|.+|-+---...+.. ......-..- ...+.+|++. ..+|.|..+|
T Consensus 228 GefllvgTdHp~~rlYdv~T~QcfvsanPd~--qht~ai~~V~Ys~t~~lYvTa----------------SkDG~IklwD 289 (430)
T KOG0640|consen 228 GEFLLVGTDHPTLRLYDVNTYQCFVSANPDD--QHTGAITQVRYSSTGSLYVTA----------------SKDGAIKLWD 289 (430)
T ss_pred CceEEEecCCCceeEEeccceeEeeecCccc--ccccceeEEEecCCccEEEEe----------------ccCCcEEeec
Confidence 5788889888889999998765443333221 1111111111 1467788874 3467788888
Q ss_pred CCCCcEEeeecCCC-CCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC----c-eecceEEeCCEEEEE
Q 040693 284 ASNGNVLWSTADPS-NGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA----T-IYGGASVSNGCIYMG 357 (382)
Q Consensus 284 ~~tG~~~W~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~----~-~~~~p~~~~g~lyv~ 357 (382)
--+++-+-++.-.- +.-..+.....|+..++.+. .+..+....+.||+.+-++.-.+ . .....+..+..=||-
T Consensus 290 GVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG-~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNhtEdyVl 368 (430)
T KOG0640|consen 290 GVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSG-KDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHTEDYVL 368 (430)
T ss_pred cccHHHHHHHHhhcCCceeeeEEEccCCeEEeecC-CcceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCccceEE
Confidence 77776555543221 11223333344665555443 57778888999999998875332 1 223344445555554
Q ss_pred eCceeEeecCCccCCCCC
Q 040693 358 NGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 358 ~~~g~~~~~~~~~~~~g~ 375 (382)
.-+-....+-.-|++|++
T Consensus 369 ~pDEas~slcsWdaRtad 386 (430)
T KOG0640|consen 369 FPDEASNSLCSWDARTAD 386 (430)
T ss_pred ccccccCceeeccccchh
Confidence 443333334445555554
No 296
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=47.54 E-value=90 Score=24.58 Aligned_cols=59 Identities=31% Similarity=0.477 Sum_probs=35.7
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDAS 285 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~ 285 (382)
...+.-..||.|+.+|.. ++.+|...... +.... .-.+.+ ..+|.++.+| .
T Consensus 20 ~~~L~l~~dGnLvl~~~~-~~~iWss~~t~----~~~~~-----~~~~~L------------------~~~GNlvl~d-~ 70 (114)
T PF01453_consen 20 NYTLILQSDGNLVLYDSN-GSVIWSSNNTS----GRGNS-----GCYLVL------------------QDDGNLVLYD-S 70 (114)
T ss_dssp TEEEEEETTSEEEEEETT-TEEEEE--S-T----TSS-S-----SEEEEE------------------ETTSEEEEEE-T
T ss_pred cccceECCCCeEEEEcCC-CCEEEEecccC----Ccccc-----CeEEEE------------------eCCCCEEEEe-e
Confidence 345556668999999876 78899983211 00000 012222 2357788888 6
Q ss_pred CCcEEeee
Q 040693 286 NGNVLWST 293 (382)
Q Consensus 286 tG~~~W~~ 293 (382)
+++.+|+.
T Consensus 71 ~~~~lW~S 78 (114)
T PF01453_consen 71 SGNVLWQS 78 (114)
T ss_dssp TSEEEEES
T ss_pred cceEEEee
Confidence 89999998
No 297
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=46.47 E-value=25 Score=23.07 Aligned_cols=25 Identities=16% Similarity=0.316 Sum_probs=19.9
Q ss_pred cCEEEEeccCccccccccccccccceEEEEeCccCceeee
Q 040693 31 KGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQ 70 (382)
Q Consensus 31 ~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~ 70 (382)
.+.+.+++. +|.|..+.. +|+.+|+
T Consensus 23 mdLiA~~t~--------------~g~v~v~Rl-~~qriw~ 47 (47)
T PF12894_consen 23 MDLIALGTE--------------DGEVLVYRL-NWQRIWS 47 (47)
T ss_pred CCEEEEEEC--------------CCeEEEEEC-CCcCccC
Confidence 367888876 888888887 8888885
No 298
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=46.44 E-value=2.7e+02 Score=26.49 Aligned_cols=54 Identities=15% Similarity=0.008 Sum_probs=32.4
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCcceEEe-CC-EEEEeeecCCCcEEEEeCCCC
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPGPVTVA-NG-VLFGGSTYRQGPIYAMDVKTG 331 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~-~~-~v~~~~~~~~g~l~~ld~~tG 331 (382)
.+.+..+|..++.++-.......+.-.-.+... .+ ++..++ .+|.|...|.+.-
T Consensus 228 ~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~~lvTaS--SDG~I~vWd~~~~ 283 (362)
T KOG0294|consen 228 NEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHEYLVTAS--SDGFIKVWDIDME 283 (362)
T ss_pred CceEEEeccCCCccceeeecchhheeeeEEEecCCceEEEEec--cCceEEEEEcccc
Confidence 377899999888888777665533222222122 22 333334 4898888887643
No 299
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=43.35 E-value=3.3e+02 Score=26.75 Aligned_cols=92 Identities=9% Similarity=0.010 Sum_probs=52.5
Q ss_pred eEEEEECCCCcEEeeecCCC-C----CCCCcceEE---eCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCcee-cceE
Q 040693 278 GWVAMDASNGNVLWSTADPS-N----GTAPGPVTV---ANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIY-GGAS 348 (382)
Q Consensus 278 ~v~a~d~~tG~~~W~~~~~~-~----~~~~~~~~~---~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~-~~p~ 348 (382)
.+.-+|-.+|..+|...... + .....+.++ ++++|++... ....+...|+.+++++-++++...+. --|.
T Consensus 361 il~~~d~~dG~pVc~~r~~~~Gs~~~kl~t~~ai~~~~~nn~iv~~gd-~tn~lil~D~~s~evvQ~l~~~epv~Dicp~ 439 (463)
T KOG1645|consen 361 ILGRIDFRDGFPVCGKRRTYFGSKQTKLSTTQAIRAVEDNNYIVVVGD-STNELILQDPHSFEVVQTLALSEPVLDICPN 439 (463)
T ss_pred eeeeeccccCceeeeecccccCCcccccccccceeccccccEEEEecC-CcceeEEeccchhheeeecccCcceeeccee
Confidence 35578888999999865322 1 112223222 3555555442 46789999999999999998875443 2233
Q ss_pred EeCCEEEEEeCceeEeecCCcc
Q 040693 349 VSNGCIYMGNGYKVTVGFGNKN 370 (382)
Q Consensus 349 ~~~g~lyv~~~~g~~~~~~~~~ 370 (382)
-.++.=|++.--...+++|..+
T Consensus 440 ~~n~~syLa~LTd~~v~Iyk~e 461 (463)
T KOG1645|consen 440 DTNGSSYLALLTDDRVHIYKNE 461 (463)
T ss_pred ecCCcchhhheecceEEEEecC
Confidence 3333333332222345555543
No 300
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=42.39 E-value=3.9e+02 Score=27.31 Aligned_cols=50 Identities=12% Similarity=0.102 Sum_probs=29.5
Q ss_pred CCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEE
Q 040693 275 IAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAM 326 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~l 326 (382)
....|..+|-.+|+++........ ..+.++++.+..|+.+..-++++...
T Consensus 509 ed~~Ir~~dn~~~~~l~s~~a~~~--svtslai~~ng~~l~s~s~d~sv~l~ 558 (577)
T KOG0642|consen 509 EDRSIRFFDNKTGKILHSMVAHKD--SVTSLAIDPNGPYLMSGSHDGSVRLW 558 (577)
T ss_pred cCCceecccccccccchheeeccc--eecceeecCCCceEEeecCCceeehh
Confidence 346789999999999988765542 34455555433333332135555443
No 301
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=42.28 E-value=1.7e+02 Score=28.25 Aligned_cols=29 Identities=14% Similarity=0.026 Sum_probs=23.8
Q ss_pred EEEEEccCcEEEEEeCCCCCeeeeeccCC
Q 040693 207 IVVAVQKSGFAWALDRDSGSLIWSMEAGP 235 (382)
Q Consensus 207 ~v~~~~~~g~l~ald~~tG~~~W~~~~~~ 235 (382)
.+..++.||.+..+|...-+.+|.++...
T Consensus 81 ~~aSGs~DG~VkiWnlsqR~~~~~f~AH~ 109 (433)
T KOG0268|consen 81 TVASGSCDGEVKIWNLSQRECIRTFKAHE 109 (433)
T ss_pred hhhccccCceEEEEehhhhhhhheeeccc
Confidence 45677789999999999999999998643
No 302
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=42.06 E-value=3.2e+02 Score=26.27 Aligned_cols=78 Identities=13% Similarity=0.272 Sum_probs=36.4
Q ss_pred eEEEEECCC--CcE-EeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCC------c--EeE-EEecCC----
Q 040693 278 GWVAMDASN--GNV-LWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTG------K--ILW-SYDTGA---- 341 (382)
Q Consensus 278 ~v~a~d~~t--G~~-~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG------~--ilw-~~~~~~---- 341 (382)
+|+.++..+ |+. .++.-........+.....++ ||++. ...|+.+...+| + ++- .+....
T Consensus 48 rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~~G-lyV~~---~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~ 123 (367)
T TIGR02604 48 RILILEDADGDGKYDKSNVFAEELSMVTGLAVAVGG-VYVAT---PPDILFLRDKDGDDKADGEREVLLSGFGGQINNHH 123 (367)
T ss_pred EEEEEEcCCCCCCcceeEEeecCCCCccceeEecCC-EEEeC---CCeEEEEeCCCCCCCCCCccEEEEEccCCCCCccc
Confidence 677777664 332 233211111111222223456 88876 677887843322 1 222 222221
Q ss_pred ceecceEE-eCCEEEEEeC
Q 040693 342 TIYGGASV-SNGCIYMGNG 359 (382)
Q Consensus 342 ~~~~~p~~-~~g~lyv~~~ 359 (382)
.....+.+ .+|+||++.+
T Consensus 124 ~~~~~l~~gpDG~LYv~~G 142 (367)
T TIGR02604 124 HSLNSLAWGPDGWLYFNHG 142 (367)
T ss_pred ccccCceECCCCCEEEecc
Confidence 11233444 4789999766
No 303
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=41.92 E-value=2.8e+02 Score=25.39 Aligned_cols=101 Identities=19% Similarity=0.272 Sum_probs=59.4
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCC-----------CcceEEeC-CE--EEEeeecCCCc--EEEEeCCCCcE--eEEE
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTA-----------PGPVTVAN-GV--LFGGSTYRQGP--IYAMDVKTGKI--LWSY 337 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~-----------~~~~~~~~-~~--v~~~~~~~~g~--l~~ld~~tG~i--lw~~ 337 (382)
...|..+|+++++..=+..++..... .--++++. |+ ||.... .+|. |--||+++-++ .|..
T Consensus 88 s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~-~~g~ivvskld~~tL~v~~tw~T 166 (250)
T PF02191_consen 88 SRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATED-NNGNIVVSKLDPETLSVEQTWNT 166 (250)
T ss_pred CceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCC-CCCcEEEEeeCcccCceEEEEEe
Confidence 47799999999999844444332111 11233443 43 333332 2343 45678877666 5555
Q ss_pred ecCCceecceEEeCCEEEEEeCcee--EeecCCccCCCCCeE
Q 040693 338 DTGATIYGGASVSNGCIYMGNGYKV--TVGFGNKNFTSGTSL 377 (382)
Q Consensus 338 ~~~~~~~~~p~~~~g~lyv~~~~g~--~~~~~~~~~~~g~~l 377 (382)
.......+..-++=|.||+..+... .--.|++|..+++..
T Consensus 167 ~~~k~~~~naFmvCGvLY~~~s~~~~~~~I~yafDt~t~~~~ 208 (250)
T PF02191_consen 167 SYPKRSAGNAFMVCGVLYATDSYDTRDTEIFYAFDTYTGKEE 208 (250)
T ss_pred ccCchhhcceeeEeeEEEEEEECCCCCcEEEEEEECCCCcee
Confidence 5555566667778899999876431 111377777777654
No 304
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=40.97 E-value=69 Score=20.43 Aligned_cols=26 Identities=12% Similarity=0.225 Sum_probs=21.0
Q ss_pred cceEEeCCEEEEeeecCCCcEEEEeCCC
Q 040693 303 GPVTVANGVLFGGSTYRQGPIYAMDVKT 330 (382)
Q Consensus 303 ~~~~~~~~~v~~~~~~~~g~l~~ld~~t 330 (382)
..+.+.++++|++.. .+.|..+|..+
T Consensus 5 ~~v~v~g~yaYva~~--~~Gl~IvDISn 30 (42)
T PF08309_consen 5 RDVAVSGNYAYVADG--NNGLVIVDISN 30 (42)
T ss_pred EEEEEECCEEEEEeC--CCCEEEEECCC
Confidence 346678999999974 78899999864
No 305
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=40.28 E-value=87 Score=31.85 Aligned_cols=109 Identities=19% Similarity=0.224 Sum_probs=63.5
Q ss_pred EEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee-eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCC
Q 040693 209 VAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT-DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNG 287 (382)
Q Consensus 209 ~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG 287 (382)
++.+-||.+..+|+=-|+++-+....+....++..-.... +...+..+ ++-..+|..+|.+.+
T Consensus 798 ~i~ScD~giHlWDPFigr~Laq~~dapk~~a~~~ikcl~nv~~~iliAg----------------csaeSTVKl~DaRsc 861 (1034)
T KOG4190|consen 798 SIASCDGGIHLWDPFIGRLLAQMEDAPKEGAGGNIKCLENVDRHILIAG----------------CSAESTVKLFDARSC 861 (1034)
T ss_pred eeeeccCcceeecccccchhHhhhcCcccCCCceeEecccCcchheeee----------------ccchhhheeeecccc
Confidence 4455577799999999988877665544333333333222 33333332 123477999999998
Q ss_pred c--EEeeecC-CCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeE
Q 040693 288 N--VLWSTAD-PSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILW 335 (382)
Q Consensus 288 ~--~~W~~~~-~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw 335 (382)
+ -.|+.-. +.+......+.+. ++.+-++- .+|-+..+|..+|+++-
T Consensus 862 e~~~E~kVcna~~Pna~~R~iaVa~~GN~lAa~L--SnGci~~LDaR~G~vIN 912 (1034)
T KOG4190|consen 862 EWTCELKVCNAPGPNALTRAIAVADKGNKLAAAL--SNGCIAILDARNGKVIN 912 (1034)
T ss_pred cceeeEEeccCCCCchheeEEEeccCcchhhHHh--cCCcEEEEecCCCceec
Confidence 6 3455432 2211222334443 33444443 38999999999999754
No 306
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=40.24 E-value=3.3e+02 Score=25.85 Aligned_cols=68 Identities=18% Similarity=0.203 Sum_probs=42.0
Q ss_pred CcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 141 HSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 141 ~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
-.|.|-.+|+ +|+.+-++..... ++.| .++.-.|.=+. .-++.+|+-...||++.++
T Consensus 220 G~G~VdvFd~-~G~l~~r~as~g~---------LNaP---------WG~a~APa~FG----~~sg~lLVGNFGDG~InaF 276 (336)
T TIGR03118 220 GLGYVNVFTL-NGQLLRRVASSGR---------LNAP---------WGLAIAPESFG----SLSGALLVGNFGDGTINAY 276 (336)
T ss_pred CcceEEEEcC-CCcEEEEeccCCc---------ccCC---------ceeeeChhhhC----CCCCCeEEeecCCceeEEe
Confidence 3488999996 5999999865543 2222 12222221110 0125566666789999999
Q ss_pred eCCCCCeeeee
Q 040693 221 DRDSGSLIWSM 231 (382)
Q Consensus 221 d~~tG~~~W~~ 231 (382)
|+.+|+.+=..
T Consensus 277 D~~sG~~~g~L 287 (336)
T TIGR03118 277 DPQSGAQLGQL 287 (336)
T ss_pred cCCCCceeeee
Confidence 99999765333
No 307
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=40.21 E-value=4.4e+02 Score=27.28 Aligned_cols=50 Identities=14% Similarity=0.098 Sum_probs=30.7
Q ss_pred CCEEEEeeecCCCcEEEEeCCCCcEeEEE-ecCCceecceE-EeCCEEEEEeCc
Q 040693 309 NGVLFGGSTYRQGPIYAMDVKTGKILWSY-DTGATIYGGAS-VSNGCIYMGNGY 360 (382)
Q Consensus 309 ~~~v~~~~~~~~g~l~~ld~~tG~ilw~~-~~~~~~~~~p~-~~~g~lyv~~~~ 360 (382)
...+.... ..|.+.+.+..+|+++-.. ...+....... -.++++.+++..
T Consensus 344 ~~~v~~~n--s~g~L~van~stG~~v~sv~q~Rg~nit~~~~d~~g~lWlgs~q 395 (671)
T COG3292 344 SWGVRQLN--SIGELMVANGSTGELVRSVHQLRGMNITTTLEDSRGRLWLGSMQ 395 (671)
T ss_pred ccceeecc--ccceEEEecCCCCcEEEEeeeccccccchhhhccCCcEEEEecc
Confidence 34444444 2678899999998875543 33333333322 257899998876
No 308
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=39.73 E-value=4.4e+02 Score=27.06 Aligned_cols=22 Identities=32% Similarity=0.404 Sum_probs=18.1
Q ss_pred ceEEEEECCCCcEEEEEecCCC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGY 164 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~ 164 (382)
..|+-+|.+.||++=+......
T Consensus 356 ~~l~klDIE~GKIVeEWk~~~d 377 (644)
T KOG2395|consen 356 DKLYKLDIERGKIVEEWKFEDD 377 (644)
T ss_pred CcceeeecccceeeeEeeccCC
Confidence 7899999999999966666544
No 309
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=39.17 E-value=80 Score=25.95 Aligned_cols=30 Identities=23% Similarity=0.246 Sum_probs=25.0
Q ss_pred EcCEEEEeccCccccccccccccccceEEEEeCccCceeeeeecc
Q 040693 30 YKGAYYVGTSSIEEGLTFELCCTFQGSLAKLDAKTGRILWQTFML 74 (382)
Q Consensus 30 ~~~~v~v~~~~~~~~~~~~~~~~~~g~l~ald~~tG~~lW~~~~~ 74 (382)
..+.|++|+ +..|.|+|.+.-+-+..+++.
T Consensus 63 ~~D~LliGt---------------~t~llaYDV~~N~d~Fyke~~ 92 (136)
T PF14781_consen 63 GRDCLLIGT---------------QTSLLAYDVENNSDLFYKEVP 92 (136)
T ss_pred CcCEEEEec---------------cceEEEEEcccCchhhhhhCc
Confidence 458899999 568999999998888887774
No 310
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=38.76 E-value=3.6e+02 Score=25.82 Aligned_cols=99 Identities=14% Similarity=0.150 Sum_probs=51.9
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEE-eCCEEEEeeecC----CC-----cEEEEeCC------CCcEeEEE
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTV-ANGVLFGGSTYR----QG-----PIYAMDVK------TGKILWSY 337 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~-~~~~v~~~~~~~----~g-----~l~~ld~~------tG~ilw~~ 337 (382)
+..+....+|.+|..+. ..+...... ..++.+ .++.||+-+... .+ .+.+|... ....-|..
T Consensus 83 d~~~~t~vyDt~t~av~-~~P~l~~pk-~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~l~~~~~~~~~~~~~~w~W 160 (342)
T PF07893_consen 83 DQSGRTLVYDTDTRAVA-TGPRLHSPK-RCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEALVYRPPPDDPSPEESWSW 160 (342)
T ss_pred cCCCCeEEEECCCCeEe-ccCCCCCCC-cceEEEEeCCeEEEeeccCccccccCccceeEEEeccccccccccCCCcceE
Confidence 34467889999988877 221111111 223322 366688876411 11 55566332 34555554
Q ss_pred ecC--Cce----------ecceEEe-CCEEEEEeCceeEeecCCccCCCCC
Q 040693 338 DTG--ATI----------YGGASVS-NGCIYMGNGYKVTVGFGNKNFTSGT 375 (382)
Q Consensus 338 ~~~--~~~----------~~~p~~~-~g~lyv~~~~g~~~~~~~~~~~~g~ 375 (382)
..- .++ ..+-+++ +..|||+..... .+.|+||+.+++
T Consensus 161 ~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~-~GTysfDt~~~~ 210 (342)
T PF07893_consen 161 RSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR-WGTYSFDTESHE 210 (342)
T ss_pred EcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCc-eEEEEEEcCCcc
Confidence 322 111 3344445 788888665421 346999997764
No 311
>PF14517 Tachylectin: Tachylectin; PDB: 1TL2_A.
Probab=38.42 E-value=3e+02 Score=24.84 Aligned_cols=28 Identities=18% Similarity=0.323 Sum_probs=15.8
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCC
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGP 235 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~ 235 (382)
-..+....++.|++++ ..|++ |+...+.
T Consensus 180 ~~~i~~~~~g~L~~V~-~~G~l-yr~~~p~ 207 (229)
T PF14517_consen 180 FHFIFFSPDGNLWAVK-SNGKL-YRGRPPQ 207 (229)
T ss_dssp EEEEEE-TTS-EEEE--ETTEE-EEES---
T ss_pred ceEEeeCCCCcEEEEe-cCCEE-eccCCcc
Confidence 4556777889999994 34655 8877654
No 312
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=38.40 E-value=3.6e+02 Score=25.73 Aligned_cols=160 Identities=16% Similarity=0.148 Sum_probs=89.2
Q ss_pred cEEEEEccCcEEEEEeCC-CCCeeeeeccCCCCCC--CCcccceee-eCCeEEEEecCcc--ccccccCCCCCCCCCceE
Q 040693 206 DIVVAVQKSGFAWALDRD-SGSLIWSMEAGPGGLG--GGAMWGAAT-DERRIYTNIANSQ--HKNFNLKPSKNSTIAGGW 279 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~-tG~~~W~~~~~~~~~~--g~~~~~~~~-~~~~v~~~~~~~~--~~~~~~~~~~~~~~~g~v 279 (382)
..+++.+.=+=|..+|.. +=+++|+-+.-..-.. ....=+.+. ++...|++.-... ...|... ..+|.
T Consensus 114 ~l~fVNT~fSCLatl~~~~SF~P~WkPpFIs~la~eDRCHLNGlA~~~g~p~yVTa~~~sD~~~gWR~~-----~~~gG- 187 (335)
T TIGR03032 114 RLLFVNTLFSCLATVSPDYSFVPLWKPPFISKLAPEDRCHLNGMALDDGEPRYVTALSQSDVADGWREG-----RRDGG- 187 (335)
T ss_pred cEEEEECcceeEEEECCCCccccccCCccccccCccCceeecceeeeCCeEEEEEEeeccCCccccccc-----ccCCe-
Confidence 445555544445555554 4578888553321100 011112222 5566776643211 1122211 12233
Q ss_pred EEEECCCCcEEeee-cCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEEEEe
Q 040693 280 VAMDASNGNVLWST-ADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIYMGN 358 (382)
Q Consensus 280 ~a~d~~tG~~~W~~-~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~ 358 (382)
..+|+.+++++=+- .++. +| -..++++|+... ..|.|+.+|+++|+..=-...++ ........++.+||+.
T Consensus 188 ~vidv~s~evl~~GLsmPh-----SP-RWhdgrLwvlds-gtGev~~vD~~~G~~e~Va~vpG-~~rGL~f~G~llvVgm 259 (335)
T TIGR03032 188 CVIDIPSGEVVASGLSMPH-----SP-RWYQGKLWLLNS-GRGELGYVDPQAGKFQPVAFLPG-FTRGLAFAGDFAFVGL 259 (335)
T ss_pred EEEEeCCCCEEEcCccCCc-----CC-cEeCCeEEEEEC-CCCEEEEEcCCCCcEEEEEECCC-CCcccceeCCEEEEEe
Confidence 33899999887432 2222 23 356677777764 58999999999998776667765 4455556678888876
Q ss_pred Ccee-----------------EeecCCccCCCCCeEEE
Q 040693 359 GYKV-----------------TVGFGNKNFTSGTSLYA 379 (382)
Q Consensus 359 ~~g~-----------------~~~~~~~~~~~g~~l~~ 379 (382)
+.-+ ..++..+|-+||+++=.
T Consensus 260 Sk~R~~~~f~glpl~~~l~~~~CGv~vidl~tG~vv~~ 297 (335)
T TIGR03032 260 SKLRESRVFGGLPIEERLDALGCGVAVIDLNSGDVVHW 297 (335)
T ss_pred ccccCCCCcCCCchhhhhhhhcccEEEEECCCCCEEEE
Confidence 5211 25667778888886543
No 313
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=37.79 E-value=2e+02 Score=22.72 Aligned_cols=20 Identities=10% Similarity=0.177 Sum_probs=15.6
Q ss_pred ceEEEEECCCCcEEEEEecCCC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGY 164 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~ 164 (382)
|.|-+++. .+.+|+.+....
T Consensus 63 GTVGvY~~--~~RlWRiKSK~~ 82 (111)
T PF14783_consen 63 GTVGVYDR--SQRLWRIKSKNQ 82 (111)
T ss_pred CEEEEEeC--cceeeeeccCCC
Confidence 78888874 799999986643
No 314
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=37.46 E-value=87 Score=23.67 Aligned_cols=61 Identities=20% Similarity=0.153 Sum_probs=32.1
Q ss_pred CEEEEeccCccccccc----cccccccceEEEEeCccCce-eeeeeccCCCCCCCCCCcCccccCCCceeeCCCCeEEEE
Q 040693 32 GAYYVGTSSIEEGLTF----ELCCTFQGSLAKLDAKTGRI-LWQTFMLPDNFGKLNEYAGAAIWGSSPSIDPIRNHVYIA 106 (382)
Q Consensus 32 ~~v~v~~~~~~~~~~~----~~~~~~~g~l~ald~~tG~~-lW~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~v~ 106 (382)
+.||+..++.++.... ..--.+.|+|..+|++|++. +=...+.- .. -..+.+++..|+|+
T Consensus 10 g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~~~~vl~~~L~f---------pN------GVals~d~~~vlv~ 74 (89)
T PF03088_consen 10 GTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTKETTVLLDGLYF---------PN------GVALSPDESFVLVA 74 (89)
T ss_dssp --EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTTEEEEEEEEESS---------EE------EEEE-TTSSEEEEE
T ss_pred CEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCCeEEEehhCCCc---------cC------eEEEcCCCCEEEEE
Confidence 6788887776555441 11122479999999999986 33333321 11 15677777777775
Q ss_pred c
Q 040693 107 T 107 (382)
Q Consensus 107 ~ 107 (382)
-
T Consensus 75 E 75 (89)
T PF03088_consen 75 E 75 (89)
T ss_dssp E
T ss_pred e
Confidence 3
No 315
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.46 E-value=63 Score=31.06 Aligned_cols=58 Identities=19% Similarity=0.265 Sum_probs=41.2
Q ss_pred EEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEe--cCC--ceecceEEeCCEEEEEeCceeE
Q 040693 306 TVANGVLFGGSTYRQGPIYAMDVKTGKILWSYD--TGA--TIYGGASVSNGCIYMGNGYKVT 363 (382)
Q Consensus 306 ~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~--~~~--~~~~~p~~~~g~lyv~~~~g~~ 363 (382)
.+.++.+|++-.+.....|.+|++...--|+.. .++ .-....++.+++||+..+.|..
T Consensus 43 a~ig~~~YVGLGs~G~afy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~~~kLyvFgG~Gk~ 104 (381)
T COG3055 43 ALIGDTVYVGLGSAGTAFYVLDLKKPGKGWTKIADFPGGARNQAVAAVIGGKLYVFGGYGKS 104 (381)
T ss_pred ceecceEEEEeccCCccceehhhhcCCCCceEcccCCCcccccchheeeCCeEEEeeccccC
Confidence 355667888754236689999999887777653 223 2345567799999999988774
No 316
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=37.34 E-value=2.7e+02 Score=24.05 Aligned_cols=92 Identities=12% Similarity=0.164 Sum_probs=50.9
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEE--eCCEEEEeee-cCCCcEEEEeCCCCcEeEEEecCCceecceEE-eCC
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTV--ANGVLFGGST-YRQGPIYAMDVKTGKILWSYDTGATIYGGASV-SNG 352 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~--~~~~v~~~~~-~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~-~~g 352 (382)
..+..+|.+ ++++.+..... ...+.. .+.++.++.. ...|.|...|.++.+++-+.+.... ....- .+|
T Consensus 83 ~~v~lyd~~-~~~i~~~~~~~----~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~~~i~~~~~~~~--t~~~WsPdG 155 (194)
T PF08662_consen 83 AKVTLYDVK-GKKIFSFGTQP----RNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKKKKISTFEHSDA--TDVEWSPDG 155 (194)
T ss_pred cccEEEcCc-ccEeEeecCCC----ceEEEECCCCCEEEEEEccCCCcEEEEEECCCCEEeeccccCcE--EEEEEcCCC
Confidence 468888886 77777765321 122322 2445555542 1257899999999888877665432 11111 456
Q ss_pred EEEEEeCceeEeecCCccCCCCCeEEEE
Q 040693 353 CIYMGNGYKVTVGFGNKNFTSGTSLYAF 380 (382)
Q Consensus 353 ~lyv~~~~g~~~~~~~~~~~~g~~l~~~ 380 (382)
+.+++... ...+...+|=.||.|
T Consensus 156 r~~~ta~t-----~~r~~~dng~~Iw~~ 178 (194)
T PF08662_consen 156 RYLATATT-----SPRLRVDNGFKIWSF 178 (194)
T ss_pred CEEEEEEe-----ccceeccccEEEEEe
Confidence 66655331 122444455555554
No 317
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.73 E-value=2.2e+02 Score=30.27 Aligned_cols=102 Identities=12% Similarity=0.046 Sum_probs=56.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEEC
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDA 284 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~ 284 (382)
-.+|+.++.||.+-++|...-+-.=.+..+.. ..-...+.+. . +..|++. ...|.|..+|+
T Consensus 146 p~iliSGSQDg~vK~~DlR~~~S~~t~~~nSE-SiRDV~fsp~-~-~~~F~s~----------------~dsG~lqlWDl 206 (839)
T KOG0269|consen 146 PNILISGSQDGTVKCWDLRSKKSKSTFRSNSE-SIRDVKFSPG-Y-GNKFASI----------------HDSGYLQLWDL 206 (839)
T ss_pred ccEEEecCCCceEEEEeeecccccccccccch-hhhceeeccC-C-CceEEEe----------------cCCceEEEeec
Confidence 46788999999999999886655544443221 1111233332 2 3334432 23577888887
Q ss_pred CCCcEEeeecCCCCCCCCcceEE---eCCEEEEeeecCCCcEEEEeCC
Q 040693 285 SNGNVLWSTADPSNGTAPGPVTV---ANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 285 ~tG~~~W~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
+.- .+|...+.. ..+|+.. .-++.|+++.++++.+...|..
T Consensus 207 Rqp-~r~~~k~~A---H~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t 250 (839)
T KOG0269|consen 207 RQP-DRCEKKLTA---HNGPVLCLNWHPNREWLATGGRDKMVKIWDMT 250 (839)
T ss_pred cCc-hhHHHHhhc---ccCceEEEeecCCCceeeecCCCccEEEEecc
Confidence 632 233333221 2334322 1266777776667776666655
No 318
>PF09826 Beta_propel: Beta propeller domain; InterPro: IPR019198 This entry consists of predicted secreted proteins containing a C-terminal beta-propeller domain distantly related to WD-40 repeats.
Probab=36.70 E-value=2.1e+02 Score=29.31 Aligned_cols=51 Identities=14% Similarity=0.154 Sum_probs=39.6
Q ss_pred ceEEeCCEEEEeeecCCCcEEEEeC---CCCcEeEEEecCCceecceEEeCCEEEEEe
Q 040693 304 PVTVANGVLFGGSTYRQGPIYAMDV---KTGKILWSYDTGATIYGGASVSNGCIYMGN 358 (382)
Q Consensus 304 ~~~~~~~~v~~~~~~~~g~l~~ld~---~tG~ilw~~~~~~~~~~~p~~~~g~lyv~~ 358 (382)
-+-.++.+||+.. +++|+.+|+ ++-+++-+.+.+.. .....+.+++|.|-.
T Consensus 17 iVKTDG~yIY~v~---~~~l~Iida~p~~~~~~~s~I~~~~~-~~eLyl~gdrLvVi~ 70 (521)
T PF09826_consen 17 IVKTDGEYIYVVS---GGRLYIIDAYPAEEMKVVSRIDLDGS-PQELYLDGDRLVVIG 70 (521)
T ss_pred EEEECCCEEEEEe---CCEEEEEECCCchhceEEEEEecCCC-hhheEEcCCEEEEEE
Confidence 3344677899988 799999998 56788888888766 556677889988865
No 319
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=36.67 E-value=4.7e+02 Score=26.54 Aligned_cols=188 Identities=12% Similarity=0.080 Sum_probs=0.0
Q ss_pred CeEEEEcCCCCCCCcchhhcccccCCCCCCCCCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCC
Q 040693 101 NHVYIATGNLYSVPLHIRQCQEENNQTTPTSPDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCP 180 (382)
Q Consensus 101 ~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~ 180 (382)
+.|.|+.+ ..||..+..+|++.=-.... .
T Consensus 189 n~laValg----------------------------------~~vylW~~~s~~v~~l~~~~-~---------------- 217 (484)
T KOG0305|consen 189 NVLAVALG----------------------------------QSVYLWSASSGSVTELCSFG-E---------------- 217 (484)
T ss_pred CeEEEEec----------------------------------ceEEEEecCCCceEEeEecC-C----------------
Q ss_pred CCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCC-CCCcccceeeeCCeEEEEecC
Q 040693 181 PGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGL-GGGAMWGAATDERRIYTNIAN 259 (382)
Q Consensus 181 ~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~-~g~~~~~~~~~~~~v~~~~~~ 259 (382)
....-+....+| ..|.++..+|.+..+|.++-+.+=... . ........+..+..+-.+
T Consensus 218 ----------~~vtSv~ws~~G---~~LavG~~~g~v~iwD~~~~k~~~~~~-----~~h~~rvg~laW~~~~lssG--- 276 (484)
T KOG0305|consen 218 ----------ELVTSVKWSPDG---SHLAVGTSDGTVQIWDVKEQKKTRTLR-----GSHASRVGSLAWNSSVLSSG--- 276 (484)
T ss_pred ----------CceEEEEECCCC---CEEEEeecCCeEEEEehhhcccccccc-----CCcCceeEEEeccCceEEEe---
Q ss_pred ccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEec
Q 040693 260 SQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 260 ~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~ 339 (382)
..++.+.-+|.+..+..=+ ....-.....-+...-+..+.++.+.++.++..|..+-+++-++..
T Consensus 277 --------------sr~~~I~~~dvR~~~~~~~-~~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~~~~~p~~~~~~ 341 (484)
T KOG0305|consen 277 --------------SRDGKILNHDVRISQHVVS-TLQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDGLSPEPKFTFTE 341 (484)
T ss_pred --------------cCCCcEEEEEEecchhhhh-hhhcccceeeeeEECCCCCeeccCCCccceEeccCCCccccEEEec
Q ss_pred CCceecceEE---eCCEEEEEeC-ceeEeecCCccCCCCCeE
Q 040693 340 GATIYGGASV---SNGCIYMGNG-YKVTVGFGNKNFTSGTSL 377 (382)
Q Consensus 340 ~~~~~~~p~~---~~g~lyv~~~-~g~~~~~~~~~~~~g~~l 377 (382)
-.+..-..+. ..+-|-++.+ ...+.+|++.+ +|+.+
T Consensus 342 H~aAVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~--~g~~i 381 (484)
T KOG0305|consen 342 HTAAVKALAWCPWQSGLLATGGGSADRCIKFWNTN--TGARI 381 (484)
T ss_pred cceeeeEeeeCCCccCceEEcCCCcccEEEEEEcC--CCcEe
No 320
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=36.47 E-value=1.4e+02 Score=27.21 Aligned_cols=62 Identities=18% Similarity=0.269 Sum_probs=37.0
Q ss_pred eCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeC
Q 040693 249 DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDV 328 (382)
Q Consensus 249 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~ 328 (382)
+.+.+|+...+ .+.|+.++. +|+++.++++.......+-..+.++.+++.. ++.+.|+.++.
T Consensus 32 d~~tLfaV~d~----------------~~~i~els~-~G~vlr~i~l~g~~D~EgI~y~g~~~~vl~~-Er~~~L~~~~~ 93 (248)
T PF06977_consen 32 DTGTLFAVQDE----------------PGEIYELSL-DGKVLRRIPLDGFGDYEGITYLGNGRYVLSE-ERDQRLYIFTI 93 (248)
T ss_dssp TTTEEEEEETT----------------TTEEEEEET-T--EEEEEE-SS-SSEEEEEE-STTEEEEEE-TTTTEEEEEEE
T ss_pred CCCeEEEEECC----------------CCEEEEEcC-CCCEEEEEeCCCCCCceeEEEECCCEEEEEE-cCCCcEEEEEE
Confidence 45778877543 478999996 6999999987652211222234456555544 36788888877
No 321
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=35.40 E-value=4.4e+02 Score=25.83 Aligned_cols=142 Identities=15% Similarity=0.291 Sum_probs=63.5
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeE-EEEecCccccccccCCCCCCCCCc--eEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRI-YTNIANSQHKNFNLKPSKNSTIAG--GWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v-~~~~~~~~~~~~~~~~~~~~~~~g--~v~a 281 (382)
..+|+...+++-|+.+|.+ |+.+-.++.+... .-...++....+..+ ++...+.. .... .++.
T Consensus 68 kSlIigTdK~~GL~VYdL~-Gk~lq~~~~Gr~N-NVDvrygf~l~g~~vDlavas~R~------------~g~n~l~~f~ 133 (381)
T PF02333_consen 68 KSLIIGTDKKGGLYVYDLD-GKELQSLPVGRPN-NVDVRYGFPLNGKTVDLAVASDRS------------DGRNSLRLFR 133 (381)
T ss_dssp G-EEEEEETTTEEEEEETT-S-EEEEE-SS-EE-EEEEEEEEEETTEEEEEEEEEE-C------------CCT-EEEEEE
T ss_pred cceEEEEeCCCCEEEEcCC-CcEEEeecCCCcc-eeeeecceecCCceEEEEEEecCc------------CCCCeEEEEE
Confidence 4567777778889999987 9988777643210 000112211122222 11222210 0012 3788
Q ss_pred EECCCCcEEeeecCCCC---CC--CCcceEEe---CCEEEEeeecCCCcEEEEeC---CCC----cEeEEEecCCceecc
Q 040693 282 MDASNGNVLWSTADPSN---GT--APGPVTVA---NGVLFGGSTYRQGPIYAMDV---KTG----KILWSYDTGATIYGG 346 (382)
Q Consensus 282 ~d~~tG~~~W~~~~~~~---~~--~~~~~~~~---~~~v~~~~~~~~g~l~~ld~---~tG----~ilw~~~~~~~~~~~ 346 (382)
||+.+|.+.-......+ .. ..+..++. ++.+|+-....+|.+..+-+ .+| +++.++.++.. ...
T Consensus 134 id~~~g~L~~v~~~~~p~~~~~~e~yGlcly~~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ-~EG 212 (381)
T PF02333_consen 134 IDPDTGELTDVTDPAAPIATDLSEPYGLCLYRSPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQ-PEG 212 (381)
T ss_dssp EETTTTEEEE-CBTTC-EE-SSSSEEEEEEEE-TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS--EEE
T ss_pred ecCCCCcceEcCCCCcccccccccceeeEEeecCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCc-ceE
Confidence 89888865422211110 00 01222221 23333322224665544432 344 45777777552 344
Q ss_pred eEEeC--CEEEEEeCce
Q 040693 347 ASVSN--GCIYMGNGYK 361 (382)
Q Consensus 347 p~~~~--g~lyv~~~~g 361 (382)
.++++ +.||++-.+-
T Consensus 213 CVVDDe~g~LYvgEE~~ 229 (381)
T PF02333_consen 213 CVVDDETGRLYVGEEDV 229 (381)
T ss_dssp EEEETTTTEEEEEETTT
T ss_pred EEEecccCCEEEecCcc
Confidence 55544 7999998763
No 322
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=35.26 E-value=3.9e+02 Score=25.22 Aligned_cols=115 Identities=17% Similarity=0.203 Sum_probs=66.7
Q ss_pred CCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEE
Q 040693 139 ENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAW 218 (382)
Q Consensus 139 ~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ 218 (382)
.+++..|...|..|=++.-.+.+++. .|...++|+..+ ..+|-+++.+-.+.
T Consensus 120 sSFDhtlKVWDtnTlQ~a~~F~me~~---------------------VYshamSp~a~s-------HcLiA~gtr~~~Vr 171 (397)
T KOG4283|consen 120 SSFDHTLKVWDTNTLQEAVDFKMEGK---------------------VYSHAMSPMAMS-------HCLIAAGTRDVQVR 171 (397)
T ss_pred ccccceEEEeecccceeeEEeecCce---------------------eehhhcChhhhc-------ceEEEEecCCCcEE
Confidence 34568899999999999999998876 233345666542 56677777788999
Q ss_pred EEeCCCCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCc
Q 040693 219 ALDRDSGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGN 288 (382)
Q Consensus 219 ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~ 288 (382)
..|.++|.--=...--.. ..-...|.|- .+-.++.+..+..-+.|-++. -+|-+.++|..+++
T Consensus 172 LCDi~SGs~sH~LsGHr~-~vlaV~Wsp~-~e~vLatgsaDg~irlWDiRr-----asgcf~~lD~hn~k 234 (397)
T KOG4283|consen 172 LCDIASGSFSHTLSGHRD-GVLAVEWSPS-SEWVLATGSADGAIRLWDIRR-----ASGCFRVLDQHNTK 234 (397)
T ss_pred EEeccCCcceeeeccccC-ceEEEEeccC-ceeEEEecCCCceEEEEEeec-----ccceeEEeecccCc
Confidence 999999864333221000 0001122221 233444444443334444432 13557777777663
No 323
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=35.15 E-value=3.5e+02 Score=24.68 Aligned_cols=187 Identities=18% Similarity=0.141 Sum_probs=94.5
Q ss_pred cceEEEEECCCCcEEEEEecCCCc-ccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYD-VWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWAL 220 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~al 220 (382)
...++.++...|..+.+++....- ...-....+.-|.. ..+..-+++. +.+.+.......+.-+
T Consensus 30 ~~~iy~~~~~~~~~v~ey~~~~~f~~~~~~~~~~~Lp~~--------~~GtG~vVYn-------gslYY~~~~s~~Ivky 94 (250)
T PF02191_consen 30 SEKIYVTSGFSGNTVYEYRNYEDFLRNGRSSRTYKLPYP--------WQGTGHVVYN-------GSLYYNKYNSRNIVKY 94 (250)
T ss_pred CCCEEEECccCCCEEEEEcCHhHHhhcCCCceEEEEece--------eccCCeEEEC-------CcEEEEecCCceEEEE
Confidence 378999999988988888755431 00000000000000 1112223330 2223333456789999
Q ss_pred eCCCCCeeeeeccCCCCCC--------CCcccceeeeCCeEEEEecCccccccccCCCCCCCCCce--EEEEECCCCc--
Q 040693 221 DRDSGSLIWSMEAGPGGLG--------GGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGG--WVAMDASNGN-- 288 (382)
Q Consensus 221 d~~tG~~~W~~~~~~~~~~--------g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~--v~a~d~~tG~-- 288 (382)
|..+++..-+..++..... +....-.++|+.-|++.-... ...|. |.-+|+.+-+
T Consensus 95 dL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~-------------~~~g~ivvskld~~tL~v~ 161 (250)
T PF02191_consen 95 DLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATE-------------DNNGNIVVSKLDPETLSVE 161 (250)
T ss_pred ECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecC-------------CCCCcEEEEeeCcccCceE
Confidence 9999998833343321111 111122344554444432111 11233 4578888884
Q ss_pred EEeeecCCCCCCCCcceEEeCCEEEEeeecC---CCcEEEEeCCCCcEeEEEecC--Cc--eecceEE--eCCEEEEEeC
Q 040693 289 VLWSTADPSNGTAPGPVTVANGVLFGGSTYR---QGPIYAMDVKTGKILWSYDTG--AT--IYGGASV--SNGCIYMGNG 359 (382)
Q Consensus 289 ~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~---~g~l~~ld~~tG~ilw~~~~~--~~--~~~~p~~--~~g~lyv~~~ 359 (382)
..|++..... ..+...+.=|.+|+..... ....+++|..+++.. ...++ .. ..+.... .+.+||+=+.
T Consensus 162 ~tw~T~~~k~--~~~naFmvCGvLY~~~s~~~~~~~I~yafDt~t~~~~-~~~i~f~~~~~~~~~l~YNP~dk~LY~wd~ 238 (250)
T PF02191_consen 162 QTWNTSYPKR--SAGNAFMVCGVLYATDSYDTRDTEIFYAFDTYTGKEE-DVSIPFPNPYGNISMLSYNPRDKKLYAWDN 238 (250)
T ss_pred EEEEeccCch--hhcceeeEeeEEEEEEECCCCCcEEEEEEECCCCcee-ceeeeeccccCceEeeeECCCCCeEEEEEC
Confidence 5677776653 2333334457888876522 234689999988765 22222 21 2222222 6788888754
No 324
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=34.99 E-value=1.9e+02 Score=27.22 Aligned_cols=73 Identities=15% Similarity=0.194 Sum_probs=42.8
Q ss_pred ecEEEEEccCcEEEEEeCC-CCCeeeeeccCCCCCCCC-cccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRD-SGSLIWSMEAGPGGLGGG-AMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~-tG~~~W~~~~~~~~~~g~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
.++||.++.|+.+.|.|.. -++-+|+...-- ..|. +.....-+...|+++ ..+..+..+
T Consensus 178 pnlvytGgDD~~l~~~D~R~p~~~i~~n~kvH--~~GV~SI~ss~~~~~~I~TG-----------------sYDe~i~~~ 238 (339)
T KOG0280|consen 178 PNLVYTGGDDGSLSCWDIRIPKTFIWHNSKVH--TSGVVSIYSSPPKPTYIATG-----------------SYDECIRVL 238 (339)
T ss_pred CceEEecCCCceEEEEEecCCcceeeecceee--ecceEEEecCCCCCceEEEe-----------------ccccceeee
Confidence 4788999999999999988 667788743210 0000 001100123345544 455778888
Q ss_pred ECC-CCcEEeeecCC
Q 040693 283 DAS-NGNVLWSTADP 296 (382)
Q Consensus 283 d~~-tG~~~W~~~~~ 296 (382)
|.+ -||++...+..
T Consensus 239 DtRnm~kPl~~~~v~ 253 (339)
T KOG0280|consen 239 DTRNMGKPLFKAKVG 253 (339)
T ss_pred ehhcccCccccCccc
Confidence 887 56665544443
No 325
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=34.01 E-value=2.3e+02 Score=26.03 Aligned_cols=60 Identities=18% Similarity=0.246 Sum_probs=40.4
Q ss_pred CCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCc-eeecEEEEEc
Q 040693 134 KCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNK-VKHDIVVAVQ 212 (382)
Q Consensus 134 ~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g-~~~~~v~~~~ 212 (382)
.|+..+++.+.|+.||+..=.++-+.+.+. .|+.......= ..+-.|++..
T Consensus 196 scLViGTE~~~i~iLd~~af~il~~~~lps----------------------------vPv~i~~~G~~devdyRI~Va~ 247 (257)
T PF14779_consen 196 SCLVIGTESGEIYILDPQAFTILKQVQLPS----------------------------VPVFISVSGQYDEVDYRIVVAC 247 (257)
T ss_pred ceEEEEecCCeEEEECchhheeEEEEecCC----------------------------CceEEEEEeeeeccceEEEEEe
Confidence 355556677999999998778887777763 34443322110 1267788999
Q ss_pred cCcEEEEEe
Q 040693 213 KSGFAWALD 221 (382)
Q Consensus 213 ~~g~l~ald 221 (382)
.||.+|.+-
T Consensus 248 Rdg~iy~ir 256 (257)
T PF14779_consen 248 RDGKIYTIR 256 (257)
T ss_pred CCCEEEEEe
Confidence 999998874
No 326
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=33.90 E-value=4.6e+02 Score=28.27 Aligned_cols=74 Identities=18% Similarity=0.270 Sum_probs=45.7
Q ss_pred cEEEEEccCcEEEEEeCCCCCee-eeeccCCCCCCCCcccceeeeCC-eEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLI-WSMEAGPGGLGGGAMWGAATDER-RIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~-W~~~~~~~~~~g~~~~~~~~~~~-~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
.+|-+++..|.+...|...+..+ |-.+...+ .-...|.+.-++. .+.... ...+.+..++
T Consensus 80 lliAsaD~~GrIil~d~~~~s~~~~l~~~~~~--~qdl~W~~~rd~Srd~LlaI----------------h~ss~lvLwn 141 (1062)
T KOG1912|consen 80 LLIASADISGRIILVDFVLASVINWLSHSNDS--VQDLCWVPARDDSRDVLLAI----------------HGSSTLVLWN 141 (1062)
T ss_pred eeEEeccccCcEEEEEehhhhhhhhhcCCCcc--hhheeeeeccCcchheeEEe----------------cCCcEEEEEE
Confidence 44456677888999998888755 44443321 1123444443333 222221 3357899999
Q ss_pred CCCCcEEeeecCCC
Q 040693 284 ASNGNVLWSTADPS 297 (382)
Q Consensus 284 ~~tG~~~W~~~~~~ 297 (382)
..||+..|+++...
T Consensus 142 tdtG~k~Wk~~ys~ 155 (1062)
T KOG1912|consen 142 TDTGEKFWKYDYSH 155 (1062)
T ss_pred ccCCceeeccccCC
Confidence 99999999998654
No 327
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=33.47 E-value=4.7e+02 Score=25.55 Aligned_cols=36 Identities=25% Similarity=0.315 Sum_probs=24.6
Q ss_pred cceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 303 GPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 303 ~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
+....++..++-++ .++.|++.|..+|+.+-..+.+
T Consensus 199 ~isl~~~~~LlS~s--GD~tlr~Wd~~sgk~L~t~dl~ 234 (390)
T KOG3914|consen 199 TISLTDNYLLLSGS--GDKTLRLWDITSGKLLDTCDLS 234 (390)
T ss_pred eeeeccCceeeecC--CCCcEEEEecccCCcccccchh
Confidence 33333444445444 5899999999999998666654
No 328
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=32.36 E-value=1.4e+02 Score=27.49 Aligned_cols=51 Identities=20% Similarity=0.162 Sum_probs=39.1
Q ss_pred EEEEeeecCCCcEEEEeCCCCcEeEEEecCCc-e--e-cceEE-eCCEEEEEeCceeE
Q 040693 311 VLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT-I--Y-GGASV-SNGCIYMGNGYKVT 363 (382)
Q Consensus 311 ~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~-~--~-~~p~~-~~g~lyv~~~~g~~ 363 (382)
-+++++. .+.|+.||++.-.++-++++++- . . .+-.- +|-|++|.+.+|.+
T Consensus 197 cLViGTE--~~~i~iLd~~af~il~~~~lpsvPv~i~~~G~~devdyRI~Va~Rdg~i 252 (257)
T PF14779_consen 197 CLVIGTE--SGEIYILDPQAFTILKQVQLPSVPVFISVSGQYDEVDYRIVVACRDGKI 252 (257)
T ss_pred eEEEEec--CCeEEEECchhheeEEEEecCCCceEEEEEeeeeccceEEEEEeCCCEE
Confidence 5677774 99999999999999999998752 1 1 22222 78999999999874
No 329
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=32.20 E-value=1.7e+02 Score=28.60 Aligned_cols=71 Identities=11% Similarity=0.140 Sum_probs=45.0
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceee--eCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAAT--DERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
.++|.+++.|.-+-..|+.+|+.+-+...-.. ...+... .++.+... ..+..+..|
T Consensus 234 kgLiasgskDnlVKlWDprSg~cl~tlh~HKn-----tVl~~~f~~n~N~Llt~-----------------skD~~~kv~ 291 (464)
T KOG0284|consen 234 KGLIASGSKDNLVKLWDPRSGSCLATLHGHKN-----TVLAVKFNPNGNWLLTG-----------------SKDQSCKVF 291 (464)
T ss_pred cceeEEccCCceeEeecCCCcchhhhhhhccc-----eEEEEEEcCCCCeeEEc-----------------cCCceEEEE
Confidence 46777888888888889999987766552110 1111111 23444433 445779999
Q ss_pred ECCCCcEEeeecCCC
Q 040693 283 DASNGNVLWSTADPS 297 (382)
Q Consensus 283 d~~tG~~~W~~~~~~ 297 (382)
|+++-+.+.++....
T Consensus 292 DiR~mkEl~~~r~Hk 306 (464)
T KOG0284|consen 292 DIRTMKELFTYRGHK 306 (464)
T ss_pred ehhHhHHHHHhhcch
Confidence 999888887776443
No 330
>PF14517 Tachylectin: Tachylectin; PDB: 1TL2_A.
Probab=30.67 E-value=1.8e+02 Score=26.18 Aligned_cols=78 Identities=21% Similarity=0.289 Sum_probs=32.3
Q ss_pred CCceEEEEECCCCcEEeeec--CCCCCCC-CcceEE-eCC---EEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecce
Q 040693 275 IAGGWVAMDASNGNVLWSTA--DPSNGTA-PGPVTV-ANG---VLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGA 347 (382)
Q Consensus 275 ~~g~v~a~d~~tG~~~W~~~--~~~~~~~-~~~~~~-~~~---~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p 347 (382)
..|.|+++++ +| .+++.. ......+ ....++ .++ ..++... .++.|++++ .+|++ ||...+. ...|
T Consensus 139 ~~GvLY~i~~-dg-~~~~~~~p~~~~~~W~~~s~~v~~~gw~~~~~i~~~-~~g~L~~V~-~~G~l-yr~~~p~--~~~~ 211 (229)
T PF14517_consen 139 PNGVLYAITP-DG-RLYRRYRPDGGSDRWLSGSGLVGGGGWDSFHFIFFS-PDGNLWAVK-SNGKL-YRGRPPQ--NGCP 211 (229)
T ss_dssp TTS-EEEEET-TE--EEEE---SSTT--HHHH-EEEESSSGGGEEEEEE--TTS-EEEE--ETTEE-EEES-----STT-
T ss_pred CCccEEEEcC-CC-ceEEeCCCCCCCCccccccceeccCCcccceEEeeC-CCCcEEEEe-cCCEE-eccCCcc--cCCc
Confidence 4577888884 44 333432 2221111 122233 344 4444433 489999994 44665 8776543 2344
Q ss_pred EEeCCEEEEEeC
Q 040693 348 SVSNGCIYMGNG 359 (382)
Q Consensus 348 ~~~~g~lyv~~~ 359 (382)
...++--.++++
T Consensus 212 ~~~~~a~~~g~g 223 (229)
T PF14517_consen 212 VYWENAKLIGSG 223 (229)
T ss_dssp -HHHH-EEEEES
T ss_pred hhhhhheeeccc
Confidence 444443333433
No 331
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=30.33 E-value=2.5e+02 Score=28.77 Aligned_cols=68 Identities=13% Similarity=0.141 Sum_probs=45.3
Q ss_pred CCCCCCCCCCcceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEE
Q 040693 132 PDKCIEPENHSNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAV 211 (382)
Q Consensus 132 ~~~~~~~~~~~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~ 211 (382)
.+.++..+..+|.|..+|..++-..+.... -.|.+....++| .+++++
T Consensus 270 ~E~kLvlGC~DgSiiLyD~~~~~t~~~ka~-----------------------------~~P~~iaWHp~g---ai~~V~ 317 (545)
T PF11768_consen 270 SEDKLVLGCEDGSIILYDTTRGVTLLAKAE-----------------------------FIPTLIAWHPDG---AIFVVG 317 (545)
T ss_pred ccceEEEEecCCeEEEEEcCCCeeeeeeec-----------------------------ccceEEEEcCCC---cEEEEE
Confidence 344445555679999999887766554211 245565555444 688889
Q ss_pred ccCcEEEEEeCCCCCeeeee
Q 040693 212 QKSGFAWALDRDSGSLIWSM 231 (382)
Q Consensus 212 ~~~g~l~ald~~tG~~~W~~ 231 (382)
+..|.+.++|.+-.-+.-+.
T Consensus 318 s~qGelQ~FD~ALspi~~qL 337 (545)
T PF11768_consen 318 SEQGELQCFDMALSPIKMQL 337 (545)
T ss_pred cCCceEEEEEeecCccceee
Confidence 99999999998865444443
No 332
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=30.19 E-value=8.8e+02 Score=27.74 Aligned_cols=69 Identities=14% Similarity=0.073 Sum_probs=47.4
Q ss_pred cEEEEEccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccceeee--CCeEEEEecCccccccccCCCCCCCCCceEEEEE
Q 040693 206 DIVVAVQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGAATD--ERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMD 283 (382)
Q Consensus 206 ~~v~~~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~~~~--~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d 283 (382)
..|+.++..+.++.+|..+-..+|..+.+.. .+.....+++ ...+.++ +..|.+.++|
T Consensus 1164 ~~lvy~T~~~~iv~~D~r~~~~~w~lk~~~~---hG~vTSi~idp~~~WlviG-----------------ts~G~l~lWD 1223 (1431)
T KOG1240|consen 1164 HVLVYATDLSRIVSWDTRMRHDAWRLKNQLR---HGLVTSIVIDPWCNWLVIG-----------------TSRGQLVLWD 1223 (1431)
T ss_pred eeEEEEEeccceEEecchhhhhHHhhhcCcc---ccceeEEEecCCceEEEEe-----------------cCCceEEEEE
Confidence 4677777788899999999999999987642 2233333443 3444454 5568899999
Q ss_pred CCCCcEEeeec
Q 040693 284 ASNGNVLWSTA 294 (382)
Q Consensus 284 ~~tG~~~W~~~ 294 (382)
++=+.+.-+..
T Consensus 1224 LRF~~~i~sw~ 1234 (1431)
T KOG1240|consen 1224 LRFRVPILSWE 1234 (1431)
T ss_pred eecCceeeccc
Confidence 98776643333
No 333
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=29.84 E-value=5.8e+02 Score=25.50 Aligned_cols=27 Identities=22% Similarity=0.180 Sum_probs=22.1
Q ss_pred ecEEEEEccCcEEEEEeCCCCCe---eeee
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSL---IWSM 231 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~---~W~~ 231 (382)
+..++..+.++.+.-.+..+|+. +|..
T Consensus 154 ~~~~fsask~g~i~kw~v~tgk~~~~i~~~ 183 (479)
T KOG0299|consen 154 DKRVFSASKDGTILKWDVLTGKKDRYIIER 183 (479)
T ss_pred ccceeecCCCcceeeeehhcCccccccccc
Confidence 45788899999999999999984 4655
No 334
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=29.43 E-value=5e+02 Score=24.69 Aligned_cols=88 Identities=19% Similarity=0.185 Sum_probs=51.5
Q ss_pred eeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecCCC--CCCCCcce---EE--eCCEEEEeeecC
Q 040693 247 ATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTADPS--NGTAPGPV---TV--ANGVLFGGSTYR 319 (382)
Q Consensus 247 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~~~--~~~~~~~~---~~--~~~~v~~~~~~~ 319 (382)
..-++.||+.-...+.......+ ....|.|-.||+ +|+.+-|..... ...|.-.+ .+ -.+.++++.. .
T Consensus 195 qnig~~lyVtYA~qd~~~~d~v~---G~G~G~VdvFd~-~G~l~~r~as~g~LNaPWG~a~APa~FG~~sg~lLVGNF-G 269 (336)
T TIGR03118 195 QNLGGTLYVTYAQQDADRNDEVA---GAGLGYVNVFTL-NGQLLRRVASSGRLNAPWGLAIAPESFGSLSGALLVGNF-G 269 (336)
T ss_pred EEECCeEEEEEEecCCccccccc---CCCcceEEEEcC-CCcEEEEeccCCcccCCceeeeChhhhCCCCCCeEEeec-C
Confidence 34688999985554332211111 122356778885 699888875443 12222211 11 1245566654 5
Q ss_pred CCcEEEEeCCCCcEeEEEec
Q 040693 320 QGPIYAMDVKTGKILWSYDT 339 (382)
Q Consensus 320 ~g~l~~ld~~tG~ilw~~~~ 339 (382)
+|+|-+||+.+|+-+-+..-
T Consensus 270 DG~InaFD~~sG~~~g~L~~ 289 (336)
T TIGR03118 270 DGTINAYDPQSGAQLGQLLD 289 (336)
T ss_pred CceeEEecCCCCceeeeecC
Confidence 99999999999997766643
No 335
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=27.26 E-value=4e+02 Score=25.71 Aligned_cols=117 Identities=12% Similarity=0.118 Sum_probs=65.3
Q ss_pred ecEEEEEccCcEEEEEeCCCCC--eeeeeccCCCCCCCCccccee-eeCCeEEEEecCccccccccCCCCCCCCCceEEE
Q 040693 205 HDIVVAVQKSGFAWALDRDSGS--LIWSMEAGPGGLGGGAMWGAA-TDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVA 281 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~--~~W~~~~~~~~~~g~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a 281 (382)
+++|+.+-.+|.++++|..++. ..|....--.... ....... .++..+.++ +..|+|..
T Consensus 264 ~nLv~~GcRngeI~~iDLR~rnqG~~~~a~rlyh~Ss-vtslq~Lq~s~q~LmaS-----------------~M~gkikL 325 (425)
T KOG2695|consen 264 DNLVFNGCRNGEIFVIDLRCRNQGNGWCAQRLYHDSS-VTSLQILQFSQQKLMAS-----------------DMTGKIKL 325 (425)
T ss_pred CCeeEecccCCcEEEEEeeecccCCCcceEEEEcCcc-hhhhhhhccccceEeec-----------------cCcCceeE
Confidence 6788888899999999998762 2343321000000 0001111 144555554 56688888
Q ss_pred EECCCCcE---EeeecCCCCCCCCcceEEe--CCEEEEeeecCCCcEEEEeCCCCcEeEEEecCC
Q 040693 282 MDASNGNV---LWSTADPSNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGA 341 (382)
Q Consensus 282 ~d~~tG~~---~W~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~ 341 (382)
+|.+--|- +-+++...+....-+..+. ++.|+.+. ++=.......++|+++-..+.+-
T Consensus 326 yD~R~~K~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~G--dDcytRiWsl~~ghLl~tipf~~ 388 (425)
T KOG2695|consen 326 YDLRATKCKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVG--DDCYTRIWSLDSGHLLCTIPFPY 388 (425)
T ss_pred eeehhhhcccceeeeecccccccccccccccccceEEEcc--CeeEEEEEecccCceeeccCCCC
Confidence 88775444 5455544332223344343 35566544 46667777777888888776553
No 336
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=27.12 E-value=3.4e+02 Score=26.43 Aligned_cols=97 Identities=15% Similarity=0.102 Sum_probs=60.4
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC--CcEeEEEecCCcee-cceEE-
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT--GKILWSYDTGATIY-GGASV- 349 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t--G~ilw~~~~~~~~~-~~p~~- 349 (382)
..+-+|...|+.+|...=+...++.- ......-...++..++. +-.+...|+++ |++.-+.-++...+ ++...
T Consensus 278 SwDHTIk~WDletg~~~~~~~~~ksl-~~i~~~~~~~Ll~~gss--dr~irl~DPR~~~gs~v~~s~~gH~nwVssvkws 354 (423)
T KOG0313|consen 278 SWDHTIKVWDLETGGLKSTLTTNKSL-NCISYSPLSKLLASGSS--DRHIRLWDPRTGDGSVVSQSLIGHKNWVSSVKWS 354 (423)
T ss_pred cccceEEEEEeecccceeeeecCcce-eEeecccccceeeecCC--CCceeecCCCCCCCceeEEeeecchhhhhheecC
Confidence 34577999999999888777655521 11111112356666664 78888999985 55555544444322 22222
Q ss_pred -eCCEEEEEeCceeEeecCCccCCC
Q 040693 350 -SNGCIYMGNGYKVTVGFGNKNFTS 373 (382)
Q Consensus 350 -~~g~lyv~~~~g~~~~~~~~~~~~ 373 (382)
-+..+|++.++.++++++.+.+..
T Consensus 355 p~~~~~~~S~S~D~t~klWDvRS~k 379 (423)
T KOG0313|consen 355 PTNEFQLVSGSYDNTVKLWDVRSTK 379 (423)
T ss_pred CCCceEEEEEecCCeEEEEEeccCC
Confidence 567888888888888887665543
No 337
>PRK10115 protease 2; Provisional
Probab=26.64 E-value=8e+02 Score=26.10 Aligned_cols=67 Identities=9% Similarity=0.084 Sum_probs=38.4
Q ss_pred EeCCEEEEeeec--CCCcEEEEeCCCCcEeEEEecCC---ceecceEEeCCEEEEEeCceeEeecCCccCCCC
Q 040693 307 VANGVLFGGSTY--RQGPIYAMDVKTGKILWSYDTGA---TIYGGASVSNGCIYMGNGYKVTVGFGNKNFTSG 374 (382)
Q Consensus 307 ~~~~~v~~~~~~--~~g~l~~ld~~tG~ilw~~~~~~---~~~~~p~~~~g~lyv~~~~g~~~~~~~~~~~~g 374 (382)
..++.+|+.+.. ....|..++..+ .-.|+.=++. .......+.++.|++....+....++.++..++
T Consensus 277 ~~~~~ly~~tn~~~~~~~l~~~~~~~-~~~~~~l~~~~~~~~i~~~~~~~~~l~~~~~~~g~~~l~~~~~~~~ 348 (686)
T PRK10115 277 HYQHRFYLRSNRHGKNFGLYRTRVRD-EQQWEELIPPRENIMLEGFTLFTDWLVVEERQRGLTSLRQINRKTR 348 (686)
T ss_pred eCCCEEEEEEcCCCCCceEEEecCCC-cccCeEEECCCCCCEEEEEEEECCEEEEEEEeCCEEEEEEEcCCCC
Confidence 345778887741 234688888763 2245543333 234455556788888766555555666665433
No 338
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=26.01 E-value=3.9e+02 Score=28.01 Aligned_cols=58 Identities=14% Similarity=0.114 Sum_probs=38.3
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC-CcEeE
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT-GKILW 335 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t-G~ilw 335 (382)
.+.++.+|+.|+|.+-+....... ..+..-..+|.++..+- .+.+|..||+.. ++.+-
T Consensus 149 ~g~v~i~D~stqk~~~el~~h~d~-vQSa~WseDG~llatsc-KdkqirifDPRa~~~piQ 207 (1012)
T KOG1445|consen 149 HGSVYITDISTQKTAVELSGHTDK-VQSADWSEDGKLLATSC-KDKQIRIFDPRASMEPIQ 207 (1012)
T ss_pred CceEEEEEcccCceeecccCCchh-hhccccccCCceEeeec-CCcceEEeCCccCCCccc
Confidence 478999999999999887765532 23333344555555542 577899999863 44443
No 339
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=25.93 E-value=1.4e+02 Score=18.59 Aligned_cols=35 Identities=17% Similarity=0.189 Sum_probs=22.1
Q ss_pred ceEEEcCEEEEeccCcc-ccccccccccccceEEEEeCccCceeee
Q 040693 26 SGTYYKGAYYVGTSSIE-EGLTFELCCTFQGSLAKLDAKTGRILWQ 70 (382)
Q Consensus 26 ~p~v~~~~v~v~~~~~~-~~~~~~~~~~~~g~l~ald~~tG~~lW~ 70 (382)
+-++.++.||+...... .. .-..+.++|++++ .|+
T Consensus 6 ~~~~~~~~iyv~GG~~~~~~--------~~~~v~~yd~~~~--~W~ 41 (47)
T PF01344_consen 6 AAVVVGNKIYVIGGYDGNNQ--------PTNSVEVYDPETN--TWE 41 (47)
T ss_dssp EEEEETTEEEEEEEBESTSS--------BEEEEEEEETTTT--EEE
T ss_pred EEEEECCEEEEEeeecccCc--------eeeeEEEEeCCCC--EEE
Confidence 45688899998554332 11 1457888997754 454
No 340
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=24.49 E-value=6.2e+02 Score=24.12 Aligned_cols=28 Identities=14% Similarity=0.123 Sum_probs=20.9
Q ss_pred ecEEEEEccCcEEEEEeCCCCCeeeeec
Q 040693 205 HDIVVAVQKSGFAWALDRDSGSLIWSME 232 (382)
Q Consensus 205 ~~~v~~~~~~g~l~ald~~tG~~~W~~~ 232 (382)
+.+++.++.|-....+|.+||+++=...
T Consensus 284 g~Q~vTaSWDRTAnlwDVEtge~v~~Lt 311 (481)
T KOG0300|consen 284 GQQMVTASWDRTANLWDVETGEVVNILT 311 (481)
T ss_pred cceeeeeeccccceeeeeccCceecccc
Confidence 5677788888888888888887765443
No 341
>KOG3545 consensus Olfactomedin and related extracellular matrix glycoproteins [Extracellular structures]
Probab=22.57 E-value=6e+02 Score=23.24 Aligned_cols=102 Identities=22% Similarity=0.216 Sum_probs=59.6
Q ss_pred CceEEEEECCCCcEEeeecCCCCCCCC-----------cceEEeC-CE-EEEeeecCCCcEE--EEeCCCC--cEeEEEe
Q 040693 276 AGGWVAMDASNGNVLWSTADPSNGTAP-----------GPVTVAN-GV-LFGGSTYRQGPIY--AMDVKTG--KILWSYD 338 (382)
Q Consensus 276 ~g~v~a~d~~tG~~~W~~~~~~~~~~~-----------~~~~~~~-~~-v~~~~~~~~g~l~--~ld~~tG--~ilw~~~ 338 (382)
...|+.+|++++...=...++...... -.++++. |+ |.-++.+.+|.+. -||+.+= +-.|+..
T Consensus 87 t~~ivky~l~~~~~~~~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp~tl~~e~tW~T~ 166 (249)
T KOG3545|consen 87 TRNIIKYDLETRTVAGSAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDPETLEVERTWNTT 166 (249)
T ss_pred CcceEEEEeecceeeeeeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCHHHhheeeeeccc
Confidence 355889999998766665555421111 2334443 42 2222322344433 5677553 4467777
Q ss_pred cCCceecceEEeCCEEEEEeCcee--EeecCCccCCCCCeE
Q 040693 339 TGATIYGGASVSNGCIYMGNGYKV--TVGFGNKNFTSGTSL 377 (382)
Q Consensus 339 ~~~~~~~~p~~~~g~lyv~~~~g~--~~~~~~~~~~~g~~l 377 (382)
.+....+..-+.=|.||+..+.-. ..--|++|..+|+..
T Consensus 167 ~~k~~~~~aF~iCGvLY~v~S~~~~~~~i~yaydt~~~~~~ 207 (249)
T KOG3545|consen 167 LPKRSAGNAFMICGVLYVVHSYNCTHTQISYAYDTTTGTQE 207 (249)
T ss_pred cCCCCcCceEEEeeeeEEEeccccCCceEEEEEEcCCCcee
Confidence 766667777777789999876322 222378888888763
No 342
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=22.43 E-value=1.8e+02 Score=17.05 Aligned_cols=8 Identities=25% Similarity=0.563 Sum_probs=5.5
Q ss_pred cceEEEEe
Q 040693 54 QGSLAKLD 61 (382)
Q Consensus 54 ~g~l~ald 61 (382)
++.|..+|
T Consensus 32 D~~i~vwd 39 (39)
T PF00400_consen 32 DGTIRVWD 39 (39)
T ss_dssp TSEEEEEE
T ss_pred CCEEEEEC
Confidence 77777665
No 343
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=22.18 E-value=1.3e+02 Score=29.90 Aligned_cols=51 Identities=18% Similarity=0.237 Sum_probs=32.3
Q ss_pred eCCEEEEeeecCCCcEEEEeCCC---CcEeEEEecCCcee------------------cceEEeCCEEEEEeC
Q 040693 308 ANGVLFGGSTYRQGPIYAMDVKT---GKILWSYDTGATIY------------------GGASVSNGCIYMGNG 359 (382)
Q Consensus 308 ~~~~v~~~~~~~~g~l~~ld~~t---G~ilw~~~~~~~~~------------------~~p~~~~g~lyv~~~ 359 (382)
++..+|+... ..|.+..+|..+ =|+..++.+++.+. -.....+.|||++++
T Consensus 322 DDrfLYvs~W-~~GdvrqYDISDP~~Pkl~gqv~lGG~~~~~~~~~v~g~~l~GgPqMvqlS~DGkRlYvTnS 393 (461)
T PF05694_consen 322 DDRFLYVSNW-LHGDVRQYDISDPFNPKLVGQVFLGGSIRKGDHPVVKGKRLRGGPQMVQLSLDGKRLYVTNS 393 (461)
T ss_dssp TS-EEEEEET-TTTEEEEEE-SSTTS-EEEEEEE-BTTTT-B--TTS------S----EEE-TTSSEEEEE--
T ss_pred CCCEEEEEcc-cCCcEEEEecCCCCCCcEEeEEEECcEeccCCCccccccccCCCCCeEEEccCCeEEEEEee
Confidence 4668888775 789999999764 67788888776431 122337899999986
No 344
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=22.09 E-value=1.2e+03 Score=26.67 Aligned_cols=63 Identities=13% Similarity=0.041 Sum_probs=42.9
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEccCcEEEEEeC
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQKSGFAWALDR 222 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~~g~l~ald~ 222 (382)
++++.+|..+-+.+|+.+...... ...+-++. .-...++.|+..|.+.++|.
T Consensus 1173 ~~iv~~D~r~~~~~w~lk~~~~hG----------------------~vTSi~id------p~~~WlviGts~G~l~lWDL 1224 (1431)
T KOG1240|consen 1173 SRIVSWDTRMRHDAWRLKNQLRHG----------------------LVTSIVID------PWCNWLVIGTSRGQLVLWDL 1224 (1431)
T ss_pred cceEEecchhhhhHHhhhcCcccc----------------------ceeEEEec------CCceEEEEecCCceEEEEEe
Confidence 789999999999999887765421 11222221 11457888999999999998
Q ss_pred CCCCee--eeecc
Q 040693 223 DSGSLI--WSMEA 233 (382)
Q Consensus 223 ~tG~~~--W~~~~ 233 (382)
.=+.++ |+++.
T Consensus 1225 RF~~~i~sw~~P~ 1237 (1431)
T KOG1240|consen 1225 RFRVPILSWEHPA 1237 (1431)
T ss_pred ecCceeecccCcc
Confidence 866655 55543
No 345
>PHA02581 9 baseplate wedge tail fiber connector; Provisional
Probab=21.85 E-value=6.4e+02 Score=23.33 Aligned_cols=34 Identities=9% Similarity=0.247 Sum_probs=23.7
Q ss_pred CCcEEEEeCCCCcEeEEEecCCceecceEEeCCE
Q 040693 320 QGPIYAMDVKTGKILWSYDTGATIYGGASVSNGC 353 (382)
Q Consensus 320 ~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~ 353 (382)
+=.++|++...+...|.+.+...+-...+.++..
T Consensus 145 ~v~~wc~~~~~~~~~W~y~iesmFGd~~~Pv~~t 178 (284)
T PHA02581 145 KVTLWCIKSEGSTSVWDYSIESMFGDKTMPVNKT 178 (284)
T ss_pred EEEEEEEeCCCCceeeeeEEEEcccCCcccccce
Confidence 4468899988999999998865443333445555
No 346
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=21.64 E-value=4.8e+02 Score=28.63 Aligned_cols=77 Identities=13% Similarity=0.186 Sum_probs=57.5
Q ss_pred ceEEEEECCCCcEEeeecCCCCCCCCcceEEeCC-EEEEeeecCCCcEEEEeCCCCcEeEEEecCCceecceEEeCCEEE
Q 040693 277 GGWVAMDASNGNVLWSTADPSNGTAPGPVTVANG-VLFGGSTYRQGPIYAMDVKTGKILWSYDTGATIYGGASVSNGCIY 355 (382)
Q Consensus 277 g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~-~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~~~~~p~~~~g~ly 355 (382)
..+.-+|..++++.-......+ ...++..|+ .+|++. ..|.|...|+++-+.+-+++...+..+..-+.++.|.
T Consensus 157 ~~li~~Dl~~~~e~r~~~v~a~---~v~imR~Nnr~lf~G~--t~G~V~LrD~~s~~~iht~~aHs~siSDfDv~GNlLi 231 (1118)
T KOG1275|consen 157 EKLIHIDLNTEKETRTTNVSAS---GVTIMRYNNRNLFCGD--TRGTVFLRDPNSFETIHTFDAHSGSISDFDVQGNLLI 231 (1118)
T ss_pred hheeeeecccceeeeeeeccCC---ceEEEEecCcEEEeec--ccceEEeecCCcCceeeeeeccccceeeeeccCCeEE
Confidence 4588899999998877776542 345555665 566666 4899999999999999999887777766666666666
Q ss_pred EEe
Q 040693 356 MGN 358 (382)
Q Consensus 356 v~~ 358 (382)
...
T Consensus 232 tCG 234 (1118)
T KOG1275|consen 232 TCG 234 (1118)
T ss_pred Eee
Confidence 553
No 347
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=21.41 E-value=6.7e+02 Score=23.35 Aligned_cols=175 Identities=15% Similarity=0.098 Sum_probs=90.6
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCc---eeecEEEEEcc------
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNK---VKHDIVVAVQK------ 213 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g---~~~~~v~~~~~------ 213 (382)
..|..+|+.+.+++=++++.+.... .+ +..+...+ .....|++++.
T Consensus 2 s~i~l~d~~~~~~~~~~~l~~~E~~-----------------------~s--~~~~~l~~~~~~~~~~ivVGT~~~~~~~ 56 (321)
T PF03178_consen 2 SSIRLVDPTTFEVLDSFELEPNEHV-----------------------TS--LCSVKLKGDSTGKKEYIVVGTAFNYGED 56 (321)
T ss_dssp -EEEEEETTTSSEEEEEEEETTEEE-----------------------EE--EEEEEETTS---SSEEEEEEEEE--TTS
T ss_pred cEEEEEeCCCCeEEEEEECCCCceE-----------------------EE--EEEEEEcCccccccCEEEEEeccccccc
Confidence 5688899999999988887765321 00 11111111 11355555543
Q ss_pred ---C-cEEEEEeCCCC-C------eeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEE
Q 040693 214 ---S-GFAWALDRDSG-S------LIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAM 282 (382)
Q Consensus 214 ---~-g~l~ald~~tG-~------~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~ 282 (382)
. |.++.++.... . .+.+.+. .+ ........++.++++. .+.|+.+
T Consensus 57 ~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~~-----~g-~V~ai~~~~~~lv~~~------------------g~~l~v~ 112 (321)
T PF03178_consen 57 PEPSSGRILVFEISESPENNFKLKLIHSTEV-----KG-PVTAICSFNGRLVVAV------------------GNKLYVY 112 (321)
T ss_dssp SS-S-EEEEEEEECSS-----EEEEEEEEEE-----SS--EEEEEEETTEEEEEE------------------TTEEEEE
T ss_pred ccccCcEEEEEEEEcccccceEEEEEEEEee-----cC-cceEhhhhCCEEEEee------------------cCEEEEE
Confidence 2 88999998874 2 2222232 12 3333333567766653 3668888
Q ss_pred ECCCCc-EEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEE--eCCCCcEeE--EEecCCceecceEE-eCCEEEE
Q 040693 283 DASNGN-VLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAM--DVKTGKILW--SYDTGATIYGGASV-SNGCIYM 356 (382)
Q Consensus 283 d~~tG~-~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~l--d~~tG~ilw--~~~~~~~~~~~p~~-~~g~lyv 356 (382)
+....+ .+=...... ......+...++++++++. ...+..+ +.+.-+..- +-..+....+.-.+ .++.+++
T Consensus 113 ~l~~~~~l~~~~~~~~-~~~i~sl~~~~~~I~vgD~--~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~ 189 (321)
T PF03178_consen 113 DLDNSKTLLKKAFYDS-PFYITSLSVFKNYILVGDA--MKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIV 189 (321)
T ss_dssp EEETTSSEEEEEEE-B-SSSEEEEEEETTEEEEEES--SSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEE
T ss_pred EccCcccchhhheecc-eEEEEEEeccccEEEEEEc--ccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEE
Confidence 887777 333333322 2234455556889999885 5555544 654332322 22222222333333 4457777
Q ss_pred EeCceeEeecCCcc
Q 040693 357 GNGYKVTVGFGNKN 370 (382)
Q Consensus 357 ~~~~g~~~~~~~~~ 370 (382)
++.+|++ .++..+
T Consensus 190 ~D~~gnl-~~l~~~ 202 (321)
T PF03178_consen 190 GDKDGNL-FVLRYN 202 (321)
T ss_dssp EETTSEE-EEEEE-
T ss_pred EcCCCeE-EEEEEC
Confidence 7788774 334444
No 348
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=21.39 E-value=7.8e+02 Score=24.15 Aligned_cols=141 Identities=13% Similarity=0.070 Sum_probs=66.1
Q ss_pred cceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEE---e--eeCceeecEEEEEccCcE
Q 040693 142 SNSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLS---M--YRNKVKHDIVVAVQKSGF 216 (382)
Q Consensus 142 ~g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~---~--~~~g~~~~~v~~~~~~g~ 216 (382)
+|.|..+|..--.++.+-.......- ......+...+ + ..|+-+.-.++++++.|.
T Consensus 106 ~G~l~viD~RGPavI~~~~i~~~~~~-------------------~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~G~ 166 (395)
T PF08596_consen 106 SGSLVVIDLRGPAVIYNENIRESFLS-------------------KSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNSGN 166 (395)
T ss_dssp TSEEEEEETTTTEEEEEEEGGG--T--------------------SS----EEEEEEEEEE-TTSSSEEEEEEEEETTSE
T ss_pred CCcEEEEECCCCeEEeeccccccccc-------------------cccccCeeEEEEEEEecCCCcccceEEEEEeCCCC
Confidence 48999999998888888766651000 00011122221 1 233334567888999998
Q ss_pred EEEEeCC-CCCeeeeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEeeecC
Q 040693 217 AWALDRD-SGSLIWSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWSTAD 295 (382)
Q Consensus 217 l~ald~~-tG~~~W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~~~~ 295 (382)
+..+... ...-.|....... ....++. --.|..||.++|+...-...
T Consensus 167 v~~fkIlp~~~g~f~v~~~~~---------~~~~~~~-----------------------i~~I~~i~~~~G~~a~At~~ 214 (395)
T PF08596_consen 167 VLTFKILPSSNGRFSVQFAGA---------TTNHDSP-----------------------ILSIIPINADTGESALATIS 214 (395)
T ss_dssp EEEEEEEE-GGG-EEEEEEEE---------E--SS---------------------------EEEEEETTT--B-B-BHH
T ss_pred EEEEEEecCCCCceEEEEeec---------cccCCCc-----------------------eEEEEEEECCCCCcccCchh
Confidence 8777543 2333455442100 0001111 23367788889987765432
Q ss_pred CCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEe
Q 040693 296 PSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYD 338 (382)
Q Consensus 296 ~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~ 338 (382)
.-.. ...-...++.+++++ +..+.++.+-+.|..-+..
T Consensus 215 ~~~~--l~~g~~i~g~vVvvS---e~~irv~~~~~~k~~~K~~ 252 (395)
T PF08596_consen 215 AMQG--LSKGISIPGYVVVVS---ESDIRVFKPPKSKGAHKSF 252 (395)
T ss_dssp HHHG--GGGT----EEEEEE----SSEEEEE-TT---EEEEE-
T ss_pred Hhhc--cccCCCcCcEEEEEc---ccceEEEeCCCCcccceee
Confidence 1100 111112356777776 7889999988888766665
No 349
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=21.04 E-value=7e+02 Score=25.60 Aligned_cols=102 Identities=14% Similarity=0.154 Sum_probs=52.3
Q ss_pred ecEEEE-EccCcEEEEEeCCCCCeeeeeccCCCCCCCCcccce------eeeCCeEEEEecCccccccccCCCCCCCCCc
Q 040693 205 HDIVVA-VQKSGFAWALDRDSGSLIWSMEAGPGGLGGGAMWGA------ATDERRIYTNIANSQHKNFNLKPSKNSTIAG 277 (382)
Q Consensus 205 ~~~v~~-~~~~g~l~ald~~tG~~~W~~~~~~~~~~g~~~~~~------~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g 277 (382)
..+|+. +.....||-+|.+.|+++=+.+..... ...+.| ..+.+.++- ....
T Consensus 479 ssli~~dg~~~~kLykmDIErGkvveeW~~~ddv---vVqy~p~~kf~qmt~eqtlvG------------------lS~~ 537 (776)
T COG5167 479 SSLIYLDGGERDKLYKMDIERGKVVEEWDLKDDV---VVQYNPYFKFQQMTDEQTLVG------------------LSDY 537 (776)
T ss_pred cceEEecCCCcccceeeecccceeeeEeecCCcc---eeecCCchhHHhcCccceEEe------------------eccc
Confidence 344444 344568999999999988444432210 011111 122333322 2234
Q ss_pred eEEEEECC-CCcEEee---ecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCC
Q 040693 278 GWVAMDAS-NGNVLWS---TADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVK 329 (382)
Q Consensus 278 ~v~a~d~~-tG~~~W~---~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~ 329 (382)
.|+.||++ .|..+=- .+......-++..+..+|+|.+++. .|.|..+|.-
T Consensus 538 svFrIDPR~~gNKi~v~esKdY~tKn~Fss~~tTesGyIa~as~--kGDirLyDRi 591 (776)
T COG5167 538 SVFRIDPRARGNKIKVVESKDYKTKNKFSSGMTTESGYIAAASR--KGDIRLYDRI 591 (776)
T ss_pred ceEEecccccCCceeeeeehhccccccccccccccCceEEEecC--CCceeeehhh
Confidence 57777776 3321111 1111111123334467888888884 8999988853
No 350
>PRK03999 translation initiation factor IF-5A; Provisional
Probab=20.91 E-value=4.6e+02 Score=21.25 Aligned_cols=59 Identities=15% Similarity=0.102 Sum_probs=34.7
Q ss_pred eEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecC
Q 040693 278 GWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTG 340 (382)
Q Consensus 278 ~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~ 340 (382)
++...|+.||+.. +...+.......+.+..-...|+-. +++.++.+|++|++.. .++.+
T Consensus 43 r~k~knL~tG~~~-e~~~~s~d~~e~~~ve~~~~qylY~--dg~~~~fMd~eTyeq~-~i~~~ 101 (129)
T PRK03999 43 RIVAIGIFDGQKR-SLVQPVDAKVEVPIIEKKTGQVLSI--MGDVVQLMDLETYETF-EIPIP 101 (129)
T ss_pred EEEEEECCCCCEE-EEEecCCCceeeeeEEeEEEEEEEe--cCCEEEEecCCCceEE-EecCC
Confidence 4788899999854 4433332222233333334555555 2568899999999954 44443
No 351
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=20.89 E-value=1.4e+02 Score=30.96 Aligned_cols=58 Identities=17% Similarity=0.273 Sum_probs=42.2
Q ss_pred EeCCEEEEeeecCCCcEEEEeCCCCcEeEEEecCCc-eecceEEeCCEEEEEeCceeEeecC
Q 040693 307 VANGVLFGGSTYRQGPIYAMDVKTGKILWSYDTGAT-IYGGASVSNGCIYMGNGYKVTVGFG 367 (382)
Q Consensus 307 ~~~~~v~~~~~~~~g~l~~ld~~tG~ilw~~~~~~~-~~~~p~~~~g~lyv~~~~g~~~~~~ 367 (382)
.++++++.++ .|.++.+|+.++|.+-+..-... +.+.-..-||.|..++..+..+-+|
T Consensus 139 TaDgil~s~a---~g~v~i~D~stqk~~~el~~h~d~vQSa~WseDG~llatscKdkqirif 197 (1012)
T KOG1445|consen 139 TADGILASGA---HGSVYITDISTQKTAVELSGHTDKVQSADWSEDGKLLATSCKDKQIRIF 197 (1012)
T ss_pred CcCceEEecc---CceEEEEEcccCceeecccCCchhhhccccccCCceEeeecCCcceEEe
Confidence 3577888777 79999999999999887765444 3444455788888888877755443
No 352
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=20.09 E-value=8.3e+02 Score=23.92 Aligned_cols=99 Identities=8% Similarity=0.073 Sum_probs=53.1
Q ss_pred CCCceEEEEECCCCcEEeeecCCCCCCCCcceEEeCCEEEEeeecCCCcEEEEeCCC---CcEeEEEecCCceecceEE-
Q 040693 274 TIAGGWVAMDASNGNVLWSTADPSNGTAPGPVTVANGVLFGGSTYRQGPIYAMDVKT---GKILWSYDTGATIYGGASV- 349 (382)
Q Consensus 274 ~~~g~v~a~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~v~~~~~~~~g~l~~ld~~t---G~ilw~~~~~~~~~~~p~~- 349 (382)
..++.|...|++.|...=....+.-......+......=++++.+++|.+...|+.+ |+++-.++-...-+.+.-.
T Consensus 277 S~DgsIrIWDiRs~~~~~~~~~kAh~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~ 356 (440)
T KOG0302|consen 277 SCDGSIRIWDIRSGPKKAAVSTKAHNSDVNVISWNRREPLLASGGDDGTLSIWDLRQFKSGQPVATFKYHKAPITSIEWH 356 (440)
T ss_pred ecCceEEEEEecCCCccceeEeeccCCceeeEEccCCcceeeecCCCceEEEEEhhhccCCCcceeEEeccCCeeEEEec
Confidence 456889999999883322222111011112222222222344444689999999874 6666666543322222222
Q ss_pred -eCCEEEEEeCceeEeecCCccCC
Q 040693 350 -SNGCIYMGNGYKVTVGFGNKNFT 372 (382)
Q Consensus 350 -~~g~lyv~~~~g~~~~~~~~~~~ 372 (382)
.+..++.+++...++.++.+...
T Consensus 357 p~e~s~iaasg~D~QitiWDlsvE 380 (440)
T KOG0302|consen 357 PHEDSVIAASGEDNQITIWDLSVE 380 (440)
T ss_pred cccCceEEeccCCCcEEEEEeecc
Confidence 45777777777777777665543
No 353
>smart00284 OLF Olfactomedin-like domains.
Probab=20.02 E-value=6.9e+02 Score=22.96 Aligned_cols=150 Identities=11% Similarity=0.074 Sum_probs=73.9
Q ss_pred ceEEEEECCCCcEEEEEecCCCcccccccccCCCCCCCCCCCCCCCCCCCceEEEeeeCceeecEEEEEcc-CcEEE--E
Q 040693 143 NSLLALDLDTGKIVWYKQLGGYDVWFGACNWYLNPNCPPGPSPDADFGEAPMMLSMYRNKVKHDIVVAVQK-SGFAW--A 219 (382)
Q Consensus 143 g~v~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~g~~~~~v~~~~~-~g~l~--a 219 (382)
..|+.+|+.++++.=+..++... + . ..-|.+ . +...-.++.+|...=..|++... .|.++ -
T Consensus 94 ~~iiKydL~t~~v~~~~~Lp~a~-y--~---~~~~Y~---------~-~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSk 157 (255)
T smart00284 94 HDICRFDLTTETYQKEPLLNGAG-Y--N---NRFPYA---------W-GGFSDIDLAVDENGLWVIYATEQNAGKIVISK 157 (255)
T ss_pred ccEEEEECCCCcEEEEEecCccc-c--c---cccccc---------c-CCCccEEEEEcCCceEEEEeccCCCCCEEEEe
Confidence 67999999999997555554210 0 0 000111 1 12223344334321455655533 35444 6
Q ss_pred EeCCCCCee--eeeccCCCCCCCCcccceeeeCCeEEEEecCccccccccCCCCCCCCCceEEEEECCCCcEEee-ecCC
Q 040693 220 LDRDSGSLI--WSMEAGPGGLGGGAMWGAATDERRIYTNIANSQHKNFNLKPSKNSTIAGGWVAMDASNGNVLWS-TADP 296 (382)
Q Consensus 220 ld~~tG~~~--W~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~g~v~a~d~~tG~~~W~-~~~~ 296 (382)
||+.|-++. |....+... .+..++ ..+.||+...... ....-.+++|..|++.... ++++
T Consensus 158 Lnp~tL~ve~tW~T~~~k~s--a~naFm---vCGvLY~~~s~~~------------~~~~I~yayDt~t~~~~~~~i~f~ 220 (255)
T smart00284 158 LNPATLTIENTWITTYNKRS--ASNAFM---ICGILYVTRSLGS------------KGEKVFYAYDTNTGKEGHLDIPFE 220 (255)
T ss_pred eCcccceEEEEEEcCCCccc--ccccEE---EeeEEEEEccCCC------------CCcEEEEEEECCCCccceeeeeec
Confidence 788877654 666544321 112222 5588998743110 1123367999999987754 3343
Q ss_pred CCCCCCcceEEe--CCEEEEeeecCCCcEEEEeC
Q 040693 297 SNGTAPGPVTVA--NGVLFGGSTYRQGPIYAMDV 328 (382)
Q Consensus 297 ~~~~~~~~~~~~--~~~v~~~~~~~~g~l~~ld~ 328 (382)
......+.+... +..+|+-. +|.+...|+
T Consensus 221 n~y~~~s~l~YNP~d~~LY~wd---ng~~l~Y~v 251 (255)
T smart00284 221 NMYEYISMLDYNPNDRKLYAWN---NGHLVHYDI 251 (255)
T ss_pred cccccceeceeCCCCCeEEEEe---CCeEEEEEE
Confidence 321112222121 24566554 566555543
Done!