Query 039705
Match_columns 539
No_of_seqs 355 out of 1774
Neff 7.7
Searched_HMMs 46136
Date Fri Mar 29 12:22:36 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/039705.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/039705hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF07250 Glyoxal_oxid_N: Glyox 100.0 3.9E-47 8.4E-52 369.6 23.6 237 34-276 1-243 (243)
2 KOG4441 Proteins containing BT 100.0 5.3E-39 1.2E-43 352.0 27.9 268 41-385 283-568 (571)
3 KOG4441 Proteins containing BT 100.0 3.6E-37 7.8E-42 337.6 25.8 250 110-425 283-549 (571)
4 PHA02713 hypothetical protein; 100.0 3.9E-35 8.4E-40 322.4 24.1 236 81-373 273-543 (557)
5 PHA02713 hypothetical protein; 100.0 9.6E-34 2.1E-38 311.3 25.5 254 113-424 259-535 (557)
6 cd02851 Galactose_oxidase_C_te 100.0 4.7E-35 1E-39 246.2 11.6 99 427-539 1-101 (101)
7 PF09118 DUF1929: Domain of un 100.0 2.7E-34 5.8E-39 242.1 9.6 97 432-538 1-98 (98)
8 TIGR03547 muta_rot_YjhT mutatr 100.0 1.5E-31 3.2E-36 278.5 28.7 251 105-400 11-333 (346)
9 PRK14131 N-acetylneuraminic ac 100.0 1.4E-29 3.1E-34 266.5 28.0 280 91-422 18-368 (376)
10 PHA02790 Kelch-like protein; P 100.0 1.4E-29 3E-34 274.4 24.7 218 85-371 251-478 (480)
11 PLN02153 epithiospecifier prot 100.0 3.3E-28 7.2E-33 252.9 30.2 280 88-400 5-326 (341)
12 TIGR03548 mutarot_permut cycli 100.0 1.1E-27 2.5E-32 247.0 25.6 248 106-400 8-315 (323)
13 PHA02790 Kelch-like protein; P 100.0 1.1E-27 2.4E-32 259.5 24.5 194 156-423 268-471 (480)
14 PHA03098 kelch-like protein; P 100.0 3.2E-27 6.9E-32 259.9 25.8 244 81-377 265-525 (534)
15 TIGR03547 muta_rot_YjhT mutatr 100.0 7.8E-27 1.7E-31 243.0 26.8 244 41-338 16-331 (346)
16 PRK14131 N-acetylneuraminic ac 100.0 3.3E-27 7.1E-32 248.6 24.2 291 14-369 11-374 (376)
17 PLN02153 epithiospecifier prot 100.0 3.8E-26 8.2E-31 237.5 29.5 284 18-360 5-339 (341)
18 PLN02193 nitrile-specifier pro 100.0 9.9E-26 2.1E-30 243.7 33.0 274 82-399 139-454 (470)
19 TIGR03548 mutarot_permut cycli 100.0 2.4E-26 5.3E-31 237.1 26.7 267 29-338 2-313 (323)
20 PHA03098 kelch-like protein; P 99.9 4E-26 8.6E-31 251.2 26.4 248 113-424 252-513 (534)
21 PLN02193 nitrile-specifier pro 99.9 1.2E-24 2.6E-29 235.2 29.6 264 109-425 118-413 (470)
22 KOG4693 Uncharacterized conser 99.8 1.8E-18 3.9E-23 165.2 21.8 256 102-399 14-313 (392)
23 KOG4693 Uncharacterized conser 99.8 6.6E-17 1.4E-21 154.6 20.2 262 21-338 3-312 (392)
24 KOG0379 Kelch repeat-containin 99.5 2.7E-12 5.9E-17 139.2 23.0 248 99-391 58-333 (482)
25 KOG0379 Kelch repeat-containin 99.4 5E-12 1.1E-16 137.2 20.8 209 146-400 57-287 (482)
26 KOG4152 Host cell transcriptio 99.3 5.1E-11 1.1E-15 123.1 17.8 277 90-399 17-343 (830)
27 KOG1230 Protein containing rep 99.3 1.5E-10 3.3E-15 117.3 17.0 244 111-426 78-338 (521)
28 COG3055 Uncharacterized protei 99.1 1.2E-08 2.6E-13 102.5 20.0 260 95-401 30-362 (381)
29 KOG1230 Protein containing rep 99.0 3.3E-09 7.2E-14 107.7 14.2 192 19-233 107-345 (521)
30 KOG4152 Host cell transcriptio 99.0 6.1E-09 1.3E-13 108.0 14.6 225 138-397 18-273 (830)
31 PF07250 Glyoxal_oxid_N: Glyox 99.0 1.1E-08 2.3E-13 100.5 14.2 135 222-398 48-190 (243)
32 COG3055 Uncharacterized protei 98.9 4.9E-08 1.1E-12 98.1 18.4 229 12-279 62-360 (381)
33 PF13964 Kelch_6: Kelch motif 98.9 4.9E-09 1.1E-13 77.4 6.0 50 317-377 1-50 (50)
34 smart00612 Kelch Kelch domain. 98.5 1.8E-07 3.9E-12 67.3 4.8 45 330-385 1-45 (47)
35 PF01344 Kelch_1: Kelch motif; 98.5 7.9E-08 1.7E-12 69.7 2.4 47 317-374 1-47 (47)
36 PF13964 Kelch_6: Kelch motif 98.4 2.8E-07 6.1E-12 67.9 4.6 46 102-150 2-50 (50)
37 smart00612 Kelch Kelch domain. 98.1 3.9E-06 8.4E-11 60.2 4.4 45 113-161 1-47 (47)
38 PF13418 Kelch_4: Galactose ox 98.1 2.4E-06 5.1E-11 62.5 3.1 48 317-374 1-48 (49)
39 PF07646 Kelch_2: Kelch motif; 98.1 7.8E-06 1.7E-10 59.9 5.4 49 317-374 1-49 (49)
40 PF13415 Kelch_3: Galactose ox 98.0 1.1E-05 2.3E-10 59.2 5.3 48 328-384 1-48 (49)
41 PF01344 Kelch_1: Kelch motif; 97.9 5.4E-06 1.2E-10 60.0 2.2 43 102-147 2-47 (47)
42 PF07646 Kelch_2: Kelch motif; 97.7 7.4E-05 1.6E-09 54.7 5.0 37 203-240 3-49 (49)
43 PF13415 Kelch_3: Galactose ox 97.7 7.5E-05 1.6E-09 54.6 5.0 44 111-157 1-48 (49)
44 PRK11138 outer membrane biogen 97.4 0.095 2E-06 55.6 25.5 244 82-396 132-384 (394)
45 PF13418 Kelch_4: Galactose ox 97.3 0.00044 9.4E-09 50.4 4.9 35 205-240 5-48 (49)
46 PLN02772 guanylate kinase 97.3 0.00076 1.6E-08 70.6 7.9 70 317-398 24-96 (398)
47 PRK11138 outer membrane biogen 97.1 0.47 1E-05 50.3 28.4 219 105-396 114-343 (394)
48 PLN02772 guanylate kinase 96.6 0.0047 1E-07 64.8 7.2 68 101-170 24-96 (398)
49 KOG0286 G-protein beta subunit 96.6 0.81 1.8E-05 45.8 21.8 246 82-397 79-335 (343)
50 TIGR03300 assembly_YfgL outer 96.5 1.3 2.9E-05 46.4 25.7 247 81-397 116-370 (377)
51 PRK13684 Ycf48-like protein; P 96.5 1.3 2.8E-05 46.0 29.6 197 136-397 118-322 (334)
52 KOG0310 Conserved WD40 repeat- 96.0 0.57 1.2E-05 49.6 18.3 245 80-399 48-301 (487)
53 COG4257 Vgb Streptogramin lyas 95.9 1.4 3.1E-05 43.9 19.2 228 83-385 86-324 (353)
54 PF13854 Kelch_5: Kelch motif 95.7 0.022 4.7E-07 40.1 4.5 41 314-361 1-41 (42)
55 PF10282 Lactonase: Lactonase, 95.6 3.6 7.8E-05 42.8 25.5 263 81-397 16-311 (345)
56 PRK13684 Ycf48-like protein; P 95.5 3.7 7.9E-05 42.7 24.5 75 306-397 203-279 (334)
57 KOG0310 Conserved WD40 repeat- 95.4 1 2.2E-05 47.8 17.4 250 29-341 35-302 (487)
58 TIGR03300 assembly_YfgL outer 95.0 5.5 0.00012 41.7 22.9 216 105-396 59-287 (377)
59 PF13088 BNR_2: BNR repeat-lik 94.6 2.5 5.3E-05 42.1 17.6 218 136-393 29-275 (275)
60 cd00200 WD40 WD40 domain, foun 94.5 4.5 9.8E-05 38.4 26.6 245 82-399 33-283 (289)
61 TIGR03866 PQQ_ABC_repeats PQQ- 94.4 5.6 0.00012 39.1 28.5 131 82-233 13-150 (300)
62 KOG2437 Muskelin [Signal trans 94.4 0.067 1.4E-06 56.8 5.5 160 206-397 265-456 (723)
63 TIGR03866 PQQ_ABC_repeats PQQ- 94.1 6.6 0.00014 38.6 26.4 131 82-233 55-192 (300)
64 PRK11028 6-phosphogluconolacto 94.0 8.4 0.00018 39.4 29.1 70 106-180 85-157 (330)
65 KOG0286 G-protein beta subunit 93.7 8.5 0.00018 38.7 18.2 191 160-427 76-275 (343)
66 TIGR01640 F_box_assoc_1 F-box 93.6 1.7 3.6E-05 42.3 13.7 142 80-230 70-230 (230)
67 PRK11028 6-phosphogluconolacto 93.4 10 0.00022 38.8 25.2 137 82-230 14-158 (330)
68 PF14870 PSII_BNR: Photosynthe 93.3 11 0.00024 38.6 20.3 242 89-398 5-253 (302)
69 KOG2437 Muskelin [Signal trans 93.2 0.11 2.4E-06 55.2 4.7 129 100-233 259-417 (723)
70 TIGR01640 F_box_assoc_1 F-box 93.2 2.4 5.3E-05 41.2 14.1 152 211-397 5-162 (230)
71 COG4257 Vgb Streptogramin lyas 93.1 10 0.00023 38.0 19.8 147 82-246 126-279 (353)
72 PLN00181 protein SPA1-RELATED; 92.4 15 0.00032 42.9 21.4 230 106-399 489-730 (793)
73 PF13854 Kelch_5: Kelch motif 92.3 0.16 3.6E-06 35.6 3.2 25 373-399 1-25 (42)
74 PF14870 PSII_BNR: Photosynthe 92.2 6.3 0.00014 40.4 15.7 161 20-222 134-297 (302)
75 PF08450 SGL: SMP-30/Gluconola 92.1 5.1 0.00011 39.2 14.8 156 210-422 50-213 (246)
76 KOG0315 G-protein beta subunit 90.8 18 0.00039 35.7 19.7 218 78-362 59-290 (311)
77 PF08450 SGL: SMP-30/Gluconola 90.3 10 0.00022 37.0 15.0 140 83-231 63-216 (246)
78 KOG0315 G-protein beta subunit 90.0 21 0.00046 35.3 20.7 224 108-400 48-281 (311)
79 PF07893 DUF1668: Protein of u 89.9 5.7 0.00012 41.4 13.4 118 109-234 74-213 (342)
80 PF13360 PQQ_2: PQQ-like domai 88.9 22 0.00048 34.0 23.6 107 109-232 34-144 (238)
81 COG1520 FOG: WD40-like repeat 88.7 34 0.00073 35.8 24.5 261 82-397 80-354 (370)
82 PF13360 PQQ_2: PQQ-like domai 86.9 29 0.00063 33.1 21.5 203 125-395 3-219 (238)
83 PF07893 DUF1668: Protein of u 86.1 22 0.00047 37.1 14.8 59 33-120 68-126 (342)
84 cd00200 WD40 WD40 domain, foun 86.1 31 0.00066 32.6 23.6 218 108-397 17-239 (289)
85 PF03089 RAG2: Recombination a 85.9 24 0.00051 35.6 13.8 181 145-394 83-281 (337)
86 KOG0266 WD40 repeat-containing 85.4 59 0.0013 35.2 21.6 205 98-362 201-411 (456)
87 KOG0279 G protein beta subunit 84.0 34 0.00075 34.3 13.9 137 81-231 86-225 (315)
88 PF10282 Lactonase: Lactonase, 83.5 59 0.0013 33.7 21.2 279 17-371 22-332 (345)
89 PF13088 BNR_2: BNR repeat-lik 83.0 13 0.00028 36.8 11.2 127 87-216 142-275 (275)
90 KOG0278 Serine/threonine kinas 82.5 28 0.0006 34.5 12.3 128 111-278 155-288 (334)
91 KOG0296 Angio-associated migra 81.6 43 0.00092 34.8 13.9 133 81-233 87-225 (399)
92 PTZ00421 coronin; Provisional 80.7 69 0.0015 35.2 16.5 139 81-233 149-295 (493)
93 KOG0649 WD40 repeat protein [G 77.3 79 0.0017 31.3 14.0 157 20-227 99-273 (325)
94 KOG2055 WD40 repeat protein [G 76.9 51 0.0011 35.3 13.1 131 81-230 281-419 (514)
95 KOG0289 mRNA splicing factor [ 75.8 63 0.0014 34.4 13.3 141 203-397 350-496 (506)
96 PLN02919 haloacid dehalogenase 74.9 98 0.0021 37.5 16.8 61 322-399 808-880 (1057)
97 PLN00181 protein SPA1-RELATED; 74.6 1.8E+02 0.0038 34.0 21.9 131 81-229 556-691 (793)
98 KOG0271 Notchless-like WD40 re 74.0 14 0.0003 38.6 8.0 113 108-233 123-240 (480)
99 PTZ00421 coronin; Provisional 73.1 1.5E+02 0.0033 32.6 22.6 107 111-233 87-203 (493)
100 KOG0271 Notchless-like WD40 re 73.0 6.7 0.00015 40.8 5.5 57 326-400 124-180 (480)
101 KOG0291 WD40-repeat-containing 72.9 1.8E+02 0.0039 33.3 26.6 50 81-133 331-380 (893)
102 PTZ00420 coronin; Provisional 72.8 55 0.0012 36.7 13.1 135 81-231 149-296 (568)
103 PRK01742 tolB translocation pr 72.3 1.4E+02 0.003 31.9 15.9 134 81-233 229-366 (429)
104 KOG0278 Serine/threonine kinas 71.7 1.1E+02 0.0024 30.5 13.1 135 207-399 151-289 (334)
105 KOG0272 U4/U6 small nuclear ri 71.1 1.2E+02 0.0026 32.2 14.0 183 101-339 262-452 (459)
106 PLN00033 photosystem II stabil 70.7 1.5E+02 0.0033 31.6 30.7 222 109-397 144-390 (398)
107 KOG0285 Pleiotropic regulator 70.7 89 0.0019 32.6 12.7 124 220-399 215-341 (460)
108 KOG2055 WD40 repeat protein [G 68.8 1.7E+02 0.0038 31.5 15.1 221 110-399 223-457 (514)
109 COG5184 ATS1 Alpha-tubulin sup 68.4 1.8E+02 0.0039 31.5 20.2 109 321-457 351-465 (476)
110 KOG0266 WD40 repeat-containing 65.3 2.1E+02 0.0045 31.0 23.2 232 107-424 166-411 (456)
111 KOG0272 U4/U6 small nuclear ri 63.7 84 0.0018 33.3 11.2 85 84-177 287-372 (459)
112 PF03089 RAG2: Recombination a 61.2 25 0.00053 35.5 6.5 83 313-400 83-176 (337)
113 KOG1036 Mitotic spindle checkp 59.6 1E+02 0.0022 31.4 10.6 89 74-177 71-160 (323)
114 KOG0303 Actin-binding protein 59.4 1.1E+02 0.0024 32.3 11.1 81 80-169 154-236 (472)
115 cd00216 PQQ_DH Dehydrogenases 59.0 2.7E+02 0.0059 30.4 15.2 27 208-234 58-87 (488)
116 KOG0641 WD40 repeat protein [G 58.6 1.9E+02 0.004 28.3 16.2 27 208-234 239-267 (350)
117 PTZ00420 coronin; Provisional 58.1 3.1E+02 0.0068 30.8 25.6 107 111-233 86-202 (568)
118 PLN00033 photosystem II stabil 57.6 2.7E+02 0.0058 29.8 25.1 72 308-397 271-347 (398)
119 COG1520 FOG: WD40-like repeat 56.9 2E+02 0.0044 29.9 13.3 136 208-396 65-205 (370)
120 PF12768 Rax2: Cortical protei 56.9 91 0.002 31.6 10.1 99 18-144 24-130 (281)
121 cd02849 CGTase_C_term Cgtase ( 54.9 1.1E+02 0.0023 24.8 8.4 75 432-534 2-77 (81)
122 PF07433 DUF1513: Protein of u 53.8 2.6E+02 0.0056 28.8 12.7 97 211-336 16-118 (305)
123 KOG0291 WD40-repeat-containing 53.7 3.5E+02 0.0076 31.1 14.4 50 81-133 459-508 (893)
124 PF07433 DUF1513: Protein of u 50.5 2E+02 0.0044 29.5 11.3 110 30-169 4-120 (305)
125 PRK04792 tolB translocation pr 50.3 3.6E+02 0.0077 29.1 15.5 91 81-177 287-377 (448)
126 PRK03629 tolB translocation pr 49.9 3.5E+02 0.0076 28.9 19.0 141 81-237 224-371 (429)
127 PF15418 DUF4625: Domain of un 49.7 1.1E+02 0.0025 27.2 8.4 90 431-526 13-115 (132)
128 KOG0316 Conserved WD40 repeat- 49.6 91 0.002 30.8 8.2 89 81-177 82-170 (307)
129 TIGR03075 PQQ_enz_alc_DH PQQ-d 48.7 2.3E+02 0.0049 31.5 12.5 121 208-368 66-196 (527)
130 KOG0301 Phospholipase A2-activ 48.5 4.6E+02 0.0099 29.9 14.2 61 154-227 145-207 (745)
131 KOG0263 Transcription initiati 47.4 99 0.0022 35.1 9.2 93 74-177 553-646 (707)
132 TIGR02608 delta_60_rpt delta-6 46.5 19 0.0004 27.0 2.4 17 383-399 6-22 (55)
133 KOG0647 mRNA export protein (c 45.6 3.5E+02 0.0077 27.7 13.0 101 110-231 82-187 (347)
134 PF13540 RCC1_2: Regulator of 44.3 28 0.00062 22.3 2.8 21 378-399 8-28 (30)
135 TIGR02800 propeller_TolB tol-p 42.3 4.2E+02 0.0091 27.6 17.9 140 81-236 215-361 (417)
136 PRK03629 tolB translocation pr 42.3 4.6E+02 0.0099 28.0 16.3 84 81-170 268-351 (429)
137 KOG0279 G protein beta subunit 41.1 4E+02 0.0087 27.0 13.4 88 326-438 157-251 (315)
138 KOG0318 WD40 repeat stress pro 39.3 5.7E+02 0.012 28.3 13.9 144 153-338 447-593 (603)
139 KOG0263 Transcription initiati 39.2 4.8E+02 0.01 29.9 12.8 85 296-400 558-642 (707)
140 COG3490 Uncharacterized protei 38.1 1.8E+02 0.0038 29.7 8.4 88 316-423 111-203 (366)
141 PRK04792 tolB translocation pr 37.8 5.5E+02 0.012 27.7 17.6 137 81-233 243-387 (448)
142 KOG0308 Conserved WD40 repeat- 37.5 3.4E+02 0.0074 30.6 11.1 142 207-398 80-234 (735)
143 KOG1036 Mitotic spindle checkp 36.2 2E+02 0.0043 29.4 8.5 79 296-397 117-197 (323)
144 PF08662 eIF2A: Eukaryotic tra 36.1 78 0.0017 29.9 5.6 56 325-397 108-163 (194)
145 PF08662 eIF2A: Eukaryotic tra 36.0 2.2E+02 0.0048 26.8 8.7 80 81-169 84-163 (194)
146 TIGR03075 PQQ_enz_alc_DH PQQ-d 34.4 1.7E+02 0.0036 32.5 8.6 95 268-395 69-172 (527)
147 PF12768 Rax2: Cortical protei 34.4 3E+02 0.0066 27.9 9.8 59 83-144 19-81 (281)
148 KOG1427 Uncharacterized conser 34.2 89 0.0019 31.8 5.7 91 380-492 120-214 (443)
149 KOG1332 Vesicle coat complex C 33.8 4.2E+02 0.0091 26.4 10.0 134 212-393 70-237 (299)
150 KOG0294 WD40 repeat-containing 33.4 5.6E+02 0.012 26.5 18.9 175 107-338 48-228 (362)
151 PRK02889 tolB translocation pr 32.6 6.4E+02 0.014 26.8 17.8 137 81-233 221-365 (427)
152 KOG0299 U3 snoRNP-associated p 31.6 5.3E+02 0.011 27.9 11.0 57 156-225 293-351 (479)
153 KOG0639 Transducin-like enhanc 29.5 3.5E+02 0.0076 29.7 9.4 140 212-399 432-573 (705)
154 PF00868 Transglut_N: Transglu 29.4 3.8E+02 0.0081 23.2 8.8 22 501-525 94-115 (118)
155 TIGR02658 TTQ_MADH_Hv methylam 29.1 5.5E+02 0.012 26.9 10.9 35 203-237 48-94 (352)
156 KOG0305 Anaphase promoting com 29.0 8.1E+02 0.017 26.9 20.8 129 81-228 198-331 (484)
157 KOG0640 mRNA cleavage stimulat 29.0 6.6E+02 0.014 25.9 14.9 32 99-133 111-142 (430)
158 PF07705 CARDB: CARDB; InterP 28.3 3.1E+02 0.0067 21.9 8.0 74 435-526 8-83 (101)
159 KOG0649 WD40 repeat protein [G 28.2 4.3E+02 0.0094 26.3 9.1 49 81-133 179-235 (325)
160 PRK05137 tolB translocation pr 27.9 7.5E+02 0.016 26.3 18.6 138 81-233 227-371 (435)
161 KOG0639 Transducin-like enhanc 27.2 2.3E+02 0.0049 31.0 7.5 134 82-231 442-584 (705)
162 COG3490 Uncharacterized protei 27.2 3.7E+02 0.008 27.5 8.6 83 81-167 92-179 (366)
163 KOG0308 Conserved WD40 repeat- 26.5 3.2E+02 0.007 30.8 8.7 98 296-424 96-203 (735)
164 KOG0296 Angio-associated migra 25.7 8E+02 0.017 25.8 14.1 135 210-398 74-211 (399)
165 PRK01029 tolB translocation pr 25.3 4E+02 0.0087 28.5 9.5 59 81-142 352-410 (428)
166 PRK00178 tolB translocation pr 24.9 8.3E+02 0.018 25.7 19.3 139 81-236 224-370 (430)
167 KOG0268 Sof1-like rRNA process 24.9 2.8E+02 0.006 29.2 7.4 133 211-400 158-295 (433)
168 PF13570 PQQ_3: PQQ-like domai 24.4 1.3E+02 0.0027 20.3 3.6 24 324-361 17-40 (40)
169 PF05096 Glu_cyclase_2: Glutam 24.1 7.4E+02 0.016 24.9 10.4 110 200-361 43-158 (264)
170 PF13895 Ig_2: Immunoglobulin 23.1 3E+02 0.0064 20.6 6.1 37 432-474 1-37 (80)
171 PF01436 NHL: NHL repeat; Int 22.9 96 0.0021 19.4 2.5 15 380-395 5-19 (28)
172 KOG0300 WD40 repeat-containing 22.7 8.7E+02 0.019 25.1 11.7 131 82-232 296-432 (481)
173 KOG0322 G-protein beta subunit 21.7 80 0.0017 31.6 2.8 53 324-398 258-314 (323)
174 KOG3881 Uncharacterized conser 21.5 9.9E+02 0.022 25.4 11.8 111 111-232 160-281 (412)
175 PRK04922 tolB translocation pr 21.1 1E+03 0.022 25.3 18.3 138 81-233 229-373 (433)
176 KOG1517 Guanine nucleotide bin 21.1 1.2E+03 0.025 28.5 12.1 140 210-397 1176-1324(1387)
177 smart00120 HX Hemopexin-like r 21.0 1.5E+02 0.0032 20.2 3.5 22 208-229 6-27 (45)
178 PF10670 DUF4198: Domain of un 21.0 5.2E+02 0.011 24.1 8.5 68 441-526 144-211 (215)
179 KOG0265 U5 snRNP-specific prot 20.5 1.9E+02 0.0041 29.5 5.2 53 325-398 55-111 (338)
180 PF11090 DUF2833: Protein of u 20.0 47 0.001 27.3 0.8 35 383-423 3-37 (86)
No 1
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=100.00 E-value=3.9e-47 Score=369.59 Aligned_cols=237 Identities=41% Similarity=0.747 Sum_probs=211.8
Q ss_pred eEEEecCCCCEEEEEccccCCCCCccCCCcccccC-CCccccccccceeEEEEECCCCCEEeCccCCCcccccceecCCC
Q 039705 34 MHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLN-PGAWQKYVDYRALAVEYDAESAAIRPLKILTDTWSSSGGLSANG 112 (539)
Q Consensus 34 ~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~-~~~~~g~~~~~~~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG 112 (539)
||++|+ +++||+++|+.+.|+|+|+|++|.||.. .+.. .+.||++++.+||+.|++++++...++.||+++++++||
T Consensus 1 mh~~~~-~~~~v~~~d~t~~g~s~~~~~~~~c~~~~~~~~-~~~d~~a~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG 78 (243)
T PF07250_consen 1 MHMALL-HNNKVIMFDRTNFGPSNISLPDGRCRDNPEDNA-LKFDGPAHSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDG 78 (243)
T ss_pred CeEeEc-cCCEEEEEeCCCcccccccCCCCccccCccccc-cccCceEEEEEEecCCCcEEeccCCCCCcccCcCCCCCC
Confidence 899999 9999999999999999999999999986 3333 378999999999999999999999999999999999999
Q ss_pred cEEEecCCCCCCCeEEEEeCCC--CccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEEecCC-Ccceeecc
Q 039705 113 TIVISGGWSSRGRSVRYLSGCY--HACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYILKEG-KRIIYDLP 189 (539)
Q Consensus 113 ~l~v~GG~~~g~~~v~~ydP~~--~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~yP~~~-~~~w~~~~ 189 (539)
+++++||+.+|.+.++.|+|+. ++++|.+.+..|..+|||+++++|+||+|+|+||+..+++|++|+.. ......++
T Consensus 79 ~ll~tGG~~~G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L~DG~vlIvGG~~~~t~E~~P~~~~~~~~~~~~ 158 (243)
T PF07250_consen 79 RLLQTGGDNDGNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTLPDGRVLIVGGSNNPTYEFWPPKGPGPGPVTLP 158 (243)
T ss_pred CEEEeCCCCccccceEEEecCCCCCCCCceECcccccCCCccccceECCCCCEEEEeCcCCCcccccCCccCCCCceeee
Confidence 9999999999999999999982 24899998766999999999999999999999999999999996632 22334566
Q ss_pred CccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEeeCCCCeEEEEcccCCCCCCccCCCccEEeccc--ccCCCCCCCc
Q 039705 190 ILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILLNPETNEILHVFPILRGGSRNYPASATSALLPI--KLQDPNSNAI 267 (539)
Q Consensus 190 ~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl--~~~~~~~~~~ 267 (539)
+|..+.+..+.|+||++++++||+||+++++.+++||+++|++.+.+|.||++.|+||.+|++||||| .+ +++ .
T Consensus 159 ~l~~~~~~~~~nlYP~~~llPdG~lFi~an~~s~i~d~~~n~v~~~lP~lPg~~R~YP~sgssvmLPl~~~~---~~~-~ 234 (243)
T PF07250_consen 159 FLSQTSDTLPNNLYPFVHLLPDGNLFIFANRGSIIYDYKTNTVVRTLPDLPGGPRNYPASGSSVMLPLTDTP---PNN-Y 234 (243)
T ss_pred cchhhhccCccccCceEEEcCCCCEEEEEcCCcEEEeCCCCeEEeeCCCCCCCceecCCCcceEEecCccCC---CCC-C
Confidence 77766666789999999999999999999999999999999987789999999999999999999999 54 333 5
Q ss_pred ccEEEEecC
Q 039705 268 RAEVLICGG 276 (539)
Q Consensus 268 ~g~Iyv~GG 276 (539)
..+|+||||
T Consensus 235 ~~evlvCGG 243 (243)
T PF07250_consen 235 TAEVLVCGG 243 (243)
T ss_pred CeEEEEeCC
Confidence 899999998
No 2
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00 E-value=5.3e-39 Score=352.03 Aligned_cols=268 Identities=21% Similarity=0.291 Sum_probs=225.7
Q ss_pred CCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCC
Q 039705 41 NTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGW 120 (539)
Q Consensus 41 ~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~ 120 (539)
..+++|++||... ++. ....+++|||.+++|..++.|+..+|..++++++|.||++||.
T Consensus 283 ~~~~l~~vGG~~~--------~~~-------------~~~~ve~yd~~~~~w~~~a~m~~~r~~~~~~~~~~~lYv~GG~ 341 (571)
T KOG4441|consen 283 VSGKLVAVGGYNR--------QGQ-------------SLRSVECYDPKTNEWSSLAPMPSPRCRVGVAVLNGKLYVVGGY 341 (571)
T ss_pred CCCeEEEECCCCC--------CCc-------------ccceeEEecCCcCcEeecCCCCcccccccEEEECCEEEEEccc
Confidence 4689999999752 011 1346889999999999999999999999999999999999998
Q ss_pred CC---CCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccC----CeEEEE-ecCCCcceeec-cCc
Q 039705 121 SS---RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDL-PIL 191 (539)
Q Consensus 121 ~~---g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~-~~l 191 (539)
+. .++++++|||. +++|+.+++ |+.+|+.++++++ +|+||++||+++ .++|+| |.++ +|... +|+
T Consensus 342 ~~~~~~l~~ve~YD~~--~~~W~~~a~-M~~~R~~~~v~~l-~g~iYavGG~dg~~~l~svE~YDp~~~--~W~~va~m~ 415 (571)
T KOG4441|consen 342 DSGSDRLSSVERYDPR--TNQWTPVAP-MNTKRSDFGVAVL-DGKLYAVGGFDGEKSLNSVECYDPVTN--KWTPVAPML 415 (571)
T ss_pred cCCCcccceEEEecCC--CCceeccCC-ccCccccceeEEE-CCEEEEEeccccccccccEEEecCCCC--cccccCCCC
Confidence 73 36899999999 999999998 9999999999999 899999999986 479999 9998 99754 455
Q ss_pred cccCCCCCCCCcceEEEeeCCcEEEEEc--------CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCC
Q 039705 192 NETTNPSENNLYPFVFLSTDGNLFIFAN--------DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPN 263 (539)
Q Consensus 192 ~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--------~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~ 263 (539)
..+ +-++++..+|+||++|| +++|+|||.+|+|+ .+|+|+. +|.+ .|.++
T Consensus 416 ~~r--------~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~~W~-~~~~M~~-~R~~--~g~a~---------- 473 (571)
T KOG4441|consen 416 TRR--------SGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDPETNTWT-LIAPMNT-RRSG--FGVAV---------- 473 (571)
T ss_pred cce--------eeeEEEEECCEEEEEcCcCCCccccceEEEEcCCCCcee-ecCCccc-cccc--ceEEE----------
Confidence 432 34778899999999999 35899999999998 8999998 6664 35454
Q ss_pred CCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCC
Q 039705 264 SNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTA 342 (539)
Q Consensus 264 ~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~ 342 (539)
++++||++||.+. . ..+.++|+|||. +++|+.. +|+.+|..++.+++ +++||++||.+ |..
T Consensus 474 ---~~~~iYvvGG~~~-~----------~~~~~VE~ydp~--~~~W~~v~~m~~~rs~~g~~~~-~~~ly~vGG~~-~~~ 535 (571)
T KOG4441|consen 474 ---LNGKIYVVGGFDG-T----------SALSSVERYDPE--TNQWTMVAPMTSPRSAVGVVVL-GGKLYAVGGFD-GNN 535 (571)
T ss_pred ---ECCEEEEECCccC-C----------CccceEEEEcCC--CCceeEcccCccccccccEEEE-CCEEEEEeccc-Ccc
Confidence 3899999999873 2 356789999997 6999998 89999998886655 99999999976 332
Q ss_pred CcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEee
Q 039705 343 GWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVV 385 (539)
Q Consensus 343 g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~l 385 (539)
-+.++|+|||++| +|+..+++...|...+++++
T Consensus 536 -------~l~~ve~ydp~~d---~W~~~~~~~~~~~~~~~~~~ 568 (571)
T KOG4441|consen 536 -------NLNTVECYDPETD---TWTEVTEPESGRGGAGVAVI 568 (571)
T ss_pred -------ccceeEEcCCCCC---ceeeCCCccccccCcceEEe
Confidence 3458999999999 99999998888887777665
No 3
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00 E-value=3.6e-37 Score=337.61 Aligned_cols=250 Identities=20% Similarity=0.302 Sum_probs=208.6
Q ss_pred CCCcEEEecCCCC---CCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCcc-C----CeEEEE-ecC
Q 039705 110 ANGTIVISGGWSS---RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRR-E----FSYEYI-LKE 180 (539)
Q Consensus 110 ~dG~l~v~GG~~~---g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~-~----~~~E~y-P~~ 180 (539)
..+.|+++||... ..+++++|||. ++.|..+++ |+.+|..++++++ +|+|||+||.+ + .++|+| |.+
T Consensus 283 ~~~~l~~vGG~~~~~~~~~~ve~yd~~--~~~w~~~a~-m~~~r~~~~~~~~-~~~lYv~GG~~~~~~~l~~ve~YD~~~ 358 (571)
T KOG4441|consen 283 VSGKLVAVGGYNRQGQSLRSVECYDPK--TNEWSSLAP-MPSPRCRVGVAVL-NGKLYVVGGYDSGSDRLSSVERYDPRT 358 (571)
T ss_pred CCCeEEEECCCCCCCcccceeEEecCC--cCcEeecCC-CCcccccccEEEE-CCEEEEEccccCCCcccceEEEecCCC
Confidence 4689999999863 36899999999 999999998 9999998888888 89999999999 3 479999 999
Q ss_pred CCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEE
Q 039705 181 GKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-------RSILLNPETNEILHVFPILRGGSRNYPASATSA 253 (539)
Q Consensus 181 ~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av 253 (539)
+ +|...+.|...+. -+.++..+|+||++||. ++|+|||++|+|+ ..++|+. +|.. +|+++
T Consensus 359 ~--~W~~~a~M~~~R~-------~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~-~va~m~~-~r~~--~gv~~ 425 (571)
T KOG4441|consen 359 N--QWTPVAPMNTKRS-------DFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWT-PVAPMLT-RRSG--HGVAV 425 (571)
T ss_pred C--ceeccCCccCccc-------cceeEEECCEEEEEeccccccccccEEEecCCCCccc-ccCCCCc-ceee--eEEEE
Confidence 8 8987655544432 25688899999999995 4899999999998 8999987 5542 23333
Q ss_pred ecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEE
Q 039705 254 LLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVL 332 (539)
Q Consensus 254 ~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~ 332 (539)
++++||++||.+... ..++++|+|||. +++|+.. +|+.+|.+++++++ +|+||
T Consensus 426 -------------~~g~iYi~GG~~~~~----------~~l~sve~YDP~--t~~W~~~~~M~~~R~~~g~a~~-~~~iY 479 (571)
T KOG4441|consen 426 -------------LGGKLYIIGGGDGSS----------NCLNSVECYDPE--TNTWTLIAPMNTRRSGFGVAVL-NGKIY 479 (571)
T ss_pred -------------ECCEEEEEcCcCCCc----------cccceEEEEcCC--CCceeecCCcccccccceEEEE-CCEEE
Confidence 389999999987321 257899999997 6999998 99999999997766 99999
Q ss_pred EEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCc
Q 039705 333 IINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPT 412 (539)
Q Consensus 333 vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~ 412 (539)
++||.+ +. ....++|+|||+++ +|+.+++|+.+|..++++++ ++++|+.||.+.. .+
T Consensus 480 vvGG~~-~~-------~~~~~VE~ydp~~~---~W~~v~~m~~~rs~~g~~~~--~~~ly~vGG~~~~--------~~-- 536 (571)
T KOG4441|consen 480 VVGGFD-GT-------SALSSVERYDPETN---QWTMVAPMTSPRSAVGVVVL--GGKLYAVGGFDGN--------NN-- 536 (571)
T ss_pred EECCcc-CC-------CccceEEEEcCCCC---ceeEcccCccccccccEEEE--CCEEEEEecccCc--------cc--
Confidence 999987 42 23447999999999 99999999999999999988 9999999995443 22
Q ss_pred ceeeEEecCCCCC
Q 039705 413 ELRIEKFYPPYFD 425 (539)
Q Consensus 413 ~~~vE~y~Ppyl~ 425 (539)
..+||+|+|..=.
T Consensus 537 l~~ve~ydp~~d~ 549 (571)
T KOG4441|consen 537 LNTVECYDPETDT 549 (571)
T ss_pred cceeEEcCCCCCc
Confidence 5789999998744
No 4
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=3.9e-35 Score=322.36 Aligned_cols=236 Identities=11% Similarity=0.129 Sum_probs=193.2
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCC-C--CCCeEEEEeCCCCccceeecccccccccccceeEE
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWS-S--RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHI 157 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~-~--g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~ 157 (539)
...+|||.+++|+.++.++..++..+++..+++||++||.. . ..+++++|||. +++|.++++ |+.+|.++++++
T Consensus 273 ~v~~yd~~~~~W~~l~~mp~~r~~~~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~~--~n~W~~~~~-m~~~R~~~~~~~ 349 (557)
T PHA02713 273 CILVYNINTMEYSVISTIPNHIINYASAIVDNEIIIAGGYNFNNPSLNKVYKINIE--NKIHVELPP-MIKNRCRFSLAV 349 (557)
T ss_pred CEEEEeCCCCeEEECCCCCccccceEEEEECCEEEEEcCCCCCCCccceEEEEECC--CCeEeeCCC-CcchhhceeEEE
Confidence 35789999999999999988887777888999999999974 1 25789999999 999999997 999999998888
Q ss_pred ccCCcEEEEcCccC----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC------------
Q 039705 158 LPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND------------ 220 (539)
Q Consensus 158 L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~------------ 220 (539)
+ +|+|||+||.++ .++|+| |.++ +|...+.|+..+. -+++++.+|+||++||.
T Consensus 350 ~-~g~IYviGG~~~~~~~~sve~Ydp~~~--~W~~~~~mp~~r~-------~~~~~~~~g~IYviGG~~~~~~~~~~~~~ 419 (557)
T PHA02713 350 I-DDTIYAIGGQNGTNVERTIECYTMGDD--KWKMLPDMPIALS-------SYGMCVLDQYIYIIGGRTEHIDYTSVHHM 419 (557)
T ss_pred E-CCEEEEECCcCCCCCCceEEEEECCCC--eEEECCCCCcccc-------cccEEEECCEEEEEeCCCccccccccccc
Confidence 8 899999999864 469999 9988 9986655544322 14567789999999984
Q ss_pred -------------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCC
Q 039705 221 -------------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGK 287 (539)
Q Consensus 221 -------------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~ 287 (539)
++++|||++|+|+ .+++|+. +|.. +++++ ++++|||+||.+. ..
T Consensus 420 ~~~~~~~~~~~~~~ve~YDP~td~W~-~v~~m~~-~r~~--~~~~~-------------~~~~IYv~GG~~~-~~----- 476 (557)
T PHA02713 420 NSIDMEEDTHSSNKVIRYDTVNNIWE-TLPNFWT-GTIR--PGVVS-------------HKDDIYVVCDIKD-EK----- 476 (557)
T ss_pred ccccccccccccceEEEECCCCCeEe-ecCCCCc-cccc--CcEEE-------------ECCEEEEEeCCCC-CC-----
Confidence 2789999999998 8999987 5553 34444 3899999999752 11
Q ss_pred CcccccCCceEEEEeeCCC-Cceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCC
Q 039705 288 GEFMNALQDCGRIEITNKS-ATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINE 365 (539)
Q Consensus 288 ~~~~~a~~s~~~~d~~~~~-~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~ 365 (539)
...+++|+|||. + ++|+.. +|+.+|..++++++ +|+|||+||.+ |. .++|+|||.|+
T Consensus 477 ----~~~~~ve~Ydp~--~~~~W~~~~~m~~~r~~~~~~~~-~~~iyv~Gg~~-~~----------~~~e~yd~~~~--- 535 (557)
T PHA02713 477 ----NVKTCIFRYNTN--TYNGWELITTTESRLSALHTILH-DNTIMMLHCYE-SY----------MLQDTFNVYTY--- 535 (557)
T ss_pred ----ccceeEEEecCC--CCCCeeEccccCcccccceeEEE-CCEEEEEeeec-ce----------eehhhcCcccc---
Confidence 123468999997 5 799997 99999999987776 99999999986 32 27899999999
Q ss_pred ceEecCCC
Q 039705 366 RFSELTPT 373 (539)
Q Consensus 366 ~Wt~~a~~ 373 (539)
+|+.+++.
T Consensus 536 ~W~~~~~~ 543 (557)
T PHA02713 536 EWNHICHQ 543 (557)
T ss_pred cccchhhh
Confidence 99998764
No 5
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=9.6e-34 Score=311.33 Aligned_cols=254 Identities=11% Similarity=0.112 Sum_probs=193.3
Q ss_pred cEEEecCCCC-CCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccC-----CeEEEE-ecCCCcce
Q 039705 113 TIVISGGWSS-RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRRE-----FSYEYI-LKEGKRII 185 (539)
Q Consensus 113 ~l~v~GG~~~-g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~-----~~~E~y-P~~~~~~w 185 (539)
.+++.||... ....+++|||. +++|..+++ |+.+|.+++++++ +|+|||+||.+. .++|+| |.++ +|
T Consensus 259 ~l~~~~g~~~~~~~~v~~yd~~--~~~W~~l~~-mp~~r~~~~~a~l-~~~IYviGG~~~~~~~~~~v~~Yd~~~n--~W 332 (557)
T PHA02713 259 CLVCHDTKYNVCNPCILVYNIN--TMEYSVIST-IPNHIINYASAIV-DNEIIIAGGYNFNNPSLNKVYKINIENK--IH 332 (557)
T ss_pred EEEEecCccccCCCCEEEEeCC--CCeEEECCC-CCccccceEEEEE-CCEEEEEcCCCCCCCccceEEEEECCCC--eE
Confidence 3555555321 23568999999 999999997 9999998888888 899999999742 468999 9988 99
Q ss_pred eeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccc
Q 039705 186 YDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIK 258 (539)
Q Consensus 186 ~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~ 258 (539)
...+.|...+. -++++..+|+||++||. ++|+|||.+|+|. .+++||. +|.. .++++
T Consensus 333 ~~~~~m~~~R~-------~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~-~~~~mp~-~r~~--~~~~~----- 396 (557)
T PHA02713 333 VELPPMIKNRC-------RFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWK-MLPDMPI-ALSS--YGMCV----- 396 (557)
T ss_pred eeCCCCcchhh-------ceeEEEECCEEEEECCcCCCCCCceEEEEECCCCeEE-ECCCCCc-cccc--ccEEE-----
Confidence 86665544322 25678889999999994 3899999999998 7999998 5553 23333
Q ss_pred cCCCCCCCcccEEEEecCCCCCc-ccccC--CC----cccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCc
Q 039705 259 LQDPNSNAIRAEVLICGGAKPEA-GVLAG--KG----EFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGD 330 (539)
Q Consensus 259 ~~~~~~~~~~g~Iyv~GG~~~~~-~~~~~--~~----~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~ 330 (539)
++++||++||.+... +.... +. .....++++++|||. +++|+.. +|+.+|..++++++ +|+
T Consensus 397 --------~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~--td~W~~v~~m~~~r~~~~~~~~-~~~ 465 (557)
T PHA02713 397 --------LDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTV--NNIWETLPNFWTGTIRPGVVSH-KDD 465 (557)
T ss_pred --------ECCEEEEEeCCCcccccccccccccccccccccccceEEEECCC--CCeEeecCCCCcccccCcEEEE-CCE
Confidence 389999999975210 00000 00 000125789999997 6999997 99999999987766 999
Q ss_pred EEEEcCcCCCCCCcccCCCCCCccEEEcCCC-CCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCC
Q 039705 331 VLIINGAKKGTAGWNFATDPNTTPVLYEPDD-PINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSK 409 (539)
Q Consensus 331 I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t-~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~ 409 (539)
|||+||.+ +.. .-...+|+|||++ + +|+.+++|+.+|..|+++++ ||+|||+||....
T Consensus 466 IYv~GG~~-~~~------~~~~~ve~Ydp~~~~---~W~~~~~m~~~r~~~~~~~~--~~~iyv~Gg~~~~--------- 524 (557)
T PHA02713 466 IYVVCDIK-DEK------NVKTCIFRYNTNTYN---GWELITTTESRLSALHTILH--DNTIMMLHCYESY--------- 524 (557)
T ss_pred EEEEeCCC-CCC------ccceeEEEecCCCCC---CeeEccccCcccccceeEEE--CCEEEEEeeecce---------
Confidence 99999976 221 1122589999999 9 99999999999999999998 9999999996431
Q ss_pred CCcceeeEEecCCCC
Q 039705 410 YPTELRIEKFYPPYF 424 (539)
Q Consensus 410 ~p~~~~vE~y~Ppyl 424 (539)
..+|+|+|..=
T Consensus 525 ----~~~e~yd~~~~ 535 (557)
T PHA02713 525 ----MLQDTFNVYTY 535 (557)
T ss_pred ----eehhhcCcccc
Confidence 26899999764
No 6
>cd02851 Galactose_oxidase_C_term Galactose oxidase C-terminus domain. Galactose oxidase is an extracellular monomeric enzyme which catalyses the stereospecific oxidation of a broad range of primary alcohol substrates and possesses a unique mononuclear copper site essential for catalysing a two-electron transfer reaction during the oxidation of primary alcohols to corresponding aldehydes. The second redox active center necessary for the reaction was found to be situated at a tyrosine residue. The C-terminus of galactose oxidase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=100.00 E-value=4.7e-35 Score=246.18 Aligned_cols=99 Identities=25% Similarity=0.297 Sum_probs=88.9
Q ss_pred CcCCCCCceeecCCC-ceeecCCEEEEEEEecccccccCcEEEEEEcCCccccccCCCceeEeccceeeeecCCceEEEE
Q 039705 427 SFASYRPSIVSKFKG-KMLKYGQNFVIQFKLDELEVSLNDLKVTMYAPPFTTHGVSMGQRLLVPATKELIDVGSGIFQVS 505 (539)
Q Consensus 427 ~~~~~RP~i~~~~~p-~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~~~~~~TH~~n~~QR~~~L~~~~~~~~g~~~~~~~ 505 (539)
|.+|.||+|+++ | .+++||++|+|+++. .+.+|+|+|++|+||++|||||+|+|+++. ..+ .+++
T Consensus 1 g~~a~RP~I~~~--p~~~i~yG~~f~v~~~~-------~i~~v~Lvr~~~~THs~~~~QR~v~L~~~~--~~~---~~~~ 66 (101)
T cd02851 1 GTLASRPVITSA--STQTAKVGDTITVSTDS-------PISSASLVRYGSATHTVNTDQRRIPLTLFS--VGG---NSYS 66 (101)
T ss_pred CCCCCCCeeccC--CccccccCCEEEEEEec-------cceEEEEEecccccccccCCccEEEeeeEe--cCC---CEEE
Confidence 356899999999 9 899999999999973 489999999999999999999999999975 223 3567
Q ss_pred EEcCCCCCcCCCcceEEEEEc-CCCCCccEEEEeC
Q 039705 506 VMAPPTAKIAPPSFYLLFVVY-RQVPSPGTWVQIG 539 (539)
Q Consensus 506 ~~~P~~~~~~ppG~ymlf~~~-~gvPS~~~~v~i~ 539 (539)
+++|+|++|||||||||||++ +||||+|+||+|+
T Consensus 67 v~~P~n~~vaPPGyYmLFvv~~~GvPS~a~wV~i~ 101 (101)
T cd02851 67 VQIPSDPGVALPGYYMLFVMNSAGVPSVAKTIRIT 101 (101)
T ss_pred EEcCCCCCcCCCcCeEEEEECCCCcccccEEEEeC
Confidence 888999999999999999995 9999999999986
No 7
>PF09118 DUF1929: Domain of unknown function (DUF1929); InterPro: IPR015202 This domain adopts a secondary structure consisting of a bundle of seven, mostly antiparallel, beta-strands surrounding a hydrophobic core. The 7 strands are arranged in 2 sheets, in a Greek-key topology. Their precise function, has not, as yet, been defined, though they are mostly found in sugar-utilising enzymes, such as galactose oxidase []. ; PDB: 2JKX_A 2EIC_A 1K3I_A 1GOH_A 2EIB_A 2WQ8_A 2VZ1_A 1GOF_A 2VZ3_A 1GOG_A ....
Probab=100.00 E-value=2.7e-34 Score=242.06 Aligned_cols=97 Identities=40% Similarity=0.724 Sum_probs=68.2
Q ss_pred CCceeecCCCceeecCCEEEEEEEecccccccCcEEEEEEcCCccccccCCCceeEeccceeeeecCCceEEEEEEcCCC
Q 039705 432 RPSIVSKFKGKMLKYGQNFVIQFKLDELEVSLNDLKVTMYAPPFTTHGVSMGQRLLVPATKELIDVGSGIFQVSVMAPPT 511 (539)
Q Consensus 432 RP~i~~~~~p~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~~~~~~TH~~n~~QR~~~L~~~~~~~~g~~~~~~~~~~P~~ 511 (539)
||+|+++ |..+.||++|+|+++.++ ..++.+|+|+|+||+|||+|||||+|+|++.. ..+ +++++++|+|
T Consensus 1 RP~i~~~--p~~i~yg~~~tv~~~~~~---~~~~~~v~L~~~~~~THs~~~~QR~v~L~~~~--~~~---~~~~v~~P~~ 70 (98)
T PF09118_consen 1 RPVITSA--PTTIKYGQTFTVTVTVPS---AASIVKVSLVRPGFVTHSFNMGQRMVELEFVS--GGG---NTVTVTAPPN 70 (98)
T ss_dssp ---EEES---SEEETT-EEEEEE--SS------ESEEEEEE--EEETTB-SS-EEEEE-EEE--ESS---SEEEEE--S-
T ss_pred CCccccC--CCeEecCCEEEEEEECCC---ccceEEEEEEeCCcccccccCCCCEEeeeeec--CCC---CEEEEECCCC
Confidence 9999998 999999999999998653 24789999999999999999999999999943 233 6899999999
Q ss_pred CCcCCCcceEEEEEc-CCCCCccEEEEe
Q 039705 512 AKIAPPSFYLLFVVY-RQVPSPGTWVQI 538 (539)
Q Consensus 512 ~~~~ppG~ymlf~~~-~gvPS~~~~v~i 538 (539)
++|||||||||||++ +||||+|+||+|
T Consensus 71 ~~vaPPG~YmLFvv~~~GvPS~a~wV~v 98 (98)
T PF09118_consen 71 PNVAPPGYYMLFVVNDDGVPSVAKWVQV 98 (98)
T ss_dssp TTTS-SEEEEEEEEETTS-B---EEEEE
T ss_pred CccCCCcCEEEEEEcCCCcccccEEEEC
Confidence 999999999999999 999999999997
No 8
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=100.00 E-value=1.5e-31 Score=278.53 Aligned_cols=251 Identities=15% Similarity=0.118 Sum_probs=180.0
Q ss_pred cceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEEccCCcEEEEcCccC----------Ce
Q 039705 105 SGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHILPDGSFIVVGGRRE----------FS 173 (539)
Q Consensus 105 ~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~~~----------~~ 173 (539)
+.+++.+++|||+||.. .+.+.+||+.+.+++|+++++ |+ .+|..++++++ |++|||+||... .+
T Consensus 11 ~~~~~~~~~vyv~GG~~--~~~~~~~d~~~~~~~W~~l~~-~p~~~R~~~~~~~~-~~~iYv~GG~~~~~~~~~~~~~~~ 86 (346)
T TIGR03547 11 GTGAIIGDKVYVGLGSA--GTSWYKLDLKKPSKGWQKIAD-FPGGPRNQAVAAAI-DGKLYVFGGIGKANSEGSPQVFDD 86 (346)
T ss_pred ceEEEECCEEEEEcccc--CCeeEEEECCCCCCCceECCC-CCCCCcccceEEEE-CCEEEEEeCCCCCCCCCcceeccc
Confidence 44667799999999974 367889996322789999997 98 58988888887 899999999742 35
Q ss_pred EEEE-ecCCCcceeeccC-ccccCCCCCCCCcceEEE-eeCCcEEEEEcC------------------------------
Q 039705 174 YEYI-LKEGKRIIYDLPI-LNETTNPSENNLYPFVFL-STDGNLFIFAND------------------------------ 220 (539)
Q Consensus 174 ~E~y-P~~~~~~w~~~~~-l~~~~~~~~~~~yp~~~~-~~~G~Iyv~Gg~------------------------------ 220 (539)
+|+| |.++ +|...+. +... .+.++.+ +.+|+||++||.
T Consensus 87 v~~Yd~~~~--~W~~~~~~~p~~-------~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (346)
T TIGR03547 87 VYRYDPKKN--SWQKLDTRSPVG-------LLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYF 157 (346)
T ss_pred EEEEECCCC--EEecCCCCCCCc-------ccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHh
Confidence 8999 9988 9976542 2221 2223344 689999999984
Q ss_pred -----------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCc
Q 039705 221 -----------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGE 289 (539)
Q Consensus 221 -----------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~ 289 (539)
++|+|||.+|+|+ .+++||..+|.. +++++ ++++|||+||.....
T Consensus 158 ~~~~~~~~~~~~v~~YDp~t~~W~-~~~~~p~~~r~~--~~~~~-------------~~~~iyv~GG~~~~~-------- 213 (346)
T TIGR03547 158 SQPPEDYFWNKNVLSYDPSTNQWR-NLGENPFLGTAG--SAIVH-------------KGNKLLLINGEIKPG-------- 213 (346)
T ss_pred CCChhHcCccceEEEEECCCCcee-ECccCCCCcCCC--ceEEE-------------ECCEEEEEeeeeCCC--------
Confidence 4789999999998 799998533432 22222 389999999975211
Q ss_pred ccccCCceEEEEeeCCCCceeee-ccCCCce-------eceeEEecCCcEEEEcCcCCCCC--------Cccc-CCCCCC
Q 039705 290 FMNALQDCGRIEITNKSATWQRE-MMPSPRV-------MGEMLLLPTGDVLIINGAKKGTA--------GWNF-ATDPNT 352 (539)
Q Consensus 290 ~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~-------~~~~vvlpdG~I~vvGG~~~g~~--------g~~~-~~~~~~ 352 (539)
..+..+++||+...+++|+.. +|+.+|. .+.+++ .+|+|||+||.+.... .+.. ....+.
T Consensus 214 --~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~~~~~~~~a~~-~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (346)
T TIGR03547 214 --LRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQEGLAGAFAGI-SNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAW 290 (346)
T ss_pred --ccchheEEEEecCCCceeeecCCCCCCCCCccccccEEeeeE-ECCEEEEeecCCCCCchhhhhcCCccccCCCCcee
Confidence 112345667764335799987 9988762 333444 4999999999752100 0000 001123
Q ss_pred ccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCC
Q 039705 353 TPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 353 ~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
++|+|||+++ +|+.+++|+.+|.+|+++++ +|+|||+||....
T Consensus 291 ~~e~yd~~~~---~W~~~~~lp~~~~~~~~~~~--~~~iyv~GG~~~~ 333 (346)
T TIGR03547 291 SSEVYALDNG---KWSKVGKLPQGLAYGVSVSW--NNGVLLIGGENSG 333 (346)
T ss_pred EeeEEEecCC---cccccCCCCCCceeeEEEEc--CCEEEEEeccCCC
Confidence 6899999999 99999999999998877666 9999999997554
No 9
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.97 E-value=1.4e-29 Score=266.49 Aligned_cols=280 Identities=15% Similarity=0.133 Sum_probs=190.2
Q ss_pred CEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEEccCCcEEEEcCc
Q 039705 91 AIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHILPDGSFIVVGGR 169 (539)
Q Consensus 91 ~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~ 169 (539)
.++.++.++..+-...++..+++|||+||.. .+.+.+||....+++|..+++ |+ .+|..++++++ +++|||+||.
T Consensus 18 ~~~~l~~lP~~~~~~~~~~~~~~iyv~gG~~--~~~~~~~d~~~~~~~W~~l~~-~p~~~r~~~~~v~~-~~~IYV~GG~ 93 (376)
T PRK14131 18 NAEQLPDLPVPFKNGTGAIDNNTVYVGLGSA--GTSWYKLDLNAPSKGWTKIAA-FPGGPREQAVAAFI-DGKLYVFGGI 93 (376)
T ss_pred ecccCCCCCcCccCCeEEEECCEEEEEeCCC--CCeEEEEECCCCCCCeEECCc-CCCCCcccceEEEE-CCEEEEEcCC
Confidence 3445555554433334566799999999974 356788987522478999987 87 47988877777 8999999997
Q ss_pred cC----------CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEe-eCCcEEEEEcC-----------------
Q 039705 170 RE----------FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLS-TDGNLFIFAND----------------- 220 (539)
Q Consensus 170 ~~----------~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~-~~G~Iyv~Gg~----------------- 220 (539)
.. ..+++| |.++ +|...+.+..+ ..+-++.++ .+++||++||.
T Consensus 94 ~~~~~~~~~~~~~~v~~YD~~~n--~W~~~~~~~p~------~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~ 165 (376)
T PRK14131 94 GKTNSEGSPQVFDDVYKYDPKTN--SWQKLDTRSPV------GLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGK 165 (376)
T ss_pred CCCCCCCceeEcccEEEEeCCCC--EEEeCCCCCCC------cccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhccc
Confidence 53 358899 8887 99766532111 112234444 79999999993
Q ss_pred ------------------------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecC
Q 039705 221 ------------------------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGG 276 (539)
Q Consensus 221 ------------------------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG 276 (539)
++++|||.+|+|+ .+++||..+|. ++++.. .+++|||+||
T Consensus 166 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~-~~~~~p~~~~~----~~a~v~-----------~~~~iYv~GG 229 (376)
T PRK14131 166 DKTPKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWK-NAGESPFLGTA----GSAVVI-----------KGNKLWLING 229 (376)
T ss_pred chhhhhhhHHHHhcCChhhcCcCceEEEEECCCCeee-ECCcCCCCCCC----cceEEE-----------ECCEEEEEee
Confidence 3789999999998 78888852333 222221 3899999999
Q ss_pred CCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCcee-------ceeEEecCCcEEEEcCcCCCCC------
Q 039705 277 AKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVM-------GEMLLLPTGDVLIINGAKKGTA------ 342 (539)
Q Consensus 277 ~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~-------~~~vvlpdG~I~vvGG~~~g~~------ 342 (539)
..... .....+..+++...+++|+.. +|+.+|.. +.++++.+|+|||+||......
T Consensus 230 ~~~~~----------~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~ 299 (376)
T PRK14131 230 EIKPG----------LRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQN 299 (376)
T ss_pred eECCC----------cCChhheEEEecCCCcceeecCCCCCCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhc
Confidence 64211 012223333332235899987 89887742 1223445999999999752110
Q ss_pred C--cc-cCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEEe
Q 039705 343 G--WN-FATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKF 419 (539)
Q Consensus 343 g--~~-~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y 419 (539)
+ +. .....+.++|+|||+++ +|+.+++|+.+|.+|+++++ +|+|||+||...... ...+|++|
T Consensus 300 ~~~~~~~~~~~~~~~e~yd~~~~---~W~~~~~lp~~r~~~~av~~--~~~iyv~GG~~~~~~---------~~~~v~~~ 365 (376)
T PRK14131 300 GKLYAHEGLKKSWSDEIYALVNG---KWQKVGELPQGLAYGVSVSW--NNGVLLIGGETAGGK---------AVSDVTLL 365 (376)
T ss_pred CCcccccCCcceeehheEEecCC---cccccCcCCCCccceEEEEe--CCEEEEEcCCCCCCc---------EeeeEEEE
Confidence 0 00 00112236899999999 99999999999999987666 999999999754321 14588888
Q ss_pred cCC
Q 039705 420 YPP 422 (539)
Q Consensus 420 ~Pp 422 (539)
.|.
T Consensus 366 ~~~ 368 (376)
T PRK14131 366 SWD 368 (376)
T ss_pred EEc
Confidence 876
No 10
>PHA02790 Kelch-like protein; Provisional
Probab=99.97 E-value=1.4e-29 Score=274.37 Aligned_cols=218 Identities=15% Similarity=0.180 Sum_probs=171.3
Q ss_pred EECCCCCEEeCccCCCcccccceecCCCcEEEecCCCC--CCCeEEEEeCCCCccceeecccccccccccceeEEccCCc
Q 039705 85 YDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSS--RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGS 162 (539)
Q Consensus 85 yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~--g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~ 162 (539)
|++.+++|..+. ..| .++..++.||++||..+ ..+++++|||. +++|..+++ |+.+|.+++++++ ||+
T Consensus 251 ~~~~~~~~~~~~----~~~--~~~~~~~~lyviGG~~~~~~~~~v~~Ydp~--~~~W~~~~~-m~~~r~~~~~v~~-~~~ 320 (480)
T PHA02790 251 YPMNMDQIIDIF----HMC--TSTHVGEVVYLIGGWMNNEIHNNAIAVNYI--SNNWIPIPP-MNSPRLYASGVPA-NNK 320 (480)
T ss_pred cCCcccceeecc----CCc--ceEEECCEEEEEcCCCCCCcCCeEEEEECC--CCEEEECCC-CCchhhcceEEEE-CCE
Confidence 456666776633 112 23447899999999753 35789999999 999999997 9999999988887 899
Q ss_pred EEEEcCccC-CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeEEEE
Q 039705 163 FIVVGGRRE-FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEILHV 235 (539)
Q Consensus 163 VyvvGG~~~-~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W~~~ 235 (539)
||++||.+. .++|+| |.++ +|...+.|+..+ +-++.+..+|+||++||. .+|+|||++|+|+ .
T Consensus 321 iYviGG~~~~~sve~ydp~~n--~W~~~~~l~~~r-------~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~-~ 390 (480)
T PHA02790 321 LYVVGGLPNPTSVERWFHGDA--AWVNMPSLLKPR-------CNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQ-F 390 (480)
T ss_pred EEEECCcCCCCceEEEECCCC--eEEECCCCCCCC-------cccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEE-e
Confidence 999999854 578999 9888 998766554432 125677889999999994 4789999999998 7
Q ss_pred cccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-cc
Q 039705 236 FPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MM 314 (539)
Q Consensus 236 ~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M 314 (539)
+++|+. +|.. +++++ ++++||++||. +|+|||. +++|+.. +|
T Consensus 391 ~~~m~~-~r~~--~~~~~-------------~~~~IYv~GG~-------------------~e~ydp~--~~~W~~~~~m 433 (480)
T PHA02790 391 GPSTYY-PHYK--SCALV-------------FGRRLFLVGRN-------------------AEFYCES--SNTWTLIDDP 433 (480)
T ss_pred CCCCCC-cccc--ceEEE-------------ECCEEEEECCc-------------------eEEecCC--CCcEeEcCCC
Confidence 999987 5653 23333 38999999983 4789986 6999997 99
Q ss_pred CCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecC
Q 039705 315 PSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELT 371 (539)
Q Consensus 315 ~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a 371 (539)
+.+|..++++++ +|+|||+||.+. . ..+.++|+|||+++ +|+.+.
T Consensus 434 ~~~r~~~~~~v~-~~~IYviGG~~~-~-------~~~~~ve~Yd~~~~---~W~~~~ 478 (480)
T PHA02790 434 IYPRDNPELIIV-DNKLLLIGGFYR-G-------SYIDTIEVYNNRTY---SWNIWD 478 (480)
T ss_pred CCCccccEEEEE-CCEEEEECCcCC-C-------cccceEEEEECCCC---eEEecC
Confidence 999999987766 999999999762 1 11347999999999 998763
No 11
>PLN02153 epithiospecifier protein
Probab=99.97 E-value=3.3e-28 Score=252.88 Aligned_cols=280 Identities=13% Similarity=0.093 Sum_probs=192.4
Q ss_pred CCCCEEeCcc----CCCcccccceecCCCcEEEecCCCCC----CCeEEEEeCCCCccceeecccccc-cccc---ccee
Q 039705 88 ESAAIRPLKI----LTDTWSSSGGLSANGTIVISGGWSSR----GRSVRYLSGCYHACYWKEHHWELS-AKRW---FSTQ 155 (539)
Q Consensus 88 ~t~~w~~l~~----~~~~~c~~~~~l~dG~l~v~GG~~~g----~~~v~~ydP~~~t~~W~~~~~~m~-~~R~---y~s~ 155 (539)
...+|+.+.. ++..++.++++..+++|||+||.... .+.+++||+. +++|+.+++ |. .+|. .+++
T Consensus 5 ~~~~W~~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~~~~yd~~--~~~W~~~~~-~~~~p~~~~~~~~~ 81 (341)
T PLN02153 5 LQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFN--THTWSIAPA-NGDVPRISCLGVRM 81 (341)
T ss_pred cCCeEEEecCCCCCCCCCCCcceEEEECCEEEEECCccCCCCceeCcEEEEECC--CCEEEEcCc-cCCCCCCccCceEE
Confidence 5567998875 45677777778889999999997421 3679999999 999999876 53 4442 4566
Q ss_pred EEccCCcEEEEcCccC----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC----------
Q 039705 156 HILPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND---------- 220 (539)
Q Consensus 156 ~~L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~---------- 220 (539)
+++ +++|||+||.+. ..+++| |+++ +|..++.+.... .+...+-|++++.+++||++||.
T Consensus 82 ~~~-~~~iyv~GG~~~~~~~~~v~~yd~~t~--~W~~~~~~~~~~--~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 156 (341)
T PLN02153 82 VAV-GTKLYIFGGRDEKREFSDFYSYDTVKN--EWTFLTKLDEEG--GPEARTFHSMASDENHVYVFGGVSKGGLMKTPE 156 (341)
T ss_pred EEE-CCEEEEECCCCCCCccCcEEEEECCCC--EEEEeccCCCCC--CCCCceeeEEEEECCEEEEECCccCCCccCCCc
Confidence 666 899999999753 368999 9887 997655432110 01122335677889999999994
Q ss_pred ---ceeEeeCCCCeEEEEcccCCCC--CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCC
Q 039705 221 ---RSILLNPETNEILHVFPILRGG--SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQ 295 (539)
Q Consensus 221 ---~~e~yDp~tn~W~~~~p~mp~~--~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~ 295 (539)
++++||+++++|+ .++++... +|.. .++++ ++++||++||.+. .+... +.....++
T Consensus 157 ~~~~v~~yd~~~~~W~-~l~~~~~~~~~r~~---~~~~~------------~~~~iyv~GG~~~-~~~~g--G~~~~~~~ 217 (341)
T PLN02153 157 RFRTIEAYNIADGKWV-QLPDPGENFEKRGG---AGFAV------------VQGKIWVVYGFAT-SILPG--GKSDYESN 217 (341)
T ss_pred ccceEEEEECCCCeEe-eCCCCCCCCCCCCc---ceEEE------------ECCeEEEEecccc-ccccC--CccceecC
Confidence 3688999999998 68776421 3332 12222 3899999999642 11000 00001256
Q ss_pred ceEEEEeeCCCCceeee----ccCCCceeceeEEecCCcEEEEcCcCCCC-CCcccCCCCCCccEEEcCCCCCCCceEec
Q 039705 296 DCGRIEITNKSATWQRE----MMPSPRVMGEMLLLPTGDVLIINGAKKGT-AGWNFATDPNTTPVLYEPDDPINERFSEL 370 (539)
Q Consensus 296 s~~~~d~~~~~~~W~~~----~M~~~R~~~~~vvlpdG~I~vvGG~~~g~-~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~ 370 (539)
.+++||+. +++|+.. .||.+|..++++++ +++|||+||..... .+.........++++|||+++ +|+.+
T Consensus 218 ~v~~yd~~--~~~W~~~~~~g~~P~~r~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~---~W~~~ 291 (341)
T PLN02153 218 AVQFFDPA--SGKWTEVETTGAKPSARSVFAHAVV-GKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETL---VWEKL 291 (341)
T ss_pred ceEEEEcC--CCcEEeccccCCCCCCcceeeeEEE-CCEEEEECcccCCccccccccccccccEEEEEcCcc---EEEec
Confidence 79999987 6999985 37889998887655 99999999974110 000000111236899999999 99988
Q ss_pred C-----CCCCCccceeeEeeCCCCeEEEecCCCCC
Q 039705 371 T-----PTSKPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 371 a-----~~~~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
. +++..|..|+++++.-+++||+.||....
T Consensus 292 ~~~~~~~~pr~~~~~~~~~v~~~~~~~~~gG~~~~ 326 (341)
T PLN02153 292 GECGEPAMPRGWTAYTTATVYGKNGLLMHGGKLPT 326 (341)
T ss_pred cCCCCCCCCCccccccccccCCcceEEEEcCcCCC
Confidence 5 56666766666666556799999997554
No 12
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.96 E-value=1.1e-27 Score=247.02 Aligned_cols=248 Identities=17% Similarity=0.177 Sum_probs=175.7
Q ss_pred ceecCCCcEEEecCCCCC------------CCeEEEEe-CCCCccceeecccccccccccceeEEccCCcEEEEcCccC-
Q 039705 106 GGLSANGTIVISGGWSSR------------GRSVRYLS-GCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRRE- 171 (539)
Q Consensus 106 ~~~l~dG~l~v~GG~~~g------------~~~v~~yd-P~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~- 171 (539)
.+.+.++.||++||.+.. .+++.+|+ +. .+.+|.++++ |+.+|.+++++++ +++||++||.+.
T Consensus 8 ~~~~~~~~l~v~GG~~~~~~~~~~~g~~~~~~~v~~~~~~~-~~~~W~~~~~-lp~~r~~~~~~~~-~~~lyviGG~~~~ 84 (323)
T TIGR03548 8 YAGIIGDYILVAGGCNFPEDPLAEGGKKKNYKGIYIAKDEN-SNLKWVKDGQ-LPYEAAYGASVSV-ENGIYYIGGSNSS 84 (323)
T ss_pred eeeEECCEEEEeeccCCCCCchhhCCcEEeeeeeEEEecCC-CceeEEEccc-CCccccceEEEEE-CCEEEEEcCCCCC
Confidence 345679999999997521 12455554 43 1337999987 9999988888877 899999999864
Q ss_pred ---CeEEEE-ecCCCcce----eeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------ceeEeeCCCCeEEEEc
Q 039705 172 ---FSYEYI-LKEGKRII----YDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-------RSILLNPETNEILHVF 236 (539)
Q Consensus 172 ---~~~E~y-P~~~~~~w----~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-------~~e~yDp~tn~W~~~~ 236 (539)
.++++| +.++ +| ...+.++..+ .-++.++.+++||++||. ++++||+++++|+ .+
T Consensus 85 ~~~~~v~~~d~~~~--~w~~~~~~~~~lp~~~-------~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~-~~ 154 (323)
T TIGR03548 85 ERFSSVYRITLDES--KEELICETIGNLPFTF-------ENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWF-EL 154 (323)
T ss_pred CCceeEEEEEEcCC--ceeeeeeEcCCCCcCc-------cCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCee-EC
Confidence 467888 7766 65 4444443322 225677789999999994 5899999999998 79
Q ss_pred ccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccC
Q 039705 237 PILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMP 315 (539)
Q Consensus 237 p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~ 315 (539)
++||..+|.. .+++. ++++|||+||.+.. ...++++||+. +++|+.. +|+
T Consensus 155 ~~~p~~~r~~---~~~~~------------~~~~iYv~GG~~~~------------~~~~~~~yd~~--~~~W~~~~~~~ 205 (323)
T TIGR03548 155 PDFPGEPRVQ---PVCVK------------LQNELYVFGGGSNI------------AYTDGYKYSPK--KNQWQKVADPT 205 (323)
T ss_pred CCCCCCCCCc---ceEEE------------ECCEEEEEcCCCCc------------cccceEEEecC--CCeeEECCCCC
Confidence 9888534542 22322 38999999997521 12457899997 5999987 763
Q ss_pred ---CCc--eeceeEEecCCcEEEEcCcCCCCC-----Ccc-------------------cCCCCCCccEEEcCCCCCCCc
Q 039705 316 ---SPR--VMGEMLLLPTGDVLIINGAKKGTA-----GWN-------------------FATDPNTTPVLYEPDDPINER 366 (539)
Q Consensus 316 ---~~R--~~~~~vvlpdG~I~vvGG~~~g~~-----g~~-------------------~~~~~~~~~e~YdP~t~~g~~ 366 (539)
.+| ..+.++++.+++|||+||.+.... .+. ....-..++|+|||+++ +
T Consensus 206 ~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~---~ 282 (323)
T TIGR03548 206 TDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTG---K 282 (323)
T ss_pred CCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhhhhccchhhhhhHHHHhCCCccccCcCceEEEEECCCC---e
Confidence 343 334445566899999999862100 000 00001236999999999 9
Q ss_pred eEecCCCC-CCccceeeEeeCCCCeEEEecCCCCC
Q 039705 367 FSELTPTS-KPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 367 Wt~~a~~~-~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
|+.+++++ .+|..|+++++ |++||++||....
T Consensus 283 W~~~~~~p~~~r~~~~~~~~--~~~iyv~GG~~~p 315 (323)
T TIGR03548 283 WKSIGNSPFFARCGAALLLT--GNNIFSINGELKP 315 (323)
T ss_pred eeEcccccccccCchheEEE--CCEEEEEeccccC
Confidence 99999887 68999888777 9999999997554
No 13
>PHA02790 Kelch-like protein; Provisional
Probab=99.96 E-value=1.1e-27 Score=259.51 Aligned_cols=194 Identities=14% Similarity=0.171 Sum_probs=156.1
Q ss_pred EEccCCcEEEEcCccC----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC----ceeEee
Q 039705 156 HILPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND----RSILLN 226 (539)
Q Consensus 156 ~~L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~----~~e~yD 226 (539)
+.+ ++.||++||.+. .++++| |.++ +|...+.|...+. + +..+..+|+||++||. ++|+||
T Consensus 268 ~~~-~~~lyviGG~~~~~~~~~v~~Ydp~~~--~W~~~~~m~~~r~------~-~~~v~~~~~iYviGG~~~~~sve~yd 337 (480)
T PHA02790 268 THV-GEVVYLIGGWMNNEIHNNAIAVNYISN--NWIPIPPMNSPRL------Y-ASGVPANNKLYVVGGLPNPTSVERWF 337 (480)
T ss_pred EEE-CCEEEEEcCCCCCCcCCeEEEEECCCC--EEEECCCCCchhh------c-ceEEEECCEEEEECCcCCCCceEEEE
Confidence 335 899999999754 468999 9988 9987665544321 1 4567789999999994 589999
Q ss_pred CCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCC
Q 039705 227 PETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKS 306 (539)
Q Consensus 227 p~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~ 306 (539)
|.+|+|+ .+|+||. +|.. .++++ ++++||++||.+. ..+++|+|||. +
T Consensus 338 p~~n~W~-~~~~l~~-~r~~--~~~~~-------------~~g~IYviGG~~~-------------~~~~ve~ydp~--~ 385 (480)
T PHA02790 338 HGDAAWV-NMPSLLK-PRCN--PAVAS-------------INNVIYVIGGHSE-------------TDTTTEYLLPN--H 385 (480)
T ss_pred CCCCeEE-ECCCCCC-CCcc--cEEEE-------------ECCEEEEecCcCC-------------CCccEEEEeCC--C
Confidence 9999998 8999997 5653 23333 3899999999751 13578999997 6
Q ss_pred Cceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEee
Q 039705 307 ATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVV 385 (539)
Q Consensus 307 ~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~l 385 (539)
++|+.. +|+.+|..++++++ +|+|||+||. +|+|||++| +|+.+++|+.+|..|+++++
T Consensus 386 ~~W~~~~~m~~~r~~~~~~~~-~~~IYv~GG~----------------~e~ydp~~~---~W~~~~~m~~~r~~~~~~v~ 445 (480)
T PHA02790 386 DQWQFGPSTYYPHYKSCALVF-GRRLFLVGRN----------------AEFYCESSN---TWTLIDDPIYPRDNPELIIV 445 (480)
T ss_pred CEEEeCCCCCCccccceEEEE-CCEEEEECCc----------------eEEecCCCC---cEeEcCCCCCCccccEEEEE
Confidence 999998 99999999887765 9999999973 489999999 99999999999999999888
Q ss_pred CCCCeEEEecCCCCCCCccCCCCCCCcceeeEEecCCC
Q 039705 386 LPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKFYPPY 423 (539)
Q Consensus 386 lpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y~Ppy 423 (539)
+|+|||+||..... + ...+|+|+|..
T Consensus 446 --~~~IYviGG~~~~~--------~--~~~ve~Yd~~~ 471 (480)
T PHA02790 446 --DNKLLLIGGFYRGS--------Y--IDTIEVYNNRT 471 (480)
T ss_pred --CCEEEEECCcCCCc--------c--cceEEEEECCC
Confidence 99999999975321 1 35799999974
No 14
>PHA03098 kelch-like protein; Provisional
Probab=99.96 E-value=3.2e-27 Score=259.89 Aligned_cols=244 Identities=13% Similarity=0.133 Sum_probs=188.4
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCC---CCeEEEEeCCCCccceeecccccccccccceeEE
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSR---GRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHI 157 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g---~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~ 157 (539)
....|++.+++|..+...+...| .+++..+++||++||.... .+.+.+||+. +++|..+++ |+.+|.++++++
T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~lyv~GG~~~~~~~~~~v~~yd~~--~~~W~~~~~-~~~~R~~~~~~~ 340 (534)
T PHA03098 265 NYITNYSPLSEINTIIDIHYVYC-FGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTK--TKSWNKVPE-LIYPRKNPGVTV 340 (534)
T ss_pred eeeecchhhhhcccccCcccccc-ceEEEECCEEEEECCCcCCCCeeccEEEEeCC--CCeeeECCC-CCcccccceEEE
Confidence 34578888999999876655455 3567789999999997532 3578999999 999999997 999999998888
Q ss_pred ccCCcEEEEcCccC----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--------CceeE
Q 039705 158 LPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--------DRSIL 224 (539)
Q Consensus 158 L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--------~~~e~ 224 (539)
+ +|+||++||.+. .++|+| |.++ +|...+.++.. .+.++.+..+|+||++|| +++++
T Consensus 341 ~-~~~lyv~GG~~~~~~~~~v~~yd~~~~--~W~~~~~lp~~-------r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~ 410 (534)
T PHA03098 341 F-NNRIYVIGGIYNSISLNTVESWKPGES--KWREEPPLIFP-------RYNPCVVNVNNLIYVIGGISKNDELLKTVEC 410 (534)
T ss_pred E-CCEEEEEeCCCCCEecceEEEEcCCCC--ceeeCCCcCcC-------CccceEEEECCEEEEECCcCCCCcccceEEE
Confidence 8 899999999863 468999 9988 99765544332 233567788999999999 34799
Q ss_pred eeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeC
Q 039705 225 LNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITN 304 (539)
Q Consensus 225 yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~ 304 (539)
|||.+++|+ .+++||. +|.. ++++. .+++||++||.+.... ....+.+++||+.
T Consensus 411 yd~~t~~W~-~~~~~p~-~r~~---~~~~~------------~~~~iyv~GG~~~~~~--------~~~~~~v~~yd~~- 464 (534)
T PHA03098 411 FSLNTNKWS-KGSPLPI-SHYG---GCAIY------------HDGKIYVIGGISYIDN--------IKVYNIVESYNPV- 464 (534)
T ss_pred EeCCCCeee-ecCCCCc-cccC---ceEEE------------ECCEEEEECCccCCCC--------CcccceEEEecCC-
Confidence 999999998 7899987 4542 22332 3899999999752110 0124568999987
Q ss_pred CCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCc
Q 039705 305 KSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPR 377 (539)
Q Consensus 305 ~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R 377 (539)
+++|+.. +|+.+|..++++++ +|+|||+||.+. .. ...++|+|||+++ +|+.+..++...
T Consensus 465 -~~~W~~~~~~~~~r~~~~~~~~-~~~iyv~GG~~~-~~-------~~~~v~~yd~~~~---~W~~~~~~p~~~ 525 (534)
T PHA03098 465 -TNKWTELSSLNFPRINASLCIF-NNKIYVVGGDKY-EY-------YINEIEVYDDKTN---TWTLFCKFPKVI 525 (534)
T ss_pred -CCceeeCCCCCcccccceEEEE-CCEEEEEcCCcC-Cc-------ccceeEEEeCCCC---EEEecCCCcccc
Confidence 6999997 89999998887766 999999999862 11 1347999999999 999988765433
No 15
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=99.96 E-value=7.8e-27 Score=243.01 Aligned_cols=244 Identities=12% Similarity=0.090 Sum_probs=172.7
Q ss_pred CCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEEC--CCCCEEeCccCC-CcccccceecCCCcEEEe
Q 039705 41 NTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDA--ESAAIRPLKILT-DTWSSSGGLSANGTIVIS 117 (539)
Q Consensus 41 ~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp--~t~~w~~l~~~~-~~~c~~~~~l~dG~l~v~ 117 (539)
.+++||++||... ....+||+ .+++|+.++.|+ ..++..+++..+++||++
T Consensus 16 ~~~~vyv~GG~~~--------------------------~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~ 69 (346)
T TIGR03547 16 IGDKVYVGLGSAG--------------------------TSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVF 69 (346)
T ss_pred ECCEEEEEccccC--------------------------CeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEE
Confidence 4899999998531 12456775 678999999987 578888888899999999
Q ss_pred cCCCC--------CCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccC------------------
Q 039705 118 GGWSS--------RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRRE------------------ 171 (539)
Q Consensus 118 GG~~~--------g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~------------------ 171 (539)
||... ..+++++|||. +++|++++..|+..|..++++++.+|+|||+||.+.
T Consensus 70 GG~~~~~~~~~~~~~~~v~~Yd~~--~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~ 147 (346)
T TIGR03547 70 GGIGKANSEGSPQVFDDVYRYDPK--KNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSE 147 (346)
T ss_pred eCCCCCCCCCcceecccEEEEECC--CCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccch
Confidence 99742 14689999999 999999873256667666666344999999999752
Q ss_pred --------------------CeEEEE-ecCCCcceeeccCccc-cCCCCCCCCcceEEEeeCCcEEEEEcCc--------
Q 039705 172 --------------------FSYEYI-LKEGKRIIYDLPILNE-TTNPSENNLYPFVFLSTDGNLFIFANDR-------- 221 (539)
Q Consensus 172 --------------------~~~E~y-P~~~~~~w~~~~~l~~-~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~-------- 221 (539)
.++|+| |.++ +|...+.|.. .+ +-++++..+++||++||..
T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~--~W~~~~~~p~~~r-------~~~~~~~~~~~iyv~GG~~~~~~~~~~ 218 (346)
T TIGR03547 148 PKDKLIAAYFSQPPEDYFWNKNVLSYDPSTN--QWRNLGENPFLGT-------AGSAIVHKGNKLLLINGEIKPGLRTAE 218 (346)
T ss_pred hhhhhHHHHhCCChhHcCccceEEEEECCCC--ceeECccCCCCcC-------CCceEEEECCEEEEEeeeeCCCccchh
Confidence 468999 9988 9986654432 21 2245677899999999942
Q ss_pred eeEee--CCCCeEEEEcccCCCCCCcc-CC--Ccc-EEecccccCCCCCCCcccEEEEecCCCCCccccc--CCCcc---
Q 039705 222 SILLN--PETNEILHVFPILRGGSRNY-PA--SAT-SALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLA--GKGEF--- 290 (539)
Q Consensus 222 ~e~yD--p~tn~W~~~~p~mp~~~r~y-p~--~g~-av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~--~~~~~--- 290 (539)
.++|| +++++|+ .+++||. +|.. +. +++ ++. ++++|||+||.+....... ....+
T Consensus 219 ~~~y~~~~~~~~W~-~~~~m~~-~r~~~~~~~~~~~a~~------------~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~ 284 (346)
T TIGR03547 219 VKQYLFTGGKLEWN-KLPPLPP-PKSSSQEGLAGAFAGI------------SNGVLLVAGGANFPGAQENYKNGKLYAHE 284 (346)
T ss_pred eEEEEecCCCceee-ecCCCCC-CCCCccccccEEeeeE------------ECCEEEEeecCCCCCchhhhhcCCccccC
Confidence 34454 5788998 7999987 4431 11 122 222 3899999999752100000 00000
Q ss_pred -cccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcC
Q 039705 291 -MNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 291 -~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
...+.++|+||+. +++|+.. +||.+|..+.++++ +|+|||+||.+
T Consensus 285 ~~~~~~~~e~yd~~--~~~W~~~~~lp~~~~~~~~~~~-~~~iyv~GG~~ 331 (346)
T TIGR03547 285 GLIKAWSSEVYALD--NGKWSKVGKLPQGLAYGVSVSW-NNGVLLIGGEN 331 (346)
T ss_pred CCCceeEeeEEEec--CCcccccCCCCCCceeeEEEEc-CCEEEEEeccC
Confidence 0113468999997 5899997 99999988775544 99999999986
No 16
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.95 E-value=3.3e-27 Score=248.59 Aligned_cols=291 Identities=11% Similarity=0.070 Sum_probs=195.3
Q ss_pred ccccCCCceEEccCC-CcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECC--CC
Q 039705 14 TLYEFKGKWELASEN-SGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAE--SA 90 (539)
Q Consensus 14 ~~~~~~g~W~~~~~~-~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~--t~ 90 (539)
++....=.++.+++. .++... .++. .+++||+++|... ....+||+. ++
T Consensus 11 ~~~~~~~~~~~l~~lP~~~~~~-~~~~-~~~~iyv~gG~~~--------------------------~~~~~~d~~~~~~ 62 (376)
T PRK14131 11 AASSFAANAEQLPDLPVPFKNG-TGAI-DNNTVYVGLGSAG--------------------------TSWYKLDLNAPSK 62 (376)
T ss_pred HhhhcceecccCCCCCcCccCC-eEEE-ECCEEEEEeCCCC--------------------------CeEEEEECCCCCC
Confidence 333344456666532 233333 3444 5899999988520 023467775 57
Q ss_pred CEEeCccCC-CcccccceecCCCcEEEecCCCC----C----CCeEEEEeCCCCccceeecccccccccccceeEEccCC
Q 039705 91 AIRPLKILT-DTWSSSGGLSANGTIVISGGWSS----R----GRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDG 161 (539)
Q Consensus 91 ~w~~l~~~~-~~~c~~~~~l~dG~l~v~GG~~~----g----~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG 161 (539)
+|+.++.++ ..++..+++..+++||++||... + .+++++|||. +++|+.++..++..|..++++++.|+
T Consensus 63 ~W~~l~~~p~~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~--~n~W~~~~~~~p~~~~~~~~~~~~~~ 140 (376)
T PRK14131 63 GWTKIAAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPK--TNSWQKLDTRSPVGLAGHVAVSLHNG 140 (376)
T ss_pred CeEECCcCCCCCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEeCC--CCEEEeCCCCCCCcccceEEEEeeCC
Confidence 999999876 46777777888999999999753 1 4679999999 99999987524566666776664599
Q ss_pred cEEEEcCccC--------------------------------------CeEEEE-ecCCCcceeeccCccccCCCCCCCC
Q 039705 162 SFIVVGGRRE--------------------------------------FSYEYI-LKEGKRIIYDLPILNETTNPSENNL 202 (539)
Q Consensus 162 ~VyvvGG~~~--------------------------------------~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~ 202 (539)
+|||+||.+. ..+++| |.++ +|...+.++... .
T Consensus 141 ~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~--~W~~~~~~p~~~------~ 212 (376)
T PRK14131 141 KAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYFFNKEVLSYDPSTN--QWKNAGESPFLG------T 212 (376)
T ss_pred EEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcCcCceEEEEECCCC--eeeECCcCCCCC------C
Confidence 9999999742 358999 9988 997655443211 1
Q ss_pred cceEEEeeCCcEEEEEcC------cee----EeeCCCCeEEEEcccCCCCCCcc--CC--Ccc-EEecccccCCCCCCCc
Q 039705 203 YPFVFLSTDGNLFIFAND------RSI----LLNPETNEILHVFPILRGGSRNY--PA--SAT-SALLPIKLQDPNSNAI 267 (539)
Q Consensus 203 yp~~~~~~~G~Iyv~Gg~------~~e----~yDp~tn~W~~~~p~mp~~~r~y--p~--~g~-av~lpl~~~~~~~~~~ 267 (539)
..++++..+++||++||. ..+ .||+++++|. .+++||. +|.. +. +++ +++ .
T Consensus 213 ~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~~~~~W~-~~~~~p~-~~~~~~~~~~~~~~a~~------------~ 278 (376)
T PRK14131 213 AGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTGNNLKWQ-KLPDLPP-APGGSSQEGVAGAFAGY------------S 278 (376)
T ss_pred CcceEEEECCEEEEEeeeECCCcCChhheEEEecCCCccee-ecCCCCC-CCcCCcCCccceEecee------------E
Confidence 224577789999999983 222 4588999998 7999987 4431 11 121 222 3
Q ss_pred ccEEEEecCCCCCcccc-cCCCc-----ccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCC
Q 039705 268 RAEVLICGGAKPEAGVL-AGKGE-----FMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKG 340 (539)
Q Consensus 268 ~g~Iyv~GG~~~~~~~~-~~~~~-----~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g 340 (539)
+++|||+||.+...... ...+. ......++|+||+. +++|+.. +||.+|.++.++++ +|+|||+||...+
T Consensus 279 ~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~--~~~W~~~~~lp~~r~~~~av~~-~~~iyv~GG~~~~ 355 (376)
T PRK14131 279 NGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALV--NGKWQKVGELPQGLAYGVSVSW-NNGVLLIGGETAG 355 (376)
T ss_pred CCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEec--CCcccccCcCCCCccceEEEEe-CCEEEEEcCCCCC
Confidence 88999999975210000 00000 00112357899997 5899987 99999998876655 9999999997521
Q ss_pred CCCcccCCCCCCccEEEcCCCCCCCceEe
Q 039705 341 TAGWNFATDPNTTPVLYEPDDPINERFSE 369 (539)
Q Consensus 341 ~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~ 369 (539)
. ..+.++++|+++.+ .++.
T Consensus 356 ~-------~~~~~v~~~~~~~~---~~~~ 374 (376)
T PRK14131 356 G-------KAVSDVTLLSWDGK---KLTV 374 (376)
T ss_pred C-------cEeeeEEEEEEcCC---EEEE
Confidence 1 23457999999987 7754
No 17
>PLN02153 epithiospecifier protein
Probab=99.95 E-value=3.8e-26 Score=237.47 Aligned_cols=284 Identities=13% Similarity=0.116 Sum_probs=187.5
Q ss_pred CCCceEEccCC---Cc-ccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEE
Q 039705 18 FKGKWELASEN---SG-ISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIR 93 (539)
Q Consensus 18 ~~g~W~~~~~~---~~-~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~ 93 (539)
..++|+.+... .| -+.-|.++. .+++||++||...... . ......+||+.+++|+
T Consensus 5 ~~~~W~~~~~~~~~~P~pR~~h~~~~-~~~~iyv~GG~~~~~~------~--------------~~~~~~~yd~~~~~W~ 63 (341)
T PLN02153 5 LQGGWIKVEQKGGKGPGPRCSHGIAV-VGDKLYSFGGELKPNE------H--------------IDKDLYVFDFNTHTWS 63 (341)
T ss_pred cCCeEEEecCCCCCCCCCCCcceEEE-ECCEEEEECCccCCCC------c--------------eeCcEEEEECCCCEEE
Confidence 45789998641 22 223455555 6899999999642100 0 0124778999999999
Q ss_pred eCccCCC-c--cc-ccceecCCCcEEEecCCCCC--CCeEEEEeCCCCccceeeccccc-----ccccccceeEEccCCc
Q 039705 94 PLKILTD-T--WS-SSGGLSANGTIVISGGWSSR--GRSVRYLSGCYHACYWKEHHWEL-----SAKRWFSTQHILPDGS 162 (539)
Q Consensus 94 ~l~~~~~-~--~c-~~~~~l~dG~l~v~GG~~~g--~~~v~~ydP~~~t~~W~~~~~~m-----~~~R~y~s~~~L~dG~ 162 (539)
.++++.. + .| ..+++..+++||++||.... .+.+++|||. +++|+++++ | +.+|..++++++ +++
T Consensus 64 ~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~--t~~W~~~~~-~~~~~~p~~R~~~~~~~~-~~~ 139 (341)
T PLN02153 64 IAPANGDVPRISCLGVRMVAVGTKLYIFGGRDEKREFSDFYSYDTV--KNEWTFLTK-LDEEGGPEARTFHSMASD-ENH 139 (341)
T ss_pred EcCccCCCCCCccCceEEEEECCEEEEECCCCCCCccCcEEEEECC--CCEEEEecc-CCCCCCCCCceeeEEEEE-CCE
Confidence 9876532 2 23 34466779999999997532 4689999999 999999876 7 678988888777 899
Q ss_pred EEEEcCccC----------CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc------------
Q 039705 163 FIVVGGRRE----------FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN------------ 219 (539)
Q Consensus 163 VyvvGG~~~----------~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg------------ 219 (539)
|||+||.+. .++++| |+++ +|..++.+... +....-+.+++.+|+||++||
T Consensus 140 iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~--~W~~l~~~~~~----~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~ 213 (341)
T PLN02153 140 VYVFGGVSKGGLMKTPERFRTIEAYNIADG--KWVQLPDPGEN----FEKRGGAGFAVVQGKIWVVYGFATSILPGGKSD 213 (341)
T ss_pred EEEECCccCCCccCCCcccceEEEEECCCC--eEeeCCCCCCC----CCCCCcceEEEECCeEEEEeccccccccCCccc
Confidence 999999752 257899 9988 99866543210 001122456778999999986
Q ss_pred ---CceeEeeCCCCeEEEEccc---CCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCccccc
Q 039705 220 ---DRSILLNPETNEILHVFPI---LRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNA 293 (539)
Q Consensus 220 ---~~~e~yDp~tn~W~~~~p~---mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a 293 (539)
+++++||+.+++|+ .++. +|. +|.. .++++ ++++|||+||....... .. ......
T Consensus 214 ~~~~~v~~yd~~~~~W~-~~~~~g~~P~-~r~~---~~~~~------------~~~~iyv~GG~~~~~~~-~~-~~~~~~ 274 (341)
T PLN02153 214 YESNAVQFFDPASGKWT-EVETTGAKPS-ARSV---FAHAV------------VGKYIIIFGGEVWPDLK-GH-LGPGTL 274 (341)
T ss_pred eecCceEEEEcCCCcEE-eccccCCCCC-Ccce---eeeEE------------ECCEEEEECcccCCccc-cc-cccccc
Confidence 24789999999998 5654 454 4432 22332 38999999996310000 00 000012
Q ss_pred CCceEEEEeeCCCCceeee------ccCCCceeceeEEe-cCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCC
Q 039705 294 LQDCGRIEITNKSATWQRE------MMPSPRVMGEMLLL-PTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPD 360 (539)
Q Consensus 294 ~~s~~~~d~~~~~~~W~~~------~M~~~R~~~~~vvl-pdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~ 360 (539)
++.+++||+. +++|+.. +||.+|..++++.+ -+++||++||..... +.+.++.+|+..
T Consensus 275 ~n~v~~~d~~--~~~W~~~~~~~~~~~pr~~~~~~~~~v~~~~~~~~~gG~~~~~-------~~~~~~~~~~~~ 339 (341)
T PLN02153 275 SNEGYALDTE--TLVWEKLGECGEPAMPRGWTAYTTATVYGKNGLLMHGGKLPTN-------ERTDDLYFYAVN 339 (341)
T ss_pred cccEEEEEcC--ccEEEeccCCCCCCCCCccccccccccCCcceEEEEcCcCCCC-------ccccceEEEecc
Confidence 4578999986 6999963 45665653333333 245899999986321 234467788754
No 18
>PLN02193 nitrile-specifier protein
Probab=99.95 E-value=9.9e-26 Score=243.70 Aligned_cols=274 Identities=11% Similarity=0.078 Sum_probs=191.2
Q ss_pred EEEEECCC----CCEEeCccC---CCcccccceecCCCcEEEecCCCCC----CCeEEEEeCCCCccceeeccc--cccc
Q 039705 82 AVEYDAES----AAIRPLKIL---TDTWSSSGGLSANGTIVISGGWSSR----GRSVRYLSGCYHACYWKEHHW--ELSA 148 (539)
Q Consensus 82 ~~~yDp~t----~~w~~l~~~---~~~~c~~~~~l~dG~l~v~GG~~~g----~~~v~~ydP~~~t~~W~~~~~--~m~~ 148 (539)
+..+||.+ ++|..+..+ +..|+.++++..+++||++||.... .+.+++||+. +++|+.++. .++.
T Consensus 139 ~y~~~~~~~~~~~~W~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~v~~yD~~--~~~W~~~~~~g~~P~ 216 (470)
T PLN02193 139 AYISLPSTPKLLGKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDLE--TRTWSISPATGDVPH 216 (470)
T ss_pred EEEecCCChhhhceEEEcccCCCCCCCccccEEEEECCEEEEECCcCCCCCCeeCcEEEEECC--CCEEEeCCCCCCCCC
Confidence 34457755 899988763 5678888888889999999997421 2568999999 999998764 1222
Q ss_pred -ccccceeEEccCCcEEEEcCccC----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--
Q 039705 149 -KRWFSTQHILPDGSFIVVGGRRE----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-- 220 (539)
Q Consensus 149 -~R~y~s~~~L~dG~VyvvGG~~~----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-- 220 (539)
+|..++++++ +++|||+||.+. ..+++| |.++ +|..++.+... +...+-|++++.+++||++||.
T Consensus 217 ~~~~~~~~v~~-~~~lYvfGG~~~~~~~ndv~~yD~~t~--~W~~l~~~~~~----P~~R~~h~~~~~~~~iYv~GG~~~ 289 (470)
T PLN02193 217 LSCLGVRMVSI-GSTLYVFGGRDASRQYNGFYSFDTTTN--EWKLLTPVEEG----PTPRSFHSMAADEENVYVFGGVSA 289 (470)
T ss_pred CcccceEEEEE-CCEEEEECCCCCCCCCccEEEEECCCC--EEEEcCcCCCC----CCCccceEEEEECCEEEEECCCCC
Confidence 2445666666 899999999864 468899 8887 99765443110 1123346677789999999994
Q ss_pred -----ceeEeeCCCCeEEEEccc---CCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccc
Q 039705 221 -----RSILLNPETNEILHVFPI---LRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMN 292 (539)
Q Consensus 221 -----~~e~yDp~tn~W~~~~p~---mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~ 292 (539)
+.++||+.+++|+ .+++ ++. .|.. ..+++ ++++||++||.+. .
T Consensus 290 ~~~~~~~~~yd~~t~~W~-~~~~~~~~~~-~R~~---~~~~~------------~~gkiyviGG~~g-~----------- 340 (470)
T PLN02193 290 TARLKTLDSYNIVDKKWF-HCSTPGDSFS-IRGG---AGLEV------------VQGKVWVVYGFNG-C----------- 340 (470)
T ss_pred CCCcceEEEEECCCCEEE-eCCCCCCCCC-CCCC---cEEEE------------ECCcEEEEECCCC-C-----------
Confidence 3789999999998 5654 232 3432 22232 3889999999752 1
Q ss_pred cCCceEEEEeeCCCCceeee-cc---CCCceeceeEEecCCcEEEEcCcCCCC-CCcccCCCCCCccEEEcCCCCCCCce
Q 039705 293 ALQDCGRIEITNKSATWQRE-MM---PSPRVMGEMLLLPTGDVLIINGAKKGT-AGWNFATDPNTTPVLYEPDDPINERF 367 (539)
Q Consensus 293 a~~s~~~~d~~~~~~~W~~~-~M---~~~R~~~~~vvlpdG~I~vvGG~~~g~-~g~~~~~~~~~~~e~YdP~t~~g~~W 367 (539)
.++++++||+. +++|+.. +| |.+|..++++++ +++|||+||..... ...........++++|||+++ +|
T Consensus 341 ~~~dv~~yD~~--t~~W~~~~~~g~~P~~R~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~---~W 414 (470)
T PLN02193 341 EVDDVHYYDPV--QDKWTQVETFGVRPSERSVFASAAV-GKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETL---QW 414 (470)
T ss_pred ccCceEEEECC--CCEEEEeccCCCCCCCcceeEEEEE-CCEEEEECCccCCccccccCccceeccEEEEEcCcC---EE
Confidence 25679999987 6999986 54 889999887765 99999999975211 000000011236899999999 99
Q ss_pred EecCCC------CCCccceeeEe--eCCCCeEEEecCCCC
Q 039705 368 SELTPT------SKPRMCHSTSV--VLPDGKILVAGSNPH 399 (539)
Q Consensus 368 t~~a~~------~~~R~yhs~a~--llpdG~V~v~GG~~~ 399 (539)
+.+..+ +.+|..|+.+. +..+.++++.||...
T Consensus 415 ~~~~~~~~~~~~P~~R~~~~~~~~~~~~~~~~~~fGG~~~ 454 (470)
T PLN02193 415 ERLDKFGEEEETPSSRGWTASTTGTIDGKKGLVMHGGKAP 454 (470)
T ss_pred EEcccCCCCCCCCCCCccccceeeEEcCCceEEEEcCCCC
Confidence 988643 56788886432 322334999999754
No 19
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=99.95 E-value=2.4e-26 Score=237.12 Aligned_cols=267 Identities=18% Similarity=0.134 Sum_probs=182.4
Q ss_pred CcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEE-CCCC-CEEeCccCCCcccccc
Q 039705 29 SGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYD-AESA-AIRPLKILTDTWSSSG 106 (539)
Q Consensus 29 ~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yD-p~t~-~w~~l~~~~~~~c~~~ 106 (539)
.++.++-++++ ++++|++||.+. +. ..+.++ |+..+.....+|+ +..+ +|+.+..++..++.++
T Consensus 2 ~~~~g~~~~~~--~~~l~v~GG~~~-~~-~~~~~~----------g~~~~~~~v~~~~~~~~~~~W~~~~~lp~~r~~~~ 67 (323)
T TIGR03548 2 LGVAGCYAGII--GDYILVAGGCNF-PE-DPLAEG----------GKKKNYKGIYIAKDENSNLKWVKDGQLPYEAAYGA 67 (323)
T ss_pred CceeeEeeeEE--CCEEEEeeccCC-CC-CchhhC----------CcEEeeeeeEEEecCCCceeEEEcccCCccccceE
Confidence 35666667775 899999999753 21 011111 1111233344454 4433 7999998888877667
Q ss_pred eecCCCcEEEecCCCC--CCCeEEEEeCCCCccce----eecccccccccccceeEEccCCcEEEEcCccC----CeEEE
Q 039705 107 GLSANGTIVISGGWSS--RGRSVRYLSGCYHACYW----KEHHWELSAKRWFSTQHILPDGSFIVVGGRRE----FSYEY 176 (539)
Q Consensus 107 ~~l~dG~l~v~GG~~~--g~~~v~~ydP~~~t~~W----~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~----~~~E~ 176 (539)
++..+++||++||... ..+++++||+. +++| ..+++ |+.+|..++++++ +++|||+||... .++++
T Consensus 68 ~~~~~~~lyviGG~~~~~~~~~v~~~d~~--~~~w~~~~~~~~~-lp~~~~~~~~~~~-~~~iYv~GG~~~~~~~~~v~~ 143 (323)
T TIGR03548 68 SVSVENGIYYIGGSNSSERFSSVYRITLD--ESKEELICETIGN-LPFTFENGSACYK-DGTLYVGGGNRNGKPSNKSYL 143 (323)
T ss_pred EEEECCEEEEEcCCCCCCCceeEEEEEEc--CCceeeeeeEcCC-CCcCccCceEEEE-CCEEEEEeCcCCCccCceEEE
Confidence 7778999999999753 25789999998 7777 77776 9999988888877 899999999742 46899
Q ss_pred E-ecCCCcceeeccCcc-ccCCCCCCCCcceEEEeeCCcEEEEEcC------ceeEeeCCCCeEEEEcccCCCC--CCcc
Q 039705 177 I-LKEGKRIIYDLPILN-ETTNPSENNLYPFVFLSTDGNLFIFAND------RSILLNPETNEILHVFPILRGG--SRNY 246 (539)
Q Consensus 177 y-P~~~~~~w~~~~~l~-~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~------~~e~yDp~tn~W~~~~p~mp~~--~r~y 246 (539)
| |.++ +|..++.+. ..+ ..++++..+++||++||. ++++|||++++|+ .+++|+.. ++..
T Consensus 144 yd~~~~--~W~~~~~~p~~~r-------~~~~~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~~W~-~~~~~~~~~~p~~~ 213 (323)
T TIGR03548 144 FNLETQ--EWFELPDFPGEPR-------VQPVCVKLQNELYVFGGGSNIAYTDGYKYSPKKNQWQ-KVADPTTDSEPISL 213 (323)
T ss_pred EcCCCC--CeeECCCCCCCCC-------CcceEEEECCEEEEEcCCCCccccceEEEecCCCeeE-ECCCCCCCCCceec
Confidence 9 9988 998765332 221 124567789999999995 3689999999998 78887531 2221
Q ss_pred CCCccEEecccccCCCCCCCcccEEEEecCCCCCccccc-C-----CC---------------cccccCCceEEEEeeCC
Q 039705 247 PASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLA-G-----KG---------------EFMNALQDCGRIEITNK 305 (539)
Q Consensus 247 p~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~-~-----~~---------------~~~~a~~s~~~~d~~~~ 305 (539)
. ..+++.+ .+++|||+||.+...+.+. + .+ ....-.+++++||+.
T Consensus 214 ~-~~~~~~~-----------~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~-- 279 (323)
T TIGR03548 214 L-GAASIKI-----------NESLLLCIGGFNKDVYNDAVIDLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVR-- 279 (323)
T ss_pred c-ceeEEEE-----------CCCEEEEECCcCHHHHHHHHhhhhhccchhhhhhHHHHhCCCccccCcCceEEEEECC--
Confidence 1 1222221 3789999999763110000 0 00 000013579999997
Q ss_pred CCceeee-ccC-CCceeceeEEecCCcEEEEcCcC
Q 039705 306 SATWQRE-MMP-SPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 306 ~~~W~~~-~M~-~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
+++|+.. +|+ .+|..++++++ |++||++||..
T Consensus 280 ~~~W~~~~~~p~~~r~~~~~~~~-~~~iyv~GG~~ 313 (323)
T TIGR03548 280 TGKWKSIGNSPFFARCGAALLLT-GNNIFSINGEL 313 (323)
T ss_pred CCeeeEcccccccccCchheEEE-CCEEEEEeccc
Confidence 5899987 787 58888877665 99999999975
No 20
>PHA03098 kelch-like protein; Provisional
Probab=99.95 E-value=4e-26 Score=251.19 Aligned_cols=248 Identities=15% Similarity=0.126 Sum_probs=184.0
Q ss_pred cEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccC-----CeEEEE-ecCCCccee
Q 039705 113 TIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRRE-----FSYEYI-LKEGKRIIY 186 (539)
Q Consensus 113 ~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~-----~~~E~y-P~~~~~~w~ 186 (539)
.+++.||..+....+..|++. .++|..+++ ++..+ .++++++ +++||++||.+. ..+.+| +.++ +|.
T Consensus 252 ~~~~~~g~~~~~~~~~~~~~~--~~~~~~~~~-~~~~~-~~~~~~~-~~~lyv~GG~~~~~~~~~~v~~yd~~~~--~W~ 324 (534)
T PHA03098 252 IIYIHITMSIFTYNYITNYSP--LSEINTIID-IHYVY-CFGSVVL-NNVIYFIGGMNKNNLSVNSVVSYDTKTK--SWN 324 (534)
T ss_pred ceEeecccchhhceeeecchh--hhhcccccC-ccccc-cceEEEE-CCEEEEECCCcCCCCeeccEEEEeCCCC--eee
Confidence 344555543223456678887 788988875 55333 3455666 899999999864 257788 8887 998
Q ss_pred eccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEeccccc
Q 039705 187 DLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKL 259 (539)
Q Consensus 187 ~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~ 259 (539)
..+.+...+ +-+.++..+|+||++||. ++++||+.+++|+ .+++||. +|.. .+++.
T Consensus 325 ~~~~~~~~R-------~~~~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~-~~~~lp~-~r~~---~~~~~----- 387 (534)
T PHA03098 325 KVPELIYPR-------KNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWR-EEPPLIF-PRYN---PCVVN----- 387 (534)
T ss_pred ECCCCCccc-------ccceEEEECCEEEEEeCCCCCEecceEEEEcCCCCcee-eCCCcCc-CCcc---ceEEE-----
Confidence 766554322 225677889999999994 4899999999998 7999987 5643 22232
Q ss_pred CCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcC
Q 039705 260 QDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 260 ~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
.+++||++||...+. ..++++++||+. +++|+.. +||.+|..++++++ +++|||+||..
T Consensus 388 -------~~~~iYv~GG~~~~~----------~~~~~v~~yd~~--t~~W~~~~~~p~~r~~~~~~~~-~~~iyv~GG~~ 447 (534)
T PHA03098 388 -------VNNLIYVIGGISKND----------ELLKTVECFSLN--TNKWSKGSPLPISHYGGCAIYH-DGKIYVIGGIS 447 (534)
T ss_pred -------ECCEEEEECCcCCCC----------cccceEEEEeCC--CCeeeecCCCCccccCceEEEE-CCEEEEECCcc
Confidence 389999999965211 236789999987 5999997 99999998887665 99999999975
Q ss_pred CCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEE
Q 039705 339 KGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEK 418 (539)
Q Consensus 339 ~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~ 418 (539)
. .. .......+++|||+++ +|+.+++|+.+|..|+.+++ ||+|||.||..... + ..++|+
T Consensus 448 ~-~~----~~~~~~~v~~yd~~~~---~W~~~~~~~~~r~~~~~~~~--~~~iyv~GG~~~~~--------~--~~~v~~ 507 (534)
T PHA03098 448 Y-ID----NIKVYNIVESYNPVTN---KWTELSSLNFPRINASLCIF--NNKIYVVGGDKYEY--------Y--INEIEV 507 (534)
T ss_pred C-CC----CCcccceEEEecCCCC---ceeeCCCCCcccccceEEEE--CCEEEEEcCCcCCc--------c--cceeEE
Confidence 2 11 0011235899999999 99999999999999988777 99999999975432 1 347999
Q ss_pred ecCCCC
Q 039705 419 FYPPYF 424 (539)
Q Consensus 419 y~Ppyl 424 (539)
|+|..-
T Consensus 508 yd~~~~ 513 (534)
T PHA03098 508 YDDKTN 513 (534)
T ss_pred EeCCCC
Confidence 999864
No 21
>PLN02193 nitrile-specifier protein
Probab=99.94 E-value=1.2e-24 Score=235.24 Aligned_cols=264 Identities=15% Similarity=0.143 Sum_probs=187.5
Q ss_pred cCCCcEEEecCCCC-CCCeEEEE--eCCC--Cccceeeccc--ccccccccceeEEccCCcEEEEcCccC------CeEE
Q 039705 109 SANGTIVISGGWSS-RGRSVRYL--SGCY--HACYWKEHHW--ELSAKRWFSTQHILPDGSFIVVGGRRE------FSYE 175 (539)
Q Consensus 109 l~dG~l~v~GG~~~-g~~~v~~y--dP~~--~t~~W~~~~~--~m~~~R~y~s~~~L~dG~VyvvGG~~~------~~~E 175 (539)
+.+++|+.++|... .+.++-.| +|.. ..++|..+.+ +++.+|..|+++++ +++|||+||... ..++
T Consensus 118 ~~~~~ivgf~G~~~~~~~~ig~y~~~~~~~~~~~~W~~~~~~~~~P~pR~~h~~~~~-~~~iyv~GG~~~~~~~~~~~v~ 196 (470)
T PLN02193 118 LQGGKIVGFHGRSTDVLHSLGAYISLPSTPKLLGKWIKVEQKGEGPGLRCSHGIAQV-GNKIYSFGGEFTPNQPIDKHLY 196 (470)
T ss_pred EcCCeEEEEeccCCCcEEeeEEEEecCCChhhhceEEEcccCCCCCCCccccEEEEE-CCEEEEECCcCCCCCCeeCcEE
Confidence 45889999999753 24444444 7751 1379998764 25789999998888 799999999742 2578
Q ss_pred EE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------ceeEeeCCCCeEEEEcccC---CCCCC
Q 039705 176 YI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-------RSILLNPETNEILHVFPIL---RGGSR 244 (539)
Q Consensus 176 ~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-------~~e~yDp~tn~W~~~~p~m---p~~~r 244 (539)
+| +.++ +|...+...... .....-+++++.+++||++||. ++++||+.+++|+ .+++| |. +|
T Consensus 197 ~yD~~~~--~W~~~~~~g~~P---~~~~~~~~~v~~~~~lYvfGG~~~~~~~ndv~~yD~~t~~W~-~l~~~~~~P~-~R 269 (470)
T PLN02193 197 VFDLETR--TWSISPATGDVP---HLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWK-LLTPVEEGPT-PR 269 (470)
T ss_pred EEECCCC--EEEeCCCCCCCC---CCcccceEEEEECCEEEEECCCCCCCCCccEEEEECCCCEEE-EcCcCCCCCC-Cc
Confidence 89 8877 997544321100 0011235577789999999993 5899999999998 68887 43 45
Q ss_pred ccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee----ccCCCcee
Q 039705 245 NYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE----MMPSPRVM 320 (539)
Q Consensus 245 ~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~----~M~~~R~~ 320 (539)
.+ .+++. .+++|||+||.+.. ..++.+++||+. +++|+.. .|+.+|..
T Consensus 270 ~~---h~~~~------------~~~~iYv~GG~~~~-----------~~~~~~~~yd~~--t~~W~~~~~~~~~~~~R~~ 321 (470)
T PLN02193 270 SF---HSMAA------------DEENVYVFGGVSAT-----------ARLKTLDSYNIV--DKKWFHCSTPGDSFSIRGG 321 (470)
T ss_pred cc---eEEEE------------ECCEEEEECCCCCC-----------CCcceEEEEECC--CCEEEeCCCCCCCCCCCCC
Confidence 43 22222 37899999998621 135678999987 5999975 27788998
Q ss_pred ceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCC---CCCccceeeEeeCCCCeEEEecCC
Q 039705 321 GEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPT---SKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 321 ~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~---~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
+.++++ +|+|||+||.+ +. ....+++|||+++ +|+.+.++ +.+|..|+++++ +++|||.||.
T Consensus 322 ~~~~~~-~gkiyviGG~~-g~--------~~~dv~~yD~~t~---~W~~~~~~g~~P~~R~~~~~~~~--~~~iyv~GG~ 386 (470)
T PLN02193 322 AGLEVV-QGKVWVVYGFN-GC--------EVDDVHYYDPVQD---KWTQVETFGVRPSERSVFASAAV--GKHIVIFGGE 386 (470)
T ss_pred cEEEEE-CCcEEEEECCC-CC--------ccCceEEEECCCC---EEEEeccCCCCCCCcceeEEEEE--CCEEEEECCc
Confidence 887665 99999999975 32 1247899999999 99998765 889999998877 9999999997
Q ss_pred CCCCCc-cCCCCCCCcceeeEEecCCCCC
Q 039705 398 PHSRYN-LTSGSKYPTELRIEKFYPPYFD 425 (539)
Q Consensus 398 ~~~~~~-~~~~~~~p~~~~vE~y~Ppyl~ 425 (539)
...... ..+...+ ..++++|+|...-
T Consensus 387 ~~~~~~~~~~~~~~--~ndv~~~D~~t~~ 413 (470)
T PLN02193 387 IAMDPLAHVGPGQL--TDGTFALDTETLQ 413 (470)
T ss_pred cCCccccccCccce--eccEEEEEcCcCE
Confidence 532100 0000011 2368999998764
No 22
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=99.82 E-value=1.8e-18 Score=165.24 Aligned_cols=256 Identities=14% Similarity=0.182 Sum_probs=173.4
Q ss_pred ccccceecCCCcEEEecCCCCC-------CCeEEEEeCCCCccceeecccc------------cccccccceeEEccCCc
Q 039705 102 WSSSGGLSANGTIVISGGWSSR-------GRSVRYLSGCYHACYWKEHHWE------------LSAKRWFSTQHILPDGS 162 (539)
Q Consensus 102 ~c~~~~~l~dG~l~v~GG~~~g-------~~~v~~ydP~~~t~~W~~~~~~------------m~~~R~y~s~~~L~dG~ 162 (539)
+-.++++....+||-+||+..| --.+.+++.. +-.|+.+++. .+..|..|+++.. +++
T Consensus 14 RVNHAavaVG~riYSFGGYCsGedy~~~~piDVH~lNa~--~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y-~d~ 90 (392)
T KOG4693|consen 14 RVNHAAVAVGSRIYSFGGYCSGEDYDAKDPIDVHVLNAE--NYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEY-QDK 90 (392)
T ss_pred cccceeeeecceEEecCCcccccccccCCcceeEEeecc--ceeEEecCcccccccccCCCCccchhhcCceEEEE-cce
Confidence 4445566678899999998533 1356778887 8899988751 3345888887776 899
Q ss_pred EEEEcCccC-----CeEEEE-ecCCCccee---eccCccccCCCCCCCCcceEEEeeCCcEEEEEcC---------ceeE
Q 039705 163 FIVVGGRRE-----FSYEYI-LKEGKRIIY---DLPILNETTNPSENNLYPFVFLSTDGNLFIFAND---------RSIL 224 (539)
Q Consensus 163 VyvvGG~~~-----~~~E~y-P~~~~~~w~---~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~---------~~e~ 224 (539)
+||-||++. +..-.| |+++ +|. ..-+++..+|. |.+++.+..+|+|||. ++..
T Consensus 91 ~yvWGGRND~egaCN~Ly~fDp~t~--~W~~p~v~G~vPgaRDG-------HsAcV~gn~MyiFGGye~~a~~FS~d~h~ 161 (392)
T KOG4693|consen 91 AYVWGGRNDDEGACNLLYEFDPETN--VWKKPEVEGFVPGARDG-------HSACVWGNQMYIFGGYEEDAQRFSQDTHV 161 (392)
T ss_pred EEEEcCccCcccccceeeeeccccc--cccccceeeecCCccCC-------ceeeEECcEEEEecChHHHHHhhhcccee
Confidence 999999975 234456 9988 884 23455555553 5688889999999994 4678
Q ss_pred eeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeC
Q 039705 225 LNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITN 304 (539)
Q Consensus 225 yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~ 304 (539)
+|..|-+|. .+...-. +..|.-..++++ +++..||+||... ... +.-.+...-.+....+|..
T Consensus 162 ld~~TmtWr-~~~Tkg~-PprwRDFH~a~~------------~~~~MYiFGGR~D-~~g-pfHs~~e~Yc~~i~~ld~~- 224 (392)
T KOG4693|consen 162 LDFATMTWR-EMHTKGD-PPRWRDFHTASV------------IDGMMYIFGGRSD-ESG-PFHSIHEQYCDTIMALDLA- 224 (392)
T ss_pred Eeccceeee-ehhccCC-Cchhhhhhhhhh------------ccceEEEeccccc-cCC-CccchhhhhcceeEEEecc-
Confidence 999999997 5533222 212222244444 3899999999752 211 0000011112233345554
Q ss_pred CCCceeee---c-cCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecC---CCCCCc
Q 039705 305 KSATWQRE---M-MPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELT---PTSKPR 377 (539)
Q Consensus 305 ~~~~W~~~---~-M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a---~~~~~R 377 (539)
+..|... . .|.+|..|.+-+. ||++|++||++ |.- +.-.....+|||.|. .|+.+. .-+.+|
T Consensus 225 -T~aW~r~p~~~~~P~GRRSHS~fvY-ng~~Y~FGGYn-g~l-----n~HfndLy~FdP~t~---~W~~I~~~Gk~P~aR 293 (392)
T KOG4693|consen 225 -TGAWTRTPENTMKPGGRRSHSTFVY-NGKMYMFGGYN-GTL-----NVHFNDLYCFDPKTS---MWSVISVRGKYPSAR 293 (392)
T ss_pred -ccccccCCCCCcCCCcccccceEEE-cceEEEecccc-hhh-----hhhhcceeecccccc---hheeeeccCCCCCcc
Confidence 6889963 4 4888999987665 99999999997 432 111225789999999 999763 456677
Q ss_pred cceeeEeeCCCCeEEEecCCCC
Q 039705 378 MCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 378 ~yhs~a~llpdG~V~v~GG~~~ 399 (539)
.-|++.+. ++|||..||...
T Consensus 294 RRqC~~v~--g~kv~LFGGTsP 313 (392)
T KOG4693|consen 294 RRQCSVVS--GGKVYLFGGTSP 313 (392)
T ss_pred cceeEEEE--CCEEEEecCCCC
Confidence 77877766 999999999643
No 23
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=99.76 E-value=6.6e-17 Score=154.60 Aligned_cols=262 Identities=14% Similarity=0.132 Sum_probs=180.0
Q ss_pred ceEEccCCCcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeCccC--
Q 039705 21 KWELASENSGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPLKIL-- 98 (539)
Q Consensus 21 ~W~~~~~~~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l~~~-- 98 (539)
.|+.-...-|.+--|+++- ...+||-|||.-.|.+- + . . -.-.+.+++.++-+|+.+++.
T Consensus 3 ~WTVHLeGGPrRVNHAava-VG~riYSFGGYCsGedy-~-------~-~--------~piDVH~lNa~~~RWtk~pp~~~ 64 (392)
T KOG4693|consen 3 TWTVHLEGGPRRVNHAAVA-VGSRIYSFGGYCSGEDY-D-------A-K--------DPIDVHVLNAENYRWTKMPPGIT 64 (392)
T ss_pred eEEEEecCCcccccceeee-ecceEEecCCccccccc-c-------c-C--------CcceeEEeeccceeEEecCcccc
Confidence 4766544456777788887 89999999997655421 0 0 0 012466788888999988752
Q ss_pred -----------CCcccccceecCCCcEEEecCCCC--C-CCeEEEEeCCCCccceeecc--cccccccccceeEEccCCc
Q 039705 99 -----------TDTWSSSGGLSANGTIVISGGWSS--R-GRSVRYLSGCYHACYWKEHH--WELSAKRWFSTQHILPDGS 162 (539)
Q Consensus 99 -----------~~~~c~~~~~l~dG~l~v~GG~~~--g-~~~v~~ydP~~~t~~W~~~~--~~m~~~R~y~s~~~L~dG~ 162 (539)
+-.+..+.++..++++|+-||.+| + -+....|||. +++|.... .-++-+|-.|++|++ ++.
T Consensus 65 ka~i~~~yp~VPyqRYGHtvV~y~d~~yvWGGRND~egaCN~Ly~fDp~--t~~W~~p~v~G~vPgaRDGHsAcV~-gn~ 141 (392)
T KOG4693|consen 65 KATIESPYPAVPYQRYGHTVVEYQDKAYVWGGRNDDEGACNLLYEFDPE--TNVWKKPEVEGFVPGARDGHSACVW-GNQ 141 (392)
T ss_pred cccccCCCCccchhhcCceEEEEcceEEEEcCccCcccccceeeeeccc--cccccccceeeecCCccCCceeeEE-CcE
Confidence 235677778888999999999875 2 3567889999 99997632 236778999999999 789
Q ss_pred EEEEcCccC----CeEEEE---ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--------------
Q 039705 163 FIVVGGRRE----FSYEYI---LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR-------------- 221 (539)
Q Consensus 163 VyvvGG~~~----~~~E~y---P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~-------------- 221 (539)
.||.||... .+-+.+ -.+- +|+.+.. ...-+++.--|...+.++.+|+|||+.
T Consensus 142 MyiFGGye~~a~~FS~d~h~ld~~Tm--tWr~~~T----kg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs~~e~Yc 215 (392)
T KOG4693|consen 142 MYIFGGYEEDAQRFSQDTHVLDFATM--TWREMHT----KGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHSIHEQYC 215 (392)
T ss_pred EEEecChHHHHHhhhccceeEeccce--eeeehhc----cCCCchhhhhhhhhhccceEEEeccccccCCCccchhhhhc
Confidence 999999853 122232 2332 6753321 110111112256777899999999952
Q ss_pred --eeEeeCCCCeEEEEccc---CCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCc
Q 039705 222 --SILLNPETNEILHVFPI---LRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQD 296 (539)
Q Consensus 222 --~e~yDp~tn~W~~~~p~---mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s 296 (539)
...+|.+|..|++ .|+ +|+++|+ .++-. +++++|++||.+. .-. .-.+.
T Consensus 216 ~~i~~ld~~T~aW~r-~p~~~~~P~GRRS----HS~fv------------Yng~~Y~FGGYng-~ln--------~Hfnd 269 (392)
T KOG4693|consen 216 DTIMALDLATGAWTR-TPENTMKPGGRRS----HSTFV------------YNGKMYMFGGYNG-TLN--------VHFND 269 (392)
T ss_pred ceeEEEecccccccc-CCCCCcCCCcccc----cceEE------------EcceEEEecccch-hhh--------hhhcc
Confidence 4568999999984 443 4554554 33332 5999999999872 211 23456
Q ss_pred eEEEEeeCCCCceeee----ccCCCceeceeEEecCCcEEEEcCcC
Q 039705 297 CGRIEITNKSATWQRE----MMPSPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 297 ~~~~d~~~~~~~W~~~----~M~~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
..+|||. +..|... .-|.+|.-.++++. ++|||++||..
T Consensus 270 Ly~FdP~--t~~W~~I~~~Gk~P~aRRRqC~~v~-g~kv~LFGGTs 312 (392)
T KOG4693|consen 270 LYCFDPK--TSMWSVISVRGKYPSARRRQCSVVS-GGKVYLFGGTS 312 (392)
T ss_pred eeecccc--cchheeeeccCCCCCcccceeEEEE-CCEEEEecCCC
Confidence 7889986 5889974 46778887887766 99999999975
No 24
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.49 E-value=2.7e-12 Score=139.21 Aligned_cols=248 Identities=14% Similarity=0.216 Sum_probs=170.9
Q ss_pred CCcccccceecCCCcEEEecCCCC--CCC--eEEEEeCCCCccceeeccc--ccccccccceeEEccCCcEEEEcCccC-
Q 039705 99 TDTWSSSGGLSANGTIVISGGWSS--RGR--SVRYLSGCYHACYWKEHHW--ELSAKRWFSTQHILPDGSFIVVGGRRE- 171 (539)
Q Consensus 99 ~~~~c~~~~~l~dG~l~v~GG~~~--g~~--~v~~ydP~~~t~~W~~~~~--~m~~~R~y~s~~~L~dG~VyvvGG~~~- 171 (539)
+..+..+.+.+.+.+++|+||... ... ++.+||-. +..|..... .-+.+|..++.+++ +.++|++||.+.
T Consensus 58 p~~R~~hs~~~~~~~~~vfGG~~~~~~~~~~dl~~~d~~--~~~w~~~~~~g~~p~~r~g~~~~~~-~~~l~lfGG~~~~ 134 (482)
T KOG0379|consen 58 PIPRAGHSAVLIGNKLYVFGGYGSGDRLTDLDLYVLDLE--SQLWTKPAATGDEPSPRYGHSLSAV-GDKLYLFGGTDKK 134 (482)
T ss_pred cchhhccceeEECCEEEEECCCCCCCccccceeEEeecC--CcccccccccCCCCCcccceeEEEE-CCeEEEEccccCC
Confidence 344555666677999999999753 223 48899988 888987542 24567888888888 799999999974
Q ss_pred ----CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--------CceeEeeCCCCeEEEEccc
Q 039705 172 ----FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--------DRSILLNPETNEILHVFPI 238 (539)
Q Consensus 172 ----~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--------~~~e~yDp~tn~W~~~~p~ 238 (539)
..+.+| +.+. +|..+..... .+-...-|.+++.+.+|||+|| ++.++||.++.+|. .+..
T Consensus 135 ~~~~~~l~~~d~~t~--~W~~l~~~~~----~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~-~~~~ 207 (482)
T KOG0379|consen 135 YRNLNELHSLDLSTR--TWSLLSPTGD----PPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWS-ELDT 207 (482)
T ss_pred CCChhheEeccCCCC--cEEEecCcCC----CCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccce-eccc
Confidence 234556 6666 8964432111 1223344778888999999999 35899999999998 4432
Q ss_pred CCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee----cc
Q 039705 239 LRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE----MM 314 (539)
Q Consensus 239 mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~----~M 314 (539)
....+ -|+.+.+..+ .+.+++++||.+.+. ..++++.++|+. +.+|... .+
T Consensus 208 ~g~~P--~pR~gH~~~~-----------~~~~~~v~gG~~~~~----------~~l~D~~~ldl~--~~~W~~~~~~g~~ 262 (482)
T KOG0379|consen 208 QGEAP--SPRYGHAMVV-----------VGNKLLVFGGGDDGD----------VYLNDVHILDLS--TWEWKLLPTGGDL 262 (482)
T ss_pred CCCCC--CCCCCceEEE-----------ECCeEEEEeccccCC----------ceecceEeeecc--cceeeeccccCCC
Confidence 21111 1222333221 388999999976222 247789999987 5889852 58
Q ss_pred CCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCC----CCCCccceeeEeeCCCCe
Q 039705 315 PSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTP----TSKPRMCHSTSVVLPDGK 390 (539)
Q Consensus 315 ~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~----~~~~R~yhs~a~llpdG~ 390 (539)
|.+|..|.++ ....+++++||...+.. .+..+...||.++. .|+.+.. .+.+|..|...+.-..++
T Consensus 263 p~~R~~h~~~-~~~~~~~l~gG~~~~~~------~~l~~~~~l~~~~~---~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (482)
T KOG0379|consen 263 PSPRSGHSLT-VSGDHLLLFGGGTDPKQ------EPLGDLYGLDLETL---VWSKVESVGVVRPSPRLGHAAELIDELGK 332 (482)
T ss_pred CCCcceeeeE-EECCEEEEEcCCccccc------cccccccccccccc---ceeeeeccccccccccccccceeeccCCc
Confidence 9999999988 55999999999763210 14457888999998 8987644 356888888766644444
Q ss_pred E
Q 039705 391 I 391 (539)
Q Consensus 391 V 391 (539)
.
T Consensus 333 ~ 333 (482)
T KOG0379|consen 333 D 333 (482)
T ss_pred c
Confidence 3
No 25
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.45 E-value=5e-12 Score=137.15 Aligned_cols=209 Identities=16% Similarity=0.246 Sum_probs=146.2
Q ss_pred cccccccceeEEccCCcEEEEcCccCC----eEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC
Q 039705 146 LSAKRWFSTQHILPDGSFIVVGGRREF----SYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND 220 (539)
Q Consensus 146 m~~~R~y~s~~~L~dG~VyvvGG~~~~----~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~ 220 (539)
.+.+|+.|+++.. ++++||.||.... ..++| .......|..... .. ..+...|-+..+..+.+||++||.
T Consensus 57 ~p~~R~~hs~~~~-~~~~~vfGG~~~~~~~~~~dl~~~d~~~~~w~~~~~-~g---~~p~~r~g~~~~~~~~~l~lfGG~ 131 (482)
T KOG0379|consen 57 GPIPRAGHSAVLI-GNKLYVFGGYGSGDRLTDLDLYVLDLESQLWTKPAA-TG---DEPSPRYGHSLSAVGDKLYLFGGT 131 (482)
T ss_pred CcchhhccceeEE-CCEEEEECCCCCCCccccceeEEeecCCcccccccc-cC---CCCCcccceeEEEECCeEEEEccc
Confidence 5678999988888 8999999997541 11355 3333235642211 11 112234556778889999999995
Q ss_pred --------ceeEeeCCCCeEEEEcccCCCC--CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcc
Q 039705 221 --------RSILLNPETNEILHVFPILRGG--SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEF 290 (539)
Q Consensus 221 --------~~e~yDp~tn~W~~~~p~mp~~--~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~ 290 (539)
+...||..+++|. .+.+.... +|.. .++++ .+.+|||+||.+...
T Consensus 132 ~~~~~~~~~l~~~d~~t~~W~-~l~~~~~~P~~r~~---Hs~~~------------~g~~l~vfGG~~~~~--------- 186 (482)
T KOG0379|consen 132 DKKYRNLNELHSLDLSTRTWS-LLSPTGDPPPPRAG---HSATV------------VGTKLVVFGGIGGTG--------- 186 (482)
T ss_pred cCCCCChhheEeccCCCCcEE-EecCcCCCCCCccc---ceEEE------------ECCEEEEECCccCcc---------
Confidence 4789999999998 44433221 3432 23332 378999999986311
Q ss_pred cccCCceEEEEeeCCCCceeee----ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCc
Q 039705 291 MNALQDCGRIEITNKSATWQRE----MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINER 366 (539)
Q Consensus 291 ~~a~~s~~~~d~~~~~~~W~~~----~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~ 366 (539)
..++++.+||+. +.+|... +-|.||..|.+++. ++++||+||...+. ..+..+.++|-.+- +
T Consensus 187 -~~~ndl~i~d~~--~~~W~~~~~~g~~P~pR~gH~~~~~-~~~~~v~gG~~~~~-------~~l~D~~~ldl~~~---~ 252 (482)
T KOG0379|consen 187 -DSLNDLHIYDLE--TSTWSELDTQGEAPSPRYGHAMVVV-GNKLLVFGGGDDGD-------VYLNDVHILDLSTW---E 252 (482)
T ss_pred -cceeeeeeeccc--cccceecccCCCCCCCCCCceEEEE-CCeEEEEeccccCC-------ceecceEeeecccc---e
Confidence 146789999987 5789973 57889999997766 99999999976221 12336889999998 9
Q ss_pred eEecC---CCCCCccceeeEeeCCCCeEEEecCCCCC
Q 039705 367 FSELT---PTSKPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 367 Wt~~a---~~~~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
|..+. ..+.+|++|+.++. ..++++.||.+..
T Consensus 253 W~~~~~~g~~p~~R~~h~~~~~--~~~~~l~gG~~~~ 287 (482)
T KOG0379|consen 253 WKLLPTGGDLPSPRSGHSLTVS--GDHLLLFGGGTDP 287 (482)
T ss_pred eeeccccCCCCCCcceeeeEEE--CCEEEEEcCCccc
Confidence 99654 57889999998865 7889999997664
No 26
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.33 E-value=5.1e-11 Score=123.08 Aligned_cols=277 Identities=14% Similarity=0.180 Sum_probs=168.7
Q ss_pred CCEEeCcc----CCCcccccceecCCCcEEEecCCCCC-CCeEEEEeCCCCccceeecc--cccccccccceeEEccCCc
Q 039705 90 AAIRPLKI----LTDTWSSSGGLSANGTIVISGGWSSR-GRSVRYLSGCYHACYWKEHH--WELSAKRWFSTQHILPDGS 162 (539)
Q Consensus 90 ~~w~~l~~----~~~~~c~~~~~l~dG~l~v~GG~~~g-~~~v~~ydP~~~t~~W~~~~--~~m~~~R~y~s~~~L~dG~ 162 (539)
-.|+.+.. .+..|..+.++.....|+|+||-++| ......|+-+ +++|..-+ .+.+.+..-++.+.. ..|
T Consensus 17 ~rWrrV~~~tGPvPrpRHGHRAVaikELiviFGGGNEGiiDELHvYNTa--tnqWf~PavrGDiPpgcAA~Gfvcd-Gtr 93 (830)
T KOG4152|consen 17 VRWRRVQQSTGPVPRPRHGHRAVAIKELIVIFGGGNEGIIDELHVYNTA--TNQWFAPAVRGDIPPGCAAFGFVCD-GTR 93 (830)
T ss_pred cceEEEecccCCCCCccccchheeeeeeEEEecCCcccchhhhhhhccc--cceeecchhcCCCCCchhhcceEec-Cce
Confidence 36776643 24456666677788999999998776 4678899998 99997532 135555544454444 569
Q ss_pred EEEEcCccC---CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEeeCCCC--------
Q 039705 163 FIVVGGRRE---FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILLNPETN-------- 230 (539)
Q Consensus 163 VyvvGG~~~---~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~yDp~tn-------- 230 (539)
||++||... .+-|.| -+.....|..+..-....+..+-..--|.+.+...|-|+|||-.-+.=||+.|
T Consensus 94 ilvFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~gnKcYlFGGLaNdseDpknNvPrYLnDl 173 (830)
T KOG4152|consen 94 ILVFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNVPRYLNDL 173 (830)
T ss_pred EEEEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEeccEeEEeccccccccCcccccchhhcce
Confidence 999999753 233455 33321133333211111122222233367888899999999932222233332
Q ss_pred ------------eEEEEc--ccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCc
Q 039705 231 ------------EILHVF--PILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQD 296 (539)
Q Consensus 231 ------------~W~~~~--p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s 296 (539)
.|...+ -.+|. +|. ++.+|++-- .+ .-..|++|.||.. |. .+.+
T Consensus 174 Y~leL~~Gsgvv~W~ip~t~Gv~P~-pRE---SHTAViY~e-----KD-s~~skmvvyGGM~-G~-----------RLgD 231 (830)
T KOG4152|consen 174 YILELRPGSGVVAWDIPITYGVLPP-PRE---SHTAVIYTE-----KD-SKKSKMVVYGGMS-GC-----------RLGD 231 (830)
T ss_pred EEEEeccCCceEEEecccccCCCCC-Ccc---cceeEEEEe-----cc-CCcceEEEEcccc-cc-----------cccc
Confidence 243111 12232 444 356666421 11 1257999999986 32 3667
Q ss_pred eEEEEeeCCCCceeee----ccCCCceeceeEEecCCcEEEEcCcCC--C----CCCcccCCCCCCccEEEcCCCCCCCc
Q 039705 297 CGRIEITNKSATWQRE----MMPSPRVMGEMLLLPTGDVLIINGAKK--G----TAGWNFATDPNTTPVLYEPDDPINER 366 (539)
Q Consensus 297 ~~~~d~~~~~~~W~~~----~M~~~R~~~~~vvlpdG~I~vvGG~~~--g----~~g~~~~~~~~~~~e~YdP~t~~g~~ 366 (539)
...+|++ +-+|... --|.+|+.|.+++. .+|+||+||.-- + .+--..+=....+.-|+|-++. +
T Consensus 232 LW~Ldl~--Tl~W~kp~~~G~~PlPRSLHsa~~I-GnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~---~ 305 (830)
T KOG4152|consen 232 LWTLDLD--TLTWNKPSLSGVAPLPRSLHSATTI-GNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTM---A 305 (830)
T ss_pred eeEEecc--eeecccccccCCCCCCcccccceee-cceeEEecceeeeeccccccccccceeeeccceeeeeecch---h
Confidence 7888886 5789873 25788999997655 999999999620 0 0000000012235678999999 9
Q ss_pred eEecC-------CCCCCccceeeEeeCCCCeEEEecCCCC
Q 039705 367 FSELT-------PTSKPRMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 367 Wt~~a-------~~~~~R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
|+.+- ..+.+|..|+++.+ +-|+|+=-|.+.
T Consensus 306 W~tl~~d~~ed~tiPR~RAGHCAvAi--gtRlYiWSGRDG 343 (830)
T KOG4152|consen 306 WETLLMDTLEDNTIPRARAGHCAVAI--GTRLYIWSGRDG 343 (830)
T ss_pred eeeeeeccccccccccccccceeEEe--ccEEEEEeccch
Confidence 98741 25677888987776 899999988754
No 27
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.27 E-value=1.5e-10 Score=117.29 Aligned_cols=244 Identities=17% Similarity=0.250 Sum_probs=158.2
Q ss_pred CCcEEEecCCC-CC-----CCeEEEEeCCCCccceeeccc-ccccccccceeEEccCCcEEEEcCccCCeEEEEecCCCc
Q 039705 111 NGTIVISGGWS-SR-----GRSVRYLSGCYHACYWKEHHW-ELSAKRWFSTQHILPDGSFIVVGGRREFSYEYILKEGKR 183 (539)
Q Consensus 111 dG~l~v~GG~~-~g-----~~~v~~ydP~~~t~~W~~~~~-~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~yP~~~~~ 183 (539)
...|+++||-. +| .+....||-. ++.|..+.. .-+.+|..|.+++.+.|.+++.||.... |...
T Consensus 78 keELilfGGEf~ngqkT~vYndLy~Yn~k--~~eWkk~~spn~P~pRsshq~va~~s~~l~~fGGEfaS-----Pnq~-- 148 (521)
T KOG1230|consen 78 KEELILFGGEFYNGQKTHVYNDLYSYNTK--KNEWKKVVSPNAPPPRSSHQAVAVPSNILWLFGGEFAS-----PNQE-- 148 (521)
T ss_pred cceeEEecceeecceeEEEeeeeeEEecc--ccceeEeccCCCcCCCccceeEEeccCeEEEeccccCC-----cchh--
Confidence 35899999954 33 3566778888 999998753 2557899999999998999999995321 2211
Q ss_pred ceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCC
Q 039705 184 IIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPN 263 (539)
Q Consensus 184 ~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~ 263 (539)
+ +- -| .+.|+||.++++|++ +. .++++ -|++|.- |..
T Consensus 149 q-----F~----------HY----------------kD~W~fd~~trkweq-l~-~~g~P--S~RSGHR-Mva------- 185 (521)
T KOG1230|consen 149 Q-----FH----------HY----------------KDLWLFDLKTRKWEQ-LE-FGGGP--SPRSGHR-MVA------- 185 (521)
T ss_pred h-----hh----------hh----------------hheeeeeeccchhee-ec-cCCCC--CCCccce-eEE-------
Confidence 0 00 01 247899999999994 42 22221 1223543 221
Q ss_pred CCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee--c--cCCCceeceeEEecCCcEEEEcCcCC
Q 039705 264 SNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE--M--MPSPRVMGEMLLLPTGDVLIINGAKK 339 (539)
Q Consensus 264 ~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~--~--M~~~R~~~~~vvlpdG~I~vvGG~~~ 339 (539)
...+|+++||... ... .....+.+.+||+. +=+|+.. + -|.+|+++++.+-|+|.|+|.||+.+
T Consensus 186 ---wK~~lilFGGFhd-~nr------~y~YyNDvy~FdLd--tykW~Klepsga~PtpRSGcq~~vtpqg~i~vyGGYsK 253 (521)
T KOG1230|consen 186 ---WKRQLILFGGFHD-SNR------DYIYYNDVYAFDLD--TYKWSKLEPSGAGPTPRSGCQFSVTPQGGIVVYGGYSK 253 (521)
T ss_pred ---eeeeEEEEcceec-CCC------ceEEeeeeEEEecc--ceeeeeccCCCCCCCCCCcceEEecCCCcEEEEcchhH
Confidence 4889999999752 111 11346789999986 5899985 3 48999999999999999999999863
Q ss_pred CCCCcccCCC-CCCccEEEcCCCCC--CCceEecCC---CCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcc
Q 039705 340 GTAGWNFATD-PNTTPVLYEPDDPI--NERFSELTP---TSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTE 413 (539)
Q Consensus 340 g~~g~~~~~~-~~~~~e~YdP~t~~--g~~Wt~~a~---~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~ 413 (539)
-..-=..+.. ......+-+|+++. --.|+.|.+ -+.||...|+++ .++++-|..||-..-. - --..
T Consensus 254 ~~~kK~~dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~v-a~n~kal~FGGV~D~e--e-----eeEs 325 (521)
T KOG1230|consen 254 QRVKKDVDKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAV-AKNHKALFFGGVCDLE--E-----EEES 325 (521)
T ss_pred hhhhhhhhcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEE-ecCCceEEecceeccc--c-----cchh
Confidence 1100000011 12246778888731 125777755 467999998765 5899999999952210 0 0013
Q ss_pred eeeEEecCCCCCC
Q 039705 414 LRIEKFYPPYFDE 426 (539)
Q Consensus 414 ~~vE~y~Ppyl~~ 426 (539)
+.-|.|+=-|+|.
T Consensus 326 l~g~F~NDLy~fd 338 (521)
T KOG1230|consen 326 LSGEFFNDLYFFD 338 (521)
T ss_pred hhhhhhhhhhhee
Confidence 4567777777664
No 28
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.08 E-value=1.2e-08 Score=102.49 Aligned_cols=260 Identities=15% Similarity=0.197 Sum_probs=157.9
Q ss_pred CccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEEccCCcEEEEcCccC--
Q 039705 95 LKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHILPDGSFIVVGGRRE-- 171 (539)
Q Consensus 95 l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~~~-- 171 (539)
++..+..+-.+..++.+..+||-=|.. ..+-...|.+.....|++.+. .+ .+|-.+.++++ +|++|+.||...
T Consensus 30 lPdlPvg~KnG~Ga~ig~~~YVGLGs~--G~afy~ldL~~~~k~W~~~a~-FpG~~rnqa~~a~~-~~kLyvFgG~Gk~~ 105 (381)
T COG3055 30 LPDLPVGFKNGAGALIGDTVYVGLGSA--GTAFYVLDLKKPGKGWTKIAD-FPGGARNQAVAAVI-GGKLYVFGGYGKSV 105 (381)
T ss_pred CCCCCccccccccceecceEEEEeccC--CccceehhhhcCCCCceEccc-CCCcccccchheee-CCeEEEeeccccCC
Confidence 444444454454555555777743421 122333444322578999987 54 56877766666 899999999742
Q ss_pred -------CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCC-cEEEEEcC----------------------
Q 039705 172 -------FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDG-NLFIFAND---------------------- 220 (539)
Q Consensus 172 -------~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G-~Iyv~Gg~---------------------- 220 (539)
.++-+| |.++ +|..++....+. +--+..+.+++ +||++||.
T Consensus 106 ~~~~~~~nd~Y~y~p~~n--sW~kl~t~sP~g------l~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~ 177 (381)
T COG3055 106 SSSPQVFNDAYRYDPSTN--SWHKLDTRSPTG------LVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAV 177 (381)
T ss_pred CCCceEeeeeEEecCCCC--hhheeccccccc------cccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHH
Confidence 134567 9988 996554322221 11133455666 99999981
Q ss_pred -------------------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCc
Q 039705 221 -------------------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEA 281 (539)
Q Consensus 221 -------------------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~ 281 (539)
.+..|||.++.|. .+-..|. ++.+|++++. -+++|.++-|.-.
T Consensus 178 ~~i~~~yf~~~~~dy~~n~ev~sy~p~~n~W~-~~G~~pf----~~~aGsa~~~-----------~~n~~~lInGEiK-- 239 (381)
T COG3055 178 DKIIAHYFDKKAEDYFFNKEVLSYDPSTNQWR-NLGENPF----YGNAGSAVVI-----------KGNKLTLINGEIK-- 239 (381)
T ss_pred HHHHHHHhCCCHHHhcccccccccccccchhh-hcCcCcc----cCccCcceee-----------cCCeEEEEcceec--
Confidence 1568999999997 5544443 5667877753 2667888877531
Q ss_pred ccccCCCcccccCCce--EEEEeeCCCCceeee-ccCCCceec------eeEEecCCcEEEEcCcCCCCC------Cccc
Q 039705 282 GVLAGKGEFMNALQDC--GRIEITNKSATWQRE-MMPSPRVMG------EMLLLPTGDVLIINGAKKGTA------GWNF 346 (539)
Q Consensus 282 ~~~~~~~~~~~a~~s~--~~~d~~~~~~~W~~~-~M~~~R~~~------~~vvlpdG~I~vvGG~~~g~~------g~~~ 346 (539)
|.+++. .+++.....-+|... ++|.+-... +.---.+|.++|.||+..-.+ |.-.
T Consensus 240 ----------pGLRt~~~k~~~~~~~~~~w~~l~~lp~~~~~~~eGvAGaf~G~s~~~~lv~GGAnF~Ga~~~y~~Gk~~ 309 (381)
T COG3055 240 ----------PGLRTAEVKQADFGGDNLKWLKLSDLPAPIGSNKEGVAGAFSGKSNGEVLVAGGANFPGALKAYKNGKFY 309 (381)
T ss_pred ----------CCccccceeEEEeccCceeeeeccCCCCCCCCCccccceeccceeCCeEEEecCCCChhHHHHHHhcccc
Confidence 334444 456766556789986 554432211 111234899999999753110 1111
Q ss_pred CCCCC-----CccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCC
Q 039705 347 ATDPN-----TTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSR 401 (539)
Q Consensus 347 ~~~~~-----~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~ 401 (539)
+.+.+ ..+.++| .+ .|+.+..++.++.|-. + ++-.+.||++||....+
T Consensus 310 AH~Gl~K~w~~~Vy~~d--~g---~Wk~~GeLp~~l~YG~-s-~~~nn~vl~IGGE~~~G 362 (381)
T COG3055 310 AHEGLSKSWNSEVYIFD--NG---SWKIVGELPQGLAYGV-S-LSYNNKVLLIGGETSGG 362 (381)
T ss_pred cccchhhhhhceEEEEc--CC---ceeeecccCCCccceE-E-EecCCcEEEEccccCCC
Confidence 22211 1355555 77 9999999999998853 3 34488999999986553
No 29
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.04 E-value=3.3e-09 Score=107.74 Aligned_cols=192 Identities=13% Similarity=0.160 Sum_probs=132.2
Q ss_pred CCceEEccC---CCcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeC
Q 039705 19 KGKWELASE---NSGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPL 95 (539)
Q Consensus 19 ~g~W~~~~~---~~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l 95 (539)
..+|..+.. ..+++++++++. +.|.+|++||--..|.+.+ +.+....-+||..+++|+++
T Consensus 107 ~~eWkk~~spn~P~pRsshq~va~-~s~~l~~fGGEfaSPnq~q----------------F~HYkD~W~fd~~trkweql 169 (521)
T KOG1230|consen 107 KNEWKKVVSPNAPPPRSSHQAVAV-PSNILWLFGGEFASPNQEQ----------------FHHYKDLWLFDLKTRKWEQL 169 (521)
T ss_pred ccceeEeccCCCcCCCccceeEEe-ccCeEEEeccccCCcchhh----------------hhhhhheeeeeeccchheee
Confidence 478998843 457788888888 8889999999533333222 12233345799999999999
Q ss_pred ccC--CCcccccceecCCCcEEEecCCCCC------CCeEEEEeCCCCccceeecccc--cccccccceeEEccCCcEEE
Q 039705 96 KIL--TDTWSSSGGLSANGTIVISGGWSSR------GRSVRYLSGCYHACYWKEHHWE--LSAKRWFSTQHILPDGSFIV 165 (539)
Q Consensus 96 ~~~--~~~~c~~~~~l~dG~l~v~GG~~~g------~~~v~~ydP~~~t~~W~~~~~~--m~~~R~y~s~~~L~dG~Vyv 165 (539)
... +..|..+..++.-.+|+++||+.+. .+.+.+||.. +-+|+++.+. -+.+|..+...+.++|.|||
T Consensus 170 ~~~g~PS~RSGHRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLd--tykW~Klepsga~PtpRSGcq~~vtpqg~i~v 247 (521)
T KOG1230|consen 170 EFGGGPSPRSGHRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLD--TYKWSKLEPSGAGPTPRSGCQFSVTPQGGIVV 247 (521)
T ss_pred ccCCCCCCCccceeEEeeeeEEEEcceecCCCceEEeeeeEEEecc--ceeeeeccCCCCCCCCCCcceEEecCCCcEEE
Confidence 764 5667777788889999999998653 4778889998 9999988642 36789888888888999999
Q ss_pred EcCccC-----------CeEEEE---ecCC-Cccee--ec-cCccccCCCCCCCCcceEEEeeCCcEEEEEcC-------
Q 039705 166 VGGRRE-----------FSYEYI---LKEG-KRIIY--DL-PILNETTNPSENNLYPFVFLSTDGNLFIFAND------- 220 (539)
Q Consensus 166 vGG~~~-----------~~~E~y---P~~~-~~~w~--~~-~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~------- 220 (539)
.||+.. .....| |..+ .++|. .+ |.-.. ..++--| .+.+.++++-+.|||-
T Consensus 248 yGGYsK~~~kK~~dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~k---PspRsgf-sv~va~n~kal~FGGV~D~eeee 323 (521)
T KOG1230|consen 248 YGGYSKQRVKKDVDKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVK---PSPRSGF-SVAVAKNHKALFFGGVCDLEEEE 323 (521)
T ss_pred EcchhHhhhhhhhhcCceeeeeeeecCCcCCCcceeEeeccCCCCC---CCCCCce-eEEEecCCceEEecceecccccc
Confidence 999742 112333 5543 23453 22 21111 1122223 2355679999999992
Q ss_pred ---------ceeEeeCCCCeEE
Q 039705 221 ---------RSILLNPETNEIL 233 (539)
Q Consensus 221 ---------~~e~yDp~tn~W~ 233 (539)
+...||...|+|.
T Consensus 324 Esl~g~F~NDLy~fdlt~nrW~ 345 (521)
T KOG1230|consen 324 ESLSGEFFNDLYFFDLTRNRWS 345 (521)
T ss_pred hhhhhhhhhhhhheecccchhh
Confidence 2346899999997
No 30
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.00 E-value=6.1e-09 Score=108.01 Aligned_cols=225 Identities=15% Similarity=0.218 Sum_probs=142.4
Q ss_pred ceeeccc---ccccccccceeEEccCCcEEEEcCccC---CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEee
Q 039705 138 YWKEHHW---ELSAKRWFSTQHILPDGSFIVVGGRRE---FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLST 210 (539)
Q Consensus 138 ~W~~~~~---~m~~~R~y~s~~~L~dG~VyvvGG~~~---~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~ 210 (539)
.|+.+.. ..+.+|..|-++++ ...|.|+||-+. ....+| ..++ +|... ..++..+-..-.|.+++.
T Consensus 18 rWrrV~~~tGPvPrpRHGHRAVai-kELiviFGGGNEGiiDELHvYNTatn--qWf~P----avrGDiPpgcAA~Gfvcd 90 (830)
T KOG4152|consen 18 RWRRVQQSTGPVPRPRHGHRAVAI-KELIVIFGGGNEGIIDELHVYNTATN--QWFAP----AVRGDIPPGCAAFGFVCD 90 (830)
T ss_pred ceEEEecccCCCCCccccchheee-eeeEEEecCCcccchhhhhhhccccc--eeecc----hhcCCCCCchhhcceEec
Confidence 4665432 25677888888888 577888898775 245578 7777 88521 111111111223557777
Q ss_pred CCcEEEEEc------CceeEeeCCCCeEEE-Eccc-CCC-CCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCc
Q 039705 211 DGNLFIFAN------DRSILLNPETNEILH-VFPI-LRG-GSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEA 281 (539)
Q Consensus 211 ~G~Iyv~Gg------~~~e~yDp~tn~W~~-~~p~-mp~-~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~ 281 (539)
.-+||+||| ++-++|....-+|+. .+.+ .|. +..-+|+-|.+..| ...|-|++||...+.
T Consensus 91 GtrilvFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl-----------~gnKcYlFGGLaNds 159 (830)
T KOG4152|consen 91 GTRILVFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSL-----------VGNKCYLFGGLANDS 159 (830)
T ss_pred CceEEEEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEE-----------eccEeEEeccccccc
Confidence 789999999 244667766665541 2222 121 12234555655433 378999999975322
Q ss_pred ccccCCCccc-ccCCceEEEEeeCCC--Cceeee----ccCCCceeceeEEec-----CCcEEEEcCcCCCCCCcccCCC
Q 039705 282 GVLAGKGEFM-NALQDCGRIEITNKS--ATWQRE----MMPSPRVMGEMLLLP-----TGDVLIINGAKKGTAGWNFATD 349 (539)
Q Consensus 282 ~~~~~~~~~~-~a~~s~~~~d~~~~~--~~W~~~----~M~~~R~~~~~vvlp-----dG~I~vvGG~~~g~~g~~~~~~ 349 (539)
.++ ..+. ..+++...+++.... -.|... .+|.+|..|.+|+.- --|++|.||.. |..
T Consensus 160 -eDp--knNvPrYLnDlY~leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~-G~R------- 228 (830)
T KOG4152|consen 160 -EDP--KNNVPRYLNDLYILELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMS-GCR------- 228 (830)
T ss_pred -cCc--ccccchhhcceEEEEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEcccc-ccc-------
Confidence 111 1122 246777777775322 358862 689999999988761 34899999987 542
Q ss_pred CCCccEEEcCCCCCCCceEec---CCCCCCccceeeEeeCCCCeEEEecCC
Q 039705 350 PNTTPVLYEPDDPINERFSEL---TPTSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 350 ~~~~~e~YdP~t~~g~~Wt~~---a~~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
+-.....|-+|- .|++. .-.+.||.-|++.++ ..|.||.||-
T Consensus 229 -LgDLW~Ldl~Tl---~W~kp~~~G~~PlPRSLHsa~~I--GnKMyvfGGW 273 (830)
T KOG4152|consen 229 -LGDLWTLDLDTL---TWNKPSLSGVAPLPRSLHSATTI--GNKMYVFGGW 273 (830)
T ss_pred -ccceeEEeccee---ecccccccCCCCCCcccccceee--cceeEEecce
Confidence 235677788888 89763 234568999998877 9999999993
No 31
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.96 E-value=1.1e-08 Score=100.47 Aligned_cols=135 Identities=23% Similarity=0.374 Sum_probs=88.4
Q ss_pred eeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEE
Q 039705 222 SILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIE 301 (539)
Q Consensus 222 ~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d 301 (539)
+..||+.++++. .+... . -.+ |++.++|+ +|+++++||...+ .+.+..|+
T Consensus 48 s~~yD~~tn~~r-pl~v~-t--d~F--CSgg~~L~-----------dG~ll~tGG~~~G-------------~~~ir~~~ 97 (243)
T PF07250_consen 48 SVEYDPNTNTFR-PLTVQ-T--DTF--CSGGAFLP-----------DGRLLQTGGDNDG-------------NKAIRIFT 97 (243)
T ss_pred EEEEecCCCcEE-eccCC-C--CCc--ccCcCCCC-----------CCCEEEeCCCCcc-------------ccceEEEe
Confidence 568999999987 55322 1 123 44455664 8999999997522 23455577
Q ss_pred eeC--CCCceeee--ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCC--CceEecCCC--
Q 039705 302 ITN--KSATWQRE--MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPIN--ERFSELTPT-- 373 (539)
Q Consensus 302 ~~~--~~~~W~~~--~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g--~~Wt~~a~~-- 373 (539)
|.. ....|... .|..+|.+++++.|+||+|+|+||... .+.|.|.++.... ..|..+...
T Consensus 98 p~~~~~~~~w~e~~~~m~~~RWYpT~~~L~DG~vlIvGG~~~------------~t~E~~P~~~~~~~~~~~~~l~~~~~ 165 (243)
T PF07250_consen 98 PCTSDGTCDWTESPNDMQSGRWYPTATTLPDGRVLIVGGSNN------------PTYEFWPPKGPGPGPVTLPFLSQTSD 165 (243)
T ss_pred cCCCCCCCCceECcccccCCCccccceECCCCCEEEEeCcCC------------CcccccCCccCCCCceeeecchhhhc
Confidence 653 23579875 699999999999999999999999762 2457666543211 123323221
Q ss_pred CCCccceeeEeeCCCCeEEEecCCC
Q 039705 374 SKPRMCHSTSVVLPDGKILVAGSNP 398 (539)
Q Consensus 374 ~~~R~yhs~a~llpdG~V~v~GG~~ 398 (539)
..+....=-..|||||+||+.+...
T Consensus 166 ~~~~nlYP~~~llPdG~lFi~an~~ 190 (243)
T PF07250_consen 166 TLPNNLYPFVHLLPDGNLFIFANRG 190 (243)
T ss_pred cCccccCceEEEcCCCCEEEEEcCC
Confidence 2233333345678999999998863
No 32
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.95 E-value=4.9e-08 Score=98.15 Aligned_cols=229 Identities=14% Similarity=0.128 Sum_probs=135.0
Q ss_pred ecccccCCCceEEccC-CCcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCC
Q 039705 12 PLTLYEFKGKWELASE-NSGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESA 90 (539)
Q Consensus 12 ~~~~~~~~g~W~~~~~-~~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~ 90 (539)
+++.....-.|+.++. .-+-+-..+... .++|+|++++.....+ +.-+ ....+.+|||.+|
T Consensus 62 ~ldL~~~~k~W~~~a~FpG~~rnqa~~a~-~~~kLyvFgG~Gk~~~------~~~~-----------~~nd~Y~y~p~~n 123 (381)
T COG3055 62 VLDLKKPGKGWTKIADFPGGARNQAVAAV-IGGKLYVFGGYGKSVS------SSPQ-----------VFNDAYRYDPSTN 123 (381)
T ss_pred ehhhhcCCCCceEcccCCCcccccchhee-eCCeEEEeeccccCCC------CCce-----------EeeeeEEecCCCC
Confidence 3344344456998863 112222333333 5899999999753221 1001 2456789999999
Q ss_pred CEEeCccCC-CcccccceecCCC-cEEEecCCC-----------------------------C-------CCCeEEEEeC
Q 039705 91 AIRPLKILT-DTWSSSGGLSANG-TIVISGGWS-----------------------------S-------RGRSVRYLSG 132 (539)
Q Consensus 91 ~w~~l~~~~-~~~c~~~~~l~dG-~l~v~GG~~-----------------------------~-------g~~~v~~ydP 132 (539)
+|..+.... ...-.+.++.+++ +|+++||.+ + -.+.+-.|||
T Consensus 124 sW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~~~~dy~~n~ev~sy~p 203 (381)
T COG3055 124 SWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDKKAEDYFFNKEVLSYDP 203 (381)
T ss_pred hhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCCCHHHhccccccccccc
Confidence 999987653 2344444556666 999999941 0 1356778999
Q ss_pred CCCccceeecccccc-cccccceeEEccCCcEEEEcCccC---CeEEEE---ecCCCcceeeccCccccCCCCCCCCcce
Q 039705 133 CYHACYWKEHHWELS-AKRWFSTQHILPDGSFIVVGGRRE---FSYEYI---LKEGKRIIYDLPILNETTNPSENNLYPF 205 (539)
Q Consensus 133 ~~~t~~W~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~~~---~~~E~y---P~~~~~~w~~~~~l~~~~~~~~~~~yp~ 205 (539)
+ +++|+.+.. .+ .++.. ++++..++++.+|-|.-- .+.|.+ =..++.+|..++.++.........+--+
T Consensus 204 ~--~n~W~~~G~-~pf~~~aG-sa~~~~~n~~~lInGEiKpGLRt~~~k~~~~~~~~~~w~~l~~lp~~~~~~~eGvAGa 279 (381)
T COG3055 204 S--TNQWRNLGE-NPFYGNAG-SAVVIKGNKLTLINGEIKPGLRTAEVKQADFGGDNLKWLKLSDLPAPIGSNKEGVAGA 279 (381)
T ss_pred c--cchhhhcCc-CcccCccC-cceeecCCeEEEEcceecCCccccceeEEEeccCceeeeeccCCCCCCCCCcccccee
Confidence 9 999998764 44 45654 455565788999988632 233333 1122457865433322111100000000
Q ss_pred EEEeeCCcEEEEEcC------------------------ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCC
Q 039705 206 VFLSTDGNLFIFAND------------------------RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQD 261 (539)
Q Consensus 206 ~~~~~~G~Iyv~Gg~------------------------~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~ 261 (539)
..--.+|.+.+.||. +.++|-...+.|. .+..||. .+.| |+++.
T Consensus 280 f~G~s~~~~lv~GGAnF~Ga~~~y~~Gk~~AH~Gl~K~w~~~Vy~~d~g~Wk-~~GeLp~-~l~Y---G~s~~------- 347 (381)
T COG3055 280 FSGKSNGEVLVAGGANFPGALKAYKNGKFYAHEGLSKSWNSEVYIFDNGSWK-IVGELPQ-GLAY---GVSLS------- 347 (381)
T ss_pred ccceeCCeEEEecCCCChhHHHHHHhcccccccchhhhhhceEEEEcCCcee-eecccCC-Cccc---eEEEe-------
Confidence 011246777777762 1345555599997 7888988 5666 55543
Q ss_pred CCCCCcccEEEEecCCCC
Q 039705 262 PNSNAIRAEVLICGGAKP 279 (539)
Q Consensus 262 ~~~~~~~g~Iyv~GG~~~ 279 (539)
.+++||++||.+.
T Consensus 348 -----~nn~vl~IGGE~~ 360 (381)
T COG3055 348 -----YNNKVLLIGGETS 360 (381)
T ss_pred -----cCCcEEEEccccC
Confidence 3789999999764
No 33
>PF13964 Kelch_6: Kelch motif
Probab=98.86 E-value=4.9e-09 Score=77.37 Aligned_cols=50 Identities=20% Similarity=0.480 Sum_probs=41.7
Q ss_pred CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCc
Q 039705 317 PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPR 377 (539)
Q Consensus 317 ~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R 377 (539)
+|..+++|++ +++|||+||.... .....++|+|||+++ +|+.+++|+.||
T Consensus 1 pR~~~s~v~~-~~~iyv~GG~~~~-------~~~~~~v~~yd~~t~---~W~~~~~mp~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVV-GGKIYVFGGYDNS-------GKYSNDVERYDPETN---TWEQLPPMPTPR 50 (50)
T ss_pred CCccCEEEEE-CCEEEEECCCCCC-------CCccccEEEEcCCCC---cEEECCCCCCCC
Confidence 5888887665 9999999998731 134558999999999 999999999998
No 34
>smart00612 Kelch Kelch domain.
Probab=98.49 E-value=1.8e-07 Score=67.28 Aligned_cols=45 Identities=20% Similarity=0.532 Sum_probs=37.9
Q ss_pred cEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEee
Q 039705 330 DVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVV 385 (539)
Q Consensus 330 ~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~l 385 (539)
+|||+||.. +. ....++|+|||+++ +|+.+++|+.+|.+|+++++
T Consensus 1 ~iyv~GG~~-~~-------~~~~~v~~yd~~~~---~W~~~~~~~~~r~~~~~~~~ 45 (47)
T smart00612 1 KIYVVGGFD-GG-------QRLKSVEVYDPETN---KWTPLPSMPTPRSGHGVAVI 45 (47)
T ss_pred CEEEEeCCC-CC-------ceeeeEEEECCCCC---eEccCCCCCCccccceEEEe
Confidence 589999975 21 23457999999999 99999999999999998776
No 35
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=98.47 E-value=7.9e-08 Score=69.72 Aligned_cols=47 Identities=21% Similarity=0.526 Sum_probs=37.7
Q ss_pred CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCC
Q 039705 317 PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTS 374 (539)
Q Consensus 317 ~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~ 374 (539)
+|.+++++++ +++|||+||... . .....++|+|||+++ +|+.+++|+
T Consensus 1 pR~~~~~~~~-~~~iyv~GG~~~-~------~~~~~~v~~yd~~~~---~W~~~~~mp 47 (47)
T PF01344_consen 1 PRSGHAAVVV-GNKIYVIGGYDG-N------NQPTNSVEVYDPETN---TWEELPPMP 47 (47)
T ss_dssp -BBSEEEEEE-TTEEEEEEEBES-T------SSBEEEEEEEETTTT---EEEEEEEES
T ss_pred CCccCEEEEE-CCEEEEEeeecc-c------CceeeeEEEEeCCCC---EEEEcCCCC
Confidence 5888887666 999999999873 1 234558999999999 999999885
No 36
>PF13964 Kelch_6: Kelch motif
Probab=98.44 E-value=2.8e-07 Score=67.89 Aligned_cols=46 Identities=15% Similarity=0.282 Sum_probs=40.5
Q ss_pred ccccceecCCCcEEEecCCCC---CCCeEEEEeCCCCccceeeccccccccc
Q 039705 102 WSSSGGLSANGTIVISGGWSS---RGRSVRYLSGCYHACYWKEHHWELSAKR 150 (539)
Q Consensus 102 ~c~~~~~l~dG~l~v~GG~~~---g~~~v~~ydP~~~t~~W~~~~~~m~~~R 150 (539)
+|.++++..+++|||+||..+ ..+++++|||. +++|+.+++ |+.+|
T Consensus 2 R~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~--t~~W~~~~~-mp~pR 50 (50)
T PF13964_consen 2 RYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPE--TNTWEQLPP-MPTPR 50 (50)
T ss_pred CccCEEEEECCEEEEECCCCCCCCccccEEEEcCC--CCcEEECCC-CCCCC
Confidence 567778889999999999865 26899999999 999999997 99887
No 37
>smart00612 Kelch Kelch domain.
Probab=98.11 E-value=3.9e-06 Score=60.18 Aligned_cols=45 Identities=20% Similarity=0.307 Sum_probs=38.4
Q ss_pred cEEEecCCCC--CCCeEEEEeCCCCccceeecccccccccccceeEEccCC
Q 039705 113 TIVISGGWSS--RGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDG 161 (539)
Q Consensus 113 ~l~v~GG~~~--g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG 161 (539)
+||++||... ..+++++|||. +++|++.++ |+.+|.+++++++ ||
T Consensus 1 ~iyv~GG~~~~~~~~~v~~yd~~--~~~W~~~~~-~~~~r~~~~~~~~-~g 47 (47)
T smart00612 1 KIYVVGGFDGGQRLKSVEVYDPE--TNKWTPLPS-MPTPRSGHGVAVI-NG 47 (47)
T ss_pred CEEEEeCCCCCceeeeEEEECCC--CCeEccCCC-CCCccccceEEEe-CC
Confidence 5899999853 35789999999 999999997 9999999988877 43
No 38
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.10 E-value=2.4e-06 Score=62.55 Aligned_cols=48 Identities=10% Similarity=0.295 Sum_probs=29.8
Q ss_pred CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCC
Q 039705 317 PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTS 374 (539)
Q Consensus 317 ~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~ 374 (539)
+|..|+++.+.+++|||+||.+. . ...+..+++||++++ +|+.+++||
T Consensus 1 pR~~h~~~~~~~~~i~v~GG~~~-~------~~~~~d~~~~d~~~~---~W~~~~~~P 48 (49)
T PF13418_consen 1 PRYGHSAVSIGDNSIYVFGGRDS-S------GSPLNDLWIFDIETN---TWTRLPSMP 48 (49)
T ss_dssp --BS-EEEEE-TTEEEEE--EEE--------TEE---EEEEETTTT---EEEE--SS-
T ss_pred CcceEEEEEEeCCeEEEECCCCC-C------CcccCCEEEEECCCC---EEEECCCCC
Confidence 68999988887899999999873 1 123447899999999 999998776
No 39
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.07 E-value=7.8e-06 Score=59.91 Aligned_cols=49 Identities=14% Similarity=0.394 Sum_probs=35.8
Q ss_pred CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCC
Q 039705 317 PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTS 374 (539)
Q Consensus 317 ~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~ 374 (539)
+|..|.++ ++|+||||+||...+. .......+++||++++ +|+.+++|+
T Consensus 1 ~r~~hs~~-~~~~kiyv~GG~~~~~-----~~~~~~~v~~~d~~t~---~W~~~~~~g 49 (49)
T PF07646_consen 1 PRYGHSAV-VLDGKIYVFGGYGTDN-----GGSSSNDVWVFDTETN---QWTELSPMG 49 (49)
T ss_pred CccceEEE-EECCEEEEECCcccCC-----CCcccceeEEEECCCC---EEeecCCCC
Confidence 57777765 5599999999991111 1122347999999999 999998874
No 40
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=98.02 E-value=1.1e-05 Score=59.19 Aligned_cols=48 Identities=13% Similarity=0.275 Sum_probs=39.0
Q ss_pred CCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEe
Q 039705 328 TGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSV 384 (539)
Q Consensus 328 dG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~ 384 (539)
+++|||+||..... .....++.+||++++ +|+++++++.+|..|++++
T Consensus 1 g~~~~vfGG~~~~~------~~~~nd~~~~~~~~~---~W~~~~~~P~~R~~h~~~~ 48 (49)
T PF13415_consen 1 GNKLYVFGGYDDDG------GTRLNDVWVFDLDTN---TWTRIGDLPPPRSGHTATV 48 (49)
T ss_pred CCEEEEECCcCCCC------CCEecCEEEEECCCC---EEEECCCCCCCccceEEEE
Confidence 57899999987211 123447899999999 9999999999999998765
No 41
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=97.92 E-value=5.4e-06 Score=59.95 Aligned_cols=43 Identities=19% Similarity=0.282 Sum_probs=36.8
Q ss_pred ccccceecCCCcEEEecCCCC---CCCeEEEEeCCCCccceeecccccc
Q 039705 102 WSSSGGLSANGTIVISGGWSS---RGRSVRYLSGCYHACYWKEHHWELS 147 (539)
Q Consensus 102 ~c~~~~~l~dG~l~v~GG~~~---g~~~v~~ydP~~~t~~W~~~~~~m~ 147 (539)
++..+++..+++||++||... ..+++++||+. +++|+++++ |+
T Consensus 2 R~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~--~~~W~~~~~-mp 47 (47)
T PF01344_consen 2 RSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPE--TNTWEELPP-MP 47 (47)
T ss_dssp BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETT--TTEEEEEEE-ES
T ss_pred CccCEEEEECCEEEEEeeecccCceeeeEEEEeCC--CCEEEEcCC-CC
Confidence 566778889999999999864 26789999999 999999987 75
No 42
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=97.69 E-value=7.4e-05 Score=54.67 Aligned_cols=37 Identities=19% Similarity=0.377 Sum_probs=31.9
Q ss_pred cceEEEeeCCcEEEEEcC----------ceeEeeCCCCeEEEEcccCC
Q 039705 203 YPFVFLSTDGNLFIFAND----------RSILLNPETNEILHVFPILR 240 (539)
Q Consensus 203 yp~~~~~~~G~Iyv~Gg~----------~~e~yDp~tn~W~~~~p~mp 240 (539)
+.|+++++++|||++||. ++++||+++++|+ .+++|+
T Consensus 3 ~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~-~~~~~g 49 (49)
T PF07646_consen 3 YGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWT-ELSPMG 49 (49)
T ss_pred cceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEe-ecCCCC
Confidence 557889999999999996 3789999999998 787764
No 43
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=97.69 E-value=7.5e-05 Score=54.63 Aligned_cols=44 Identities=16% Similarity=0.155 Sum_probs=37.9
Q ss_pred CCcEEEecCCC-CC---CCeEEEEeCCCCccceeecccccccccccceeEE
Q 039705 111 NGTIVISGGWS-SR---GRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHI 157 (539)
Q Consensus 111 dG~l~v~GG~~-~g---~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~ 157 (539)
+++|||+||.. ++ .+++..||+. +++|+++++ ++.+|..|++++
T Consensus 1 g~~~~vfGG~~~~~~~~~nd~~~~~~~--~~~W~~~~~-~P~~R~~h~~~~ 48 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGGTRLNDVWVFDLD--TNTWTRIGD-LPPPRSGHTATV 48 (49)
T ss_pred CCEEEEECCcCCCCCCEecCEEEEECC--CCEEEECCC-CCCCccceEEEE
Confidence 57899999987 22 6889999999 999999976 999999998875
No 44
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=97.36 E-value=0.095 Score=55.63 Aligned_cols=244 Identities=12% Similarity=0.129 Sum_probs=121.6
Q ss_pred EEEEECCCCC--EEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc--cccccceeEE
Q 039705 82 AVEYDAESAA--IRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS--AKRWFSTQHI 157 (539)
Q Consensus 82 ~~~yDp~t~~--w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~--~~R~y~s~~~ 157 (539)
...+|.+|++ |+.-.. ....+ .-++.+++||+..+. ..+..||+.++.-.|+.-.. .+ ..|...+.+
T Consensus 132 l~ald~~tG~~~W~~~~~-~~~~s--sP~v~~~~v~v~~~~----g~l~ald~~tG~~~W~~~~~-~~~~~~~~~~sP~- 202 (394)
T PRK11138 132 VYALNAEDGEVAWQTKVA-GEALS--RPVVSDGLVLVHTSN----GMLQALNESDGAVKWTVNLD-VPSLTLRGESAPA- 202 (394)
T ss_pred EEEEECCCCCCcccccCC-Cceec--CCEEECCEEEEECCC----CEEEEEEccCCCEeeeecCC-CCcccccCCCCCE-
Confidence 5568887764 654321 12222 234557888875432 46889999855567876432 11 112223333
Q ss_pred ccCCcEEEEcCccCCeEEEE-ecCCCcceee-ccCccccCCCCCCCCcceEEEeeCCcEEEEEc-CceeEeeCCCCe--E
Q 039705 158 LPDGSFIVVGGRREFSYEYI-LKEGKRIIYD-LPILNETTNPSENNLYPFVFLSTDGNLFIFAN-DRSILLNPETNE--I 232 (539)
Q Consensus 158 L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~-~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg-~~~e~yDp~tn~--W 232 (539)
+.+|.||+..+. + .+-.+ +.++...|.. ...................-++.+|.||+.+. ....++|+++++ |
T Consensus 203 v~~~~v~~~~~~-g-~v~a~d~~~G~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~vy~~~~~g~l~ald~~tG~~~W 280 (394)
T PRK11138 203 TAFGGAIVGGDN-G-RVSAVLMEQGQLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVVYALAYNGNLVALDLRSGQIVW 280 (394)
T ss_pred EECCEEEEEcCC-C-EEEEEEccCChhhheeccccCCCccchhcccccCCCcEEECCEEEEEEcCCeEEEEECCCCCEEE
Confidence 336777775442 2 22233 4444335642 11000000000000000112456899998764 357789998865 6
Q ss_pred EEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee
Q 039705 233 LHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE 312 (539)
Q Consensus 233 ~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~ 312 (539)
.+.. .. . ...++ .+++||++.... .+.++|+.+....|+..
T Consensus 281 ~~~~---~~----~--~~~~~-------------~~~~vy~~~~~g-----------------~l~ald~~tG~~~W~~~ 321 (394)
T PRK11138 281 KREY---GS----V--NDFAV-------------DGGRIYLVDQND-----------------RVYALDTRGGVELWSQS 321 (394)
T ss_pred eecC---CC----c--cCcEE-------------ECCEEEEEcCCC-----------------eEEEEECCCCcEEEccc
Confidence 5321 11 0 01111 278999875321 35678876545678764
Q ss_pred ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEE
Q 039705 313 MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKIL 392 (539)
Q Consensus 313 ~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~ 392 (539)
.+. .+.....+ +.+|+||+... + | .+.++|+++.+ ..|+.- ......+.+-++. ||+||
T Consensus 322 ~~~-~~~~~sp~-v~~g~l~v~~~-~-G------------~l~~ld~~tG~-~~~~~~--~~~~~~~s~P~~~--~~~l~ 380 (394)
T PRK11138 322 DLL-HRLLTAPV-LYNGYLVVGDS-E-G------------YLHWINREDGR-FVAQQK--VDSSGFLSEPVVA--DDKLL 380 (394)
T ss_pred ccC-CCcccCCE-EECCEEEEEeC-C-C------------EEEEEECCCCC-EEEEEE--cCCCcceeCCEEE--CCEEE
Confidence 322 23333333 44999987432 2 2 46778988862 135531 1112334444444 89999
Q ss_pred EecC
Q 039705 393 VAGS 396 (539)
Q Consensus 393 v~GG 396 (539)
|..-
T Consensus 381 v~t~ 384 (394)
T PRK11138 381 IQAR 384 (394)
T ss_pred EEeC
Confidence 8743
No 45
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=97.31 E-value=0.00044 Score=50.40 Aligned_cols=35 Identities=20% Similarity=0.375 Sum_probs=23.2
Q ss_pred eEEEee-CCcEEEEEcC--------ceeEeeCCCCeEEEEcccCC
Q 039705 205 FVFLST-DGNLFIFAND--------RSILLNPETNEILHVFPILR 240 (539)
Q Consensus 205 ~~~~~~-~G~Iyv~Gg~--------~~e~yDp~tn~W~~~~p~mp 240 (539)
|+.+.. +++||++||. ++++||+++++|+ .+++||
T Consensus 5 h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~-~~~~~P 48 (49)
T PF13418_consen 5 HSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWT-RLPSMP 48 (49)
T ss_dssp -EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEE-E--SS-
T ss_pred EEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEE-ECCCCC
Confidence 445544 7999999993 5899999999998 688887
No 46
>PLN02772 guanylate kinase
Probab=97.26 E-value=0.00076 Score=70.62 Aligned_cols=70 Identities=19% Similarity=0.258 Sum_probs=53.7
Q ss_pred CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEe---cCCCCCCccceeeEeeCCCCeEEE
Q 039705 317 PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSE---LTPTSKPRMCHSTSVVLPDGKILV 393 (539)
Q Consensus 317 ~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~---~a~~~~~R~yhs~a~llpdG~V~v 393 (539)
+|..+.+++. ++++||+||.+... .....+.+||+.|. +|+. ...-|.+|-.||++ ++-|.||||
T Consensus 24 ~~~~~tav~i-gdk~yv~GG~~d~~-------~~~~~v~i~D~~t~---~W~~P~V~G~~P~~r~GhSa~-v~~~~rilv 91 (398)
T PLN02772 24 PKNRETSVTI-GDKTYVIGGNHEGN-------TLSIGVQILDKITN---NWVSPIVLGTGPKPCKGYSAV-VLNKDRILV 91 (398)
T ss_pred CCCcceeEEE-CCEEEEEcccCCCc-------cccceEEEEECCCC---cEecccccCCCCCCCCcceEE-EECCceEEE
Confidence 6677776655 99999999976311 12236899999999 9985 46788899999855 557999999
Q ss_pred ecCCC
Q 039705 394 AGSNP 398 (539)
Q Consensus 394 ~GG~~ 398 (539)
.+++.
T Consensus 92 ~~~~~ 96 (398)
T PLN02772 92 IKKGS 96 (398)
T ss_pred EeCCC
Confidence 98753
No 47
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=97.15 E-value=0.47 Score=50.29 Aligned_cols=219 Identities=14% Similarity=0.178 Sum_probs=110.2
Q ss_pred cceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCc
Q 039705 105 SGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKR 183 (539)
Q Consensus 105 ~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~ 183 (539)
++.+..+++||+.+. ...+..+|..++.-.|+.- +...- +.+ -++.+++||+..+. ..+..+ ++++..
T Consensus 114 ~~~~v~~~~v~v~~~----~g~l~ald~~tG~~~W~~~---~~~~~-~ss-P~v~~~~v~v~~~~--g~l~ald~~tG~~ 182 (394)
T PRK11138 114 GGVTVAGGKVYIGSE----KGQVYALNAEDGEVAWQTK---VAGEA-LSR-PVVSDGLVLVHTSN--GMLQALNESDGAV 182 (394)
T ss_pred cccEEECCEEEEEcC----CCEEEEEECCCCCCccccc---CCCce-ecC-CEEECCEEEEECCC--CEEEEEEccCCCE
Confidence 344556788876543 2468899987556679653 22211 222 23448888875543 234455 555544
Q ss_pred ceeec-c--CccccCCCCCCCCcceEEEeeCCcEEEEEcC-ceeEeeCCCCe--EEEEcccCCCCC----CccCCCccEE
Q 039705 184 IIYDL-P--ILNETTNPSENNLYPFVFLSTDGNLFIFAND-RSILLNPETNE--ILHVFPILRGGS----RNYPASATSA 253 (539)
Q Consensus 184 ~w~~~-~--~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-~~e~yDp~tn~--W~~~~p~mp~~~----r~yp~~g~av 253 (539)
.|... . .+.... . ..-++.+|.+|+..++ ....+|+++++ |...+. .+.+. |......+-+
T Consensus 183 ~W~~~~~~~~~~~~~-------~-~sP~v~~~~v~~~~~~g~v~a~d~~~G~~~W~~~~~-~~~~~~~~~~~~~~~~sP~ 253 (394)
T PRK11138 183 KWTVNLDVPSLTLRG-------E-SAPATAFGGAIVGGDNGRVSAVLMEQGQLIWQQRIS-QPTGATEIDRLVDVDTTPV 253 (394)
T ss_pred eeeecCCCCcccccC-------C-CCCEEECCEEEEEcCCCEEEEEEccCChhhheeccc-cCCCccchhcccccCCCcE
Confidence 57431 1 110000 0 0123346777775543 35668888764 653221 11100 0000001111
Q ss_pred ecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCCceeceeEEecCCcEEE
Q 039705 254 LLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSPRVMGEMLLLPTGDVLI 333 (539)
Q Consensus 254 ~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~R~~~~~vvlpdG~I~v 333 (539)
+ .++.||+++. + ..+.++|+.+....|+.. ....+ . .++.+|+||+
T Consensus 254 v------------~~~~vy~~~~-~----------------g~l~ald~~tG~~~W~~~-~~~~~---~-~~~~~~~vy~ 299 (394)
T PRK11138 254 V------------VGGVVYALAY-N----------------GNLVALDLRSGQIVWKRE-YGSVN---D-FAVDGGRIYL 299 (394)
T ss_pred E------------ECCEEEEEEc-C----------------CeEEEEECCCCCEEEeec-CCCcc---C-cEEECCEEEE
Confidence 1 2678887653 2 135678876545678864 11111 2 2345899998
Q ss_pred EcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecC
Q 039705 334 INGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGS 396 (539)
Q Consensus 334 vGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG 396 (539)
..... .+.++|+++.. ..|+.-. ...+...+.++. +|+||+...
T Consensus 300 ~~~~g--------------~l~ald~~tG~-~~W~~~~--~~~~~~~sp~v~--~g~l~v~~~ 343 (394)
T PRK11138 300 VDQND--------------RVYALDTRGGV-ELWSQSD--LLHRLLTAPVLY--NGYLVVGDS 343 (394)
T ss_pred EcCCC--------------eEEEEECCCCc-EEEcccc--cCCCcccCCEEE--CCEEEEEeC
Confidence 65321 46788887752 2586421 112333333443 899988643
No 48
>PLN02772 guanylate kinase
Probab=96.62 E-value=0.0047 Score=64.78 Aligned_cols=68 Identities=10% Similarity=0.055 Sum_probs=54.1
Q ss_pred cccccceecCCCcEEEecCCCCC---CCeEEEEeCCCCccceeecc--cccccccccceeEEccCCcEEEEcCcc
Q 039705 101 TWSSSGGLSANGTIVISGGWSSR---GRSVRYLSGCYHACYWKEHH--WELSAKRWFSTQHILPDGSFIVVGGRR 170 (539)
Q Consensus 101 ~~c~~~~~l~dG~l~v~GG~~~g---~~~v~~ydP~~~t~~W~~~~--~~m~~~R~y~s~~~L~dG~VyvvGG~~ 170 (539)
..|...++..+.++|++||.++. ...+.+||+. +.+|.... ..-+.+|-.|+++++.|++|+|+++-.
T Consensus 24 ~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~--t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~ 96 (398)
T PLN02772 24 PKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKI--TNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGS 96 (398)
T ss_pred CCCcceeEEECCEEEEEcccCCCccccceEEEEECC--CCcEecccccCCCCCCCCcceEEEECCceEEEEeCCC
Confidence 44555667789999999998763 3578999999 99998643 135678999999999999999998643
No 49
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=96.58 E-value=0.81 Score=45.76 Aligned_cols=246 Identities=17% Similarity=0.256 Sum_probs=129.6
Q ss_pred EEEEECCC-CCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCcccee---ecccccccccccceeEE
Q 039705 82 AVEYDAES-AAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWK---EHHWELSAKRWFSTQHI 157 (539)
Q Consensus 82 ~~~yDp~t-~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~---~~~~~m~~~R~y~s~~~ 157 (539)
.-+||..| |+-..++ ++..|-=..+..+.|..++.||.+ +.+.+|+-. +.+=. .+..++...+.|-+++.
T Consensus 79 lIvWDs~TtnK~haip-l~s~WVMtCA~sPSg~~VAcGGLd---N~Csiy~ls--~~d~~g~~~v~r~l~gHtgylScC~ 152 (343)
T KOG0286|consen 79 LIVWDSFTTNKVHAIP-LPSSWVMTCAYSPSGNFVACGGLD---NKCSIYPLS--TRDAEGNVRVSRELAGHTGYLSCCR 152 (343)
T ss_pred EEEEEcccccceeEEe-cCceeEEEEEECCCCCeEEecCcC---ceeEEEecc--cccccccceeeeeecCccceeEEEE
Confidence 34677643 4443333 222232223456999999999974 567888875 22111 22223556678888887
Q ss_pred ccC-CcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEee-CCcEEEEEcC--ceeEeeCCCCeE
Q 039705 158 LPD-GSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLST-DGNLFIFAND--RSILLNPETNEI 232 (539)
Q Consensus 158 L~d-G~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~-~G~Iyv~Gg~--~~e~yDp~tn~W 232 (539)
..| +.|+.-.|- .+.-.| -.+. +- ...+.-.+.|. -...+.| +++.|+.|+. .+.+||.+...-
T Consensus 153 f~dD~~ilT~SGD--~TCalWDie~g--~~-~~~f~GH~gDV------~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c 221 (343)
T KOG0286|consen 153 FLDDNHILTGSGD--MTCALWDIETG--QQ-TQVFHGHTGDV------MSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQC 221 (343)
T ss_pred EcCCCceEecCCC--ceEEEEEcccc--eE-EEEecCCcccE------EEEecCCCCCCeEEecccccceeeeeccCcce
Confidence 554 555554443 344455 2222 11 11111111110 0123345 8999999995 477889888765
Q ss_pred EEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCC--Ccee
Q 039705 233 LHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKS--ATWQ 310 (539)
Q Consensus 233 ~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~--~~W~ 310 (539)
.+.+ ++..-. -.+...+| +|.-++.|--+ .+|-.||+.... ..++
T Consensus 222 ~qtF---~ghesD---INsv~ffP-----------~G~afatGSDD----------------~tcRlyDlRaD~~~a~ys 268 (343)
T KOG0286|consen 222 VQTF---EGHESD---INSVRFFP-----------SGDAFATGSDD----------------ATCRLYDLRADQELAVYS 268 (343)
T ss_pred eEee---cccccc---cceEEEcc-----------CCCeeeecCCC----------------ceeEEEeecCCcEEeeec
Confidence 4333 321000 01233443 45555655422 257778887311 1122
Q ss_pred eeccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCe
Q 039705 311 REMMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGK 390 (539)
Q Consensus 311 ~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~ 390 (539)
.++...+-...+ - -.-|+++..|..+ +++++||--+. ++=..+. -..-|. |+.-+.|||.
T Consensus 269 ~~~~~~gitSv~-F-S~SGRlLfagy~d-------------~~c~vWDtlk~--e~vg~L~-GHeNRv--Scl~~s~DG~ 328 (343)
T KOG0286|consen 269 HDSIICGITSVA-F-SKSGRLLFAGYDD-------------FTCNVWDTLKG--ERVGVLA-GHENRV--SCLGVSPDGM 328 (343)
T ss_pred cCcccCCceeEE-E-cccccEEEeeecC-------------CceeEeecccc--ceEEEee-ccCCee--EEEEECCCCc
Confidence 222333322222 2 2379999888544 26789997665 2333443 334453 4555679999
Q ss_pred EEEecCC
Q 039705 391 ILVAGSN 397 (539)
Q Consensus 391 V~v~GG~ 397 (539)
-+..|+=
T Consensus 329 av~TgSW 335 (343)
T KOG0286|consen 329 AVATGSW 335 (343)
T ss_pred EEEecch
Confidence 9998874
No 50
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=96.53 E-value=1.3 Score=46.37 Aligned_cols=247 Identities=14% Similarity=0.138 Sum_probs=116.2
Q ss_pred eEEEEECCCCC--EEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEE
Q 039705 81 LAVEYDAESAA--IRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHI 157 (539)
Q Consensus 81 ~~~~yDp~t~~--w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~ 157 (539)
....+|+++++ |+.-.. ....+ ...+.++++|+..+ ...+..+|+.++...|+.....-. ..+...+.++
T Consensus 116 ~l~ald~~tG~~~W~~~~~-~~~~~--~p~v~~~~v~v~~~----~g~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~~ 188 (377)
T TIGR03300 116 EVIALDAEDGKELWRAKLS-SEVLS--PPLVANGLVVVRTN----DGRLTALDAATGERLWTYSRVTPALTLRGSASPVI 188 (377)
T ss_pred EEEEEECCCCcEeeeeccC-ceeec--CCEEECCEEEEECC----CCeEEEEEcCCCceeeEEccCCCceeecCCCCCEE
Confidence 45668887765 654321 11122 22345677776543 245889998744556865322000 1132333344
Q ss_pred ccCCcEEEEcCccCCeEEEE-ecCCCcceee-ccCccccCCCCCCCCcceEEEeeCCcEEEEEc-CceeEeeCCCCe--E
Q 039705 158 LPDGSFIVVGGRREFSYEYI-LKEGKRIIYD-LPILNETTNPSENNLYPFVFLSTDGNLFIFAN-DRSILLNPETNE--I 232 (539)
Q Consensus 158 L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~-~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg-~~~e~yDp~tn~--W 232 (539)
. ++.+|+ |..++ .+-.+ ++++...|.. ........+...........++.++.||+... ....+||+++++ |
T Consensus 189 ~-~~~v~~-~~~~g-~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~~g~l~a~d~~tG~~~W 265 (377)
T TIGR03300 189 A-DGGVLV-GFAGG-KLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYAVSYQGRVAALDLRSGRVLW 265 (377)
T ss_pred E-CCEEEE-ECCCC-EEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEEEEcCCEEEEEECCCCcEEE
Confidence 3 665554 43332 23333 4444334632 11100000000000000112345788888764 357789998765 5
Q ss_pred EEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee
Q 039705 233 LHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE 312 (539)
Q Consensus 233 ~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~ 312 (539)
... .. .+ ...+ + .+++||+.... ..+.++|..+....|+..
T Consensus 266 ~~~---~~----~~--~~p~-~------------~~~~vyv~~~~-----------------G~l~~~d~~tG~~~W~~~ 306 (377)
T TIGR03300 266 KRD---AS----SY--QGPA-V------------DDNRLYVTDAD-----------------GVVVALDRRSGSELWKND 306 (377)
T ss_pred eec---cC----Cc--cCce-E------------eCCEEEEECCC-----------------CeEEEEECCCCcEEEccc
Confidence 421 11 11 1112 2 27889887531 135678876544678865
Q ss_pred ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEE
Q 039705 313 MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKIL 392 (539)
Q Consensus 313 ~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~ 392 (539)
.+.. +.... .++.+++||+. ..+ | .+.++|+++.+ ..|+.- .......-+-++ .|++||
T Consensus 307 ~~~~-~~~ss-p~i~g~~l~~~-~~~-G------------~l~~~d~~tG~-~~~~~~--~~~~~~~~sp~~--~~~~l~ 365 (377)
T TIGR03300 307 ELKY-RQLTA-PAVVGGYLVVG-DFE-G------------YLHWLSREDGS-FVARLK--TDGSGIASPPVV--VGDGLL 365 (377)
T ss_pred cccC-Ccccc-CEEECCEEEEE-eCC-C------------EEEEEECCCCC-EEEEEE--cCCCccccCCEE--ECCEEE
Confidence 4432 22222 23347777764 322 2 46788887761 134321 111112222233 388988
Q ss_pred EecCC
Q 039705 393 VAGSN 397 (539)
Q Consensus 393 v~GG~ 397 (539)
+.+.+
T Consensus 366 v~~~d 370 (377)
T TIGR03300 366 VQTRD 370 (377)
T ss_pred EEeCC
Confidence 77653
No 51
>PRK13684 Ycf48-like protein; Provisional
Probab=96.51 E-value=1.3 Score=46.02 Aligned_cols=197 Identities=12% Similarity=0.092 Sum_probs=95.8
Q ss_pred ccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcE
Q 039705 136 ACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNL 214 (539)
Q Consensus 136 t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~I 214 (539)
..+|++.......+........+.++.++++|... .+| ......+|......... ..+ .....++|.+
T Consensus 118 G~tW~~~~~~~~~~~~~~~i~~~~~~~~~~~g~~G----~i~~S~DgG~tW~~~~~~~~g------~~~-~i~~~~~g~~ 186 (334)
T PRK13684 118 GKNWTRIPLSEKLPGSPYLITALGPGTAEMATNVG----AIYRTTDGGKNWEALVEDAAG------VVR-NLRRSPDGKY 186 (334)
T ss_pred CCCCeEccCCcCCCCCceEEEEECCCcceeeeccc----eEEEECCCCCCceeCcCCCcc------eEE-EEEECCCCeE
Confidence 57898874211122222234445456677766432 255 33335588644321110 011 1223356655
Q ss_pred EEEEcCceeEe---eCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCccc
Q 039705 215 FIFANDRSILL---NPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFM 291 (539)
Q Consensus 215 yv~Gg~~~e~y---Dp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~ 291 (539)
+++|.+- .+| |....+|+ .++. +. .+.. .+ .+.. .+++++++|... .
T Consensus 187 v~~g~~G-~i~~s~~~gg~tW~-~~~~-~~-~~~l--~~-i~~~-----------~~g~~~~vg~~G--~---------- 236 (334)
T PRK13684 187 VAVSSRG-NFYSTWEPGQTAWT-PHQR-NS-SRRL--QS-MGFQ-----------PDGNLWMLARGG--Q---------- 236 (334)
T ss_pred EEEeCCc-eEEEEcCCCCCeEE-EeeC-CC-cccc--ee-eeEc-----------CCCCEEEEecCC--E----------
Confidence 5554433 333 44446797 4533 22 1111 11 2222 267888887532 1
Q ss_pred ccCCceEEEEeeCCCCceeeeccCCC---ceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceE
Q 039705 292 NALQDCGRIEITNKSATWQREMMPSP---RVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFS 368 (539)
Q Consensus 292 ~a~~s~~~~d~~~~~~~W~~~~M~~~---R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt 368 (539)
.++.-.+...+|+...++.. ....+.+..++++++++|.. | .+|- ..+.|++|+
T Consensus 237 ------~~~~s~d~G~sW~~~~~~~~~~~~~l~~v~~~~~~~~~~~G~~--G--------------~v~~-S~d~G~tW~ 293 (334)
T PRK13684 237 ------IRFNDPDDLESWSKPIIPEITNGYGYLDLAYRTPGEIWAGGGN--G--------------TLLV-SKDGGKTWE 293 (334)
T ss_pred ------EEEccCCCCCccccccCCccccccceeeEEEcCCCCEEEEcCC--C--------------eEEE-eCCCCCCCe
Confidence 11211233578997544421 12234455678899988763 2 1221 245667999
Q ss_pred ecCC-CCCCccceeeEeeCCCCeEEEecCC
Q 039705 369 ELTP-TSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 369 ~~a~-~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
.+.. ...+..+..+ ++..++++|++|..
T Consensus 294 ~~~~~~~~~~~~~~~-~~~~~~~~~~~G~~ 322 (334)
T PRK13684 294 KDPVGEEVPSNFYKI-VFLDPEKGFVLGQR 322 (334)
T ss_pred ECCcCCCCCcceEEE-EEeCCCceEEECCC
Confidence 8753 3334344443 44568899888874
No 52
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.02 E-value=0.57 Score=49.61 Aligned_cols=245 Identities=13% Similarity=0.147 Sum_probs=124.1
Q ss_pred eeEEEEECCCCCEEe-CccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccc--ccccceeE
Q 039705 80 ALAVEYDAESAAIRP-LKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSA--KRWFSTQH 156 (539)
Q Consensus 80 ~~~~~yDp~t~~w~~-l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~--~R~y~s~~ 156 (539)
..+.+|+..+..-.. +.-..+.-|+ ..+-.||+++.+|+. .-.+++||-. +..- +-. |.. .+-..+--
T Consensus 48 ~rvqly~~~~~~~~k~~srFk~~v~s-~~fR~DG~LlaaGD~---sG~V~vfD~k--~r~i--LR~-~~ah~apv~~~~f 118 (487)
T KOG0310|consen 48 VRVQLYSSVTRSVRKTFSRFKDVVYS-VDFRSDGRLLAAGDE---SGHVKVFDMK--SRVI--LRQ-LYAHQAPVHVTKF 118 (487)
T ss_pred cEEEEEecchhhhhhhHHhhccceeE-EEeecCCeEEEccCC---cCcEEEeccc--cHHH--HHH-HhhccCceeEEEe
Confidence 345567666544433 2222333343 344569999999975 3468899954 3111 111 211 11100110
Q ss_pred EccCCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEE--eeCCcEEEEEcCc--eeEeeCCCC-e
Q 039705 157 ILPDGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFL--STDGNLFIFANDR--SILLNPETN-E 231 (539)
Q Consensus 157 ~L~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~--~~~G~Iyv~Gg~~--~e~yDp~tn-~ 231 (539)
.-.|+.+++.|+-+ ....+|-... ...... +....| |-.+.. -.++.|++.||++ +.+||.++. .
T Consensus 119 ~~~d~t~l~s~sDd-~v~k~~d~s~--a~v~~~-l~~htD------YVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~~ 188 (487)
T KOG0310|consen 119 SPQDNTMLVSGSDD-KVVKYWDLST--AYVQAE-LSGHTD------YVRCGDISPANDHIVVTGSYDGKVRLWDTRSLTS 188 (487)
T ss_pred cccCCeEEEecCCC-ceEEEEEcCC--cEEEEE-ecCCcc------eeEeeccccCCCeEEEecCCCceEEEEEeccCCc
Confidence 12377888877654 3334442221 111111 222212 333322 2368899999976 778999876 5
Q ss_pred EEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceee
Q 039705 232 ILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQR 311 (539)
Q Consensus 232 W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~ 311 (539)
|...+ ..+.+ .-.+++|.+ +..|-.+||. ++-.+|+.. ..+ ..
T Consensus 189 ~v~el---nhg~p------Ve~vl~lps--------gs~iasAgGn------------------~vkVWDl~~-G~q-ll 231 (487)
T KOG0310|consen 189 RVVEL---NHGCP------VESVLALPS--------GSLIASAGGN------------------SVKVWDLTT-GGQ-LL 231 (487)
T ss_pred eeEEe---cCCCc------eeeEEEcCC--------CCEEEEcCCC------------------eEEEEEecC-Cce-eh
Confidence 55322 22211 222343321 3566666663 245677652 111 11
Q ss_pred eccC-CCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCe
Q 039705 312 EMMP-SPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGK 390 (539)
Q Consensus 312 ~~M~-~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~ 390 (539)
..|. +-..--+....-|++=++.||.+. .+-+|| +. .|+.+-.+..|----|.++ -||++
T Consensus 232 ~~~~~H~KtVTcL~l~s~~~rLlS~sLD~-------------~VKVfd--~t---~~Kvv~s~~~~~pvLsiav-s~dd~ 292 (487)
T KOG0310|consen 232 TSMFNHNKTVTCLRLASDSTRLLSGSLDR-------------HVKVFD--TT---NYKVVHSWKYPGPVLSIAV-SPDDQ 292 (487)
T ss_pred hhhhcccceEEEEEeecCCceEeeccccc-------------ceEEEE--cc---ceEEEEeeecccceeeEEe-cCCCc
Confidence 1222 111111222233667777787762 578999 44 6888776665554456554 58999
Q ss_pred EEEecCCCC
Q 039705 391 ILVAGSNPH 399 (539)
Q Consensus 391 V~v~GG~~~ 399 (539)
.+|+|..+.
T Consensus 293 t~viGmsnG 301 (487)
T KOG0310|consen 293 TVVIGMSNG 301 (487)
T ss_pred eEEEecccc
Confidence 999998643
No 53
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=95.85 E-value=1.4 Score=43.86 Aligned_cols=228 Identities=15% Similarity=0.160 Sum_probs=123.6
Q ss_pred EEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccccccc---ceeEEcc
Q 039705 83 VEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWF---STQHILP 159 (539)
Q Consensus 83 ~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y---~s~~~L~ 159 (539)
-..||.|++....+...-..-++.+.-+||...++= ....|.++||+ +..-++.+ |+..+.+ .+++.=.
T Consensus 86 GhLdP~tGev~~ypLg~Ga~Phgiv~gpdg~~Witd----~~~aI~R~dpk--t~evt~f~--lp~~~a~~nlet~vfD~ 157 (353)
T COG4257 86 GHLDPATGEVETYPLGSGASPHGIVVGPDGSAWITD----TGLAIGRLDPK--TLEVTRFP--LPLEHADANLETAVFDP 157 (353)
T ss_pred eecCCCCCceEEEecCCCCCCceEEECCCCCeeEec----CcceeEEecCc--ccceEEee--cccccCCCcccceeeCC
Confidence 357999999988776543333344555677777763 23479999998 55444432 3333333 2344444
Q ss_pred CCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEE--EcCceeEeeCCCCeEEEEcc
Q 039705 160 DGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIF--ANDRSILLNPETNEILHVFP 237 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~--Gg~~~e~yDp~tn~W~~~~p 237 (539)
+|.+..+|-..- -.+.-|.++ .-...+..+.- .-| ..++.+||.+|+. .|+-.-+.||.+..=+ .+|
T Consensus 158 ~G~lWFt~q~G~-yGrLdPa~~--~i~vfpaPqG~------gpy-Gi~atpdGsvwyaslagnaiaridp~~~~ae-v~p 226 (353)
T COG4257 158 WGNLWFTGQIGA-YGRLDPARN--VISVFPAPQGG------GPY-GICATPDGSVWYASLAGNAIARIDPFAGHAE-VVP 226 (353)
T ss_pred CccEEEeecccc-ceecCcccC--ceeeeccCCCC------CCc-ceEECCCCcEEEEeccccceEEcccccCCcc-eec
Confidence 688888874311 001113322 11111111010 012 2467799999998 6777788899887543 332
Q ss_pred cCCCC----CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeec
Q 039705 238 ILRGG----SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREM 313 (539)
Q Consensus 238 ~mp~~----~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~ 313 (539)
. |.. .|.- .+. --+++.+. + + ...++.+|||.. .+|..-+
T Consensus 227 ~-P~~~~~gsRri---wsd--------------pig~~wit---t---w----------g~g~l~rfdPs~--~sW~eyp 270 (353)
T COG4257 227 Q-PNALKAGSRRI---WSD--------------PIGRAWIT---T---W----------GTGSLHRFDPSV--TSWIEYP 270 (353)
T ss_pred C-CCccccccccc---ccC--------------ccCcEEEe---c---c----------CCceeeEeCccc--ccceeee
Confidence 2 221 2220 111 14666665 1 1 123578999874 6698755
Q ss_pred cC--CCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEee
Q 039705 314 MP--SPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVV 385 (539)
Q Consensus 314 M~--~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~l 385 (539)
+| .+|-+.- -+=--|+|+..- ++ ...+..|||++- +|+.+ +++|-..+...|
T Consensus 271 LPgs~arpys~-rVD~~grVW~se-a~------------agai~rfdpeta---~ftv~---p~pr~n~gn~ql 324 (353)
T COG4257 271 LPGSKARPYSM-RVDRHGRVWLSE-AD------------AGAIGRFDPETA---RFTVL---PIPRPNSGNIQL 324 (353)
T ss_pred CCCCCCCccee-eeccCCcEEeec-cc------------cCceeecCcccc---eEEEe---cCCCCCCCceec
Confidence 54 4565432 222246666521 11 115789999999 99886 456654443333
No 54
>PF13854 Kelch_5: Kelch motif
Probab=95.67 E-value=0.022 Score=40.13 Aligned_cols=41 Identities=17% Similarity=0.348 Sum_probs=29.2
Q ss_pred cCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCC
Q 039705 314 MPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDD 361 (539)
Q Consensus 314 M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t 361 (539)
+|.+|..|+++++ +++|||+||.. +. .......+.+||..+
T Consensus 1 ~P~~R~~hs~~~~-~~~iyi~GG~~-~~-----~~~~~~d~~~l~l~s 41 (42)
T PF13854_consen 1 IPSPRYGHSAVVV-GNNIYIFGGYS-GN-----NNSYSNDLYVLDLPS 41 (42)
T ss_pred CCCCccceEEEEE-CCEEEEEcCcc-CC-----CCCEECcEEEEECCC
Confidence 4789999998766 99999999987 21 112233577777654
No 55
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.55 E-value=3.6 Score=42.80 Aligned_cols=263 Identities=13% Similarity=0.169 Sum_probs=123.1
Q ss_pred eEEEEECCCCCEEeCccCCCc-ccccceecCC-CcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEE
Q 039705 81 LAVEYDAESAAIRPLKILTDT-WSSSGGLSAN-GTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHI 157 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~-~c~~~~~l~d-G~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~ 157 (539)
....||.++++++.+...... -.+.-+..++ ..||++.........+..|+-...+.+.+.+.. .. .+..-...++
T Consensus 16 ~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~-~~~~g~~p~~i~~ 94 (345)
T PF10282_consen 16 YVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNS-VPSGGSSPCHIAV 94 (345)
T ss_dssp EEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEE-EEESSSCEEEEEE
T ss_pred EEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeee-eccCCCCcEEEEE
Confidence 456678899999876542111 1111122234 556666543112345555543311456766654 43 3343223344
Q ss_pred ccCCcEEEEcCccCCeEEEEecCCCcceeec-cCcc----ccCCCCCCCCcceEEE-eeCCcEEEEE---cCceeEeeCC
Q 039705 158 LPDGSFIVVGGRREFSYEYILKEGKRIIYDL-PILN----ETTNPSENNLYPFVFL-STDGNLFIFA---NDRSILLNPE 228 (539)
Q Consensus 158 L~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~-~~l~----~~~~~~~~~~yp~~~~-~~~G~Iyv~G---g~~~e~yDp~ 228 (539)
-+||+.+++.-..+.++.+|+......-... .... ..........+||.+. .+||+.+++. .+.+.+|+..
T Consensus 95 ~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~ 174 (345)
T PF10282_consen 95 DPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGADRVYVYDID 174 (345)
T ss_dssp CTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTTEEEEEEE-
T ss_pred ecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCCEEEEEEEe
Confidence 4678777776555567777733211111100 0000 0000001234566654 4688754443 4567888877
Q ss_pred CCe--EEEEcc--cCCCC--CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEe
Q 039705 229 TNE--ILHVFP--ILRGG--SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEI 302 (539)
Q Consensus 229 tn~--W~~~~p--~mp~~--~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~ 302 (539)
.++ .. ... .++.+ +|+ .++-| ....+|++.-.+ +++..|+.
T Consensus 175 ~~~~~l~-~~~~~~~~~G~GPRh------~~f~p----------dg~~~Yv~~e~s----------------~~v~v~~~ 221 (345)
T PF10282_consen 175 DDTGKLT-PVDSIKVPPGSGPRH------LAFSP----------DGKYAYVVNELS----------------NTVSVFDY 221 (345)
T ss_dssp TTS-TEE-EEEEEECSTTSSEEE------EEE-T----------TSSEEEEEETTT----------------TEEEEEEE
T ss_pred CCCceEE-EeeccccccCCCCcE------EEEcC----------CcCEEEEecCCC----------------CcEEEEee
Confidence 655 43 211 12221 343 33332 145678875432 34566666
Q ss_pred eCCCCceee---e-ccCC---Cc-eeceeEEecCCc-EEEEcCcCCCCCCcccCCCCCCccEEEcC--CCCCCCceEecC
Q 039705 303 TNKSATWQR---E-MMPS---PR-VMGEMLLLPTGD-VLIINGAKKGTAGWNFATDPNTTPVLYEP--DDPINERFSELT 371 (539)
Q Consensus 303 ~~~~~~W~~---~-~M~~---~R-~~~~~vvlpdG~-I~vvGG~~~g~~g~~~~~~~~~~~e~YdP--~t~~g~~Wt~~a 371 (539)
......++. . .++. .. ..+...+-|||+ |||.+-.. .++-+|+- ++. +.+.+.
T Consensus 222 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~-------------~sI~vf~~d~~~g---~l~~~~ 285 (345)
T PF10282_consen 222 DPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGS-------------NSISVFDLDPATG---TLTLVQ 285 (345)
T ss_dssp ETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTT-------------TEEEEEEECTTTT---TEEEEE
T ss_pred cccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccC-------------CEEEEEEEecCCC---ceEEEE
Confidence 533445553 2 2322 11 233445678897 56655322 15566665 444 555443
Q ss_pred C----CCCCccceeeEeeCCCCeEEEecCC
Q 039705 372 P----TSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 372 ~----~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
. ...|| .. .+-|||+.++++..
T Consensus 286 ~~~~~G~~Pr---~~-~~s~~g~~l~Va~~ 311 (345)
T PF10282_consen 286 TVPTGGKFPR---HF-AFSPDGRYLYVANQ 311 (345)
T ss_dssp EEEESSSSEE---EE-EE-TTSSEEEEEET
T ss_pred EEeCCCCCcc---EE-EEeCCCCEEEEEec
Confidence 2 23356 23 34789997776553
No 56
>PRK13684 Ycf48-like protein; Provisional
Probab=95.52 E-value=3.7 Score=42.68 Aligned_cols=75 Identities=19% Similarity=0.302 Sum_probs=44.9
Q ss_pred CCceeeeccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCcc--ceeeE
Q 039705 306 SATWQREMMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRM--CHSTS 383 (539)
Q Consensus 306 ~~~W~~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~--yhs~a 383 (539)
..+|+..+.+..+...+++..++|+++++|.. |. .++. .+|.|++|+....-..... .++++
T Consensus 203 g~tW~~~~~~~~~~l~~i~~~~~g~~~~vg~~--G~-------------~~~~-s~d~G~sW~~~~~~~~~~~~~l~~v~ 266 (334)
T PRK13684 203 QTAWTPHQRNSSRRLQSMGFQPDGNLWMLARG--GQ-------------IRFN-DPDDLESWSKPIIPEITNGYGYLDLA 266 (334)
T ss_pred CCeEEEeeCCCcccceeeeEcCCCCEEEEecC--CE-------------EEEc-cCCCCCccccccCCccccccceeeEE
Confidence 35798875555555556667789999998753 21 2231 3566779997532111122 23333
Q ss_pred eeCCCCeEEEecCC
Q 039705 384 VVLPDGKILVAGSN 397 (539)
Q Consensus 384 ~llpdG~V~v~GG~ 397 (539)
..++++++++|..
T Consensus 267 -~~~~~~~~~~G~~ 279 (334)
T PRK13684 267 -YRTPGEIWAGGGN 279 (334)
T ss_pred -EcCCCCEEEEcCC
Confidence 3578899998864
No 57
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.41 E-value=1 Score=47.78 Aligned_cols=250 Identities=14% Similarity=0.179 Sum_probs=133.7
Q ss_pred CcccceEEEecCCCCEEEEEccccCCCC--CccCCCcccccC-C--C--ccccccccceeEEEEECCCCCE-EeCc----
Q 039705 29 SGISAMHIILFPNTNKAIMLDAVSLGPS--NVRLPVGIYRLN-P--G--AWQKYVDYRALAVEYDAESAAI-RPLK---- 96 (539)
Q Consensus 29 ~~~~~~~~~ll~~~gkv~~~g~~~~~~~--~~~~~~g~~~~~-~--~--~~~g~~~~~~~~~~yDp~t~~w-~~l~---- 96 (539)
+|..+.+.++. ..-+|-+++....... ..+|++..|... + . .+.| |-.-++.+||.++... +.+.
T Consensus 35 sp~~P~d~aVt-~S~rvqly~~~~~~~~k~~srFk~~v~s~~fR~DG~LlaaG--D~sG~V~vfD~k~r~iLR~~~ah~a 111 (487)
T KOG0310|consen 35 SPKHPYDFAVT-SSVRVQLYSSVTRSVRKTFSRFKDVVYSVDFRSDGRLLAAG--DESGHVKVFDMKSRVILRQLYAHQA 111 (487)
T ss_pred CCCCCCceEEe-cccEEEEEecchhhhhhhHHhhccceeEEEeecCCeEEEcc--CCcCcEEEeccccHHHHHHHhhccC
Confidence 46667788887 8888888887553322 123444433221 1 0 0111 2345788999655222 1111
Q ss_pred cCC-CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccc--eeEEccCCcEEEEcCccCCe
Q 039705 97 ILT-DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFS--TQHILPDGSFIVVGGRREFS 173 (539)
Q Consensus 97 ~~~-~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~--s~~~L~dG~VyvvGG~~~~~ 173 (539)
+-+ ..|| .-|+++++.|+.. +.+.++|-. +.. . ... +...--|- ..+.=.++.+++.||+++ .
T Consensus 112 pv~~~~f~-----~~d~t~l~s~sDd---~v~k~~d~s--~a~-v-~~~-l~~htDYVR~g~~~~~~~hivvtGsYDg-~ 177 (487)
T KOG0310|consen 112 PVHVTKFS-----PQDNTMLVSGSDD---KVVKYWDLS--TAY-V-QAE-LSGHTDYVRCGDISPANDHIVVTGSYDG-K 177 (487)
T ss_pred ceeEEEec-----ccCCeEEEecCCC---ceEEEEEcC--CcE-E-EEE-ecCCcceeEeeccccCCCeEEEecCCCc-e
Confidence 111 2244 2488999998863 556677765 332 2 222 33221111 112222678999999985 3
Q ss_pred EEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEE-EcCceeEeeCCCCeEEEEcccCCCCCCccCCCcc
Q 039705 174 YEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIF-ANDRSILLNPETNEILHVFPILRGGSRNYPASAT 251 (539)
Q Consensus 174 ~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~-Gg~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~ 251 (539)
+..| .+.. ..|. ..+-.. .+.+ .+..+++|.+++. ||+++-+||..++.- .+..|.. ++ -+
T Consensus 178 vrl~DtR~~-~~~v-~elnhg--~pVe-----~vl~lpsgs~iasAgGn~vkVWDl~~G~q--ll~~~~~---H~---Kt 240 (487)
T KOG0310|consen 178 VRLWDTRSL-TSRV-VELNHG--CPVE-----SVLALPSGSLIASAGGNSVKVWDLTTGGQ--LLTSMFN---HN---KT 240 (487)
T ss_pred EEEEEeccC-Ccee-EEecCC--Ccee-----eEEEcCCCCEEEEcCCCeEEEEEecCCce--ehhhhhc---cc---ce
Confidence 5566 4432 1332 111101 0000 2455678766665 678899999987663 3444432 11 23
Q ss_pred EEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCc
Q 039705 252 SALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGD 330 (539)
Q Consensus 252 av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~ 330 (539)
...|.+.. ++.=++.||.+. .+-.||. ..|... .|.++-.-....+.||++
T Consensus 241 VTcL~l~s--------~~~rLlS~sLD~----------------~VKVfd~----t~~Kvv~s~~~~~pvLsiavs~dd~ 292 (487)
T KOG0310|consen 241 VTCLRLAS--------DSTRLLSGSLDR----------------HVKVFDT----TNYKVVHSWKYPGPVLSIAVSPDDQ 292 (487)
T ss_pred EEEEEeec--------CCceEeeccccc----------------ceEEEEc----cceEEEEeeecccceeeEEecCCCc
Confidence 34444321 456778888762 3556773 457765 554443334456789999
Q ss_pred EEEEcCcCCCC
Q 039705 331 VLIINGAKKGT 341 (539)
Q Consensus 331 I~vvGG~~~g~ 341 (539)
..|+|..+ |.
T Consensus 293 t~viGmsn-Gl 302 (487)
T KOG0310|consen 293 TVVIGMSN-GL 302 (487)
T ss_pred eEEEeccc-ce
Confidence 99999876 53
No 58
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=94.98 E-value=5.5 Score=41.67 Aligned_cols=216 Identities=16% Similarity=0.168 Sum_probs=104.8
Q ss_pred cceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCc
Q 039705 105 SGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKR 183 (539)
Q Consensus 105 ~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~ 183 (539)
.+-++.++++|+.+.. ..+..||+.++.-.|..- +.. +.. +..++.+++||+ |..++ .+..+ +.++..
T Consensus 59 ~~p~v~~~~v~v~~~~----g~v~a~d~~tG~~~W~~~---~~~-~~~-~~p~v~~~~v~v-~~~~g-~l~ald~~tG~~ 127 (377)
T TIGR03300 59 LQPAVAGGKVYAADAD----GTVVALDAETGKRLWRVD---LDE-RLS-GGVGADGGLVFV-GTEKG-EVIALDAEDGKE 127 (377)
T ss_pred cceEEECCEEEEECCC----CeEEEEEccCCcEeeeec---CCC-Ccc-cceEEcCCEEEE-EcCCC-EEEEEECCCCcE
Confidence 3444557777776542 468899987445568653 221 222 233454667765 44332 33444 444433
Q ss_pred ceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc-CceeEeeCCCCe--EEEEcccCCCCCCccCCCccEEecccccC
Q 039705 184 IIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN-DRSILLNPETNE--ILHVFPILRGGSRNYPASATSALLPIKLQ 260 (539)
Q Consensus 184 ~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg-~~~e~yDp~tn~--W~~~~p~mp~~~r~yp~~g~av~lpl~~~ 260 (539)
.|.... ... .+ ...+..++++|+..+ ....++|+++++ |+.....-....+.. ++.+.
T Consensus 128 ~W~~~~--~~~-------~~-~~p~v~~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~~~~~~~~---~sp~~------ 188 (377)
T TIGR03300 128 LWRAKL--SSE-------VL-SPPLVANGLVVVRTNDGRLTALDAATGERLWTYSRVTPALTLRGS---ASPVI------ 188 (377)
T ss_pred eeeecc--Cce-------ee-cCCEEECCEEEEECCCCeEEEEEcCCCceeeEEccCCCceeecCC---CCCEE------
Confidence 564210 000 00 012334677777544 346788998765 653221100000111 11111
Q ss_pred CCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCc--------eeceeEEecCCcE
Q 039705 261 DPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPR--------VMGEMLLLPTGDV 331 (539)
Q Consensus 261 ~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R--------~~~~~vvlpdG~I 331 (539)
.++.+| +|..+ ..+..+|+.+....|+.. ..+..+ .... .++.+++|
T Consensus 189 ------~~~~v~-~~~~~----------------g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~-p~~~~~~v 244 (377)
T TIGR03300 189 ------ADGGVL-VGFAG----------------GKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGD-PVVDGGQV 244 (377)
T ss_pred ------ECCEEE-EECCC----------------CEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCc-cEEECCEE
Confidence 145554 44322 135567775444568754 322211 1122 33458888
Q ss_pred EEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecC
Q 039705 332 LIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGS 396 (539)
Q Consensus 332 ~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG 396 (539)
|+.+... .+.+||+++.+ ..|+.-. ..+.+.++ .|++||+...
T Consensus 245 y~~~~~g--------------~l~a~d~~tG~-~~W~~~~-----~~~~~p~~--~~~~vyv~~~ 287 (377)
T TIGR03300 245 YAVSYQG--------------RVAALDLRSGR-VLWKRDA-----SSYQGPAV--DDNRLYVTDA 287 (377)
T ss_pred EEEEcCC--------------EEEEEECCCCc-EEEeecc-----CCccCceE--eCCEEEEECC
Confidence 8855321 46789998762 2586531 11222233 4899998753
No 59
>PF13088 BNR_2: BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=94.63 E-value=2.5 Score=42.08 Aligned_cols=218 Identities=14% Similarity=0.125 Sum_probs=104.9
Q ss_pred ccceeecccccc------cccccceeEEccCCcEEEEc--CccC---CeEE---EEecCCCcceeeccCccccCC-CCCC
Q 039705 136 ACYWKEHHWELS------AKRWFSTQHILPDGSFIVVG--GRRE---FSYE---YILKEGKRIIYDLPILNETTN-PSEN 200 (539)
Q Consensus 136 t~~W~~~~~~m~------~~R~y~s~~~L~dG~VyvvG--G~~~---~~~E---~yP~~~~~~w~~~~~l~~~~~-~~~~ 200 (539)
..+|++... +. ..-+.+..+..++|+|+++- +... .... ++...+..+|.....+..... ....
T Consensus 29 G~tWs~~~~-v~~~~~~~~~~~~p~~~~~~~g~l~l~~~~~~~~~~~~~~~~~~~~S~D~G~TWs~~~~l~~~~~~~~~~ 107 (275)
T PF13088_consen 29 GKTWSEPRI-VADGPKPGRRYGNPSLVVDPDGRLWLFYSAGSSGGGWSGSRIYYSRSTDGGKTWSEPTDLPPGWFGNFSG 107 (275)
T ss_dssp TTEEEEEEE-EETSTBTTCEEEEEEEEEETTSEEEEEEEEEETTESCCTCEEEEEEESSTTSS-EEEEEEHHHCCCSCEE
T ss_pred CCeeCCCEE-EeeccccCCcccCcEEEEeCCCCEEEEEEEccCCCCCCceeEEEEEECCCCCCCCCccccccccccceec
Confidence 678987532 21 11234444555699999886 2221 1111 224433567853322211100 0000
Q ss_pred CCcceEEEeeCCcEEEEEc------C-ceeEee-CCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEE
Q 039705 201 NLYPFVFLSTDGNLFIFAN------D-RSILLN-PETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVL 272 (539)
Q Consensus 201 ~~yp~~~~~~~G~Iyv~Gg------~-~~e~yD-p~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iy 272 (539)
......+.+.+|++++..- . ....|. -...+|. .....+..... .-.+++.+ .+|+|+
T Consensus 108 ~~~~~~i~~~~G~l~~~~~~~~~~~~~~~~~~S~D~G~tW~-~~~~~~~~~~~----~e~~~~~~---------~dG~l~ 173 (275)
T PF13088_consen 108 PGRGPPIQLPDGRLIAPYYHESGGSFSAFVYYSDDGGKTWS-SGSPIPDGQGE----CEPSIVEL---------PDGRLL 173 (275)
T ss_dssp CSEEEEEEECTTEEEEEEEEESSCEEEEEEEEESSTTSSEE-EEEECECSEEE----EEEEEEEE---------TTSEEE
T ss_pred cceeeeeEecCCCEEEEEeeccccCcceEEEEeCCCCceee-ccccccccCCc----ceeEEEEC---------CCCcEE
Confidence 0011225567999988641 1 122343 4456687 34333221111 11122211 267888
Q ss_pred EecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee---ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCC
Q 039705 273 ICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE---MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATD 349 (539)
Q Consensus 273 v~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~---~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~ 349 (539)
++--.. .. .....+.-.|...+|+.. .++........+.+.||+++++.....+.
T Consensus 174 ~~~R~~-~~-------------~~~~~~~S~D~G~TWs~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r-------- 231 (275)
T PF13088_consen 174 AVFRTE-GN-------------DDIYISRSTDGGRTWSPPQPTNLPNPNSSISLVRLSDGRLLLVYNNPDGR-------- 231 (275)
T ss_dssp EEEEEC-SS-------------TEEEEEEESSTTSS-EEEEEEECSSCCEEEEEEECTTSEEEEEEECSSTS--------
T ss_pred EEEEcc-CC-------------CcEEEEEECCCCCcCCCceecccCcccCCceEEEcCCCCEEEEEECCCCC--------
Confidence 874321 00 112222223446789962 67777777777788999999988732111
Q ss_pred CCCccEEEcCCCCCCCceEecCCCCCCc---cceeeEeeCCCCeEEE
Q 039705 350 PNTTPVLYEPDDPINERFSELTPTSKPR---MCHSTSVVLPDGKILV 393 (539)
Q Consensus 350 ~~~~~e~YdP~t~~g~~Wt~~a~~~~~R---~yhs~a~llpdG~V~v 393 (539)
..+.+. + ..+.|++|+........- ...+..+.++||+|+|
T Consensus 232 ~~l~l~-~--S~D~g~tW~~~~~i~~~~~~~~~Y~~~~~~~dg~l~i 275 (275)
T PF13088_consen 232 SNLSLY-V--SEDGGKTWSRPKTIDDGPNGDSGYPSLTQLPDGKLYI 275 (275)
T ss_dssp EEEEEE-E--ECTTCEEEEEEEEEEEEE-CCEEEEEEEEEETTEEEE
T ss_pred CceEEE-E--EeCCCCcCCccEEEeCCCCCcEECCeeEEeCCCcCCC
Confidence 111222 2 333466998654333322 3335566678999986
No 60
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=94.51 E-value=4.5 Score=38.45 Aligned_cols=245 Identities=17% Similarity=0.183 Sum_probs=112.5
Q ss_pred EEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccccc-ccceeEEccC
Q 039705 82 AVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKR-WFSTQHILPD 160 (539)
Q Consensus 82 ~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R-~y~s~~~L~d 160 (539)
..+||..+++.......+...........+++.+++++.. ..+.+||.. +.+... . +.... .-.+.....+
T Consensus 33 i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~---~~i~i~~~~--~~~~~~--~-~~~~~~~i~~~~~~~~ 104 (289)
T cd00200 33 IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSD---KTIRLWDLE--TGECVR--T-LTGHTSYVSSVAFSPD 104 (289)
T ss_pred EEEEEeeCCCcEEEEecCCcceeEEEECCCCCEEEEEcCC---CeEEEEEcC--cccceE--E-EeccCCcEEEEEEcCC
Confidence 4466666554222111111111223445677788888752 578899886 432211 1 21111 1123344446
Q ss_pred CcEEEEcCccCCeEEEE-ecCCCcce-eeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCCCeEEEEc
Q 039705 161 GSFIVVGGRREFSYEYI-LKEGKRII-YDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNPETNEILHVF 236 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y-P~~~~~~w-~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~tn~W~~~~ 236 (539)
+++++.++.+ ..+.+| ..+. +. .... .... . --.....+++++++.+. ..+.+||..+++-...+
T Consensus 105 ~~~~~~~~~~-~~i~~~~~~~~--~~~~~~~---~~~~----~-i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~ 173 (289)
T cd00200 105 GRILSSSSRD-KTIKVWDVETG--KCLTTLR---GHTD----W-VNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATL 173 (289)
T ss_pred CCEEEEecCC-CeEEEEECCCc--EEEEEec---cCCC----c-EEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeE
Confidence 6777777644 345566 4322 21 1111 0000 0 00122335577777764 45788998765433222
Q ss_pred ccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeecc-C
Q 039705 237 PILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMM-P 315 (539)
Q Consensus 237 p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M-~ 315 (539)
. . .... --...+.| +++.+++++.+ + .+..||+.. .+... .+ .
T Consensus 174 ~---~-~~~~--i~~~~~~~-----------~~~~l~~~~~~-~---------------~i~i~d~~~--~~~~~-~~~~ 217 (289)
T cd00200 174 T---G-HTGE--VNSVAFSP-----------DGEKLLSSSSD-G---------------TIKLWDLST--GKCLG-TLRG 217 (289)
T ss_pred e---c-Cccc--cceEEECC-----------CcCEEEEecCC-C---------------cEEEEECCC--Cceec-chhh
Confidence 1 1 1100 01122221 45455555543 1 345677642 11111 11 1
Q ss_pred CCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEec
Q 039705 316 SPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAG 395 (539)
Q Consensus 316 ~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~G 395 (539)
....-......+++++++.++.+ + .+.+||..+.. .-..+. ..... -....+.++++.++++
T Consensus 218 ~~~~i~~~~~~~~~~~~~~~~~~-~------------~i~i~~~~~~~--~~~~~~-~~~~~--i~~~~~~~~~~~l~~~ 279 (289)
T cd00200 218 HENGVNSVAFSPDGYLLASGSED-G------------TIRVWDLRTGE--CVQTLS-GHTNS--VTSLAWSPDGKRLASG 279 (289)
T ss_pred cCCceEEEEEcCCCcEEEEEcCC-C------------cEEEEEcCCce--eEEEcc-ccCCc--EEEEEECCCCCEEEEe
Confidence 11122233456678888877633 1 57889887641 222222 11111 1234456888999988
Q ss_pred CCCC
Q 039705 396 SNPH 399 (539)
Q Consensus 396 G~~~ 399 (539)
+.++
T Consensus 280 ~~d~ 283 (289)
T cd00200 280 SADG 283 (289)
T ss_pred cCCC
Confidence 8643
No 61
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=94.41 E-value=5.6 Score=39.13 Aligned_cols=131 Identities=12% Similarity=0.188 Sum_probs=64.3
Q ss_pred EEEEECCCCCEEeCccCCCcccccceecCCCcE-EEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 82 AVEYDAESAAIRPLKILTDTWSSSGGLSANGTI-VISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 82 ~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l-~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
..+||+.+++....-.... ........+||+. |++++. ...+.+||.. +.+.... +.....-...+..+|
T Consensus 13 v~~~d~~t~~~~~~~~~~~-~~~~l~~~~dg~~l~~~~~~---~~~v~~~d~~--~~~~~~~---~~~~~~~~~~~~~~~ 83 (300)
T TIGR03866 13 ISVIDTATLEVTRTFPVGQ-RPRGITLSKDGKLLYVCASD---SDTIQVIDLA--TGEVIGT---LPSGPDPELFALHPN 83 (300)
T ss_pred EEEEECCCCceEEEEECCC-CCCceEECCCCCEEEEEECC---CCeEEEEECC--CCcEEEe---ccCCCCccEEEECCC
Confidence 4567776665433221111 1122345678875 466553 3578899987 5554331 111111123445567
Q ss_pred CcEEEEcCccCCeEEEE-ecCCCcce-eeccCccccCCCCCCCCcc-eEEEeeCCcEEEEEcCc---eeEeeCCCCeEE
Q 039705 161 GSFIVVGGRREFSYEYI-LKEGKRII-YDLPILNETTNPSENNLYP-FVFLSTDGNLFIFANDR---SILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y-P~~~~~~w-~~~~~l~~~~~~~~~~~yp-~~~~~~~G~Iyv~Gg~~---~e~yDp~tn~W~ 233 (539)
|+.+.+.+.....+.+| ..+. +- ...+. . ..+ .....++|++++++..+ ...||..+.+-.
T Consensus 84 g~~l~~~~~~~~~l~~~d~~~~--~~~~~~~~--~--------~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~~~ 150 (300)
T TIGR03866 84 GKILYIANEDDNLVTVIDIETR--KVLAEIPV--G--------VEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTYEIV 150 (300)
T ss_pred CCEEEEEcCCCCeEEEEECCCC--eEEeEeeC--C--------CCcceEEECCCCCEEEEEecCCCeEEEEeCCCCeEE
Confidence 77544443333456666 4432 21 11110 0 011 23445799988887643 445788776543
No 62
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=94.35 E-value=0.067 Score=56.77 Aligned_cols=160 Identities=11% Similarity=0.119 Sum_probs=96.5
Q ss_pred EEEeeC--CcEEEEEcCc-------eeEeeCCCCeEEEEc---ccCCCCCCccCCCccEEecccccCCCCCCCcccEEEE
Q 039705 206 VFLSTD--GNLFIFANDR-------SILLNPETNEILHVF---PILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLI 273 (539)
Q Consensus 206 ~~~~~~--G~Iyv~Gg~~-------~e~yDp~tn~W~~~~---p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv 273 (539)
-++.-+ .+||..||.+ -|.|.-..|.|+ .+ ...|+ .|.+ .-.|. . . -..|+|.
T Consensus 265 QMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v~e~~W~-~iN~~t~~PG-~RsC---HRMVi-d--------~-S~~KLYL 329 (723)
T KOG2437|consen 265 QMVIDVQTECVYLYGGWDGTQDLADFWAYSVKENQWT-CINRDTEGPG-ARSC---HRMVI-D--------I-SRRKLYL 329 (723)
T ss_pred eEEEeCCCcEEEEecCcccchhHHHHHhhcCCcceeE-EeecCCCCCc-chhh---hhhhh-h--------h-hHhHHhh
Confidence 344444 4999999964 589999999998 43 22455 5653 22221 1 0 1458999
Q ss_pred ecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCC-------ceeceeEEecCCc--EEEEcCcCCCCCCc
Q 039705 274 CGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSP-------RVMGEMLLLPTGD--VLIINGAKKGTAGW 344 (539)
Q Consensus 274 ~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~-------R~~~~~vvlpdG~--I~vvGG~~~g~~g~ 344 (539)
.|-+-..+.. +-..+-+...+||+. ++.|...+|... -.-|+|++- ..| |||.||.....
T Consensus 330 lG~Y~~sS~r-----~~~s~RsDfW~FDi~--~~~W~~ls~dt~~dGGP~~vfDHqM~Vd-~~k~~iyVfGGr~~~~--- 398 (723)
T KOG2437|consen 330 LGRYLDSSVR-----NSKSLRSDFWRFDID--TNTWMLLSEDTAADGGPKLVFDHQMCVD-SEKHMIYVFGGRILTC--- 398 (723)
T ss_pred hhhccccccc-----cccccccceEEEecC--CceeEEecccccccCCcceeecceeeEe-cCcceEEEecCeeccC---
Confidence 9864311111 011234567889986 589997666543 234566544 555 99999986421
Q ss_pred ccCCCCCCc-cEEEcCCCCCCCceEecC----------CCCCCccceeeEeeCCCCeEEEecCC
Q 039705 345 NFATDPNTT-PVLYEPDDPINERFSELT----------PTSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 345 ~~~~~~~~~-~e~YdP~t~~g~~Wt~~a----------~~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
+++++. ...||-... .|..++ .-.+.|+.|.+-..--+.+.|+.||.
T Consensus 399 ---~e~~f~GLYaf~~~~~---~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq 456 (723)
T KOG2437|consen 399 ---NEPQFSGLYAFNCQCQ---TWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQ 456 (723)
T ss_pred ---CCccccceEEEecCCc---cHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCc
Confidence 224443 567887766 887542 23457888887655445556777764
No 63
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=94.08 E-value=6.6 Score=38.63 Aligned_cols=131 Identities=15% Similarity=0.123 Sum_probs=66.7
Q ss_pred EEEEECCCCCEEe-CccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 82 AVEYDAESAAIRP-LKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 82 ~~~yDp~t~~w~~-l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
..+||..+.+... +...... ...++.++|+.+.+.+..+ ..+.+||.. +.+- +.. +.....-.+.+.-+|
T Consensus 55 v~~~d~~~~~~~~~~~~~~~~--~~~~~~~~g~~l~~~~~~~--~~l~~~d~~--~~~~--~~~-~~~~~~~~~~~~~~d 125 (300)
T TIGR03866 55 IQVIDLATGEVIGTLPSGPDP--ELFALHPNGKILYIANEDD--NLVTVIDIE--TRKV--LAE-IPVGVEPEGMAVSPD 125 (300)
T ss_pred EEEEECCCCcEEEeccCCCCc--cEEEECCCCCEEEEEcCCC--CeEEEEECC--CCeE--EeE-eeCCCCcceEEECCC
Confidence 5578888877654 2222222 1234456777554433222 578999987 4321 111 221221233455678
Q ss_pred CcEEEEcCccCCeEEEE-ecCCCccee-eccCccccCCCCCCCCcc-eEEEeeCCcEEEEEc---CceeEeeCCCCeEE
Q 039705 161 GSFIVVGGRREFSYEYI-LKEGKRIIY-DLPILNETTNPSENNLYP-FVFLSTDGNLFIFAN---DRSILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y-P~~~~~~w~-~~~~l~~~~~~~~~~~yp-~~~~~~~G~Iyv~Gg---~~~e~yDp~tn~W~ 233 (539)
|++++++..+......| ..+. +-. ... ... .| .....++|+.+++++ ..+.+||.++.+..
T Consensus 126 g~~l~~~~~~~~~~~~~d~~~~--~~~~~~~--~~~--------~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~ 192 (300)
T TIGR03866 126 GKIVVNTSETTNMAHFIDTKTY--EIVDNVL--VDQ--------RPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVI 192 (300)
T ss_pred CCEEEEEecCCCeEEEEeCCCC--eEEEEEE--cCC--------CccEEEECCCCCEEEEEcCCCCEEEEEEcCcceee
Confidence 99988887654334444 3322 111 111 000 11 223456887665543 34778999887654
No 64
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.96 E-value=8.4 Score=39.44 Aligned_cols=70 Identities=13% Similarity=0.037 Sum_probs=38.4
Q ss_pred ceecCCCcEEEecCCCCCCCeEEEEeCCCCccc-e-eecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecC
Q 039705 106 GGLSANGTIVISGGWSSRGRSVRYLSGCYHACY-W-KEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKE 180 (539)
Q Consensus 106 ~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~-W-~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~ 180 (539)
-++.+||+.+.+..+. ...+.+||.. ++. . ..... +.....-++++.-+||+.+.+.......+.+| ..+
T Consensus 85 i~~~~~g~~l~v~~~~--~~~v~v~~~~--~~g~~~~~~~~-~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~ 157 (330)
T PRK11028 85 ISTDHQGRFLFSASYN--ANCVSVSPLD--KDGIPVAPIQI-IEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSD 157 (330)
T ss_pred EEECCCCCEEEEEEcC--CCeEEEEEEC--CCCCCCCceee-ccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECC
Confidence 3456788866666543 3677788764 221 1 11111 22222224455666887776666666678888 443
No 65
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=93.67 E-value=8.5 Score=38.73 Aligned_cols=191 Identities=17% Similarity=0.210 Sum_probs=106.8
Q ss_pred CCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCCC--eEEEE
Q 039705 160 DGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNPETN--EILHV 235 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~tn--~W~~~ 235 (539)
||+++|--......+...|... .|.. .++-.|.|+..+.|| +...+|+.++. .-...
T Consensus 76 DGklIvWDs~TtnK~haipl~s--~WVM-----------------tCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~ 136 (343)
T KOG0286|consen 76 DGKLIVWDSFTTNKVHAIPLPS--SWVM-----------------TCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVR 136 (343)
T ss_pred CCeEEEEEcccccceeEEecCc--eeEE-----------------EEEECCCCCeEEecCcCceeEEEecccccccccce
Confidence 8999987665544334444433 5631 123457888888898 45778988754 21111
Q ss_pred cc-cCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-c
Q 039705 236 FP-ILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-M 313 (539)
Q Consensus 236 ~p-~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~ 313 (539)
.. .+++ .+.|- ...-.+ .++.|+.--| + .+|...|++. .+=... .
T Consensus 137 v~r~l~g-Htgyl--ScC~f~-----------dD~~ilT~SG-D----------------~TCalWDie~--g~~~~~f~ 183 (343)
T KOG0286|consen 137 VSRELAG-HTGYL--SCCRFL-----------DDNHILTGSG-D----------------MTCALWDIET--GQQTQVFH 183 (343)
T ss_pred eeeeecC-cccee--EEEEEc-----------CCCceEecCC-C----------------ceEEEEEccc--ceEEEEec
Confidence 11 1233 34441 223332 1556555434 2 3577888863 333333 3
Q ss_pred cCCCceeceeEEec-CCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEE
Q 039705 314 MPSPRVMGEMLLLP-TGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKIL 392 (539)
Q Consensus 314 M~~~R~~~~~vvlp-dG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~ 392 (539)
-+.+-.+.- -+.| |++.||.||.+. ++.+||-... +=...=.-+..- -.+....|+|--+
T Consensus 184 GH~gDV~sl-sl~p~~~ntFvSg~cD~-------------~aklWD~R~~---~c~qtF~ghesD--INsv~ffP~G~af 244 (343)
T KOG0286|consen 184 GHTGDVMSL-SLSPSDGNTFVSGGCDK-------------SAKLWDVRSG---QCVQTFEGHESD--INSVRFFPSGDAF 244 (343)
T ss_pred CCcccEEEE-ecCCCCCCeEEeccccc-------------ceeeeeccCc---ceeEeecccccc--cceEEEccCCCee
Confidence 333333322 3457 999999999873 4678998776 322211111111 2345678999999
Q ss_pred EecCCCCCC--CccCCCCCCCcceeeEEecCCCCCCC
Q 039705 393 VAGSNPHSR--YNLTSGSKYPTELRIEKFYPPYFDES 427 (539)
Q Consensus 393 v~GG~~~~~--~~~~~~~~~p~~~~vE~y~Ppyl~~~ 427 (539)
++|+.+..- |... ....+++|.++-..-|
T Consensus 245 atGSDD~tcRlyDlR------aD~~~a~ys~~~~~~g 275 (343)
T KOG0286|consen 245 ATGSDDATCRLYDLR------ADQELAVYSHDSIICG 275 (343)
T ss_pred eecCCCceeEEEeec------CCcEEeeeccCcccCC
Confidence 999975431 2111 2457899998877655
No 66
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=93.61 E-value=1.7 Score=42.32 Aligned_cols=142 Identities=14% Similarity=0.167 Sum_probs=83.7
Q ss_pred eeEEEEECCCCCEEeCccCCC--cccccceecCCCcEEEecCCCCC-C-CeEEEEeCCCCccceee-ccccccccc----
Q 039705 80 ALAVEYDAESAAIRPLKILTD--TWSSSGGLSANGTIVISGGWSSR-G-RSVRYLSGCYHACYWKE-HHWELSAKR---- 150 (539)
Q Consensus 80 ~~~~~yDp~t~~w~~l~~~~~--~~c~~~~~l~dG~l~v~GG~~~g-~-~~v~~ydP~~~t~~W~~-~~~~m~~~R---- 150 (539)
..+++|+..+++|+.+..... ..... .+..||.+|-+.-...+ . ..+-.||.. +.+|.+ ++ ++..+
T Consensus 70 ~~~~Vys~~~~~Wr~~~~~~~~~~~~~~-~v~~~G~lyw~~~~~~~~~~~~IvsFDl~--~E~f~~~i~--~P~~~~~~~ 144 (230)
T TIGR01640 70 SEHQVYTLGSNSWRTIECSPPHHPLKSR-GVCINGVLYYLAYTLKTNPDYFIVSFDVS--SERFKEFIP--LPCGNSDSV 144 (230)
T ss_pred ccEEEEEeCCCCccccccCCCCccccCC-eEEECCEEEEEEEECCCCCcEEEEEEEcc--cceEeeeee--cCccccccc
Confidence 357899999999999874321 12222 56678988877633221 1 268889999 899995 43 33322
Q ss_pred ccceeEEccCCcEEEEcCccC-CeEEEE-ecCC-Ccceee---ccCccccCCCCCCCCcceEEEeeCCcEEEEEcC---c
Q 039705 151 WFSTQHILPDGSFIVVGGRRE-FSYEYI-LKEG-KRIIYD---LPILNETTNPSENNLYPFVFLSTDGNLFIFAND---R 221 (539)
Q Consensus 151 ~y~s~~~L~dG~VyvvGG~~~-~~~E~y-P~~~-~~~w~~---~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~---~ 221 (539)
.+...+.+ +|++.++.-... .++|+| -+.. ...|.. .++.... +.. ...+ .....-+|+|.+.... .
T Consensus 145 ~~~~L~~~-~G~L~~v~~~~~~~~~~IWvl~d~~~~~W~k~~~i~~~~~~-~~~-~~~~-~~~~~~~g~I~~~~~~~~~~ 220 (230)
T TIGR01640 145 DYLSLINY-KGKLAVLKQKKDTNNFDLWVLNDAGKQEWSKLFTVPIPPLP-DLV-DDNF-LSGFTDKGEIVLCCEDENPF 220 (230)
T ss_pred cceEEEEE-CCEEEEEEecCCCCcEEEEEECCCCCCceeEEEEEcCcchh-hhh-hhee-EeEEeeCCEEEEEeCCCCce
Confidence 23445566 699888776432 457888 4322 346852 3321110 100 1112 2356678999887653 2
Q ss_pred -eeEeeCCCC
Q 039705 222 -SILLNPETN 230 (539)
Q Consensus 222 -~e~yDp~tn 230 (539)
...||+++|
T Consensus 221 ~~~~y~~~~~ 230 (230)
T TIGR01640 221 YIFYYNVGEN 230 (230)
T ss_pred EEEEEeccCC
Confidence 677888875
No 67
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.42 E-value=10 Score=38.75 Aligned_cols=137 Identities=10% Similarity=-0.030 Sum_probs=66.2
Q ss_pred EEEEECCC-CCEEeCccCCCc-ccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEcc
Q 039705 82 AVEYDAES-AAIRPLKILTDT-WSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILP 159 (539)
Q Consensus 82 ~~~yDp~t-~~w~~l~~~~~~-~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~ 159 (539)
...||..+ ++++.+...... ....-++.+||+.+.+|+.. ...+..|+... ..+++.... .+....-...+.-+
T Consensus 14 I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~--~~~i~~~~~~~-~g~l~~~~~-~~~~~~p~~i~~~~ 89 (330)
T PRK11028 14 IHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRP--EFRVLSYRIAD-DGALTFAAE-SPLPGSPTHISTDH 89 (330)
T ss_pred EEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECC--CCcEEEEEECC-CCceEEeee-ecCCCCceEEEECC
Confidence 45666653 455544332211 11122445688876666543 35677777641 345554332 22222222345566
Q ss_pred CCcEEEEcCccCCeEEEE-ecCCCcce-eeccCccccCCCCCCCCcceEE-EeeCCcEEEEEc---CceeEeeCCCC
Q 039705 160 DGSFIVVGGRREFSYEYI-LKEGKRII-YDLPILNETTNPSENNLYPFVF-LSTDGNLFIFAN---DRSILLNPETN 230 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~y-P~~~~~~w-~~~~~l~~~~~~~~~~~yp~~~-~~~~G~Iyv~Gg---~~~e~yDp~tn 230 (539)
||+.+.+.......+-+| ..++ ... .....+.. ...|+.+ +.++|+..++.+ +.+.+||..++
T Consensus 90 ~g~~l~v~~~~~~~v~v~~~~~~-g~~~~~~~~~~~-------~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~ 158 (330)
T PRK11028 90 QGRFLFSASYNANCVSVSPLDKD-GIPVAPIQIIEG-------LEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDD 158 (330)
T ss_pred CCCEEEEEEcCCCeEEEEEECCC-CCCCCceeeccC-------CCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCC
Confidence 887666665544556667 4332 011 11111100 1234544 456876554433 56889998764
No 68
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=93.29 E-value=11 Score=38.64 Aligned_cols=242 Identities=13% Similarity=0.137 Sum_probs=99.8
Q ss_pred CCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccc-cccceeEEccCCcEEEEc
Q 039705 89 SAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAK-RWFSTQHILPDGSFIVVG 167 (539)
Q Consensus 89 t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~-R~y~s~~~L~dG~VyvvG 167 (539)
.+.|+.+....+.......+.-+.+-+++|-+. .+|--.++..+|.........+ .....++...+.+.||+|
T Consensus 5 ~~~W~~v~l~t~~~l~dV~F~d~~~G~~VG~~g------~il~T~DGG~tW~~~~~~~~~~~~~~l~~I~f~~~~g~ivG 78 (302)
T PF14870_consen 5 GNSWQQVSLPTDKPLLDVAFVDPNHGWAVGAYG------TILKTTDGGKTWQPVSLDLDNPFDYHLNSISFDGNEGWIVG 78 (302)
T ss_dssp S--EEEEE-S-SS-EEEEEESSSS-EEEEETTT------EEEEESSTTSS-EE-----S-----EEEEEEEETTEEEEEE
T ss_pred CCCcEEeecCCCCceEEEEEecCCEEEEEecCC------EEEEECCCCccccccccCCCccceeeEEEEEecCCceEEEc
Confidence 356777765554443344445457788887431 1333333367898865323332 223334444578899887
Q ss_pred CccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceE-EEeeCCcEEEEEcCceeEee--CCCCeEEEEcccCCCCC
Q 039705 168 GRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFV-FLSTDGNLFIFANDRSILLN--PETNEILHVFPILRGGS 243 (539)
Q Consensus 168 G~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~-~~~~~G~Iyv~Gg~~~e~yD--p~tn~W~~~~p~mp~~~ 243 (539)
-.. . ++ ...+..+|...+..... + --++. ..+.++.+.+++... .+|- -.-.+|........+
T Consensus 79 ~~g---~-ll~T~DgG~tW~~v~l~~~l----p--gs~~~i~~l~~~~~~l~~~~G-~iy~T~DgG~tW~~~~~~~~g-- 145 (302)
T PF14870_consen 79 EPG---L-LLHTTDGGKTWERVPLSSKL----P--GSPFGITALGDGSAELAGDRG-AIYRTTDGGKTWQAVVSETSG-- 145 (302)
T ss_dssp ETT---E-EEEESSTTSS-EE----TT-----S--S-EEEEEEEETTEEEEEETT---EEEESSTTSSEEEEE-S-----
T ss_pred CCc---e-EEEecCCCCCcEEeecCCCC----C--CCeeEEEEcCCCcEEEEcCCC-cEEEeCCCCCCeeEcccCCcc--
Confidence 421 1 34 33335688765421110 0 01222 334466677766543 2332 234689732211111
Q ss_pred CccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCCceecee
Q 039705 244 RNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSPRVMGEM 323 (539)
Q Consensus 244 r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~R~~~~~ 323 (539)
.. ...... .+|++++++-.. .. ....|+. ...|+.-..+..|..-++
T Consensus 146 -s~---~~~~r~-----------~dG~~vavs~~G--~~--------------~~s~~~G--~~~w~~~~r~~~~riq~~ 192 (302)
T PF14870_consen 146 -SI---NDITRS-----------SDGRYVAVSSRG--NF--------------YSSWDPG--QTTWQPHNRNSSRRIQSM 192 (302)
T ss_dssp --E---EEEEE------------TTS-EEEEETTS--SE--------------EEEE-TT---SS-EEEE--SSS-EEEE
T ss_pred -ee---EeEEEC-----------CCCcEEEEECcc--cE--------------EEEecCC--CccceEEccCccceehhc
Confidence 00 011111 278888887542 11 1123433 356988766655555566
Q ss_pred EEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCcccee--eEeeCCCCeEEEecCCC
Q 039705 324 LLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHS--TSVVLPDGKILVAGSNP 398 (539)
Q Consensus 324 vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs--~a~llpdG~V~v~GG~~ 398 (539)
..-+|+.++++. + |.. ..+....+.+++|+.- ..++.....+ .....+++.|+++||.-
T Consensus 193 gf~~~~~lw~~~--~-Gg~------------~~~s~~~~~~~~w~~~-~~~~~~~~~~~ld~a~~~~~~~wa~gg~G 253 (302)
T PF14870_consen 193 GFSPDGNLWMLA--R-GGQ------------IQFSDDPDDGETWSEP-IIPIKTNGYGILDLAYRPPNEIWAVGGSG 253 (302)
T ss_dssp EE-TTS-EEEEE--T-TTE------------EEEEE-TTEEEEE----B-TTSS--S-EEEEEESSSS-EEEEESTT
T ss_pred eecCCCCEEEEe--C-CcE------------EEEccCCCCccccccc-cCCcccCceeeEEEEecCCCCEEEEeCCc
Confidence 677999998865 1 210 1122223333478772 1233222222 23446899999999974
No 69
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=93.19 E-value=0.11 Score=55.17 Aligned_cols=129 Identities=9% Similarity=0.075 Sum_probs=84.7
Q ss_pred CcccccceecCCC--cEEEecCCCC--CCCeEEEEeCCCCccceeeccc--ccccccccceeEEc-cCCcEEEEcCccC-
Q 039705 100 DTWSSSGGLSANG--TIVISGGWSS--RGRSVRYLSGCYHACYWKEHHW--ELSAKRWFSTQHIL-PDGSFIVVGGRRE- 171 (539)
Q Consensus 100 ~~~c~~~~~l~dG--~l~v~GG~~~--g~~~v~~ydP~~~t~~W~~~~~--~m~~~R~y~s~~~L-~dG~VyvvGG~~~- 171 (539)
.++..++.+.-++ .||+-|||+. .....+.|+-. .+.|+.... ..+..|..|-++.= +..|+|.+|-.-.
T Consensus 259 ~~RgGHQMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v~--e~~W~~iN~~t~~PG~RsCHRMVid~S~~KLYLlG~Y~~s 336 (723)
T KOG2437|consen 259 GMRGGHQMVIDVQTECVYLYGGWDGTQDLADFWAYSVK--ENQWTCINRDTEGPGARSCHRMVIDISRRKLYLLGRYLDS 336 (723)
T ss_pred cccCcceEEEeCCCcEEEEecCcccchhHHHHHhhcCC--cceeEEeecCCCCCcchhhhhhhhhhhHhHHhhhhhcccc
Confidence 3455555665555 9999999952 24566788877 889997642 24566776766542 2348999986422
Q ss_pred ---------CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCc--EEEEEcCc----------eeEeeCCC
Q 039705 172 ---------FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGN--LFIFANDR----------SILLNPET 229 (539)
Q Consensus 172 ---------~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~--Iyv~Gg~~----------~e~yDp~t 229 (539)
...+.| -.++ .|..+.+-.. .|..+.+.|-|.+++...| |||+||+. ...||...
T Consensus 337 S~r~~~s~RsDfW~FDi~~~--~W~~ls~dt~-~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~~~~~e~~f~GLYaf~~~~ 413 (723)
T KOG2437|consen 337 SVRNSKSLRSDFWRFDIDTN--TWMLLSEDTA-ADGGPKLVFDHQMCVDSEKHMIYVFGGRILTCNEPQFSGLYAFNCQC 413 (723)
T ss_pred ccccccccccceEEEecCCc--eeEEeccccc-ccCCcceeecceeeEecCcceEEEecCeeccCCCccccceEEEecCC
Confidence 134555 4455 7877665332 3445677888888887666 99999963 24678777
Q ss_pred CeEE
Q 039705 230 NEIL 233 (539)
Q Consensus 230 n~W~ 233 (539)
..|.
T Consensus 414 ~~w~ 417 (723)
T KOG2437|consen 414 QTWK 417 (723)
T ss_pred ccHH
Confidence 7886
No 70
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=93.18 E-value=2.4 Score=41.16 Aligned_cols=152 Identities=18% Similarity=0.218 Sum_probs=81.0
Q ss_pred CCcEEEEEcCceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcc
Q 039705 211 DGNLFIFANDRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEF 290 (539)
Q Consensus 211 ~G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~ 290 (539)
||-|.+.......++||.|.+|. .+|+.+. ++.++... ...+...+ ..+ +=||+.+..... .
T Consensus 5 nGLlc~~~~~~~~V~NP~T~~~~-~LP~~~~-~~~~~~~~-~~~~G~d~--~~~---~YKVv~~~~~~~-~--------- 66 (230)
T TIGR01640 5 DGLICFSYGKRLVVWNPSTGQSR-WLPTPKS-RRSNKESD-TYFLGYDP--IEK---QYKVLCFSDRSG-N--------- 66 (230)
T ss_pred ceEEEEecCCcEEEECCCCCCEE-ecCCCCC-cccccccc-eEEEeecc--cCC---cEEEEEEEeecC-C---------
Confidence 45554444455678999999997 7876543 22222111 11222221 112 347777754310 0
Q ss_pred cccCCceEEEEeeCCCCceeee-ccCC-CceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceE
Q 039705 291 MNALQDCGRIEITNKSATWQRE-MMPS-PRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFS 368 (539)
Q Consensus 291 ~~a~~s~~~~d~~~~~~~W~~~-~M~~-~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt 368 (539)
.....++.|+.. +++|... ..+. .......|. .||.||-+.-...+ ++...+..||-++. +|+
T Consensus 67 -~~~~~~~Vys~~--~~~Wr~~~~~~~~~~~~~~~v~-~~G~lyw~~~~~~~--------~~~~~IvsFDl~~E---~f~ 131 (230)
T TIGR01640 67 -RNQSEHQVYTLG--SNSWRTIECSPPHHPLKSRGVC-INGVLYYLAYTLKT--------NPDYFIVSFDVSSE---RFK 131 (230)
T ss_pred -CCCccEEEEEeC--CCCccccccCCCCccccCCeEE-ECCEEEEEEEECCC--------CCcEEEEEEEcccc---eEe
Confidence 012468899987 5799985 3221 111112354 49999998743211 11125788999999 999
Q ss_pred ecCCCCCCcc----ceeeEeeCCCCeEEEecCC
Q 039705 369 ELTPTSKPRM----CHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 369 ~~a~~~~~R~----yhs~a~llpdG~V~v~GG~ 397 (539)
..-+++..+. +...+.+ +|++-++...
T Consensus 132 ~~i~~P~~~~~~~~~~~L~~~--~G~L~~v~~~ 162 (230)
T TIGR01640 132 EFIPLPCGNSDSVDYLSLINY--KGKLAVLKQK 162 (230)
T ss_pred eeeecCccccccccceEEEEE--CCEEEEEEec
Confidence 5223343332 1223333 7888776553
No 71
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=93.13 E-value=10 Score=37.97 Aligned_cols=147 Identities=13% Similarity=0.079 Sum_probs=75.9
Q ss_pred EEEEECCCCCEEeCccCC---CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEc
Q 039705 82 AVEYDAESAAIRPLKILT---DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHIL 158 (539)
Q Consensus 82 ~~~yDp~t~~w~~l~~~~---~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L 158 (539)
..++|+++.+.+..+... +.--...++-.+|.|..+|=..- --.+||. ++.-+..+. +.+-.-...|+.
T Consensus 126 I~R~dpkt~evt~f~lp~~~a~~nlet~vfD~~G~lWFt~q~G~----yGrLdPa--~~~i~vfpa--PqG~gpyGi~at 197 (353)
T COG4257 126 IGRLDPKTLEVTRFPLPLEHADANLETAVFDPWGNLWFTGQIGA----YGRLDPA--RNVISVFPA--PQGGGPYGICAT 197 (353)
T ss_pred eEEecCcccceEEeecccccCCCcccceeeCCCccEEEeecccc----ceecCcc--cCceeeecc--CCCCCCcceEEC
Confidence 457899998888765432 22334556677899999884210 1145665 443332221 122223467888
Q ss_pred cCCcEEEEcCccCCeEEEEecCCCcceeeccCccc-cCCCCCCCCcceEEEeeCCcEEEE--EcCceeEeeCCCCeEEEE
Q 039705 159 PDGSFIVVGGRREFSYEYILKEGKRIIYDLPILNE-TTNPSENNLYPFVFLSTDGNLFIF--ANDRSILLNPETNEILHV 235 (539)
Q Consensus 159 ~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~-~~~~~~~~~yp~~~~~~~G~Iyv~--Gg~~~e~yDp~tn~W~~~ 235 (539)
+||+|+...=..+.-..+-|... .....+.... ..+. -..+.-+-|++++. |+-+..+|||.+..|. .
T Consensus 198 pdGsvwyaslagnaiaridp~~~--~aev~p~P~~~~~gs------Rriwsdpig~~wittwg~g~l~rfdPs~~sW~-e 268 (353)
T COG4257 198 PDGSVWYASLAGNAIARIDPFAG--HAEVVPQPNALKAGS------RRIWSDPIGRAWITTWGTGSLHRFDPSVTSWI-E 268 (353)
T ss_pred CCCcEEEEeccccceEEcccccC--CcceecCCCcccccc------cccccCccCcEEEeccCCceeeEeCcccccce-e
Confidence 89999998322111111112221 1111111101 0000 01244456777765 4456789999999997 4
Q ss_pred cccCCCC-CCcc
Q 039705 236 FPILRGG-SRNY 246 (539)
Q Consensus 236 ~p~mp~~-~r~y 246 (539)
. +||+. +|-|
T Consensus 269 y-pLPgs~arpy 279 (353)
T COG4257 269 Y-PLPGSKARPY 279 (353)
T ss_pred e-eCCCCCCCcc
Confidence 4 45553 4444
No 72
>PLN00181 protein SPA1-RELATED; Provisional
Probab=92.41 E-value=15 Score=42.88 Aligned_cols=230 Identities=12% Similarity=0.086 Sum_probs=109.8
Q ss_pred ceecCCCcEEEecCCCCCCCeEEEEeCCCCcc--ceeec--cc-ccccccccceeEEcc-CCcEEEEcCccCCeEEEE-e
Q 039705 106 GGLSANGTIVISGGWSSRGRSVRYLSGCYHAC--YWKEH--HW-ELSAKRWFSTQHILP-DGSFIVVGGRREFSYEYI-L 178 (539)
Q Consensus 106 ~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~--~W~~~--~~-~m~~~R~y~s~~~L~-dG~VyvvGG~~~~~~E~y-P 178 (539)
..+.+||.++++||.+ ..+++||.. +. ..... +. .+.....-.+.+..+ ++..++.|+.+ .++.+| .
T Consensus 489 i~fs~dg~~latgg~D---~~I~iwd~~--~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~D-g~v~lWd~ 562 (793)
T PLN00181 489 IGFDRDGEFFATAGVN---KKIKIFECE--SIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFE-GVVQVWDV 562 (793)
T ss_pred EEECCCCCEEEEEeCC---CEEEEEECC--cccccccccccceEEecccCceeeEEeccCCCCEEEEEeCC-CeEEEEEC
Confidence 3456799999999863 578889864 21 11110 00 011110001111111 35667777665 456677 3
Q ss_pred cCCCcceeeccCccccCCCCCCCCcceEEEe--eCCcEEEEEcCc--eeEeeCCCCeEEEEcccCCCCCCccCCCccEEe
Q 039705 179 KEGKRIIYDLPILNETTNPSENNLYPFVFLS--TDGNLFIFANDR--SILLNPETNEILHVFPILRGGSRNYPASATSAL 254 (539)
Q Consensus 179 ~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~--~~G~Iyv~Gg~~--~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~ 254 (539)
.+. +.. ..+....+ .. ..+.. .+|.+++.|+.+ +.+||..+..-...+ .. ... -.++.
T Consensus 563 ~~~--~~~--~~~~~H~~----~V--~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~---~~-~~~----v~~v~ 624 (793)
T PLN00181 563 ARS--QLV--TEMKEHEK----RV--WSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTI---KT-KAN----ICCVQ 624 (793)
T ss_pred CCC--eEE--EEecCCCC----CE--EEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEE---ec-CCC----eEEEE
Confidence 332 211 11111111 11 11222 378888888754 778998876533222 11 101 11222
Q ss_pred cccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCCceeceeEEecCCcEEEE
Q 039705 255 LPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSPRVMGEMLLLPTGDVLII 334 (539)
Q Consensus 255 lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~R~~~~~vvlpdG~I~vv 334 (539)
+.. .+++++++|+.+. .+..||+...........-..... ..+...++..++.
T Consensus 625 ~~~---------~~g~~latgs~dg----------------~I~iwD~~~~~~~~~~~~~h~~~V--~~v~f~~~~~lvs 677 (793)
T PLN00181 625 FPS---------ESGRSLAFGSADH----------------KVYYYDLRNPKLPLCTMIGHSKTV--SYVRFVDSSTLVS 677 (793)
T ss_pred EeC---------CCCCEEEEEeCCC----------------eEEEEECCCCCccceEecCCCCCE--EEEEEeCCCEEEE
Confidence 210 2578888888651 356777653111111111111111 2234458888888
Q ss_pred cCcCCCCCCcccCCCCCCccEEEcCCCCC-CCceEecCCCCCCccceeeEeeCCCCeEEEecCCCC
Q 039705 335 NGAKKGTAGWNFATDPNTTPVLYEPDDPI-NERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 335 GG~~~g~~g~~~~~~~~~~~e~YdP~t~~-g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
++.+ + ++-+||..+.. +..|..+..............+.++|+.+++|+.++
T Consensus 678 ~s~D-~------------~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~ 730 (793)
T PLN00181 678 SSTD-N------------TLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSETN 730 (793)
T ss_pred EECC-C------------EEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeCCC
Confidence 8765 1 57889986531 112332221111011112344678999999999644
No 73
>PF13854 Kelch_5: Kelch motif
Probab=92.33 E-value=0.16 Score=35.58 Aligned_cols=25 Identities=32% Similarity=0.472 Sum_probs=21.7
Q ss_pred CCCCccceeeEeeCCCCeEEEecCCCC
Q 039705 373 TSKPRMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 373 ~~~~R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
+|.+|..|++++. +++||+.||...
T Consensus 1 ~P~~R~~hs~~~~--~~~iyi~GG~~~ 25 (42)
T PF13854_consen 1 IPSPRYGHSAVVV--GNNIYIFGGYSG 25 (42)
T ss_pred CCCCccceEEEEE--CCEEEEEcCccC
Confidence 3678999998887 899999999874
No 74
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=92.16 E-value=6.3 Score=40.38 Aligned_cols=161 Identities=14% Similarity=0.252 Sum_probs=78.7
Q ss_pred CceEEccCCCcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeCccCC
Q 039705 20 GKWELASENSGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPLKILT 99 (539)
Q Consensus 20 g~W~~~~~~~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l~~~~ 99 (539)
.+|+.+.. .....+.......+|++++++... . ....+|+-...|++.....
T Consensus 134 ~tW~~~~~-~~~gs~~~~~r~~dG~~vavs~~G--~-------------------------~~~s~~~G~~~w~~~~r~~ 185 (302)
T PF14870_consen 134 KTWQAVVS-ETSGSINDITRSSDGRYVAVSSRG--N-------------------------FYSSWDPGQTTWQPHNRNS 185 (302)
T ss_dssp SSEEEEE--S----EEEEEE-TTS-EEEEETTS--S-------------------------EEEEE-TT-SS-EEEE--S
T ss_pred CCeeEccc-CCcceeEeEEECCCCcEEEEECcc--c-------------------------EEEEecCCCccceEEccCc
Confidence 47999864 333444443333899988887531 1 2346788888899887665
Q ss_pred CcccccceecCCCcEEEecCCCCCCCeEEEEe-CCCCccceeecccccccccc-cceeEEccCCcEEEEcCccCCeEEEE
Q 039705 100 DTWSSSGGLSANGTIVISGGWSSRGRSVRYLS-GCYHACYWKEHHWELSAKRW-FSTQHILPDGSFIVVGGRREFSYEYI 177 (539)
Q Consensus 100 ~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~yd-P~~~t~~W~~~~~~m~~~R~-y~s~~~L~dG~VyvvGG~~~~~~E~y 177 (539)
..+-..-.+.+|+.++++. .. -.+++=+ +. ...+|.+.........+ +..++.-.++.++++||... +|
T Consensus 186 ~~riq~~gf~~~~~lw~~~-~G---g~~~~s~~~~-~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~G~----l~ 256 (302)
T PF14870_consen 186 SRRIQSMGFSPDGNLWMLA-RG---GQIQFSDDPD-DGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGSGT----LL 256 (302)
T ss_dssp SS-EEEEEE-TTS-EEEEE-TT---TEEEEEE-TT-EEEEE---B-TTSS--S-EEEEEESSSS-EEEEESTT-----EE
T ss_pred cceehhceecCCCCEEEEe-CC---cEEEEccCCC-CccccccccCCcccCceeeEEEEecCCCCEEEEeCCcc----EE
Confidence 5665556667888887764 11 1233333 33 26789873221322333 35667777899999999742 44
Q ss_pred -ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCce
Q 039705 178 -LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRS 222 (539)
Q Consensus 178 -P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~ 222 (539)
...+..+|...+.... .+.|+|-..+. .+.+-|++|.+-+
T Consensus 257 ~S~DgGktW~~~~~~~~----~~~n~~~i~f~-~~~~gf~lG~~G~ 297 (302)
T PF14870_consen 257 VSTDGGKTWQKDRVGEN----VPSNLYRIVFV-NPDKGFVLGQDGV 297 (302)
T ss_dssp EESSTTSS-EE-GGGTT----SSS---EEEEE-ETTEEEEE-STTE
T ss_pred EeCCCCccceECccccC----CCCceEEEEEc-CCCceEEECCCcE
Confidence 4444568865543222 23577754333 5679999997654
No 75
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=92.07 E-value=5.1 Score=39.21 Aligned_cols=156 Identities=19% Similarity=0.206 Sum_probs=79.3
Q ss_pred eCCcEEEEEcCceeEeeCCCCeEEEEcccCCCC--CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCC
Q 039705 210 TDGNLFIFANDRSILLNPETNEILHVFPILRGG--SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGK 287 (539)
Q Consensus 210 ~~G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~--~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~ 287 (539)
.+|++|+.......++|+.+++++ .+...+.. ....|. -.++- .+|+||+.--.......
T Consensus 50 ~~g~l~v~~~~~~~~~d~~~g~~~-~~~~~~~~~~~~~~~N--D~~vd-----------~~G~ly~t~~~~~~~~~---- 111 (246)
T PF08450_consen 50 PDGRLYVADSGGIAVVDPDTGKVT-VLADLPDGGVPFNRPN--DVAVD-----------PDGNLYVTDSGGGGASG---- 111 (246)
T ss_dssp TTSEEEEEETTCEEEEETTTTEEE-EEEEEETTCSCTEEEE--EEEE------------TTS-EEEEEECCBCTTC----
T ss_pred cCCEEEEEEcCceEEEecCCCcEE-EEeeccCCCcccCCCc--eEEEc-----------CCCCEEEEecCCCcccc----
Confidence 489999998888888999999987 55554321 111111 11221 27888886422110000
Q ss_pred CcccccCCceEEEEeeCCCCceeee--ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCC
Q 039705 288 GEFMNALQDCGRIEITNKSATWQRE--MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINE 365 (539)
Q Consensus 288 ~~~~~a~~s~~~~d~~~~~~~W~~~--~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~ 365 (539)
.....+.++++. .+.+.. .|..+. +.+.-+||+.+.+.-.. ...+..||++.+.+
T Consensus 112 ----~~~g~v~~~~~~---~~~~~~~~~~~~pN---Gi~~s~dg~~lyv~ds~------------~~~i~~~~~~~~~~- 168 (246)
T PF08450_consen 112 ----IDPGSVYRIDPD---GKVTVVADGLGFPN---GIAFSPDGKTLYVADSF------------NGRIWRFDLDADGG- 168 (246)
T ss_dssp ----GGSEEEEEEETT---SEEEEEEEEESSEE---EEEEETTSSEEEEEETT------------TTEEEEEEEETTTC-
T ss_pred ----ccccceEEECCC---CeEEEEecCccccc---ceEECCcchheeecccc------------cceeEEEecccccc-
Confidence 000346667653 344432 454432 33556899754432211 12578888875521
Q ss_pred ceE---ecCCCCCCc-cceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEEecCC
Q 039705 366 RFS---ELTPTSKPR-MCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKFYPP 422 (539)
Q Consensus 366 ~Wt---~~a~~~~~R-~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y~Pp 422 (539)
+++ ......... .--+. ++-.+|+|||+--. ..+|.+|+|.
T Consensus 169 ~~~~~~~~~~~~~~~g~pDG~-~vD~~G~l~va~~~---------------~~~I~~~~p~ 213 (246)
T PF08450_consen 169 ELSNRRVFIDFPGGPGYPDGL-AVDSDGNLWVADWG---------------GGRIVVFDPD 213 (246)
T ss_dssp CEEEEEEEEE-SSSSCEEEEE-EEBTTS-EEEEEET---------------TTEEEEEETT
T ss_pred ceeeeeeEEEcCCCCcCCCcc-eEcCCCCEEEEEcC---------------CCEEEEECCC
Confidence 232 122222221 22344 45679999998321 1267788876
No 76
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=90.80 E-value=18 Score=35.74 Aligned_cols=218 Identities=14% Similarity=0.160 Sum_probs=109.2
Q ss_pred cceeEEEEECCCCCEEeCccC--CCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccc--
Q 039705 78 YRALAVEYDAESAAIRPLKIL--TDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFS-- 153 (539)
Q Consensus 78 ~~~~~~~yDp~t~~w~~l~~~--~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~-- 153 (539)
+..++.+||..++.=.|+... ++.--....+-.||+++.+||.+ -.++++|-. . +...|-|.
T Consensus 59 ~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseD---gt~kIWdlR--~---------~~~qR~~~~~ 124 (311)
T KOG0315|consen 59 GNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSED---GTVKIWDLR--S---------LSCQRNYQHN 124 (311)
T ss_pred cCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCC---ceEEEEecc--C---------cccchhccCC
Confidence 356788999988765444322 11111233456799999999963 467788865 2 22223322
Q ss_pred ----eeEEccCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEee--
Q 039705 154 ----TQHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILLN-- 226 (539)
Q Consensus 154 ----s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~yD-- 226 (539)
+++.-++.--+++| ..+..+.+| -.++ .-.... +++. + . .--...+.+||+..+.+++...+|=
T Consensus 125 spVn~vvlhpnQteLis~-dqsg~irvWDl~~~--~c~~~l-iPe~-~-~---~i~sl~v~~dgsml~a~nnkG~cyvW~ 195 (311)
T KOG0315|consen 125 SPVNTVVLHPNQTELISG-DQSGNIRVWDLGEN--SCTHEL-IPED-D-T---SIQSLTVMPDGSMLAAANNKGNCYVWR 195 (311)
T ss_pred CCcceEEecCCcceEEee-cCCCcEEEEEccCC--cccccc-CCCC-C-c---ceeeEEEcCCCcEEEEecCCccEEEEE
Confidence 23333333333444 333456666 4433 221111 1110 0 0 0012356789999988887655543
Q ss_pred CCCCeEEEEcccCCCC-CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCC
Q 039705 227 PETNEILHVFPILRGG-SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNK 305 (539)
Q Consensus 227 p~tn~W~~~~p~mp~~-~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~ 305 (539)
.-+..-...+-|+..- .++ ..+...+| .| ++|.++.-+++. +|..++.+
T Consensus 196 l~~~~~~s~l~P~~k~~ah~--~~il~C~l--SP--------d~k~lat~ssdk----------------tv~iwn~~-- 245 (311)
T KOG0315|consen 196 LLNHQTASELEPVHKFQAHN--GHILRCLL--SP--------DVKYLATCSSDK----------------TVKIWNTD-- 245 (311)
T ss_pred ccCCCccccceEhhheeccc--ceEEEEEE--CC--------CCcEEEeecCCc----------------eEEEEecC--
Confidence 3232211123222210 111 01222222 22 778888777652 34445433
Q ss_pred CCceeee-ccC-CCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCC
Q 039705 306 SATWQRE-MMP-SPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDP 362 (539)
Q Consensus 306 ~~~W~~~-~M~-~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~ 362 (539)
+-...+ .+. ..|..-+.+--.||+-+|+|+.+. .+.+||++++
T Consensus 246 -~~~kle~~l~gh~rWvWdc~FS~dg~YlvTassd~-------------~~rlW~~~~~ 290 (311)
T KOG0315|consen 246 -DFFKLELVLTGHQRWVWDCAFSADGEYLVTASSDH-------------TARLWDLSAG 290 (311)
T ss_pred -CceeeEEEeecCCceEEeeeeccCccEEEecCCCC-------------ceeecccccC
Confidence 112222 222 224444545566999999988662 5788999887
No 77
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=90.30 E-value=10 Score=37.03 Aligned_cols=140 Identities=14% Similarity=0.133 Sum_probs=74.1
Q ss_pred EEEECCCCCEEeCccC-----CCcccccceecCCCcEEEecCCC--C-CC--CeEEEEeCCCCccceeeccccccccccc
Q 039705 83 VEYDAESAAIRPLKIL-----TDTWSSSGGLSANGTIVISGGWS--S-RG--RSVRYLSGCYHACYWKEHHWELSAKRWF 152 (539)
Q Consensus 83 ~~yDp~t~~w~~l~~~-----~~~~c~~~~~l~dG~l~v~GG~~--~-g~--~~v~~ydP~~~t~~W~~~~~~m~~~R~y 152 (539)
.++|+.+++++.+... ...++...++..+|++|+.--.. . .. ..+.++++. .+...+...|..+
T Consensus 63 ~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~---~~~~~~~~~~~~p--- 136 (246)
T PF08450_consen 63 AVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD---GKVTVVADGLGFP--- 136 (246)
T ss_dssp EEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT---SEEEEEEEEESSE---
T ss_pred EEEecCCCcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC---CeEEEEecCcccc---
Confidence 3459999998877644 34567778888999999864221 1 11 457888885 3333333223322
Q ss_pred ceeEEccCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEe-eCCcEEEE--EcCceeEeeCC
Q 039705 153 STQHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLS-TDGNLFIF--ANDRSILLNPE 228 (539)
Q Consensus 153 ~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~-~~G~Iyv~--Gg~~~e~yDp~ 228 (539)
.+.+.-+||+.+.+.-+....+..| .......+........... ..-+|-.+++ .+|+||+. ++..+.+|||.
T Consensus 137 NGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~---~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~ 213 (246)
T PF08450_consen 137 NGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPG---GPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD 213 (246)
T ss_dssp EEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SS---SSCEEEEEEEBTTS-EEEEEETTTEEEEEETT
T ss_pred cceEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCC---CCcCCCcceEcCCCCEEEEEcCCCEEEEECCC
Confidence 2455566887555443333445566 4322111211110000000 0113444444 58999998 56789999999
Q ss_pred CCe
Q 039705 229 TNE 231 (539)
Q Consensus 229 tn~ 231 (539)
...
T Consensus 214 G~~ 216 (246)
T PF08450_consen 214 GKL 216 (246)
T ss_dssp SCE
T ss_pred ccE
Confidence 443
No 78
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=90.00 E-value=21 Score=35.27 Aligned_cols=224 Identities=14% Similarity=0.153 Sum_probs=108.9
Q ss_pred ecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccccccccee-EEccCCcEEEEcCccCCeEEEEecCCCccee
Q 039705 108 LSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQ-HILPDGSFIVVGGRREFSYEYILKEGKRIIY 186 (539)
Q Consensus 108 ~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~-~~L~dG~VyvvGG~~~~~~E~yP~~~~~~w~ 186 (539)
..+|++.++++|+ ..|++||-. ++.=..++. ....+-.-++ ..-.||+-...||.++ ++.+|....-..-+
T Consensus 48 iTpdk~~LAaa~~----qhvRlyD~~--S~np~Pv~t-~e~h~kNVtaVgF~~dgrWMyTgseDg-t~kIWdlR~~~~qR 119 (311)
T KOG0315|consen 48 ITPDKKDLAAAGN----QHVRLYDLN--SNNPNPVAT-FEGHTKNVTAVGFQCDGRWMYTGSEDG-TVKIWDLRSLSCQR 119 (311)
T ss_pred EcCCcchhhhccC----CeeEEEEcc--CCCCCceeE-EeccCCceEEEEEeecCeEEEecCCCc-eEEEEeccCcccch
Confidence 3579999999987 579999987 543212211 1122222222 2334899999998774 44444111100000
Q ss_pred eccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCC
Q 039705 187 DLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNS 264 (539)
Q Consensus 187 ~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~ 264 (539)
........ | ..++-++.-=.++|.+ .+++||..+|.-+.. .||.. -. +-...+++ |
T Consensus 120 ~~~~~spV------n---~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~--liPe~-~~-~i~sl~v~-~-------- 177 (311)
T KOG0315|consen 120 NYQHNSPV------N---TVVLHPNQTELISGDQSGNIRVWDLGENSCTHE--LIPED-DT-SIQSLTVM-P-------- 177 (311)
T ss_pred hccCCCCc------c---eEEecCCcceEEeecCCCcEEEEEccCCccccc--cCCCC-Cc-ceeeEEEc-C--------
Confidence 00100000 0 1122244333344544 478899999865432 35542 11 11122332 2
Q ss_pred CCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCC--Cceeee---ccCCCceeceeEEecCCcEEEEcCcCC
Q 039705 265 NAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKS--ATWQRE---MMPSPRVMGEMLLLPTGDVLIINGAKK 339 (539)
Q Consensus 265 ~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~--~~W~~~---~M~~~R~~~~~vvlpdG~I~vvGG~~~ 339 (539)
+|+.++. ..+.| +|.+.++.... ..-+.. .|.... ......-||+|.++.-+.++
T Consensus 178 ---dgsml~a-~nnkG---------------~cyvW~l~~~~~~s~l~P~~k~~ah~~~-il~C~lSPd~k~lat~ssdk 237 (311)
T KOG0315|consen 178 ---DGSMLAA-ANNKG---------------NCYVWRLLNHQTASELEPVHKFQAHNGH-ILRCLLSPDVKYLATCSSDK 237 (311)
T ss_pred ---CCcEEEE-ecCCc---------------cEEEEEccCCCccccceEhhheecccce-EEEEEECCCCcEEEeecCCc
Confidence 6675554 33322 35566553200 011111 233322 22335679999999988763
Q ss_pred CCCCcccCCCCCCccEEEcCCCCCCCceEe--cCCCCCCccceeeEeeCCCCeEEEecCCCCC
Q 039705 340 GTAGWNFATDPNTTPVLYEPDDPINERFSE--LTPTSKPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 340 g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~--~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
++.+|+-+.- +.. .-.-...-+.- +++-.||+-+|.|+.++.
T Consensus 238 -------------tv~iwn~~~~----~kle~~l~gh~rWvWd--c~FS~dg~YlvTassd~~ 281 (311)
T KOG0315|consen 238 -------------TVKIWNTDDF----FKLELVLTGHQRWVWD--CAFSADGEYLVTASSDHT 281 (311)
T ss_pred -------------eEEEEecCCc----eeeEEEeecCCceEEe--eeeccCccEEEecCCCCc
Confidence 5777776554 111 00111111222 345569999999998643
No 79
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=89.94 E-value=5.7 Score=41.41 Aligned_cols=118 Identities=9% Similarity=0.017 Sum_probs=72.2
Q ss_pred cCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccCC---------eEEEE-e
Q 039705 109 SANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREF---------SYEYI-L 178 (539)
Q Consensus 109 l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~---------~~E~y-P 178 (539)
+.+.+|+.++.. ..+-+||+. +..-..+|. |..+..++.++.+ +++|||+...... ..|.+ -
T Consensus 74 l~gskIv~~d~~----~~t~vyDt~--t~av~~~P~-l~~pk~~pisv~V-G~~LY~m~~~~~~~~~~~~~~~~FE~l~~ 145 (342)
T PF07893_consen 74 LHGSKIVAVDQS----GRTLVYDTD--TRAVATGPR-LHSPKRCPISVSV-GDKLYAMDRSPFPEPAGRPDFPCFEALVY 145 (342)
T ss_pred ecCCeEEEEcCC----CCeEEEECC--CCeEeccCC-CCCCCcceEEEEe-CCeEEEeeccCccccccCccceeEEEecc
Confidence 357788888654 347799998 887777776 8888888877777 6789999876321 55665 1
Q ss_pred c--------CCCcceeeccCccccCCCCCCCCcceEEEee-CCcEEEEEc-C--ceeEeeCCCCeEEE
Q 039705 179 K--------EGKRIIYDLPILNETTNPSENNLYPFVFLST-DGNLFIFAN-D--RSILLNPETNEILH 234 (539)
Q Consensus 179 ~--------~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~-~G~Iyv~Gg-~--~~e~yDp~tn~W~~ 234 (539)
. .....|..+|..+-..+.......-...++. +..|||.-. . .+..||..+.+|.+
T Consensus 146 ~~~~~~~~~~~~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~~GTysfDt~~~~W~~ 213 (342)
T PF07893_consen 146 RPPPDDPSPEESWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRRWGTYSFDTESHEWRK 213 (342)
T ss_pred ccccccccCCCcceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCceEEEEEEcCCcceee
Confidence 1 1122455554322111110000002234455 457888443 4 58899999999984
No 80
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=88.95 E-value=22 Score=33.99 Aligned_cols=107 Identities=15% Similarity=0.211 Sum_probs=56.7
Q ss_pred cCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcceee
Q 039705 109 SANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYD 187 (539)
Q Consensus 109 l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~ 187 (539)
..++++|+..+ ...+.+||+.++.-.|+.- +.. +-.... .+.+++||+..... .+..+ .+++...|..
T Consensus 34 ~~~~~v~~~~~----~~~l~~~d~~tG~~~W~~~---~~~-~~~~~~-~~~~~~v~v~~~~~--~l~~~d~~tG~~~W~~ 102 (238)
T PF13360_consen 34 PDGGRVYVASG----DGNLYALDAKTGKVLWRFD---LPG-PISGAP-VVDGGRVYVGTSDG--SLYALDAKTGKVLWSI 102 (238)
T ss_dssp EETTEEEEEET----TSEEEEEETTTSEEEEEEE---CSS-CGGSGE-EEETTEEEEEETTS--EEEEEETTTSCEEEEE
T ss_pred EeCCEEEEEcC----CCEEEEEECCCCCEEEEee---ccc-ccccee-eeccccccccccee--eeEecccCCcceeeee
Confidence 36777777743 3678999987445578653 222 111112 44488888876332 44455 4554456752
Q ss_pred -ccCccccCCCCCCCCc-ceEEEeeCCcEEEEE-cCceeEeeCCCCeE
Q 039705 188 -LPILNETTNPSENNLY-PFVFLSTDGNLFIFA-NDRSILLNPETNEI 232 (539)
Q Consensus 188 -~~~l~~~~~~~~~~~y-p~~~~~~~G~Iyv~G-g~~~e~yDp~tn~W 232 (539)
....... ..+ +....+.++++|+.. +....++|+++++-
T Consensus 103 ~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~g~l~~~d~~tG~~ 144 (238)
T PF13360_consen 103 YLTSSPPA------GVRSSSSPAVDGDRLYVGTSSGKLVALDPKTGKL 144 (238)
T ss_dssp EE-SSCTC------STB--SEEEEETTEEEEEETCSEEEEEETTTTEE
T ss_pred cccccccc------ccccccCceEecCEEEEEeccCcEEEEecCCCcE
Confidence 2110110 011 112233345565555 45688999998763
No 81
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=88.67 E-value=34 Score=35.79 Aligned_cols=261 Identities=13% Similarity=0.046 Sum_probs=133.9
Q ss_pred EEEEECCCCC--EEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEcc
Q 039705 82 AVEYDAESAA--IRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILP 159 (539)
Q Consensus 82 ~~~yDp~t~~--w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~ 159 (539)
...+|+.+.+ |+.........+++.....||+||+-... ....+||+.+++..|..-.. .. .+|. +.++..
T Consensus 80 i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~~G~i~~g~~~----g~~y~ld~~~G~~~W~~~~~-~~-~~~~-~~~v~~ 152 (370)
T COG1520 80 IFALNPDTGLVKWSYPLLGAVAQLSGPILGSDGKIYVGSWD----GKLYALDASTGTLVWSRNVG-GS-PYYA-SPPVVG 152 (370)
T ss_pred EEEEeCCCCcEEecccCcCcceeccCceEEeCCeEEEeccc----ceEEEEECCCCcEEEEEecC-CC-eEEe-cCcEEc
Confidence 3456777766 76544332345666677779998776543 26888998545778987554 22 5653 445555
Q ss_pred CCcEEEEcCccCCeEEEE-ecCCCcceee-ccC-ccccCCCCCCCCcceEEEeeCCcEEEEEcC---ceeEeeCCCC--e
Q 039705 160 DGSFIVVGGRREFSYEYI-LKEGKRIIYD-LPI-LNETTNPSENNLYPFVFLSTDGNLFIFAND---RSILLNPETN--E 231 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~y-P~~~~~~w~~-~~~-l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~---~~e~yDp~tn--~ 231 (539)
|+.||+.. +...+-.. +.+....|.. .+. +..... .....-+|.+|+-... ....+|++++ .
T Consensus 153 ~~~v~~~s--~~g~~~al~~~tG~~~W~~~~~~~~~~~~~--------~~~~~~~~~vy~~~~~~~~~~~a~~~~~G~~~ 222 (370)
T COG1520 153 DGTVYVGT--DDGHLYALNADTGTLKWTYETPAPLSLSIY--------GSPAIASGTVYVGSDGYDGILYALNAEDGTLK 222 (370)
T ss_pred CcEEEEec--CCCeEEEEEccCCcEEEEEecCCccccccc--------cCceeecceEEEecCCCcceEEEEEccCCcEe
Confidence 89998875 21222222 4433446642 211 111110 1123567888876542 3566788665 4
Q ss_pred EEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceee
Q 039705 232 ILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQR 311 (539)
Q Consensus 232 W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~ 311 (539)
|.+.. ..+. .+. ... ..| .+..+.||+.|+.-.+.+ .....++|..+....|+.
T Consensus 223 w~~~~-~~~~-~~~----~~~-~~~--------~~~~~~v~v~~~~~~~~~-----------~g~~~~l~~~~G~~~W~~ 276 (370)
T COG1520 223 WSQKV-SQTI-GRT----AIS-TTP--------AVDGGPVYVDGGVYAGSY-----------GGKLLCLDADTGELIWSF 276 (370)
T ss_pred eeeee-eccc-Ccc----ccc-ccc--------cccCceEEECCcEEEEec-----------CCeEEEEEcCCCceEEEE
Confidence 55211 1111 111 110 011 013788888887311111 123567776655678987
Q ss_pred e-cc--CCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCC-ceEecCCCCCCccceeeEeeCC
Q 039705 312 E-MM--PSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINE-RFSELTPTSKPRMCHSTSVVLP 387 (539)
Q Consensus 312 ~-~M--~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~-~Wt~~a~~~~~R~yhs~a~llp 387 (539)
. ++ ...+.......--||++|+..-..... ....+.++++..+.-. .|.....- .+.......-
T Consensus 277 ~~~~~~~~~~~~~~~~~~~dG~v~~~~~~~~~~--------~~~~~~~~~~~~g~~~~~w~~~~~g----~~~~~~~~~~ 344 (370)
T COG1520 277 PAGGSVQGSGLYTTPVAGADGKVYIGFTDNDGR--------GSGSLYALADVPGGTLLKWSYPVGG----GYSLSTVAGS 344 (370)
T ss_pred ecccEeccCCeeEEeecCCCccEEEEEeccccc--------cccceEEEeccCCCeeEEEEEeCCC----ceecccceec
Confidence 6 52 223333333333599999976443110 1235677777333111 57654333 2222233344
Q ss_pred CCeEEEecCC
Q 039705 388 DGKILVAGSN 397 (539)
Q Consensus 388 dG~V~v~GG~ 397 (539)
||.+|..+-+
T Consensus 345 ~g~~y~~~~~ 354 (370)
T COG1520 345 DGTLYFGGDD 354 (370)
T ss_pred cCeEEecccC
Confidence 7777776654
No 82
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=86.94 E-value=29 Score=33.12 Aligned_cols=203 Identities=12% Similarity=0.064 Sum_probs=100.6
Q ss_pred CeEEEEeCCCCccceeecccccccccccce-eEEccCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCC
Q 039705 125 RSVRYLSGCYHACYWKEHHWELSAKRWFST-QHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNL 202 (539)
Q Consensus 125 ~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s-~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~ 202 (539)
..+..+|+.++...|+.- +..+..... ..+..++.+|+..+. ..+-++ +.++...|.... .... .
T Consensus 3 g~l~~~d~~tG~~~W~~~---~~~~~~~~~~~~~~~~~~v~~~~~~--~~l~~~d~~tG~~~W~~~~--~~~~------~ 69 (238)
T PF13360_consen 3 GTLSALDPRTGKELWSYD---LGPGIGGPVATAVPDGGRVYVASGD--GNLYALDAKTGKVLWRFDL--PGPI------S 69 (238)
T ss_dssp SEEEEEETTTTEEEEEEE---CSSSCSSEEETEEEETTEEEEEETT--SEEEEEETTTSEEEEEEEC--SSCG------G
T ss_pred CEEEEEECCCCCEEEEEE---CCCCCCCccceEEEeCCEEEEEcCC--CEEEEEECCCCCEEEEeec--cccc------c
Confidence 357889997556678762 422222122 144457888887432 334455 445433464221 1110 0
Q ss_pred cceEEEeeCCcEEEEEc-CceeEeeCCCCe--EEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCC
Q 039705 203 YPFVFLSTDGNLFIFAN-DRSILLNPETNE--ILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKP 279 (539)
Q Consensus 203 yp~~~~~~~G~Iyv~Gg-~~~e~yDp~tn~--W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~ 279 (539)
.+ .+..+++||+... +....+|.++++ |.......+. .... ......+ .++++|+... +
T Consensus 70 ~~--~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~-~~~~--~~~~~~~-----------~~~~~~~~~~-~- 131 (238)
T PF13360_consen 70 GA--PVVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPP-AGVR--SSSSPAV-----------DGDRLYVGTS-S- 131 (238)
T ss_dssp SG--EEEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCT-CSTB----SEEEE-----------ETTEEEEEET-C-
T ss_pred ce--eeecccccccccceeeeEecccCCcceeeeeccccccc-cccc--cccCceE-----------ecCEEEEEec-c-
Confidence 11 3566889988864 468889988765 5411111111 1111 1111111 1455555543 1
Q ss_pred CcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCC--------ceeceeEEecCCcEEEEcCcCCCCCCcccCCCC
Q 039705 280 EAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSP--------RVMGEMLLLPTGDVLIINGAKKGTAGWNFATDP 350 (539)
Q Consensus 280 ~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~--------R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~ 350 (539)
..+..+|+.+....|+.. .++.. ... +.+++.+|+||+..+..
T Consensus 132 ---------------g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~~~~g------------ 183 (238)
T PF13360_consen 132 ---------------GKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDIN-GSPVISDGRVYVSSGDG------------ 183 (238)
T ss_dssp ---------------SEEEEEETTTTEEEEEEESSTT-SS--EEEETTEE-EEEECCTTEEEEECCTS------------
T ss_pred ---------------CcEEEEecCCCcEEEEeecCCCCCCcceeeecccc-cceEEECCEEEEEcCCC------------
Confidence 135678876544668875 44321 112 23455578999877643
Q ss_pred CCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEec
Q 039705 351 NTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAG 395 (539)
Q Consensus 351 ~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~G 395 (539)
.+..+|.+++.- .|+.- ... +....+..++.||+..
T Consensus 184 --~~~~~d~~tg~~-~w~~~--~~~----~~~~~~~~~~~l~~~~ 219 (238)
T PF13360_consen 184 --RVVAVDLATGEK-LWSKP--ISG----IYSLPSVDGGTLYVTS 219 (238)
T ss_dssp --SEEEEETTTTEE-EEEEC--SS-----ECECEECCCTEEEEEE
T ss_pred --eEEEEECCCCCE-EEEec--CCC----ccCCceeeCCEEEEEe
Confidence 123348888821 28443 211 2222445577777765
No 83
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=86.14 E-value=22 Score=37.14 Aligned_cols=59 Identities=14% Similarity=0.170 Sum_probs=43.4
Q ss_pred ceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeCccCCCcccccceecCCC
Q 039705 33 AMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPLKILTDTWSSSGGLSANG 112 (539)
Q Consensus 33 ~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG 112 (539)
.|+.+.+ .+.||+.++... .+.+||..|.....++.++...+...++...+
T Consensus 68 ~~~F~al-~gskIv~~d~~~----------------------------~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~ 118 (342)
T PF07893_consen 68 SMDFFAL-HGSKIVAVDQSG----------------------------RTLVYDTDTRAVATGPRLHSPKRCPISVSVGD 118 (342)
T ss_pred eeEEEEe-cCCeEEEEcCCC----------------------------CeEEEECCCCeEeccCCCCCCCcceEEEEeCC
Confidence 4556665 688888886531 25689999999999988876655555666688
Q ss_pred cEEEecCC
Q 039705 113 TIVISGGW 120 (539)
Q Consensus 113 ~l~v~GG~ 120 (539)
+||+....
T Consensus 119 ~LY~m~~~ 126 (342)
T PF07893_consen 119 KLYAMDRS 126 (342)
T ss_pred eEEEeecc
Confidence 89998765
No 84
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=86.13 E-value=31 Score=32.56 Aligned_cols=218 Identities=15% Similarity=0.187 Sum_probs=104.2
Q ss_pred ecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccccccc-ceeEEccCCcEEEEcCccCCeEEEE-ecCCCcce
Q 039705 108 LSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWF-STQHILPDGSFIVVGGRREFSYEYI-LKEGKRII 185 (539)
Q Consensus 108 ~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y-~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w 185 (539)
..++++++++|+. ...+.+||.. +.+-.. . +...... ......++++.+++++.+ ..+.+| .... +.
T Consensus 17 ~~~~~~~l~~~~~---~g~i~i~~~~--~~~~~~--~-~~~~~~~i~~~~~~~~~~~l~~~~~~-~~i~i~~~~~~--~~ 85 (289)
T cd00200 17 FSPDGKLLATGSG---DGTIKVWDLE--TGELLR--T-LKGHTGPVRDVAASADGTYLASGSSD-KTIRLWDLETG--EC 85 (289)
T ss_pred EcCCCCEEEEeec---CcEEEEEEee--CCCcEE--E-EecCCcceeEEEECCCCCEEEEEcCC-CeEEEEEcCcc--cc
Confidence 3457788888875 2578888876 332111 0 1111111 234556677788888764 345566 3332 11
Q ss_pred -eeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCC
Q 039705 186 -YDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDP 262 (539)
Q Consensus 186 -~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~ 262 (539)
.... .... . --.....+++++++.++ ....+||..+.+-...+.... .. .......|
T Consensus 86 ~~~~~---~~~~----~-i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~----~~--i~~~~~~~------ 145 (289)
T cd00200 86 VRTLT---GHTS----Y-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHT----DW--VNSVAFSP------ 145 (289)
T ss_pred eEEEe---ccCC----c-EEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEeccCC----Cc--EEEEEEcC------
Confidence 1111 0000 0 00122334677887776 357789998665443332111 10 01112211
Q ss_pred CCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCCceeceeEEecCCcEEEEcCcCCCCC
Q 039705 263 NSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSPRVMGEMLLLPTGDVLIINGAKKGTA 342 (539)
Q Consensus 263 ~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~ 342 (539)
+++++++|..+ ..+..||+..... .......... -.+....++++.+++++.+ +
T Consensus 146 -----~~~~l~~~~~~----------------~~i~i~d~~~~~~-~~~~~~~~~~-i~~~~~~~~~~~l~~~~~~-~-- 199 (289)
T cd00200 146 -----DGTFVASSSQD----------------GTIKLWDLRTGKC-VATLTGHTGE-VNSVAFSPDGEKLLSSSSD-G-- 199 (289)
T ss_pred -----cCCEEEEEcCC----------------CcEEEEEcccccc-ceeEecCccc-cceEEECCCcCEEEEecCC-C--
Confidence 35666666533 1345676642111 1111111111 2234557888777777754 2
Q ss_pred CcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCC
Q 039705 343 GWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 343 g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
.+.+||..+. +....-. ..... -....+.+|++++++++.
T Consensus 200 ----------~i~i~d~~~~---~~~~~~~-~~~~~-i~~~~~~~~~~~~~~~~~ 239 (289)
T cd00200 200 ----------TIKLWDLSTG---KCLGTLR-GHENG-VNSVAFSPDGYLLASGSE 239 (289)
T ss_pred ----------cEEEEECCCC---ceecchh-hcCCc-eEEEEEcCCCcEEEEEcC
Confidence 5788998765 3322110 11111 123445688889888873
No 85
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=85.92 E-value=24 Score=35.59 Aligned_cols=181 Identities=15% Similarity=0.237 Sum_probs=89.6
Q ss_pred ccccccccceeEEcc-CCc--EEEEcCccCCeEEEE-ecCC--Cccee-eccCccccCCCCCCCCcceEEEeeCCcEEEE
Q 039705 145 ELSAKRWFSTQHILP-DGS--FIVVGGRREFSYEYI-LKEG--KRIIY-DLPILNETTNPSENNLYPFVFLSTDGNLFIF 217 (539)
Q Consensus 145 ~m~~~R~y~s~~~L~-dG~--VyvvGG~~~~~~E~y-P~~~--~~~w~-~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~ 217 (539)
+.+.+|..|+.-++. .|| +.++||+. | |... ...|. +....+ .
T Consensus 83 dvP~aRYGHt~~vV~SrGKta~VlFGGRS------Y~P~~qRTTenWNsVvDC~P--------------------~---- 132 (337)
T PF03089_consen 83 DVPEARYGHTINVVHSRGKTACVLFGGRS------YMPPGQRTTENWNSVVDCPP--------------------Q---- 132 (337)
T ss_pred CCCcccccceEEEEEECCcEEEEEECCcc------cCCccccchhhcceeccCCC--------------------e----
Confidence 588999988875543 454 56678873 4 4332 23563 222211 1
Q ss_pred EcCceeEeeCCCCeEE-EEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCc
Q 039705 218 ANDRSILLNPETNEIL-HVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQD 296 (539)
Q Consensus 218 Gg~~~e~yDp~tn~W~-~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s 296 (539)
+.+.|.+-+-.+ ..+|.+..+ -++ ..++ + .++.||++||..-... .....
T Consensus 133 ----VfLiDleFGC~tah~lpEl~dG-~SF----Hvsl---a--------r~D~VYilGGHsl~sd---------~Rpp~ 183 (337)
T PF03089_consen 133 ----VFLIDLEFGCCTAHTLPELQDG-QSF----HVSL---A--------RNDCVYILGGHSLESD---------SRPPR 183 (337)
T ss_pred ----EEEEeccccccccccchhhcCC-eEE----EEEE---e--------cCceEEEEccEEccCC---------CCCCc
Confidence 235677665543 235555542 222 2222 1 3789999999752111 01122
Q ss_pred eEEE--EeeCCCCceeeeccCCCceeceeEEe--cCCcEEEEcCcCCCCCCc------ccCCCCCCccEEEcCCCCCCCc
Q 039705 297 CGRI--EITNKSATWQREMMPSPRVMGEMLLL--PTGDVLIINGAKKGTAGW------NFATDPNTTPVLYEPDDPINER 366 (539)
Q Consensus 297 ~~~~--d~~~~~~~W~~~~M~~~R~~~~~vvl--pdG~I~vvGG~~~g~~g~------~~~~~~~~~~e~YdP~t~~g~~ 366 (539)
..|+ |+-..++.-+-.-++.+-+...+++. -..+.+|+||++.....- ..++ ..+++=.-+++ +
T Consensus 184 l~rlkVdLllGSP~vsC~vl~~glSisSAIvt~~~~~e~iIlGGY~sdsQKRm~C~~V~Ldd---~~I~ie~~E~P---~ 257 (337)
T PF03089_consen 184 LYRLKVDLLLGSPAVSCTVLQGGLSISSAIVTQTGPHEYIILGGYQSDSQKRMECNTVSLDD---DGIHIEEREPP---E 257 (337)
T ss_pred EEEEEEeecCCCceeEEEECCCCceEeeeeEeecCCCceEEEecccccceeeeeeeEEEEeC---CceEeccCCCC---C
Confidence 3333 33221222222234444444344332 235678889987432210 0000 12344444566 8
Q ss_pred eEecCCCCCCccceeeEeeCCCCeEEEe
Q 039705 367 FSELTPTSKPRMCHSTSVVLPDGKILVA 394 (539)
Q Consensus 367 Wt~~a~~~~~R~yhs~a~llpdG~V~v~ 394 (539)
|+. .....|.+.+..+= .|.+|++
T Consensus 258 Wt~--dI~hSrtWFGgs~G--~G~~Li~ 281 (337)
T PF03089_consen 258 WTG--DIKHSRTWFGGSMG--KGSALIG 281 (337)
T ss_pred CCC--CcCcCccccccccC--CceEEEE
Confidence 974 46667777776543 7777664
No 86
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=85.44 E-value=59 Score=35.24 Aligned_cols=205 Identities=17% Similarity=0.188 Sum_probs=106.8
Q ss_pred CCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccc-eeecccccccccccceeEEccCCcEEEEcCccCCeEEE
Q 039705 98 LTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACY-WKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEY 176 (539)
Q Consensus 98 ~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~-W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~ 176 (539)
.+...+......+||++++.|..+ ..+++||... ... =..+.. +.. +-.+++.-++|+.++.|+.+ .++-+
T Consensus 201 ~h~~~v~~~~fs~d~~~l~s~s~D---~tiriwd~~~-~~~~~~~l~g-H~~--~v~~~~f~p~g~~i~Sgs~D-~tvri 272 (456)
T KOG0266|consen 201 GHTRGVSDVAFSPDGSYLLSGSDD---KTLRIWDLKD-DGRNLKTLKG-HST--YVTSVAFSPDGNLLVSGSDD-GTVRI 272 (456)
T ss_pred ccccceeeeEECCCCcEEEEecCC---ceEEEeeccC-CCeEEEEecC-CCC--ceEEEEecCCCCEEEEecCC-CcEEE
Confidence 355666677788999987777653 7889999841 211 122211 222 22456666688777766655 56667
Q ss_pred E-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--eeEeeCCCCeEEEEcccCCCCCCccCCCccEE
Q 039705 177 I-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR--SILLNPETNEILHVFPILRGGSRNYPASATSA 253 (539)
Q Consensus 177 y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~--~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av 253 (539)
| .++. +- ...+....+.. -.+..-.+|++++.+..+ ..+||..+..-. ....+... ..+..-..+
T Consensus 273 Wd~~~~--~~--~~~l~~hs~~i-----s~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~-~~~~~~~~--~~~~~~~~~ 340 (456)
T KOG0266|consen 273 WDVRTG--EC--VRKLKGHSDGI-----SGLAFSPDGNLLVSASYDGTIRVWDLETGSKL-CLKLLSGA--ENSAPVTSV 340 (456)
T ss_pred EeccCC--eE--EEeeeccCCce-----EEEEECCCCCEEEEcCCCccEEEEECCCCcee-eeecccCC--CCCCceeEE
Confidence 7 4432 22 11122211110 012233588999888644 678999887721 01112211 110000122
Q ss_pred ecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCC--CceeeeccCCCceeceeEEecCCcE
Q 039705 254 LLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKS--ATWQREMMPSPRVMGEMLLLPTGDV 331 (539)
Q Consensus 254 ~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~--~~W~~~~M~~~R~~~~~vvlpdG~I 331 (539)
..- .+++.++++..+. .+-.+|+.... ..|...... .|.....+..++|+.
T Consensus 341 ~fs----------p~~~~ll~~~~d~----------------~~~~w~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 393 (456)
T KOG0266|consen 341 QFS----------PNGKYLLSASLDR----------------TLKLWDLRSGKSVGTYTGHSNL-VRCIFSPTLSTGGKL 393 (456)
T ss_pred EEC----------CCCcEEEEecCCC----------------eEEEEEccCCcceeeecccCCc-ceeEecccccCCCCe
Confidence 221 2677777766441 23345543211 223322111 255545555678888
Q ss_pred EEEcCcCCCCCCcccCCCCCCccEEEcCCCC
Q 039705 332 LIINGAKKGTAGWNFATDPNTTPVLYEPDDP 362 (539)
Q Consensus 332 ~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~ 362 (539)
.+.|..+. .+++||+.+.
T Consensus 394 i~sg~~d~-------------~v~~~~~~s~ 411 (456)
T KOG0266|consen 394 IYSGSEDG-------------SVYVWDSSSG 411 (456)
T ss_pred EEEEeCCc-------------eEEEEeCCcc
Confidence 88887652 6899999875
No 87
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=84.03 E-value=34 Score=34.32 Aligned_cols=137 Identities=13% Similarity=0.126 Sum_probs=76.6
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...+||.++.+-+..-..|..---+.++..|.+.++.|-.+ +++..||-.. .++.+.... +. .-|-.++...|+
T Consensus 86 ~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qivSGSrD---kTiklwnt~g-~ck~t~~~~-~~-~~WVscvrfsP~ 159 (315)
T KOG0279|consen 86 TLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIVSGSRD---KTIKLWNTLG-VCKYTIHED-SH-REWVSCVRFSPN 159 (315)
T ss_pred eEEEEEecCCcEEEEEEecCCceEEEEecCCCceeecCCCc---ceeeeeeecc-cEEEEEecC-CC-cCcEEEEEEcCC
Confidence 35678888865554433332211123556789999998653 7888888762 444443332 43 346555556665
Q ss_pred C-cEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCcee--EeeCCCCe
Q 039705 161 G-SFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSI--LLNPETNE 231 (539)
Q Consensus 161 G-~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e--~yDp~tn~ 231 (539)
. ..+|+.+.+.+++.+|-..+ .- ...+..... .+-..+.+.+||.+.+.||.+.+ ++|....+
T Consensus 160 ~~~p~Ivs~s~DktvKvWnl~~--~~-l~~~~~gh~-----~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~k 225 (315)
T KOG0279|consen 160 ESNPIIVSASWDKTVKVWNLRN--CQ-LRTTFIGHS-----GYVNTVTVSPDGSLCASGGKDGEAMLWDLNEGK 225 (315)
T ss_pred CCCcEEEEccCCceEEEEccCC--cc-hhhcccccc-----ccEEEEEECCCCCEEecCCCCceEEEEEccCCc
Confidence 3 56666666667877772221 11 111111111 11123456789999999998754 55655443
No 88
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=83.54 E-value=59 Score=33.68 Aligned_cols=279 Identities=13% Similarity=0.178 Sum_probs=127.0
Q ss_pred cCCCceEEccC-CCcccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeC
Q 039705 17 EFKGKWELASE-NSGISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPL 95 (539)
Q Consensus 17 ~~~g~W~~~~~-~~~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l 95 (539)
...|+++.+.. .....+-.+++-+.+..+|+..... .. +| .-.+..++..+++.+.+
T Consensus 22 ~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~-~~------~g---------------~v~~~~i~~~~g~L~~~ 79 (345)
T PF10282_consen 22 EETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGS-GD------SG---------------GVSSYRIDPDTGTLTLL 79 (345)
T ss_dssp TTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTS-ST------TT---------------EEEEEEEETTTTEEEEE
T ss_pred CCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccc-cC------CC---------------CEEEEEECCCcceeEEe
Confidence 45578887753 2334445556654456666665432 00 11 12344566666777776
Q ss_pred ccCC---CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeec----------ccc-cccccccceeEEccCC
Q 039705 96 KILT---DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEH----------HWE-LSAKRWFSTQHILPDG 161 (539)
Q Consensus 96 ~~~~---~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~----------~~~-m~~~R~y~s~~~L~dG 161 (539)
.... ..-|+ -++.++|+.+++.-+. ..++.+|+-.. ...-.+. +.. -+..-.-|.+...+||
T Consensus 80 ~~~~~~g~~p~~-i~~~~~g~~l~vany~--~g~v~v~~l~~-~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg 155 (345)
T PF10282_consen 80 NSVPSGGSSPCH-IAVDPDGRFLYVANYG--GGSVSVFPLDD-DGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDG 155 (345)
T ss_dssp EEEEESSSCEEE-EEECTTSSEEEEEETT--TTEEEEEEECT-TSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTS
T ss_pred eeeccCCCCcEE-EEEecCCCEEEEEEcc--CCeEEEEEccC-CcccceeeeecccCCCCCcccccccccceeEEECCCC
Confidence 5432 22242 2445688877765332 24566665441 0111111 000 0111223456677788
Q ss_pred cEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcc-eEEEeeCCc-EEEEEc--CceeEeeC--CCCeEE-
Q 039705 162 SFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYP-FVFLSTDGN-LFIFAN--DRSILLNP--ETNEIL- 233 (539)
Q Consensus 162 ~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp-~~~~~~~G~-Iyv~Gg--~~~e~yDp--~tn~W~- 233 (539)
+.+.+--.....+.+| -.....+......+.. +...-| |+...+||+ +|++.- +.+..|+. .+++++
T Consensus 156 ~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~-----~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~ 230 (345)
T PF10282_consen 156 RFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKV-----PPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTE 230 (345)
T ss_dssp SEEEEEETTTTEEEEEEE-TTS-TEEEEEEEEC-----STTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEE
T ss_pred CEEEEEecCCCEEEEEEEeCCCceEEEeecccc-----ccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeE
Confidence 7555543333456666 3332111211111100 111234 344456876 555553 34444444 466554
Q ss_pred -EEcccCCCCCCccCCCccEEecccccCCCCCCCccc-EEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceee
Q 039705 234 -HVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRA-EVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQR 311 (539)
Q Consensus 234 -~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g-~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~ 311 (539)
+.++.+|.+...-. .++.+. +.| ++ .+|+.--. .+++..|++...+.+-+.
T Consensus 231 ~~~~~~~~~~~~~~~-~~~~i~--isp--------dg~~lyvsnr~----------------~~sI~vf~~d~~~g~l~~ 283 (345)
T PF10282_consen 231 IQTISTLPEGFTGEN-APAEIA--ISP--------DGRFLYVSNRG----------------SNSISVFDLDPATGTLTL 283 (345)
T ss_dssp EEEEESCETTSCSSS-SEEEEE--E-T--------TSSEEEEEECT----------------TTEEEEEEECTTTTTEEE
T ss_pred EEEeeeccccccccC-CceeEE--Eec--------CCCEEEEEecc----------------CCEEEEEEEecCCCceEE
Confidence 23444543211100 122222 222 44 46664321 235667777433334333
Q ss_pred e---c--cCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEE--cCCCCCCCceEecC
Q 039705 312 E---M--MPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLY--EPDDPINERFSELT 371 (539)
Q Consensus 312 ~---~--M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~Y--dP~t~~g~~Wt~~a 371 (539)
. + -..||.+ .+-+||+.+++.....+ ++.+| |+++. .++.+.
T Consensus 284 ~~~~~~~G~~Pr~~---~~s~~g~~l~Va~~~s~------------~v~vf~~d~~tG---~l~~~~ 332 (345)
T PF10282_consen 284 VQTVPTGGKFPRHF---AFSPDGRYLYVANQDSN------------TVSVFDIDPDTG---KLTPVG 332 (345)
T ss_dssp EEEEEESSSSEEEE---EE-TTSSEEEEEETTTT------------EEEEEEEETTTT---EEEEEE
T ss_pred EEEEeCCCCCccEE---EEeCCCCEEEEEecCCC------------eEEEEEEeCCCC---cEEEec
Confidence 2 2 3457753 45689998888765421 45555 66777 787654
No 89
>PF13088 BNR_2: BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=82.96 E-value=13 Score=36.81 Aligned_cols=127 Identities=11% Similarity=0.098 Sum_probs=70.3
Q ss_pred CCCCCEEeCccC-CC-cccccce-ecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccc-ccccccccceeEEccCCc
Q 039705 87 AESAAIRPLKIL-TD-TWSSSGG-LSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHW-ELSAKRWFSTQHILPDGS 162 (539)
Q Consensus 87 p~t~~w~~l~~~-~~-~~c~~~~-~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~-~m~~~R~y~s~~~L~dG~ 162 (539)
-.-.+|+..... .. ..|.... .+.||+|+++--.. ....+.++--.++..+|++... .++..........+.||+
T Consensus 142 D~G~tW~~~~~~~~~~~~~e~~~~~~~dG~l~~~~R~~-~~~~~~~~~S~D~G~TWs~~~~~~~~~~~~~~~~~~~~~g~ 220 (275)
T PF13088_consen 142 DGGKTWSSGSPIPDGQGECEPSIVELPDGRLLAVFRTE-GNDDIYISRSTDGGRTWSPPQPTNLPNPNSSISLVRLSDGR 220 (275)
T ss_dssp STTSSEEEEEECECSEEEEEEEEEEETTSEEEEEEEEC-SSTEEEEEEESSTTSS-EEEEEEECSSCCEEEEEEECTTSE
T ss_pred CCCceeeccccccccCCcceeEEEECCCCcEEEEEEcc-CCCcEEEEEECCCCCcCCCceecccCcccCCceEEEcCCCC
Confidence 344569877654 22 3444443 35799999875332 1123333322223678987532 245445445566788999
Q ss_pred EEEEcCccC--CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEE
Q 039705 163 FIVVGGRRE--FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFI 216 (539)
Q Consensus 163 VyvvGG~~~--~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv 216 (539)
++++..... ..+.++ ...+..+|.....+.... ....-||.+..+.||+|||
T Consensus 221 ~~~~~~~~~~r~~l~l~~S~D~g~tW~~~~~i~~~~--~~~~~Y~~~~~~~dg~l~i 275 (275)
T PF13088_consen 221 LLLVYNNPDGRSNLSLYVSEDGGKTWSRPKTIDDGP--NGDSGYPSLTQLPDGKLYI 275 (275)
T ss_dssp EEEEEECSSTSEEEEEEEECTTCEEEEEEEEEEEEE---CCEEEEEEEEEETTEEEE
T ss_pred EEEEEECCCCCCceEEEEEeCCCCcCCccEEEeCCC--CCcEECCeeEEeCCCcCCC
Confidence 999988422 234444 333356785432222211 1124599999999999986
No 90
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=82.46 E-value=28 Score=34.51 Aligned_cols=128 Identities=14% Similarity=0.122 Sum_probs=70.9
Q ss_pred CCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcce---e
Q 039705 111 NGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRII---Y 186 (539)
Q Consensus 111 dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w---~ 186 (539)
|..|+-. . ..+.||++|-. +.+=.. . |..+.---++.+-.||+++++.= +.++-++ ++.= .- .
T Consensus 155 D~~iLSS--a--dd~tVRLWD~r--Tgt~v~--s-L~~~s~VtSlEvs~dG~ilTia~--gssV~Fwdaksf--~~lKs~ 221 (334)
T KOG0278|consen 155 DKCILSS--A--DDKTVRLWDHR--TGTEVQ--S-LEFNSPVTSLEVSQDGRILTIAY--GSSVKFWDAKSF--GLLKSY 221 (334)
T ss_pred CceEEee--c--cCCceEEEEec--cCcEEE--E-EecCCCCcceeeccCCCEEEEec--CceeEEeccccc--cceeec
Confidence 5566554 1 14789999987 554332 2 44444334566777999998842 1245555 4321 11 0
Q ss_pred eccCccccCCCCCCCCcceEEEeeCCcEEEEEcCce--eEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCC
Q 039705 187 DLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRS--ILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNS 264 (539)
Q Consensus 187 ~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~--e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~ 264 (539)
.+|.-.. .+.+-|+-.+||.||.+. ..||..|+.-. -.-.. -. .|..-.+-.+|
T Consensus 222 k~P~nV~-----------SASL~P~k~~fVaGged~~~~kfDy~TgeEi---~~~nk--gh---~gpVhcVrFSP----- 277 (334)
T KOG0278|consen 222 KMPCNVE-----------SASLHPKKEFFVAGGEDFKVYKFDYNTGEEI---GSYNK--GH---FGPVHCVRFSP----- 277 (334)
T ss_pred cCccccc-----------cccccCCCceEEecCcceEEEEEeccCCcee---eeccc--CC---CCceEEEEECC-----
Confidence 1221111 235667889999999875 56888888643 11111 01 12221222222
Q ss_pred CCcccEEEEecCCC
Q 039705 265 NAIRAEVLICGGAK 278 (539)
Q Consensus 265 ~~~~g~Iyv~GG~~ 278 (539)
+|++|+.|-.+
T Consensus 278 ---dGE~yAsGSED 288 (334)
T KOG0278|consen 278 ---DGELYASGSED 288 (334)
T ss_pred ---CCceeeccCCC
Confidence 89999998765
No 91
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=81.60 E-value=43 Score=34.84 Aligned_cols=133 Identities=17% Similarity=0.216 Sum_probs=78.2
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecc--cccccccccceeEEc
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHH--WELSAKRWFSTQHIL 158 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~--~~m~~~R~y~s~~~L 158 (539)
.+.+||..++.|--.-.-+.---....+..||.++++|+.. -.+.+|.-..+..+|.-.. .+|..-+|.+.
T Consensus 87 ~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdms---G~v~v~~~stg~~~~~~~~e~~dieWl~WHp~---- 159 (399)
T KOG0296|consen 87 LAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMS---GKVLVFKVSTGGEQWKLDQEVEDIEWLKWHPR---- 159 (399)
T ss_pred eEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCC---ccEEEEEcccCceEEEeecccCceEEEEeccc----
Confidence 57789999998754433332111122456899999999874 4577777653356776431 13777788763
Q ss_pred cCCcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCCCeEE
Q 039705 159 PDGSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPETNEIL 233 (539)
Q Consensus 159 ~dG~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~ 233 (539)
+.|+..|-.+ .++++| |... .-.. +...... .-..-.+++||..+.|-. +..+|||++..-.
T Consensus 160 --a~illAG~~D-GsvWmw~ip~~~--~~kv---~~Gh~~~-----ct~G~f~pdGKr~~tgy~dgti~~Wn~ktg~p~ 225 (399)
T KOG0296|consen 160 --AHILLAGSTD-GSVWMWQIPSQA--LCKV---MSGHNSP-----CTCGEFIPDGKRILTGYDDGTIIVWNPKTGQPL 225 (399)
T ss_pred --ccEEEeecCC-CcEEEEECCCcc--eeeE---ecCCCCC-----cccccccCCCceEEEEecCceEEEEecCCCcee
Confidence 4565655443 456677 4422 1112 2221110 012246788988877754 4678999998643
No 92
>PTZ00421 coronin; Provisional
Probab=80.73 E-value=69 Score=35.24 Aligned_cols=139 Identities=9% Similarity=0.032 Sum_probs=66.5
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccce-eecccccccccccceeEEcc
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYW-KEHHWELSAKRWFSTQHILP 159 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W-~~~~~~m~~~R~y~s~~~L~ 159 (539)
...+||..+++-...-..+..........+||.++++|+.+ ..+++||+. +.+- ..+.. ........+....
T Consensus 149 tVrIWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~D---g~IrIwD~r--sg~~v~tl~~--H~~~~~~~~~w~~ 221 (493)
T PTZ00421 149 VVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKD---KKLNIIDPR--DGTIVSSVEA--HASAKSQRCLWAK 221 (493)
T ss_pred EEEEEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecCC---CEEEEEECC--CCcEEEEEec--CCCCcceEEEEcC
Confidence 46789988765432211122222223445799999999863 679999998 4432 11111 1111111223344
Q ss_pred CCcEEEEcCcc---CCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC---ceeEeeCCCCeE
Q 039705 160 DGSFIVVGGRR---EFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND---RSILLNPETNEI 232 (539)
Q Consensus 160 dG~VyvvGG~~---~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~---~~e~yDp~tn~W 232 (539)
++..++..|.+ ...+.+| ..+......... + +...... ..+.-+++++++.||. .+.+||..+++.
T Consensus 222 ~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~-~----d~~~~~~--~~~~d~d~~~L~lggkgDg~Iriwdl~~~~~ 294 (493)
T PTZ00421 222 RKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVD-L----DQSSALF--IPFFDEDTNLLYIGSKGEGNIRCFELMNERL 294 (493)
T ss_pred CCCeEEEEecCCCCCCeEEEEeCCCCCCceeEec-c----CCCCceE--EEEEcCCCCEEEEEEeCCCeEEEEEeeCCce
Confidence 54444544432 3467777 433200000000 0 0000000 0112357887777653 467899887775
Q ss_pred E
Q 039705 233 L 233 (539)
Q Consensus 233 ~ 233 (539)
.
T Consensus 295 ~ 295 (493)
T PTZ00421 295 T 295 (493)
T ss_pred E
Confidence 4
No 93
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=77.31 E-value=79 Score=31.31 Aligned_cols=157 Identities=15% Similarity=0.189 Sum_probs=79.8
Q ss_pred CceEEccC----CCcccceEEEec-CCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEe
Q 039705 20 GKWELASE----NSGISAMHIILF-PNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRP 94 (539)
Q Consensus 20 g~W~~~~~----~~~~~~~~~~ll-~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~ 94 (539)
--|+...+ ..++..+.+..+ |..+-|++.+|.. ....+|.++++.+.
T Consensus 99 ~lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgGD~----------------------------~~y~~dlE~G~i~r 150 (325)
T KOG0649|consen 99 RLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGGDG----------------------------VIYQVDLEDGRIQR 150 (325)
T ss_pred hhhhhcCccccCcccCCccceeEeccCCCcEEEecCCe----------------------------EEEEEEecCCEEEE
Confidence 34777654 234555544444 4567777666421 24578999988876
Q ss_pred CccCCCccccccee--cCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccc-----ccc---cccccceeEEccCCcEE
Q 039705 95 LKILTDTWSSSGGL--SANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHW-----ELS---AKRWFSTQHILPDGSFI 164 (539)
Q Consensus 95 l~~~~~~~c~~~~~--l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~-----~m~---~~R~y~s~~~L~dG~Vy 164 (539)
.--.+.-.-+ .++ -.++.|+ .|+. | -+++++|-+ +.+-..+-. ++. ..||- .+... |-.-+
T Consensus 151 ~~rGHtDYvH-~vv~R~~~~qil-sG~E-D--GtvRvWd~k--t~k~v~~ie~yk~~~~lRp~~g~wi-gala~-~edWl 221 (325)
T KOG0649|consen 151 EYRGHTDYVH-SVVGRNANGQIL-SGAE-D--GTVRVWDTK--TQKHVSMIEPYKNPNLLRPDWGKWI-GALAV-NEDWL 221 (325)
T ss_pred EEcCCcceee-eeeecccCccee-ecCC-C--ccEEEEecc--ccceeEEeccccChhhcCcccCcee-EEEec-cCceE
Confidence 5444322211 111 1245543 4543 3 367788877 444332210 122 34553 33334 67788
Q ss_pred EEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEE-cCceeEeeC
Q 039705 165 VVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFA-NDRSILLNP 227 (539)
Q Consensus 165 vvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~G-g~~~e~yDp 227 (539)
|.||-.. ..+| +... .....|+.... +.+...+..|.+.| |+.+..|..
T Consensus 222 vCGgGp~--lslwhLrsse--~t~vfpipa~v----------~~v~F~~d~vl~~G~g~~v~~~~l 273 (325)
T KOG0649|consen 222 VCGGGPK--LSLWHLRSSE--STCVFPIPARV----------HLVDFVDDCVLIGGEGNHVQSYTL 273 (325)
T ss_pred EecCCCc--eeEEeccCCC--ceEEEecccce----------eEeeeecceEEEeccccceeeeee
Confidence 8888533 3344 3322 22234432221 22444567777777 677766544
No 94
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=76.95 E-value=51 Score=35.29 Aligned_cols=131 Identities=14% Similarity=0.103 Sum_probs=70.8
Q ss_pred eEEEEECCCCCEEeCccCC---CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEE
Q 039705 81 LAVEYDAESAAIRPLKILT---DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHI 157 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~---~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~ 157 (539)
....||+.+.+.+++..+. ...-..-.+.+++..+++-|.+ -.|.+.... ++.|...-. |+ ++- ...+.
T Consensus 281 y~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd~~fia~~G~~---G~I~lLhak--T~eli~s~K-ie-G~v-~~~~f 352 (514)
T KOG2055|consen 281 YLYSYDLETAKVTKLKPPYGVEEKSMERFEVSHDSNFIAIAGNN---GHIHLLHAK--TKELITSFK-IE-GVV-SDFTF 352 (514)
T ss_pred EEEEeeccccccccccCCCCcccchhheeEecCCCCeEEEcccC---ceEEeehhh--hhhhhheee-ec-cEE-eeEEE
Confidence 3457999999999887652 1121222345678888887764 346666666 777754322 32 221 22333
Q ss_pred ccCCcEEEEcCccCCeEEEE-ecCC--CcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--eeEeeCCCC
Q 039705 158 LPDGSFIVVGGRREFSYEYI-LKEG--KRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR--SILLNPETN 230 (539)
Q Consensus 158 L~dG~VyvvGG~~~~~~E~y-P~~~--~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~--~e~yDp~tn 230 (539)
-.||+.+++.|.++ .+.+| -..+ -.+|..-..+..+ ..+..++|..|++|..+ +-+||..+-
T Consensus 353 sSdsk~l~~~~~~G-eV~v~nl~~~~~~~rf~D~G~v~gt----------s~~~S~ng~ylA~GS~~GiVNIYd~~s~ 419 (514)
T KOG2055|consen 353 SSDSKELLASGGTG-EVYVWNLRQNSCLHRFVDDGSVHGT----------SLCISLNGSYLATGSDSGIVNIYDGNSC 419 (514)
T ss_pred ecCCcEEEEEcCCc-eEEEEecCCcceEEEEeecCcccee----------eeeecCCCceEEeccCcceEEEeccchh
Confidence 45776655555433 23333 2221 1133222112111 22445799988888765 678986653
No 95
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=75.80 E-value=63 Score=34.45 Aligned_cols=141 Identities=11% Similarity=0.101 Sum_probs=75.8
Q ss_pred cceEEEeeCCcEEEEEcCc--eeEeeCCCCeEEEEcccCCCCCCccCCCcc--EEecccccCCCCCCCcccEEEEecCCC
Q 039705 203 YPFVFLSTDGNLFIFANDR--SILLNPETNEILHVFPILRGGSRNYPASAT--SALLPIKLQDPNSNAIRAEVLICGGAK 278 (539)
Q Consensus 203 yp~~~~~~~G~Iyv~Gg~~--~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~--av~lpl~~~~~~~~~~~g~Iyv~GG~~ 278 (539)
|..+..-|||.||..|--+ +.+||.+...- +...|+. +|. ++-+ . -+|--++++-.+
T Consensus 350 ~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~---~a~Fpgh------t~~vk~i~F-------s---ENGY~Lat~add 410 (506)
T KOG0289|consen 350 YTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTN---VAKFPGH------TGPVKAISF-------S---ENGYWLATAADD 410 (506)
T ss_pred eEEeeEcCCceEEeccCCCceEEEEEcCCccc---cccCCCC------CCceeEEEe-------c---cCceEEEEEecC
Confidence 4444556899999998654 56899887653 2233321 121 1111 0 255656664322
Q ss_pred CCcccccCCCcccccCCceEEEEeeCCCCceeeeccCCCceeceeEE-ecCCcEEEEcCcCCCCCCcccCCCCCCccEEE
Q 039705 279 PEAGVLAGKGEFMNALQDCGRIEITNKSATWQREMMPSPRVMGEMLL-LPTGDVLIINGAKKGTAGWNFATDPNTTPVLY 357 (539)
Q Consensus 279 ~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~~M~~~R~~~~~vv-lpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~Y 357 (539)
.++.++|+..- .-.....++... ...++. =..|+.++++|.+ .++.+|
T Consensus 411 ----------------~~V~lwDLRKl-~n~kt~~l~~~~-~v~s~~fD~SGt~L~~~g~~-------------l~Vy~~ 459 (506)
T KOG0289|consen 411 ----------------GSVKLWDLRKL-KNFKTIQLDEKK-EVNSLSFDQSGTYLGIAGSD-------------LQVYIC 459 (506)
T ss_pred ----------------CeEEEEEehhh-cccceeeccccc-cceeEEEcCCCCeEEeecce-------------eEEEEE
Confidence 13677888641 222222333322 112222 2468888888754 378899
Q ss_pred cCCCCCCCceEecCCCCCCccceee-EeeCCCCeEEEecCC
Q 039705 358 EPDDPINERFSELTPTSKPRMCHST-SVVLPDGKILVAGSN 397 (539)
Q Consensus 358 dP~t~~g~~Wt~~a~~~~~R~yhs~-a~llpdG~V~v~GG~ 397 (539)
+-.+. +|+.+...+.-- .-++ +-+--+.+++..||.
T Consensus 460 ~k~~k---~W~~~~~~~~~s-g~st~v~Fg~~aq~l~s~sm 496 (506)
T KOG0289|consen 460 KKKTK---SWTEIKELADHS-GLSTGVRFGEHAQYLASTSM 496 (506)
T ss_pred ecccc---cceeeehhhhcc-cccceeeecccceEEeeccc
Confidence 99999 999876554222 1122 223344556666665
No 96
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=74.95 E-value=98 Score=37.51 Aligned_cols=61 Identities=18% Similarity=0.286 Sum_probs=38.3
Q ss_pred eeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCC----------Cc--cceeeEeeCCCC
Q 039705 322 EMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSK----------PR--MCHSTSVVLPDG 389 (539)
Q Consensus 322 ~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~----------~R--~yhs~a~llpdG 389 (539)
+.++-+||+|||....+ ..+.+||+++. ..+.++.... .+ .-+++ ++.+||
T Consensus 808 Gvavd~dG~LYVADs~N-------------~rIrviD~~tg---~v~tiaG~G~~G~~dG~~~~a~l~~P~GI-avd~dG 870 (1057)
T PLN02919 808 GVLCAKDGQIYVADSYN-------------HKIKKLDPATK---RVTTLAGTGKAGFKDGKALKAQLSEPAGL-ALGENG 870 (1057)
T ss_pred eeeEeCCCcEEEEECCC-------------CEEEEEECCCC---eEEEEeccCCcCCCCCcccccccCCceEE-EEeCCC
Confidence 33456899999977543 26899999988 6665432111 11 11244 446899
Q ss_pred eEEEecCCCC
Q 039705 390 KILVAGSNPH 399 (539)
Q Consensus 390 ~V~v~GG~~~ 399 (539)
+|||+-.+.+
T Consensus 871 ~lyVaDt~Nn 880 (1057)
T PLN02919 871 RLFVADTNNS 880 (1057)
T ss_pred CEEEEECCCC
Confidence 9999866533
No 97
>PLN00181 protein SPA1-RELATED; Provisional
Probab=74.58 E-value=1.8e+02 Score=33.98 Aligned_cols=131 Identities=14% Similarity=0.068 Sum_probs=63.8
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceec-CCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEE-c
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLS-ANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHI-L 158 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l-~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~-L 158 (539)
...+||..+++....-..+.......... .++.++++||.+ ..+++||.. +..- ... +.......++.. -
T Consensus 556 ~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~D---g~v~iWd~~--~~~~--~~~-~~~~~~v~~v~~~~ 627 (793)
T PLN00181 556 VVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDD---GSVKLWSIN--QGVS--IGT-IKTKANICCVQFPS 627 (793)
T ss_pred eEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCC---CEEEEEECC--CCcE--EEE-EecCCCeEEEEEeC
Confidence 35678887765433222232222222233 378999999863 578999976 3321 111 221111111111 2
Q ss_pred cCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCC
Q 039705 159 PDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPET 229 (539)
Q Consensus 159 ~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~t 229 (539)
.+|..+++|+.++ .+.+| .... +- ....+..... .. ..+...++..++.|+. .+.+||...
T Consensus 628 ~~g~~latgs~dg-~I~iwD~~~~--~~-~~~~~~~h~~----~V--~~v~f~~~~~lvs~s~D~~ikiWd~~~ 691 (793)
T PLN00181 628 ESGRSLAFGSADH-KVYYYDLRNP--KL-PLCTMIGHSK----TV--SYVRFVDSSTLVSSSTDNTLKLWDLSM 691 (793)
T ss_pred CCCCEEEEEeCCC-eEEEEECCCC--Cc-cceEecCCCC----CE--EEEEEeCCCEEEEEECCCEEEEEeCCC
Confidence 3688888888664 56666 4332 10 0000111000 00 1122347777777764 467888764
No 98
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=74.01 E-value=14 Score=38.56 Aligned_cols=113 Identities=16% Similarity=0.146 Sum_probs=63.8
Q ss_pred ecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcce
Q 039705 108 LSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRII 185 (539)
Q Consensus 108 ~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w 185 (539)
+.++|+.++.|+- ..+++++|+. +.+ ++.. +. ...|-.+.+--+||+.++.|-.+ .++.+| |+++ +-
T Consensus 123 fsp~g~~l~tGsG---D~TvR~WD~~--TeT--p~~t-~KgH~~WVlcvawsPDgk~iASG~~d-g~I~lwdpktg--~~ 191 (480)
T KOG0271|consen 123 FSPTGSRLVTGSG---DTTVRLWDLD--TET--PLFT-CKGHKNWVLCVAWSPDGKKIASGSKD-GSIRLWDPKTG--QQ 191 (480)
T ss_pred ecCCCceEEecCC---CceEEeeccC--CCC--ccee-ecCCccEEEEEEECCCcchhhccccC-CeEEEecCCCC--Cc
Confidence 4568888888863 4789999997 433 1111 22 34577777788899998766543 567788 8876 21
Q ss_pred eeccCccccCC-CCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCCCeEE
Q 039705 186 YDLPILNETTN-PSENNLYPFVFLSTDGNLFIFAN--DRSILLNPETNEIL 233 (539)
Q Consensus 186 ~~~~~l~~~~~-~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~tn~W~ 233 (539)
.-.+ |..... -....+.| ..+.+..+.++.++ +++.+||....+-.
T Consensus 192 ~g~~-l~gH~K~It~Lawep-~hl~p~~r~las~skDg~vrIWd~~~~~~~ 240 (480)
T KOG0271|consen 192 IGRA-LRGHKKWITALAWEP-LHLVPPCRRLASSSKDGSVRIWDTKLGTCV 240 (480)
T ss_pred cccc-ccCcccceeEEeecc-cccCCCccceecccCCCCEEEEEccCceEE
Confidence 1111 101000 00011112 14455666666655 35677887766544
No 99
>PTZ00421 coronin; Provisional
Probab=73.06 E-value=1.5e+02 Score=32.56 Aligned_cols=107 Identities=8% Similarity=0.110 Sum_probs=55.6
Q ss_pred CCcEEEecCCCCCCCeEEEEeCCCCccce-----eeccccccc-ccccceeEEccC-CcEEEEcCccCCeEEEE-ecCCC
Q 039705 111 NGTIVISGGWSSRGRSVRYLSGCYHACYW-----KEHHWELSA-KRWFSTQHILPD-GSFIVVGGRREFSYEYI-LKEGK 182 (539)
Q Consensus 111 dG~l~v~GG~~~g~~~v~~ydP~~~t~~W-----~~~~~~m~~-~R~y~s~~~L~d-G~VyvvGG~~~~~~E~y-P~~~~ 182 (539)
|+.++++|+.+ ..+++||-. +... ..+.. +.. .+.-.+.+.-++ +.+++.||.+ .++.+| ..+.
T Consensus 87 d~~~LaSgS~D---gtIkIWdi~--~~~~~~~~~~~l~~-L~gH~~~V~~l~f~P~~~~iLaSgs~D-gtVrIWDl~tg- 158 (493)
T PTZ00421 87 DPQKLFTASED---GTIMGWGIP--EEGLTQNISDPIVH-LQGHTKKVGIVSFHPSAMNVLASAGAD-MVVNVWDVERG- 158 (493)
T ss_pred CCCEEEEEeCC---CEEEEEecC--CCccccccCcceEE-ecCCCCcEEEEEeCcCCCCEEEEEeCC-CEEEEEECCCC-
Confidence 67888888863 578889865 3211 11111 221 111122233334 3577877765 456677 4433
Q ss_pred cceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCCCeEE
Q 039705 183 RIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPETNEIL 233 (539)
Q Consensus 183 ~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~ 233 (539)
+-. ..+....+ ..+ .+...++|++++.|+. .+.+||+++++-.
T Consensus 159 -~~~--~~l~~h~~----~V~-sla~spdG~lLatgs~Dg~IrIwD~rsg~~v 203 (493)
T PTZ00421 159 -KAV--EVIKCHSD----QIT-SLEWNLDGSLLCTTSKDKKLNIIDPRDGTIV 203 (493)
T ss_pred -eEE--EEEcCCCC----ceE-EEEEECCCCEEEEecCCCEEEEEECCCCcEE
Confidence 211 00111101 111 1223468999998875 4788999987644
No 100
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=73.04 E-value=6.7 Score=40.80 Aligned_cols=57 Identities=26% Similarity=0.320 Sum_probs=38.7
Q ss_pred ecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCC
Q 039705 326 LPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 326 lpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
-|+|+.++.|+.+. ++.+||+.|. + .+..+.--+..-.+...-|||+.++.|+-++.
T Consensus 124 sp~g~~l~tGsGD~-------------TvR~WD~~Te---T--p~~t~KgH~~WVlcvawsPDgk~iASG~~dg~ 180 (480)
T KOG0271|consen 124 SPTGSRLVTGSGDT-------------TVRLWDLDTE---T--PLFTCKGHKNWVLCVAWSPDGKKIASGSKDGS 180 (480)
T ss_pred cCCCceEEecCCCc-------------eEEeeccCCC---C--cceeecCCccEEEEEEECCCcchhhccccCCe
Confidence 48999999998652 7899999987 1 11122222333344556899999999996543
No 101
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=72.85 E-value=1.8e+02 Score=33.32 Aligned_cols=50 Identities=12% Similarity=0.082 Sum_probs=31.7
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGC 133 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~ 133 (539)
+-.+|+-+++.+.--...|-.+-..-+..+||.++++|+.+ .+|++||-+
T Consensus 331 QLlVweWqsEsYVlKQQgH~~~i~~l~YSpDgq~iaTG~eD---gKVKvWn~~ 380 (893)
T KOG0291|consen 331 QLLVWEWQSESYVLKQQGHSDRITSLAYSPDGQLIATGAED---GKVKVWNTQ 380 (893)
T ss_pred eEEEEEeeccceeeeccccccceeeEEECCCCcEEEeccCC---CcEEEEecc
Confidence 44566655555543333333333344567899999999964 578889876
No 102
>PTZ00420 coronin; Provisional
Probab=72.82 E-value=55 Score=36.67 Aligned_cols=135 Identities=11% Similarity=0.116 Sum_probs=66.5
Q ss_pred eEEEEECCCCCEE-eCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccc-cceeEE-
Q 039705 81 LAVEYDAESAAIR-PLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRW-FSTQHI- 157 (539)
Q Consensus 81 ~~~~yDp~t~~w~-~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~-y~s~~~- 157 (539)
...+||..+.+-. .+. +..........++|.++++++.. +.+++||+. +.+ .+.. +..... ..+.++
T Consensus 149 tIrIWDl~tg~~~~~i~--~~~~V~SlswspdG~lLat~s~D---~~IrIwD~R--sg~--~i~t-l~gH~g~~~s~~v~ 218 (568)
T PTZ00420 149 FVNIWDIENEKRAFQIN--MPKKLSSLKWNIKGNLLSGTCVG---KHMHIIDPR--KQE--IASS-FHIHDGGKNTKNIW 218 (568)
T ss_pred eEEEEECCCCcEEEEEe--cCCcEEEEEECCCCCEEEEEecC---CEEEEEECC--CCc--EEEE-EecccCCceeEEEE
Confidence 3567888776532 121 11112233445799999988752 679999998 432 1111 221110 001111
Q ss_pred c----cCCcEEEEcCccC---CeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeC
Q 039705 158 L----PDGSFIVVGGRRE---FSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNP 227 (539)
Q Consensus 158 L----~dG~VyvvGG~~~---~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp 227 (539)
+ +|+..++.+|.+. .++.+| .+.. ...... ...+.....+.|+ +-..+|.+|+.|. ..+.+|+.
T Consensus 219 ~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~-~~pl~~----~~ld~~~~~L~p~-~D~~tg~l~lsGkGD~tIr~~e~ 292 (568)
T PTZ00420 219 IDGLGGDDNYILSTGFSKNNMREMKLWDLKNT-TSALVT----MSIDNASAPLIPH-YDESTGLIYLIGKGDGNCRYYQH 292 (568)
T ss_pred eeeEcCCCCEEEEEEcCCCCccEEEEEECCCC-CCceEE----EEecCCccceEEe-eeCCCCCEEEEEECCCeEEEEEc
Confidence 1 4777777777654 357777 4421 121110 0011111111222 1133589999884 45778888
Q ss_pred CCCe
Q 039705 228 ETNE 231 (539)
Q Consensus 228 ~tn~ 231 (539)
..+.
T Consensus 293 ~~~~ 296 (568)
T PTZ00420 293 SLGS 296 (568)
T ss_pred cCCc
Confidence 7664
No 103
>PRK01742 tolB translocation protein TolB; Provisional
Probab=72.25 E-value=1.4e+02 Score=31.92 Aligned_cols=134 Identities=13% Similarity=0.092 Sum_probs=69.6
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccc-cceeEEcc
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRW-FSTQHILP 159 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~-y~s~~~L~ 159 (539)
...++|..+++-+.+..... ........+||+.++++...++...+..+|.. +.....+.. ... .....--+
T Consensus 229 ~i~i~dl~tg~~~~l~~~~g-~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~--~~~~~~lt~----~~~~~~~~~wSp 301 (429)
T PRK01742 229 QLVVHDLRSGARKVVASFRG-HNGAPAFSPDGSRLAFASSKDGVLNIYVMGAN--GGTPSQLTS----GAGNNTEPSWSP 301 (429)
T ss_pred EEEEEeCCCCceEEEecCCC-ccCceeECCCCCEEEEEEecCCcEEEEEEECC--CCCeEeecc----CCCCcCCEEECC
Confidence 35568888776655543321 11234567899987776544444557777876 555544432 111 12344456
Q ss_pred CCcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEE-EcCceeEeeCCCCeEE
Q 039705 160 DGSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIF-ANDRSILLNPETNEIL 233 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~-Gg~~~e~yDp~tn~W~ 233 (539)
||+-+++........++| .... .... .+... . |. ....+||+..++ ++....++|..++++.
T Consensus 302 DG~~i~f~s~~~g~~~I~~~~~~~--~~~~--~l~~~------~-~~-~~~SpDG~~ia~~~~~~i~~~Dl~~g~~~ 366 (429)
T PRK01742 302 DGQSILFTSDRSGSPQVYRMSASG--GGAS--LVGGR------G-YS-AQISADGKTLVMINGDNVVKQDLTSGSTE 366 (429)
T ss_pred CCCEEEEEECCCCCceEEEEECCC--CCeE--EecCC------C-CC-ccCCCCCCEEEEEcCCCEEEEECCCCCeE
Confidence 887544433222234555 2221 1111 01100 1 21 234678875544 4456777899888876
No 104
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=71.66 E-value=1.1e+02 Score=30.47 Aligned_cols=135 Identities=13% Similarity=0.174 Sum_probs=71.2
Q ss_pred EEeeCCcEEEEEc-CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCccccc
Q 039705 207 FLSTDGNLFIFAN-DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLA 285 (539)
Q Consensus 207 ~~~~~G~Iyv~Gg-~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~ 285 (539)
++.-|.+|.-... ..+.+||-++++.++.+ ..+. |- -+.-+- .+|+|+.+--..
T Consensus 151 wc~eD~~iLSSadd~tVRLWD~rTgt~v~sL-~~~s-----~V-tSlEvs-----------~dG~ilTia~gs------- 205 (334)
T KOG0278|consen 151 WCHEDKCILSSADDKTVRLWDHRTGTEVQSL-EFNS-----PV-TSLEVS-----------QDGRILTIAYGS------- 205 (334)
T ss_pred EeccCceEEeeccCCceEEEEeccCcEEEEE-ecCC-----CC-cceeec-----------cCCCEEEEecCc-------
Confidence 3333445544332 46889999999877544 1111 10 111111 378888874211
Q ss_pred CCCcccccCCceEEEEeeCCCCceee--e-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCC
Q 039705 286 GKGEFMNALQDCGRIEITNKSATWQR--E-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDP 362 (539)
Q Consensus 286 ~~~~~~~a~~s~~~~d~~~~~~~W~~--~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~ 362 (539)
++.-.|+ ..... . .||.--. .+-+-|+-.+||.||.+. .+..||=.|+
T Consensus 206 ----------sV~Fwda----ksf~~lKs~k~P~nV~--SASL~P~k~~fVaGged~-------------~~~kfDy~Tg 256 (334)
T KOG0278|consen 206 ----------SVKFWDA----KSFGLLKSYKMPCNVE--SASLHPKKEFFVAGGEDF-------------KVYKFDYNTG 256 (334)
T ss_pred ----------eeEEecc----ccccceeeccCccccc--cccccCCCceEEecCcce-------------EEEEEeccCC
Confidence 2333333 22222 2 5665332 234579999999999762 4566776665
Q ss_pred CCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCC
Q 039705 363 INERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 363 ~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
. .-... .-...---|+ .-..|||.+|..|+.+.
T Consensus 257 e--Ei~~~-nkgh~gpVhc-VrFSPdGE~yAsGSEDG 289 (334)
T KOG0278|consen 257 E--EIGSY-NKGHFGPVHC-VRFSPDGELYASGSEDG 289 (334)
T ss_pred c--eeeec-ccCCCCceEE-EEECCCCceeeccCCCc
Confidence 1 11110 0111112243 34579999999999754
No 105
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=71.06 E-value=1.2e+02 Score=32.21 Aligned_cols=183 Identities=13% Similarity=0.143 Sum_probs=100.8
Q ss_pred cccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccc--cccceeEEccCCcEEEEcCccCCeEEEE-
Q 039705 101 TWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAK--RWFSTQHILPDGSFIVVGGRREFSYEYI- 177 (539)
Q Consensus 101 ~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~--R~y~s~~~L~dG~VyvvGG~~~~~~E~y- 177 (539)
.+-+..++-++|+.+.++-++ .+-++||-. +.+ ++ .|+.+ +.-++.+.-+||.+.+.||.+.. ..+|
T Consensus 262 ~RVs~VafHPsG~~L~TasfD---~tWRlWD~~--tk~--El--L~QEGHs~~v~~iaf~~DGSL~~tGGlD~~-~RvWD 331 (459)
T KOG0272|consen 262 ARVSRVAFHPSGKFLGTASFD---STWRLWDLE--TKS--EL--LLQEGHSKGVFSIAFQPDGSLAATGGLDSL-GRVWD 331 (459)
T ss_pred hhheeeeecCCCceeeecccc---cchhhcccc--cch--hh--HhhcccccccceeEecCCCceeeccCccch-hheee
Confidence 444555666788888877653 445566665 322 11 13333 33456666779999999998752 2234
Q ss_pred ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEec
Q 039705 178 LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPETNEILHVFPILRGGSRNYPASATSALL 255 (539)
Q Consensus 178 P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~l 255 (539)
-++. .-+ =+|....+ ..| .+.-.|||...+.|+. .+.+||.+.-+- +-.||. .++- =+-|-+
T Consensus 332 lRtg--r~i--m~L~gH~k----~I~-~V~fsPNGy~lATgs~Dnt~kVWDLR~r~~---ly~ipA-H~nl---VS~Vk~ 395 (459)
T KOG0272|consen 332 LRTG--RCI--MFLAGHIK----EIL-SVAFSPNGYHLATGSSDNTCKVWDLRMRSE---LYTIPA-HSNL---VSQVKY 395 (459)
T ss_pred cccC--cEE--EEeccccc----cee-eEeECCCceEEeecCCCCcEEEeeeccccc---ceeccc-ccch---hhheEe
Confidence 3332 111 01111111 011 1223578988888874 577888876543 333554 2221 011221
Q ss_pred ccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-cc--CCCceeceeEEecCCcEE
Q 039705 256 PIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MM--PSPRVMGEMLLLPTGDVL 332 (539)
Q Consensus 256 pl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M--~~~R~~~~~vvlpdG~I~ 332 (539)
- . ..|+.++.++++. ++-.+. +..|+.. .| +..+++.. -..+|+..+
T Consensus 396 ~-------p--~~g~fL~TasyD~----------------t~kiWs----~~~~~~~ksLaGHe~kV~s~-Dis~d~~~i 445 (459)
T KOG0272|consen 396 S-------P--QEGYFLVTASYDN----------------TVKIWS----TRTWSPLKSLAGHEGKVISL-DISPDSQAI 445 (459)
T ss_pred c-------c--cCCeEEEEcccCc----------------ceeeec----CCCcccchhhcCCccceEEE-EeccCCceE
Confidence 1 0 2678888888762 122232 4778875 55 56677655 356799999
Q ss_pred EEcCcCC
Q 039705 333 IINGAKK 339 (539)
Q Consensus 333 vvGG~~~ 339 (539)
+.++.++
T Consensus 446 ~t~s~DR 452 (459)
T KOG0272|consen 446 ATSSFDR 452 (459)
T ss_pred EEeccCc
Confidence 9888774
No 106
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=70.72 E-value=1.5e+02 Score=31.59 Aligned_cols=222 Identities=13% Similarity=0.044 Sum_probs=104.0
Q ss_pred cCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccc--ccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcce
Q 039705 109 SANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHW--ELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRII 185 (539)
Q Consensus 109 l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~--~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w 185 (539)
..+++.+++|-. -.+|--.++..+|+..+. .++.. . .....+.++.++++|... .+| ......+|
T Consensus 144 f~~~~g~~vG~~------G~il~T~DgG~tW~~~~~~~~~p~~-~-~~i~~~~~~~~~ivg~~G----~v~~S~D~G~tW 211 (398)
T PLN00033 144 FKGKEGWIIGKP------AILLHTSDGGETWERIPLSPKLPGE-P-VLIKATGPKSAEMVTDEG----AIYVTSNAGRNW 211 (398)
T ss_pred EECCEEEEEcCc------eEEEEEcCCCCCceECccccCCCCC-c-eEEEEECCCceEEEeccc----eEEEECCCCCCc
Confidence 346778888632 134433333678987643 12222 1 223345456788887432 255 33324578
Q ss_pred eec--cC----ccccCC--CCCCCCcc----eEEEeeCCcEEEEEcCce-e-EeeCCCCeEEEEcccCCCCCCccCCCcc
Q 039705 186 YDL--PI----LNETTN--PSENNLYP----FVFLSTDGNLFIFANDRS-I-LLNPETNEILHVFPILRGGSRNYPASAT 251 (539)
Q Consensus 186 ~~~--~~----l~~~~~--~~~~~~yp----~~~~~~~G~Iyv~Gg~~~-e-~yDp~tn~W~~~~p~mp~~~r~yp~~g~ 251 (539)
... +. +..... ......|- .+..+.||+++++|-+-. . ..|.-...|+ .+. .+. .+.. .+.
T Consensus 212 ~~~~~~t~~~~l~~~~~s~~~g~~~y~Gsf~~v~~~~dG~~~~vg~~G~~~~s~d~G~~~W~-~~~-~~~-~~~l--~~v 286 (398)
T PLN00033 212 KAAVEETVSATLNRTVSSGISGASYYTGTFSTVNRSPDGDYVAVSSRGNFYLTWEPGQPYWQ-PHN-RAS-ARRI--QNM 286 (398)
T ss_pred eEcccccccccccccccccccccceeccceeeEEEcCCCCEEEEECCccEEEecCCCCcceE-Eec-CCC-ccce--eee
Confidence 543 11 110000 00001111 123457888888875432 2 2344334487 442 222 1111 121
Q ss_pred EEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCce-----eeeccCCCce-eceeEE
Q 039705 252 SALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATW-----QREMMPSPRV-MGEMLL 325 (539)
Q Consensus 252 av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W-----~~~~M~~~R~-~~~~vv 325 (539)
. .. .++.++++|... + .+--.+....| ...+++..+. -.+.+.
T Consensus 287 ~-~~-----------~dg~l~l~g~~G-~------------------l~~S~d~G~~~~~~~f~~~~~~~~~~~l~~v~~ 335 (398)
T PLN00033 287 G-WR-----------ADGGLWLLTRGG-G------------------LYVSKGTGLTEEDFDFEEADIKSRGFGILDVGY 335 (398)
T ss_pred e-Ec-----------CCCCEEEEeCCc-e------------------EEEecCCCCcccccceeecccCCCCcceEEEEE
Confidence 1 11 278888887532 1 01111223344 3334443332 233345
Q ss_pred ecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCC-CCC-CccceeeEeeCCCCeEEEecCC
Q 039705 326 LPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTP-TSK-PRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 326 lpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~-~~~-~R~yhs~a~llpdG~V~v~GG~ 397 (539)
..|+.++++|.. |. +|- .++.|++|+.... -.+ .-+| . .....+++.|++|-+
T Consensus 336 ~~d~~~~a~G~~--G~--------------v~~-s~D~G~tW~~~~~~~~~~~~ly-~-v~f~~~~~g~~~G~~ 390 (398)
T PLN00033 336 RSKKEAWAAGGS--GI--------------LLR-STDGGKSWKRDKGADNIAANLY-S-VKFFDDKKGFVLGND 390 (398)
T ss_pred cCCCcEEEEECC--Cc--------------EEE-eCCCCcceeEccccCCCCccee-E-EEEcCCCceEEEeCC
Confidence 568899888864 21 122 2346779998752 222 2344 2 233567999999864
No 107
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=70.68 E-value=89 Score=32.60 Aligned_cols=124 Identities=16% Similarity=0.257 Sum_probs=66.2
Q ss_pred CceeEeeCCCCeEEEEcccCCCCCCccCCCccEE-ecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceE
Q 039705 220 DRSILLNPETNEILHVFPILRGGSRNYPASATSA-LLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCG 298 (539)
Q Consensus 220 ~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av-~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~ 298 (539)
..+-|||.+.|+.. |.|..+=++| .|.+-| .-.+++.||.+ ..+-
T Consensus 215 k~VKCwDLe~nkvI----------R~YhGHlS~V~~L~lhP--------Tldvl~t~grD----------------st~R 260 (460)
T KOG0285|consen 215 KQVKCWDLEYNKVI----------RHYHGHLSGVYCLDLHP--------TLDVLVTGGRD----------------STIR 260 (460)
T ss_pred CeeEEEechhhhhH----------HHhccccceeEEEeccc--------cceeEEecCCc----------------ceEE
Confidence 35789999999754 4452222222 333332 46788899876 1345
Q ss_pred EEEeeCCCCceeee--ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCC
Q 039705 299 RIEITNKSATWQRE--MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKP 376 (539)
Q Consensus 299 ~~d~~~~~~~W~~~--~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~ 376 (539)
.+|+....+.-... ..+..|++... .|+.|+- |-.+ .++-+||-..+ ++-..+... .
T Consensus 261 vWDiRtr~~V~~l~GH~~~V~~V~~~~---~dpqvit-~S~D-------------~tvrlWDl~ag--kt~~tlt~h--k 319 (460)
T KOG0285|consen 261 VWDIRTRASVHVLSGHTNPVASVMCQP---TDPQVIT-GSHD-------------STVRLWDLRAG--KTMITLTHH--K 319 (460)
T ss_pred EeeecccceEEEecCCCCcceeEEeec---CCCceEE-ecCC-------------ceEEEeeeccC--ceeEeeecc--c
Confidence 57775322222222 44555655443 2888753 3332 16788998776 133333221 2
Q ss_pred ccceeeEeeCCCCeEEEecCCCC
Q 039705 377 RMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 377 R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
+.-- +.+|-|+-..|+.++-++
T Consensus 320 ksvr-al~lhP~e~~fASas~dn 341 (460)
T KOG0285|consen 320 KSVR-ALCLHPKENLFASASPDN 341 (460)
T ss_pred ceee-EEecCCchhhhhccCCcc
Confidence 2211 234557777777777543
No 108
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=68.84 E-value=1.7e+02 Score=31.48 Aligned_cols=221 Identities=16% Similarity=0.250 Sum_probs=111.5
Q ss_pred CCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccc-cceeEEccCCc-EEEEcCccCCeEEEE-ecCCCccee
Q 039705 110 ANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRW-FSTQHILPDGS-FIVVGGRREFSYEYI-LKEGKRIIY 186 (539)
Q Consensus 110 ~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~-y~s~~~L~dG~-VyvvGG~~~~~~E~y-P~~~~~~w~ 186 (539)
+.-.+++++|.+ +.+++|--....|. .+.+ |...+. -.+++..++|. +++++|+... +-.| -.+. +-.
T Consensus 223 p~~plllvaG~d---~~lrifqvDGk~N~--~lqS-~~l~~fPi~~a~f~p~G~~~i~~s~rrky-~ysyDle~a--k~~ 293 (514)
T KOG2055|consen 223 PTAPLLLVAGLD---GTLRIFQVDGKVNP--KLQS-IHLEKFPIQKAEFAPNGHSVIFTSGRRKY-LYSYDLETA--KVT 293 (514)
T ss_pred CCCceEEEecCC---CcEEEEEecCccCh--hhee-eeeccCccceeeecCCCceEEEecccceE-EEEeecccc--ccc
Confidence 456788888874 45667644311444 2322 443332 24566777998 8888988542 1223 2221 222
Q ss_pred eccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeE--eeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCC
Q 039705 187 DLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSIL--LNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNS 264 (539)
Q Consensus 187 ~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~--yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~ 264 (539)
.+..+... +....-.--+.++++..++.|+..++ ..-+|+.|.. --.|++. . +...+-
T Consensus 294 k~~~~~g~----e~~~~e~FeVShd~~fia~~G~~G~I~lLhakT~eli~-s~KieG~--v-----~~~~fs-------- 353 (514)
T KOG2055|consen 294 KLKPPYGV----EEKSMERFEVSHDSNFIAIAGNNGHIHLLHAKTKELIT-SFKIEGV--V-----SDFTFS-------- 353 (514)
T ss_pred cccCCCCc----ccchhheeEecCCCCeEEEcccCceEEeehhhhhhhhh-eeeeccE--E-----eeEEEe--------
Confidence 12111111 00001111466889888888876554 4556777752 2233331 1 111111
Q ss_pred CCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCC--Cceeee-ccCCCceeceeE-EecCCcEEEEcCcCCC
Q 039705 265 NAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKS--ATWQRE-MMPSPRVMGEML-LLPTGDVLIINGAKKG 340 (539)
Q Consensus 265 ~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~--~~W~~~-~M~~~R~~~~~v-vlpdG~I~vvGG~~~g 340 (539)
...-+||++||.. .+..+|+.... .+|... .. ++..+ .-++|..+++|-.. |
T Consensus 354 -Sdsk~l~~~~~~G-----------------eV~v~nl~~~~~~~rf~D~G~v-----~gts~~~S~ng~ylA~GS~~-G 409 (514)
T KOG2055|consen 354 -SDSKELLASGGTG-----------------EVYVWNLRQNSCLHRFVDDGSV-----HGTSLCISLNGSYLATGSDS-G 409 (514)
T ss_pred -cCCcEEEEEcCCc-----------------eEEEEecCCcceEEEEeecCcc-----ceeeeeecCCCceEEeccCc-c
Confidence 0245678887753 24455554321 345544 33 22222 23599977766432 3
Q ss_pred CCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccce-----eeEeeCCCCeEEEecCCCC
Q 039705 341 TAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCH-----STSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 341 ~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yh-----s~a~llpdG~V~v~GG~~~ 399 (539)
-+-+||-++- +...+|-|+.+.-- +.-..-+|+.||.+-+...
T Consensus 410 ------------iVNIYd~~s~----~~s~~PkPik~~dNLtt~Itsl~Fn~d~qiLAiaS~~~ 457 (514)
T KOG2055|consen 410 ------------IVNIYDGNSC----FASTNPKPIKTVDNLTTAITSLQFNHDAQILAIASRVK 457 (514)
T ss_pred ------------eEEEeccchh----hccCCCCchhhhhhhheeeeeeeeCcchhhhhhhhhcc
Confidence 4678986553 55555555544322 1122357888888777543
No 109
>COG5184 ATS1 Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]
Probab=68.36 E-value=1.8e+02 Score=31.50 Aligned_cols=109 Identities=16% Similarity=0.220 Sum_probs=55.6
Q ss_pred ceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCC-----CCccceeeEeeCCCCeEEEec
Q 039705 321 GEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTS-----KPRMCHSTSVVLPDGKILVAG 395 (539)
Q Consensus 321 ~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~-----~~R~yhs~a~llpdG~V~v~G 395 (539)
|..++.-||.+|..|=.+.+.-|... +. .++- + .|+.++... ..+..|+.+. .-||.||.=|
T Consensus 351 H~l~L~~~G~l~a~Gr~~~~qlg~~~--~~-----~~~~--~---~~~~ls~~~~~~~v~~gt~~~~~~-t~~gsvy~wG 417 (476)
T COG5184 351 HSLILRKDGTLYAFGRGDRGQLGIQE--EI-----TIDV--S---TPTKLSVAIKLEQVACGTHHNIAR-TDDGSVYSWG 417 (476)
T ss_pred eEEEEecCceEEEecCCccccccCcc--cc-----eeec--C---CccccccccceEEEEecCccceee-ccCCceEEec
Confidence 45566779999999965544332110 00 1111 1 333333111 2455666655 4688999999
Q ss_pred CCCCCCCccCCCCCCCcceeeEEecCCCCCCCcCCCCCceeecCCCc-eeecCCEEEEEEEec
Q 039705 396 SNPHSRYNLTSGSKYPTELRIEKFYPPYFDESFASYRPSIVSKFKGK-MLKYGQNFVIQFKLD 457 (539)
Q Consensus 396 G~~~~~~~~~~~~~~p~~~~vE~y~Ppyl~~~~~~~RP~i~~~~~p~-~~~~g~~~~v~~~~~ 457 (539)
-+.+..... + ..-|...+|.+- |++..+.- +. .+.||..|.|-....
T Consensus 418 ~ge~gnlG~--g------~~~~~~~~pt~i------~~~~~~~~-~~i~~g~~~~~~v~~~~~ 465 (476)
T COG5184 418 WGEHGNLGN--G------PKEADVLVPTLI------RQPLLSGH-NIILAGYGNQFSVIEETM 465 (476)
T ss_pred CchhhhccC--C------chhhhccccccc------cccccCCC-ceEEeccCcceEEEecch
Confidence 876653211 1 122445556554 33222220 22 246888888776543
No 110
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=65.30 E-value=2.1e+02 Score=31.03 Aligned_cols=232 Identities=17% Similarity=0.178 Sum_probs=121.0
Q ss_pred eecCCCcEEEecCCCCCCCeEEEEeCCCCccce-eecccccc-cccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCc
Q 039705 107 GLSANGTIVISGGWSSRGRSVRYLSGCYHACYW-KEHHWELS-AKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKR 183 (539)
Q Consensus 107 ~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W-~~~~~~m~-~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~ 183 (539)
.+..||+.++.+.. .+.+.+++.. + .+ ...-. +. ..++-...+.-+||+ |++.|.+..++.+| ...+
T Consensus 166 ~fs~~g~~l~~~~~---~~~i~~~~~~--~-~~~~~~~~-l~~h~~~v~~~~fs~d~~-~l~s~s~D~tiriwd~~~~-- 235 (456)
T KOG0266|consen 166 DFSPDGRALAAASS---DGLIRIWKLE--G-IKSNLLRE-LSGHTRGVSDVAFSPDGS-YLLSGSDDKTLRIWDLKDD-- 235 (456)
T ss_pred EEcCCCCeEEEccC---CCcEEEeecc--c-ccchhhcc-ccccccceeeeEECCCCc-EEEEecCCceEEEeeccCC--
Confidence 34678999777654 3566777664 2 22 11111 32 224444555666888 56666666778888 4222
Q ss_pred ceeeccCccccCCCCCCCCcceE-EEeeCCcEEEEEcC--ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccC
Q 039705 184 IIYDLPILNETTNPSENNLYPFV-FLSTDGNLFIFAND--RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQ 260 (539)
Q Consensus 184 ~w~~~~~l~~~~~~~~~~~yp~~-~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~ 260 (539)
.-. +..+.... .|.+. .-.++|++.+.|+. .+.+||.++.+-.+.+ ++. .. + -.++.++
T Consensus 236 ~~~-~~~l~gH~------~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l---~~h-s~-~--is~~~f~---- 297 (456)
T KOG0266|consen 236 GRN-LKTLKGHS------TYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKL---KGH-SD-G--ISGLAFS---- 297 (456)
T ss_pred CeE-EEEecCCC------CceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEee---ecc-CC-c--eEEEEEC----
Confidence 111 11111111 12222 23468899999885 4788999986644333 321 11 0 1122222
Q ss_pred CCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCcee-----ee-ccCCCceeceeE-EecCCcEEE
Q 039705 261 DPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQ-----RE-MMPSPRVMGEML-LLPTGDVLI 333 (539)
Q Consensus 261 ~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~-----~~-~M~~~R~~~~~v-vlpdG~I~v 333 (539)
.++.+++.+..+ ..+..+|.. +|. .. ....+. ....+ --|||+.++
T Consensus 298 ------~d~~~l~s~s~d----------------~~i~vwd~~----~~~~~~~~~~~~~~~~~-~~~~~~fsp~~~~ll 350 (456)
T KOG0266|consen 298 ------PDGNLLVSASYD----------------GTIRVWDLE----TGSKLCLKLLSGAENSA-PVTSVQFSPNGKYLL 350 (456)
T ss_pred ------CCCCEEEEcCCC----------------ccEEEEECC----CCceeeeecccCCCCCC-ceeEEEECCCCcEEE
Confidence 278888888654 235567764 233 11 222221 11222 348999888
Q ss_pred EcCcCCCCCCcccCCCCCCccEEEcCCCC-CCCceEecCCCCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCc
Q 039705 334 INGAKKGTAGWNFATDPNTTPVLYEPDDP-INERFSELTPTSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPT 412 (539)
Q Consensus 334 vGG~~~g~~g~~~~~~~~~~~e~YdP~t~-~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~ 412 (539)
++..+. ++-+||.... .-.+|+..... .|...+ .+..++|+.++.|+.+.
T Consensus 351 ~~~~d~-------------~~~~w~l~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~i~sg~~d~------------- 401 (456)
T KOG0266|consen 351 SASLDR-------------TLKLWDLRSGKSVGTYTGHSNL--VRCIFS-PTLSTGGKLIYSGSEDG------------- 401 (456)
T ss_pred EecCCC-------------eEEEEEccCCcceeeecccCCc--ceeEec-ccccCCCCeEEEEeCCc-------------
Confidence 887652 3456666543 11133333222 132222 34467999999998633
Q ss_pred ceeeEEecCCCC
Q 039705 413 ELRIEKFYPPYF 424 (539)
Q Consensus 413 ~~~vE~y~Ppyl 424 (539)
.|++|++..+
T Consensus 402 --~v~~~~~~s~ 411 (456)
T KOG0266|consen 402 --SVYVWDSSSG 411 (456)
T ss_pred --eEEEEeCCcc
Confidence 4677777764
No 111
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=63.65 E-value=84 Score=33.34 Aligned_cols=85 Identities=13% Similarity=0.060 Sum_probs=49.0
Q ss_pred EEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceee-cccccccccccceeEEccCCc
Q 039705 84 EYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKE-HHWELSAKRWFSTQHILPDGS 162 (539)
Q Consensus 84 ~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~-~~~~m~~~R~y~s~~~L~dG~ 162 (539)
+||..|++=--+...|..--...++-+||.++.+||.+ .-.+++|-. +..-.- ++. .-+--.++.--+||.
T Consensus 287 lWD~~tk~ElL~QEGHs~~v~~iaf~~DGSL~~tGGlD---~~~RvWDlR--tgr~im~L~g---H~k~I~~V~fsPNGy 358 (459)
T KOG0272|consen 287 LWDLETKSELLLQEGHSKGVFSIAFQPDGSLAATGGLD---SLGRVWDLR--TGRCIMFLAG---HIKEILSVAFSPNGY 358 (459)
T ss_pred hcccccchhhHhhcccccccceeEecCCCceeeccCcc---chhheeecc--cCcEEEEecc---cccceeeEeECCCce
Confidence 45555544332333343333344667899999999974 334667766 433221 111 112335677788999
Q ss_pred EEEEcCccCCeEEEE
Q 039705 163 FIVVGGRREFSYEYI 177 (539)
Q Consensus 163 VyvvGG~~~~~~E~y 177 (539)
.++.||.++ ++.+|
T Consensus 359 ~lATgs~Dn-t~kVW 372 (459)
T KOG0272|consen 359 HLATGSSDN-TCKVW 372 (459)
T ss_pred EEeecCCCC-cEEEe
Confidence 999999874 34444
No 112
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=61.20 E-value=25 Score=35.50 Aligned_cols=83 Identities=14% Similarity=0.223 Sum_probs=50.2
Q ss_pred ccCCCceeceeE-EecCCc--EEEEcCcCC------CCCCcccCCCCCCccEEEcCCCCCCCceE--ecCCCCCCcccee
Q 039705 313 MMPSPRVMGEML-LLPTGD--VLIINGAKK------GTAGWNFATDPNTTPVLYEPDDPINERFS--ELTPTSKPRMCHS 381 (539)
Q Consensus 313 ~M~~~R~~~~~v-vlpdG~--I~vvGG~~~------g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt--~~a~~~~~R~yhs 381 (539)
+.|.+|..|.+- +---|| +.++||... -+..|+.--+....+.+.|.+-. -.+ .++.+......|-
T Consensus 83 dvP~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qRTTenWNsVvDC~P~VfLiDleFG---C~tah~lpEl~dG~SFHv 159 (337)
T PF03089_consen 83 DVPEARYGHTINVVHSRGKTACVLFGGRSYMPPGQRTTENWNSVVDCPPQVFLIDLEFG---CCTAHTLPELQDGQSFHV 159 (337)
T ss_pred CCCcccccceEEEEEECCcEEEEEECCcccCCccccchhhcceeccCCCeEEEEecccc---ccccccchhhcCCeEEEE
Confidence 689999888753 233444 445677542 12223322222335667777766 555 3566777788884
Q ss_pred eEeeCCCCeEEEecCCCCC
Q 039705 382 TSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 382 ~a~llpdG~V~v~GG~~~~ 400 (539)
+ |.-+..||+.||....
T Consensus 160 s--lar~D~VYilGGHsl~ 176 (337)
T PF03089_consen 160 S--LARNDCVYILGGHSLE 176 (337)
T ss_pred E--EecCceEEEEccEEcc
Confidence 3 3459999999997554
No 113
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=59.57 E-value=1e+02 Score=31.38 Aligned_cols=89 Identities=15% Similarity=0.157 Sum_probs=50.4
Q ss_pred cccccceeEEEEECCCCCEEeCccCC-CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccccccc
Q 039705 74 KYVDYRALAVEYDAESAAIRPLKILT-DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWF 152 (539)
Q Consensus 74 g~~~~~~~~~~yDp~t~~w~~l~~~~-~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y 152 (539)
|+.|+ ....||..+..-..+-.-. -.+|-... ..--.+|.|||+ ++++++||+ .. -....- ++..+-|
T Consensus 71 G~~dg--~vr~~Dln~~~~~~igth~~~i~ci~~~--~~~~~vIsgsWD---~~ik~wD~R--~~-~~~~~~-d~~kkVy 139 (323)
T KOG1036|consen 71 GGLDG--QVRRYDLNTGNEDQIGTHDEGIRCIEYS--YEVGCVISGSWD---KTIKFWDPR--NK-VVVGTF-DQGKKVY 139 (323)
T ss_pred eccCc--eEEEEEecCCcceeeccCCCceEEEEee--ccCCeEEEcccC---ccEEEEecc--cc-cccccc-ccCceEE
Confidence 44443 4678999887766654322 23443222 233456789985 789999998 31 112221 3444544
Q ss_pred ceeEEccCCcEEEEcCccCCeEEEE
Q 039705 153 STQHILPDGSFIVVGGRREFSYEYI 177 (539)
Q Consensus 153 ~s~~~L~dG~VyvvGG~~~~~~E~y 177 (539)
++ .+ .|..+|+|+.+. .+-+|
T Consensus 140 -~~-~v-~g~~LvVg~~~r-~v~iy 160 (323)
T KOG1036|consen 140 -CM-DV-SGNRLVVGTSDR-KVLIY 160 (323)
T ss_pred -EE-ec-cCCEEEEeecCc-eEEEE
Confidence 33 34 467888888764 34556
No 114
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=59.42 E-value=1.1e+02 Score=32.27 Aligned_cols=81 Identities=11% Similarity=0.088 Sum_probs=46.8
Q ss_pred eeEEEEECCCCCE-EeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccc-cccccceeEE
Q 039705 80 ALAVEYDAESAAI-RPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQHI 157 (539)
Q Consensus 80 ~~~~~yDp~t~~w-~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~~~ 157 (539)
..+.+||..|++- ..+. .++ -|-+..+-.||.++++.-. .++++++||. +.+-.... |. .+---.-++.
T Consensus 154 n~v~iWnv~tgeali~l~-hpd-~i~S~sfn~dGs~l~Ttck---DKkvRv~dpr--~~~~v~e~--~~heG~k~~Raif 224 (472)
T KOG0303|consen 154 NTVSIWNVGTGEALITLD-HPD-MVYSMSFNRDGSLLCTTCK---DKKVRVIDPR--RGTVVSEG--VAHEGAKPARAIF 224 (472)
T ss_pred ceEEEEeccCCceeeecC-CCC-eEEEEEeccCCceeeeecc---cceeEEEcCC--CCcEeeec--ccccCCCcceeEE
Confidence 3456777777653 2333 222 2333345568888877643 3899999998 55432221 22 1111234667
Q ss_pred ccCCcEEEEcCc
Q 039705 158 LPDGSFIVVGGR 169 (539)
Q Consensus 158 L~dG~VyvvGG~ 169 (539)
|.+|+++..|=+
T Consensus 225 l~~g~i~tTGfs 236 (472)
T KOG0303|consen 225 LASGKIFTTGFS 236 (472)
T ss_pred eccCceeeeccc
Confidence 889998888865
No 115
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=59.03 E-value=2.7e+02 Score=30.38 Aligned_cols=27 Identities=19% Similarity=0.331 Sum_probs=19.1
Q ss_pred EeeCCcEEEEEc-CceeEeeCCCCe--EEE
Q 039705 208 LSTDGNLFIFAN-DRSILLNPETNE--ILH 234 (539)
Q Consensus 208 ~~~~G~Iyv~Gg-~~~e~yDp~tn~--W~~ 234 (539)
++.+|+||+... ..+..+|.++++ |..
T Consensus 58 vv~~g~vy~~~~~g~l~AlD~~tG~~~W~~ 87 (488)
T cd00216 58 LVVDGDMYFTTSHSALFALDAATGKVLWRY 87 (488)
T ss_pred EEECCEEEEeCCCCcEEEEECCCChhhcee
Confidence 456899998754 457788988755 753
No 116
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=58.61 E-value=1.9e+02 Score=28.33 Aligned_cols=27 Identities=11% Similarity=0.325 Sum_probs=20.6
Q ss_pred EeeCCcEEEEEcC--ceeEeeCCCCeEEE
Q 039705 208 LSTDGNLFIFAND--RSILLNPETNEILH 234 (539)
Q Consensus 208 ~~~~G~Iyv~Gg~--~~e~yDp~tn~W~~ 234 (539)
+-|.|++.+.|-. +..+||.+.++-.+
T Consensus 239 vdpsgrll~sg~~dssc~lydirg~r~iq 267 (350)
T KOG0641|consen 239 VDPSGRLLASGHADSSCMLYDIRGGRMIQ 267 (350)
T ss_pred ECCCcceeeeccCCCceEEEEeeCCceee
Confidence 3468999999864 46789999887553
No 117
>PTZ00420 coronin; Provisional
Probab=58.12 E-value=3.1e+02 Score=30.77 Aligned_cols=107 Identities=17% Similarity=0.117 Sum_probs=53.5
Q ss_pred CCcEEEecCCCCCCCeEEEEeCCCCccc--eeecc-c--cccc-ccccceeEEccCCc-EEEEcCccCCeEEEE-ecCCC
Q 039705 111 NGTIVISGGWSSRGRSVRYLSGCYHACY--WKEHH-W--ELSA-KRWFSTQHILPDGS-FIVVGGRREFSYEYI-LKEGK 182 (539)
Q Consensus 111 dG~l~v~GG~~~g~~~v~~ydP~~~t~~--W~~~~-~--~m~~-~R~y~s~~~L~dG~-VyvvGG~~~~~~E~y-P~~~~ 182 (539)
++.++++||.+ ..+++||.. +.. -.... . .+.. .+.-.+++.-+++. +++.||.+ .++.+| ..+.
T Consensus 86 ~~~lLASgS~D---gtIrIWDi~--t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~D-gtIrIWDl~tg- 158 (568)
T PTZ00420 86 FSEILASGSED---LTIRVWEIP--HNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFD-SFVNIWDIENE- 158 (568)
T ss_pred CCCEEEEEeCC---CeEEEEECC--CCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCC-CeEEEEECCCC-
Confidence 37888998863 578899865 221 11000 0 0111 11112233334565 44556654 567777 4433
Q ss_pred cceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC--ceeEeeCCCCeEE
Q 039705 183 RIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND--RSILLNPETNEIL 233 (539)
Q Consensus 183 ~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~--~~e~yDp~tn~W~ 233 (539)
........ . + ..+ .+...++|++++.++. .+.+||+++++-.
T Consensus 159 ~~~~~i~~-~---~----~V~-SlswspdG~lLat~s~D~~IrIwD~Rsg~~i 202 (568)
T PTZ00420 159 KRAFQINM-P---K----KLS-SLKWNIKGNLLSGTCVGKHMHIIDPRKQEIA 202 (568)
T ss_pred cEEEEEec-C---C----cEE-EEEECCCCCEEEEEecCCEEEEEECCCCcEE
Confidence 11111110 0 0 001 1223468999998863 5889999987644
No 118
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=57.63 E-value=2.7e+02 Score=29.79 Aligned_cols=72 Identities=18% Similarity=0.171 Sum_probs=40.3
Q ss_pred ceeeeccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCC-----ceEecCCCCCCccceee
Q 039705 308 TWQREMMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINE-----RFSELTPTSKPRMCHST 382 (539)
Q Consensus 308 ~W~~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~-----~Wt~~a~~~~~R~yhs~ 382 (539)
.|+..+.+.++...++...+||.++++|... . +|- .++.|+ +|+++..........+
T Consensus 271 ~W~~~~~~~~~~l~~v~~~~dg~l~l~g~~G--~--------------l~~-S~d~G~~~~~~~f~~~~~~~~~~~l~~- 332 (398)
T PLN00033 271 YWQPHNRASARRIQNMGWRADGGLWLLTRGG--G--------------LYV-SKGTGLTEEDFDFEEADIKSRGFGILD- 332 (398)
T ss_pred ceEEecCCCccceeeeeEcCCCCEEEEeCCc--e--------------EEE-ecCCCCcccccceeecccCCCCcceEE-
Confidence 3898766666555555667899999988532 1 111 122233 3454432211222233
Q ss_pred EeeCCCCeEEEecCC
Q 039705 383 SVVLPDGKILVAGSN 397 (539)
Q Consensus 383 a~llpdG~V~v~GG~ 397 (539)
+....|+.++++|..
T Consensus 333 v~~~~d~~~~a~G~~ 347 (398)
T PLN00033 333 VGYRSKKEAWAAGGS 347 (398)
T ss_pred EEEcCCCcEEEEECC
Confidence 335679999999886
No 119
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=56.90 E-value=2e+02 Score=29.92 Aligned_cols=136 Identities=15% Similarity=0.185 Sum_probs=74.6
Q ss_pred EeeCCcEEEEEc-CceeEeeCCCCe--EEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccc
Q 039705 208 LSTDGNLFIFAN-DRSILLNPETNE--ILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVL 284 (539)
Q Consensus 208 ~~~~G~Iyv~Gg-~~~e~yDp~tn~--W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~ 284 (539)
+.-||+||+... .....+|+.+.+ |...+.. . .... .+. ++. .+|+||+-.. + +
T Consensus 65 ~~~dg~v~~~~~~G~i~A~d~~~g~~~W~~~~~~--~--~~~~-~~~-~~~-----------~~G~i~~g~~-~-g---- 121 (370)
T COG1520 65 ADGDGTVYVGTRDGNIFALNPDTGLVKWSYPLLG--A--VAQL-SGP-ILG-----------SDGKIYVGSW-D-G---- 121 (370)
T ss_pred EeeCCeEEEecCCCcEEEEeCCCCcEEecccCcC--c--ceec-cCc-eEE-----------eCCeEEEecc-c-c----
Confidence 455888888632 257789999877 7632211 0 0110 111 111 2688766432 2 1
Q ss_pred cCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCC
Q 039705 285 AGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPI 363 (539)
Q Consensus 285 ~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~ 363 (539)
...++|..+....|+.. +.. .+.... .+..|+.||+.... + .+.+.|+++..
T Consensus 122 -----------~~y~ld~~~G~~~W~~~~~~~-~~~~~~-~v~~~~~v~~~s~~--g------------~~~al~~~tG~ 174 (370)
T COG1520 122 -----------KLYALDASTGTLVWSRNVGGS-PYYASP-PVVGDGTVYVGTDD--G------------HLYALNADTGT 174 (370)
T ss_pred -----------eEEEEECCCCcEEEEEecCCC-eEEecC-cEEcCcEEEEecCC--C------------eEEEEEccCCc
Confidence 35677775445789986 442 455444 45569999986521 1 35677777652
Q ss_pred CCceEecCCC-CCCccceeeEeeCCCCeEEEecC
Q 039705 364 NERFSELTPT-SKPRMCHSTSVVLPDGKILVAGS 396 (539)
Q Consensus 364 g~~Wt~~a~~-~~~R~yhs~a~llpdG~V~v~GG 396 (539)
..|+.-.+. ...+.+-+.+ .-+|.||+..-
T Consensus 175 -~~W~~~~~~~~~~~~~~~~~--~~~~~vy~~~~ 205 (370)
T COG1520 175 -LKWTYETPAPLSLSIYGSPA--IASGTVYVGSD 205 (370)
T ss_pred -EEEEEecCCccccccccCce--eecceEEEecC
Confidence 268743332 2334433333 34888888754
No 120
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=56.89 E-value=91 Score=31.60 Aligned_cols=99 Identities=12% Similarity=0.158 Sum_probs=58.4
Q ss_pred CCCceEEccCCCccc-ce-EEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeC
Q 039705 18 FKGKWELASENSGIS-AM-HIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPL 95 (539)
Q Consensus 18 ~~g~W~~~~~~~~~~-~~-~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l 95 (539)
...+|+.+... +. .+ ++... .+.++|+.|.+..+.. .......||.++++|..+
T Consensus 24 ~~~qW~~~g~~--i~G~V~~l~~~-~~~~Llv~G~ft~~~~---------------------~~~~la~yd~~~~~w~~~ 79 (281)
T PF12768_consen 24 DNSQWSSPGNG--ISGTVTDLQWA-SNNQLLVGGNFTLNGT---------------------NSSNLATYDFKNQTWSSL 79 (281)
T ss_pred CCCEeecCCCC--ceEEEEEEEEe-cCCEEEEEEeeEECCC---------------------CceeEEEEecCCCeeeec
Confidence 45889998543 32 23 33434 6888888887653210 023467899999999988
Q ss_pred ccCC-----CcccccceecCCC-cEEEecCCCCCCCeEEEEeCCCCccceeeccc
Q 039705 96 KILT-----DTWSSSGGLSANG-TIVISGGWSSRGRSVRYLSGCYHACYWKEHHW 144 (539)
Q Consensus 96 ~~~~-----~~~c~~~~~l~dG-~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~ 144 (539)
.... .+.-......-|+ .+++.|....+..-+..|| ..+|..+..
T Consensus 80 ~~~~s~~ipgpv~a~~~~~~d~~~~~~aG~~~~g~~~l~~~d----Gs~W~~i~~ 130 (281)
T PF12768_consen 80 GGGSSNSIPGPVTALTFISNDGSNFWVAGRSANGSTFLMKYD----GSSWSSIGS 130 (281)
T ss_pred CCcccccCCCcEEEEEeeccCCceEEEeceecCCCceEEEEc----CCceEeccc
Confidence 7632 1111111112244 5666666545567777886 457987653
No 121
>cd02849 CGTase_C_term Cgtase (cyclodextrin glycosyltransferase) C-terminus domain. Enzymes such as amylases, cyclomaltodextrinase (CDase), and CGTase degrade starch to smaller oligosaccharides by hydrolyzing the alpha-D-(1,4) linkages between glucose residues present in starch. In the case of CGTases, an additional cyclization reaction is catalyzed yielding mixtures of cyclic oligosaccharides which are referred to as alpha-, beta-, or gamma-cyclodextrins (CDs) (consisting of six, seven, or eight glucoses, respectively). CGTases are characterized as depending on the major product of the cyclization reaction. Besides having similar catalytic site residues, amylases and CGTases contain carbohydrate binding domains that are distant from the active site and which are implicated in attaching the enzyme to raw starch granules and in guiding the amylose chain into the active site. The C-terminus of CGTase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These d
Probab=54.95 E-value=1.1e+02 Score=24.77 Aligned_cols=75 Identities=19% Similarity=0.066 Sum_probs=48.1
Q ss_pred CCceeecCCCceeecCCEEEEEEEecccccccCcEEEEEEcCCccccccCCCceeEeccceeeeecCCceEEEEEEcCCC
Q 039705 432 RPSIVSKFKGKMLKYGQNFVIQFKLDELEVSLNDLKVTMYAPPFTTHGVSMGQRLLVPATKELIDVGSGIFQVSVMAPPT 511 (539)
Q Consensus 432 RP~i~~~~~p~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~~~~~~TH~~n~~QR~~~L~~~~~~~~g~~~~~~~~~~P~~ 511 (539)
-|.|.+.. |..-..|++++|+=+.-. ....+|. +. + .+.++... +. ..+++++|..
T Consensus 2 ~P~I~~i~-P~~g~~G~~VtI~G~gFg----~~~~~V~-~g----------~---~~a~v~s~--sd---t~I~~~vP~~ 57 (81)
T cd02849 2 TPLIGHVG-PMMGKAGNTVTISGEGFG----SAPGTVY-FG----------T---TAATVISW--SD---TRIVVTVPNV 57 (81)
T ss_pred CCEEeeEc-CCCCCCCCEEEEEEECCC----CCCcEEE-EC----------C---EEeEEEEE--CC---CEEEEEeCCC
Confidence 48899987 988889999888744211 1122331 11 1 23334332 22 4789999964
Q ss_pred CCcCCCcceEEEEEc-CCCCCccE
Q 039705 512 AKIAPPSFYLLFVVY-RQVPSPGT 534 (539)
Q Consensus 512 ~~~~ppG~ymlf~~~-~gvPS~~~ 534 (539)
++|.|-++|.. +|.=|.+.
T Consensus 58 ----~aG~~~V~V~~~~G~~Sn~~ 77 (81)
T cd02849 58 ----PAGNYDVTVKTADGATSNGY 77 (81)
T ss_pred ----CCceEEEEEEeCCCcccCcE
Confidence 78999999997 68777644
No 122
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=53.82 E-value=2.6e+02 Score=28.75 Aligned_cols=97 Identities=18% Similarity=0.277 Sum_probs=52.4
Q ss_pred CCcEEEEEc---CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCC
Q 039705 211 DGNLFIFAN---DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGK 287 (539)
Q Consensus 211 ~G~Iyv~Gg---~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~ 287 (539)
++.+.+|+- .-..+||+.+.+-.+.+.+ +. .|+| .|.++.-+ +|+.+..==.+ +.
T Consensus 16 ~~~avafaRRPG~~~~v~D~~~g~~~~~~~a-~~-gRHF--yGHg~fs~-----------dG~~LytTEnd---~~---- 73 (305)
T PF07433_consen 16 RPEAVAFARRPGTFALVFDCRTGQLLQRLWA-PP-GRHF--YGHGVFSP-----------DGRLLYTTEND---YE---- 73 (305)
T ss_pred CCeEEEEEeCCCcEEEEEEcCCCceeeEEcC-CC-CCEE--ecCEEEcC-----------CCCEEEEeccc---cC----
Confidence 456666664 3467899999886655544 33 4554 47777543 45544432111 11
Q ss_pred CcccccCCceEEEEeeCCCCceeee-ccC-CCceeceeEEecCC-cEEEEcC
Q 039705 288 GEFMNALQDCGRIEITNKSATWQRE-MMP-SPRVMGEMLLLPTG-DVLIING 336 (539)
Q Consensus 288 ~~~~~a~~s~~~~d~~~~~~~W~~~-~M~-~~R~~~~~vvlpdG-~I~vvGG 336 (539)
..--.+.+||.. ..++.. ..+ .+--=|....+||| .+.|.+|
T Consensus 74 ----~g~G~IgVyd~~---~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANG 118 (305)
T PF07433_consen 74 ----TGRGVIGVYDAA---RGYRRIGEFPSHGIGPHELLLMPDGETLVVANG 118 (305)
T ss_pred ----CCcEEEEEEECc---CCcEEEeEecCCCcChhhEEEcCCCCEEEEEcC
Confidence 111234556653 455543 332 23334677889999 5555555
No 123
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=53.69 E-value=3.5e+02 Score=31.13 Aligned_cols=50 Identities=16% Similarity=0.266 Sum_probs=37.2
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGC 133 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~ 133 (539)
...+|+.+|++-..+-..|..--++-++-++|.+++.|-|+ ++|+++|--
T Consensus 459 ~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWD---kTVRiW~if 508 (893)
T KOG0291|consen 459 EIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWD---KTVRIWDIF 508 (893)
T ss_pred EEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEecccc---ceEEEEEee
Confidence 35678999999888777665433334567899999999884 788888764
No 124
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=50.55 E-value=2e+02 Score=29.51 Aligned_cols=110 Identities=15% Similarity=0.215 Sum_probs=68.2
Q ss_pred cccceEEEecCCCCEEEEEccccCCCCCccCCCcccccCCCccccccccceeEEEEECCCCCEEeC-ccC-CCcccccce
Q 039705 30 GISAMHIILFPNTNKAIMLDAVSLGPSNVRLPVGIYRLNPGAWQKYVDYRALAVEYDAESAAIRPL-KIL-TDTWSSSGG 107 (539)
Q Consensus 30 ~~~~~~~~ll~~~gkv~~~g~~~~~~~~~~~~~g~~~~~~~~~~g~~~~~~~~~~yDp~t~~w~~l-~~~-~~~~c~~~~ 107 (539)
|-+.+.++.-|....+++|.+- | | .+..+||+.+++-... ... ...|..+++
T Consensus 4 P~RgH~~a~~p~~~~avafaRR---P-------G----------------~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~ 57 (305)
T PF07433_consen 4 PARGHGVAAHPTRPEAVAFARR---P-------G----------------TFALVFDCRTGQLLQRLWAPPGRHFYGHGV 57 (305)
T ss_pred CccccceeeCCCCCeEEEEEeC---C-------C----------------cEEEEEEcCCCceeeEEcCCCCCEEecCEE
Confidence 4455555665447888888762 2 2 2578899999886543 333 356888899
Q ss_pred ecCCCcEEEecCC-C-CCCCeEEEEeCCCCccceeeccccccc-ccccceeEEccCCcEEEE--cCc
Q 039705 108 LSANGTIVISGGW-S-SRGRSVRYLSGCYHACYWKEHHWELSA-KRWFSTQHILPDGSFIVV--GGR 169 (539)
Q Consensus 108 ~l~dG~l~v~GG~-~-~g~~~v~~ydP~~~t~~W~~~~~~m~~-~R~y~s~~~L~dG~Vyvv--GG~ 169 (539)
+..||+++.+==. . .+.-.|-+||.. .+...+.. .+. +-.-|-...++||+-+|| ||.
T Consensus 58 fs~dG~~LytTEnd~~~g~G~IgVyd~~---~~~~ri~E-~~s~GIGPHel~l~pDG~tLvVANGGI 120 (305)
T PF07433_consen 58 FSPDGRLLYTTENDYETGRGVIGVYDAA---RGYRRIGE-FPSHGIGPHELLLMPDGETLVVANGGI 120 (305)
T ss_pred EcCCCCEEEEeccccCCCcEEEEEEECc---CCcEEEeE-ecCCCcChhhEEEcCCCCEEEEEcCCC
Confidence 9999987766422 2 234567889985 34544443 332 233344667889955555 665
No 125
>PRK04792 tolB translocation protein TolB; Provisional
Probab=50.28 E-value=3.6e+02 Score=29.09 Aligned_cols=91 Identities=10% Similarity=0.046 Sum_probs=53.8
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|..+++.+.+..... ........+||+.+++....++...+..+|.. +.++..+.. ...+..+.+.-+|
T Consensus 287 ~Iy~~dl~tg~~~~lt~~~~-~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~--~g~~~~Lt~---~g~~~~~~~~SpD 360 (448)
T PRK04792 287 EIYVVDIATKALTRITRHRA-IDTEPSWHPDGKSLIFTSERGGKPQIYRVNLA--SGKVSRLTF---EGEQNLGGSITPD 360 (448)
T ss_pred EEEEEECCCCCeEECccCCC-CccceEECCCCCEEEEEECCCCCceEEEEECC--CCCEEEEec---CCCCCcCeeECCC
Confidence 45667999998888764321 11223456899877665443445678888987 677765531 2233334455668
Q ss_pred CcEEEEcCccCCeEEEE
Q 039705 161 GSFIVVGGRREFSYEYI 177 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y 177 (539)
|+.++..........+|
T Consensus 361 G~~l~~~~~~~g~~~I~ 377 (448)
T PRK04792 361 GRSMIMVNRTNGKFNIA 377 (448)
T ss_pred CCEEEEEEecCCceEEE
Confidence 87776655443333343
No 126
>PRK03629 tolB translocation protein TolB; Provisional
Probab=49.92 E-value=3.5e+02 Score=28.91 Aligned_cols=141 Identities=10% Similarity=0.047 Sum_probs=73.2
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|..+++-+.+...... ...-...+||+.+++-...++...+.++|.. +.+...+.. -.. ......-.+|
T Consensus 224 ~i~i~dl~~G~~~~l~~~~~~-~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~--tg~~~~lt~-~~~--~~~~~~wSPD 297 (429)
T PRK03629 224 ALVIQTLANGAVRQVASFPRH-NGAPAFSPDGSKLAFALSKTGSLNLYVMDLA--SGQIRQVTD-GRS--NNTEPTWFPD 297 (429)
T ss_pred EEEEEECCCCCeEEccCCCCC-cCCeEECCCCCEEEEEEcCCCCcEEEEEECC--CCCEEEccC-CCC--CcCceEECCC
Confidence 355678888877776543322 1234567899876654333345678889987 666655432 111 1123344568
Q ss_pred CcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeEE
Q 039705 161 GSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W~ 233 (539)
|+.++..........+| .... .....+... .. .+ ......+||+..++... +..++|..++++.
T Consensus 298 G~~I~f~s~~~g~~~Iy~~d~~~-g~~~~lt~~-~~-----~~--~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~ 368 (429)
T PRK03629 298 SQNLAYTSDQAGRPQVYKVNING-GAPQRITWE-GS-----QN--QDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQ 368 (429)
T ss_pred CCEEEEEeCCCCCceEEEEECCC-CCeEEeecC-CC-----Cc--cCEEECCCCCEEEEEEccCCCceEEEEECCCCCeE
Confidence 87555443222223455 2221 122211110 00 01 12245688887666432 3567898888876
Q ss_pred EEcc
Q 039705 234 HVFP 237 (539)
Q Consensus 234 ~~~p 237 (539)
.+.
T Consensus 369 -~Lt 371 (429)
T PRK03629 369 -VLT 371 (429)
T ss_pred -EeC
Confidence 453
No 127
>PF15418 DUF4625: Domain of unknown function (DUF4625)
Probab=49.75 E-value=1.1e+02 Score=27.20 Aligned_cols=90 Identities=17% Similarity=0.162 Sum_probs=46.8
Q ss_pred CCCceeecC---CC---ceeecCCEEEEEEEecccccccCcEEEEEE-cCCccccccCCC----ceeEeccceeeeecCC
Q 039705 431 YRPSIVSKF---KG---KMLKYGQNFVIQFKLDELEVSLNDLKVTMY-APPFTTHGVSMG----QRLLVPATKELIDVGS 499 (539)
Q Consensus 431 ~RP~i~~~~---~p---~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~-~~~~~TH~~n~~----QR~~~L~~~~~~~~g~ 499 (539)
..|+|+... .| ..+..|+.|.+++...+ ...+.++.+- -.-|-.|+-... ..-..+........|.
T Consensus 13 ~~P~I~~~~~~~~p~~~~~~~~G~~ihfe~~i~d---~~~i~si~VeIH~nfd~H~h~~~~~~~~~~~~~~~~~~~~~g~ 89 (132)
T PF15418_consen 13 EKPVITLNEIGAFPENCKVATRGDDIHFEADISD---NSAIKSIKVEIHNNFDHHTHSTEAGECEKPWVFEQDYDIYGGK 89 (132)
T ss_pred CCCEEEeeecccCCCCCeEEecCCcEEEEEEEEc---ccceeEEEEEEecCcCcccccccccccccCcEEEEEEcccCCc
Confidence 578887771 14 34788999999887654 2344444332 222333333221 1111111111001121
Q ss_pred --ceEEEEEEcCCCCCcCCCcceEEEEEc
Q 039705 500 --GIFQVSVMAPPTAKIAPPSFYLLFVVY 526 (539)
Q Consensus 500 --~~~~~~~~~P~~~~~~ppG~ymlf~~~ 526 (539)
-.....+++|++ |+||-|-+++..
T Consensus 90 ~~~~~h~~i~IPa~---a~~G~YH~~i~V 115 (132)
T PF15418_consen 90 KNYDFHEHIDIPAD---APAGDYHFMITV 115 (132)
T ss_pred ccEeEEEeeeCCCC---CCCcceEEEEEE
Confidence 124556789998 899999887753
No 128
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=49.65 E-value=91 Score=30.76 Aligned_cols=89 Identities=11% Similarity=0.048 Sum_probs=55.3
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
.+.+||.+|++....--.+...-....+--+-.|++.|++. .++++||-. ++...++.- +...+-.-+++.+ .
T Consensus 82 ~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD---~s~r~wDCR--S~s~ePiQi-ldea~D~V~Si~v-~ 154 (307)
T KOG0316|consen 82 AVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFD---SSVRLWDCR--SRSFEPIQI-LDEAKDGVSSIDV-A 154 (307)
T ss_pred eEEEEEcccCeeeeecccccceeeEEEecCcceEEEecccc---ceeEEEEcc--cCCCCccch-hhhhcCceeEEEe-c
Confidence 46789999887644322222111111222345677777764 689999988 787777654 7777776667777 3
Q ss_pred CcEEEEcCccCCeEEEE
Q 039705 161 GSFIVVGGRREFSYEYI 177 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y 177 (539)
+ --|++|+...++..|
T Consensus 155 ~-heIvaGS~DGtvRty 170 (307)
T KOG0316|consen 155 E-HEIVAGSVDGTVRTY 170 (307)
T ss_pred c-cEEEeeccCCcEEEE
Confidence 4 345566655667777
No 129
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=48.65 E-value=2.3e+02 Score=31.47 Aligned_cols=121 Identities=14% Similarity=0.168 Sum_probs=60.1
Q ss_pred EeeCCcEEEEEc-CceeEeeCCCCe--EEEEcccCCCCCCc---cC-CCccEEecccccCCCCCCCcccEEEEecCCCCC
Q 039705 208 LSTDGNLFIFAN-DRSILLNPETNE--ILHVFPILRGGSRN---YP-ASATSALLPIKLQDPNSNAIRAEVLICGGAKPE 280 (539)
Q Consensus 208 ~~~~G~Iyv~Gg-~~~e~yDp~tn~--W~~~~p~mp~~~r~---yp-~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~ 280 (539)
++.+|+||+... ..+..+|.++++ |... +..+...+. .. .....++ .+++||+... +
T Consensus 66 vv~~g~vyv~s~~g~v~AlDa~TGk~lW~~~-~~~~~~~~~~~~~~~~~rg~av------------~~~~v~v~t~-d-- 129 (527)
T TIGR03075 66 LVVDGVMYVTTSYSRVYALDAKTGKELWKYD-PKLPDDVIPVMCCDVVNRGVAL------------YDGKVFFGTL-D-- 129 (527)
T ss_pred EEECCEEEEECCCCcEEEEECCCCceeeEec-CCCCcccccccccccccccceE------------ECCEEEEEcC-C--
Confidence 445888998654 457888988754 6532 222210000 00 0011111 3678876432 2
Q ss_pred cccccCCCcccccCCceEEEEeeCCCCceeee--ccCCCceeceeEEecCCcEEEEcCc-CCCCCCcccCCCCCCccEEE
Q 039705 281 AGVLAGKGEFMNALQDCGRIEITNKSATWQRE--MMPSPRVMGEMLLLPTGDVLIINGA-KKGTAGWNFATDPNTTPVLY 357 (539)
Q Consensus 281 ~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~--~M~~~R~~~~~vvlpdG~I~vvGG~-~~g~~g~~~~~~~~~~~e~Y 357 (539)
..+.++|..+....|+.. .+.......++-++.+|+||+.... ..+. --.+..|
T Consensus 130 --------------g~l~ALDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~---------~G~v~Al 186 (527)
T TIGR03075 130 --------------ARLVALDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGV---------RGYVTAY 186 (527)
T ss_pred --------------CEEEEEECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCC---------CcEEEEE
Confidence 135678876555679864 3322212222334558888874321 1111 1146678
Q ss_pred cCCCCCCCceE
Q 039705 358 EPDDPINERFS 368 (539)
Q Consensus 358 dP~t~~g~~Wt 368 (539)
|.++.+ ..|+
T Consensus 187 D~~TG~-~lW~ 196 (527)
T TIGR03075 187 DAKTGK-LVWR 196 (527)
T ss_pred ECCCCc-eeEe
Confidence 887762 2576
No 130
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=48.47 E-value=4.6e+02 Score=29.85 Aligned_cols=61 Identities=13% Similarity=0.057 Sum_probs=31.2
Q ss_pred eeEEccCCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--eeEeeC
Q 039705 154 TQHILPDGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR--SILLNP 227 (539)
Q Consensus 154 s~~~L~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~--~e~yDp 227 (539)
+++.|+++ .|+.|+.+ +++.+|-.. + .+..+... +.+-+..+.+++.=|+..+++ ..++|.
T Consensus 145 Av~~l~e~-~~vTgsaD-KtIklWk~~---~--~l~tf~gH------tD~VRgL~vl~~~~flScsNDg~Ir~w~~ 207 (745)
T KOG0301|consen 145 AVASLPEN-TYVTGSAD-KTIKLWKGG---T--LLKTFSGH------TDCVRGLAVLDDSHFLSCSNDGSIRLWDL 207 (745)
T ss_pred eeeecCCC-cEEeccCc-ceeeeccCC---c--hhhhhccc------hhheeeeEEecCCCeEeecCCceEEEEec
Confidence 67788777 88888765 444444110 0 11112222 223344555666556666655 344555
No 131
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=47.44 E-value=99 Score=35.11 Aligned_cols=93 Identities=13% Similarity=0.051 Sum_probs=56.1
Q ss_pred cccccceeEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccc-c
Q 039705 74 KYVDYRALAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRW-F 152 (539)
Q Consensus 74 g~~~~~~~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~-y 152 (539)
|..|+ .+.+||..++.-..+-..|...-.+-++.++|+.++.|+.. ..+.+||-. +.+- +.. |..... -
T Consensus 553 GSsD~--tVRlWDv~~G~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed---~~I~iWDl~--~~~~--v~~-l~~Ht~ti 622 (707)
T KOG0263|consen 553 GSSDR--TVRLWDVSTGNSVRIFTGHKGPVTALAFSPCGRYLASGDED---GLIKIWDLA--NGSL--VKQ-LKGHTGTI 622 (707)
T ss_pred CCCCc--eEEEEEcCCCcEEEEecCCCCceEEEEEcCCCceEeecccC---CcEEEEEcC--CCcc--hhh-hhcccCce
Confidence 44444 47789998887776655544433344556799999999863 567888876 3221 111 222111 1
Q ss_pred ceeEEccCCcEEEEcCccCCeEEEE
Q 039705 153 STQHILPDGSFIVVGGRREFSYEYI 177 (539)
Q Consensus 153 ~s~~~L~dG~VyvvGG~~~~~~E~y 177 (539)
.+...-.||.|+|+||.++ ++.+|
T Consensus 623 ~SlsFS~dg~vLasgg~Dn-sV~lW 646 (707)
T KOG0263|consen 623 YSLSFSRDGNVLASGGADN-SVRLW 646 (707)
T ss_pred eEEEEecCCCEEEecCCCC-eEEEE
Confidence 1223334999999999874 44444
No 132
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=46.52 E-value=19 Score=27.03 Aligned_cols=17 Identities=53% Similarity=0.640 Sum_probs=13.1
Q ss_pred EeeCCCCeEEEecCCCC
Q 039705 383 SVVLPDGKILVAGSNPH 399 (539)
Q Consensus 383 a~llpdG~V~v~GG~~~ 399 (539)
..+.|||||+++|....
T Consensus 6 ~~~q~DGkIlv~G~~~~ 22 (55)
T TIGR02608 6 VAVQSDGKILVAGYVDN 22 (55)
T ss_pred EEECCCCcEEEEEEeec
Confidence 34579999999997543
No 133
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=45.65 E-value=3.5e+02 Score=27.71 Aligned_cols=101 Identities=17% Similarity=0.249 Sum_probs=54.4
Q ss_pred CCCcEEEecCCCCCCCeEEEEeCCCCccceeeccc---ccccccccceeEEccCCcEEEEcCccCCeEEEE-ecCCCcce
Q 039705 110 ANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHW---ELSAKRWFSTQHILPDGSFIVVGGRREFSYEYI-LKEGKRII 185 (539)
Q Consensus 110 ~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~---~m~~~R~y~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w 185 (539)
-||..+.+||.+ +.+.+||.. +++=..++. ....-||-... +.-.++=|++.+++.+| ++.. ..
T Consensus 82 ddgskVf~g~~D---k~~k~wDL~--S~Q~~~v~~Hd~pvkt~~wv~~~-----~~~cl~TGSWDKTlKfWD~R~~--~p 149 (347)
T KOG0647|consen 82 DDGSKVFSGGCD---KQAKLWDLA--SGQVSQVAAHDAPVKTCHWVPGM-----NYQCLVTGSWDKTLKFWDTRSS--NP 149 (347)
T ss_pred cCCceEEeeccC---CceEEEEcc--CCCeeeeeecccceeEEEEecCC-----CcceeEecccccceeecccCCC--Ce
Confidence 478888888763 678899998 777665542 01222442221 12244556677888888 6643 33
Q ss_pred e-eccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEeeCCCCe
Q 039705 186 Y-DLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILLNPETNE 231 (539)
Q Consensus 186 ~-~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~yDp~tn~ 231 (539)
. .+. |+++.. .....||-+++. .++++..+|+.+...
T Consensus 150 v~t~~-LPeRvY-a~Dv~~pm~vVa-------ta~r~i~vynL~n~~ 187 (347)
T KOG0647|consen 150 VATLQ-LPERVY-AADVLYPMAVVA-------TAERHIAVYNLENPP 187 (347)
T ss_pred eeeee-ccceee-ehhccCceeEEE-------ecCCcEEEEEcCCCc
Confidence 2 222 223221 112445544443 345667778876554
No 134
>PF13540 RCC1_2: Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=44.32 E-value=28 Score=22.29 Aligned_cols=21 Identities=24% Similarity=0.395 Sum_probs=13.9
Q ss_pred cceeeEeeCCCCeEEEecCCCC
Q 039705 378 MCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 378 ~yhs~a~llpdG~V~v~GG~~~ 399 (539)
.+|++++ +.||+||.-|.+..
T Consensus 8 ~~ht~al-~~~g~v~~wG~n~~ 28 (30)
T PF13540_consen 8 GYHTCAL-TSDGEVYCWGDNNY 28 (30)
T ss_dssp SSEEEEE-E-TTEEEEEE--TT
T ss_pred CCEEEEE-EcCCCEEEEcCCcC
Confidence 5787654 57999999998754
No 135
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=42.30 E-value=4.2e+02 Score=27.63 Aligned_cols=140 Identities=12% Similarity=0.056 Sum_probs=68.4
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...+||..+++.+.+...... .......+||+-+++....++...+.++|.. +.....+.. .. .........+|
T Consensus 215 ~i~v~d~~~g~~~~~~~~~~~-~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~--~~~~~~l~~-~~--~~~~~~~~s~d 288 (417)
T TIGR02800 215 EIYVQDLATGQREKVASFPGM-NGAPAFSPDGSKLAVSLSKDGNPDIYVMDLD--GKQLTRLTN-GP--GIDTEPSWSPD 288 (417)
T ss_pred EEEEEECCCCCEEEeecCCCC-ccceEECCCCCEEEEEECCCCCccEEEEECC--CCCEEECCC-CC--CCCCCEEECCC
Confidence 456789988877666543221 1224567898755443322344678888887 555554432 11 11111233457
Q ss_pred CcEEEEcC-ccC-CeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeEE
Q 039705 161 GSFIVVGG-RRE-FSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG-~~~-~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W~ 233 (539)
|+-+++.. +.+ ..+.++.... ..+..+..- . . .+ ......+||+.+++... ...++|..++.+.
T Consensus 289 g~~l~~~s~~~g~~~iy~~d~~~-~~~~~l~~~-~--~---~~--~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~ 359 (417)
T TIGR02800 289 GKSIAFTSDRGGSPQIYMMDADG-GEVRRLTFR-G--G---YN--ASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGER 359 (417)
T ss_pred CCEEEEEECCCCCceEEEEECCC-CCEEEeecC-C--C---Cc--cCeEECCCCCEEEEEEccCCceEEEEEeCCCCCeE
Confidence 86554433 222 2232332221 133222110 0 0 01 11245678887776543 4677888876654
Q ss_pred EEc
Q 039705 234 HVF 236 (539)
Q Consensus 234 ~~~ 236 (539)
.+
T Consensus 360 -~l 361 (417)
T TIGR02800 360 -VL 361 (417)
T ss_pred -Ec
Confidence 44
No 136
>PRK03629 tolB translocation protein TolB; Provisional
Probab=42.27 E-value=4.6e+02 Score=28.03 Aligned_cols=84 Identities=12% Similarity=0.020 Sum_probs=48.3
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...+||.++++.+++..... ........+||+.+++.....+...+..+|.. +..-..+.. ........+..+|
T Consensus 268 ~I~~~d~~tg~~~~lt~~~~-~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~--~g~~~~lt~---~~~~~~~~~~SpD 341 (429)
T PRK03629 268 NLYVMDLASGQIRQVTDGRS-NNTEPTWFPDSQNLAYTSDQAGRPQVYKVNIN--GGAPQRITW---EGSQNQDADVSSD 341 (429)
T ss_pred EEEEEECCCCCEEEccCCCC-CcCceEECCCCCEEEEEeCCCCCceEEEEECC--CCCeEEeec---CCCCccCEEECCC
Confidence 35568999888888764422 22234567899877765543344567777876 544443321 1111223445678
Q ss_pred CcEEEEcCcc
Q 039705 161 GSFIVVGGRR 170 (539)
Q Consensus 161 G~VyvvGG~~ 170 (539)
|+.++.....
T Consensus 342 G~~Ia~~~~~ 351 (429)
T PRK03629 342 GKFMVMVSSN 351 (429)
T ss_pred CCEEEEEEcc
Confidence 8877765543
No 137
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=41.14 E-value=4e+02 Score=27.03 Aligned_cols=88 Identities=18% Similarity=0.207 Sum_probs=50.6
Q ss_pred ecC--CcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCC-ccceeeEeeCCCCeEEEecCCCCCCC
Q 039705 326 LPT--GDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKP-RMCHSTSVVLPDGKILVAGSNPHSRY 402 (539)
Q Consensus 326 lpd--G~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~-R~yhs~a~llpdG~V~v~GG~~~~~~ 402 (539)
.|+ .-++|-+|.++ ++-+||-++- +- ..+-+. -.|-.+..+-|||.+.+.||.++..+
T Consensus 157 sP~~~~p~Ivs~s~Dk-------------tvKvWnl~~~---~l---~~~~~gh~~~v~t~~vSpDGslcasGgkdg~~~ 217 (315)
T KOG0279|consen 157 SPNESNPIIVSASWDK-------------TVKVWNLRNC---QL---RTTFIGHSGYVNTVTVSPDGSLCASGGKDGEAM 217 (315)
T ss_pred cCCCCCcEEEEccCCc-------------eEEEEccCCc---ch---hhccccccccEEEEEECCCCCEEecCCCCceEE
Confidence 455 45666666652 6789998776 33 233322 34556677889999999999876532
Q ss_pred --ccCCCCCCCcceeeEEecCCC-C-CCCcCCCCCceeec
Q 039705 403 --NLTSGSKYPTELRIEKFYPPY-F-DESFASYRPSIVSK 438 (539)
Q Consensus 403 --~~~~~~~~p~~~~vE~y~Ppy-l-~~~~~~~RP~i~~~ 438 (539)
+..-+ +. .+..|.+++-- | |.+ .|+.+..+
T Consensus 218 LwdL~~~-k~--lysl~a~~~v~sl~fsp---nrywL~~a 251 (315)
T KOG0279|consen 218 LWDLNEG-KN--LYSLEAFDIVNSLCFSP---NRYWLCAA 251 (315)
T ss_pred EEEccCC-ce--eEeccCCCeEeeEEecC---CceeEeec
Confidence 11100 11 34445554443 1 333 57777776
No 138
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=39.31 E-value=5.7e+02 Score=28.25 Aligned_cols=144 Identities=15% Similarity=0.127 Sum_probs=0.0
Q ss_pred ceeEEccCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCC
Q 039705 153 STQHILPDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNPET 229 (539)
Q Consensus 153 ~s~~~L~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~t 229 (539)
.++++-+|+.-.++||.+++ +.+| -..+ .-.....+...+.+.. ...-.+||+.++.|. +.+.+||-.+
T Consensus 447 s~vAv~~~~~~vaVGG~Dgk-vhvysl~g~--~l~ee~~~~~h~a~iT-----~vaySpd~~yla~~Da~rkvv~yd~~s 518 (603)
T KOG0318|consen 447 SAVAVSPDGSEVAVGGQDGK-VHVYSLSGD--ELKEEAKLLEHRAAIT-----DVAYSPDGAYLAAGDASRKVVLYDVAS 518 (603)
T ss_pred ceEEEcCCCCEEEEecccce-EEEEEecCC--cccceeeeecccCCce-----EEEECCCCcEEEEeccCCcEEEEEccc
Q ss_pred CeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCce
Q 039705 230 NEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATW 309 (539)
Q Consensus 230 n~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W 309 (539)
+.... .|.--+++-...+.=.| +.+.++.|..+ .++..|+...|...
T Consensus 519 ~~~~~--------~~w~FHtakI~~~aWsP--------~n~~vATGSlD----------------t~Viiysv~kP~~~- 565 (603)
T KOG0318|consen 519 REVKT--------NRWAFHTAKINCVAWSP--------NNKLVATGSLD----------------TNVIIYSVKKPAKH- 565 (603)
T ss_pred Cceec--------ceeeeeeeeEEEEEeCC--------CceEEEecccc----------------ceEEEEEccChhhh-
Q ss_pred eeeccCCCceeceeEEecCCcEEEEcCcC
Q 039705 310 QREMMPSPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 310 ~~~~M~~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
..-...-+...+.+.-.|..-+|.-|.+
T Consensus 566 -i~iknAH~~gVn~v~wlde~tvvSsG~D 593 (603)
T KOG0318|consen 566 -IIIKNAHLGGVNSVAWLDESTVVSSGQD 593 (603)
T ss_pred -eEeccccccCceeEEEecCceEEeccCc
No 139
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=39.22 E-value=4.8e+02 Score=29.88 Aligned_cols=85 Identities=12% Similarity=0.134 Sum_probs=46.3
Q ss_pred ceEEEEeeCCCCceeeeccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCC
Q 039705 296 DCGRIEITNKSATWQREMMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSK 375 (539)
Q Consensus 296 s~~~~d~~~~~~~W~~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~ 375 (539)
+|-..|... ..+-..-.-+..+. .+...-|+|+-++.|+.+ + .+-+||-.+. +- +..+..
T Consensus 558 tVRlWDv~~-G~~VRiF~GH~~~V-~al~~Sp~Gr~LaSg~ed-~------------~I~iWDl~~~---~~--v~~l~~ 617 (707)
T KOG0263|consen 558 TVRLWDVST-GNSVRIFTGHKGPV-TALAFSPCGRYLASGDED-G------------LIKIWDLANG---SL--VKQLKG 617 (707)
T ss_pred eEEEEEcCC-CcEEEEecCCCCce-EEEEEcCCCceEeecccC-C------------cEEEEEcCCC---cc--hhhhhc
Confidence 455566542 22211112233333 344567899999988865 2 5789998775 21 111110
Q ss_pred CccceeeEeeCCCCeEEEecCCCCC
Q 039705 376 PRMCHSTSVVLPDGKILVAGSNPHS 400 (539)
Q Consensus 376 ~R~yhs~a~llpdG~V~v~GG~~~~ 400 (539)
-..--.+...-.||.|||+||.+|.
T Consensus 618 Ht~ti~SlsFS~dg~vLasgg~Dns 642 (707)
T KOG0263|consen 618 HTGTIYSLSFSRDGNVLASGGADNS 642 (707)
T ss_pred ccCceeEEEEecCCCEEEecCCCCe
Confidence 0111112234569999999998665
No 140
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.14 E-value=1.8e+02 Score=29.74 Aligned_cols=88 Identities=23% Similarity=0.423 Sum_probs=50.8
Q ss_pred CCc-eeceeEEecCCcEEEEcCcCCCCCCcccCCCCCC-ccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEE
Q 039705 316 SPR-VMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNT-TPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILV 393 (539)
Q Consensus 316 ~~R-~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~-~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v 393 (539)
..| .++..|--+||+++-.-=.+. ++.. -+-+||-. . .++.++.-+.--+...-.+|++|||.+|
T Consensus 111 ~~RHfyGHGvfs~dG~~LYATEndf---------d~~rGViGvYd~r-~---~fqrvgE~~t~GiGpHev~lm~DGrtlv 177 (366)
T COG3490 111 EGRHFYGHGVFSPDGRLLYATENDF---------DPNRGVIGVYDAR-E---GFQRVGEFSTHGIGPHEVTLMADGRTLV 177 (366)
T ss_pred cCceeecccccCCCCcEEEeecCCC---------CCCCceEEEEecc-c---ccceecccccCCcCcceeEEecCCcEEE
Confidence 344 334447789999875321111 1111 46789977 4 6887776554333333467789999877
Q ss_pred ecCC---CCCCCccCCCCCCCcceeeEEecCCC
Q 039705 394 AGSN---PHSRYNLTSGSKYPTELRIEKFYPPY 423 (539)
Q Consensus 394 ~GG~---~~~~~~~~~~~~~p~~~~vE~y~Ppy 423 (539)
+-++ .+.++ . + +++++|...|.+
T Consensus 178 vanGGIethpdf--g---R--~~lNldsMePSl 203 (366)
T COG3490 178 VANGGIETHPDF--G---R--TELNLDSMEPSL 203 (366)
T ss_pred EeCCceeccccc--C---c--cccchhhcCccE
Confidence 6443 23221 1 1 367788888877
No 141
>PRK04792 tolB translocation protein TolB; Provisional
Probab=37.82 E-value=5.5e+02 Score=27.65 Aligned_cols=137 Identities=13% Similarity=0.063 Sum_probs=69.7
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|+.+++-+.+...... -......+||+-+++-...++...+.++|.. +.+.+.+.. -. -.....+.-+|
T Consensus 243 ~L~~~dl~tg~~~~lt~~~g~-~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~--tg~~~~lt~-~~--~~~~~p~wSpD 316 (448)
T PRK04792 243 EIFVQDIYTQVREKVTSFPGI-NGAPRFSPDGKKLALVLSKDGQPEIYVVDIA--TKALTRITR-HR--AIDTEPSWHPD 316 (448)
T ss_pred EEEEEECCCCCeEEecCCCCC-cCCeeECCCCCEEEEEEeCCCCeEEEEEECC--CCCeEECcc-CC--CCccceEECCC
Confidence 355678888877666543211 1133567899866553333445678888987 676665543 11 01112233458
Q ss_pred CcEEEEcCccCCeEEEE---ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeE
Q 039705 161 GSFIVVGGRREFSYEYI---LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEI 232 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y---P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W 232 (539)
|+-+++........++| ..++ ++..+.. ... .+. .....+||+.+++.+. +..++|..+++.
T Consensus 317 G~~I~f~s~~~g~~~Iy~~dl~~g--~~~~Lt~-~g~-----~~~--~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~ 386 (448)
T PRK04792 317 GKSLIFTSERGGKPQIYRVNLASG--KVSRLTF-EGE-----QNL--GGSITPDGRSMIMVNRTNGKFNIARQDLETGAM 386 (448)
T ss_pred CCEEEEEECCCCCceEEEEECCCC--CEEEEec-CCC-----CCc--CeeECCCCCEEEEEEecCCceEEEEEECCCCCe
Confidence 86554433222223444 2333 4432211 010 111 1245688876666432 356688888776
Q ss_pred E
Q 039705 233 L 233 (539)
Q Consensus 233 ~ 233 (539)
.
T Consensus 387 ~ 387 (448)
T PRK04792 387 Q 387 (448)
T ss_pred E
Confidence 5
No 142
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=37.54 E-value=3.4e+02 Score=30.64 Aligned_cols=142 Identities=19% Similarity=0.272 Sum_probs=72.1
Q ss_pred EEeeCCcEEEEEcC--ceeEeeCCCCe-EEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCccc
Q 039705 207 FLSTDGNLFIFAND--RSILLNPETNE-ILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGV 283 (539)
Q Consensus 207 ~~~~~G~Iyv~Gg~--~~e~yDp~tn~-W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~ 283 (539)
+++-+|+.++.... ++-++++..+. |- ..-+.. .+.|- .+ +.++. -+..+++.||.+.
T Consensus 80 iL~~~~~tlIS~SsDtTVK~W~~~~~~~~c--~stir~-H~DYV---kc-la~~a--------k~~~lvaSgGLD~---- 140 (735)
T KOG0308|consen 80 ILCGNGKTLISASSDTTVKVWNAHKDNTFC--MSTIRT-HKDYV---KC-LAYIA--------KNNELVASGGLDR---- 140 (735)
T ss_pred HhhcCCCceEEecCCceEEEeecccCcchh--Hhhhhc-ccchh---ee-eeecc--------cCceeEEecCCCc----
Confidence 34457888887764 46778887654 43 223333 45562 11 21111 2678899999873
Q ss_pred ccCCCcccccCCceEEEEeeCC-------CCceeeeccC-CCceece-eEEecCCcEEEEcCcCCCCCCcccCCCCCCcc
Q 039705 284 LAGKGEFMNALQDCGRIEITNK-------SATWQREMMP-SPRVMGE-MLLLPTGDVLIINGAKKGTAGWNFATDPNTTP 354 (539)
Q Consensus 284 ~~~~~~~~~a~~s~~~~d~~~~-------~~~W~~~~M~-~~R~~~~-~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~ 354 (539)
.+..+|+... .+.=+..++. .++..-= .+.-++|.|+|-||..+ ..
T Consensus 141 ------------~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgtek-------------~l 195 (735)
T KOG0308|consen 141 ------------KIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTEK-------------DL 195 (735)
T ss_pred ------------cEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCccc-------------ce
Confidence 1233443311 1111112333 2332111 12235677888888653 46
Q ss_pred EEEcCCCCCCCceEec-CCCCCCccceeeEeeCCCCeEEEecCCC
Q 039705 355 VLYEPDDPINERFSEL-TPTSKPRMCHSTSVVLPDGKILVAGSNP 398 (539)
Q Consensus 355 e~YdP~t~~g~~Wt~~-a~~~~~R~yhs~a~llpdG~V~v~GG~~ 398 (539)
.+|||.+.. +-..+ +....- -+.++..||+=++.|+.+
T Consensus 196 r~wDprt~~--kimkLrGHTdNV----r~ll~~dDGt~~ls~sSD 234 (735)
T KOG0308|consen 196 RLWDPRTCK--KIMKLRGHTDNV----RVLLVNDDGTRLLSASSD 234 (735)
T ss_pred EEecccccc--ceeeeeccccce----EEEEEcCCCCeEeecCCC
Confidence 799999982 22222 111111 245567899666666643
No 143
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=36.21 E-value=2e+02 Score=29.42 Aligned_cols=79 Identities=13% Similarity=0.170 Sum_probs=41.3
Q ss_pred ceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEec-CCC
Q 039705 296 DCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSEL-TPT 373 (539)
Q Consensus 296 s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~-a~~ 373 (539)
++..+|+.. +-... .+...+.+.- -+ .|.++|+|+.++ .+.+||-.+..- -++.- .++
T Consensus 117 ~ik~wD~R~---~~~~~~~d~~kkVy~~--~v-~g~~LvVg~~~r-------------~v~iyDLRn~~~-~~q~reS~l 176 (323)
T KOG1036|consen 117 TIKFWDPRN---KVVVGTFDQGKKVYCM--DV-SGNRLVVGTSDR-------------KVLIYDLRNLDE-PFQRRESSL 176 (323)
T ss_pred cEEEEeccc---cccccccccCceEEEE--ec-cCCEEEEeecCc-------------eEEEEEcccccc-hhhhccccc
Confidence 467788762 22222 3444455532 23 678888888763 678999876511 12111 111
Q ss_pred CCCccceeeEeeCCCCeEEEecCC
Q 039705 374 SKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 374 ~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
... --+ ..++|++.=|++|+-
T Consensus 177 kyq--tR~-v~~~pn~eGy~~sSi 197 (323)
T KOG1036|consen 177 KYQ--TRC-VALVPNGEGYVVSSI 197 (323)
T ss_pred eeE--EEE-EEEecCCCceEEEee
Confidence 111 012 344577777777764
No 144
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=36.07 E-value=78 Score=29.94 Aligned_cols=56 Identities=14% Similarity=0.281 Sum_probs=37.6
Q ss_pred EecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEeeCCCCeEEEecCC
Q 039705 325 LLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVVLPDGKILVAGSN 397 (539)
Q Consensus 325 vlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~ 397 (539)
--|+|+.++++|.. +.. -.+++||.++ ++.+....... .+....-||||-+++...
T Consensus 108 wsP~G~~l~~~g~~-n~~---------G~l~~wd~~~-----~~~i~~~~~~~--~t~~~WsPdGr~~~ta~t 163 (194)
T PF08662_consen 108 WSPDGRFLVLAGFG-NLN---------GDLEFWDVRK-----KKKISTFEHSD--ATDVEWSPDGRYLATATT 163 (194)
T ss_pred ECCCCCEEEEEEcc-CCC---------cEEEEEECCC-----CEEeeccccCc--EEEEEEcCCCCEEEEEEe
Confidence 45999999999975 221 1689999874 34444443332 344556899999988764
No 145
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=35.95 E-value=2.2e+02 Score=26.82 Aligned_cols=80 Identities=13% Similarity=0.176 Sum_probs=45.1
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
.+.+||.+......+... ........++|+.+++||..+..-.+++||.. + +..+.. ...... ..++=-+|
T Consensus 84 ~v~lyd~~~~~i~~~~~~---~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~--~--~~~i~~-~~~~~~-t~~~WsPd 154 (194)
T PF08662_consen 84 KVTLYDVKGKKIFSFGTQ---PRNTISWSPDGRFLVLAGFGNLNGDLEFWDVR--K--KKKIST-FEHSDA-TDVEWSPD 154 (194)
T ss_pred ccEEEcCcccEeEeecCC---CceEEEECCCCCEEEEEEccCCCcEEEEEECC--C--CEEeec-cccCcE-EEEEEcCC
Confidence 467888875444444322 12223456899999999975433568999976 3 333332 222211 12223457
Q ss_pred CcEEEEcCc
Q 039705 161 GSFIVVGGR 169 (539)
Q Consensus 161 G~VyvvGG~ 169 (539)
|+.++....
T Consensus 155 Gr~~~ta~t 163 (194)
T PF08662_consen 155 GRYLATATT 163 (194)
T ss_pred CCEEEEEEe
Confidence 888886643
No 146
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=34.41 E-value=1.7e+02 Score=32.52 Aligned_cols=95 Identities=9% Similarity=0.126 Sum_probs=49.5
Q ss_pred ccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCce---e----ceeEEecCCcEEEEcCcCC
Q 039705 268 RAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRV---M----GEMLLLPTGDVLIINGAKK 339 (539)
Q Consensus 268 ~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~---~----~~~vvlpdG~I~vvGG~~~ 339 (539)
+++||++.... .+.++|.......|+.. ..+.... . ...+++-+++||+... +
T Consensus 69 ~g~vyv~s~~g-----------------~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~-d- 129 (527)
T TIGR03075 69 DGVMYVTTSYS-----------------RVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTL-D- 129 (527)
T ss_pred CCEEEEECCCC-----------------cEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcC-C-
Confidence 78999865421 35678876545678864 2221100 0 1123455888887432 2
Q ss_pred CCCCcccCCCCCCccEEEcCCCCCCCceEecC-CCCCCccceeeEeeCCCCeEEEec
Q 039705 340 GTAGWNFATDPNTTPVLYEPDDPINERFSELT-PTSKPRMCHSTSVVLPDGKILVAG 395 (539)
Q Consensus 340 g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a-~~~~~R~yhs~a~llpdG~V~v~G 395 (539)
+ .+.++|.+|.+- .|+.-. ....... ...+-++.+|+||+..
T Consensus 130 g------------~l~ALDa~TGk~-~W~~~~~~~~~~~~-~tssP~v~~g~Vivg~ 172 (527)
T TIGR03075 130 A------------RLVALDAKTGKV-VWSKKNGDYKAGYT-ITAAPLVVKGKVITGI 172 (527)
T ss_pred C------------EEEEEECCCCCE-Eeeccccccccccc-ccCCcEEECCEEEEee
Confidence 1 467899887622 586532 1111111 1122233488988863
No 147
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=34.35 E-value=3e+02 Score=27.86 Aligned_cols=59 Identities=10% Similarity=-0.013 Sum_probs=36.6
Q ss_pred EEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCC--CC--CCeEEEEeCCCCccceeeccc
Q 039705 83 VEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWS--SR--GRSVRYLSGCYHACYWKEHHW 144 (539)
Q Consensus 83 ~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~--~g--~~~v~~ydP~~~t~~W~~~~~ 144 (539)
-.||+.+.+|..+......- -......++.-+++||.. .+ ...+-.||.. +.+|+.+..
T Consensus 19 C~yd~~~~qW~~~g~~i~G~-V~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~--~~~w~~~~~ 81 (281)
T PF12768_consen 19 CLYDTDNSQWSSPGNGISGT-VTDLQWASNNQLLVGGNFTLNGTNSSNLATYDFK--NQTWSSLGG 81 (281)
T ss_pred EEEECCCCEeecCCCCceEE-EEEEEEecCCEEEEEEeeEECCCCceeEEEEecC--CCeeeecCC
Confidence 46999999999876542111 012223355555555543 22 4567789999 899987764
No 148
>KOG1427 consensus Uncharacterized conserved protein, contains RCC1 domain [Function unknown]
Probab=34.21 E-value=89 Score=31.82 Aligned_cols=91 Identities=18% Similarity=0.159 Sum_probs=61.3
Q ss_pred eeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEEecCCCCCCCcCCCCCceeecCCCceeecCCEEEEEEEeccc
Q 039705 380 HSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKFYPPYFDESFASYRPSIVSKFKGKMLKYGQNFVIQFKLDEL 459 (539)
Q Consensus 380 hs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y~Ppyl~~~~~~~RP~i~~~~~p~~~~~g~~~~v~~~~~~~ 459 (539)
|.+.+|.-+|.||..|=|-.+.+... ....|+|+||-..-- -|.|+ .+..|..|+|-++..
T Consensus 120 nHTl~ltdtG~v~afGeNK~GQlGlg-------n~~~~v~s~~~~~~~----~~~v~------~v~cga~ftv~l~~~-- 180 (443)
T KOG1427|consen 120 NHTLVLTDTGQVLAFGENKYGQLGLG-------NAKNEVESTPLPCVV----SDEVT------NVACGADFTVWLSST-- 180 (443)
T ss_pred CcEEEEecCCcEEEeccccccccccc-------ccccccccCCCcccc----Cccce------eeccccceEEEeecc--
Confidence 44667788999999998755433221 234589999876532 23233 357899999998853
Q ss_pred ccccCcEEEEEEcCCcccccc----CCCceeEeccce
Q 039705 460 EVSLNDLKVTMYAPPFTTHGV----SMGQRLLVPATK 492 (539)
Q Consensus 460 ~~~~~~~~v~l~~~~~~TH~~----n~~QR~~~L~~~ 492 (539)
..+..+-|-..|---|.. ||+.--|.|.|.
T Consensus 181 ---~si~t~glp~ygqlgh~td~~~~~~~~~~~~~~e 214 (443)
T KOG1427|consen 181 ---ESILTAGLPQYGQLGHGTDNEFNMKDSSVRLAYE 214 (443)
T ss_pred ---cceeecCCccccccccCcchhhccccccceeeee
Confidence 357777787778777764 566666666554
No 149
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=33.85 E-value=4.2e+02 Score=26.45 Aligned_cols=134 Identities=11% Similarity=0.188 Sum_probs=0.0
Q ss_pred CcEEEEEcCc--eeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCc
Q 039705 212 GNLFIFANDR--SILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGE 289 (539)
Q Consensus 212 G~Iyv~Gg~~--~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~ 289 (539)
|.|.+..-++ +.++...+++|+ .. ...-.+.++.--..-+|+ --|-+|+|+-.+.
T Consensus 70 G~iLAScsYDgkVIiWke~~g~w~-k~------~e~~~h~~SVNsV~waph------eygl~LacasSDG---------- 126 (299)
T KOG1332|consen 70 GTILASCSYDGKVIIWKEENGRWT-KA------YEHAAHSASVNSVAWAPH------EYGLLLACASSDG---------- 126 (299)
T ss_pred CcEeeEeecCceEEEEecCCCchh-hh------hhhhhhcccceeeccccc------ccceEEEEeeCCC----------
Q ss_pred ccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCC---------------cEEEEcCcCCCCCCcccCCCCCCc
Q 039705 290 FMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTG---------------DVLIINGAKKGTAGWNFATDPNTT 353 (539)
Q Consensus 290 ~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG---------------~I~vvGG~~~g~~g~~~~~~~~~~ 353 (539)
++..++... ++.|... --.--+.+.++|.-.-. +=||.||.++ .
T Consensus 127 ------~vsvl~~~~-~g~w~t~ki~~aH~~GvnsVswapa~~~g~~~~~~~~~~~krlvSgGcDn-------------~ 186 (299)
T KOG1332|consen 127 ------KVSVLTYDS-SGGWTTSKIVFAHEIGVNSVSWAPASAPGSLVDQGPAAKVKRLVSGGCDN-------------L 186 (299)
T ss_pred ------cEEEEEEcC-CCCccchhhhhccccccceeeecCcCCCccccccCcccccceeeccCCcc-------------c
Q ss_pred cEEEcCCCCCCCceEe----------------cCCCCCCccceeeEeeCCCCeEEE
Q 039705 354 PVLYEPDDPINERFSE----------------LTPTSKPRMCHSTSVVLPDGKILV 393 (539)
Q Consensus 354 ~e~YdP~t~~g~~Wt~----------------~a~~~~~R~yhs~a~llpdG~V~v 393 (539)
+-+|+-+.+ +|.. .+....+|.+-.++-- ||+|++
T Consensus 187 VkiW~~~~~---~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~Sq--Dg~viI 237 (299)
T KOG1332|consen 187 VKIWKFDSD---SWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQ--DGTVII 237 (299)
T ss_pred eeeeecCCc---chhhhhhhhhcchhhhhhhhccccCCCceeeEEecC--CCcEEE
No 150
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=33.45 E-value=5.6e+02 Score=26.48 Aligned_cols=175 Identities=17% Similarity=0.122 Sum_probs=85.4
Q ss_pred eecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccCCcE---EEEcCccCCeEEEEecCCCc
Q 039705 107 GLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPDGSF---IVVGGRREFSYEYILKEGKR 183 (539)
Q Consensus 107 ~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~dG~V---yvvGG~~~~~~E~yP~~~~~ 183 (539)
++..+|+.++.||.+ ..|.+||-. ++. ++.. +-..-...++... +... ..+.|.+...+-+|-..
T Consensus 48 avAVs~~~~aSGssD---etI~IYDm~--k~~--qlg~-ll~HagsitaL~F-~~~~S~shLlS~sdDG~i~iw~~~--- 115 (362)
T KOG0294|consen 48 ALAVSGPYVASGSSD---ETIHIYDMR--KRK--QLGI-LLSHAGSITALKF-YPPLSKSHLLSGSDDGHIIIWRVG--- 115 (362)
T ss_pred EEEecceeEeccCCC---CcEEEEecc--chh--hhcc-eeccccceEEEEe-cCCcchhheeeecCCCcEEEEEcC---
Confidence 445689999998874 579999976 322 2222 2221111111111 1111 34455544444455322
Q ss_pred ceeeccCccccCCCCCCCCcceEEEeeCCcEEEE-EcCc-eeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCC
Q 039705 184 IIYDLPILNETTNPSENNLYPFVFLSTDGNLFIF-ANDR-SILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQD 261 (539)
Q Consensus 184 ~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~-Gg~~-~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~ 261 (539)
.|.....+...... --+..+-|.|||-+. ||.. ...||..+++-- .+.++. ++ +..|..-
T Consensus 116 ~W~~~~slK~H~~~-----Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a-~v~~L~----~~---at~v~w~----- 177 (362)
T KOG0294|consen 116 SWELLKSLKAHKGQ-----VTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVA-FVLNLK----NK---ATLVSWS----- 177 (362)
T ss_pred CeEEeeeecccccc-----cceeEecCCCceEEEEcCCceeeeehhhcCccc-eeeccC----Cc---ceeeEEc-----
Confidence 57544333322111 113455678998765 4433 445666666543 333332 23 1222211
Q ss_pred CCCCCcccEEEEecCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcC
Q 039705 262 PNSNAIRAEVLICGGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAK 338 (539)
Q Consensus 262 ~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~ 338 (539)
..|.-|++++.+ .++.|.+.. .+-..+ .|+ .|. +++..+ ++..+++||.+
T Consensus 178 -----~~Gd~F~v~~~~-----------------~i~i~q~d~--A~v~~~i~~~-~r~-l~~~~l-~~~~L~vG~d~ 228 (362)
T KOG0294|consen 178 -----PQGDHFVVSGRN-----------------KIDIYQLDN--ASVFREIENP-KRI-LCATFL-DGSELLVGGDN 228 (362)
T ss_pred -----CCCCEEEEEecc-----------------EEEEEeccc--HhHhhhhhcc-ccc-eeeeec-CCceEEEecCC
Confidence 156667776654 244555432 221122 455 444 454555 89999999976
No 151
>PRK02889 tolB translocation protein TolB; Provisional
Probab=32.57 E-value=6.4e+02 Score=26.85 Aligned_cols=137 Identities=12% Similarity=0.100 Sum_probs=67.6
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccc-cceeEEcc
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRW-FSTQHILP 159 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~-y~s~~~L~ 159 (539)
...+||..+++-+.+...... -......+||+.+++....++...+..+|.. +.....+.. ... .....-.+
T Consensus 221 ~I~~~dl~~g~~~~l~~~~g~-~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~--~~~~~~lt~----~~~~~~~~~wSp 293 (427)
T PRK02889 221 VVYVHDLATGRRRVVANFKGS-NSAPAWSPDGRTLAVALSRDGNSQIYTVNAD--GSGLRRLTQ----SSGIDTEPFFSP 293 (427)
T ss_pred EEEEEECCCCCEEEeecCCCC-ccceEECCCCCEEEEEEccCCCceEEEEECC--CCCcEECCC----CCCCCcCeEEcC
Confidence 356788888876666433211 1234567899766654333455667777776 444444322 111 11233456
Q ss_pred CCcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeE
Q 039705 160 DGSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEI 232 (539)
Q Consensus 160 dG~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W 232 (539)
||+-+++........++| +... ........ ... .+. .....+||+..++... ...++|..+++.
T Consensus 294 DG~~l~f~s~~~g~~~Iy~~~~~~-g~~~~lt~-~g~-----~~~--~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~ 364 (427)
T PRK02889 294 DGRSIYFTSDRGGAPQIYRMPASG-GAAQRVTF-TGS-----YNT--SPRISPDGKLLAYISRVGGAFKLYVQDLATGQV 364 (427)
T ss_pred CCCEEEEEecCCCCcEEEEEECCC-CceEEEec-CCC-----CcC--ceEECCCCCEEEEEEccCCcEEEEEEECCCCCe
Confidence 887555432222233455 3221 12222211 110 011 1245678876555432 356788887776
Q ss_pred E
Q 039705 233 L 233 (539)
Q Consensus 233 ~ 233 (539)
.
T Consensus 365 ~ 365 (427)
T PRK02889 365 T 365 (427)
T ss_pred E
Confidence 5
No 152
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=31.61 E-value=5.3e+02 Score=27.89 Aligned_cols=57 Identities=14% Similarity=0.084 Sum_probs=33.7
Q ss_pred EEccCCcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCceeEe
Q 039705 156 HILPDGSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDRSILL 225 (539)
Q Consensus 156 ~~L~dG~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~~e~y 225 (539)
-+|.-+++..+||++. ++.+| |... +-...+. .-.+-++++.|..=|+.|..+..++
T Consensus 293 daL~reR~vtVGgrDr-T~rlwKi~ees--qlifrg~----------~~sidcv~~In~~HfvsGSdnG~Ia 351 (479)
T KOG0299|consen 293 DALSRERCVTVGGRDR-TVRLWKIPEES--QLIFRGG----------EGSIDCVAFINDEHFVSGSDNGSIA 351 (479)
T ss_pred chhcccceEEeccccc-eeEEEeccccc--eeeeeCC----------CCCeeeEEEecccceeeccCCceEE
Confidence 3566689999999984 45555 3321 1111110 0123457778888899998765443
No 153
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=29.49 E-value=3.5e+02 Score=29.67 Aligned_cols=140 Identities=14% Similarity=0.161 Sum_probs=73.2
Q ss_pred CcEEEEEcCceeEeeCCCCeEEEEcccCCCC-CCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcc
Q 039705 212 GNLFIFANDRSILLNPETNEILHVFPILRGG-SRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEF 290 (539)
Q Consensus 212 G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~-~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~ 290 (539)
-++|-.|-..+-+||...-.=...+..|+-. +-+|- -++-++| +|+-+++||..
T Consensus 432 rhVyTgGkgcVKVWdis~pg~k~PvsqLdcl~rdnyi--RSckL~p-----------dgrtLivGGea------------ 486 (705)
T KOG0639|consen 432 RHVYTGGKGCVKVWDISQPGNKSPVSQLDCLNRDNYI--RSCKLLP-----------DGRTLIVGGEA------------ 486 (705)
T ss_pred ceeEecCCCeEEEeeccCCCCCCccccccccCcccce--eeeEecC-----------CCceEEecccc------------
Confidence 3455433345677776542110122222221 33453 3455665 89999999952
Q ss_pred cccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEe
Q 039705 291 MNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSE 369 (539)
Q Consensus 291 ~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~ 369 (539)
.++..+|+..++.+-..+ +-..+-+++- .+-||-||...-=.+ | .+.+||-... +-
T Consensus 487 ----stlsiWDLAapTprikaeltssapaCyAL-a~spDakvcFsccsd-G------------nI~vwDLhnq---~~-- 543 (705)
T KOG0639|consen 487 ----STLSIWDLAAPTPRIKAELTSSAPACYAL-AISPDAKVCFSCCSD-G------------NIAVWDLHNQ---TL-- 543 (705)
T ss_pred ----ceeeeeeccCCCcchhhhcCCcchhhhhh-hcCCccceeeeeccC-C------------cEEEEEcccc---ee--
Confidence 246678887655444433 3222334433 446788887643222 1 4678887665 22
Q ss_pred cCCCCCCccceeeEeeCCCCeEEEecCCCC
Q 039705 370 LTPTSKPRMCHSTSVVLPDGKILVAGSNPH 399 (539)
Q Consensus 370 ~a~~~~~R~yhs~a~llpdG~V~v~GG~~~ 399 (539)
+....----..++-.+-.||.=+=.||-++
T Consensus 544 VrqfqGhtDGascIdis~dGtklWTGGlDn 573 (705)
T KOG0639|consen 544 VRQFQGHTDGASCIDISKDGTKLWTGGLDN 573 (705)
T ss_pred eecccCCCCCceeEEecCCCceeecCCCcc
Confidence 111111112234455667888888888654
No 154
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=29.42 E-value=3.8e+02 Score=23.22 Aligned_cols=22 Identities=41% Similarity=0.455 Sum_probs=13.3
Q ss_pred eEEEEEEcCCCCCcCCCcceEEEEE
Q 039705 501 IFQVSVMAPPTAKIAPPSFYLLFVV 525 (539)
Q Consensus 501 ~~~~~~~~P~~~~~~ppG~ymlf~~ 525 (539)
.-+|.|+.|+|+ |=|.|-|-|-
T Consensus 94 ~~tv~V~spa~A---~VG~y~l~v~ 115 (118)
T PF00868_consen 94 SVTVSVTSPANA---PVGRYKLSVE 115 (118)
T ss_dssp EEEEEEE--TTS-----EEEEEEEE
T ss_pred EEEEEEECCCCC---ceEEEEEEEE
Confidence 467777888865 5699988763
No 155
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=29.12 E-value=5.5e+02 Score=26.93 Aligned_cols=35 Identities=17% Similarity=0.282 Sum_probs=23.8
Q ss_pred cceEEEeeCCc-EEEEEc-----------CceeEeeCCCCeEEEEcc
Q 039705 203 YPFVFLSTDGN-LFIFAN-----------DRSILLNPETNEILHVFP 237 (539)
Q Consensus 203 yp~~~~~~~G~-Iyv~Gg-----------~~~e~yDp~tn~W~~~~p 237 (539)
.|+..+.+||+ ||+... ..+++||..+.+-...++
T Consensus 48 ~P~~~~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~ 94 (352)
T TIGR02658 48 LPNPVVASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIE 94 (352)
T ss_pred CCceeECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEc
Confidence 45556777875 555544 237899999998775553
No 156
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=29.03 E-value=8.1e+02 Score=26.95 Aligned_cols=129 Identities=13% Similarity=0.025 Sum_probs=67.1
Q ss_pred eEEEEECCCCCEEeCccCC-CcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccc-ccccceeEEc
Q 039705 81 LAVEYDAESAAIRPLKILT-DTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSA-KRWFSTQHIL 158 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~-~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~-~R~y~s~~~L 158 (539)
.+.+|+-.+...+.+.... +.-|+ -....+|..+++|=. ...+++||.. +++=. .. |.. ...+-++..-
T Consensus 198 ~vylW~~~s~~v~~l~~~~~~~vtS-v~ws~~G~~LavG~~---~g~v~iwD~~--~~k~~--~~-~~~~h~~rvg~laW 268 (484)
T KOG0305|consen 198 SVYLWSASSGSVTELCSFGEELVTS-VKWSPDGSHLAVGTS---DGTVQIWDVK--EQKKT--RT-LRGSHASRVGSLAW 268 (484)
T ss_pred eEEEEecCCCceEEeEecCCCceEE-EEECCCCCEEEEeec---CCeEEEEehh--hcccc--cc-ccCCcCceeEEEec
Confidence 4556777777777766552 22221 233468999999843 3578999987 43322 22 433 2222234444
Q ss_pred cCCcEEEEcCccCCeEEEE-ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--eeEeeCC
Q 039705 159 PDGSFIVVGGRREFSYEYI-LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR--SILLNPE 228 (539)
Q Consensus 159 ~dG~VyvvGG~~~~~~E~y-P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~--~e~yDp~ 228 (539)
++.++..|.++..- -.+ -... +- ....+...... --..--..||+.++.||++ +.+||-.
T Consensus 269 -~~~~lssGsr~~~I-~~~dvR~~--~~-~~~~~~~H~qe-----VCgLkws~d~~~lASGgnDN~~~Iwd~~ 331 (484)
T KOG0305|consen 269 -NSSVLSSGSRDGKI-LNHDVRIS--QH-VVSTLQGHRQE-----VCGLKWSPDGNQLASGGNDNVVFIWDGL 331 (484)
T ss_pred -cCceEEEecCCCcE-EEEEEecc--hh-hhhhhhcccce-----eeeeEECCCCCeeccCCCccceEeccCC
Confidence 68888888877531 111 1110 00 01112111100 0011224699999999976 5666663
No 157
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=29.00 E-value=6.6e+02 Score=25.94 Aligned_cols=32 Identities=19% Similarity=0.306 Sum_probs=25.3
Q ss_pred CCcccccceecCCCcEEEecCCCCCCCeEEEEeCC
Q 039705 99 TDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGC 133 (539)
Q Consensus 99 ~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~ 133 (539)
+..-|..+++.+||.++.+|+-+ .++.++|-.
T Consensus 111 HK~~cR~aafs~DG~lvATGsaD---~SIKildve 142 (430)
T KOG0640|consen 111 HKSPCRAAAFSPDGSLVATGSAD---ASIKILDVE 142 (430)
T ss_pred cccceeeeeeCCCCcEEEccCCc---ceEEEeehh
Confidence 45568888999999999999863 577788753
No 158
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=28.28 E-value=3.1e+02 Score=21.91 Aligned_cols=74 Identities=18% Similarity=0.201 Sum_probs=41.0
Q ss_pred eeecCCCceeecCCEEEEEEEecccc-cccCcEEEEEEcCCccccccCCCceeE-eccceeeeecCCceEEEEEEcCCCC
Q 039705 435 IVSKFKGKMLKYGQNFVIQFKLDELE-VSLNDLKVTMYAPPFTTHGVSMGQRLL-VPATKELIDVGSGIFQVSVMAPPTA 512 (539)
Q Consensus 435 i~~~~~p~~~~~g~~~~v~~~~~~~~-~~~~~~~v~l~~~~~~TH~~n~~QR~~-~L~~~~~~~~g~~~~~~~~~~P~~~ 512 (539)
+... |..+..|+.++|++...-.. .......|.|..-+... +++.| .|. .| .+.+++++..+.
T Consensus 8 ~~~~--~~~~~~g~~~~i~~~V~N~G~~~~~~~~v~~~~~~~~~-----~~~~i~~L~------~g-~~~~v~~~~~~~- 72 (101)
T PF07705_consen 8 ITVS--PSNVVPGEPVTITVTVKNNGTADAENVTVRLYLDGNSV-----STVTIPSLA------PG-ESETVTFTWTPP- 72 (101)
T ss_dssp EEEC---SEEETTSEEEEEEEEEE-SSS-BEEEEEEEEETTEEE-----EEEEESEB-------TT-EEEEEEEEEE-S-
T ss_pred EeeC--CCcccCCCEEEEEEEEEECCCCCCCCEEEEEEECCcee-----ccEEECCcC------CC-cEEEEEEEEEeC-
Confidence 3444 88999999888887752211 12345677777766655 44444 332 23 234555554444
Q ss_pred CcCCCcceEEEEEc
Q 039705 513 KIAPPSFYLLFVVY 526 (539)
Q Consensus 513 ~~~ppG~ymlf~~~ 526 (539)
-||.|-|.++.
T Consensus 73 ---~~G~~~i~~~i 83 (101)
T PF07705_consen 73 ---SPGSYTIRVVI 83 (101)
T ss_dssp ---S-CEEEEEEEE
T ss_pred ---CCCeEEEEEEE
Confidence 67888777764
No 159
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=28.19 E-value=4.3e+02 Score=26.35 Aligned_cols=49 Identities=12% Similarity=0.134 Sum_probs=30.6
Q ss_pred eEEEEECCCCCEEeC-ccCCCccccc-------ceecCCCcEEEecCCCCCCCeEEEEeCC
Q 039705 81 LAVEYDAESAAIRPL-KILTDTWSSS-------GGLSANGTIVISGGWSSRGRSVRYLSGC 133 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l-~~~~~~~c~~-------~~~l~dG~l~v~GG~~~g~~~v~~ydP~ 133 (539)
.+.+||.+|.+-..+ .+..+.-|.- ++...|..+++.||- ....+|+..
T Consensus 179 tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgG----p~lslwhLr 235 (325)
T KOG0649|consen 179 TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGG----PKLSLWHLR 235 (325)
T ss_pred cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecCC----CceeEEecc
Confidence 478999999887654 3433332322 455668889999984 334455543
No 160
>PRK05137 tolB translocation protein TolB; Provisional
Probab=27.95 E-value=7.5e+02 Score=26.26 Aligned_cols=138 Identities=15% Similarity=0.069 Sum_probs=68.6
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|+.+++.+.+...... .......+||+.+++....++...+.++|.. +.....+.. -.. .......-+|
T Consensus 227 ~i~~~dl~~g~~~~l~~~~g~-~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~--~~~~~~Lt~-~~~--~~~~~~~spD 300 (435)
T PRK05137 227 RVYLLDLETGQRELVGNFPGM-TFAPRFSPDGRKVVMSLSQGGNTDIYTMDLR--SGTTTRLTD-SPA--IDTSPSYSPD 300 (435)
T ss_pred EEEEEECCCCcEEEeecCCCc-ccCcEECCCCCEEEEEEecCCCceEEEEECC--CCceEEccC-CCC--ccCceeEcCC
Confidence 466789998888777543321 2234567899866554333445678888887 555554432 110 1112334568
Q ss_pred CcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc-----CceeEeeCCCCeEE
Q 039705 161 GSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN-----DRSILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg-----~~~e~yDp~tn~W~ 233 (539)
|+-+++........++| .... .....+... . ..+......+||+..++.. ....++|+.++...
T Consensus 301 G~~i~f~s~~~g~~~Iy~~d~~g-~~~~~lt~~--~------~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~ 371 (435)
T PRK05137 301 GSQIVFESDRSGSPQLYVMNADG-SNPRRISFG--G------GRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGER 371 (435)
T ss_pred CCEEEEEECCCCCCeEEEEECCC-CCeEEeecC--C------CcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceE
Confidence 87555433222222344 3222 122221110 0 0111224567887665532 24567787665543
No 161
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=27.19 E-value=2.3e+02 Score=31.03 Aligned_cols=134 Identities=13% Similarity=0.104 Sum_probs=69.9
Q ss_pred EEEEECCCCC----EEeCccC-CCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeeccccccc--ccccce
Q 039705 82 AVEYDAESAA----IRPLKIL-TDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSA--KRWFST 154 (539)
Q Consensus 82 ~~~yDp~t~~----w~~l~~~-~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~--~R~y~s 154 (539)
+.+||..-.. ...|..+ .+.+-...-.++||+-+++||. ..++.++|.. +-+=+--+. |.. +-+|+-
T Consensus 442 VKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgrtLivGGe---astlsiWDLA--apTprikae-ltssapaCyAL 515 (705)
T KOG0639|consen 442 VKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGRTLIVGGE---ASTLSIWDLA--APTPRIKAE-LTSSAPACYAL 515 (705)
T ss_pred EEEeeccCCCCCCccccccccCcccceeeeEecCCCceEEeccc---cceeeeeecc--CCCcchhhh-cCCcchhhhhh
Confidence 5678865321 2334443 3344444567899999999997 3677888876 333222222 332 344543
Q ss_pred eEEccCCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc--CceeEeeCCCCe
Q 039705 155 QHILPDGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN--DRSILLNPETNE 231 (539)
Q Consensus 155 ~~~L~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg--~~~e~yDp~tn~ 231 (539)
++ -+|-+|....=+++ .+-+|.-.+ +-.+..+ +...|... ...+..||.=.+.|| +.+.+||.++.+
T Consensus 516 a~-spDakvcFsccsdG-nI~vwDLhn--q~~Vrqf-qGhtDGas-----cIdis~dGtklWTGGlDntvRcWDlregr 584 (705)
T KOG0639|consen 516 AI-SPDAKVCFSCCSDG-NIAVWDLHN--QTLVRQF-QGHTDGAS-----CIDISKDGTKLWTGGLDNTVRCWDLREGR 584 (705)
T ss_pred hc-CCccceeeeeccCC-cEEEEEccc--ceeeecc-cCCCCCce-----eEEecCCCceeecCCCccceeehhhhhhh
Confidence 33 33666655443332 344553333 3322222 22222210 123345787777787 568889888765
No 162
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.18 E-value=3.7e+02 Score=27.52 Aligned_cols=83 Identities=14% Similarity=0.113 Sum_probs=51.5
Q ss_pred eEEEEECCCCCE-EeCccC-CCcccccceecCCCcEE-EecCCCCC-CCeEEEEeCCCCccceeecccccc-ccccccee
Q 039705 81 LAVEYDAESAAI-RPLKIL-TDTWSSSGGLSANGTIV-ISGGWSSR-GRSVRYLSGCYHACYWKEHHWELS-AKRWFSTQ 155 (539)
Q Consensus 81 ~~~~yDp~t~~w-~~l~~~-~~~~c~~~~~l~dG~l~-v~GG~~~g-~~~v~~ydP~~~t~~W~~~~~~m~-~~R~y~s~ 155 (539)
.+.+||++..+- ..+... ...|+.++++.+||+++ .+-+..+. .--+-+||-. +....+.. .+ .+-.-|-+
T Consensus 92 f~~vfD~~~~~~pv~~~s~~~RHfyGHGvfs~dG~~LYATEndfd~~rGViGvYd~r---~~fqrvgE-~~t~GiGpHev 167 (366)
T COG3490 92 FAMVFDPNGAQEPVTLVSQEGRHFYGHGVFSPDGRLLYATENDFDPNRGVIGVYDAR---EGFQRVGE-FSTHGIGPHEV 167 (366)
T ss_pred eEEEECCCCCcCcEEEecccCceeecccccCCCCcEEEeecCCCCCCCceEEEEecc---cccceecc-cccCCcCccee
Confidence 466788886543 233333 45688899999999865 44444343 3456789874 44444443 32 22334567
Q ss_pred EEccCCcEEEEc
Q 039705 156 HILPDGSFIVVG 167 (539)
Q Consensus 156 ~~L~dG~VyvvG 167 (539)
..+.||+.+|+-
T Consensus 168 ~lm~DGrtlvva 179 (366)
T COG3490 168 TLMADGRTLVVA 179 (366)
T ss_pred EEecCCcEEEEe
Confidence 788899988873
No 163
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=26.49 E-value=3.2e+02 Score=30.81 Aligned_cols=98 Identities=13% Similarity=0.227 Sum_probs=54.0
Q ss_pred ceEEEEeeCCCCceeeeccCCCceecee--EEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCC---CceEec
Q 039705 296 DCGRIEITNKSATWQREMMPSPRVMGEM--LLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPIN---ERFSEL 370 (539)
Q Consensus 296 s~~~~d~~~~~~~W~~~~M~~~R~~~~~--vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g---~~Wt~~ 370 (539)
++..+++.. .++|-+..+..-+-+-.. ....+..+++.||.++ .+-+||-++..- -++..+
T Consensus 96 TVK~W~~~~-~~~~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~-------------~IflWDin~~~~~l~~s~n~~ 161 (735)
T KOG0308|consen 96 TVKVWNAHK-DNTFCMSTIRTHKDYVKCLAYIAKNNELVASGGLDR-------------KIFLWDINTGTATLVASFNNV 161 (735)
T ss_pred eEEEeeccc-CcchhHhhhhcccchheeeeecccCceeEEecCCCc-------------cEEEEEccCcchhhhhhcccc
Confidence 455676653 345655433333333332 2256889999999873 567888775410 012221
Q ss_pred C--C---CCCCccceeeEeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEEecCCCC
Q 039705 371 T--P---TSKPRMCHSTSVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKFYPPYF 424 (539)
Q Consensus 371 a--~---~~~~R~yhs~a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y~Ppyl 424 (539)
. + .+.--.| |.| .-+.|.++|.||. |..+.+|+|-|=
T Consensus 162 t~~sl~sG~k~siY-SLA-~N~t~t~ivsGgt---------------ek~lr~wDprt~ 203 (735)
T KOG0308|consen 162 TVNSLGSGPKDSIY-SLA-MNQTGTIIVSGGT---------------EKDLRLWDPRTC 203 (735)
T ss_pred ccccCCCCCcccee-eee-cCCcceEEEecCc---------------ccceEEeccccc
Confidence 1 1 1222233 222 3467788888883 456788888883
No 164
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=25.73 E-value=8e+02 Score=25.83 Aligned_cols=135 Identities=15% Similarity=0.275 Sum_probs=72.8
Q ss_pred eCCcEEEEEcC--ceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCC
Q 039705 210 TDGNLFIFAND--RSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGK 287 (539)
Q Consensus 210 ~~G~Iyv~Gg~--~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~ 287 (539)
|+.++-+.||. .+.+||..++.|.-. +++ ... +=.++++ .++|.+++.|+.+.
T Consensus 74 P~~~l~aTGGgDD~AflW~~~~ge~~~e---ltg-HKD---SVt~~~F----------shdgtlLATGdmsG-------- 128 (399)
T KOG0296|consen 74 PNNNLVATGGGDDLAFLWDISTGEFAGE---LTG-HKD---SVTCCSF----------SHDGTLLATGDMSG-------- 128 (399)
T ss_pred CCCceEEecCCCceEEEEEccCCcceeE---ecC-CCC---ceEEEEE----------ccCceEEEecCCCc--------
Confidence 46667777774 378899988887532 344 111 1123332 15888999998752
Q ss_pred CcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCc
Q 039705 288 GEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINER 366 (539)
Q Consensus 288 ~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~ 366 (539)
.+..++......+|... ++..=-.+ ---|-+.|++.|-.+ | ++.+|...... .
T Consensus 129 --------~v~v~~~stg~~~~~~~~e~~dieWl---~WHp~a~illAG~~D-G------------svWmw~ip~~~--~ 182 (399)
T KOG0296|consen 129 --------KVLVFKVSTGGEQWKLDQEVEDIEWL---KWHPRAHILLAGSTD-G------------SVWMWQIPSQA--L 182 (399)
T ss_pred --------cEEEEEcccCceEEEeecccCceEEE---EecccccEEEeecCC-C------------cEEEEECCCcc--e
Confidence 23344444334566653 33211110 012455666666443 2 56788765530 2
Q ss_pred eEecCCCCCCccceeeEeeCCCCeEEEecCCC
Q 039705 367 FSELTPTSKPRMCHSTSVVLPDGKILVAGSNP 398 (539)
Q Consensus 367 Wt~~a~~~~~R~yhs~a~llpdG~V~v~GG~~ 398 (539)
=+.+.- +..+ ..+.-++||||-++.|-.+
T Consensus 183 ~kv~~G-h~~~--ct~G~f~pdGKr~~tgy~d 211 (399)
T KOG0296|consen 183 CKVMSG-HNSP--CTCGEFIPDGKRILTGYDD 211 (399)
T ss_pred eeEecC-CCCC--cccccccCCCceEEEEecC
Confidence 223322 2223 3456788999998888753
No 165
>PRK01029 tolB translocation protein TolB; Provisional
Probab=25.29 E-value=4e+02 Score=28.53 Aligned_cols=59 Identities=10% Similarity=-0.018 Sum_probs=36.9
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeec
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEH 142 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~ 142 (539)
...+||+.+++.+.+.... .........+||+.+++-....+...+..+|.. +.+...+
T Consensus 352 ~I~v~dl~~g~~~~Lt~~~-~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~--~g~~~~L 410 (428)
T PRK01029 352 QICVYDLATGRDYQLTTSP-ENKESPSWAIDSLHLVYSAGNSNESELYLISLI--TKKTRKI 410 (428)
T ss_pred EEEEEECCCCCeEEccCCC-CCccceEECCCCCEEEEEECCCCCceEEEEECC--CCCEEEe
Confidence 4567899999988876432 122334556798877654433344667777876 5555544
No 166
>PRK00178 tolB translocation protein TolB; Provisional
Probab=24.94 E-value=8.3e+02 Score=25.73 Aligned_cols=139 Identities=7% Similarity=-0.003 Sum_probs=70.6
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|+.+++.+.+...... -......+||+.+++.-..++...+.++|.. +...+.+.. -. ........-+|
T Consensus 224 ~l~~~~l~~g~~~~l~~~~g~-~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~--~~~~~~lt~-~~--~~~~~~~~spD 297 (430)
T PRK00178 224 RIFVQNLDTGRREQITNFEGL-NGAPAWSPDGSKLAFVLSKDGNPEIYVMDLA--SRQLSRVTN-HP--AIDTEPFWGKD 297 (430)
T ss_pred EEEEEECCCCCEEEccCCCCC-cCCeEECCCCCEEEEEEccCCCceEEEEECC--CCCeEEccc-CC--CCcCCeEECCC
Confidence 456789988888777543211 1123556899866654333344678889988 666665543 11 11112233457
Q ss_pred CcEEEEcCccCCeEEEE--e-cCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEcC-----ceeEeeCCCCeE
Q 039705 161 GSFIVVGGRREFSYEYI--L-KEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAND-----RSILLNPETNEI 232 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y--P-~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~-----~~e~yDp~tn~W 232 (539)
|+-+++.........+| . .++ ++..+... .. . .......+||+..++... +..++|..++++
T Consensus 298 g~~i~f~s~~~g~~~iy~~d~~~g--~~~~lt~~-~~-----~--~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~ 367 (430)
T PRK00178 298 GRTLYFTSDRGGKPQIYKVNVNGG--RAERVTFV-GN-----Y--NARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSV 367 (430)
T ss_pred CCEEEEEECCCCCceEEEEECCCC--CEEEeecC-CC-----C--ccceEECCCCCEEEEEEccCCceEEEEEECCCCCE
Confidence 76444332222222344 2 222 33322111 10 0 111245678876655432 356788888776
Q ss_pred EEEc
Q 039705 233 LHVF 236 (539)
Q Consensus 233 ~~~~ 236 (539)
. .+
T Consensus 368 ~-~l 370 (430)
T PRK00178 368 R-IL 370 (430)
T ss_pred E-Ec
Confidence 5 44
No 167
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=24.86 E-value=2.8e+02 Score=29.16 Aligned_cols=133 Identities=13% Similarity=0.195 Sum_probs=72.1
Q ss_pred CCcEEEEEcCceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCCcc
Q 039705 211 DGNLFIFANDRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKGEF 290 (539)
Q Consensus 211 ~G~Iyv~Gg~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~~~ 290 (539)
.+.+|+.+|..+++||+..+. .+..|.-+ ++.-.+.-. + ...-.|++++|.+
T Consensus 158 ~~~~FaTcGe~i~IWD~~R~~---Pv~smswG---~Dti~svkf--------N--pvETsILas~~sD------------ 209 (433)
T KOG0268|consen 158 KNSVFATCGEQIDIWDEQRDN---PVSSMSWG---ADSISSVKF--------N--PVETSILASCASD------------ 209 (433)
T ss_pred ccccccccCceeeecccccCC---ccceeecC---CCceeEEec--------C--CCcchheeeeccC------------
Confidence 367899999999999997654 33344321 110011111 1 1256788888765
Q ss_pred cccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEe
Q 039705 291 MNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSE 369 (539)
Q Consensus 291 ~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~ 369 (539)
.+...||.....+.=... .|..--. .--|.+-+|++|-.+. ....||-..-
T Consensus 210 ----rsIvLyD~R~~~Pl~KVi~~mRTN~I----swnPeafnF~~a~ED~-------------nlY~~DmR~l------- 261 (433)
T KOG0268|consen 210 ----RSIVLYDLRQASPLKKVILTMRTNTI----CWNPEAFNFVAANEDH-------------NLYTYDMRNL------- 261 (433)
T ss_pred ----CceEEEecccCCccceeeeeccccce----ecCccccceeeccccc-------------cceehhhhhh-------
Confidence 245678876533332222 4433111 1235667777665432 3455663221
Q ss_pred cCCCCCCccceeeEee----CCCCeEEEecCCCCC
Q 039705 370 LTPTSKPRMCHSTSVV----LPDGKILVAGSNPHS 400 (539)
Q Consensus 370 ~a~~~~~R~yhs~a~l----lpdG~V~v~GG~~~~ 400 (539)
-.++. -...|..|+| -|-|+=+|+|+++-.
T Consensus 262 ~~p~~-v~~dhvsAV~dVdfsptG~EfvsgsyDks 295 (433)
T KOG0268|consen 262 SRPLN-VHKDHVSAVMDVDFSPTGQEFVSGSYDKS 295 (433)
T ss_pred cccch-hhcccceeEEEeccCCCcchhccccccce
Confidence 11222 2445777877 467999999987543
No 168
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=24.38 E-value=1.3e+02 Score=20.29 Aligned_cols=24 Identities=13% Similarity=0.142 Sum_probs=14.2
Q ss_pred EEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCC
Q 039705 324 LLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDD 361 (539)
Q Consensus 324 vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t 361 (539)
.++-||+||+.+... .+.++|++|
T Consensus 17 ~~v~~g~vyv~~~dg--------------~l~ald~~t 40 (40)
T PF13570_consen 17 PAVAGGRVYVGTGDG--------------NLYALDAAT 40 (40)
T ss_dssp -EECTSEEEEE-TTS--------------EEEEEETT-
T ss_pred CEEECCEEEEEcCCC--------------EEEEEeCCC
Confidence 345588888865521 467788764
No 169
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=24.09 E-value=7.4e+02 Score=24.87 Aligned_cols=110 Identities=13% Similarity=0.088 Sum_probs=0.0
Q ss_pred CCCcceEEEe-eCCcEEEEEc----CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEe
Q 039705 200 NNLYPFVFLS-TDGNLFIFAN----DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLIC 274 (539)
Q Consensus 200 ~~~yp~~~~~-~~G~Iyv~Gg----~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~ 274 (539)
...|-..... .+|.+|-.-| .....||+.+++.. .--++|. .|-..|.+++ +++||..
T Consensus 43 ~~aFTQGL~~~~~g~LyESTG~yG~S~l~~~d~~tg~~~-~~~~l~~---~~FgEGit~~-------------~d~l~qL 105 (264)
T PF05096_consen 43 PTAFTQGLEFLDDGTLYESTGLYGQSSLRKVDLETGKVL-QSVPLPP---RYFGEGITIL-------------GDKLYQL 105 (264)
T ss_dssp TT-EEEEEEEEETTEEEEEECSTTEEEEEEEETTTSSEE-EEEE-TT---T--EEEEEEE-------------TTEEEEE
T ss_pred CcccCccEEecCCCEEEEeCCCCCcEEEEEEECCCCcEE-EEEECCc---cccceeEEEE-------------CCEEEEE
Q ss_pred cCCCCCcccccCCCcccccCCceEEEEeeCCCCceeee-ccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCc
Q 039705 275 GGAKPEAGVLAGKGEFMNALQDCGRIEITNKSATWQRE-MMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTT 353 (539)
Q Consensus 275 GG~~~~~~~~~~~~~~~~a~~s~~~~d~~~~~~~W~~~-~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~ 353 (539)
.+. ...+..||+ ++.+.. ..+.+..+-+.+.- +..+++..|.+ .
T Consensus 106 ------TWk----------~~~~f~yd~----~tl~~~~~~~y~~EGWGLt~d-g~~Li~SDGS~--------------~ 150 (264)
T PF05096_consen 106 ------TWK----------EGTGFVYDP----NTLKKIGTFPYPGEGWGLTSD-GKRLIMSDGSS--------------R 150 (264)
T ss_dssp ------ESS----------SSEEEEEET----TTTEEEEEEE-SSS--EEEEC-SSCEEEE-SSS--------------E
T ss_pred ------Eec----------CCeEEEEcc----ccceEEEEEecCCcceEEEcC-CCEEEEECCcc--------------c
Q ss_pred cEEEcCCC
Q 039705 354 PVLYEPDD 361 (539)
Q Consensus 354 ~e~YdP~t 361 (539)
....||++
T Consensus 151 L~~~dP~~ 158 (264)
T PF05096_consen 151 LYFLDPET 158 (264)
T ss_dssp EEEE-TTT
T ss_pred eEEECCcc
No 170
>PF13895 Ig_2: Immunoglobulin domain; PDB: 2V5R_B 2V5M_A 2V5S_B 2GI7_A 3LAF_A 4DEP_C 3O4O_B 2EC8_A 2E9W_A 1J87_A ....
Probab=23.10 E-value=3e+02 Score=20.57 Aligned_cols=37 Identities=14% Similarity=0.238 Sum_probs=27.3
Q ss_pred CCceeecCCCceeecCCEEEEEEEecccccccCcEEEEEEcCC
Q 039705 432 RPSIVSKFKGKMLKYGQNFVIQFKLDELEVSLNDLKVTMYAPP 474 (539)
Q Consensus 432 RP~i~~~~~p~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~~~~ 474 (539)
.|+|+.- |..+..|+.++|+=...+ ....++.+.+.+
T Consensus 1 kP~l~~~--~~~v~~g~~~~l~C~~~~----~p~~~~~w~~~~ 37 (80)
T PF13895_consen 1 KPVLSSS--PQSVEEGDSVTLTCSVSG----NPPPQVQWYKNG 37 (80)
T ss_dssp --EEEEE--SSEEETTSEEEEEEEEES----SSSSEEEEEETT
T ss_pred CcEEEcc--ceEEeCCCcEEEEEEEEc----ccceeeeeeeee
Confidence 3888888 889999999999976543 123678898865
No 171
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=22.86 E-value=96 Score=19.39 Aligned_cols=15 Identities=47% Similarity=0.676 Sum_probs=12.0
Q ss_pred eeeEeeCCCCeEEEec
Q 039705 380 HSTSVVLPDGKILVAG 395 (539)
Q Consensus 380 hs~a~llpdG~V~v~G 395 (539)
|++|+- ++|.|||+=
T Consensus 5 ~gvav~-~~g~i~VaD 19 (28)
T PF01436_consen 5 HGVAVD-SDGNIYVAD 19 (28)
T ss_dssp EEEEEE-TTSEEEEEE
T ss_pred cEEEEe-CCCCEEEEE
Confidence 567766 899999974
No 172
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=22.69 E-value=8.7e+02 Score=25.13 Aligned_cols=131 Identities=17% Similarity=0.161 Sum_probs=64.2
Q ss_pred EEEEECCCCCEEeCccCCCc---ccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEc
Q 039705 82 AVEYDAESAAIRPLKILTDT---WSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHIL 158 (539)
Q Consensus 82 ~~~yDp~t~~w~~l~~~~~~---~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L 158 (539)
+-+||.++++....-..++. .|+. -+..+++++.-.+ ...+.+|-. ..-..++- .+..----+.++.
T Consensus 296 AnlwDVEtge~v~~LtGHd~ELtHcst---HptQrLVvTsSrD---tTFRLWDFR---eaI~sV~V-FQGHtdtVTS~vF 365 (481)
T KOG0300|consen 296 ANLWDVETGEVVNILTGHDSELTHCST---HPTQRLVVTSSRD---TTFRLWDFR---EAIQSVAV-FQGHTDTVTSVVF 365 (481)
T ss_pred ceeeeeccCceeccccCcchhcccccc---CCcceEEEEeccC---ceeEeccch---hhcceeee-ecccccceeEEEE
Confidence 45677777766555444432 3544 3788999987542 344555532 11111111 1111111122222
Q ss_pred -cCCcEEEEcCccCCeEEEEecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEE--cCceeEeeCCCCeE
Q 039705 159 -PDGSFIVVGGRREFSYEYILKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFA--NDRSILLNPETNEI 232 (539)
Q Consensus 159 -~dG~VyvvGG~~~~~~E~yP~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~G--g~~~e~yDp~tn~W 232 (539)
-|.+ |+.|++..++.+|...|... |+..-+.+ .+.| ...+...++|.++= ++++.+||...++-
T Consensus 366 ~~dd~--vVSgSDDrTvKvWdLrNMRs----plATIRtd-S~~N---Rvavs~g~~iIAiPhDNRqvRlfDlnG~Rl 432 (481)
T KOG0300|consen 366 NTDDR--VVSGSDDRTVKVWDLRNMRS----PLATIRTD-SPAN---RVAVSKGHPIIAIPHDNRQVRLFDLNGNRL 432 (481)
T ss_pred ecCCc--eeecCCCceEEEeeeccccC----cceeeecC-Cccc---eeEeecCCceEEeccCCceEEEEecCCCcc
Confidence 1333 56788888888873333111 11111112 1112 23455566677764 46789999988753
No 173
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=21.70 E-value=80 Score=31.64 Aligned_cols=53 Identities=26% Similarity=0.438 Sum_probs=37.3
Q ss_pred EEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEe----eCCCCeEEEecCCC
Q 039705 324 LLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSV----VLPDGKILVAGSNP 398 (539)
Q Consensus 324 vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~----llpdG~V~v~GG~~ 398 (539)
-+=||+||++.-|.+. .+.+| .|+.+.++.+- .||+.++ ..||-.|+.+++.+
T Consensus 258 rIRpD~KIlATAGWD~-------------RiRVy--------swrtl~pLAVL-kyHsagvn~vAfspd~~lmAaaskD 314 (323)
T KOG0322|consen 258 RIRPDGKILATAGWDH-------------RIRVY--------SWRTLNPLAVL-KYHSAGVNAVAFSPDCELMAAASKD 314 (323)
T ss_pred EEccCCcEEeecccCC-------------cEEEE--------EeccCCchhhh-hhhhcceeEEEeCCCCchhhhccCC
Confidence 4568999999999872 45677 58888887653 4677443 35777777777653
No 174
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.47 E-value=9.9e+02 Score=25.37 Aligned_cols=111 Identities=14% Similarity=0.147 Sum_probs=66.8
Q ss_pred CCcEEEecCCCCCCCeEEEEeCCCCccceeec--cc---ccccccccceeEEccC--CcEEEEcCccCCeEEEE-ecCCC
Q 039705 111 NGTIVISGGWSSRGRSVRYLSGCYHACYWKEH--HW---ELSAKRWFSTQHILPD--GSFIVVGGRREFSYEYI-LKEGK 182 (539)
Q Consensus 111 dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~--~~---~m~~~R~y~s~~~L~d--G~VyvvGG~~~~~~E~y-P~~~~ 182 (539)
+-.|+++||... .+.+++||..+....|+.- ++ .|..|=|.-....|.+ ...|+.+=.. ..+.+| |+..
T Consensus 160 ~p~Iva~GGke~-~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~-hqvR~YDt~~q- 236 (412)
T KOG3881|consen 160 DPYIVATGGKEN-INELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRY-HQVRLYDTRHQ- 236 (412)
T ss_pred CCceEecCchhc-ccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecc-eeEEEecCccc-
Confidence 456888898642 4667788876324457642 22 3667889777777765 5666665432 346788 7653
Q ss_pred cce-eeccCccccCCCCCCCCcceEEEeeCCcEEEEEcCc--eeEeeCCCCeE
Q 039705 183 RII-YDLPILNETTNPSENNLYPFVFLSTDGNLFIFANDR--SILLNPETNEI 232 (539)
Q Consensus 183 ~~w-~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg~~--~e~yDp~tn~W 232 (539)
++. ...+++.. --.+..+.++|+..++|+.. ...||.++..-
T Consensus 237 RRPV~~fd~~E~--------~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl 281 (412)
T KOG3881|consen 237 RRPVAQFDFLEN--------PISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKL 281 (412)
T ss_pred CcceeEeccccC--------cceeeeecCCCcEEEEecccchhheecccCcee
Confidence 122 12233311 11244567899988888864 45689887753
No 175
>PRK04922 tolB translocation protein TolB; Provisional
Probab=21.11 E-value=1e+03 Score=25.30 Aligned_cols=138 Identities=12% Similarity=0.022 Sum_probs=69.1
Q ss_pred eEEEEECCCCCEEeCccCCCcccccceecCCCcEEEecCCCCCCCeEEEEeCCCCccceeecccccccccccceeEEccC
Q 039705 81 LAVEYDAESAAIRPLKILTDTWSSSGGLSANGTIVISGGWSSRGRSVRYLSGCYHACYWKEHHWELSAKRWFSTQHILPD 160 (539)
Q Consensus 81 ~~~~yDp~t~~w~~l~~~~~~~c~~~~~l~dG~l~v~GG~~~g~~~v~~ydP~~~t~~W~~~~~~m~~~R~y~s~~~L~d 160 (539)
...++|..+++.+.+...... .......+||+-+++....++...+.++|.. +.....+.. -. .. .......+|
T Consensus 229 ~l~~~dl~~g~~~~l~~~~g~-~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~--~g~~~~lt~-~~-~~-~~~~~~spD 302 (433)
T PRK04922 229 AIYVQDLATGQRELVASFRGI-NGAPSFSPDGRRLALTLSRDGNPEIYVMDLG--SRQLTRLTN-HF-GI-DTEPTWAPD 302 (433)
T ss_pred EEEEEECCCCCEEEeccCCCC-ccCceECCCCCEEEEEEeCCCCceEEEEECC--CCCeEECcc-CC-CC-ccceEECCC
Confidence 455789888887776543221 1234567899765544333345678889987 655544432 10 01 112344568
Q ss_pred CcEEEEcCccCCeEEEE--ecCCCcceeeccCccccCCCCCCCCcceEEEeeCCcEEEEEc-----CceeEeeCCCCeEE
Q 039705 161 GSFIVVGGRREFSYEYI--LKEGKRIIYDLPILNETTNPSENNLYPFVFLSTDGNLFIFAN-----DRSILLNPETNEIL 233 (539)
Q Consensus 161 G~VyvvGG~~~~~~E~y--P~~~~~~w~~~~~l~~~~~~~~~~~yp~~~~~~~G~Iyv~Gg-----~~~e~yDp~tn~W~ 233 (539)
|+-+++.........+| .... .+...+.. ... .+. .....+||+..++.. .+..++|..+++..
T Consensus 303 G~~l~f~sd~~g~~~iy~~dl~~-g~~~~lt~-~g~-----~~~--~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~ 373 (433)
T PRK04922 303 GKSIYFTSDRGGRPQIYRVAASG-GSAERLTF-QGN-----YNA--RASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVR 373 (433)
T ss_pred CCEEEEEECCCCCceEEEEECCC-CCeEEeec-CCC-----Ccc--CEEECCCCCEEEEEECCCCceeEEEEECCCCCeE
Confidence 87555443221122344 2211 13322211 010 011 124567887655532 23678888887765
No 176
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=21.09 E-value=1.2e+03 Score=28.49 Aligned_cols=140 Identities=14% Similarity=0.184 Sum_probs=72.3
Q ss_pred eCCcEEEEEc-CceeEeeCCCCeEEEEcccCCCCCCccCCCccEEecccccCCCCCCCcccEEEEecCCCCCcccccCCC
Q 039705 210 TDGNLFIFAN-DRSILLNPETNEILHVFPILRGGSRNYPASATSALLPIKLQDPNSNAIRAEVLICGGAKPEAGVLAGKG 288 (539)
Q Consensus 210 ~~G~Iyv~Gg-~~~e~yDp~tn~W~~~~p~mp~~~r~yp~~g~av~lpl~~~~~~~~~~~g~Iyv~GG~~~~~~~~~~~~ 288 (539)
..|+|++.|+ +.+.+||-....-. .++|.+. + +...+|- .+. .+|.|++.|=.+ |
T Consensus 1176 ~~G~Ll~tGd~r~IRIWDa~~E~~~---~diP~~s-~---t~vTaLS-------~~~-~~gn~i~AGfaD-G-------- 1231 (1387)
T KOG1517|consen 1176 QSGHLLVTGDVRSIRIWDAHKEQVV---ADIPYGS-S---TLVTALS-------ADL-VHGNIIAAGFAD-G-------- 1231 (1387)
T ss_pred hCCeEEecCCeeEEEEEecccceeE---eecccCC-C---ccceeec-------ccc-cCCceEEEeecC-C--------
Confidence 3799999987 45788998877644 4556421 1 2233332 122 368888887554 2
Q ss_pred cccccCCceEEEEeeCCC-----CceeeeccCCCceeceeEEecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCC
Q 039705 289 EFMNALQDCGRIEITNKS-----ATWQREMMPSPRVMGEMLLLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPI 363 (539)
Q Consensus 289 ~~~~a~~s~~~~d~~~~~-----~~W~~~~M~~~R~~~~~vvlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~ 363 (539)
++-.||-.-+. ..|..-.-.. +..+..+ =+.|..-++.|.+.| .++++|+..+.
T Consensus 1232 -------svRvyD~R~a~~ds~v~~~R~h~~~~-~Iv~~sl-q~~G~~elvSgs~~G------------~I~~~DlR~~~ 1290 (1387)
T KOG1517|consen 1232 -------SVRVYDRRMAPPDSLVCVYREHNDVE-PIVHLSL-QRQGLGELVSGSQDG------------DIQLLDLRMSS 1290 (1387)
T ss_pred -------ceEEeecccCCccccceeecccCCcc-cceeEEe-ecCCCcceeeeccCC------------eEEEEecccCc
Confidence 23345533211 2333321111 1333322 234444445555432 58899988751
Q ss_pred CCceEecCCCCCCccce---eeEeeCCCCeEEEecCC
Q 039705 364 NERFSELTPTSKPRMCH---STSVVLPDGKILVAGSN 397 (539)
Q Consensus 364 g~~Wt~~a~~~~~R~yh---s~a~llpdG~V~v~GG~ 397 (539)
-+.+ -.....|-|- .+-.+..+..|+..|+.
T Consensus 1291 ~e~~---~~iv~~~~yGs~lTal~VH~hapiiAsGs~ 1324 (1387)
T KOG1517|consen 1291 KETF---LTIVAHWEYGSALTALTVHEHAPIIASGSA 1324 (1387)
T ss_pred cccc---ceeeeccccCccceeeeeccCCCeeeecCc
Confidence 1122 2223344443 33334678899999885
No 177
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=21.02 E-value=1.5e+02 Score=20.23 Aligned_cols=22 Identities=18% Similarity=0.465 Sum_probs=18.4
Q ss_pred EeeCCcEEEEEcCceeEeeCCC
Q 039705 208 LSTDGNLFIFANDRSILLNPET 229 (539)
Q Consensus 208 ~~~~G~Iyv~Gg~~~e~yDp~t 229 (539)
...+|++|++-|+..++||..+
T Consensus 6 ~~~~~~~yfFkg~~yw~~~~~~ 27 (45)
T smart00120 6 ELRNGKTYFFKGDKYWRFDPKR 27 (45)
T ss_pred EeCCCeEEEEeCCEEEEEcCCc
Confidence 3457899999999999999765
No 178
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=20.99 E-value=5.2e+02 Score=24.12 Aligned_cols=68 Identities=19% Similarity=0.186 Sum_probs=43.0
Q ss_pred CceeecCCEEEEEEEecccccccCcEEEEEEcCCccccccCCCceeEeccceeeeecCCceEEEEEEcCCCCCcCCCcce
Q 039705 441 GKMLKYGQNFVIQFKLDELEVSLNDLKVTMYAPPFTTHGVSMGQRLLVPATKELIDVGSGIFQVSVMAPPTAKIAPPSFY 520 (539)
Q Consensus 441 p~~~~~g~~~~v~~~~~~~~~~~~~~~v~l~~~~~~TH~~n~~QR~~~L~~~~~~~~g~~~~~~~~~~P~~~~~~ppG~y 520 (539)
|..+..|+.|++++-..+.. ....+|.+...+........ ...++. ..+| .+++++| -||.|
T Consensus 144 P~~l~~g~~~~~~vl~~GkP--l~~a~V~~~~~~~~~~~~~~-----~~~~~T-D~~G----~~~~~~~------~~G~w 205 (215)
T PF10670_consen 144 PYKLKAGDPLPFQVLFDGKP--LAGAEVEAFSPGGWYDVEHE-----AKTLKT-DANG----RATFTLP------RPGLW 205 (215)
T ss_pred cccccCCCEEEEEEEECCeE--cccEEEEEEECCCccccccc-----eEEEEE-CCCC----EEEEecC------CCEEE
Confidence 77788999998887754432 23478888888876544333 222221 1234 5666654 57999
Q ss_pred EEEEEc
Q 039705 521 LLFVVY 526 (539)
Q Consensus 521 mlf~~~ 526 (539)
||-+..
T Consensus 206 li~a~~ 211 (215)
T PF10670_consen 206 LIRASH 211 (215)
T ss_pred EEEEEE
Confidence 998864
No 179
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=20.52 E-value=1.9e+02 Score=29.51 Aligned_cols=53 Identities=19% Similarity=0.329 Sum_probs=35.0
Q ss_pred EecCCcEEEEcCcCCCCCCcccCCCCCCccEEEcCCCCCCCceEecCCCCCCccceeeEee----CCCCeEEEecCCC
Q 039705 325 LLPTGDVLIINGAKKGTAGWNFATDPNTTPVLYEPDDPINERFSELTPTSKPRMCHSTSVV----LPDGKILVAGSNP 398 (539)
Q Consensus 325 vlpdG~I~vvGG~~~g~~g~~~~~~~~~~~e~YdP~t~~g~~Wt~~a~~~~~R~yhs~a~l----lpdG~V~v~GG~~ 398 (539)
-.|||..|+.||.++ .+.+|+-.-+...-|..- .|+.|++ ..|+++++.-|.+
T Consensus 55 F~P~gs~~aSgG~Dr-------------~I~LWnv~gdceN~~~lk--------gHsgAVM~l~~~~d~s~i~S~gtD 111 (338)
T KOG0265|consen 55 FHPDGSCFASGGSDR-------------AIVLWNVYGDCENFWVLK--------GHSGAVMELHGMRDGSHILSCGTD 111 (338)
T ss_pred ECCCCCeEeecCCcc-------------eEEEEeccccccceeeec--------cccceeEeeeeccCCCEEEEecCC
Confidence 358999999999874 456665433211167542 5887776 5688877776654
No 180
>PF11090 DUF2833: Protein of unknown function (DUF2833); InterPro: IPR020335 This entry contains proteins with no known function.
Probab=20.04 E-value=47 Score=27.27 Aligned_cols=35 Identities=14% Similarity=0.159 Sum_probs=23.4
Q ss_pred EeeCCCCeEEEecCCCCCCCccCCCCCCCcceeeEEecCCC
Q 039705 383 SVVLPDGKILVAGSNPHSRYNLTSGSKYPTELRIEKFYPPY 423 (539)
Q Consensus 383 a~llpdG~V~v~GG~~~~~~~~~~~~~~p~~~~vE~y~Ppy 423 (539)
.++..+|++++.||+..+.+.+- +...++.+++++
T Consensus 3 v~~~~~g~~lAiGG~~g~~~Wfv------tt~~v~~~~~~~ 37 (86)
T PF11090_consen 3 VTIEHKGRPLAIGGNNGGCLWFV------TTNKVKSLTKKE 37 (86)
T ss_pred EEEecCCeEEEEccccCCeEEEE------ECcHHhhcCHhh
Confidence 45567999999999974433332 345667777764
Done!