Query 045821
Match_columns 352
No_of_seqs 190 out of 2431
Neff 10.2
Searched_HMMs 46136
Date Fri Mar 29 05:50:20 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/045821.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/045821hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4441 Proteins containing BT 100.0 1.4E-42 3E-47 327.5 31.0 258 66-330 282-555 (571)
2 PHA02713 hypothetical protein; 100.0 1.9E-41 4.1E-46 321.2 29.8 255 69-330 258-542 (557)
3 KOG4441 Proteins containing BT 100.0 5.5E-39 1.2E-43 303.1 25.8 254 36-300 301-566 (571)
4 PLN02153 epithiospecifier prot 100.0 6.5E-37 1.4E-41 275.9 28.6 264 47-313 8-324 (341)
5 PHA02713 hypothetical protein; 100.0 1.1E-37 2.5E-42 295.4 22.3 236 44-288 280-543 (557)
6 PLN02193 nitrile-specifier pro 100.0 2.6E-35 5.7E-40 274.9 35.6 261 68-330 120-419 (470)
7 PLN02193 nitrile-specifier pro 100.0 6.2E-36 1.3E-40 279.1 30.3 264 46-313 151-453 (470)
8 TIGR03548 mutarot_permut cycli 100.0 1.2E-35 2.7E-40 265.7 29.2 240 67-314 12-314 (323)
9 PHA03098 kelch-like protein; P 100.0 6.6E-36 1.4E-40 285.4 29.1 253 69-330 251-520 (534)
10 TIGR03547 muta_rot_YjhT mutatr 100.0 2E-35 4.4E-40 267.1 27.5 240 67-313 16-331 (346)
11 PLN02153 epithiospecifier prot 100.0 6.9E-35 1.5E-39 262.8 30.4 235 93-327 5-290 (341)
12 PRK14131 N-acetylneuraminic ac 100.0 1.2E-34 2.6E-39 263.9 28.1 239 68-313 38-353 (376)
13 PHA02790 Kelch-like protein; P 100.0 2.1E-34 4.6E-39 269.5 27.6 203 115-326 267-475 (480)
14 PHA03098 kelch-like protein; P 100.0 4.8E-34 1E-38 272.6 26.2 217 68-290 294-523 (534)
15 PHA02790 Kelch-like protein; P 100.0 1E-33 2.2E-38 264.9 26.5 203 67-286 270-478 (480)
16 KOG4693 Uncharacterized conser 100.0 1.4E-33 3E-38 228.9 21.7 251 61-313 14-312 (392)
17 TIGR03548 mutarot_permut cycli 100.0 6.1E-32 1.3E-36 241.9 27.4 222 109-330 3-288 (323)
18 TIGR03547 muta_rot_YjhT mutatr 100.0 8.8E-32 1.9E-36 243.4 26.8 223 101-331 2-308 (346)
19 PRK14131 N-acetylneuraminic ac 100.0 2.2E-30 4.8E-35 236.0 26.3 230 94-331 16-330 (376)
20 KOG4693 Uncharacterized conser 100.0 3.7E-28 8.1E-33 197.4 21.8 229 96-327 3-282 (392)
21 KOG0379 Kelch repeat-containin 99.9 4.7E-25 1E-29 205.5 23.3 234 54-289 54-312 (482)
22 KOG0379 Kelch repeat-containin 99.9 7.4E-24 1.6E-28 197.5 25.1 225 106-332 57-312 (482)
23 KOG1230 Protein containing rep 99.9 7.7E-23 1.7E-27 175.9 18.8 223 108-330 65-349 (521)
24 KOG4152 Host cell transcriptio 99.9 6.6E-23 1.4E-27 180.6 16.4 264 47-313 18-342 (830)
25 KOG1230 Protein containing rep 99.9 1.9E-22 4.2E-27 173.4 17.6 257 18-286 33-348 (521)
26 KOG4152 Host cell transcriptio 99.8 3.1E-18 6.7E-23 151.4 18.5 218 93-312 15-273 (830)
27 COG3055 Uncharacterized protei 99.8 1.2E-17 2.7E-22 142.1 20.6 237 68-313 46-359 (381)
28 COG3055 Uncharacterized protei 99.7 3.6E-15 7.8E-20 127.2 18.6 207 99-313 29-295 (381)
29 KOG2437 Muskelin [Signal trans 99.5 1.3E-14 2.8E-19 128.4 7.4 234 93-327 237-540 (723)
30 PF13964 Kelch_6: Kelch motif 99.3 1.4E-11 3.1E-16 77.7 6.6 50 109-158 1-50 (50)
31 KOG2437 Muskelin [Signal trans 99.2 5.9E-12 1.3E-16 111.8 4.9 239 46-286 239-542 (723)
32 PF13964 Kelch_6: Kelch motif 99.1 1.1E-10 2.5E-15 73.5 5.9 46 157-202 1-50 (50)
33 TIGR01640 F_box_assoc_1 F-box 99.1 8.4E-08 1.8E-12 81.7 22.7 188 136-323 14-230 (230)
34 PF13415 Kelch_3: Galactose ox 99.0 8.4E-10 1.8E-14 69.1 5.8 48 119-166 1-49 (49)
35 PF01344 Kelch_1: Kelch motif; 99.0 4.8E-10 1E-14 69.7 4.0 47 109-155 1-47 (47)
36 PF01344 Kelch_1: Kelch motif; 99.0 7.4E-10 1.6E-14 68.8 4.0 43 157-199 1-47 (47)
37 PF07646 Kelch_2: Kelch motif; 98.9 1.3E-08 2.7E-13 63.7 6.7 47 109-155 1-49 (49)
38 PF13418 Kelch_4: Galactose ox 98.8 3.6E-09 7.8E-14 66.3 3.8 47 109-155 1-48 (49)
39 PF07646 Kelch_2: Kelch motif; 98.8 1.2E-08 2.7E-13 63.8 5.7 43 157-199 1-49 (49)
40 TIGR01640 F_box_assoc_1 F-box 98.8 5.7E-06 1.2E-10 70.4 23.8 194 83-280 14-230 (230)
41 smart00612 Kelch Kelch domain. 98.7 1.9E-08 4.2E-13 62.3 4.9 47 121-168 1-47 (47)
42 PF13418 Kelch_4: Galactose ox 98.7 1.3E-08 2.8E-13 63.8 3.9 43 157-199 1-48 (49)
43 PF07250 Glyoxal_oxid_N: Glyox 98.7 3.5E-06 7.7E-11 71.0 17.7 149 136-291 46-211 (243)
44 PF13415 Kelch_3: Galactose ox 98.6 1.4E-07 3E-12 58.9 5.7 43 167-209 1-48 (49)
45 smart00612 Kelch Kelch domain. 98.5 1.5E-07 3.3E-12 58.2 4.8 44 169-212 1-47 (47)
46 PLN02772 guanylate kinase 98.5 1.6E-06 3.5E-11 77.6 10.7 83 108-190 23-110 (398)
47 PF07250 Glyoxal_oxid_N: Glyox 98.4 3.9E-05 8.5E-10 64.7 16.1 141 183-327 49-203 (243)
48 PLN03215 ascorbic acid mannose 98.3 0.0016 3.4E-08 58.5 25.1 38 16-53 2-40 (373)
49 PF13854 Kelch_5: Kelch motif 98.2 3.1E-06 6.8E-11 50.8 5.3 38 108-145 3-41 (42)
50 PLN02772 guanylate kinase 98.2 9.8E-06 2.1E-10 72.7 10.1 80 156-237 23-110 (398)
51 PRK11138 outer membrane biogen 98.1 0.0069 1.5E-07 56.0 28.2 228 67-327 68-320 (394)
52 PF12937 F-box-like: F-box-lik 98.1 9.2E-07 2E-11 54.7 1.3 40 18-57 1-40 (47)
53 PF13854 Kelch_5: Kelch motif 98.1 8.1E-06 1.8E-10 48.9 5.2 36 154-189 1-41 (42)
54 PF07893 DUF1668: Protein of u 98.0 0.0066 1.4E-07 54.7 24.7 119 114-244 70-216 (342)
55 PRK11138 outer membrane biogen 98.0 0.012 2.6E-07 54.4 27.8 206 84-319 171-392 (394)
56 PF00646 F-box: F-box domain; 98.0 1.1E-06 2.4E-11 54.6 0.0 43 17-59 2-44 (48)
57 smart00256 FBOX A Receptor for 97.9 6.2E-06 1.4E-10 49.2 1.5 37 21-57 1-37 (41)
58 PF13360 PQQ_2: PQQ-like domai 97.9 0.011 2.4E-07 50.3 21.9 173 136-324 3-194 (238)
59 KOG0281 Beta-TrCP (transducin 97.8 0.00087 1.9E-08 58.0 14.1 279 18-322 75-389 (499)
60 TIGR03300 assembly_YfgL outer 97.8 0.03 6.6E-07 51.3 27.7 173 116-317 187-375 (377)
61 PF13360 PQQ_2: PQQ-like domai 97.5 0.041 9E-07 46.7 27.0 182 116-324 33-233 (238)
62 TIGR03300 assembly_YfgL outer 97.5 0.069 1.5E-06 49.0 28.8 221 68-327 65-305 (377)
63 PF03089 RAG2: Recombination a 97.5 0.004 8.7E-08 52.7 13.1 101 121-221 39-174 (337)
64 PF08450 SGL: SMP-30/Gluconola 97.4 0.02 4.4E-07 49.1 17.7 186 119-322 11-214 (246)
65 PF14870 PSII_BNR: Photosynthe 97.4 0.066 1.4E-06 47.1 20.7 236 68-328 27-268 (302)
66 PF12768 Rax2: Cortical protei 97.3 0.018 3.8E-07 50.1 15.5 105 180-287 16-130 (281)
67 PF07893 DUF1668: Protein of u 97.2 0.018 4E-07 51.9 15.2 119 166-288 75-217 (342)
68 PRK13684 Ycf48-like protein; P 97.1 0.21 4.5E-06 45.0 20.8 206 93-321 117-331 (334)
69 PF08450 SGL: SMP-30/Gluconola 96.9 0.1 2.2E-06 44.7 16.6 190 68-286 11-221 (246)
70 TIGR03866 PQQ_ABC_repeats PQQ- 96.9 0.27 5.8E-06 42.9 19.9 182 121-324 2-190 (300)
71 PF05096 Glu_cyclase_2: Glutam 96.8 0.14 3E-06 43.8 16.1 156 160-327 47-209 (264)
72 PRK04792 tolB translocation pr 96.8 0.45 9.7E-06 44.8 26.3 148 135-287 241-391 (448)
73 TIGR03866 PQQ_ABC_repeats PQQ- 96.7 0.35 7.6E-06 42.2 24.6 224 70-322 2-238 (300)
74 KOG2055 WD40 repeat protein [G 96.6 0.058 1.3E-06 48.8 12.7 148 167-324 224-377 (514)
75 PF12768 Rax2: Cortical protei 96.6 0.19 4.1E-06 43.8 15.6 106 134-244 14-130 (281)
76 KOG2055 WD40 repeat protein [G 96.6 0.12 2.6E-06 46.9 14.4 188 68-279 224-418 (514)
77 KOG0274 Cdc4 and related F-box 96.6 0.37 8.1E-06 46.2 18.8 285 14-323 104-402 (537)
78 cd00200 WD40 WD40 domain, foun 96.5 0.42 9.1E-06 40.6 18.2 144 167-324 62-210 (289)
79 PRK13684 Ycf48-like protein; P 96.5 0.57 1.2E-05 42.2 21.7 212 93-327 74-294 (334)
80 cd00200 WD40 WD40 domain, foun 96.5 0.43 9.3E-06 40.6 22.6 183 119-322 62-250 (289)
81 PF09910 DUF2139: Uncharacteri 96.5 0.48 1E-05 40.9 21.6 169 100-279 28-231 (339)
82 KOG0291 WD40-repeat-containing 96.5 0.95 2.1E-05 43.9 21.0 154 117-288 359-517 (893)
83 PF03089 RAG2: Recombination a 96.3 0.59 1.3E-05 40.1 16.3 70 108-177 86-174 (337)
84 cd00094 HX Hemopexin-like repe 96.3 0.5 1.1E-05 38.9 17.6 145 115-281 12-178 (194)
85 PLN02919 haloacid dehalogenase 96.3 1.9 4.1E-05 45.3 27.9 192 119-324 635-891 (1057)
86 PRK00178 tolB translocation pr 96.2 1.1 2.3E-05 42.0 22.6 182 82-280 222-408 (430)
87 PRK04792 tolB translocation pr 96.2 1.1 2.4E-05 42.2 23.5 183 81-280 240-427 (448)
88 PRK11028 6-phosphogluconolacto 96.2 0.88 1.9E-05 40.8 19.4 189 121-322 3-206 (330)
89 PF05096 Glu_cyclase_2: Glutam 96.1 0.31 6.7E-06 41.7 13.9 115 210-332 54-168 (264)
90 PRK11028 6-phosphogluconolacto 96.1 1 2.2E-05 40.4 22.7 191 119-323 46-260 (330)
91 PRK04922 tolB translocation pr 96.0 1.4 3E-05 41.4 24.9 180 135-322 227-412 (433)
92 KOG0310 Conserved WD40 repeat- 96.0 0.44 9.4E-06 43.6 14.8 177 135-323 47-227 (487)
93 PF10282 Lactonase: Lactonase, 96.0 1.2 2.5E-05 40.4 22.1 238 68-322 48-323 (345)
94 PRK00178 tolB translocation pr 95.9 1.5 3.2E-05 41.0 26.7 146 135-287 222-372 (430)
95 PRK05137 tolB translocation pr 95.9 1.6 3.4E-05 41.0 23.4 188 82-286 225-420 (435)
96 smart00284 OLF Olfactomedin-li 95.9 1 2.2E-05 38.6 20.5 193 119-321 34-252 (255)
97 PF10282 Lactonase: Lactonase, 95.8 1.4 3E-05 40.0 22.0 234 74-321 4-275 (345)
98 TIGR02800 propeller_TolB tol-p 95.8 1.6 3.6E-05 40.4 23.5 182 82-279 213-398 (417)
99 COG4257 Vgb Streptogramin lyas 95.7 1.1 2.5E-05 38.3 18.3 218 86-325 86-309 (353)
100 KOG0310 Conserved WD40 repeat- 95.7 1.3 2.8E-05 40.7 16.6 223 68-319 79-306 (487)
101 TIGR02800 propeller_TolB tol-p 95.7 1.8 3.8E-05 40.2 26.3 146 136-287 214-363 (417)
102 PF03178 CPSF_A: CPSF A subuni 95.6 0.83 1.8E-05 40.9 15.6 133 136-277 62-203 (321)
103 TIGR03075 PQQ_enz_alc_DH PQQ-d 95.6 1.4 3.1E-05 42.3 18.0 115 161-285 63-198 (527)
104 cd00216 PQQ_DH Dehydrogenases 95.5 2.4 5.2E-05 40.5 24.4 116 115-241 57-191 (488)
105 PRK04922 tolB translocation pr 95.5 2.2 4.8E-05 40.0 23.7 187 82-285 227-418 (433)
106 PF02191 OLF: Olfactomedin-lik 95.5 1.5 3.1E-05 37.7 19.4 191 119-321 30-247 (250)
107 COG4257 Vgb Streptogramin lyas 95.4 1.5 3.3E-05 37.5 17.0 184 84-287 125-314 (353)
108 cd00094 HX Hemopexin-like repe 95.3 1.3 2.9E-05 36.3 17.9 145 162-324 11-178 (194)
109 PF08268 FBA_3: F-box associat 95.2 0.69 1.5E-05 35.2 11.9 79 164-242 2-87 (129)
110 PF08268 FBA_3: F-box associat 95.2 0.42 9.2E-06 36.4 10.6 79 208-287 2-89 (129)
111 PRK04043 tolB translocation pr 95.1 2.9 6.3E-05 39.0 20.4 177 136-323 213-402 (419)
112 KOG2120 SCF ubiquitin ligase, 95.0 0.016 3.6E-07 49.8 2.5 38 18-55 98-135 (419)
113 PLN00181 protein SPA1-RELATED; 95.0 4.8 0.0001 41.0 23.6 172 136-322 555-739 (793)
114 TIGR03075 PQQ_enz_alc_DH PQQ-d 94.7 2.5 5.5E-05 40.7 16.7 117 115-241 65-197 (527)
115 PRK05137 tolB translocation pr 94.7 3.9 8.4E-05 38.4 27.6 181 135-323 225-414 (435)
116 PF12217 End_beta_propel: Cata 94.6 2.5 5.4E-05 35.9 14.0 175 93-267 53-258 (367)
117 PF02191 OLF: Olfactomedin-lik 94.5 2.7 5.9E-05 36.0 18.6 192 68-278 30-247 (250)
118 PRK03629 tolB translocation pr 94.3 4.7 0.0001 37.7 26.9 148 135-287 222-372 (429)
119 PRK03629 tolB translocation pr 94.2 4.8 0.00011 37.7 23.6 188 83-286 223-414 (429)
120 PRK04043 tolB translocation pr 94.1 5.1 0.00011 37.4 23.0 190 82-286 212-408 (419)
121 KOG0316 Conserved WD40 repeat- 94.1 3.1 6.7E-05 34.9 17.3 176 135-325 80-261 (307)
122 KOG0286 G-protein beta subunit 94.0 3.7 8E-05 35.5 19.7 204 119-348 108-321 (343)
123 PRK02889 tolB translocation pr 93.8 5.8 0.00013 37.1 24.8 146 135-286 219-368 (427)
124 KOG0266 WD40 repeat-containing 93.6 6.8 0.00015 37.0 17.5 178 135-322 224-410 (456)
125 PF03178 CPSF_A: CPSF A subuni 93.5 3 6.5E-05 37.3 13.9 122 180-310 62-190 (321)
126 PLN02919 haloacid dehalogenase 93.2 13 0.00028 39.2 22.9 150 119-282 694-892 (1057)
127 PLN00033 photosystem II stabil 93.2 7.1 0.00015 36.1 22.8 216 93-328 118-363 (398)
128 KOG0646 WD40 repeat protein [G 93.1 3.8 8.1E-05 37.6 13.2 199 113-330 85-316 (476)
129 smart00284 OLF Olfactomedin-li 93.0 5.4 0.00012 34.2 20.2 192 68-278 34-252 (255)
130 PTZ00421 coronin; Provisional 92.9 9 0.0002 36.6 20.3 147 168-326 138-295 (493)
131 KOG0649 WD40 repeat protein [G 92.9 5.2 0.00011 33.7 16.0 185 147-352 100-303 (325)
132 COG2706 3-carboxymuconate cycl 92.6 7 0.00015 34.7 25.4 245 68-324 51-324 (346)
133 KOG0316 Conserved WD40 repeat- 92.6 5.6 0.00012 33.4 17.2 189 68-282 70-261 (307)
134 PLN00033 photosystem II stabil 92.5 8.8 0.00019 35.5 21.9 199 93-312 164-390 (398)
135 KOG2997 F-box protein FBX9 [Ge 92.3 0.058 1.3E-06 46.7 1.0 43 18-60 107-154 (366)
136 PF07734 FBA_1: F-box associat 92.1 5.1 0.00011 31.8 13.3 80 208-287 2-92 (164)
137 PLN03215 ascorbic acid mannose 92.0 8.3 0.00018 35.1 14.2 132 145-285 189-353 (373)
138 PF14870 PSII_BNR: Photosynthe 91.7 9 0.0002 33.9 24.9 217 68-312 71-295 (302)
139 PTZ00421 coronin; Provisional 91.7 13 0.00028 35.6 22.1 152 120-285 138-297 (493)
140 KOG0296 Angio-associated migra 91.4 10 0.00022 33.9 14.9 141 167-324 75-223 (399)
141 COG4946 Uncharacterized protei 91.3 12 0.00027 34.7 20.4 164 55-243 264-439 (668)
142 KOG0315 G-protein beta subunit 91.2 8.4 0.00018 32.6 16.6 165 167-347 51-221 (311)
143 PF07734 FBA_1: F-box associat 91.2 6.5 0.00014 31.2 11.9 78 164-242 2-90 (164)
144 PRK02889 tolB translocation pr 91.1 13 0.00029 34.7 23.4 183 81-279 218-404 (427)
145 PTZ00420 coronin; Provisional 90.9 16 0.00036 35.4 19.8 141 169-323 139-295 (568)
146 PF02897 Peptidase_S9_N: Proly 90.8 14 0.00029 34.4 17.3 161 68-242 238-411 (414)
147 COG1520 FOG: WD40-like repeat 90.8 13 0.00028 34.0 20.2 156 163-331 64-229 (370)
148 KOG0296 Angio-associated migra 90.8 12 0.00025 33.5 21.5 147 119-281 75-223 (399)
149 KOG1036 Mitotic spindle checkp 90.6 7.6 0.00017 33.8 11.8 132 180-324 35-166 (323)
150 COG4447 Uncharacterized protei 90.5 6.9 0.00015 33.8 11.4 244 68-339 54-307 (339)
151 TIGR03074 PQQ_membr_DH membran 90.0 24 0.00051 35.8 19.3 35 160-197 187-223 (764)
152 COG1520 FOG: WD40-like repeat 89.8 16 0.00034 33.4 25.2 153 116-285 65-226 (370)
153 KOG0315 G-protein beta subunit 89.3 12 0.00027 31.7 16.4 177 136-323 61-247 (311)
154 KOG1332 Vesicle coat complex C 89.2 13 0.00027 31.6 14.1 103 169-288 176-297 (299)
155 TIGR03074 PQQ_membr_DH membran 89.0 28 0.0006 35.3 18.5 35 113-154 188-224 (764)
156 KOG0266 WD40 repeat-containing 88.8 22 0.00047 33.7 18.5 146 119-279 257-410 (456)
157 PF02897 Peptidase_S9_N: Proly 88.4 21 0.00046 33.1 19.9 147 135-286 251-412 (414)
158 KOG1446 Histone H3 (Lys4) meth 88.1 17 0.00037 31.7 22.5 101 84-195 37-137 (311)
159 cd00216 PQQ_DH Dehydrogenases 88.0 25 0.00055 33.5 25.9 162 135-309 255-454 (488)
160 PRK01742 tolB translocation pr 87.2 26 0.00056 32.8 22.1 180 82-285 227-411 (429)
161 KOG2321 WD40 repeat protein [G 86.7 17 0.00038 34.6 12.2 62 212-281 146-208 (703)
162 COG4946 Uncharacterized protei 86.7 27 0.00059 32.5 16.9 184 134-327 105-301 (668)
163 KOG4649 PQQ (pyrrolo-quinoline 85.1 24 0.00051 30.3 12.7 90 180-281 33-126 (354)
164 KOG2106 Uncharacterized conser 84.8 35 0.00076 32.1 18.4 185 84-289 223-409 (626)
165 KOG0289 mRNA splicing factor [ 84.4 34 0.00073 31.5 17.8 119 160-290 350-474 (506)
166 KOG0278 Serine/threonine kinas 84.2 25 0.00054 29.9 11.9 125 134-275 163-294 (334)
167 KOG1332 Vesicle coat complex C 84.2 25 0.00054 29.9 14.1 74 156-244 221-296 (299)
168 PF07433 DUF1513: Protein of u 83.4 31 0.00068 30.5 21.6 237 68-321 16-285 (305)
169 KOG0274 Cdc4 and related F-box 83.2 46 0.001 32.2 20.0 177 134-327 309-488 (537)
170 PRK01742 tolB translocation pr 82.7 42 0.00091 31.4 25.4 139 135-286 227-369 (429)
171 PF06433 Me-amine-dh_H: Methyl 82.7 36 0.00078 30.6 12.2 69 211-282 249-324 (342)
172 KOG0291 WD40-repeat-containing 82.4 55 0.0012 32.5 23.1 157 115-285 312-473 (893)
173 PF15525 DUF4652: Domain of un 82.2 25 0.00055 28.5 11.2 86 127-212 79-172 (200)
174 PRK10115 protease 2; Provision 82.2 59 0.0013 32.7 25.4 149 135-286 198-354 (686)
175 PLN00181 protein SPA1-RELATED; 82.0 64 0.0014 33.0 22.2 143 119-280 587-740 (793)
176 KOG3881 Uncharacterized conser 81.5 38 0.00082 30.7 11.6 151 168-326 161-325 (412)
177 COG2706 3-carboxymuconate cycl 81.3 40 0.00086 30.2 24.8 210 120-336 52-289 (346)
178 TIGR02658 TTQ_MADH_Hv methylam 80.3 46 0.001 30.2 23.5 120 68-194 12-142 (352)
179 COG3386 Gluconolactonase [Carb 77.6 51 0.0011 29.3 18.2 183 136-332 47-253 (307)
180 KOG0282 mRNA splicing factor [ 77.6 29 0.00062 32.3 9.9 62 167-238 269-333 (503)
181 KOG2111 Uncharacterized conser 75.5 58 0.0013 28.8 17.6 145 119-282 58-216 (346)
182 TIGR02658 TTQ_MADH_Hv methylam 74.9 67 0.0014 29.2 22.4 68 258-327 259-336 (352)
183 PF14583 Pectate_lyase22: Olig 73.8 41 0.00089 30.8 10.0 223 68-309 46-299 (386)
184 PRK10115 protease 2; Provision 73.8 1E+02 0.0022 30.9 21.8 150 135-288 246-404 (686)
185 KOG0263 Transcription initiati 71.7 43 0.00093 33.0 10.0 63 210-279 587-650 (707)
186 KOG0643 Translation initiation 71.6 67 0.0014 27.8 20.2 171 133-313 71-253 (327)
187 COG0823 TolB Periplasmic compo 70.4 95 0.0021 29.1 16.9 108 134-244 260-368 (425)
188 KOG0647 mRNA export protein (c 69.9 78 0.0017 27.9 16.2 129 135-275 93-225 (347)
189 KOG1036 Mitotic spindle checkp 69.9 77 0.0017 27.8 15.6 132 136-281 35-166 (323)
190 KOG0263 Transcription initiati 69.4 60 0.0013 32.1 10.4 59 167-234 588-648 (707)
191 KOG0305 Anaphase promoting com 69.2 1.1E+02 0.0023 29.2 12.4 173 137-323 198-378 (484)
192 KOG2321 WD40 repeat protein [G 69.0 65 0.0014 31.0 10.2 105 167-281 145-261 (703)
193 PF12217 End_beta_propel: Cata 68.6 77 0.0017 27.3 13.1 194 116-310 22-256 (367)
194 COG4447 Uncharacterized protei 68.0 53 0.0011 28.6 8.7 113 163-286 50-164 (339)
195 PF14583 Pectate_lyase22: Olig 68.0 1E+02 0.0022 28.4 11.7 129 85-221 170-303 (386)
196 KOG0286 G-protein beta subunit 67.5 86 0.0019 27.5 19.7 134 135-277 165-302 (343)
197 KOG0281 Beta-TrCP (transducin 67.1 30 0.00065 30.9 7.3 102 166-279 328-429 (499)
198 KOG1539 WD repeat protein [Gen 66.7 1.5E+02 0.0033 30.0 18.6 71 113-194 39-109 (910)
199 KOG0285 Pleiotropic regulator 66.5 1E+02 0.0022 27.9 15.7 204 119-349 203-416 (460)
200 COG0823 TolB Periplasmic compo 65.7 1.2E+02 0.0026 28.5 14.2 103 180-286 262-367 (425)
201 KOG0318 WD40 repeat stress pro 65.2 1.3E+02 0.0028 28.7 20.6 86 256-344 329-416 (603)
202 PF02239 Cytochrom_D1: Cytochr 65.0 1.1E+02 0.0024 28.0 11.2 149 119-283 48-207 (369)
203 KOG0649 WD40 repeat protein [G 63.4 95 0.0021 26.5 16.5 129 68-218 126-263 (325)
204 COG4880 Secreted protein conta 62.6 1.3E+02 0.0029 27.9 13.8 184 83-288 406-600 (603)
205 KOG0647 mRNA export protein (c 62.0 1.1E+02 0.0025 26.9 10.4 63 211-279 83-146 (347)
206 PTZ00420 coronin; Provisional 61.2 1.7E+02 0.0037 28.7 18.0 148 121-286 139-301 (568)
207 KOG3545 Olfactomedin and relat 60.4 1.1E+02 0.0023 26.2 16.9 190 68-278 30-246 (249)
208 KOG0295 WD40 repeat-containing 59.8 1.4E+02 0.0029 27.1 15.6 53 257-313 344-396 (406)
209 PF13088 BNR_2: BNR repeat-lik 59.7 1.1E+02 0.0025 26.2 16.2 214 93-308 28-275 (275)
210 KOG0640 mRNA cleavage stimulat 59.6 79 0.0017 27.9 8.4 104 168-282 184-295 (430)
211 PF02239 Cytochrom_D1: Cytochr 59.4 1.4E+02 0.0031 27.3 13.0 134 135-280 15-160 (369)
212 PF06433 Me-amine-dh_H: Methyl 58.5 42 0.0009 30.2 6.8 70 119-192 249-324 (342)
213 COG3391 Uncharacterized conser 58.2 1.5E+02 0.0033 27.2 18.8 150 167-324 85-242 (381)
214 KOG4378 Nuclear protein COP1 [ 57.4 1.8E+02 0.0038 27.7 12.4 90 180-279 187-281 (673)
215 KOG0640 mRNA cleavage stimulat 55.5 1.2E+02 0.0026 26.9 8.7 108 167-282 227-339 (430)
216 KOG1520 Predicted alkaloid syn 55.4 1.7E+02 0.0036 26.8 12.7 69 211-285 126-214 (376)
217 KOG0289 mRNA splicing factor [ 54.7 1.8E+02 0.004 27.0 17.7 119 112-246 350-473 (506)
218 PF14298 DUF4374: Domain of un 54.4 1.9E+02 0.0041 27.1 11.9 59 180-239 367-428 (435)
219 KOG1274 WD40 repeat protein [G 54.0 2.6E+02 0.0057 28.6 17.6 149 170-330 68-225 (933)
220 KOG4378 Nuclear protein COP1 [ 53.1 2.1E+02 0.0045 27.2 14.9 61 257-322 219-281 (673)
221 PF14781 BBS2_N: Ciliary BBSom 52.5 1E+02 0.0023 23.6 7.5 63 168-242 64-134 (136)
222 KOG3881 Uncharacterized conser 51.8 1.9E+02 0.0042 26.4 12.2 148 120-280 161-322 (412)
223 KOG0282 mRNA splicing factor [ 51.3 70 0.0015 29.9 7.1 63 257-322 268-331 (503)
224 PF07433 DUF1513: Protein of u 51.2 1.8E+02 0.0039 25.9 17.7 214 113-334 8-260 (305)
225 PF13859 BNR_3: BNR repeat-lik 51.0 1.8E+02 0.0039 25.9 12.7 166 161-327 2-211 (310)
226 KOG0772 Uncharacterized conser 50.7 2.3E+02 0.005 27.1 15.8 154 156-324 315-491 (641)
227 KOG1517 Guanine nucleotide bin 50.4 3.3E+02 0.0072 28.8 13.1 163 148-324 1154-1336(1387)
228 PF11768 DUF3312: Protein of u 50.1 1.8E+02 0.0038 28.1 9.8 107 211-324 218-332 (545)
229 KOG0321 WD40 repeat-containing 49.1 90 0.0019 30.4 7.7 97 228-324 75-178 (720)
230 PF08662 eIF2A: Eukaryotic tra 48.8 1.5E+02 0.0032 24.2 10.2 64 167-236 71-134 (194)
231 PF13570 PQQ_3: PQQ-like domai 48.5 42 0.00091 19.1 3.7 25 298-322 16-40 (40)
232 KOG2048 WD40 repeat protein [G 47.6 2.9E+02 0.0063 27.3 21.9 121 195-324 421-551 (691)
233 PRK01029 tolB translocation pr 47.2 2.5E+02 0.0054 26.3 21.8 59 227-286 351-411 (428)
234 PF11134 Phage_stabilise: Phag 46.8 2.5E+02 0.0054 26.3 12.0 112 160-279 234-364 (469)
235 PF14781 BBS2_N: Ciliary BBSom 44.1 46 0.00099 25.5 4.1 64 258-323 63-127 (136)
236 KOG1897 Damage-specific DNA bi 43.7 4E+02 0.0088 27.8 19.4 148 120-278 787-944 (1096)
237 KOG1428 Inhibitor of type V ad 43.3 3.8E+02 0.0081 29.8 11.3 49 210-267 496-544 (3738)
238 KOG0294 WD40 repeat-containing 43.2 2.4E+02 0.0053 25.1 14.1 44 146-191 29-74 (362)
239 KOG2048 WD40 repeat protein [G 42.5 3.5E+02 0.0076 26.7 21.4 147 163-318 159-315 (691)
240 KOG0272 U4/U6 small nuclear ri 42.3 2.8E+02 0.0062 25.7 11.1 111 166-289 313-427 (459)
241 KOG0650 WD40 repeat nucleolar 40.7 2E+02 0.0043 28.1 8.4 135 168-312 534-680 (733)
242 KOG1538 Uncharacterized conser 40.4 3.9E+02 0.0084 26.7 12.5 170 147-323 2-253 (1081)
243 COG4101 Predicted mannose-6-ph 40.3 98 0.0021 23.0 5.1 48 302-350 67-119 (142)
244 COG3391 Uncharacterized conser 39.8 3E+02 0.0066 25.3 19.9 153 118-285 84-246 (381)
245 PRK01029 tolB translocation pr 39.7 3.2E+02 0.007 25.6 25.8 185 135-325 210-407 (428)
246 KOG1446 Histone H3 (Lys4) meth 39.6 2.7E+02 0.0059 24.6 15.9 147 180-336 36-183 (311)
247 PF13088 BNR_2: BNR repeat-lik 39.1 2.5E+02 0.0054 24.0 12.6 143 68-216 118-275 (275)
248 KOG1963 WD40 repeat protein [G 38.6 4.4E+02 0.0095 26.8 16.5 95 228-322 433-539 (792)
249 PF13013 F-box-like_2: F-box-l 37.8 32 0.00069 25.3 2.4 29 18-46 22-50 (109)
250 KOG0639 Transducin-like enhanc 37.3 3.4E+02 0.0073 26.0 9.2 34 256-290 474-507 (705)
251 COG3823 Glutamine cyclotransfe 36.9 2.5E+02 0.0055 23.5 9.7 116 207-331 51-168 (262)
252 KOG0322 G-protein beta subunit 36.8 2.9E+02 0.0062 24.1 9.2 113 167-289 163-287 (323)
253 KOG0639 Transducin-like enhanc 36.6 1.1E+02 0.0024 28.9 6.1 101 167-278 476-581 (705)
254 KOG4649 PQQ (pyrrolo-quinoline 35.5 3E+02 0.0065 24.0 19.5 71 115-196 16-91 (354)
255 COG3386 Gluconolactonase [Carb 35.2 3.3E+02 0.0071 24.3 22.8 205 82-312 46-276 (307)
256 PF08662 eIF2A: Eukaryotic tra 34.7 2.5E+02 0.0054 22.8 11.7 68 119-196 71-139 (194)
257 PF09910 DUF2139: Uncharacteri 34.5 3.3E+02 0.0072 24.2 16.0 112 68-195 117-238 (339)
258 KOG2106 Uncharacterized conser 34.1 4.3E+02 0.0093 25.3 13.1 98 211-319 212-314 (626)
259 TIGR03032 conserved hypothetic 33.8 3.5E+02 0.0076 24.3 22.1 33 257-290 211-243 (335)
260 KOG0299 U3 snoRNP-associated p 32.6 4.3E+02 0.0092 24.9 14.9 141 116-281 210-359 (479)
261 KOG0308 Conserved WD40 repeat- 32.0 3.2E+02 0.007 26.9 8.5 95 180-281 95-204 (735)
262 PF06977 SdiA-regulated: SdiA- 31.8 2.1E+02 0.0046 24.5 6.9 59 258-318 182-247 (248)
263 KOG0265 U5 snRNP-specific prot 31.5 3.7E+02 0.008 23.8 10.3 96 167-280 58-165 (338)
264 PF00780 CNH: CNH domain; Int 30.9 3.4E+02 0.0074 23.2 18.7 83 256-341 146-236 (275)
265 KOG3545 Olfactomedin and relat 30.4 3.5E+02 0.0076 23.2 19.6 190 119-320 30-245 (249)
266 KOG0306 WD40-repeat-containing 30.4 5.9E+02 0.013 25.8 15.4 169 135-320 42-218 (888)
267 KOG2315 Predicted translation 29.8 5.2E+02 0.011 25.0 13.3 64 119-190 282-346 (566)
268 KOG1188 WD40 repeat protein [G 29.7 4.2E+02 0.0092 23.9 12.4 176 135-324 49-245 (376)
269 KOG1274 WD40 repeat protein [G 29.7 6.5E+02 0.014 26.0 21.0 60 258-321 200-262 (933)
270 PF11768 DUF3312: Protein of u 28.3 5.6E+02 0.012 24.9 10.6 96 82-191 235-332 (545)
271 KOG4341 F-box protein containi 28.1 32 0.0007 31.7 1.4 38 16-53 70-107 (483)
272 smart00564 PQQ beta-propeller 27.9 1E+02 0.0022 16.2 3.4 24 256-281 4-27 (33)
273 KOG0293 WD40 repeat-containing 27.5 5.1E+02 0.011 24.1 16.5 130 137-279 292-426 (519)
274 PRK02888 nitrous-oxide reducta 27.4 6.3E+02 0.014 25.1 12.7 54 271-324 297-354 (635)
275 KOG0308 Conserved WD40 repeat- 27.1 6.3E+02 0.014 25.1 12.9 66 118-191 128-204 (735)
276 KOG0306 WD40-repeat-containing 27.1 4.9E+02 0.011 26.3 8.9 99 166-276 382-482 (888)
277 KOG2314 Translation initiation 26.5 4.8E+02 0.01 25.4 8.5 70 68-145 457-526 (698)
278 KOG0301 Phospholipase A2-activ 26.2 6.7E+02 0.014 25.1 13.6 66 256-326 228-293 (745)
279 COG3490 Uncharacterized protei 26.1 4.6E+02 0.01 23.2 15.9 56 258-318 290-345 (366)
280 KOG0646 WD40 repeat protein [G 25.4 5.8E+02 0.012 24.1 11.0 59 160-222 84-145 (476)
281 KOG1898 Splicing factor 3b, su 25.1 8.5E+02 0.018 25.9 13.3 160 134-302 851-1028(1205)
282 KOG0270 WD40 repeat-containing 24.7 5.8E+02 0.013 23.9 14.5 28 297-324 424-452 (463)
283 PF14298 DUF4374: Domain of un 24.2 6.1E+02 0.013 23.9 12.8 61 270-330 367-433 (435)
284 PF15525 DUF4652: Domain of un 23.8 4.1E+02 0.0089 21.8 10.5 69 84-156 89-160 (200)
285 KOG3669 Uncharacterized conser 23.7 7E+02 0.015 24.4 12.2 54 141-200 213-273 (705)
286 KOG0279 G protein beta subunit 23.4 5.1E+02 0.011 22.7 18.4 186 119-327 74-268 (315)
287 KOG0319 WD40-repeat-containing 22.9 7.9E+02 0.017 24.8 16.1 169 138-324 86-271 (775)
288 smart00120 HX Hemopexin-like r 22.6 1.7E+02 0.0036 16.8 4.7 19 303-322 9-27 (45)
289 COG3823 Glutamine cyclotransfe 22.5 4.7E+02 0.01 22.0 16.2 186 113-313 49-250 (262)
290 KOG1963 WD40 repeat protein [G 22.4 8.5E+02 0.018 24.9 10.4 59 254-313 258-323 (792)
291 KOG0294 WD40 repeat-containing 21.9 5.8E+02 0.013 22.8 19.1 27 115-147 48-74 (362)
292 KOG0303 Actin-binding protein 21.4 6.6E+02 0.014 23.3 11.6 135 180-324 154-297 (472)
293 KOG0265 U5 snRNP-specific prot 21.1 5.9E+02 0.013 22.6 14.8 148 68-239 58-208 (338)
No 1
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00 E-value=1.4e-42 Score=327.48 Aligned_cols=258 Identities=24% Similarity=0.386 Sum_probs=230.8
Q ss_pred CCCCEEEEEEecCC--CCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821 66 LDETWIYAFCRDNK--LERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA 143 (352)
Q Consensus 66 ~~~~~l~~~gg~~~--~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~ 143 (352)
...+.|+++||... .....+..||| .++.|..+++|+ .+|..+++++++|.||++||.+.....++++++||+
T Consensus 282 ~~~~~l~~vGG~~~~~~~~~~ve~yd~--~~~~w~~~a~m~---~~r~~~~~~~~~~~lYv~GG~~~~~~~l~~ve~YD~ 356 (571)
T KOG4441|consen 282 SVSGKLVAVGGYNRQGQSLRSVECYDP--KTNEWSSLAPMP---SPRCRVGVAVLNGKLYVVGGYDSGSDRLSSVERYDP 356 (571)
T ss_pred CCCCeEEEECCCCCCCcccceeEEecC--CcCcEeecCCCC---cccccccEEEECCEEEEEccccCCCcccceEEEecC
Confidence 34688999999864 33456889999 999999999999 778899999999999999999843377899999999
Q ss_pred CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccC
Q 045821 144 SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSA 220 (352)
Q Consensus 144 ~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~ 220 (352)
.+++|..+++|+.+|..+++++++|.||++||.+.. +++|+|||.+++|..+++|+..+..+++++++|+||++||.
T Consensus 357 ~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~ 436 (571)
T KOG4441|consen 357 RTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGG 436 (571)
T ss_pred CCCceeccCCccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCc
Confidence 999999999999999999999999999999999865 99999999999999999999988899999999999999998
Q ss_pred CCCC-CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEccCCCCCcCCC
Q 045821 221 SAAT-SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVGRLSTLLTRP 295 (352)
Q Consensus 221 ~~~~-~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~~~~~~~~~~ 295 (352)
.... ....+++|||.+++|+.++++..++....+++.+++||++||+++ ..+.+|||.+++|+.++.++. .+.
T Consensus 437 ~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~--~rs 514 (571)
T KOG4441|consen 437 DGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTS--PRS 514 (571)
T ss_pred CCCccccceEEEEcCCCCceeecCCcccccccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCcc--ccc
Confidence 8766 667999999999999999987777777778899999999999876 468999999999999988876 588
Q ss_pred CceEEEeCCEEEEEcCCc------eEEEEEcCCcccccceE
Q 045821 296 PCKLVAIGKTIFVIGKGC------SAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 296 ~~~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~~~~~ 330 (352)
..+++..+++||++||.+ .+..||+++++|.....
T Consensus 515 ~~g~~~~~~~ly~vGG~~~~~~l~~ve~ydp~~d~W~~~~~ 555 (571)
T KOG4441|consen 515 AVGVVVLGGKLYAVGGFDGNNNLNTVECYDPETDTWTEVTE 555 (571)
T ss_pred cccEEEECCEEEEEecccCccccceeEEcCCCCCceeeCCC
Confidence 889999999999999966 35579999999987544
No 2
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=1.9e-41 Score=321.20 Aligned_cols=255 Identities=17% Similarity=0.265 Sum_probs=215.0
Q ss_pred CEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821 69 TWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW 148 (352)
Q Consensus 69 ~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W 148 (352)
..+++.||.........+.||+ .+++|..++++| .++.+++++++++.||++||.+......+++++||+.+++|
T Consensus 258 ~~l~~~~g~~~~~~~~v~~yd~--~~~~W~~l~~mp---~~r~~~~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~~~n~W 332 (557)
T PHA02713 258 LCLVCHDTKYNVCNPCILVYNI--NTMEYSVISTIP---NHIINYASAIVDNEIIIAGGYNFNNPSLNKVYKINIENKIH 332 (557)
T ss_pred eEEEEecCccccCCCCEEEEeC--CCCeEEECCCCC---ccccceEEEEECCEEEEEcCCCCCCCccceEEEEECCCCeE
Confidence 3455544421111235678999 999999999998 56788899999999999999754334578999999999999
Q ss_pred EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC-
Q 045821 149 TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT- 224 (352)
Q Consensus 149 ~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~- 224 (352)
..+++|+.+|..++++.++++||++||.+.. +++++||+.+++|+.+++|+.++..+++++++|+||++||.....
T Consensus 333 ~~~~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~ 412 (557)
T PHA02713 333 VELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHID 412 (557)
T ss_pred eeCCCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECCCCeEEECCCCCcccccccEEEECCEEEEEeCCCcccc
Confidence 9999999999999999999999999998643 789999999999999999999999999999999999999976421
Q ss_pred -----------------CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-----CeEEEEeCCC-Cc
Q 045821 225 -----------------SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-----TKLMMWQKES-RE 281 (352)
Q Consensus 225 -----------------~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd~~~-~~ 281 (352)
....+++|||.+++|+.+++.+..+....+++++|+||++||..+ +.+.+|||++ ++
T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~ 492 (557)
T PHA02713 413 YTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNG 492 (557)
T ss_pred cccccccccccccccccccceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCC
Confidence 135799999999999999876666656667889999999999753 3578999999 89
Q ss_pred eEEccCCCCCcCCCCceEEEeCCEEEEEcCCce---EEEEEcCCcccccceE
Q 045821 282 WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS---AVVIDVGNIGNIGGIM 330 (352)
Q Consensus 282 W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~---~~~~d~~~~~w~~~~~ 330 (352)
|+.++.+|. .|..+++++++++||++||.+. +..||+++++|...++
T Consensus 493 W~~~~~m~~--~r~~~~~~~~~~~iyv~Gg~~~~~~~e~yd~~~~~W~~~~~ 542 (557)
T PHA02713 493 WELITTTES--RLSALHTILHDNTIMMLHCYESYMLQDTFNVYTYEWNHICH 542 (557)
T ss_pred eeEccccCc--ccccceeEEECCEEEEEeeecceeehhhcCcccccccchhh
Confidence 999999987 5888999999999999999775 5579999999986444
No 3
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00 E-value=5.5e-39 Score=303.11 Aligned_cols=254 Identities=24% Similarity=0.400 Sum_probs=216.2
Q ss_pred ccchhhhhhhhhhcccccChhhHHHHhhCCCC--CCEEEEEEecCC-CC-ceeEEEecCCCCCCCeeEeccCCCcccCCc
Q 045821 36 RYHAVLKCVSSRWRGLLHSEEWCAYRRKHNLD--ETWIYAFCRDNK-LE-RVCCYVLDPNSTRRSWKMIHELPARTLRRK 111 (352)
Q Consensus 36 ~~~~~l~~v~k~W~~l~~s~~~~~~~~~~~~~--~~~l~~~gg~~~-~~-~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~ 111 (352)
.....+....+.|..+...| ..|.+++.. ++.||++||.+. .. ....+.||| .+++|..+++|. .+|.
T Consensus 301 ~~ve~yd~~~~~w~~~a~m~---~~r~~~~~~~~~~~lYv~GG~~~~~~~l~~ve~YD~--~~~~W~~~a~M~---~~R~ 372 (571)
T KOG4441|consen 301 RSVECYDPKTNEWSSLAPMP---SPRCRVGVAVLNGKLYVVGGYDSGSDRLSSVERYDP--RTNQWTPVAPMN---TKRS 372 (571)
T ss_pred ceeEEecCCcCcEeecCCCC---cccccccEEEECCEEEEEccccCCCcccceEEEecC--CCCceeccCCcc---Cccc
Confidence 33444455556798886555 445555544 999999999873 33 456899999 999999999999 7899
Q ss_pred ceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeC
Q 045821 112 GMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDP 187 (352)
Q Consensus 112 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~ 187 (352)
.+++++++|.||++||.++. ...+++++|||.+++|..+++|+.+|+.|++++++++||++||.+.. +++++|||
T Consensus 373 ~~~v~~l~g~iYavGG~dg~-~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP 451 (571)
T KOG4441|consen 373 DFGVAVLDGKLYAVGGFDGE-KSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDP 451 (571)
T ss_pred cceeEEECCEEEEEeccccc-cccccEEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcC
Confidence 99999999999999999975 77899999999999999999999999999999999999999997654 89999999
Q ss_pred CCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821 188 RTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQS 267 (352)
Q Consensus 188 ~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~ 267 (352)
.|++|+.+++|+.++..+++++++++||++||.++......+++||+.+++|+.+.++...+.....++.+++||++||+
T Consensus 452 ~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~~rs~~g~~~~~~~ly~vGG~ 531 (571)
T KOG4441|consen 452 ETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTSPRSAVGVVVLGGKLYAVGGF 531 (571)
T ss_pred CCCceeecCCcccccccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCccccccccEEEECCEEEEEecc
Confidence 99999999999999999999999999999999998555568999999999999997666666666678899999999998
Q ss_pred CC----CeEEEEeCCCCceEEccCCCCCcCCCCceEE
Q 045821 268 SG----TKLMMWQKESREWSPVGRLSTLLTRPPCKLV 300 (352)
Q Consensus 268 ~~----~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~ 300 (352)
.+ +.+..|||++++|+.... +.. .+...+++
T Consensus 532 ~~~~~l~~ve~ydp~~d~W~~~~~-~~~-~~~~~~~~ 566 (571)
T KOG4441|consen 532 DGNNNLNTVECYDPETDTWTEVTE-PES-GRGGAGVA 566 (571)
T ss_pred cCccccceeEEcCCCCCceeeCCC-ccc-cccCcceE
Confidence 77 689999999999999998 442 34444443
No 4
>PLN02153 epithiospecifier protein
Probab=100.00 E-value=6.5e-37 Score=275.92 Aligned_cols=264 Identities=17% Similarity=0.304 Sum_probs=198.4
Q ss_pred hhcccccCh-hhHHHHhhCCCC--CCEEEEEEecCCC-C--ceeEEEecCCCCCCCeeEeccCCCccc-CCcceEEEEEC
Q 045821 47 RWRGLLHSE-EWCAYRRKHNLD--ETWIYAFCRDNKL-E--RVCCYVLDPNSTRRSWKMIHELPARTL-RRKGMGFEVLG 119 (352)
Q Consensus 47 ~W~~l~~s~-~~~~~~~~~~~~--~~~l~~~gg~~~~-~--~~~~~~~d~~~~~~~w~~~~~~~~~~~-~~~~~~~~~~~ 119 (352)
.|..+.... ..+.+|..|++. ++.||++||.... . ...++.||+ .+++|..++++...+. .+.++++++++
T Consensus 8 ~W~~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~~~~yd~--~~~~W~~~~~~~~~p~~~~~~~~~~~~~ 85 (341)
T PLN02153 8 GWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDF--NTHTWSIAPANGDVPRISCLGVRMVAVG 85 (341)
T ss_pred eEEEecCCCCCCCCCCCcceEEEECCEEEEECCccCCCCceeCcEEEEEC--CCCEEEEcCccCCCCCCccCceEEEEEC
Confidence 488776532 234456666544 8999999997432 2 236899999 9999999987653212 24478889999
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC-----CCcceeeeeEEECCEEEEEeCCCCC---------CceEEE
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM-----STARCYFPCGVLNQKIYCIGGLGDT---------HSWDVY 185 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~-----~~~r~~~~~~~~~~~iyv~gG~~~~---------~~v~~y 185 (352)
++||++||.... ...+++++||+.+++|+.++++ |.+|..|+++.++++|||+||.... +++++|
T Consensus 86 ~~iyv~GG~~~~-~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~y 164 (341)
T PLN02153 86 TKLYIFGGRDEK-REFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAY 164 (341)
T ss_pred CEEEEECCCCCC-CccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEE
Confidence 999999998654 4578999999999999999877 7889999999999999999997531 578999
Q ss_pred eCCCCcEEEecCCC---CcccceeEEEECCEEEEEccCCCC--------CCceEEEEEeCCCCceeecCC---CCCCCCC
Q 045821 186 DPRTNNWKLHTEPN---IFTEIEDSFVMDGKIYIRCSASAA--------TSHVCALVYEPSTDSWLHADA---NMASGWR 251 (352)
Q Consensus 186 d~~t~~W~~~~~~~---~~~~~~~~~~~~g~iyv~GG~~~~--------~~~~~i~~yd~~~~~W~~~~~---~~~~~~~ 251 (352)
|+.+++|+.++++. .++..+.+++++++||++||.... ...+.+++||+.+++|++++. .+..+..
T Consensus 165 d~~~~~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~ 244 (341)
T PLN02153 165 NIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV 244 (341)
T ss_pred ECCCCeEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccCCCCCCcce
Confidence 99999999998653 566667788899999999886421 113589999999999999864 2333334
Q ss_pred CcEEEECCEEEEEecCC----------C---CeEEEEeCCCCceEEccCC---CCCcCCCC-ceEEEe-CCEEEEEcCCc
Q 045821 252 GPAVVVDDALYVLDQSS----------G---TKLMMWQKESREWSPVGRL---STLLTRPP-CKLVAI-GKTIFVIGKGC 313 (352)
Q Consensus 252 ~~~~~~~~~l~~~gg~~----------~---~~v~~yd~~~~~W~~~~~~---~~~~~~~~-~~~~~~-~~~i~v~gG~~ 313 (352)
+.+++++++||++||.. + +++++||+++++|+.+... +.+..+.. +++++. +++||++||..
T Consensus 245 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~~W~~~~~~~~~~~pr~~~~~~~~~v~~~~~~~~~gG~~ 324 (341)
T PLN02153 245 FAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETLVWEKLGECGEPAMPRGWTAYTTATVYGKNGLLMHGGKL 324 (341)
T ss_pred eeeEEECCEEEEECcccCCccccccccccccccEEEEEcCccEEEeccCCCCCCCCCccccccccccCCcceEEEEcCcC
Confidence 45678899999999962 1 4899999999999998632 22212322 233333 45899999974
No 5
>PHA02713 hypothetical protein; Provisional
Probab=100.00 E-value=1.1e-37 Score=295.41 Aligned_cols=236 Identities=16% Similarity=0.212 Sum_probs=198.1
Q ss_pred hhhhhcccccChhhHHHHhhCC--CCCCEEEEEEecCC--CCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC
Q 045821 44 VSSRWRGLLHSEEWCAYRRKHN--LDETWIYAFCRDNK--LERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG 119 (352)
Q Consensus 44 v~k~W~~l~~s~~~~~~~~~~~--~~~~~l~~~gg~~~--~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~ 119 (352)
..+.|..+.+.|. .+..++ ..++.||++||.+. ......+.||| .++.|..+++|+ .+|..+++++++
T Consensus 280 ~~~~W~~l~~mp~---~r~~~~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~--~~n~W~~~~~m~---~~R~~~~~~~~~ 351 (557)
T PHA02713 280 NTMEYSVISTIPN---HIINYASAIVDNEIIIAGGYNFNNPSLNKVYKINI--ENKIHVELPPMI---KNRCRFSLAVID 351 (557)
T ss_pred CCCeEEECCCCCc---cccceEEEEECCEEEEEcCCCCCCCccceEEEEEC--CCCeEeeCCCCc---chhhceeEEEEC
Confidence 3457888755443 333333 33999999999642 22346889999 999999999998 689999999999
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCC---------------------
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGD--------------------- 178 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~--------------------- 178 (352)
++||++||.++. ...+++++|||.+++|..+++||.+|..+++++++++||++||...
T Consensus 352 g~IYviGG~~~~-~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~ 430 (557)
T PHA02713 352 DTIYAIGGQNGT-NVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHS 430 (557)
T ss_pred CEEEEECCcCCC-CCCceEEEEECCCCeEEECCCCCcccccccEEEECCEEEEEeCCCcccccccccccccccccccccc
Confidence 999999998654 4578899999999999999999999999999999999999999752
Q ss_pred CCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCc-eEEEEEeCCC-CceeecCCCCCCCCCCcEEE
Q 045821 179 THSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSH-VCALVYEPST-DSWLHADANMASGWRGPAVV 256 (352)
Q Consensus 179 ~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~-~~i~~yd~~~-~~W~~~~~~~~~~~~~~~~~ 256 (352)
.+++++|||.+++|+.+++|+.++..+++++++|+||++||..+.... ..+++||+.+ ++|+.+++.+.++....+++
T Consensus 431 ~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~ 510 (557)
T PHA02713 431 SNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTIL 510 (557)
T ss_pred cceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEE
Confidence 157999999999999999999999989999999999999987643333 3689999999 89999988666666667788
Q ss_pred ECCEEEEEecCCC-CeEEEEeCCCCceEEccCC
Q 045821 257 VDDALYVLDQSSG-TKLMMWQKESREWSPVGRL 288 (352)
Q Consensus 257 ~~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~ 288 (352)
++|+||++||+.+ ..+.+||+.+++|+.++..
T Consensus 511 ~~~~iyv~Gg~~~~~~~e~yd~~~~~W~~~~~~ 543 (557)
T PHA02713 511 HDNTIMMLHCYESYMLQDTFNVYTYEWNHICHQ 543 (557)
T ss_pred ECCEEEEEeeecceeehhhcCcccccccchhhh
Confidence 9999999999876 5689999999999988744
No 6
>PLN02193 nitrile-specifier protein
Probab=100.00 E-value=2.6e-35 Score=274.88 Aligned_cols=261 Identities=17% Similarity=0.264 Sum_probs=207.1
Q ss_pred CCEEEEEEecCCCC--ceeEEEecCCCCC----CCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCC-CCCeEEE
Q 045821 68 ETWIYAFCRDNKLE--RVCCYVLDPNSTR----RSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSED-ATSEVYC 140 (352)
Q Consensus 68 ~~~l~~~gg~~~~~--~~~~~~~d~~~~~----~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~-~~~~~~~ 140 (352)
++.|+.++|..... ..-.+.+++ .+ ++|.++.++...|.+|..|+++.+++.||++||...... ..+++++
T Consensus 120 ~~~ivgf~G~~~~~~~~ig~y~~~~--~~~~~~~~W~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~v~~ 197 (470)
T PLN02193 120 GGKIVGFHGRSTDVLHSLGAYISLP--STPKLLGKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYV 197 (470)
T ss_pred CCeEEEEeccCCCcEEeeEEEEecC--CChhhhceEEEcccCCCCCCCccccEEEEECCEEEEECCcCCCCCCeeCcEEE
Confidence 77888888864333 233566676 44 899999875433478999999999999999999754322 3468999
Q ss_pred EECCCCeeEecCCC---CC-cceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCC---CCcccceeEEEE
Q 045821 141 YDASMNTWTDAAPM---ST-ARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVM 210 (352)
Q Consensus 141 ~d~~t~~W~~~~~~---~~-~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~ 210 (352)
||+.+++|..++.+ |. +|..++++.++++||++||.... +++++||+.+++|++++++ |.++..++++.+
T Consensus 198 yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~~~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~ 277 (470)
T PLN02193 198 FDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAAD 277 (470)
T ss_pred EECCCCEEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCCCCCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEE
Confidence 99999999988653 33 25678888999999999998654 8999999999999999876 677888888889
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC---CCCCCCCcEEEECCEEEEEecCCC---CeEEEEeCCCCceEE
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN---MASGWRGPAVVVDDALYVLDQSSG---TKLMMWQKESREWSP 284 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~l~~~gg~~~---~~v~~yd~~~~~W~~ 284 (352)
+++||++||.........+++||+.+++|+.++.+ +..+..+.+++++++||++||..+ +++++||+++++|++
T Consensus 278 ~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~~~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g~~~~dv~~yD~~t~~W~~ 357 (470)
T PLN02193 278 EENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQ 357 (470)
T ss_pred CCEEEEECCCCCCCCcceEEEEECCCCEEEeCCCCCCCCCCCCCcEEEEECCcEEEEECCCCCccCceEEEECCCCEEEE
Confidence 99999999987766667899999999999998652 223333445678999999999764 689999999999999
Q ss_pred ccCCC-CCcCCCCceEEEeCCEEEEEcCCc---------------eEEEEEcCCcccccceE
Q 045821 285 VGRLS-TLLTRPPCKLVAIGKTIFVIGKGC---------------SAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 285 ~~~~~-~~~~~~~~~~~~~~~~i~v~gG~~---------------~~~~~d~~~~~w~~~~~ 330 (352)
+..+. .+..|..++++.++++|||+||.. ++++||+++++|.+...
T Consensus 358 ~~~~g~~P~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~~W~~~~~ 419 (470)
T PLN02193 358 VETFGVRPSERSVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETLQWERLDK 419 (470)
T ss_pred eccCCCCCCCcceeEEEEECCEEEEECCccCCccccccCccceeccEEEEEcCcCEEEEccc
Confidence 97652 223577888889999999999953 36789999999986443
No 7
>PLN02193 nitrile-specifier protein
Probab=100.00 E-value=6.2e-36 Score=279.11 Aligned_cols=264 Identities=17% Similarity=0.312 Sum_probs=204.1
Q ss_pred hhhcccccChhhHHHHhhCCCC--CCEEEEEEecCCC-C-c-eeEEEecCCCCCCCeeEeccCCCccc-CCcceEEEEEC
Q 045821 46 SRWRGLLHSEEWCAYRRKHNLD--ETWIYAFCRDNKL-E-R-VCCYVLDPNSTRRSWKMIHELPARTL-RRKGMGFEVLG 119 (352)
Q Consensus 46 k~W~~l~~s~~~~~~~~~~~~~--~~~l~~~gg~~~~-~-~-~~~~~~d~~~~~~~w~~~~~~~~~~~-~~~~~~~~~~~ 119 (352)
..|..+....+.+.+|..|+.. ++.||++||.... . . ..++.||+ .+++|..++.+...|. .+..+++++++
T Consensus 151 ~~W~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~v~~yD~--~~~~W~~~~~~g~~P~~~~~~~~~v~~~ 228 (470)
T PLN02193 151 GKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDL--ETRTWSISPATGDVPHLSCLGVRMVSIG 228 (470)
T ss_pred ceEEEcccCCCCCCCccccEEEEECCEEEEECCcCCCCCCeeCcEEEEEC--CCCEEEeCCCCCCCCCCcccceEEEEEC
Confidence 5798876554555567767654 8999999996422 2 2 35899999 9999998876532222 35678899999
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC---CCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEE
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM---STARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWK 193 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~---~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~ 193 (352)
++||++||.+.. ...+++++||+.+++|+.++++ |.||..|+++.++++|||+||.... +++++||+.+++|+
T Consensus 229 ~~lYvfGG~~~~-~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~ 307 (470)
T PLN02193 229 STLYVFGGRDAS-RQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWF 307 (470)
T ss_pred CEEEEECCCCCC-CCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEECCEEEEECCCCCCCCcceEEEEECCCCEEE
Confidence 999999998754 5678999999999999999887 7899999999999999999998654 78999999999999
Q ss_pred EecC---CCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC---CCCCCCCcEEEECCEEEEEecC
Q 045821 194 LHTE---PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN---MASGWRGPAVVVDDALYVLDQS 267 (352)
Q Consensus 194 ~~~~---~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~l~~~gg~ 267 (352)
.++. ++.++..+.+++++++||++||..+ ...+++++||+.+++|++++.. +..+..+.+++++++||++||.
T Consensus 308 ~~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g-~~~~dv~~yD~~t~~W~~~~~~g~~P~~R~~~~~~~~~~~iyv~GG~ 386 (470)
T PLN02193 308 HCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNG-CEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGGE 386 (470)
T ss_pred eCCCCCCCCCCCCCcEEEEECCcEEEEECCCC-CccCceEEEECCCCEEEEeccCCCCCCCcceeEEEEECCEEEEECCc
Confidence 9874 4456667778889999999998764 2356899999999999998652 3344445567889999999996
Q ss_pred CC-------------CeEEEEeCCCCceEEccCCCC----CcCCCCce--EEEe--CCEEEEEcCCc
Q 045821 268 SG-------------TKLMMWQKESREWSPVGRLST----LLTRPPCK--LVAI--GKTIFVIGKGC 313 (352)
Q Consensus 268 ~~-------------~~v~~yd~~~~~W~~~~~~~~----~~~~~~~~--~~~~--~~~i~v~gG~~ 313 (352)
.. +++++||+.+++|+.+..++. +..|..++ .... ++.|+++||..
T Consensus 387 ~~~~~~~~~~~~~~~ndv~~~D~~t~~W~~~~~~~~~~~~P~~R~~~~~~~~~~~~~~~~~~fGG~~ 453 (470)
T PLN02193 387 IAMDPLAHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSRGWTASTTGTIDGKKGLVMHGGKA 453 (470)
T ss_pred cCCccccccCccceeccEEEEEcCcCEEEEcccCCCCCCCCCCCccccceeeEEcCCceEEEEcCCC
Confidence 31 479999999999999975431 22444442 2222 34599999974
No 8
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=100.00 E-value=1.2e-35 Score=265.72 Aligned_cols=240 Identities=14% Similarity=0.188 Sum_probs=192.3
Q ss_pred CCCEEEEEEecCCCCc-----------eeEEEec-CCCC-CCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCC
Q 045821 67 DETWIYAFCRDNKLER-----------VCCYVLD-PNST-RRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSED 133 (352)
Q Consensus 67 ~~~~l~~~gg~~~~~~-----------~~~~~~d-~~~~-~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~ 133 (352)
.++.||++||.+..+. ..++.|+ + . +.+|..+++|| .+|..++++++++.||++||.+.. .
T Consensus 12 ~~~~l~v~GG~~~~~~~~~~~g~~~~~~~v~~~~~~--~~~~~W~~~~~lp---~~r~~~~~~~~~~~lyviGG~~~~-~ 85 (323)
T TIGR03548 12 IGDYILVAGGCNFPEDPLAEGGKKKNYKGIYIAKDE--NSNLKWVKDGQLP---YEAAYGASVSVENGIYYIGGSNSS-E 85 (323)
T ss_pred ECCEEEEeeccCCCCCchhhCCcEEeeeeeEEEecC--CCceeEEEcccCC---ccccceEEEEECCEEEEEcCCCCC-C
Confidence 4899999999864321 2466664 3 3 34799999998 678778888999999999998654 4
Q ss_pred CCCeEEEEECCCCee----EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCC-cccce
Q 045821 134 ATSEVYCYDASMNTW----TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNI-FTEIE 205 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W----~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~-~~~~~ 205 (352)
..+++++||+.+++| ..++++|.+|..|++++++++||++||.... +++++||+.+++|+.+++++. ++..+
T Consensus 86 ~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~ 165 (323)
T TIGR03548 86 RFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPRVQP 165 (323)
T ss_pred CceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCCCcc
Confidence 578999999999987 7889999999999999999999999997433 889999999999999987764 56666
Q ss_pred eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC----CCCCC-CcE-EEECCEEEEEecCCC----------
Q 045821 206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM----ASGWR-GPA-VVVDDALYVLDQSSG---------- 269 (352)
Q Consensus 206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~----~~~~~-~~~-~~~~~~l~~~gg~~~---------- 269 (352)
.+++++++||++||.... ...++++||+.+++|+.++++. +.... ..+ +..+++||++||.++
T Consensus 166 ~~~~~~~~iYv~GG~~~~-~~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~ 244 (323)
T TIGR03548 166 VCVKLQNELYVFGGGSNI-AYTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDL 244 (323)
T ss_pred eEEEECCEEEEEcCCCCc-cccceEEEecCCCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhh
Confidence 677899999999987642 2346899999999999997632 22222 223 345799999999752
Q ss_pred --------------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCce
Q 045821 270 --------------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS 314 (352)
Q Consensus 270 --------------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~ 314 (352)
+.+++||+.+++|+.++.+|. ..|..++++.++++||++||..+
T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~~~~~p~-~~r~~~~~~~~~~~iyv~GG~~~ 314 (323)
T TIGR03548 245 ATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKSIGNSPF-FARCGAALLLTGNNIFSINGELK 314 (323)
T ss_pred hhccchhhhhhHHHHhCCCccccCcCceEEEEECCCCeeeEcccccc-cccCchheEEECCEEEEEecccc
Confidence 369999999999999997764 25778889999999999999643
No 9
>PHA03098 kelch-like protein; Provisional
Probab=100.00 E-value=6.6e-36 Score=285.42 Aligned_cols=253 Identities=17% Similarity=0.235 Sum_probs=207.2
Q ss_pred CEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821 69 TWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW 148 (352)
Q Consensus 69 ~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W 148 (352)
..+++.||.. ........|++ .++.|..++..+ .+..++++++++.||++||........+++++||+.+++|
T Consensus 251 ~~~~~~~g~~-~~~~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~lyv~GG~~~~~~~~~~v~~yd~~~~~W 323 (534)
T PHA03098 251 SIIYIHITMS-IFTYNYITNYS--PLSEINTIIDIH----YVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSW 323 (534)
T ss_pred cceEeecccc-hhhceeeecch--hhhhcccccCcc----ccccceEEEECCEEEEECCCcCCCCeeccEEEEeCCCCee
Confidence 4455545543 22334556788 888898887654 2344688999999999999876555667999999999999
Q ss_pred EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC-
Q 045821 149 TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT- 224 (352)
Q Consensus 149 ~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~- 224 (352)
..+++|+.+|..++++.++++||++||.... +++++||+.+++|+.++++|.++..+++++++|+||++||.....
T Consensus 324 ~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~ 403 (534)
T PHA03098 324 NKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDE 403 (534)
T ss_pred eECCCCCcccccceEEEECCEEEEEeCCCCCEecceEEEEcCCCCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCc
Confidence 9999999999999999999999999998744 889999999999999999999888888899999999999965432
Q ss_pred CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-------CeEEEEeCCCCceEEccCCCCCcCCCCc
Q 045821 225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-------TKLMMWQKESREWSPVGRLSTLLTRPPC 297 (352)
Q Consensus 225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-------~~v~~yd~~~~~W~~~~~~~~~~~~~~~ 297 (352)
....+++||+.+++|+.+++.+..+..+.++..+++||++||... +.+++||+++++|+.++.++. .+..+
T Consensus 404 ~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~~~~~~--~r~~~ 481 (534)
T PHA03098 404 LLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNF--PRINA 481 (534)
T ss_pred ccceEEEEeCCCCeeeecCCCCccccCceEEEECCEEEEECCccCCCCCcccceEEEecCCCCceeeCCCCCc--ccccc
Confidence 346899999999999999875555555566788999999999653 359999999999999998775 46777
Q ss_pred eEEEeCCEEEEEcCCc------eEEEEEcCCcccccceE
Q 045821 298 KLVAIGKTIFVIGKGC------SAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 298 ~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~~~~~ 330 (352)
+++.++++||++||.. .+++||+++++|.....
T Consensus 482 ~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~ 520 (534)
T PHA03098 482 SLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCK 520 (534)
T ss_pred eEEEECCEEEEEcCCcCCcccceeEEEeCCCCEEEecCC
Confidence 7888899999999954 56789999999965433
No 10
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=100.00 E-value=2e-35 Score=267.11 Aligned_cols=240 Identities=19% Similarity=0.248 Sum_probs=186.3
Q ss_pred CCCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCC-----CCCCeEEEE
Q 045821 67 DETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSE-----DATSEVYCY 141 (352)
Q Consensus 67 ~~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~-----~~~~~~~~~ 141 (352)
.++.||++||.. ...++.||++..+++|.++++||. .+|..+++++++++||++||..... ...+++++|
T Consensus 16 ~~~~vyv~GG~~---~~~~~~~d~~~~~~~W~~l~~~p~--~~R~~~~~~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Y 90 (346)
T TIGR03547 16 IGDKVYVGLGSA---GTSWYKLDLKKPSKGWQKIADFPG--GPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRY 90 (346)
T ss_pred ECCEEEEEcccc---CCeeEEEECCCCCCCceECCCCCC--CCcccceEEEECCEEEEEeCCCCCCCCCcceecccEEEE
Confidence 389999999973 245778885336789999999873 3688899999999999999975421 246789999
Q ss_pred ECCCCeeEecC-CCCCcceeeeeE-EECCEEEEEeCCCCC-------------------------------------Cce
Q 045821 142 DASMNTWTDAA-PMSTARCYFPCG-VLNQKIYCIGGLGDT-------------------------------------HSW 182 (352)
Q Consensus 142 d~~t~~W~~~~-~~~~~r~~~~~~-~~~~~iyv~gG~~~~-------------------------------------~~v 182 (352)
|+.+++|+.++ .+|.+|..++++ .++++||++||.... +++
T Consensus 91 d~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 170 (346)
T TIGR03547 91 DPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYFWNKNV 170 (346)
T ss_pred ECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcCccceE
Confidence 99999999997 455666666666 689999999997521 579
Q ss_pred EEEeCCCCcEEEecCCCC-cccceeEEEECCEEEEEccCCCCCC-ceEEEEE--eCCCCceeecCCCCCCCC------CC
Q 045821 183 DVYDPRTNNWKLHTEPNI-FTEIEDSFVMDGKIYIRCSASAATS-HVCALVY--EPSTDSWLHADANMASGW------RG 252 (352)
Q Consensus 183 ~~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g~iyv~GG~~~~~~-~~~i~~y--d~~~~~W~~~~~~~~~~~------~~ 252 (352)
++||+.+++|+.+++||. ++..+++++++++||++||...... ...++.| |+.+++|+.+++.+.++. .+
T Consensus 171 ~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~~~~~~ 250 (346)
T TIGR03547 171 LSYDPSTNQWRNLGENPFLGTAGSAIVHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQEGLAG 250 (346)
T ss_pred EEEECCCCceeECccCCCCcCCCceEEEECCEEEEEeeeeCCCccchheEEEEecCCCceeeecCCCCCCCCCccccccE
Confidence 999999999999998876 4566677889999999999754332 2245555 457789999977543321 11
Q ss_pred -cEEEECCEEEEEecCCC---------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821 253 -PAVVVDDALYVLDQSSG---------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG 310 (352)
Q Consensus 253 -~~~~~~~~l~~~gg~~~---------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g 310 (352)
.+++++++||++||... ..+.+||+++++|+.++.+|. .+..+++++++++|||+|
T Consensus 251 ~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~lp~--~~~~~~~~~~~~~iyv~G 328 (346)
T TIGR03547 251 AFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKLPQ--GLAYGVSVSWNNGVLLIG 328 (346)
T ss_pred EeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCCCC--CceeeEEEEcCCEEEEEe
Confidence 24678999999999641 257899999999999999987 466677788999999999
Q ss_pred CCc
Q 045821 311 KGC 313 (352)
Q Consensus 311 G~~ 313 (352)
|..
T Consensus 329 G~~ 331 (346)
T TIGR03547 329 GEN 331 (346)
T ss_pred ccC
Confidence 975
No 11
>PLN02153 epithiospecifier protein
Probab=100.00 E-value=6.9e-35 Score=262.75 Aligned_cols=235 Identities=17% Similarity=0.282 Sum_probs=187.3
Q ss_pred CCCCeeEeccCCC-cccCCcceEEEEECCEEEEEeCCCCC-CCCCCeEEEEECCCCeeEecCCCC-Ccc---eeeeeEEE
Q 045821 93 TRRSWKMIHELPA-RTLRRKGMGFEVLGKNAYLLGGCGWS-EDATSEVYCYDASMNTWTDAAPMS-TAR---CYFPCGVL 166 (352)
Q Consensus 93 ~~~~w~~~~~~~~-~~~~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~~~d~~t~~W~~~~~~~-~~r---~~~~~~~~ 166 (352)
....|.++..... .|.+|..|++++++++||++||.... ....+++++||+.+++|..+++++ .|| ..|+++.+
T Consensus 5 ~~~~W~~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~ 84 (341)
T PLN02153 5 LQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAV 84 (341)
T ss_pred cCCeEEEecCCCCCCCCCCCcceEEEECCEEEEECCccCCCCceeCcEEEEECCCCEEEEcCccCCCCCCccCceEEEEE
Confidence 6788999987321 23689999999999999999998543 234579999999999999988764 344 36788899
Q ss_pred CCEEEEEeCCCCC---CceEEEeCCCCcEEEecCC-----CCcccceeEEEECCEEEEEccCCCCC------CceEEEEE
Q 045821 167 NQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEP-----NIFTEIEDSFVMDGKIYIRCSASAAT------SHVCALVY 232 (352)
Q Consensus 167 ~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~-----~~~~~~~~~~~~~g~iyv~GG~~~~~------~~~~i~~y 232 (352)
+++||++||.... +++++||+.+++|+.++++ |.++..+++++++++|||+||..... ...++++|
T Consensus 85 ~~~iyv~GG~~~~~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~y 164 (341)
T PLN02153 85 GTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAY 164 (341)
T ss_pred CCEEEEECCCCCCCccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEE
Confidence 9999999997654 7899999999999998866 66777888889999999999986432 23479999
Q ss_pred eCCCCceeecCCCC---CCCCCCcEEEECCEEEEEecCC------------CCeEEEEeCCCCceEEccCCC-CCcCCCC
Q 045821 233 EPSTDSWLHADANM---ASGWRGPAVVVDDALYVLDQSS------------GTKLMMWQKESREWSPVGRLS-TLLTRPP 296 (352)
Q Consensus 233 d~~~~~W~~~~~~~---~~~~~~~~~~~~~~l~~~gg~~------------~~~v~~yd~~~~~W~~~~~~~-~~~~~~~ 296 (352)
|+.+++|+.++... ..+..+.+++++++||++||.. .+++++||+++++|+++.... .+..|..
T Consensus 165 d~~~~~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~ 244 (341)
T PLN02153 165 NIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV 244 (341)
T ss_pred ECCCCeEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccCCCCCCcce
Confidence 99999999987632 3333444577899999998853 147999999999999997532 1225777
Q ss_pred ceEEEeCCEEEEEcCCc---------------eEEEEEcCCccccc
Q 045821 297 CKLVAIGKTIFVIGKGC---------------SAVVIDVGNIGNIG 327 (352)
Q Consensus 297 ~~~~~~~~~i~v~gG~~---------------~~~~~d~~~~~w~~ 327 (352)
++++.++++|||+||.. +++.||+++++|.+
T Consensus 245 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~~W~~ 290 (341)
T PLN02153 245 FAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETLVWEK 290 (341)
T ss_pred eeeEEECCEEEEECcccCCccccccccccccccEEEEEcCccEEEe
Confidence 88889999999999952 56789999999975
No 12
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=100.00 E-value=1.2e-34 Score=263.95 Aligned_cols=239 Identities=19% Similarity=0.262 Sum_probs=184.6
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCC-C----CCCCCeEEEEE
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGW-S----EDATSEVYCYD 142 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~-~----~~~~~~~~~~d 142 (352)
++.||++||... ..++.||.+..++.|.+++++|. .+|..++++.++++||++||... . ....+++++||
T Consensus 38 ~~~iyv~gG~~~---~~~~~~d~~~~~~~W~~l~~~p~--~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD 112 (376)
T PRK14131 38 NNTVYVGLGSAG---TSWYKLDLNAPSKGWTKIAAFPG--GPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYD 112 (376)
T ss_pred CCEEEEEeCCCC---CeEEEEECCCCCCCeEECCcCCC--CCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEe
Confidence 899999998632 34778887224589999998873 36888899999999999999764 1 12367899999
Q ss_pred CCCCeeEecCC-CCCcceeeeeEE-ECCEEEEEeCCCC-------------------------------------CCceE
Q 045821 143 ASMNTWTDAAP-MSTARCYFPCGV-LNQKIYCIGGLGD-------------------------------------THSWD 183 (352)
Q Consensus 143 ~~t~~W~~~~~-~~~~r~~~~~~~-~~~~iyv~gG~~~-------------------------------------~~~v~ 183 (352)
+.+++|+.+++ +|.++..|+++. .+++||++||... .++++
T Consensus 113 ~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~ 192 (376)
T PRK14131 113 PKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYFFNKEVL 192 (376)
T ss_pred CCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcCcCceEE
Confidence 99999999985 456667777766 7999999999752 15799
Q ss_pred EEeCCCCcEEEecCCCC-cccceeEEEECCEEEEEccCCCCC-Cce--EEEEEeCCCCceeecCCCCCCCC-------CC
Q 045821 184 VYDPRTNNWKLHTEPNI-FTEIEDSFVMDGKIYIRCSASAAT-SHV--CALVYEPSTDSWLHADANMASGW-------RG 252 (352)
Q Consensus 184 ~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g~iyv~GG~~~~~-~~~--~i~~yd~~~~~W~~~~~~~~~~~-------~~ 252 (352)
+||+.+++|+.++++|. ++..++++.++++||++||....+ ... ..+.||+.+++|+.+++.+.++. .+
T Consensus 193 ~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~~~~~~~ 272 (376)
T PRK14131 193 SYDPSTNQWKNAGESPFLGTAGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSSQEGVAG 272 (376)
T ss_pred EEECCCCeeeECCcCCCCCCCcceEEEECCEEEEEeeeECCCcCChhheEEEecCCCcceeecCCCCCCCcCCcCCccce
Confidence 99999999999988876 455666778899999999865332 222 34456788999999976543321 11
Q ss_pred c-EEEECCEEEEEecCCC---------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821 253 P-AVVVDDALYVLDQSSG---------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG 310 (352)
Q Consensus 253 ~-~~~~~~~l~~~gg~~~---------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g 310 (352)
. +++.+++||++||... ..+.+||+++++|+.++.+|. .+..++++.++++|||+|
T Consensus 273 ~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~lp~--~r~~~~av~~~~~iyv~G 350 (376)
T PRK14131 273 AFAGYSNGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNGKWQKVGELPQ--GLAYGVSVSWNNGVLLIG 350 (376)
T ss_pred EeceeECCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEecCCcccccCcCCC--CccceEEEEeCCEEEEEc
Confidence 1 4568999999999641 135689999999999998887 466677888999999999
Q ss_pred CCc
Q 045821 311 KGC 313 (352)
Q Consensus 311 G~~ 313 (352)
|..
T Consensus 351 G~~ 353 (376)
T PRK14131 351 GET 353 (376)
T ss_pred CCC
Confidence 964
No 13
>PHA02790 Kelch-like protein; Provisional
Probab=100.00 E-value=2.1e-34 Score=269.51 Aligned_cols=203 Identities=17% Similarity=0.276 Sum_probs=178.1
Q ss_pred EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821 115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL 194 (352)
Q Consensus 115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~ 194 (352)
.+..++.||++||.+.. ...+++++|||.+++|..+++|+.+|..++++.++++||++||.+...++++||+.+++|+.
T Consensus 267 ~~~~~~~lyviGG~~~~-~~~~~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~~~sve~ydp~~n~W~~ 345 (480)
T PHA02790 267 STHVGEVVYLIGGWMNN-EIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTSVERWFHGDAAWVN 345 (480)
T ss_pred eEEECCEEEEEcCCCCC-CcCCeEEEEECCCCEEEECCCCCchhhcceEEEECCEEEEECCcCCCCceEEEECCCCeEEE
Confidence 35589999999998653 45788999999999999999999999999999999999999997654789999999999999
Q ss_pred ecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEE
Q 045821 195 HTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMM 274 (352)
Q Consensus 195 ~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~ 274 (352)
+++|+.++..+++++++|+||++||.... ...+++||+.+++|+.+++++.++..+.+++++|+||++|| .+.+
T Consensus 346 ~~~l~~~r~~~~~~~~~g~IYviGG~~~~--~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~~IYv~GG----~~e~ 419 (480)
T PHA02790 346 MPSLLKPRCNPAVASINNVIYVIGGHSET--DTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGR----NAEF 419 (480)
T ss_pred CCCCCCCCcccEEEEECCEEEEecCcCCC--CccEEEEeCCCCEEEeCCCCCCccccceEEEECCEEEEECC----ceEE
Confidence 99999999888899999999999997532 35789999999999999876655555566789999999998 5789
Q ss_pred EeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc------eEEEEEcCCcccc
Q 045821 275 WQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC------SAVVIDVGNIGNI 326 (352)
Q Consensus 275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~ 326 (352)
||+++++|+.++.++. .|..+++++++++||++||.+ .+..||+++++|.
T Consensus 420 ydp~~~~W~~~~~m~~--~r~~~~~~v~~~~IYviGG~~~~~~~~~ve~Yd~~~~~W~ 475 (480)
T PHA02790 420 YCESSNTWTLIDDPIY--PRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWN 475 (480)
T ss_pred ecCCCCcEeEcCCCCC--CccccEEEEECCEEEEECCcCCCcccceEEEEECCCCeEE
Confidence 9999999999998876 588889999999999999953 4668999999995
No 14
>PHA03098 kelch-like protein; Provisional
Probab=100.00 E-value=4.8e-34 Score=272.60 Aligned_cols=217 Identities=20% Similarity=0.340 Sum_probs=188.2
Q ss_pred CCEEEEEEecCCCCc--eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821 68 ETWIYAFCRDNKLER--VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM 145 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~--~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t 145 (352)
++.||++||.+.... ...+.||+ .+++|..+++++ .+|.++++++++++||++||.+.. ...+++++||+.+
T Consensus 294 ~~~lyv~GG~~~~~~~~~~v~~yd~--~~~~W~~~~~~~---~~R~~~~~~~~~~~lyv~GG~~~~-~~~~~v~~yd~~~ 367 (534)
T PHA03098 294 NNVIYFIGGMNKNNLSVNSVVSYDT--KTKSWNKVPELI---YPRKNPGVTVFNNRIYVIGGIYNS-ISLNTVESWKPGE 367 (534)
T ss_pred CCEEEEECCCcCCCCeeccEEEEeC--CCCeeeECCCCC---cccccceEEEECCEEEEEeCCCCC-EecceEEEEcCCC
Confidence 899999999754332 36889999 999999999988 678999999999999999998743 5578899999999
Q ss_pred CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC
Q 045821 146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS 221 (352)
Q Consensus 146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~ 221 (352)
++|+.++++|.||..++++.++++||++||.... +++++||+.+++|+.++++|.++..+++++++++||++||..
T Consensus 368 ~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~~iyv~GG~~ 447 (534)
T PHA03098 368 SKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGIS 447 (534)
T ss_pred CceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEEECCEEEEECCcc
Confidence 9999999999999999999999999999996432 789999999999999999988888888899999999999876
Q ss_pred CCCC---ceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEccCCCC
Q 045821 222 AATS---HVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVGRLST 290 (352)
Q Consensus 222 ~~~~---~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~~~~~ 290 (352)
.... ...+++||+.+++|+.++..+.++....++..+++||++||..+ +.+++||+++++|+.++.+|+
T Consensus 448 ~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~ 523 (534)
T PHA03098 448 YIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCKFPK 523 (534)
T ss_pred CCCCCcccceEEEecCCCCceeeCCCCCcccccceEEEECCEEEEEcCCcCCcccceeEEEeCCCCEEEecCCCcc
Confidence 4332 34799999999999999875555555556778999999999763 589999999999999998776
No 15
>PHA02790 Kelch-like protein; Provisional
Probab=100.00 E-value=1e-33 Score=264.92 Aligned_cols=203 Identities=15% Similarity=0.251 Sum_probs=177.3
Q ss_pred CCCEEEEEEecCCCC-ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821 67 DETWIYAFCRDNKLE-RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM 145 (352)
Q Consensus 67 ~~~~l~~~gg~~~~~-~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t 145 (352)
.++.||++||.+... ....+.||| .+++|..+++|+ .+|..++++.++++||++||.+. .+++++||+.+
T Consensus 270 ~~~~lyviGG~~~~~~~~~v~~Ydp--~~~~W~~~~~m~---~~r~~~~~v~~~~~iYviGG~~~----~~sve~ydp~~ 340 (480)
T PHA02790 270 VGEVVYLIGGWMNNEIHNNAIAVNY--ISNNWIPIPPMN---SPRLYASGVPANNKLYVVGGLPN----PTSVERWFHGD 340 (480)
T ss_pred ECCEEEEEcCCCCCCcCCeEEEEEC--CCCEEEECCCCC---chhhcceEEEECCEEEEECCcCC----CCceEEEECCC
Confidence 488999999975433 345788999 999999999998 67888999999999999999753 25689999999
Q ss_pred CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC
Q 045821 146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT 224 (352)
Q Consensus 146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~ 224 (352)
++|..+++||.+|..+++++++++||++||.... +.+++|||.+++|+.+++|+.++..+++++++|+||++||.
T Consensus 341 n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~~IYv~GG~---- 416 (480)
T PHA02790 341 AAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGRN---- 416 (480)
T ss_pred CeEEECCCCCCCCcccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeCCCCCCccccceEEEECCEEEEECCc----
Confidence 9999999999999999999999999999998654 78999999999999999999988888889999999999863
Q ss_pred CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEcc
Q 045821 225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVG 286 (352)
Q Consensus 225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~ 286 (352)
+++||+.+++|+.+++.+.++....+++++|+||++||..+ +.+.+||+++++|+...
T Consensus 417 ----~e~ydp~~~~W~~~~~m~~~r~~~~~~v~~~~IYviGG~~~~~~~~~ve~Yd~~~~~W~~~~ 478 (480)
T PHA02790 417 ----AEFYCESSNTWTLIDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWNIWD 478 (480)
T ss_pred ----eEEecCCCCcEeEcCCCCCCccccEEEEECCEEEEECCcCCCcccceEEEEECCCCeEEecC
Confidence 57899999999999876555656667889999999999753 47999999999998654
No 16
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=100.00 E-value=1.4e-33 Score=228.91 Aligned_cols=251 Identities=20% Similarity=0.328 Sum_probs=202.7
Q ss_pred HhhCCCC--CCEEEEEEec------CCCCceeEEEecCCCCCCCeeEeccCC----------CcccCCcceEEEEECCEE
Q 045821 61 RRKHNLD--ETWIYAFCRD------NKLERVCCYVLDPNSTRRSWKMIHELP----------ARTLRRKGMGFEVLGKNA 122 (352)
Q Consensus 61 ~~~~~~~--~~~l~~~gg~------~~~~~~~~~~~d~~~~~~~w~~~~~~~----------~~~~~~~~~~~~~~~~~i 122 (352)
|..|+.. +++||-+||. .........++|. .+-+|.++++-- .-|-.|++|+++...+++
T Consensus 14 RVNHAavaVG~riYSFGGYCsGedy~~~~piDVH~lNa--~~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y~d~~ 91 (392)
T KOG4693|consen 14 RVNHAAVAVGSRIYSFGGYCSGEDYDAKDPIDVHVLNA--ENYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEYQDKA 91 (392)
T ss_pred cccceeeeecceEEecCCcccccccccCCcceeEEeec--cceeEEecCcccccccccCCCCccchhhcCceEEEEcceE
Confidence 4444433 8999999985 1233567889999 999999998711 112458999999999999
Q ss_pred EEEeCCCCCCCCCCeEEEEECCCCeeEecC---CCCCcceeeeeEEECCEEEEEeCCCCC-----CceEEEeCCCCcEEE
Q 045821 123 YLLGGCGWSEDATSEVYCYDASMNTWTDAA---PMSTARCYFPCGVLNQKIYCIGGLGDT-----HSWDVYDPRTNNWKL 194 (352)
Q Consensus 123 yv~GG~~~~~~~~~~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~~~~iyv~gG~~~~-----~~v~~yd~~t~~W~~ 194 (352)
|+.||.+.....-+.++.|||.+++|.... .+|.+|-+|+++++++.+|++||+..+ +++..+|..|.+|+.
T Consensus 92 yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~h~ld~~TmtWr~ 171 (392)
T KOG4693|consen 92 YVWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDTHVLDFATMTWRE 171 (392)
T ss_pred EEEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccceeEeccceeeee
Confidence 999999887677788999999999998653 468899999999999999999998765 899999999999999
Q ss_pred ecC---CCCcccceeEEEECCEEEEEccCCCCCCc---------eEEEEEeCCCCceeecCC---CCCCCCCCcEEEECC
Q 045821 195 HTE---PNIFTEIEDSFVMDGKIYIRCSASAATSH---------VCALVYEPSTDSWLHADA---NMASGWRGPAVVVDD 259 (352)
Q Consensus 195 ~~~---~~~~~~~~~~~~~~g~iyv~GG~~~~~~~---------~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~ 259 (352)
+.. +|.=+..++++++++.+|++||....... ..|..+|..++.|...+. .+..++++++.+.++
T Consensus 172 ~~Tkg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvYng 251 (392)
T KOG4693|consen 172 MHTKGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVYNG 251 (392)
T ss_pred hhccCCCchhhhhhhhhhccceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCcccccceEEEcc
Confidence 863 55667778889999999999987643221 278899999999998754 344566777789999
Q ss_pred EEEEEecCCC------CeEEEEeCCCCceEEccCCC-CCcCCCCceEEEeCCEEEEEcCCc
Q 045821 260 ALYVLDQSSG------TKLMMWQKESREWSPVGRLS-TLLTRPPCKLVAIGKTIFVIGKGC 313 (352)
Q Consensus 260 ~l~~~gg~~~------~~v~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~~~i~v~gG~~ 313 (352)
++|++||+++ ++++.|||.+..|+.+..-. .+..|+..+.++.++|+|+|||..
T Consensus 252 ~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~I~~~Gk~P~aRRRqC~~v~g~kv~LFGGTs 312 (392)
T KOG4693|consen 252 KMYMFGGYNGTLNVHFNDLYCFDPKTSMWSVISVRGKYPSARRRQCSVVSGGKVYLFGGTS 312 (392)
T ss_pred eEEEecccchhhhhhhcceeecccccchheeeeccCCCCCcccceeEEEECCEEEEecCCC
Confidence 9999999987 79999999999999886321 112466667777899999999975
No 17
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=100.00 E-value=6.1e-32 Score=241.90 Aligned_cols=222 Identities=15% Similarity=0.205 Sum_probs=176.5
Q ss_pred CCcceEEEEECCEEEEEeCCCCCC---------CCCCeEEEEE-CCC-CeeEecCCCCCcceeeeeEEECCEEEEEeCCC
Q 045821 109 RRKGMGFEVLGKNAYLLGGCGWSE---------DATSEVYCYD-ASM-NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLG 177 (352)
Q Consensus 109 ~~~~~~~~~~~~~iyv~GG~~~~~---------~~~~~~~~~d-~~t-~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~ 177 (352)
.++++.++++++.||++||.+... ...+++++|+ +.. .+|..+++||.+|..++++.++++||++||..
T Consensus 3 ~~~g~~~~~~~~~l~v~GG~~~~~~~~~~~g~~~~~~~v~~~~~~~~~~~W~~~~~lp~~r~~~~~~~~~~~lyviGG~~ 82 (323)
T TIGR03548 3 GVAGCYAGIIGDYILVAGGCNFPEDPLAEGGKKKNYKGIYIAKDENSNLKWVKDGQLPYEAAYGASVSVENGIYYIGGSN 82 (323)
T ss_pred ceeeEeeeEECCEEEEeeccCCCCCchhhCCcEEeeeeeEEEecCCCceeEEEcccCCccccceEEEEECCEEEEEcCCC
Confidence 466788899999999999986542 1345788885 332 37999999999999888899999999999976
Q ss_pred CC---CceEEEeCCCCcE----EEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCC-CC
Q 045821 178 DT---HSWDVYDPRTNNW----KLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMA-SG 249 (352)
Q Consensus 178 ~~---~~v~~yd~~t~~W----~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~-~~ 249 (352)
.. +++++||+.+++| +.++++|.++..+++++++++||++||.......+++++||+.+++|+++++.+. .+
T Consensus 83 ~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r 162 (323)
T TIGR03548 83 SSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPR 162 (323)
T ss_pred CCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCC
Confidence 54 8899999999988 7788888888888889999999999997655556799999999999999976443 33
Q ss_pred CCCcEEEECCEEEEEecCCC---CeEEEEeCCCCceEEccCCCC---CcCC-CCceEEEeCCEEEEEcCCc---------
Q 045821 250 WRGPAVVVDDALYVLDQSSG---TKLMMWQKESREWSPVGRLST---LLTR-PPCKLVAIGKTIFVIGKGC--------- 313 (352)
Q Consensus 250 ~~~~~~~~~~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~~~---~~~~-~~~~~~~~~~~i~v~gG~~--------- 313 (352)
..+.+++.+++||++||..+ .++++||+++++|+.++.++. +..+ ..++++..+++||++||.+
T Consensus 163 ~~~~~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 242 (323)
T TIGR03548 163 VQPVCVKLQNELYVFGGGSNIAYTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVI 242 (323)
T ss_pred CcceEEEECCEEEEEcCCCCccccceEEEecCCCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHh
Confidence 34445678999999999764 468999999999999987642 1122 2334455689999999964
Q ss_pred -----------------------------eEEEEEcCCcccccceE
Q 045821 314 -----------------------------SAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 314 -----------------------------~~~~~d~~~~~w~~~~~ 330 (352)
++++||+++++|.....
T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~~~~ 288 (323)
T TIGR03548 243 DLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKSIGN 288 (323)
T ss_pred hhhhccchhhhhhHHHHhCCCccccCcCceEEEEECCCCeeeEccc
Confidence 46789999999977543
No 18
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=100.00 E-value=8.8e-32 Score=243.37 Aligned_cols=223 Identities=18% Similarity=0.264 Sum_probs=171.0
Q ss_pred ccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC--CCCeeEecCCCC-CcceeeeeEEECCEEEEEeCCC
Q 045821 101 HELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA--SMNTWTDAAPMS-TARCYFPCGVLNQKIYCIGGLG 177 (352)
Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~--~t~~W~~~~~~~-~~r~~~~~~~~~~~iyv~gG~~ 177 (352)
++|| .++...++++++++|||+||.. .+++++||+ .+++|..+++|| .+|..++++.++++|||+||..
T Consensus 2 ~~lp---~~~~~~~~~~~~~~vyv~GG~~-----~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~~ 73 (346)
T TIGR03547 2 PDLP---VGFKNGTGAIIGDKVYVGLGSA-----GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGIG 73 (346)
T ss_pred CCCC---ccccCceEEEECCEEEEEcccc-----CCeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEEeCCC
Confidence 5566 5677777889999999999963 267899996 578899999998 5899999999999999999975
Q ss_pred C--------C-CceEEEeCCCCcEEEecC-CCCcccceeEE-EECCEEEEEccCCCCC----------------------
Q 045821 178 D--------T-HSWDVYDPRTNNWKLHTE-PNIFTEIEDSF-VMDGKIYIRCSASAAT---------------------- 224 (352)
Q Consensus 178 ~--------~-~~v~~yd~~t~~W~~~~~-~~~~~~~~~~~-~~~g~iyv~GG~~~~~---------------------- 224 (352)
. . +++++||+.+++|++++. ++..+..+.++ +++++||++||.....
T Consensus 74 ~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (346)
T TIGR03547 74 KANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLI 153 (346)
T ss_pred CCCCCCcceecccEEEEECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhH
Confidence 2 1 689999999999999974 34444445455 6899999999976321
Q ss_pred ------------CceEEEEEeCCCCceeecCCCCCC-CCCCcEEEECCEEEEEecCCC-----CeEEEEe--CCCCceEE
Q 045821 225 ------------SHVCALVYEPSTDSWLHADANMAS-GWRGPAVVVDDALYVLDQSSG-----TKLMMWQ--KESREWSP 284 (352)
Q Consensus 225 ------------~~~~i~~yd~~~~~W~~~~~~~~~-~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd--~~~~~W~~ 284 (352)
....+++||+.+++|+.+++.+.. +....++..+++||++||... ..++.|| +++++|++
T Consensus 154 ~~~~~~~~~~~~~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~ 233 (346)
T TIGR03547 154 AAYFSQPPEDYFWNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNK 233 (346)
T ss_pred HHHhCCChhHcCccceEEEEECCCCceeECccCCCCcCCCceEEEECCEEEEEeeeeCCCccchheEEEEecCCCceeee
Confidence 125799999999999999875543 334445678999999999643 2355554 57789999
Q ss_pred ccCCCCCcC-----CCCceEEEeCCEEEEEcCCc-----------------------eEEEEEcCCcccccceEE
Q 045821 285 VGRLSTLLT-----RPPCKLVAIGKTIFVIGKGC-----------------------SAVVIDVGNIGNIGGIMV 331 (352)
Q Consensus 285 ~~~~~~~~~-----~~~~~~~~~~~~i~v~gG~~-----------------------~~~~~d~~~~~w~~~~~~ 331 (352)
++.+|.... +..+.++.++++||++||.. .+.+||+++++|.....|
T Consensus 234 ~~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~l 308 (346)
T TIGR03547 234 LPPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKL 308 (346)
T ss_pred cCCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCC
Confidence 998876421 12334677899999999963 356899999999876544
No 19
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.98 E-value=2.2e-30 Score=236.00 Aligned_cols=230 Identities=18% Similarity=0.243 Sum_probs=173.5
Q ss_pred CCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECC--CCeeEecCCCC-CcceeeeeEEECCEE
Q 045821 94 RRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDAS--MNTWTDAAPMS-TARCYFPCGVLNQKI 170 (352)
Q Consensus 94 ~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~--t~~W~~~~~~~-~~r~~~~~~~~~~~i 170 (352)
.-.++.+++|| .++..++++.++++||++||... +.+++||+. +++|..++++| .+|..++++.++++|
T Consensus 16 ~~~~~~l~~lP---~~~~~~~~~~~~~~iyv~gG~~~-----~~~~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~~~~I 87 (376)
T PRK14131 16 AANAEQLPDLP---VPFKNGTGAIDNNTVYVGLGSAG-----TSWYKLDLNAPSKGWTKIAAFPGGPREQAVAAFIDGKL 87 (376)
T ss_pred ceecccCCCCC---cCccCCeEEEECCEEEEEeCCCC-----CeEEEEECCCCCCCeEECCcCCCCCcccceEEEECCEE
Confidence 34566778888 56666678889999999999633 458999986 47899999987 589999999999999
Q ss_pred EEEeCCCC-------C--CceEEEeCCCCcEEEecCC-CCcccceeEEE-ECCEEEEEccCCCCC---------------
Q 045821 171 YCIGGLGD-------T--HSWDVYDPRTNNWKLHTEP-NIFTEIEDSFV-MDGKIYIRCSASAAT--------------- 224 (352)
Q Consensus 171 yv~gG~~~-------~--~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~-~~g~iyv~GG~~~~~--------------- 224 (352)
||+||... . +++++||+.+++|+.++++ +..+..+++++ .+++||++||.....
T Consensus 88 YV~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~ 167 (376)
T PRK14131 88 YVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDK 167 (376)
T ss_pred EEEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccch
Confidence 99999754 1 6899999999999999853 44444555555 799999999975310
Q ss_pred -------------------CceEEEEEeCCCCceeecCCCCCCCC-CCcEEEECCEEEEEecCCC-----CeE--EEEeC
Q 045821 225 -------------------SHVCALVYEPSTDSWLHADANMASGW-RGPAVVVDDALYVLDQSSG-----TKL--MMWQK 277 (352)
Q Consensus 225 -------------------~~~~i~~yd~~~~~W~~~~~~~~~~~-~~~~~~~~~~l~~~gg~~~-----~~v--~~yd~ 277 (352)
....+++||+.+++|+.+++.+..++ ..+++..+++||++||... ..+ +.||+
T Consensus 168 ~~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~ 247 (376)
T PRK14131 168 TPKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTG 247 (376)
T ss_pred hhhhhhHHHHhcCChhhcCcCceEEEEECCCCeeeECCcCCCCCCCcceEEEECCEEEEEeeeECCCcCChhheEEEecC
Confidence 13579999999999999987554333 4445678999999999532 233 45688
Q ss_pred CCCceEEccCCCCCcC----C--CCceEEEeCCEEEEEcCCc-----------------------eEEEEEcCCcccccc
Q 045821 278 ESREWSPVGRLSTLLT----R--PPCKLVAIGKTIFVIGKGC-----------------------SAVVIDVGNIGNIGG 328 (352)
Q Consensus 278 ~~~~W~~~~~~~~~~~----~--~~~~~~~~~~~i~v~gG~~-----------------------~~~~~d~~~~~w~~~ 328 (352)
++++|+.++.+|.... + ..+..+..+++||++||.. .+.+||+++++|...
T Consensus 248 ~~~~W~~~~~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~ 327 (376)
T PRK14131 248 NNLKWQKLPDLPPAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNGKWQKV 327 (376)
T ss_pred CCcceeecCCCCCCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEecCCccccc
Confidence 9999999998876321 1 1122467899999999953 234799999999875
Q ss_pred eEE
Q 045821 329 IMV 331 (352)
Q Consensus 329 ~~~ 331 (352)
..|
T Consensus 328 ~~l 330 (376)
T PRK14131 328 GEL 330 (376)
T ss_pred CcC
Confidence 444
No 20
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=99.96 E-value=3.7e-28 Score=197.42 Aligned_cols=229 Identities=18% Similarity=0.202 Sum_probs=183.2
Q ss_pred CeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCC----CCCCCeEEEEECCCCeeEecCC-------------CCCcc
Q 045821 96 SWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----EDATSEVYCYDASMNTWTDAAP-------------MSTAR 158 (352)
Q Consensus 96 ~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----~~~~~~~~~~d~~t~~W~~~~~-------------~~~~r 158 (352)
.|+..-.-. +.|-+|+++.++..||-|||+-.. ....-++.++|..+-+|..+++ .|..|
T Consensus 3 ~WTVHLeGG---PrRVNHAavaVG~riYSFGGYCsGedy~~~~piDVH~lNa~~~RWtk~pp~~~ka~i~~~yp~VPyqR 79 (392)
T KOG4693|consen 3 TWTVHLEGG---PRRVNHAAVAVGSRIYSFGGYCSGEDYDAKDPIDVHVLNAENYRWTKMPPGITKATIESPYPAVPYQR 79 (392)
T ss_pred eEEEEecCC---cccccceeeeecceEEecCCcccccccccCCcceeEEeeccceeEEecCcccccccccCCCCccchhh
Confidence 355443333 468899999999999999996432 1335589999999999999886 23459
Q ss_pred eeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CCCCcccceeEEEECCEEEEEccCCCCC--CceEE
Q 045821 159 CYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVMDGKIYIRCSASAAT--SHVCA 229 (352)
Q Consensus 159 ~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~~g~iyv~GG~~~~~--~~~~i 229 (352)
++|+.+.+++++|+.||.++. +.++.||++|+.|.+.. -.|.++.++++|+.++.+|++||..... ..+++
T Consensus 80 YGHtvV~y~d~~yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~ 159 (392)
T KOG4693|consen 80 YGHTVVEYQDKAYVWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDT 159 (392)
T ss_pred cCceEEEEcceEEEEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccc
Confidence 999999999999999998875 88899999999998764 3688899999999999999999987543 34589
Q ss_pred EEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCC-------------CeEEEEeCCCCceEEccCCCC-Cc
Q 045821 230 LVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSG-------------TKLMMWQKESREWSPVGRLST-LL 292 (352)
Q Consensus 230 ~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~-------------~~v~~yd~~~~~W~~~~~~~~-~~ 292 (352)
.++|..+.+|+++.. ++.-+..+.+.++++.+|++||... +.+..+|..++.|..-+..+. +.
T Consensus 160 h~ld~~TmtWr~~~Tkg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~ 239 (392)
T KOG4693|consen 160 HVLDFATMTWREMHTKGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPG 239 (392)
T ss_pred eeEeccceeeeehhccCCCchhhhhhhhhhccceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCC
Confidence 999999999999854 2222334566788999999999543 578899999999998764332 22
Q ss_pred CCCCceEEEeCCEEEEEcCCc--------eEEEEEcCCccccc
Q 045821 293 TRPPCKLVAIGKTIFVIGKGC--------SAVVIDVGNIGNIG 327 (352)
Q Consensus 293 ~~~~~~~~~~~~~i~v~gG~~--------~~~~~d~~~~~w~~ 327 (352)
.|+.++..++++++|+|||++ +.+.||+.+..|..
T Consensus 240 GRRSHS~fvYng~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~ 282 (392)
T KOG4693|consen 240 GRRSHSTFVYNGKMYMFGGYNGTLNVHFNDLYCFDPKTSMWSV 282 (392)
T ss_pred cccccceEEEcceEEEecccchhhhhhhcceeecccccchhee
Confidence 688889999999999999976 46679999999954
No 21
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.94 E-value=4.7e-25 Score=205.53 Aligned_cols=234 Identities=20% Similarity=0.246 Sum_probs=191.6
Q ss_pred ChhhHHHHhhCCCC--CCEEEEEEecCCCC---ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCC
Q 045821 54 SEEWCAYRRKHNLD--ETWIYAFCRDNKLE---RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGC 128 (352)
Q Consensus 54 s~~~~~~~~~~~~~--~~~l~~~gg~~~~~---~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~ 128 (352)
....+..|..|.+. ++.+|++||..... ...+++||. .+..|.........+.++.+++++.++++||+|||.
T Consensus 54 ~~~~p~~R~~hs~~~~~~~~~vfGG~~~~~~~~~~dl~~~d~--~~~~w~~~~~~g~~p~~r~g~~~~~~~~~l~lfGG~ 131 (482)
T KOG0379|consen 54 LGVGPIPRAGHSAVLIGNKLYVFGGYGSGDRLTDLDLYVLDL--ESQLWTKPAATGDEPSPRYGHSLSAVGDKLYLFGGT 131 (482)
T ss_pred CCCCcchhhccceeEECCEEEEECCCCCCCccccceeEEeec--CCcccccccccCCCCCcccceeEEEECCeEEEEccc
Confidence 34556677777655 89999999964222 225999999 999998887766555889999999999999999998
Q ss_pred CCCCCCCCeEEEEECCCCeeEecCC---CCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CC
Q 045821 129 GWSEDATSEVYCYDASMNTWTDAAP---MSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EP 198 (352)
Q Consensus 129 ~~~~~~~~~~~~~d~~t~~W~~~~~---~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~ 198 (352)
+......++++.||+.|++|+.+.. +|.+|.+|+++.++.++||+||.... +++++||+.+.+|.++. ..
T Consensus 132 ~~~~~~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~ 211 (482)
T KOG0379|consen 132 DKKYRNLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEA 211 (482)
T ss_pred cCCCCChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecccCCCC
Confidence 8644668899999999999998754 57889999999999999999998754 99999999999999986 35
Q ss_pred CCcccceeEEEECCEEEEEccCC-CCCCceEEEEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCC-----
Q 045821 199 NIFTEIEDSFVMDGKIYIRCSAS-AATSHVCALVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSG----- 269 (352)
Q Consensus 199 ~~~~~~~~~~~~~g~iyv~GG~~-~~~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~----- 269 (352)
|.++..|++++++++++++||.. .....++++.+|+.+.+|.++.. .+.++..+..+..+..++++||...
T Consensus 212 P~pR~gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~~R~~h~~~~~~~~~~l~gG~~~~~~~~ 291 (482)
T KOG0379|consen 212 PSPRYGHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPSPRSGHSLTVSGDHLLLFGGGTDPKQEP 291 (482)
T ss_pred CCCCCCceEEEECCeEEEEeccccCCceecceEeeecccceeeeccccCCCCCCcceeeeEEECCEEEEEcCCccccccc
Confidence 66788889999999999999988 44556699999999999997653 2333444445688999999998544
Q ss_pred -CeEEEEeCCCCceEEccCCC
Q 045821 270 -TKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 270 -~~v~~yd~~~~~W~~~~~~~ 289 (352)
.+++.||.+++.|+++....
T Consensus 292 l~~~~~l~~~~~~w~~~~~~~ 312 (482)
T KOG0379|consen 292 LGDLYGLDLETLVWSKVESVG 312 (482)
T ss_pred ccccccccccccceeeeeccc
Confidence 46889999999999997554
No 22
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.93 E-value=7.4e-24 Score=197.52 Aligned_cols=225 Identities=20% Similarity=0.357 Sum_probs=184.4
Q ss_pred cccCCcceEEEEECCEEEEEeCCCCCCCCCC-eEEEEECCCCeeEecC---CCCCcceeeeeEEECCEEEEEeCCCCC--
Q 045821 106 RTLRRKGMGFEVLGKNAYLLGGCGWSEDATS-EVYCYDASMNTWTDAA---PMSTARCYFPCGVLNQKIYCIGGLGDT-- 179 (352)
Q Consensus 106 ~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~-~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~~~~iyv~gG~~~~-- 179 (352)
.+.+|..|+++.+++++|+|||......... +++++|..+..|.... ..|.+|++|.++.++++||++||....
T Consensus 57 ~p~~R~~hs~~~~~~~~~vfGG~~~~~~~~~~dl~~~d~~~~~w~~~~~~g~~p~~r~g~~~~~~~~~l~lfGG~~~~~~ 136 (482)
T KOG0379|consen 57 GPIPRAGHSAVLIGNKLYVFGGYGSGDRLTDLDLYVLDLESQLWTKPAATGDEPSPRYGHSLSAVGDKLYLFGGTDKKYR 136 (482)
T ss_pred CcchhhccceeEECCEEEEECCCCCCCccccceeEEeecCCcccccccccCCCCCcccceeEEEECCeEEEEccccCCCC
Confidence 3378999999999999999999765522222 6999999999997654 456889999999999999999998843
Q ss_pred --CceEEEeCCCCcEEEec---CCCCcccceeEEEECCEEEEEccCCCCC-CceEEEEEeCCCCceeecCC---CCCCCC
Q 045821 180 --HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVMDGKIYIRCSASAAT-SHVCALVYEPSTDSWLHADA---NMASGW 250 (352)
Q Consensus 180 --~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~~g~iyv~GG~~~~~-~~~~i~~yd~~~~~W~~~~~---~~~~~~ 250 (352)
++++.||+.|++|+.+. .+|+++..|+++++++++||+||..... ..+++++||+.+.+|.++.. .+.++.
T Consensus 137 ~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~ 216 (482)
T KOG0379|consen 137 NLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRY 216 (482)
T ss_pred ChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecccCCCCCCCCC
Confidence 79999999999999886 3688889999999999999999988766 67799999999999998854 233344
Q ss_pred CCcEEEECCEEEEEecCC-C----CeEEEEeCCCCceEEcc---CCCCCcCCCCceEEEeCCEEEEEcCCce--------
Q 045821 251 RGPAVVVDDALYVLDQSS-G----TKLMMWQKESREWSPVG---RLSTLLTRPPCKLVAIGKTIFVIGKGCS-------- 314 (352)
Q Consensus 251 ~~~~~~~~~~l~~~gg~~-~----~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~~~i~v~gG~~~-------- 314 (352)
.+..++.+++++++||.. + ++++.+|+.+.+|..+. ..|. +|..+..+..+++++++||...
T Consensus 217 gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~--~R~~h~~~~~~~~~~l~gG~~~~~~~~l~~ 294 (482)
T KOG0379|consen 217 GHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPS--PRSGHSLTVSGDHLLLFGGGTDPKQEPLGD 294 (482)
T ss_pred CceEEEECCeEEEEeccccCCceecceEeeecccceeeeccccCCCCC--CcceeeeEEECCEEEEEcCCcccccccccc
Confidence 455578899999998876 3 79999999999999665 3343 6888888899999999998653
Q ss_pred EEEEEcCCcccccceEEe
Q 045821 315 AVVIDVGNIGNIGGIMVS 332 (352)
Q Consensus 315 ~~~~d~~~~~w~~~~~~~ 332 (352)
.+.+|.+...|.+.....
T Consensus 295 ~~~l~~~~~~w~~~~~~~ 312 (482)
T KOG0379|consen 295 LYGLDLETLVWSKVESVG 312 (482)
T ss_pred cccccccccceeeeeccc
Confidence 345888888898755554
No 23
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.91 E-value=7.7e-23 Score=175.86 Aligned_cols=223 Identities=13% Similarity=0.191 Sum_probs=174.2
Q ss_pred cCCcceEEEEE--CCEEEEEeCC--CCC-CCCCCeEEEEECCCCeeEec--CCCCCcceeeeeEEE-CCEEEEEeCCCCC
Q 045821 108 LRRKGMGFEVL--GKNAYLLGGC--GWS-EDATSEVYCYDASMNTWTDA--APMSTARCYFPCGVL-NQKIYCIGGLGDT 179 (352)
Q Consensus 108 ~~~~~~~~~~~--~~~iyv~GG~--~~~-~~~~~~~~~~d~~t~~W~~~--~~~~~~r~~~~~~~~-~~~iyv~gG~~~~ 179 (352)
.+|.++++.+. .+.+++|||. ++. ....++++.||..+++|+.+ |+.|.||+.|.++++ .+.+|++||.-..
T Consensus 65 spRsn~sl~~nPekeELilfGGEf~ngqkT~vYndLy~Yn~k~~eWkk~~spn~P~pRsshq~va~~s~~l~~fGGEfaS 144 (521)
T KOG1230|consen 65 SPRSNPSLFANPEKEELILFGGEFYNGQKTHVYNDLYSYNTKKNEWKKVVSPNAPPPRSSHQAVAVPSNILWLFGGEFAS 144 (521)
T ss_pred CCCCCcceeeccCcceeEEecceeecceeEEEeeeeeEEeccccceeEeccCCCcCCCccceeEEeccCeEEEeccccCC
Confidence 67888777666 5689999994 322 24578999999999999976 456789999988888 4899999995432
Q ss_pred ---------CceEEEeCCCCcEEEecC--CCCcccceeEEEECCEEEEEccCCCCC----CceEEEEEeCCCCceeecCC
Q 045821 180 ---------HSWDVYDPRTNNWKLHTE--PNIFTEIEDSFVMDGKIYIRCSASAAT----SHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 180 ---------~~v~~yd~~t~~W~~~~~--~~~~~~~~~~~~~~g~iyv~GG~~~~~----~~~~i~~yd~~~~~W~~~~~ 244 (352)
.++|.||..|++|+++.. -|.+|.+|-+++...+|++|||..... +.+++++||+.+-+|+++.+
T Consensus 145 Pnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RSGHRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klep 224 (521)
T KOG1230|consen 145 PNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRSGHRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEP 224 (521)
T ss_pred cchhhhhhhhheeeeeeccchheeeccCCCCCCCccceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccC
Confidence 999999999999999964 466788888999999999999988643 34599999999999999966
Q ss_pred CC--CCCCCCc--EEEECCEEEEEecCCC-------------CeEEEEeCCC-----CceEEccCCC-CCcCCCCceEEE
Q 045821 245 NM--ASGWRGP--AVVVDDALYVLDQSSG-------------TKLMMWQKES-----REWSPVGRLS-TLLTRPPCKLVA 301 (352)
Q Consensus 245 ~~--~~~~~~~--~~~~~~~l~~~gg~~~-------------~~v~~yd~~~-----~~W~~~~~~~-~~~~~~~~~~~~ 301 (352)
.- +.++++. ++.-+|.||+.||+.. ++++.++++. =+|+.+.... .+.+|.++++++
T Consensus 225 sga~PtpRSGcq~~vtpqg~i~vyGGYsK~~~kK~~dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~v 304 (521)
T KOG1230|consen 225 SGAGPTPRSGCQFSVTPQGGIVVYGGYSKQRVKKDVDKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAV 304 (521)
T ss_pred CCCCCCCCCcceEEecCCCcEEEEcchhHhhhhhhhhcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEE
Confidence 32 3444444 3455999999999764 6899999988 4588886432 223688888777
Q ss_pred e-CCEEEEEcCCc---------------eEEEEEcCCcccccceE
Q 045821 302 I-GKTIFVIGKGC---------------SAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 302 ~-~~~i~v~gG~~---------------~~~~~d~~~~~w~~~~~ 330 (352)
. +++-|.|||.. +.+.||...++|.++..
T Consensus 305 a~n~kal~FGGV~D~eeeeEsl~g~F~NDLy~fdlt~nrW~~~ql 349 (521)
T KOG1230|consen 305 AKNHKALFFGGVCDLEEEEESLSGEFFNDLYFFDLTRNRWSEGQL 349 (521)
T ss_pred ecCCceEEecceecccccchhhhhhhhhhhhheecccchhhHhhh
Confidence 6 55899999955 36679999999987633
No 24
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.90 E-value=6.6e-23 Score=180.60 Aligned_cols=264 Identities=17% Similarity=0.199 Sum_probs=196.9
Q ss_pred hhcccccC-hhhHHHHhhCCCC--CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEE
Q 045821 47 RWRGLLHS-EEWCAYRRKHNLD--ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAY 123 (352)
Q Consensus 47 ~W~~l~~s-~~~~~~~~~~~~~--~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iy 123 (352)
+|+.+..+ ...+++|+.|... ...|.+|||.|..-..++++||. .++.|..-......+.+.+.|..+..+.+||
T Consensus 18 rWrrV~~~tGPvPrpRHGHRAVaikELiviFGGGNEGiiDELHvYNT--atnqWf~PavrGDiPpgcAA~GfvcdGtril 95 (830)
T KOG4152|consen 18 RWRRVQQSTGPVPRPRHGHRAVAIKELIVIFGGGNEGIIDELHVYNT--ATNQWFAPAVRGDIPPGCAAFGFVCDGTRIL 95 (830)
T ss_pred ceEEEecccCCCCCccccchheeeeeeEEEecCCcccchhhhhhhcc--ccceeecchhcCCCCCchhhcceEecCceEE
Confidence 57766543 2345556555433 88899999986655677899999 9999976544333336677888888999999
Q ss_pred EEeCCCCCCCCCCeEEEEECCCCeeEecCC-------CCCcceeeeeEEECCEEEEEeCCCCC------------CceEE
Q 045821 124 LLGGCGWSEDATSEVYCYDASMNTWTDAAP-------MSTARCYFPCGVLNQKIYCIGGLGDT------------HSWDV 184 (352)
Q Consensus 124 v~GG~~~~~~~~~~~~~~d~~t~~W~~~~~-------~~~~r~~~~~~~~~~~iyv~gG~~~~------------~~v~~ 184 (352)
+|||..+....+++++-.....-.|+++.+ +|-||.+|+...++++-|+|||...+ +++++
T Consensus 96 vFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~gnKcYlFGGLaNdseDpknNvPrYLnDlY~ 175 (830)
T KOG4152|consen 96 VFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNVPRYLNDLYI 175 (830)
T ss_pred EEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEeccEeEEeccccccccCcccccchhhcceEE
Confidence 999988777888888766666666777753 35689999999999999999996432 77777
Q ss_pred EeCCCC----cEEEec---CCCCcccceeEEEE------CCEEEEEccCCCCCCceEEEEEeCCCCceeecCC---CCCC
Q 045821 185 YDPRTN----NWKLHT---EPNIFTEIEDSFVM------DGKIYIRCSASAATSHVCALVYEPSTDSWLHADA---NMAS 248 (352)
Q Consensus 185 yd~~t~----~W~~~~---~~~~~~~~~~~~~~------~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~---~~~~ 248 (352)
.+..-+ .|...- ..|.++..|.++++ ..++||+||+.+ -...++|..|+.+-+|++..- .+-+
T Consensus 176 leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~G-~RLgDLW~Ldl~Tl~W~kp~~~G~~PlP 254 (830)
T KOG4152|consen 176 LELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMSG-CRLGDLWTLDLDTLTWNKPSLSGVAPLP 254 (830)
T ss_pred EEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEccccc-ccccceeEEecceeecccccccCCCCCC
Confidence 777644 486543 36777888888777 348999998874 344589999999999998743 2333
Q ss_pred CCCCcEEEECCEEEEEecCCC------------------CeEEEEeCCCCceEEcc-----CCCCCcCCCCceEEEeCCE
Q 045821 249 GWRGPAVVVDDALYVLDQSSG------------------TKLMMWQKESREWSPVG-----RLSTLLTRPPCKLVAIGKT 305 (352)
Q Consensus 249 ~~~~~~~~~~~~l~~~gg~~~------------------~~v~~yd~~~~~W~~~~-----~~~~~~~~~~~~~~~~~~~ 305 (352)
+.-+.+..+++|+|++||.-- +.+-++|++++.|..+- +-..+..|.+++.+.++.+
T Consensus 255 RSLHsa~~IGnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~~W~tl~~d~~ed~tiPR~RAGHCAvAigtR 334 (830)
T KOG4152|consen 255 RSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTMAWETLLMDTLEDNTIPRARAGHCAVAIGTR 334 (830)
T ss_pred cccccceeecceeEEecceeeeeccccccccccceeeeccceeeeeecchheeeeeeccccccccccccccceeEEeccE
Confidence 334556788999999999311 57889999999999874 1223346888999999999
Q ss_pred EEEEcCCc
Q 045821 306 IFVIGKGC 313 (352)
Q Consensus 306 i~v~gG~~ 313 (352)
+|+-.|++
T Consensus 335 lYiWSGRD 342 (830)
T KOG4152|consen 335 LYIWSGRD 342 (830)
T ss_pred EEEEeccc
Confidence 99999876
No 25
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.90 E-value=1.9e-22 Score=173.43 Aligned_cols=257 Identities=14% Similarity=0.147 Sum_probs=183.7
Q ss_pred cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhhHHHHhhCC----CCCCEEEEEEecCCCCc-----eeEEEe
Q 045821 18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWCAYRRKHN----LDETWIYAFCRDNKLER-----VCCYVL 88 (352)
Q Consensus 18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~~~~~~~~----~~~~~l~~~gg~~~~~~-----~~~~~~ 88 (352)
...|-++.+.+|+..+.++......-.+ .-.++ +|.... ...+.|+++||.-..+. ..+|.|
T Consensus 33 ~~e~de~~i~~~iq~~eaK~~e~~~e~~----~~~Ps-----pRsn~sl~~nPekeELilfGGEf~ngqkT~vYndLy~Y 103 (521)
T KOG1230|consen 33 NEELDEADIAEIIQSLEAKQIEHVVETS----VPPPS-----PRSNPSLFANPEKEELILFGGEFYNGQKTHVYNDLYSY 103 (521)
T ss_pred CcccchHHHHHHHHhhhhhccceeeecc----CCCCC-----CCCCcceeeccCcceeEEecceeecceeEEEeeeeeEE
Confidence 3445577888888888776542211000 00111 122111 11557999998522221 247899
Q ss_pred cCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCC-----CCCCeEEEEECCCCeeEecCC--CCCccee
Q 045821 89 DPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSE-----DATSEVYCYDASMNTWTDAAP--MSTARCY 160 (352)
Q Consensus 89 d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~-----~~~~~~~~~d~~t~~W~~~~~--~~~~r~~ 160 (352)
|. .+++|+++..... |.+|..|.++++- |.+|+|||.-... ...+++|.||..+++|..+.. -|.||++
T Consensus 104 n~--k~~eWkk~~spn~-P~pRsshq~va~~s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RSG 180 (521)
T KOG1230|consen 104 NT--KKNEWKKVVSPNA-PPPRSSHQAVAVPSNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRSG 180 (521)
T ss_pred ec--cccceeEeccCCC-cCCCccceeEEeccCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCCCcc
Confidence 99 9999999875433 3678888887774 8999999964321 246799999999999998854 5789999
Q ss_pred eeeEEECCEEEEEeCCCCC-------CceEEEeCCCCcEEEecCC---CCcccceeEEEE-CCEEEEEccCCC-------
Q 045821 161 FPCGVLNQKIYCIGGLGDT-------HSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVM-DGKIYIRCSASA------- 222 (352)
Q Consensus 161 ~~~~~~~~~iyv~gG~~~~-------~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~-~g~iyv~GG~~~------- 222 (352)
|.++....++++|||..+. +++++||+.|-+|.++.+. |.+|.+++..+. +|.|||.||+..
T Consensus 181 HRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRSGcq~~vtpqg~i~vyGGYsK~~~kK~~ 260 (521)
T KOG1230|consen 181 HRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGCQFSVTPQGGIVVYGGYSKQRVKKDV 260 (521)
T ss_pred ceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCCcceEEecCCCcEEEEcchhHhhhhhhh
Confidence 9999999999999997654 9999999999999999753 456667766666 999999999763
Q ss_pred --CCCceEEEEEeCCC-----CceeecCC--CCCCCCCCcE--EEECCEEEEEecCCC-------------CeEEEEeCC
Q 045821 223 --ATSHVCALVYEPST-----DSWLHADA--NMASGWRGPA--VVVDDALYVLDQSSG-------------TKLMMWQKE 278 (352)
Q Consensus 223 --~~~~~~i~~yd~~~-----~~W~~~~~--~~~~~~~~~~--~~~~~~l~~~gg~~~-------------~~v~~yd~~ 278 (352)
+...++++..++.+ -.|+.+.+ ..+.++++.+ +.-+++-+.+||.-. ++++.||..
T Consensus 261 dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~va~n~kal~FGGV~D~eeeeEsl~g~F~NDLy~fdlt 340 (521)
T KOG1230|consen 261 DKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAVAKNHKALFFGGVCDLEEEEESLSGEFFNDLYFFDLT 340 (521)
T ss_pred hcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEEecCCceEEecceecccccchhhhhhhhhhhhheecc
Confidence 22344899999877 56888865 2334444444 345779999999432 789999999
Q ss_pred CCceEEcc
Q 045821 279 SREWSPVG 286 (352)
Q Consensus 279 ~~~W~~~~ 286 (352)
.++|+...
T Consensus 341 ~nrW~~~q 348 (521)
T KOG1230|consen 341 RNRWSEGQ 348 (521)
T ss_pred cchhhHhh
Confidence 99998663
No 26
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.80 E-value=3.1e-18 Score=151.37 Aligned_cols=218 Identities=15% Similarity=0.201 Sum_probs=157.9
Q ss_pred CCCCeeEeccCCC-cccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEec---CCCCCcceeeeeEEECC
Q 045821 93 TRRSWKMIHELPA-RTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDA---APMSTARCYFPCGVLNQ 168 (352)
Q Consensus 93 ~~~~w~~~~~~~~-~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~---~~~~~~r~~~~~~~~~~ 168 (352)
....|+.+..... -+.+|++|.++++...|.+|||-++ ...+++++||..+++|..- .+.|.+.+.|..+..+.
T Consensus 15 ~~~rWrrV~~~tGPvPrpRHGHRAVaikELiviFGGGNE--GiiDELHvYNTatnqWf~PavrGDiPpgcAA~GfvcdGt 92 (830)
T KOG4152|consen 15 NVVRWRRVQQSTGPVPRPRHGHRAVAIKELIVIFGGGNE--GIIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVCDGT 92 (830)
T ss_pred cccceEEEecccCCCCCccccchheeeeeeEEEecCCcc--cchhhhhhhccccceeecchhcCCCCCchhhcceEecCc
Confidence 5567887765432 2368999999999999999999765 4578899999999999753 46788888888888899
Q ss_pred EEEEEeCCCCC----CceEEEeCCCCcEEEecC-------CCCcccceeEEEECCEEEEEccCCCCC---------CceE
Q 045821 169 KIYCIGGLGDT----HSWDVYDPRTNNWKLHTE-------PNIFTEIEDSFVMDGKIYIRCSASAAT---------SHVC 228 (352)
Q Consensus 169 ~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~-------~~~~~~~~~~~~~~g~iyv~GG~~~~~---------~~~~ 228 (352)
+||+|||.-.. ++++......-.|+++.+ +|-++-+|+...++++-|+|||..... +.++
T Consensus 93 rilvFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~gnKcYlFGGLaNdseDpknNvPrYLnD 172 (830)
T KOG4152|consen 93 RILVFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNVPRYLND 172 (830)
T ss_pred eEEEEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEeccEeEEeccccccccCcccccchhhcc
Confidence 99999995432 555444444456777753 233455666677799999999975422 2347
Q ss_pred EEEEeCCCC----ceeecC---CCCCCCCCCcEEEE------CCEEEEEecCCC---CeEEEEeCCCCceEEccCC-CCC
Q 045821 229 ALVYEPSTD----SWLHAD---ANMASGWRGPAVVV------DDALYVLDQSSG---TKLMMWQKESREWSPVGRL-STL 291 (352)
Q Consensus 229 i~~yd~~~~----~W~~~~---~~~~~~~~~~~~~~------~~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~-~~~ 291 (352)
++..++..+ .|...- ..++.+.++.++.. ..++|++||..+ .++|.+|+++.+|++...- -.+
T Consensus 173 lY~leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~G~RLgDLW~Ldl~Tl~W~kp~~~G~~P 252 (830)
T KOG4152|consen 173 LYILELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMSGCRLGDLWTLDLDTLTWNKPSLSGVAP 252 (830)
T ss_pred eEEEEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEcccccccccceeEEecceeecccccccCCCC
Confidence 777776543 386542 23344444455443 347999999877 7999999999999977521 122
Q ss_pred cCCCCceEEEeCCEEEEEcCC
Q 045821 292 LTRPPCKLVAIGKTIFVIGKG 312 (352)
Q Consensus 292 ~~~~~~~~~~~~~~i~v~gG~ 312 (352)
++|..+....+++|+|||||-
T Consensus 253 lPRSLHsa~~IGnKMyvfGGW 273 (830)
T KOG4152|consen 253 LPRSLHSATTIGNKMYVFGGW 273 (830)
T ss_pred CCcccccceeecceeEEecce
Confidence 468888999999999999983
No 27
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.80 E-value=1.2e-17 Score=142.05 Aligned_cols=237 Identities=21% Similarity=0.323 Sum_probs=172.9
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCC----CCCCCeEEEEEC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----EDATSEVYCYDA 143 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----~~~~~~~~~~d~ 143 (352)
++.+|+--|. ....+|.+|.++....|+++...|. .+|.+...++++++||+|||.... ....+++++|||
T Consensus 46 g~~~YVGLGs---~G~afy~ldL~~~~k~W~~~a~FpG--~~rnqa~~a~~~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p 120 (381)
T COG3055 46 GDTVYVGLGS---AGTAFYVLDLKKPGKGWTKIADFPG--GARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDP 120 (381)
T ss_pred cceEEEEecc---CCccceehhhhcCCCCceEcccCCC--cccccchheeeCCeEEEeeccccCCCCCceEeeeeEEecC
Confidence 6788986553 3457888888667789999999986 578899999999999999996432 345789999999
Q ss_pred CCCeeEecCC-CCCcceeeeeEEECC-EEEEEeCCCCC-------------------------------------CceEE
Q 045821 144 SMNTWTDAAP-MSTARCYFPCGVLNQ-KIYCIGGLGDT-------------------------------------HSWDV 184 (352)
Q Consensus 144 ~t~~W~~~~~-~~~~r~~~~~~~~~~-~iyv~gG~~~~-------------------------------------~~v~~ 184 (352)
.+++|..+.. .|+....+..+.+++ +||++||+... ..+..
T Consensus 121 ~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~~~~dy~~n~ev~s 200 (381)
T COG3055 121 STNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDKKAEDYFFNKEVLS 200 (381)
T ss_pred CCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCCCHHHhcccccccc
Confidence 9999998865 355566777888876 99999997421 78899
Q ss_pred EeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCC-CceEEEEEeC--CCCceeecCCCCCCCCCC---cE---
Q 045821 185 YDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAAT-SHVCALVYEP--STDSWLHADANMASGWRG---PA--- 254 (352)
Q Consensus 185 yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~-~~~~i~~yd~--~~~~W~~~~~~~~~~~~~---~~--- 254 (352)
|||++++|+.+...|......++++. ++++.++-|....+ ....+..++. ...+|..+++.+...... .+
T Consensus 201 y~p~~n~W~~~G~~pf~~~aGsa~~~~~n~~~lInGEiKpGLRt~~~k~~~~~~~~~~w~~l~~lp~~~~~~~eGvAGaf 280 (381)
T COG3055 201 YDPSTNQWRNLGENPFYGNAGSAVVIKGNKLTLINGEIKPGLRTAEVKQADFGGDNLKWLKLSDLPAPIGSNKEGVAGAF 280 (381)
T ss_pred cccccchhhhcCcCcccCccCcceeecCCeEEEEcceecCCccccceeEEEeccCceeeeeccCCCCCCCCCccccceec
Confidence 99999999998866655544455554 56577775654333 2335556655 456899987643332221 11
Q ss_pred -EEECCEEEEEecCCC-----------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821 255 -VVVDDALYVLDQSSG-----------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG 310 (352)
Q Consensus 255 -~~~~~~l~~~gg~~~-----------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g 310 (352)
-..++.+.+.||..- ++|+.+| +++|+.++.+|. .......+..+++||+||
T Consensus 281 ~G~s~~~~lv~GGAnF~Ga~~~y~~Gk~~AH~Gl~K~w~~~Vy~~d--~g~Wk~~GeLp~--~l~YG~s~~~nn~vl~IG 356 (381)
T COG3055 281 SGKSNGEVLVAGGANFPGALKAYKNGKFYAHEGLSKSWNSEVYIFD--NGSWKIVGELPQ--GLAYGVSLSYNNKVLLIG 356 (381)
T ss_pred cceeCCeEEEecCCCChhHHHHHHhcccccccchhhhhhceEEEEc--CCceeeecccCC--CccceEEEecCCcEEEEc
Confidence 134677777776321 4788888 899999999998 455566677899999999
Q ss_pred CCc
Q 045821 311 KGC 313 (352)
Q Consensus 311 G~~ 313 (352)
|..
T Consensus 357 GE~ 359 (381)
T COG3055 357 GET 359 (381)
T ss_pred ccc
Confidence 965
No 28
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.69 E-value=3.6e-15 Score=127.21 Aligned_cols=207 Identities=23% Similarity=0.357 Sum_probs=148.3
Q ss_pred EeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC--CeeEecCCCC-CcceeeeeEEECCEEEEEeC
Q 045821 99 MIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM--NTWTDAAPMS-TARCYFPCGVLNQKIYCIGG 175 (352)
Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t--~~W~~~~~~~-~~r~~~~~~~~~~~iyv~gG 175 (352)
.+|.+| .+..+-+.+.+++.+||-=|..+ .+.+..|... ..|+.++..| .+|.....++++++||++||
T Consensus 29 ~lPdlP---vg~KnG~Ga~ig~~~YVGLGs~G-----~afy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~~~kLyvFgG 100 (381)
T COG3055 29 QLPDLP---VGFKNGAGALIGDTVYVGLGSAG-----TAFYVLDLKKPGKGWTKIADFPGGARNQAVAAVIGGKLYVFGG 100 (381)
T ss_pred cCCCCC---ccccccccceecceEEEEeccCC-----ccceehhhhcCCCCceEcccCCCcccccchheeeCCeEEEeec
Confidence 345555 34444467788889999866333 4567777763 5699999988 57888899999999999999
Q ss_pred CCCC--------CceEEEeCCCCcEEEecCC-CCcccceeEEEECC-EEEEEccCCCCC---------------------
Q 045821 176 LGDT--------HSWDVYDPRTNNWKLHTEP-NIFTEIEDSFVMDG-KIYIRCSASAAT--------------------- 224 (352)
Q Consensus 176 ~~~~--------~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~~~g-~iyv~GG~~~~~--------------------- 224 (352)
.... +++++|||.+++|.++... |..-..+.++.+++ +||++||.+..-
T Consensus 101 ~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i 180 (381)
T COG3055 101 YGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKI 180 (381)
T ss_pred cccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHH
Confidence 7643 8999999999999998753 33344456677777 999999977311
Q ss_pred -------------CceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCC-----CeEEEEeCC--CCceE
Q 045821 225 -------------SHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSG-----TKLMMWQKE--SREWS 283 (352)
Q Consensus 225 -------------~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~-----~~v~~yd~~--~~~W~ 283 (352)
....++.||+++++|+.+...+-...++++++. ++++.++.|+-. ..+.+++.. ..+|.
T Consensus 181 ~~~yf~~~~~dy~~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~~~~~n~~~lInGEiKpGLRt~~~k~~~~~~~~~~w~ 260 (381)
T COG3055 181 IAHYFDKKAEDYFFNKEVLSYDPSTNQWRNLGENPFYGNAGSAVVIKGNKLTLINGEIKPGLRTAEVKQADFGGDNLKWL 260 (381)
T ss_pred HHHHhCCCHHHhcccccccccccccchhhhcCcCcccCccCcceeecCCeEEEEcceecCCccccceeEEEeccCceeee
Confidence 112789999999999999876767777777654 666888887543 345666555 56799
Q ss_pred EccCCCCCcCCCCceEE-----EeCCEEEEEcCCc
Q 045821 284 PVGRLSTLLTRPPCKLV-----AIGKTIFVIGKGC 313 (352)
Q Consensus 284 ~~~~~~~~~~~~~~~~~-----~~~~~i~v~gG~~ 313 (352)
.++++|.+.....-.++ ..++++.+.||.+
T Consensus 261 ~l~~lp~~~~~~~eGvAGaf~G~s~~~~lv~GGAn 295 (381)
T COG3055 261 KLSDLPAPIGSNKEGVAGAFSGKSNGEVLVAGGAN 295 (381)
T ss_pred eccCCCCCCCCCccccceeccceeCCeEEEecCCC
Confidence 99988865433322222 2467788888755
No 29
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=99.54 E-value=1.3e-14 Score=128.44 Aligned_cols=234 Identities=16% Similarity=0.172 Sum_probs=155.0
Q ss_pred CCCCeeEeccCC-------CcccCCcceEEEEECC--EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC---CCCccee
Q 045821 93 TRRSWKMIHELP-------ARTLRRKGMGFEVLGK--NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP---MSTARCY 160 (352)
Q Consensus 93 ~~~~w~~~~~~~-------~~~~~~~~~~~~~~~~--~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~---~~~~r~~ 160 (352)
-+..|.++++.. ..++.|.+|..+...+ .||++||+++. +.+.++|+|+...+.|..+.. .|..|..
T Consensus 237 y~~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~-~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsC 315 (723)
T KOG2437|consen 237 YKPRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGT-QDLADFWAYSVKENQWTCINRDTEGPGARSC 315 (723)
T ss_pred ccccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccc-hhHHHHHhhcCCcceeEEeecCCCCCcchhh
Confidence 567788777654 3457788998888865 89999999987 778999999999999998753 5778999
Q ss_pred eeeEEECC--EEEEEeCCCCC---------CceEEEeCCCCcEEEecC------CCCcccceeEEEECCE--EEEEccCC
Q 045821 161 FPCGVLNQ--KIYCIGGLGDT---------HSWDVYDPRTNNWKLHTE------PNIFTEIEDSFVMDGK--IYIRCSAS 221 (352)
Q Consensus 161 ~~~~~~~~--~iyv~gG~~~~---------~~v~~yd~~t~~W~~~~~------~~~~~~~~~~~~~~g~--iyv~GG~~ 221 (352)
|.++..-. ++|++|-+-.. +++|+||..++.|.-++. -|.....|..++.+.+ |||+||..
T Consensus 316 HRMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~ 395 (723)
T KOG2437|consen 316 HRMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRI 395 (723)
T ss_pred hhhhhhhhHhHHhhhhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCee
Confidence 98887644 89999986432 899999999999998862 1223344566666655 99999976
Q ss_pred CCCC---ceEEEEEeCCCCceeecCCCCCC----------CCCC-c-EEEECCEEEEEecCCC----CeEEEEeCCCCce
Q 045821 222 AATS---HVCALVYEPSTDSWLHADANMAS----------GWRG-P-AVVVDDALYVLDQSSG----TKLMMWQKESREW 282 (352)
Q Consensus 222 ~~~~---~~~i~~yd~~~~~W~~~~~~~~~----------~~~~-~-~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W 282 (352)
-... ...++.||.....|..+...... +..+ + .+..++.+|++||... .-.+.||+..+.=
T Consensus 396 ~~~~e~~f~GLYaf~~~~~~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq~s~~El~L~f~y~I~~E~~ 475 (723)
T KOG2437|consen 396 LTCNEPQFSGLYAFNCQCQTWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQRSKTELNLFFSYDIDSEHV 475 (723)
T ss_pred ccCCCccccceEEEecCCccHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCcccceEEeehhcceeccccc
Confidence 4333 34899999999999877431111 0000 1 2345778999998665 2235565554443
Q ss_pred EEccCC--------CCCcCCCCceEEEeCCEEEEEcC------------CceEEEEEcCCccccc
Q 045821 283 SPVGRL--------STLLTRPPCKLVAIGKTIFVIGK------------GCSAVVIDVGNIGNIG 327 (352)
Q Consensus 283 ~~~~~~--------~~~~~~~~~~~~~~~~~i~v~gG------------~~~~~~~d~~~~~w~~ 327 (352)
..+... |.........+-.-...|.+.-| +++++||++.++.|..
T Consensus 476 ~~~s~~~k~dsS~~pS~~f~qRs~~dp~~~~i~~~~G~~~~~~~~e~~~rns~wi~~i~~~~w~c 540 (723)
T KOG2437|consen 476 DIISDGTKKDSSMVPSTGFTQRATIDPELNEIHVLSGLSKDKEKREENVRNSFWIYDIVRNSWSC 540 (723)
T ss_pred hhhhccCcCccccCCCcchhhhcccCCCCcchhhhcccchhccCccccccCcEEEEEecccchhh
Confidence 333211 11001111111122344554433 3368889999998863
No 30
>PF13964 Kelch_6: Kelch motif
Probab=99.28 E-value=1.4e-11 Score=77.67 Aligned_cols=50 Identities=36% Similarity=0.605 Sum_probs=45.6
Q ss_pred CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcc
Q 045821 109 RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTAR 158 (352)
Q Consensus 109 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r 158 (352)
+|.++++++++++||++||........+++++||+.+++|+.+++||.||
T Consensus 1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR 50 (50)
T ss_pred CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence 47889999999999999998775577899999999999999999999887
No 31
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=99.25 E-value=5.9e-12 Score=111.83 Aligned_cols=239 Identities=16% Similarity=0.174 Sum_probs=156.9
Q ss_pred hhhcccccCh-------hhHHHHhhCCCC----CCEEEEEEecCCCCce-eEEEecCCCCCCCeeEeccCCCcccCCcce
Q 045821 46 SRWRGLLHSE-------EWCAYRRKHNLD----ETWIYAFCRDNKLERV-CCYVLDPNSTRRSWKMIHELPARTLRRKGM 113 (352)
Q Consensus 46 k~W~~l~~s~-------~~~~~~~~~~~~----~~~l~~~gg~~~~~~~-~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~ 113 (352)
-+|..++... ..+..|..|.+. ++-||+.||.+..... .+|.|+- ..+.|..+..-...|..|.+|
T Consensus 239 ~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v--~e~~W~~iN~~t~~PG~RsCH 316 (723)
T KOG2437|consen 239 PRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGTQDLADFWAYSV--KENQWTCINRDTEGPGARSCH 316 (723)
T ss_pred ccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccchhHHHHHhhcC--CcceeEEeecCCCCCcchhhh
Confidence 3577665433 234456666655 5689999999766654 5889988 889999886544334778888
Q ss_pred EEEEECC--EEEEEeCCCCC-----CCCCCeEEEEECCCCeeEecCC------CCCcceeeeeEEECCE--EEEEeCCCC
Q 045821 114 GFEVLGK--NAYLLGGCGWS-----EDATSEVYCYDASMNTWTDAAP------MSTARCYFPCGVLNQK--IYCIGGLGD 178 (352)
Q Consensus 114 ~~~~~~~--~iyv~GG~~~~-----~~~~~~~~~~d~~t~~W~~~~~------~~~~r~~~~~~~~~~~--iyv~gG~~~ 178 (352)
..+.... ++|+.|-+-+. ....+++|.||..++.|..+.- -|...+.|.+++.+.+ |||+||+.-
T Consensus 317 RMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~~ 396 (723)
T KOG2437|consen 317 RMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRIL 396 (723)
T ss_pred hhhhhhhHhHHhhhhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCeec
Confidence 8887765 89999976432 2456799999999999998852 2344667888888766 999999753
Q ss_pred C------CceEEEeCCCCcEEEecCC----------CCcccce--eEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee
Q 045821 179 T------HSWDVYDPRTNNWKLHTEP----------NIFTEIE--DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL 240 (352)
Q Consensus 179 ~------~~v~~yd~~t~~W~~~~~~----------~~~~~~~--~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~ 240 (352)
. ...+.||.....|..+..- ...+-++ -++.-+..+|++||.......+-...||+....=.
T Consensus 397 ~~~e~~f~GLYaf~~~~~~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq~s~~El~L~f~y~I~~E~~~ 476 (723)
T KOG2437|consen 397 TCNEPQFSGLYAFNCQCQTWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQRSKTELNLFFSYDIDSEHVD 476 (723)
T ss_pred cCCCccccceEEEecCCccHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCcccceEEeehhcceeccccch
Confidence 3 7889999999999876521 0111111 13445778999988776666666677766444332
Q ss_pred ecCC-------CCCCCCCC-cEE--EECCEEEEEecCCC----------CeEEEEeCCCCceEEcc
Q 045821 241 HADA-------NMASGWRG-PAV--VVDDALYVLDQSSG----------TKLMMWQKESREWSPVG 286 (352)
Q Consensus 241 ~~~~-------~~~~~~~~-~~~--~~~~~l~~~gg~~~----------~~v~~yd~~~~~W~~~~ 286 (352)
.+.. ..+.++.. -++ .-...|...-|... +..|+|+..++.|..+.
T Consensus 477 ~~s~~~k~dsS~~pS~~f~qRs~~dp~~~~i~~~~G~~~~~~~~e~~~rns~wi~~i~~~~w~cI~ 542 (723)
T KOG2437|consen 477 IISDGTKKDSSMVPSTGFTQRATIDPELNEIHVLSGLSKDKEKREENVRNSFWIYDIVRNSWSCIY 542 (723)
T ss_pred hhhccCcCccccCCCcchhhhcccCCCCcchhhhcccchhccCccccccCcEEEEEecccchhhHh
Confidence 2211 01111110 011 12345555555322 57899999999998774
No 32
>PF13964 Kelch_6: Kelch motif
Probab=99.15 E-value=1.1e-10 Score=73.51 Aligned_cols=46 Identities=28% Similarity=0.463 Sum_probs=41.7
Q ss_pred cceeeeeEEECCEEEEEeCCCC-C---CceEEEeCCCCcEEEecCCCCcc
Q 045821 157 ARCYFPCGVLNQKIYCIGGLGD-T---HSWDVYDPRTNNWKLHTEPNIFT 202 (352)
Q Consensus 157 ~r~~~~~~~~~~~iyv~gG~~~-~---~~v~~yd~~t~~W~~~~~~~~~~ 202 (352)
||..|+++.++++|||+||... . +++++||++|++|+++++||.+|
T Consensus 1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR 50 (50)
T PF13964_consen 1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR 50 (50)
T ss_pred CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence 6889999999999999999877 2 99999999999999999888654
No 33
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=99.08 E-value=8.4e-08 Score=81.68 Aligned_cols=188 Identities=12% Similarity=0.109 Sum_probs=113.5
Q ss_pred CeEEEEECCCCeeEecCCCCCcce---e-eeeEEEC-----CEEEEEeCCC---CCCceEEEeCCCCcEEEecCCCCcc-
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARC---Y-FPCGVLN-----QKIYCIGGLG---DTHSWDVYDPRTNNWKLHTEPNIFT- 202 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~---~-~~~~~~~-----~~iyv~gG~~---~~~~v~~yd~~t~~W~~~~~~~~~~- 202 (352)
..+.++||.|++|..+|+.+.++. . .....++ -+|..+.... ....+++|+..+++|+.+...+...
T Consensus 14 ~~~~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~~~~~~~~Vys~~~~~Wr~~~~~~~~~~ 93 (230)
T TIGR01640 14 KRLVVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGNRNQSEHQVYTLGSNSWRTIECSPPHHP 93 (230)
T ss_pred CcEEEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCCCCCccEEEEEeCCCCccccccCCCCcc
Confidence 568999999999999986543211 1 0111122 2555554321 1168899999999999987433221
Q ss_pred cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCC---CCcEEEECCEEEEEecCCC---CeEEEE
Q 045821 203 EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGW---RGPAVVVDDALYVLDQSSG---TKLMMW 275 (352)
Q Consensus 203 ~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~---~~~~~~~~~~l~~~gg~~~---~~v~~y 275 (352)
.....+.++|.||-+...........|.+||..+++|.+ ++.+..... ....+..+|+|.++..... -+||+.
T Consensus 94 ~~~~~v~~~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~IWvl 173 (230)
T TIGR01640 94 LKSRGVCINGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIPLPCGNSDSVDYLSLINYKGKLAVLKQKKDTNNFDLWVL 173 (230)
T ss_pred ccCCeEEECCEEEEEEEECCCCCcEEEEEEEcccceEeeeeecCccccccccceEEEEECCEEEEEEecCCCCcEEEEEE
Confidence 122367889999998643321222379999999999995 654332211 1223567899998865432 256666
Q ss_pred e-CCCCceEEccCCCCC-c--CCC--CceEEEeCCEEEEEcCC--ce-EEEEEcCCc
Q 045821 276 Q-KESREWSPVGRLSTL-L--TRP--PCKLVAIGKTIFVIGKG--CS-AVVIDVGNI 323 (352)
Q Consensus 276 d-~~~~~W~~~~~~~~~-~--~~~--~~~~~~~~~~i~v~gG~--~~-~~~~d~~~~ 323 (352)
+ .+.+.|+++-.++.. . ... ....+..+++|++.... .. ..+||++++
T Consensus 174 ~d~~~~~W~k~~~i~~~~~~~~~~~~~~~~~~~~g~I~~~~~~~~~~~~~~y~~~~~ 230 (230)
T TIGR01640 174 NDAGKQEWSKLFTVPIPPLPDLVDDNFLSGFTDKGEIVLCCEDENPFYIFYYNVGEN 230 (230)
T ss_pred CCCCCCceeEEEEEcCcchhhhhhheeEeEEeeCCEEEEEeCCCCceEEEEEeccCC
Confidence 4 446679976544421 1 111 12345567888887653 23 667999874
No 34
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=99.02 E-value=8.4e-10 Score=69.12 Aligned_cols=48 Identities=25% Similarity=0.523 Sum_probs=42.9
Q ss_pred CCEEEEEeCCC-CCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE
Q 045821 119 GKNAYLLGGCG-WSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL 166 (352)
Q Consensus 119 ~~~iyv~GG~~-~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~ 166 (352)
++++||+||.+ ......+++|+||+.+++|+.++++|.||..|+++++
T Consensus 1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i 49 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI 49 (49)
T ss_pred CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence 57899999988 4457789999999999999999999999999998864
No 35
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.00 E-value=4.8e-10 Score=69.71 Aligned_cols=47 Identities=34% Similarity=0.663 Sum_probs=42.6
Q ss_pred CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821 109 RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS 155 (352)
Q Consensus 109 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~ 155 (352)
+|.++++++++++||++||.+......+++++||+.+++|+.+++||
T Consensus 1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP 47 (47)
T ss_dssp -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence 47889999999999999999886688999999999999999999876
No 36
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=98.96 E-value=7.4e-10 Score=68.84 Aligned_cols=43 Identities=37% Similarity=0.612 Sum_probs=38.8
Q ss_pred cceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCC
Q 045821 157 ARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPN 199 (352)
Q Consensus 157 ~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~ 199 (352)
||..|++++++++||++||.... +++++||+.+++|+.+++||
T Consensus 1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp 47 (47)
T PF01344_consen 1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP 47 (47)
T ss_dssp -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence 68899999999999999998873 99999999999999998765
No 37
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.85 E-value=1.3e-08 Score=63.73 Aligned_cols=47 Identities=30% Similarity=0.636 Sum_probs=41.1
Q ss_pred CCcceEEEEECCEEEEEeCC--CCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821 109 RRKGMGFEVLGKNAYLLGGC--GWSEDATSEVYCYDASMNTWTDAAPMS 155 (352)
Q Consensus 109 ~~~~~~~~~~~~~iyv~GG~--~~~~~~~~~~~~~d~~t~~W~~~~~~~ 155 (352)
+|.+|++++++++||++||. .......+++++||+.+++|+.+++++
T Consensus 1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG 49 (49)
T ss_pred CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence 47889999999999999999 444467899999999999999998875
No 38
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.83 E-value=3.6e-09 Score=66.32 Aligned_cols=47 Identities=26% Similarity=0.499 Sum_probs=31.5
Q ss_pred CCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821 109 RRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS 155 (352)
Q Consensus 109 ~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~ 155 (352)
+|.+|+++.+ ++.|||+||.+......+++|+||+.+++|+.++++|
T Consensus 1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP 48 (49)
T ss_dssp --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence 4788999988 5899999998876568899999999999999998876
No 39
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.81 E-value=1.2e-08 Score=63.76 Aligned_cols=43 Identities=30% Similarity=0.468 Sum_probs=37.7
Q ss_pred cceeeeeEEECCEEEEEeCC--CCC----CceEEEeCCCCcEEEecCCC
Q 045821 157 ARCYFPCGVLNQKIYCIGGL--GDT----HSWDVYDPRTNNWKLHTEPN 199 (352)
Q Consensus 157 ~r~~~~~~~~~~~iyv~gG~--~~~----~~v~~yd~~t~~W~~~~~~~ 199 (352)
||..|++++++++|||+||+ ... +++++||+++++|+.+++++
T Consensus 1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g 49 (49)
T PF07646_consen 1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG 49 (49)
T ss_pred CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence 68899999999999999999 222 99999999999999998654
No 40
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=98.80 E-value=5.7e-06 Score=70.39 Aligned_cols=194 Identities=9% Similarity=0.057 Sum_probs=114.4
Q ss_pred eeEEEecCCCCCCCeeEeccCCCcccCCcc--eEEEE--E-CC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCC
Q 045821 83 VCCYVLDPNSTRRSWKMIHELPARTLRRKG--MGFEV--L-GK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMST 156 (352)
Q Consensus 83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~--~~~~~--~-~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~ 156 (352)
...+++|| .|+.|..+|+.+........ .+... . +. +|..+...... .....+++|+..+++|+.+...+.
T Consensus 14 ~~~~V~NP--~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~-~~~~~~~Vys~~~~~Wr~~~~~~~ 90 (230)
T TIGR01640 14 KRLVVWNP--STGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGN-RNQSEHQVYTLGSNSWRTIECSPP 90 (230)
T ss_pred CcEEEECC--CCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCC-CCCccEEEEEeCCCCccccccCCC
Confidence 35778899 99999999865421010111 11111 1 11 44444332111 234578999999999999874332
Q ss_pred c-ceeeeeEEECCEEEEEeCCCC-C--CceEEEeCCCCcEEE-ecCCCCcc---cceeEEEECCEEEEEccCCCCCCceE
Q 045821 157 A-RCYFPCGVLNQKIYCIGGLGD-T--HSWDVYDPRTNNWKL-HTEPNIFT---EIEDSFVMDGKIYIRCSASAATSHVC 228 (352)
Q Consensus 157 ~-r~~~~~~~~~~~iyv~gG~~~-~--~~v~~yd~~t~~W~~-~~~~~~~~---~~~~~~~~~g~iyv~GG~~~~~~~~~ 228 (352)
. ......+.++|.+|.+..... . ..+..||..+++|+. ++.|.... .....+.++|+|.++.... ......
T Consensus 91 ~~~~~~~~v~~~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~-~~~~~~ 169 (230)
T TIGR01640 91 HHPLKSRGVCINGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIPLPCGNSDSVDYLSLINYKGKLAVLKQKK-DTNNFD 169 (230)
T ss_pred CccccCCeEEECCEEEEEEEECCCCCcEEEEEEEcccceEeeeeecCccccccccceEEEEECCEEEEEEecC-CCCcEE
Confidence 2 112236778999999875432 1 268999999999995 66433221 1234677799999885432 223357
Q ss_pred EEEEe-CCCCceeec---CCCCCCCCC----CcEEEECCEEEEEecCC-CCeEEEEeCCCC
Q 045821 229 ALVYE-PSTDSWLHA---DANMASGWR----GPAVVVDDALYVLDQSS-GTKLMMWQKESR 280 (352)
Q Consensus 229 i~~yd-~~~~~W~~~---~~~~~~~~~----~~~~~~~~~l~~~gg~~-~~~v~~yd~~~~ 280 (352)
||+.+ .....|+++ +.+...... ...+..+++|++..... +..+..||++++
T Consensus 170 IWvl~d~~~~~W~k~~~i~~~~~~~~~~~~~~~~~~~~g~I~~~~~~~~~~~~~~y~~~~~ 230 (230)
T TIGR01640 170 LWVLNDAGKQEWSKLFTVPIPPLPDLVDDNFLSGFTDKGEIVLCCEDENPFYIFYYNVGEN 230 (230)
T ss_pred EEEECCCCCCceeEEEEEcCcchhhhhhheeEeEEeeCCEEEEEeCCCCceEEEEEeccCC
Confidence 88875 345679864 211111111 12356688888876532 224999999875
No 41
>smart00612 Kelch Kelch domain.
Probab=98.74 E-value=1.9e-08 Score=62.32 Aligned_cols=47 Identities=30% Similarity=0.485 Sum_probs=41.1
Q ss_pred EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECC
Q 045821 121 NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ 168 (352)
Q Consensus 121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~ 168 (352)
+||++||.... ...+++++||+.+++|+.+++|+.+|..++++.+++
T Consensus 1 ~iyv~GG~~~~-~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g 47 (47)
T smart00612 1 KIYVVGGFDGG-QRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47 (47)
T ss_pred CEEEEeCCCCC-ceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence 48999998653 567899999999999999999999999999888764
No 42
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.74 E-value=1.3e-08 Score=63.80 Aligned_cols=43 Identities=21% Similarity=0.383 Sum_probs=29.3
Q ss_pred cceeeeeEEE-CCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCC
Q 045821 157 ARCYFPCGVL-NQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPN 199 (352)
Q Consensus 157 ~r~~~~~~~~-~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~ 199 (352)
||..|+++.+ +++||++||.... +++++||+++++|++++++|
T Consensus 1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P 48 (49)
T PF13418_consen 1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP 48 (49)
T ss_dssp --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence 6899999998 5899999998764 89999999999999997665
No 43
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.66 E-value=3.5e-06 Score=70.98 Aligned_cols=149 Identities=12% Similarity=0.195 Sum_probs=97.2
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCC----CcEEEec-CCCCcccceeEEE
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRT----NNWKLHT-EPNIFTEIEDSFV 209 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t----~~W~~~~-~~~~~~~~~~~~~ 209 (352)
.....||+.+++++.+.-....-+...+..-||++.+.||.... ..+-.|++.+ ..|.+.+ .|...|.+..+..
T Consensus 46 a~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~ 125 (243)
T PF07250_consen 46 AHSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDNDGNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATT 125 (243)
T ss_pred EEEEEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCccccceEEEecCCCCCCCCceECcccccCCCccccceE
Confidence 34567999999999886432222222222238999999998665 7788898876 5798776 4777888887776
Q ss_pred E-CCEEEEEccCCCCCCceEEEEEeCCC-----CceeecCC---CCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCC
Q 045821 210 M-DGKIYIRCSASAATSHVCALVYEPST-----DSWLHADA---NMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 210 ~-~g~iyv~GG~~~~~~~~~i~~yd~~~-----~~W~~~~~---~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
+ ||+++|+||.... ..+.+.... ..|..+.. ..+...+... +.-+|+||+++.. .-.+||+.+
T Consensus 126 L~DG~vlIvGG~~~~----t~E~~P~~~~~~~~~~~~~l~~~~~~~~~nlYP~~~llPdG~lFi~an~---~s~i~d~~~ 198 (243)
T PF07250_consen 126 LPDGRVLIVGGSNNP----TYEFWPPKGPGPGPVTLPFLSQTSDTLPNNLYPFVHLLPDGNLFIFANR---GSIIYDYKT 198 (243)
T ss_pred CCCCCEEEEeCcCCC----cccccCCccCCCCceeeecchhhhccCccccCceEEEcCCCCEEEEEcC---CcEEEeCCC
Confidence 6 8999999987721 223333211 11222211 1112222222 4569999999986 678889999
Q ss_pred Cce-EEccCCCCC
Q 045821 280 REW-SPVGRLSTL 291 (352)
Q Consensus 280 ~~W-~~~~~~~~~ 291 (352)
+++ .+++.+|..
T Consensus 199 n~v~~~lP~lPg~ 211 (243)
T PF07250_consen 199 NTVVRTLPDLPGG 211 (243)
T ss_pred CeEEeeCCCCCCC
Confidence 987 788888864
No 44
>PF13415 Kelch_3: Galactose oxidase, central domain
Probab=98.60 E-value=1.4e-07 Score=58.94 Aligned_cols=43 Identities=23% Similarity=0.309 Sum_probs=37.8
Q ss_pred CCEEEEEeCCC--CC---CceEEEeCCCCcEEEecCCCCcccceeEEE
Q 045821 167 NQKIYCIGGLG--DT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFV 209 (352)
Q Consensus 167 ~~~iyv~gG~~--~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~ 209 (352)
+++|||+||.. .. +++++||+.+++|++++++|.+|..|++++
T Consensus 1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~ 48 (49)
T PF13415_consen 1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATV 48 (49)
T ss_pred CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEE
Confidence 57899999988 22 999999999999999998999888887765
No 45
>smart00612 Kelch Kelch domain.
Probab=98.54 E-value=1.5e-07 Score=58.19 Aligned_cols=44 Identities=34% Similarity=0.579 Sum_probs=38.0
Q ss_pred EEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 169 KIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 169 ~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
+||++||.... +++++||+.+++|+.+++|+.++..+++++++|
T Consensus 1 ~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g 47 (47)
T smart00612 1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING 47 (47)
T ss_pred CEEEEeCCCCCceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence 48999998653 889999999999999999999888888777764
No 46
>PLN02772 guanylate kinase
Probab=98.46 E-value=1.6e-06 Score=77.63 Aligned_cols=83 Identities=17% Similarity=0.177 Sum_probs=68.8
Q ss_pred cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecC---CCCCcceeeeeEEE-CCEEEEEeCCCCC-Cce
Q 045821 108 LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAA---PMSTARCYFPCGVL-NQKIYCIGGLGDT-HSW 182 (352)
Q Consensus 108 ~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~-~~~iyv~gG~~~~-~~v 182 (352)
.++..++++.+++++||+||.+......+.+++||..|++|..-. ..|.||.+|+++++ +++|+|+++.... .++
T Consensus 23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~~~~~~~ 102 (398)
T PLN02772 23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGSAPDDSI 102 (398)
T ss_pred CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCCCCccce
Confidence 367889999999999999998775446789999999999998654 56789999999999 5799999976654 777
Q ss_pred EEEeCCCC
Q 045821 183 DVYDPRTN 190 (352)
Q Consensus 183 ~~yd~~t~ 190 (352)
|.....|.
T Consensus 103 w~l~~~t~ 110 (398)
T PLN02772 103 WFLEVDTP 110 (398)
T ss_pred EEEEcCCH
Confidence 77776654
No 47
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.35 E-value=3.9e-05 Score=64.72 Aligned_cols=141 Identities=14% Similarity=0.213 Sum_probs=91.1
Q ss_pred EEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCC-CCCCCCCCcEE-E
Q 045821 183 DVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADA-NMASGWRGPAV-V 256 (352)
Q Consensus 183 ~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~-~~~~~~~~~~~-~ 256 (352)
..||+.|++++.+......-+..++..-||++.+.||... ....+-.|++.+ ..|.+... ....+|+..+. .
T Consensus 49 ~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~--G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L 126 (243)
T PF07250_consen 49 VEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDND--GNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTL 126 (243)
T ss_pred EEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCc--cccceEEEecCCCCCCCCceECcccccCCCccccceEC
Confidence 5799999999988643333333344555899999988764 334566787765 67988865 44556666654 4
Q ss_pred ECCEEEEEecCCCCeEEEEeCC-C--C---ceEEccCCCC--CcCCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821 257 VDDALYVLDQSSGTKLMMWQKE-S--R---EWSPVGRLST--LLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG 327 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~-~--~---~W~~~~~~~~--~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~ 327 (352)
-+|+++++||..+ ..+.|-|. . . .|..+..... ....+-+..+.-+++|++++.. +..+||...+.+.+
T Consensus 127 ~DG~vlIvGG~~~-~t~E~~P~~~~~~~~~~~~~l~~~~~~~~~nlYP~~~llPdG~lFi~an~-~s~i~d~~~n~v~~ 203 (243)
T PF07250_consen 127 PDGRVLIVGGSNN-PTYEFWPPKGPGPGPVTLPFLSQTSDTLPNNLYPFVHLLPDGNLFIFANR-GSIIYDYKTNTVVR 203 (243)
T ss_pred CCCCEEEEeCcCC-CcccccCCccCCCCceeeecchhhhccCccccCceEEEcCCCCEEEEEcC-CcEEEeCCCCeEEe
Confidence 5899999999876 33444443 2 1 1222222111 1123334555679999999976 57789999986533
No 48
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=98.27 E-value=0.0016 Score=58.49 Aligned_cols=38 Identities=16% Similarity=0.287 Sum_probs=32.9
Q ss_pred CccCCCcHHHHHHhhccCc-cccchhhhhhhhhhccccc
Q 045821 16 PLISGLPDDIALFCLARVP-RRYHAVLKCVSSRWRGLLH 53 (352)
Q Consensus 16 ~~~~~lp~dl~~~il~r~p-~~~~~~l~~v~k~W~~l~~ 53 (352)
..|++||+||+..|..|+| ..++.++++||+.|++-..
T Consensus 2 ~~Ws~Lp~dll~~i~~~l~~~~d~~~~~~vC~sWr~a~~ 40 (373)
T PLN03215 2 ADWSTLPEELLHMIAGRLFSNVELKRFRSICRSWRSSVS 40 (373)
T ss_pred CChhhCCHHHHHHHHhhCCcHHHHHHHHhhhhhHHHhcc
Confidence 3689999999999999995 5679999999999996544
No 49
>PF13854 Kelch_5: Kelch motif
Probab=98.25 E-value=3.1e-06 Score=50.76 Aligned_cols=38 Identities=26% Similarity=0.489 Sum_probs=33.9
Q ss_pred cCCcceEEEEECCEEEEEeCCCC-CCCCCCeEEEEECCC
Q 045821 108 LRRKGMGFEVLGKNAYLLGGCGW-SEDATSEVYCYDASM 145 (352)
Q Consensus 108 ~~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~~~d~~t 145 (352)
.+|..|+++.++++||++||... .....+++|++|..+
T Consensus 3 ~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s 41 (42)
T PF13854_consen 3 SPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS 41 (42)
T ss_pred CCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence 68999999999999999999984 557789999999876
No 50
>PLN02772 guanylate kinase
Probab=98.22 E-value=9.8e-06 Score=72.68 Aligned_cols=80 Identities=20% Similarity=0.191 Sum_probs=62.7
Q ss_pred CcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CCCCcccceeEEEE-CCEEEEEccCCCCCCce
Q 045821 156 TARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVM-DGKIYIRCSASAATSHV 227 (352)
Q Consensus 156 ~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~ 227 (352)
.|+..++++.+++++||+||..+. +.+++||..|++|.... ..|.++.+|+++++ +++|+|++++... ..
T Consensus 23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~~~--~~ 100 (398)
T PLN02772 23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGSAP--DD 100 (398)
T ss_pred CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCCCC--cc
Confidence 478889999999999999997654 78999999999998764 46777888888877 6899999755432 24
Q ss_pred EEEEEeCCCC
Q 045821 228 CALVYEPSTD 237 (352)
Q Consensus 228 ~i~~yd~~~~ 237 (352)
++|.....|.
T Consensus 101 ~~w~l~~~t~ 110 (398)
T PLN02772 101 SIWFLEVDTP 110 (398)
T ss_pred ceEEEEcCCH
Confidence 6776665443
No 51
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=98.14 E-value=0.0069 Score=55.97 Aligned_cols=228 Identities=13% Similarity=0.150 Sum_probs=130.7
Q ss_pred CCCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcc-----cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEE
Q 045821 67 DETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPART-----LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCY 141 (352)
Q Consensus 67 ~~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~-----~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~ 141 (352)
.++.+|+.... -.++.||..+-...|+.-..-.... ......+.+..++.||+.+. ...++.+
T Consensus 68 ~~~~vy~~~~~-----g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~-------~g~l~al 135 (394)
T PRK11138 68 AYNKVYAADRA-----GLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSE-------KGQVYAL 135 (394)
T ss_pred ECCEEEEECCC-----CeEEEEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcC-------CCEEEEE
Confidence 38888887432 2467778733345576432211000 01122334566888887532 3568999
Q ss_pred ECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCc--ccceeEEEECCEEE
Q 045821 142 DASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIF--TEIEDSFVMDGKIY 215 (352)
Q Consensus 142 d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~--~~~~~~~~~~g~iy 215 (352)
|..+++ |+.-.. .+ ...+.++.++.+|+..+. ..++.+|+.+++ |+.-...+.. +....-++.++.+|
T Consensus 136 d~~tG~~~W~~~~~--~~-~~ssP~v~~~~v~v~~~~---g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v~~~~v~ 209 (394)
T PRK11138 136 NAEDGEVAWQTKVA--GE-ALSRPVVSDGLVLVHTSN---GMLQALNESDGAVKWTVNLDVPSLTLRGESAPATAFGGAI 209 (394)
T ss_pred ECCCCCCcccccCC--Cc-eecCCEEECCEEEEECCC---CEEEEEEccCCCEeeeecCCCCcccccCCCCCEEECCEEE
Confidence 998876 765322 11 223445568888886542 568999998875 8764332211 11123345577777
Q ss_pred EEccCCCCCCceEEEEEeCCCC--ceeec-CCCCCC-------CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eE
Q 045821 216 IRCSASAATSHVCALVYEPSTD--SWLHA-DANMAS-------GWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WS 283 (352)
Q Consensus 216 v~GG~~~~~~~~~i~~yd~~~~--~W~~~-~~~~~~-------~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~ 283 (352)
+..+ . ..+..+|..++ .|+.- ..+... ......++.++.+|+.+. .+ .++.+|+.+++ |+
T Consensus 210 ~~~~-~-----g~v~a~d~~~G~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~vy~~~~-~g-~l~ald~~tG~~~W~ 281 (394)
T PRK11138 210 VGGD-N-----GRVSAVLMEQGQLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVVYALAY-NG-NLVALDLRSGQIVWK 281 (394)
T ss_pred EEcC-C-----CEEEEEEccCChhhheeccccCCCccchhcccccCCCcEEECCEEEEEEc-CC-eEEEEECCCCCEEEe
Confidence 7532 2 24777887765 48642 111100 011223566888887663 33 89999998765 86
Q ss_pred EccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCc--cccc
Q 045821 284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNI--GNIG 327 (352)
Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~--~w~~ 327 (352)
.- ... ...++..++.||+.......+.+|+++. .|..
T Consensus 282 ~~--~~~-----~~~~~~~~~~vy~~~~~g~l~ald~~tG~~~W~~ 320 (394)
T PRK11138 282 RE--YGS-----VNDFAVDGGRIYLVDQNDRVYALDTRGGVELWSQ 320 (394)
T ss_pred ec--CCC-----ccCcEEECCEEEEEcCCCeEEEEECCCCcEEEcc
Confidence 52 111 1134567899999887767778888643 3643
No 52
>PF12937 F-box-like: F-box-like; PDB: 1P22_A 2OVP_B 2OVR_B 2OVQ_B 1FS1_A 1FS2_C 1FQV_I 1LDK_E 2AST_B 2ASS_B.
Probab=98.13 E-value=9.2e-07 Score=54.65 Aligned_cols=40 Identities=33% Similarity=0.483 Sum_probs=34.4
Q ss_pred cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhh
Q 045821 18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW 57 (352)
Q Consensus 18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~ 57 (352)
|..||+|++.+|++.++..++.++..|||+|+.++.++.+
T Consensus 1 i~~LP~Eil~~If~~L~~~dl~~~~~vcr~w~~~~~~~~l 40 (47)
T PF12937_consen 1 ISSLPDEILLEIFSYLDPRDLLRLSLVCRRWRRIANDNSL 40 (47)
T ss_dssp CCCS-HHHHHHHHTTS-HHHHHHHTTSSHHHHHHHTCCCH
T ss_pred ChHhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHCChhh
Confidence 5689999999999999999999999999999999876644
No 53
>PF13854 Kelch_5: Kelch motif
Probab=98.12 E-value=8.1e-06 Score=48.93 Aligned_cols=36 Identities=22% Similarity=0.372 Sum_probs=31.9
Q ss_pred CCCcceeeeeEEECCEEEEEeCCCC--C---CceEEEeCCC
Q 045821 154 MSTARCYFPCGVLNQKIYCIGGLGD--T---HSWDVYDPRT 189 (352)
Q Consensus 154 ~~~~r~~~~~~~~~~~iyv~gG~~~--~---~~v~~yd~~t 189 (352)
+|.+|..|+++.++++||++||... . +++++||+.+
T Consensus 1 ~P~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s 41 (42)
T PF13854_consen 1 IPSPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS 41 (42)
T ss_pred CCCCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence 4789999999999999999999883 2 9999999876
No 54
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=98.04 E-value=0.0066 Score=54.75 Aligned_cols=119 Identities=11% Similarity=0.253 Sum_probs=82.9
Q ss_pred EEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC--C------ceEE
Q 045821 114 GFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT--H------SWDV 184 (352)
Q Consensus 114 ~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~--~------~v~~ 184 (352)
....+ +++|+.++.. ....+||..+..-...|.+..+.....++.++++||++...... . .+++
T Consensus 70 ~F~al~gskIv~~d~~-------~~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~ 142 (342)
T PF07893_consen 70 DFFALHGSKIVAVDQS-------GRTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEA 142 (342)
T ss_pred EEEEecCCeEEEEcCC-------CCeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEE
Confidence 34444 8889888653 44789999999988888888887777788889999999886443 1 5555
Q ss_pred E--eC--------CCCcEEEecCCCCcccc-------eeEEEE-CCEEEEE-ccCCCCCCceEEEEEeCCCCceeecCC
Q 045821 185 Y--DP--------RTNNWKLHTEPNIFTEI-------EDSFVM-DGKIYIR-CSASAATSHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 185 y--d~--------~t~~W~~~~~~~~~~~~-------~~~~~~-~g~iyv~-GG~~~~~~~~~i~~yd~~~~~W~~~~~ 244 (352)
+ ++ ..-.|+.++++|..... .+-+++ +..|+|. -+.. ...++||+.+.+|+...+
T Consensus 143 l~~~~~~~~~~~~~~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~-----~GTysfDt~~~~W~~~Gd 216 (342)
T PF07893_consen 143 LVYRPPPDDPSPEESWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR-----WGTYSFDTESHEWRKHGD 216 (342)
T ss_pred eccccccccccCCCcceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCc-----eEEEEEEcCCcceeeccc
Confidence 5 42 23368888876543332 233455 5688884 2211 248999999999999976
No 55
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=98.02 E-value=0.012 Score=54.41 Aligned_cols=206 Identities=15% Similarity=0.178 Sum_probs=112.9
Q ss_pred eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc----
Q 045821 84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA---- 157 (352)
Q Consensus 84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~---- 157 (352)
.++.||+.+-...|+.-...+.. ..+...+-+..++.+|+..+ ...++.+|+.+++ |+.-...+..
T Consensus 171 ~l~ald~~tG~~~W~~~~~~~~~-~~~~~~sP~v~~~~v~~~~~-------~g~v~a~d~~~G~~~W~~~~~~~~~~~~~ 242 (394)
T PRK11138 171 MLQALNESDGAVKWTVNLDVPSL-TLRGESAPATAFGGAIVGGD-------NGRVSAVLMEQGQLIWQQRISQPTGATEI 242 (394)
T ss_pred EEEEEEccCCCEeeeecCCCCcc-cccCCCCCEEECCEEEEEcC-------CCEEEEEEccCChhhheeccccCCCccch
Confidence 35677772223346654332210 01111223344666666432 2457888888775 7642111110
Q ss_pred ----ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEE
Q 045821 158 ----RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALV 231 (352)
Q Consensus 158 ----r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~ 231 (352)
....+.++.++.+|+.+.. ..+.++|+.+++ |+.-- .. ....+..+++||+.... ..+++
T Consensus 243 ~~~~~~~~sP~v~~~~vy~~~~~---g~l~ald~~tG~~~W~~~~--~~---~~~~~~~~~~vy~~~~~------g~l~a 308 (394)
T PRK11138 243 DRLVDVDTTPVVVGGVVYALAYN---GNLVALDLRSGQIVWKREY--GS---VNDFAVDGGRIYLVDQN------DRVYA 308 (394)
T ss_pred hcccccCCCcEEECCEEEEEEcC---CeEEEEECCCCCEEEeecC--CC---ccCcEEECCEEEEEcCC------CeEEE
Confidence 1123445668999987642 568999998874 87521 11 11245678999997422 25888
Q ss_pred EeCCCC--ceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEE
Q 045821 232 YEPSTD--SWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIF 307 (352)
Q Consensus 232 yd~~~~--~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~ 307 (352)
+|..++ .|+.-.. ........+..++.||+... ++ .++.+|..+++ |+. .++. .......+..+++||
T Consensus 309 ld~~tG~~~W~~~~~--~~~~~~sp~v~~g~l~v~~~-~G-~l~~ld~~tG~~~~~~--~~~~--~~~~s~P~~~~~~l~ 380 (394)
T PRK11138 309 LDTRGGVELWSQSDL--LHRLLTAPVLYNGYLVVGDS-EG-YLHWINREDGRFVAQQ--KVDS--SGFLSEPVVADDKLL 380 (394)
T ss_pred EECCCCcEEEccccc--CCCcccCCEEECCEEEEEeC-CC-EEEEEECCCCCEEEEE--EcCC--CcceeCCEEECCEEE
Confidence 998765 4864321 11122233557888886543 34 79999998876 443 1211 112234455788888
Q ss_pred EEcCCceEEEEE
Q 045821 308 VIGKGCSAVVID 319 (352)
Q Consensus 308 v~gG~~~~~~~d 319 (352)
+-...-..+.++
T Consensus 381 v~t~~G~l~~~~ 392 (394)
T PRK11138 381 IQARDGTVYAIT 392 (394)
T ss_pred EEeCCceEEEEe
Confidence 876544454444
No 56
>PF00646 F-box: F-box domain; InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains. Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=98.02 E-value=1.1e-06 Score=54.56 Aligned_cols=43 Identities=33% Similarity=0.426 Sum_probs=35.8
Q ss_pred ccCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhhHH
Q 045821 17 LISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWCA 59 (352)
Q Consensus 17 ~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~~ 59 (352)
.+.+||+|++.+|+.+++..++..++.|||+|+.++.++.+..
T Consensus 2 ~~~~LP~~il~~Il~~l~~~~~~~l~~vsk~~~~~~~~~~~~~ 44 (48)
T PF00646_consen 2 PLSDLPDEILQEILSYLDPKDLLRLSLVSKRWRSLVDSPRLWK 44 (48)
T ss_dssp HHHHS-HHHHHHHHHTS-HHHHHHHCTT-HHHHHHHTTHHHHH
T ss_pred CHHHCCHHHHHHHHHHCcHHHHHHHHHHhhHHHHHHcCCCccH
Confidence 4568999999999999999999999999999999998876643
No 57
>smart00256 FBOX A Receptor for Ubiquitination Targets.
Probab=97.87 E-value=6.2e-06 Score=49.18 Aligned_cols=37 Identities=38% Similarity=0.671 Sum_probs=34.3
Q ss_pred CcHHHHHHhhccCccccchhhhhhhhhhcccccChhh
Q 045821 21 LPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW 57 (352)
Q Consensus 21 lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~ 57 (352)
||+|++.+|+++++..++.+++.|||+|+.++..+.+
T Consensus 1 lP~~ll~~I~~~l~~~d~~~~~~vc~~~~~~~~~~~~ 37 (41)
T smart00256 1 LPDEILEEILSKLPPKDLLRLRKVSRRWRSLIDSHDF 37 (41)
T ss_pred CCHHHHHHHHHcCCHHHHHHHHHHHHHHHHHhcChhh
Confidence 7999999999999999999999999999999877654
No 58
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.85 E-value=0.011 Score=50.26 Aligned_cols=173 Identities=16% Similarity=0.175 Sum_probs=100.7
Q ss_pred CeEEEEECCCCe--eEecCCCCCcceeee--eEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEE
Q 045821 136 SEVYCYDASMNT--WTDAAPMSTARCYFP--CGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFV 209 (352)
Q Consensus 136 ~~~~~~d~~t~~--W~~~~~~~~~r~~~~--~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~ 209 (352)
..+..+|+.+++ |+.-. ..+..... .+..++.+|+..+. ..+.++|..+++ |+.-. +..... ....
T Consensus 3 g~l~~~d~~tG~~~W~~~~--~~~~~~~~~~~~~~~~~v~~~~~~---~~l~~~d~~tG~~~W~~~~--~~~~~~-~~~~ 74 (238)
T PF13360_consen 3 GTLSALDPRTGKELWSYDL--GPGIGGPVATAVPDGGRVYVASGD---GNLYALDAKTGKVLWRFDL--PGPISG-APVV 74 (238)
T ss_dssp SEEEEEETTTTEEEEEEEC--SSSCSSEEETEEEETTEEEEEETT---SEEEEEETTTSEEEEEEEC--SSCGGS-GEEE
T ss_pred CEEEEEECCCCCEEEEEEC--CCCCCCccceEEEeCCEEEEEcCC---CEEEEEECCCCCEEEEeec--cccccc-eeee
Confidence 457788888776 76622 11122222 34468899998432 779999998774 77643 222222 2477
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCC--cee-ecCCCCC-CCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCCc--e
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTD--SWL-HADANMA-SGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESRE--W 282 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W~-~~~~~~~-~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~~--W 282 (352)
.++.+|+.... ..+.++|..++ .|+ .....+. ........ ..++.+|+... .+ .++.+|+++++ |
T Consensus 75 ~~~~v~v~~~~------~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g-~l~~~d~~tG~~~w 146 (238)
T PF13360_consen 75 DGGRVYVGTSD------GSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTS-SG-KLVALDPKTGKLLW 146 (238)
T ss_dssp ETTEEEEEETT------SEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEET-CS-EEEEEETTTTEEEE
T ss_pred cccccccccce------eeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEec-cC-cEEEEecCCCcEEE
Confidence 89999988421 16899997665 598 4433222 22222333 34666665543 33 89999999765 6
Q ss_pred EEccCCCCCcC------CCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821 283 SPVGRLSTLLT------RPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 283 ~~~~~~~~~~~------~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~ 324 (352)
+.-...+.... ......+..++.||+..+......+|.++..
T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~~~~~d~~tg~ 194 (238)
T PF13360_consen 147 KYPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDGRVVAVDLATGE 194 (238)
T ss_dssp EEESSTT-SS--EEEETTEEEEEECCTTEEEEECCTSSEEEEETTTTE
T ss_pred EeecCCCCCCcceeeecccccceEEECCEEEEEcCCCeEEEEECCCCC
Confidence 65433332110 1123444456799998877655545888776
No 59
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=97.83 E-value=0.00087 Score=58.02 Aligned_cols=279 Identities=15% Similarity=0.163 Sum_probs=133.2
Q ss_pred cCCCc----HHHHHHhhccCccccchhhhhhhhhhcccccChhhH-HH-----------HhhCCCCCCEEEEEEecCCCC
Q 045821 18 ISGLP----DDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWC-AY-----------RRKHNLDETWIYAFCRDNKLE 81 (352)
Q Consensus 18 ~~~lp----~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~-~~-----------~~~~~~~~~~l~~~gg~~~~~ 81 (352)
+..|| +++++.||+.|...+++....|||+|..++..+..- +. |.-+...+-..|++--.....
T Consensus 75 i~~lP~~gl~hi~e~ilsyld~~sLc~celv~k~W~r~l~dg~~WKkLie~~vr~dslWrgl~e~rqw~~~lf~~r~~~~ 154 (499)
T KOG0281|consen 75 ITALPEQGLDHIAENILSYLDALSLCACELVCKEWKRVLSDGMLWKKLIERMVRTDSLWRGLSERRQWDQYLFKNRPNDG 154 (499)
T ss_pred HHhcccccHHHHHHHHHHhcchhhhhHHHHHHHHHHHHhccchHHHHHHHHhcchHHHHhhhhhccCcchhhccCCCCcC
Confidence 45788 999999999999999999999999999988777431 11 111111122333331110000
Q ss_pred -----ceeEEEe-----cCCCCCCCeeEeccCCCcc---c-CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 82 -----RVCCYVL-----DPNSTRRSWKMIHELPART---L-RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 82 -----~~~~~~~-----d~~~~~~~w~~~~~~~~~~---~-~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
.+....| |.....+.|..-..+-... + ...+.-+...++...|-|- ..+++-++|..+..
T Consensus 155 ~~~~n~f~~~l~pki~~di~~idsNWr~Gr~~~~rinc~Se~skgVYClQYDD~kiVSGl------rDnTikiWD~n~~~ 228 (499)
T KOG0281|consen 155 GFPPNSFYRLLYPKIIQDIETIESNWRCGRHLLQRINCRSENSKGVYCLQYDDEKIVSGL------RDNTIKIWDKNSLE 228 (499)
T ss_pred CcCCCcchhhhhHHHHHHHhhhhcchhccceeeeeecCCcccCCceEEEEecchhhhccc------ccCceEEeccccHH
Confidence 0000001 1111334453222111000 0 0111112233444444332 23456666765543
Q ss_pred eEecCCCCCcceeeee-EEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEEECCEEEEEccCCCCCC
Q 045821 148 WTDAAPMSTARCYFPC-GVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFVMDGKIYIRCSASAATS 225 (352)
Q Consensus 148 W~~~~~~~~~r~~~~~-~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~~~g~iyv~GG~~~~~~ 225 (352)
-..+ -..+.+... -.+++++.+.|..+ +.+-++|.+|++--+. ...+... ....+++.+.|....+ .
T Consensus 229 c~~~---L~GHtGSVLCLqyd~rviisGSSD--sTvrvWDv~tge~l~t---lihHceaVLhlrf~ng~mvtcSkD---r 297 (499)
T KOG0281|consen 229 CLKI---LTGHTGSVLCLQYDERVIVSGSSD--STVRVWDVNTGEPLNT---LIHHCEAVLHLRFSNGYMVTCSKD---R 297 (499)
T ss_pred HHHh---hhcCCCcEEeeeccceEEEecCCC--ceEEEEeccCCchhhH---HhhhcceeEEEEEeCCEEEEecCC---c
Confidence 2211 112222222 22477877777655 7888999988753211 1112211 1122333333432222 1
Q ss_pred ceEEEEEeCCCCceeecCCCCCCCCCCc---EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe
Q 045821 226 HVCALVYEPSTDSWLHADANMASGWRGP---AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI 302 (352)
Q Consensus 226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~~---~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 302 (352)
.+-++|...-+ .+.-..--..+.+ ++-++++ |++.......+-+|++.+..+... +.. -+.+-++.-+
T Consensus 298 --siaVWdm~sps--~it~rrVLvGHrAaVNvVdfd~k-yIVsASgDRTikvW~~st~efvRt--l~g--HkRGIAClQY 368 (499)
T KOG0281|consen 298 --SIAVWDMASPT--DITLRRVLVGHRAAVNVVDFDDK-YIVSASGDRTIKVWSTSTCEFVRT--LNG--HKRGIACLQY 368 (499)
T ss_pred --eeEEEeccCch--HHHHHHHHhhhhhheeeeccccc-eEEEecCCceEEEEeccceeeehh--hhc--ccccceehhc
Confidence 23333332211 1100000011222 2446777 666655555899999887666433 322 2455667778
Q ss_pred CCEEEEEcCCceEE-EEEcCC
Q 045821 303 GKTIFVIGKGCSAV-VIDVGN 322 (352)
Q Consensus 303 ~~~i~v~gG~~~~~-~~d~~~ 322 (352)
.+++.|-|..+... ++|++.
T Consensus 369 r~rlvVSGSSDntIRlwdi~~ 389 (499)
T KOG0281|consen 369 RDRLVVSGSSDNTIRLWDIEC 389 (499)
T ss_pred cCeEEEecCCCceEEEEeccc
Confidence 88888888665322 455543
No 60
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.77 E-value=0.03 Score=51.34 Aligned_cols=173 Identities=14% Similarity=0.187 Sum_probs=96.0
Q ss_pred EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc--------ceeeeeEEECCEEEEEeCCCCCCceEEE
Q 045821 116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA--------RCYFPCGVLNQKIYCIGGLGDTHSWDVY 185 (352)
Q Consensus 116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~--------r~~~~~~~~~~~iyv~gG~~~~~~v~~y 185 (352)
+..++.+|+ |.. ...+..+|+.+++ |+.-...+.. ......+..++.+|+.... ..+.+|
T Consensus 187 ~~~~~~v~~-~~~------~g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~~---g~l~a~ 256 (377)
T TIGR03300 187 VIADGGVLV-GFA------GGKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYAVSYQ---GRVAAL 256 (377)
T ss_pred EEECCEEEE-ECC------CCEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEEEEcC---CEEEEE
Confidence 445665554 321 2468889988775 7542211111 1122344557888886542 568999
Q ss_pred eCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCC--ceeecCCCCCCCCCCcEEEECCEE
Q 045821 186 DPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTD--SWLHADANMASGWRGPAVVVDDAL 261 (352)
Q Consensus 186 d~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W~~~~~~~~~~~~~~~~~~~~~l 261 (352)
|+.+++ |+.-. ......+..+++||+.... ..+.++|..++ .|+.-.. ........+..++.|
T Consensus 257 d~~tG~~~W~~~~-----~~~~~p~~~~~~vyv~~~~------G~l~~~d~~tG~~~W~~~~~--~~~~~ssp~i~g~~l 323 (377)
T TIGR03300 257 DLRSGRVLWKRDA-----SSYQGPAVDDNRLYVTDAD------GVVVALDRRSGSELWKNDEL--KYRQLTAPAVVGGYL 323 (377)
T ss_pred ECCCCcEEEeecc-----CCccCceEeCCEEEEECCC------CeEEEEECCCCcEEEccccc--cCCccccCEEECCEE
Confidence 998764 76531 1122334568899987321 25888888765 4865321 111122224467777
Q ss_pred EEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEE
Q 045821 262 YVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVV 317 (352)
Q Consensus 262 ~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~ 317 (352)
|+.+ .++ .++.+|..+++ |+ .. ++. .......+..+++||+.+..-.++.
T Consensus 324 ~~~~-~~G-~l~~~d~~tG~~~~~-~~-~~~--~~~~~sp~~~~~~l~v~~~dG~l~~ 375 (377)
T TIGR03300 324 VVGD-FEG-YLHWLSREDGSFVAR-LK-TDG--SGIASPPVVVGDGLLVQTRDGDLYA 375 (377)
T ss_pred EEEe-CCC-EEEEEECCCCCEEEE-EE-cCC--CccccCCEEECCEEEEEeCCceEEE
Confidence 7654 344 89999987665 42 21 111 1123445667888887765434443
No 61
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.54 E-value=0.041 Score=46.65 Aligned_cols=182 Identities=18% Similarity=0.228 Sum_probs=108.4
Q ss_pred EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--c
Q 045821 116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--N 191 (352)
Q Consensus 116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~ 191 (352)
+..++.+|+..+ ...++++|..+++ |+.-. +.+ .....+..++.+|+..+. ..+..+|..+. .
T Consensus 33 ~~~~~~v~~~~~-------~~~l~~~d~~tG~~~W~~~~--~~~-~~~~~~~~~~~v~v~~~~---~~l~~~d~~tG~~~ 99 (238)
T PF13360_consen 33 VPDGGRVYVASG-------DGNLYALDAKTGKVLWRFDL--PGP-ISGAPVVDGGRVYVGTSD---GSLYALDAKTGKVL 99 (238)
T ss_dssp EEETTEEEEEET-------TSEEEEEETTTSEEEEEEEC--SSC-GGSGEEEETTEEEEEETT---SEEEEEETTTSCEE
T ss_pred EEeCCEEEEEcC-------CCEEEEEECCCCCEEEEeec--ccc-ccceeeecccccccccce---eeeEecccCCccee
Confidence 447888998843 4788999998887 65443 222 112246778999988742 47899998877 4
Q ss_pred EE-EecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCC--------CCcEEEEC
Q 045821 192 WK-LHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGW--------RGPAVVVD 258 (352)
Q Consensus 192 W~-~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~--------~~~~~~~~ 258 (352)
|+ .....+... ......+.++.+|+... ...+.++|+.+++ |+.-...+.... ....+..+
T Consensus 100 W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (238)
T PF13360_consen 100 WSIYLTSSPPAGVRSSSSPAVDGDRLYVGTS------SGKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVISD 173 (238)
T ss_dssp EEEEE-SSCTCSTB--SEEEEETTEEEEEET------CSEEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECCT
T ss_pred eeeccccccccccccccCceEecCEEEEEec------cCcEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEEC
Confidence 98 443322222 22234445777777642 1268999988764 776432211111 11123346
Q ss_pred CEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821 259 DALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 259 ~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~ 324 (352)
+.+|+..+.. .+..+|..++. |+.. ... ........++.||+......++.+|+++..
T Consensus 174 ~~v~~~~~~g--~~~~~d~~tg~~~w~~~--~~~----~~~~~~~~~~~l~~~~~~~~l~~~d~~tG~ 233 (238)
T PF13360_consen 174 GRVYVSSGDG--RVVAVDLATGEKLWSKP--ISG----IYSLPSVDGGTLYVTSSDGRLYALDLKTGK 233 (238)
T ss_dssp TEEEEECCTS--SEEEEETTTTEEEEEEC--SS-----ECECEECCCTEEEEEETTTEEEEEETTTTE
T ss_pred CEEEEEcCCC--eEEEEECCCCCEEEEec--CCC----ccCCceeeCCEEEEEeCCCEEEEEECCCCC
Confidence 7788776553 36666999987 8333 211 112245668888888855567789988654
No 62
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.53 E-value=0.069 Score=48.97 Aligned_cols=221 Identities=16% Similarity=0.175 Sum_probs=122.4
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++.+|+.+.. -.++.||+.+....|+.-..-. ...+.+..++.+|+.+. ...++.+|..+++
T Consensus 65 ~~~v~v~~~~-----g~v~a~d~~tG~~~W~~~~~~~------~~~~p~v~~~~v~v~~~-------~g~l~ald~~tG~ 126 (377)
T TIGR03300 65 GGKVYAADAD-----GTVVALDAETGKRLWRVDLDER------LSGGVGADGGLVFVGTE-------KGEVIALDAEDGK 126 (377)
T ss_pred CCEEEEECCC-----CeEEEEEccCCcEeeeecCCCC------cccceEEcCCEEEEEcC-------CCEEEEEECCCCc
Confidence 7777765322 2477778722334575433221 11123444677776432 3578999998775
Q ss_pred --eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCCc--ccceeEEEECCEEEEEccCC
Q 045821 148 --WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNIF--TEIEDSFVMDGKIYIRCSAS 221 (352)
Q Consensus 148 --W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~~--~~~~~~~~~~g~iyv~GG~~ 221 (352)
|+.-.. .+ .....+..++.+|+..+. ..+..+|+.++ .|+.-...+.. ......+..++.+|+. ...
T Consensus 127 ~~W~~~~~--~~-~~~~p~v~~~~v~v~~~~---g~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~~~~~~v~~~-~~~ 199 (377)
T TIGR03300 127 ELWRAKLS--SE-VLSPPLVANGLVVVRTND---GRLTALDAATGERLWTYSRVTPALTLRGSASPVIADGGVLVG-FAG 199 (377)
T ss_pred EeeeeccC--ce-eecCCEEECCEEEEECCC---CeEEEEEcCCCceeeEEccCCCceeecCCCCCEEECCEEEEE-CCC
Confidence 764322 11 223344567888886442 56899999876 48754322211 1122334567766653 222
Q ss_pred CCCCceEEEEEeCCCC--ceeec-CCCCCC-------CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCC
Q 045821 222 AATSHVCALVYEPSTD--SWLHA-DANMAS-------GWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLS 289 (352)
Q Consensus 222 ~~~~~~~i~~yd~~~~--~W~~~-~~~~~~-------~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~ 289 (352)
..+..+|+.++ .|+.- ..+... ......+..++.+|+.+. .+ .++.||+++++ |+.- .+
T Consensus 200 -----g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~-~g-~l~a~d~~tG~~~W~~~--~~ 270 (377)
T TIGR03300 200 -----GKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYAVSY-QG-RVAALDLRSGRVLWKRD--AS 270 (377)
T ss_pred -----CEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEEEEc-CC-EEEEEECCCCcEEEeec--cC
Confidence 25888888665 48642 111100 011223456788887653 33 79999998754 7543 11
Q ss_pred CCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCc--cccc
Q 045821 290 TLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNI--GNIG 327 (352)
Q Consensus 290 ~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~--~w~~ 327 (352)
.....+..+++||+......++.+|.... .|+.
T Consensus 271 -----~~~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~W~~ 305 (377)
T TIGR03300 271 -----SYQGPAVDDNRLYVTDADGVVVALDRRSGSELWKN 305 (377)
T ss_pred -----CccCceEeCCEEEEECCCCeEEEEECCCCcEEEcc
Confidence 12234567899999876656777887543 3654
No 63
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=97.49 E-value=0.004 Score=52.68 Aligned_cols=101 Identities=17% Similarity=0.126 Sum_probs=66.4
Q ss_pred EEEEE-eCCCCCCCCCCeEEEEECCCCe--------e---EecCCCCCcceeeeeEEE--CC--EEEEEeCCCCC-----
Q 045821 121 NAYLL-GGCGWSEDATSEVYCYDASMNT--------W---TDAAPMSTARCYFPCGVL--NQ--KIYCIGGLGDT----- 179 (352)
Q Consensus 121 ~iyv~-GG~~~~~~~~~~~~~~d~~t~~--------W---~~~~~~~~~r~~~~~~~~--~~--~iyv~gG~~~~----- 179 (352)
..|++ ||.+.++.-+..+++....++. . ..+.+.|.+|++|++.++ .| ...+|||...-
T Consensus 39 ~~YlIHGGrTPNNElS~~LY~ls~~s~~cNkK~tl~C~EKeLvGdvP~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qR 118 (337)
T PF03089_consen 39 EQYLIHGGRTPNNELSSSLYILSVDSRGCNKKVTLCCQEKELVGDVPEARYGHTINVVHSRGKTACVLFGGRSYMPPGQR 118 (337)
T ss_pred eeEEecCCcCCCcccccceEEEEeecCCCCceeEEEEecceecCCCCcccccceEEEEEECCcEEEEEECCcccCCcccc
Confidence 45665 6777766677788887665432 1 124578999999987776 23 46777885421
Q ss_pred ------------CceEEEeCCCCcEEE--ecCCCCcccceeEEEECCEEEEEccCC
Q 045821 180 ------------HSWDVYDPRTNNWKL--HTEPNIFTEIEDSFVMDGKIYIRCSAS 221 (352)
Q Consensus 180 ------------~~v~~yd~~t~~W~~--~~~~~~~~~~~~~~~~~g~iyv~GG~~ 221 (352)
..|+..|++-+-.+. ++.+.....+|.+..-++.+|++||..
T Consensus 119 TTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D~VYilGGHs 174 (337)
T PF03089_consen 119 TTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHVSLARNDCVYILGGHS 174 (337)
T ss_pred chhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEEEEecCceEEEEccEE
Confidence 556666766655442 333444556667777899999999875
No 64
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.45 E-value=0.02 Score=49.08 Aligned_cols=186 Identities=16% Similarity=0.152 Sum_probs=104.2
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEec
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHT 196 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~ 196 (352)
++.+|... .....++++|+.+++-..+.... ..+++.. ++.+|+... ....++|+.+++++.+.
T Consensus 11 ~g~l~~~D------~~~~~i~~~~~~~~~~~~~~~~~----~~G~~~~~~~g~l~v~~~----~~~~~~d~~~g~~~~~~ 76 (246)
T PF08450_consen 11 DGRLYWVD------IPGGRIYRVDPDTGEVEVIDLPG----PNGMAFDRPDGRLYVADS----GGIAVVDPDTGKVTVLA 76 (246)
T ss_dssp TTEEEEEE------TTTTEEEEEETTTTEEEEEESSS----EEEEEEECTTSEEEEEET----TCEEEEETTTTEEEEEE
T ss_pred CCEEEEEE------cCCCEEEEEECCCCeEEEEecCC----CceEEEEccCCEEEEEEc----CceEEEecCCCcEEEEe
Confidence 57788774 23468999999998766543222 2233333 689999887 55678899999998886
Q ss_pred CCC-----CcccceeEEEECCEEEEEccCCCC-CCc--eEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecC
Q 045821 197 EPN-----IFTEIEDSFVMDGKIYIRCSASAA-TSH--VCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQS 267 (352)
Q Consensus 197 ~~~-----~~~~~~~~~~~~g~iyv~GG~~~~-~~~--~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~ 267 (352)
..+ ..+....++.-+|+||+-.-.... ... ..++++++. .+.+.+...... -.+.++..++ .||+....
T Consensus 77 ~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~-pNGi~~s~dg~~lyv~ds~ 154 (246)
T PF08450_consen 77 DLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGF-PNGIAFSPDGKTLYVADSF 154 (246)
T ss_dssp EEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESS-EEEEEEETTSSEEEEEETT
T ss_pred eccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCccc-ccceEECCcchheeecccc
Confidence 542 111222334447899986322211 111 479999998 666555332111 1122334455 46665544
Q ss_pred CCCeEEEEeCCCCc--eEEc---cCCCCCcCCCCceEEE-eCCEEEEEc-CCceEEEEEcCC
Q 045821 268 SGTKLMMWQKESRE--WSPV---GRLSTLLTRPPCKLVA-IGKTIFVIG-KGCSAVVIDVGN 322 (352)
Q Consensus 268 ~~~~v~~yd~~~~~--W~~~---~~~~~~~~~~~~~~~~-~~~~i~v~g-G~~~~~~~d~~~ 322 (352)
.+ .|++||+.... +... ..++... ...-++++ -++.||+.. +...+.+||++.
T Consensus 155 ~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~-g~pDG~~vD~~G~l~va~~~~~~I~~~~p~G 214 (246)
T PF08450_consen 155 NG-RIWRFDLDADGGELSNRRVFIDFPGGP-GYPDGLAVDSDGNLWVADWGGGRIVVFDPDG 214 (246)
T ss_dssp TT-EEEEEEEETTTCCEEEEEEEEE-SSSS-CEEEEEEEBTTS-EEEEEETTTEEEEEETTS
T ss_pred cc-eeEEEeccccccceeeeeeEEEcCCCC-cCCCcceEcCCCCEEEEEcCCCEEEEECCCc
Confidence 44 89999886433 4322 2233211 01123333 367888863 455788999884
No 65
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=97.44 E-value=0.066 Score=47.12 Aligned_cols=236 Identities=11% Similarity=0.113 Sum_probs=106.4
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++.-|++|-. . .++.=+- ...+|+....-...+......++...++..|++|-. .-++.-.=.-.+
T Consensus 27 ~~~G~~VG~~----g-~il~T~D--GG~tW~~~~~~~~~~~~~~l~~I~f~~~~g~ivG~~-------g~ll~T~DgG~t 92 (302)
T PF14870_consen 27 PNHGWAVGAY----G-TILKTTD--GGKTWQPVSLDLDNPFDYHLNSISFDGNEGWIVGEP-------GLLLHTTDGGKT 92 (302)
T ss_dssp SS-EEEEETT----T-EEEEESS--TTSS-EE-----S-----EEEEEEEETTEEEEEEET-------TEEEEESSTTSS
T ss_pred CCEEEEEecC----C-EEEEECC--CCccccccccCCCccceeeEEEEEecCCceEEEcCC-------ceEEEecCCCCC
Confidence 5667777542 1 1222222 578999886432211122233445557889988731 122222223457
Q ss_pred eEecCC-CCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821 148 WTDAAP-MSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS 225 (352)
Q Consensus 148 W~~~~~-~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~ 225 (352)
|+.++. .+.|...+....+ ++.+.+++.. ..+++=.-.-.+|+.+..............-+|++++++ ..+
T Consensus 93 W~~v~l~~~lpgs~~~i~~l~~~~~~l~~~~---G~iy~T~DgG~tW~~~~~~~~gs~~~~~r~~dG~~vavs-~~G--- 165 (302)
T PF14870_consen 93 WERVPLSSKLPGSPFGITALGDGSAELAGDR---GAIYRTTDGGKTWQAVVSETSGSINDITRSSDGRYVAVS-SRG--- 165 (302)
T ss_dssp -EE----TT-SS-EEEEEEEETTEEEEEETT-----EEEESSTTSSEEEEE-S----EEEEEE-TTS-EEEEE-TTS---
T ss_pred cEEeecCCCCCCCeeEEEEcCCCcEEEEcCC---CcEEEeCCCCCCeeEcccCCcceeEeEEECCCCcEEEEE-Ccc---
Confidence 999862 2344444444444 5677776653 334444444568998764333222222233467766664 221
Q ss_pred ceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe--CCCCceEEccCCCCCcCCCC-ceEEE-
Q 045821 226 HVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ--KESREWSPVGRLSTLLTRPP-CKLVA- 301 (352)
Q Consensus 226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd--~~~~~W~~~~~~~~~~~~~~-~~~~~- 301 (352)
.-....|+....|+.......++...+...-++.|++++ .+..+..=| -..++|.+-. .|....... ..++.
T Consensus 166 -~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~--~Gg~~~~s~~~~~~~~w~~~~-~~~~~~~~~~ld~a~~ 241 (302)
T PF14870_consen 166 -NFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA--RGGQIQFSDDPDDGETWSEPI-IPIKTNGYGILDLAYR 241 (302)
T ss_dssp -SEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE--TTTEEEEEE-TTEEEEE---B--TTSS--S-EEEEEES
T ss_pred -cEEEEecCCCccceEEccCccceehhceecCCCCEEEEe--CCcEEEEccCCCCcccccccc-CCcccCceeeEEEEec
Confidence 234567888889998876555555555566678888876 222666666 4457787732 222111111 22222
Q ss_pred eCCEEEEEcCCceEEEEEcCCcccccc
Q 045821 302 IGKTIFVIGKGCSAVVIDVGNIGNIGG 328 (352)
Q Consensus 302 ~~~~i~v~gG~~~~~~~d~~~~~w~~~ 328 (352)
.++.+++.||.-..+.-.-.-.+|++.
T Consensus 242 ~~~~~wa~gg~G~l~~S~DgGktW~~~ 268 (302)
T PF14870_consen 242 PPNEIWAVGGSGTLLVSTDGGKTWQKD 268 (302)
T ss_dssp SSS-EEEEESTT-EEEESSTTSS-EE-
T ss_pred CCCCEEEEeCCccEEEeCCCCccceEC
Confidence 368899999987666656667789874
No 66
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=97.32 E-value=0.018 Score=50.14 Aligned_cols=105 Identities=14% Similarity=0.278 Sum_probs=65.3
Q ss_pred CceEEEeCCCCcEEEecCCCCcccceeEEE-ECCEEEEEccCCCCC-CceEEEEEeCCCCceeecCCCC----CCCCCCc
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFV-MDGKIYIRCSASAAT-SHVCALVYEPSTDSWLHADANM----ASGWRGP 253 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~-~~g~iyv~GG~~~~~-~~~~i~~yd~~~~~W~~~~~~~----~~~~~~~ 253 (352)
..+-.||..+.+|..+..- ....-..... -++++|+.|-..-.+ ....+..||..+.+|+.++... +.+....
T Consensus 16 ~~lC~yd~~~~qW~~~g~~-i~G~V~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~a~ 94 (281)
T PF12768_consen 16 PGLCLYDTDNSQWSSPGNG-ISGTVTDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGGGSSNSIPGPVTAL 94 (281)
T ss_pred CEEEEEECCCCEeecCCCC-ceEEEEEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCCcccccCCCcEEEE
Confidence 7788999999999987643 2222222333 377888886544434 3458899999999999886622 2221111
Q ss_pred EEE--ECCEEEEEecCCC--CeEEEEeCCCCceEEccC
Q 045821 254 AVV--VDDALYVLDQSSG--TKLMMWQKESREWSPVGR 287 (352)
Q Consensus 254 ~~~--~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~~ 287 (352)
.+. ...++++.|.... ..+..|| ..+|..+..
T Consensus 95 ~~~~~d~~~~~~aG~~~~g~~~l~~~d--Gs~W~~i~~ 130 (281)
T PF12768_consen 95 TFISNDGSNFWVAGRSANGSTFLMKYD--GSSWSSIGS 130 (281)
T ss_pred EeeccCCceEEEeceecCCCceEEEEc--CCceEeccc
Confidence 111 2345666665322 4577775 778999975
No 67
>PF07893 DUF1668: Protein of unknown function (DUF1668); InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function.
Probab=97.22 E-value=0.018 Score=51.89 Aligned_cols=119 Identities=21% Similarity=0.199 Sum_probs=77.6
Q ss_pred ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC-------ceEEEEEeC----
Q 045821 166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS-------HVCALVYEP---- 234 (352)
Q Consensus 166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~-------~~~i~~yd~---- 234 (352)
.+.+|+.++.. ....+||..|..-...|.+..+.....++.++++||++........ ...+..|+.
T Consensus 75 ~gskIv~~d~~---~~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~l~~~~~~~~ 151 (342)
T PF07893_consen 75 HGSKIVAVDQS---GRTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEALVYRPPPDD 151 (342)
T ss_pred cCCeEEEEcCC---CCeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEEecccccccc
Confidence 47888888654 4478999999988777766666666677788999999966543322 123444442
Q ss_pred ----CCCceeecCCCCCCCCCC-----c--EEEE-CCEEEE-EecCCCCeEEEEeCCCCceEEccCC
Q 045821 235 ----STDSWLHADANMASGWRG-----P--AVVV-DDALYV-LDQSSGTKLMMWQKESREWSPVGRL 288 (352)
Q Consensus 235 ----~~~~W~~~~~~~~~~~~~-----~--~~~~-~~~l~~-~gg~~~~~v~~yd~~~~~W~~~~~~ 288 (352)
..-.|+.+++++-..... . -+++ +..|+| +.+.. ...+.||+++.+|+.+++-
T Consensus 152 ~~~~~~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~-~GTysfDt~~~~W~~~GdW 217 (342)
T PF07893_consen 152 PSPEESWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR-WGTYSFDTESHEWRKHGDW 217 (342)
T ss_pred ccCCCcceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCc-eEEEEEEcCCcceeeccce
Confidence 233688887644322221 1 1344 567777 43321 1489999999999999853
No 68
>PRK13684 Ycf48-like protein; Provisional
Probab=97.08 E-value=0.21 Score=45.03 Aligned_cols=206 Identities=13% Similarity=0.190 Sum_probs=102.0
Q ss_pred CCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEE
Q 045821 93 TRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIY 171 (352)
Q Consensus 93 ~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iy 171 (352)
...+|+++..... .+...+.+..++ +.+++.|. ...+++=+-.-.+|+.+.... .-..+.+....+..+
T Consensus 117 gG~tW~~~~~~~~--~~~~~~~i~~~~~~~~~~~g~-------~G~i~~S~DgG~tW~~~~~~~-~g~~~~i~~~~~g~~ 186 (334)
T PRK13684 117 GGKNWTRIPLSEK--LPGSPYLITALGPGTAEMATN-------VGAIYRTTDGGKNWEALVEDA-AGVVRNLRRSPDGKY 186 (334)
T ss_pred CCCCCeEccCCcC--CCCCceEEEEECCCcceeeec-------cceEEEECCCCCCceeCcCCC-cceEEEEEECCCCeE
Confidence 4678988863211 111222333343 34555543 123444333456799886533 223344444444344
Q ss_pred EEeCCCCCCceEE-EeCCCCcEEEecCCCCccccee-EEEECCEEEEEccCCCCCCceEEEEE--eCCCCceeecCCCCC
Q 045821 172 CIGGLGDTHSWDV-YDPRTNNWKLHTEPNIFTEIED-SFVMDGKIYIRCSASAATSHVCALVY--EPSTDSWLHADANMA 247 (352)
Q Consensus 172 v~gG~~~~~~v~~-yd~~t~~W~~~~~~~~~~~~~~-~~~~~g~iyv~GG~~~~~~~~~i~~y--d~~~~~W~~~~~~~~ 247 (352)
++.|.. ..++. .|....+|+.+... ....... +..-++.++++|... ...+ +-...+|+.+..+..
T Consensus 187 v~~g~~--G~i~~s~~~gg~tW~~~~~~-~~~~l~~i~~~~~g~~~~vg~~G-------~~~~~s~d~G~sW~~~~~~~~ 256 (334)
T PRK13684 187 VAVSSR--GNFYSTWEPGQTAWTPHQRN-SSRRLQSMGFQPDGNLWMLARGG-------QIRFNDPDDLESWSKPIIPEI 256 (334)
T ss_pred EEEeCC--ceEEEEcCCCCCeEEEeeCC-CcccceeeeEcCCCCEEEEecCC-------EEEEccCCCCCccccccCCcc
Confidence 443322 12222 24445679988643 2233233 333478898885332 2334 233458998654322
Q ss_pred -CCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEE-eCCEEEEEcCCceEEEEEcC
Q 045821 248 -SGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVA-IGKTIFVIGKGCSAVVIDVG 321 (352)
Q Consensus 248 -~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~~~i~v~gG~~~~~~~d~~ 321 (352)
......+ +..++.++++|.. + .++.-.....+|+.+...+. .......++. .++++|+.|..-.+..++..
T Consensus 257 ~~~~~l~~v~~~~~~~~~~~G~~-G-~v~~S~d~G~tW~~~~~~~~-~~~~~~~~~~~~~~~~~~~G~~G~il~~~~~ 331 (334)
T PRK13684 257 TNGYGYLDLAYRTPGEIWAGGGN-G-TLLVSKDGGKTWEKDPVGEE-VPSNFYKIVFLDPEKGFVLGQRGVLLRYVGS 331 (334)
T ss_pred ccccceeeEEEcCCCCEEEEcCC-C-eEEEeCCCCCCCeECCcCCC-CCcceEEEEEeCCCceEEECCCceEEEecCC
Confidence 1122222 2336678877754 3 45555556789999863222 1223344443 46778888765444445543
No 69
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=96.91 E-value=0.1 Score=44.69 Aligned_cols=190 Identities=16% Similarity=0.155 Sum_probs=101.5
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASM 145 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t 145 (352)
++.||.+. ...-.++.+|+ .++.-..+. .+ . ..+++.. ++.+|+... ....++|+.+
T Consensus 11 ~g~l~~~D----~~~~~i~~~~~--~~~~~~~~~-~~---~---~~G~~~~~~~g~l~v~~~--------~~~~~~d~~~ 69 (246)
T PF08450_consen 11 DGRLYWVD----IPGGRIYRVDP--DTGEVEVID-LP---G---PNGMAFDRPDGRLYVADS--------GGIAVVDPDT 69 (246)
T ss_dssp TTEEEEEE----TTTTEEEEEET--TTTEEEEEE-SS---S---EEEEEEECTTSEEEEEET--------TCEEEEETTT
T ss_pred CCEEEEEE----cCCCEEEEEEC--CCCeEEEEe-cC---C---CceEEEEccCCEEEEEEc--------CceEEEecCC
Confidence 56788772 23346788898 776554433 22 1 1333443 678888764 3346669999
Q ss_pred CeeEecCCCC--C-cceee--eeEEECCEEEEEeCCCCC------CceEEEeCCCCcEEEecCCCCcccceeEEEECC-E
Q 045821 146 NTWTDAAPMS--T-ARCYF--PCGVLNQKIYCIGGLGDT------HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-K 213 (352)
Q Consensus 146 ~~W~~~~~~~--~-~r~~~--~~~~~~~~iyv~gG~~~~------~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~ 213 (352)
++++.+...+ . +.... .++--+|.+|+..-.... ..++++++. .+.+.+... ......-+..-++ .
T Consensus 70 g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~-~~~pNGi~~s~dg~~ 147 (246)
T PF08450_consen 70 GKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADG-LGFPNGIAFSPDGKT 147 (246)
T ss_dssp TEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEE-ESSEEEEEEETTSSE
T ss_pred CcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecC-cccccceEECCcchh
Confidence 9998876653 1 22333 333336888886432221 358899998 665554321 1111222233355 5
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCc--eeecC---CCCCCCCC--CcEEEECCEEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDS--WLHAD---ANMASGWR--GPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVG 286 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~---~~~~~~~~--~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~ 286 (352)
||+.-. ....|++|+..... +.... ........ +.++-.+|.||+..- .+..|++||++...-..+.
T Consensus 148 lyv~ds-----~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~-~~~~I~~~~p~G~~~~~i~ 221 (246)
T PF08450_consen 148 LYVADS-----FNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADW-GGGRIVVFDPDGKLLREIE 221 (246)
T ss_dssp EEEEET-----TTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEE-TTTEEEEEETTSCEEEEEE
T ss_pred eeeccc-----ccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEc-CCCEEEEECCCccEEEEEc
Confidence 777632 22368888875433 33221 11111111 122334788888643 2339999999966666564
No 70
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.87 E-value=0.27 Score=42.92 Aligned_cols=182 Identities=15% Similarity=0.079 Sum_probs=87.3
Q ss_pred EEEEEeCCCCCCCCCCeEEEEECCCCeeEe-cCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCcEEEecC
Q 045821 121 NAYLLGGCGWSEDATSEVYCYDASMNTWTD-AAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTE 197 (352)
Q Consensus 121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~-~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~ 197 (352)
.+|+.++. ...+.+||+.+++-.. +.....++ .++.. ++ .+|+.++.+ ..+.+||..+.+....-.
T Consensus 2 ~~~~s~~~------d~~v~~~d~~t~~~~~~~~~~~~~~---~l~~~~dg~~l~~~~~~~--~~v~~~d~~~~~~~~~~~ 70 (300)
T TIGR03866 2 KAYVSNEK------DNTISVIDTATLEVTRTFPVGQRPR---GITLSKDGKLLYVCASDS--DTIQVIDLATGEVIGTLP 70 (300)
T ss_pred cEEEEecC------CCEEEEEECCCCceEEEEECCCCCC---ceEECCCCCEEEEEECCC--CeEEEEECCCCcEEEecc
Confidence 45555543 3468888888775322 22111122 12222 33 577776543 678899998876543211
Q ss_pred CCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEE
Q 045821 198 PNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMW 275 (352)
Q Consensus 198 ~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~y 275 (352)
..... ...++.-++ .+|+.++.. ..+..||+.+.+- ..++.. .......+..++++++++...+..+..|
T Consensus 71 ~~~~~-~~~~~~~~g~~l~~~~~~~-----~~l~~~d~~~~~~~~~~~~~--~~~~~~~~~~dg~~l~~~~~~~~~~~~~ 142 (300)
T TIGR03866 71 SGPDP-ELFALHPNGKILYIANEDD-----NLVTVIDIETRKVLAEIPVG--VEPEGMAVSPDGKIVVNTSETTNMAHFI 142 (300)
T ss_pred CCCCc-cEEEECCCCCEEEEEcCCC-----CeEEEEECCCCeEEeEeeCC--CCcceEEECCCCCEEEEEecCCCeEEEE
Confidence 11111 111222244 566664322 2588888876542 222211 1112223344677777766554467778
Q ss_pred eCCCCceEEccCCCCCcCCCCceEEEeCCE-EEEEcC-CceEEEEEcCCcc
Q 045821 276 QKESREWSPVGRLSTLLTRPPCKLVAIGKT-IFVIGK-GCSAVVIDVGNIG 324 (352)
Q Consensus 276 d~~~~~W~~~~~~~~~~~~~~~~~~~~~~~-i~v~gG-~~~~~~~d~~~~~ 324 (352)
|..+..-...-.... +..+.....+++ +++-+. ...+.+||.++..
T Consensus 143 d~~~~~~~~~~~~~~---~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~ 190 (300)
T TIGR03866 143 DTKTYEIVDNVLVDQ---RPRFAEFTADGKELWVSSEIGGTVSVIDVATRK 190 (300)
T ss_pred eCCCCeEEEEEEcCC---CccEEEECCCCCEEEEEcCCCCEEEEEEcCcce
Confidence 887654322111111 111122223444 444432 2346678887654
No 71
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=96.83 E-value=0.14 Score=43.79 Aligned_cols=156 Identities=18% Similarity=0.081 Sum_probs=93.1
Q ss_pred eeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 160 YFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 160 ~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
..+... .++.+|.-.|....+.+..||+.|++-....+++......+++.++++||.+. -.....+.||..+
T Consensus 47 TQGL~~~~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLT-----Wk~~~~f~yd~~t-- 119 (264)
T PF05096_consen 47 TQGLEFLDDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLT-----WKEGTGFVYDPNT-- 119 (264)
T ss_dssp EEEEEEEETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEE-----SSSSEEEEEETTT--
T ss_pred CccEEecCCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEE-----ecCCeEEEEcccc--
Confidence 344555 57899999887776899999999998766666777777778899999999993 1123688999975
Q ss_pred eeecCCCCCCCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc----CCCCceEEEeCCEEEE-EcCC
Q 045821 239 WLHADANMASGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL----TRPPCKLVAIGKTIFV-IGKG 312 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~----~~~~~~~~~~~~~i~v-~gG~ 312 (352)
.+.+... .....+-.. ..+..|++-.|.+ .++.+||++-+ .+..+.... ....--+-..+++||. +--.
T Consensus 120 l~~~~~~-~y~~EGWGLt~dg~~Li~SDGS~--~L~~~dP~~f~--~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~t 194 (264)
T PF05096_consen 120 LKKIGTF-PYPGEGWGLTSDGKRLIMSDGSS--RLYFLDPETFK--EVRTIQVTDNGRPVSNLNELEYINGKIYANVWQT 194 (264)
T ss_dssp TEEEEEE-E-SSS--EEEECSSCEEEE-SSS--EEEEE-TTT-S--EEEEEE-EETTEE---EEEEEEETTEEEEEETTS
T ss_pred ceEEEEE-ecCCcceEEEcCCCEEEEECCcc--ceEEECCcccc--eEEEEEEEECCEECCCcEeEEEEcCEEEEEeCCC
Confidence 3333321 111233344 4455677777754 89999998633 222221100 0111123345788885 5566
Q ss_pred ceEEEEEcCCccccc
Q 045821 313 CSAVVIDVGNIGNIG 327 (352)
Q Consensus 313 ~~~~~~d~~~~~w~~ 327 (352)
+.+..+|+++..-..
T Consensus 195 d~I~~Idp~tG~V~~ 209 (264)
T PF05096_consen 195 DRIVRIDPETGKVVG 209 (264)
T ss_dssp SEEEEEETTT-BEEE
T ss_pred CeEEEEeCCCCeEEE
Confidence 677789999876433
No 72
>PRK04792 tolB translocation protein TolB; Provisional
Probab=96.83 E-value=0.45 Score=44.82 Aligned_cols=148 Identities=11% Similarity=0.065 Sum_probs=82.3
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK- 213 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~- 213 (352)
...++++|..+++-+.+...+..........-+..|++....+....++.+|..+++.+++..... ........-+|+
T Consensus 241 ~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~~~-~~~~p~wSpDG~~ 319 (448)
T PRK04792 241 KAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALTRITRHRA-IDTEPSWHPDGKS 319 (448)
T ss_pred CcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccCCC-CccceEECCCCCE
Confidence 367999999988776665544221111111113356555433333679999999998887754211 111122333554
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGR 287 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~ 287 (352)
|++. ... .....++.+|..+++++.+..... ........-++ .|++.+...+ ..++++|+.++....+..
T Consensus 320 I~f~-s~~--~g~~~Iy~~dl~~g~~~~Lt~~g~-~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~ 391 (448)
T PRK04792 320 LIFT-SER--GGKPQIYRVNLASGKVSRLTFEGE-QNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQVLTS 391 (448)
T ss_pred EEEE-ECC--CCCceEEEEECCCCCEEEEecCCC-CCcCeeECCCCCEEEEEEecCCceEEEEEECCCCCeEEccC
Confidence 4443 221 122478999999888887742111 11122333344 5555543322 478999999988877653
No 73
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.72 E-value=0.35 Score=42.16 Aligned_cols=224 Identities=15% Similarity=0.076 Sum_probs=102.9
Q ss_pred EEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821 70 WIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTW 148 (352)
Q Consensus 70 ~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W 148 (352)
.+|+.++.+ -.+.++|+ .++.-...-.... ..+ ......++ .+|+.++. ...+.+||..+++.
T Consensus 2 ~~~~s~~~d----~~v~~~d~--~t~~~~~~~~~~~--~~~--~l~~~~dg~~l~~~~~~------~~~v~~~d~~~~~~ 65 (300)
T TIGR03866 2 KAYVSNEKD----NTISVIDT--ATLEVTRTFPVGQ--RPR--GITLSKDGKLLYVCASD------SDTIQVIDLATGEV 65 (300)
T ss_pred cEEEEecCC----CEEEEEEC--CCCceEEEEECCC--CCC--ceEECCCCCEEEEEECC------CCeEEEEECCCCcE
Confidence 466665531 25556687 6655332222211 111 11122233 56776642 35688999988765
Q ss_pred Ee-cCCCCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccc-ceeEEEECCEEEEEccCCCCC
Q 045821 149 TD-AAPMSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTE-IEDSFVMDGKIYIRCSASAAT 224 (352)
Q Consensus 149 ~~-~~~~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~-~~~~~~~~g~iyv~GG~~~~~ 224 (352)
.. ++....+. ..+.. + +.+|+.++.+ ..+.+||..+.+- +...+.... ...+..-+|++++++....
T Consensus 66 ~~~~~~~~~~~---~~~~~~~g~~l~~~~~~~--~~l~~~d~~~~~~--~~~~~~~~~~~~~~~~~dg~~l~~~~~~~-- 136 (300)
T TIGR03866 66 IGTLPSGPDPE---LFALHPNGKILYIANEDD--NLVTVIDIETRKV--LAEIPVGVEPEGMAVSPDGKIVVNTSETT-- 136 (300)
T ss_pred EEeccCCCCcc---EEEECCCCCEEEEEcCCC--CeEEEEECCCCeE--EeEeeCCCCcceEEECCCCCEEEEEecCC--
Confidence 43 32211221 12222 3 4566665433 5789999987642 211111111 1122333677766643321
Q ss_pred CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-Ecc----CCCCCcCCCCceE
Q 045821 225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVG----RLSTLLTRPPCKL 299 (352)
Q Consensus 225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~----~~~~~~~~~~~~~ 299 (352)
..+..||..+.+-..... .........+..+++.+++++..+..+.+||.++.+.. .+. ..+.. ......+
T Consensus 137 --~~~~~~d~~~~~~~~~~~-~~~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~-~~~~~~i 212 (300)
T TIGR03866 137 --NMAHFIDTKTYEIVDNVL-VDQRPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPE-AVQPVGI 212 (300)
T ss_pred --CeEEEEeCCCCeEEEEEE-cCCCccEEEECCCCCEEEEEcCCCCEEEEEEcCcceeeeeeeecccccccc-cCCccce
Confidence 235556776543322100 11111112233456555555443348999999876532 221 11110 0111122
Q ss_pred --EEeCCEEEEE-cCCceEEEEEcCC
Q 045821 300 --VAIGKTIFVI-GKGCSAVVIDVGN 322 (352)
Q Consensus 300 --~~~~~~i~v~-gG~~~~~~~d~~~ 322 (352)
..-+..+|+. ++.+...++|.++
T Consensus 213 ~~s~dg~~~~~~~~~~~~i~v~d~~~ 238 (300)
T TIGR03866 213 KLTKDGKTAFVALGPANRVAVVDAKT 238 (300)
T ss_pred EECCCCCEEEEEcCCCCeEEEEECCC
Confidence 2224455664 4455677898864
No 74
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=96.61 E-value=0.058 Score=48.78 Aligned_cols=148 Identities=11% Similarity=0.144 Sum_probs=91.6
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cccceeEEEECCE-EEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FTEIEDSFVMDGK-IYIRCSASAATSHVCALVYEPSTDSWLHAD 243 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~~~~~~~~~~g~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~ 243 (352)
...+.+++|.+..-.++.-|-+++. .+..+.. .+-..+...-+|. ..+.+|.. ..++.||..+.+-+++.
T Consensus 224 ~~plllvaG~d~~lrifqvDGk~N~--~lqS~~l~~fPi~~a~f~p~G~~~i~~s~rr-----ky~ysyDle~ak~~k~~ 296 (514)
T KOG2055|consen 224 TAPLLLVAGLDGTLRIFQVDGKVNP--KLQSIHLEKFPIQKAEFAPNGHSVIFTSGRR-----KYLYSYDLETAKVTKLK 296 (514)
T ss_pred CCceEEEecCCCcEEEEEecCccCh--hheeeeeccCccceeeecCCCceEEEecccc-----eEEEEeecccccccccc
Confidence 3568888888754555555666665 3333222 1222222223555 44454444 36899999999998886
Q ss_pred CCCCC---CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEc
Q 045821 244 ANMAS---GWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDV 320 (352)
Q Consensus 244 ~~~~~---~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~ 320 (352)
++... ......+..++...++.|..+ .|+..-..+++|..--.++. ....+++...+.+|++.||.-.++++|+
T Consensus 297 ~~~g~e~~~~e~FeVShd~~fia~~G~~G-~I~lLhakT~eli~s~KieG--~v~~~~fsSdsk~l~~~~~~GeV~v~nl 373 (514)
T KOG2055|consen 297 PPYGVEEKSMERFEVSHDSNFIAIAGNNG-HIHLLHAKTKELITSFKIEG--VVSDFTFSSDSKELLASGGTGEVYVWNL 373 (514)
T ss_pred CCCCcccchhheeEecCCCCeEEEcccCc-eEEeehhhhhhhhheeeecc--EEeeEEEecCCcEEEEEcCCceEEEEec
Confidence 54222 222334556777777777777 88888888888853322332 1233444455678999999889999999
Q ss_pred CCcc
Q 045821 321 GNIG 324 (352)
Q Consensus 321 ~~~~ 324 (352)
.++.
T Consensus 374 ~~~~ 377 (514)
T KOG2055|consen 374 RQNS 377 (514)
T ss_pred CCcc
Confidence 9875
No 75
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=96.58 E-value=0.19 Score=43.79 Aligned_cols=106 Identities=11% Similarity=0.234 Sum_probs=65.1
Q ss_pred CCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCC----CCceEEEeCCCCcEEEecCCC--Cccccee
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGD----THSWDVYDPRTNNWKLHTEPN--IFTEIED 206 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~----~~~v~~yd~~t~~W~~~~~~~--~~~~~~~ 206 (352)
.-..+..||..+.+|..+..--.... .++... ++++|+.|-... ...+-.||..+.+|..++.-. .-.....
T Consensus 14 ~C~~lC~yd~~~~qW~~~g~~i~G~V-~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~ 92 (281)
T PF12768_consen 14 PCPGLCLYDTDNSQWSSPGNGISGTV-TDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGGGSSNSIPGPVT 92 (281)
T ss_pred CCCEEEEEECCCCEeecCCCCceEEE-EEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCCcccccCCCcEE
Confidence 35678899999999999875422221 233333 677888775322 167889999999998887521 1111112
Q ss_pred EE-E---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821 207 SF-V---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 207 ~~-~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~ 244 (352)
+. + -.+++++.|.. ......+..|| ..+|+.+..
T Consensus 93 a~~~~~~d~~~~~~aG~~--~~g~~~l~~~d--Gs~W~~i~~ 130 (281)
T PF12768_consen 93 ALTFISNDGSNFWVAGRS--ANGSTFLMKYD--GSSWSSIGS 130 (281)
T ss_pred EEEeeccCCceEEEecee--cCCCceEEEEc--CCceEeccc
Confidence 22 2 13468887654 23344677774 458988865
No 76
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=96.57 E-value=0.12 Score=46.86 Aligned_cols=188 Identities=12% Similarity=0.141 Sum_probs=100.4
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
..-+++++| ..+...+|..|- .++. .+..+.-.-.|......+..+....+++|. ..-++.||..+.+
T Consensus 224 ~~plllvaG--~d~~lrifqvDG--k~N~--~lqS~~l~~fPi~~a~f~p~G~~~i~~s~r------rky~ysyDle~ak 291 (514)
T KOG2055|consen 224 TAPLLLVAG--LDGTLRIFQVDG--KVNP--KLQSIHLEKFPIQKAEFAPNGHSVIFTSGR------RKYLYSYDLETAK 291 (514)
T ss_pred CCceEEEec--CCCcEEEEEecC--ccCh--hheeeeeccCccceeeecCCCceEEEeccc------ceEEEEeeccccc
Confidence 345666666 345677888887 6665 333332110222333333334436666654 3457899999998
Q ss_pred eEecCCCCC-c---ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCC
Q 045821 148 WTDAAPMST-A---RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASA 222 (352)
Q Consensus 148 W~~~~~~~~-~---r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~ 222 (352)
-..+.++.. + ...|.+. .++.+.++-|.. .-+......|+.|-.--.++..... .....++ .|++.||..
T Consensus 292 ~~k~~~~~g~e~~~~e~FeVS-hd~~fia~~G~~--G~I~lLhakT~eli~s~KieG~v~~-~~fsSdsk~l~~~~~~G- 366 (514)
T KOG2055|consen 292 VTKLKPPYGVEEKSMERFEVS-HDSNFIAIAGNN--GHIHLLHAKTKELITSFKIEGVVSD-FTFSSDSKELLASGGTG- 366 (514)
T ss_pred cccccCCCCcccchhheeEec-CCCCeEEEcccC--ceEEeehhhhhhhhheeeeccEEee-EEEecCCcEEEEEcCCc-
Confidence 888876531 1 1223333 344556666644 3455666677777432222222111 2223344 567775544
Q ss_pred CCCceEEEEEeCCCCceeecCCCCCCCCCCcEE--EECCEEEEEecCCCCeEEEEeCCC
Q 045821 223 ATSHVCALVYEPSTDSWLHADANMASGWRGPAV--VVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 223 ~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~--~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
.||++|..++.-...-. -....++... ..++.++.+|...| -|-+||.++
T Consensus 367 -----eV~v~nl~~~~~~~rf~-D~G~v~gts~~~S~ng~ylA~GS~~G-iVNIYd~~s 418 (514)
T KOG2055|consen 367 -----EVYVWNLRQNSCLHRFV-DDGSVHGTSLCISLNGSYLATGSDSG-IVNIYDGNS 418 (514)
T ss_pred -----eEEEEecCCcceEEEEe-ecCccceeeeeecCCCceEEeccCcc-eEEEeccch
Confidence 59999998875332211 1112233332 36778777776665 677887554
No 77
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=96.57 E-value=0.37 Score=46.19 Aligned_cols=285 Identities=14% Similarity=0.101 Sum_probs=137.3
Q ss_pred CCCccCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhh-HHH-HhhCCCC----CCEEEEEEecCCCCceeEEE
Q 045821 14 VVPLISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW-CAY-RRKHNLD----ETWIYAFCRDNKLERVCCYV 87 (352)
Q Consensus 14 ~~~~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~-~~~-~~~~~~~----~~~l~~~gg~~~~~~~~~~~ 87 (352)
....+..||.|+...||..++.+.+..++.+|+.|+.+...... .+. +...... +..+-...+.........+.
T Consensus 104 ~~dfi~~lp~el~~~il~~Ld~~~l~~~~~v~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ks~~~ 183 (537)
T KOG0274|consen 104 QRDFLSLLPSELSLHILSFLDGRDLLAVRQVCRNWNKLLDDDKVWWRMCRELIGRLPPKCEKGLPLKSGFKGRPWKSFYR 183 (537)
T ss_pred ccchhhcccchhcccccccCCHHHhhhhhhhcchhhhhhhccchhhhhhhhhcccCCcccCcccccccccccchhhhhhh
Confidence 35567889999999999999999999999999999988754432 211 1111111 11111111100000000000
Q ss_pred -ecCCCCCCCeeEeccCCCc-ccC--CcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821 88 -LDPNSTRRSWKMIHELPAR-TLR--RKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC 163 (352)
Q Consensus 88 -~d~~~~~~~w~~~~~~~~~-~~~--~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~ 163 (352)
..- ....|......... ..+ ........+.+.. +.-| .....+.+||..+..-...+.....-.-.+.
T Consensus 184 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~~~-----s~~~tl~~~~~~~~~~i~~~l~GH~g~V~~l 255 (537)
T KOG0274|consen 184 RRFR--LSKNWRKLFRRGYKVLLGTDDHVVLCLQLHDGF-FKSG-----SDDSTLHLWDLNNGYLILTRLVGHFGGVWGL 255 (537)
T ss_pred hhhh--ccccccccccccceeecccCcchhhhheeecCe-EEec-----CCCceeEEeecccceEEEeeccCCCCCceeE
Confidence 011 23344443322100 000 0111111111111 1122 2334556777777654433211111111222
Q ss_pred EEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceee
Q 045821 164 GVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLH 241 (352)
Q Consensus 164 ~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~ 241 (352)
... ++.+++.|..+ ..+-+.|..++....+- . .+.....+.. .+...+.|+.+ +.+.++|..+.+-..
T Consensus 256 ~~~~~~~~lvsgS~D--~t~rvWd~~sg~C~~~l--~-gh~stv~~~~~~~~~~~sgs~D-----~tVkVW~v~n~~~l~ 325 (537)
T KOG0274|consen 256 AFPSGGDKLVSGSTD--KTERVWDCSTGECTHSL--Q-GHTSSVRCLTIDPFLLVSGSRD-----NTVKVWDVTNGACLN 325 (537)
T ss_pred EEecCCCEEEEEecC--CcEEeEecCCCcEEEEe--c-CCCceEEEEEccCceEeeccCC-----ceEEEEeccCcceEE
Confidence 222 35666666655 67778888888776542 2 2222222333 33333333333 367788887666554
Q ss_pred cCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcCCce-EEEEE
Q 045821 242 ADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGKGCS-AVVID 319 (352)
Q Consensus 242 ~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG~~~-~~~~d 319 (352)
+-.. +...--.+..++.+.+.|.+++ .|-+||+.+.+. +..+.. ....-.++ .+++ ..++=|+.++ +.+.|
T Consensus 326 l~~~--h~~~V~~v~~~~~~lvsgs~d~-~v~VW~~~~~~c--l~sl~g-H~~~V~sl-~~~~~~~~~Sgs~D~~IkvWd 398 (537)
T KOG0274|consen 326 LLRG--HTGPVNCVQLDEPLLVSGSYDG-TVKVWDPRTGKC--LKSLSG-HTGRVYSL-IVDSENRLLSGSLDTTIKVWD 398 (537)
T ss_pred Eecc--ccccEEEEEecCCEEEEEecCc-eEEEEEhhhcee--eeeecC-CcceEEEE-EecCcceEEeeeeccceEeec
Confidence 4221 1111112456788888888887 899999985444 333332 12222233 3444 4444444443 44678
Q ss_pred cCCc
Q 045821 320 VGNI 323 (352)
Q Consensus 320 ~~~~ 323 (352)
+++.
T Consensus 399 l~~~ 402 (537)
T KOG0274|consen 399 LRTK 402 (537)
T ss_pred CCch
Confidence 7776
No 78
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.53 E-value=0.42 Score=40.65 Aligned_cols=144 Identities=11% Similarity=0.107 Sum_probs=64.5
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWL-HADA 244 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~ 244 (352)
++..+++++.+ ..+.+||..++.-...-..... ........ ++++++.++.. ..+..||..+.+-. .+..
T Consensus 62 ~~~~l~~~~~~--~~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~-----~~i~~~~~~~~~~~~~~~~ 133 (289)
T cd00200 62 DGTYLASGSSD--KTIRLWDLETGECVRTLTGHTS-YVSSVAFSPDGRILSSSSRD-----KTIKVWDVETGKCLTTLRG 133 (289)
T ss_pred CCCEEEEEcCC--CeEEEEEcCcccceEEEeccCC-cEEEEEEcCCCCEEEEecCC-----CeEEEEECCCcEEEEEecc
Confidence 34455555544 6788999887532211111111 11112222 34666665423 25778888744332 2221
Q ss_pred CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-EccCCCCCcCCCCceEEE-eCC-EEEEEcCCceEEEEEcC
Q 045821 245 NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVGRLSTLLTRPPCKLVA-IGK-TIFVIGKGCSAVVIDVG 321 (352)
Q Consensus 245 ~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~-~~~-~i~v~gG~~~~~~~d~~ 321 (352)
............++.+++.+...+ .+.+||..+.+-. .+.. . .....++.. .++ .+++.+....+.+||..
T Consensus 134 -~~~~i~~~~~~~~~~~l~~~~~~~-~i~i~d~~~~~~~~~~~~-~---~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~ 207 (289)
T cd00200 134 -HTDWVNSVAFSPDGTFVASSSQDG-TIKLWDLRTGKCVATLTG-H---TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS 207 (289)
T ss_pred -CCCcEEEEEEcCcCCEEEEEcCCC-cEEEEEccccccceeEec-C---ccccceEEECCCcCEEEEecCCCcEEEEECC
Confidence 111111111222345555554344 8999998754321 2211 1 111122222 233 45554444456688887
Q ss_pred Ccc
Q 045821 322 NIG 324 (352)
Q Consensus 322 ~~~ 324 (352)
...
T Consensus 208 ~~~ 210 (289)
T cd00200 208 TGK 210 (289)
T ss_pred CCc
Confidence 533
No 79
>PRK13684 Ycf48-like protein; Provisional
Probab=96.53 E-value=0.57 Score=42.20 Aligned_cols=212 Identities=15% Similarity=0.162 Sum_probs=104.1
Q ss_pred CCCCeeEecc-CCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC-CCcceeeeeEEE-CCE
Q 045821 93 TRRSWKMIHE-LPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM-STARCYFPCGVL-NQK 169 (352)
Q Consensus 93 ~~~~w~~~~~-~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~-~~~r~~~~~~~~-~~~ 169 (352)
...+|++... ++. ....-..+...++..|+.|.. ..++.=+=.-.+|+.+... ..+........+ ++.
T Consensus 74 gG~tW~~~~~~~~~--~~~~l~~v~~~~~~~~~~G~~-------g~i~~S~DgG~tW~~~~~~~~~~~~~~~i~~~~~~~ 144 (334)
T PRK13684 74 GGETWEERSLDLPE--ENFRLISISFKGDEGWIVGQP-------SLLLHTTDGGKNWTRIPLSEKLPGSPYLITALGPGT 144 (334)
T ss_pred CCCCceECccCCcc--cccceeeeEEcCCcEEEeCCC-------ceEEEECCCCCCCeEccCCcCCCCCceEEEEECCCc
Confidence 4678998753 221 111222333345556665421 2233311123479888532 122222333333 345
Q ss_pred EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEE-EeCCCCceeecCCCCC
Q 045821 170 IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALV-YEPSTDSWLHADANMA 247 (352)
Q Consensus 170 iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~-yd~~~~~W~~~~~~~~ 247 (352)
+|+.|.. ..+++=+-.-.+|+.+...... ....+... ++.++++ |..+ .++. .|....+|+.+.....
T Consensus 145 ~~~~g~~---G~i~~S~DgG~tW~~~~~~~~g-~~~~i~~~~~g~~v~~-g~~G-----~i~~s~~~gg~tW~~~~~~~~ 214 (334)
T PRK13684 145 AEMATNV---GAIYRTTDGGKNWEALVEDAAG-VVRNLRRSPDGKYVAV-SSRG-----NFYSTWEPGQTAWTPHQRNSS 214 (334)
T ss_pred ceeeecc---ceEEEECCCCCCceeCcCCCcc-eEEEEEECCCCeEEEE-eCCc-----eEEEEcCCCCCeEEEeeCCCc
Confidence 6666643 2344444446789988653322 22233333 4444444 3332 2333 2445568998865433
Q ss_pred CCCCCcEEEECCEEEEEecCCCCeEEEE--eCCCCceEEccCCCCC-cCCCCceEEEe-CCEEEEEcCCceEEEEEcCCc
Q 045821 248 SGWRGPAVVVDDALYVLDQSSGTKLMMW--QKESREWSPVGRLSTL-LTRPPCKLVAI-GKTIFVIGKGCSAVVIDVGNI 323 (352)
Q Consensus 248 ~~~~~~~~~~~~~l~~~gg~~~~~v~~y--d~~~~~W~~~~~~~~~-~~~~~~~~~~~-~~~i~v~gG~~~~~~~d~~~~ 323 (352)
......+...++.++++|.. + .+ ++ +-...+|+.+.. |.. .....+++... ++.+++.|..-.++.-...-.
T Consensus 215 ~~l~~i~~~~~g~~~~vg~~-G-~~-~~~s~d~G~sW~~~~~-~~~~~~~~l~~v~~~~~~~~~~~G~~G~v~~S~d~G~ 290 (334)
T PRK13684 215 RRLQSMGFQPDGNLWMLARG-G-QI-RFNDPDDLESWSKPII-PEITNGYGYLDLAYRTPGEIWAGGGNGTLLVSKDGGK 290 (334)
T ss_pred ccceeeeEcCCCCEEEEecC-C-EE-EEccCCCCCccccccC-CccccccceeeEEEcCCCCEEEEcCCCeEEEeCCCCC
Confidence 33344444557888888754 3 33 34 344568997642 211 11222333333 667888876544444344455
Q ss_pred cccc
Q 045821 324 GNIG 327 (352)
Q Consensus 324 ~w~~ 327 (352)
+|..
T Consensus 291 tW~~ 294 (334)
T PRK13684 291 TWEK 294 (334)
T ss_pred CCeE
Confidence 7865
No 80
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.51 E-value=0.43 Score=40.57 Aligned_cols=183 Identities=10% Similarity=0.083 Sum_probs=82.7
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEec
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHT 196 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~ 196 (352)
++..+++++. ...+.+||..+++.. .+..... .-..+... ++.+++.++.+ ..+.+||..+..-...-
T Consensus 62 ~~~~l~~~~~------~~~i~i~~~~~~~~~~~~~~~~~--~i~~~~~~~~~~~~~~~~~~--~~i~~~~~~~~~~~~~~ 131 (289)
T cd00200 62 DGTYLASGSS------DKTIRLWDLETGECVRTLTGHTS--YVSSVAFSPDGRILSSSSRD--KTIKVWDVETGKCLTTL 131 (289)
T ss_pred CCCEEEEEcC------CCeEEEEEcCcccceEEEeccCC--cEEEEEEcCCCCEEEEecCC--CeEEEEECCCcEEEEEe
Confidence 3345555553 356788888775321 1211111 11112222 34566666533 67889999865432221
Q ss_pred CCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEE
Q 045821 197 EPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPAVVVDDALYVLDQSSGTKLMM 274 (352)
Q Consensus 197 ~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~ 274 (352)
.. ........... ++.+++.|+.. ..+..||..+.+- ..+.. ...........-+++.+++++..+ .+.+
T Consensus 132 ~~-~~~~i~~~~~~~~~~~l~~~~~~-----~~i~i~d~~~~~~~~~~~~-~~~~i~~~~~~~~~~~l~~~~~~~-~i~i 203 (289)
T cd00200 132 RG-HTDWVNSVAFSPDGTFVASSSQD-----GTIKLWDLRTGKCVATLTG-HTGEVNSVAFSPDGEKLLSSSSDG-TIKL 203 (289)
T ss_pred cc-CCCcEEEEEEcCcCCEEEEEcCC-----CcEEEEEccccccceeEec-CccccceEEECCCcCEEEEecCCC-cEEE
Confidence 11 11111122223 34555553312 2477788764332 22211 111111112223444555665544 8999
Q ss_pred EeCCCCceEEccCCCCCcCCCCceEEEe-CCEEEEEcC-CceEEEEEcCC
Q 045821 275 WQKESREWSPVGRLSTLLTRPPCKLVAI-GKTIFVIGK-GCSAVVIDVGN 322 (352)
Q Consensus 275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~~~i~v~gG-~~~~~~~d~~~ 322 (352)
||..+.+.... +.. ......++... ++.+++.++ ...+.+||..+
T Consensus 204 ~d~~~~~~~~~--~~~-~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~ 250 (289)
T cd00200 204 WDLSTGKCLGT--LRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250 (289)
T ss_pred EECCCCceecc--hhh-cCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCC
Confidence 99976443222 211 01112223332 355666665 44567888875
No 81
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=96.49 E-value=0.48 Score=40.89 Aligned_cols=169 Identities=12% Similarity=0.119 Sum_probs=95.9
Q ss_pred eccCCCcccCCcceEEEEECCEEEEEeCCCCC----------------CCCCCeEEEEECCCCe----eEecCCCCCcce
Q 045821 100 IHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----------------EDATSEVYCYDASMNT----WTDAAPMSTARC 159 (352)
Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----------------~~~~~~~~~~d~~t~~----W~~~~~~~~~r~ 159 (352)
+.+.|.. ....+-++..+++.|| |||+--. ....+.+..||..+++ |++--.-+...+
T Consensus 28 vG~~P~S-GGDTYNAV~~vDd~Iy-FGGWVHAPa~y~gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~~~~Wa 105 (339)
T PF09910_consen 28 VGPPPTS-GGDTYNAVEWVDDFIY-FGGWVHAPAVYEGKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKESIHDKTKWA 105 (339)
T ss_pred ccCCCCC-CCccceeeeeecceEE-EeeeecCCceeeeccCCceEEEEeeccceEEEEEcCCCeEEEEEecccCCccccc
Confidence 4444432 3455566777777777 5775211 1235678999988887 654433332222
Q ss_pred eeeeEE----ECCEEEEEeCCCCC-CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821 160 YFPCGV----LNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP 234 (352)
Q Consensus 160 ~~~~~~----~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~ 234 (352)
+-.+-. +++++++.=+-... -.++..|..++.=+++..-|.. -.+.++|...|-+ .+.......+.+||+
T Consensus 106 GEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ps~---KG~~~~D~a~F~i--~~~~~g~~~i~~~Dl 180 (339)
T PF09910_consen 106 GEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSSNPSL---KGTLVHDYACFGI--NNFHKGVSGIHCLDL 180 (339)
T ss_pred cchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccCCCCc---CceEeeeeEEEec--cccccCCceEEEEEc
Confidence 211111 25677776443322 6778888888887777643332 2233344444433 333344558999999
Q ss_pred CCCce--eecCCC-------CCCCCCCcEEEECCEEEEE-ecCCCCeEEEEeCCC
Q 045821 235 STDSW--LHADAN-------MASGWRGPAVVVDDALYVL-DQSSGTKLMMWQKES 279 (352)
Q Consensus 235 ~~~~W--~~~~~~-------~~~~~~~~~~~~~~~l~~~-gg~~~~~v~~yd~~~ 279 (352)
.+++| ...+.. ...+..+.++...+++|.+ +| -+.+.||..
T Consensus 181 i~~~~~~e~f~~~~s~Dg~~~~~~~~G~~~s~ynR~faF~rG----Gi~vgnP~~ 231 (339)
T PF09910_consen 181 ISGKWVIESFDVSLSVDGGPVIRPELGAMASAYNRLFAFVRG----GIFVGNPYN 231 (339)
T ss_pred cCCeEEEEecccccCCCCCceEeeccccEEEEeeeEEEEEec----cEEEeCCCC
Confidence 99999 444321 1123334456677777755 44 588888883
No 82
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.46 E-value=0.95 Score=43.92 Aligned_cols=154 Identities=12% Similarity=0.112 Sum_probs=86.9
Q ss_pred EECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc-EE
Q 045821 117 VLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN-WK 193 (352)
Q Consensus 117 ~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~-W~ 193 (352)
..++.+.+.|+.++ .+-+||..++-.... +...-..++++.+ .++..+....+ ..|-.+|...-. ++
T Consensus 359 SpDgq~iaTG~eDg------KVKvWn~~SgfC~vT--FteHts~Vt~v~f~~~g~~llssSLD--GtVRAwDlkRYrNfR 428 (893)
T KOG0291|consen 359 SPDGQLIATGAEDG------KVKVWNTQSGFCFVT--FTEHTSGVTAVQFTARGNVLLSSSLD--GTVRAWDLKRYRNFR 428 (893)
T ss_pred CCCCcEEEeccCCC------cEEEEeccCceEEEE--eccCCCceEEEEEEecCCEEEEeecC--CeEEeeeecccceee
Confidence 33667777776544 477788876642221 1122223333332 55666655554 567777776543 55
Q ss_pred EecCCCCcccceeEEEEC--CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCe
Q 045821 194 LHTEPNIFTEIEDSFVMD--GKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTK 271 (352)
Q Consensus 194 ~~~~~~~~~~~~~~~~~~--g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~ 271 (352)
....|.+... .++.+| |.|.+.|+.+ ...|++++.++++--.+=.....+-++.++...+.+++-|..+. .
T Consensus 429 Tft~P~p~Qf--scvavD~sGelV~AG~~d----~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDk-T 501 (893)
T KOG0291|consen 429 TFTSPEPIQF--SCVAVDPSGELVCAGAQD----SFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDK-T 501 (893)
T ss_pred eecCCCceee--eEEEEcCCCCEEEeeccc----eEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccc-e
Confidence 5543333222 334444 8999987554 24688888888877666443444444444555666777776665 7
Q ss_pred EEEEeCCCCceEEccCC
Q 045821 272 LMMWQKESREWSPVGRL 288 (352)
Q Consensus 272 v~~yd~~~~~W~~~~~~ 288 (352)
|-+||.-. +|.++..+
T Consensus 502 VRiW~if~-s~~~vEtl 517 (893)
T KOG0291|consen 502 VRIWDIFS-SSGTVETL 517 (893)
T ss_pred EEEEEeec-cCceeeeE
Confidence 88888764 34444433
No 83
>PF03089 RAG2: Recombination activating protein 2; InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end. The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events. The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=96.32 E-value=0.59 Score=40.05 Aligned_cols=70 Identities=20% Similarity=0.346 Sum_probs=51.3
Q ss_pred cCCcceEEEEEC--C--EEEEEeCCCCC-------------CCCCCeEEEEECCCCeeE--ecCCCCCcceeeeeEEECC
Q 045821 108 LRRKGMGFEVLG--K--NAYLLGGCGWS-------------EDATSEVYCYDASMNTWT--DAAPMSTARCYFPCGVLNQ 168 (352)
Q Consensus 108 ~~~~~~~~~~~~--~--~iyv~GG~~~~-------------~~~~~~~~~~d~~t~~W~--~~~~~~~~r~~~~~~~~~~ 168 (352)
.+|++|++.++. | .+.+|||+.-. ......++.+|+.-+..+ .++.+....++|.+..-++
T Consensus 86 ~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qRTTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D 165 (337)
T PF03089_consen 86 EARYGHTINVVHSRGKTACVLFGGRSYMPPGQRTTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHVSLARND 165 (337)
T ss_pred cccccceEEEEEECCcEEEEEECCcccCCccccchhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEEEEecCc
Confidence 789999988872 2 47888995321 124457788888877544 4566677788888888899
Q ss_pred EEEEEeCCC
Q 045821 169 KIYCIGGLG 177 (352)
Q Consensus 169 ~iyv~gG~~ 177 (352)
.+|++||..
T Consensus 166 ~VYilGGHs 174 (337)
T PF03089_consen 166 CVYILGGHS 174 (337)
T ss_pred eEEEEccEE
Confidence 999999954
No 84
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=96.29 E-value=0.5 Score=38.89 Aligned_cols=145 Identities=13% Similarity=0.112 Sum_probs=77.9
Q ss_pred EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCC-CC-CcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCC
Q 045821 115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAP-MS-TARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPR 188 (352)
Q Consensus 115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~-~~-~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~ 188 (352)
+....+.+|+|-| ..+|+++..... -+.+.. .+ .|..--++... ++++|++-| +.+++|+..
T Consensus 12 ~~~~~g~~y~FkG--------~~~w~~~~~~~~~~p~~I~~~w~~~p~~IDAa~~~~~~~~~yfFkg----~~yw~~~~~ 79 (194)
T cd00094 12 VTTLRGELYFFKG--------RYFWRLSPGKPPGSPFLISSFWPSLPSPVDAAFERPDTGKIYFFKG----DKYWVYTGK 79 (194)
T ss_pred EEEeCCEEEEEeC--------CEEEEEeCCCCCCCCeEhhhhCCCCCCCccEEEEECCCCEEEEECC----CEEEEEcCc
Confidence 3444689999976 466777764211 111211 11 11111122222 389999988 778999877
Q ss_pred CCcEE---EecCCCCc---ccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCceeec-CCCCCCCC------CCc
Q 045821 189 TNNWK---LHTEPNIF---TEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWLHA-DANMASGW------RGP 253 (352)
Q Consensus 189 t~~W~---~~~~~~~~---~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~-~~~~~~~~------~~~ 253 (352)
+.... .+...... ....++... ++++|++-|.. .++||...++...- +......+ .-+
T Consensus 80 ~~~~~~Pk~i~~~~~~~~~~~iDAA~~~~~~~~~yfFkg~~-------y~ry~~~~~~v~~~yP~~i~~~w~g~p~~ida 152 (194)
T cd00094 80 NLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFFKGDK-------YWRYDEKTQKMDPGYPKLIETDFPGVPDKVDA 152 (194)
T ss_pred ccccCCCcchhhcCCCCCCCCccEEEEEcCCCEEEEEeCCE-------EEEEeCCCccccCCCCcchhhcCCCcCCCcce
Confidence 52221 12111111 111233333 68999997654 78888765543211 11111011 122
Q ss_pred EEEEC-CEEEEEecCCCCeEEEEeCCCCc
Q 045821 254 AVVVD-DALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 254 ~~~~~-~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
+.... +++|++.|. ..++||..+++
T Consensus 153 a~~~~~~~~yfF~g~---~y~~~d~~~~~ 178 (194)
T cd00094 153 AFRWLDGYYYFFKGD---QYWRFDPRSKE 178 (194)
T ss_pred eEEeCCCcEEEEECC---EEEEEeCccce
Confidence 33344 899999886 89999998766
No 85
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=96.26 E-value=1.9 Score=45.26 Aligned_cols=192 Identities=14% Similarity=0.065 Sum_probs=103.8
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC-------C------c--ceeeeeEEE--CCEEEEEeCCCCCCc
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS-------T------A--RCYFPCGVL--NQKIYCIGGLGDTHS 181 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~-------~------~--r~~~~~~~~--~~~iyv~gG~~~~~~ 181 (352)
++.||+.... ...+.++|+.++.-+.+..-. . . ...+.+++- ++.+||..... ..
T Consensus 635 gn~LYVaDt~------n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~--~~ 706 (1057)
T PLN02919 635 KNLLYVADTE------NHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQ--HQ 706 (1057)
T ss_pred CCEEEEEeCC------CceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCC--Ce
Confidence 4678987542 245778888877655442210 0 0 011223333 57899876543 67
Q ss_pred eEEEeCCCCcEEEecCCC----------C--cccce-eEEEE-C-CEEEEEccCCCCCCceEEEEEeCCCCceeecC--C
Q 045821 182 WDVYDPRTNNWKLHTEPN----------I--FTEIE-DSFVM-D-GKIYIRCSASAATSHVCALVYEPSTDSWLHAD--A 244 (352)
Q Consensus 182 v~~yd~~t~~W~~~~~~~----------~--~~~~~-~~~~~-~-g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~--~ 244 (352)
+++||+.++....+..-. . ..... +.++. + +.||+....+ ..|.+||+.++....+. .
T Consensus 707 I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n-----~~Irv~D~~tg~~~~~~gg~ 781 (1057)
T PLN02919 707 IWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSES-----SSIRALDLKTGGSRLLAGGD 781 (1057)
T ss_pred EEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCC-----CeEEEEECCCCcEEEEEecc
Confidence 899998877654432100 0 00011 12222 3 4599985332 36889998766533211 0
Q ss_pred C-----C-----------CCCC---CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC----------CcCCC
Q 045821 245 N-----M-----------ASGW---RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLST----------LLTRP 295 (352)
Q Consensus 245 ~-----~-----------~~~~---~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~----------~~~~~ 295 (352)
+ . .... .+.++.-+|.||+....++ .|.+||++++....+..... .....
T Consensus 782 ~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~-rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~ 860 (1057)
T PLN02919 782 PTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNH-KIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSE 860 (1057)
T ss_pred cccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCC-EEEEEECCCCeEEEEeccCCcCCCCCcccccccCC
Confidence 0 0 0000 1122344678888876554 99999999988876653211 00111
Q ss_pred CceEE-EeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821 296 PCKLV-AIGKTIFVIGKGC-SAVVIDVGNIG 324 (352)
Q Consensus 296 ~~~~~-~~~~~i~v~gG~~-~~~~~d~~~~~ 324 (352)
-.+++ ..+++|||....+ .+.++|+++..
T Consensus 861 P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~ 891 (1057)
T PLN02919 861 PAGLALGENGRLFVADTNNSLIRYLDLNKGE 891 (1057)
T ss_pred ceEEEEeCCCCEEEEECCCCEEEEEECCCCc
Confidence 22333 3467899886544 56689988754
No 86
>PRK00178 tolB translocation protein TolB; Provisional
Probab=96.22 E-value=1.1 Score=42.02 Aligned_cols=182 Identities=10% Similarity=0.040 Sum_probs=95.3
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...++++|. .++.-+.+...+. .........++ +|++....+ ...+++++|..+++.+.+......-..
T Consensus 222 ~~~l~~~~l--~~g~~~~l~~~~g----~~~~~~~SpDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~lt~~~~~~~~ 291 (430)
T PRK00178 222 RPRIFVQNL--DTGRREQITNFEG----LNGAPAWSPDGSKLAFVLSKD----GNPEIYVMDLASRQLSRVTNHPAIDTE 291 (430)
T ss_pred CCEEEEEEC--CCCCEEEccCCCC----CcCCeEECCCCCEEEEEEccC----CCceEEEEECCCCCeEEcccCCCCcCC
Confidence 456788888 7777666654431 11111222233 454433221 136899999999988777643221111
Q ss_pred eeeEEEC-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 161 FPCGVLN-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 161 ~~~~~~~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
+. ..-+ ..|++.........++.+|..++.++++.... .........-+| .|+.. .... ....++.+|+.++.
T Consensus 292 ~~-~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~-~~~~--~~~~l~~~dl~tg~ 366 (430)
T PRK00178 292 PF-WGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMV-HRQD--GNFHVAAQDLQRGS 366 (430)
T ss_pred eE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEE-EccC--CceEEEEEECCCCC
Confidence 11 1113 35655543322367899999988888775221 111111222344 45444 3221 23468999999988
Q ss_pred eeecCCCCCCCCCCcEEEECCEEEEEecCC-C-CeEEEEeCCCC
Q 045821 239 WLHADANMASGWRGPAVVVDDALYVLDQSS-G-TKLMMWQKESR 280 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~-~-~~v~~yd~~~~ 280 (352)
.+.+.... ........-+|+.+++.... + ..++..+...+
T Consensus 367 ~~~lt~~~--~~~~p~~spdg~~i~~~~~~~g~~~l~~~~~~g~ 408 (430)
T PRK00178 367 VRILTDTS--LDESPSVAPNGTMLIYATRQQGRGVLMLVSINGR 408 (430)
T ss_pred EEEccCCC--CCCCceECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 88775421 11223444466555554322 2 35777777544
No 87
>PRK04792 tolB translocation protein TolB; Provisional
Probab=96.22 E-value=1.1 Score=42.20 Aligned_cols=183 Identities=11% Similarity=0.036 Sum_probs=95.8
Q ss_pred CceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce
Q 045821 81 ERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC 159 (352)
Q Consensus 81 ~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~ 159 (352)
+...++.+|. .++.-..+...+. ... ......+ ..|++....+ ...+++.+|..+++.+.+........
T Consensus 240 g~~~L~~~dl--~tg~~~~lt~~~g---~~~-~~~wSPDG~~La~~~~~~----g~~~Iy~~dl~tg~~~~lt~~~~~~~ 309 (448)
T PRK04792 240 RKAEIFVQDI--YTQVREKVTSFPG---ING-APRFSPDGKKLALVLSKD----GQPEIYVVDIATKALTRITRHRAIDT 309 (448)
T ss_pred CCcEEEEEEC--CCCCeEEecCCCC---CcC-CeeECCCCCEEEEEEeCC----CCeEEEEEECCCCCeEECccCCCCcc
Confidence 3456888888 7777666655441 111 1122223 4465544322 23679999999998877764321111
Q ss_pred eeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCC
Q 045821 160 YFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTD 237 (352)
Q Consensus 160 ~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~ 237 (352)
... ..-++ .|++.........++.+|..+++++++.... .........-+| .|+.. +.. .....++.+|+.++
T Consensus 310 ~p~-wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g-~~~~~~~~SpDG~~l~~~-~~~--~g~~~I~~~dl~~g 384 (448)
T PRK04792 310 EPS-WHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEG-EQNLGGSITPDGRSMIMV-NRT--NGKFNIARQDLETG 384 (448)
T ss_pred ceE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCC-CCCcCeeECCCCCEEEEE-Eec--CCceEEEEEECCCC
Confidence 111 11244 4555443332378899999999988875211 111112333355 44444 332 22347889999988
Q ss_pred ceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCC
Q 045821 238 SWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESR 280 (352)
Q Consensus 238 ~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~ 280 (352)
..+.+.... . -......-+++ |++.....+ ..++.+|.+..
T Consensus 385 ~~~~lt~~~-~-d~~ps~spdG~~I~~~~~~~g~~~l~~~~~~G~ 427 (448)
T PRK04792 385 AMQVLTSTR-L-DESPSVAPNGTMVIYSTTYQGKQVLAAVSIDGR 427 (448)
T ss_pred CeEEccCCC-C-CCCceECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 887765421 1 11223444454 444333333 35777787433
No 88
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.19 E-value=0.88 Score=40.79 Aligned_cols=189 Identities=7% Similarity=0.017 Sum_probs=88.1
Q ss_pred EEEEEeCCCCCCCCCCeEEEEECCC-CeeEecCCCCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCC-CCcEEEec
Q 045821 121 NAYLLGGCGWSEDATSEVYCYDASM-NTWTDAAPMSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPR-TNNWKLHT 196 (352)
Q Consensus 121 ~iyv~GG~~~~~~~~~~~~~~d~~t-~~W~~~~~~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~-t~~W~~~~ 196 (352)
.+|+..+. .+.+.+||..+ ++++.+...+.......++.- + ..+|+.+... ..+..|+.. ++.++.+.
T Consensus 3 ~~y~~~~~------~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~--~~i~~~~~~~~g~l~~~~ 74 (330)
T PRK11028 3 IVYIASPE------SQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPE--FRVLSYRIADDGALTFAA 74 (330)
T ss_pred EEEEEcCC------CCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCC--CcEEEEEECCCCceEEee
Confidence 46666432 35677788754 466665544432222222222 3 4577755422 667777775 45666554
Q ss_pred CCCCcccceeEEEE-CC-EEEEEccCCCCCCceEEEEEeCCCC-ce-eecCCCCC-CCCCCcEEEECCEEEEEecCCCCe
Q 045821 197 EPNIFTEIEDSFVM-DG-KIYIRCSASAATSHVCALVYEPSTD-SW-LHADANMA-SGWRGPAVVVDDALYVLDQSSGTK 271 (352)
Q Consensus 197 ~~~~~~~~~~~~~~-~g-~iyv~GG~~~~~~~~~i~~yd~~~~-~W-~~~~~~~~-~~~~~~~~~~~~~l~~~gg~~~~~ 271 (352)
..+.....++.+.. ++ .+|+.. .. ...+.+||..++ .- ..+..... ..-+..++.-+++.+++.....+.
T Consensus 75 ~~~~~~~p~~i~~~~~g~~l~v~~-~~----~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~ 149 (330)
T PRK11028 75 ESPLPGSPTHISTDHQGRFLFSAS-YN----ANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDR 149 (330)
T ss_pred eecCCCCceEEEECCCCCEEEEEE-cC----CCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCE
Confidence 33222222233333 44 566653 21 135666776532 11 11211111 111222233355444444444458
Q ss_pred EEEEeCCCC-ceEEcc----CCCCCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCC
Q 045821 272 LMMWQKESR-EWSPVG----RLSTLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGN 322 (352)
Q Consensus 272 v~~yd~~~~-~W~~~~----~~~~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~ 322 (352)
|.+||..++ ...... ..+....-+...+..-+..+|+... ...+.+||+..
T Consensus 150 v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~ 206 (330)
T PRK11028 150 IRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKD 206 (330)
T ss_pred EEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeC
Confidence 999998763 232110 1111111111222233456888764 55677888863
No 89
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=96.13 E-value=0.31 Score=41.74 Aligned_cols=115 Identities=15% Similarity=0.193 Sum_probs=77.8
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~ 289 (352)
.+|.+|...|..+ .+.+..||+.+++-......+...+.-..+..+++||.+-=.++ ..++||+++ .+.+...+
T Consensus 54 ~~g~LyESTG~yG---~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLTWk~~-~~f~yd~~t--l~~~~~~~ 127 (264)
T PF05096_consen 54 DDGTLYESTGLYG---QSSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLTWKEG-TGFVYDPNT--LKKIGTFP 127 (264)
T ss_dssp ETTEEEEEECSTT---EEEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEESSSS-EEEEEETTT--TEEEEEEE
T ss_pred CCCEEEEeCCCCC---cEEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEEecCC-eEEEEcccc--ceEEEEEe
Confidence 5789999877654 35788999999886654443444444445678999999975444 899999975 66666555
Q ss_pred CCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccccceEEe
Q 045821 290 TLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIGGIMVS 332 (352)
Q Consensus 290 ~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~~ 332 (352)
. ..-+-+++..++.+++--|+...+..|++...-.+...++
T Consensus 128 y--~~EGWGLt~dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V~ 168 (264)
T PF05096_consen 128 Y--PGEGWGLTSDGKRLIMSDGSSRLYFLDPETFKEVRTIQVT 168 (264)
T ss_dssp ---SSS--EEEECSSCEEEE-SSSEEEEE-TTT-SEEEEEE-E
T ss_pred c--CCcceEEEcCCCEEEEECCccceEEECCcccceEEEEEEE
Confidence 4 2456678877888888889999999999987755544443
No 90
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.06 E-value=1 Score=40.36 Aligned_cols=191 Identities=12% Similarity=0.098 Sum_probs=85.6
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECC-CCeeEecCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCc--EE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDAS-MNTWTDAAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNN--WK 193 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~-t~~W~~~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~--W~ 193 (352)
+..+|+.+. ....+..|+.. ++++..+...+.+.....++.. ++ .+|+..-.+ ..+.+||..++. .+
T Consensus 46 ~~~lyv~~~------~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~--~~v~v~~~~~~g~~~~ 117 (330)
T PRK11028 46 KRHLYVGVR------PEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNA--NCVSVSPLDKDGIPVA 117 (330)
T ss_pred CCEEEEEEC------CCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCC--CeEEEEEECCCCCCCC
Confidence 345676543 12556667765 4556554433322111222222 34 566654322 677788876431 11
Q ss_pred EecCCCCcccceeEEEE-CC-EEEEEccCCCCCCceEEEEEeCCCC-ceeec-----CCCCCCCCCCcEEEECC-EEEEE
Q 045821 194 LHTEPNIFTEIEDSFVM-DG-KIYIRCSASAATSHVCALVYEPSTD-SWLHA-----DANMASGWRGPAVVVDD-ALYVL 264 (352)
Q Consensus 194 ~~~~~~~~~~~~~~~~~-~g-~iyv~GG~~~~~~~~~i~~yd~~~~-~W~~~-----~~~~~~~~~~~~~~~~~-~l~~~ 264 (352)
.+...+.....+.++.. ++ .+|+..- ....+..||..++ ..... ..+....-++.++.-++ .+|+.
T Consensus 118 ~~~~~~~~~~~~~~~~~p~g~~l~v~~~-----~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~ 192 (330)
T PRK11028 118 PIQIIEGLEGCHSANIDPDNRTLWVPCL-----KEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCV 192 (330)
T ss_pred ceeeccCCCcccEeEeCCCCCEEEEeeC-----CCCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEE
Confidence 22211111111222222 34 6777632 2236888887653 22211 00111111112233344 56666
Q ss_pred ecCCCCeEEEEeCC--CCceEEc---cCCCCCc--CCCCceEE--EeCCEEEEEcC-CceEEEEEcCCc
Q 045821 265 DQSSGTKLMMWQKE--SREWSPV---GRLSTLL--TRPPCKLV--AIGKTIFVIGK-GCSAVVIDVGNI 323 (352)
Q Consensus 265 gg~~~~~v~~yd~~--~~~W~~~---~~~~~~~--~~~~~~~~--~~~~~i~v~gG-~~~~~~~d~~~~ 323 (352)
.... +.+.+||.+ +++.+.+ ..+|... .+....+. .-+..+|+... .....+|+++.+
T Consensus 193 ~~~~-~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~ 260 (330)
T PRK11028 193 NELN-SSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSED 260 (330)
T ss_pred ecCC-CEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCC
Confidence 5433 388888776 4454433 3333321 12222222 33446888653 445667887654
No 91
>PRK04922 tolB translocation protein TolB; Provisional
Probab=96.01 E-value=1.4 Score=41.37 Aligned_cols=180 Identities=12% Similarity=0.007 Sum_probs=93.4
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK 213 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~ 213 (352)
...++++|..+++-..+...+...... ...-++ +|++....+....++++|..++.-+++..-.. ........-+|+
T Consensus 227 ~~~l~~~dl~~g~~~~l~~~~g~~~~~-~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~~lt~~~~-~~~~~~~spDG~ 304 (433)
T PRK04922 227 RSAIYVQDLATGQRELVASFRGINGAP-SFSPDGRRLALTLSRDGNPEIYVMDLGSRQLTRLTNHFG-IDTEPTWAPDGK 304 (433)
T ss_pred CcEEEEEECCCCCEEEeccCCCCccCc-eECCCCCEEEEEEeCCCCceEEEEECCCCCeEECccCCC-CccceEECCCCC
Confidence 467899999988877776544211111 111134 56554333323689999999888766643211 111122333555
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEEC-CEEEEEecCCC-CeEEEEeCCCCceEEccCCCCC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVD-DALYVLDQSSG-TKLMMWQKESREWSPVGRLSTL 291 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~~~~ 291 (352)
-.++.... .....++.+|..+++.+.+.... .........-+ ..|++..+..+ ..++++|+.++....+..-+.
T Consensus 305 ~l~f~sd~--~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~~- 380 (433)
T PRK04922 305 SIYFTSDR--GGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGSL- 380 (433)
T ss_pred EEEEEECC--CCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCCC-
Confidence 33332222 11236888898888887764211 11112233334 45665544332 479999999888877653221
Q ss_pred cCCCCceEEEeCCEEEEE---cCCceEEEEEcCC
Q 045821 292 LTRPPCKLVAIGKTIFVI---GKGCSAVVIDVGN 322 (352)
Q Consensus 292 ~~~~~~~~~~~~~~i~v~---gG~~~~~~~d~~~ 322 (352)
.........+..|+.. +|....+++|...
T Consensus 381 --~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g 412 (433)
T PRK04922 381 --DESPSFAPNGSMVLYATREGGRGVLAAVSTDG 412 (433)
T ss_pred --CCCceECCCCCEEEEEEecCCceEEEEEECCC
Confidence 1122333333344433 2344566677753
No 92
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.99 E-value=0.44 Score=43.59 Aligned_cols=177 Identities=15% Similarity=0.119 Sum_probs=91.5
Q ss_pred CCeEEEEECCCCeeEe-cCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc-EEEecCCCCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTD-AAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~-~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~~~g 212 (352)
+..+..|+..+..-.. +..+.. -.+......||++..+|+.. .-|.+||..+.. -+.+..-..+.........++
T Consensus 47 S~rvqly~~~~~~~~k~~srFk~-~v~s~~fR~DG~LlaaGD~s--G~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~ 123 (487)
T KOG0310|consen 47 SVRVQLYSSVTRSVRKTFSRFKD-VVYSVDFRSDGRLLAAGDES--GHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDN 123 (487)
T ss_pred ccEEEEEecchhhhhhhHHhhcc-ceeEEEeecCCeEEEccCCc--CcEEEeccccHHHHHHHhhccCceeEEEecccCC
Confidence 3567778776653221 221111 11222223379999999876 578889955521 111111111111112334688
Q ss_pred EEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCC
Q 045821 213 KIYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTL 291 (352)
Q Consensus 213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~ 291 (352)
.+++.|+-+. .+..+|.++..-+ ++......-+++.....++.|++.||+++ .|-.||..+.+ ..+-.+...
T Consensus 124 t~l~s~sDd~-----v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg-~vrl~DtR~~~-~~v~elnhg 196 (487)
T KOG0310|consen 124 TMLVSGSDDK-----VVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDG-KVRLWDTRSLT-SRVVELNHG 196 (487)
T ss_pred eEEEecCCCc-----eEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCc-eEEEEEeccCC-ceeEEecCC
Confidence 9999975442 3444555555422 22222222233344456888999999998 99999999874 333333221
Q ss_pred cCCCCceEEE-eCCEEEEEcCCceEEEEEcCCc
Q 045821 292 LTRPPCKLVA-IGKTIFVIGKGCSAVVIDVGNI 323 (352)
Q Consensus 292 ~~~~~~~~~~-~~~~i~v~gG~~~~~~~d~~~~ 323 (352)
..--.++. .++.+.+..|.+.+-++|.-+.
T Consensus 197 --~pVe~vl~lpsgs~iasAgGn~vkVWDl~~G 227 (487)
T KOG0310|consen 197 --CPVESVLALPSGSLIASAGGNSVKVWDLTTG 227 (487)
T ss_pred --CceeeEEEcCCCCEEEEcCCCeEEEEEecCC
Confidence 11112233 3445555444446667776643
No 93
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.99 E-value=1.2 Score=40.40 Aligned_cols=238 Identities=14% Similarity=0.202 Sum_probs=117.9
Q ss_pred CCEEEEEEecC-CCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE---CCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821 68 ETWIYAFCRDN-KLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL---GKNAYLLGGCGWSEDATSEVYCYDA 143 (352)
Q Consensus 68 ~~~l~~~gg~~-~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~---~~~iyv~GG~~~~~~~~~~~~~~d~ 143 (352)
++.||+..... .......|..++ .+++.+.+...+. .-..-+...+ +..+|+.-- ....+.+|+.
T Consensus 48 ~~~LY~~~e~~~~~g~v~~~~i~~--~~g~L~~~~~~~~---~g~~p~~i~~~~~g~~l~vany------~~g~v~v~~l 116 (345)
T PF10282_consen 48 GRRLYVVNEGSGDSGGVSSYRIDP--DTGTLTLLNSVPS---GGSSPCHIAVDPDGRFLYVANY------GGGSVSVFPL 116 (345)
T ss_dssp SSEEEEEETTSSTTTEEEEEEEET--TTTEEEEEEEEEE---SSSCEEEEEECTTSSEEEEEET------TTTEEEEEEE
T ss_pred CCEEEEEEccccCCCCEEEEEECC--CcceeEEeeeecc---CCCCcEEEEEecCCCEEEEEEc------cCCeEEEEEc
Confidence 78899886543 445667777777 7678877766541 1112222333 445666531 1356777777
Q ss_pred CCC-eeEecC---------CC---CCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCc--EEEecC--CCCcccc
Q 045821 144 SMN-TWTDAA---------PM---STARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNN--WKLHTE--PNIFTEI 204 (352)
Q Consensus 144 ~t~-~W~~~~---------~~---~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~--~~~~~~~ 204 (352)
..+ +-.... +- ...-..|.+... + ..+|+..= .. ..+.+|+...+. ...... .+....-
T Consensus 117 ~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dl-G~-D~v~~~~~~~~~~~l~~~~~~~~~~G~GP 194 (345)
T PF10282_consen 117 DDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDL-GA-DRVYVYDIDDDTGKLTPVDSIKVPPGSGP 194 (345)
T ss_dssp CTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEET-TT-TEEEEEEE-TTS-TEEEEEEEECSTTSSE
T ss_pred cCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEec-CC-CEEEEEEEeCCCceEEEeeccccccCCCC
Confidence 653 222211 00 111223444444 3 46777531 11 678888887665 544221 2222222
Q ss_pred eeEEEE-C-CEEEEEccCCCCCCceEEEEEeCCCCceeecCC--CCCCCC----CCcEEE--ECC-EEEEEecCCCCeEE
Q 045821 205 EDSFVM-D-GKIYIRCSASAATSHVCALVYEPSTDSWLHADA--NMASGW----RGPAVV--VDD-ALYVLDQSSGTKLM 273 (352)
Q Consensus 205 ~~~~~~-~-g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~--~~~~~~----~~~~~~--~~~-~l~~~gg~~~~~v~ 273 (352)
.++++. + ..+|++... .....++.++..+..++.+.. ..+... ..+.+. -++ .||+.... .+.|.
T Consensus 195 Rh~~f~pdg~~~Yv~~e~---s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~-~~sI~ 270 (345)
T PF10282_consen 195 RHLAFSPDGKYAYVVNEL---SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRG-SNSIS 270 (345)
T ss_dssp EEEEE-TTSSEEEEEETT---TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECT-TTEEE
T ss_pred cEEEEcCCcCEEEEecCC---CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEecc-CCEEE
Confidence 244444 3 478998532 222345556655777765532 111111 122232 245 45655433 34888
Q ss_pred EEeC--CCCceEEccCCCCC-cCCCCceEEEeCCEEEEEc-CCceEEEEEcCC
Q 045821 274 MWQK--ESREWSPVGRLSTL-LTRPPCKLVAIGKTIFVIG-KGCSAVVIDVGN 322 (352)
Q Consensus 274 ~yd~--~~~~W~~~~~~~~~-~~~~~~~~~~~~~~i~v~g-G~~~~~~~d~~~ 322 (352)
+|+. .+++.+.+...+.. ..-+.+.+..-++.||+.+ +.+.+.+|++..
T Consensus 271 vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~ 323 (345)
T PF10282_consen 271 VFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDP 323 (345)
T ss_dssp EEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred EEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeC
Confidence 8876 45677766644431 1112233333566777776 344566787653
No 94
>PRK00178 tolB translocation protein TolB; Provisional
Probab=95.91 E-value=1.5 Score=41.01 Aligned_cols=146 Identities=8% Similarity=0.016 Sum_probs=81.4
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
...++++|..+++-+.+...+..-. ..... ++ +|++....+....++++|..++..+++..... ........-+|
T Consensus 222 ~~~l~~~~l~~g~~~~l~~~~g~~~--~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~-~~~~~~~spDg 298 (430)
T PRK00178 222 RPRIFVQNLDTGRREQITNFEGLNG--APAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPA-IDTEPFWGKDG 298 (430)
T ss_pred CCEEEEEECCCCCEEEccCCCCCcC--CeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEcccCCC-CcCCeEECCCC
Confidence 3579999999988777765432111 11111 33 45544332222689999999998887754221 11111223354
Q ss_pred -EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821 213 -KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSG-TKLMMWQKESREWSPVGR 287 (352)
Q Consensus 213 -~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~ 287 (352)
+|+.. ... .....++.+|..+++++.+.... .........- ++.|++.....+ ..++++|..++..+.+..
T Consensus 299 ~~i~f~-s~~--~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~ 372 (430)
T PRK00178 299 RTLYFT-SDR--GGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTD 372 (430)
T ss_pred CEEEEE-ECC--CCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccC
Confidence 45544 221 12236888999888887764311 1111122233 445555543322 468999999988887764
No 95
>PRK05137 tolB translocation protein TolB; Provisional
Probab=95.88 E-value=1.6 Score=40.99 Aligned_cols=188 Identities=13% Similarity=0.065 Sum_probs=97.5
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...++.+|. .++....+...+. ... ......++ .|++....+ ...+++.+|..+++.+.+...+.....
T Consensus 225 ~~~i~~~dl--~~g~~~~l~~~~g---~~~-~~~~SPDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~Lt~~~~~~~~ 294 (435)
T PRK05137 225 RPRVYLLDL--ETGQRELVGNFPG---MTF-APRFSPDGRKVVMSLSQG----GNTDIYTMDLRSGTTTRLTDSPAIDTS 294 (435)
T ss_pred CCEEEEEEC--CCCcEEEeecCCC---ccc-CcEECCCCCEEEEEEecC----CCceEEEEECCCCceEEccCCCCccCc
Confidence 356788888 8877777665442 111 11222344 454443322 246799999998887777543321111
Q ss_pred eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
. ...-++ .|+..........++++|..+...+++.... .........-+| .|++. ... .....++.+|+.+..
T Consensus 295 ~-~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~~~SpdG~~ia~~-~~~--~~~~~i~~~d~~~~~ 369 (435)
T PRK05137 295 P-SYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTPVWSPRGDLIAFT-KQG--GGQFSIGVMKPDGSG 369 (435)
T ss_pred e-eEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCC-CcccCeEECCCCCEEEEE-EcC--CCceEEEEEECCCCc
Confidence 1 111244 4444332222267899998888777765321 111112233355 44443 222 123468888987766
Q ss_pred eeecCCCCCCCCCCcEEEECCE-EEEEecCCC----CeEEEEeCCCCceEEcc
Q 045821 239 WLHADANMASGWRGPAVVVDDA-LYVLDQSSG----TKLMMWQKESREWSPVG 286 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~----~~v~~yd~~~~~W~~~~ 286 (352)
...+.... ........-+|+ |++.....+ ..++.+|..+..-..+.
T Consensus 370 ~~~lt~~~--~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~~~~l~ 420 (435)
T PRK05137 370 ERILTSGF--LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRNEREVP 420 (435)
T ss_pred eEeccCCC--CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCceEEcc
Confidence 66554321 222334444554 444433222 37899998876655554
No 96
>smart00284 OLF Olfactomedin-like domains.
Probab=95.86 E-value=1 Score=38.56 Aligned_cols=193 Identities=13% Similarity=0.131 Sum_probs=105.7
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEEC----CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDA----SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL 194 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~----~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~ 194 (352)
++++|++.+... ....+..|.- ..++....-.+|.+-.+-+.++++|.+|.--... ..+..||..+++-..
T Consensus 34 ~~~~wv~~~~~~---~~~~v~ey~~~~~f~~~~~~~~~~Lp~~~~GtG~VVYngslYY~~~~s--~~iiKydL~t~~v~~ 108 (255)
T smart00284 34 KSLYWYMPLNTR---VLRSVREYSSMSDFQMGKNPTDHPLPHAGQGTGVVVYNGSLYFNKFNS--HDICRFDLTTETYQK 108 (255)
T ss_pred CceEEEEccccC---CCcEEEEecCHHHHhccCCceEEECCCccccccEEEECceEEEEecCC--ccEEEEECCCCcEEE
Confidence 467787765421 1234445532 2333322223566666777888899999865433 789999999997643
Q ss_pred ecCCCCcccc------------eeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCCCCCCCCCCcEEEEC
Q 045821 195 HTEPNIFTEI------------EDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADANMASGWRGPAVVVD 258 (352)
Q Consensus 195 ~~~~~~~~~~------------~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~~~~~~~~~~~~~~~ 258 (352)
...+|.+... --.++-++-|+++=..........+-..|+.+ .+|.. ..+.+..+.++++-
T Consensus 109 ~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp~tL~ve~tW~T---~~~k~sa~naFmvC 185 (255)
T smart00284 109 EPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPATLTIENTWIT---TYNKRSASNAFMIC 185 (255)
T ss_pred EEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCcccceEEEEEEc---CCCcccccccEEEe
Confidence 3333332111 11233445566653333323334566777765 34654 23444455567777
Q ss_pred CEEEEEecCC--CCe-EEEEeCCCCceEEccCCCCCcCCCCceEEE---eCCEEEEEcCCceEEEEEcC
Q 045821 259 DALYVLDQSS--GTK-LMMWQKESREWSPVGRLSTLLTRPPCKLVA---IGKTIFVIGKGCSAVVIDVG 321 (352)
Q Consensus 259 ~~l~~~gg~~--~~~-v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~---~~~~i~v~gG~~~~~~~d~~ 321 (352)
|.||++.... ... .+.||+.+++=.. ..+|-.......+++. -+.+||+.--. ....||+.
T Consensus 186 GvLY~~~s~~~~~~~I~yayDt~t~~~~~-~~i~f~n~y~~~s~l~YNP~d~~LY~wdng-~~l~Y~v~ 252 (255)
T smart00284 186 GILYVTRSLGSKGEKVFYAYDTNTGKEGH-LDIPFENMYEYISMLDYNPNDRKLYAWNNG-HLVHYDIA 252 (255)
T ss_pred eEEEEEccCCCCCcEEEEEEECCCCccce-eeeeeccccccceeceeCCCCCeEEEEeCC-eEEEEEEE
Confidence 9999996422 234 4899999876332 2233221222333333 36789988643 35556653
No 97
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.84 E-value=1.4 Score=39.97 Aligned_cols=234 Identities=15% Similarity=0.194 Sum_probs=116.0
Q ss_pred EEecC--CCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEE--ECCCCeeE
Q 045821 74 FCRDN--KLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCY--DASMNTWT 149 (352)
Q Consensus 74 ~gg~~--~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~--d~~t~~W~ 149 (352)
+|+.. ..+....+.||. .++++..+...... ......+...-++.||+..... .....+..| +..+++.+
T Consensus 4 vgsy~~~~~~gI~~~~~d~--~~g~l~~~~~~~~~-~~Ps~l~~~~~~~~LY~~~e~~---~~~g~v~~~~i~~~~g~L~ 77 (345)
T PF10282_consen 4 VGSYTNGKGGGIYVFRFDE--ETGTLTLVQTVAEG-ENPSWLAVSPDGRRLYVVNEGS---GDSGGVSSYRIDPDTGTLT 77 (345)
T ss_dssp EEECCSSSSTEEEEEEEET--TTTEEEEEEEEEES-SSECCEEE-TTSSEEEEEETTS---STTTEEEEEEEETTTTEEE
T ss_pred EEcCCCCCCCcEEEEEEcC--CCCCceEeeeecCC-CCCceEEEEeCCCEEEEEEccc---cCCCCEEEEEECCCcceeE
Confidence 45443 234567788888 99999877753311 1112222222356788886432 123445444 45556777
Q ss_pred ecCCCCC-cceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCC-cEEEec----------CC-CCcccceeEEEE--C-
Q 045821 150 DAAPMST-ARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTN-NWKLHT----------EP-NIFTEIEDSFVM--D- 211 (352)
Q Consensus 150 ~~~~~~~-~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~-~W~~~~----------~~-~~~~~~~~~~~~--~- 211 (352)
.+...+. +...+.++.- +..+|+.- +.. ..+.+|+...+ .=.... .+ .......+.+.+ +
T Consensus 78 ~~~~~~~~g~~p~~i~~~~~g~~l~van-y~~-g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg 155 (345)
T PF10282_consen 78 LLNSVPSGGSSPCHIAVDPDGRFLYVAN-YGG-GSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDG 155 (345)
T ss_dssp EEEEEEESSSCEEEEEECTTSSEEEEEE-TTT-TEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTS
T ss_pred EeeeeccCCCCcEEEEEecCCCEEEEEE-ccC-CeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCC
Confidence 7766553 2222222222 34566653 221 67888888764 211111 00 111222233333 3
Q ss_pred CEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCC---cEEEE-CCEEEEEecCCCCeEEEEeCC--CCceE
Q 045821 212 GKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRG---PAVVV-DDALYVLDQSSGTKLMMWQKE--SREWS 283 (352)
Q Consensus 212 g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~---~~~~~-~~~l~~~gg~~~~~v~~yd~~--~~~W~ 283 (352)
..+|+.. .....|+.|+...+. .............+ .++.- +..+|++....+ .|.+|+.. +..++
T Consensus 156 ~~v~v~d-----lG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~-~v~v~~~~~~~g~~~ 229 (345)
T PF10282_consen 156 RFVYVPD-----LGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSN-TVSVFDYDPSDGSLT 229 (345)
T ss_dssp SEEEEEE-----TTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTT-EEEEEEEETTTTEEE
T ss_pred CEEEEEe-----cCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCC-cEEEEeecccCCcee
Confidence 3577763 122367777776554 54422111111111 12222 346888877655 77666555 66776
Q ss_pred Ecc---CCCCCcCC--CCceEEEe--CCEEEEEc-CCceEEEEEcC
Q 045821 284 PVG---RLSTLLTR--PPCKLVAI--GKTIFVIG-KGCSAVVIDVG 321 (352)
Q Consensus 284 ~~~---~~~~~~~~--~~~~~~~~--~~~i~v~g-G~~~~~~~d~~ 321 (352)
.+. .+|..... ....+..- +..||+-. |.+.+.+|++.
T Consensus 230 ~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d 275 (345)
T PF10282_consen 230 EIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLD 275 (345)
T ss_dssp EEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEEC
T ss_pred EEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEe
Confidence 654 44432211 22333333 55688865 56667789984
No 98
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=95.78 E-value=1.6 Score=40.43 Aligned_cols=182 Identities=10% Similarity=-0.007 Sum_probs=93.1
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...++++|. .++....+..... .... .....++ .+++..... ...+++.+|..++..+.+.........
T Consensus 213 ~~~i~v~d~--~~g~~~~~~~~~~---~~~~-~~~spDg~~l~~~~~~~----~~~~i~~~d~~~~~~~~l~~~~~~~~~ 282 (417)
T TIGR02800 213 KPEIYVQDL--ATGQREKVASFPG---MNGA-PAFSPDGSKLAVSLSKD----GNPDIYVMDLDGKQLTRLTNGPGIDTE 282 (417)
T ss_pred CcEEEEEEC--CCCCEEEeecCCC---Cccc-eEECCCCCEEEEEECCC----CCccEEEEECCCCCEEECCCCCCCCCC
Confidence 356788888 7776665554431 1111 1222244 465543322 236789999998877776543321111
Q ss_pred eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce
Q 045821 161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW 239 (352)
Q Consensus 161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W 239 (352)
.. ..-++ +|++.........++.+|..+..++++.... .........-+|+.+++..... ....++.+|+.+..+
T Consensus 283 ~~-~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~-~~~~~~~~spdg~~i~~~~~~~--~~~~i~~~d~~~~~~ 358 (417)
T TIGR02800 283 PS-WSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRG-GYNASPSWSPDGDLIAFVHREG--GGFNIAVMDLDGGGE 358 (417)
T ss_pred EE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccCeEECCCCCEEEEEEccC--CceEEEEEeCCCCCe
Confidence 11 11144 4554443322267899999988887765221 1111122333566555544332 234789999988777
Q ss_pred eecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCC
Q 045821 240 LHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKES 279 (352)
Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~ 279 (352)
+.+.... .........+++ |++.....+ ..+++++.+.
T Consensus 359 ~~l~~~~--~~~~p~~spdg~~l~~~~~~~~~~~l~~~~~~g 398 (417)
T TIGR02800 359 RVLTDTG--LDESPSFAPNGRMILYATTRGGRGVLGLVSTDG 398 (417)
T ss_pred EEccCCC--CCCCceECCCCCEEEEEEeCCCcEEEEEEECCC
Confidence 7665321 112233444444 444433322 3456666544
No 99
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=95.73 E-value=1.1 Score=38.30 Aligned_cols=218 Identities=12% Similarity=0.046 Sum_probs=120.5
Q ss_pred EEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC-CCCcceeeeeE
Q 045821 86 YVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP-MSTARCYFPCG 164 (352)
Q Consensus 86 ~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~-~~~~r~~~~~~ 164 (352)
=.+|| .+++-...+--. ..+.+..+.--++..++..+ ...+-++|+.+..-++.+- ...+-.....+
T Consensus 86 GhLdP--~tGev~~ypLg~---Ga~Phgiv~gpdg~~Witd~-------~~aI~R~dpkt~evt~f~lp~~~a~~nlet~ 153 (353)
T COG4257 86 GHLDP--ATGEVETYPLGS---GASPHGIVVGPDGSAWITDT-------GLAIGRLDPKTLEVTRFPLPLEHADANLETA 153 (353)
T ss_pred eecCC--CCCceEEEecCC---CCCCceEEECCCCCeeEecC-------cceeEEecCcccceEEeecccccCCCcccce
Confidence 35688 888887776433 12222222222455665532 2367889998887766642 12233445556
Q ss_pred EEC--CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821 165 VLN--QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA 242 (352)
Q Consensus 165 ~~~--~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~ 242 (352)
++| |.++..|-... --+.||.++.-+..+.+....-+..++.-+|.+|+.- . .-+.|-..|+.+..=+.+
T Consensus 154 vfD~~G~lWFt~q~G~---yGrLdPa~~~i~vfpaPqG~gpyGi~atpdGsvwyas-l----agnaiaridp~~~~aev~ 225 (353)
T COG4257 154 VFDPWGNLWFTGQIGA---YGRLDPARNVISVFPAPQGGGPYGICATPDGSVWYAS-L----AGNAIARIDPFAGHAEVV 225 (353)
T ss_pred eeCCCccEEEeecccc---ceecCcccCceeeeccCCCCCCcceEECCCCcEEEEe-c----cccceEEcccccCCccee
Confidence 664 56666654221 1267888776665554433333444566688888872 1 123577788887765555
Q ss_pred CCCCC--CCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEE-cCCceEEEEE
Q 045821 243 DANMA--SGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVI-GKGCSAVVID 319 (352)
Q Consensus 243 ~~~~~--~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~-gG~~~~~~~d 319 (352)
+.+-. .........-.+.+.+..-.. ..+++|||.+.+|.+-+ ||....+...-.+--.+.+++- -+.....-||
T Consensus 226 p~P~~~~~gsRriwsdpig~~wittwg~-g~l~rfdPs~~sW~eyp-LPgs~arpys~rVD~~grVW~sea~agai~rfd 303 (353)
T COG4257 226 PQPNALKAGSRRIWSDPIGRAWITTWGT-GSLHRFDPSVTSWIEYP-LPGSKARPYSMRVDRHGRVWLSEADAGAIGRFD 303 (353)
T ss_pred cCCCcccccccccccCccCcEEEeccCC-ceeeEeCcccccceeee-CCCCCCCcceeeeccCCcEEeeccccCceeecC
Confidence 44222 111122233456777663222 38999999999999876 4443334444344445666652 2232333477
Q ss_pred cCCccc
Q 045821 320 VGNIGN 325 (352)
Q Consensus 320 ~~~~~w 325 (352)
+++.+.
T Consensus 304 peta~f 309 (353)
T COG4257 304 PETARF 309 (353)
T ss_pred cccceE
Confidence 776554
No 100
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.73 E-value=1.3 Score=40.70 Aligned_cols=223 Identities=13% Similarity=0.090 Sum_probs=107.6
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCC-eeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRS-WKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMN 146 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~-w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~ 146 (352)
++.|++.|+. .+. ..+||. .+.. ...+...+ .+-+.--....++.+++.|+-+ ..+..+|..+.
T Consensus 79 DG~LlaaGD~--sG~--V~vfD~--k~r~iLR~~~ah~---apv~~~~f~~~d~t~l~s~sDd------~v~k~~d~s~a 143 (487)
T KOG0310|consen 79 DGRLLAAGDE--SGH--VKVFDM--KSRVILRQLYAHQ---APVHVTKFSPQDNTMLVSGSDD------KVVKYWDLSTA 143 (487)
T ss_pred CCeEEEccCC--cCc--EEEecc--ccHHHHHHHhhcc---CceeEEEecccCCeEEEecCCC------ceEEEEEcCCc
Confidence 7888888753 333 344565 4421 11121111 1222222344578888887622 23344555555
Q ss_pred eeE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCC
Q 045821 147 TWT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAAT 224 (352)
Q Consensus 147 ~W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~ 224 (352)
.-. .+..-...-...+....++.|.+.|||+ ..+-.||..+.+ ..+-.+.....-..++.+ .|.+.+..|++
T Consensus 144 ~v~~~l~~htDYVR~g~~~~~~~hivvtGsYD--g~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~iasAgGn--- 217 (487)
T KOG0310|consen 144 YVQAELSGHTDYVRCGDISPANDHIVVTGSYD--GKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIASAGGN--- 217 (487)
T ss_pred EEEEEecCCcceeEeeccccCCCeEEEecCCC--ceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEEcCCC---
Confidence 421 1111111111223333478899999998 678889988774 223223322222233444 33555554444
Q ss_pred CceEEEEEeCCCCceeecCCCCCCCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe
Q 045821 225 SHVCALVYEPSTDSWLHADANMASGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI 302 (352)
Q Consensus 225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 302 (352)
.+-++|..++.=. +.....+...-.. ..-++.=++-||.++ .+-+|| +..|..+-.+..+-+-- ...+..
T Consensus 218 ---~vkVWDl~~G~ql-l~~~~~H~KtVTcL~l~s~~~rLlS~sLD~-~VKVfd--~t~~Kvv~s~~~~~pvL-siavs~ 289 (487)
T KOG0310|consen 218 ---SVKVWDLTTGGQL-LTSMFNHNKTVTCLRLASDSTRLLSGSLDR-HVKVFD--TTNYKVVHSWKYPGPVL-SIAVSP 289 (487)
T ss_pred ---eEEEEEecCCcee-hhhhhcccceEEEEEeecCCceEeeccccc-ceEEEE--ccceEEEEeeeccccee-eEEecC
Confidence 3667776533211 1111111100001 112445566677766 899999 44566665432211111 122234
Q ss_pred CCEEEEEcCCceEEEEE
Q 045821 303 GKTIFVIGKGCSAVVID 319 (352)
Q Consensus 303 ~~~i~v~gG~~~~~~~d 319 (352)
++.-.++|..+....+.
T Consensus 290 dd~t~viGmsnGlv~~r 306 (487)
T KOG0310|consen 290 DDQTVVIGMSNGLVSIR 306 (487)
T ss_pred CCceEEEecccceeeee
Confidence 77888888877666554
No 101
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=95.70 E-value=1.8 Score=40.23 Aligned_cols=146 Identities=13% Similarity=0.029 Sum_probs=79.5
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK- 213 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~- 213 (352)
..++++|..+++-..+........... ..-++ .+++.........++.+|..++..+.+....... ......-+|+
T Consensus 214 ~~i~v~d~~~g~~~~~~~~~~~~~~~~-~spDg~~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~~~-~~~~~s~dg~~ 291 (417)
T TIGR02800 214 PEIYVQDLATGQREKVASFPGMNGAPA-FSPDGSKLAVSLSKDGNPDIYVMDLDGKQLTRLTNGPGID-TEPSWSPDGKS 291 (417)
T ss_pred cEEEEEECCCCCEEEeecCCCCccceE-ECCCCCEEEEEECCCCCccEEEEECCCCCEEECCCCCCCC-CCEEECCCCCE
Confidence 678999999887666554332222111 11244 5665544333367999999988877765322111 1112223554
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC--CeEEEEeCCCCceEEccC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG--TKLMMWQKESREWSPVGR 287 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~~ 287 (352)
|+.. ... .....++.+|..+..+..+.... .........-+++.+++....+ ..++.+|+.++.+..+..
T Consensus 292 l~~~-s~~--~g~~~iy~~d~~~~~~~~l~~~~-~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l~~ 363 (417)
T TIGR02800 292 IAFT-SDR--GGSPQIYMMDADGGEVRRLTFRG-GYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGERVLTD 363 (417)
T ss_pred EEEE-ECC--CCCceEEEEECCCCCEEEeecCC-CCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCeEEccC
Confidence 4443 222 11236888999888887664321 1111223334555444443322 479999999877776653
No 102
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=95.62 E-value=0.83 Score=40.88 Aligned_cols=133 Identities=13% Similarity=0.169 Sum_probs=81.6
Q ss_pred CeEEEEECCCC-----eeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc-EEEecCCCCcccceeEEE
Q 045821 136 SEVYCYDASMN-----TWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEPNIFTEIEDSFV 209 (352)
Q Consensus 136 ~~~~~~d~~t~-----~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~ 209 (352)
..+.+|+.... +.+.+.....+-.-.+.+.+++++.+..| +.+.+|+...+. +.+.+............+
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g----~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~ 137 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVG----NKLYVYDLDNSKTLLKKAFYDSPFYITSLSV 137 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEET----TEEEEEEEETTSSEEEEEEE-BSSSEEEEEE
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeec----CEEEEEEccCcccchhhheecceEEEEEEec
Confidence 67889998884 56666544444445677778999888777 789999999888 887775444444445667
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCCCeE--EEEeC
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSGTKL--MMWQK 277 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~~~v--~~yd~ 277 (352)
.++.|++..-. ....+..|+....+-..+.......+..++..+ ++..++.+...+ .+ ..|++
T Consensus 138 ~~~~I~vgD~~----~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~g-nl~~l~~~~ 203 (321)
T PF03178_consen 138 FKNYILVGDAM----KSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDG-NLFVLRYNP 203 (321)
T ss_dssp ETTEEEEEESS----SSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTS-EEEEEEE-S
T ss_pred cccEEEEEEcc----cCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCC-eEEEEEECC
Confidence 78877764222 223566788867777777654444443344444 555444444444 54 45554
No 103
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=95.62 E-value=1.4 Score=42.32 Aligned_cols=115 Identities=12% Similarity=0.193 Sum_probs=68.4
Q ss_pred eeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCCcc--------cceeEEEECCEEEEEccCCCCCCceEEE
Q 045821 161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNIFT--------EIEDSFVMDGKIYIRCSASAATSHVCAL 230 (352)
Q Consensus 161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~~~--------~~~~~~~~~g~iyv~GG~~~~~~~~~i~ 230 (352)
...++.++.||+.... ..++.+|..|. .|+.-...+... .....++.+++||+.. .. ..+.
T Consensus 63 stPvv~~g~vyv~s~~---g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t-~d-----g~l~ 133 (527)
T TIGR03075 63 SQPLVVDGVMYVTTSY---SRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGT-LD-----ARLV 133 (527)
T ss_pred cCCEEECCEEEEECCC---CcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEc-CC-----CEEE
Confidence 3456679999996653 46899999876 487643222111 1123456788888753 22 2589
Q ss_pred EEeCCCCc--eeecCCCCCC--CCCCcEEEECCEEEEEec-----CCCCeEEEEeCCCCc--eEEc
Q 045821 231 VYEPSTDS--WLHADANMAS--GWRGPAVVVDDALYVLDQ-----SSGTKLMMWQKESRE--WSPV 285 (352)
Q Consensus 231 ~yd~~~~~--W~~~~~~~~~--~~~~~~~~~~~~l~~~gg-----~~~~~v~~yd~~~~~--W~~~ 285 (352)
++|..+++ |+.-...... ....+.++.++.+|+-.. .. ..|..||.++++ |+.-
T Consensus 134 ALDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~~-G~v~AlD~~TG~~lW~~~ 198 (527)
T TIGR03075 134 ALDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGVR-GYVTAYDAKTGKLVWRRY 198 (527)
T ss_pred EEECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCCC-cEEEEEECCCCceeEecc
Confidence 99987764 8754221211 122233567888776532 12 379999998865 7533
No 104
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=95.51 E-value=2.4 Score=40.46 Aligned_cols=116 Identities=15% Similarity=0.203 Sum_probs=65.7
Q ss_pred EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCC----Ccc-eeeeeEEEC-CEEEEEeCCCCCCceEEEe
Q 045821 115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMS----TAR-CYFPCGVLN-QKIYCIGGLGDTHSWDVYD 186 (352)
Q Consensus 115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~----~~r-~~~~~~~~~-~~iyv~gG~~~~~~v~~yd 186 (352)
-++.++.+|+... ...++.+|..+++ |+.-...+ .+. .....+..+ ++||+.... ..+..+|
T Consensus 57 Pvv~~g~vy~~~~-------~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~~---g~v~AlD 126 (488)
T cd00216 57 PLVVDGDMYFTTS-------HSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTFD---GRLVALD 126 (488)
T ss_pred CEEECCEEEEeCC-------CCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecCC---CeEEEEE
Confidence 3566899998643 2568899998775 87543222 111 111234446 888875432 5789999
Q ss_pred CCCC--cEEEecCCCC-cc--cceeEEEECCEEEEEccCCCC----CCceEEEEEeCCCC--ceee
Q 045821 187 PRTN--NWKLHTEPNI-FT--EIEDSFVMDGKIYIRCSASAA----TSHVCALVYEPSTD--SWLH 241 (352)
Q Consensus 187 ~~t~--~W~~~~~~~~-~~--~~~~~~~~~g~iyv~GG~~~~----~~~~~i~~yd~~~~--~W~~ 241 (352)
..|. .|+.-..... .. .....++.++.+|+- ..... .....++++|..++ .|+.
T Consensus 127 ~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg-~~~~~~~~~~~~g~v~alD~~TG~~~W~~ 191 (488)
T cd00216 127 AETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIG-SSGAEFFACGVRGALRAYDVETGKLLWRF 191 (488)
T ss_pred CCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEe-ccccccccCCCCcEEEEEECCCCceeeEe
Confidence 9876 4876432221 01 122335566776653 22111 11236889998765 4875
No 105
>PRK04922 tolB translocation protein TolB; Provisional
Probab=95.49 E-value=2.2 Score=39.97 Aligned_cols=187 Identities=15% Similarity=0.059 Sum_probs=96.1
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...++.+|. .++.-..+...+. .........++ +|++....+ ...+++++|..+++.+.+.........
T Consensus 227 ~~~l~~~dl--~~g~~~~l~~~~g----~~~~~~~SpDG~~l~~~~s~~----g~~~Iy~~d~~~g~~~~lt~~~~~~~~ 296 (433)
T PRK04922 227 RSAIYVQDL--ATGQRELVASFRG----INGAPSFSPDGRRLALTLSRD----GNPEIYVMDLGSRQLTRLTNHFGIDTE 296 (433)
T ss_pred CcEEEEEEC--CCCCEEEeccCCC----CccCceECCCCCEEEEEEeCC----CCceEEEEECCCCCeEECccCCCCccc
Confidence 456778888 7777666655442 11111222234 455443222 236799999998887666543211111
Q ss_pred eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
.+ ..-++ .|+..........++.+|..++..+++.... .........-+| .|++..+ . .....++.+|+.++.
T Consensus 297 ~~-~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~-~--~~~~~I~v~d~~~g~ 371 (433)
T PRK04922 297 PT-WAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHG-S--GGQYRIAVMDLSTGS 371 (433)
T ss_pred eE-ECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEC-C--CCceeEEEEECCCCC
Confidence 11 11144 4544433222257888999888887775211 111112233355 4544422 2 222378899998888
Q ss_pred eeecCCCCCCCCCCcEEEECCEEEEEec-CCC-CeEEEEeCCCCceEEc
Q 045821 239 WLHADANMASGWRGPAVVVDDALYVLDQ-SSG-TKLMMWQKESREWSPV 285 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg-~~~-~~v~~yd~~~~~W~~~ 285 (352)
.+.+.... .........+++.+++.. ..+ ..++.+|.+...=..+
T Consensus 372 ~~~Lt~~~--~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g~~~~~l 418 (433)
T PRK04922 372 VRTLTPGS--LDESPSFAPNGSMVLYATREGGRGVLAAVSTDGRVRQRL 418 (433)
T ss_pred eEECCCCC--CCCCceECCCCCEEEEEEecCCceEEEEEECCCCceEEc
Confidence 88775421 112234444555444433 222 4688888865443334
No 106
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=95.47 E-value=1.5 Score=37.70 Aligned_cols=191 Identities=16% Similarity=0.149 Sum_probs=106.7
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECC-----CCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDAS-----MNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK 193 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~-----t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~ 193 (352)
.+++|++.+..+. .+..|.-. .++....-.+|.+-.+-+-++++|.+|.--... ..+..||+.++.-.
T Consensus 30 ~~~iy~~~~~~~~-----~v~ey~~~~~f~~~~~~~~~~~Lp~~~~GtG~vVYngslYY~~~~s--~~IvkydL~t~~v~ 102 (250)
T PF02191_consen 30 SEKIYVTSGFSGN-----TVYEYRNYEDFLRNGRSSRTYKLPYPWQGTGHVVYNGSLYYNKYNS--RNIVKYDLTTRSVV 102 (250)
T ss_pred CCCEEEECccCCC-----EEEEEcCHhHHhhcCCCceEEEEeceeccCCeEEECCcEEEEecCC--ceEEEEECcCCcEE
Confidence 4567888774332 44444322 222222223455555666777888887765422 78999999998644
Q ss_pred EecCCCCcccc------------eeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCCCCCCCCCCcEEEE
Q 045821 194 LHTEPNIFTEI------------EDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADANMASGWRGPAVVV 257 (352)
Q Consensus 194 ~~~~~~~~~~~------------~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~~~~~~~~~~~~~~ 257 (352)
.-..+|.+... --.++-++-|+|+-..........+-..|+.+ .+|.. ....+..+.++++
T Consensus 103 ~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~tL~v~~tw~T---~~~k~~~~naFmv 179 (250)
T PF02191_consen 103 ARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPETLSVEQTWNT---SYPKRSAGNAFMV 179 (250)
T ss_pred EEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCcccCceEEEEEe---ccCchhhcceeeE
Confidence 11112222211 11234455677764444333334666777754 45653 2344555566777
Q ss_pred CCEEEEEecCCC--CeE-EEEeCCCCceEEccCCCCCcCCCCceEEE---eCCEEEEEcCCceEEEEEcC
Q 045821 258 DDALYVLDQSSG--TKL-MMWQKESREWSPVGRLSTLLTRPPCKLVA---IGKTIFVIGKGCSAVVIDVG 321 (352)
Q Consensus 258 ~~~l~~~gg~~~--~~v-~~yd~~~~~W~~~~~~~~~~~~~~~~~~~---~~~~i~v~gG~~~~~~~d~~ 321 (352)
-|.||++..... ..| +.||..+++=..+. ++-........++. .+.+||+.-.. ..+.|++.
T Consensus 180 CGvLY~~~s~~~~~~~I~yafDt~t~~~~~~~-i~f~~~~~~~~~l~YNP~dk~LY~wd~G-~~v~Y~v~ 247 (250)
T PF02191_consen 180 CGVLYATDSYDTRDTEIFYAFDTYTGKEEDVS-IPFPNPYGNISMLSYNPRDKKLYAWDNG-YQVTYDVR 247 (250)
T ss_pred eeEEEEEEECCCCCcEEEEEEECCCCceecee-eeeccccCceEeeeECCCCCeEEEEECC-eEEEEEEE
Confidence 899999987653 234 89999987765443 32211222333443 36789998644 35556653
No 107
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=95.40 E-value=1.5 Score=37.55 Aligned_cols=184 Identities=13% Similarity=0.112 Sum_probs=99.5
Q ss_pred eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC--CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce-e
Q 045821 84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG--KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC-Y 160 (352)
Q Consensus 84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~--~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~-~ 160 (352)
.+-.+|+ ++...++.+-... ....+.-.++++ +.++..|-.... =..||.++.-+..+.. .... +
T Consensus 125 aI~R~dp--kt~evt~f~lp~~--~a~~nlet~vfD~~G~lWFt~q~G~y-------GrLdPa~~~i~vfpaP-qG~gpy 192 (353)
T COG4257 125 AIGRLDP--KTLEVTRFPLPLE--HADANLETAVFDPWGNLWFTGQIGAY-------GRLDPARNVISVFPAP-QGGGPY 192 (353)
T ss_pred eeEEecC--cccceEEeecccc--cCCCcccceeeCCCccEEEeeccccc-------eecCcccCceeeeccC-CCCCCc
Confidence 3445666 6666655542211 233344445553 456666532111 1567777765554432 2222 2
Q ss_pred eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce--eEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE--DSFVMDGKIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~--~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
-.++.-+|.+|+..=.. +-+-..|+.+..=+.++.+....... ..+--.|++++.. ...-.+.+||+++.+
T Consensus 193 Gi~atpdGsvwyaslag--naiaridp~~~~aev~p~P~~~~~gsRriwsdpig~~witt-----wg~g~l~rfdPs~~s 265 (353)
T COG4257 193 GICATPDGSVWYASLAG--NAIARIDPFAGHAEVVPQPNALKAGSRRIWSDPIGRAWITT-----WGTGSLHRFDPSVTS 265 (353)
T ss_pred ceEECCCCcEEEEeccc--cceEEcccccCCcceecCCCcccccccccccCccCcEEEec-----cCCceeeEeCccccc
Confidence 22333478888762111 66778888887655565433312111 1233357777762 112268999999999
Q ss_pred eeecCCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821 239 WLHADANMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESREWSPVGR 287 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~ 287 (352)
|.+-+-+-...+-... |-..+++++-. ...+.+.+||+++.+++.++.
T Consensus 266 W~eypLPgs~arpys~rVD~~grVW~se-a~agai~rfdpeta~ftv~p~ 314 (353)
T COG4257 266 WIEYPLPGSKARPYSMRVDRHGRVWLSE-ADAGAIGRFDPETARFTVLPI 314 (353)
T ss_pred ceeeeCCCCCCCcceeeeccCCcEEeec-cccCceeecCcccceEEEecC
Confidence 9987543222222222 33345555533 222389999999999998863
No 108
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=95.31 E-value=1.3 Score=36.34 Aligned_cols=145 Identities=12% Similarity=0.116 Sum_probs=78.3
Q ss_pred eeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecC-CCC-cccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCC
Q 045821 162 PCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTE-PNI-FTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPS 235 (352)
Q Consensus 162 ~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~-~~~-~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~ 235 (352)
+++..++++|++-| ..+++++.... .-+.|.. .+. +..-.++... +|++|++-|.. .|+|+..
T Consensus 11 A~~~~~g~~y~FkG----~~~w~~~~~~~~~~p~~I~~~w~~~p~~IDAa~~~~~~~~~yfFkg~~-------yw~~~~~ 79 (194)
T cd00094 11 AVTTLRGELYFFKG----RYFWRLSPGKPPGSPFLISSFWPSLPSPVDAAFERPDTGKIYFFKGDK-------YWVYTGK 79 (194)
T ss_pred eEEEeCCEEEEEeC----CEEEEEeCCCCCCCCeEhhhhCCCCCCCccEEEEECCCCEEEEECCCE-------EEEEcCc
Confidence 34445689999988 67888876522 1222222 111 1122233333 38999995543 7888866
Q ss_pred CCcee---ecCC-CCCC--CCCCcEEEE--CCEEEEEecCCCCeEEEEeCCCCceEE-----c----cCCCCCcCCCCce
Q 045821 236 TDSWL---HADA-NMAS--GWRGPAVVV--DDALYVLDQSSGTKLMMWQKESREWSP-----V----GRLSTLLTRPPCK 298 (352)
Q Consensus 236 ~~~W~---~~~~-~~~~--~~~~~~~~~--~~~l~~~gg~~~~~v~~yd~~~~~W~~-----~----~~~~~~~~~~~~~ 298 (352)
+..+. .+.. ..+. ..--++... ++++|++.|. ..++||...++... + ..+|. .-.++
T Consensus 80 ~~~~~~Pk~i~~~~~~~~~~~iDAA~~~~~~~~~yfFkg~---~y~ry~~~~~~v~~~yP~~i~~~w~g~p~---~idaa 153 (194)
T cd00094 80 NLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFFKGD---KYWRYDEKTQKMDPGYPKLIETDFPGVPD---KVDAA 153 (194)
T ss_pred ccccCCCcchhhcCCCCCCCCccEEEEEcCCCEEEEEeCC---EEEEEeCCCccccCCCCcchhhcCCCcCC---Cccee
Confidence 42221 1111 0010 111123333 6899999886 89999987655421 1 11221 12233
Q ss_pred EEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821 299 LVAIGKTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 299 ~~~~~~~i~v~gG~~~~~~~d~~~~~ 324 (352)
+...++++|+|-|. ..+.||..+.+
T Consensus 154 ~~~~~~~~yfF~g~-~y~~~d~~~~~ 178 (194)
T cd00094 154 FRWLDGYYYFFKGD-QYWRFDPRSKE 178 (194)
T ss_pred EEeCCCcEEEEECC-EEEEEeCccce
Confidence 33344899999876 56678887654
No 109
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=95.22 E-value=0.69 Score=35.18 Aligned_cols=79 Identities=13% Similarity=0.187 Sum_probs=54.3
Q ss_pred EEECCEEEEEeCCCC--CCceEEEeCCCCcEEEecCC---CCcccceeEEEECCEEEEEccCCCCC-CceEEEEE-eCCC
Q 045821 164 GVLNQKIYCIGGLGD--THSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVMDGKIYIRCSASAAT-SHVCALVY-EPST 236 (352)
Q Consensus 164 ~~~~~~iyv~gG~~~--~~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~~g~iyv~GG~~~~~-~~~~i~~y-d~~~ 236 (352)
+.+||-+|....... ...+..||..+++|+.+..| .........+.++|+|-++.-..... ....+|+. |..+
T Consensus 2 icinGvly~~a~~~~~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~k 81 (129)
T PF08268_consen 2 ICINGVLYWLAWSEDSDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYEK 81 (129)
T ss_pred EEECcEEEeEEEECCCCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeecccc
Confidence 457888888876522 28889999999999998754 12233345678899998884333222 23578877 4567
Q ss_pred Cceeec
Q 045821 237 DSWLHA 242 (352)
Q Consensus 237 ~~W~~~ 242 (352)
.+|.+.
T Consensus 82 ~~Wsk~ 87 (129)
T PF08268_consen 82 QEWSKK 87 (129)
T ss_pred ceEEEE
Confidence 789866
No 110
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=95.20 E-value=0.42 Score=36.36 Aligned_cols=79 Identities=10% Similarity=0.181 Sum_probs=53.6
Q ss_pred EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC--CCCCC-CCcEEEECCEEEEEecCCC-----CeEEEE-eCC
Q 045821 208 FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN--MASGW-RGPAVVVDDALYVLDQSSG-----TKLMMW-QKE 278 (352)
Q Consensus 208 ~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~--~~~~~-~~~~~~~~~~l~~~gg~~~-----~~v~~y-d~~ 278 (352)
..+||.+|.+.-. .......|.+||..+++|+.+..| ..... ....+..+|+|.++.-... -++|+. |.+
T Consensus 2 icinGvly~~a~~-~~~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~ 80 (129)
T PF08268_consen 2 ICINGVLYWLAWS-EDSDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYE 80 (129)
T ss_pred EEECcEEEeEEEE-CCCCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeeccc
Confidence 3578999888544 334455899999999999998764 22222 2234678999998865432 256666 666
Q ss_pred CCceEEccC
Q 045821 279 SREWSPVGR 287 (352)
Q Consensus 279 ~~~W~~~~~ 287 (352)
.++|++...
T Consensus 81 k~~Wsk~~~ 89 (129)
T PF08268_consen 81 KQEWSKKHI 89 (129)
T ss_pred cceEEEEEE
Confidence 789997753
No 111
>PRK04043 tolB translocation protein TolB; Provisional
Probab=95.09 E-value=2.9 Score=38.99 Aligned_cols=177 Identities=11% Similarity=0.036 Sum_probs=96.8
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-E
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-K 213 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~ 213 (352)
.++|++|..+++=+.+...+....... ..-|+ ++.+........+++.+|..++.++++...+. ........-+| +
T Consensus 213 ~~Iyv~dl~tg~~~~lt~~~g~~~~~~-~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~ 290 (419)
T PRK04043 213 PTLYKYNLYTGKKEKIASSQGMLVVSD-VSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKR 290 (419)
T ss_pred CEEEEEECCCCcEEEEecCCCcEEeeE-ECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCE
Confidence 478888888887666654322111111 12244 55554433333789999999999998864432 11112233345 5
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCC------C-CeEEEEeCCCCceEEc
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSS------G-TKLMMWQKESREWSPV 285 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~------~-~~v~~yd~~~~~W~~~ 285 (352)
||..-.. .....++.+|..+++.+++..... . .....-+| .|.+..... + .+++++|.+++.++.+
T Consensus 291 I~F~Sdr---~g~~~Iy~~dl~~g~~~rlt~~g~--~-~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~L 364 (419)
T PRK04043 291 IVFVSDR---LGYPNIFMKKLNSGSVEQVVFHGK--N-NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRL 364 (419)
T ss_pred EEEEECC---CCCceEEEEECCCCCeEeCccCCC--c-CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEEC
Confidence 6655322 223479999999888877643211 1 22333344 454444322 2 4799999999999888
Q ss_pred cCCCCCcCCCCceEEEeCCEEEEEc---CCceEEEEEcCCc
Q 045821 286 GRLSTLLTRPPCKLVAIGKTIFVIG---KGCSAVVIDVGNI 323 (352)
Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~i~v~g---G~~~~~~~d~~~~ 323 (352)
..... .........+..|+... |.....++++..+
T Consensus 365 T~~~~---~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~ 402 (419)
T PRK04043 365 TANGV---NQFPRFSSDGGSIMFIKYLGNQSALGIIRLNYN 402 (419)
T ss_pred CCCCC---cCCeEECCCCCEEEEEEccCCcEEEEEEecCCC
Confidence 65321 11233333344455443 3334556666553
No 112
>KOG2120 consensus SCF ubiquitin ligase, Skp2 component [Posttranslational modification, protein turnover, chaperones]
Probab=95.04 E-value=0.016 Score=49.83 Aligned_cols=38 Identities=24% Similarity=0.281 Sum_probs=34.8
Q ss_pred cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccCh
Q 045821 18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSE 55 (352)
Q Consensus 18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~ 55 (352)
|.+||||+++.|++-++.+.+.+...|||+|..+....
T Consensus 98 ~~slpDEill~IFs~L~kk~LL~~~~VC~Rfyr~~~de 135 (419)
T KOG2120|consen 98 WDSLPDEILLGIFSCLCKKELLKVSGVCKRFYRLASDE 135 (419)
T ss_pred cccCCHHHHHHHHHhccHHHHHHHHHHHHHHhhccccc
Confidence 67899999999999999999999999999999886543
No 113
>PLN00181 protein SPA1-RELATED; Provisional
Probab=95.00 E-value=4.8 Score=41.04 Aligned_cols=172 Identities=10% Similarity=0.047 Sum_probs=81.9
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEE-EecCCCCcccceeEEE--E
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWK-LHTEPNIFTEIEDSFV--M 210 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~~~~~~~~~~~~~--~ 210 (352)
..+.+||..+++-...-..... .-.+++. .++.+++.|+.+ ..+.+||..+..-. .+. .......+.. -
T Consensus 555 g~v~lWd~~~~~~~~~~~~H~~-~V~~l~~~p~~~~~L~Sgs~D--g~v~iWd~~~~~~~~~~~---~~~~v~~v~~~~~ 628 (793)
T PLN00181 555 GVVQVWDVARSQLVTEMKEHEK-RVWSIDYSSADPTLLASGSDD--GSVKLWSINQGVSIGTIK---TKANICCVQFPSE 628 (793)
T ss_pred CeEEEEECCCCeEEEEecCCCC-CEEEEEEcCCCCCEEEEEcCC--CEEEEEECCCCcEEEEEe---cCCCeEEEEEeCC
Confidence 4678889877653221111111 1122222 246677777765 67889998765422 221 1111111211 2
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCC----ceE
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESR----EWS 283 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~----~W~ 283 (352)
++.+++.|+.++ .+..||..+.. ...+.. +...-..+ ..++..++.++.++ .+.+||.... .|.
T Consensus 629 ~g~~latgs~dg-----~I~iwD~~~~~~~~~~~~~---h~~~V~~v~f~~~~~lvs~s~D~-~ikiWd~~~~~~~~~~~ 699 (793)
T PLN00181 629 SGRSLAFGSADH-----KVYYYDLRNPKLPLCTMIG---HSKTVSYVRFVDSSTLVSSSTDN-TLKLWDLSMSISGINET 699 (793)
T ss_pred CCCEEEEEeCCC-----eEEEEECCCCCccceEecC---CCCCEEEEEEeCCCEEEEEECCC-EEEEEeCCCCccccCCc
Confidence 567777765542 68888886542 112111 11011112 23566666776665 7999998643 233
Q ss_pred EccCCCCCcCCCCceEEEeCCEEEEEcCCce-EEEEEcCC
Q 045821 284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVIDVGN 322 (352)
Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~d~~~ 322 (352)
.+..+........+.....++.+++.|+.++ +.+|+...
T Consensus 700 ~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~~ 739 (793)
T PLN00181 700 PLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKAF 739 (793)
T ss_pred ceEEEcCCCCCeeEEEEcCCCCEEEEEeCCCEEEEEECCC
Confidence 3322211001111122223455666665544 55677653
No 114
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=94.72 E-value=2.5 Score=40.70 Aligned_cols=117 Identities=17% Similarity=0.239 Sum_probs=68.2
Q ss_pred EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc--------ceeeeeEEECCEEEEEeCCCCCCceEE
Q 045821 115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA--------RCYFPCGVLNQKIYCIGGLGDTHSWDV 184 (352)
Q Consensus 115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~--------r~~~~~~~~~~~iyv~gG~~~~~~v~~ 184 (352)
-++.++.||+... ...++.+|..|++ |+.-...+.. ......++.+++||+... + ..+..
T Consensus 65 Pvv~~g~vyv~s~-------~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~-d--g~l~A 134 (527)
T TIGR03075 65 PLVVDGVMYVTTS-------YSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTL-D--ARLVA 134 (527)
T ss_pred CEEECCEEEEECC-------CCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcC-C--CEEEE
Confidence 3566899998643 2468899998875 8754432211 111234566888887543 2 56899
Q ss_pred EeCCCCc--EEEec-CCCCc-ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eee
Q 045821 185 YDPRTNN--WKLHT-EPNIF-TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLH 241 (352)
Q Consensus 185 yd~~t~~--W~~~~-~~~~~-~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~ 241 (352)
+|..|.+ |+.-. .+... .....-++.+++||+-....+......+..||..+++ |+.
T Consensus 135 LDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~ 197 (527)
T TIGR03075 135 LDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGVRGYVTAYDAKTGKLVWRR 197 (527)
T ss_pred EECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCCCcEEEEEECCCCceeEec
Confidence 9998875 86532 11111 1122345678888875322211222368888887764 763
No 115
>PRK05137 tolB translocation protein TolB; Provisional
Probab=94.66 E-value=3.9 Score=38.37 Aligned_cols=181 Identities=10% Similarity=0.028 Sum_probs=93.0
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK 213 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~ 213 (352)
...++++|+.+++.+.+...+......... -++ +|++....+....++.+|..+..-+++...+. ........-+|+
T Consensus 225 ~~~i~~~dl~~g~~~~l~~~~g~~~~~~~S-PDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~-~~~~~~~spDG~ 302 (435)
T PRK05137 225 RPRVYLLDLETGQRELVGNFPGMTFAPRFS-PDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPA-IDTSPSYSPDGS 302 (435)
T ss_pred CCEEEEEECCCCcEEEeecCCCcccCcEEC-CCCCEEEEEEecCCCceEEEEECCCCceEEccCCCC-ccCceeEcCCCC
Confidence 368999999998887776544322211111 144 55444433333778999999888777653221 111122333554
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccCCCCC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGRLSTL 291 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~~~~ 291 (352)
-.++.... .....++.+|..+...+.+... ..........-+| .|++.....+ ..++.+|+.++....+....
T Consensus 303 ~i~f~s~~--~g~~~Iy~~d~~g~~~~~lt~~-~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~~lt~~~-- 377 (435)
T PRK05137 303 QIVFESDR--SGSPQLYVMNADGSNPRRISFG-GGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGERILTSGF-- 377 (435)
T ss_pred EEEEEECC--CCCCeEEEEECCCCCeEEeecC-CCcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceEeccCCC--
Confidence 33332221 1123688899887777766432 1111222233344 5555443222 47899998877665554211
Q ss_pred cCCCCceEEEeCCEEEEEc---CC---ceEEEEEcCCc
Q 045821 292 LTRPPCKLVAIGKTIFVIG---KG---CSAVVIDVGNI 323 (352)
Q Consensus 292 ~~~~~~~~~~~~~~i~v~g---G~---~~~~~~d~~~~ 323 (352)
..........+..|+... |. ...+++|....
T Consensus 378 -~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~ 414 (435)
T PRK05137 378 -LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGR 414 (435)
T ss_pred -CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCC
Confidence 111223333344455443 22 24566777543
No 116
>PF12217 End_beta_propel: Catalytic beta propeller domain of bacteriophage endosialidase; InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=94.61 E-value=2.5 Score=35.89 Aligned_cols=175 Identities=10% Similarity=0.160 Sum_probs=80.8
Q ss_pred CCCCeeEec---cCCC--cccCCcceEEEEECCEEEEEeC-CCCCCCCCCeEEEEE---CCCCeeEe--cCCCCC-----
Q 045821 93 TRRSWKMIH---ELPA--RTLRRKGMGFEVLGKNAYLLGG-CGWSEDATSEVYCYD---ASMNTWTD--AAPMST----- 156 (352)
Q Consensus 93 ~~~~w~~~~---~~~~--~~~~~~~~~~~~~~~~iyv~GG-~~~~~~~~~~~~~~d---~~t~~W~~--~~~~~~----- 156 (352)
..++|+.-. ++.. +....++++..++++++|.+=- ..-........+.|| ...+.|+. ++..+.
T Consensus 53 dG~TWttPEwLtd~H~~yptvnyHCmSMGv~~NRLfa~iEtR~~a~~km~~~~Lw~RpMF~~spW~~teL~~~~~~~~a~ 132 (367)
T PF12217_consen 53 DGQTWTTPEWLTDLHPDYPTVNYHCMSMGVVGNRLFAVIETRTVASNKMVRAELWSRPMFHDSPWRITELGTIASFTSAG 132 (367)
T ss_dssp TTSS----EESS---TTTTTEEEE-B-EEEETTEEEEEEEEEETTT--EEEEEEEEEE-STTS--EEEEEES-TT-----
T ss_pred CCCcccCchhhhhcCCCCCccceeeeeeeeecceeeEEEeehhhhhhhhhhhhhhcccccccCCceeeeccccccccccc
Confidence 578886432 2211 1134567788899999986632 222223344455555 33566764 443333
Q ss_pred --cceeeeeEEECCEEEEEeCCCCC---Cc--eEEEeCC----CCcE-EEecC-CCCcccceeEEEECCEEEEEccCCCC
Q 045821 157 --ARCYFPCGVLNQKIYCIGGLGDT---HS--WDVYDPR----TNNW-KLHTE-PNIFTEIEDSFVMDGKIYIRCSASAA 223 (352)
Q Consensus 157 --~r~~~~~~~~~~~iyv~gG~~~~---~~--v~~yd~~----t~~W-~~~~~-~~~~~~~~~~~~~~g~iyv~GG~~~~ 223 (352)
.-.-|+.+.+++.-|.+|=...+ .. +..|... ...- +.++. ........+.-.++|.||+..-....
T Consensus 133 ~~vTe~HSFa~i~~~~fA~GyHnGD~sPRe~G~~yfs~~~~sp~~~vrr~i~sey~~~AsEPCvkyY~g~LyLtTRgt~~ 212 (367)
T PF12217_consen 133 VAVTELHSFATIDDNQFAVGYHNGDVSPRELGFLYFSDAFASPGVFVRRIIPSEYERNASEPCVKYYDGVLYLTTRGTLP 212 (367)
T ss_dssp ---SEEEEEEE-SSS-EEEEEEE-SSSS-EEEEEEETTTTT-TT--EEEE--GGG-TTEEEEEEEEETTEEEEEEEES-T
T ss_pred ceeeeeeeeeEecCCceeEEeccCCCCcceeeEEEecccccCCcceeeeechhhhccccccchhhhhCCEEEEEEcCcCC
Confidence 23467888888877777743332 22 2223221 1111 22221 11222334556789999998533322
Q ss_pred -CCceEEEEEeCCCCceeecCCCCCCCCCC-cEEEECCEEEEEecC
Q 045821 224 -TSHVCALVYEPSTDSWLHADANMASGWRG-PAVVVDDALYVLDQS 267 (352)
Q Consensus 224 -~~~~~i~~yd~~~~~W~~~~~~~~~~~~~-~~~~~~~~l~~~gg~ 267 (352)
.....+.+-+.....|+.+.-+....... +....++.||++|..
T Consensus 213 ~~~GS~L~rs~d~G~~w~slrfp~nvHhtnlPFakvgD~l~mFgsE 258 (367)
T PF12217_consen 213 TNPGSSLHRSDDNGQNWSSLRFPNNVHHTNLPFAKVGDVLYMFGSE 258 (367)
T ss_dssp TS---EEEEESSTTSS-EEEE-TT---SS---EEEETTEEEEEEE-
T ss_pred CCCcceeeeecccCCchhhccccccccccCCCceeeCCEEEEEecc
Confidence 33346778888889999886542222221 235679999999863
No 117
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=94.54 E-value=2.7 Score=36.05 Aligned_cols=192 Identities=14% Similarity=0.160 Sum_probs=105.6
Q ss_pred CCEEEEEEecCCCCceeEEEecCCC---CCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNS---TRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDAS 144 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~---~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~ 144 (352)
.+.+|++.+.... .++.|.-.. ..+.....-.+| .+-.+.+.++.+|.+|.-- .....+.+||..
T Consensus 30 ~~~iy~~~~~~~~---~v~ey~~~~~f~~~~~~~~~~~Lp---~~~~GtG~vVYngslYY~~------~~s~~IvkydL~ 97 (250)
T PF02191_consen 30 SEKIYVTSGFSGN---TVYEYRNYEDFLRNGRSSRTYKLP---YPWQGTGHVVYNGSLYYNK------YNSRNIVKYDLT 97 (250)
T ss_pred CCCEEEECccCCC---EEEEEcCHhHHhhcCCCceEEEEe---ceeccCCeEEECCcEEEEe------cCCceEEEEECc
Confidence 6778888665222 444442200 112222222333 3455566677788877653 346889999999
Q ss_pred CCeeE---ecCCCC---------CcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCC----CcEEEecCCCCcccce
Q 045821 145 MNTWT---DAAPMS---------TARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRT----NNWKLHTEPNIFTEIE 205 (352)
Q Consensus 145 t~~W~---~~~~~~---------~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t----~~W~~~~~~~~~~~~~ 205 (352)
+++-. .++... .+......++-++-++|+-..... -.+-..|+.+ .+|..- . ..+...
T Consensus 98 t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~tL~v~~tw~T~--~-~k~~~~ 174 (250)
T PF02191_consen 98 TRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPETLSVEQTWNTS--Y-PKRSAG 174 (250)
T ss_pred CCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCcccCceEEEEEec--c-Cchhhc
Confidence 98744 444321 112223444445556666443322 3334556654 357642 2 223333
Q ss_pred eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-E---EECCEEEEEecCCCCeEEEEeCC
Q 045821 206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-V---VVDDALYVLDQSSGTKLMMWQKE 278 (352)
Q Consensus 206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~---~~~~~l~~~gg~~~~~v~~yd~~ 278 (352)
.++.+=|.||++....... ..-.++||+.+++=..+..+...+....+ + ..+.+||+...- .+..|+..
T Consensus 175 naFmvCGvLY~~~s~~~~~-~~I~yafDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY~wd~G---~~v~Y~v~ 247 (250)
T PF02191_consen 175 NAFMVCGVLYATDSYDTRD-TEIFYAFDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLYAWDNG---YQVTYDVR 247 (250)
T ss_pred ceeeEeeEEEEEEECCCCC-cEEEEEEECCCCceeceeeeeccccCceEeeeECCCCCeEEEEECC---eEEEEEEE
Confidence 4677789999996554433 44567899998876655444444433333 2 247889988643 67777653
No 118
>PRK03629 tolB translocation protein TolB; Provisional
Probab=94.28 E-value=4.7 Score=37.75 Aligned_cols=148 Identities=9% Similarity=-0.030 Sum_probs=80.0
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK 213 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~ 213 (352)
...++++|..+++-+.+...+..-.... ..-++ +|++.........++.+|..++..+++..... ........-+|+
T Consensus 222 ~~~i~i~dl~~G~~~~l~~~~~~~~~~~-~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~-~~~~~~wSPDG~ 299 (429)
T PRK03629 222 RSALVIQTLANGAVRQVASFPRHNGAPA-FSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRS-NNTEPTWFPDSQ 299 (429)
T ss_pred CcEEEEEECCCCCeEEccCCCCCcCCeE-ECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCCC-CcCceEECCCCC
Confidence 3578999998887766665433211111 11144 56554333322578999999988877653321 111122233555
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGR 287 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~ 287 (352)
..++.... .....++.+|..+..-+.+.... .........-+| .|++.+...+ ..++++|++++.+..+..
T Consensus 300 ~I~f~s~~--~g~~~Iy~~d~~~g~~~~lt~~~-~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~ 372 (429)
T PRK03629 300 NLAYTSDQ--AGRPQVYKVNINGGAPQRITWEG-SQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTD 372 (429)
T ss_pred EEEEEeCC--CCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCC
Confidence 33332222 11236888888877666653211 111122233344 4545444332 478999999999887763
No 119
>PRK03629 tolB translocation protein TolB; Provisional
Probab=94.23 E-value=4.8 Score=37.67 Aligned_cols=188 Identities=8% Similarity=0.008 Sum_probs=94.9
Q ss_pred eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceee
Q 045821 83 VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYF 161 (352)
Q Consensus 83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~ 161 (352)
..++.+|. .++.-..+...+. .........++ .|++..... ...+++.+|..+++.+.+..........
T Consensus 223 ~~i~i~dl--~~G~~~~l~~~~~----~~~~~~~SPDG~~La~~~~~~----g~~~I~~~d~~tg~~~~lt~~~~~~~~~ 292 (429)
T PRK03629 223 SALVIQTL--ANGAVRQVASFPR----HNGAPAFSPDGSKLAFALSKT----GSLNLYVMDLASGQIRQVTDGRSNNTEP 292 (429)
T ss_pred cEEEEEEC--CCCCeEEccCCCC----CcCCeEECCCCCEEEEEEcCC----CCcEEEEEECCCCCEEEccCCCCCcCce
Confidence 45666677 6666555554431 11111222244 455543321 2346999999988877765433211111
Q ss_pred eeEEECCE-EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee
Q 045821 162 PCGVLNQK-IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL 240 (352)
Q Consensus 162 ~~~~~~~~-iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~ 240 (352)
...-+++ |+..........++.+|..+..-+++.... .........-+|+..++.+... ....++.+|+.++.++
T Consensus 293 -~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~-~~~~~~~~SpDG~~Ia~~~~~~--g~~~I~~~dl~~g~~~ 368 (429)
T PRK03629 293 -TWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITWEG-SQNQDADVSSDGKFMVMVSSNG--GQQHIAKQDLATGGVQ 368 (429)
T ss_pred -EECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEccC--CCceEEEEECCCCCeE
Confidence 1111444 443332222257888898887766664221 1111222333554444433322 1236888999999888
Q ss_pred ecCCCCCCCCCCcEEEECCEEEEEecCCC--CeEEEEeCCCCceEEcc
Q 045821 241 HADANMASGWRGPAVVVDDALYVLDQSSG--TKLMMWQKESREWSPVG 286 (352)
Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~ 286 (352)
.+.... ........-+|+.+++.+..+ ..++..+.+...=..+.
T Consensus 369 ~Lt~~~--~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~~G~~~~~l~ 414 (429)
T PRK03629 369 VLTDTF--LDETPSIAPNGTMVIYSSSQGMGSVLNLVSTDGRFKARLP 414 (429)
T ss_pred EeCCCC--CCCCceECCCCCEEEEEEcCCCceEEEEEECCCCCeEECc
Confidence 876421 112334555666555544333 34667777655444443
No 120
>PRK04043 tolB translocation protein TolB; Provisional
Probab=94.09 E-value=5.1 Score=37.40 Aligned_cols=190 Identities=9% Similarity=-0.037 Sum_probs=106.8
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...+|.+|. .++.=+.+...+. .........++ +|.+..... ...++|.+|..+++++.+...+..-..
T Consensus 212 ~~~Iyv~dl--~tg~~~~lt~~~g----~~~~~~~SPDG~~la~~~~~~----g~~~Iy~~dl~~g~~~~LT~~~~~d~~ 281 (419)
T PRK04043 212 KPTLYKYNL--YTGKKEKIASSQG----MLVVSDVSKDGSKLLLTMAPK----GQPDIYLYDTNTKTLTQITNYPGIDVN 281 (419)
T ss_pred CCEEEEEEC--CCCcEEEEecCCC----cEEeeEECCCCCEEEEEEccC----CCcEEEEEECCCCcEEEcccCCCccCc
Confidence 458899999 8887777765431 11111223344 555544322 247899999999999888754431111
Q ss_pred eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC----CceEEEEEeCCC
Q 045821 161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT----SHVCALVYEPST 236 (352)
Q Consensus 161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~----~~~~i~~yd~~~ 236 (352)
.....-+.+|++.........++++|..+++.+++..-.. . .....-+|+..++-...... ....++.+|+.+
T Consensus 282 p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~--~-~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~ 358 (419)
T PRK04043 282 GNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGK--N-NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNS 358 (419)
T ss_pred cEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCC--c-CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCC
Confidence 1111113467776654433789999999998877653211 1 12333355433332322211 224789999999
Q ss_pred CceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821 237 DSWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESREWSPVG 286 (352)
Q Consensus 237 ~~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~~W~~~~ 286 (352)
+.++.+.... ........-+|+ |++.....+ ..++.++.+.+.=..++
T Consensus 359 g~~~~LT~~~--~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~~~~~l~ 408 (419)
T PRK04043 359 DYIRRLTANG--VNQFPRFSSDGGSIMFIKYLGNQSALGIIRLNYNKSFLFP 408 (419)
T ss_pred CCeEECCCCC--CcCCeEECCCCCEEEEEEccCCcEEEEEEecCCCeeEEee
Confidence 9998886531 222334444555 544443322 35788888765544444
No 121
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.09 E-value=3.1 Score=34.87 Aligned_cols=176 Identities=10% Similarity=0.037 Sum_probs=101.2
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEEC--CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLN--QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~--~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
.+.+.+||..|++-.+--. .--..-..+.+| ..+.+.|+.+ .++.++|..+++.+++..+..+.....++.+.+
T Consensus 80 Dk~v~vwDV~TGkv~Rr~r--gH~aqVNtV~fNeesSVv~SgsfD--~s~r~wDCRS~s~ePiQildea~D~V~Si~v~~ 155 (307)
T KOG0316|consen 80 DKAVQVWDVNTGKVDRRFR--GHLAQVNTVRFNEESSVVASGSFD--SSVRLWDCRSRSFEPIQILDEAKDGVSSIDVAE 155 (307)
T ss_pred CceEEEEEcccCeeeeecc--cccceeeEEEecCcceEEEecccc--ceeEEEEcccCCCCccchhhhhcCceeEEEecc
Confidence 4678999999986322110 000111223343 3566666666 789999999999988887777777777888888
Q ss_pred EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEc--cCCCC
Q 045821 213 KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPV--GRLST 290 (352)
Q Consensus 213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~--~~~~~ 290 (352)
...+.|..++ .+-.||...++-.. +-+..+.....+..++...++|..++ .+...|-+++.--+. +....
T Consensus 156 heIvaGS~DG-----tvRtydiR~G~l~s--Dy~g~pit~vs~s~d~nc~La~~l~s-tlrLlDk~tGklL~sYkGhkn~ 227 (307)
T KOG0316|consen 156 HEIVAGSVDG-----TVRTYDIRKGTLSS--DYFGHPITSVSFSKDGNCSLASSLDS-TLRLLDKETGKLLKSYKGHKNM 227 (307)
T ss_pred cEEEeeccCC-----cEEEEEeecceeeh--hhcCCcceeEEecCCCCEEEEeeccc-eeeecccchhHHHHHhcccccc
Confidence 8888865553 46678876554321 11111222222344666666665544 788888877653211 11111
Q ss_pred CcCCCCceEEEeCCEEEEEcCCc--eEEEEEcCCccc
Q 045821 291 LLTRPPCKLVAIGKTIFVIGKGC--SAVVIDVGNIGN 325 (352)
Q Consensus 291 ~~~~~~~~~~~~~~~i~v~gG~~--~~~~~d~~~~~w 325 (352)
. .+ ..+...+..-.|++|.. .++++|.....-
T Consensus 228 e-yk--ldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~ 261 (307)
T KOG0316|consen 228 E-YK--LDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ 261 (307)
T ss_pred e-ee--eeeeecccceeEEeccCCceEEEEEecccee
Confidence 0 11 12223344456677765 366788776553
No 122
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=94.00 E-value=3.7 Score=35.48 Aligned_cols=204 Identities=12% Similarity=0.141 Sum_probs=100.4
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC----CCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCcE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP----MSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~----~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
.+.....||.+ +.+-+|+..++.=+.... ++.-..+.+++.+ + ++|.--.| + ..+-.+|+++.+=
T Consensus 108 Sg~~VAcGGLd------N~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD~~ilT~SG-D--~TCalWDie~g~~ 178 (343)
T KOG0286|consen 108 SGNFVACGGLD------NKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDDNHILTGSG-D--MTCALWDIETGQQ 178 (343)
T ss_pred CCCeEEecCcC------ceeEEEecccccccccceeeeeecCccceeEEEEEcCCCceEecCC-C--ceEEEEEcccceE
Confidence 56666778764 346788887653332222 2222334444444 3 34433333 2 6778899998753
Q ss_pred EEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821 193 KLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG 269 (352)
Q Consensus 193 ~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~ 269 (352)
.. ..........+.. .+++.||.||.+. ....+|.....-.+.=.-...........-+|.-+..|..++
T Consensus 179 ~~--~f~GH~gDV~slsl~p~~~ntFvSg~cD~-----~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGSDD~ 251 (343)
T KOG0286|consen 179 TQ--VFHGHTGDVMSLSLSPSDGNTFVSGGCDK-----SAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGSDDA 251 (343)
T ss_pred EE--EecCCcccEEEEecCCCCCCeEEeccccc-----ceeeeeccCcceeEeecccccccceEEEccCCCeeeecCCCc
Confidence 22 1211111111111 2789999988774 234445544432221000000001111233566666665554
Q ss_pred CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcccccceEEeccCCCCCCCCcceeeE
Q 045821 270 TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIGNIGGIMVSSSIPKLNDNDDIISCK 348 (352)
Q Consensus 270 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~w~~~~~~~~~~~~~~~~~~~~~~~ 348 (352)
..-.||+..++=-.+=.-+..........+...|+++..|..+ ++.+.|.-+ -.++-.|..|+|-++|-
T Consensus 252 -tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk---------~e~vg~L~GHeNRvScl 321 (343)
T KOG0286|consen 252 -TCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNVWDTLK---------GERVGVLAGHENRVSCL 321 (343)
T ss_pred -eeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEeeecCCceeEeeccc---------cceEEEeeccCCeeEEE
Confidence 7788888876533222222111122223344578888877543 344455443 33455667888888763
No 123
>PRK02889 tolB translocation protein TolB; Provisional
Probab=93.82 E-value=5.8 Score=37.12 Aligned_cols=146 Identities=12% Similarity=0.027 Sum_probs=77.0
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
...++++|..+++-..+...+... .+.+. -++ +|++....+....++.+|..+...+++.... .........-+|
T Consensus 219 ~~~I~~~dl~~g~~~~l~~~~g~~--~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~wSpDG 295 (427)
T PRK02889 219 KPVVYVHDLATGRRRVVANFKGSN--SAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQSS-GIDTEPFFSPDG 295 (427)
T ss_pred CcEEEEEECCCCCEEEeecCCCCc--cceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCCCC-CCCcCeEEcCCC
Confidence 357999999988766655433211 11111 144 5554444333367888898877766664321 111122233356
Q ss_pred EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821 213 KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVG 286 (352)
Q Consensus 213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~ 286 (352)
+..++.... .....++.++..+...+.+.... .........-+| .|++.....+ ..++++|..++....+.
T Consensus 296 ~~l~f~s~~--~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt 368 (427)
T PRK02889 296 RSIYFTSDR--GGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALT 368 (427)
T ss_pred CEEEEEecC--CCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEcc
Confidence 533332222 12246888888777776664211 111122333344 4554443322 36899999988877764
No 124
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=93.55 E-value=6.8 Score=37.04 Aligned_cols=178 Identities=17% Similarity=0.119 Sum_probs=89.4
Q ss_pred CCeEEEEECCCC-e-eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEEC
Q 045821 135 TSEVYCYDASMN-T-WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD 211 (352)
Q Consensus 135 ~~~~~~~d~~t~-~-W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~ 211 (352)
...+.+||...+ + -+.+....... ++++.. .+++++.|+.+ ..|.++|..+.+-.+.-......-...++.-+
T Consensus 224 D~tiriwd~~~~~~~~~~l~gH~~~v--~~~~f~p~g~~i~Sgs~D--~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d 299 (456)
T KOG0266|consen 224 DKTLRIWDLKDDGRNLKTLKGHSTYV--TSVAFSPDGNLLVSGSDD--GTVRIWDVRTGECVRKLKGHSDGISGLAFSPD 299 (456)
T ss_pred CceEEEeeccCCCeEEEEecCCCCce--EEEEecCCCCEEEEecCC--CcEEEEeccCCeEEEeeeccCCceEEEEECCC
Confidence 356788888433 2 34444434333 232222 45888888877 78999999986544332111111112233347
Q ss_pred CEEEEEccCCCCCCceEEEEEeCCCCcee---ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-EccC
Q 045821 212 GKIYIRCSASAATSHVCALVYEPSTDSWL---HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVGR 287 (352)
Q Consensus 212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~---~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~~ 287 (352)
+.+++.+..+ ..+..||..+..-. .+................+..|++.+..+..+-.||.....-- +...
T Consensus 300 ~~~l~s~s~d-----~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~~~~~~~~~ 374 (456)
T KOG0266|consen 300 GNLLVSASYD-----GTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRSGKSVGTYTG 374 (456)
T ss_pred CCEEEEcCCC-----ccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccCCcceeeecc
Confidence 7888876443 25778888887743 2222111111111222244455555555557888888754322 1111
Q ss_pred CCCCcCCCCceEE-EeCCEEEEEcCCc-eEEEEEcCC
Q 045821 288 LSTLLTRPPCKLV-AIGKTIFVIGKGC-SAVVIDVGN 322 (352)
Q Consensus 288 ~~~~~~~~~~~~~-~~~~~i~v~gG~~-~~~~~d~~~ 322 (352)
.... .+..+..+ ..+++..+.|+.+ .+.++|+..
T Consensus 375 ~~~~-~~~~~~~~~~~~~~~i~sg~~d~~v~~~~~~s 410 (456)
T KOG0266|consen 375 HSNL-VRCIFSPTLSTGGKLIYSGSEDGSVYVWDSSS 410 (456)
T ss_pred cCCc-ceeEecccccCCCCeEEEEeCCceEEEEeCCc
Confidence 1111 11222222 2355555555444 355788775
No 125
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=93.53 E-value=3 Score=37.27 Aligned_cols=122 Identities=13% Similarity=0.158 Sum_probs=69.0
Q ss_pred CceEEEeCCCC-----cEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc-eeecCCCCCCCCCCc
Q 045821 180 HSWDVYDPRTN-----NWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS-WLHADANMASGWRGP 253 (352)
Q Consensus 180 ~~v~~yd~~t~-----~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~~~~~~~~~~ 253 (352)
..+.+|+.... +.+.+........-.+.+.++|+|.+..|. .+..|+...++ +..............
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g~-------~l~v~~l~~~~~l~~~~~~~~~~~i~s 134 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVGN-------KLYVYDLDNSKTLLKKAFYDSPFYITS 134 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEETT-------EEEEEEEETTSSEEEEEEE-BSSSEEE
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeecC-------EEEEEEccCcccchhhheecceEEEEE
Confidence 45788888874 555554332223334566779996665442 47788877777 766643222222222
Q ss_pred EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe-CCEEEEEc
Q 045821 254 AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI-GKTIFVIG 310 (352)
Q Consensus 254 ~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~~~i~v~g 310 (352)
..+.++.|++-.-..+-.++.|+.+..+...++.-.. .+...++..+ ++..++.+
T Consensus 135 l~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~--~~~v~~~~~l~d~~~~i~~ 190 (321)
T PF03178_consen 135 LSVFKNYILVGDAMKSVSLLRYDEENNKLILVARDYQ--PRWVTAAEFLVDEDTIIVG 190 (321)
T ss_dssp EEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS---BEEEEEEEE-SSSEEEEE
T ss_pred EeccccEEEEEEcccCEEEEEEEccCCEEEEEEecCC--CccEEEEEEecCCcEEEEE
Confidence 3456887776655555567788987777888875433 2334455555 54444444
No 126
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=93.20 E-value=13 Score=39.24 Aligned_cols=150 Identities=15% Similarity=0.149 Sum_probs=83.4
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC----------CC-c--ceeeeeEEE-C-CEEEEEeCCCCCCceE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM----------ST-A--RCYFPCGVL-N-QKIYCIGGLGDTHSWD 183 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~----------~~-~--r~~~~~~~~-~-~~iyv~gG~~~~~~v~ 183 (352)
++.+|+... ....+++||+.++....+..- .. . .....++.. + +.|||....+ +.+.
T Consensus 694 ~g~LyVad~------~~~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n--~~Ir 765 (1057)
T PLN02919 694 NEKVYIAMA------GQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSES--SSIR 765 (1057)
T ss_pred CCeEEEEEC------CCCeEEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCC--CeEE
Confidence 578888753 245688888887765433210 00 0 111123332 3 4699987654 7899
Q ss_pred EEeCCCCcEEEec--CC--CC--------------cc-cc--eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821 184 VYDPRTNNWKLHT--EP--NI--------------FT-EI--EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA 242 (352)
Q Consensus 184 ~yd~~t~~W~~~~--~~--~~--------------~~-~~--~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~ 242 (352)
+||+.++.-+.+. .. +. .. .. ..++.-+|.|||....+ ..|.+||+.++....+
T Consensus 766 v~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N-----~rIrviD~~tg~v~ti 840 (1057)
T PLN02919 766 ALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYN-----HKIKKLDPATKRVTTL 840 (1057)
T ss_pred EEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCC-----CEEEEEECCCCeEEEE
Confidence 9999876543221 00 00 00 01 11233467899985332 3689999988877655
Q ss_pred CCCC----------CCCCCCc---EEEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821 243 DANM----------ASGWRGP---AVVVDDALYVLDQSSGTKLMMWQKESREW 282 (352)
Q Consensus 243 ~~~~----------~~~~~~~---~~~~~~~l~~~gg~~~~~v~~yd~~~~~W 282 (352)
.... ......+ ++.-+|++|+....++ .|.++|..+++-
T Consensus 841 aG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn-~Irvid~~~~~~ 892 (1057)
T PLN02919 841 AGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNS-LIRYLDLNKGEA 892 (1057)
T ss_pred eccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCC-EEEEEECCCCcc
Confidence 3211 0111111 2334688988876554 899999988753
No 127
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=93.15 E-value=7.1 Score=36.09 Aligned_cols=216 Identities=12% Similarity=0.074 Sum_probs=106.4
Q ss_pred CCCCeeEecc-CCCcc-cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC-CcceeeeeEEE-CC
Q 045821 93 TRRSWKMIHE-LPART-LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS-TARCYFPCGVL-NQ 168 (352)
Q Consensus 93 ~~~~w~~~~~-~~~~~-~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~-~~r~~~~~~~~-~~ 168 (352)
...+|.+... ..... ...+...+...++..|++|-. ..++.=.-.-.+|+.++..+ .+......... ++
T Consensus 118 GG~tW~~~~~~~~~~~~~~~~l~~v~f~~~~g~~vG~~-------G~il~T~DgG~tW~~~~~~~~~p~~~~~i~~~~~~ 190 (398)
T PLN00033 118 GGKTWVPRSIPSAEDEDFNYRFNSISFKGKEGWIIGKP-------AILLHTSDGGETWERIPLSPKLPGEPVLIKATGPK 190 (398)
T ss_pred CCCCceECccCcccccccccceeeeEEECCEEEEEcCc-------eEEEEEcCCCCCceECccccCCCCCceEEEEECCC
Confidence 5788988642 11100 112234445557778887531 12222122235799876432 12112223334 45
Q ss_pred EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCc------cc--------c----ee-EEEECCEEEEEccCCCCCCceEE
Q 045821 169 KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIF------TE--------I----ED-SFVMDGKIYIRCSASAATSHVCA 229 (352)
Q Consensus 169 ~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~------~~--------~----~~-~~~~~g~iyv~GG~~~~~~~~~i 229 (352)
.++++|.. ..+++=+-.-.+|+.+...+.. .. + .. ...-++.++++|-.. .+
T Consensus 191 ~~~ivg~~---G~v~~S~D~G~tW~~~~~~t~~~~l~~~~~s~~~g~~~y~Gsf~~v~~~~dG~~~~vg~~G------~~ 261 (398)
T PLN00033 191 SAEMVTDE---GAIYVTSNAGRNWKAAVEETVSATLNRTVSSGISGASYYTGTFSTVNRSPDGDYVAVSSRG------NF 261 (398)
T ss_pred ceEEEecc---ceEEEECCCCCCceEcccccccccccccccccccccceeccceeeEEEcCCCCEEEEECCc------cE
Confidence 67777743 3344444445689987322110 00 0 00 112245566663211 23
Q ss_pred E-EEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCce-----EEccCCCCCcCCCCceEEE-e
Q 045821 230 L-VYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREW-----SPVGRLSTLLTRPPCKLVA-I 302 (352)
Q Consensus 230 ~-~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W-----~~~~~~~~~~~~~~~~~~~-~ 302 (352)
+ ..|.....|+.+..+............++.+++++... .+..-+.....| .++..... .....++.. -
T Consensus 262 ~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g~~G--~l~~S~d~G~~~~~~~f~~~~~~~~--~~~l~~v~~~~ 337 (398)
T PLN00033 262 YLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLTRGG--GLYVSKGTGLTEEDFDFEEADIKSR--GFGILDVGYRS 337 (398)
T ss_pred EEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEeCCc--eEEEecCCCCcccccceeecccCCC--CcceEEEEEcC
Confidence 3 34444445999877655555555556688888887543 455555555545 44432211 111223332 3
Q ss_pred CCEEEEEcCCceEEEEEcCCcccccc
Q 045821 303 GKTIFVIGKGCSAVVIDVGNIGNIGG 328 (352)
Q Consensus 303 ~~~i~v~gG~~~~~~~d~~~~~w~~~ 328 (352)
++.+++.|..--.+.-...-.+|.+.
T Consensus 338 d~~~~a~G~~G~v~~s~D~G~tW~~~ 363 (398)
T PLN00033 338 KKEAWAAGGSGILLRSTDGGKSWKRD 363 (398)
T ss_pred CCcEEEEECCCcEEEeCCCCcceeEc
Confidence 66788888654444445556678774
No 128
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=93.05 E-value=3.8 Score=37.58 Aligned_cols=199 Identities=12% Similarity=0.087 Sum_probs=92.6
Q ss_pred eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE----ECCEEEEEeCCCCCCceEEEeCC
Q 045821 113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV----LNQKIYCIGGLGDTHSWDVYDPR 188 (352)
Q Consensus 113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~----~~~~iyv~gG~~~~~~v~~yd~~ 188 (352)
++++..+...|++||. ...+++++...++.--.+- .+++ ...+. -|+..++.||.+ ..|.+|+..
T Consensus 85 ~al~s~n~G~~l~ag~-----i~g~lYlWelssG~LL~v~---~aHY-Q~ITcL~fs~dgs~iiTgskD--g~V~vW~l~ 153 (476)
T KOG0646|consen 85 HALASSNLGYFLLAGT-----ISGNLYLWELSSGILLNVL---SAHY-QSITCLKFSDDGSHIITGSKD--GAVLVWLLT 153 (476)
T ss_pred eeeecCCCceEEEeec-----ccCcEEEEEeccccHHHHH---Hhhc-cceeEEEEeCCCcEEEecCCC--ccEEEEEEE
Confidence 5566677778888872 3456888888887522211 2222 22222 267777777766 445554432
Q ss_pred CC--cEEEecCCCCcccceeEEE----------ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE
Q 045821 189 TN--NWKLHTEPNIFTEIEDSFV----------MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV 256 (352)
Q Consensus 189 t~--~W~~~~~~~~~~~~~~~~~----------~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~ 256 (352)
+- .-..-.+.|......+... .+.++|-.+ .+ ..+-+||.+.+.--.-- ..+......++-
T Consensus 154 ~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS-~D-----~t~k~wdlS~g~LLlti-~fp~si~av~lD 226 (476)
T KOG0646|consen 154 DLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTAS-ED-----RTIKLWDLSLGVLLLTI-TFPSSIKAVALD 226 (476)
T ss_pred eecccccCCCccceeeeccCcceeEEEEecCCCccceEEEec-CC-----ceEEEEEeccceeeEEE-ecCCcceeEEEc
Confidence 10 0000011122222222211 133455542 11 25667787776432110 112222222222
Q ss_pred ECCEEEEEecCCCCeEEEEeCCCCceEE--------------ccCCCCCcC-CCCceE-EEeCCEEEEEcCCceE-EEEE
Q 045821 257 VDDALYVLDQSSGTKLMMWQKESREWSP--------------VGRLSTLLT-RPPCKL-VAIGKTIFVIGKGCSA-VVID 319 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~--------------~~~~~~~~~-~~~~~~-~~~~~~i~v~gG~~~~-~~~d 319 (352)
..+..+++|+..| .++..+..+-.=.. +..+-.... ..-.++ +..++.+++.|+.+.. .+.|
T Consensus 227 pae~~~yiGt~~G-~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWd 305 (476)
T KOG0646|consen 227 PAERVVYIGTEEG-KIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAISTDGTLLLSGDEDGKVCVWD 305 (476)
T ss_pred ccccEEEecCCcc-eEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEEecCccEEEeeCCCCCEEEEe
Confidence 3445555666555 55555443211000 111111001 111222 3458999999988764 4799
Q ss_pred cCCcccccceE
Q 045821 320 VGNIGNIGGIM 330 (352)
Q Consensus 320 ~~~~~w~~~~~ 330 (352)
+...+-.+...
T Consensus 306 i~S~Q~iRtl~ 316 (476)
T KOG0646|consen 306 IYSKQCIRTLQ 316 (476)
T ss_pred cchHHHHHHHh
Confidence 98887666433
No 129
>smart00284 OLF Olfactomedin-like domains.
Probab=92.95 E-value=5.4 Score=34.19 Aligned_cols=192 Identities=16% Similarity=0.200 Sum_probs=103.7
Q ss_pred CCEEEEEEecCCCCceeEEEe----cCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821 68 ETWIYAFCRDNKLERVCCYVL----DPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA 143 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~----d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~ 143 (352)
++.+|++.+.. .....++.| |. ......+.-.+| .+..+.+.++.+|.+|.-- .....+.+||.
T Consensus 34 ~~~~wv~~~~~-~~~~~v~ey~~~~~f--~~~~~~~~~~Lp---~~~~GtG~VVYngslYY~~------~~s~~iiKydL 101 (255)
T smart00284 34 KSLYWYMPLNT-RVLRSVREYSSMSDF--QMGKNPTDHPLP---HAGQGTGVVVYNGSLYFNK------FNSHDICRFDL 101 (255)
T ss_pred CceEEEEcccc-CCCcEEEEecCHHHH--hccCCceEEECC---CccccccEEEECceEEEEe------cCCccEEEEEC
Confidence 46788875532 122334444 22 333443333455 4566777888899998853 23577999999
Q ss_pred CCCeeEecCCCCCc------------ceeeeeEEECCEEEEEeCCCCC---CceEEEeCCC----CcEEEecCCCCcccc
Q 045821 144 SMNTWTDAAPMSTA------------RCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRT----NNWKLHTEPNIFTEI 204 (352)
Q Consensus 144 ~t~~W~~~~~~~~~------------r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t----~~W~~~~~~~~~~~~ 204 (352)
.+++-.....+|.+ ......++-++-++|+=..... -.+-..|+.| .+|..- .+. +..
T Consensus 102 ~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp~tL~ve~tW~T~--~~k-~sa 178 (255)
T smart00284 102 TTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPATLTIENTWITT--YNK-RSA 178 (255)
T ss_pred CCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCcccceEEEEEEcC--CCc-ccc
Confidence 99986433333322 1223344444445555332211 2234566654 357652 222 222
Q ss_pred eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-E---EECCEEEEEecCCCCeEEEEeCC
Q 045821 205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-V---VVDDALYVLDQSSGTKLMMWQKE 278 (352)
Q Consensus 205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~---~~~~~l~~~gg~~~~~v~~yd~~ 278 (352)
..++.+=|.||++-... .....-.+.||+.+++=..+..+.+.+....+ + ..+.+||+.... .+..||..
T Consensus 179 ~naFmvCGvLY~~~s~~-~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~~LY~wdng---~~l~Y~v~ 252 (255)
T smart00284 179 SNAFMICGILYVTRSLG-SKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYNPNDRKLYAWNNG---HLVHYDIA 252 (255)
T ss_pred cccEEEeeEEEEEccCC-CCCcEEEEEEECCCCccceeeeeeccccccceeceeCCCCCeEEEEeCC---eEEEEEEE
Confidence 24566678999995322 12223567899988764333333333333322 2 247889988543 67777753
No 130
>PTZ00421 coronin; Provisional
Probab=92.89 E-value=9 Score=36.59 Aligned_cols=147 Identities=14% Similarity=0.154 Sum_probs=68.8
Q ss_pred CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCC
Q 045821 168 QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADA 244 (352)
Q Consensus 168 ~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~ 244 (352)
+.+++.||.+ ..+.+||..+.+-... +.... .......-++.+.+.|+.++ .+..||+.+++-. .+..
T Consensus 138 ~~iLaSgs~D--gtVrIWDl~tg~~~~~--l~~h~~~V~sla~spdG~lLatgs~Dg-----~IrIwD~rsg~~v~tl~~ 208 (493)
T PTZ00421 138 MNVLASAGAD--MVVNVWDVERGKAVEV--IKCHSDQITSLEWNLDGSLLCTTSKDK-----KLNIIDPRDGTIVSSVEA 208 (493)
T ss_pred CCEEEEEeCC--CEEEEEECCCCeEEEE--EcCCCCceEEEEEECCCCEEEEecCCC-----EEEEEECCCCcEEEEEec
Confidence 3567777765 6788999987753221 11111 11122233677777766553 5788998766422 2211
Q ss_pred CCCCCCCCcEE-EECCEEEEEecCC---CCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--CceEE
Q 045821 245 NMASGWRGPAV-VVDDALYVLDQSS---GTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCSAV 316 (352)
Q Consensus 245 ~~~~~~~~~~~-~~~~~l~~~gg~~---~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~~~ 316 (352)
.........+ ..++..++..|.. ...+.+||..+.. .... ..... ..........++.++++|| ...+.
T Consensus 209 -H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~-~~d~~-~~~~~~~~d~d~~~L~lggkgDg~Ir 285 (493)
T PTZ00421 209 -HASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTV-DLDQS-SALFIPFFDEDTNLLYIGSKGEGNIR 285 (493)
T ss_pred -CCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEe-ccCCC-CceEEEEEcCCCCEEEEEEeCCCeEE
Confidence 1111111111 2233344444432 2479999987533 1111 11110 0000111122445555544 33566
Q ss_pred EEEcCCcccc
Q 045821 317 VIDVGNIGNI 326 (352)
Q Consensus 317 ~~d~~~~~w~ 326 (352)
+||+......
T Consensus 286 iwdl~~~~~~ 295 (493)
T PTZ00421 286 CFELMNERLT 295 (493)
T ss_pred EEEeeCCceE
Confidence 8998876543
No 131
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=92.87 E-value=5.2 Score=33.75 Aligned_cols=185 Identities=17% Similarity=0.204 Sum_probs=95.1
Q ss_pred eeEecCCCCC-----ccee-eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE---CCEEEEE
Q 045821 147 TWTDAAPMST-----ARCY-FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM---DGKIYIR 217 (352)
Q Consensus 147 ~W~~~~~~~~-----~r~~-~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~---~g~iyv~ 217 (352)
.|+..+++.. |-.. +....-.+.|+..||. ..++..|.++++.+..- .....+-++++. +++|+-
T Consensus 100 lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgGD---~~~y~~dlE~G~i~r~~--rGHtDYvH~vv~R~~~~qils- 173 (325)
T KOG0649|consen 100 LWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGGD---GVIYQVDLEDGRIQREY--RGHTDYVHSVVGRNANGQILS- 173 (325)
T ss_pred hhhhcCccccCcccCCccceeEeccCCCcEEEecCC---eEEEEEEecCCEEEEEE--cCCcceeeeeeecccCcceee-
Confidence 4777766543 3222 2233345788888874 56788999999876542 222333344433 345443
Q ss_pred ccCCCCCCceEEEEEeCCCCceeec-CC----CCCCC-CCC--cEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821 218 CSASAATSHVCALVYEPSTDSWLHA-DA----NMASG-WRG--PAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 218 GG~~~~~~~~~i~~yd~~~~~W~~~-~~----~~~~~-~~~--~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~ 289 (352)
|+.++ .+-.+|..+.+-.++ .+ ...++ +.. .+...+...++.||-. .+..|.....+-+.+-..|
T Consensus 174 G~EDG-----tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgGp--~lslwhLrsse~t~vfpip 246 (325)
T KOG0649|consen 174 GAEDG-----TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGGP--KLSLWHLRSSESTCVFPIP 246 (325)
T ss_pred cCCCc-----cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecCCC--ceeEEeccCCCceEEEecc
Confidence 33332 466788888776554 22 11111 222 2345566777777654 5777777776666665444
Q ss_pred CCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCCcccccceEEeccCCCCCCCC-cceeeEEeeC
Q 045821 290 TLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGNIGNIGGIMVSSSIPKLNDND-DIISCKCLAI 352 (352)
Q Consensus 290 ~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~~~w~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 352 (352)
.. ...+.+.++..+++| .+.+.-|..... -.+-++.+-|...... +.-+||+|.+
T Consensus 247 a~-----v~~v~F~~d~vl~~G~g~~v~~~~l~Gv---l~a~ip~~s~~c~s~s~~~~p~k~~s~ 303 (325)
T KOG0649|consen 247 AR-----VHLVDFVDDCVLIGGEGNHVQSYTLNGV---LQANIPVESTACYSASWQTSPIKFISI 303 (325)
T ss_pred cc-----eeEeeeecceEEEeccccceeeeeeccE---EEEeccCCccceeeecccCCceEEEEe
Confidence 31 233444455555554 333322322210 1222333334444443 5556787764
No 132
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=92.64 E-value=7 Score=34.72 Aligned_cols=245 Identities=11% Similarity=0.097 Sum_probs=119.3
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC-C
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM-N 146 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t-~ 146 (352)
+..||++......+....|..|+ ..++.+.+...... ..-..+..+.-++...+..-+. ...+-+|-..+ +
T Consensus 51 ~~~LY~v~~~~~~ggvaay~iD~--~~G~Lt~ln~~~~~-g~~p~yvsvd~~g~~vf~AnY~-----~g~v~v~p~~~dG 122 (346)
T COG2706 51 QRHLYVVNEPGEEGGVAAYRIDP--DDGRLTFLNRQTLP-GSPPCYVSVDEDGRFVFVANYH-----SGSVSVYPLQADG 122 (346)
T ss_pred CCEEEEEEecCCcCcEEEEEEcC--CCCeEEEeeccccC-CCCCeEEEECCCCCEEEEEEcc-----CceEEEEEcccCC
Confidence 55799887665566788888999 88888877654421 1111332333333322222221 23444444433 1
Q ss_pred e-eEec------CCCCCcc-----eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC--CCCcccceeEEEE-C
Q 045821 147 T-WTDA------APMSTAR-----CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE--PNIFTEIEDSFVM-D 211 (352)
Q Consensus 147 ~-W~~~------~~~~~~r-----~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~--~~~~~~~~~~~~~-~ 211 (352)
. |..+ ..-|.+| ..++-..-++++.+.--... ..+..|+...+.-....+ .++...-.+.+++ +
T Consensus 123 ~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~-Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn 201 (346)
T COG2706 123 SLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGT-DRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPN 201 (346)
T ss_pred ccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCC-ceEEEEEcccCccccccccccCCCCCcceEEEcCC
Confidence 1 2211 1112233 21222222443333221111 678888888665443322 2222222245555 4
Q ss_pred C-EEEEEccCCCCCCceEEEEEeCCCCceeecCCC--CCCCCC----CcE--EEECCEEEEEecCCCCeEEEE--eCCCC
Q 045821 212 G-KIYIRCSASAATSHVCALVYEPSTDSWLHADAN--MASGWR----GPA--VVVDDALYVLDQSSGTKLMMW--QKESR 280 (352)
Q Consensus 212 g-~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~--~~~~~~----~~~--~~~~~~l~~~gg~~~~~v~~y--d~~~~ 280 (352)
+ ..|++.. -.....++.||....+.++++.. ++..+. .++ +..+|+.+.......+.|.+| |+.++
T Consensus 202 ~k~aY~v~E---L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g 278 (346)
T COG2706 202 GKYAYLVNE---LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGG 278 (346)
T ss_pred CcEEEEEec---cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCC
Confidence 4 5688842 23344788888888888877542 222222 222 233555443433222366555 77777
Q ss_pred ceEEccCCCC-CcCCCCceEEEeCCEEEEEcCC-ceEEEEEcCCcc
Q 045821 281 EWSPVGRLST-LLTRPPCKLVAIGKTIFVIGKG-CSAVVIDVGNIG 324 (352)
Q Consensus 281 ~W~~~~~~~~-~~~~~~~~~~~~~~~i~v~gG~-~~~~~~d~~~~~ 324 (352)
.-..+..-+. ...-+.|.+..-++.|++.+-. +.+.+|..++.+
T Consensus 279 ~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~~T 324 (346)
T COG2706 279 KLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFERDKET 324 (346)
T ss_pred EEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcCCC
Confidence 6555543221 1122345555556666666643 345567766655
No 133
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=92.56 E-value=5.6 Score=33.38 Aligned_cols=189 Identities=13% Similarity=0.093 Sum_probs=103.5
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC--EEEEEeCCCCCCCCCCeEEEEECCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK--NAYLLGGCGWSEDATSEVYCYDASM 145 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~--~iyv~GG~~~~~~~~~~~~~~d~~t 145 (352)
++.-++.||. ...++++|- ++++-.+. +. ........+..++ .+.+-|+. ..++..||..+
T Consensus 70 Dnskf~s~Gg----Dk~v~vwDV--~TGkv~Rr--~r---gH~aqVNtV~fNeesSVv~Sgsf------D~s~r~wDCRS 132 (307)
T KOG0316|consen 70 DNSKFASCGG----DKAVQVWDV--NTGKVDRR--FR---GHLAQVNTVRFNEESSVVASGSF------DSSVRLWDCRS 132 (307)
T ss_pred cccccccCCC----CceEEEEEc--ccCeeeee--cc---cccceeeEEEecCcceEEEeccc------cceeEEEEccc
Confidence 4444555543 234666688 77653222 11 1122333444543 35544543 45688899999
Q ss_pred CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821 146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS 225 (352)
Q Consensus 146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~ 225 (352)
++.+.++.+..++-.-.++.+.+..++.|..+ ..+-.||+..++- ..+-...+-.+....-+++-.++|..++
T Consensus 133 ~s~ePiQildea~D~V~Si~v~~heIvaGS~D--GtvRtydiR~G~l--~sDy~g~pit~vs~s~d~nc~La~~l~s--- 205 (307)
T KOG0316|consen 133 RSFEPIQILDEAKDGVSSIDVAEHEIVAGSVD--GTVRTYDIRKGTL--SSDYFGHPITSVSFSKDGNCSLASSLDS--- 205 (307)
T ss_pred CCCCccchhhhhcCceeEEEecccEEEeeccC--CcEEEEEeeccee--ehhhcCCcceeEEecCCCCEEEEeeccc---
Confidence 99888888888888777888888888877766 6788899987642 2221112222223334555555543332
Q ss_pred ceEEEEEeCCCCceeecCCCCCCCCCC-cEEEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821 226 HVCALVYEPSTDSWLHADANMASGWRG-PAVVVDDALYVLDQSSGTKLMMWQKESREW 282 (352)
Q Consensus 226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~-~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W 282 (352)
.+...|..+++--..-.......+. -.+..+..-.+++|.....++.||....+-
T Consensus 206 --tlrLlDk~tGklL~sYkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~ 261 (307)
T KOG0316|consen 206 --TLRLLDKETGKLLKSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ 261 (307)
T ss_pred --eeeecccchhHHHHHhcccccceeeeeeeecccceeEEeccCCceEEEEEecccee
Confidence 3444455554432211101111111 123344455667776555899999987654
No 134
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=92.48 E-value=8.8 Score=35.47 Aligned_cols=199 Identities=13% Similarity=0.188 Sum_probs=98.5
Q ss_pred CCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC----Cc----------
Q 045821 93 TRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS----TA---------- 157 (352)
Q Consensus 93 ~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~----~~---------- 157 (352)
...+|+.++..+.. +.....+..++ +.+++.|.. ..+++-+-.-.+|+.+...+ ..
T Consensus 164 gG~tW~~~~~~~~~--p~~~~~i~~~~~~~~~ivg~~-------G~v~~S~D~G~tW~~~~~~t~~~~l~~~~~s~~~g~ 234 (398)
T PLN00033 164 GGETWERIPLSPKL--PGEPVLIKATGPKSAEMVTDE-------GAIYVTSNAGRNWKAAVEETVSATLNRTVSSGISGA 234 (398)
T ss_pred CCCCceECccccCC--CCCceEEEEECCCceEEEecc-------ceEEEECCCCCCceEccccccccccccccccccccc
Confidence 57899888653211 11223334443 457777631 23444443445798762111 00
Q ss_pred ----ceeeeeEEE-CCEEEEEeCCCCCCceEE-EeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEE
Q 045821 158 ----RCYFPCGVL-NQKIYCIGGLGDTHSWDV-YDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALV 231 (352)
Q Consensus 158 ----r~~~~~~~~-~~~iyv~gG~~~~~~v~~-yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~ 231 (352)
-..+.+... ++.++++|-.. .+++ .|.....|+.+...............++.++++|... .+..
T Consensus 235 ~~y~Gsf~~v~~~~dG~~~~vg~~G---~~~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g~~G------~l~~ 305 (398)
T PLN00033 235 SYYTGTFSTVNRSPDGDYVAVSSRG---NFYLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLTRGG------GLYV 305 (398)
T ss_pred ceeccceeeEEEcCCCCEEEEECCc---cEEEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEeCCc------eEEE
Confidence 011122222 45566665432 2222 3333345998875444443333445688899885332 2333
Q ss_pred EeCCCC-----ceeecCCCCCCC-CCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEE-eCC
Q 045821 232 YEPSTD-----SWLHADANMASG-WRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVA-IGK 304 (352)
Q Consensus 232 yd~~~~-----~W~~~~~~~~~~-~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~~ 304 (352)
-+.... +|.+++...... ........++.+++.|... .+.+-....++|+.....+. .....+.+.. .++
T Consensus 306 S~d~G~~~~~~~f~~~~~~~~~~~l~~v~~~~d~~~~a~G~~G--~v~~s~D~G~tW~~~~~~~~-~~~~ly~v~f~~~~ 382 (398)
T PLN00033 306 SKGTGLTEEDFDFEEADIKSRGFGILDVGYRSKKEAWAAGGSG--ILLRSTDGGKSWKRDKGADN-IAANLYSVKFFDDK 382 (398)
T ss_pred ecCCCCcccccceeecccCCCCcceEEEEEcCCCcEEEEECCC--cEEEeCCCCcceeEccccCC-CCcceeEEEEcCCC
Confidence 333334 455543321111 1111223466788777643 56777778899999763221 1223345553 457
Q ss_pred EEEEEcCC
Q 045821 305 TIFVIGKG 312 (352)
Q Consensus 305 ~i~v~gG~ 312 (352)
+.|+.|-.
T Consensus 383 ~g~~~G~~ 390 (398)
T PLN00033 383 KGFVLGND 390 (398)
T ss_pred ceEEEeCC
Confidence 89998844
No 135
>KOG2997 consensus F-box protein FBX9 [General function prediction only]
Probab=92.33 E-value=0.058 Score=46.68 Aligned_cols=43 Identities=23% Similarity=0.216 Sum_probs=37.6
Q ss_pred cCCCcHHHHHHhhccCcc-----ccchhhhhhhhhhcccccChhhHHH
Q 045821 18 ISGLPDDIALFCLARVPR-----RYHAVLKCVSSRWRGLLHSEEWCAY 60 (352)
Q Consensus 18 ~~~lp~dl~~~il~r~p~-----~~~~~l~~v~k~W~~l~~s~~~~~~ 60 (352)
+..||+|++.+|+.++=. +++.++.+||+.|.-....|++.+.
T Consensus 107 ~~~LPdEvLm~I~~~vv~~~~d~rsL~~~s~vCr~F~~~~R~~~lwR~ 154 (366)
T KOG2997|consen 107 ISVLPDEVLMRIFRWVVSSLLDLRSLEQLSLVCRGFYKCARDPELWRL 154 (366)
T ss_pred hhhCCHHHHHHHHHHHHhhhcchhhHHHhHhhHHHHHHHHcChHHHHH
Confidence 468999999999998775 7899999999999999888887654
No 136
>PF07734 FBA_1: F-box associated; InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=92.06 E-value=5.1 Score=31.82 Aligned_cols=80 Identities=15% Similarity=0.196 Sum_probs=49.3
Q ss_pred EEECCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcE----EEECCEEEEEecCC-C--CeEEEEe---
Q 045821 208 FVMDGKIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPA----VVVDDALYVLDQSS-G--TKLMMWQ--- 276 (352)
Q Consensus 208 ~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~----~~~~~~l~~~gg~~-~--~~v~~yd--- 276 (352)
+.++|.+|-++..........|.+||..+.+. ..++.|......... ++.++.|.++-... . -+||+-+
T Consensus 2 V~vnG~~hW~~~~~~~~~~~~IlsFDl~~E~F~~~~~lP~~~~~~~~~~~L~~v~~~~L~~~~~~~~~~~~~IWvm~~~~ 81 (164)
T PF07734_consen 2 VFVNGALHWLAYDENNDEKDFILSFDLSTEKFGRSLPLPFCNDDDDDSVSLSVVRGDCLCVLYQCDETSKIEIWVMKKYG 81 (164)
T ss_pred EEECCEEEeeEEecCCCCceEEEEEeccccccCCEECCCCccCccCCEEEEEEecCCEEEEEEeccCCccEEEEEEeeec
Confidence 57899999986554433333799999999999 666554333311121 23477888774322 1 3566554
Q ss_pred CCCCceEEccC
Q 045821 277 KESREWSPVGR 287 (352)
Q Consensus 277 ~~~~~W~~~~~ 287 (352)
....+|+++-.
T Consensus 82 ~~~~SWtK~~~ 92 (164)
T PF07734_consen 82 YGKESWTKLFT 92 (164)
T ss_pred cCcceEEEEEE
Confidence 23678998853
No 137
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=91.97 E-value=8.3 Score=35.11 Aligned_cols=132 Identities=8% Similarity=0.080 Sum_probs=73.9
Q ss_pred CCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC-----CCCc--ccceeEEEECCEEEEE
Q 045821 145 MNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE-----PNIF--TEIEDSFVMDGKIYIR 217 (352)
Q Consensus 145 t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~-----~~~~--~~~~~~~~~~g~iyv~ 217 (352)
-++|+.+.. .....-.++.++|++|++.-. ..++.++.+-. =.++.+ +... ......+...|.|+++
T Consensus 189 ~~~Wt~l~~--~~~~~~DIi~~kGkfYAvD~~---G~l~~i~~~l~-i~~v~~~i~~~~~~g~~~~~~yLVEs~GdLLmV 262 (373)
T PLN03215 189 GNVLKALKQ--MGYHFSDIIVHKGQTYALDSI---GIVYWINSDLE-FSRFGTSLDENITDGCWTGDRRFVECCGELYIV 262 (373)
T ss_pred CCeeeEccC--CCceeeEEEEECCEEEEEcCC---CeEEEEecCCc-eeeecceecccccCCcccCceeEEEECCEEEEE
Confidence 389999974 223355678889999998432 34666664321 112221 1001 1223456677889988
Q ss_pred ccCCCCC--------------CceEEEEEeCCCCceeecCCCCCCCC-----CCcEE-------EECCEEEEEecCCCCe
Q 045821 218 CSASAAT--------------SHVCALVYEPSTDSWLHADANMASGW-----RGPAV-------VVDDALYVLDQSSGTK 271 (352)
Q Consensus 218 GG~~~~~--------------~~~~i~~yd~~~~~W~~~~~~~~~~~-----~~~~~-------~~~~~l~~~gg~~~~~ 271 (352)
....... ....++..|....+|.++........ ....+ ..+|.||++... .
T Consensus 263 ~R~~~~~~~~~~~~~~~~~~t~~f~VfklD~~~~~WveV~sLgd~aLFlG~~~s~sv~a~e~pG~k~NcIYFtdd~---~ 339 (373)
T PLN03215 263 ERLPKESTWKRKADGFEYSRTVGFKVYKFDDELAKWMEVKTLGDNAFVMATDTCFSVLAHEFYGCLPNSIYFTEDT---M 339 (373)
T ss_pred EEEccCcccccccccccccceeEEEEEEEcCCCCcEEEecccCCeEEEEECCccEEEecCCCCCccCCEEEEECCC---c
Confidence 6532110 11266677888999999876322110 00001 125677877554 5
Q ss_pred EEEEeCCCCceEEc
Q 045821 272 LMMWQKESREWSPV 285 (352)
Q Consensus 272 v~~yd~~~~~W~~~ 285 (352)
..+||++.++-+.+
T Consensus 340 ~~v~~~~dg~~~~~ 353 (373)
T PLN03215 340 PKVFKLDNGNGSSI 353 (373)
T ss_pred ceEEECCCCCccce
Confidence 67888887764444
No 138
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=91.70 E-value=9 Score=33.90 Aligned_cols=217 Identities=12% Similarity=0.221 Sum_probs=97.3
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMN 146 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~ 146 (352)
++..+++|.. .. ++.-.- ...+|++++.... .+-..+.+..+ ++.+.+++.. ..+++=.=.-.
T Consensus 71 ~~~g~ivG~~----g~-ll~T~D--gG~tW~~v~l~~~--lpgs~~~i~~l~~~~~~l~~~~-------G~iy~T~DgG~ 134 (302)
T PF14870_consen 71 GNEGWIVGEP----GL-LLHTTD--GGKTWERVPLSSK--LPGSPFGITALGDGSAELAGDR-------GAIYRTTDGGK 134 (302)
T ss_dssp TTEEEEEEET----TE-EEEESS--TTSS-EE----TT---SS-EEEEEEEETTEEEEEETT---------EEEESSTTS
T ss_pred CCceEEEcCC----ce-EEEecC--CCCCcEEeecCCC--CCCCeeEEEEcCCCcEEEEcCC-------CcEEEeCCCCC
Confidence 5567776532 12 222222 6889999863211 12223344444 5667777642 33444333345
Q ss_pred eeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821 147 TWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS 225 (352)
Q Consensus 147 ~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~ 225 (352)
+|+.+..-... .-..+... ++++.+++... +-....|+....|+........+-......-++.|+++. ..+
T Consensus 135 tW~~~~~~~~g-s~~~~~r~~dG~~vavs~~G--~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~-~Gg--- 207 (302)
T PF14870_consen 135 TWQAVVSETSG-SINDITRSSDGRYVAVSSRG--NFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA-RGG--- 207 (302)
T ss_dssp SEEEEE-S-----EEEEEE-TTS-EEEEETTS--SEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE-TTT---
T ss_pred CeeEcccCCcc-eeEeEEECCCCcEEEEECcc--cEEEEecCCCccceEEccCccceehhceecCCCCEEEEe-CCc---
Confidence 79887543222 22222333 56766666443 334466888888998875434333334444578888874 221
Q ss_pred ceEEEEEe--CCCCceeecCCCCCCCCCC-cEE--EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE
Q 045821 226 HVCALVYE--PSTDSWLHADANMASGWRG-PAV--VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV 300 (352)
Q Consensus 226 ~~~i~~yd--~~~~~W~~~~~~~~~~~~~-~~~--~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~ 300 (352)
.+..-+ ....+|++-..+......+ ..+ .-++.+++.||.. .+++=.-..++|++....... +...+.++
T Consensus 208 --~~~~s~~~~~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~G--~l~~S~DgGktW~~~~~~~~~-~~n~~~i~ 282 (302)
T PF14870_consen 208 --QIQFSDDPDDGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGSG--TLLVSTDGGKTWQKDRVGENV-PSNLYRIV 282 (302)
T ss_dssp --EEEEEE-TTEEEEE---B-TTSS--S-EEEEEESSSS-EEEEESTT---EEEESSTTSS-EE-GGGTTS-SS---EEE
T ss_pred --EEEEccCCCCccccccccCCcccCceeeEEEEecCCCCEEEEeCCc--cEEEeCCCCccceECccccCC-CCceEEEE
Confidence 233333 3456788733323222222 222 2368899988864 577777778999998754321 23334444
Q ss_pred E-eCCEEEEEcCC
Q 045821 301 A-IGKTIFVIGKG 312 (352)
Q Consensus 301 ~-~~~~i~v~gG~ 312 (352)
. ..++-+++|..
T Consensus 283 f~~~~~gf~lG~~ 295 (302)
T PF14870_consen 283 FVNPDKGFVLGQD 295 (302)
T ss_dssp EEETTEEEEE-ST
T ss_pred EcCCCceEEECCC
Confidence 4 45799999854
No 139
>PTZ00421 coronin; Provisional
Probab=91.69 E-value=13 Score=35.59 Aligned_cols=152 Identities=10% Similarity=0.036 Sum_probs=71.8
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE-EecC
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK-LHTE 197 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~ 197 (352)
+.+++.||.+ ..+.+||..+++-. .+.....+ .......-++.+++.|+.+ ..+.+||+.+..-. .+..
T Consensus 138 ~~iLaSgs~D------gtVrIWDl~tg~~~~~l~~h~~~-V~sla~spdG~lLatgs~D--g~IrIwD~rsg~~v~tl~~ 208 (493)
T PTZ00421 138 MNVLASAGAD------MVVNVWDVERGKAVEVIKCHSDQ-ITSLEWNLDGSLLCTTSKD--KKLNIIDPRDGTIVSSVEA 208 (493)
T ss_pred CCEEEEEeCC------CEEEEEECCCCeEEEEEcCCCCc-eEEEEEECCCCEEEEecCC--CEEEEEECCCCcEEEEEec
Confidence 3466666543 45788898876532 22211111 1111112267777777765 67889999876521 1111
Q ss_pred CCCcccceeEEEEC-CEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcE-E--EECCEEEEEecCCCCe
Q 045821 198 PNIFTEIEDSFVMD-GKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPA-V--VVDDALYVLDQSSGTK 271 (352)
Q Consensus 198 ~~~~~~~~~~~~~~-g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~-~--~~~~~l~~~gg~~~~~ 271 (352)
.............+ +.|... |... .....+..||..+.. ...... . ...... . ..++.++++||.....
T Consensus 209 H~~~~~~~~~w~~~~~~ivt~-G~s~-s~Dr~VklWDlr~~~~p~~~~~~--d-~~~~~~~~~~d~d~~~L~lggkgDg~ 283 (493)
T PTZ00421 209 HASAKSQRCLWAKRKDLIITL-GCSK-SQQRQIMLWDTRKMASPYSTVDL--D-QSSALFIPFFDEDTNLLYIGSKGEGN 283 (493)
T ss_pred CCCCcceEEEEcCCCCeEEEE-ecCC-CCCCeEEEEeCCCCCCceeEecc--C-CCCceEEEEEcCCCCEEEEEEeCCCe
Confidence 11111111111223 344444 4321 112257777875432 111110 0 001111 1 2256666666643348
Q ss_pred EEEEeCCCCceEEc
Q 045821 272 LMMWQKESREWSPV 285 (352)
Q Consensus 272 v~~yd~~~~~W~~~ 285 (352)
|..||..++.....
T Consensus 284 Iriwdl~~~~~~~~ 297 (493)
T PTZ00421 284 IRCFELMNERLTFC 297 (493)
T ss_pred EEEEEeeCCceEEE
Confidence 99999988776544
No 140
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=91.40 E-value=10 Score=33.88 Aligned_cols=141 Identities=17% Similarity=0.191 Sum_probs=78.4
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccc--eeEEEECCEEEEEccCCCCCCceEEEEEeC--CCCceeec
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEI--EDSFVMDGKIYIRCSASAATSHVCALVYEP--STDSWLHA 242 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~--~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~--~~~~W~~~ 242 (352)
+..+.+.||.+ ...+.++..+..|--- ++..... +..+.++|.+..-|+..+ .+.++.. ...+|...
T Consensus 75 ~~~l~aTGGgD--D~AflW~~~~ge~~~e--ltgHKDSVt~~~FshdgtlLATGdmsG-----~v~v~~~stg~~~~~~~ 145 (399)
T KOG0296|consen 75 NNNLVATGGGD--DLAFLWDISTGEFAGE--LTGHKDSVTCCSFSHDGTLLATGDMSG-----KVLVFKVSTGGEQWKLD 145 (399)
T ss_pred CCceEEecCCC--ceEEEEEccCCcceeE--ecCCCCceEEEEEccCceEEEecCCCc-----cEEEEEcccCceEEEee
Confidence 56777778866 6778899999886432 2222222 244667899888887664 3555544 44567654
Q ss_pred CCCC---CCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCce-EEEE
Q 045821 243 DANM---ASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVI 318 (352)
Q Consensus 243 ~~~~---~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~ 318 (352)
...- --.||. ...+++.|..++ .+|.|...++.-.++- +.......++-+..+||-.+.|-.+. +.++
T Consensus 146 ~e~~dieWl~WHp-----~a~illAG~~DG-svWmw~ip~~~~~kv~--~Gh~~~ct~G~f~pdGKr~~tgy~dgti~~W 217 (399)
T KOG0296|consen 146 QEVEDIEWLKWHP-----RAHILLAGSTDG-SVWMWQIPSQALCKVM--SGHNSPCTCGEFIPDGKRILTGYDDGTIIVW 217 (399)
T ss_pred cccCceEEEEecc-----cccEEEeecCCC-cEEEEECCCcceeeEe--cCCCCCcccccccCCCceEEEEecCceEEEE
Confidence 2110 011111 345677777666 8999988875433332 11112333344444566555554443 3346
Q ss_pred EcCCcc
Q 045821 319 DVGNIG 324 (352)
Q Consensus 319 d~~~~~ 324 (352)
++++-+
T Consensus 218 n~ktg~ 223 (399)
T KOG0296|consen 218 NPKTGQ 223 (399)
T ss_pred ecCCCc
Confidence 666543
No 141
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=91.29 E-value=12 Score=34.69 Aligned_cols=164 Identities=13% Similarity=0.098 Sum_probs=85.6
Q ss_pred hhhHHHHhhCCCC-CCEEEEEEecCCCCceeEEEecCCCCCCCeeEecc-CCCc-ccCCcceE--------EEEECC-EE
Q 045821 55 EEWCAYRRKHNLD-ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHE-LPAR-TLRRKGMG--------FEVLGK-NA 122 (352)
Q Consensus 55 ~~~~~~~~~~~~~-~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~-~~~~-~~~~~~~~--------~~~~~~-~i 122 (352)
..|...-.++... +.+|...-|. .+|.||| .+.+.+++.- +|.. ......+. .+..+| .|
T Consensus 264 TnFtdYY~R~~nsDGkrIvFq~~G------dIylydP--~td~lekldI~lpl~rk~k~~k~~~pskyledfa~~~Gd~i 335 (668)
T COG4946 264 TNFTDYYPRNANSDGKRIVFQNAG------DIYLYDP--ETDSLEKLDIGLPLDRKKKQPKFVNPSKYLEDFAVVNGDYI 335 (668)
T ss_pred CCchhccccccCCCCcEEEEecCC------cEEEeCC--CcCcceeeecCCccccccccccccCHHHhhhhhccCCCcEE
Confidence 3444444455544 4444443222 6889999 8888877752 2311 00000000 122333 33
Q ss_pred EEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc
Q 045821 123 YLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT 202 (352)
Q Consensus 123 yv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~ 202 (352)
-++. ...++++++..+--..++.-..-|+.+ ...++.-.++|-.+. ..+.+||..+..-+++.. +...
T Consensus 336 a~VS--------RGkaFi~~~~~~~~iqv~~~~~VrY~r--~~~~~e~~vigt~dg-D~l~iyd~~~~e~kr~e~-~lg~ 403 (668)
T COG4946 336 ALVS--------RGKAFIMRPWDGYSIQVGKKGGVRYRR--IQVDPEGDVIGTNDG-DKLGIYDKDGGEVKRIEK-DLGN 403 (668)
T ss_pred EEEe--------cCcEEEECCCCCeeEEcCCCCceEEEE--EccCCcceEEeccCC-ceEEEEecCCceEEEeeC-Cccc
Confidence 3321 234566666554333343322223333 223445667776554 688999999998777652 3333
Q ss_pred cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821 203 EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHAD 243 (352)
Q Consensus 203 ~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~ 243 (352)
-....+.-+|+..+++ +....+|.+|..++.=+.+.
T Consensus 404 I~av~vs~dGK~~vva-----Ndr~el~vididngnv~~id 439 (668)
T COG4946 404 IEAVKVSPDGKKVVVA-----NDRFELWVIDIDNGNVRLID 439 (668)
T ss_pred eEEEEEcCCCcEEEEE-----cCceEEEEEEecCCCeeEec
Confidence 3333344466755553 23346888898888777664
No 142
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=91.21 E-value=8.4 Score=32.63 Aligned_cols=165 Identities=12% Similarity=0.107 Sum_probs=78.0
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeE--EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDS--FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~--~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~ 244 (352)
+++..+.+|. ..|-.||..++.=..+...-.......+ +-.+|+-...||.++ .+-++|...-.-++.
T Consensus 51 dk~~LAaa~~---qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseDg-----t~kIWdlR~~~~qR~-- 120 (311)
T KOG0315|consen 51 DKKDLAAAGN---QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSEDG-----TVKIWDLRSLSCQRN-- 120 (311)
T ss_pred CcchhhhccC---CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCCc-----eEEEEeccCcccchh--
Confidence 4444555554 5688999998742111111111111222 234788777765543 344555544222111
Q ss_pred CCCCCC-CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-EeCCEEEEE-cCCceEEEEEcC
Q 045821 245 NMASGW-RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV-AIGKTIFVI-GKGCSAVVIDVG 321 (352)
Q Consensus 245 ~~~~~~-~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~~~i~v~-gG~~~~~~~d~~ 321 (352)
..... ....+..-++--++.|..+..|+++|..++..+...-.... ....++. ..+++.++. ...-++++.+.-
T Consensus 121 -~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~liPe~~--~~i~sl~v~~dgsml~a~nnkG~cyvW~l~ 197 (311)
T KOG0315|consen 121 -YQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGENSCTHELIPEDD--TSIQSLTVMPDGSMLAAANNKGNCYVWRLL 197 (311)
T ss_pred -ccCCCCcceEEecCCcceEEeecCCCcEEEEEccCCccccccCCCCC--cceeeEEEcCCCcEEEEecCCccEEEEEcc
Confidence 11110 01112222333333333333899999999988765422221 2222233 334544433 323357777776
Q ss_pred CcccccceEEeccCCCCCCCC-cceee
Q 045821 322 NIGNIGGIMVSSSIPKLNDND-DIISC 347 (352)
Q Consensus 322 ~~~w~~~~~~~~~~~~~~~~~-~~~~~ 347 (352)
. ....+...-+.++.++. +|-.|
T Consensus 198 ~---~~~~s~l~P~~k~~ah~~~il~C 221 (311)
T KOG0315|consen 198 N---HQTASELEPVHKFQAHNGHILRC 221 (311)
T ss_pred C---CCccccceEhhheecccceEEEE
Confidence 6 23344455566677666 44443
No 143
>PF07734 FBA_1: F-box associated; InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=91.17 E-value=6.5 Score=31.24 Aligned_cols=78 Identities=13% Similarity=0.108 Sum_probs=48.8
Q ss_pred EEECCEEEEEeCCCCC-C--ceEEEeCCCCcE-EEecCCCCcccce---eE-EEECCEEEEEccCCCCCCceEEEEEe--
Q 045821 164 GVLNQKIYCIGGLGDT-H--SWDVYDPRTNNW-KLHTEPNIFTEIE---DS-FVMDGKIYIRCSASAATSHVCALVYE-- 233 (352)
Q Consensus 164 ~~~~~~iyv~gG~~~~-~--~v~~yd~~t~~W-~~~~~~~~~~~~~---~~-~~~~g~iyv~GG~~~~~~~~~i~~yd-- 233 (352)
+.++|.+|.++..... . .+..||..+++. +.++.|....... .. ++.++.|-++-- ........||+-+
T Consensus 2 V~vnG~~hW~~~~~~~~~~~~IlsFDl~~E~F~~~~~lP~~~~~~~~~~~L~~v~~~~L~~~~~-~~~~~~~~IWvm~~~ 80 (164)
T PF07734_consen 2 VFVNGALHWLAYDENNDEKDFILSFDLSTEKFGRSLPLPFCNDDDDDSVSLSVVRGDCLCVLYQ-CDETSKIEIWVMKKY 80 (164)
T ss_pred EEECCEEEeeEEecCCCCceEEEEEeccccccCCEECCCCccCccCCEEEEEEecCCEEEEEEe-ccCCccEEEEEEeee
Confidence 5679999998876554 2 689999999999 6665433333111 22 333778888732 2222234677655
Q ss_pred -CCCCceeec
Q 045821 234 -PSTDSWLHA 242 (352)
Q Consensus 234 -~~~~~W~~~ 242 (352)
....+|+++
T Consensus 81 ~~~~~SWtK~ 90 (164)
T PF07734_consen 81 GYGKESWTKL 90 (164)
T ss_pred ccCcceEEEE
Confidence 247899876
No 144
>PRK02889 tolB translocation protein TolB; Provisional
Probab=91.14 E-value=13 Score=34.73 Aligned_cols=183 Identities=11% Similarity=0.019 Sum_probs=90.5
Q ss_pred CceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce
Q 045821 81 ERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC 159 (352)
Q Consensus 81 ~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~ 159 (352)
+...++.+|. .++.-..+...+. .........++ .|++....+ ...++|.+|..++..+.+..-.....
T Consensus 218 ~~~~I~~~dl--~~g~~~~l~~~~g----~~~~~~~SPDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~lt~~~~~~~ 287 (427)
T PRK02889 218 KKPVVYVHDL--ATGRRRVVANFKG----SNSAPAWSPDGRTLAVALSRD----GNSQIYTVNADGSGLRRLTQSSGIDT 287 (427)
T ss_pred CCcEEEEEEC--CCCCEEEeecCCC----CccceEECCCCCEEEEEEccC----CCceEEEEECCCCCcEECCCCCCCCc
Confidence 3456888888 7776555554431 11112222244 455443322 24688999988777666643221111
Q ss_pred eeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 160 YFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 160 ~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
.. ...-++ .|+..+.......++.+|..++..+.+... ..........-+|+..++..... ....++.+|..+++
T Consensus 288 ~~-~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~-g~~~~~~~~SpDG~~Ia~~s~~~--g~~~I~v~d~~~g~ 363 (427)
T PRK02889 288 EP-FFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFT-GSYNTSPRISPDGKLLAYISRVG--GAFKLYVQDLATGQ 363 (427)
T ss_pred Ce-EEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecC-CCCcCceEECCCCCEEEEEEccC--CcEEEEEEECCCCC
Confidence 11 111244 455443322226788888887777776521 11111122333554333323221 22368889998888
Q ss_pred eeecCCCCCCCCCCcEEEECCEEEEEec-CCC-CeEEEEeCCC
Q 045821 239 WLHADANMASGWRGPAVVVDDALYVLDQ-SSG-TKLMMWQKES 279 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg-~~~-~~v~~yd~~~ 279 (352)
.+.+.... ........-+++.+++.. ..+ ..++.++...
T Consensus 364 ~~~lt~~~--~~~~p~~spdg~~l~~~~~~~g~~~l~~~~~~g 404 (427)
T PRK02889 364 VTALTDTT--RDESPSFAPNGRYILYATQQGGRSVLAAVSSDG 404 (427)
T ss_pred eEEccCCC--CccCceECCCCCEEEEEEecCCCEEEEEEECCC
Confidence 77764321 112233444555444433 222 3567777754
No 145
>PTZ00420 coronin; Provisional
Probab=90.90 E-value=16 Score=35.45 Aligned_cols=141 Identities=12% Similarity=0.144 Sum_probs=68.0
Q ss_pred EEEEEeCCCCCCceEEEeCCCCcEE-EecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCC
Q 045821 169 KIYCIGGLGDTHSWDVYDPRTNNWK-LHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADANM 246 (352)
Q Consensus 169 ~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~ 246 (352)
.+++.+|.+ ..+.+||..+.+=. .+. .+ ......+...+|.+++.++.++ .+..||+.+.+=. .+.. .
T Consensus 139 ~iLaSgS~D--gtIrIWDl~tg~~~~~i~-~~-~~V~SlswspdG~lLat~s~D~-----~IrIwD~Rsg~~i~tl~g-H 208 (568)
T PTZ00420 139 YIMCSSGFD--SFVNIWDIENEKRAFQIN-MP-KKLSSLKWNIKGNLLSGTCVGK-----HMHIIDPRKQEIASSFHI-H 208 (568)
T ss_pred eEEEEEeCC--CeEEEEECCCCcEEEEEe-cC-CcEEEEEECCCCCEEEEEecCC-----EEEEEECCCCcEEEEEec-c
Confidence 455566655 67889999876511 111 11 1111122234778777755432 5888898765422 1211 1
Q ss_pred CCCCCCcEEE-----ECCEEEEEecCCC---CeEEEEeCCC-CceEEccCCCCCcCCCCceEE----EeCCEEEEEcCCc
Q 045821 247 ASGWRGPAVV-----VDDALYVLDQSSG---TKLMMWQKES-REWSPVGRLSTLLTRPPCKLV----AIGKTIFVIGKGC 313 (352)
Q Consensus 247 ~~~~~~~~~~-----~~~~l~~~gg~~~---~~v~~yd~~~-~~W~~~~~~~~~~~~~~~~~~----~~~~~i~v~gG~~ 313 (352)
........+. .++..++.+|.++ ..+.+||..+ ..-...-.+.. ....+. ...+.+|+.|..+
T Consensus 209 ~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld~----~~~~L~p~~D~~tg~l~lsGkGD 284 (568)
T PTZ00420 209 DGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSIDN----ASAPLIPHYDESTGLIYLIGKGD 284 (568)
T ss_pred cCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEecC----CccceEEeeeCCCCCEEEEEECC
Confidence 1100011111 2445555556543 3699999874 22111111111 111111 2247788888444
Q ss_pred -eEEEEEcCCc
Q 045821 314 -SAVVIDVGNI 323 (352)
Q Consensus 314 -~~~~~d~~~~ 323 (352)
.+.+|++...
T Consensus 285 ~tIr~~e~~~~ 295 (568)
T PTZ00420 285 GNCRYYQHSLG 295 (568)
T ss_pred CeEEEEEccCC
Confidence 4668887654
No 146
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=90.84 E-value=14 Score=34.36 Aligned_cols=161 Identities=11% Similarity=0.082 Sum_probs=86.2
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCC-----CCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEE
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTR-----RSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYD 142 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~-----~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d 142 (352)
+.+|++.-..... ...++.+|. .. ..|..+.+-. .-..+.+...++.+|+....+ .....+..++
T Consensus 238 ~~~l~i~~~~~~~-~s~v~~~d~--~~~~~~~~~~~~l~~~~----~~~~~~v~~~~~~~yi~Tn~~---a~~~~l~~~~ 307 (414)
T PF02897_consen 238 GRYLFISSSSGTS-ESEVYLLDL--DDGGSPDAKPKLLSPRE----DGVEYYVDHHGDRLYILTNDD---APNGRLVAVD 307 (414)
T ss_dssp SSEEEEEEESSSS-EEEEEEEEC--CCTTTSS-SEEEEEESS----SS-EEEEEEETTEEEEEE-TT----TT-EEEEEE
T ss_pred ccEEEEEEEcccc-CCeEEEEec--cccCCCcCCcEEEeCCC----CceEEEEEccCCEEEEeeCCC---CCCcEEEEec
Confidence 4555554333222 466777887 54 3777776532 122334455588999887633 3356788888
Q ss_pred CCCCe---eE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCC-CCcEEEecCCCCcccceeEEE--E-CCEE
Q 045821 143 ASMNT---WT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPR-TNNWKLHTEPNIFTEIEDSFV--M-DGKI 214 (352)
Q Consensus 143 ~~t~~---W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~-t~~W~~~~~~~~~~~~~~~~~--~-~g~i 214 (352)
..... |. .+.+-.....--.+...++.+++..-.+....+.+||.. +..-..++ ++.... ..... . .+.+
T Consensus 308 l~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~Lvl~~~~~~~~~l~v~~~~~~~~~~~~~-~p~~g~-v~~~~~~~~~~~~ 385 (414)
T PF02897_consen 308 LADPSPAEWWTVLIPEDEDVSLEDVSLFKDYLVLSYRENGSSRLRVYDLDDGKESREIP-LPEAGS-VSGVSGDFDSDEL 385 (414)
T ss_dssp TTSTSGGGEEEEEE--SSSEEEEEEEEETTEEEEEEEETTEEEEEEEETT-TEEEEEEE-SSSSSE-EEEEES-TT-SEE
T ss_pred ccccccccceeEEcCCCCceeEEEEEEECCEEEEEEEECCccEEEEEECCCCcEEeeec-CCcceE-EeccCCCCCCCEE
Confidence 87665 65 444322223444555568887776544333788999998 44434444 222221 11111 1 3344
Q ss_pred EEEccCCCCCCceEEEEEeCCCCceeec
Q 045821 215 YIRCSASAATSHVCALVYEPSTDSWLHA 242 (352)
Q Consensus 215 yv~GG~~~~~~~~~i~~yd~~~~~W~~~ 242 (352)
++.-. +-.....++.||+.+++.+.+
T Consensus 386 ~~~~s--s~~~P~~~y~~d~~t~~~~~~ 411 (414)
T PF02897_consen 386 RFSYS--SFTTPPTVYRYDLATGELTLL 411 (414)
T ss_dssp EEEEE--ETTEEEEEEEEETTTTCEEEE
T ss_pred EEEEe--CCCCCCEEEEEECCCCCEEEE
Confidence 44311 112233899999999987654
No 147
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=90.78 E-value=13 Score=33.99 Aligned_cols=156 Identities=16% Similarity=0.134 Sum_probs=87.4
Q ss_pred eEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCC--CCc
Q 045821 163 CGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPS--TDS 238 (352)
Q Consensus 163 ~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~--~~~ 238 (352)
.+..++++|+.... ..+..+|+.+.. |+.................+|+||+- ...+ .+++||.. +..
T Consensus 64 ~~~~dg~v~~~~~~---G~i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~~G~i~~g-~~~g-----~~y~ld~~~G~~~ 134 (370)
T COG1520 64 PADGDGTVYVGTRD---GNIFALNPDTGLVKWSYPLLGAVAQLSGPILGSDGKIYVG-SWDG-----KLYALDASTGTLV 134 (370)
T ss_pred cEeeCCeEEEecCC---CcEEEEeCCCCcEEecccCcCcceeccCceEEeCCeEEEe-cccc-----eEEEEECCCCcEE
Confidence 36678999997221 468999999886 97654220112222334448998885 3322 68899984 456
Q ss_pred eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--Cce
Q 045821 239 WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCS 314 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~ 314 (352)
|+.-.... ..+....+..++.+|+.. ..+ .++.+|.++.+ |+.-...+ ...+.......-++.+|+-.. ...
T Consensus 135 W~~~~~~~-~~~~~~~v~~~~~v~~~s-~~g-~~~al~~~tG~~~W~~~~~~~-~~~~~~~~~~~~~~~vy~~~~~~~~~ 210 (370)
T COG1520 135 WSRNVGGS-PYYASPPVVGDGTVYVGT-DDG-HLYALNADTGTLKWTYETPAP-LSLSIYGSPAIASGTVYVGSDGYDGI 210 (370)
T ss_pred EEEecCCC-eEEecCcEEcCcEEEEec-CCC-eEEEEEccCCcEEEEEecCCc-cccccccCceeecceEEEecCCCcce
Confidence 88654331 233344455666766654 222 78888887544 76332221 111222223355677777644 234
Q ss_pred EEEEEcCCcc--cccceEE
Q 045821 315 AVVIDVGNIG--NIGGIMV 331 (352)
Q Consensus 315 ~~~~d~~~~~--w~~~~~~ 331 (352)
.+.+|++..+ |+.....
T Consensus 211 ~~a~~~~~G~~~w~~~~~~ 229 (370)
T COG1520 211 LYALNAEDGTLKWSQKVSQ 229 (370)
T ss_pred EEEEEccCCcEeeeeeeec
Confidence 5668876544 6643333
No 148
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=90.77 E-value=12 Score=33.50 Aligned_cols=147 Identities=12% Similarity=0.170 Sum_probs=75.8
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEec
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHT 196 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~ 196 (352)
++.+.+.||.+. ..+.++..++.|-..-.-...-.......+++.+.+.|+.+ ..+.++...++ +|....
T Consensus 75 ~~~l~aTGGgDD------~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdms--G~v~v~~~stg~~~~~~~~ 146 (399)
T KOG0296|consen 75 NNNLVATGGGDD------LAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMS--GKVLVFKVSTGGEQWKLDQ 146 (399)
T ss_pred CCceEEecCCCc------eEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCC--ccEEEEEcccCceEEEeec
Confidence 566777787543 46788888877432211111112223334577777777766 56666666655 465431
Q ss_pred CCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe
Q 045821 197 EPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ 276 (352)
Q Consensus 197 ~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd 276 (352)
....-.. ..-.-.+.|+++|..++ .+|.|...+..=.++-.-...+-...-+.-+||..+.|-.++ .+.+||
T Consensus 147 e~~dieW--l~WHp~a~illAG~~DG-----svWmw~ip~~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dg-ti~~Wn 218 (399)
T KOG0296|consen 147 EVEDIEW--LKWHPRAHILLAGSTDG-----SVWMWQIPSQALCKVMSGHNSPCTCGEFIPDGKRILTGYDDG-TIIVWN 218 (399)
T ss_pred ccCceEE--EEecccccEEEeecCCC-----cEEEEECCCcceeeEecCCCCCcccccccCCCceEEEEecCc-eEEEEe
Confidence 1111000 00001345677754332 578887766433332111111111111344677777765555 899999
Q ss_pred CCCCc
Q 045821 277 KESRE 281 (352)
Q Consensus 277 ~~~~~ 281 (352)
+++.+
T Consensus 219 ~ktg~ 223 (399)
T KOG0296|consen 219 PKTGQ 223 (399)
T ss_pred cCCCc
Confidence 99875
No 149
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=90.61 E-value=7.6 Score=33.78 Aligned_cols=132 Identities=11% Similarity=0.048 Sum_probs=69.1
Q ss_pred CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD 259 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 259 (352)
.++-.||..++.=+.. ......--.+++.+..-.+.||.++ .+-.||..+..=..+...-..-+. ..-....
T Consensus 35 gslrlYdv~~~~l~~~--~~~~~plL~c~F~d~~~~~~G~~dg-----~vr~~Dln~~~~~~igth~~~i~c-i~~~~~~ 106 (323)
T KOG1036|consen 35 GSLRLYDVPANSLKLK--FKHGAPLLDCAFADESTIVTGGLDG-----QVRRYDLNTGNEDQIGTHDEGIRC-IEYSYEV 106 (323)
T ss_pred CcEEEEeccchhhhhh--eecCCceeeeeccCCceEEEeccCc-----eEEEEEecCCcceeeccCCCceEE-EEeeccC
Confidence 5677888887732111 1111111123344444344455553 588999988877666542111100 0011234
Q ss_pred EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821 260 ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 260 ~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~ 324 (352)
..++.||.++ .+-.+|+...+ ..........-+++.+.++.|+|-+-..++.+||..+..
T Consensus 107 ~~vIsgsWD~-~ik~wD~R~~~----~~~~~d~~kkVy~~~v~g~~LvVg~~~r~v~iyDLRn~~ 166 (323)
T KOG1036|consen 107 GCVISGSWDK-TIKFWDPRNKV----VVGTFDQGKKVYCMDVSGNRLVVGTSDRKVLIYDLRNLD 166 (323)
T ss_pred CeEEEcccCc-cEEEEeccccc----cccccccCceEEEEeccCCEEEEeecCceEEEEEccccc
Confidence 4566777776 89999998622 111111133456666666666654444467789976543
No 150
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=90.54 E-value=6.9 Score=33.76 Aligned_cols=244 Identities=14% Similarity=0.146 Sum_probs=123.9
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
+.+.+++|+. . .+..-|- ...+|.+..... .....++....+.+-++.| ..+.++.=+-.-.+
T Consensus 54 g~~gwlVg~r---g--tiletdd--~g~tw~qal~~~---gr~~f~sv~f~~~egw~vG-------e~sqll~T~DgGqs 116 (339)
T COG4447 54 GSHGWLVGGR---G--TILETDD--GGITWAQALDFL---GRHAFHSVSFLGMEGWIVG-------EPSQLLHTTDGGQS 116 (339)
T ss_pred CcceEEEcCc---c--eEEEecC--Ccccchhhhchh---hhhheeeeeeecccccccC-------CcceEEEecCCCcc
Confidence 6778888764 1 2232344 678898775443 1222333333343333333 12344444444557
Q ss_pred eEecCCC-CCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCc--ccceeEEEECCEEEEEccCCCC
Q 045821 148 WTDAAPM-STARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIF--TEIEDSFVMDGKIYIRCSASAA 223 (352)
Q Consensus 148 W~~~~~~-~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~--~~~~~~~~~~g~iyv~GG~~~~ 223 (352)
|.+++.- ..+-...+...++. +-|++|-+. .++.=+-.-+.|+.+.+-... .-...+.+.+|...++|-..
T Consensus 117 WARi~~~e~~eg~~~sI~f~d~q~g~m~gd~G---ail~T~DgGk~Wk~l~e~~v~~~~~n~ia~s~dng~vaVg~rG-- 191 (339)
T COG4447 117 WARIPLSEKLEGFPDSITFLDDQRGEMLGDQG---AILKTTDGGKNWKALVEKAVGLAVPNEIARSADNGYVAVGARG-- 191 (339)
T ss_pred hhhchhhcCCCCCcceeEEecchhhhhhcccc---eEEEecCCcccHhHhcccccchhhhhhhhhhccCCeEEEecCc--
Confidence 9888743 23333344555543 445555432 233333334579887642222 22333455667766664222
Q ss_pred CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC--EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-
Q 045821 224 TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD--ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV- 300 (352)
Q Consensus 224 ~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~--~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~- 300 (352)
+..-..+.....|..-.....++...+....++ -++..|+.. ..+..+...+.|+.++.... ..+...++.
T Consensus 192 ---s~f~T~~aGqt~~~~~g~~s~~~letmg~adag~~g~la~g~qg--~~f~~~~~gD~wsd~~~~~~-~g~~~~Gl~d 265 (339)
T COG4447 192 ---SFFSTWGAGQTVWLPHGRNSSRRLETMGLADAGSKGLLARGGQG--DQFSWVCGGDEWSDQGEPVN-LGRRSWGLLD 265 (339)
T ss_pred ---ceEecCCCCccEEeccCCCccchhcccccccCCccceEEEcccc--ceeecCCCcccccccccchh-cccCCCcccc
Confidence 111123333444444333222333333344444 355556543 57778888999998863322 223333222
Q ss_pred ---EeCCEEEEEcCCceEEEEEcCCcccccceEEeccCCCCC
Q 045821 301 ---AIGKTIFVIGKGCSAVVIDVGNIGNIGGIMVSSSIPKLN 339 (352)
Q Consensus 301 ---~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~~~~~~~~~ 339 (352)
...+.+||.|..-+++.---.-.+|.++.-+..++-.++
T Consensus 266 ~a~~a~~~v~v~G~gGnvl~StdgG~t~skd~g~~er~s~l~ 307 (339)
T COG4447 266 FAPRAPPEVWVSGIGGNVLASTDGGTTWSKDGGVEERVSNLY 307 (339)
T ss_pred ccccCCCCeEEeccCccEEEecCCCeeEeccCChhhhhhhhh
Confidence 236789998876667765566667887665554444333
No 151
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=89.97 E-value=24 Score=35.77 Aligned_cols=35 Identities=14% Similarity=0.171 Sum_probs=25.3
Q ss_pred eeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecC
Q 045821 160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTE 197 (352)
Q Consensus 160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~ 197 (352)
....+++++.+|+.... ..++.+|..|+ .|+.-+.
T Consensus 187 e~TPlvvgg~lYv~t~~---~~V~ALDa~TGk~lW~~d~~ 223 (764)
T TIGR03074 187 QATPLKVGDTLYLCTPH---NKVIALDAATGKEKWKFDPK 223 (764)
T ss_pred ccCCEEECCEEEEECCC---CeEEEEECCCCcEEEEEcCC
Confidence 35567789999998654 56888888876 4876543
No 152
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=89.76 E-value=16 Score=33.41 Aligned_cols=153 Identities=15% Similarity=0.170 Sum_probs=85.3
Q ss_pred EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--c
Q 045821 116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--N 191 (352)
Q Consensus 116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~ 191 (352)
+..++.+|+... ...++.+|+.+.+ |+................-+|+||+-... ..+++||..+. .
T Consensus 65 ~~~dg~v~~~~~-------~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~~G~i~~g~~~---g~~y~ld~~~G~~~ 134 (370)
T COG1520 65 ADGDGTVYVGTR-------DGNIFALNPDTGLVKWSYPLLGAVAQLSGPILGSDGKIYVGSWD---GKLYALDASTGTLV 134 (370)
T ss_pred EeeCCeEEEecC-------CCcEEEEeCCCCcEEecccCcCcceeccCceEEeCCeEEEeccc---ceEEEEECCCCcEE
Confidence 666888998611 2278999999887 86543320111111112227887775542 25889999644 5
Q ss_pred EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC--CceeecCCC-CCCCCCCcEEEECCEEEEEecCC
Q 045821 192 WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST--DSWLHADAN-MASGWRGPAVVVDDALYVLDQSS 268 (352)
Q Consensus 192 W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~-~~~~~~~~~~~~~~~l~~~gg~~ 268 (352)
|+.-..- ........++.++.+|+.. ....++++|..+ ..|+.-... ......+.....++.+|+.....
T Consensus 135 W~~~~~~-~~~~~~~~v~~~~~v~~~s------~~g~~~al~~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~ 207 (370)
T COG1520 135 WSRNVGG-SPYYASPPVVGDGTVYVGT------DDGHLYALNADTGTLKWTYETPAPLSLSIYGSPAIASGTVYVGSDGY 207 (370)
T ss_pred EEEecCC-CeEEecCcEEcCcEEEEec------CCCeEEEEEccCCcEEEEEecCCccccccccCceeecceEEEecCCC
Confidence 8765433 1222234556677888762 112577777764 458744322 22233333335566666653310
Q ss_pred CCeEEEEeCCCCc--eEEc
Q 045821 269 GTKLMMWQKESRE--WSPV 285 (352)
Q Consensus 269 ~~~v~~yd~~~~~--W~~~ 285 (352)
...++.+|+++++ |++.
T Consensus 208 ~~~~~a~~~~~G~~~w~~~ 226 (370)
T COG1520 208 DGILYALNAEDGTLKWSQK 226 (370)
T ss_pred cceEEEEEccCCcEeeeee
Confidence 2269999997654 7753
No 153
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=89.33 E-value=12 Score=31.66 Aligned_cols=177 Identities=10% Similarity=0.109 Sum_probs=89.7
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--C
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--D 211 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~ 211 (352)
..+-.||..++.=..+...-..+..-.++. .+|+....||.+ ..+-++|...-.-.+.-..+. . -..++.. +
T Consensus 61 qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseD--gt~kIWdlR~~~~qR~~~~~s-p-Vn~vvlhpnQ 136 (311)
T KOG0315|consen 61 QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSED--GTVKIWDLRSLSCQRNYQHNS-P-VNTVVLHPNQ 136 (311)
T ss_pred CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCC--ceEEEEeccCcccchhccCCC-C-cceEEecCCc
Confidence 567889998874222222222223333333 378888888866 566777777633332211111 1 1123333 3
Q ss_pred CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC-CCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCce-EEccCCC
Q 045821 212 GKIYIRCSASAATSHVCALVYEPSTDSWLHADANM-ASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREW-SPVGRLS 289 (352)
Q Consensus 212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W-~~~~~~~ 289 (352)
+.|++ |.+ ...|+++|+.++.......+- .......++.-+|+.++.+...| ..++++.-+.+- +++..+.
T Consensus 137 teLis--~dq----sg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG-~cyvW~l~~~~~~s~l~P~~ 209 (311)
T KOG0315|consen 137 TELIS--GDQ----SGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKG-NCYVWRLLNHQTASELEPVH 209 (311)
T ss_pred ceEEe--ecC----CCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCc-cEEEEEccCCCccccceEhh
Confidence 34443 222 235999999999876553322 22223334555777555544444 778887765432 2222222
Q ss_pred CCcCCCCc---eEEEeCCEEEEEcCCce-EEEEEcCCc
Q 045821 290 TLLTRPPC---KLVAIGKTIFVIGKGCS-AVVIDVGNI 323 (352)
Q Consensus 290 ~~~~~~~~---~~~~~~~~i~v~gG~~~-~~~~d~~~~ 323 (352)
....+..+ +....++|.++..+.++ +.|+..+.+
T Consensus 210 k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~ 247 (311)
T KOG0315|consen 210 KFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDF 247 (311)
T ss_pred heecccceEEEEEECCCCcEEEeecCCceEEEEecCCc
Confidence 21122232 23344677666666554 446666655
No 154
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.22 E-value=13 Score=31.55 Aligned_cols=103 Identities=19% Similarity=0.402 Sum_probs=52.5
Q ss_pred EEEEEeCCCCCCceEEEeCCCCcEEEecC----------------CCCcccceeEEEECCEEEEEccCCCCCCceEEEEE
Q 045821 169 KIYCIGGLGDTHSWDVYDPRTNNWKLHTE----------------PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVY 232 (352)
Q Consensus 169 ~iyv~gG~~~~~~v~~yd~~t~~W~~~~~----------------~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~y 232 (352)
+-++.||++ +.|-+++...++|..-.. ...++...+++.-++++++. .-
T Consensus 176 krlvSgGcD--n~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~SqDg~viIw-------------t~ 240 (299)
T KOG1332|consen 176 KRLVSGGCD--NLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQDGTVIIW-------------TK 240 (299)
T ss_pred ceeeccCCc--cceeeeecCCcchhhhhhhhhcchhhhhhhhccccCCCceeeEEecCCCcEEEE-------------Ee
Confidence 457888888 455555555557753221 11122222344445555444 44
Q ss_pred eCCCCceeec--CCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCC-CCceEEccCC
Q 045821 233 EPSTDSWLHA--DANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKE-SREWSPVGRL 288 (352)
Q Consensus 233 d~~~~~W~~~--~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~-~~~W~~~~~~ 288 (352)
+...+.|+.. ...+...++..--..++.|-+-||.+ .+.+|-.. .++|.+++..
T Consensus 241 ~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~GdN--kvtlwke~~~Gkw~~v~~~ 297 (299)
T KOG1332|consen 241 DEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGDN--KVTLWKENVDGKWEEVGEV 297 (299)
T ss_pred cCccCcccccccccCCcceEEEEEeccccEEEEecCCc--EEEEEEeCCCCcEEEcccc
Confidence 4556778643 33233333322223344555555544 66666555 4599998754
No 155
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=89.04 E-value=28 Score=35.28 Aligned_cols=35 Identities=20% Similarity=0.295 Sum_probs=24.4
Q ss_pred eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCC
Q 045821 113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPM 154 (352)
Q Consensus 113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~ 154 (352)
.+-.++++.+|+... .+.++.+|..|++ |+.-+..
T Consensus 188 ~TPlvvgg~lYv~t~-------~~~V~ALDa~TGk~lW~~d~~~ 224 (764)
T TIGR03074 188 ATPLKVGDTLYLCTP-------HNKVIALDAATGKEKWKFDPKL 224 (764)
T ss_pred cCCEEECCEEEEECC-------CCeEEEEECCCCcEEEEEcCCC
Confidence 445667999999843 3568888888775 7765543
No 156
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=88.78 E-value=22 Score=33.68 Aligned_cols=146 Identities=11% Similarity=0.100 Sum_probs=72.1
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE---Ee
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK---LH 195 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~---~~ 195 (352)
.+.+++.|+.+ ..+.++|..+.+-...-.............-++..++.+.++ ..+.+||..+..-+ .+
T Consensus 257 ~g~~i~Sgs~D------~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d--~~i~vwd~~~~~~~~~~~~ 328 (456)
T KOG0266|consen 257 DGNLLVSGSDD------GTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSASYD--GTIRVWDLETGSKLCLKLL 328 (456)
T ss_pred CCCEEEEecCC------CcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEcCCC--ccEEEEECCCCceeeeecc
Confidence 44677777653 458888998855333222222222222333367777777664 78899999988743 22
Q ss_pred cCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCc----eeecCCCCCCCCCCcEEEECCEEEEEecCCCC
Q 045821 196 TEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDS----WLHADANMASGWRGPAVVVDDALYVLDQSSGT 270 (352)
Q Consensus 196 ~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~----W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~ 270 (352)
...........+... +++..+.+..+ ..+-.+|..... |...... ...........++++.+.|...+
T Consensus 329 ~~~~~~~~~~~~~fsp~~~~ll~~~~d-----~~~~~w~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~sg~~d~- 401 (456)
T KOG0266|consen 329 SGAENSAPVTSVQFSPNGKYLLSASLD-----RTLKLWDLRSGKSVGTYTGHSNL-VRCIFSPTLSTGGKLIYSGSEDG- 401 (456)
T ss_pred cCCCCCCceeEEEECCCCcEEEEecCC-----CeEEEEEccCCcceeeecccCCc-ceeEecccccCCCCeEEEEeCCc-
Confidence 211111111222332 44444443222 124444444332 2222111 00111111234666666666555
Q ss_pred eEEEEeCCC
Q 045821 271 KLMMWQKES 279 (352)
Q Consensus 271 ~v~~yd~~~ 279 (352)
.|..||+.+
T Consensus 402 ~v~~~~~~s 410 (456)
T KOG0266|consen 402 SVYVWDSSS 410 (456)
T ss_pred eEEEEeCCc
Confidence 899999987
No 157
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=88.38 E-value=21 Score=33.08 Aligned_cols=147 Identities=12% Similarity=0.104 Sum_probs=83.0
Q ss_pred CCeEEEEECCCC-----eeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCc---EE-EecCCCCcccc
Q 045821 135 TSEVYCYDASMN-----TWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNN---WK-LHTEPNIFTEI 204 (352)
Q Consensus 135 ~~~~~~~d~~t~-----~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~---W~-~~~~~~~~~~~ 204 (352)
.++++..|.... .|..+..-. .-....+...++.+|+....... ..+..++..+.. |. .+.+.......
T Consensus 251 ~s~v~~~d~~~~~~~~~~~~~l~~~~-~~~~~~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l 329 (414)
T PF02897_consen 251 ESEVYLLDLDDGGSPDAKPKLLSPRE-DGVEYYVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDEDVSL 329 (414)
T ss_dssp EEEEEEEECCCTTTSS-SEEEEEESS-SS-EEEEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SSSEEE
T ss_pred CCeEEEEeccccCCCcCCcEEEeCCC-CceEEEEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCCceeE
Confidence 378999999875 677775422 22333344458899998875544 788888888665 66 44433333344
Q ss_pred eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE---ECCEEE-EEecCCC-CeEEEEeCCC
Q 045821 205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV---VDDALY-VLDQSSG-TKLMMWQKES 279 (352)
Q Consensus 205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~---~~~~l~-~~gg~~~-~~v~~yd~~~ 279 (352)
..+...++.|++.. .. +....+.++|+. ..|.....+.+......... .++.++ .+.+... ..++.||+.+
T Consensus 330 ~~~~~~~~~Lvl~~-~~--~~~~~l~v~~~~-~~~~~~~~~~p~~g~v~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~t 405 (414)
T PF02897_consen 330 EDVSLFKDYLVLSY-RE--NGSSRLRVYDLD-DGKESREIPLPEAGSVSGVSGDFDSDELRFSYSSFTTPPTVYRYDLAT 405 (414)
T ss_dssp EEEEEETTEEEEEE-EE--TTEEEEEEEETT--TEEEEEEESSSSSEEEEEES-TT-SEEEEEEEETTEEEEEEEEETTT
T ss_pred EEEEEECCEEEEEE-EE--CCccEEEEEECC-CCcEEeeecCCcceEEeccCCCCCCCEEEEEEeCCCCCCEEEEEECCC
Confidence 55666788888873 22 334478889988 33433322222211111111 133443 4455443 5899999999
Q ss_pred CceEEcc
Q 045821 280 REWSPVG 286 (352)
Q Consensus 280 ~~W~~~~ 286 (352)
++.+.+.
T Consensus 406 ~~~~~~k 412 (414)
T PF02897_consen 406 GELTLLK 412 (414)
T ss_dssp TCEEEEE
T ss_pred CCEEEEE
Confidence 9877653
No 158
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=88.08 E-value=17 Score=31.72 Aligned_cols=101 Identities=8% Similarity=-0.012 Sum_probs=53.5
Q ss_pred eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821 84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC 163 (352)
Q Consensus 84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~ 163 (352)
.+..||. ..++-.+.-... ..-.....-.......+.+-. ....++...+..+|+.-+.-+-...+...-.
T Consensus 37 sl~LYd~--~~g~~~~ti~sk---kyG~~~~~Fth~~~~~i~sSt----k~d~tIryLsl~dNkylRYF~GH~~~V~sL~ 107 (311)
T KOG1446|consen 37 SLRLYDS--LSGKQVKTINSK---KYGVDLACFTHHSNTVIHSST----KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLS 107 (311)
T ss_pred eEEEEEc--CCCceeeEeecc---cccccEEEEecCCceEEEccC----CCCCceEEEEeecCceEEEcCCCCceEEEEE
Confidence 4566788 776655444333 111112222223333333322 2346778888888887554332333332222
Q ss_pred EEECCEEEEEeCCCCCCceEEEeCCCCcEEEe
Q 045821 164 GVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLH 195 (352)
Q Consensus 164 ~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~ 195 (352)
+.=.+..|+-++.+ ..+..+|..+...+-+
T Consensus 108 ~sP~~d~FlS~S~D--~tvrLWDlR~~~cqg~ 137 (311)
T KOG1446|consen 108 VSPKDDTFLSSSLD--KTVRLWDLRVKKCQGL 137 (311)
T ss_pred ecCCCCeEEecccC--CeEEeeEecCCCCceE
Confidence 22234778888876 6788888887665543
No 159
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=87.98 E-value=25 Score=33.54 Aligned_cols=162 Identities=12% Similarity=0.152 Sum_probs=79.7
Q ss_pred CCeEEEEECCCCe--eEecCCCCC---cceee--eeE---EECCE---EEEEeCCCCCCceEEEeCCCCc--EEEecCCC
Q 045821 135 TSEVYCYDASMNT--WTDAAPMST---ARCYF--PCG---VLNQK---IYCIGGLGDTHSWDVYDPRTNN--WKLHTEPN 199 (352)
Q Consensus 135 ~~~~~~~d~~t~~--W~~~~~~~~---~r~~~--~~~---~~~~~---iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~ 199 (352)
.+.++.+|..+++ |+.-...+. -+... ... .+++. +.++|..+ ..++.+|..+.+ |+.-...
T Consensus 255 ~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~~~--G~l~ald~~tG~~~W~~~~~~- 331 (488)
T cd00216 255 TDSIVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAPKN--GFFYVLDRTTGKLISARPEVE- 331 (488)
T ss_pred eeeEEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEECCC--ceEEEEECCCCcEeeEeEeec-
Confidence 4579999999886 874321110 01111 111 12332 33334333 568999999875 8753210
Q ss_pred CcccceeEEEECCEEEEEccCCC------------CCCceEEEEEeCCCC--ceeecCCCC-------CCCCCCcEEEEC
Q 045821 200 IFTEIEDSFVMDGKIYIRCSASA------------ATSHVCALVYEPSTD--SWLHADANM-------ASGWRGPAVVVD 258 (352)
Q Consensus 200 ~~~~~~~~~~~~g~iyv~GG~~~------------~~~~~~i~~yd~~~~--~W~~~~~~~-------~~~~~~~~~~~~ 258 (352)
.....-.+.+|+-..... ......+.++|..+. .|+.-.... ........+..+
T Consensus 332 -----~~~~~~~~~vyv~~~~~~~~~~~~~~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~~~~~~~~~~~g 406 (488)
T cd00216 332 -----QPMAYDPGLVYLGAFHIPLGLPPQKKKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGFPHWGGSLATAG 406 (488)
T ss_pred -----cccccCCceEEEccccccccCcccccCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCCcccCcceEecC
Confidence 001111266666421100 011236888888765 487642200 011122233445
Q ss_pred CEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEE
Q 045821 259 DALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVI 309 (352)
Q Consensus 259 ~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~ 309 (352)
+.|| +|..++ .++.+|.++++ |+. .++.. ..........++++||.
T Consensus 407 ~~v~-~g~~dG-~l~ald~~tG~~lW~~--~~~~~-~~a~P~~~~~~g~~yv~ 454 (488)
T cd00216 407 NLVF-AGAADG-YFRAFDATTGKELWKF--RTPSG-IQATPMTYEVNGKQYVG 454 (488)
T ss_pred CeEE-EECCCC-eEEEEECCCCceeeEE--ECCCC-ceEcCEEEEeCCEEEEE
Confidence 5544 454455 89999998765 763 23321 12222333558999986
No 160
>PRK01742 tolB translocation protein TolB; Provisional
Probab=87.23 E-value=26 Score=32.81 Aligned_cols=180 Identities=9% Similarity=0.003 Sum_probs=86.1
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCE-EEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKN-AYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY 160 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~-iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~ 160 (352)
...++.+|. .++.-..+...+. .........+++ |++....++ ..++|.+|..+++.+.+..-...-..
T Consensus 227 ~~~i~i~dl--~tg~~~~l~~~~g----~~~~~~wSPDG~~La~~~~~~g----~~~Iy~~d~~~~~~~~lt~~~~~~~~ 296 (429)
T PRK01742 227 KSQLVVHDL--RSGARKVVASFRG----HNGAPAFSPDGSRLAFASSKDG----VLNIYVMGANGGTPSQLTSGAGNNTE 296 (429)
T ss_pred CcEEEEEeC--CCCceEEEecCCC----ccCceeECCCCCEEEEEEecCC----cEEEEEEECCCCCeEeeccCCCCcCC
Confidence 456777888 7766555554431 111122223444 444332222 24688899988876666542211111
Q ss_pred eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-EEEEccCCCCCCceEEEEEeCCCCc
Q 045821 161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK-IYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~-iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
. ...-++ .|+..........++.+|..+..-+.+. . .. ......-+|+ |++.++ ..+..+|..+++
T Consensus 297 ~-~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l~-~--~~-~~~~~SpDG~~ia~~~~-------~~i~~~Dl~~g~ 364 (429)
T PRK01742 297 P-SWSPDGQSILFTSDRSGSPQVYRMSASGGGASLVG-G--RG-YSAQISADGKTLVMING-------DNVVKQDLTSGS 364 (429)
T ss_pred E-EECCCCCEEEEEECCCCCceEEEEECCCCCeEEec-C--CC-CCccCCCCCCEEEEEcC-------CCEEEEECCCCC
Confidence 1 111244 4554443332256777777655433332 1 11 1222223554 444432 236778999888
Q ss_pred eeecCCCCCCCCCCcEEEECCEEEEEecCCC-C-eEEEEeCCCCceEEc
Q 045821 239 WLHADANMASGWRGPAVVVDDALYVLDQSSG-T-KLMMWQKESREWSPV 285 (352)
Q Consensus 239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~-~v~~yd~~~~~W~~~ 285 (352)
+..+.... ........-+++.++++...+ . .++..+.....=.++
T Consensus 365 ~~~lt~~~--~~~~~~~sPdG~~i~~~s~~g~~~~l~~~~~~G~~~~~l 411 (429)
T PRK01742 365 TEVLSSTF--LDESPSISPNGIMIIYSSTQGLGKVLQLVSADGRFKARL 411 (429)
T ss_pred eEEecCCC--CCCCceECCCCCEEEEEEcCCCceEEEEEECCCCceEEc
Confidence 87764321 112334455676666655433 2 233445444433334
No 161
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=86.72 E-value=17 Score=34.58 Aligned_cols=62 Identities=11% Similarity=0.247 Sum_probs=38.2
Q ss_pred CEEEEEccCCCCCCceEEEEEeCCCCceeec-CCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821 212 GKIYIRCSASAATSHVCALVYEPSTDSWLHA-DANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~-~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
--||+.|... .++++|+..+.|-.- .... .......+.--..|+.+|+..+ .|..||+.+.+
T Consensus 146 cDly~~gsg~------evYRlNLEqGrfL~P~~~~~-~~lN~v~in~~hgLla~Gt~~g-~VEfwDpR~ks 208 (703)
T KOG2321|consen 146 CDLYLVGSGS------EVYRLNLEQGRFLNPFETDS-GELNVVSINEEHGLLACGTEDG-VVEFWDPRDKS 208 (703)
T ss_pred ccEEEeecCc------ceEEEEcccccccccccccc-ccceeeeecCccceEEecccCc-eEEEecchhhh
Confidence 3577775333 699999999988542 1111 1111111223456788888777 89999998765
No 162
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=86.69 E-value=27 Score=32.55 Aligned_cols=184 Identities=10% Similarity=0.072 Sum_probs=99.1
Q ss_pred CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC--CceEEEeCCCCcEEEecCCCCcccceeEEEEC
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT--HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD 211 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~--~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~ 211 (352)
...++++++...++-+++.-+...-..-+.-.-++.|.|..-...+ +-.+.|-...+-=.. .+++.... .+.+.-+
T Consensus 105 ~taDly~v~~e~Ge~kRiTyfGr~fT~VaG~~~dg~iiV~TD~~tPF~q~~~lYkv~~dg~~~-e~LnlGpa-thiv~~d 182 (668)
T COG4946 105 QTADLYVVPSEDGEAKRITYFGRRFTRVAGWIPDGEIIVSTDFHTPFSQWTELYKVNVDGIKT-EPLNLGPA-THIVIKD 182 (668)
T ss_pred ccccEEEEeCCCCcEEEEEEeccccceeeccCCCCCEEEEeccCCCcccceeeeEEccCCcee-eeccCCce-eeEEEeC
Confidence 3567888888888877776553221222222236777777654433 233444444332111 11222222 2345557
Q ss_pred CEEEEEccCCCC----------CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-CeEEEEeCCCC
Q 045821 212 GKIYIRCSASAA----------TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-TKLMMWQKESR 280 (352)
Q Consensus 212 g~iyv~GG~~~~----------~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~~v~~yd~~~~ 280 (352)
|.+++ |.+.. +..-.+|.-.....++.++-+.. ... ..-+.+++++|.+....+ .+++.-|.+.+
T Consensus 183 g~ivi--gRntydLP~WK~YkGGtrGklWis~d~g~tFeK~vdl~-~~v-S~PmIV~~RvYFlsD~eG~GnlYSvdldGk 258 (668)
T COG4946 183 GIIVI--GRNTYDLPHWKGYKGGTRGKLWISSDGGKTFEKFVDLD-GNV-SSPMIVGERVYFLSDHEGVGNLYSVDLDGK 258 (668)
T ss_pred CEEEE--ccCcccCcccccccCCccceEEEEecCCcceeeeeecC-CCc-CCceEEcceEEEEecccCccceEEeccCCc
Confidence 75554 33311 11125665544444555553321 111 223567999999987766 57888888887
Q ss_pred ceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821 281 EWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG 327 (352)
Q Consensus 281 ~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~ 327 (352)
--.+..+.....+|+ +.-+++-.||.-.-+.++|||+++.-++
T Consensus 259 DlrrHTnFtdYY~R~----~nsDGkrIvFq~~GdIylydP~td~lek 301 (668)
T COG4946 259 DLRRHTNFTDYYPRN----ANSDGKRIVFQNAGDIYLYDPETDSLEK 301 (668)
T ss_pred hhhhcCCchhccccc----cCCCCcEEEEecCCcEEEeCCCcCccee
Confidence 766665554322222 2235665555444479999999988665
No 163
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=85.06 E-value=24 Score=30.33 Aligned_cols=90 Identities=11% Similarity=0.027 Sum_probs=55.5
Q ss_pred CceEEEeCCCC--cEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC--CceeecCCCCCCCCCCcEE
Q 045821 180 HSWDVYDPRTN--NWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST--DSWLHADANMASGWRGPAV 255 (352)
Q Consensus 180 ~~v~~yd~~t~--~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~~~~~~~~~~~ 255 (352)
..+..-|+.++ .|+.+- ..|..+.+.++++. .|+|.+.+ .++..+.++ .-|..+.. ..-...+.+
T Consensus 33 ~~~~avd~~sG~~~We~il---g~RiE~sa~vvgdf-VV~GCy~g-----~lYfl~~~tGs~~w~f~~~--~~vk~~a~~ 101 (354)
T KOG4649|consen 33 GIVIAVDPQSGNLIWEAIL---GVRIECSAIVVGDF-VVLGCYSG-----GLYFLCVKTGSQIWNFVIL--ETVKVRAQC 101 (354)
T ss_pred ceEEEecCCCCcEEeehhh---CceeeeeeEEECCE-EEEEEccC-----cEEEEEecchhheeeeeeh--hhhccceEE
Confidence 56778888877 498864 45556667777777 56655543 244445444 46776532 112233345
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCc
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
..++.+...|..++ .++..|+.+..
T Consensus 102 d~~~glIycgshd~-~~yalD~~~~~ 126 (354)
T KOG4649|consen 102 DFDGGLIYCGSHDG-NFYALDPKTYG 126 (354)
T ss_pred cCCCceEEEecCCC-cEEEecccccc
Confidence 55677666776666 88888888754
No 164
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=84.81 E-value=35 Score=32.09 Aligned_cols=185 Identities=10% Similarity=0.090 Sum_probs=91.3
Q ss_pred eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821 84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC 163 (352)
Q Consensus 84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~ 163 (352)
.++.+++ .++...+...+-+......-.+++..+|.-.+.|- ....+.+|++.+++...-.. -.+..-++.
T Consensus 223 H~~Fw~~--~~~~l~k~~~~fek~ekk~Vl~v~F~engdviTgD------S~G~i~Iw~~~~~~~~k~~~-aH~ggv~~L 293 (626)
T KOG2106|consen 223 HLYFWTL--RGGSLVKRQGIFEKREKKFVLCVTFLENGDVITGD------SGGNILIWSKGTNRISKQVH-AHDGGVFSL 293 (626)
T ss_pred eEEEEEc--cCCceEEEeeccccccceEEEEEEEcCCCCEEeec------CCceEEEEeCCCceEEeEee-ecCCceEEE
Confidence 4666699 99888777665433233222333333444444443 33468899998887655433 233344666
Q ss_pred EEEC-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccc-eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee
Q 045821 164 GVLN-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEI-EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH 241 (352)
Q Consensus 164 ~~~~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~-~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~ 241 (352)
+.+. |.+.- ||.+ ..+..+|-.-++=+++. +|..... ..++.-.+.||| | .. .+.+..=..++. .+.
T Consensus 294 ~~lr~GtllS-GgKD--Rki~~Wd~~y~k~r~~e-lPe~~G~iRtv~e~~~di~v-G-Tt----rN~iL~Gt~~~~-f~~ 362 (626)
T KOG2106|consen 294 CMLRDGTLLS-GGKD--RKIILWDDNYRKLRETE-LPEQFGPIRTVAEGKGDILV-G-TT----RNFILQGTLENG-FTL 362 (626)
T ss_pred EEecCccEee-cCcc--ceEEecccccccccccc-CchhcCCeeEEecCCCcEEE-e-ec----cceEEEeeecCC-ceE
Confidence 6664 55555 8876 56677773322222222 2222111 122222334555 2 21 122333222222 221
Q ss_pred cCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821 242 ADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 242 ~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~ 289 (352)
+........-+.+......+|+..|..+ .+..|+-..-.|+++-.-|
T Consensus 363 ~v~gh~delwgla~hps~~q~~T~gqdk-~v~lW~~~k~~wt~~~~d~ 409 (626)
T KOG2106|consen 363 TVQGHGDELWGLATHPSKNQLLTCGQDK-HVRLWNDHKLEWTKIIEDP 409 (626)
T ss_pred EEEecccceeeEEcCCChhheeeccCcc-eEEEccCCceeEEEEecCc
Confidence 1110111111222334566777777766 7888886677899886444
No 165
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=84.40 E-value=34 Score=31.53 Aligned_cols=119 Identities=9% Similarity=0.116 Sum_probs=65.5
Q ss_pred eeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEEE-CCEEEEEccCCCCCCceEEEEEeCCC
Q 045821 160 YFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFVM-DGKIYIRCSASAATSHVCALVYEPST 236 (352)
Q Consensus 160 ~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~ 236 (352)
..+++.. ||.||..|-.+ ..+.+||..+.. .+...|....-- ...+. ||+..+. +.+. ..+..+|+..
T Consensus 350 ~ts~~fHpDgLifgtgt~d--~~vkiwdlks~~--~~a~Fpght~~vk~i~FsENGY~Lat-~add----~~V~lwDLRK 420 (506)
T KOG0289|consen 350 YTSAAFHPDGLIFGTGTPD--GVVKIWDLKSQT--NVAKFPGHTGPVKAISFSENGYWLAT-AADD----GSVKLWDLRK 420 (506)
T ss_pred eEEeeEcCCceEEeccCCC--ceEEEEEcCCcc--ccccCCCCCCceeEEEeccCceEEEE-EecC----CeEEEEEehh
Confidence 4444444 78888877655 678889998776 444444422211 22333 4554444 3332 1377788755
Q ss_pred Cc-eeecCCCCCCCCCCcEEEE--CCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821 237 DS-WLHADANMASGWRGPAVVV--DDALYVLDQSSGTKLMMWQKESREWSPVGRLST 290 (352)
Q Consensus 237 ~~-W~~~~~~~~~~~~~~~~~~--~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~ 290 (352)
.+ ...+. .+......++.+ .|..+.++|..= .|+.|+-.+.+|+++..++.
T Consensus 421 l~n~kt~~--l~~~~~v~s~~fD~SGt~L~~~g~~l-~Vy~~~k~~k~W~~~~~~~~ 474 (506)
T KOG0289|consen 421 LKNFKTIQ--LDEKKEVNSLSFDQSGTYLGIAGSDL-QVYICKKKTKSWTEIKELAD 474 (506)
T ss_pred hcccceee--ccccccceeEEEcCCCCeEEeeccee-EEEEEecccccceeeehhhh
Confidence 43 22221 111111122333 456666665432 56777888999999987765
No 166
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=84.24 E-value=25 Score=29.94 Aligned_cols=125 Identities=15% Similarity=0.241 Sum_probs=64.7
Q ss_pred CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEE-EeCCCCCCceEEEeCCCCcEEEec--CCCCcccceeEEEE
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYC-IGGLGDTHSWDVYDPRTNNWKLHT--EPNIFTEIEDSFVM 210 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv-~gG~~~~~~v~~yd~~t~~W~~~~--~~~~~~~~~~~~~~ 210 (352)
...++-.+|..|++-..--.++.+-...-.. .+|+|.. .-| ++|-.+|+.+-. .+. .||-... .++..-
T Consensus 163 dd~tVRLWD~rTgt~v~sL~~~s~VtSlEvs-~dG~ilTia~g----ssV~Fwdaksf~--~lKs~k~P~nV~-SASL~P 234 (334)
T KOG0278|consen 163 DDKTVRLWDHRTGTEVQSLEFNSPVTSLEVS-QDGRILTIAYG----SSVKFWDAKSFG--LLKSYKMPCNVE-SASLHP 234 (334)
T ss_pred cCCceEEEEeccCcEEEEEecCCCCcceeec-cCCCEEEEecC----ceeEEecccccc--ceeeccCccccc-cccccC
Confidence 3456777888877643222233332222222 2444443 334 567777776532 332 2222111 112223
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE----EEECCEEEEEecCCCCeEEEE
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA----VVVDDALYVLDQSSGTKLMMW 275 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~----~~~~~~l~~~gg~~~~~v~~y 275 (352)
+..+||.||.. ..++.||..++.=... .....+++. ..-+|.+|..|..++ .+..|
T Consensus 235 ~k~~fVaGged-----~~~~kfDy~TgeEi~~---~nkgh~gpVhcVrFSPdGE~yAsGSEDG-TirlW 294 (334)
T KOG0278|consen 235 KKEFFVAGGED-----FKVYKFDYNTGEEIGS---YNKGHFGPVHCVRFSPDGELYASGSEDG-TIRLW 294 (334)
T ss_pred CCceEEecCcc-----eEEEEEeccCCceeee---cccCCCCceEEEEECCCCceeeccCCCc-eEEEE
Confidence 56899998665 3678888877642221 112222222 345999999998877 55554
No 167
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.23 E-value=25 Score=29.87 Aligned_cols=74 Identities=9% Similarity=0.109 Sum_probs=43.5
Q ss_pred CcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC-CCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821 156 TARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE-PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP 234 (352)
Q Consensus 156 ~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~-~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~ 234 (352)
.++...+++..++++++. .-+.+.++|+...- .-+....+.+-.+-|.+..++|.+ +.+..+-.
T Consensus 221 l~~s~iAS~SqDg~viIw----------t~~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~Gd-----Nkvtlwke 285 (299)
T KOG1332|consen 221 LPKSTIASCSQDGTVIIW----------TKDEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGD-----NKVTLWKE 285 (299)
T ss_pred CCceeeEEecCCCcEEEE----------EecCccCcccccccccCCcceEEEEEeccccEEEEecCC-----cEEEEEEe
Confidence 466777777777776663 34456678875431 112233445556667766665544 24556655
Q ss_pred C-CCceeecCC
Q 045821 235 S-TDSWLHADA 244 (352)
Q Consensus 235 ~-~~~W~~~~~ 244 (352)
+ .++|.+++.
T Consensus 286 ~~~Gkw~~v~~ 296 (299)
T KOG1332|consen 286 NVDGKWEEVGE 296 (299)
T ss_pred CCCCcEEEccc
Confidence 4 559998854
No 168
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=83.43 E-value=31 Score=30.45 Aligned_cols=237 Identities=12% Similarity=0.060 Sum_probs=120.3
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcc--eEEEEECC-EEEEEeCCCCCCCCCCeEEEEECC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKG--MGFEVLGK-NAYLLGGCGWSEDATSEVYCYDAS 144 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~--~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~ 144 (352)
...+.+|+.. ......+||+ .+..-...-..+ ..|+. |++..-++ .+|..-+.. .....-+-+||..
T Consensus 16 ~~~avafaRR---PG~~~~v~D~--~~g~~~~~~~a~---~gRHFyGHg~fs~dG~~LytTEnd~--~~g~G~IgVyd~~ 85 (305)
T PF07433_consen 16 RPEAVAFARR---PGTFALVFDC--RTGQLLQRLWAP---PGRHFYGHGVFSPDGRLLYTTENDY--ETGRGVIGVYDAA 85 (305)
T ss_pred CCeEEEEEeC---CCcEEEEEEc--CCCceeeEEcCC---CCCEEecCEEEcCCCCEEEEecccc--CCCcEEEEEEECc
Confidence 4566667554 3345667798 777654333333 34544 44444444 566654422 2345667889998
Q ss_pred CCeeEecCCCCCc-ceeeeeEEE-CC-EEEEE-eCCCCC--------------CceEEEeCCCCcEEEecCCCCc----c
Q 045821 145 MNTWTDAAPMSTA-RCYFPCGVL-NQ-KIYCI-GGLGDT--------------HSWDVYDPRTNNWKLHTEPNIF----T 202 (352)
Q Consensus 145 t~~W~~~~~~~~~-r~~~~~~~~-~~-~iyv~-gG~~~~--------------~~v~~yd~~t~~W~~~~~~~~~----~ 202 (352)
+...++...+.. .-.|-+... ++ .+.|. ||...+ .++-..|..+.+=..--.+++. .
T Consensus 86 -~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lS 164 (305)
T PF07433_consen 86 -RGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLHQLS 164 (305)
T ss_pred -CCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCccccccc
Confidence 667777766643 334555554 44 34443 553211 5666677887753222122221 1
Q ss_pred cceeEEEECCEEEEEccCCCCCCce--EEEEEeCCCCceeecCCCC--CCC--CCCcEEEE--CCEEEEEecCCCCeEEE
Q 045821 203 EIEDSFVMDGKIYIRCSASAATSHV--CALVYEPSTDSWLHADANM--ASG--WRGPAVVV--DDALYVLDQSSGTKLMM 274 (352)
Q Consensus 203 ~~~~~~~~~g~iyv~GG~~~~~~~~--~i~~yd~~~~~W~~~~~~~--~~~--~~~~~~~~--~~~l~~~gg~~~~~v~~ 274 (352)
-.|.++.-+|.+.+-.-..+..... -+..++.... -+.+..+. ... -+..++.+ ++.++.+..=.++.+.+
T Consensus 165 iRHLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~~-~~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~ 243 (305)
T PF07433_consen 165 IRHLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGGA-LRLLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSPRGGRVAV 243 (305)
T ss_pred eeeEEecCCCcEEEEEecCCCCCccCCeEEEEcCCCc-ceeccCChHHHHhhCCceEEEEEeCCCCEEEEECCCCCEEEE
Confidence 2233333455555432222221111 3333333221 22222111 111 12223333 44455555444558999
Q ss_pred EeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcC
Q 045821 275 WQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVG 321 (352)
Q Consensus 275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~ 321 (352)
||..++.|.....++. .|+++..++...+-.|.-.+...+..
T Consensus 244 ~d~~tg~~~~~~~l~D-----~cGva~~~~~f~~ssG~G~~~~~~~~ 285 (305)
T PF07433_consen 244 WDAATGRLLGSVPLPD-----ACGVAPTDDGFLVSSGQGQLIRLSPD 285 (305)
T ss_pred EECCCCCEeeccccCc-----eeeeeecCCceEEeCCCccEEEccCc
Confidence 9999999988877765 36777777775555565444444433
No 169
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=83.23 E-value=46 Score=32.22 Aligned_cols=177 Identities=16% Similarity=0.159 Sum_probs=94.2
Q ss_pred CCCeEEEEECCCCeeEecCCCCCccee-eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCY-FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~-~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
..+.+.++|..+++--.+-. +... -..+.+++.+.|.|.++ ..|-+||+.+.+.-.. +.......-+..+++
T Consensus 309 ~D~tVkVW~v~n~~~l~l~~---~h~~~V~~v~~~~~~lvsgs~d--~~v~VW~~~~~~cl~s--l~gH~~~V~sl~~~~ 381 (537)
T KOG0274|consen 309 RDNTVKVWDVTNGACLNLLR---GHTGPVNCVQLDEPLLVSGSYD--GTVKVWDPRTGKCLKS--LSGHTGRVYSLIVDS 381 (537)
T ss_pred CCceEEEEeccCcceEEEec---cccccEEEEEecCCEEEEEecC--ceEEEEEhhhceeeee--ecCCcceEEEEEecC
Confidence 34678888888665333221 1111 12333466777777766 5788999997665332 222111112345666
Q ss_pred -EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCC
Q 045821 213 -KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTL 291 (352)
Q Consensus 213 -~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~ 291 (352)
..++-|+.+ ..|.++|+.+.. ..+.....+...-....+.+++++-+..++ .+..||.+++.-.++-.-+.
T Consensus 382 ~~~~~Sgs~D-----~~IkvWdl~~~~-~c~~tl~~h~~~v~~l~~~~~~Lvs~~aD~-~Ik~WD~~~~~~~~~~~~~~- 453 (537)
T KOG0274|consen 382 ENRLLSGSLD-----TTIKVWDLRTKR-KCIHTLQGHTSLVSSLLLRDNFLVSSSADG-TIKLWDAEEGECLRTLEGRH- 453 (537)
T ss_pred cceEEeeeec-----cceEeecCCchh-hhhhhhcCCcccccccccccceeEeccccc-cEEEeecccCceeeeeccCC-
Confidence 666666555 246777776664 211111111111122345666676666665 89999998887765543321
Q ss_pred cCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCccccc
Q 045821 292 LTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIGNIG 327 (352)
Q Consensus 292 ~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~w~~ 327 (352)
...-.++..+...++..+.+ ...++|.....-.+
T Consensus 454 --~~~v~~l~~~~~~il~s~~~~~~~l~dl~~~~~~~ 488 (537)
T KOG0274|consen 454 --VGGVSALALGKEEILCSSDDGSVKLWDLRSGTLIR 488 (537)
T ss_pred --cccEEEeecCcceEEEEecCCeeEEEecccCchhh
Confidence 12222333333455555544 45578888776554
No 170
>PRK01742 tolB translocation protein TolB; Provisional
Probab=82.72 E-value=42 Score=31.39 Aligned_cols=139 Identities=12% Similarity=0.029 Sum_probs=71.3
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCE-EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQK-IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~-iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g 212 (352)
...++++|..+++-+.+...+.. . ...+. -+++ |++....+...+++.+|..++..+++..... ........-+|
T Consensus 227 ~~~i~i~dl~tg~~~~l~~~~g~-~-~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~-~~~~~~wSpDG 303 (429)
T PRK01742 227 KSQLVVHDLRSGARKVVASFRGH-N-GAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAG-NNTEPSWSPDG 303 (429)
T ss_pred CcEEEEEeCCCCceEEEecCCCc-c-CceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccCCC-CcCCEEECCCC
Confidence 35789999988766555544321 1 11121 2454 4443333322567888988887776653211 11122333456
Q ss_pred E-EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821 213 K-IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSGTKLMMWQKESREWSPVG 286 (352)
Q Consensus 213 ~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~~~v~~yd~~~~~W~~~~ 286 (352)
+ |+.. ... .....++.++..+..-+.+.. . . ......-++ .|++.++ ..++.+|..++.+..+.
T Consensus 304 ~~i~f~-s~~--~g~~~I~~~~~~~~~~~~l~~--~-~-~~~~~SpDG~~ia~~~~---~~i~~~Dl~~g~~~~lt 369 (429)
T PRK01742 304 QSILFT-SDR--SGSPQVYRMSASGGGASLVGG--R-G-YSAQISADGKTLVMING---DNVVKQDLTSGSTEVLS 369 (429)
T ss_pred CEEEEE-ECC--CCCceEEEEECCCCCeEEecC--C-C-CCccCCCCCCEEEEEcC---CCEEEEECCCCCeEEec
Confidence 5 4443 221 112367777776554333321 1 1 122233344 4555544 26888999998887664
No 171
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=82.68 E-value=36 Score=30.58 Aligned_cols=69 Identities=20% Similarity=0.327 Sum_probs=37.0
Q ss_pred CCEEEEEcc---CCCC-CCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCE--EEEEecCCCCeEEEEeCCCCce
Q 045821 211 DGKIYIRCS---ASAA-TSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDA--LYVLDQSSGTKLMMWQKESREW 282 (352)
Q Consensus 211 ~g~iyv~GG---~~~~-~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~--l~~~gg~~~~~v~~yd~~~~~W 282 (352)
.++|||+-= ...+ .....||+||+.+.+=. +++- ....-+..+..+.+ ||.+.+..+ .+.+||..+++-
T Consensus 249 ~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l--~~~~~Si~Vsqd~~P~L~~~~~~~~-~l~v~D~~tGk~ 324 (342)
T PF06433_consen 249 SGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPL--EHPIDSIAVSQDDKPLLYALSAGDG-TLDVYDAATGKL 324 (342)
T ss_dssp TTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEE--EEEESEEEEESSSS-EEEEEETTTT-EEEEEETTT--E
T ss_pred cCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeC--CCccceEEEccCCCcEEEEEcCCCC-eEEEEeCcCCcE
Confidence 678998741 1111 12238999999988643 2221 11111233444443 555554333 899999998754
No 172
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=82.37 E-value=55 Score=32.50 Aligned_cols=157 Identities=8% Similarity=0.083 Sum_probs=87.1
Q ss_pred EEEEC--CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821 115 FEVLG--KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 115 ~~~~~--~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
.+.++ |.-..+|+. .+..+.+|+-.++++..-.+-..++.....-.-||.+.+.|+.+ ..|-+||..+.-.
T Consensus 312 t~~~N~tGDWiA~g~~-----klgQLlVweWqsEsYVlKQQgH~~~i~~l~YSpDgq~iaTG~eD--gKVKvWn~~SgfC 384 (893)
T KOG0291|consen 312 TVSFNSTGDWIAFGCS-----KLGQLLVWEWQSESYVLKQQGHSDRITSLAYSPDGQLIATGAED--GKVKVWNTQSGFC 384 (893)
T ss_pred EEEecccCCEEEEcCC-----ccceEEEEEeeccceeeeccccccceeeEEECCCCcEEEeccCC--CcEEEEeccCceE
Confidence 34444 555556652 24577888777776654444333333222222378888888866 7888999888754
Q ss_pred EEecCCCCcccceeE--EEECCEEEEEccCCCCCCceEEEEEeCCCC-ceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821 193 KLHTEPNIFTEIEDS--FVMDGKIYIRCSASAATSHVCALVYEPSTD-SWLHADANMASGWRGPAVVVDDALYVLDQSSG 269 (352)
Q Consensus 193 ~~~~~~~~~~~~~~~--~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~-~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~ 269 (352)
... .........+ +...|+..+.-..++ .+-.+|...- ..+....|-+..+...++-..|.|.+.|+.+.
T Consensus 385 ~vT--FteHts~Vt~v~f~~~g~~llssSLDG-----tVRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~ 457 (893)
T KOG0291|consen 385 FVT--FTEHTSGVTAVQFTARGNVLLSSSLDG-----TVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDS 457 (893)
T ss_pred EEE--eccCCCceEEEEEEecCCEEEEeecCC-----eEEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccce
Confidence 322 1222222222 223455444423332 3555665433 24444433333333223333499999999887
Q ss_pred CeEEEEeCCCCceEEc
Q 045821 270 TKLMMWQKESREWSPV 285 (352)
Q Consensus 270 ~~v~~yd~~~~~W~~~ 285 (352)
-.|++|+.++++--.+
T Consensus 458 F~IfvWS~qTGqllDi 473 (893)
T KOG0291|consen 458 FEIFVWSVQTGQLLDI 473 (893)
T ss_pred EEEEEEEeecCeeeeh
Confidence 7899999998886544
No 173
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=82.18 E-value=25 Score=28.46 Aligned_cols=86 Identities=15% Similarity=0.078 Sum_probs=48.5
Q ss_pred CCCCCCCCCCeEEEEECCCCeeEecCCCCCc-ceeeeeEE-E-CCEEEEEeCCCCC-----CceEEEeCCCCcEEEecCC
Q 045821 127 GCGWSEDATSEVYCYDASMNTWTDAAPMSTA-RCYFPCGV-L-NQKIYCIGGLGDT-----HSWDVYDPRTNNWKLHTEP 198 (352)
Q Consensus 127 G~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~-~-~~~iyv~gG~~~~-----~~v~~yd~~t~~W~~~~~~ 198 (352)
|.++.......+|++|..++.|..+..-+.. .+.+--+. + +..|.|+-|.... ..+++|++.|+.=..+-+.
T Consensus 79 g~~a~eEgiGkIYIkn~~~~~~~~L~i~~~~~k~sPK~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~ 158 (200)
T PF15525_consen 79 GPEAEEEGIGKIYIKNLNNNNWWSLQIDQNEEKYSPKYIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEW 158 (200)
T ss_pred CCccccccceeEEEEecCCCceEEEEecCcccccCCceeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeec
Confidence 4444456688999999999987655322211 22222222 2 3444444443221 8899999999987777654
Q ss_pred CCcccceeEEEECC
Q 045821 199 NIFTEIEDSFVMDG 212 (352)
Q Consensus 199 ~~~~~~~~~~~~~g 212 (352)
..-......+...+
T Consensus 159 ~dkkqQVis~e~~g 172 (200)
T PF15525_consen 159 KDKKQQVISAEKNG 172 (200)
T ss_pred cccceeEEEEEEeC
Confidence 33333333333333
No 174
>PRK10115 protease 2; Provisional
Probab=82.17 E-value=59 Score=32.67 Aligned_cols=149 Identities=11% Similarity=0.012 Sum_probs=77.4
Q ss_pred CCeEEEEECCCCe--eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCC-CceEEEeC--CCCcEEEecCCCCcccceeEE
Q 045821 135 TSEVYCYDASMNT--WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDT-HSWDVYDP--RTNNWKLHTEPNIFTEIEDSF 208 (352)
Q Consensus 135 ~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~-~~v~~yd~--~t~~W~~~~~~~~~~~~~~~~ 208 (352)
..++++++..|.. =..+-.-+........... +++..++...... +.++.|+. .+..|..+.+.+..... ...
T Consensus 198 ~~~v~~h~lgt~~~~d~lv~~e~~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~ 276 (686)
T PRK10115 198 PYQVWRHTIGTPASQDELVYEEKDDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDAELADAEPFVFLPRRKDHEY-SLD 276 (686)
T ss_pred CCEEEEEECCCChhHCeEEEeeCCCCEEEEEEEcCCCCEEEEEEECCccccEEEEECcCCCCCceEEEECCCCCEE-EEE
Confidence 4789999999883 2223221222222222223 4443334443333 77888883 23444433322222222 233
Q ss_pred EECCEEEEEccCCCCCCceEEEEEeCC-CCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821 209 VMDGKIYIRCSASAATSHVCALVYEPS-TDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-TKLMMWQKESREWSPVG 286 (352)
Q Consensus 209 ~~~g~iyv~GG~~~~~~~~~i~~yd~~-~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~ 286 (352)
..++.+|+.--.. .....+...+.. ..+|+.+-+......-......++.|++..-..+ ..++++|..++....+.
T Consensus 277 ~~~~~ly~~tn~~--~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~~~~l~~~~~~~g~~~l~~~~~~~~~~~~l~ 354 (686)
T PRK10115 277 HYQHRFYLRSNRH--GKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLFTDWLVVEERQRGLTSLRQINRKTREVIGIA 354 (686)
T ss_pred eCCCEEEEEEcCC--CCCceEEEecCCCcccCeEEECCCCCCEEEEEEEECCEEEEEEEeCCEEEEEEEcCCCCceEEec
Confidence 4567888874222 223356777776 5789887543221111222345777777765444 46888887766655554
No 175
>PLN00181 protein SPA1-RELATED; Provisional
Probab=81.95 E-value=64 Score=32.99 Aligned_cols=143 Identities=10% Similarity=0.051 Sum_probs=69.0
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCc--EE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNN--WK 193 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~--W~ 193 (352)
++.+++.|+.+ ..+.+||..+.+- ..+.. ... -..+.. -++..++.|+.+ ..+.+||..+.. ..
T Consensus 587 ~~~~L~Sgs~D------g~v~iWd~~~~~~~~~~~~-~~~--v~~v~~~~~~g~~latgs~d--g~I~iwD~~~~~~~~~ 655 (793)
T PLN00181 587 DPTLLASGSDD------GSVKLWSINQGVSIGTIKT-KAN--ICCVQFPSESGRSLAFGSAD--HKVYYYDLRNPKLPLC 655 (793)
T ss_pred CCCEEEEEcCC------CEEEEEECCCCcEEEEEec-CCC--eEEEEEeCCCCCEEEEEeCC--CeEEEEECCCCCccce
Confidence 45666667643 4578888876532 12211 111 111111 146777777755 688999987643 11
Q ss_pred EecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCC----ceeecCCCCCCCC--CCcEEEECCEEEEEecC
Q 045821 194 LHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTD----SWLHADANMASGW--RGPAVVVDDALYVLDQS 267 (352)
Q Consensus 194 ~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~----~W~~~~~~~~~~~--~~~~~~~~~~l~~~gg~ 267 (352)
.+.. ....-..+...++..++.|+.++ .+..||.... .|..+.....+.. ....+..++.+++.|+.
T Consensus 656 ~~~~--h~~~V~~v~f~~~~~lvs~s~D~-----~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~ 728 (793)
T PLN00181 656 TMIG--HSKTVSYVRFVDSSTLVSSSTDN-----TLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSE 728 (793)
T ss_pred EecC--CCCCEEEEEEeCCCEEEEEECCC-----EEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeC
Confidence 2110 11111123334666666655442 4666776432 2322211111110 11122345666777776
Q ss_pred CCCeEEEEeCCCC
Q 045821 268 SGTKLMMWQKESR 280 (352)
Q Consensus 268 ~~~~v~~yd~~~~ 280 (352)
++ .+.+|+....
T Consensus 729 D~-~v~iw~~~~~ 740 (793)
T PLN00181 729 TN-EVFVYHKAFP 740 (793)
T ss_pred CC-EEEEEECCCC
Confidence 65 8888887643
No 176
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=81.53 E-value=38 Score=30.71 Aligned_cols=151 Identities=13% Similarity=0.132 Sum_probs=79.8
Q ss_pred CEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCC------cccceeEEEECC---EEEEEccCCCCCCceEEEEEeCCC
Q 045821 168 QKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNI------FTEIEDSFVMDG---KIYIRCSASAATSHVCALVYEPST 236 (352)
Q Consensus 168 ~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~------~~~~~~~~~~~g---~iyv~GG~~~~~~~~~i~~yd~~~ 236 (352)
-.|+.+||....+.+.+||.++. .|+.-..++. +.....+.++.+ +-++-+-. .-.+-.||+..
T Consensus 161 p~Iva~GGke~~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~-----~hqvR~YDt~~ 235 (412)
T KOG3881|consen 161 PYIVATGGKENINELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITR-----YHQVRLYDTRH 235 (412)
T ss_pred CceEecCchhcccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEec-----ceeEEEecCcc
Confidence 36777788654466778887754 5776543222 222334445544 33333211 12577899875
Q ss_pred CceeecCC-CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcCCce
Q 045821 237 DSWLHADA-NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGKGCS 314 (352)
Q Consensus 237 ~~W~~~~~-~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG~~~ 314 (352)
+.=-...- .......+.+...++...++|...+ .+..||..++.---.. + .........+..+.+ .+...+|-+.
T Consensus 236 qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g-~l~~FD~r~~kl~g~~-~-kg~tGsirsih~hp~~~~las~GLDR 312 (412)
T KOG3881|consen 236 QRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKG-QLAKFDLRGGKLLGCG-L-KGITGSIRSIHCHPTHPVLASCGLDR 312 (412)
T ss_pred cCcceeEeccccCcceeeeecCCCcEEEEecccc-hhheecccCceeeccc-c-CCccCCcceEEEcCCCceEEeeccce
Confidence 43211111 1122222333344566666666655 8999998876542221 0 111122234555555 5667777776
Q ss_pred EE-EEEcCCcccc
Q 045821 315 AV-VIDVGNIGNI 326 (352)
Q Consensus 315 ~~-~~d~~~~~w~ 326 (352)
.+ |||++++.-.
T Consensus 313 yvRIhD~ktrkll 325 (412)
T KOG3881|consen 313 YVRIHDIKTRKLL 325 (412)
T ss_pred eEEEeecccchhh
Confidence 55 8999986544
No 177
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=81.30 E-value=40 Score=30.16 Aligned_cols=210 Identities=10% Similarity=0.062 Sum_probs=105.1
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCC--cEEEe
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTN--NWKLH 195 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~ 195 (352)
..+|++..-+. ..--..+.+|+..++-+.+...+.+-..++-+.+ +++..+..-+.. ..+-+|-..++ -|..+
T Consensus 52 ~~LY~v~~~~~--~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~-g~v~v~p~~~dG~l~~~v 128 (346)
T COG2706 52 RHLYVVNEPGE--EGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHS-GSVSVYPLQADGSLQPVV 128 (346)
T ss_pred CEEEEEEecCC--cCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccC-ceEEEEEcccCCccccce
Confidence 35887754322 2234556778877887777654433333232333 343333333332 66677766554 13222
Q ss_pred c------CCCCcc---cceeEEE--ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCC---CcEEEECCE-
Q 045821 196 T------EPNIFT---EIEDSFV--MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWR---GPAVVVDDA- 260 (352)
Q Consensus 196 ~------~~~~~~---~~~~~~~--~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~---~~~~~~~~~- 260 (352)
. .-|..+ ...+.+. -++++.+... -....+..|+...+.-+......-.... +.++--+++
T Consensus 129 ~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~D----LG~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~ 204 (346)
T COG2706 129 QVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPD----LGTDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKY 204 (346)
T ss_pred eeeecCCCCCCccccCCccceeeeCCCCCEEEEee----cCCceEEEEEcccCccccccccccCCCCCcceEEEcCCCcE
Confidence 1 111111 1122333 3554444321 2234688888887766554332111112 222333444
Q ss_pred EEEEecCCC-CeEEEEeCCCCceEEcc---CCCCCcCCCCc--eEEE--eCCEEEEEc-CCceEEEEEcCCcccccceEE
Q 045821 261 LYVLDQSSG-TKLMMWQKESREWSPVG---RLSTLLTRPPC--KLVA--IGKTIFVIG-KGCSAVVIDVGNIGNIGGIMV 331 (352)
Q Consensus 261 l~~~gg~~~-~~v~~yd~~~~~W~~~~---~~~~~~~~~~~--~~~~--~~~~i~v~g-G~~~~~~~d~~~~~w~~~~~~ 331 (352)
.|++...++ -.++.||....+.+++. .+|..+..... ++.. .+.-+|+-. |.+...+|.+..++=.-....
T Consensus 205 aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~ 284 (346)
T COG2706 205 AYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVG 284 (346)
T ss_pred EEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEE
Confidence 688887766 34567777777777775 56665533322 2222 244577776 455677788887764333444
Q ss_pred eccCC
Q 045821 332 SSSIP 336 (352)
Q Consensus 332 ~~~~~ 336 (352)
+....
T Consensus 285 ~~~te 289 (346)
T COG2706 285 ITPTE 289 (346)
T ss_pred EeccC
Confidence 44433
No 178
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=80.28 E-value=46 Score=30.24 Aligned_cols=120 Identities=9% Similarity=-0.081 Sum_probs=65.4
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCC---CCCCCCCeEEEEECC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCG---WSEDATSEVYCYDAS 144 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~---~~~~~~~~~~~~d~~ 144 (352)
..++|+.-.......-..+++|. .+.+- +...+. ..+....+..-+..+|+...+. ........+.+||+.
T Consensus 12 ~~~v~V~d~~~~~~~~~v~ViD~--~~~~v--~g~i~~--G~~P~~~~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~ 85 (352)
T TIGR02658 12 ARRVYVLDPGHFAATTQVYTIDG--EAGRV--LGMTDG--GFLPNPVVASDGSFFAHASTVYSRIARGKRTDYVEVIDPQ 85 (352)
T ss_pred CCEEEEECCcccccCceEEEEEC--CCCEE--EEEEEc--cCCCceeECCCCCEEEEEeccccccccCCCCCEEEEEECc
Confidence 56688874331111245677788 66443 333332 1222232333355799887632 222346789999999
Q ss_pred CCeeE-ecCCCCCccee------eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821 145 MNTWT-DAAPMSTARCY------FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKL 194 (352)
Q Consensus 145 t~~W~-~~~~~~~~r~~------~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~ 194 (352)
|.+-. +++..+.||.. .....-+| .+||.--. ..+.|-+.|..+++--.
T Consensus 86 t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~-p~~~V~VvD~~~~kvv~ 142 (352)
T TIGR02658 86 THLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFS-PSPAVGVVDLEGKAFVR 142 (352)
T ss_pred cCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCC-CCCEEEEEECCCCcEEE
Confidence 98743 44433444421 11222244 57776432 12889999999987543
No 179
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=77.64 E-value=51 Score=29.31 Aligned_cols=183 Identities=12% Similarity=0.138 Sum_probs=90.2
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc-EEEecCC----CCcccceeEE
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEP----NIFTEIEDSF 208 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~----~~~~~~~~~~ 208 (352)
..++.||+.+++-+.... |... .+...+ ++.|.++. ..+..++++++. |+.+.+. +..+..-..+
T Consensus 47 ~~i~r~~~~~g~~~~~~~-p~~~--~~~~~~d~~g~Lv~~~-----~g~~~~~~~~~~~~t~~~~~~~~~~~~r~ND~~v 118 (307)
T COG3386 47 GRIHRLDPETGKKRVFPS-PGGF--SSGALIDAGGRLIACE-----HGVRLLDPDTGGKITLLAEPEDGLPLNRPNDGVV 118 (307)
T ss_pred CeEEEecCCcCceEEEEC-CCCc--ccceeecCCCeEEEEc-----cccEEEeccCCceeEEeccccCCCCcCCCCceeE
Confidence 567888887665443322 2111 222333 34555543 446777776443 3555532 2223333445
Q ss_pred EECCEEEEEccCC-----CC--CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC--
Q 045821 209 VMDGKIYIRCSAS-----AA--TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES-- 279 (352)
Q Consensus 209 ~~~g~iyv~GG~~-----~~--~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~-- 279 (352)
.-+|.+|+- ... .. ...-.++++|+ .....++-........+.+..-+++.+++.....+.+++|+...
T Consensus 119 ~pdG~~wfg-t~~~~~~~~~~~~~~G~lyr~~p-~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~~ 196 (307)
T COG3386 119 DPDGRIWFG-DMGYFDLGKSEERPTGSLYRVDP-DGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPAT 196 (307)
T ss_pred cCCCCEEEe-CCCccccCccccCCcceEEEEcC-CCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecCccc
Confidence 557888774 322 11 11127999998 45565553321222233344556654444444444899997763
Q ss_pred ----Cc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--CceEEEEEcCCcccccceEEe
Q 045821 280 ----RE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCSAVVIDVGNIGNIGGIMVS 332 (352)
Q Consensus 280 ----~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~~~~~d~~~~~w~~~~~~~ 332 (352)
+. +..... .. ...-...+--++.||+... ...+..|+++ ..-.+...++
T Consensus 197 g~~~~~~~~~~~~~-~~--G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP 253 (307)
T COG3386 197 GPIGGRRGFVDFDE-EP--GLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLP 253 (307)
T ss_pred CccCCcceEEEccC-CC--CCCCceEEeCCCCEEEecccCCceEEEECCC-CcEEEEEECC
Confidence 11 111111 00 1111223334677885432 2367789998 4444555555
No 180
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=77.56 E-value=29 Score=32.31 Aligned_cols=62 Identities=6% Similarity=0.134 Sum_probs=36.2
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--C-CEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--D-GKIYIRCSASAATSHVCALVYEPSTDS 238 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~-g~iyv~GG~~~~~~~~~i~~yd~~~~~ 238 (352)
++.-|+.+|++ ..+-.+|.+|++-..-- .. ..-..++.+ + .+++++||.++ .|..+|..+++
T Consensus 269 ~g~~fLS~sfD--~~lKlwDtETG~~~~~f--~~-~~~~~cvkf~pd~~n~fl~G~sd~-----ki~~wDiRs~k 333 (503)
T KOG0282|consen 269 CGTSFLSASFD--RFLKLWDTETGQVLSRF--HL-DKVPTCVKFHPDNQNIFLVGGSDK-----KIRQWDIRSGK 333 (503)
T ss_pred cCCeeeeeecc--eeeeeeccccceEEEEE--ec-CCCceeeecCCCCCcEEEEecCCC-----cEEEEeccchH
Confidence 34556666665 56677899998754321 11 111112222 3 48999987764 57778877665
No 181
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=75.46 E-value=58 Score=28.79 Aligned_cols=145 Identities=12% Similarity=0.094 Sum_probs=72.6
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCc------ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTA------RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN 191 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~------r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~ 191 (352)
.+.+-++||.+.+..+.+.+.+||-...+- .++. +..+ |..+-+++..++||| |...++-
T Consensus 58 ~N~laLVGGg~~pky~pNkviIWDD~k~~~i~el~-f~~~I~~V~l~r~riVvvl~~~I~V------------ytF~~n~ 124 (346)
T KOG2111|consen 58 SNYLALVGGGSRPKYPPNKVIIWDDLKERCIIELS-FNSEIKAVKLRRDRIVVVLENKIYV------------YTFPDNP 124 (346)
T ss_pred hceEEEecCCCCCCCCCceEEEEecccCcEEEEEE-eccceeeEEEcCCeEEEEecCeEEE------------EEcCCCh
Confidence 466677787765557789999999444332 2221 1111 233445555666655 4333321
Q ss_pred --EEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceeecCC-CCCCCCCCc---EEEECCEEEEE
Q 045821 192 --WKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLHADA-NMASGWRGP---AVVVDDALYVL 264 (352)
Q Consensus 192 --W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~-~~~~~~~~~---~~~~~~~l~~~ 264 (352)
=..+....-+..-+..+.. +..+.++-|...+ .+..-|+...+=. ++ ......... +...+|.+...
T Consensus 125 k~l~~~et~~NPkGlC~~~~~~~k~~LafPg~k~G----qvQi~dL~~~~~~--~p~~I~AH~s~Iacv~Ln~~Gt~vAT 198 (346)
T KOG2111|consen 125 KLLHVIETRSNPKGLCSLCPTSNKSLLAFPGFKTG----QVQIVDLASTKPN--APSIINAHDSDIACVALNLQGTLVAT 198 (346)
T ss_pred hheeeeecccCCCceEeecCCCCceEEEcCCCccc----eEEEEEhhhcCcC--CceEEEcccCceeEEEEcCCccEEEE
Confidence 1111111111112222222 4456666555431 3555555433321 11 111111222 23468888888
Q ss_pred ecCCCCeEEEEeCCCCce
Q 045821 265 DQSSGTKLMMWQKESREW 282 (352)
Q Consensus 265 gg~~~~~v~~yd~~~~~W 282 (352)
+...|.-|-+||..+++=
T Consensus 199 aStkGTLIRIFdt~~g~~ 216 (346)
T KOG2111|consen 199 ASTKGTLIRIFDTEDGTL 216 (346)
T ss_pred eccCcEEEEEEEcCCCcE
Confidence 888887788999998763
No 182
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=74.87 E-value=67 Score=29.23 Aligned_cols=68 Identities=9% Similarity=0.166 Sum_probs=38.4
Q ss_pred CCEEEEEe--cC------CCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcC-CceEEEEEcCCccccc
Q 045821 258 DDALYVLD--QS------SGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGK-GCSAVVIDVGNIGNIG 327 (352)
Q Consensus 258 ~~~l~~~g--g~------~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG-~~~~~~~d~~~~~w~~ 327 (352)
++++|+.. +- .++.++++|..+ ++.+..++....-...++..-+. .+|+..+ ...+.++|+++..-.+
T Consensus 259 g~~lyV~~~~~~~~thk~~~~~V~ViD~~t--~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~ 336 (352)
T TIGR02658 259 RDRIYLLADQRAKWTHKTASRFLFVVDAKT--GKRLRKIELGHEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELS 336 (352)
T ss_pred CCEEEEEecCCccccccCCCCEEEEEECCC--CeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEe
Confidence 57788742 11 115899999765 44454444321122223333345 6787775 4457799988765433
No 183
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=73.82 E-value=41 Score=30.79 Aligned_cols=223 Identities=11% Similarity=0.005 Sum_probs=92.5
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++.-+++.+. ..+....|.+|. .++.-.+|...+.. ...+..++.-+..+|.+-. ...++..|..|.+
T Consensus 46 dG~kllF~s~-~dg~~nly~lDL--~t~~i~QLTdg~g~--~~~g~~~s~~~~~~~Yv~~-------~~~l~~vdL~T~e 113 (386)
T PF14583_consen 46 DGRKLLFASD-FDGNRNLYLLDL--ATGEITQLTDGPGD--NTFGGFLSPDDRALYYVKN-------GRSLRRVDLDTLE 113 (386)
T ss_dssp TS-EEEEEE--TTSS-EEEEEET--TT-EEEE---SS-B---TTT-EE-TTSSEEEEEET-------TTEEEEEETTT--
T ss_pred CCCEEEEEec-cCCCcceEEEEc--ccCEEEECccCCCC--CccceEEecCCCeEEEEEC-------CCeEEEEECCcCc
Confidence 4545556554 566778899999 99999999886631 1223333333556654421 2578889999887
Q ss_pred eEecCCCCCcceeeeeEEE--CCEEEEEeCC---C----CC-------------CceEEEeCCCCcEEEecCCCCcccce
Q 045821 148 WTDAAPMSTARCYFPCGVL--NQKIYCIGGL---G----DT-------------HSWDVYDPRTNNWKLHTEPNIFTEIE 205 (352)
Q Consensus 148 W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~---~----~~-------------~~v~~yd~~t~~W~~~~~~~~~~~~~ 205 (352)
=+.+-..|..-......+. ++..++.--. + .. ..+..-|..|++.+.+-. ......
T Consensus 114 ~~~vy~~p~~~~g~gt~v~n~d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~--~~~wlg 191 (386)
T PF14583_consen 114 ERVVYEVPDDWKGYGTWVANSDCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFE--DTDWLG 191 (386)
T ss_dssp EEEEEE--TTEEEEEEEEE-TTSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEE--ESS-EE
T ss_pred EEEEEECCcccccccceeeCCCccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEe--cCcccc
Confidence 6666666654443333322 2322221100 0 00 556667777777665532 111111
Q ss_pred eE--EEECCEEEEEccCCCC-CC-ceEEEEEeCCCCceeecCCCCCCCCCCc-EEEEC-CEEEEEecCCC---CeEEEEe
Q 045821 206 DS--FVMDGKIYIRCSASAA-TS-HVCALVYEPSTDSWLHADANMASGWRGP-AVVVD-DALYVLDQSSG---TKLMMWQ 276 (352)
Q Consensus 206 ~~--~~~~g~iyv~GG~~~~-~~-~~~i~~yd~~~~~W~~~~~~~~~~~~~~-~~~~~-~~l~~~gg~~~---~~v~~yd 276 (352)
+. .-.+..+.+++ ..+. .. ...||..|........+....+.-..+. --..+ ..|+..+...+ ..+..||
T Consensus 192 H~~fsP~dp~li~fC-HEGpw~~Vd~RiW~i~~dg~~~~~v~~~~~~e~~gHEfw~~DG~~i~y~~~~~~~~~~~i~~~d 270 (386)
T PF14583_consen 192 HVQFSPTDPTLIMFC-HEGPWDLVDQRIWTINTDGSNVKKVHRRMEGESVGHEFWVPDGSTIWYDSYTPGGQDFWIAGYD 270 (386)
T ss_dssp EEEEETTEEEEEEEE-E-S-TTTSS-SEEEEETTS---EESS---TTEEEEEEEE-TTSS-EEEEEEETTT--EEEEEE-
T ss_pred CcccCCCCCCEEEEe-ccCCcceeceEEEEEEcCCCcceeeecCCCCcccccccccCCCCEEEEEeecCCCCceEEEeeC
Confidence 11 11244454542 1111 11 1267888876554444433221111111 11223 34544443232 3578889
Q ss_pred CCCCceEEccCCCCCcCCCCceEEEeCCEEEEE
Q 045821 277 KESREWSPVGRLSTLLTRPPCKLVAIGKTIFVI 309 (352)
Q Consensus 277 ~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~ 309 (352)
+++..=+.+..+|. ..+-....++++++=
T Consensus 271 ~~t~~~~~~~~~p~----~~H~~ss~Dg~L~vG 299 (386)
T PF14583_consen 271 PDTGERRRLMEMPW----CSHFMSSPDGKLFVG 299 (386)
T ss_dssp TTT--EEEEEEE-S----EEEEEE-TTSSEEEE
T ss_pred CCCCCceEEEeCCc----eeeeEEcCCCCEEEe
Confidence 99865444444432 122333445666554
No 184
>PRK10115 protease 2; Provisional
Probab=73.78 E-value=1e+02 Score=30.95 Aligned_cols=150 Identities=11% Similarity=0.017 Sum_probs=79.5
Q ss_pred CCeEEEEEC--CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCC-CCcEEEecCCCCcccceeEEEE
Q 045821 135 TSEVYCYDA--SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPR-TNNWKLHTEPNIFTEIEDSFVM 210 (352)
Q Consensus 135 ~~~~~~~d~--~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~-t~~W~~~~~~~~~~~~~~~~~~ 210 (352)
.+.++.++. .+..|..+...+.. ........++.+|+..-.+.. ..+...+.. ++.|+.+-+......-......
T Consensus 246 ~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~ 324 (686)
T PRK10115 246 TSEVLLLDAELADAEPFVFLPRRKD-HEYSLDHYQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLF 324 (686)
T ss_pred cccEEEEECcCCCCCceEEEECCCC-CEEEEEeCCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCCCCEEEEEEEE
Confidence 456777773 23444333222222 222333456788888755433 556666766 5789888644232333344555
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE--EC-CEEEE-EecCCC-CeEEEEeCCCCceEEc
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV--VD-DALYV-LDQSSG-TKLMMWQKESREWSPV 285 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~--~~-~~l~~-~gg~~~-~~v~~yd~~~~~W~~~ 285 (352)
++.|++.. .. .....++.+|..+.....+.-..+......... .+ +.+++ +.+... ..++.||+.+++|+.+
T Consensus 325 ~~~l~~~~-~~--~g~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l 401 (686)
T PRK10115 325 TDWLVVEE-RQ--RGLTSLRQINRKTREVIGIAFDDPAYVTWIAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVL 401 (686)
T ss_pred CCEEEEEE-Ee--CCEEEEEEEcCCCCceEEecCCCCceEeeecccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEE
Confidence 78887773 22 233357788876655555531111110000011 12 33433 333332 6899999999998877
Q ss_pred cCC
Q 045821 286 GRL 288 (352)
Q Consensus 286 ~~~ 288 (352)
...
T Consensus 402 ~~~ 404 (686)
T PRK10115 402 KQT 404 (686)
T ss_pred Eec
Confidence 643
No 185
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=71.66 E-value=43 Score=33.04 Aligned_cols=63 Identities=6% Similarity=0.139 Sum_probs=36.9
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
-.|+..+.|+..+ .|..||..+.+=. ++... ........+..+|.+++.||-++ +|-.||...
T Consensus 587 p~Gr~LaSg~ed~-----~I~iWDl~~~~~v~~l~~H-t~ti~SlsFS~dg~vLasgg~Dn-sV~lWD~~~ 650 (707)
T KOG0263|consen 587 PCGRYLASGDEDG-----LIKIWDLANGSLVKQLKGH-TGTIYSLSFSRDGNVLASGGADN-SVRLWDLTK 650 (707)
T ss_pred CCCceEeecccCC-----cEEEEEcCCCcchhhhhcc-cCceeEEEEecCCCEEEecCCCC-eEEEEEchh
Confidence 3677666654442 5777888765322 22111 11111223567999999998766 888888764
No 186
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=71.60 E-value=67 Score=27.78 Aligned_cols=171 Identities=9% Similarity=0.005 Sum_probs=80.2
Q ss_pred CCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCC-CCC--CceEEEeCCCC-------c-EEEecCCCCc
Q 045821 133 DATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGL-GDT--HSWDVYDPRTN-------N-WKLHTEPNIF 201 (352)
Q Consensus 133 ~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~-~~~--~~v~~yd~~t~-------~-W~~~~~~~~~ 201 (352)
....++..+|..+++-...-..+.+-... -...++.+.++.-. ... ..+.+||.... + ..+++ .+..
T Consensus 71 SAD~t~kLWDv~tGk~la~~k~~~~Vk~~-~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~-t~~s 148 (327)
T KOG0643|consen 71 SADQTAKLWDVETGKQLATWKTNSPVKRV-DFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIP-TPDS 148 (327)
T ss_pred cccceeEEEEcCCCcEEEEeecCCeeEEE-eeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEec-CCcc
Confidence 34566788899998743333333332111 11234444444321 111 67788888733 2 44444 2221
Q ss_pred ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc-eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCC
Q 045821 202 TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS-WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESR 280 (352)
Q Consensus 202 ~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~ 280 (352)
....+.-.--++..+.|...+ .|-.||..++. -.+-...........-..-+.-.|+.|..+. .--.+|..+-
T Consensus 149 kit~a~Wg~l~~~ii~Ghe~G-----~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dt-takl~D~~tl 222 (327)
T KOG0643|consen 149 KITSALWGPLGETIIAGHEDG-----SISIYDARTGKELVDSDEEHSSKINDLQFSRDRTYFITGSKDT-TAKLVDVRTL 222 (327)
T ss_pred ceeeeeecccCCEEEEecCCC-----cEEEEEcccCceeeechhhhccccccccccCCcceEEecccCc-cceeeeccce
Confidence 111111112345555543332 58889988753 2221111111101111223445555555443 5566666553
Q ss_pred ceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc
Q 045821 281 EWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC 313 (352)
Q Consensus 281 ~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~ 313 (352)
+-.+- .....+....++..+.+++.+-||.+
T Consensus 223 ~v~Kt--y~te~PvN~aaisP~~d~VilgGGqe 253 (327)
T KOG0643|consen 223 EVLKT--YTTERPVNTAAISPLLDHVILGGGQE 253 (327)
T ss_pred eeEEE--eeecccccceecccccceEEecCCce
Confidence 33221 11112344567777888888888865
No 187
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=70.44 E-value=95 Score=29.10 Aligned_cols=108 Identities=11% Similarity=-0.032 Sum_probs=60.6
Q ss_pred CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK 213 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~ 213 (352)
...+++.+|..+++-..+.....--...+...-+.+|+..........+++||++..+=+++..-..... .....-+|+
T Consensus 260 g~~~iy~~dl~~~~~~~Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~~~~-~p~~SpdG~ 338 (425)
T COG0823 260 GSPDIYLMDLDGKNLPRLTNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFSGGGNS-NPVWSPDGD 338 (425)
T ss_pred CCccEEEEcCCCCcceecccCCccccCccCCCCCCEEEEEeCCCCCcceEEECCCCCceeEeeccCCCCc-CccCCCCCC
Confidence 3578999999988744443332211222222223445544443333899999999887666653222221 223334565
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCc-eeecCC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDS-WLHADA 244 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~ 244 (352)
.+++-+... +. ..+..+|+.++. |+.+..
T Consensus 339 ~i~~~~~~~-g~-~~i~~~~~~~~~~~~~lt~ 368 (425)
T COG0823 339 KIVFESSSG-GQ-WDIDKNDLASGGKIRILTS 368 (425)
T ss_pred EEEEEeccC-Cc-eeeEEeccCCCCcEEEccc
Confidence 555544321 11 578889987776 887754
No 188
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=69.90 E-value=78 Score=27.86 Aligned_cols=129 Identities=12% Similarity=0.099 Sum_probs=67.4
Q ss_pred CCeEEEEECCCCeeEecCCCCCc-ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCC-CCcccceeEEEECC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTA-RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEP-NIFTEIEDSFVMDG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~~~g 212 (352)
.+.+-.||+.+++-..++.--.| |..|-.-..+..+.+.|..+ ..+-.+|+.... .+..+ .+.+.++ +-+...
T Consensus 93 Dk~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWD--KTlKfWD~R~~~--pv~t~~LPeRvYa-~Dv~~p 167 (347)
T KOG0647|consen 93 DKQAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWD--KTLKFWDTRSSN--PVATLQLPERVYA-ADVLYP 167 (347)
T ss_pred CCceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEecccc--cceeecccCCCC--eeeeeeccceeee-hhccCc
Confidence 45678899999987777643332 33232222233455555554 556666666332 12111 1222222 222233
Q ss_pred EEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEE
Q 045821 213 KIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMW 275 (352)
Q Consensus 213 ~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~y 275 (352)
.+.|..+. ..|.+|++.+.. ...+..+........++..+++.|.+|+.+| .+.+.
T Consensus 168 m~vVata~------r~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiEG-rv~iq 225 (347)
T KOG0647|consen 168 MAVVATAE------RHIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIEG-RVAIQ 225 (347)
T ss_pred eeEEEecC------CcEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeecc-eEEEE
Confidence 33333221 248888886553 3444444444444456777888999999988 44443
No 189
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=69.90 E-value=77 Score=27.85 Aligned_cols=132 Identities=14% Similarity=0.119 Sum_probs=63.4
Q ss_pred CeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEE
Q 045821 136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIY 215 (352)
Q Consensus 136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iy 215 (352)
.++-.||..++.-+..-....| --.++..+..=.+.||.+ ..+-+||..+..=.++..-..+-..-.-. ......
T Consensus 35 gslrlYdv~~~~l~~~~~~~~p--lL~c~F~d~~~~~~G~~d--g~vr~~Dln~~~~~~igth~~~i~ci~~~-~~~~~v 109 (323)
T KOG1036|consen 35 GSLRLYDVPANSLKLKFKHGAP--LLDCAFADESTIVTGGLD--GQVRRYDLNTGNEDQIGTHDEGIRCIEYS-YEVGCV 109 (323)
T ss_pred CcEEEEeccchhhhhheecCCc--eeeeeccCCceEEEeccC--ceEEEEEecCCcceeeccCCCceEEEEee-ccCCeE
Confidence 3466777776632211111111 123333444444556666 67889999988755554222211111111 223334
Q ss_pred EEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821 216 IRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 216 v~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
|.||-+ ..|..+|+.+..=.. ...... .-.+..+.+..+++|+... .+..||..+..
T Consensus 110 IsgsWD-----~~ik~wD~R~~~~~~--~~d~~k-kVy~~~v~g~~LvVg~~~r-~v~iyDLRn~~ 166 (323)
T KOG1036|consen 110 ISGSWD-----KTIKFWDPRNKVVVG--TFDQGK-KVYCMDVSGNRLVVGTSDR-KVLIYDLRNLD 166 (323)
T ss_pred EEcccC-----ccEEEEecccccccc--ccccCc-eEEEEeccCCEEEEeecCc-eEEEEEccccc
Confidence 555444 356777775511110 101111 1112345566666766444 89999987643
No 190
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=69.42 E-value=60 Score=32.06 Aligned_cols=59 Identities=15% Similarity=0.154 Sum_probs=34.1
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEP 234 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~ 234 (352)
+|+-.+.|+.+ ..+..||..+.+ .+..+.... -....+..+|.+.+.||.+. .+..+|.
T Consensus 588 ~Gr~LaSg~ed--~~I~iWDl~~~~--~v~~l~~Ht~ti~SlsFS~dg~vLasgg~Dn-----sV~lWD~ 648 (707)
T KOG0263|consen 588 CGRYLASGDED--GLIKIWDLANGS--LVKQLKGHTGTIYSLSFSRDGNVLASGGADN-----SVRLWDL 648 (707)
T ss_pred CCceEeecccC--CcEEEEEcCCCc--chhhhhcccCceeEEEEecCCCEEEecCCCC-----eEEEEEc
Confidence 45555555444 678889988764 222222221 11234566999999987763 4555555
No 191
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=69.18 E-value=1.1e+02 Score=29.19 Aligned_cols=173 Identities=10% Similarity=0.058 Sum_probs=81.4
Q ss_pred eEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc-cceeEEEECCE
Q 045821 137 EVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT-EIEDSFVMDGK 213 (352)
Q Consensus 137 ~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~-~~~~~~~~~g~ 213 (352)
.++.++-.+..-..+.++. +...+++.. +|...++|-.+ ..|++||..+.. .+..+...+ ...++..-++.
T Consensus 198 ~vylW~~~s~~v~~l~~~~--~~~vtSv~ws~~G~~LavG~~~--g~v~iwD~~~~k--~~~~~~~~h~~rvg~laW~~~ 271 (484)
T KOG0305|consen 198 SVYLWSASSGSVTELCSFG--EELVTSVKWSPDGSHLAVGTSD--GTVQIWDVKEQK--KTRTLRGSHASRVGSLAWNSS 271 (484)
T ss_pred eEEEEecCCCceEEeEecC--CCceEEEEECCCCCEEEEeecC--CeEEEEehhhcc--ccccccCCcCceeEEEeccCc
Confidence 4455555555544444443 233333333 46666666544 678899988663 444444422 22334455777
Q ss_pred EEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEE-ccCCCCC
Q 045821 214 IYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSP-VGRLSTL 291 (352)
Q Consensus 214 iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~-~~~~~~~ 291 (352)
+...|.... .|..+|....+=. ..-.......++.....++..+.-||.++ .+.+||.....+.. +.....
T Consensus 272 ~lssGsr~~-----~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnDN-~~~Iwd~~~~~p~~~~~~H~a- 344 (484)
T KOG0305|consen 272 VLSSGSRDG-----KILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGNDN-VVFIWDGLSPEPKFTFTEHTA- 344 (484)
T ss_pred eEEEecCCC-----cEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCCcc-ceEeccCCCccccEEEeccce-
Confidence 788765553 4556655322110 00000000011111233555555566555 88999883322211 100000
Q ss_pred cCCCCceEEEeCCEEEEEcCCce---EEEEEcCCc
Q 045821 292 LTRPPCKLVAIGKTIFVIGKGCS---AVVIDVGNI 323 (352)
Q Consensus 292 ~~~~~~~~~~~~~~i~v~gG~~~---~~~~d~~~~ 323 (352)
.....+.+.....|+..||... .-++|+.+.
T Consensus 345 -AVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~g 378 (484)
T KOG0305|consen 345 -AVKALAWCPWQSGLLATGGGSADRCIKFWNTNTG 378 (484)
T ss_pred -eeeEeeeCCCccCceEEcCCCcccEEEEEEcCCC
Confidence 1112233445667777776542 224555543
No 192
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=68.98 E-value=65 Score=30.98 Aligned_cols=105 Identities=11% Similarity=0.091 Sum_probs=56.0
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCce-eecC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSW-LHAD 243 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~ 243 (352)
+-.+|+.|.. .+|+++|.+.+.|-. +.......--++.+ -+.|..+|+..+ .++.+|+.+.+- ..+.
T Consensus 145 scDly~~gsg---~evYRlNLEqGrfL~--P~~~~~~~lN~v~in~~hgLla~Gt~~g-----~VEfwDpR~ksrv~~l~ 214 (703)
T KOG2321|consen 145 SCDLYLVGSG---SEVYRLNLEQGRFLN--PFETDSGELNVVSINEEHGLLACGTEDG-----VVEFWDPRDKSRVGTLD 214 (703)
T ss_pred CccEEEeecC---cceEEEEcccccccc--ccccccccceeeeecCccceEEecccCc-----eEEEecchhhhhheeee
Confidence 3457776643 789999999999843 22222111122333 346777765453 688888866542 2221
Q ss_pred C------CCCCC--CCCcEEEE-CCEEEEEecCCCCeEEEEeCCCCc
Q 045821 244 A------NMASG--WRGPAVVV-DDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 244 ~------~~~~~--~~~~~~~~-~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
. .+... ..-.++.+ ++.|-+--|.....+++||+.+.+
T Consensus 215 ~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~ 261 (703)
T KOG2321|consen 215 AASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASK 261 (703)
T ss_pred cccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCC
Confidence 1 11111 11123334 435655545444489999987654
No 193
>PF12217 End_beta_propel: Catalytic beta propeller domain of bacteriophage endosialidase; InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=68.55 E-value=77 Score=27.29 Aligned_cols=194 Identities=13% Similarity=0.257 Sum_probs=86.6
Q ss_pred EEECCEEEE--EeCCCCCCCCCCeEEEEECCCC-eeEe---cCCC----CC-cceeeeeEEECCEEEEEeCCCCC-----
Q 045821 116 EVLGKNAYL--LGGCGWSEDATSEVYCYDASMN-TWTD---AAPM----ST-ARCYFPCGVLNQKIYCIGGLGDT----- 179 (352)
Q Consensus 116 ~~~~~~iyv--~GG~~~~~~~~~~~~~~d~~t~-~W~~---~~~~----~~-~r~~~~~~~~~~~iyv~gG~~~~----- 179 (352)
.+.++-||. ++|....-..+.-.|+=....+ +|.. +.++ |. .-..+++.++++++|.+=....-
T Consensus 22 FVy~~VIYAPfM~~~RHGv~~LhvaWVkSgDdG~TWttPEwLtd~H~~yptvnyHCmSMGv~~NRLfa~iEtR~~a~~km 101 (367)
T PF12217_consen 22 FVYDNVIYAPFMAGDRHGVDNLHVAWVKSGDDGQTWTTPEWLTDLHPDYPTVNYHCMSMGVVGNRLFAVIETRTVASNKM 101 (367)
T ss_dssp EEETTEEEEEEEEESSSSSTT-EEEEEEESSTTSS----EESS---TTTTTEEEE-B-EEEETTEEEEEEEEEETTT--E
T ss_pred eeecCeeecccccccccCccceEEEEEEecCCCCcccCchhhhhcCCCCCccceeeeeeeeecceeeEEEeehhhhhhhh
Confidence 445666652 3333222233445566555543 4743 2222 22 23356777889999887553211
Q ss_pred CceEEEe---CCCCcEEEec--CCCC-------cccceeEEEECCEEEEEccCCCCCC--ceEEEEEeCCC---Cce--e
Q 045821 180 HSWDVYD---PRTNNWKLHT--EPNI-------FTEIEDSFVMDGKIYIRCSASAATS--HVCALVYEPST---DSW--L 240 (352)
Q Consensus 180 ~~v~~yd---~~t~~W~~~~--~~~~-------~~~~~~~~~~~g~iyv~GG~~~~~~--~~~i~~yd~~~---~~W--~ 240 (352)
...+.|| .....|+... ..+. ....|+.+.+++.-|.+|=.++.-. ...+..|...- ... +
T Consensus 102 ~~~~Lw~RpMF~~spW~~teL~~~~~~~~a~~~vTe~HSFa~i~~~~fA~GyHnGD~sPRe~G~~yfs~~~~sp~~~vrr 181 (367)
T PF12217_consen 102 VRAELWSRPMFHDSPWRITELGTIASFTSAGVAVTELHSFATIDDNQFAVGYHNGDVSPRELGFLYFSDAFASPGVFVRR 181 (367)
T ss_dssp EEEEEEEEE-STTS--EEEEEES-TT--------SEEEEEEE-SSS-EEEEEEE-SSSS-EEEEEEETTTTT-TT--EEE
T ss_pred hhhhhhcccccccCCceeeecccccccccccceeeeeeeeeEecCCceeEEeccCCCCcceeeEEEecccccCCcceeee
Confidence 3334444 4577887643 2222 1223455777887788854333222 22444442211 111 1
Q ss_pred ecCCCCCCCCCCcE-EEECCEEEEEe-cC----CCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821 241 HADANMASGWRGPA-VVVDDALYVLD-QS----SGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG 310 (352)
Q Consensus 241 ~~~~~~~~~~~~~~-~~~~~~l~~~g-g~----~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g 310 (352)
.++.......+.+. -..+|+||++- |. -++.+.+-+.....|+.+.- |...-....-++..++.||+||
T Consensus 182 ~i~sey~~~AsEPCvkyY~g~LyLtTRgt~~~~~GS~L~rs~d~G~~w~slrf-p~nvHhtnlPFakvgD~l~mFg 256 (367)
T PF12217_consen 182 IIPSEYERNASEPCVKYYDGVLYLTTRGTLPTNPGSSLHRSDDNGQNWSSLRF-PNNVHHTNLPFAKVGDVLYMFG 256 (367)
T ss_dssp E--GGG-TTEEEEEEEEETTEEEEEEEES-TTS---EEEEESSTTSS-EEEE--TT---SS---EEEETTEEEEEE
T ss_pred echhhhccccccchhhhhCCEEEEEEcCcCCCCCcceeeeecccCCchhhccc-cccccccCCCceeeCCEEEEEe
Confidence 22211111111111 24699999884 22 23678888888999998863 3333333445667899999998
No 194
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=67.98 E-value=53 Score=28.59 Aligned_cols=113 Identities=11% Similarity=0.056 Sum_probs=66.9
Q ss_pred eEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821 163 CGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA 242 (352)
Q Consensus 163 ~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~ 242 (352)
.+.-+.+.+++|+.. .+..=|-.-++|.+...+...+..+...+.+.+=++.| .. ..++.-+-...+|.++
T Consensus 50 ~~~~g~~gwlVg~rg---tiletdd~g~tw~qal~~~gr~~f~sv~f~~~egw~vG-e~-----sqll~T~DgGqsWARi 120 (339)
T COG4447 50 FTESGSHGWLVGGRG---TILETDDGGITWAQALDFLGRHAFHSVSFLGMEGWIVG-EP-----SQLLHTTDGGQSWARI 120 (339)
T ss_pred EeecCcceEEEcCcc---eEEEecCCcccchhhhchhhhhheeeeeeecccccccC-Cc-----ceEEEecCCCcchhhc
Confidence 333467899999863 34455666778998765544444554444454455552 22 3466666778899998
Q ss_pred CCCCCCCCCCc--EEEECCEEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821 243 DANMASGWRGP--AVVVDDALYVLDQSSGTKLMMWQKESREWSPVG 286 (352)
Q Consensus 243 ~~~~~~~~~~~--~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~ 286 (352)
+.......... .+..+..-+++|.+. .|+.-+-..+.|+.+.
T Consensus 121 ~~~e~~eg~~~sI~f~d~q~g~m~gd~G--ail~T~DgGk~Wk~l~ 164 (339)
T COG4447 121 PLSEKLEGFPDSITFLDDQRGEMLGDQG--AILKTTDGGKNWKALV 164 (339)
T ss_pred hhhcCCCCCcceeEEecchhhhhhcccc--eEEEecCCcccHhHhc
Confidence 65332222222 233344456666643 5666666778898775
No 195
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=67.95 E-value=1e+02 Score=28.39 Aligned_cols=129 Identities=12% Similarity=0.007 Sum_probs=57.7
Q ss_pred EEEecCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCe-eEecCCCCCcceeee
Q 045821 85 CYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNT-WTDAAPMSTARCYFP 162 (352)
Q Consensus 85 ~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~-W~~~~~~~~~r~~~~ 162 (352)
++..|. .++..+.+-.-. ...+|..... +..+.+|--...-......+|..|..... |.....++.....|-
T Consensus 170 i~~idl--~tG~~~~v~~~~----~wlgH~~fsP~dp~li~fCHEGpw~~Vd~RiW~i~~dg~~~~~v~~~~~~e~~gHE 243 (386)
T PF14583_consen 170 IFTIDL--KTGERKVVFEDT----DWLGHVQFSPTDPTLIMFCHEGPWDLVDQRIWTINTDGSNVKKVHRRMEGESVGHE 243 (386)
T ss_dssp EEEEET--TT--EEEEEEES----S-EEEEEEETTEEEEEEEEE-S-TTTSS-SEEEEETTS---EESS---TTEEEEEE
T ss_pred EEEEEC--CCCceeEEEecC----ccccCcccCCCCCCEEEEeccCCcceeceEEEEEEcCCCcceeeecCCCCcccccc
Confidence 334455 666655554322 1223333333 33445553222222234589999987555 444444444555665
Q ss_pred eEEECCE-EEEEeCCCCC--CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC
Q 045821 163 CGVLNQK-IYCIGGLGDT--HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS 221 (352)
Q Consensus 163 ~~~~~~~-iyv~gG~~~~--~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~ 221 (352)
--..+|. |+..+..... .-+..||+.|..=+.+..+| ...|.....+|++++--|.+
T Consensus 244 fw~~DG~~i~y~~~~~~~~~~~i~~~d~~t~~~~~~~~~p--~~~H~~ss~Dg~L~vGDG~d 303 (386)
T PF14583_consen 244 FWVPDGSTIWYDSYTPGGQDFWIAGYDPDTGERRRLMEMP--WCSHFMSSPDGKLFVGDGGD 303 (386)
T ss_dssp EE-TTSS-EEEEEEETTT--EEEEEE-TTT--EEEEEEE---SEEEEEE-TTSSEEEEEE--
T ss_pred cccCCCCEEEEEeecCCCCceEEEeeCCCCCCceEEEeCC--ceeeeEEcCCCCEEEecCCC
Confidence 5555553 3332322222 55677999987544443343 35566667788988866654
No 196
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=67.45 E-value=86 Score=27.47 Aligned_cols=134 Identities=14% Similarity=0.152 Sum_probs=69.0
Q ss_pred CCeEEEEECCCCeeEecCCCCCc-ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CC
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTA-RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DG 212 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g 212 (352)
..++-.+|..+++-...-.-... ....+...-+.+.||.||++ .....+|.....-.+.-.-. .....++.++ +|
T Consensus 165 D~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD--~~aklWD~R~~~c~qtF~gh-esDINsv~ffP~G 241 (343)
T KOG0286|consen 165 DMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCD--KSAKLWDVRSGQCVQTFEGH-ESDINSVRFFPSG 241 (343)
T ss_pred CceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccc--cceeeeeccCcceeEeeccc-ccccceEEEccCC
Confidence 45677788888764332110111 11111111267899999988 56677888877554432111 1112233333 67
Q ss_pred EEEEEccCCCCCCceEEEEEeCCCCceeecC--CCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeC
Q 045821 213 KIYIRCSASAATSHVCALVYEPSTDSWLHAD--ANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQK 277 (352)
Q Consensus 213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~--~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~ 277 (352)
.-++.|.-+ ...-.||+..++=-.+- .....+..+.++...|+|++.|..+. ...+||.
T Consensus 242 ~afatGSDD-----~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~-~c~vWDt 302 (343)
T KOG0286|consen 242 DAFATGSDD-----ATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFAGYDDF-TCNVWDT 302 (343)
T ss_pred CeeeecCCC-----ceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEeeecCC-ceeEeec
Confidence 777775333 24667887766433331 11222223334556888887775433 5566553
No 197
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=67.09 E-value=30 Score=30.88 Aligned_cols=102 Identities=14% Similarity=0.214 Sum_probs=64.3
Q ss_pred ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC
Q 045821 166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN 245 (352)
Q Consensus 166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~ 245 (352)
++++ |++....+ ..+-+++..|..+-+ .+....++-++.-++|++.|.|..+ +.|-.+|...+..-.+-..
T Consensus 328 fd~k-yIVsASgD-RTikvW~~st~efvR--tl~gHkRGIAClQYr~rlvVSGSSD-----ntIRlwdi~~G~cLRvLeG 398 (499)
T KOG0281|consen 328 FDDK-YIVSASGD-RTIKVWSTSTCEFVR--TLNGHKRGIACLQYRDRLVVSGSSD-----NTIRLWDIECGACLRVLEG 398 (499)
T ss_pred cccc-eEEEecCC-ceEEEEeccceeeeh--hhhcccccceehhccCeEEEecCCC-----ceEEEEeccccHHHHHHhc
Confidence 4666 44333222 677888888876543 3455556666777899999997443 4677888877665443110
Q ss_pred CCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821 246 MASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 246 ~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
... ---.+-++++=.+-|+++| .+-+||..+
T Consensus 399 HEe--LvRciRFd~krIVSGaYDG-kikvWdl~a 429 (499)
T KOG0281|consen 399 HEE--LVRCIRFDNKRIVSGAYDG-KIKVWDLQA 429 (499)
T ss_pred hHH--hhhheeecCceeeeccccc-eEEEEeccc
Confidence 000 0112567888888999988 777777654
No 198
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=66.75 E-value=1.5e+02 Score=29.98 Aligned_cols=71 Identities=11% Similarity=0.080 Sum_probs=40.5
Q ss_pred eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821 113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
+.....+...||.-. .-+.+.+||..+-.--.+. -+.|..-.+++.....+|+..| +.+.+|--....|
T Consensus 39 ~~~~~~~~~~~vtt~------vgksfqvYd~~kl~ll~vs-~~lp~~I~alas~~~~vy~A~g----~~i~~~~rgk~i~ 107 (910)
T KOG1539|consen 39 FRVVALGSTFYVTTC------VGKSFQVYDVNKLNLLFVS-KPLPDKITALASDKDYVYVASG----NKIYAYARGKHIR 107 (910)
T ss_pred eeeeecCceEEEEEe------cCceEEEEeccceEEEEec-CCCCCceEEEEecCceEEEecC----cEEEEEEccceEE
Confidence 344555666666632 2356788888765544444 2333344555666667777777 5666665554444
Q ss_pred EE
Q 045821 193 KL 194 (352)
Q Consensus 193 ~~ 194 (352)
-.
T Consensus 108 ~~ 109 (910)
T KOG1539|consen 108 HT 109 (910)
T ss_pred EE
Confidence 33
No 199
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=66.46 E-value=1e+02 Score=27.88 Aligned_cols=204 Identities=15% Similarity=0.137 Sum_probs=101.1
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE------CCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL------NQKIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~------~~~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
...=|+|... ..+.+-+||...|+..+ ..++|-++++ --.+.+.||.+ ..+-++|+.|..
T Consensus 203 ~rHpYlFs~g-----edk~VKCwDLe~nkvIR------~YhGHlS~V~~L~lhPTldvl~t~grD--st~RvWDiRtr~- 268 (460)
T KOG0285|consen 203 KRHPYLFSAG-----EDKQVKCWDLEYNKVIR------HYHGHLSGVYCLDLHPTLDVLVTGGRD--STIRVWDIRTRA- 268 (460)
T ss_pred ccCceEEEec-----CCCeeEEEechhhhhHH------HhccccceeEEEeccccceeEEecCCc--ceEEEeeecccc-
Confidence 4445666542 23568889998775322 1122322222 13456667766 667788888764
Q ss_pred EEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821 193 KLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG 269 (352)
Q Consensus 193 ~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~ 269 (352)
.+..+.....-...+. .+.+||-- ..++ .+-.+|...++-...- ..+.....+.+..-+.+++.....
T Consensus 269 -~V~~l~GH~~~V~~V~~~~~dpqvit~-S~D~-----tvrlWDl~agkt~~tl--t~hkksvral~lhP~e~~fASas~ 339 (460)
T KOG0285|consen 269 -SVHVLSGHTNPVASVMCQPTDPQVITG-SHDS-----TVRLWDLRAGKTMITL--THHKKSVRALCLHPKENLFASASP 339 (460)
T ss_pred -eEEEecCCCCcceeEEeecCCCceEEe-cCCc-----eEEEeeeccCceeEee--ecccceeeEEecCCchhhhhccCC
Confidence 2221222111112222 25666552 3331 3445555544322211 111222233455666666654333
Q ss_pred CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEE-EEcCCcccccceEEeccCCCCCCCCcceeeE
Q 045821 270 TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVV-IDVGNIGNIGGIMVSSSIPKLNDNDDIISCK 348 (352)
Q Consensus 270 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~-~d~~~~~w~~~~~~~~~~~~~~~~~~~~~~~ 348 (352)
+.+-.++.-.+.... ++.. .....-++....+-+|+.||.+..++ +|-+..- .=....+.-.|++...+..+.|.
T Consensus 340 dnik~w~~p~g~f~~--nlsg-h~~iintl~~nsD~v~~~G~dng~~~fwdwksg~-nyQ~~~t~vqpGSl~sEagI~as 415 (460)
T KOG0285|consen 340 DNIKQWKLPEGEFLQ--NLSG-HNAIINTLSVNSDGVLVSGGDNGSIMFWDWKSGH-NYQRGQTIVQPGSLESEAGIFAS 415 (460)
T ss_pred ccceeccCCccchhh--cccc-ccceeeeeeeccCceEEEcCCceEEEEEecCcCc-ccccccccccCCccccccceeEE
Confidence 356666655544322 1221 11222356667788999998776554 6655422 22233344558888888666666
Q ss_pred E
Q 045821 349 C 349 (352)
Q Consensus 349 ~ 349 (352)
|
T Consensus 416 ~ 416 (460)
T KOG0285|consen 416 C 416 (460)
T ss_pred e
Confidence 5
No 200
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=65.71 E-value=1.2e+02 Score=28.46 Aligned_cols=103 Identities=15% Similarity=0.023 Sum_probs=56.7
Q ss_pred CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD 259 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 259 (352)
.+++.+|..+.+=.++..... ...+-+..-+|+-+++.... .....|+.+|....+=+++........ .....-+|
T Consensus 262 ~~iy~~dl~~~~~~~Lt~~~g-i~~~Ps~spdG~~ivf~Sdr--~G~p~I~~~~~~g~~~~riT~~~~~~~-~p~~SpdG 337 (425)
T COG0823 262 PDIYLMDLDGKNLPRLTNGFG-INTSPSWSPDGSKIVFTSDR--GGRPQIYLYDLEGSQVTRLTFSGGGNS-NPVWSPDG 337 (425)
T ss_pred ccEEEEcCCCCcceecccCCc-cccCccCCCCCCEEEEEeCC--CCCcceEEECCCCCceeEeeccCCCCc-CccCCCCC
Confidence 889999999876333332211 11122333455544442211 222378999998877676654332222 33344455
Q ss_pred EEEEEecCC-C-CeEEEEeCCCCc-eEEcc
Q 045821 260 ALYVLDQSS-G-TKLMMWQKESRE-WSPVG 286 (352)
Q Consensus 260 ~l~~~gg~~-~-~~v~~yd~~~~~-W~~~~ 286 (352)
+.+++-+.. + ..+..+|+.++. |+.+.
T Consensus 338 ~~i~~~~~~~g~~~i~~~~~~~~~~~~~lt 367 (425)
T COG0823 338 DKIVFESSSGGQWDIDKNDLASGGKIRILT 367 (425)
T ss_pred CEEEEEeccCCceeeEEeccCCCCcEEEcc
Confidence 544443322 2 368899998777 87775
No 201
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=65.24 E-value=1.3e+02 Score=28.68 Aligned_cols=86 Identities=17% Similarity=0.095 Sum_probs=52.4
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeC-CEEEEEcCCceEEEEEcCCcccccce-EEec
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIG-KTIFVIGKGCSAVVIDVGNIGNIGGI-MVSS 333 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~-~~i~v~gG~~~~~~~d~~~~~w~~~~-~~~~ 333 (352)
.-+++-++-|.++| .|..||..++.=.++..-+. ...-.++...+ +.++.+|=-+...+++.+.+...+.. .-..
T Consensus 329 ~~d~~~i~SgsyDG-~I~~W~~~~g~~~~~~g~~h--~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg 405 (603)
T KOG0318|consen 329 SPDGKTIYSGSYDG-HINSWDSGSGTSDRLAGKGH--TNQIKGMAASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLG 405 (603)
T ss_pred cCCCCEEEeeccCc-eEEEEecCCccccccccccc--cceEEEEeecCCCcEEEEecCCeEEEEecccCcccccceeecC
Confidence 34565555555666 88888888877665542221 22233455555 77888876666667888777766543 3455
Q ss_pred cCCCCCCCCcc
Q 045821 334 SIPKLNDNDDI 344 (352)
Q Consensus 334 ~~~~~~~~~~~ 344 (352)
+.|.-.++.+.
T Consensus 406 ~QP~~lav~~d 416 (603)
T KOG0318|consen 406 SQPKGLAVLSD 416 (603)
T ss_pred CCceeEEEcCC
Confidence 66766666543
No 202
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=65.00 E-value=1.1e+02 Score=27.96 Aligned_cols=149 Identities=10% Similarity=0.018 Sum_probs=70.2
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcE-EEec
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNW-KLHT 196 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W-~~~~ 196 (352)
+..+|+.+. ...+.++|+.+++ .+...+........+. -+|+..+++.+.. ..+.++|.+|.+= +.++
T Consensus 48 gr~~yv~~r-------dg~vsviD~~~~~--~v~~i~~G~~~~~i~~s~DG~~~~v~n~~~-~~v~v~D~~tle~v~~I~ 117 (369)
T PF02239_consen 48 GRYLYVANR-------DGTVSVIDLATGK--VVATIKVGGNPRGIAVSPDGKYVYVANYEP-GTVSVIDAETLEPVKTIP 117 (369)
T ss_dssp SSEEEEEET-------TSEEEEEETTSSS--EEEEEE-SSEEEEEEE--TTTEEEEEEEET-TEEEEEETTT--EEEEEE
T ss_pred CCEEEEEcC-------CCeEEEEECCccc--EEEEEecCCCcceEEEcCCCCEEEEEecCC-CceeEeccccccceeecc
Confidence 457898853 2468899999887 3333333333333333 3554333333322 7888899887642 2333
Q ss_pred C--CCC---cccceeE-EEECCEEEEEccCCCCCCceEEEEEeCCCCc--e-eecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821 197 E--PNI---FTEIEDS-FVMDGKIYIRCSASAATSHVCALVYEPSTDS--W-LHADANMASGWRGPAVVVDDALYVLDQS 267 (352)
Q Consensus 197 ~--~~~---~~~~~~~-~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~ 267 (352)
. ++. ..+..+. ....+..|++.-.+ ...++..|....+ . +.+.. ....+.....-+++-|+++..
T Consensus 118 ~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd----~~~I~vVdy~d~~~~~~~~i~~--g~~~~D~~~dpdgry~~va~~ 191 (369)
T PF02239_consen 118 TGGMPVDGPESRVAAIVASPGRPEFVVNLKD----TGEIWVVDYSDPKNLKVTTIKV--GRFPHDGGFDPDGRYFLVAAN 191 (369)
T ss_dssp --EE-TTTS---EEEEEE-SSSSEEEEEETT----TTEEEEEETTTSSCEEEEEEE----TTEEEEEE-TTSSEEEEEEG
T ss_pred cccccccccCCCceeEEecCCCCEEEEEEcc----CCeEEEEEeccccccceeeecc--cccccccccCcccceeeeccc
Confidence 1 111 1111222 22344545552222 2257777765432 1 11111 111122233345565666555
Q ss_pred CCCeEEEEeCCCCceE
Q 045821 268 SGTKLMMWQKESREWS 283 (352)
Q Consensus 268 ~~~~v~~yd~~~~~W~ 283 (352)
..+.+.+.|.+++.-.
T Consensus 192 ~sn~i~viD~~~~k~v 207 (369)
T PF02239_consen 192 GSNKIAVIDTKTGKLV 207 (369)
T ss_dssp GGTEEEEEETTTTEEE
T ss_pred ccceeEEEeeccceEE
Confidence 5568888888876544
No 203
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=63.36 E-value=95 Score=26.50 Aligned_cols=129 Identities=13% Similarity=0.041 Sum_probs=69.5
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMN 146 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~ 146 (352)
++.|+..||. -..|..|. ++++.+..-.-. ..+-|+++.-+ +.-.+.|+.+ ..+-++|..|.
T Consensus 126 enSi~~AgGD-----~~~y~~dl--E~G~i~r~~rGH----tDYvH~vv~R~~~~qilsG~ED------GtvRvWd~kt~ 188 (325)
T KOG0649|consen 126 ENSILFAGGD-----GVIYQVDL--EDGRIQREYRGH----TDYVHSVVGRNANGQILSGAED------GTVRVWDTKTQ 188 (325)
T ss_pred CCcEEEecCC-----eEEEEEEe--cCCEEEEEEcCC----cceeeeeeecccCcceeecCCC------ccEEEEecccc
Confidence 6777777764 34666788 888776554332 12334444422 2223345443 34677888887
Q ss_pred eeEec-CC-----CCCccee-e-eeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEc
Q 045821 147 TWTDA-AP-----MSTARCY-F-PCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRC 218 (352)
Q Consensus 147 ~W~~~-~~-----~~~~r~~-~-~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~G 218 (352)
+-..+ .+ ...|..+ + .+...+....|+||- ...-.++..+.+-..+ .|.+...+.+.++++.+.+.|
T Consensus 189 k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgG---p~lslwhLrsse~t~v--fpipa~v~~v~F~~d~vl~~G 263 (325)
T KOG0649|consen 189 KHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGG---PKLSLWHLRSSESTCV--FPIPARVHLVDFVDDCVLIGG 263 (325)
T ss_pred ceeEEeccccChhhcCcccCceeEEEeccCceEEecCC---CceeEEeccCCCceEE--EecccceeEeeeecceEEEec
Confidence 64332 22 1222222 2 233335566677664 4455666666554444 344455555666777777665
No 204
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=62.58 E-value=1.3e+02 Score=27.93 Aligned_cols=184 Identities=13% Similarity=0.105 Sum_probs=88.5
Q ss_pred eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCcceee
Q 045821 83 VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTARCYF 161 (352)
Q Consensus 83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~r~~~ 161 (352)
...+++|. .-+.--++..+.. ..| -+++-..++..|++-= ...+-+++.|...-+= +.+..+.-|-+.-
T Consensus 406 N~vYilDe--~lnvvGkltGl~~--gER-IYAvRf~gdv~yiVTf-----rqtDPlfviDlsNPenPkvlGeLKIPGfS~ 475 (603)
T COG4880 406 NAVYILDE--NLNVVGKLTGLAP--GER-IYAVRFVGDVLYIVTF-----RQTDPLFVIDLSNPENPKVLGELKIPGFSE 475 (603)
T ss_pred ceeEEEcC--CCcEEEEEeccCC--Cce-EEEEEEeCceEEEEEE-----eccCceEEEEcCCCCCCceeEEEecCCchh
Confidence 45788887 7777777776651 123 2445556777777642 2234466666653221 1122222222221
Q ss_pred eeEEEC-CEEEEEeCCCCCCceEEEeCCCCc-EEEec------CCCCcccceeEEEECC--EEEEEccCCCCCCceEEEE
Q 045821 162 PCGVLN-QKIYCIGGLGDTHSWDVYDPRTNN-WKLHT------EPNIFTEIEDSFVMDG--KIYIRCSASAATSHVCALV 231 (352)
Q Consensus 162 ~~~~~~-~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~------~~~~~~~~~~~~~~~g--~iyv~GG~~~~~~~~~i~~ 231 (352)
-...++ +++.=+|.+++.-.+-.||.+.-. =+.+. -..+.-+.|+++..|. .|+.+-.+.+ ..++.
T Consensus 476 YLHpigen~~lGvG~~~g~vKiSLFdiSdl~~PkEv~~y~l~~~wspvf~dhHAFl~d~~~~ifFlPay~~----gyif~ 551 (603)
T COG4880 476 YLHPIGENRLLGVGAYQGGVKISLFDISDLAAPKEVSNYTLSNAWSPVFYDHHAFLYDPEAEIFFLPAYLG----GYIFF 551 (603)
T ss_pred hccccCCCcEEEeecccCCceEEEEeccCCCCchhhhheehhhhcchhhhccceeecCCcccEEEecccCc----cEEEE
Confidence 222233 455555555544555666664311 00010 0112234456666653 4554422111 02333
Q ss_pred EeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCC
Q 045821 232 YEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRL 288 (352)
Q Consensus 232 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~ 288 (352)
+.. + -..+...-...-.-.+...++-+|++||. .+|+|| .|+|+.++..
T Consensus 552 ied--g-~kl~k~~e~k~na~RA~fi~dylY~vg~~---ev~~ld--enswe~Vge~ 600 (603)
T COG4880 552 IED--G-SKLRKRAERKLNADRAFFIKDYLYLVGGN---EVWKLD--ENSWEVVGEA 600 (603)
T ss_pred Eec--C-ceeeehhhhcccceeeEEecceEEEeccc---eeEEec--cchHhhhhhe
Confidence 322 2 11111100000112246789999999997 899988 5779887643
No 205
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=61.95 E-value=1.1e+02 Score=26.90 Aligned_cols=63 Identities=16% Similarity=0.096 Sum_probs=34.2
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCCCeEEEEeCCC
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
+|.....||.+. .+-.+|+.+++=.++...-.+-+...-+.. +-.+++.|..+. .+-.||+..
T Consensus 83 dgskVf~g~~Dk-----~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDK-TlKfWD~R~ 146 (347)
T KOG0647|consen 83 DGSKVFSGGCDK-----QAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWDK-TLKFWDTRS 146 (347)
T ss_pred CCceEEeeccCC-----ceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEeccccc-ceeecccCC
Confidence 454444445553 477899999988887652111111111111 123556666655 777778774
No 206
>PTZ00420 coronin; Provisional
Probab=61.18 E-value=1.7e+02 Score=28.67 Aligned_cols=148 Identities=12% Similarity=0.128 Sum_probs=70.0
Q ss_pred EEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeE-EECCEEEEEeCCCCCCceEEEeCCCCcEEEecCC
Q 045821 121 NAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCG-VLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEP 198 (352)
Q Consensus 121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~-~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~ 198 (352)
.+++.||.+ ..+.+||..+.+-. .+. .+. .-.++. ..++.+++.++.+ ..+.+||+.+.+= +...
T Consensus 139 ~iLaSgS~D------gtIrIWDl~tg~~~~~i~-~~~--~V~SlswspdG~lLat~s~D--~~IrIwD~Rsg~~--i~tl 205 (568)
T PTZ00420 139 YIMCSSGFD------SFVNIWDIENEKRAFQIN-MPK--KLSSLKWNIKGNLLSGTCVG--KHMHIIDPRKQEI--ASSF 205 (568)
T ss_pred eEEEEEeCC------CeEEEEECCCCcEEEEEe-cCC--cEEEEEECCCCCEEEEEecC--CEEEEEECCCCcE--EEEE
Confidence 445555543 45778898876521 121 111 111121 1267777776644 6789999987642 2111
Q ss_pred CCcccc---eeEEE-----ECCEEEEEccCCCCCCceEEEEEeCCC-CceeecCCCCCCCCCCcEE-E---ECCEEEEEe
Q 045821 199 NIFTEI---EDSFV-----MDGKIYIRCSASAATSHVCALVYEPST-DSWLHADANMASGWRGPAV-V---VDDALYVLD 265 (352)
Q Consensus 199 ~~~~~~---~~~~~-----~~g~iyv~GG~~~~~~~~~i~~yd~~~-~~W~~~~~~~~~~~~~~~~-~---~~~~l~~~g 265 (352)
. .+.. ..++. -++...+.+|.... ....+..||..+ ..-..... .. ...+... . ..+.+|+.|
T Consensus 206 ~-gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~-~~R~VkLWDlr~~~~pl~~~~-ld-~~~~~L~p~~D~~tg~l~lsG 281 (568)
T PTZ00420 206 H-IHDGGKNTKNIWIDGLGGDDNYILSTGFSKN-NMREMKLWDLKNTTSALVTMS-ID-NASAPLIPHYDESTGLIYLIG 281 (568)
T ss_pred e-cccCCceeEEEEeeeEcCCCCEEEEEEcCCC-CccEEEEEECCCCCCceEEEE-ec-CCccceEEeeeCCCCCEEEEE
Confidence 1 1111 11111 24455555455432 222467777753 22111100 00 0111111 1 146677776
Q ss_pred cCCCCeEEEEeCCCCceEEcc
Q 045821 266 QSSGTKLMMWQKESREWSPVG 286 (352)
Q Consensus 266 g~~~~~v~~yd~~~~~W~~~~ 286 (352)
..++ .+..|+...+.-..+.
T Consensus 282 kGD~-tIr~~e~~~~~~~~l~ 301 (568)
T PTZ00420 282 KGDG-NCRYYQHSLGSIRKVN 301 (568)
T ss_pred ECCC-eEEEEEccCCcEEeec
Confidence 4444 8999998777655554
No 207
>KOG3545 consensus Olfactomedin and related extracellular matrix glycoproteins [Extracellular structures]
Probab=60.42 E-value=1.1e+02 Score=26.17 Aligned_cols=190 Identities=16% Similarity=0.221 Sum_probs=106.7
Q ss_pred CCEEEEEEecCCCCceeEEEe----cCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821 68 ETWIYAFCRDNKLERVCCYVL----DPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA 143 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~----d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~ 143 (352)
++++|++. .......+.| |. ....+...-.+| .+..+-..++.+|.+|.-. .....+.+||.
T Consensus 30 ~~r~~~~~---~~~~~~l~E~~~~~~~--~~~~~~~~~~lp---~~~~gTg~VVynGs~yynk------~~t~~ivky~l 95 (249)
T KOG3545|consen 30 DDRIYVMN---YFDGLMLTEYTNLEDF--KRGRKAEKYRLP---YSWDGTGHVVYNGSLYYNK------AGTRNIIKYDL 95 (249)
T ss_pred cCceEEec---cccCceEEEeccHHHh--hccCcceEEeCC---CCccccceEEEcceEEeec------cCCcceEEEEe
Confidence 56788883 2233344434 23 445555555566 4455566678888888764 33567889999
Q ss_pred CCCe---eEecCCCCC---------cceeeeeEEECCEEEEEeCCCCC--Cce-EEEeCCC----CcEEEecCCCCcccc
Q 045821 144 SMNT---WTDAAPMST---------ARCYFPCGVLNQKIYCIGGLGDT--HSW-DVYDPRT----NNWKLHTEPNIFTEI 204 (352)
Q Consensus 144 ~t~~---W~~~~~~~~---------~r~~~~~~~~~~~iyv~gG~~~~--~~v-~~yd~~t----~~W~~~~~~~~~~~~ 204 (352)
.++. +..++.+.. +......++...-++++--.... ..| -..|+.| .+|..-- ..+..
T Consensus 96 ~~~~~~~~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp~tl~~e~tW~T~~---~k~~~ 172 (249)
T KOG3545|consen 96 ETRTVAGSAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDPETLEVERTWNTTL---PKRSA 172 (249)
T ss_pred ecceeeeeeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCHHHhheeeeecccc---CCCCc
Confidence 8854 555554321 12233455554555555443322 333 5667743 3575422 22333
Q ss_pred eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEE----EECCEEEEEecCCCCeEEEEeCC
Q 045821 205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAV----VVDDALYVLDQSSGTKLMMWQKE 278 (352)
Q Consensus 205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~----~~~~~l~~~gg~~~~~v~~yd~~ 278 (352)
..++.+=|.||++-....... .--++||..+++=..+.-+.+......+. ..+.+||+.... .+..|++.
T Consensus 173 ~~aF~iCGvLY~v~S~~~~~~-~i~yaydt~~~~~~~~~ipf~N~y~~~~~idYNP~D~~LY~wdng---~~l~y~l~ 246 (249)
T KOG3545|consen 173 GNAFMICGVLYVVHSYNCTHT-QISYAYDTTTGTQERIDLPFPNPYSYATMIDYNPRDRRLYAWDNG---HQLTYNLT 246 (249)
T ss_pred CceEEEeeeeEEEeccccCCc-eEEEEEEcCCCceecccccccchhhhhhccCCCcccceeeEecCC---cEEEEEeE
Confidence 345666788899854443221 12378999988776665555554444432 347889988543 56777653
No 208
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=59.84 E-value=1.4e+02 Score=27.11 Aligned_cols=53 Identities=11% Similarity=0.134 Sum_probs=34.2
Q ss_pred ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc
Q 045821 257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC 313 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~ 313 (352)
-+|+ |++..-++..+.+||..+.+....-..+. ..-.++....++.||+.|+-
T Consensus 344 p~Gk-yi~ScaDDktlrvwdl~~~~cmk~~~ah~---hfvt~lDfh~~~p~VvTGsV 396 (406)
T KOG0295|consen 344 PGGK-YILSCADDKTLRVWDLKNLQCMKTLEAHE---HFVTSLDFHKTAPYVVTGSV 396 (406)
T ss_pred CCCe-EEEEEecCCcEEEEEeccceeeeccCCCc---ceeEEEecCCCCceEEeccc
Confidence 3455 55554444589999999988766544332 33334555677889988764
No 209
>PF13088 BNR_2: BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=59.67 E-value=1.1e+02 Score=26.17 Aligned_cols=214 Identities=12% Similarity=0.158 Sum_probs=98.2
Q ss_pred CCCCeeEeccCCCcc--cCCcceEEEE--ECCEEEEEe--CCCCC-CCCCCeEEEEECC-CCeeEecCCCCCc------c
Q 045821 93 TRRSWKMIHELPART--LRRKGMGFEV--LGKNAYLLG--GCGWS-EDATSEVYCYDAS-MNTWTDAAPMSTA------R 158 (352)
Q Consensus 93 ~~~~w~~~~~~~~~~--~~~~~~~~~~--~~~~iyv~G--G~~~~-~~~~~~~~~~d~~-t~~W~~~~~~~~~------r 158 (352)
...+|.....+.... .......+.. .++.|+++- +.... .....-.+..... -.+|.....++.. .
T Consensus 28 ~G~tWs~~~~v~~~~~~~~~~~~p~~~~~~~g~l~l~~~~~~~~~~~~~~~~~~~~S~D~G~TWs~~~~l~~~~~~~~~~ 107 (275)
T PF13088_consen 28 GGKTWSEPRIVADGPKPGRRYGNPSLVVDPDGRLWLFYSAGSSGGGWSGSRIYYSRSTDGGKTWSEPTDLPPGWFGNFSG 107 (275)
T ss_dssp CTTEEEEEEEEETSTBTTCEEEEEEEEEETTSEEEEEEEEEETTESCCTCEEEEEEESSTTSS-EEEEEEHHHCCCSCEE
T ss_pred CCCeeCCCEEEeeccccCCcccCcEEEEeCCCCEEEEEEEccCCCCCCceeEEEEEECCCCCCCCCccccccccccceec
Confidence 567898765433221 1122222222 278888885 22111 1111112355555 3468876543311 1
Q ss_pred --eeeeeEEECCEEEEEeCCC-C-C-CceEEEeCC-CCcEEEecCCCCc-ccce-eEEE-ECCEEEEEccCCCCCCceEE
Q 045821 159 --CYFPCGVLNQKIYCIGGLG-D-T-HSWDVYDPR-TNNWKLHTEPNIF-TEIE-DSFV-MDGKIYIRCSASAATSHVCA 229 (352)
Q Consensus 159 --~~~~~~~~~~~iyv~gG~~-~-~-~~v~~yd~~-t~~W~~~~~~~~~-~~~~-~~~~-~~g~iyv~GG~~~~~~~~~i 229 (352)
.......-++++++..-.. . . ..+..|... -.+|+....+... .... ..+. -+|+|+++.-.. ... ...
T Consensus 108 ~~~~~~i~~~~G~l~~~~~~~~~~~~~~~~~~S~D~G~tW~~~~~~~~~~~~~e~~~~~~~dG~l~~~~R~~-~~~-~~~ 185 (275)
T PF13088_consen 108 PGRGPPIQLPDGRLIAPYYHESGGSFSAFVYYSDDGGKTWSSGSPIPDGQGECEPSIVELPDGRLLAVFRTE-GND-DIY 185 (275)
T ss_dssp CSEEEEEEECTTEEEEEEEEESSCEEEEEEEEESSTTSSEEEEEECECSEEEEEEEEEEETTSEEEEEEEEC-SST-EEE
T ss_pred cceeeeeEecCCCEEEEEeeccccCcceEEEEeCCCCceeeccccccccCCcceeEEEECCCCcEEEEEEcc-CCC-cEE
Confidence 1111233378887763111 1 1 334444444 4579887654332 2222 2222 377898884332 111 333
Q ss_pred EEEeC-CCCceeecCC-CCCCCCCCcE-EE-ECCEEEEEecCC--CCeE--EEEeCCCCceEEccCCCCCc--CCCCceE
Q 045821 230 LVYEP-STDSWLHADA-NMASGWRGPA-VV-VDDALYVLDQSS--GTKL--MMWQKESREWSPVGRLSTLL--TRPPCKL 299 (352)
Q Consensus 230 ~~yd~-~~~~W~~~~~-~~~~~~~~~~-~~-~~~~l~~~gg~~--~~~v--~~yd~~~~~W~~~~~~~~~~--~~~~~~~ 299 (352)
..+.. ...+|+.... ..+....... +. -+++++++.... ...+ ..-..+..+|.....+.... .....++
T Consensus 186 ~~~S~D~G~TWs~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~~l~l~~S~D~g~tW~~~~~i~~~~~~~~~Y~~~ 265 (275)
T PF13088_consen 186 ISRSTDGGRTWSPPQPTNLPNPNSSISLVRLSDGRLLLVYNNPDGRSNLSLYVSEDGGKTWSRPKTIDDGPNGDSGYPSL 265 (275)
T ss_dssp EEEESSTTSS-EEEEEEECSSCCEEEEEEECTTSEEEEEEECSSTSEEEEEEEECTTCEEEEEEEEEEEEE-CCEEEEEE
T ss_pred EEEECCCCCcCCCceecccCcccCCceEEEcCCCCEEEEEECCCCCCceEEEEEeCCCCcCCccEEEeCCCCCcEECCee
Confidence 33333 3568987532 1222222222 22 367888777631 1233 33344478999776554422 1223344
Q ss_pred EEe-CCEEEE
Q 045821 300 VAI-GKTIFV 308 (352)
Q Consensus 300 ~~~-~~~i~v 308 (352)
+.. +++|||
T Consensus 266 ~~~~dg~l~i 275 (275)
T PF13088_consen 266 TQLPDGKLYI 275 (275)
T ss_dssp EEEETTEEEE
T ss_pred EEeCCCcCCC
Confidence 444 568886
No 210
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=59.60 E-value=79 Score=27.94 Aligned_cols=104 Identities=16% Similarity=0.251 Sum_probs=55.6
Q ss_pred CEEEEEeCCCCCCceEEEeCCCCcE----EEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821 168 QKIYCIGGLGDTHSWDVYDPRTNNW----KLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHAD 243 (352)
Q Consensus 168 ~~iyv~gG~~~~~~v~~yd~~t~~W----~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~ 243 (352)
..|.+.|..+ +.+-.||...-+= +.+.+--+ ..+-++.-.|...++|- ....+..||..+-+.-.-.
T Consensus 184 e~ILiS~srD--~tvKlFDfsK~saKrA~K~~qd~~~--vrsiSfHPsGefllvgT-----dHp~~rlYdv~T~Qcfvsa 254 (430)
T KOG0640|consen 184 ETILISGSRD--NTVKLFDFSKTSAKRAFKVFQDTEP--VRSISFHPSGEFLLVGT-----DHPTLRLYDVNTYQCFVSA 254 (430)
T ss_pred hheEEeccCC--CeEEEEecccHHHHHHHHHhhccce--eeeEeecCCCceEEEec-----CCCceeEEeccceeEeeec
Confidence 3455555444 7788888864321 11111000 01112222566666652 2345778888765543222
Q ss_pred CCCCCCCCCcEE----EECCEEEEEecCCCCeEEEEeCCCCce
Q 045821 244 ANMASGWRGPAV----VVDDALYVLDQSSGTKLMMWQKESREW 282 (352)
Q Consensus 244 ~~~~~~~~~~~~----~~~~~l~~~gg~~~~~v~~yd~~~~~W 282 (352)
. |......+.. .-.++||+.|..+| .|-.||-.++..
T Consensus 255 n-Pd~qht~ai~~V~Ys~t~~lYvTaSkDG-~IklwDGVS~rC 295 (430)
T KOG0640|consen 255 N-PDDQHTGAITQVRYSSTGSLYVTASKDG-AIKLWDGVSNRC 295 (430)
T ss_pred C-cccccccceeEEEecCCccEEEEeccCC-cEEeeccccHHH
Confidence 2 2222223322 23789999999888 888988877665
No 211
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=59.37 E-value=1.4e+02 Score=27.27 Aligned_cols=134 Identities=10% Similarity=-0.000 Sum_probs=67.4
Q ss_pred CCeEEEEECCCCe-eEecCCCCCcceeeeeEEE---CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEE
Q 045821 135 TSEVYCYDASMNT-WTDAAPMSTARCYFPCGVL---NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFV 209 (352)
Q Consensus 135 ~~~~~~~d~~t~~-W~~~~~~~~~r~~~~~~~~---~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~ 209 (352)
.+++.+.|..+++ -..++.... .|....+ +..+|+.+. + ..+-++|+.+.+ .+...+...... .+..
T Consensus 15 ~~~v~viD~~t~~~~~~i~~~~~---~h~~~~~s~Dgr~~yv~~r-d--g~vsviD~~~~~--~v~~i~~G~~~~~i~~s 86 (369)
T PF02239_consen 15 SGSVAVIDGATNKVVARIPTGGA---PHAGLKFSPDGRYLYVANR-D--GTVSVIDLATGK--VVATIKVGGNPRGIAVS 86 (369)
T ss_dssp GTEEEEEETTT-SEEEEEE-STT---EEEEEE-TT-SSEEEEEET-T--SEEEEEETTSSS--EEEEEE-SSEEEEEEE-
T ss_pred CCEEEEEECCCCeEEEEEcCCCC---ceeEEEecCCCCEEEEEcC-C--CeEEEEECCccc--EEEEEecCCCcceEEEc
Confidence 5678888988875 344443222 2444433 347899863 3 678999999887 232222222222 2344
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCC-----CCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCC
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTDSW-LHADAN-----MASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESR 280 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~-----~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~ 280 (352)
-+|+..+++... ...+..+|..+.+= ..++.. ....+..+. ....+..|++.-.+...++..|....
T Consensus 87 ~DG~~~~v~n~~----~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~ 160 (369)
T PF02239_consen 87 PDGKYVYVANYE----PGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDP 160 (369)
T ss_dssp -TTTEEEEEEEE----TTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTTS
T ss_pred CCCCEEEEEecC----CCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEeccc
Confidence 567654443322 12678889877643 233221 111111111 22355667776554458999887654
No 212
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=58.48 E-value=42 Score=30.19 Aligned_cols=70 Identities=23% Similarity=0.356 Sum_probs=39.0
Q ss_pred CCEEEEEeC---CCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CC--EEEEEeCCCCCCceEEEeCCCCcE
Q 045821 119 GKNAYLLGG---CGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQ--KIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 119 ~~~iyv~GG---~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~--~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
.+.+|+..- ......+..++|.||+.+++--.--++..+ -.+..+- +. .+|.+.+.+ ..+.+||..|++-
T Consensus 249 ~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~--~~Si~Vsqd~~P~L~~~~~~~--~~l~v~D~~tGk~ 324 (342)
T PF06433_consen 249 SGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHP--IDSIAVSQDDKPLLYALSAGD--GTLDVYDAATGKL 324 (342)
T ss_dssp TTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEE--ESEEEEESSSS-EEEEEETTT--TEEEEEETTT--E
T ss_pred cCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCc--cceEEEccCCCcEEEEEcCCC--CeEEEEeCcCCcE
Confidence 678998742 222235678999999999873222122222 2123333 33 566654422 5789999999853
No 213
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=58.17 E-value=1.5e+02 Score=27.19 Aligned_cols=150 Identities=14% Similarity=0.110 Sum_probs=82.1
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM 246 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~ 246 (352)
+.++|+..+.. +.+.++|..+++-............-.....++.+|+..... ....+...|..+++=.....--
T Consensus 85 ~~~vyv~~~~~--~~v~vid~~~~~~~~~~~vG~~P~~~~~~~~~~~vYV~n~~~---~~~~vsvid~~t~~~~~~~~vG 159 (381)
T COG3391 85 GNKVYVTTGDS--NTVSVIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGN---GNNTVSVIDAATNKVTATIPVG 159 (381)
T ss_pred CCeEEEecCCC--CeEEEEcCcccceeeEeeeccCCceEEECCCCCEEEEEeccc---CCceEEEEeCCCCeEEEEEecC
Confidence 56899998754 788899977765433222222222212223356999984321 2347888999888755441111
Q ss_pred CCCCCCcEEE-ECCEEEEEecCCCCeEEEEeCCCCceEEccC----CCCCcCCCCceEEEeCCEEEEEcCCc---eEEEE
Q 045821 247 ASGWRGPAVV-VDDALYVLDQSSGTKLMMWQKESREWSPVGR----LSTLLTRPPCKLVAIGKTIFVIGKGC---SAVVI 318 (352)
Q Consensus 247 ~~~~~~~~~~-~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~----~~~~~~~~~~~~~~~~~~i~v~gG~~---~~~~~ 318 (352)
..+ .+.++. .++++|+..... +.+..+|+.++.-.+ +. .+.......-.+..-+.++|+.-... ....+
T Consensus 160 ~~P-~~~a~~p~g~~vyv~~~~~-~~v~vi~~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~i 236 (381)
T COG3391 160 NTP-TGVAVDPDGNKVYVTNSDD-NTVSVIDTSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKI 236 (381)
T ss_pred CCc-ceEEECCCCCeEEEEecCC-CeEEEEeCCCcceec-cccccccccCCCCceEEECCCCCEEEEEeccCCCceEEEE
Confidence 111 333443 356688887333 489999988776554 32 22211111122223466788876544 45566
Q ss_pred EcCCcc
Q 045821 319 DVGNIG 324 (352)
Q Consensus 319 d~~~~~ 324 (352)
|...+.
T Consensus 237 d~~~~~ 242 (381)
T COG3391 237 DTATGN 242 (381)
T ss_pred eCCCce
Confidence 666543
No 214
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=57.44 E-value=1.8e+02 Score=27.68 Aligned_cols=90 Identities=8% Similarity=-0.015 Sum_probs=46.7
Q ss_pred CceEEEeCCCCc----EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCCCCcE
Q 045821 180 HSWDVYDPRTNN----WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGWRGPA 254 (352)
Q Consensus 180 ~~v~~yd~~t~~----W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~~~~~ 254 (352)
..|..||..... |.+... .+.++-+.+..+..|++.-|.+. .|..||.....-+. +. ...+....+
T Consensus 187 G~VtlwDv~g~sp~~~~~~~Hs--AP~~gicfspsne~l~vsVG~Dk-----ki~~yD~~s~~s~~~l~--y~~Plstva 257 (673)
T KOG4378|consen 187 GAVTLWDVQGMSPIFHASEAHS--APCRGICFSPSNEALLVSVGYDK-----KINIYDIRSQASTDRLT--YSHPLSTVA 257 (673)
T ss_pred CeEEEEeccCCCcccchhhhcc--CCcCcceecCCccceEEEecccc-----eEEEeecccccccceee--ecCCcceee
Confidence 456667776542 433321 11122223345677777666663 68889987544322 11 122222223
Q ss_pred EEECCEEEEEecCCCCeEEEEeCCC
Q 045821 255 VVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 255 ~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
+.-+|.+++.|...| .++.||+..
T Consensus 258 f~~~G~~L~aG~s~G-~~i~YD~R~ 281 (673)
T KOG4378|consen 258 FSECGTYLCAGNSKG-ELIAYDMRS 281 (673)
T ss_pred ecCCceEEEeecCCc-eEEEEeccc
Confidence 333555555555555 899999874
No 215
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=55.47 E-value=1.2e+02 Score=26.92 Aligned_cols=108 Identities=9% Similarity=0.159 Sum_probs=55.6
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccccee-EEEE--CCEEEEEccCCCCCCceEEEEEeCCCCcee-ec
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIED-SFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWL-HA 242 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~-~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~ 242 (352)
.|...++|... ..+..||..|-+.-.-..|...+.... .+-+ .++||+-|..++ .|-.+|--.++.. .+
T Consensus 227 sGefllvgTdH--p~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t~~lYvTaSkDG-----~IklwDGVS~rCv~t~ 299 (430)
T KOG0640|consen 227 SGEFLLVGTDH--PTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSSTGSLYVTASKDG-----AIKLWDGVSNRCVRTI 299 (430)
T ss_pred CCceEEEecCC--CceeEEeccceeEeeecCcccccccceeEEEecCCccEEEEeccCC-----cEEeeccccHHHHHHH
Confidence 34444544322 778899999887654333222222221 1222 689999976654 4666666555432 23
Q ss_pred CCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821 243 DANMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESREW 282 (352)
Q Consensus 243 ~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~~W 282 (352)
...........+ +.-+++-.+-.|.+. .+..|.+.++.-
T Consensus 300 ~~AH~gsevcSa~Ftkn~kyiLsSG~DS-~vkLWEi~t~R~ 339 (430)
T KOG0640|consen 300 GNAHGGSEVCSAVFTKNGKYILSSGKDS-TVKLWEISTGRM 339 (430)
T ss_pred HhhcCCceeeeEEEccCCeEEeecCCcc-eeeeeeecCCce
Confidence 221211111122 234666555556544 566666766654
No 216
>KOG1520 consensus Predicted alkaloid synthase/Surface mucin Hemomucin [General function prediction only]
Probab=55.45 E-value=1.7e+02 Score=26.80 Aligned_cols=69 Identities=13% Similarity=0.161 Sum_probs=39.6
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCC---CCCCc-EEEECCEEEEEecCC----------------CC
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMAS---GWRGP-AVVVDDALYVLDQSS----------------GT 270 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~---~~~~~-~~~~~~~l~~~gg~~----------------~~ 270 (352)
+|.+||.-++- .++.-++..+.=+.+.+.... .+... .+.-+|.+|...... ..
T Consensus 126 ggdL~VaDAYl------GL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~~g~vyFTDSSsk~~~rd~~~a~l~g~~~G 199 (376)
T KOG1520|consen 126 GGDLYVADAYL------GLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDPEGVVYFTDSSSKYDRRDFVFAALEGDPTG 199 (376)
T ss_pred CCeEEEEecce------eeEEECCCCCcceeccccccCeeeeecCceeEcCCCeEEEeccccccchhheEEeeecCCCcc
Confidence 55899984433 578888887775554432211 11111 122355666554322 13
Q ss_pred eEEEEeCCCCceEEc
Q 045821 271 KLMMWQKESREWSPV 285 (352)
Q Consensus 271 ~v~~yd~~~~~W~~~ 285 (352)
.+.+||+.++.=+.+
T Consensus 200 Rl~~YD~~tK~~~VL 214 (376)
T KOG1520|consen 200 RLFRYDPSTKVTKVL 214 (376)
T ss_pred ceEEecCcccchhhh
Confidence 799999999876555
No 217
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=54.70 E-value=1.8e+02 Score=27.02 Aligned_cols=119 Identities=13% Similarity=0.099 Sum_probs=63.2
Q ss_pred ceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCC
Q 045821 112 GMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRT 189 (352)
Q Consensus 112 ~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t 189 (352)
..+++.+ +|.||..| .....+-+||..+.. .++.+|.--..-....+ ++--|++-+.++ .+|..+|...
T Consensus 350 ~ts~~fHpDgLifgtg------t~d~~vkiwdlks~~--~~a~Fpght~~vk~i~FsENGY~Lat~add-~~V~lwDLRK 420 (506)
T KOG0289|consen 350 YTSAAFHPDGLIFGTG------TPDGVVKIWDLKSQT--NVAKFPGHTGPVKAISFSENGYWLATAADD-GSVKLWDLRK 420 (506)
T ss_pred eEEeeEcCCceEEecc------CCCceEEEEEcCCcc--ccccCCCCCCceeEEEeccCceEEEEEecC-CeEEEEEehh
Confidence 4444444 44555544 334567788988775 55555541111222333 233333344343 4589999986
Q ss_pred Cc-EEEecCCCCcccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC
Q 045821 190 NN-WKLHTEPNIFTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM 246 (352)
Q Consensus 190 ~~-W~~~~~~~~~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~ 246 (352)
.. ...+. ++. .....+..+ .|+..+++|.. ..++.|+-.+..|+++....
T Consensus 421 l~n~kt~~-l~~-~~~v~s~~fD~SGt~L~~~g~~-----l~Vy~~~k~~k~W~~~~~~~ 473 (506)
T KOG0289|consen 421 LKNFKTIQ-LDE-KKEVNSLSFDQSGTYLGIAGSD-----LQVYICKKKTKSWTEIKELA 473 (506)
T ss_pred hcccceee-ccc-cccceeEEEcCCCCeEEeecce-----eEEEEEecccccceeeehhh
Confidence 54 33332 111 112233334 46666665332 46788888999999986543
No 218
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=54.40 E-value=1.9e+02 Score=27.11 Aligned_cols=59 Identities=17% Similarity=0.210 Sum_probs=38.3
Q ss_pred CceEEEeCCCCcEEEecCCCCc---ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIF---TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW 239 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~---~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W 239 (352)
..+-+||..+.+...+..+|.. .......+-+|++|+-=. ...+....|+.+|+.+.+=
T Consensus 367 ~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vt-t~~g~~~~IY~iDp~TatA 428 (435)
T PF14298_consen 367 KKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVT-TEDGSDPYIYKIDPATATA 428 (435)
T ss_pred ceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEe-ecCCCceeEEEEcCccccc
Confidence 7778899999887777766654 111234556888888422 1222245799999988753
No 219
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=53.97 E-value=2.6e+02 Score=28.64 Aligned_cols=149 Identities=11% Similarity=0.037 Sum_probs=64.6
Q ss_pred EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC--CCCceeecCCCCC
Q 045821 170 IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP--STDSWLHADANMA 247 (352)
Q Consensus 170 iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~--~~~~W~~~~~~~~ 247 (352)
-++.|..+ +.|.+|...+.+-..+-..-..+..+.++..+|+..++||.+. .|-+.+. .+..-...+. .
T Consensus 68 ~f~~~s~~--~tv~~y~fps~~~~~iL~Rftlp~r~~~v~g~g~~iaagsdD~-----~vK~~~~~D~s~~~~lrgh--~ 138 (933)
T KOG1274|consen 68 HFLTGSEQ--NTVLRYKFPSGEEDTILARFTLPIRDLAVSGSGKMIAAGSDDT-----AVKLLNLDDSSQEKVLRGH--D 138 (933)
T ss_pred ceEEeecc--ceEEEeeCCCCCccceeeeeeccceEEEEecCCcEEEeecCce-----eEEEEeccccchheeeccc--C
Confidence 44444333 7888887776654433211112223345556777777765441 2222222 2222222111 1
Q ss_pred CCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc----eEEccCCCCCc-CC--CCceEEEeCCEEEEEcCCceEEEEEc
Q 045821 248 SGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE----WSPVGRLSTLL-TR--PPCKLVAIGKTIFVIGKGCSAVVIDV 320 (352)
Q Consensus 248 ~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~----W~~~~~~~~~~-~~--~~~~~~~~~~~i~v~gG~~~~~~~d~ 320 (352)
..-.+....-++.++.+...+| .|.+||+.+.. |..+..-.... .+ ...+...-++++.+.+-.+.+.+|+.
T Consensus 139 apVl~l~~~p~~~fLAvss~dG-~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r 217 (933)
T KOG1274|consen 139 APVLQLSYDPKGNFLAVSSCDG-KVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSR 217 (933)
T ss_pred CceeeeeEcCCCCEEEEEecCc-eEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEcc
Confidence 1111111223555555544444 78888877654 33332111100 01 11111223466666665555556654
Q ss_pred CCcccccceE
Q 045821 321 GNIGNIGGIM 330 (352)
Q Consensus 321 ~~~~w~~~~~ 330 (352)
+ .|...+.
T Consensus 218 ~--~we~~f~ 225 (933)
T KOG1274|consen 218 K--GWELQFK 225 (933)
T ss_pred C--Cceehee
Confidence 4 3554433
No 220
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=53.09 E-value=2.1e+02 Score=27.22 Aligned_cols=61 Identities=18% Similarity=0.248 Sum_probs=34.1
Q ss_pred ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-EeCCEEEEEcCCce-EEEEEcCC
Q 045821 257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV-AIGKTIFVIGKGCS-AVVIDVGN 322 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~~~i~v~gG~~~-~~~~d~~~ 322 (352)
.+..|++-=|++. .+..||.....-.. .+.. .....+++ ..+|.++++|-... ++.||+..
T Consensus 219 sne~l~vsVG~Dk-ki~~yD~~s~~s~~--~l~y--~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~ 281 (673)
T KOG4378|consen 219 SNEALLVSVGYDK-KINIYDIRSQASTD--RLTY--SHPLSTVAFSECGTYLCAGNSKGELIAYDMRS 281 (673)
T ss_pred CccceEEEecccc-eEEEeecccccccc--eeee--cCCcceeeecCCceEEEeecCCceEEEEeccc
Confidence 4677777777776 89999998543321 1111 11112232 23566777775554 44588653
No 221
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=52.48 E-value=1e+02 Score=23.57 Aligned_cols=63 Identities=14% Similarity=0.071 Sum_probs=35.4
Q ss_pred CEEEEEeCCCCCCceEEEeCCCCc---EEEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCC--ce
Q 045821 168 QKIYCIGGLGDTHSWDVYDPRTNN---WKLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTD--SW 239 (352)
Q Consensus 168 ~~iyv~gG~~~~~~v~~yd~~t~~---W~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W 239 (352)
..+.++|.. +.+.+||...|. ++.++ .......... ....+.++||.. .+.-||-..+ -|
T Consensus 64 ~D~LliGt~---t~llaYDV~~N~d~Fyke~~---DGvn~i~~g~~~~~~~~l~ivGGnc------si~Gfd~~G~e~fW 131 (136)
T PF14781_consen 64 RDCLLIGTQ---TSLLAYDVENNSDLFYKEVP---DGVNAIVIGKLGDIPSPLVIVGGNC------SIQGFDYEGNEIFW 131 (136)
T ss_pred cCEEEEecc---ceEEEEEcccCchhhhhhCc---cceeEEEEEecCCCCCcEEEECceE------EEEEeCCCCcEEEE
Confidence 345666654 679999999885 44443 2221111112 245677777654 5777776543 36
Q ss_pred eec
Q 045821 240 LHA 242 (352)
Q Consensus 240 ~~~ 242 (352)
+..
T Consensus 132 tVt 134 (136)
T PF14781_consen 132 TVT 134 (136)
T ss_pred Eec
Confidence 643
No 222
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=51.81 E-value=1.9e+02 Score=26.42 Aligned_cols=148 Identities=9% Similarity=0.071 Sum_probs=72.0
Q ss_pred CEEEEEeCCCCCCCCCCeEEEEECCCC--eeEecCC------CCCcceeeeeEEECC---EEEEEeCCCCCCceEEEeCC
Q 045821 120 KNAYLLGGCGWSEDATSEVYCYDASMN--TWTDAAP------MSTARCYFPCGVLNQ---KIYCIGGLGDTHSWDVYDPR 188 (352)
Q Consensus 120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~--~W~~~~~------~~~~r~~~~~~~~~~---~iyv~gG~~~~~~v~~yd~~ 188 (352)
..|+.+||... .+.+.+||.... .|+.-.. |..|-....+..+.+ .-|+.+... ..+-.||+.
T Consensus 161 p~Iva~GGke~----~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~--hqvR~YDt~ 234 (412)
T KOG3881|consen 161 PYIVATGGKEN----INELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRY--HQVRLYDTR 234 (412)
T ss_pred CceEecCchhc----ccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecc--eeEEEecCc
Confidence 45777788532 466677777654 3764321 223433344444433 233322211 678899998
Q ss_pred CCcEEEecCCCCccc--ceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEe
Q 045821 189 TNNWKLHTEPNIFTE--IEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLD 265 (352)
Q Consensus 189 t~~W~~~~~~~~~~~--~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~g 265 (352)
.++ +.+........ .+.+-.-+++..++| .....+..||..+.+---..-.-.........+. +..+...+
T Consensus 235 ~qR-RPV~~fd~~E~~is~~~l~p~gn~Iy~g-----n~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~ 308 (412)
T KOG3881|consen 235 HQR-RPVAQFDFLENPISSTGLTPSGNFIYTG-----NTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASC 308 (412)
T ss_pred ccC-cceeEeccccCcceeeeecCCCcEEEEe-----cccchhheecccCceeeccccCCccCCcceEEEcCCCceEEee
Confidence 654 34432222211 112222344444443 2223577888877653322110111111111222 34677777
Q ss_pred cCCCCeEEEEeCCCC
Q 045821 266 QSSGTKLMMWQKESR 280 (352)
Q Consensus 266 g~~~~~v~~yd~~~~ 280 (352)
|.+- .+-+||.++.
T Consensus 309 GLDR-yvRIhD~ktr 322 (412)
T KOG3881|consen 309 GLDR-YVRIHDIKTR 322 (412)
T ss_pred ccce-eEEEeecccc
Confidence 7654 7889998873
No 223
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=51.32 E-value=70 Score=29.91 Aligned_cols=63 Identities=14% Similarity=0.199 Sum_probs=36.7
Q ss_pred ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEE-EEEcCC
Q 045821 257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAV-VIDVGN 322 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~-~~d~~~ 322 (352)
.++.=++.++++. .+-.||.++++-..--.+... ......-.-+..+++.||.++-. .+|+..
T Consensus 268 ~~g~~fLS~sfD~-~lKlwDtETG~~~~~f~~~~~--~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs 331 (503)
T KOG0282|consen 268 NCGTSFLSASFDR-FLKLWDTETGQVLSRFHLDKV--PTCVKFHPDNQNIFLVGGSDKKIRQWDIRS 331 (503)
T ss_pred ccCCeeeeeecce-eeeeeccccceEEEEEecCCC--ceeeecCCCCCcEEEEecCCCcEEEEeccc
Confidence 3555667777776 788999999987655332210 00011112245899999887533 355543
No 224
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=51.21 E-value=1.8e+02 Score=25.87 Aligned_cols=214 Identities=12% Similarity=0.043 Sum_probs=103.2
Q ss_pred eEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce--eeeeEEECCEEEEEeCCCCC---CceEEE
Q 045821 113 MGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC--YFPCGVLNQKIYCIGGLGDT---HSWDVY 185 (352)
Q Consensus 113 ~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~--~~~~~~~~~~iyv~gG~~~~---~~v~~y 185 (352)
|.++.. ...+.+|+- .+-.-+.+||+.+++-...-..+..|. +|++..-+|+.......+.. ..+-+|
T Consensus 8 H~~a~~p~~~~avafaR-----RPG~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~dG~~LytTEnd~~~g~G~IgVy 82 (305)
T PF07433_consen 8 HGVAAHPTRPEAVAFAR-----RPGTFALVFDCRTGQLLQRLWAPPGRHFYGHGVFSPDGRLLYTTENDYETGRGVIGVY 82 (305)
T ss_pred cceeeCCCCCeEEEEEe-----CCCcEEEEEEcCCCceeeEEcCCCCCEEecCEEEcCCCCEEEEeccccCCCcEEEEEE
Confidence 444444 456677764 334567899999987543333355554 46666666654333332222 677899
Q ss_pred eCCCCcEEEecCCCCccccee-EEEE-CC-EEEEE-ccCCCCCCc-----e------EEEEEeCCCCceeec---C-CCC
Q 045821 186 DPRTNNWKLHTEPNIFTEIED-SFVM-DG-KIYIR-CSASAATSH-----V------CALVYEPSTDSWLHA---D-ANM 246 (352)
Q Consensus 186 d~~t~~W~~~~~~~~~~~~~~-~~~~-~g-~iyv~-GG~~~~~~~-----~------~i~~yd~~~~~W~~~---~-~~~ 246 (352)
|.. +..+++...+...-..+ ...+ +| .|.|. ||...+... + ++...|..+++-.+. + ...
T Consensus 83 d~~-~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~ 161 (305)
T PF07433_consen 83 DAA-RGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLH 161 (305)
T ss_pred ECc-CCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCcccc
Confidence 998 67888776554433332 3333 55 45554 444433111 1 555666776654322 1 111
Q ss_pred CCCCCCcEEEECCEEEEEecCCC------CeEEEEeCCCCceEEccCCCC----CcCCCCceEEEeC-C-EEEEEcCCc-
Q 045821 247 ASGWRGPAVVVDDALYVLDQSSG------TKLMMWQKESREWSPVGRLST----LLTRPPCKLVAIG-K-TIFVIGKGC- 313 (352)
Q Consensus 247 ~~~~~~~~~~~~~~l~~~gg~~~------~~v~~yd~~~~~W~~~~~~~~----~~~~~~~~~~~~~-~-~i~v~gG~~- 313 (352)
.....+.++.-+|.+++-.-+.+ --+..++.... -..+ .+|. .+..+..+++... + .|.+-.-.-
T Consensus 162 ~lSiRHLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~~-~~~~-~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg 239 (305)
T PF07433_consen 162 QLSIRHLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGGA-LRLL-PAPEEQWRRLNGYIGSIAADRDGRLIAVTSPRGG 239 (305)
T ss_pred ccceeeEEecCCCcEEEEEecCCCCCccCCeEEEEcCCCc-ceec-cCChHHHHhhCCceEEEEEeCCCCEEEEECCCCC
Confidence 11222233333444433322322 12334443331 2111 1211 1112223344432 2 344544332
Q ss_pred eEEEEEcCCcccccceEEecc
Q 045821 314 SAVVIDVGNIGNIGGIMVSSS 334 (352)
Q Consensus 314 ~~~~~d~~~~~w~~~~~~~~~ 334 (352)
.+.+||..+..|.....++..
T Consensus 240 ~~~~~d~~tg~~~~~~~l~D~ 260 (305)
T PF07433_consen 240 RVAVWDAATGRLLGSVPLPDA 260 (305)
T ss_pred EEEEEECCCCCEeeccccCce
Confidence 455789999888776655443
No 225
>PF13859 BNR_3: BNR repeat-like domain; PDB: 3B69_A.
Probab=51.04 E-value=1.8e+02 Score=25.92 Aligned_cols=166 Identities=10% Similarity=0.115 Sum_probs=74.8
Q ss_pred eeeEEECCEEEEEeCCCCC--------CceE-EEeCCCCcEEEecCCC---C--c---ccceeEEEECCEEEEEccCCCC
Q 045821 161 FPCGVLNQKIYCIGGLGDT--------HSWD-VYDPRTNNWKLHTEPN---I--F---TEIEDSFVMDGKIYIRCSASAA 223 (352)
Q Consensus 161 ~~~~~~~~~iyv~gG~~~~--------~~v~-~yd~~t~~W~~~~~~~---~--~---~~~~~~~~~~g~iyv~GG~~~~ 223 (352)
++++.+++.|+++...... ..+. .......+|+...... . . ......++-++.||++-|....
T Consensus 2 PSLV~vgGvv~AvAEa~~~~~~~~~~~~ias~~~~~~g~tw~~~~~~~~~~~~~~~v~v~rPTtvvkgn~IymLvG~y~~ 81 (310)
T PF13859_consen 2 PSLVEVGGVVFAVAEAQCKKSNDSGFTDIASEYSTDNGETWKAEVAVLNDDGSKKRVDVSRPTTVVKGNKIYMLVGSYSR 81 (310)
T ss_dssp EEEEEETTEEEEEEEEESS-S-SSS-EEEEEEEESSSSSS-EEEEEE----SS-TT-EEEEEEEEEETTEEEEEEEEESS
T ss_pred CCEEEECCEEEEEEEEEEccCCCCCceeEEEeEeeccccccccceeeecccccccccccceeeeeecceeEEEEEEEEec
Confidence 4567778998888764311 1122 2334455797654211 1 1 1123456679999998766542
Q ss_pred C---CceEEEEE--eCCCCceeecCCCCCCCCC---------CcEEEE-CCEEEEE--ecCCC----CeEEEEeCC-CCc
Q 045821 224 T---SHVCALVY--EPSTDSWLHADANMASGWR---------GPAVVV-DDALYVL--DQSSG----TKLMMWQKE-SRE 281 (352)
Q Consensus 224 ~---~~~~i~~y--d~~~~~W~~~~~~~~~~~~---------~~~~~~-~~~l~~~--gg~~~----~~v~~yd~~-~~~ 281 (352)
. ....+..+ +....+|......+..... +..+.. +|-|.+- +-..+ ..+.+|..+ ..+
T Consensus 82 ~~~~~~~~llLvks~~~g~~W~~~~~l~~~~~~~~~~figgGGSGV~m~dGTLVFPv~a~~~~~~~~~SlIiYS~d~g~~ 161 (310)
T PF13859_consen 82 SAGADDWGLLLVKSTDGGIKWGDTKSLPSTSFQSWKQFIGGGGSGVVMEDGTLVFPVQATKKNGDGTVSLIIYSTDDGKT 161 (310)
T ss_dssp --SSTTEEEEEEEEESSSSEE---EE-GGGS-EEEEEEEE-SEE-EE-TTS-EEEEEEEEETT---EEEEEEEESSTTSS
T ss_pred cccccccceeeeeccCCcceeeecccCCchhccccceeecCCCCceEEcCCCEEEEEeeeccCccceEEEEEEECCCccc
Confidence 1 11233333 3344469776432221111 111332 3333211 21111 256788777 788
Q ss_pred eEEccCCCCCcCCCCceEEEe-CCEEEEEc----CCceEEEEEcCCccccc
Q 045821 282 WSPVGRLSTLLTRPPCKLVAI-GKTIFVIG----KGCSAVVIDVGNIGNIG 327 (352)
Q Consensus 282 W~~~~~~~~~~~~~~~~~~~~-~~~i~v~g----G~~~~~~~d~~~~~w~~ 327 (352)
|.--..+++. ......++.. +++|+++. |+.+++.-.-.-.+|.+
T Consensus 162 W~lskg~s~~-gC~~psv~EWe~gkLlM~~~c~~g~rrVYeS~DmG~tWte 211 (310)
T PF13859_consen 162 WKLSKGMSPA-GCSDPSVVEWEDGKLLMMTACDDGRRRVYESGDMGTTWTE 211 (310)
T ss_dssp -EE-S----T-T-EEEEEEEE-TTEEEEEEE-TTS---EEEESSTTSS-EE
T ss_pred eEeccccCCC-CcceEEEEeccCCeeEEEEecccceEEEEEEcccceehhh
Confidence 9876666542 3344567777 78999885 44445544344557865
No 226
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=50.70 E-value=2.3e+02 Score=27.06 Aligned_cols=154 Identities=14% Similarity=0.108 Sum_probs=76.3
Q ss_pred CcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cc-----cceeEEEECCEEEEEccCCCCCCc
Q 045821 156 TARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FT-----EIEDSFVMDGKIYIRCSASAATSH 226 (352)
Q Consensus 156 ~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~-----~~~~~~~~~g~iyv~GG~~~~~~~ 226 (352)
..|.....+.+ ++++ +.+|+.+ .++..++. ..|..-+.+.. ++ -.+.++..+|++.+.-|.+.
T Consensus 315 g~Rv~~tsC~~nrdg~~-iAagc~D-GSIQ~W~~--~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~---- 386 (641)
T KOG0772|consen 315 GKRVPVTSCAWNRDGKL-IAAGCLD-GSIQIWDK--GSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDD---- 386 (641)
T ss_pred CcccCceeeecCCCcch-hhhcccC-Cceeeeec--CCcccccceEeeeccCCCCceeEEEeccccchhhhccCCC----
Confidence 34666666666 4566 5556554 56666665 33433332211 11 11234556888887766553
Q ss_pred eEEEEEeCC-----CCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-----CeEEEEeCCCCceEEccCCCCCcCCCC
Q 045821 227 VCALVYEPS-----TDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-----TKLMMWQKESREWSPVGRLSTLLTRPP 296 (352)
Q Consensus 227 ~~i~~yd~~-----~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd~~~~~W~~~~~~~~~~~~~~ 296 (352)
.+-.+|+. -+.|+-++...+.. -..+.-+.+|.+.|-..- ..++.||.. +...+..++.. ...
T Consensus 387 -tLKvWDLrq~kkpL~~~tgL~t~~~~t--dc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~--t~d~v~ki~i~--~aS 459 (641)
T KOG0772|consen 387 -TLKVWDLRQFKKPLNVRTGLPTPFPGT--DCCFSPDDKLILTGTSAPNGMTAGTLFFFDRM--TLDTVYKIDIS--TAS 459 (641)
T ss_pred -ceeeeeccccccchhhhcCCCccCCCC--ccccCCCceEEEecccccCCCCCceEEEEecc--ceeeEEEecCC--Cce
Confidence 23333432 23455554422211 112344778887775221 368888864 45555444331 111
Q ss_pred ceEEEe---CCEEEEEcCCceEEE-EEcCCcc
Q 045821 297 CKLVAI---GKTIFVIGKGCSAVV-IDVGNIG 324 (352)
Q Consensus 297 ~~~~~~---~~~i~v~gG~~~~~~-~d~~~~~ 324 (352)
..-+.. -++|++-.|.-...+ ||+.+..
T Consensus 460 vv~~~WhpkLNQi~~gsgdG~~~vyYdp~~S~ 491 (641)
T KOG0772|consen 460 VVRCLWHPKLNQIFAGSGDGTAHVYYDPNESI 491 (641)
T ss_pred EEEEeecchhhheeeecCCCceEEEECccccc
Confidence 111122 356776555545554 7776654
No 227
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=50.39 E-value=3.3e+02 Score=28.78 Aligned_cols=163 Identities=10% Similarity=0.062 Sum_probs=82.1
Q ss_pred eEecCCC-CCcceeeeeEEE---CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eE---EEECCEEEEEcc
Q 045821 148 WTDAAPM-STARCYFPCGVL---NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DS---FVMDGKIYIRCS 219 (352)
Q Consensus 148 W~~~~~~-~~~r~~~~~~~~---~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~---~~~~g~iyv~GG 219 (352)
|..+.++ +.+|..-.++-. .|++++.|+. ..+-++|..+++ .+.++|...... .+ --.+|.+.+.|-
T Consensus 1154 w~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~---r~IRIWDa~~E~--~~~diP~~s~t~vTaLS~~~~~gn~i~AGf 1228 (1387)
T KOG1517|consen 1154 WSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDV---RSIRIWDAHKEQ--VVADIPYGSSTLVTALSADLVHGNIIAAGF 1228 (1387)
T ss_pred eccccccCccCCCCCeeeehhhhCCeEEecCCe---eEEEEEecccce--eEeecccCCCccceeecccccCCceEEEee
Confidence 6666554 455554433333 5889998874 567889988775 344454442222 11 123578888854
Q ss_pred CCCCCCceEEEEEeCCC-------CceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc
Q 045821 220 ASAATSHVCALVYEPST-------DSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL 292 (352)
Q Consensus 220 ~~~~~~~~~i~~yd~~~-------~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~ 292 (352)
.++ .+-.||... +.|++-....+ ..+..+.-+|.--++.|..+.+|..+|+... ....-+....
T Consensus 1229 aDG-----svRvyD~R~a~~ds~v~~~R~h~~~~~--Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~--~~e~~~~iv~ 1299 (1387)
T KOG1517|consen 1229 ADG-----SVRVYDRRMAPPDSLVCVYREHNDVEP--IVHLSLQRQGLGELVSGSQDGDIQLLDLRMS--SKETFLTIVA 1299 (1387)
T ss_pred cCC-----ceEEeecccCCccccceeecccCCccc--ceeEEeecCCCcceeeeccCCeEEEEecccC--cccccceeee
Confidence 443 466777522 23443322111 1111122223223444433338999998875 2222111111
Q ss_pred CC----CCceEEEeCC-EEEEEcCCceEEEEEcCCcc
Q 045821 293 TR----PPCKLVAIGK-TIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 293 ~~----~~~~~~~~~~-~i~v~gG~~~~~~~d~~~~~ 324 (352)
.+ ...++.++.. -|+.-|+.....||+...+.
T Consensus 1300 ~~~yGs~lTal~VH~hapiiAsGs~q~ikIy~~~G~~ 1336 (1387)
T KOG1517|consen 1300 HWEYGSALTALTVHEHAPIIASGSAQLIKIYSLSGEQ 1336 (1387)
T ss_pred ccccCccceeeeeccCCCeeeecCcceEEEEecChhh
Confidence 11 1234555543 46666666556678876554
No 228
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=50.14 E-value=1.8e+02 Score=28.09 Aligned_cols=107 Identities=14% Similarity=0.076 Sum_probs=56.7
Q ss_pred CCEEEEEccCCCC--CCceEEEEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE--
Q 045821 211 DGKIYIRCSASAA--TSHVCALVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-- 283 (352)
Q Consensus 211 ~g~iyv~GG~~~~--~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-- 283 (352)
+.+|+.+.-.-+. ....+.-.|+...++-+.+.- +.+......+...+.+.+++|..++ .|..||...++=.
T Consensus 218 ~~qi~tVE~s~s~~g~~~~d~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~Dg-SiiLyD~~~~~t~~~ 296 (545)
T PF11768_consen 218 PYQIHTVEQSISVKGEPSADSCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDG-SIILYDTTRGVTLLA 296 (545)
T ss_pred CcEEEEEEEecCCCCCceeEEEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecCC-eEEEEEcCCCeeeee
Confidence 4456666322211 122255567777666554422 1221111122234667788888877 9999998876433
Q ss_pred EccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821 284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIG 324 (352)
Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~ 324 (352)
+.+-.|. +..-..++.|+++|+.. ...+||..-+.
T Consensus 297 ka~~~P~------~iaWHp~gai~~V~s~qGelQ~FD~ALsp 332 (545)
T PF11768_consen 297 KAEFIPT------LIAWHPDGAIFVVGSEQGELQCFDMALSP 332 (545)
T ss_pred eecccce------EEEEcCCCcEEEEEcCCceEEEEEeecCc
Confidence 2222222 11122377788777654 45578876544
No 229
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=49.12 E-value=90 Score=30.42 Aligned_cols=97 Identities=8% Similarity=0.085 Sum_probs=52.2
Q ss_pred EEEEEeCCCCceeecCC-CCCCCCCCcEEE----ECCEEEEEecCCCCeEEEEeCCCCceEEccC-CCCCcCCCCceEEE
Q 045821 228 CALVYEPSTDSWLHADA-NMASGWRGPAVV----VDDALYVLDQSSGTKLMMWQKESREWSPVGR-LSTLLTRPPCKLVA 301 (352)
Q Consensus 228 ~i~~yd~~~~~W~~~~~-~~~~~~~~~~~~----~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~~~ 301 (352)
.+-.||......+.... ......|..++. +.++..++....+..+..||.++..-.-... +...-.....++..
T Consensus 75 ~i~l~dt~~~~fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~ 154 (720)
T KOG0321|consen 75 GIILFDTKSIVFRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMP 154 (720)
T ss_pred ceeeecchhhhcchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeeeeeccceeecceeecccccccchhhhcc
Confidence 46677776666651111 111111222221 2355555543333489999999877655431 11111233445666
Q ss_pred eCCEEEEEcCCce-EEEEEcCCcc
Q 045821 302 IGKTIFVIGKGCS-AVVIDVGNIG 324 (352)
Q Consensus 302 ~~~~i~v~gG~~~-~~~~d~~~~~ 324 (352)
.+..+++.||++. +.+.|...+.
T Consensus 155 ~n~~vF~tGgRDg~illWD~R~n~ 178 (720)
T KOG0321|consen 155 TNPAVFCTGGRDGEILLWDCRCNG 178 (720)
T ss_pred CCCcceeeccCCCcEEEEEEeccc
Confidence 7888999999886 4467766554
No 230
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=48.82 E-value=1.5e+02 Score=24.23 Aligned_cols=64 Identities=11% Similarity=0.031 Sum_probs=36.2
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST 236 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~ 236 (352)
+.++.++-|... ..+..||.+.. .+...+..........-+|++.++||.... ...+..||..+
T Consensus 71 g~~favi~g~~~-~~v~lyd~~~~---~i~~~~~~~~n~i~wsP~G~~l~~~g~~n~--~G~l~~wd~~~ 134 (194)
T PF08662_consen 71 GNEFAVIYGSMP-AKVTLYDVKGK---KIFSFGTQPRNTISWSPDGRFLVLAGFGNL--NGDLEFWDVRK 134 (194)
T ss_pred CCEEEEEEccCC-cccEEEcCccc---EeEeecCCCceEEEECCCCCEEEEEEccCC--CcEEEEEECCC
Confidence 456666656333 68899999633 333333323332233347888888776422 23577888873
No 231
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=48.50 E-value=42 Score=19.09 Aligned_cols=25 Identities=12% Similarity=0.173 Sum_probs=17.1
Q ss_pred eEEEeCCEEEEEcCCceEEEEEcCC
Q 045821 298 KLVAIGKTIFVIGKGCSAVVIDVGN 322 (352)
Q Consensus 298 ~~~~~~~~i~v~gG~~~~~~~d~~~ 322 (352)
+.+..++.+|+.+....++.+|+++
T Consensus 16 ~~~v~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 16 SPAVAGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp --EECTSEEEEE-TTSEEEEEETT-
T ss_pred CCEEECCEEEEEcCCCEEEEEeCCC
Confidence 4466789999988877788888763
No 232
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=47.55 E-value=2.9e+02 Score=27.26 Aligned_cols=121 Identities=11% Similarity=0.129 Sum_probs=68.2
Q ss_pred ecCCCCccccee--EEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCC----CCCCcEE-EECCEEEEEec
Q 045821 195 HTEPNIFTEIED--SFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMAS----GWRGPAV-VVDDALYVLDQ 266 (352)
Q Consensus 195 ~~~~~~~~~~~~--~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~----~~~~~~~-~~~~~l~~~gg 266 (352)
+.+++....... +... ++++++.- .....++.++..+.+..++....+. ..+..++ ..++-|.++++
T Consensus 421 v~~~~~~~~~a~~i~ftid~~k~~~~s-----~~~~~le~~el~~ps~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~t 495 (691)
T KOG2048|consen 421 VDDVPLALLDASAISFTIDKNKLFLVS-----KNIFSLEEFELETPSFKELKSIQSQAKCPSISRLVVSSDGNYIAAIST 495 (691)
T ss_pred eccchhhhccceeeEEEecCceEEEEe-----cccceeEEEEecCcchhhhhccccccCCCcceeEEEcCCCCEEEEEec
Confidence 344555443332 2334 45555552 1223678888888887776442222 1222223 24566666664
Q ss_pred CCCCeEEEEeCCCCceEEcc-CCCCCcCCCCceEE-EeCCEEEEEcCCceEEEEEcCCcc
Q 045821 267 SSGTKLMMWQKESREWSPVG-RLSTLLTRPPCKLV-AIGKTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 267 ~~~~~v~~yd~~~~~W~~~~-~~~~~~~~~~~~~~-~~~~~i~v~gG~~~~~~~d~~~~~ 324 (352)
.. .|++||.++.+-..+. .++. .....+.. ...+.|.+.-..+.++.||++.+.
T Consensus 496 ~g--~I~v~nl~~~~~~~l~~rln~--~vTa~~~~~~~~~~lvvats~nQv~efdi~~~~ 551 (691)
T KOG2048|consen 496 RG--QIFVYNLETLESHLLKVRLNI--DVTAAAFSPFVRNRLVVATSNNQVFEFDIEARN 551 (691)
T ss_pred cc--eEEEEEcccceeecchhccCc--ceeeeeccccccCcEEEEecCCeEEEEecchhh
Confidence 43 8999999999887776 3432 12222222 235677777767678889996544
No 233
>PRK01029 tolB translocation protein TolB; Provisional
Probab=47.15 E-value=2.5e+02 Score=26.34 Aligned_cols=59 Identities=14% Similarity=0.111 Sum_probs=37.2
Q ss_pred eEEEEEeCCCCceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821 227 VCALVYEPSTDSWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESREWSPVG 286 (352)
Q Consensus 227 ~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~~W~~~~ 286 (352)
..++.||+.+++.+.+.... .........-+++ |++.....+ ..++.+|..++....+.
T Consensus 351 ~~I~v~dl~~g~~~~Lt~~~-~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~~Lt 411 (428)
T PRK01029 351 RQICVYDLATGRDYQLTTSP-ENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTRKIV 411 (428)
T ss_pred cEEEEEECCCCCeEEccCCC-CCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEee
Confidence 47899999999888875431 1112233333554 554443322 57999999988887775
No 234
>PF11134 Phage_stabilise: Phage stabilisation protein; InterPro: IPR021098 This entry represents the Bacteriophage P22, Gp10, DNA-stabilising protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family are phage proteins involved with stabilising the head assembly unit and condensed DNA within the capsid [].
Probab=46.75 E-value=2.5e+02 Score=26.29 Aligned_cols=112 Identities=9% Similarity=0.051 Sum_probs=57.3
Q ss_pred eeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc---cc----eeE-----EEE-CCEEEEEccCCCCCCc
Q 045821 160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT---EI----EDS-----FVM-DGKIYIRCSASAATSH 226 (352)
Q Consensus 160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~---~~----~~~-----~~~-~g~iyv~GG~~~~~~~ 226 (352)
.++.+.+++.++.+|..... .-.+|-....+=.+|+..+... .+ -+. ... +..+|++ .-.
T Consensus 234 ~~s~~~~~~t~~wlg~~~~G-~~sVy~~~gyq~~RIST~~IE~~l~~ya~~ela~af~et~~f~~h~~l~i------hlp 306 (469)
T PF11134_consen 234 KHSKTKFGNTVAWLGHDATG-APSVYRINGYQASRISTHAIEKALRSYAHDELAIAFMETYQFDGHEFLLI------HLP 306 (469)
T ss_pred cceeeecCCEEEEeccCCCC-CceEEEecCCceeeeccHHHHHHHHhhccHHHHHHHHHHhhcCceEEEEE------EcC
Confidence 35566678888888874432 2234444444444555322211 11 011 122 3467777 222
Q ss_pred eEEEEEeCCCCce----eecCCCCCC--CCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821 227 VCALVYEPSTDSW----LHADANMAS--GWRGPAVVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 227 ~~i~~yd~~~~~W----~~~~~~~~~--~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
...++||..+++| ..+...... .+....+..++++. +|...+..+..+++..
T Consensus 307 ~~tlcyD~at~~~~~qw~~l~tg~~~~~~R~~~~~f~~~q~~-VGD~~~g~lg~L~~n~ 364 (469)
T PF11134_consen 307 RKTLCYDAATSQWGEQWFILKTGFYDEPYRAIDFMFFDNQIT-VGDKQNGLLGALDFNA 364 (469)
T ss_pred CceEEEEcccCCcccceEEEeccccCCcceeeeeEEeCCeeE-ecccccceeEEecccc
Confidence 4789999998855 444443322 22233455566654 4554444666666654
No 235
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=44.15 E-value=46 Score=25.46 Aligned_cols=64 Identities=14% Similarity=0.162 Sum_probs=37.8
Q ss_pred CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCc-eEEEeCCEEEEEcCCceEEEEEcCCc
Q 045821 258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPC-KLVAIGKTIFVIGKGCSAVVIDVGNI 323 (352)
Q Consensus 258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~-~~~~~~~~i~v~gG~~~~~~~d~~~~ 323 (352)
+..++++|-.. .+..||.+.|+=..-.+++.....-.. .+....+.+.++||.....=||-+-+
T Consensus 63 ~~D~LliGt~t--~llaYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~Gfd~~G~ 127 (136)
T PF14781_consen 63 GRDCLLIGTQT--SLLAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQGFDYEGN 127 (136)
T ss_pred CcCEEEEeccc--eEEEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEEeCCCCc
Confidence 45677887655 799999998873222333331100000 11112467999999877777887644
No 236
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=43.72 E-value=4e+02 Score=27.83 Aligned_cols=148 Identities=14% Similarity=0.170 Sum_probs=91.9
Q ss_pred CEEEEEeC-CCCC---CCCCCeEEEEECCC-CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821 120 KNAYLLGG-CGWS---EDATSEVYCYDASM-NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL 194 (352)
Q Consensus 120 ~~iyv~GG-~~~~---~~~~~~~~~~d~~t-~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~ 194 (352)
+..|++|- ...+ ......+.+|.... ++-+.+..+...-+-.+.+.++|++.+.=| ..+-.|+-.+++ .
T Consensus 787 ~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~Gav~aL~~fngkllA~In----~~vrLye~t~~~--e 860 (1096)
T KOG1897|consen 787 NTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETVVKGAVYALVEFNGKLLAGIN----QSVRLYEWTTER--E 860 (1096)
T ss_pred ceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeeeeccceeehhhhCCeEEEecC----cEEEEEEccccc--e
Confidence 45777764 2111 11234455555554 667777777666666778888998877656 778889777662 2
Q ss_pred ecCCCCcccce----eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCC
Q 045821 195 HTEPNIFTEIE----DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGT 270 (352)
Q Consensus 195 ~~~~~~~~~~~----~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~ 270 (352)
+. +--....+ ..-+.++.|+|.-- .....+..|+...+...+++....+.|-.++-.+++..|+.+...++
T Consensus 861 Lr-~e~~~~~~~~aL~l~v~gdeI~VgDl----m~Sitll~y~~~eg~f~evArD~~p~Wmtaveil~~d~ylgae~~gN 935 (1096)
T KOG1897|consen 861 LR-IECNISNPIIALDLQVKGDEIAVGDL----MRSITLLQYKGDEGNFEEVARDYNPNWMTAVEILDDDTYLGAENSGN 935 (1096)
T ss_pred eh-hhhcccCCeEEEEEEecCcEEEEeec----cceEEEEEEeccCCceEEeehhhCccceeeEEEecCceEEeeccccc
Confidence 11 00111112 12345778887521 22347888999999999998888888877777788888877765552
Q ss_pred -eEEEEeCC
Q 045821 271 -KLMMWQKE 278 (352)
Q Consensus 271 -~v~~yd~~ 278 (352)
.+.++|-.
T Consensus 936 lf~v~~d~~ 944 (1096)
T KOG1897|consen 936 LFTVRKDSD 944 (1096)
T ss_pred EEEEEecCC
Confidence 23455543
No 237
>KOG1428 consensus Inhibitor of type V adenylyl cyclases/Neuronal presynaptic protein Highwire/PAM/RPM-1 [Signal transduction mechanisms]
Probab=43.31 E-value=3.8e+02 Score=29.80 Aligned_cols=49 Identities=14% Similarity=0.264 Sum_probs=27.0
Q ss_pred ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821 210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQS 267 (352)
Q Consensus 210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~ 267 (352)
-+|++|..|-+. .+=.|. ..+.|.++.-+ .+....++-.++..+..|..
T Consensus 496 ~sGKvYYaGn~t------~~Gl~e-~G~nWmEL~l~--~~IVq~SVG~D~~~~~~~A~ 544 (3738)
T KOG1428|consen 496 RSGKVYYAGNGT------RFGLFE-TGNNWMELCLP--EPIVQISVGIDTIMFRSGAG 544 (3738)
T ss_pred cCccEEEecCcc------EEeEEc-cCCceEEecCC--CceEEEEeccchhheeeccC
Confidence 478999996433 122332 35889998653 22233334445555555544
No 238
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=43.16 E-value=2.4e+02 Score=25.11 Aligned_cols=44 Identities=14% Similarity=0.041 Sum_probs=24.5
Q ss_pred CeeEecCCCCC-cceee-eeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821 146 NTWTDAAPMST-ARCYF-PCGVLNQKIYCIGGLGDTHSWDVYDPRTNN 191 (352)
Q Consensus 146 ~~W~~~~~~~~-~r~~~-~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~ 191 (352)
.+|...|.+.. ++... .++.+++...+.||.+ ..+.+||..++.
T Consensus 29 ~~~~l~~lF~~~aH~~sitavAVs~~~~aSGssD--etI~IYDm~k~~ 74 (362)
T KOG0294|consen 29 VKPTLKPLFAFSAHAGSITALAVSGPYVASGSSD--ETIHIYDMRKRK 74 (362)
T ss_pred cceeeeccccccccccceeEEEecceeEeccCCC--CcEEEEeccchh
Confidence 34655554432 33322 3444455544445544 788999998764
No 239
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=42.49 E-value=3.5e+02 Score=26.74 Aligned_cols=147 Identities=10% Similarity=0.061 Sum_probs=66.5
Q ss_pred eEEECCEEE-EEeCCCCCCceEEEeCCCCcEEEecCCCCc---ccce----eEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821 163 CGVLNQKIY-CIGGLGDTHSWDVYDPRTNNWKLHTEPNIF---TEIE----DSFVMDGKIYIRCSASAATSHVCALVYEP 234 (352)
Q Consensus 163 ~~~~~~~iy-v~gG~~~~~~v~~yd~~t~~W~~~~~~~~~---~~~~----~~~~~~g~iyv~GG~~~~~~~~~i~~yd~ 234 (352)
....+.... ++||..+ ..+-++|..+++=-.+-.|... .+.. ++.++.+...+.|- ..-.+..+|.
T Consensus 159 slsw~~~~~~i~~Gs~D-g~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd~tI~sgD-----S~G~V~FWd~ 232 (691)
T KOG2048|consen 159 SLSWNPTGTKIAGGSID-GVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRDSTIASGD-----SAGTVTFWDS 232 (691)
T ss_pred EEEecCCccEEEecccC-ceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeecCcEEEec-----CCceEEEEcc
Confidence 334444444 6666654 3456677776643222222221 1111 23344444445542 2225666776
Q ss_pred CCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCC--CCceEEccCCCCCcCCCCceEEEeCCEEEEEcCC
Q 045821 235 STDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKE--SREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKG 312 (352)
Q Consensus 235 ~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~--~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~ 312 (352)
...+=.+--..........++..++.-++.+|.+. .+..|-.. .++|.....-.. ..+..-+++++++ +.+-||.
T Consensus 233 ~~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~-~ii~~~~~~~~~~wv~~~~r~~-h~hdvrs~av~~~-~l~sgG~ 309 (691)
T KOG2048|consen 233 IFGTLIQSHSCHDADVLALAVADNEDRVFSAGVDP-KIIQYSLTTNKSEWVINSRRDL-HAHDVRSMAVIEN-ALISGGR 309 (691)
T ss_pred cCcchhhhhhhhhcceeEEEEcCCCCeEEEccCCC-ceEEEEecCCccceeeeccccC-Ccccceeeeeecc-eEEecce
Confidence 55543221001111111122333434555566665 67666444 446876653211 1233346677777 5555666
Q ss_pred ceEEEE
Q 045821 313 CSAVVI 318 (352)
Q Consensus 313 ~~~~~~ 318 (352)
+....+
T Consensus 310 d~~l~i 315 (691)
T KOG2048|consen 310 DFTLAI 315 (691)
T ss_pred eeEEEE
Confidence 544433
No 240
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=42.33 E-value=2.8e+02 Score=25.68 Aligned_cols=111 Identities=19% Similarity=0.251 Sum_probs=55.1
Q ss_pred ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC
Q 045821 166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN 245 (352)
Q Consensus 166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~ 245 (352)
.+|.+...||.+. -.-++|..|..-.-+-.-....-......-+|+...-||.+ +.+-++|+...+= +-.
T Consensus 313 ~DGSL~~tGGlD~--~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNGy~lATgs~D-----nt~kVWDLR~r~~--ly~- 382 (459)
T KOG0272|consen 313 PDGSLAATGGLDS--LGRVWDLRTGRCIMFLAGHIKEILSVAFSPNGYHLATGSSD-----NTCKVWDLRMRSE--LYT- 382 (459)
T ss_pred CCCceeeccCccc--hhheeecccCcEEEEecccccceeeEeECCCceEEeecCCC-----CcEEEeeeccccc--cee-
Confidence 4899999999763 33457777776433211111111111223377777776554 2344444432221 111
Q ss_pred CCCCCCCcE-EE---ECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821 246 MASGWRGPA-VV---VDDALYVLDQSSGTKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 246 ~~~~~~~~~-~~---~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~ 289 (352)
++....-.+ |. ..|..++.+++++ .+-+|. +..|+.+..|.
T Consensus 383 ipAH~nlVS~Vk~~p~~g~fL~TasyD~-t~kiWs--~~~~~~~ksLa 427 (459)
T KOG0272|consen 383 IPAHSNLVSQVKYSPQEGYFLVTASYDN-TVKIWS--TRTWSPLKSLA 427 (459)
T ss_pred cccccchhhheEecccCCeEEEEcccCc-ceeeec--CCCcccchhhc
Confidence 111111111 22 2577777788766 555554 45577776553
No 241
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=40.66 E-value=2e+02 Score=28.06 Aligned_cols=135 Identities=6% Similarity=-0.023 Sum_probs=68.4
Q ss_pred CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEEC---CEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821 168 QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD---GKIYIRCSASAATSHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 168 ~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~---g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~ 244 (352)
+.+-++.+.+....|.+++.....=+ .+..........+.+. -.++|..- ..+-.||+....-.+---
T Consensus 534 DYlatV~~~~~~~~VliHQLSK~~sQ--~PF~kskG~vq~v~FHPs~p~lfVaTq-------~~vRiYdL~kqelvKkL~ 604 (733)
T KOG0650|consen 534 DYLATVMPDSGNKSVLIHQLSKRKSQ--SPFRKSKGLVQRVKFHPSKPYLFVATQ-------RSVRIYDLSKQELVKKLL 604 (733)
T ss_pred ceEEEeccCCCcceEEEEeccccccc--CchhhcCCceeEEEecCCCceEEEEec-------cceEEEehhHHHHHHHHh
Confidence 34444444444488889988765421 2222222233334443 24555421 258889987643322111
Q ss_pred CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCC--ceEEccCCC-------CCcCCCCceEEEeCCEEEEEcCC
Q 045821 245 NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESR--EWSPVGRLS-------TLLTRPPCKLVAIGKTIFVIGKG 312 (352)
Q Consensus 245 ~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~--~W~~~~~~~-------~~~~~~~~~~~~~~~~i~v~gG~ 312 (352)
....-.+..++.-+|.=+++|++.+ .+..||.+-+ -|..+.--+ .-.....++.+.-++.++||-|.
T Consensus 605 tg~kwiS~msihp~GDnli~gs~d~-k~~WfDldlsskPyk~lr~H~~avr~Va~H~ryPLfas~sdDgtv~Vfhg~ 680 (733)
T KOG0650|consen 605 TGSKWISSMSIHPNGDNLILGSYDK-KMCWFDLDLSSKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGTVIVFHGM 680 (733)
T ss_pred cCCeeeeeeeecCCCCeEEEecCCC-eeEEEEcccCcchhHHhhhhhhhhhhhhhccccceeeeecCCCcEEEEeee
Confidence 1111112333444555566777776 8888877644 576654211 11122335555667888888763
No 242
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=40.43 E-value=3.9e+02 Score=26.66 Aligned_cols=170 Identities=14% Similarity=0.163 Sum_probs=0.0
Q ss_pred eeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC---
Q 045821 147 TWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS--- 221 (352)
Q Consensus 147 ~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~--- 221 (352)
+|...-..+..++-+..+.- +..+.+..| +.+++||+..++--+.-.-.....++.+-..+|+.|..||.+
T Consensus 2 ~~~~~~r~~~~hci~d~afkPDGsqL~lAAg----~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~V 77 (1081)
T KOG1538|consen 2 RAVLTWRDKAEHCINDIAFKPDGTQLILAAG----SRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSV 77 (1081)
T ss_pred chhhhhhcccccchheeEECCCCceEEEecC----CEEEEEeCCCcccccccccccceEEEEEEccCCceeccCCCceeE
Q ss_pred -------------CCCCceEEEEEeCCC-----------CceeecCCCCCCCCCCcEEEE-----CCEEEEEecCCC---
Q 045821 222 -------------AATSHVCALVYEPST-----------DSWLHADANMASGWRGPAVVV-----DDALYVLDQSSG--- 269 (352)
Q Consensus 222 -------------~~~~~~~i~~yd~~~-----------~~W~~~~~~~~~~~~~~~~~~-----~~~l~~~gg~~~--- 269 (352)
+++.......|+|-+ .-|+.-...........-+.. +|.++++|-.+|
T Consensus 78 I~W~~klEG~LkYSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~CsWtnDGqylalG~~nGTIs 157 (1081)
T KOG1538|consen 78 IIWTSKLEGILKYSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICCSWTNDGQYLALGMFNGTIS 157 (1081)
T ss_pred EEecccccceeeeccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEeeecCCCcEEEEeccCceEE
Q ss_pred ---------------------------------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC
Q 045821 270 ---------------------------------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK 304 (352)
Q Consensus 270 ---------------------------------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~ 304 (352)
..+..|..+...-.+-..+. ....+---..+|
T Consensus 158 iRNk~gEek~~I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~LsG~~Igk~r~L~---FdP~CisYf~NG 234 (1081)
T KOG1538|consen 158 IRNKNGEEKVKIERPGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLSGKQIGKDRALN---FDPCCISYFTNG 234 (1081)
T ss_pred eecCCCCcceEEeCCCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEecceeecccccCC---CCchhheeccCC
Q ss_pred EEEEEcCCceEEEEEcCCc
Q 045821 305 TIFVIGKGCSAVVIDVGNI 323 (352)
Q Consensus 305 ~i~v~gG~~~~~~~d~~~~ 323 (352)
+..++||.++..-+-+.+.
T Consensus 235 Ey~LiGGsdk~L~~fTR~G 253 (1081)
T KOG1538|consen 235 EYILLGGSDKQLSLFTRDG 253 (1081)
T ss_pred cEEEEccCCCceEEEeecC
No 243
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=40.25 E-value=98 Score=23.01 Aligned_cols=48 Identities=10% Similarity=0.082 Sum_probs=32.0
Q ss_pred eCCEEEEEcCCceEEEEE-----cCCcccccceEEeccCCCCCCCCcceeeEEe
Q 045821 302 IGKTIFVIGKGCSAVVID-----VGNIGNIGGIMVSSSIPKLNDNDDIISCKCL 350 (352)
Q Consensus 302 ~~~~i~v~gG~~~~~~~d-----~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (352)
+...||++.|....++=+ .+. .=.+-+-++..+|+++.|.+-.+|.|+
T Consensus 67 hEtaIYvlsG~ah~w~G~rLE~ha~~-~pGDf~YiPpgVPHqp~N~S~ep~s~v 119 (142)
T COG4101 67 HETAIYVLSGEAHTWYGNRLEEHAEV-GPGDFFYIPPGVPHQPANLSTEPLSAV 119 (142)
T ss_pred ccEEEEEEeceeeeeeccceeeeEEe-cCCCeEEcCCCCCCcccccCCCCeEEE
Confidence 355699999977665522 221 123345568999999999977777765
No 244
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=39.81 E-value=3e+02 Score=25.25 Aligned_cols=153 Identities=16% Similarity=0.175 Sum_probs=83.7
Q ss_pred ECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEEEe
Q 045821 118 LGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWKLH 195 (352)
Q Consensus 118 ~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~ 195 (352)
.++++|+..+. .+.+.++|..+++-... .+..+..+..+. .++.+||.......+.+.+.|..+++=...
T Consensus 84 ~~~~vyv~~~~------~~~v~vid~~~~~~~~~--~~vG~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~ 155 (381)
T COG3391 84 AGNKVYVTTGD------SNTVSVIDTATNTVLGS--IPVGLGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTAT 155 (381)
T ss_pred CCCeEEEecCC------CCeEEEEcCcccceeeE--eeeccCCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEE
Confidence 46779998653 56788888666653322 222223233332 356899987643238889999998864433
Q ss_pred cCCCCcccceeEEE-ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC---CCCCCCCCc--EEE-ECCEEEEEecCC
Q 045821 196 TEPNIFTEIEDSFV-MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA---NMASGWRGP--AVV-VDDALYVLDQSS 268 (352)
Q Consensus 196 ~~~~~~~~~~~~~~-~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~--~~~-~~~~l~~~gg~~ 268 (352)
.+-.... ...++. .++++|+.- ...+.+..+|.+++.=.. +. ......... .+. .+.++|+....+
T Consensus 156 ~~vG~~P-~~~a~~p~g~~vyv~~-----~~~~~v~vi~~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~ 228 (381)
T COG3391 156 IPVGNTP-TGVAVDPDGNKVYVTN-----SDDNTVSVIDTSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDGS 228 (381)
T ss_pred EecCCCc-ceEEECCCCCeEEEEe-----cCCCeEEEEeCCCcceec-cccccccccCCCCceEEECCCCCEEEEEeccC
Confidence 1111111 222222 345689884 223467888887664332 21 111111111 232 355688886655
Q ss_pred C-CeEEEEeCCCCceEEc
Q 045821 269 G-TKLMMWQKESREWSPV 285 (352)
Q Consensus 269 ~-~~v~~yd~~~~~W~~~ 285 (352)
. ..+.++|..++.-...
T Consensus 229 ~~~~v~~id~~~~~v~~~ 246 (381)
T COG3391 229 GSNNVLKIDTATGNVTAT 246 (381)
T ss_pred CCceEEEEeCCCceEEEe
Confidence 2 4788888888665444
No 245
>PRK01029 tolB translocation protein TolB; Provisional
Probab=39.71 E-value=3.2e+02 Score=25.56 Aligned_cols=185 Identities=8% Similarity=-0.015 Sum_probs=82.5
Q ss_pred CCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEE--EeCCCC---cEEEecCCCCcccceeEEE
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDV--YDPRTN---NWKLHTEPNIFTEIEDSFV 209 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~--yd~~t~---~W~~~~~~~~~~~~~~~~~ 209 (352)
...+|++|..+++-+.+...+..........-+.+|..........+++. ||..++ .=+++..............
T Consensus 210 ~~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wS 289 (428)
T PRK01029 210 VPKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFS 289 (428)
T ss_pred CceEEEEECCCCCceEeecCCCCccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEEC
Confidence 36899999998887777665443222211111335544432211133333 555442 1122221111111112333
Q ss_pred ECCE-EEEEccCCCCCCceEEEEEeCCC--CceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEE
Q 045821 210 MDGK-IYIRCSASAATSHVCALVYEPST--DSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSP 284 (352)
Q Consensus 210 ~~g~-iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~ 284 (352)
-||+ |+.. .... ....++.++... ..=+.+... ..........-+| .|++.....+ ..+++||+++++.+.
T Consensus 290 PDG~~Laf~-s~~~--g~~~ly~~~~~~~g~~~~~lt~~-~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~ 365 (428)
T PRK01029 290 PDGTRLVFV-SNKD--GRPRIYIMQIDPEGQSPRLLTKK-YRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQ 365 (428)
T ss_pred CCCCEEEEE-ECCC--CCceEEEEECcccccceEEeccC-CCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEE
Confidence 4665 4433 2211 123566666532 211222211 1111122233344 4555543332 489999999999887
Q ss_pred ccCCCCCcCCCCceEEEeCCEEEEEc---CCceEEEEEcCCccc
Q 045821 285 VGRLSTLLTRPPCKLVAIGKTIFVIG---KGCSAVVIDVGNIGN 325 (352)
Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~i~v~g---G~~~~~~~d~~~~~w 325 (352)
+...+. ..........+..|+... |....+++|+.....
T Consensus 366 Lt~~~~--~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~ 407 (428)
T PRK01029 366 LTTSPE--NKESPSWAIDSLHLVYSAGNSNESELYLISLITKKT 407 (428)
T ss_pred ccCCCC--CccceEECCCCCEEEEEECCCCCceEEEEECCCCCE
Confidence 764321 111222222334455443 334566678766543
No 246
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=39.60 E-value=2.7e+02 Score=24.63 Aligned_cols=147 Identities=14% Similarity=0.146 Sum_probs=75.7
Q ss_pred CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD 259 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~ 259 (352)
..+..||..+..-.+.-....-..............+.+. . ..-..|...+..+|+--+.=......-....+.-.+
T Consensus 36 Dsl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~~~i~sS-t--k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~ 112 (311)
T KOG1446|consen 36 DSLRLYDSLSGKQVKTINSKKYGVDLACFTHHSNTVIHSS-T--KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD 112 (311)
T ss_pred CeEEEEEcCCCceeeEeecccccccEEEEecCCceEEEcc-C--CCCCceEEEEeecCceEEEcCCCCceEEEEEecCCC
Confidence 6789999998875544323322222233444444444422 2 111345555666666544311111111111222345
Q ss_pred EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc-CCceEEEEEcCCcccccceEEeccCC
Q 045821 260 ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG-KGCSAVVIDVGNIGNIGGIMVSSSIP 336 (352)
Q Consensus 260 ~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g-G~~~~~~~d~~~~~w~~~~~~~~~~~ 336 (352)
..++-++.+. .+..||+....-.-+-.+.. + ..+..-..|-|+.++ |.+.+-+||..... ++.+.+..++
T Consensus 113 d~FlS~S~D~-tvrLWDlR~~~cqg~l~~~~---~-pi~AfDp~GLifA~~~~~~~IkLyD~Rs~d--kgPF~tf~i~ 183 (311)
T KOG1446|consen 113 DTFLSSSLDK-TVRLWDLRVKKCQGLLNLSG---R-PIAAFDPEGLIFALANGSELIKLYDLRSFD--KGPFTTFSIT 183 (311)
T ss_pred CeEEecccCC-eEEeeEecCCCCceEEecCC---C-cceeECCCCcEEEEecCCCeEEEEEecccC--CCCceeEccC
Confidence 7777777665 88999988665544432221 1 122222345566555 34356678887654 5667766666
No 247
>PF13088 BNR_2: BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=39.06 E-value=2.5e+02 Score=24.02 Aligned_cols=143 Identities=13% Similarity=0.088 Sum_probs=70.2
Q ss_pred CCEEEEEEecCCCC-ceeEEEe-cCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECC
Q 045821 68 ETWIYAFCRDNKLE-RVCCYVL-DPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDAS 144 (352)
Q Consensus 68 ~~~l~~~gg~~~~~-~~~~~~~-d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~ 144 (352)
++.+++..-..... ......+ +- ...+|+.....+.. ......+++.. ++.|+++--.... . ...+.+...
T Consensus 118 ~G~l~~~~~~~~~~~~~~~~~~S~D--~G~tW~~~~~~~~~-~~~~e~~~~~~~dG~l~~~~R~~~~-~--~~~~~~S~D 191 (275)
T PF13088_consen 118 DGRLIAPYYHESGGSFSAFVYYSDD--GGKTWSSGSPIPDG-QGECEPSIVELPDGRLLAVFRTEGN-D--DIYISRSTD 191 (275)
T ss_dssp TTEEEEEEEEESSCEEEEEEEEESS--TTSSEEEEEECECS-EEEEEEEEEEETTSEEEEEEEECSS-T--EEEEEEESS
T ss_pred CCCEEEEEeeccccCcceEEEEeCC--CCceeecccccccc-CCcceeEEEECCCCcEEEEEEccCC-C--cEEEEEECC
Confidence 56666653222112 2223333 33 56789888766310 11222333333 6678877543211 1 333444555
Q ss_pred C-CeeEecC--CCCCcceeeeeEEE-CCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcc----cceeEEEE-CC
Q 045821 145 M-NTWTDAA--PMSTARCYFPCGVL-NQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFT----EIEDSFVM-DG 212 (352)
Q Consensus 145 t-~~W~~~~--~~~~~r~~~~~~~~-~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~----~~~~~~~~-~g 212 (352)
. .+|.... .+|.+........+ +++++++...... ..+..-.....+|.....+.... .+..++.. +|
T Consensus 192 ~G~TWs~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~~l~l~~S~D~g~tW~~~~~i~~~~~~~~~Y~~~~~~~dg 271 (275)
T PF13088_consen 192 GGRTWSPPQPTNLPNPNSSISLVRLSDGRLLLVYNNPDGRSNLSLYVSEDGGKTWSRPKTIDDGPNGDSGYPSLTQLPDG 271 (275)
T ss_dssp TTSS-EEEEEEECSSCCEEEEEEECTTSEEEEEEECSSTSEEEEEEEECTTCEEEEEEEEEEEEE-CCEEEEEEEEEETT
T ss_pred CCCcCCCceecccCcccCCceEEEcCCCCEEEEEECCCCCCceEEEEEeCCCCcCCccEEEeCCCCCcEECCeeEEeCCC
Confidence 3 3598754 34555555555554 5688877773222 22222233366898765433322 44455555 67
Q ss_pred EEEE
Q 045821 213 KIYI 216 (352)
Q Consensus 213 ~iyv 216 (352)
+|||
T Consensus 272 ~l~i 275 (275)
T PF13088_consen 272 KLYI 275 (275)
T ss_dssp EEEE
T ss_pred cCCC
Confidence 9886
No 248
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=38.64 E-value=4.4e+02 Score=26.80 Aligned_cols=95 Identities=15% Similarity=0.163 Sum_probs=51.4
Q ss_pred EEEEEeCCCCceeecCC-CCCCCCCCcE-EEE---CCEEEEEecCCC-CeEEEE------eCCCCceEEccCCCCCcCCC
Q 045821 228 CALVYEPSTDSWLHADA-NMASGWRGPA-VVV---DDALYVLDQSSG-TKLMMW------QKESREWSPVGRLSTLLTRP 295 (352)
Q Consensus 228 ~i~~yd~~~~~W~~~~~-~~~~~~~~~~-~~~---~~~l~~~gg~~~-~~v~~y------d~~~~~W~~~~~~~~~~~~~ 295 (352)
..|.|++...+|..... ..|+.-...+ +.. ....++..|.++ -.+|++ ++....|+..+--.-.....
T Consensus 433 KFW~~n~~~kt~~L~T~I~~PH~~~~vat~~~~~~rs~~~vta~~dg~~KiW~~~~~~n~~k~~s~W~c~~i~sy~k~~i 512 (792)
T KOG1963|consen 433 KFWQYNPNSKTFILNTKINNPHGNAFVATIFLNPTRSVRCVTASVDGDFKIWVFTDDSNIYKKSSNWTCKAIGSYHKTPI 512 (792)
T ss_pred EEEEEcCCcceeEEEEEEecCCCceeEEEEEecCcccceeEEeccCCeEEEEEEecccccCcCccceEEeeeeccccCcc
Confidence 78999999999976533 1222222222 111 111444444443 367887 66677898776322111111
Q ss_pred CceEEEeCCEEEEEcCCceEEEEEcCC
Q 045821 296 PCKLVAIGKTIFVIGKGCSAVVIDVGN 322 (352)
Q Consensus 296 ~~~~~~~~~~i~v~gG~~~~~~~d~~~ 322 (352)
...+..-+|+++..|-.+...++|..+
T Consensus 513 ~a~~fs~dGslla~s~~~~Itiwd~~~ 539 (792)
T KOG1963|consen 513 TALCFSQDGSLLAVSFDDTITIWDYDT 539 (792)
T ss_pred cchhhcCCCcEEEEecCCEEEEecCCC
Confidence 223334466777777666666777766
No 249
>PF13013 F-box-like_2: F-box-like domain
Probab=37.83 E-value=32 Score=25.27 Aligned_cols=29 Identities=10% Similarity=0.079 Sum_probs=23.8
Q ss_pred cCCCcHHHHHHhhccCccccchhhhhhhh
Q 045821 18 ISGLPDDIALFCLARVPRRYHAVLKCVSS 46 (352)
Q Consensus 18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k 46 (352)
+.+||+||...|+..=....+..+...++
T Consensus 22 l~DLP~ELl~~I~~~C~~~~l~~l~~~~~ 50 (109)
T PF13013_consen 22 LLDLPWELLQLIFDYCNDPILLALSRTCR 50 (109)
T ss_pred hhhChHHHHHHHHhhcCcHHHHHHHHHHH
Confidence 77899999999999888887766666655
No 250
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=37.27 E-value=3.4e+02 Score=25.95 Aligned_cols=34 Identities=12% Similarity=0.250 Sum_probs=26.3
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLST 290 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~ 290 (352)
.-+++-+++||+.. .+.+||...-+=.....++.
T Consensus 474 ~pdgrtLivGGeas-tlsiWDLAapTprikaelts 507 (705)
T KOG0639|consen 474 LPDGRTLIVGGEAS-TLSIWDLAAPTPRIKAELTS 507 (705)
T ss_pred cCCCceEEeccccc-eeeeeeccCCCcchhhhcCC
Confidence 34888899999854 89999999887766666654
No 251
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=36.93 E-value=2.5e+02 Score=23.53 Aligned_cols=116 Identities=15% Similarity=0.146 Sum_probs=69.7
Q ss_pred EEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEE
Q 045821 207 SFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSP 284 (352)
Q Consensus 207 ~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~ 284 (352)
....+|+||...|..+ ...+.++|..+.+ |++--+ ++..+.-.....++.+|.+-=..+ .-+.||+. +.++
T Consensus 51 L~~~~g~i~esTG~yg---~S~ir~~~L~~gq~~~s~~l~-~~~~FgEGit~~gd~~y~LTw~eg-vaf~~d~~--t~~~ 123 (262)
T COG3823 51 LEYLDGHILESTGLYG---FSKIRVSDLTTGQEIFSEKLA-PDTVFGEGITKLGDYFYQLTWKEG-VAFKYDAD--TLEE 123 (262)
T ss_pred eeeeCCEEEEeccccc---cceeEEEeccCceEEEEeecC-CccccccceeeccceEEEEEeccc-eeEEEChH--Hhhh
Confidence 3455788888766543 3468888887554 554322 233344445678899998865443 56778764 4566
Q ss_pred ccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccccceEE
Q 045821 285 VGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIGGIMV 331 (352)
Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~ 331 (352)
++..+- ...+-+++.-+..|.+-.|.....+-||+...=.+..++
T Consensus 124 lg~~~y--~GeGWgLt~d~~~LimsdGsatL~frdP~tfa~~~~v~V 168 (262)
T COG3823 124 LGRFSY--EGEGWGLTSDDKNLIMSDGSATLQFRDPKTFAELDTVQV 168 (262)
T ss_pred hccccc--CCcceeeecCCcceEeeCCceEEEecCHHHhhhcceEEE
Confidence 665543 244456666666666666766555667766654443333
No 252
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=36.78 E-value=2.9e+02 Score=24.08 Aligned_cols=113 Identities=13% Similarity=0.138 Sum_probs=57.0
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCC--------cEEEecCCCCcccce-eEEEECCEEEEEccCCCC-CCceEEEEEeCCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTN--------NWKLHTEPNIFTEIE-DSFVMDGKIYIRCSASAA-TSHVCALVYEPST 236 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~--------~W~~~~~~~~~~~~~-~~~~~~g~iyv~GG~~~~-~~~~~i~~yd~~~ 236 (352)
+..+++..|+.. ..|..||..++ .|..+.+ ...+... ..+.++ ...- +|..+. .....++.|+-++
T Consensus 163 ~s~~lllaGyEs-ghvv~wd~S~~~~~~~~~~~~kv~~~-~ash~qpvlsldya-s~~~-rGisgga~dkl~~~Sl~~s~ 238 (323)
T KOG0322|consen 163 GSTFLLLAGYES-GHVVIWDLSTGDKIIQLPQSSKVESP-NASHKQPVLSLDYA-SSCD-RGISGGADDKLVMYSLNHST 238 (323)
T ss_pred cceEEEEEeccC-CeEEEEEccCCceeeccccccccccc-hhhccCcceeeeec-hhhc-CCcCCCccccceeeeecccc
Confidence 457888888877 78899999886 2322221 1111111 112222 1111 122211 1112455555444
Q ss_pred CceeecCC--CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821 237 DSWLHADA--NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS 289 (352)
Q Consensus 237 ~~W~~~~~--~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~ 289 (352)
++-+.-.. ...+.-.+..+-.++||+...|.++ .+-+|+ |+.+..+.
T Consensus 239 gslq~~~e~~lknpGv~gvrIRpD~KIlATAGWD~-RiRVys-----wrtl~pLA 287 (323)
T KOG0322|consen 239 GSLQIRKEITLKNPGVSGVRIRPDGKILATAGWDH-RIRVYS-----WRTLNPLA 287 (323)
T ss_pred CcccccceEEecCCCccceEEccCCcEEeecccCC-cEEEEE-----eccCCchh
Confidence 43321111 1112223334667999999999988 788876 55665544
No 253
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=36.63 E-value=1.1e+02 Score=28.94 Aligned_cols=101 Identities=11% Similarity=0.216 Sum_probs=50.9
Q ss_pred CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821 167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA 244 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~ 244 (352)
|++-.++||+. ..+-++|..+-+=+.-..++. ..++..++.-|.++......++ .|.+||+.+.+-..-
T Consensus 476 dgrtLivGGea--stlsiWDLAapTprikaeltssapaCyALa~spDakvcFsccsdG-----nI~vwDLhnq~~Vrq-- 546 (705)
T KOG0639|consen 476 DGRTLIVGGEA--STLSIWDLAAPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDG-----NIAVWDLHNQTLVRQ-- 546 (705)
T ss_pred CCceEEecccc--ceeeeeeccCCCcchhhhcCCcchhhhhhhcCCccceeeeeccCC-----cEEEEEcccceeeec--
Confidence 78888889975 667777777655332222333 2233334445666655533222 366777766543221
Q ss_pred CCCCCCCCcE---EEECCEEEEEecCCCCeEEEEeCC
Q 045821 245 NMASGWRGPA---VVVDDALYVLDQSSGTKLMMWQKE 278 (352)
Q Consensus 245 ~~~~~~~~~~---~~~~~~l~~~gg~~~~~v~~yd~~ 278 (352)
.+...-++. +..+|.=+..||.++ .+-+||..
T Consensus 547 -fqGhtDGascIdis~dGtklWTGGlDn-tvRcWDlr 581 (705)
T KOG0639|consen 547 -FQGHTDGASCIDISKDGTKLWTGGLDN-TVRCWDLR 581 (705)
T ss_pred -ccCCCCCceeEEecCCCceeecCCCcc-ceeehhhh
Confidence 111111222 222454455677665 44444443
No 254
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=35.52 E-value=3e+02 Score=23.95 Aligned_cols=71 Identities=14% Similarity=0.191 Sum_probs=42.9
Q ss_pred EEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC-
Q 045821 115 FEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN- 190 (352)
Q Consensus 115 ~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~- 190 (352)
+++++ .+-||+=| .....+...|+.++. |+.+- ..|...++.++++. .|+|-+. ..++..+.+|+
T Consensus 16 LVV~~dskT~v~ig-----SHs~~~~avd~~sG~~~We~il---g~RiE~sa~vvgdf-VV~GCy~--g~lYfl~~~tGs 84 (354)
T KOG4649|consen 16 LVVCNDSKTLVVIG-----SHSGIVIAVDPQSGNLIWEAIL---GVRIECSAIVVGDF-VVLGCYS--GGLYFLCVKTGS 84 (354)
T ss_pred EEEecCCceEEEEe-----cCCceEEEecCCCCcEEeehhh---CceeeeeeEEECCE-EEEEEcc--CcEEEEEecchh
Confidence 34443 34444444 334567788999886 77654 24666666777776 4556554 45666777766
Q ss_pred -cEEEec
Q 045821 191 -NWKLHT 196 (352)
Q Consensus 191 -~W~~~~ 196 (352)
-|....
T Consensus 85 ~~w~f~~ 91 (354)
T KOG4649|consen 85 QIWNFVI 91 (354)
T ss_pred heeeeee
Confidence 476543
No 255
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=35.20 E-value=3.3e+02 Score=24.28 Aligned_cols=205 Identities=15% Similarity=0.072 Sum_probs=96.4
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCCCe-eEecCCC----
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASMNT-WTDAAPM---- 154 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t~~-W~~~~~~---- 154 (352)
...++.||+ .++.-+...... .. .+...+ ++.|.... ..+..+++.+.. ++.+...
T Consensus 46 ~~~i~r~~~--~~g~~~~~~~p~----~~--~~~~~~d~~g~Lv~~~---------~g~~~~~~~~~~~~t~~~~~~~~~ 108 (307)
T COG3386 46 GGRIHRLDP--ETGKKRVFPSPG----GF--SSGALIDAGGRLIACE---------HGVRLLDPDTGGKITLLAEPEDGL 108 (307)
T ss_pred CCeEEEecC--CcCceEEEECCC----Cc--ccceeecCCCeEEEEc---------cccEEEeccCCceeEEeccccCCC
Confidence 346777788 656544444322 11 223333 33444332 234556665443 3444432
Q ss_pred CCcceeeeeEEECCEEEEEeCC----CCC-----CceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCC
Q 045821 155 STARCYFPCGVLNQKIYCIGGL----GDT-----HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAAT 224 (352)
Q Consensus 155 ~~~r~~~~~~~~~~~iyv~gG~----~~~-----~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~ 224 (352)
+.-|..-..+--+|.+|+-.-. ... ..+++||+. ....++-.-........+..-++ .+|+.- .
T Consensus 109 ~~~r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~-g~~~~l~~~~~~~~NGla~SpDg~tly~aD-----T 182 (307)
T COG3386 109 PLNRPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD-GGVVRLLDDDLTIPNGLAFSPDGKTLYVAD-----T 182 (307)
T ss_pred CcCCCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCC-CCEEEeecCcEEecCceEECCCCCEEEEEe-----C
Confidence 2223333444445666664332 011 678999984 55555432222333333444466 677762 2
Q ss_pred CceEEEEEeCCC------Cc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCC
Q 045821 225 SHVCALVYEPST------DS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPP 296 (352)
Q Consensus 225 ~~~~i~~yd~~~------~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~ 296 (352)
....+++|+... ++ +..... .+..-=+..+-.+|.||+.....+..|.+|+++...-.++ .+|.. .-..
T Consensus 183 ~~~~i~r~~~d~~~g~~~~~~~~~~~~~-~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pdG~l~~~i-~lP~~-~~t~ 259 (307)
T COG3386 183 PANRIHRYDLDPATGPIGGRRGFVDFDE-EPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLGEI-KLPVK-RPTN 259 (307)
T ss_pred CCCeEEEEecCcccCccCCcceEEEccC-CCCCCCceEEeCCCCEEEecccCCceEEEECCCCcEEEEE-ECCCC-CCcc
Confidence 224677776542 11 111111 0011112234457888875554445899999994444433 45531 1111
Q ss_pred ceEEE-eCCEEEEEcCC
Q 045821 297 CKLVA-IGKTIFVIGKG 312 (352)
Q Consensus 297 ~~~~~-~~~~i~v~gG~ 312 (352)
+++.. -.+.|||..-.
T Consensus 260 ~~FgG~~~~~L~iTs~~ 276 (307)
T COG3386 260 PAFGGPDLNTLYITSAR 276 (307)
T ss_pred ceEeCCCcCEEEEEecC
Confidence 11111 13678887643
No 256
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=34.75 E-value=2.5e+02 Score=22.84 Aligned_cols=68 Identities=16% Similarity=0.186 Sum_probs=37.8
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCcEEEec
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHT 196 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~ 196 (352)
++++.++-|. ....+..||.....-..+ +........-.-+|+..+++|.... ..+..||..+ .+.+.
T Consensus 71 g~~favi~g~-----~~~~v~lyd~~~~~i~~~---~~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~--~~~i~ 139 (194)
T PF08662_consen 71 GNEFAVIYGS-----MPAKVTLYDVKGKKIFSF---GTQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRK--KKKIS 139 (194)
T ss_pred CCEEEEEEcc-----CCcccEEEcCcccEeEee---cCCCceEEEECCCCCEEEEEEccCCCcEEEEEECCC--CEEee
Confidence 5566666552 123788899864333333 2211111111126788888886543 6789999984 44454
No 257
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=34.48 E-value=3.3e+02 Score=24.16 Aligned_cols=112 Identities=13% Similarity=0.141 Sum_probs=66.2
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++.|++.-+ +.....-+|..|. .++.-+.+..-|. .. .....+..+| += +.-......+.++|..+++
T Consensus 117 ~D~LLlAR~-DGh~nLGvy~ldr--~~g~~~~L~~~ps---~K---G~~~~D~a~F--~i-~~~~~g~~~i~~~Dli~~~ 184 (339)
T PF09910_consen 117 EDRLLLARA-DGHANLGVYSLDR--RTGKAEKLSSNPS---LK---GTLVHDYACF--GI-NNFHKGVSGIHCLDLISGK 184 (339)
T ss_pred cCEEEEEec-CCcceeeeEEEcc--cCCceeeccCCCC---cC---ceEeeeeEEE--ec-cccccCCceEEEEEccCCe
Confidence 677776533 3445666888898 8998888877662 11 1222232333 22 3333567889999999999
Q ss_pred e--EecCCCCC-------cceeeeeEEECCEEEEEeCCCCCCceEEEeCCC-CcEEEe
Q 045821 148 W--TDAAPMST-------ARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRT-NNWKLH 195 (352)
Q Consensus 148 W--~~~~~~~~-------~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t-~~W~~~ 195 (352)
| +..+.-.. -+..-.++...+++|++-+ ..+.+.||.. +..+..
T Consensus 185 ~~~e~f~~~~s~Dg~~~~~~~~G~~~s~ynR~faF~r----GGi~vgnP~~~e~~~f~ 238 (339)
T PF09910_consen 185 WVIESFDVSLSVDGGPVIRPELGAMASAYNRLFAFVR----GGIFVGNPYNGEEFRFY 238 (339)
T ss_pred EEEEecccccCCCCCceEeeccccEEEEeeeEEEEEe----ccEEEeCCCCCCceeEE
Confidence 9 44432111 1223345556677777654 3478899984 344333
No 258
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=34.09 E-value=4.3e+02 Score=25.32 Aligned_cols=98 Identities=13% Similarity=0.229 Sum_probs=55.2
Q ss_pred CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCC-CCc--EEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821 211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGW-RGP--AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR 287 (352)
Q Consensus 211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~-~~~--~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~ 287 (352)
+..+.+..|.. .++.+++..+.-.+-......+. ... .+..+|.=.+.|...+ .|.+|++.++..++...
T Consensus 212 d~nliit~Gk~------H~~Fw~~~~~~l~k~~~~fek~ekk~Vl~v~F~engdviTgDS~G-~i~Iw~~~~~~~~k~~~ 284 (626)
T KOG2106|consen 212 DPNLIITCGKG------HLYFWTLRGGSLVKRQGIFEKREKKFVLCVTFLENGDVITGDSGG-NILIWSKGTNRISKQVH 284 (626)
T ss_pred CCcEEEEeCCc------eEEEEEccCCceEEEeeccccccceEEEEEEEcCCCCEEeecCCc-eEEEEeCCCceEEeEee
Confidence 44555554543 37777888887765543222221 111 1233444445554444 99999999888776543
Q ss_pred CCCCcCCCCceEEEe-CCEEEEEcCCceEEE-EE
Q 045821 288 LSTLLTRPPCKLVAI-GKTIFVIGKGCSAVV-ID 319 (352)
Q Consensus 288 ~~~~~~~~~~~~~~~-~~~i~v~gG~~~~~~-~d 319 (352)
. ....-++++.+ +++|+- ||.+...+ .|
T Consensus 285 -a--H~ggv~~L~~lr~GtllS-GgKDRki~~Wd 314 (626)
T KOG2106|consen 285 -A--HDGGVFSLCMLRDGTLLS-GGKDRKIILWD 314 (626)
T ss_pred -e--cCCceEEEEEecCccEee-cCccceEEecc
Confidence 1 23445666666 455666 88775443 44
No 259
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=33.79 E-value=3.5e+02 Score=24.25 Aligned_cols=33 Identities=21% Similarity=0.407 Sum_probs=28.9
Q ss_pred ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821 257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLST 290 (352)
Q Consensus 257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~ 290 (352)
.+|+||+..-..+ .+..+|+++++.+.+..+|.
T Consensus 211 hdgrLwvldsgtG-ev~~vD~~~G~~e~Va~vpG 243 (335)
T TIGR03032 211 YQGKLWLLNSGRG-ELGYVDPQAGKFQPVAFLPG 243 (335)
T ss_pred eCCeEEEEECCCC-EEEEEcCCCCcEEEEEECCC
Confidence 4899999987666 99999999999999988876
No 260
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=32.62 E-value=4.3e+02 Score=24.85 Aligned_cols=141 Identities=9% Similarity=0.080 Sum_probs=69.5
Q ss_pred EEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee-eeeEEEC--CEEEEEeCCCCCCceEEEeCCCCcE
Q 045821 116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY-FPCGVLN--QKIYCIGGLGDTHSWDVYDPRTNNW 192 (352)
Q Consensus 116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~-~~~~~~~--~~iyv~gG~~~~~~v~~yd~~t~~W 192 (352)
+..+++.+++||.+ ..+.++|+.|.+= +..++.-|-. .+.+... +.+|..+- + .++-+|+......
T Consensus 210 vS~Dgkylatgg~d------~~v~Iw~~~t~eh--v~~~~ghr~~V~~L~fr~gt~~lys~s~-D--rsvkvw~~~~~s~ 278 (479)
T KOG0299|consen 210 VSSDGKYLATGGRD------RHVQIWDCDTLEH--VKVFKGHRGAVSSLAFRKGTSELYSASA-D--RSVKVWSIDQLSY 278 (479)
T ss_pred EcCCCcEEEecCCC------ceEEEecCcccch--hhcccccccceeeeeeecCccceeeeec-C--CceEEEehhHhHH
Confidence 33477888888853 4467888887642 2222222221 2222222 35666442 1 4455555544332
Q ss_pred EEecCCCCcccceeE------EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEec
Q 045821 193 KLHTEPNIFTEIEDS------FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQ 266 (352)
Q Consensus 193 ~~~~~~~~~~~~~~~------~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg 266 (352)
+. .-++|+. +.-.++...+||.+. .+..|+... .-+.+-.. .....-.+++++..-|+.|+
T Consensus 279 --ve----tlyGHqd~v~~IdaL~reR~vtVGgrDr-----T~rlwKi~e-esqlifrg-~~~sidcv~~In~~HfvsGS 345 (479)
T KOG0299|consen 279 --VE----TLYGHQDGVLGIDALSRERCVTVGGRDR-----TVRLWKIPE-ESQLIFRG-GEGSIDCVAFINDEHFVSGS 345 (479)
T ss_pred --HH----HHhCCccceeeechhcccceEEeccccc-----eeEEEeccc-cceeeeeC-CCCCeeeEEEecccceeecc
Confidence 21 1222221 122467777877664 344444411 11111110 01111224567888899998
Q ss_pred CCCCeEEEEeCCCCc
Q 045821 267 SSGTKLMMWQKESRE 281 (352)
Q Consensus 267 ~~~~~v~~yd~~~~~ 281 (352)
.+| .|..++.-.+.
T Consensus 346 dnG-~IaLWs~~KKk 359 (479)
T KOG0299|consen 346 DNG-SIALWSLLKKK 359 (479)
T ss_pred CCc-eEEEeeecccC
Confidence 877 88888776654
No 261
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=32.02 E-value=3.2e+02 Score=26.94 Aligned_cols=95 Identities=7% Similarity=0.125 Sum_probs=48.6
Q ss_pred CceEEEeCCCCc-EEEecCCCCcccceeEEEE---CCEEEEEccCCCCCCceEEEEEeCCCCce------eecCC--CC-
Q 045821 180 HSWDVYDPRTNN-WKLHTEPNIFTEIEDSFVM---DGKIYIRCSASAATSHVCALVYEPSTDSW------LHADA--NM- 246 (352)
Q Consensus 180 ~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~~---~g~iyv~GG~~~~~~~~~i~~yd~~~~~W------~~~~~--~~- 246 (352)
..|-++++..+- |- +..+.....+..+.++ ++.+++.||.+. .|..||.++..= ..++. ..
T Consensus 95 tTVK~W~~~~~~~~c-~stir~H~DYVkcla~~ak~~~lvaSgGLD~-----~IflWDin~~~~~l~~s~n~~t~~sl~s 168 (735)
T KOG0308|consen 95 TTVKVWNAHKDNTFC-MSTIRTHKDYVKCLAYIAKNNELVASGGLDR-----KIFLWDINTGTATLVASFNNVTVNSLGS 168 (735)
T ss_pred ceEEEeecccCcchh-HhhhhcccchheeeeecccCceeEEecCCCc-----cEEEEEccCcchhhhhhccccccccCCC
Confidence 677777777653 31 1112333333333333 677888887764 466666664422 22211 11
Q ss_pred CCC--CCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821 247 ASG--WRGPAVVVDDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 247 ~~~--~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
.+. .+..+....+.+++-||..+ .+..||+.+..
T Consensus 169 G~k~siYSLA~N~t~t~ivsGgtek-~lr~wDprt~~ 204 (735)
T KOG0308|consen 169 GPKDSIYSLAMNQTGTIIVSGGTEK-DLRLWDPRTCK 204 (735)
T ss_pred CCccceeeeecCCcceEEEecCccc-ceEEecccccc
Confidence 111 11112223344666677665 88999998743
No 262
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=31.79 E-value=2.1e+02 Score=24.53 Aligned_cols=59 Identities=12% Similarity=0.119 Sum_probs=33.0
Q ss_pred CCEEEEEecCCCCeEEEEeCCCCceEEccCCC------CCcCCCCceEEE-eCCEEEEEcCCceEEEE
Q 045821 258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLS------TLLTRPPCKLVA-IGKTIFVIGKGCSAVVI 318 (352)
Q Consensus 258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~------~~~~~~~~~~~~-~~~~i~v~gG~~~~~~~ 318 (352)
.+.||++...+. .+.++|.+.+.-..+.-.. .. ..+.-+++. -+|.|||.+.-+-++.|
T Consensus 182 t~~lliLS~es~-~l~~~d~~G~~~~~~~L~~g~~gl~~~-~~QpEGIa~d~~G~LYIvsEpNlfy~f 247 (248)
T PF06977_consen 182 TGHLLILSDESR-LLLELDRQGRVVSSLSLDRGFHGLSKD-IPQPEGIAFDPDGNLYIVSEPNLFYRF 247 (248)
T ss_dssp TTEEEEEETTTT-EEEEE-TT--EEEEEE-STTGGG-SS----SEEEEEE-TT--EEEEETTTEEEEE
T ss_pred CCeEEEEECCCC-eEEEECCCCCEEEEEEeCCcccCcccc-cCCccEEEECCCCCEEEEcCCceEEEe
Confidence 678999987665 8999997766554443211 11 222334443 47899999987666665
No 263
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=31.50 E-value=3.7e+02 Score=23.82 Aligned_cols=96 Identities=16% Similarity=0.205 Sum_probs=46.5
Q ss_pred CCEEEEEeCCCCCCceEE---EeCCCCcEEEecCCCCcccceeEEEE------C-CEEEEEccCCCCCCceEEEEEeCCC
Q 045821 167 NQKIYCIGGLGDTHSWDV---YDPRTNNWKLHTEPNIFTEIEDSFVM------D-GKIYIRCSASAATSHVCALVYEPST 236 (352)
Q Consensus 167 ~~~iyv~gG~~~~~~v~~---yd~~t~~W~~~~~~~~~~~~~~~~~~------~-g~iyv~GG~~~~~~~~~i~~yd~~~ 236 (352)
+|..++.||++. .+.. |.-..|-|..- .|..+++ + +.|+-. |.+. .+..+|..+
T Consensus 58 ~gs~~aSgG~Dr--~I~LWnv~gdceN~~~lk--------gHsgAVM~l~~~~d~s~i~S~-gtDk-----~v~~wD~~t 121 (338)
T KOG0265|consen 58 DGSCFASGGSDR--AIVLWNVYGDCENFWVLK--------GHSGAVMELHGMRDGSHILSC-GTDK-----TVRGWDAET 121 (338)
T ss_pred CCCeEeecCCcc--eEEEEeccccccceeeec--------cccceeEeeeeccCCCEEEEe-cCCc-----eEEEEeccc
Confidence 566777777763 3333 55556667532 2222222 3 345555 4442 578888887
Q ss_pred CceeecCCCCCCCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCC
Q 045821 237 DSWLHADANMASGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESR 280 (352)
Q Consensus 237 ~~W~~~~~~~~~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~ 280 (352)
++-..--. .....-.+ ..--|..++..|.....+-+||..++
T Consensus 122 G~~~rk~k--~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k 165 (338)
T KOG0265|consen 122 GKRIRKHK--GHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKK 165 (338)
T ss_pred ceeeehhc--cccceeeecCccccCCeEEEecCCCceEEEEeeccc
Confidence 76443211 00000001 11123333444444448888988754
No 264
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=30.88 E-value=3.4e+02 Score=23.17 Aligned_cols=83 Identities=11% Similarity=0.069 Sum_probs=42.0
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc--------CCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL--------TRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG 327 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~--------~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~ 327 (352)
..++.|++.... ....+|+.+.+=+.+..+.... ......+..+++.-+++...+-....|.......+
T Consensus 146 ~~~~~i~v~~~~---~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~Ll~~~~~g~fv~~~G~~~r~ 222 (275)
T PF00780_consen 146 FLGNKICVGTSK---GFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLSDNEFLLCYDNIGVFVNKNGEPSRK 222 (275)
T ss_pred EeCCEEEEEeCC---ceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEeCCceEEEEecceEEEEcCCCCcCcc
Confidence 345555544322 6788888877666665433211 11123444555334444333333345555544445
Q ss_pred ceEEeccCCCCCCC
Q 045821 328 GIMVSSSIPKLNDN 341 (352)
Q Consensus 328 ~~~~~~~~~~~~~~ 341 (352)
....|.+.|.....
T Consensus 223 ~~i~W~~~p~~~~~ 236 (275)
T PF00780_consen 223 STIQWSSAPQSVAY 236 (275)
T ss_pred cEEEcCCchhEEEE
Confidence 66667776654443
No 265
>KOG3545 consensus Olfactomedin and related extracellular matrix glycoproteins [Extracellular structures]
Probab=30.44 E-value=3.5e+02 Score=23.20 Aligned_cols=190 Identities=11% Similarity=0.122 Sum_probs=103.6
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEEC----CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC---c
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDA----SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN---N 191 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~----~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~---~ 191 (352)
++++|++.+... ..+.-|.. ..+.+...=.+|.+-.+-.-++++|.+|.-.+.. ..+..||..+. .
T Consensus 30 ~~r~~~~~~~~~-----~~l~E~~~~~~~~~~~~~~~~~lp~~~~gTg~VVynGs~yynk~~t--~~ivky~l~~~~~~~ 102 (249)
T KOG3545|consen 30 DDRIYVMNYFDG-----LMLTEYTNLEDFKRGRKAEKYRLPYSWDGTGHVVYNGSLYYNKAGT--RNIIKYDLETRTVAG 102 (249)
T ss_pred cCceEEeccccC-----ceEEEeccHHHhhccCcceEEeCCCCccccceEEEcceEEeeccCC--cceEEEEeecceeee
Confidence 667788744322 23333433 2334444445666666677788888888765433 67889999884 3
Q ss_pred EEEecCCCCccc---------ceeEEEECCEEEEEccCCCCCCceEEEEEeCC----CCceeecCCCCCCCCCCcEEEEC
Q 045821 192 WKLHTEPNIFTE---------IEDSFVMDGKIYIRCSASAATSHVCALVYEPS----TDSWLHADANMASGWRGPAVVVD 258 (352)
Q Consensus 192 W~~~~~~~~~~~---------~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~----~~~W~~~~~~~~~~~~~~~~~~~ 258 (352)
|+.++.+..... ..-.++-.+-|+++=-..+......+...|+. ..+|..- .+++..+.++++-
T Consensus 103 ~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp~tl~~e~tW~T~---~~k~~~~~aF~iC 179 (249)
T KOG3545|consen 103 SAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDPETLEVERTWNTT---LPKRSAGNAFMIC 179 (249)
T ss_pred eeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCHHHhheeeeeccc---cCCCCcCceEEEe
Confidence 555553221111 11234445556666333333333355677773 3456432 3344455566677
Q ss_pred CEEEEEecCCC--CeE-EEEeCCCCceEEccCCCCCcCCCCceEE---EeCCEEEEEcCCceEEEEEc
Q 045821 259 DALYVLDQSSG--TKL-MMWQKESREWSPVGRLSTLLTRPPCKLV---AIGKTIFVIGKGCSAVVIDV 320 (352)
Q Consensus 259 ~~l~~~gg~~~--~~v-~~yd~~~~~W~~~~~~~~~~~~~~~~~~---~~~~~i~v~gG~~~~~~~d~ 320 (352)
|-||++-.... ..| +.||..+++=+.+. +|-........+. ..+.++|+..-.. ...|++
T Consensus 180 GvLY~v~S~~~~~~~i~yaydt~~~~~~~~~-ipf~N~y~~~~~idYNP~D~~LY~wdng~-~l~y~l 245 (249)
T KOG3545|consen 180 GVLYVVHSYNCTHTQISYAYDTTTGTQERID-LPFPNPYSYATMIDYNPRDRRLYAWDNGH-QLTYNL 245 (249)
T ss_pred eeeEEEeccccCCceEEEEEEcCCCceeccc-ccccchhhhhhccCCCcccceeeEecCCc-EEEEEe
Confidence 88999976554 334 78999988775443 3321122222333 3467899886432 344554
No 266
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=30.41 E-value=5.9e+02 Score=25.80 Aligned_cols=169 Identities=10% Similarity=0.045 Sum_probs=81.4
Q ss_pred CCeEEEEECCCCeeEecCCCCCcc-eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEE--EE-
Q 045821 135 TSEVYCYDASMNTWTDAAPMSTAR-CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSF--VM- 210 (352)
Q Consensus 135 ~~~~~~~d~~t~~W~~~~~~~~~r-~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~--~~- 210 (352)
...+..||..++.-...-.....+ --+.....++.+.+.=||.+ ..+..|+..++. +.-.+..+.. ++. -+
T Consensus 42 ~E~vn~WdlRtge~~~~l~~~~~k~evt~l~~~~d~l~lAVGYaD-GsVqif~~~s~~---~~~tfngHK~-AVt~l~fd 116 (888)
T KOG0306|consen 42 LEQVNIWDLRTGEIEKKLILLKKKAEVTCLRSSDDILLLAVGYAD-GSVQIFSLESEE---ILITFNGHKA-AVTTLKFD 116 (888)
T ss_pred cccEeEEeeecchhhhhhhhhcccceEEEeeccCCcceEEEEecC-ceEEeeccCCCc---eeeeeccccc-ceEEEEEc
Confidence 456778888877432221122223 22333334555555555554 678888888772 2111111111 111 12
Q ss_pred -CCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821 211 -DGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR 287 (352)
Q Consensus 211 -~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~ 287 (352)
.|.-++.||.++ .|..||.-... .+.-+. ...-...-+..+..+.+--+.++ .+-.||.++....+.-
T Consensus 117 ~~G~rlaSGskDt-----~IIvwDlV~E~Gl~rL~GH--kd~iT~~~F~~~~~~lvS~sKDs-~iK~WdL~tqhCf~Th- 187 (888)
T KOG0306|consen 117 KIGTRLASGSKDT-----DIIVWDLVGEEGLFRLRGH--KDSITQALFLNGDSFLVSVSKDS-MIKFWDLETQHCFETH- 187 (888)
T ss_pred ccCceEeecCCCc-----cEEEEEeccceeeEEeecc--hHHHhHHhccCCCeEEEEeccCc-eEEEEecccceeeeEE-
Confidence 466777777664 57777764332 111110 00000001222456666555554 7888999988765442
Q ss_pred CCCCcCCCCceEEEeCCEEEEEcCCce-EEEEEc
Q 045821 288 LSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVIDV 320 (352)
Q Consensus 288 ~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~d~ 320 (352)
+.. ..-..+++..+ ++.|.+|.+. .-+|++
T Consensus 188 vd~--r~Eiw~l~~~~-~~lvt~~~dse~~v~~L 218 (888)
T KOG0306|consen 188 VDH--RGEIWALVLDE-KLLVTAGTDSELKVWEL 218 (888)
T ss_pred ecc--cceEEEEEEec-ceEEEEecCCceEEEEe
Confidence 111 01123455545 6666655543 445665
No 267
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=29.80 E-value=5.2e+02 Score=24.98 Aligned_cols=64 Identities=16% Similarity=0.270 Sum_probs=37.5
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCC
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTN 190 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~ 190 (352)
+...-|+-|+ .-..+-+||+.-+--..++.-|.-+.++ .-.|+|.++.|.+.- .+|+++|..+.
T Consensus 282 ~~EF~VvyGf-----MPAkvtifnlr~~~v~df~egpRN~~~f---np~g~ii~lAGFGNL~G~mEvwDv~n~ 346 (566)
T KOG2315|consen 282 GREFAVVYGF-----MPAKVTIFNLRGKPVFDFPEGPRNTAFF---NPHGNIILLAGFGNLPGDMEVWDVPNR 346 (566)
T ss_pred CCEEEEEEec-----ccceEEEEcCCCCEeEeCCCCCccceEE---CCCCCEEEEeecCCCCCceEEEeccch
Confidence 3345555553 2355677888765444444333222222 124678888887655 89999999873
No 268
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=29.68 E-value=4.2e+02 Score=23.91 Aligned_cols=176 Identities=9% Similarity=0.072 Sum_probs=83.8
Q ss_pred CCeEEEEECCCCee-EecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc------EEEecCCCCcccce
Q 045821 135 TSEVYCYDASMNTW-TDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN------WKLHTEPNIFTEIE 205 (352)
Q Consensus 135 ~~~~~~~d~~t~~W-~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~------W~~~~~~~~~~~~~ 205 (352)
..++.+||+.|.+- +.+.-.|..-........ ...++..+ .+ ..|-.||..+.. |+.-+ .....+
T Consensus 49 ngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~s-sD--G~Vr~wD~Rs~~e~a~~~~~~~~---~~~f~~ 122 (376)
T KOG1188|consen 49 NGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCS-SD--GTVRLWDIRSQAESARISWTQQS---GTPFIC 122 (376)
T ss_pred CCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEec-cC--CeEEEEEeecchhhhheeccCCC---CCcceE
Confidence 35677888888653 233222221111111111 12333332 22 468889988763 32211 112222
Q ss_pred eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCCCC---cE--EEECCEEEEEecCCCCeEEEEeCCC
Q 045821 206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGWRG---PA--VVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~~~---~~--~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
...-.+++|...| .........+..||... |++ +.. ......- .. ..-+..+++.|..+| -+-+||.+.
T Consensus 123 ld~nck~~ii~~G-tE~~~s~A~v~lwDvR~--~qq~l~~-~~eSH~DDVT~lrFHP~~pnlLlSGSvDG-LvnlfD~~~ 197 (376)
T KOG1188|consen 123 LDLNCKKNIIACG-TELTRSDASVVLWDVRS--EQQLLRQ-LNESHNDDVTQLRFHPSDPNLLLSGSVDG-LVNLFDTKK 197 (376)
T ss_pred eeccCcCCeEEec-cccccCceEEEEEEecc--ccchhhh-hhhhccCcceeEEecCCCCCeEEeecccc-eEEeeecCC
Confidence 2222367777774 33223334677888643 222 211 0000000 01 123667888887777 788999887
Q ss_pred CceEEccCCCCCcCCCCceEEE---e-CC--EEEEEcCCceEEEEEcCCcc
Q 045821 280 REWSPVGRLSTLLTRPPCKLVA---I-GK--TIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 280 ~~W~~~~~~~~~~~~~~~~~~~---~-~~--~i~v~gG~~~~~~~d~~~~~ 324 (352)
+ .+=..|-. ......++.. + ++ +|+++.-.....+|+.+...
T Consensus 198 d--~EeDaL~~-viN~~sSI~~igw~~~~ykrI~clTH~Etf~~~ele~~~ 245 (376)
T KOG1188|consen 198 D--NEEDALLH-VINHGSSIHLIGWLSKKYKRIMCLTHMETFAIYELEDGS 245 (376)
T ss_pred C--cchhhHHH-hhcccceeeeeeeecCCcceEEEEEccCceeEEEccCCC
Confidence 6 22222211 1122222221 2 23 79999877778888877655
No 269
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=29.67 E-value=6.5e+02 Score=26.04 Aligned_cols=60 Identities=12% Similarity=0.130 Sum_probs=31.9
Q ss_pred CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe--CCEEEEEcCCce-EEEEEcC
Q 045821 258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI--GKTIFVIGKGCS-AVVIDVG 321 (352)
Q Consensus 258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~--~~~i~v~gG~~~-~~~~d~~ 321 (352)
+|.+.+.+-.+ .|.+|+.+ .|+....+........+..+.. +|+-...++.++ +.++|++
T Consensus 200 ~g~la~~~~d~--~Vkvy~r~--~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~I~vWnv~ 262 (933)
T KOG1274|consen 200 GGTLAVPPVDN--TVKVYSRK--GWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQILVWNVD 262 (933)
T ss_pred CCeEEeeccCC--eEEEEccC--CceeheeecccccccceEEEEEcCCCcEEeeeccCCcEEEEecc
Confidence 46666665433 78888865 4887776654333332333322 344333444332 4467776
No 270
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=28.27 E-value=5.6e+02 Score=24.87 Aligned_cols=96 Identities=10% Similarity=0.062 Sum_probs=50.5
Q ss_pred ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceee
Q 045821 82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYF 161 (352)
Q Consensus 82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~ 161 (352)
...+.+|+. ..+.-+.+.....+.....-++.......-+++|..+ .++..||..++.-.... ...-.
T Consensus 235 ~~d~ciYE~--~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~D------gSiiLyD~~~~~t~~~k----a~~~P 302 (545)
T PF11768_consen 235 SADSCIYEC--SRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCED------GSIILYDTTRGVTLLAK----AEFIP 302 (545)
T ss_pred eeEEEEEEe--ecCceeEEEEEEEecCCcceEEecCcccceEEEEecC------CeEEEEEcCCCeeeeee----ecccc
Confidence 345666766 5555544432211101122222233345677777654 45888998776422221 11111
Q ss_pred eeEEE--CCEEEEEeCCCCCCceEEEeCCCCc
Q 045821 162 PCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN 191 (352)
Q Consensus 162 ~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~ 191 (352)
..+.. +|.++++|+.. ..+.+||..-+.
T Consensus 303 ~~iaWHp~gai~~V~s~q--GelQ~FD~ALsp 332 (545)
T PF11768_consen 303 TLIAWHPDGAIFVVGSEQ--GELQCFDMALSP 332 (545)
T ss_pred eEEEEcCCCcEEEEEcCC--ceEEEEEeecCc
Confidence 22222 77888888865 678899987664
No 271
>KOG4341 consensus F-box protein containing LRR [General function prediction only]
Probab=28.12 E-value=32 Score=31.73 Aligned_cols=38 Identities=13% Similarity=0.242 Sum_probs=33.1
Q ss_pred CccCCCcHHHHHHhhccCccccchhhhhhhhhhccccc
Q 045821 16 PLISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLH 53 (352)
Q Consensus 16 ~~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~ 53 (352)
...-.||+|++..+.+.+.-.++.+.+.+|+.|+.+..
T Consensus 70 ~~~~~LPpEl~lkvFS~LDtksl~r~a~~c~~~n~~Al 107 (483)
T KOG4341|consen 70 SISRSLPPELLLKVFSMLDTKSLCRAAQCCTMWNKLAL 107 (483)
T ss_pred cccccCCHHHHHHHHHHHhHHHHHHHHHHHHHhhhhhh
Confidence 33447999999999999999999999999999997643
No 272
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=27.91 E-value=1e+02 Score=16.21 Aligned_cols=24 Identities=25% Similarity=0.444 Sum_probs=14.5
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCc
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESRE 281 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~ 281 (352)
..++.+|+. ..++ .++.+|.++++
T Consensus 4 ~~~~~v~~~-~~~g-~l~a~d~~~G~ 27 (33)
T smart00564 4 LSDGTVYVG-STDG-TLYALDAKTGE 27 (33)
T ss_pred EECCEEEEE-cCCC-EEEEEEcccCc
Confidence 345555544 3333 88899887654
No 273
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=27.50 E-value=5.1e+02 Score=24.14 Aligned_cols=130 Identities=12% Similarity=0.168 Sum_probs=69.6
Q ss_pred eEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCC---cEEEecCCCCcccceeEEEECC
Q 045821 137 EVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTN---NWKLHTEPNIFTEIEDSFVMDG 212 (352)
Q Consensus 137 ~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~---~W~~~~~~~~~~~~~~~~~~~g 212 (352)
.++.+|..|+.-...-+-..+-+..+++-. ++.-+|.|+.+ ..+..+|...+ .|+.+.. +.-...+...||
T Consensus 292 ~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~d--r~i~~wdlDgn~~~~W~gvr~---~~v~dlait~Dg 366 (519)
T KOG0293|consen 292 VLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPD--RTIIMWDLDGNILGNWEGVRD---PKVHDLAITYDG 366 (519)
T ss_pred heeeccCCcchhhhhcccCcCCCcceeEEccCCceeEecCCC--CcEEEecCCcchhhccccccc---ceeEEEEEcCCC
Confidence 367778877753332211111122222222 67778888876 56677777644 6887764 233344556666
Q ss_pred E-EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821 213 K-IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES 279 (352)
Q Consensus 213 ~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~ 279 (352)
+ ++.+ +.+. .+..|+..+..=..+-. ...+.....+.-++++.++.=... .++.||.+.
T Consensus 367 k~vl~v-~~d~-----~i~l~~~e~~~dr~lis-e~~~its~~iS~d~k~~LvnL~~q-ei~LWDl~e 426 (519)
T KOG0293|consen 367 KYVLLV-TVDK-----KIRLYNREARVDRGLIS-EEQPITSFSISKDGKLALVNLQDQ-EIHLWDLEE 426 (519)
T ss_pred cEEEEE-eccc-----ceeeechhhhhhhcccc-ccCceeEEEEcCCCcEEEEEcccC-eeEEeecch
Confidence 5 5555 3332 45666665443331111 111222233455788777765443 899999983
No 274
>PRK02888 nitrous-oxide reductase; Validated
Probab=27.35 E-value=6.3e+02 Score=25.15 Aligned_cols=54 Identities=17% Similarity=0.097 Sum_probs=34.7
Q ss_pred eEEEEeCCC---CceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821 271 KLMMWQKES---REWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIG 324 (352)
Q Consensus 271 ~v~~yd~~~---~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~ 324 (352)
.|.+.|..+ ..+..+..+|....-++..+..-+..+|+-|+.. .+.++|+++..
T Consensus 297 ~V~VID~~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k 354 (635)
T PRK02888 297 KVPVVDGRKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLD 354 (635)
T ss_pred EEEEEECCccccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhh
Confidence 788999876 1244455455543444555555667788888754 46689998744
No 275
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=27.10 E-value=6.3e+02 Score=25.07 Aligned_cols=66 Identities=18% Similarity=0.305 Sum_probs=39.8
Q ss_pred ECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEe------c--CCCC-Cccee-eeeEE-ECCEEEEEeCCCCCCceEEEe
Q 045821 118 LGKNAYLLGGCGWSEDATSEVYCYDASMNTWTD------A--APMS-TARCY-FPCGV-LNQKIYCIGGLGDTHSWDVYD 186 (352)
Q Consensus 118 ~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~------~--~~~~-~~r~~-~~~~~-~~~~iyv~gG~~~~~~v~~yd 186 (352)
-++.+++-||.+. .++.||..+..-+. + .+++ .++.. ++.+. -.+.++|.||.. ..+..||
T Consensus 128 k~~~lvaSgGLD~------~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgte--k~lr~wD 199 (735)
T KOG0308|consen 128 KNNELVASGGLDR------KIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTE--KDLRLWD 199 (735)
T ss_pred cCceeEEecCCCc------cEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCcc--cceEEec
Confidence 3677888888644 46777777553211 1 2233 23322 22222 235688888877 7888999
Q ss_pred CCCCc
Q 045821 187 PRTNN 191 (352)
Q Consensus 187 ~~t~~ 191 (352)
+.|..
T Consensus 200 prt~~ 204 (735)
T KOG0308|consen 200 PRTCK 204 (735)
T ss_pred ccccc
Confidence 99874
No 276
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=27.10 E-value=4.9e+02 Score=26.34 Aligned_cols=99 Identities=13% Similarity=0.160 Sum_probs=48.1
Q ss_pred ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceee-cC
Q 045821 166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLH-AD 243 (352)
Q Consensus 166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~ 243 (352)
.++.+.++.|.. .++-+++..|.+..+. ++- .+..++.++ +++..++|+..+ .+..||.....--+ ++
T Consensus 382 S~d~~~~~Sga~--~SikiWn~~t~kciRT--i~~-~y~l~~~Fvpgd~~Iv~G~k~G-----el~vfdlaS~~l~Eti~ 451 (888)
T KOG0306|consen 382 SSDSILLASGAG--ESIKIWNRDTLKCIRT--ITC-GYILASKFVPGDRYIVLGTKNG-----ELQVFDLASASLVETIR 451 (888)
T ss_pred ecCceeeeecCC--CcEEEEEccCcceeEE--ecc-ccEEEEEecCCCceEEEeccCC-----ceEEEEeehhhhhhhhh
Confidence 345555555544 5678888887665332 111 233344445 455555654443 47777775433222 21
Q ss_pred CCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe
Q 045821 244 ANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ 276 (352)
Q Consensus 244 ~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd 276 (352)
. .....-..+...+++-++.|+-+. .+-.||
T Consensus 452 A-HdgaIWsi~~~pD~~g~vT~saDk-tVkfWd 482 (888)
T KOG0306|consen 452 A-HDGAIWSISLSPDNKGFVTGSADK-TVKFWD 482 (888)
T ss_pred c-cccceeeeeecCCCCceEEecCCc-EEEEEe
Confidence 1 111111112344667677776554 444443
No 277
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=26.50 E-value=4.8e+02 Score=25.36 Aligned_cols=70 Identities=10% Similarity=0.083 Sum_probs=44.0
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM 145 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t 145 (352)
+++..++-|......+++|.... ...+|..+..++ ....+...-+..|...++++..+ ...+++.||...
T Consensus 457 gdkF~vi~g~~~k~tvsfY~~e~--~~~~~~lVk~~d---k~~~N~vfwsPkG~fvvva~l~s---~~g~l~F~D~~~ 526 (698)
T KOG2314|consen 457 GDKFAVISGNTVKNTVSFYAVET--NIKKPSLVKELD---KKFANTVFWSPKGRFVVVAALVS---RRGDLEFYDTDY 526 (698)
T ss_pred CCeEEEEEccccccceeEEEeec--CCCchhhhhhhc---ccccceEEEcCCCcEEEEEEecc---cccceEEEecch
Confidence 56666666665666677776665 667777777666 33344444444667666666543 346788888764
No 278
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=26.22 E-value=6.7e+02 Score=25.07 Aligned_cols=66 Identities=26% Similarity=0.257 Sum_probs=35.6
Q ss_pred EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccc
Q 045821 256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNI 326 (352)
Q Consensus 256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~ 326 (352)
..++.+.+..|.++ .+.+|+.+ ...+.=.+|. .-...+-+..++.|++-|.-..+.+|...+++|-
T Consensus 228 ~~~~~~Ivs~gEDr-tlriW~~~--e~~q~I~lPt--tsiWsa~~L~NgDIvvg~SDG~VrVfT~~k~R~A 293 (745)
T KOG0301|consen 228 ALSDGLIVSTGEDR-TLRIWKKD--ECVQVITLPT--TSIWSAKVLLNGDIVVGGSDGRVRVFTVDKDRKA 293 (745)
T ss_pred cCCCCeEEEecCCc-eEEEeecC--ceEEEEecCc--cceEEEEEeeCCCEEEeccCceEEEEEecccccC
Confidence 45677777777665 67777655 2222223443 1122333334666665443334558888877764
No 279
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.15 E-value=4.6e+02 Score=23.21 Aligned_cols=56 Identities=14% Similarity=0.224 Sum_probs=34.6
Q ss_pred CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEE
Q 045821 258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVI 318 (352)
Q Consensus 258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~ 318 (352)
...+..+..-.++...++|.+++.-.....++. .++++.-.+.+.+-.|.-.+..|
T Consensus 290 ~~glV~lTSP~GN~~vi~da~tG~vv~~a~l~d-----aaGva~~~~gf~vssg~G~~~~~ 345 (366)
T COG3490 290 RDGLVALTSPRGNRAVIWDAATGAVVSEAALPD-----AAGVAAAKGGFAVSSGQGRIIFY 345 (366)
T ss_pred cCCeEEEecCCCCeEEEEEcCCCcEEecccccc-----cccceeccCceEEecCCceEEec
Confidence 345555544455689999999998877776665 23444445555555555445444
No 280
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=25.44 E-value=5.8e+02 Score=24.06 Aligned_cols=59 Identities=12% Similarity=-0.007 Sum_probs=32.5
Q ss_pred eeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccccee---EEEECCEEEEEccCCC
Q 045821 160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIED---SFVMDGKIYIRCSASA 222 (352)
Q Consensus 160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~---~~~~~g~iyv~GG~~~ 222 (352)
-++.+..+.--|+++|... .+++.+...|+.--.+ . .+++..- .+..||..++.||.++
T Consensus 84 v~al~s~n~G~~l~ag~i~-g~lYlWelssG~LL~v--~-~aHYQ~ITcL~fs~dgs~iiTgskDg 145 (476)
T KOG0646|consen 84 VHALASSNLGYFLLAGTIS-GNLYLWELSSGILLNV--L-SAHYQSITCLKFSDDGSHIITGSKDG 145 (476)
T ss_pred eeeeecCCCceEEEeeccc-CcEEEEEeccccHHHH--H-HhhccceeEEEEeCCCcEEEecCCCc
Confidence 3455555555666666322 5678877777752111 1 2333332 2334778888877664
No 281
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=25.09 E-value=8.5e+02 Score=25.91 Aligned_cols=160 Identities=11% Similarity=0.080 Sum_probs=79.8
Q ss_pred CCCeEEEEECCCCeeEecCCCCCcceeeeeEE--E---CCE-EEEEeCCCCC---------CceEEEeCCCC--cEEEec
Q 045821 134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGV--L---NQK-IYCIGGLGDT---------HSWDVYDPRTN--NWKLHT 196 (352)
Q Consensus 134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~---~~~-iyv~gG~~~~---------~~v~~yd~~t~--~W~~~~ 196 (352)
....+..+|+.+++-...-.++..-..++++. + +.. +..+|+.... ..++.|+...+ +-+.+.
T Consensus 851 w~s~I~~~d~~s~~~~~~~~l~~ne~a~~v~~~~fs~~~~~~~~~v~~~~~~~l~~~~~~~g~~ytyk~~~~g~~lellh 930 (1205)
T KOG1898|consen 851 WVSSIRVFDPKSGKIICLVELGQNEAAFSVCAVDFSSSEYQPFVAVGVATTEQLDSKSISSGFVYTYKFVRNGDKLELLH 930 (1205)
T ss_pred ccceEEEEcCCCCceEEEEeecCCcchhheeeeeeccCCCceEEEEEeeccccccccccCCCceEEEEEEecCceeeeee
Confidence 44567788888776544433333222222222 2 222 4444443221 44677766533 333333
Q ss_pred CCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEE
Q 045821 197 EPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-VVVDDALYVLDQSSGTKLMMW 275 (352)
Q Consensus 197 ~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~y 275 (352)
.-..+..-++.+.+.|+++.--|. .+..||...++--........+..-.. .....+|++-.-..+-.+++|
T Consensus 931 ~T~~~~~v~Ai~~f~~~~LagvG~-------~l~~YdlG~K~lLRk~e~k~~p~~Is~iqt~~~RI~VgD~qeSV~~~~y 1003 (1205)
T KOG1898|consen 931 KTEIPGPVGAICPFQGRVLAGVGR-------FLRLYDLGKKKLLRKCELKFIPNRISSIQTYGARIVVGDIQESVHFVRY 1003 (1205)
T ss_pred ccCCCccceEEeccCCEEEEeccc-------EEEEeeCChHHHHhhhhhccCceEEEEEeecceEEEEeeccceEEEEEE
Confidence 323333334567778866553332 488899876654332111111111112 234455555444444456888
Q ss_pred eCCCCceEEccCCCCCcCCCCceEEEe
Q 045821 276 QKESREWSPVGRLSTLLTRPPCKLVAI 302 (352)
Q Consensus 276 d~~~~~W~~~~~~~~~~~~~~~~~~~~ 302 (352)
+++.|+......-|. .|+-.++..+
T Consensus 1004 ~~~~n~l~~fadD~~--pR~Vt~~~~l 1028 (1205)
T KOG1898|consen 1004 RREDNQLIVFADDPV--PRHVTALELL 1028 (1205)
T ss_pred ecCCCeEEEEeCCCc--cceeeEEEEe
Confidence 999998888876554 3544444443
No 282
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=24.75 E-value=5.8e+02 Score=23.89 Aligned_cols=28 Identities=14% Similarity=-0.056 Sum_probs=19.3
Q ss_pred ceEEEeCCEEEEEcCCce-EEEEEcCCcc
Q 045821 297 CKLVAIGKTIFVIGKGCS-AVVIDVGNIG 324 (352)
Q Consensus 297 ~~~~~~~~~i~v~gG~~~-~~~~d~~~~~ 324 (352)
++.......+|++||... ..++|+.++.
T Consensus 424 ~~~~~~~a~~la~GG~k~~~~vwd~~~~~ 452 (463)
T KOG0270|consen 424 FALDPDVAFTLAFGGEKAVLRVWDIFTNS 452 (463)
T ss_pred cccCCCcceEEEecCccceEEEeecccCh
Confidence 344445567899999887 4478877654
No 283
>PF14298 DUF4374: Domain of unknown function (DUF4374)
Probab=24.19 E-value=6.1e+02 Score=23.90 Aligned_cols=61 Identities=13% Similarity=0.081 Sum_probs=40.5
Q ss_pred CeEEEEeCCCCceEEccCCCCCc-CCCCceEEEeCCEEEEE-c---C-CceEEEEEcCCcccccceE
Q 045821 270 TKLMMWQKESREWSPVGRLSTLL-TRPPCKLVAIGKTIFVI-G---K-GCSAVVIDVGNIGNIGGIM 330 (352)
Q Consensus 270 ~~v~~yd~~~~~W~~~~~~~~~~-~~~~~~~~~~~~~i~v~-g---G-~~~~~~~d~~~~~w~~~~~ 330 (352)
..+.++|..+++.+.+..+|... ....-...+.++++|+- . | ..-++.+|+.+.+=.++..
T Consensus 367 ~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~~g~~~~IY~iDp~TatAtKGl~ 433 (435)
T PF14298_consen 367 KKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTEDGSDPYIYKIDPATATATKGLK 433 (435)
T ss_pred ceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEeecCCCceeEEEEcCcccccccceE
Confidence 57889999999999998888651 22222344568888863 2 2 1235568888877666554
No 284
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=23.80 E-value=4.1e+02 Score=21.78 Aligned_cols=69 Identities=12% Similarity=0.126 Sum_probs=41.8
Q ss_pred eEEEecCCCCCCCeeEeccCC--CcccCCcceEEEEECCE-EEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCC
Q 045821 84 CCYVLDPNSTRRSWKMIHELP--ARTLRRKGMGFEVLGKN-AYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMST 156 (352)
Q Consensus 84 ~~~~~d~~~~~~~w~~~~~~~--~~~~~~~~~~~~~~~~~-iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~ 156 (352)
.+|++|. .++.|..+.--+ ....|. ...-.-+.. ++++|...+....-..+++|++.+++-..+-+...
T Consensus 89 kIYIkn~--~~~~~~~L~i~~~~~k~sPK--~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~~d 160 (200)
T PF15525_consen 89 KIYIKNL--NNNNWWSLQIDQNEEKYSPK--YIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEWKD 160 (200)
T ss_pred eEEEEec--CCCceEEEEecCcccccCCc--eeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeeccc
Confidence 5788888 888887664322 122233 222223444 55556443433445789999999999888876644
No 285
>KOG3669 consensus Uncharacterized conserved protein, contains dysferlin, TECPR and PH domains [General function prediction only]
Probab=23.68 E-value=7e+02 Score=24.43 Aligned_cols=54 Identities=15% Similarity=0.241 Sum_probs=33.8
Q ss_pred EECCCCeeEecCCCC-Cc----c-eeee-eEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC
Q 045821 141 YDASMNTWTDAAPMS-TA----R-CYFP-CGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI 200 (352)
Q Consensus 141 ~d~~t~~W~~~~~~~-~~----r-~~~~-~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~ 200 (352)
-.|.-+.|+.+++.+ .. + ..-. ++.-++++|.=-| |-+.++.-++|+.+..+..
T Consensus 213 ~~P~GraW~~i~~~t~L~qISagPtg~VwAvt~nG~vf~R~G------VsRqNp~GdsWkdI~tP~~ 273 (705)
T KOG3669|consen 213 DRPCGRAWKVICPYTDLSQISAGPTGVVWAVTENGAVFYREG------VSRQNPEGDSWKDIVTPRQ 273 (705)
T ss_pred CCCCCceeeecCCCCccceEeecCcceEEEEeeCCcEEEEec------ccccCCCCchhhhccCccc
Confidence 357777888887644 11 1 1111 2223777777666 5679999999998875443
No 286
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=23.43 E-value=5.1e+02 Score=22.75 Aligned_cols=186 Identities=10% Similarity=0.050 Sum_probs=86.7
Q ss_pred CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcEEEecC
Q 045821 119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE 197 (352)
Q Consensus 119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~ 197 (352)
++...+-++++ ..+..+|..+++=+..-- ...+--.++++ -+++-.|.|..+ +.+..||...+.=-++..
T Consensus 74 dg~~alS~swD------~~lrlWDl~~g~~t~~f~-GH~~dVlsva~s~dn~qivSGSrD--kTiklwnt~g~ck~t~~~ 144 (315)
T KOG0279|consen 74 DGNFALSASWD------GTLRLWDLATGESTRRFV-GHTKDVLSVAFSTDNRQIVSGSRD--KTIKLWNTLGVCKYTIHE 144 (315)
T ss_pred CCceEEecccc------ceEEEEEecCCcEEEEEE-ecCCceEEEEecCCCceeecCCCc--ceeeeeeecccEEEEEec
Confidence 55555555543 345667777653221110 01111112222 255555656555 667777776654323322
Q ss_pred CCCcccc-eeEEEE-C--CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCC--cEEEECCEEEEEecCCCCe
Q 045821 198 PNIFTEI-EDSFVM-D--GKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRG--PAVVVDDALYVLDQSSGTK 271 (352)
Q Consensus 198 ~~~~~~~-~~~~~~-~--g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~--~~~~~~~~l~~~gg~~~~~ 271 (352)
-.. +.- .+..+. + +-+.+.+|.+. .+-++|..+.+-... .+.+...- .++..+|.++.-||.++ .
T Consensus 145 ~~~-~~WVscvrfsP~~~~p~Ivs~s~Dk-----tvKvWnl~~~~l~~~--~~gh~~~v~t~~vSpDGslcasGgkdg-~ 215 (315)
T KOG0279|consen 145 DSH-REWVSCVRFSPNESNPIIVSASWDK-----TVKVWNLRNCQLRTT--FIGHSGYVNTVTVSPDGSLCASGGKDG-E 215 (315)
T ss_pred CCC-cCcEEEEEEcCCCCCcEEEEccCCc-----eEEEEccCCcchhhc--cccccccEEEEEECCCCCEEecCCCCc-e
Confidence 111 111 122222 2 33444443332 355566655433221 11111111 23456999999999888 8
Q ss_pred EEEEeCCCCce-EEccCCCCCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCCccccc
Q 045821 272 LMMWQKESREW-SPVGRLSTLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGNIGNIG 327 (352)
Q Consensus 272 v~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~~~w~~ 327 (352)
+..+|....+= ..++.. ..-.+++..-++.++.-+ ..++-|+|.+...-..
T Consensus 216 ~~LwdL~~~k~lysl~a~-----~~v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~ 268 (315)
T KOG0279|consen 216 AMLWDLNEGKNLYSLEAF-----DIVNSLCFSPNRYWLCAATATSIKIWDLESKAVVE 268 (315)
T ss_pred EEEEEccCCceeEeccCC-----CeEeeEEecCCceeEeeccCCceEEEeccchhhhh
Confidence 88888876543 222221 112344444555544433 2245567776655433
No 287
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=22.93 E-value=7.9e+02 Score=24.77 Aligned_cols=169 Identities=9% Similarity=0.062 Sum_probs=77.4
Q ss_pred EEEEECCCCe----eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC-cccceeEEEECC
Q 045821 138 VYCYDASMNT----WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI-FTEIEDSFVMDG 212 (352)
Q Consensus 138 ~~~~d~~t~~----W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g 212 (352)
+.+|+..+++ |..+-.-|.-+..+. . .+.+...||.+ ..+-++|...+...-- +.. +..-+++.+..+
T Consensus 86 lrv~~L~tgk~irswKa~He~Pvi~ma~~--~-~g~LlAtggaD--~~v~VWdi~~~~~th~--fkG~gGvVssl~F~~~ 158 (775)
T KOG0319|consen 86 LRVWSLPTGKLIRSWKAIHEAPVITMAFD--P-TGTLLATGGAD--GRVKVWDIKNGYCTHS--FKGHGGVVSSLLFHPH 158 (775)
T ss_pred EEEEEcccchHhHhHhhccCCCeEEEEEc--C-CCceEEecccc--ceEEEEEeeCCEEEEE--ecCCCceEEEEEeCCc
Confidence 4556666653 544333333222111 1 23677778877 5677777766543321 111 111122333333
Q ss_pred E---EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCC--CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821 213 K---IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGW--RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR 287 (352)
Q Consensus 213 ~---iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~--~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~ 287 (352)
. +...|+.+ ..+..||..+..= .+.....+.. .+.+...++.-.+.+|.+. -+.+||. .+.+.+..
T Consensus 159 ~~~~lL~sg~~D-----~~v~vwnl~~~~t-cl~~~~~H~S~vtsL~~~~d~~~~ls~~RDk-vi~vwd~--~~~~~l~~ 229 (775)
T KOG0319|consen 159 WNRWLLASGATD-----GTVRVWNLNDKRT-CLHTMILHKSAVTSLAFSEDSLELLSVGRDK-VIIVWDL--VQYKKLKT 229 (775)
T ss_pred cchhheeecCCC-----ceEEEEEcccCch-HHHHHHhhhhheeeeeeccCCceEEEeccCc-EEEEeeh--hhhhhhhe
Confidence 3 33554333 2577777764322 1111111111 1112222333333344332 7889998 44445555
Q ss_pred CCCCcCCCCceEEEeC-------CEEEEEcCCceEEEEEcCCcc
Q 045821 288 LSTLLTRPPCKLVAIG-------KTIFVIGKGCSAVVIDVGNIG 324 (352)
Q Consensus 288 ~~~~~~~~~~~~~~~~-------~~i~v~gG~~~~~~~d~~~~~ 324 (352)
+|.. ...-+++... +-++..||.....++|.+.-.
T Consensus 230 lp~y--e~~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~ 271 (775)
T KOG0319|consen 230 LPLY--ESLESVVRLREELGGKGEYIITAGGSGVVQYWDSESGK 271 (775)
T ss_pred echh--hheeeEEEechhcCCcceEEEEecCCceEEEEecccch
Confidence 5532 1112233222 246667877777788887644
No 288
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=22.61 E-value=1.7e+02 Score=16.80 Aligned_cols=19 Identities=5% Similarity=-0.010 Sum_probs=14.3
Q ss_pred CCEEEEEcCCceEEEEEcCC
Q 045821 303 GKTIFVIGKGCSAVVIDVGN 322 (352)
Q Consensus 303 ~~~i~v~gG~~~~~~~d~~~ 322 (352)
++++|+|-|. ..+.||..+
T Consensus 9 ~~~~yfFkg~-~yw~~~~~~ 27 (45)
T smart00120 9 NGKTYFFKGD-KYWRFDPKR 27 (45)
T ss_pred CCeEEEEeCC-EEEEEcCCc
Confidence 5689999876 566788654
No 289
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=22.50 E-value=4.7e+02 Score=22.01 Aligned_cols=186 Identities=14% Similarity=0.007 Sum_probs=95.1
Q ss_pred eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcc-eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821 113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTAR-CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN 191 (352)
Q Consensus 113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r-~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~ 191 (352)
+.+...+|++|.--|..+ .+.+.++|..+++-..-..++.++ ++-..+..++.+|...=.+ ...+.||+.|-
T Consensus 49 QGL~~~~g~i~esTG~yg----~S~ir~~~L~~gq~~~s~~l~~~~~FgEGit~~gd~~y~LTw~e--gvaf~~d~~t~- 121 (262)
T COG3823 49 QGLEYLDGHILESTGLYG----FSKIRVSDLTTGQEIFSEKLAPDTVFGEGITKLGDYFYQLTWKE--GVAFKYDADTL- 121 (262)
T ss_pred cceeeeCCEEEEeccccc----cceeEEEeccCceEEEEeecCCccccccceeeccceEEEEEecc--ceeEEEChHHh-
Confidence 456777888888877644 377889999977633222233233 3456777788999886543 44567776643
Q ss_pred EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce---eecCCC-CCCCCCCcEEEECCEEEEEecC
Q 045821 192 WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW---LHADAN-MASGWRGPAVVVDDALYVLDQS 267 (352)
Q Consensus 192 W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W---~~~~~~-~~~~~~~~~~~~~~~l~~~gg~ 267 (352)
+.+...+....+-+.+.-+.+|.+.-|. ..+..-||.+-.= .++... .+...-.-.-.++|.+|.== .
T Consensus 122 -~~lg~~~y~GeGWgLt~d~~~LimsdGs------atL~frdP~tfa~~~~v~VT~~g~pv~~LNELE~VdG~lyANV-w 193 (262)
T COG3823 122 -EELGRFSYEGEGWGLTSDDKNLIMSDGS------ATLQFRDPKTFAELDTVQVTDDGVPVSKLNELEWVDGELYANV-W 193 (262)
T ss_pred -hhhcccccCCcceeeecCCcceEeeCCc------eEEEecCHHHhhhcceEEEEECCeecccccceeeeccEEEEee-e
Confidence 3333333333333444444455444332 2344445543211 111110 00000000112344444210 1
Q ss_pred CCCeEEEEeCCCCc---eEEccCCCCCcCCC--------CceEEEeCCEEEEEcCCc
Q 045821 268 SGTKLMMWQKESRE---WSPVGRLSTLLTRP--------PCKLVAIGKTIFVIGKGC 313 (352)
Q Consensus 268 ~~~~v~~yd~~~~~---W~~~~~~~~~~~~~--------~~~~~~~~~~i~v~gG~~ 313 (352)
..+.+.+.||+++. |..+..++...... +-+...-++++|+-|-..
T Consensus 194 ~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~w 250 (262)
T COG3823 194 QTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLW 250 (262)
T ss_pred eecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcC
Confidence 11478899999875 88888765422111 112222356888888544
No 290
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=22.36 E-value=8.5e+02 Score=24.91 Aligned_cols=59 Identities=22% Similarity=0.278 Sum_probs=38.1
Q ss_pred EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc-------CCCCceEEEeCCEEEEEcCCc
Q 045821 254 AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL-------TRPPCKLVAIGKTIFVIGKGC 313 (352)
Q Consensus 254 ~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~-------~~~~~~~~~~~~~i~v~gG~~ 313 (352)
+...+|.-++-||..+ .+.+|-.+++.=+-++.+.... .-..++++.-+++|.++-+.+
T Consensus 258 ~fS~~G~~LlSGG~E~-VLv~Wq~~T~~kqfLPRLgs~I~~i~vS~ds~~~sl~~~DNqI~li~~~d 323 (792)
T KOG1963|consen 258 SFSSDGAYLLSGGREG-VLVLWQLETGKKQFLPRLGSPILHIVVSPDSDLYSLVLEDNQIHLIKASD 323 (792)
T ss_pred EEecCCceEeecccce-EEEEEeecCCCcccccccCCeeEEEEEcCCCCeEEEEecCceEEEEeccc
Confidence 3455777667788776 7888888887744455544321 233456666788999887643
No 291
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=21.86 E-value=5.8e+02 Score=22.84 Aligned_cols=27 Identities=15% Similarity=0.292 Sum_probs=15.9
Q ss_pred EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++++++ .|+..|. ....+.+||..++.
T Consensus 48 avAVs~-~~~aSGs-----sDetI~IYDm~k~~ 74 (362)
T KOG0294|consen 48 ALAVSG-PYVASGS-----SDETIHIYDMRKRK 74 (362)
T ss_pred EEEecc-eeEeccC-----CCCcEEEEeccchh
Confidence 344444 4555442 23568999998764
No 292
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=21.39 E-value=6.6e+02 Score=23.29 Aligned_cols=135 Identities=14% Similarity=0.192 Sum_probs=66.4
Q ss_pred CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-C
Q 045821 180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-D 258 (352)
Q Consensus 180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~ 258 (352)
+.|..+|..|+.=-.--..| ..........+|.+++-...+. .+-++|+.+++-.............-++.+ +
T Consensus 154 n~v~iWnv~tgeali~l~hp-d~i~S~sfn~dGs~l~TtckDK-----kvRv~dpr~~~~v~e~~~heG~k~~Raifl~~ 227 (472)
T KOG0303|consen 154 NTVSIWNVGTGEALITLDHP-DMVYSMSFNRDGSLLCTTCKDK-----KVRVIDPRRGTVVSEGVAHEGAKPARAIFLAS 227 (472)
T ss_pred ceEEEEeccCCceeeecCCC-CeEEEEEeccCCceeeeecccc-----eeEEEcCCCCcEeeecccccCCCcceeEEecc
Confidence 77888888887521111111 1122234445777777644432 688899988876555432222222223444 4
Q ss_pred CEEEEEecCCC---CeEEEEeCCCCceEEccCCCCCcCCCCceEEEe----CCEEEEEcCCce-EEEEEcCCcc
Q 045821 259 DALYVLDQSSG---TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI----GKTIFVIGKGCS-AVVIDVGNIG 324 (352)
Q Consensus 259 ~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~----~~~i~v~gG~~~-~~~~d~~~~~ 324 (352)
++ ++..|++. ..+..+|+++-+= .+. -..+......+..+ .+-||+.|-.+. +-.|.+..+.
T Consensus 228 g~-i~tTGfsr~seRq~aLwdp~nl~e-P~~--~~elDtSnGvl~PFyD~dt~ivYl~GKGD~~IRYyEit~d~ 297 (472)
T KOG0303|consen 228 GK-IFTTGFSRMSERQIALWDPNNLEE-PIA--LQELDTSNGVLLPFYDPDTSIVYLCGKGDSSIRYFEITNEP 297 (472)
T ss_pred Cc-eeeeccccccccceeccCcccccC-cce--eEEeccCCceEEeeecCCCCEEEEEecCCcceEEEEecCCC
Confidence 55 55555543 5677777765221 010 00011222222222 245899996554 3355555443
No 293
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=21.12 E-value=5.9e+02 Score=22.62 Aligned_cols=148 Identities=11% Similarity=-0.028 Sum_probs=68.5
Q ss_pred CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821 68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT 147 (352)
Q Consensus 68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~ 147 (352)
++..++.||. ...-+.|-+|+- ..|.|.....-. +........-++.|+-.| ..+.+..||..+++
T Consensus 58 ~gs~~aSgG~-Dr~I~LWnv~gd--ceN~~~lkgHsg----AVM~l~~~~d~s~i~S~g-------tDk~v~~wD~~tG~ 123 (338)
T KOG0265|consen 58 DGSCFASGGS-DRAIVLWNVYGD--CENFWVLKGHSG----AVMELHGMRDGSHILSCG-------TDKTVRGWDAETGK 123 (338)
T ss_pred CCCeEeecCC-cceEEEEecccc--ccceeeeccccc----eeEeeeeccCCCEEEEec-------CCceEEEEecccce
Confidence 5667777775 333344555766 677776542111 000011111134455543 35778899999886
Q ss_pred eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC--EEEEEccCCCCC
Q 045821 148 WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG--KIYIRCSASAAT 224 (352)
Q Consensus 148 W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g--~iyv~GG~~~~~ 224 (352)
-..--..-. +.--++... -|...|..|.++ ..+-+||..+..--+ ..+..+.-.++.+++ .=.+.||.+
T Consensus 124 ~~rk~k~h~-~~vNs~~p~rrg~~lv~SgsdD-~t~kl~D~R~k~~~~---t~~~kyqltAv~f~d~s~qv~sggId--- 195 (338)
T KOG0265|consen 124 RIRKHKGHT-SFVNSLDPSRRGPQLVCSGSDD-GTLKLWDIRKKEAIK---TFENKYQLTAVGFKDTSDQVISGGID--- 195 (338)
T ss_pred eeehhcccc-ceeeecCccccCCeEEEecCCC-ceEEEEeecccchhh---ccccceeEEEEEecccccceeecccc---
Confidence 432111000 000000011 133444455444 778889998664222 223344444554432 223344554
Q ss_pred CceEEEEEeCCCCce
Q 045821 225 SHVCALVYEPSTDSW 239 (352)
Q Consensus 225 ~~~~i~~yd~~~~~W 239 (352)
+.+.++|+..+.-
T Consensus 196 --n~ikvWd~r~~d~ 208 (338)
T KOG0265|consen 196 --NDIKVWDLRKNDG 208 (338)
T ss_pred --CceeeeccccCcc
Confidence 2466666654443
Done!