Query 003800
Match_columns 794
No_of_seqs 292 out of 1142
Neff 7.3
Searched_HMMs 46136
Date Thu Mar 28 12:16:11 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/003800.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/003800hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2103 Uncharacterized conser 100.0 1E-113 3E-118 960.3 56.7 702 12-792 7-730 (910)
2 PRK11138 outer membrane biogen 99.9 2.1E-20 4.5E-25 210.6 34.8 241 1-258 1-278 (394)
3 TIGR03300 assembly_YfgL outer 99.8 4.5E-18 9.8E-23 190.3 34.9 216 24-258 35-263 (377)
4 PRK11138 outer membrane biogen 99.8 1.6E-16 3.4E-21 179.1 29.3 213 26-258 86-316 (394)
5 PF13360 PQQ_2: PQQ-like domai 99.7 9.3E-16 2E-20 159.6 27.7 216 24-258 8-234 (238)
6 TIGR03300 assembly_YfgL outer 99.7 6.8E-15 1.5E-19 164.7 30.6 209 26-258 82-301 (377)
7 cd00216 PQQ_DH Dehydrogenases 99.7 1.1E-14 2.4E-19 168.4 24.2 220 27-257 37-322 (488)
8 cd00216 PQQ_DH Dehydrogenases 99.6 1.9E-14 4E-19 166.6 25.1 230 25-265 126-434 (488)
9 PF13360 PQQ_2: PQQ-like domai 99.6 5.6E-14 1.2E-18 146.2 22.1 181 61-257 1-194 (238)
10 TIGR03075 PQQ_enz_alc_DH PQQ-d 99.5 1E-12 2.3E-17 152.8 23.1 219 29-257 47-336 (527)
11 COG1520 FOG: WD40-like repeat 99.5 1.4E-12 3E-17 145.9 23.0 216 26-257 40-271 (370)
12 TIGR03074 PQQ_membr_DH membran 99.5 1.7E-12 3.6E-17 155.5 24.6 202 49-258 190-481 (764)
13 COG1520 FOG: WD40-like repeat 99.3 2.4E-10 5.2E-15 127.9 23.3 186 24-221 83-279 (370)
14 TIGR03074 PQQ_membr_DH membran 99.3 9.8E-11 2.1E-15 140.4 21.3 191 25-220 210-487 (764)
15 TIGR03075 PQQ_enz_alc_DH PQQ-d 99.3 1.8E-10 3.9E-15 134.2 19.9 187 25-218 85-341 (527)
16 KOG4649 PQQ (pyrrolo-quinoline 99.0 2E-08 4.4E-13 102.4 19.0 183 53-257 23-209 (354)
17 KOG4649 PQQ (pyrrolo-quinoline 98.9 4.8E-07 1E-11 92.5 23.7 178 25-223 39-219 (354)
18 COG4993 Gcd Glucose dehydrogen 98.7 5.1E-07 1.1E-11 101.8 17.3 205 52-257 213-491 (773)
19 TIGR03866 PQQ_ABC_repeats PQQ- 97.8 0.071 1.5E-06 56.4 34.6 183 56-257 4-190 (300)
20 PF02239 Cytochrom_D1: Cytochr 97.7 0.067 1.5E-06 60.1 31.6 187 21-220 18-212 (369)
21 COG4993 Gcd Glucose dehydrogen 97.7 0.00079 1.7E-08 76.7 15.1 165 52-218 271-496 (773)
22 PF01011 PQQ: PQQ enzyme repea 97.6 0.0001 2.3E-09 54.3 4.6 31 54-84 1-31 (38)
23 TIGR03866 PQQ_ABC_repeats PQQ- 97.6 0.16 3.4E-06 53.7 31.8 189 52-257 41-240 (300)
24 cd00200 WD40 WD40 domain, foun 97.4 0.11 2.4E-06 53.1 26.7 186 53-257 21-210 (289)
25 cd00200 WD40 WD40 domain, foun 97.4 0.069 1.5E-06 54.6 25.1 187 53-257 63-252 (289)
26 PF05096 Glu_cyclase_2: Glutam 97.4 0.024 5.2E-07 59.9 21.0 155 52-221 54-214 (264)
27 TIGR02658 TTQ_MADH_Hv methylam 97.4 0.35 7.7E-06 53.7 33.9 191 51-257 10-226 (352)
28 PF02239 Cytochrom_D1: Cytochr 97.3 0.23 5E-06 55.8 29.5 190 54-257 6-205 (369)
29 PF10282 Lactonase: Lactonase, 96.9 1 2.2E-05 50.1 32.7 222 25-253 21-274 (345)
30 PF13570 PQQ_3: PQQ-like domai 96.9 0.0019 4.1E-08 48.1 4.6 40 73-115 1-40 (40)
31 PTZ00421 coronin; Provisional 96.8 0.23 5E-06 57.9 23.9 195 53-257 88-293 (493)
32 smart00564 PQQ beta-propeller 96.8 0.0015 3.3E-08 46.1 3.8 27 53-79 6-32 (33)
33 PF13570 PQQ_3: PQQ-like domai 96.8 0.0025 5.5E-08 47.4 5.0 40 29-72 1-40 (40)
34 PF10282 Lactonase: Lactonase, 96.8 1.3 2.7E-05 49.3 37.9 195 56-257 2-227 (345)
35 TIGR02658 TTQ_MADH_Hv methylam 96.8 0.8 1.7E-05 51.0 26.4 79 49-127 53-149 (352)
36 PF01011 PQQ: PQQ enzyme repea 96.7 0.0039 8.6E-08 45.9 5.1 31 98-128 2-32 (38)
37 KOG0296 Angio-associated migra 96.4 0.17 3.7E-06 54.9 17.5 156 51-219 200-365 (399)
38 KOG2103 Uncharacterized conser 96.4 0.089 1.9E-06 62.3 16.4 192 26-246 64-267 (910)
39 KOG2048 WD40 repeat protein [G 96.3 1.1 2.4E-05 52.3 24.0 188 53-257 37-236 (691)
40 KOG1539 WD repeat protein [Gen 96.3 1.7 3.8E-05 51.9 25.7 186 53-251 124-315 (910)
41 PF05935 Arylsulfotrans: Aryls 96.1 0.24 5.3E-06 57.6 18.2 151 53-219 113-310 (477)
42 PTZ00420 coronin; Provisional 96.0 2.4 5.2E-05 50.3 26.1 191 53-257 87-296 (568)
43 smart00564 PQQ beta-propeller 95.9 0.014 3E-07 41.1 4.2 29 94-122 4-32 (33)
44 KOG2055 WD40 repeat protein [G 95.9 1.3 2.8E-05 49.6 21.1 199 39-255 214-418 (514)
45 KOG0318 WD40 repeat stress pro 95.2 7.6 0.00016 44.5 33.2 151 51-214 200-354 (603)
46 KOG0316 Conserved WD40 repeat- 95.2 0.78 1.7E-05 47.4 15.3 190 56-264 74-266 (307)
47 KOG0296 Angio-associated migra 95.1 6.6 0.00014 43.1 28.9 141 52-219 75-229 (399)
48 KOG0316 Conserved WD40 repeat- 95.1 5.1 0.00011 41.6 22.6 146 94-257 28-176 (307)
49 PHA02790 Kelch-like protein; P 94.9 2.6 5.7E-05 49.1 21.1 167 53-241 271-453 (480)
50 PHA02713 hypothetical protein; 94.8 1.9 4E-05 51.3 19.8 173 53-241 303-519 (557)
51 PTZ00421 coronin; Provisional 94.7 3.2 7E-05 48.5 21.3 154 53-218 138-298 (493)
52 KOG0278 Serine/threonine kinas 94.7 2 4.2E-05 44.8 16.6 106 53-170 155-262 (334)
53 PLN02919 haloacid dehalogenase 94.4 3.9 8.4E-05 52.4 22.6 200 53-257 635-891 (1057)
54 PLN00181 protein SPA1-RELATED; 94.2 20 0.00043 44.6 28.1 190 52-256 494-692 (793)
55 KOG0291 WD40-repeat-containing 94.2 17 0.00036 43.6 34.4 66 186-257 402-469 (893)
56 KOG0319 WD40-repeat-containing 94.0 17 0.00038 43.3 25.2 193 52-257 73-314 (775)
57 PRK11028 6-phosphogluconolacto 94.0 11 0.00025 41.1 28.3 197 51-255 44-259 (330)
58 PRK11028 6-phosphogluconolacto 93.9 12 0.00025 40.9 29.8 188 55-255 3-206 (330)
59 KOG2048 WD40 repeat protein [G 93.7 5.2 0.00011 47.0 19.5 153 52-218 121-283 (691)
60 PHA03098 kelch-like protein; P 93.7 3.8 8.1E-05 48.3 19.5 189 53-257 294-514 (534)
61 PF05935 Arylsulfotrans: Aryls 93.6 3.4 7.3E-05 48.2 18.3 115 94-220 112-241 (477)
62 PF05096 Glu_cyclase_2: Glutam 93.5 3.8 8.1E-05 43.6 16.8 154 95-268 54-216 (264)
63 TIGR03548 mutarot_permut cycli 93.5 13 0.00028 40.6 22.0 162 25-196 45-231 (323)
64 KOG0291 WD40-repeat-containing 93.4 13 0.00029 44.3 22.2 110 95-218 361-474 (893)
65 KOG1446 Histone H3 (Lys4) meth 93.4 13 0.00029 40.0 23.2 217 20-257 37-265 (311)
66 KOG0315 G-protein beta subunit 93.4 12 0.00025 39.4 21.6 143 98-257 11-157 (311)
67 COG3823 Glutamine cyclotransfe 93.2 2.8 6.1E-05 42.8 14.3 147 52-218 54-212 (262)
68 KOG0310 Conserved WD40 repeat- 93.0 4.1 8.8E-05 46.1 16.7 151 52-218 121-276 (487)
69 TIGR03548 mutarot_permut cycli 92.8 17 0.00037 39.7 21.8 145 64-218 40-200 (323)
70 COG4257 Vgb Streptogramin lyas 92.8 6.9 0.00015 41.6 16.9 193 53-266 72-272 (353)
71 PRK14131 N-acetylneuraminic ac 92.6 19 0.00042 40.4 22.1 70 53-124 38-122 (376)
72 KOG0266 WD40 repeat-containing 92.5 15 0.00032 42.5 21.5 193 52-257 214-412 (456)
73 KOG0278 Serine/threonine kinas 92.3 3.4 7.3E-05 43.2 13.7 108 95-218 154-262 (334)
74 PHA02713 hypothetical protein; 92.1 3.5 7.5E-05 49.1 16.0 148 53-218 351-539 (557)
75 PF14269 Arylsulfotran_2: Aryl 92.0 5.1 0.00011 43.7 15.9 147 62-219 95-297 (299)
76 PRK05137 tolB translocation pr 91.7 29 0.00063 39.7 24.1 188 51-257 211-415 (435)
77 KOG3881 Uncharacterized conser 91.6 7.5 0.00016 43.0 16.2 188 51-257 113-323 (412)
78 PTZ00420 coronin; Provisional 91.2 21 0.00045 42.6 20.9 69 56-126 141-209 (568)
79 PLN00181 protein SPA1-RELATED; 90.7 52 0.0011 40.9 29.7 106 53-166 545-652 (793)
80 TIGR03547 muta_rot_YjhT mutatr 90.0 29 0.00064 38.2 20.0 160 53-223 17-238 (346)
81 KOG1445 Tumor-specific antigen 89.9 2.4 5.1E-05 49.3 10.9 150 51-218 638-808 (1012)
82 KOG1539 WD repeat protein [Gen 89.9 24 0.00053 42.7 19.4 155 49-215 168-325 (910)
83 PLN02919 haloacid dehalogenase 89.9 19 0.0004 46.3 20.3 157 53-215 694-893 (1057)
84 PF06433 Me-amine-dh_H: Methyl 89.9 36 0.00078 37.7 27.9 195 51-257 104-323 (342)
85 KOG0274 Cdc4 and related F-box 89.8 30 0.00064 41.0 20.5 180 53-256 218-402 (537)
86 PF08450 SGL: SMP-30/Gluconola 89.7 28 0.00061 36.2 26.9 142 53-210 11-164 (246)
87 KOG0285 Pleiotropic regulator 89.6 28 0.00061 38.3 18.0 232 55-314 207-441 (460)
88 PF14269 Arylsulfotran_2: Aryl 89.6 6 0.00013 43.2 13.7 112 52-171 153-297 (299)
89 PRK04922 tolB translocation pr 88.0 55 0.0012 37.5 22.9 150 52-215 214-373 (433)
90 PRK04792 tolB translocation pr 88.0 57 0.0012 37.6 23.7 148 51-214 227-386 (448)
91 COG4257 Vgb Streptogramin lyas 87.3 27 0.00059 37.3 15.8 194 53-266 114-315 (353)
92 PRK03629 tolB translocation pr 87.1 62 0.0014 37.0 23.5 151 51-215 208-368 (429)
93 KOG0275 Conserved WD40 repeat- 86.6 5.9 0.00013 42.6 10.7 184 62-257 274-470 (508)
94 KOG4441 Proteins containing BT 86.6 14 0.00031 44.1 15.3 173 53-241 284-482 (571)
95 PRK00178 tolB translocation pr 86.4 65 0.0014 36.6 23.5 148 51-214 208-367 (430)
96 KOG0310 Conserved WD40 repeat- 86.0 14 0.0003 42.0 13.6 113 55-177 168-283 (487)
97 KOG4547 WD40 repeat-containing 85.9 20 0.00043 41.7 15.1 106 145-258 69-176 (541)
98 KOG0649 WD40 repeat protein [G 84.7 56 0.0012 34.3 17.0 105 58-170 76-194 (325)
99 KOG0286 G-protein beta subunit 84.6 40 0.00087 36.3 15.5 152 52-216 155-309 (343)
100 KOG0270 WD40 repeat-containing 84.4 29 0.00064 39.1 15.1 119 97-231 257-381 (463)
101 KOG0266 WD40 repeat-containing 84.2 44 0.00095 38.7 17.7 158 53-218 258-417 (456)
102 KOG0315 G-protein beta subunit 83.8 63 0.0014 34.2 18.3 60 53-114 95-154 (311)
103 TIGR03547 muta_rot_YjhT mutatr 83.1 79 0.0017 34.8 19.8 178 53-241 63-328 (346)
104 KOG1274 WD40 repeat protein [G 83.0 63 0.0014 39.7 18.1 186 52-254 65-262 (933)
105 KOG0303 Actin-binding protein 83.0 21 0.00047 39.7 13.2 71 53-126 144-215 (472)
106 PHA03098 kelch-like protein; P 82.1 47 0.001 39.1 17.2 135 93-240 292-443 (534)
107 PHA02790 Kelch-like protein; P 82.1 57 0.0012 38.0 17.6 147 94-257 270-426 (480)
108 KOG2321 WD40 repeat protein [G 81.3 36 0.00077 39.7 14.6 176 56-241 148-331 (703)
109 KOG0293 WD40 repeat-containing 81.2 36 0.00079 38.2 14.1 212 31-257 259-473 (519)
110 PRK00178 tolB translocation pr 80.1 1.1E+02 0.0025 34.7 23.2 187 54-257 164-366 (430)
111 COG3391 Uncharacterized conser 79.5 1.1E+02 0.0025 34.3 22.7 191 52-257 84-286 (381)
112 KOG0279 G protein beta subunit 79.4 78 0.0017 34.0 15.3 70 52-121 116-187 (315)
113 KOG4441 Proteins containing BT 79.1 68 0.0015 38.4 17.0 172 53-241 332-529 (571)
114 PRK04043 tolB translocation pr 78.9 1.3E+02 0.0028 34.5 23.5 148 51-214 197-361 (419)
115 COG4946 Uncharacterized protei 78.5 1.3E+02 0.0029 34.5 20.4 190 49-257 231-434 (668)
116 PF14727 PHTB1_N: PTHB1 N-term 78.3 41 0.00088 38.5 14.2 93 31-128 231-330 (418)
117 KOG4499 Ca2+-binding protein R 77.6 43 0.00093 35.1 12.5 115 105-236 138-265 (310)
118 PLN02193 nitrile-specifier pro 77.6 1.5E+02 0.0032 34.5 23.0 198 53-263 175-417 (470)
119 KOG0270 WD40 repeat-containing 77.6 54 0.0012 37.1 14.2 94 30-129 268-376 (463)
120 PLN02153 epithiospecifier prot 77.6 1.2E+02 0.0026 33.4 23.0 196 53-257 32-287 (341)
121 KOG1036 Mitotic spindle checkp 77.0 79 0.0017 34.3 14.7 109 86-211 15-125 (323)
122 COG3391 Uncharacterized conser 76.9 1.3E+02 0.0029 33.8 19.7 157 51-218 125-291 (381)
123 PRK04792 tolB translocation pr 76.9 1.5E+02 0.0032 34.2 22.8 150 94-257 228-385 (448)
124 PRK14131 N-acetylneuraminic ac 76.4 1.4E+02 0.003 33.5 20.2 36 204-241 314-350 (376)
125 TIGR02800 propeller_TolB tol-p 75.7 1.4E+02 0.0031 33.4 23.6 149 51-214 199-358 (417)
126 PLN02193 nitrile-specifier pro 75.6 1.3E+02 0.0029 34.9 17.9 152 53-218 228-416 (470)
127 KOG0282 mRNA splicing factor [ 74.4 32 0.0007 39.2 11.6 73 52-125 269-341 (503)
128 KOG1274 WD40 repeat protein [G 74.0 74 0.0016 39.1 15.1 119 52-174 107-230 (933)
129 KOG0275 Conserved WD40 repeat- 73.9 89 0.0019 34.0 14.1 181 26-220 282-477 (508)
130 TIGR02800 propeller_TolB tol-p 73.5 1.6E+02 0.0035 33.0 22.9 149 94-257 200-357 (417)
131 COG2706 3-carboxymuconate cycl 72.8 1.6E+02 0.0034 32.6 24.0 69 187-257 50-123 (346)
132 KOG0643 Translation initiation 72.8 1.4E+02 0.003 32.0 19.4 103 102-218 70-185 (327)
133 KOG2106 Uncharacterized conser 71.2 1.7E+02 0.0037 33.9 16.2 147 53-218 339-486 (626)
134 PRK05137 tolB translocation pr 69.1 2.1E+02 0.0046 32.6 23.3 137 64-214 183-326 (435)
135 PF14870 PSII_BNR: Photosynthe 68.9 1.6E+02 0.0034 32.3 15.3 170 19-198 122-296 (302)
136 KOG4547 WD40 repeat-containing 68.8 2.4E+02 0.0052 33.1 17.7 114 53-174 70-184 (541)
137 PRK02888 nitrous-oxide reducta 68.7 1.3E+02 0.0028 36.1 15.5 150 46-215 196-356 (635)
138 KOG0646 WD40 repeat protein [G 68.5 2.2E+02 0.0048 32.6 18.9 60 188-255 188-248 (476)
139 KOG0282 mRNA splicing factor [ 67.3 45 0.00097 38.1 10.7 141 97-255 227-373 (503)
140 KOG0295 WD40 repeat-containing 67.2 2.1E+02 0.0046 31.9 17.6 52 203-257 315-367 (406)
141 PLN00033 photosystem II stabil 67.0 2.3E+02 0.005 32.3 20.4 129 29-171 73-214 (398)
142 PRK13684 Ycf48-like protein; P 67.0 2.1E+02 0.0045 31.7 18.8 179 21-218 109-294 (334)
143 KOG0318 WD40 repeat stress pro 66.4 2.6E+02 0.0057 32.6 28.1 182 53-257 290-476 (603)
144 COG3823 Glutamine cyclotransfe 66.2 69 0.0015 33.1 10.8 110 53-170 100-213 (262)
145 KOG1036 Mitotic spindle checkp 65.9 1.5E+02 0.0032 32.2 13.8 61 53-115 65-125 (323)
146 KOG0649 WD40 repeat protein [G 65.8 1.9E+02 0.004 30.7 24.3 159 96-265 72-245 (325)
147 PLN02153 epithiospecifier prot 65.5 2.2E+02 0.0047 31.3 18.2 152 53-218 85-290 (341)
148 KOG0647 mRNA export protein (c 65.2 2.1E+02 0.0046 31.1 16.6 154 52-222 83-240 (347)
149 cd00028 B_lectin Bulb-type man 64.5 48 0.0011 30.4 9.1 71 73-170 41-112 (116)
150 KOG1027 Serine/threonine prote 63.4 30 0.00065 42.3 9.0 109 52-177 106-216 (903)
151 PF08450 SGL: SMP-30/Gluconola 63.3 1.9E+02 0.0041 29.9 27.5 145 95-255 11-165 (246)
152 KOG0306 WD40-repeat-containing 63.3 3.5E+02 0.0076 33.0 26.2 101 56-167 339-447 (888)
153 PRK03629 tolB translocation pr 63.0 2.8E+02 0.006 31.7 23.6 149 94-257 209-366 (429)
154 KOG0303 Actin-binding protein 62.9 1E+02 0.0022 34.6 12.1 92 96-198 144-237 (472)
155 smart00108 B_lectin Bulb-type 62.7 63 0.0014 29.5 9.4 81 63-170 30-111 (114)
156 PF14583 Pectate_lyase22: Olig 62.3 2.3E+02 0.005 32.1 15.2 102 67-176 14-124 (386)
157 KOG0288 WD40 repeat protein Ti 61.8 73 0.0016 35.8 10.9 108 100-219 315-426 (459)
158 KOG1446 Histone H3 (Lys4) meth 61.1 2.5E+02 0.0054 30.6 26.4 202 37-265 13-227 (311)
159 KOG0271 Notchless-like WD40 re 61.1 40 0.00087 37.5 8.7 109 50-166 166-280 (480)
160 KOG2106 Uncharacterized conser 60.4 3.3E+02 0.0071 31.7 23.8 220 53-312 257-489 (626)
161 KOG2055 WD40 repeat protein [G 60.4 1E+02 0.0022 35.3 11.8 77 50-128 312-388 (514)
162 KOG0639 Transducin-like enhanc 59.5 1.3E+02 0.0028 34.7 12.5 112 51-172 519-631 (705)
163 PF05262 Borrelia_P83: Borreli 59.1 59 0.0013 37.9 10.2 98 153-256 374-472 (489)
164 PF14870 PSII_BNR: Photosynthe 58.9 2.8E+02 0.006 30.4 19.7 180 23-219 83-268 (302)
165 KOG0285 Pleiotropic regulator 58.7 3E+02 0.0065 30.7 21.8 146 56-217 166-314 (460)
166 KOG0646 WD40 repeat protein [G 58.4 3.4E+02 0.0073 31.2 17.0 132 56-198 96-239 (476)
167 KOG1272 WD40-repeat-containing 58.2 28 0.0006 39.6 7.0 179 53-255 141-324 (545)
168 KOG4378 Nuclear protein COP1 [ 58.1 2.6E+02 0.0056 32.4 14.5 139 56-210 136-280 (673)
169 COG3386 Gluconolactonase [Carb 57.3 2.7E+02 0.0058 30.6 14.6 105 98-215 39-155 (307)
170 COG2706 3-carboxymuconate cycl 57.0 3.1E+02 0.0068 30.4 30.8 192 55-255 4-222 (346)
171 KOG0265 U5 snRNP-specific prot 55.2 3.1E+02 0.0068 29.9 14.3 33 95-127 101-133 (338)
172 KOG0295 WD40 repeat-containing 54.9 1.6E+02 0.0034 32.8 11.8 65 97-169 305-371 (406)
173 KOG0288 WD40 repeat protein Ti 54.9 3.3E+02 0.0072 30.9 14.5 185 53-258 231-421 (459)
174 PF04841 Vps16_N: Vps16, N-ter 54.6 3.7E+02 0.0081 30.6 16.5 98 65-170 63-163 (410)
175 KOG0274 Cdc4 and related F-box 53.8 4.5E+02 0.0097 31.3 22.7 180 53-257 261-444 (537)
176 KOG3881 Uncharacterized conser 53.1 24 0.00052 39.2 5.5 73 52-124 258-330 (412)
177 KOG0280 Uncharacterized conser 53.0 28 0.00061 37.4 5.8 73 53-127 178-255 (339)
178 KOG0294 WD40 repeat-containing 52.7 3.5E+02 0.0076 29.7 16.8 186 45-256 90-283 (362)
179 PF06977 SdiA-regulated: SdiA- 50.4 3.4E+02 0.0073 28.8 22.2 187 52-252 32-239 (248)
180 COG3419 PilY1 Tfp pilus assemb 49.9 2.4E+02 0.0053 35.7 13.7 27 97-123 583-609 (1036)
181 PF09910 DUF2139: Uncharacteri 48.8 1.2E+02 0.0027 32.9 9.7 98 153-257 77-184 (339)
182 PF01453 B_lectin: D-mannose b 48.7 1.2E+02 0.0027 27.8 8.9 60 95-170 19-78 (114)
183 KOG0263 Transcription initiati 48.6 1.4E+02 0.0031 36.1 11.2 63 102-172 553-617 (707)
184 PRK01742 tolB translocation pr 48.4 4.6E+02 0.01 29.8 20.7 144 51-215 213-366 (429)
185 KOG4499 Ca2+-binding protein R 48.1 92 0.002 32.8 8.4 83 95-177 169-256 (310)
186 smart00108 B_lectin Bulb-type 46.8 1.9E+02 0.0042 26.3 9.9 52 107-173 31-82 (114)
187 KOG1188 WD40 repeat protein [G 46.8 2.2E+02 0.0047 31.5 11.3 64 148-215 43-107 (376)
188 PF14727 PHTB1_N: PTHB1 N-term 46.5 5.1E+02 0.011 29.8 20.9 187 53-257 145-363 (418)
189 PRK13684 Ycf48-like protein; P 43.8 4.8E+02 0.01 28.7 22.6 168 29-219 33-209 (334)
190 PF08553 VID27: VID27 cytoplas 43.5 3.9E+02 0.0085 33.3 14.2 110 96-213 493-608 (794)
191 TIGR02276 beta_rpt_yvtn 40-res 43.2 68 0.0015 23.1 5.1 31 52-82 2-33 (42)
192 cd00028 B_lectin Bulb-type man 42.6 2E+02 0.0043 26.3 9.3 22 151-173 62-83 (116)
193 KOG1517 Guanine nucleotide bin 42.3 4.2E+02 0.0092 33.8 13.9 147 96-254 1177-1333(1387)
194 PF06433 Me-amine-dh_H: Methyl 42.1 5.3E+02 0.012 28.7 30.5 189 54-258 3-217 (342)
195 KOG0308 Conserved WD40 repeat- 42.1 2.2E+02 0.0047 34.1 11.1 101 55-163 184-286 (735)
196 PRK04922 tolB translocation pr 42.0 5.7E+02 0.012 29.1 22.7 149 94-257 214-371 (433)
197 PF15525 DUF4652: Domain of un 40.8 3.8E+02 0.0083 27.3 11.2 65 497-573 88-153 (200)
198 PF03178 CPSF_A: CPSF A subuni 40.4 5.1E+02 0.011 28.0 22.0 175 64-254 3-202 (321)
199 KOG0265 U5 snRNP-specific prot 38.6 82 0.0018 34.1 6.6 63 52-115 101-164 (338)
200 PF14339 DUF4394: Domain of un 38.5 5E+02 0.011 27.4 14.6 165 50-221 35-224 (236)
201 PF09910 DUF2139: Uncharacteri 38.2 5.7E+02 0.012 28.0 18.9 157 26-216 16-187 (339)
202 PRK02889 tolB translocation pr 37.8 6.6E+02 0.014 28.6 23.6 149 51-215 205-365 (427)
203 COG3045 CreA Uncharacterized p 37.5 95 0.0021 30.2 6.2 58 1-61 3-62 (165)
204 KOG0283 WD40 repeat-containing 36.7 4.9E+02 0.011 31.8 13.3 110 96-218 421-540 (712)
205 KOG0379 Kelch repeat-containin 36.7 6.2E+02 0.013 29.6 14.2 155 94-257 69-252 (482)
206 KOG1188 WD40 repeat protein [G 36.4 4.8E+02 0.01 29.0 11.9 172 69-255 17-197 (376)
207 KOG2321 WD40 repeat protein [G 36.3 7.5E+02 0.016 29.5 14.0 157 94-257 62-261 (703)
208 KOG3914 WD repeat protein WDR4 35.6 83 0.0018 35.2 6.3 72 52-125 162-234 (390)
209 KOG0292 Vesicle coat complex C 35.3 1E+03 0.022 30.0 21.2 107 75-211 239-349 (1202)
210 PF02897 Peptidase_S9_N: Proly 34.5 7E+02 0.015 27.9 19.1 146 63-215 252-409 (414)
211 PF05567 Neisseria_PilC: Neiss 34.4 6.6E+02 0.014 27.8 13.4 55 201-257 180-242 (335)
212 KOG0281 Beta-TrCP (transducin 33.8 1.3E+02 0.0028 33.3 7.2 98 96-212 331-430 (499)
213 PF14783 BBS2_Mid: Ciliary BBS 33.6 3.9E+02 0.0086 24.8 11.8 68 53-127 15-82 (111)
214 KOG0771 Prolactin regulatory e 32.3 5.6E+02 0.012 29.0 12.0 19 237-255 294-312 (398)
215 KOG2395 Protein involved in va 31.6 7.1E+02 0.015 29.4 12.8 116 95-217 344-465 (644)
216 KOG0639 Transducin-like enhanc 30.9 2.2E+02 0.0048 33.0 8.7 75 50-127 560-634 (705)
217 TIGR02276 beta_rpt_yvtn 40-res 30.8 1.1E+02 0.0025 21.8 4.6 30 188-220 3-32 (42)
218 PF08596 Lgl_C: Lethal giant l 30.1 8.6E+02 0.019 27.6 13.6 182 53-265 97-300 (395)
219 KOG0281 Beta-TrCP (transducin 30.1 1.1E+02 0.0025 33.6 6.1 72 52-126 329-400 (499)
220 KOG1912 WD40 repeat protein [G 29.9 5.4E+02 0.012 31.8 11.8 76 98-177 81-158 (1062)
221 KOG0263 Transcription initiati 29.6 2.1E+02 0.0045 34.7 8.6 69 56-125 550-618 (707)
222 KOG0271 Notchless-like WD40 re 29.6 8.7E+02 0.019 27.5 18.9 64 95-167 127-192 (480)
223 KOG0289 mRNA splicing factor [ 29.5 9.2E+02 0.02 27.7 19.7 75 51-126 313-389 (506)
224 PRK01742 tolB translocation pr 28.9 9E+02 0.02 27.4 22.2 183 53-257 168-364 (429)
225 COG3386 Gluconolactonase [Carb 28.2 8.1E+02 0.018 26.8 12.6 75 51-127 172-255 (307)
226 PF11589 DUF3244: Domain of un 27.0 1.2E+02 0.0026 27.4 5.0 24 731-755 48-71 (106)
227 PF08553 VID27: VID27 cytoplas 27.0 2.6E+02 0.0056 34.8 9.1 63 53-117 542-608 (794)
228 PRK02889 tolB translocation pr 26.7 9.8E+02 0.021 27.2 27.0 149 94-257 206-363 (427)
229 PF01456 Mucin: Mucin-like gly 26.1 49 0.0011 31.7 2.4 27 1-27 1-27 (143)
230 COG4946 Uncharacterized protei 25.8 1.1E+03 0.024 27.5 24.2 195 92-312 232-441 (668)
231 PF02897 Peptidase_S9_N: Proly 25.3 9.8E+02 0.021 26.7 27.5 65 154-220 252-320 (414)
232 KOG0650 WD40 repeat nucleolar 25.3 2E+02 0.0044 34.0 7.3 31 98-128 414-444 (733)
233 COG4447 Uncharacterized protei 24.5 6.6E+02 0.014 27.3 10.3 174 54-242 139-322 (339)
234 COG3292 Predicted periplasmic 24.3 3.3E+02 0.0071 32.4 8.7 70 53-129 175-244 (671)
235 KOG0279 G protein beta subunit 24.3 9.4E+02 0.02 26.1 22.1 57 69-127 49-106 (315)
236 TIGR00548 lolB outer membrane 22.4 1.4E+02 0.0031 30.4 5.1 58 53-118 51-108 (202)
237 KOG1240 Protein kinase contain 21.4 4.8E+02 0.01 33.8 9.8 70 55-124 1165-1235(1431)
238 KOG0319 WD40-repeat-containing 21.2 1.6E+03 0.034 27.6 22.0 72 53-124 30-102 (775)
239 PRK13861 type IV secretion sys 20.9 3.6E+02 0.0077 29.4 8.0 33 3-35 2-34 (292)
240 PF14583 Pectate_lyase22: Olig 20.7 3.3E+02 0.0072 30.9 7.8 68 53-123 47-119 (386)
241 KOG0301 Phospholipase A2-activ 20.5 8.2E+02 0.018 29.7 11.0 94 56-161 193-287 (745)
242 KOG1445 Tumor-specific antigen 20.4 1.1E+03 0.024 28.5 11.8 60 94-162 139-200 (1012)
243 PF01453 B_lectin: D-mannose b 20.4 6.6E+02 0.014 22.9 8.7 60 53-122 19-78 (114)
244 KOG2110 Uncharacterized conser 20.2 1.3E+03 0.027 26.1 19.9 176 63-255 68-249 (391)
245 PF08894 DUF1838: Protein of u 20.2 74 0.0016 33.3 2.4 67 700-769 24-90 (238)
No 1
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=100.00 E-value=1.4e-113 Score=960.33 Aligned_cols=702 Identities=34% Similarity=0.487 Sum_probs=551.9
Q ss_pred HHHhccccccceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeee
Q 003800 12 FLSSCTIPSLSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGID 91 (794)
Q Consensus 12 ~l~~~~~~~~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~ 91 (794)
+|+++..+|+|+||||+||+|||++++| ++...|+.-.+..+++||+|++|+||+||.+||+++|||.++.+....+..
T Consensus 7 ~~~~~~~~~aav~edq~gkfdwr~~~vG-~~k~~~~~~~t~~~rlivsT~~~vlAsL~~~tGei~WRqvl~~~~~~~~~~ 85 (910)
T KOG2103|consen 7 ALALLLYRAAAVYEDQAGKFDWRQQLVG-VKKVNFLVYDTKSKRLIVSTEKGVLASLNLRTGEIIWRQVLEPKTSGLGVP 85 (910)
T ss_pred HHHHHHHHHHHHHHHHhhhcchhhhccc-ceeEEEEeecCCCceEEEEeccchhheecccCCcEEEEEeccCCCcccCcc
Confidence 3333445667999999999999999999 555556666667899999999999999999999999999998874432331
Q ss_pred eeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEe
Q 003800 92 IALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRD 171 (794)
Q Consensus 92 ~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~ 171 (794)
. .-++|.+|..+|+||.++|.+.|+..+..+ . ....+.. ...+.|+.+ .....|+..|...
T Consensus 86 ~-----~~~iS~dg~~lr~wn~~~g~l~~~i~l~~g-~-~~~~~~v-------~~~i~v~~g-----~~~~~g~l~w~~~ 146 (910)
T KOG2103|consen 86 L-----TNTISVDGRYLRSWNTNNGILDWEIELADG-F-KGLLLEV-------NKGIAVLNG-----HTRKFGELKWVES 146 (910)
T ss_pred e-----eEEEccCCcEEEeecCCCceeeeecccccc-c-ceeEEEE-------ccceEEEcc-----eeccccceeehhh
Confidence 1 115788889999999999999999999876 3 1111111 222333333 5667899999998
Q ss_pred ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee-eeeeeecccCccCceEEEcCcEEEEEECCCCeEEE
Q 003800 172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL-NHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVT 250 (794)
Q Consensus 172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~-w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v 250 (794)
.+.......|.+.+...+.+|++++--.++..|.+++..+|+.. |+.++..|+.-...|.-+.+.+++|.+ |.+..
T Consensus 147 ~~~~~~~~~q~~~~~~t~vvy~~~~l~~s~~~V~~~~~~~g~v~~~~~~v~~pw~~~~~c~~~k~~vl~~s~---g~l~s 223 (910)
T KOG2103|consen 147 FSISIEEDLQDAKIYGTDVVYVLGLLKRSGSCVQQVFSDDGEVTGPQSTVLGPWFKVLSCSTDKEVVLVCSN---GTLIS 223 (910)
T ss_pred ccccchhHHHHhhhccCcEEEEEEEEecCCceEEEEEccCCcEecceeeeecCcccccccccccceEEEcCC---CCeEE
Confidence 87654434454334578889999887667779999999999999 888888886555566555666788885 57888
Q ss_pred EEeecceeeeEEEeecccCCCCCCceEEeecCCcc-eeEEEecCcEEEEEEecCCcEEEEEeecCcceeeeeeeecCCce
Q 003800 251 VSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTG-MFTVKINNYKLFIRLTSEDKLEVVHKVDHETVVSDALVFSEGKE 329 (794)
Q Consensus 251 ~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~~~~s~~~~~~~~~~ 329 (794)
.|+..++....+... +++-. +.| ...+..++|..++.+++.|...++......-..+.+++..++..
T Consensus 224 ~di~~~~~~~~q~~~-----------e~l~~-l~g~~i~~~g~~~~~~V~V~s~~~~~v~~~~~~e~~lsdsl~~~~d~e 291 (910)
T KOG2103|consen 224 LDISSQKVQISQLLA-----------EILLP-LTGDLILLDGNKHTAMVSVNSSSNHWVYLFCRSEVDLSDSLEAGGDTE 291 (910)
T ss_pred EEEEeeccchhhhhh-----------hhhhc-cCCceEEecCCCceeEEEEecCCCeEEEeecccceeeccccccccccc
Confidence 888776521111111 11110 111 44455556778899987776666544332223344455556666
Q ss_pred EEEEEEEcCce----EEEEEeeeeeeecCccceeeeeccCCceeEEEEEEEEEecCCcceEEEEEEEcCCcEEEEECCeE
Q 003800 330 AFAVVEHGGSK----VDITVKPGQDWNNNLVQESIEMDHQRGLVHKVFINNYLRTDRSHGFRALIVMEDHSLLLVQQGKI 405 (794)
Q Consensus 330 ~~~~~~~~~~~----v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~r~l~~t~d~~~~l~~~g~~ 405 (794)
++.++.+..+. |+.......+.. +....++...+.|+.+.. +..++++.+||++++++|+.+.+.|||.+
T Consensus 292 ~~~si~~~ss~~~~~V~~vn~l~~~~~----~~~~~~~~~l~~p~~F~~--~~~~~~e~~~~al~~~~d~~~~~~qng~i 365 (910)
T KOG2103|consen 292 ASKSIHPESSYLFDQVFIVNNLYLVLD----AQSILLEQKLSRPEVFGT--FEYFDREIGALALVVNDDHSLLFLQNGLI 365 (910)
T ss_pred cceeeecccchhhheeeehhhhhhcch----hhhhhhhcccCcchhcce--eEEeccccceEEEEEecCceEEEEeCcce
Confidence 66665555433 222222222222 223344444555644322 34455566999999999999999999887
Q ss_pred E-EEeccccccceeEEEEeCCCCcccchhhhhhhhhc----hhHHHHHH-hhhcccccCChhhHHHHhh-------cc-c
Q 003800 406 V-WNREDALASIIDVTTSELPVEKEGVSVAKVEHSLF----EWLKGHML-KLKGTLMLASPEDVAAIQA-------IR-L 471 (794)
Q Consensus 406 ~-W~ReEsLa~i~~~~~vdlp~~~~~~~~~~le~e~~----~~~~~~~~-Rl~~~~~~~~~~~~~~l~~-------~~-~ 471 (794)
. |+|||+||++++++|+|||++++ ++.+|.||. +++++||+ |+.+ |+.+|++ .+ .
T Consensus 366 ~~WsREEsLa~vvd~~~vdlpLs~~---~~~~e~e~~~~~~~~l~~afl~R~~t--------q~~ql~~~~~h~~~~~~~ 434 (910)
T KOG2103|consen 366 LVWSREESLANVVDVEMVDLPLSRD---QGLLEDEFEDKESNSLWGAFLKRLTT--------QFNQLINLLKHNQGLPTP 434 (910)
T ss_pred EEeehhhhhhhhccceeeccccccc---hhhHHHHhhccccchHHHHHHHHHHH--------HHHHHHHHHHhhhccCCC
Confidence 7 99999999999999999999998 667777763 36999999 9999 8888766 22 4
Q ss_pred cccCccc-ccccCCCceEEEEEEecCceEEEEECCCCcEEEEEecccCCCCCCCceee-EEeeecCcccCCCCCCeEEEE
Q 003800 472 KSSEKSK-MTRDHNGFRKLLIVLTKARKIFALHSGDGRVVWSLLLHKSEACDSPTELN-LYQWQTPHHHAMDENPSVLVV 549 (794)
Q Consensus 472 ~~~~~~~-~~rD~FGf~Klivv~T~~Gkl~alds~~G~i~W~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vv 549 (794)
+++.+++ +.||.||||||||++|++|||||||+.+|+++|++.+++... +++.++ ++|+..+||| +++.|.|+
T Consensus 435 ~s~~~n~~l~rD~Fgl~K~iIvlT~tGkiFglds~~G~i~Wkl~L~~~~~--~~e~v~l~vqr~~~H~~---~d~~~svl 509 (910)
T KOG2103|consen 435 LSALKNKDLSRDKFGLRKMIIVLTSTGKIFGLDSVDGQIHWKLWLPNVQQ--NPEGVKLFVQRTTAHFP---LDEDPSVL 509 (910)
T ss_pred cccccccceeecccCceeEEEEEecCceEEEEEcCCCeEEEEEecCcccC--CcccceEEEEeccccCC---CCCCCeEE
Confidence 4555666 999999999999999999999999999999999999997432 356899 7888888998 78888888
Q ss_pred EEecCCCCCCcEEEEEEccCCceecccccccceeEEEeecccCCccceEEEEEcCCCceEEccCChhhhhhhhhcccceE
Q 003800 550 GRCGVSSKAPAILSFVDTYTGKELNSFDLVHSAVQVMPLPFTDSTEQRLHLLVDDDRRIHLYPKTSEAISIFQQEFSNIY 629 (794)
Q Consensus 550 ~~~~~~~~~~~~~~~~d~~tG~~~~~~~l~~~~~~~~~lp~~~~~~~~~~~~~d~~~~v~~~P~~~~~~~~~~~~~~~~~ 629 (794)
++++ .+++++++.|||++|++.++.+++++++|.++||.++.++++.++++|+.+.+++||.+.+.+..++++++++|
T Consensus 510 f~~k--~s~~gvly~fn~~~Gkv~s~~~l~~~v~q~sllp~~~~d~~~~illidd~~~v~l~P~~~~~l~~~~~~a~s~y 587 (910)
T KOG2103|consen 510 FVHK--GSGNGVLYEFNPITGKVISRSPLDYRVKQLSLLPVTEHDHQYLILLIDDHLKVKLYPGTSTDLEIVANEASSIY 587 (910)
T ss_pred EEec--cCCCeEEEEEecCcceeeecCccCCceeeEEeccccccccceeEEEecccceEEecCCCcccchhhhhccCccE
Confidence 8876 57899999999999999998889999999999999999999999999999999999999999999999999999
Q ss_pred EEEEEccCCeEEEEEEeecCCCcccccccceeeEeEEEEcCCCCceEEEEeeccCCcccccceeeecCCeeEeeccCCce
Q 003800 630 WYSVEADNGIIKGHAVKSKCAGEVLDDFCFETRVLWSIIFPMESEKIIAAVSRKQNEVVHTQAKVTSEQDVMYKYISKNL 709 (794)
Q Consensus 630 ~~~~d~~~~~l~G~~~~~~~~~~~~~~~~~~~~~~W~~~~~~~~e~Iv~~~~r~~~e~v~S~g~VLgDRsVLYKYLNPNl 709 (794)
+|++|.++|.|+||.++.+ ++..++|+.++|++.|+||++..|+++|+|||+|||||||+||||||||||
T Consensus 588 ~Yt~e~~~~~i~Gy~i~~~----------lT~~~~W~~~l~~e~e~IIav~~r~p~e~VhSqGrVlgdrsVlYKYlnPNL 657 (910)
T KOG2103|consen 588 LYTVEADTGGIYGYIIKAD----------LTTTQTWKKNLPSEKEKIIAVKGRNPNEHVHSQGRVLGDRSVLYKYLNPNL 657 (910)
T ss_pred EEEEEcccCcEEEEEEecc----------cceeeeeeeccCchhheeeEeccCCcchheeecceecccceeeeeccCcch
Confidence 9999999999999999844 578899999999777999999999999999999999999999999999999
Q ss_pred EEEEEEcCCCCCCcCCCCCCCcEEEEEEEEceeeeEEEEEEecCCCCCceEEEEecEEEEEEEeCCcceEEEEEEEEecC
Q 003800 710 LFVATVAPKASGHIGSADPDEAWLVVYLIDTITGRILHRMTHHGAQGPVHAVLSENWVVYHYFNLRAHRYEMSVTEIYDQ 789 (794)
Q Consensus 710 ~~v~t~~~~~~~~~~~~~~~~~~l~v~liD~VTG~il~s~~h~~~~~pi~~v~~ENWvvYsy~~~~~~~~~i~vvELyE~ 789 (794)
+||+|.++++ ++ ..++||||+|||+|+|+++|+++++|||+||||||+||||||++.+|+||+|+|||||
T Consensus 658 ~A~~t~~~~~-------~~---~~~~~LiD~VTG~Ivht~~h~k~~~PvhiVfSENWvvYsYfs~k~~rteltvvELYEg 727 (910)
T KOG2103|consen 658 AAVATANPDD-------HH---ETFLYLIDTVTGSIVHTQSHQKARGPVHIVFSENWVVYSYFSDKARRTELTVVELYEG 727 (910)
T ss_pred hheeecCcCC-------ce---eEEEEEEeeeeeEEEEeeehhhhcCceEEEEecceEEEEEeccccccceEEEEEEecC
Confidence 9999999983 21 1256999999999999999999999999999999999999999999999999999999
Q ss_pred Ccc
Q 003800 790 SRA 792 (794)
Q Consensus 790 ~~~ 792 (794)
++.
T Consensus 728 s~~ 730 (910)
T KOG2103|consen 728 SEQ 730 (910)
T ss_pred Ccc
Confidence 864
No 2
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=99.89 E-value=2.1e-20 Score=210.56 Aligned_cols=241 Identities=20% Similarity=0.304 Sum_probs=168.0
Q ss_pred ChHHHHHHHHHHHHhccccccceeec---------------ccccEeeEEeccCceeeeee--eeeccCCCEEEEEeCCC
Q 003800 1 MAIRFIILTLLFLSSCTIPSLSLYED---------------QVGLMDWHQQYIGKVKHAVF--HTQKTGRKRVVVSTEEN 63 (794)
Q Consensus 1 ~~~~~~l~~l~~l~~~~~~~~Al~ed---------------qvG~~dW~~~~vG~~~~~~f--~~~~~~~~~Vyv~t~~g 63 (794)
|-+|.+++..|++++|++.|+.++.. ..++..|+.++ |......+ ..|...+++||+++.+|
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~W~~~~-g~g~~~~~~~~sPvv~~~~vy~~~~~g 79 (394)
T PRK11138 1 MQLRKTLLPGLLSVTLLSGCSSFNSEEDVVKMSPLPQVENQFTPTTVWSTSV-GDGVGDYYSRLHPAVAYNKVYAADRAG 79 (394)
T ss_pred CcHHHHHHHHHHHHHHhhhcCCCCCCccccCCCCcccccccCCcceeeEEEc-CCCCccceeeeccEEECCEEEEECCCC
Confidence 56788777777777777777765421 25678999986 43321111 13555689999999999
Q ss_pred EEEEEECcCCccceEEEcCcccc---------eeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800 64 VIASLDLRHGEIFWRHVLGINDV---------VDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL 134 (794)
Q Consensus 64 ~l~ALn~~tG~ivWR~~l~~~~~---------i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~ 134 (794)
.|+|||++||+++|++.+..... +.+. +...++.|++++.++.++|+|++||+++|+.++.++.. +.+
T Consensus 80 ~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~-~~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~--ssP 156 (394)
T PRK11138 80 LVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGG-VTVAGGKVYIGSEKGQVYALNAEDGEVAWQTKVAGEAL--SRP 156 (394)
T ss_pred eEEEEECCCCcEeeEEcCCCcccccccccccccccc-cEEECCEEEEEcCCCEEEEEECCCCCCcccccCCCcee--cCC
Confidence 99999999999999999876211 1111 24456677777766799999999999999999876543 223
Q ss_pred ccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeee-eEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800 135 LVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQ-QVIQLDESDQIYVVGYAGSSQFHAYQINAMNG 212 (794)
Q Consensus 135 ~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~-~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG 212 (794)
++. ++.+++. .+|.|+++|.+||+++|+++...+..... ...+...++.+|+.+..| .++++|+++|
T Consensus 157 ~v~-------~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v~~~~v~~~~~~g----~v~a~d~~~G 225 (394)
T PRK11138 157 VVS-------DGLVLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLRGESAPATAFGGAIVGGDNG----RVSAVLMEQG 225 (394)
T ss_pred EEE-------CCEEEEECCCCEEEEEEccCCCEeeeecCCCCcccccCCCCCEEECCEEEEEcCCC----EEEEEEccCC
Confidence 332 5677776 48999999999999999998754322100 011123567888766555 8999999999
Q ss_pred ceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800 213 ELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI 258 (794)
Q Consensus 213 ~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~ 258 (794)
+.+|+.++..+.+ +..++++.++.++ +.+ ..|.++++|+.+|++
T Consensus 226 ~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~vy-~~~-~~g~l~ald~~tG~~ 278 (394)
T PRK11138 226 QLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVVY-ALA-YNGNLVALDLRSGQI 278 (394)
T ss_pred hhhheeccccCCCccchhcccccCCCcEEECCEEE-EEE-cCCeEEEEECCCCCE
Confidence 9999987655422 2234555454444 444 358999999999984
No 3
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=99.84 E-value=4.5e-18 Score=190.28 Aligned_cols=216 Identities=16% Similarity=0.275 Sum_probs=151.6
Q ss_pred eecccccEeeEEeccCceee-e-eeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE
Q 003800 24 YEDQVGLMDWHQQYIGKVKH-A-VFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL 101 (794)
Q Consensus 24 ~edqvG~~dW~~~~vG~~~~-~-~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V 101 (794)
..++.|++.|+.++ |.... . .-..|...+++||+++.+|.|+|+|++||+++|++.+... +.+. +..+++.+++
T Consensus 35 ~~~~~~~~~W~~~~-~~~~~~~~~~~~p~v~~~~v~v~~~~g~v~a~d~~tG~~~W~~~~~~~--~~~~-p~v~~~~v~v 110 (377)
T TIGR03300 35 QPTVKVDQVWSASV-GDGVGHYYLRLQPAVAGGKVYAADADGTVVALDAETGKRLWRVDLDER--LSGG-VGADGGLVFV 110 (377)
T ss_pred cccCcceeeeEEEc-CCCcCccccccceEEECCEEEEECCCCeEEEEEccCCcEeeeecCCCC--cccc-eEEcCCEEEE
Confidence 45678999999987 44321 1 1123555689999999999999999999999999999765 3322 3456677878
Q ss_pred EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeee
Q 003800 102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQ 180 (794)
Q Consensus 102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~ 180 (794)
++.++.+++||+.+|+++|+..+.++.. ..+++ . ++.+++. .+|.|+++|.++|+++|+++...+.....
T Consensus 111 ~~~~g~l~ald~~tG~~~W~~~~~~~~~--~~p~v------~-~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~~~~~~ 181 (377)
T TIGR03300 111 GTEKGEVIALDAEDGKELWRAKLSSEVL--SPPLV------A-NGLVVVRTNDGRLTALDAATGERLWTYSRVTPALTLR 181 (377)
T ss_pred EcCCCEEEEEECCCCcEeeeeccCceee--cCCEE------E-CCEEEEECCCCeEEEEEcCCCceeeEEccCCCceeec
Confidence 7766799999999999999998876543 22222 2 5667776 58999999999999999998765432110
Q ss_pred e-EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEEE
Q 003800 181 Q-VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILVT 250 (794)
Q Consensus 181 ~-~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~v 250 (794)
. ..+...++.+|+....| +++++|+++|+.+|+..+..+.+ ....+.+ .++.+++.+ ..|.+++
T Consensus 182 ~~~sp~~~~~~v~~~~~~g----~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~-~~~~vy~~~-~~g~l~a 255 (377)
T TIGR03300 182 GSASPVIADGGVLVGFAGG----KLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVV-DGGQVYAVS-YQGRVAA 255 (377)
T ss_pred CCCCCEEECCEEEEECCCC----EEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEE-ECCEEEEEE-cCCEEEE
Confidence 0 00113456777544334 89999999999999987654422 1223443 334444444 3588999
Q ss_pred EEeeccee
Q 003800 251 VSFKNRKI 258 (794)
Q Consensus 251 ~~l~sg~~ 258 (794)
+|+++|++
T Consensus 256 ~d~~tG~~ 263 (377)
T TIGR03300 256 LDLRSGRV 263 (377)
T ss_pred EECCCCcE
Confidence 99999874
No 4
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=99.77 E-value=1.6e-16 Score=179.15 Aligned_cols=213 Identities=15% Similarity=0.244 Sum_probs=146.7
Q ss_pred cccccEeeEEeccCceee------ee-eeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEE
Q 003800 26 DQVGLMDWHQQYIGKVKH------AV-FHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYV 98 (794)
Q Consensus 26 dqvG~~dW~~~~vG~~~~------~~-f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~ 98 (794)
.+.|+..|++++-+.... .. ...|...+++||+++.+|.|+|||++||+++|++.+... +... +...++.
T Consensus 86 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~--~~ss-P~v~~~~ 162 (394)
T PRK11138 86 ADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSEKGQVYALNAEDGEVAWQTKVAGE--ALSR-PVVSDGL 162 (394)
T ss_pred CCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcCCCEEEEEECCCCCCcccccCCCc--eecC-CEEECCE
Confidence 458999999987542110 01 112445688999999999999999999999999988654 3332 2344566
Q ss_pred EEEEccCCeEEEEeCCCCcEeEEEeccCcccc---CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccC
Q 003800 99 ITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS---KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAA 174 (794)
Q Consensus 99 V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s---~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~ 174 (794)
++++..++.++|+|++||+++|+.....+... ...|.+ . ++.+++. .+|.++++|..+|+++|+.+...
T Consensus 163 v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v------~-~~~v~~~~~~g~v~a~d~~~G~~~W~~~~~~ 235 (394)
T PRK11138 163 VLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLRGESAPAT------A-FGGAIVGGDNGRVSAVLMEQGQLIWQQRISQ 235 (394)
T ss_pred EEEECCCCEEEEEEccCCCEeeeecCCCCcccccCCCCCEE------E-CCEEEEEcCCCEEEEEEccCChhhheecccc
Confidence 77766567999999999999999987643220 111222 2 4566666 58999999999999999987543
Q ss_pred cce--ee-----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCe
Q 003800 175 ESV--EV-----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSI 247 (794)
Q Consensus 175 ~~~--~~-----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~ 247 (794)
+.. .. ....+...++.+|+.+..| .++|+|++||+.+|+.....+. .+.+.++.+++ .+ .+|.
T Consensus 236 ~~~~~~~~~~~~~~~sP~v~~~~vy~~~~~g----~l~ald~~tG~~~W~~~~~~~~----~~~~~~~~vy~-~~-~~g~ 305 (394)
T PRK11138 236 PTGATEIDRLVDVDTTPVVVGGVVYALAYNG----NLVALDLRSGQIVWKREYGSVN----DFAVDGGRIYL-VD-QNDR 305 (394)
T ss_pred CCCccchhcccccCCCcEEECCEEEEEEcCC----eEEEEECCCCCEEEeecCCCcc----CcEEECCEEEE-Ec-CCCe
Confidence 310 00 0011124688999887666 8999999999999998654321 23333444444 43 3689
Q ss_pred EEEEEeeccee
Q 003800 248 LVTVSFKNRKI 258 (794)
Q Consensus 248 L~v~~l~sg~~ 258 (794)
++++|..+|++
T Consensus 306 l~ald~~tG~~ 316 (394)
T PRK11138 306 VYALDTRGGVE 316 (394)
T ss_pred EEEEECCCCcE
Confidence 99999999873
No 5
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.74 E-value=9.3e-16 Score=159.57 Aligned_cols=216 Identities=19% Similarity=0.301 Sum_probs=145.0
Q ss_pred eecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc
Q 003800 24 YEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS 103 (794)
Q Consensus 24 ~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~ 103 (794)
+..+.|+..|+.++ +.........+...++++|+++.++.|+|+|++||+++|++.++.. +...+...++.+++.+.
T Consensus 8 ~d~~tG~~~W~~~~-~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~--~~~~~~~~~~~v~v~~~ 84 (238)
T PF13360_consen 8 LDPRTGKELWSYDL-GPGIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGP--ISGAPVVDGGRVYVGTS 84 (238)
T ss_dssp EETTTTEEEEEEEC-SSSCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSC--GGSGEEEETTEEEEEET
T ss_pred EECCCCCEEEEEEC-CCCCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeecccc--ccceeeecccccccccc
Confidence 45569999999987 4322111211223488999999999999999999999999999655 22222345555555454
Q ss_pred cCCeEEEEeCCCCcEeEEE-eccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce-e--
Q 003800 104 DGSTLRAWNLPDGQMVWES-FLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV-E-- 178 (794)
Q Consensus 104 ~g~~v~A~d~~tG~llWe~-~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~-~-- 178 (794)
++.++++|+.||+++|+. ....+.. .. ......... ++.+++.. ++.|+++|++||+++|+++...+.. .
T Consensus 85 -~~~l~~~d~~tG~~~W~~~~~~~~~~--~~-~~~~~~~~~-~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~~~~~~~~ 159 (238)
T PF13360_consen 85 -DGSLYALDAKTGKVLWSIYLTSSPPA--GV-RSSSSPAVD-GDRLYVGTSSGKLVALDPKTGKLLWKYPVGEPRGSSPI 159 (238)
T ss_dssp -TSEEEEEETTTSCEEEEEEE-SSCTC--ST-B--SEEEEE-TTEEEEEETCSEEEEEETTTTEEEEEEESSTT-SS--E
T ss_pred -eeeeEecccCCcceeeeecccccccc--cc-ccccCceEe-cCEEEEEeccCcEEEEecCCCcEEEEeecCCCCCCcce
Confidence 459999999999999995 4332221 10 000000222 56677765 9999999999999999998865331 1
Q ss_pred ------eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEE
Q 003800 179 ------VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVS 252 (794)
Q Consensus 179 ------~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~ 252 (794)
..++ ...++.+|+.+..| .+.++|..+|+.+|+..... .. ..+...++.+++.+ ..+.++++|
T Consensus 160 ~~~~~~~~~~--~~~~~~v~~~~~~g----~~~~~d~~tg~~~w~~~~~~---~~-~~~~~~~~~l~~~~-~~~~l~~~d 228 (238)
T PF13360_consen 160 SSFSDINGSP--VISDGRVYVSSGDG----RVVAVDLATGEKLWSKPISG---IY-SLPSVDGGTLYVTS-SDGRLYALD 228 (238)
T ss_dssp EEETTEEEEE--ECCTTEEEEECCTS----SEEEEETTTTEEEEEECSS----EC-ECEECCCTEEEEEE-TTTEEEEEE
T ss_pred eeecccccce--EEECCEEEEEcCCC----eEEEEECCCCCEEEEecCCC---cc-CCceeeCCEEEEEe-CCCEEEEEE
Confidence 1122 24567899876666 48888999999999664222 11 22334556777777 579999999
Q ss_pred eeccee
Q 003800 253 FKNRKI 258 (794)
Q Consensus 253 l~sg~~ 258 (794)
+.+|++
T Consensus 229 ~~tG~~ 234 (238)
T PF13360_consen 229 LKTGKV 234 (238)
T ss_dssp TTTTEE
T ss_pred CCCCCE
Confidence 999984
No 6
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=99.71 E-value=6.8e-15 Score=164.66 Aligned_cols=209 Identities=19% Similarity=0.283 Sum_probs=144.4
Q ss_pred cccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccC
Q 003800 26 DQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDG 105 (794)
Q Consensus 26 dqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g 105 (794)
.+.|+..|++++-+... ..|..+++++|+++.+|.|+|||++||+++|+..+... +... +...++.+++...+
T Consensus 82 ~~tG~~~W~~~~~~~~~----~~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~--~~~~-p~v~~~~v~v~~~~ 154 (377)
T TIGR03300 82 AETGKRLWRVDLDERLS----GGVGADGGLVFVGTEKGEVIALDAEDGKELWRAKLSSE--VLSP-PLVANGLVVVRTND 154 (377)
T ss_pred ccCCcEeeeecCCCCcc----cceEEcCCEEEEEcCCCEEEEEECCCCcEeeeeccCce--eecC-CEEECCEEEEECCC
Confidence 46899999998755432 23445688999999999999999999999999988654 3322 23445567766656
Q ss_pred CeEEEEeCCCCcEeEEEeccCcccc---CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcce--ee
Q 003800 106 STLRAWNLPDGQMVWESFLRGSKHS---KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESV--EV 179 (794)
Q Consensus 106 ~~v~A~d~~tG~llWe~~l~~~~~s---~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~--~~ 179 (794)
+.+++||+++|+++|+.....+... ...+. .. ++.+++. .+|.++++|..+|+.+|+.+...+.. ..
T Consensus 155 g~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~------~~-~~~v~~~~~~g~v~ald~~tG~~~W~~~~~~~~g~~~~ 227 (377)
T TIGR03300 155 GRLTALDAATGERLWTYSRVTPALTLRGSASPV------IA-DGGVLVGFAGGKLVALDLQTGQPLWEQRVALPKGRTEL 227 (377)
T ss_pred CeEEEEEcCCCceeeEEccCCCceeecCCCCCE------EE-CCEEEEECCCCEEEEEEccCCCEeeeeccccCCCCCch
Confidence 7999999999999999987654320 01111 12 4556555 47999999999999999986543210 00
Q ss_pred -----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEee
Q 003800 180 -----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFK 254 (794)
Q Consensus 180 -----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~ 254 (794)
....+...++.+|+.+..| .++|+|++||+.+|+...... ..+.+.++.+++ .+ .+|.++++|..
T Consensus 228 ~~~~~~~~~p~~~~~~vy~~~~~g----~l~a~d~~tG~~~W~~~~~~~----~~p~~~~~~vyv-~~-~~G~l~~~d~~ 297 (377)
T TIGR03300 228 ERLVDVDGDPVVDGGQVYAVSYQG----RVAALDLRSGRVLWKRDASSY----QGPAVDDNRLYV-TD-ADGVVVALDRR 297 (377)
T ss_pred hhhhccCCccEEECCEEEEEEcCC----EEEEEECCCCcEEEeeccCCc----cCceEeCCEEEE-EC-CCCeEEEEECC
Confidence 0001123578999877666 799999999999999863221 123333434444 43 46899999998
Q ss_pred ccee
Q 003800 255 NRKI 258 (794)
Q Consensus 255 sg~~ 258 (794)
+|++
T Consensus 298 tG~~ 301 (377)
T TIGR03300 298 SGSE 301 (377)
T ss_pred CCcE
Confidence 8873
No 7
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=99.65 E-value=1.1e-14 Score=168.43 Aligned_cols=220 Identities=15% Similarity=0.158 Sum_probs=141.6
Q ss_pred ccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc-----ceeeeeeeeCC-EEEE
Q 003800 27 QVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-----VVDGIDIALGK-YVIT 100 (794)
Q Consensus 27 qvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-----~i~~l~~~~g~-~~V~ 100 (794)
+.+++.|+.+. |.. ......|...+++||+++.++.|+|||++||+++|++.+.... .+..-.+...+ +.|+
T Consensus 37 ~~~~~~W~~~~-~~~-~~~~~sPvv~~g~vy~~~~~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~ 114 (488)
T cd00216 37 KKLKVAWTFST-GDE-RGQEGTPLVVDGDMYFTTSHSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVF 114 (488)
T ss_pred hcceeeEEEEC-CCC-CCcccCCEEECCEEEEeCCCCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEE
Confidence 45779999987 310 0112234445899999999999999999999999999886541 00000112334 7788
Q ss_pred EEccCCeEEEEeCCCCcEeEEEeccCcc-----ccCCccccccccccccCCeEEEEE----------CCEEEEEECCCCc
Q 003800 101 LSSDGSTLRAWNLPDGQMVWESFLRGSK-----HSKPLLLVPTNLKVDKDSLILVSS----------KGCLHAVSSIDGE 165 (794)
Q Consensus 101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~-----~s~~~~~~~~~~~~~~~~~V~V~~----------~g~l~ald~~tG~ 165 (794)
++..++.|+|+|++||+++|+....... . ...+.+ . ++.+++.+ +|.|+|||+.||+
T Consensus 115 v~~~~g~v~AlD~~TG~~~W~~~~~~~~~~~~~i-~ssP~v------~-~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~ 186 (488)
T cd00216 115 FGTFDGRLVALDAETGKQVWKFGNNDQVPPGYTM-TGAPTI------V-KKLVIIGSSGAEFFACGVRGALRAYDVETGK 186 (488)
T ss_pred EecCCCeEEEEECCCCCEeeeecCCCCcCcceEe-cCCCEE------E-CCEEEEeccccccccCCCCcEEEEEECCCCc
Confidence 8776789999999999999999987642 1 112222 2 46666643 4789999999999
Q ss_pred EEEEEeccCcc-eee------------------eeEEEEecCCEEEEEEecCC--------------ceeEEEEEEcCCC
Q 003800 166 ILWTRDFAAES-VEV------------------QQVIQLDESDQIYVVGYAGS--------------SQFHAYQINAMNG 212 (794)
Q Consensus 166 ~~W~~~~~~~~-~~~------------------~~~v~s~~~~~vyv~~~~g~--------------~~~~v~ald~~tG 212 (794)
++|+++...+. ... .........+.||+.+..+. ..-.++|||++||
T Consensus 187 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~g~~vw~~pa~d~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~l~Ald~~tG 266 (488)
T cd00216 187 LLWRFYTTEPDPNAFPTWGPDRQMWGPGGGTSWASPTYDPKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDSIVALDADTG 266 (488)
T ss_pred eeeEeeccCCCcCCCCCCCCCcceecCCCCCccCCeeEeCCCCEEEEECCCCCCCccCCccCCCCCCceeeEEEEcCCCC
Confidence 99999774221 000 01111124678887643320 1237999999999
Q ss_pred ceeeeeeeeccc----CccCceEEE-----cCc---EEEEEECCCCeEEEEEeecce
Q 003800 213 ELLNHETAAFSG----GFVGDVALV-----SSD---TLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 213 ~~~w~~~v~~~~----~~s~~~~~v-----g~~---~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+++|+.+...+. .....+.+. .+. ++++.. .+|.++++|..+|+
T Consensus 267 ~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~-~~G~l~ald~~tG~ 322 (488)
T cd00216 267 KVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAP-KNGFFYVLDRTTGK 322 (488)
T ss_pred CEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEEC-CCceEEEEECCCCc
Confidence 999999764331 111122222 111 334443 56889999999998
No 8
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=99.65 E-value=1.9e-14 Score=166.58 Aligned_cols=230 Identities=10% Similarity=0.158 Sum_probs=143.9
Q ss_pred ecccccEeeEEeccCcee--eeeeeeeccCCCEEEEEeC---------CCEEEEEECcCCccceEEEcCcccc-------
Q 003800 25 EDQVGLMDWHQQYIGKVK--HAVFHTQKTGRKRVVVSTE---------ENVIASLDLRHGEIFWRHVLGINDV------- 86 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~--~~~f~~~~~~~~~Vyv~t~---------~g~l~ALn~~tG~ivWR~~l~~~~~------- 86 (794)
..+.|+..|++++-+... ...-..|...++.+|+++. .|.|+|||++||+++|++.+.....
T Consensus 126 D~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~~~W~~~~~~~~~~~~~~~~ 205 (488)
T cd00216 126 DAETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIGSSGAEFFACGVRGALRAYDVETGKLLWRFYTTEPDPNAFPTWG 205 (488)
T ss_pred ECCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEeccccccccCCCCcEEEEEECCCCceeeEeeccCCCcCCCCCCC
Confidence 456899999998755421 1111224444788998874 5789999999999999998853210
Q ss_pred ------------eeeeeeee--CCEEEEEEccC------------------CeEEEEeCCCCcEeEEEeccCccc----c
Q 003800 87 ------------VDGIDIAL--GKYVITLSSDG------------------STLRAWNLPDGQMVWESFLRGSKH----S 130 (794)
Q Consensus 87 ------------i~~l~~~~--g~~~V~Vs~~g------------------~~v~A~d~~tG~llWe~~l~~~~~----s 130 (794)
+-.. ++. .+++|+++..+ +.|+|+|++||+++|+.+...... .
T Consensus 206 ~~~~~~~~~g~~vw~~-pa~d~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~ 284 (488)
T cd00216 206 PDRQMWGPGGGTSWAS-PTYDPKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDSIVALDADTGKVKWFYQTTPHDLWDYDG 284 (488)
T ss_pred CCcceecCCCCCccCC-eeEeCCCCEEEEECCCCCCCccCCccCCCCCCceeeEEEEcCCCCCEEEEeeCCCCCCccccc
Confidence 0011 122 45778886533 279999999999999998653211 0
Q ss_pred CCccccccccc-cccC--CeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEec---------
Q 003800 131 KPLLLVPTNLK-VDKD--SLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYA--------- 197 (794)
Q Consensus 131 ~~~~~~~~~~~-~~~~--~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~--------- 197 (794)
...+.+. ... .++. ..|++. .+|.|+|||++||+++|+.+...... +..++.||+.+..
T Consensus 285 ~s~p~~~-~~~~~~g~~~~~V~~g~~~G~l~ald~~tG~~~W~~~~~~~~~-------~~~~~~vyv~~~~~~~~~~~~~ 356 (488)
T cd00216 285 PNQPSLA-DIKPKDGKPVPAIVHAPKNGFFYVLDRTTGKLISARPEVEQPM-------AYDPGLVYLGAFHIPLGLPPQK 356 (488)
T ss_pred CCCCeEE-eccccCCCeeEEEEEECCCceEEEEECCCCcEeeEeEeecccc-------ccCCceEEEccccccccCcccc
Confidence 1111111 000 1111 124444 48999999999999999987642111 2345778874321
Q ss_pred -----CCceeEEEEEEcCCCceeeeeeeecc-------cCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEee
Q 003800 198 -----GSSQFHAYQINAMNGELLNHETAAFS-------GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETHL 265 (794)
Q Consensus 198 -----g~~~~~v~ald~~tG~~~w~~~v~~~-------~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l 265 (794)
......++|||+.||+.+|+...... .......+.+.++.+++.+ .+|.|+++|..+|++ +-+..+
T Consensus 357 ~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~~~~~~~~~~~g~~v~~g~-~dG~l~ald~~tG~~-lW~~~~ 434 (488)
T cd00216 357 KKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGFPHWGGSLATAGNLVFAGA-ADGYFRAFDATTGKE-LWKFRT 434 (488)
T ss_pred cCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCCcccCcceEecCCeEEEEC-CCCeEEEEECCCCce-eeEEEC
Confidence 01234899999999999999976511 1111223334556665665 468999999999984 333444
No 9
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.61 E-value=5.6e-14 Score=146.20 Aligned_cols=181 Identities=19% Similarity=0.285 Sum_probs=119.4
Q ss_pred CCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccc
Q 003800 61 EENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTN 139 (794)
Q Consensus 61 ~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~ 139 (794)
++|.|.|+|++||+++|+..++... ..... +...++.++++..++.|++||+.||+++|+..+..+.. . .+..
T Consensus 1 ~~g~l~~~d~~tG~~~W~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~-~-~~~~--- 74 (238)
T PF13360_consen 1 DDGTLSALDPRTGKELWSYDLGPGIGGPVAT-AVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGPIS-G-APVV--- 74 (238)
T ss_dssp -TSEEEEEETTTTEEEEEEECSSSCSSEEET-EEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSCGG-S-GEEE---
T ss_pred CCCEEEEEECCCCCEEEEEECCCCCCCccce-EEEeCCEEEEEcCCCEEEEEECCCCCEEEEeecccccc-c-eeee---
Confidence 4789999999999999999995431 11111 23355566666556799999999999999999965433 1 1222
Q ss_pred cccccCCeEEEEE-CCEEEEEECCCCcEEEEE-eccCccee-eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceee
Q 003800 140 LKVDKDSLILVSS-KGCLHAVSSIDGEILWTR-DFAAESVE-VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLN 216 (794)
Q Consensus 140 ~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~-~~~~~~~~-~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w 216 (794)
. ++.+++.. ++.|+++|..||+++|+. ....+... .........++.+|+....| .++++|++||+++|
T Consensus 75 ---~-~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----~l~~~d~~tG~~~w 146 (238)
T PF13360_consen 75 ---D-GGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSG----KLVALDPKTGKLLW 146 (238)
T ss_dssp ---E-TTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCS----EEEEEETTTTEEEE
T ss_pred ---c-ccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccC----cEEEEecCCCcEEE
Confidence 2 67788875 789999999999999994 54422211 11111123577787655455 89999999999999
Q ss_pred eeeeecccCcc---------CceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 217 HETAAFSGGFV---------GDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 217 ~~~v~~~~~~s---------~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+..+..+.... +.+++.++ .++..+ ..+.+..+|+.+|+
T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~-~~g~~~~~d~~tg~ 194 (238)
T PF13360_consen 147 KYPVGEPRGSSPISSFSDINGSPVISDG-RVYVSS-GDGRVVAVDLATGE 194 (238)
T ss_dssp EEESSTT-SS--EEEETTEEEEEECCTT-EEEEEC-CTSSEEEEETTTTE
T ss_pred EeecCCCCCCcceeeecccccceEEECC-EEEEEc-CCCeEEEEECCCCC
Confidence 99885543221 23333333 333333 34555666999887
No 10
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=99.51 E-value=1e-12 Score=152.81 Aligned_cols=219 Identities=17% Similarity=0.194 Sum_probs=139.4
Q ss_pred ccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee---e-----eeeeCCEEEE
Q 003800 29 GLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG---I-----DIALGKYVIT 100 (794)
Q Consensus 29 G~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~---l-----~~~~g~~~V~ 100 (794)
.++.|+.++ |... ....+|...+++||+++..|.|+|||++||+++|++.......+.. . .++..++.|+
T Consensus 47 L~~~W~~~~-g~~~-g~~stPvv~~g~vyv~s~~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~ 124 (527)
T TIGR03075 47 LQPAWTFSL-GKLR-GQESQPLVVDGVMYVTTSYSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVF 124 (527)
T ss_pred ceEEEEEEC-CCCC-CcccCCEEECCEEEEECCCCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEE
Confidence 347799887 4221 1122344558999999999999999999999999998754321110 0 0234456777
Q ss_pred EEccCCeEEEEeCCCCcEeEEEeccCccc---cCCccccccccccccCCeEEEEE-------CCEEEEEECCCCcEEEEE
Q 003800 101 LSSDGSTLRAWNLPDGQMVWESFLRGSKH---SKPLLLVPTNLKVDKDSLILVSS-------KGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~~---s~~~~~~~~~~~~~~~~~V~V~~-------~g~l~ald~~tG~~~W~~ 170 (794)
+++.++.|+|+|+.||+++|+........ ..+.+++ . ++.|++.. +|.|+|+|++||+++|++
T Consensus 125 v~t~dg~l~ALDa~TGk~~W~~~~~~~~~~~~~tssP~v------~-~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~ 197 (527)
T TIGR03075 125 FGTLDARLVALDAKTGKVVWSKKNGDYKAGYTITAAPLV------V-KGKVITGISGGEFGVRGYVTAYDAKTGKLVWRR 197 (527)
T ss_pred EEcCCCEEEEEECCCCCEEeecccccccccccccCCcEE------E-CCEEEEeecccccCCCcEEEEEECCCCceeEec
Confidence 77666799999999999999998743211 0112222 2 56777753 589999999999999998
Q ss_pred eccCcce------------ee------------------eeEEEEecCCEEEEEEec-----CC-------ceeEEEEEE
Q 003800 171 DFAAESV------------EV------------------QQVIQLDESDQIYVVGYA-----GS-------SQFHAYQIN 208 (794)
Q Consensus 171 ~~~~~~~------------~~------------------~~~v~s~~~~~vyv~~~~-----g~-------~~~~v~ald 208 (794)
....+.- .+ ..+..-...+.||+.... +. +.-.++|||
T Consensus 198 ~~~p~~~~~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~~~s~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld 277 (527)
T TIGR03075 198 YTVPGDMGYLDKADKPVGGEPGAKTWPGDAWKTGGGATWGTGSYDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARD 277 (527)
T ss_pred cCcCCCcccccccccccccccccCCCCCCccccCCCCccCceeEcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEc
Confidence 6632110 00 001100124577765422 11 123799999
Q ss_pred cCCCceeeeeeeecc--cCc--cCceEEE----cCc---EEEEEECCCCeEEEEEeecce
Q 003800 209 AMNGELLNHETAAFS--GGF--VGDVALV----SSD---TLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 209 ~~tG~~~w~~~v~~~--~~~--s~~~~~v----g~~---~lv~~d~~~g~L~v~~l~sg~ 257 (794)
++||+.+|.++..-. ++. ...++++ ++. .++..+ .+|.++++|-.+|+
T Consensus 278 ~~TG~~~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~~~~-K~G~~~vlDr~tG~ 336 (527)
T TIGR03075 278 PDTGKIKWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLAHAD-RNGFFYVLDRTNGK 336 (527)
T ss_pred cccCCEEEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEEEeC-CCceEEEEECCCCc
Confidence 999999999985221 222 2233433 222 444554 57999999999987
No 11
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.51 E-value=1.4e-12 Score=145.89 Aligned_cols=216 Identities=21% Similarity=0.249 Sum_probs=148.3
Q ss_pred cccccEeeEEeccCceeeeeeeee--ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCc-ccceeeeeeeeCCEEEEEE
Q 003800 26 DQVGLMDWHQQYIGKVKHAVFHTQ--KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGI-NDVVDGIDIALGKYVITLS 102 (794)
Q Consensus 26 dqvG~~dW~~~~vG~~~~~~f~~~--~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~-~~~i~~l~~~~g~~~V~Vs 102 (794)
...|...|.... +......+..| ...+++||+.+.+|.|.|+|+.+|+++|+..+.. ...+.+. +...++.++++
T Consensus 40 ~~~g~~~W~~~~-~~~~~~~~~~~~~~~~dg~v~~~~~~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~-~~~~~G~i~~g 117 (370)
T COG1520 40 NTSGTLLWSVSL-GSGGGGIYAGPAPADGDGTVYVGTRDGNIFALNPDTGLVKWSYPLLGAVAQLSGP-ILGSDGKIYVG 117 (370)
T ss_pred ccCcceeeeeec-ccCccceEeccccEeeCCeEEEecCCCcEEEEeCCCCcEEecccCcCcceeccCc-eEEeCCeEEEe
Confidence 445888897653 22222233334 5669999999999999999999999999998875 2112222 23446678888
Q ss_pred ccCCeEEEEeCCCCcEeEEEeccC-ccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCc-cee-
Q 003800 103 SDGSTLRAWNLPDGQMVWESFLRG-SKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAE-SVE- 178 (794)
Q Consensus 103 ~~g~~v~A~d~~tG~llWe~~l~~-~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~-~~~- 178 (794)
...+.++++|+.||+++|+..... ... ...+++ .++.|++. .+|.++++|+.||.++|+++.+.+ ...
T Consensus 118 ~~~g~~y~ld~~~G~~~W~~~~~~~~~~-~~~~v~-------~~~~v~~~s~~g~~~al~~~tG~~~W~~~~~~~~~~~~ 189 (370)
T COG1520 118 SWDGKLYALDASTGTLVWSRNVGGSPYY-ASPPVV-------GDGTVYVGTDDGHLYALNADTGTLKWTYETPAPLSLSI 189 (370)
T ss_pred cccceEEEEECCCCcEEEEEecCCCeEE-ecCcEE-------cCcEEEEecCCCeEEEEEccCCcEEEEEecCCcccccc
Confidence 776799999999999999999987 222 122222 26777777 489999999999999999988763 111
Q ss_pred eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEE
Q 003800 179 VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILV 249 (794)
Q Consensus 179 ~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~ 249 (794)
.... ...++.+|+.... . ...++++|+.+|+..|+.+...+.+ +....+++++++ |.-..++.+.
T Consensus 190 ~~~~--~~~~~~vy~~~~~-~-~~~~~a~~~~~G~~~w~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~--~~~~~~g~~~ 263 (370)
T COG1520 190 YGSP--AIASGTVYVGSDG-Y-DGILYALNAEDGTLKWSQKVSQTIGRTAISTTPAVDGGPVYVDGGV--YAGSYGGKLL 263 (370)
T ss_pred ccCc--eeecceEEEecCC-C-cceEEEEEccCCcEeeeeeeecccCcccccccccccCceEEECCcE--EEEecCCeEE
Confidence 1111 2468888875542 1 2289999999999999975544322 222344455554 2333456788
Q ss_pred EEEeecce
Q 003800 250 TVSFKNRK 257 (794)
Q Consensus 250 v~~l~sg~ 257 (794)
.++..+|+
T Consensus 264 ~l~~~~G~ 271 (370)
T COG1520 264 CLDADTGE 271 (370)
T ss_pred EEEcCCCc
Confidence 88888887
No 12
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=99.51 E-value=1.7e-12 Score=155.46 Aligned_cols=202 Identities=14% Similarity=0.160 Sum_probs=130.1
Q ss_pred eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccce-------eeee----------------eeeCCEEEEEEccC
Q 003800 49 QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVV-------DGID----------------IALGKYVITLSSDG 105 (794)
Q Consensus 49 ~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i-------~~l~----------------~~~g~~~V~Vs~~g 105 (794)
|...+++||+.|..|.|+|||++||+++||+........ .++. +...++.|++++.+
T Consensus 190 Plvvgg~lYv~t~~~~V~ALDa~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~D 269 (764)
T TIGR03074 190 PLKVGDTLYLCTPHNKVIALDAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSD 269 (764)
T ss_pred CEEECCEEEEECCCCeEEEEECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCC
Confidence 445589999999999999999999999999988654210 0110 11234577777767
Q ss_pred CeEEEEeCCCCcEeEEEeccCccc--------------cCCccccccccccccCCeEEEEE-----------CCEEEEEE
Q 003800 106 STLRAWNLPDGQMVWESFLRGSKH--------------SKPLLLVPTNLKVDKDSLILVSS-----------KGCLHAVS 160 (794)
Q Consensus 106 ~~v~A~d~~tG~llWe~~l~~~~~--------------s~~~~~~~~~~~~~~~~~V~V~~-----------~g~l~ald 160 (794)
++|+|+|+.||+++|++...+... ..+.+++ . ++.|++.. +|.|+|+|
T Consensus 270 g~LiALDA~TGk~~W~fg~~G~vdl~~~~g~~~~g~~~~ts~P~V------~-~g~VIvG~~v~d~~~~~~~~G~I~A~D 342 (764)
T TIGR03074 270 ARLIALDADTGKLCEDFGNNGTVDLTAGMGTTPPGYYYPTSPPLV------A-GTTVVIGGRVADNYSTDEPSGVIRAFD 342 (764)
T ss_pred CeEEEEECCCCCEEEEecCCCceeeecccCcCCCcccccccCCEE------E-CCEEEEEecccccccccCCCcEEEEEE
Confidence 899999999999999876543210 0111222 2 56777752 58899999
Q ss_pred CCCCcEEEEEeccCccee--------e--------eeEEEEecCCEEEEEEec------C--------CceeEEEEEEcC
Q 003800 161 SIDGEILWTRDFAAESVE--------V--------QQVIQLDESDQIYVVGYA------G--------SSQFHAYQINAM 210 (794)
Q Consensus 161 ~~tG~~~W~~~~~~~~~~--------~--------~~~v~s~~~~~vyv~~~~------g--------~~~~~v~ald~~ 210 (794)
+.||+++|++....+... . .....-...+.+|+-... | .+.-.++|||++
T Consensus 343 a~TGkl~W~~~~g~p~~~~~~~~g~~~~~gg~n~W~~~s~D~~~glvy~ptGn~~pd~~g~~r~~~~n~y~~slvALD~~ 422 (764)
T TIGR03074 343 VNTGALVWAWDPGNPDPTAPPAPGETYTRNTPNSWSVASYDEKLGLVYLPMGNQTPDQWGGDRTPADEKYSSSLVALDAT 422 (764)
T ss_pred CCCCcEeeEEecCCCCcccCCCCCCEeccCCCCccCceEEcCCCCeEEEeCCCccccccCCccccCcccccceEEEEeCC
Confidence 999999999986422110 0 001101223556652210 1 123579999999
Q ss_pred CCceeeeeeeecc----cCccCceEEEc----Cc----EEEEEECCCCeEEEEEeeccee
Q 003800 211 NGELLNHETAAFS----GGFVGDVALVS----SD----TLVTLDTTRSILVTVSFKNRKI 258 (794)
Q Consensus 211 tG~~~w~~~v~~~----~~~s~~~~~vg----~~----~lv~~d~~~g~L~v~~l~sg~~ 258 (794)
||+.+|+++..-. .++...++++. ++ .++..+ .+|.++++|-++|+.
T Consensus 423 TGk~~W~~Q~~~hD~WD~D~~~~p~L~d~~~~~G~~~~~v~~~~-K~G~~~vlDr~tG~~ 481 (764)
T TIGR03074 423 TGKERWVFQTVHHDLWDMDVPAQPSLVDLPDADGTTVPALVAPT-KQGQIYVLDRRTGEP 481 (764)
T ss_pred CCceEEEecccCCccccccccCCceEEeeecCCCcEeeEEEEEC-CCCEEEEEECCCCCE
Confidence 9999999975221 12222344331 22 555555 579999999999883
No 13
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.30 E-value=2.4e-10 Score=127.86 Aligned_cols=186 Identities=19% Similarity=0.334 Sum_probs=124.1
Q ss_pred eecccccEeeEEeccCceeeeeeeee-ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE
Q 003800 24 YEDQVGLMDWHQQYIGKVKHAVFHTQ-KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS 102 (794)
Q Consensus 24 ~edqvG~~dW~~~~vG~~~~~~f~~~-~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs 102 (794)
+..+.|+..|+..+.+.. ..+..| ...+++||+++.+|.++|||++||+++|++..+....... ++..+++.|++.
T Consensus 83 ~d~~~g~~~W~~~~~~~~--~~~~~~~~~~~G~i~~g~~~g~~y~ld~~~G~~~W~~~~~~~~~~~~-~~v~~~~~v~~~ 159 (370)
T COG1520 83 LNPDTGLVKWSYPLLGAV--AQLSGPILGSDGKIYVGSWDGKLYALDASTGTLVWSRNVGGSPYYAS-PPVVGDGTVYVG 159 (370)
T ss_pred EeCCCCcEEecccCcCcc--eeccCceEEeCCeEEEecccceEEEEECCCCcEEEEEecCCCeEEec-CcEEcCcEEEEe
Confidence 446678888999987611 112222 1237889999999999999999999999999987100112 235677888877
Q ss_pred ccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE---CCEEEEEECCCCcEEEEEeccCccee-
Q 003800 103 SDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS---KGCLHAVSSIDGEILWTRDFAAESVE- 178 (794)
Q Consensus 103 ~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~---~g~l~ald~~tG~~~W~~~~~~~~~~- 178 (794)
+..++++++|+.||+++|+.....+ ... ..... ....++.+++.. ++.++|+|+.+|..+|+.+...+...
T Consensus 160 s~~g~~~al~~~tG~~~W~~~~~~~-~~~--~~~~~--~~~~~~~vy~~~~~~~~~~~a~~~~~G~~~w~~~~~~~~~~~ 234 (370)
T COG1520 160 TDDGHLYALNADTGTLKWTYETPAP-LSL--SIYGS--PAIASGTVYVGSDGYDGILYALNAEDGTLKWSQKVSQTIGRT 234 (370)
T ss_pred cCCCeEEEEEccCCcEEEEEecCCc-ccc--ccccC--ceeecceEEEecCCCcceEEEEEccCCcEeeeeeeecccCcc
Confidence 5557999999999999999888653 201 11110 112256667763 45899999999999999643221110
Q ss_pred -e--eeEE---EEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeee
Q 003800 179 -V--QQVI---QLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAA 221 (794)
Q Consensus 179 -~--~~~v---~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~ 221 (794)
. .+.+ ....++.+|..+..| +++|+|+.+|+.+|+....
T Consensus 235 ~~~~~~~~~~~~v~v~~~~~~~~~~g----~~~~l~~~~G~~~W~~~~~ 279 (370)
T COG1520 235 AISTTPAVDGGPVYVDGGVYAGSYGG----KLLCLDADTGELIWSFPAG 279 (370)
T ss_pred cccccccccCceEEECCcEEEEecCC----eEEEEEcCCCceEEEEecc
Confidence 0 0110 012344555544444 7999999999999999754
No 14
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=99.30 E-value=9.8e-11 Score=140.38 Aligned_cols=191 Identities=14% Similarity=0.213 Sum_probs=117.1
Q ss_pred ecccccEeeEEeccCceee----------eeee------------eeccCCCEEEEEeCCCEEEEEECcCCccceEEEcC
Q 003800 25 EDQVGLMDWHQQYIGKVKH----------AVFH------------TQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLG 82 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~~----------~~f~------------~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~ 82 (794)
..+.|+..|++..-..... +.+. .|...+++||+.|.++.|+|||++||+++|++..+
T Consensus 210 Da~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~Dg~LiALDA~TGk~~W~fg~~ 289 (764)
T TIGR03074 210 DAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSDARLIALDADTGKLCEDFGNN 289 (764)
T ss_pred ECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCCCeEEEEECCCCCEEEEecCC
Confidence 3568999999986332211 0111 12345779999999999999999999999987543
Q ss_pred ccc--------------ceeeeeeeeCCEEEEEEcc----------CCeEEEEeCCCCcEeEEEeccCccccC-----Cc
Q 003800 83 IND--------------VVDGIDIALGKYVITLSSD----------GSTLRAWNLPDGQMVWESFLRGSKHSK-----PL 133 (794)
Q Consensus 83 ~~~--------------~i~~l~~~~g~~~V~Vs~~----------g~~v~A~d~~tG~llWe~~l~~~~~s~-----~~ 133 (794)
... .+.+. +.+.+++|++++. .+.|+|+|++||+++|++....+.... ..
T Consensus 290 G~vdl~~~~g~~~~g~~~~ts~-P~V~~g~VIvG~~v~d~~~~~~~~G~I~A~Da~TGkl~W~~~~g~p~~~~~~~~g~~ 368 (764)
T TIGR03074 290 GTVDLTAGMGTTPPGYYYPTSP-PLVAGTTVVIGGRVADNYSTDEPSGVIRAFDVNTGALVWAWDPGNPDPTAPPAPGET 368 (764)
T ss_pred CceeeecccCcCCCcccccccC-CEEECCEEEEEecccccccccCCCcEEEEEECCCCcEeeEEecCCCCcccCCCCCCE
Confidence 210 01122 3455667777632 468999999999999999864322100 00
Q ss_pred ccccc-----cccccc-CCeEEE-------------------EECCEEEEEECCCCcEEEEEeccCcce----eeee--E
Q 003800 134 LLVPT-----NLKVDK-DSLILV-------------------SSKGCLHAVSSIDGEILWTRDFAAESV----EVQQ--V 182 (794)
Q Consensus 134 ~~~~~-----~~~~~~-~~~V~V-------------------~~~g~l~ald~~tG~~~W~~~~~~~~~----~~~~--~ 182 (794)
...+. ..+.+. .+.+|+ ...+.|.|||++||+++|.++.....+ .+.+ +
T Consensus 369 ~~~gg~n~W~~~s~D~~~glvy~ptGn~~pd~~g~~r~~~~n~y~~slvALD~~TGk~~W~~Q~~~hD~WD~D~~~~p~L 448 (764)
T TIGR03074 369 YTRNTPNSWSVASYDEKLGLVYLPMGNQTPDQWGGDRTPADEKYSSSLVALDATTGKERWVFQTVHHDLWDMDVPAQPSL 448 (764)
T ss_pred eccCCCCccCceEEcCCCCeEEEeCCCccccccCCccccCcccccceEEEEeCCCCceEEEecccCCccccccccCCceE
Confidence 00000 001111 133433 125789999999999999997732211 1112 2
Q ss_pred EEEec-CC----EEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800 183 IQLDE-SD----QIYVVGYAGSSQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 183 v~s~~-~~----~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v 220 (794)
++... ++ .||..+-+| .+++||.+||+++|..+.
T Consensus 449 ~d~~~~~G~~~~~v~~~~K~G----~~~vlDr~tG~~l~~~~e 487 (764)
T TIGR03074 449 VDLPDADGTTVPALVAPTKQG----QIYVLDRRTGEPIVPVEE 487 (764)
T ss_pred EeeecCCCcEeeEEEEECCCC----EEEEEECCCCCEEeecee
Confidence 22112 44 456555455 899999999999998753
No 15
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=99.26 E-value=1.8e-10 Score=134.23 Aligned_cols=187 Identities=16% Similarity=0.229 Sum_probs=115.9
Q ss_pred ecccccEeeEEeccCcee-ee-----ee-eeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc---ceeeeeeee
Q 003800 25 EDQVGLMDWHQQYIGKVK-HA-----VF-HTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND---VVDGIDIAL 94 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~-~~-----~f-~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~---~i~~l~~~~ 94 (794)
..+.|+..|++..-.... .. .. ..++..+++||+++.++.|+|||++||+++|++.+.... .+.+. +..
T Consensus 85 Da~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~dg~l~ALDa~TGk~~W~~~~~~~~~~~~~tss-P~v 163 (527)
T TIGR03075 85 DAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTLDARLVALDAKTGKVVWSKKNGDYKAGYTITAA-PLV 163 (527)
T ss_pred ECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcCCCEEEEEECCCCCEEeecccccccccccccCC-cEE
Confidence 457899999998622111 01 00 113445789999999999999999999999999875321 12223 234
Q ss_pred CCEEEEEEcc------CCeEEEEeCCCCcEeEEEeccCcccc---------------------------CCccccccccc
Q 003800 95 GKYVITLSSD------GSTLRAWNLPDGQMVWESFLRGSKHS---------------------------KPLLLVPTNLK 141 (794)
Q Consensus 95 g~~~V~Vs~~------g~~v~A~d~~tG~llWe~~l~~~~~s---------------------------~~~~~~~~~~~ 141 (794)
.++.|+++.. .+.|+|+|++||+++|++....+... ...+... .
T Consensus 164 ~~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~~~~p~~~~~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~~~---s 240 (527)
T TIGR03075 164 VKGKVITGISGGEFGVRGYVTAYDAKTGKLVWRRYTVPGDMGYLDKADKPVGGEPGAKTWPGDAWKTGGGATWGTG---S 240 (527)
T ss_pred ECCEEEEeecccccCCCcEEEEEECCCCceeEeccCcCCCcccccccccccccccccCCCCCCccccCCCCccCce---e
Confidence 4556666532 36899999999999999887533200 0011111 2
Q ss_pred ccc-CCeEEEEE------CC-----------EEEEEECCCCcEEEEEeccCcce------eeeeEEEEecCCE---EEEE
Q 003800 142 VDK-DSLILVSS------KG-----------CLHAVSSIDGEILWTRDFAAESV------EVQQVIQLDESDQ---IYVV 194 (794)
Q Consensus 142 ~~~-~~~V~V~~------~g-----------~l~ald~~tG~~~W~~~~~~~~~------~~~~~v~s~~~~~---vyv~ 194 (794)
.|. .+.||+.. ++ .|.|||++||+.+|.++...... ....+++...+++ +++.
T Consensus 241 ~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~~ 320 (527)
T TIGR03075 241 YDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARDPDTGKIKWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLAH 320 (527)
T ss_pred EcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEccccCCEEEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEEE
Confidence 222 34566643 12 79999999999999998743321 1112232212333 4432
Q ss_pred EecCCceeEEEEEEcCCCceeeee
Q 003800 195 GYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 195 ~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
+. .+..+++||..||+++|..
T Consensus 321 ~~---K~G~~~vlDr~tG~~i~~~ 341 (527)
T TIGR03075 321 AD---RNGFFYVLDRTNGKLLSAE 341 (527)
T ss_pred eC---CCceEEEEECCCCceeccc
Confidence 22 2238999999999998754
No 16
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=99.02 E-value=2e-08 Score=102.37 Aligned_cols=183 Identities=17% Similarity=0.316 Sum_probs=131.3
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
.-.||.++..+.+.|+|+.+|+..|++.++.. +.+.+...|+. |+++-..+.++-++-+||.+.|....-+..-.+
T Consensus 23 kT~v~igSHs~~~~avd~~sG~~~We~ilg~R--iE~sa~vvgdf-VV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~- 98 (354)
T KOG4649|consen 23 KTLVVIGSHSGIVIAVDPQSGNLIWEAILGVR--IECSAIVVGDF-VVLGCYSGGLYFLCVKTGSQIWNFVILETVKVR- 98 (354)
T ss_pred ceEEEEecCCceEEEecCCCCcEEeehhhCce--eeeeeEEECCE-EEEEEccCcEEEEEecchhheeeeeehhhhccc-
Confidence 44699999999999999999999999999876 55544455654 666776778999999999999999887654311
Q ss_pred ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800 133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN 211 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t 211 (794)
+.+ +.+ .+.++..+ |+++||||..+-.-+|+.+-+.... ...++ ...++.+|+....| .|.+.+.++
T Consensus 99 -a~~----d~~-~glIycgshd~~~yalD~~~~~cVykskcgG~~f-~sP~i-~~g~~sly~a~t~G----~vlavt~~~ 166 (354)
T KOG4649|consen 99 -AQC----DFD-GGLIYCGSHDGNFYALDPKTYGCVYKSKCGGGTF-VSPVI-APGDGSLYAAITAG----AVLAVTKNP 166 (354)
T ss_pred -eEE----cCC-CceEEEecCCCcEEEecccccceEEecccCCcee-cccee-cCCCceEEEEeccc----eEEEEccCC
Confidence 122 112 45667764 9999999999999999987776543 22232 34578899887777 899999999
Q ss_pred C--ceeeeeeeecccCccCceEEEcCc-EEEEEECCCCeEEEEEeecce
Q 003800 212 G--ELLNHETAAFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 212 G--~~~w~~~v~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+ ..+|......| +-+++..++.. ++-|+| |.|...+ ++|+
T Consensus 167 ~~~~~~w~~~~~~P--iF~splcv~~sv~i~~Vd---G~l~~f~-~sG~ 209 (354)
T KOG4649|consen 167 YSSTEFWAATRFGP--IFASPLCVGSSVIITTVD---GVLTSFD-ESGR 209 (354)
T ss_pred CCcceehhhhcCCc--cccCceeccceEEEEEec---cEEEEEc-CCCc
Confidence 9 88898865554 22333444433 233443 5666666 6665
No 17
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=98.90 E-value=4.8e-07 Score=92.50 Aligned_cols=178 Identities=13% Similarity=0.137 Sum_probs=131.7
Q ss_pred ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc
Q 003800 25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD 104 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~ 104 (794)
..|.|+..|++-+-++....+.. -++-|+++-.+|.|+-|+-+||+..|..+..+....... ..-..++++.++.
T Consensus 39 d~~sG~~~We~ilg~RiE~sa~v----vgdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~-~d~~~glIycgsh 113 (354)
T KOG4649|consen 39 DPQSGNLIWEAILGVRIECSAIV----VGDFVVLGCYSGGLYFLCVKTGSQIWNFVILETVKVRAQ-CDFDGGLIYCGSH 113 (354)
T ss_pred cCCCCcEEeehhhCceeeeeeEE----ECCEEEEEEccCcEEEEEecchhheeeeeehhhhccceE-EcCCCceEEEecC
Confidence 47899999999886666533222 267799999999999999999999999988665211111 2346678998988
Q ss_pred CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCC--cEEEEEeccCcceeeee
Q 003800 105 GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDG--EILWTRDFAAESVEVQQ 181 (794)
Q Consensus 105 g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG--~~~W~~~~~~~~~~~~~ 181 (794)
+++.+|+|..+=.-+|+.+-.+...+ + |.+. .+++.+|+. ..|.|.|++.+++ ...|.+....|-..-.+
T Consensus 114 d~~~yalD~~~~~cVykskcgG~~f~-s-P~i~-----~g~~sly~a~t~G~vlavt~~~~~~~~~w~~~~~~PiF~spl 186 (354)
T KOG4649|consen 114 DGNFYALDPKTYGCVYKSKCGGGTFV-S-PVIA-----PGDGSLYAAITAGAVLAVTKNPYSSTEFWAATRFGPIFASPL 186 (354)
T ss_pred CCcEEEecccccceEEecccCCceec-c-ceec-----CCCceEEEEeccceEEEEccCCCCcceehhhhcCCccccCce
Confidence 88999999999999999888776552 2 2221 125678887 5999999999999 89999988777542223
Q ss_pred EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc
Q 003800 182 VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS 223 (794)
Q Consensus 182 ~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~ 223 (794)
++ ...+.....+| .+.++| .+|+.+|+.+...|
T Consensus 187 cv----~~sv~i~~VdG----~l~~f~-~sG~qvwr~~t~Gp 219 (354)
T KOG4649|consen 187 CV----GSSVIITTVDG----VLTSFD-ESGRQVWRPATKGP 219 (354)
T ss_pred ec----cceEEEEEecc----EEEEEc-CCCcEEEeecCCCc
Confidence 32 23344445566 899999 79999998865443
No 18
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=98.69 E-value=5.1e-07 Score=101.80 Aligned_cols=205 Identities=18% Similarity=0.214 Sum_probs=124.7
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCccccee-------eee--eeeCC------EEEEEEccCCeEEEEeCCCC
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVD-------GID--IALGK------YVITLSSDGSTLRAWNLPDG 116 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~-------~l~--~~~g~------~~V~Vs~~g~~v~A~d~~tG 116 (794)
.++.+|+.|.-|.+.|||++||++.||.+-..+.++. ++. ..... ..|++...+.+|.|+|++||
T Consensus 213 vgdtlYvcTphn~v~ALDa~TGkekWkydp~~~~nv~~~~~tCrgVsy~~a~a~~k~pc~~rIflpt~DarlIALdA~tG 292 (773)
T COG4993 213 VGDTLYVCTPHNRVFALDAATGKEKWKYDPNLKSNVDPQHQTCRGVSYGAAKADAKSPCPRRIFLPTADARLIALDADTG 292 (773)
T ss_pred ECCEEEEecCcceeEEeeccCCceeeecCCCCCCCcccccccccceecccccccccCCCceeEEeecCCceEEEEeCCCC
Confidence 3778999999999999999999999999876553222 110 01112 34777766789999999999
Q ss_pred cEeEEEeccCccc-------cCCccccccccccccCCeEEEE-E----------CCEEEEEECCCCcEEEEEeccCccee
Q 003800 117 QMVWESFLRGSKH-------SKPLLLVPTNLKVDKDSLILVS-S----------KGCLHAVSSIDGEILWTRDFAAESVE 178 (794)
Q Consensus 117 ~llWe~~l~~~~~-------s~~~~~~~~~~~~~~~~~V~V~-~----------~g~l~ald~~tG~~~W~~~~~~~~~~ 178 (794)
+..|.+.-.+... ..+-...+.+...-....+++. + .|.+.++|..+|+..|.++...+...
T Consensus 293 kvc~~Fa~~Ga~~l~tgm~~~k~g~y~~tS~p~~~~~~~v~~g~v~Dn~st~e~sgVir~fdv~tG~l~w~~D~gnpD~t 372 (773)
T COG4993 293 KVCWSFANKGALNLETGMKDTKDGLYYGTSPPEFGVKGIVIAGSVADNESTWEPSGVIRGFDVLTGKLTWAGDPGNPDPT 372 (773)
T ss_pred cEeheeccCceeeeeccCCCCCCCeEeecCCCcccceeEEEeeccCCCceeeccCccccccccccCceEEccCCCCCCCC
Confidence 9999976443210 0111111111011112333332 1 57888999999999999987655421
Q ss_pred ----eeeE----------EEE--ecCCEEEEEEec------C--------CceeEEEEEEcCCCceeeeeeeecc--cCc
Q 003800 179 ----VQQV----------IQL--DESDQIYVVGYA------G--------SSQFHAYQINAMNGELLNHETAAFS--GGF 226 (794)
Q Consensus 179 ----~~~~----------v~s--~~~~~vyv~~~~------g--------~~~~~v~ald~~tG~~~w~~~v~~~--~~~ 226 (794)
+.+- ..+ ..-+.||+-.-. | .++-.++|+|+.||+..|-++..-. ++.
T Consensus 373 ~p~~~g~tyt~nspn~W~~~SyD~~lnlVy~p~Gn~~pd~wg~trtp~dekysssivAlD~~TG~~kW~yQtvhhDlWDm 452 (773)
T COG4993 373 APTAPGQTYTRNSPNSWASASYDAKLNLVYVPMGNQTPDTWGGTRTPGDEKYSSSIVALDATTGKLKWVYQTVHHDLWDM 452 (773)
T ss_pred CCCCCCceeecCCCCcccccccCCCCCeEEEeCCCCChhhccCCCCcccccccceeEEecCCCcceeeeeeccCcchhcc
Confidence 1010 001 234567763221 1 1245789999999999998864221 222
Q ss_pred cC--ceEEE----cC---cEEEEEECCCCeEEEEEeecce
Q 003800 227 VG--DVALV----SS---DTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 227 s~--~~~~v----g~---~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+. .+.+. .+ ..++..+ .+|.++++|-.+|+
T Consensus 453 Dvp~qp~L~D~~~DG~~vpalv~pt-k~G~~YVlDRrtGe 491 (773)
T COG4993 453 DVPAQPTLLDITKDGKVVPALVHPT-KNGFIYVLDRRTGE 491 (773)
T ss_pred cCCCCceEEEeecCCcEeeeeeccc-ccCcEEEEEcCCCc
Confidence 22 22222 11 1455555 46899999999988
No 19
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.80 E-value=0.071 Score=56.41 Aligned_cols=183 Identities=13% Similarity=0.148 Sum_probs=102.7
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL 134 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~ 134 (794)
++..+.+|.|..+|.++|+.+.+...... ..++....++..+++ ++.++.++.||..+|+.+.+........ ...
T Consensus 4 ~~s~~~d~~v~~~d~~t~~~~~~~~~~~~--~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~~~--~~~ 79 (300)
T TIGR03866 4 YVSNEKDNTISVIDTATLEVTRTFPVGQR--PRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPDPE--LFA 79 (300)
T ss_pred EEEecCCCEEEEEECCCCceEEEEECCCC--CCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCCcc--EEE
Confidence 44556789999999999998777654332 223322333444544 4456799999999999876654332211 111
Q ss_pred ccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800 135 LVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG 212 (794)
Q Consensus 135 ~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG 212 (794)
+ ..+ ++.+++.. ++.+..+|..+++.+...+.... +..+.. ..++..++++..++ ..+..+|..+|
T Consensus 80 ~-----~~~-g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~~---~~~~~~-~~dg~~l~~~~~~~--~~~~~~d~~~~ 147 (300)
T TIGR03866 80 L-----HPN-GKILYIANEDDNLVTVIDIETRKVLAEIPVGVE---PEGMAV-SPDGKIVVNTSETT--NMAHFIDTKTY 147 (300)
T ss_pred E-----CCC-CCEEEEEcCCCCeEEEEECCCCeEEeEeeCCCC---cceEEE-CCCCCEEEEEecCC--CeEEEEeCCCC
Confidence 1 112 34465552 78999999999888777654321 122221 23444444443322 13555788888
Q ss_pred ceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 213 ELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 213 ~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+........ ... ..+.+- .+..++......+.+.+.|+++++
T Consensus 148 ~~~~~~~~~--~~~-~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~ 190 (300)
T TIGR03866 148 EIVDNVLVD--QRP-RFAEFTADGKELWVSSEIGGTVSVIDVATRK 190 (300)
T ss_pred eEEEEEEcC--CCc-cEEEECCCCCEEEEEcCCCCEEEEEEcCcce
Confidence 776543211 111 112222 223333333345789999999876
No 20
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.71 E-value=0.067 Score=60.06 Aligned_cols=187 Identities=15% Similarity=0.148 Sum_probs=101.8
Q ss_pred cceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEE
Q 003800 21 LSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVIT 100 (794)
Q Consensus 21 ~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~ 100 (794)
.++.+.+..++.-+.+..|.+ +... ..+.+++.+|+++.+|.|.-+|+.+++++-+...... ..++....++..++
T Consensus 18 v~viD~~t~~~~~~i~~~~~~-h~~~-~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~--~~~i~~s~DG~~~~ 93 (369)
T PF02239_consen 18 VAVIDGATNKVVARIPTGGAP-HAGL-KFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGN--PRGIAVSPDGKYVY 93 (369)
T ss_dssp EEEEETTT-SEEEEEE-STTE-EEEE-E-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSE--EEEEEE--TTTEEE
T ss_pred EEEEECCCCeEEEEEcCCCCc-eeEE-EecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCC--cceEEEcCCCCEEE
Confidence 466777777777777765544 2111 1123456799999999999999999999999887654 33443344555666
Q ss_pred EEc-cCCeEEEEeCCCCcEeEEEeccCccc----cCCccccccccccccCCeEEEE--E-CCEEEEEECCCCcEEEEEec
Q 003800 101 LSS-DGSTLRAWNLPDGQMVWESFLRGSKH----SKPLLLVPTNLKVDKDSLILVS--S-KGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 101 Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~----s~~~~~~~~~~~~~~~~~V~V~--~-~g~l~ald~~tG~~~W~~~~ 172 (794)
++. ..+.+..+|++|.+++=+....+... +....++. .. .+.-++. . .+++.-+|-.+.+.+.....
T Consensus 94 v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~----s~-~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i 168 (369)
T PF02239_consen 94 VANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVA----SP-GRPEFVVNLKDTGEIWVVDYSDPKNLKVTTI 168 (369)
T ss_dssp EEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-----S-SSSEEEEEETTTTEEEEEETTTSSCEEEEEE
T ss_pred EEecCCCceeEeccccccceeecccccccccccCCCceeEEe----cC-CCCEEEEEEccCCeEEEEEeccccccceeee
Confidence 665 46799999999999998877653221 00001111 01 2222222 2 46666666655554443322
Q ss_pred cCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800 173 AAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 173 ~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v 220 (794)
..... +.-.. ...+++.|+++..++. ++..+|.++++.++....
T Consensus 169 ~~g~~-~~D~~-~dpdgry~~va~~~sn--~i~viD~~~~k~v~~i~~ 212 (369)
T PF02239_consen 169 KVGRF-PHDGG-FDPDGRYFLVAANGSN--KIAVIDTKTGKLVALIDT 212 (369)
T ss_dssp E--TT-EEEEE-E-TTSSEEEEEEGGGT--EEEEEETTTTEEEEEEE-
T ss_pred ccccc-ccccc-cCcccceeeecccccc--eeEEEeeccceEEEEeec
Confidence 22111 11111 1234555555544322 888999999999876543
No 21
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=97.67 E-value=0.00079 Score=76.75 Aligned_cols=165 Identities=15% Similarity=0.282 Sum_probs=95.5
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCccc--------c----eeee-eeeeCCEEEEEEc----------cCCeE
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND--------V----VDGI-DIALGKYVITLSS----------DGSTL 108 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~--------~----i~~l-~~~~g~~~V~Vs~----------~g~~v 108 (794)
...|||.-|.+..|.|||++||++.|.+.-.... . ..+. ++..+...+++++ ..+.+
T Consensus 271 c~~rIflpt~DarlIALdA~tGkvc~~Fa~~Ga~~l~tgm~~~k~g~y~~tS~p~~~~~~~v~~g~v~Dn~st~e~sgVi 350 (773)
T COG4993 271 CPRRIFLPTADARLIALDADTGKVCWSFANKGALNLETGMKDTKDGLYYGTSPPEFGVKGIVIAGSVADNESTWEPSGVI 350 (773)
T ss_pred CceeEEeecCCceEEEEeCCCCcEeheeccCceeeeeccCCCCCCCeEeecCCCcccceeEEEeeccCCCceeeccCccc
Confidence 3567999999999999999999999995432210 0 0000 0112222222222 13578
Q ss_pred EEEeCCCCcEeEEEeccCcccc------------CCcccccccccccc-CCeEEEE-E------------------CCEE
Q 003800 109 RAWNLPDGQMVWESFLRGSKHS------------KPLLLVPTNLKVDK-DSLILVS-S------------------KGCL 156 (794)
Q Consensus 109 ~A~d~~tG~llWe~~l~~~~~s------------~~~~~~~~~~~~~~-~~~V~V~-~------------------~g~l 156 (794)
|++|..+|+++|...-..+..- .+.....+ ..|. -+.||+- . ...+
T Consensus 351 r~fdv~tG~l~w~~D~gnpD~t~p~~~g~tyt~nspn~W~~~--SyD~~lnlVy~p~Gn~~pd~wg~trtp~dekysssi 428 (773)
T COG4993 351 RGFDVLTGKLTWAGDPGNPDPTAPTAPGQTYTRNSPNSWASA--SYDAKLNLVYVPMGNQTPDTWGGTRTPGDEKYSSSI 428 (773)
T ss_pred cccccccCceEEccCCCCCCCCCCCCCCceeecCCCCccccc--ccCCCCCeEEEeCCCCChhhccCCCCccccccccee
Confidence 9999999999999876543210 00000000 1111 2456652 1 3479
Q ss_pred EEEECCCCcEEEEEeccCcce----eeee--EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 157 HAVSSIDGEILWTRDFAAESV----EVQQ--VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 157 ~ald~~tG~~~W~~~~~~~~~----~~~~--~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
.|+|+.||+.+|.++..-..+ .+.| +.+...++++.=+-.....+..++.+|..||+++-..
T Consensus 429 vAlD~~TG~~kW~yQtvhhDlWDmDvp~qp~L~D~~~DG~~vpalv~ptk~G~~YVlDRrtGe~lv~~ 496 (773)
T COG4993 429 VALDATTGKLKWVYQTVHHDLWDMDVPAQPTLLDITKDGKVVPALVHPTKNGFIYVLDRRTGELLVPI 496 (773)
T ss_pred EEecCCCcceeeeeeccCcchhcccCCCCceEEEeecCCcEeeeeecccccCcEEEEEcCCCcccccc
Confidence 999999999999987754322 1233 2223345544322222222347999999999987544
No 22
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=97.59 E-value=0.0001 Score=54.31 Aligned_cols=31 Identities=29% Similarity=0.595 Sum_probs=28.7
Q ss_pred CEEEEEeCCCEEEEEECcCCccceEEEcCcc
Q 003800 54 KRVVVSTEENVIASLDLRHGEIFWRHVLGIN 84 (794)
Q Consensus 54 ~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~ 84 (794)
++||+++.+|.|+|||++||+++|+++.+..
T Consensus 1 ~~v~~~~~~g~l~AlD~~TG~~~W~~~~~~~ 31 (38)
T PF01011_consen 1 GRVYVGTPDGYLYALDAKTGKVLWKFQTGPP 31 (38)
T ss_dssp TEEEEETTTSEEEEEETTTTSEEEEEESSSG
T ss_pred CEEEEeCCCCEEEEEECCCCCEEEeeeCCCC
Confidence 5799999999999999999999999998765
No 23
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.56 E-value=0.16 Score=53.71 Aligned_cols=189 Identities=14% Similarity=0.156 Sum_probs=102.8
Q ss_pred CCCEEEEE-eCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 52 GRKRVVVS-TEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 52 ~~~~Vyv~-t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
+++.+|++ +.++.|..+|.++|+...+...... ...+....++..++++ ..++.++.||..+++.+.+........
T Consensus 41 dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~--~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~~~~ 118 (300)
T TIGR03866 41 DGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPD--PELFALHPNGKILYIANEDDNLVTVIDIETRKVLAEIPVGVEPE 118 (300)
T ss_pred CCCEEEEEECCCCeEEEEECCCCcEEEeccCCCC--ccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEeEeeCCCCcc
Confidence 34557654 5678999999999987654433222 2222122334455555 345799999999998887766432211
Q ss_pred cCCccccccccccccCCeEEE-EE-C-CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800 130 SKPLLLVPTNLKVDKDSLILV-SS-K-GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V-~~-~-g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a 206 (794)
...+ ..++..++ .. + ..++.+|..+|+.......... +..+..+..+..+++.+..++ .+..
T Consensus 119 --~~~~-------~~dg~~l~~~~~~~~~~~~~d~~~~~~~~~~~~~~~---~~~~~~s~dg~~l~~~~~~~~---~v~i 183 (300)
T TIGR03866 119 --GMAV-------SPDGKIVVNTSETTNMAHFIDTKTYEIVDNVLVDQR---PRFAEFTADGKELWVSSEIGG---TVSV 183 (300)
T ss_pred --eEEE-------CCCCCEEEEEecCCCeEEEEeCCCCeEEEEEEcCCC---ccEEEECCCCCEEEEEcCCCC---EEEE
Confidence 1111 11344444 33 2 3567778888877655433221 222222234445655443233 7888
Q ss_pred EEcCCCceeeeeeeeccc----CccC-ceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 207 INAMNGELLNHETAAFSG----GFVG-DVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 207 ld~~tG~~~w~~~v~~~~----~~s~-~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+|..+|+.+.+.....+. .... .+.+- .+..+++.....+.+++.|+++++
T Consensus 184 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~ 240 (300)
T TIGR03866 184 IDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYE 240 (300)
T ss_pred EEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCCc
Confidence 999999876554322211 1111 12221 233433433345678888988776
No 24
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.44 E-value=0.11 Score=53.13 Aligned_cols=186 Identities=18% Similarity=0.182 Sum_probs=110.5
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
++.+++++.+|.+...|..+++...+...... .+..+.....+..+++++.++.++.||..+++...+........ ..
T Consensus 21 ~~~l~~~~~~g~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~~i-~~ 98 (289)
T cd00200 21 GKLLATGSGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV-SS 98 (289)
T ss_pred CCEEEEeecCcEEEEEEeeCCCcEEEEecCCc-ceeEEEECCCCCEEEEEcCCCeEEEEEcCcccceEEEeccCCcE-EE
Confidence 56788888899999999999987777654332 23222222233355556656799999999998888877544222 11
Q ss_pred ccccccccccccCCeEEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEEEEEc
Q 003800 133 LLLVPTNLKVDKDSLILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAYQINA 209 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~ald~ 209 (794)
... ..++.+++ .. +|.+..+|..+++........... ...+.. ...+.+++.+. +| .+..+|.
T Consensus 99 ~~~-------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--i~~~~~-~~~~~~l~~~~~~~----~i~i~d~ 164 (289)
T cd00200 99 VAF-------SPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW--VNSVAF-SPDGTFVASSSQDG----TIKLWDL 164 (289)
T ss_pred EEE-------cCCCCEEEEecCCCeEEEEECCCcEEEEEeccCCCc--EEEEEE-cCcCCEEEEEcCCC----cEEEEEc
Confidence 111 11334444 44 899999999999888877633222 122221 12244444444 44 6888899
Q ss_pred CCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 210 MNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 210 ~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.+++.+...... ...+.. +.+. .++.+++... .+.+.+.++.+++
T Consensus 165 ~~~~~~~~~~~~-~~~i~~-~~~~~~~~~l~~~~~-~~~i~i~d~~~~~ 210 (289)
T cd00200 165 RTGKCVATLTGH-TGEVNS-VAFSPDGEKLLSSSS-DGTIKLWDLSTGK 210 (289)
T ss_pred cccccceeEecC-ccccce-EEECCCcCEEEEecC-CCcEEEEECCCCc
Confidence 888887665411 111211 2222 2224444432 6888888888765
No 25
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.43 E-value=0.069 Score=54.64 Aligned_cols=187 Identities=14% Similarity=0.156 Sum_probs=111.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
++.+++++.+|.+...|..+++...+...... .+..+.....+.+++.++.++.++.||..+++............ ..
T Consensus 63 ~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i-~~ 140 (289)
T cd00200 63 GTYLASGSSDKTIRLWDLETGECVRTLTGHTS-YVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWV-NS 140 (289)
T ss_pred CCEEEEEcCCCeEEEEEcCcccceEEEeccCC-cEEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEeccCCCcE-EE
Confidence 45789999999999999999988777654332 23333222233455555546799999999999988877433222 11
Q ss_pred ccccccccccccCCeEEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800 133 LLLVPTNLKVDKDSLILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM 210 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~ 210 (794)
+. ...++.+++ .. ++.+..+|..+++....+....... ..+.....+..+++.+.+| .+..+|..
T Consensus 141 ~~-------~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i--~~~~~~~~~~~l~~~~~~~----~i~i~d~~ 207 (289)
T cd00200 141 VA-------FSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEV--NSVAFSPDGEKLLSSSSDG----TIKLWDLS 207 (289)
T ss_pred EE-------EcCcCCEEEEEcCCCcEEEEEccccccceeEecCcccc--ceEEECCCcCEEEEecCCC----cEEEEECC
Confidence 11 121233444 45 8999999999998877776433221 2222112233566554444 68888998
Q ss_pred CCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 211 NGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 211 tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+|+.+.+... .+..+.. +.+- .+.++++.+ .++.+++.++.+++
T Consensus 208 ~~~~~~~~~~-~~~~i~~-~~~~~~~~~~~~~~-~~~~i~i~~~~~~~ 252 (289)
T cd00200 208 TGKCLGTLRG-HENGVNS-VAFSPDGYLLASGS-EDGTIRVWDLRTGE 252 (289)
T ss_pred CCceecchhh-cCCceEE-EEEcCCCcEEEEEc-CCCcEEEEEcCCce
Confidence 8887765521 1111211 1111 223444443 46889888888765
No 26
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=97.41 E-value=0.024 Score=59.93 Aligned_cols=155 Identities=15% Similarity=0.102 Sum_probs=99.0
Q ss_pred CCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 52 GRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 52 ~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
.++.+|-+|.. ..|..+|++||++..++.++...-..|+ ...++.+.-++=..+....||+++-+++=+.+..++.
T Consensus 54 ~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGi-t~~~d~l~qLTWk~~~~f~yd~~tl~~~~~~~y~~EG 132 (264)
T PF05096_consen 54 DDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGI-TILGDKLYQLTWKEGTGFVYDPNTLKKIGTFPYPGEG 132 (264)
T ss_dssp ETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEE-EEETTEEEEEESSSSEEEEEETTTTEEEEEEE-SSS-
T ss_pred CCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeE-EEECCEEEEEEecCCeEEEEccccceEEEEEecCCcc
Confidence 37899999973 4899999999999999999876322244 2356667776765679999999999999888877665
Q ss_pred ccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEE-EEecCCEEEEEEecCCceeEEE
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVI-QLDESDQIYVVGYAGSSQFHAY 205 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v-~s~~~~~vyv~~~~g~~~~~v~ 205 (794)
. + + .- ++.-++.+ ..+|+-+|+++-+..=+.+.........++- ....+|.+|+=-.... .++
T Consensus 133 W--G--L-----t~--dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~td---~I~ 198 (264)
T PF05096_consen 133 W--G--L-----TS--DGKRLIMSDGSSRLYFLDPETFKEVRTIQVTDNGRPVSNLNELEYINGKIYANVWQTD---RIV 198 (264)
T ss_dssp ---E--E-----EE--CSSCEEEE-SSSEEEEE-TTT-SEEEEEE-EETTEE---EEEEEEETTEEEEEETTSS---EEE
T ss_pred e--E--E-----Ec--CCCEEEEECCccceEEECCcccceEEEEEEEECCEECCCcEeEEEEcCEEEEEeCCCC---eEE
Confidence 4 1 1 11 34444444 5689999999887665554432221111110 0124889997444433 789
Q ss_pred EEEcCCCceeeeeeee
Q 003800 206 QINAMNGELLNHETAA 221 (794)
Q Consensus 206 ald~~tG~~~w~~~v~ 221 (794)
.+|++||++.-...++
T Consensus 199 ~Idp~tG~V~~~iDls 214 (264)
T PF05096_consen 199 RIDPETGKVVGWIDLS 214 (264)
T ss_dssp EEETTT-BEEEEEE-H
T ss_pred EEeCCCCeEEEEEEhh
Confidence 9999999999777653
No 27
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=97.39 E-value=0.35 Score=53.73 Aligned_cols=191 Identities=12% Similarity=0.072 Sum_probs=115.9
Q ss_pred cCCCEEEEEeCC-----CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-Ec---------cCCeEEEEeCCC
Q 003800 51 TGRKRVVVSTEE-----NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SS---------DGSTLRAWNLPD 115 (794)
Q Consensus 51 ~~~~~Vyv~t~~-----g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~---------~g~~v~A~d~~t 115 (794)
.+..++||.... |.|..+|.++++++=......... +. ++.++..+|+ .+ ....|..||++|
T Consensus 10 ~~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G~~P~--~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t 86 (352)
T TIGR02658 10 SDARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGGFLPN--PV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQT 86 (352)
T ss_pred CCCCEEEEECCcccccCceEEEEECCCCEEEEEEEccCCCc--ee-ECCCCCEEEEEeccccccccCCCCCEEEEEECcc
Confidence 346789999886 899999999998875555543321 22 3445555655 44 457999999999
Q ss_pred CcEeEEEeccCc-cc--cCCccccccccccccCCeEEEE--E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCC
Q 003800 116 GQMVWESFLRGS-KH--SKPLLLVPTNLKVDKDSLILVS--S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESD 189 (794)
Q Consensus 116 G~llWe~~l~~~-~~--s~~~~~~~~~~~~~~~~~V~V~--~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~ 189 (794)
++++.+..+... .. ........ ...+ ++.+||. + +..|..+|..+++++=+.+.+.... +. ...++
T Consensus 87 ~~~~~~i~~p~~p~~~~~~~~~~~~--ls~d-gk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~~~~----vy-~t~e~ 158 (352)
T TIGR02658 87 HLPIADIELPEGPRFLVGTYPWMTS--LTPD-NKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPDCYH----IF-PTAND 158 (352)
T ss_pred CcEEeEEccCCCchhhccCccceEE--ECCC-CCEEEEecCCCCCEEEEEECCCCcEEEEEeCCCCcE----EE-EecCC
Confidence 999999998643 10 00011111 0222 4567775 3 7899999999999999988876433 22 24455
Q ss_pred EEEEEEecCCceeEEEEEEcCCCceeeeeeeec--c--cCc-cCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 190 QIYVVGYAGSSQFHAYQINAMNGELLNHETAAF--S--GGF-VGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 190 ~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~--~--~~~-s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.-++.+.+|. ...+.+| .+|+.. ..+... + -.+ ..+.+...++..+|.+.. |.++++|+....
T Consensus 159 ~~~~~~~Dg~--~~~v~~d-~~g~~~-~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~e-G~V~~id~~~~~ 226 (352)
T TIGR02658 159 TFFMHCRDGS--LAKVGYG-TKGNPK-IKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYT-GKIFQIDLSSGD 226 (352)
T ss_pred ccEEEeecCc--eEEEEec-CCCceE-EeeeeeecCCccccccCCceEcCCCcEEEEecC-CeEEEEecCCCc
Confidence 5556677764 2334455 356633 222111 1 011 112122224556667654 999999986644
No 28
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.34 E-value=0.23 Score=55.79 Aligned_cols=190 Identities=12% Similarity=0.107 Sum_probs=105.8
Q ss_pred CEEEEEe-CCCEEEEEECcCCccceEEEcCcccceee-eeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 54 KRVVVST-EENVIASLDLRHGEIFWRHVLGINDVVDG-IDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 54 ~~Vyv~t-~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
+..||.. +.|.|+.+|.+|.+++-+...... ..+ +....++..+++++.++.|.-+|+.+++++-+.+......
T Consensus 6 ~l~~V~~~~~~~v~viD~~t~~~~~~i~~~~~--~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~-- 81 (369)
T PF02239_consen 6 NLFYVVERGSGSVAVIDGATNKVVARIPTGGA--PHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPR-- 81 (369)
T ss_dssp GEEEEEEGGGTEEEEEETTT-SEEEEEE-STT--EEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEE--
T ss_pred cEEEEEecCCCEEEEEECCCCeEEEEEcCCCC--ceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcc--
Confidence 4455544 579999999999999999877543 221 1112334456666556799999999999999998875432
Q ss_pred CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCc-----ceeeeeEEEEecCCEEEEEEecCCceeEE
Q 003800 132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAE-----SVEVQQVIQLDESDQIYVVGYAGSSQFHA 204 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~-----~~~~~~~v~s~~~~~vyv~~~~g~~~~~v 204 (794)
...+ ..+ ++.+++. ..+.+..+|.+|.+++=+.+.... ......++ .......|+++.... .++
T Consensus 82 ~i~~-----s~D-G~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv-~s~~~~~fVv~lkd~--~~I 152 (369)
T PF02239_consen 82 GIAV-----SPD-GKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIV-ASPGRPEFVVNLKDT--GEI 152 (369)
T ss_dssp EEEE-------T-TTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEE-E-SSSSEEEEEETTT--TEE
T ss_pred eEEE-----cCC-CCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEE-ecCCCCEEEEEEccC--CeE
Confidence 1111 122 4567775 389999999999998877654321 11122233 234555576666531 178
Q ss_pred EEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 205 YQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 205 ~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
..+|..+.+.+....+.....+.+ ..+- .+.+++......+.+.++|.++++
T Consensus 153 ~vVdy~d~~~~~~~~i~~g~~~~D-~~~dpdgry~~va~~~sn~i~viD~~~~k 205 (369)
T PF02239_consen 153 WVVDYSDPKNLKVTTIKVGRFPHD-GGFDPDGRYFLVAANGSNKIAVIDTKTGK 205 (369)
T ss_dssp EEEETTTSSCEEEEEEE--TTEEE-EEE-TTSSEEEEEEGGGTEEEEEETTTTE
T ss_pred EEEEeccccccceeeecccccccc-cccCcccceeeecccccceeEEEeeccce
Confidence 888988877766555544332222 1111 122332222233456666666654
No 29
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.91 E-value=1 Score=50.08 Aligned_cols=222 Identities=14% Similarity=0.207 Sum_probs=115.7
Q ss_pred ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCC----CEEEEE--ECcCCccceEEEcCccccee-eeeeeeCCE
Q 003800 25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEE----NVIASL--DLRHGEIFWRHVLGINDVVD-GIDIALGKY 97 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~----g~l~AL--n~~tG~ivWR~~l~~~~~i~-~l~~~~g~~ 97 (794)
.++.|++...+.. .....+.+.....+++.+|++++. |.|.++ +.++|+..-.......+... .+.+...+.
T Consensus 21 d~~~g~l~~~~~~-~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~ 99 (345)
T PF10282_consen 21 DEETGTLTLVQTV-AEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGR 99 (345)
T ss_dssp ETTTTEEEEEEEE-EESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSS
T ss_pred cCCCCCceEeeee-cCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEecCCC
Confidence 3455665544432 111123333334568889999984 566555 55557776665554322221 221223566
Q ss_pred EEEEEc-cCCeEEEEeCCC-CcEeEEEecc-----Cccc-----cCCccccccccccccCCeEEEEE--CCEEEEEECCC
Q 003800 98 VITLSS-DGSTLRAWNLPD-GQMVWESFLR-----GSKH-----SKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSID 163 (794)
Q Consensus 98 ~V~Vs~-~g~~v~A~d~~t-G~llWe~~l~-----~~~~-----s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~t 163 (794)
.++++. .++.+..++..+ |++.-..... ++.. +.+-.+.. ..+ ++.++|.. ..+|+.++...
T Consensus 100 ~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~---~pd-g~~v~v~dlG~D~v~~~~~~~ 175 (345)
T PF10282_consen 100 FLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVF---SPD-GRFVYVPDLGADRVYVYDIDD 175 (345)
T ss_dssp EEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE----TT-SSEEEEEETTTTEEEEEEE-T
T ss_pred EEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEE---CCC-CCEEEEEecCCCEEEEEEEeC
Confidence 777665 367898888875 8777664321 1110 00001110 112 34566643 55666666554
Q ss_pred Cc--EEEEE--eccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec-ccCccC-----ceEEE
Q 003800 164 GE--ILWTR--DFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF-SGGFVG-----DVALV 233 (794)
Q Consensus 164 G~--~~W~~--~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~-~~~~s~-----~~~~v 233 (794)
+. ..-.. ..+.+. .|+.+....++..+|++.-. +..+.++.++..+|+......+.. |.+..+ .+.+-
T Consensus 176 ~~~~l~~~~~~~~~~G~-GPRh~~f~pdg~~~Yv~~e~-s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~is 253 (345)
T PF10282_consen 176 DTGKLTPVDSIKVPPGS-GPRHLAFSPDGKYAYVVNEL-SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAIS 253 (345)
T ss_dssp TS-TEEEEEEEECSTTS-SEEEEEE-TTSSEEEEEETT-TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-
T ss_pred CCceEEEeeccccccCC-CCcEEEEcCCcCEEEEecCC-CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEe
Confidence 43 33211 222222 36677655566678887644 445666777766897766555543 333322 22222
Q ss_pred -cCcEEEEEECCCCeEEEEEe
Q 003800 234 -SSDTLVTLDTTRSILVTVSF 253 (794)
Q Consensus 234 -g~~~lv~~d~~~g~L~v~~l 253 (794)
.++.+++.+...+++.+.++
T Consensus 254 pdg~~lyvsnr~~~sI~vf~~ 274 (345)
T PF10282_consen 254 PDGRFLYVSNRGSNSISVFDL 274 (345)
T ss_dssp TTSSEEEEEECTTTEEEEEEE
T ss_pred cCCCEEEEEeccCCEEEEEEE
Confidence 35577778877788888888
No 30
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=96.86 E-value=0.0019 Score=48.07 Aligned_cols=40 Identities=15% Similarity=0.205 Sum_probs=26.1
Q ss_pred CccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCC
Q 003800 73 GEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPD 115 (794)
Q Consensus 73 G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~t 115 (794)
|+++|++.++.. +.+. ++..++.||+++.+++++|+|++|
T Consensus 1 G~~~W~~~~~~~--~~~~-~~v~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 1 GKVLWSYDTGGP--IWSS-PAVAGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp S-EEEEEE-SS-----S---EECTSEEEEE-TTSEEEEEETT-
T ss_pred CceeEEEECCCC--cCcC-CEEECCEEEEEcCCCEEEEEeCCC
Confidence 899999999764 3333 356677888888778999999975
No 31
>PTZ00421 coronin; Provisional
Probab=96.83 E-value=0.23 Score=57.94 Aligned_cols=195 Identities=13% Similarity=0.111 Sum_probs=105.6
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceE-----EEcCc-ccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWR-----HVLGI-NDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLR 125 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR-----~~l~~-~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~ 125 (794)
++.+++++++|.|...|..++..... ..+.. ...+..+... .++.+++.++.++.|+.||..+|+.+=.....
T Consensus 88 ~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h 167 (493)
T PTZ00421 88 PQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCH 167 (493)
T ss_pred CCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCC
Confidence 55788999999999999877643211 11211 1123333222 23345555566789999999999876555433
Q ss_pred CccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeE
Q 003800 126 GSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFH 203 (794)
Q Consensus 126 ~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~ 203 (794)
...+ ..+ ....++.+++ . .|+.+...|..+|+...+........ ...+......+.++.+++.++....
T Consensus 168 ~~~V-~sl-------a~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~-~~~~~w~~~~~~ivt~G~s~s~Dr~ 238 (493)
T PTZ00421 168 SDQI-TSL-------EWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAK-SQRCLWAKRKDLIITLGCSKSQQRQ 238 (493)
T ss_pred CCce-EEE-------EEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCc-ceEEEEcCCCCeEEEEecCCCCCCe
Confidence 3222 111 1122344444 3 38999999999999887765433221 1122222344566655654333346
Q ss_pred EEEEEcCCCceeee-eeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 204 AYQINAMNGELLNH-ETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 204 v~ald~~tG~~~w~-~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+..+|..+.....+ ..+...... ..+.+- ..++++......+.+++.++.+++
T Consensus 239 VklWDlr~~~~p~~~~~~d~~~~~-~~~~~d~d~~~L~lggkgDg~Iriwdl~~~~ 293 (493)
T PTZ00421 239 IMLWDTRKMASPYSTVDLDQSSAL-FIPFFDEDTNLLYIGSKGEGNIRCFELMNER 293 (493)
T ss_pred EEEEeCCCCCCceeEeccCCCCce-EEEEEcCCCCEEEEEEeCCCeEEEEEeeCCc
Confidence 77778776543221 111111111 011121 334554444346789999998877
No 32
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=96.83 E-value=0.0015 Score=46.13 Aligned_cols=27 Identities=30% Similarity=0.601 Sum_probs=25.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEE
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRH 79 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~ 79 (794)
++.+|+++.+|.|.|+|++||+++|++
T Consensus 6 ~~~v~~~~~~g~l~a~d~~~G~~~W~~ 32 (33)
T smart00564 6 DGTVYVGSTDGTLYALDAKTGEILWTY 32 (33)
T ss_pred CCEEEEEcCCCEEEEEEcccCcEEEEc
Confidence 668999999999999999999999985
No 33
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=96.80 E-value=0.0025 Score=47.35 Aligned_cols=40 Identities=23% Similarity=0.355 Sum_probs=27.2
Q ss_pred ccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcC
Q 003800 29 GLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRH 72 (794)
Q Consensus 29 G~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~t 72 (794)
|+..|++++-|.. ...|...+++||+++.+|.|+|||++|
T Consensus 1 G~~~W~~~~~~~~----~~~~~v~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 1 GKVLWSYDTGGPI----WSSPAVAGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp S-EEEEEE-SS-------S--EECTSEEEEE-TTSEEEEEETT-
T ss_pred CceeEEEECCCCc----CcCCEEECCEEEEEcCCCEEEEEeCCC
Confidence 7889999885522 344556699999999999999999986
No 34
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.78 E-value=1.3 Score=49.27 Aligned_cols=195 Identities=10% Similarity=0.150 Sum_probs=107.9
Q ss_pred EEEEeCC----CE--EEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc----CCeEEEEeCCC--CcEeEEEe
Q 003800 56 VVVSTEE----NV--IASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD----GSTLRAWNLPD--GQMVWESF 123 (794)
Q Consensus 56 Vyv~t~~----g~--l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~----g~~v~A~d~~t--G~llWe~~ 123 (794)
+|+++.. +- ++.+|.++|++--.+..........+.....+..+|+... .+.|.+|+..+ |++.--..
T Consensus 2 ~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~ 81 (345)
T PF10282_consen 2 LYVGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNS 81 (345)
T ss_dssp EEEEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEE
T ss_pred EEEEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeee
Confidence 5677765 33 5566779998877766543332333322335566666432 45777766553 88877666
Q ss_pred ccCccccCCcccccccccccc-CCeEEEEE--CCEEEEEECCC-CcEEEE-----Eec--cCcc----eeeeeEEEEecC
Q 003800 124 LRGSKHSKPLLLVPTNLKVDK-DSLILVSS--KGCLHAVSSID-GEILWT-----RDF--AAES----VEVQQVIQLDES 188 (794)
Q Consensus 124 l~~~~~s~~~~~~~~~~~~~~-~~~V~V~~--~g~l~ald~~t-G~~~W~-----~~~--~~~~----~~~~~~v~s~~~ 188 (794)
..... ..+..+ ..+. ++.+++.. +|.+..++..+ |++.-. ... +.+. .-+-++....++
T Consensus 82 ~~~~g--~~p~~i----~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg 155 (345)
T PF10282_consen 82 VPSGG--SSPCHI----AVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDG 155 (345)
T ss_dssp EEESS--SCEEEE----EECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTS
T ss_pred eccCC--CCcEEE----EEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCC
Confidence 54221 122222 2222 45677763 88888888764 765433 211 1110 112334333345
Q ss_pred CEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE--cCcEEEEEECCCCeEEEEEee--cce
Q 003800 189 DQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV--SSDTLVTLDTTRSILVTVSFK--NRK 257 (794)
Q Consensus 189 ~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v--g~~~lv~~d~~~g~L~v~~l~--sg~ 257 (794)
..+|+. .-|...+.++.+|..+|+......+..+.+-....+.. .++++++++...+.+.++++. +|+
T Consensus 156 ~~v~v~-dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~ 227 (345)
T PF10282_consen 156 RFVYVP-DLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGS 227 (345)
T ss_dssp SEEEEE-ETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTE
T ss_pred CEEEEE-ecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCc
Confidence 567764 45667788888888888766545454443322222322 445777888788899999998 565
No 35
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.77 E-value=0.8 Score=50.96 Aligned_cols=79 Identities=5% Similarity=-0.052 Sum_probs=59.9
Q ss_pred eccCCCEEEEEeC----------CCEEEEEECcCCccceEEEcCcccc-----ee-eeeeeeCCEEEEEEc-c-CCeEEE
Q 003800 49 QKTGRKRVVVSTE----------ENVIASLDLRHGEIFWRHVLGINDV-----VD-GIDIALGKYVITLSS-D-GSTLRA 110 (794)
Q Consensus 49 ~~~~~~~Vyv~t~----------~g~l~ALn~~tG~ivWR~~l~~~~~-----i~-~l~~~~g~~~V~Vs~-~-g~~v~A 110 (794)
.+.+++.+|+++. .+.|..+|++|++++.+..++.... .. .+.+..++..++|+. . .+.|..
T Consensus 53 ~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~V 132 (352)
T TIGR02658 53 VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGV 132 (352)
T ss_pred ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEE
Confidence 4556778998776 7899999999999999999865511 01 222345666777765 3 578999
Q ss_pred EeCCCCcEeEEEeccCc
Q 003800 111 WNLPDGQMVWESFLRGS 127 (794)
Q Consensus 111 ~d~~tG~llWe~~l~~~ 127 (794)
+|.++|+.+=+....+.
T Consensus 133 vD~~~~kvv~ei~vp~~ 149 (352)
T TIGR02658 133 VDLEGKAFVRMMDVPDC 149 (352)
T ss_pred EECCCCcEEEEEeCCCC
Confidence 99999999999998653
No 36
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=96.66 E-value=0.0039 Score=45.87 Aligned_cols=31 Identities=13% Similarity=0.232 Sum_probs=26.6
Q ss_pred EEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
.|+++..++.++|+|+.||+++|+++.....
T Consensus 2 ~v~~~~~~g~l~AlD~~TG~~~W~~~~~~~~ 32 (38)
T PF01011_consen 2 RVYVGTPDGYLYALDAKTGKVLWKFQTGPPV 32 (38)
T ss_dssp EEEEETTTSEEEEEETTTTSEEEEEESSSGG
T ss_pred EEEEeCCCCEEEEEECCCCCEEEeeeCCCCC
Confidence 5677777789999999999999999987654
No 37
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=96.43 E-value=0.17 Score=54.89 Aligned_cols=156 Identities=13% Similarity=0.125 Sum_probs=97.9
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc-
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH- 129 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~- 129 (794)
++++|++++.++|.|.+.|++||+++-+..-.+.............-+++-+..++.+...+..+|+++--..-..+.+
T Consensus 200 pdGKr~~tgy~dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~ 279 (399)
T KOG0296|consen 200 PDGKRILTGYDDGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELK 279 (399)
T ss_pred CCCceEEEEecCceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCcccc
Confidence 3588899999999999999999999988764443222222222333344434456788888888999887766322211
Q ss_pred ------cCCccccccccccccCCe-EEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 130 ------SKPLLLVPTNLKVDKDSL-ILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 130 ------s~~~~~~~~~~~~~~~~~-V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
.......+. .... +.. .+ +|++.-+|.+.-+++-..+.+.+. .++.. .....+|..+.+|
T Consensus 280 ~~~e~~~esve~~~~-----ss~lpL~A~G~vdG~i~iyD~a~~~~R~~c~he~~V---~~l~w-~~t~~l~t~c~~g-- 348 (399)
T KOG0296|consen 280 PSQEELDESVESIPS-----SSKLPLAACGSVDGTIAIYDLAASTLRHICEHEDGV---TKLKW-LNTDYLLTACANG-- 348 (399)
T ss_pred ccchhhhhhhhhccc-----ccccchhhcccccceEEEEecccchhheeccCCCce---EEEEE-cCcchheeeccCc--
Confidence 011111110 0111 111 22 888888888766666555555542 23332 1256778777777
Q ss_pred eeEEEEEEcCCCceeeeee
Q 003800 201 QFHAYQINAMNGELLNHET 219 (794)
Q Consensus 201 ~~~v~ald~~tG~~~w~~~ 219 (794)
+|..+|+.||+.+..++
T Consensus 349 --~v~~wDaRtG~l~~~y~ 365 (399)
T KOG0296|consen 349 --KVRQWDARTGQLKFTYT 365 (399)
T ss_pred --eEEeeeccccceEEEEe
Confidence 89999999999998885
No 38
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.41 E-value=0.089 Score=62.25 Aligned_cols=192 Identities=16% Similarity=0.185 Sum_probs=111.7
Q ss_pred cccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccC
Q 003800 26 DQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDG 105 (794)
Q Consensus 26 dqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g 105 (794)
-..|.+-|||.+-+++... +-+- .-++..+.-.+.+-|.++|...|...+..+.....+ ....++.++++
T Consensus 64 ~~tGei~WRqvl~~~~~~~--~~~~----~~~iS~dg~~lr~wn~~~g~l~~~i~l~~g~~~~~~--~v~~~i~v~~g-- 133 (910)
T KOG2103|consen 64 LRTGEIIWRQVLEPKTSGL--GVPL----TNTISVDGRYLRSWNTNNGILDWEIELADGFKGLLL--EVNKGIAVLNG-- 133 (910)
T ss_pred ccCCcEEEEEeccCCCccc--Ccce----eEEEccCCcEEEeecCCCceeeeecccccccceeEE--EEccceEEEcc--
Confidence 4478999999774443322 1111 125555566799999999999999988766222233 34444445444
Q ss_pred CeEEEEeCCCCcEeEEEeccCccc--cCCccccccccccccCCeEEEE-----ECCEEEEEECCCCcEE-EEEeccCcce
Q 003800 106 STLRAWNLPDGQMVWESFLRGSKH--SKPLLLVPTNLKVDKDSLILVS-----SKGCLHAVSSIDGEIL-WTRDFAAESV 177 (794)
Q Consensus 106 ~~v~A~d~~tG~llWe~~l~~~~~--s~~~~~~~~~~~~~~~~~V~V~-----~~g~l~ald~~tG~~~-W~~~~~~~~~ 177 (794)
|....|.+.|+..+..... .+++.+.+ .+.++++ ++..+++++..+|++. |+...-.|+.
T Consensus 134 -----~~~~~g~l~w~~~~~~~~~~~~q~~~~~~-------t~vvy~~~~l~~s~~~V~~~~~~~g~v~~~~~~v~~pw~ 201 (910)
T KOG2103|consen 134 -----HTRKFGELKWVESFSISIEEDLQDAKIYG-------TDVVYVLGLLKRSGSCVQQVFSDDGEVTGPQSTVLGPWF 201 (910)
T ss_pred -----eeccccceeehhhccccchhHHHHhhhcc-------CcEEEEEEEEecCCceEEEEEccCCcEecceeeeecCcc
Confidence 7899999999998875432 01122221 3444443 2668999999999988 9888777776
Q ss_pred eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee-eeecccCccCceEEE-cC--cEEEEEECCCC
Q 003800 178 EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE-TAAFSGGFVGDVALV-SS--DTLVTLDTTRS 246 (794)
Q Consensus 178 ~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~-~v~~~~~~s~~~~~v-g~--~~lv~~d~~~g 246 (794)
.+..| + .+..++.++..| .+..+|...++.--.+ ....-..+.+..+++ |+ ++++|+++.++
T Consensus 202 ~~~~c--~-~~k~~vl~~s~g----~l~s~di~~~~~~~~q~~~e~l~~l~g~~i~~~g~~~~~~V~V~s~~~ 267 (910)
T KOG2103|consen 202 KVLSC--S-TDKEVVLVCSNG----TLISLDISSQKVQISQLLAEILLPLTGDLILLDGNKHTAMVSVNSSSN 267 (910)
T ss_pred ccccc--c-cccceEEEcCCC----CeEEEEEEeeccchhhhhhhhhhccCCceEEecCCCceeEEEEecCCC
Confidence 55444 2 233444456666 3555555433322111 111112334444444 32 37888886433
No 39
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=96.29 E-value=1.1 Score=52.28 Aligned_cols=188 Identities=13% Similarity=0.163 Sum_probs=118.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCc----ccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGI----NDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~----~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
++.+-++-.+|.|---|++.+ |-+...- ...+.++.-+.++.+..++. .+.+.-||+.+|+.+-+....+..
T Consensus 37 S~~lAvsRt~g~IEiwN~~~~---w~~~~vi~g~~drsIE~L~W~e~~RLFS~g~-sg~i~EwDl~~lk~~~~~d~~gg~ 112 (691)
T KOG2048|consen 37 SNQLAVSRTDGNIEIWNLSNN---WFLEPVIHGPEDRSIESLAWAEGGRLFSSGL-SGSITEWDLHTLKQKYNIDSNGGA 112 (691)
T ss_pred CCceeeeccCCcEEEEccCCC---ceeeEEEecCCCCceeeEEEccCCeEEeecC-CceEEEEecccCceeEEecCCCcc
Confidence 555666666788888888874 8776532 22455552223444444344 569999999999999988876653
Q ss_pred ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI 207 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al 207 (794)
. +.+. .-.....+.|. .+|.++-++...|+...+..++........+.-..++-+++..+.+| .+.+.
T Consensus 113 I----Wsia---i~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg----~Iriw 181 (691)
T KOG2048|consen 113 I----WSIA---INPENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDG----VIRIW 181 (691)
T ss_pred e----eEEE---eCCccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCc----eEEEE
Confidence 3 3332 11113455566 48899999999999888877765432122222112233355444444 89999
Q ss_pred EcCCCceeeeeeeecccCccC-------ceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 208 NAMNGELLNHETAAFSGGFVG-------DVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 208 d~~tG~~~w~~~v~~~~~~s~-------~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
|+++|+.+.-.+.... .+.. ++.++..+.++|.|+ +|.+..=|-..|+
T Consensus 182 d~~~~~t~~~~~~~~d-~l~k~~~~iVWSv~~Lrd~tI~sgDS-~G~V~FWd~~~gT 236 (691)
T KOG2048|consen 182 DVKSGQTLHIITMQLD-RLSKREPTIVWSVLFLRDSTIASGDS-AGTVTFWDSIFGT 236 (691)
T ss_pred EcCCCceEEEeeeccc-ccccCCceEEEEEEEeecCcEEEecC-CceEEEEcccCcc
Confidence 9999998872222111 1111 344567889999995 6988888888887
No 40
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.25 E-value=1.7 Score=51.89 Aligned_cols=186 Identities=14% Similarity=0.129 Sum_probs=105.8
Q ss_pred CCEEEEEeCCCEEEEEECcCC-ccceEE--EcCccc-cee-eeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 53 RKRVVVSTEENVIASLDLRHG-EIFWRH--VLGIND-VVD-GIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG-~ivWR~--~l~~~~-~i~-~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
+..++..+.+|.+.-.+..++ +++--+ .++..+ .|. .+++..-=+-++++..+|.+.-||..+|+++.+++....
T Consensus 124 Ge~lia~d~~~~l~vw~~s~~~~e~~l~~~~~~~~~~~Ital~HP~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f~~~~s 203 (910)
T KOG1539|consen 124 GEHLIAVDISNILFVWKTSSIQEELYLQSTFLKVEGDFITALLHPSTYLNKIVVGSSQGRLQLWNVRTGKVVYTFQEFFS 203 (910)
T ss_pred cceEEEEEccCcEEEEEeccccccccccceeeeccCCceeeEecchhheeeEEEeecCCcEEEEEeccCcEEEEeccccc
Confidence 345666666666666665554 221111 000011 122 122222223345555567999999999999999987654
Q ss_pred cccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800 128 KHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 128 ~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a 206 (794)
.. .. +.+ +++ -+.|.+. .+|++.-+|.+.|+.+-+++.+.+.. ..+.-..++..+-+.+-. .+.+.-
T Consensus 204 ~I-T~--ieq---sPa-LDVVaiG~~~G~ViifNlK~dkil~sFk~d~g~V--tslSFrtDG~p~las~~~---~G~m~~ 271 (910)
T KOG1539|consen 204 RI-TA--IEQ---SPA-LDVVAIGLENGTVIIFNLKFDKILMSFKQDWGRV--TSLSFRTDGNPLLASGRS---NGDMAF 271 (910)
T ss_pred ce-eE--ecc---CCc-ceEEEEeccCceEEEEEcccCcEEEEEEccccce--eEEEeccCCCeeEEeccC---CceEEE
Confidence 33 11 111 111 1233343 49999999999999999998874332 122212345555554433 237888
Q ss_pred EEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEE
Q 003800 207 INAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTV 251 (794)
Q Consensus 207 ld~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~ 251 (794)
.|+..-+.+|+.+-+...++.+...+.|..+++...++ .+|++-
T Consensus 272 wDLe~kkl~~v~~nah~~sv~~~~fl~~epVl~ta~~D-nSlk~~ 315 (910)
T KOG1539|consen 272 WDLEKKKLINVTRNAHYGSVTGATFLPGEPVLVTAGAD-NSLKVW 315 (910)
T ss_pred EEcCCCeeeeeeeccccCCcccceecCCCceEeeccCC-CceeEE
Confidence 99988888888874444555555555566666655443 444433
No 41
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=96.09 E-value=0.24 Score=57.59 Aligned_cols=151 Identities=18% Similarity=0.236 Sum_probs=80.4
Q ss_pred CCEEEEEeC-----CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 53 RKRVVVSTE-----ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 53 ~~~Vyv~t~-----~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
.+.+|+.+. ....+++| .+|.++|...+....... +....++.+++.++ ..++.+|. .|+++|++.+...
T Consensus 113 ~~gl~~~~~~~~~~~~~~~~iD-~~G~Vrw~~~~~~~~~~~-~~~l~nG~ll~~~~--~~~~e~D~-~G~v~~~~~l~~~ 187 (477)
T PF05935_consen 113 EDGLYFVNGNDWDSSSYTYLID-NNGDVRWYLPLDSGSDNS-FKQLPNGNLLIGSG--NRLYEIDL-LGKVIWEYDLPGG 187 (477)
T ss_dssp TT-EEEEEETT--BEEEEEEEE-TTS-EEEEE-GGGT--SS-EEE-TTS-EEEEEB--TEEEEE-T-T--EEEEEE--TT
T ss_pred CCcEEEEeCCCCCCCceEEEEC-CCccEEEEEccCccccce-eeEcCCCCEEEecC--CceEEEcC-CCCEEEeeecCCc
Confidence 555777666 67899999 589999999887653211 21223344444333 68999998 6999999999874
Q ss_pred c--ccCCccccccccccccCCeEEEE-E--------------CCEEEEEECCCCcEEEEEeccCcc---ee---------
Q 003800 128 K--HSKPLLLVPTNLKVDKDSLILVS-S--------------KGCLHAVSSIDGEILWTRDFAAES---VE--------- 178 (794)
Q Consensus 128 ~--~s~~~~~~~~~~~~~~~~~V~V~-~--------------~g~l~ald~~tG~~~W~~~~~~~~---~~--------- 178 (794)
. ..-+....+ ++.++++ . ...+.-+| .+|+++|+|+....- ..
T Consensus 188 ~~~~HHD~~~l~-------nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd-~tG~vv~~wd~~d~ld~~~~~~~~~~~~~ 259 (477)
T PF05935_consen 188 YYDFHHDIDELP-------NGNLLILASETKYVDEDKDVDTVEDVIVEVD-PTGEVVWEWDFFDHLDPYRDTVLKPYPYG 259 (477)
T ss_dssp EE-B-S-EEE-T-------TS-EEEEEEETTEE-TS-EE---S-EEEEE--TTS-EEEEEEGGGTS-TT--TTGGT--SS
T ss_pred ccccccccEECC-------CCCEEEEEeecccccCCCCccEecCEEEEEC-CCCCEEEEEehHHhCCccccccccccccc
Confidence 3 111222222 3444442 3 45799999 999999999774311 00
Q ss_pred ----------ee---eEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800 179 ----------VQ---QVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 179 ----------~~---~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~ 219 (794)
.. .+.+...++.+.+.+-.-+ .|..+|..||++.|-.-
T Consensus 260 ~~~~~~~~~DW~H~Nsi~yd~~dd~iivSsR~~s---~V~~Id~~t~~i~Wilg 310 (477)
T PF05935_consen 260 DISGSGGGRDWLHINSIDYDPSDDSIIVSSRHQS---AVIKIDYRTGKIKWILG 310 (477)
T ss_dssp SSS-SSTTSBS--EEEEEEETTTTEEEEEETTT----EEEEEE-TTS-EEEEES
T ss_pred ccccCCCCCCccccCccEEeCCCCeEEEEcCcce---EEEEEECCCCcEEEEeC
Confidence 00 0111123566665443222 68999999999999873
No 42
>PTZ00420 coronin; Provisional
Probab=96.03 E-value=2.4 Score=50.28 Aligned_cols=191 Identities=12% Similarity=0.086 Sum_probs=103.3
Q ss_pred CCEEEEEeCCCEEEEEECcCCccce------EEEcCc-ccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFW------RHVLGI-NDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivW------R~~l~~-~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
++.++.++.+|.|.-.|..++...- ...+.. ...+..+... .+..+++.++.++.++.||..+|+.+++...
T Consensus 87 ~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~~ 166 (568)
T PTZ00420 87 SEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINM 166 (568)
T ss_pred CCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEec
Confidence 4578889999999999988764311 111211 1223332212 2444444455568999999999999888764
Q ss_pred cCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE----EecCCEEEEEEecC
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ----LDESDQIYVVGYAG 198 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~----s~~~~~vyv~~~~g 198 (794)
..... .+ ....++.+++. + ++.+...|..+|+.+-++....... ....+. +..++.+...++++
T Consensus 167 ~~~V~--Sl-------swspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~-~s~~v~~~~fs~d~~~IlTtG~d~ 236 (568)
T PTZ00420 167 PKKLS--SL-------KWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGK-NTKNIWIDGLGGDDNYILSTGFSK 236 (568)
T ss_pred CCcEE--EE-------EECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCc-eeEEEEeeeEcCCCCEEEEEEcCC
Confidence 33221 11 12224555554 3 8899999999999876655433221 111111 12334555555554
Q ss_pred CceeEEEEEEcCC-CceeeeeeeecccCccCce-EEE--c-CcEEEEEECCCCeEEEEEeecce
Q 003800 199 SSQFHAYQINAMN-GELLNHETAAFSGGFVGDV-ALV--S-SDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 199 ~~~~~v~ald~~t-G~~~w~~~v~~~~~~s~~~-~~v--g-~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.....+.-.|+.+ ++++-...+.... +.+ .+. . +.++++.. ..+.+++.++..+.
T Consensus 237 ~~~R~VkLWDlr~~~~pl~~~~ld~~~---~~L~p~~D~~tg~l~lsGk-GD~tIr~~e~~~~~ 296 (568)
T PTZ00420 237 NNMREMKLWDLKNTTSALVTMSIDNAS---APLIPHYDESTGLIYLIGK-GDGNCRYYQHSLGS 296 (568)
T ss_pred CCccEEEEEECCCCCCceEEEEecCCc---cceEEeeeCCCCCEEEEEE-CCCeEEEEEccCCc
Confidence 3323566677774 5555443221111 111 111 1 23455553 45778888877665
No 43
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=95.87 E-value=0.014 Score=41.12 Aligned_cols=29 Identities=24% Similarity=0.445 Sum_probs=24.0
Q ss_pred eCCEEEEEEccCCeEEEEeCCCCcEeEEE
Q 003800 94 LGKYVITLSSDGSTLRAWNLPDGQMVWES 122 (794)
Q Consensus 94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~ 122 (794)
..++.+++++.++.++|+|+++|+++|+.
T Consensus 4 ~~~~~v~~~~~~g~l~a~d~~~G~~~W~~ 32 (33)
T smart00564 4 LSDGTVYVGSTDGTLYALDAKTGEILWTY 32 (33)
T ss_pred EECCEEEEEcCCCEEEEEEcccCcEEEEc
Confidence 34556777776789999999999999986
No 44
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=95.85 E-value=1.3 Score=49.65 Aligned_cols=199 Identities=16% Similarity=0.165 Sum_probs=112.5
Q ss_pred CceeeeeeeeeccCCCEEEEEeCCC--EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCCeEEEEeCCC
Q 003800 39 GKVKHAVFHTQKTGRKRVVVSTEEN--VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGSTLRAWNLPD 115 (794)
Q Consensus 39 G~~~~~~f~~~~~~~~~Vyv~t~~g--~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~t 115 (794)
|......||. ....+.|+.-+| .|+.+|-++-..+=...|+.- +|..... ..|...++.++....++.||..+
T Consensus 214 ~~I~sv~FHp---~~plllvaG~d~~lrifqvDGk~N~~lqS~~l~~f-Pi~~a~f~p~G~~~i~~s~rrky~ysyDle~ 289 (514)
T KOG2055|consen 214 GGITSVQFHP---TAPLLLVAGLDGTLRIFQVDGKVNPKLQSIHLEKF-PIQKAEFAPNGHSVIFTSGRRKYLYSYDLET 289 (514)
T ss_pred CCceEEEecC---CCceEEEecCCCcEEEEEecCccChhheeeeeccC-ccceeeecCCCceEEEecccceEEEEeeccc
Confidence 3334456773 244678887776 477888777665544444332 2332222 34555777787777899999999
Q ss_pred CcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEE
Q 003800 116 GQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVV 194 (794)
Q Consensus 116 G~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~ 194 (794)
+++.=-.+..+-. .+....... ..+ +..+++. ..|.++.|.++||+..=+.+.++... .+..+..+..+++.
T Consensus 290 ak~~k~~~~~g~e-~~~~e~FeV--Shd-~~fia~~G~~G~I~lLhakT~eli~s~KieG~v~---~~~fsSdsk~l~~~ 362 (514)
T KOG2055|consen 290 AKVTKLKPPYGVE-EKSMERFEV--SHD-SNFIAIAGNNGHIHLLHAKTKELITSFKIEGVVS---DFTFSSDSKELLAS 362 (514)
T ss_pred cccccccCCCCcc-cchhheeEe--cCC-CCeEEEcccCceEEeehhhhhhhhheeeeccEEe---eEEEecCCcEEEEE
Confidence 8875322222211 112222210 111 2333333 48999999999998877777654321 12223455667776
Q ss_pred EecCCceeEEEEEEcCCCceeeeeeeecccCccCc--eEEEcCcEEEEEECCCCeEEEEEeec
Q 003800 195 GYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGD--VALVSSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 195 ~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~--~~~vg~~~lv~~d~~~g~L~v~~l~s 255 (794)
+..| .|+.+|+..-..+... .-...+.+. |.-..+.+++|.. +.|.+-+.|.++
T Consensus 363 ~~~G----eV~v~nl~~~~~~~rf--~D~G~v~gts~~~S~ng~ylA~GS-~~GiVNIYd~~s 418 (514)
T KOG2055|consen 363 GGTG----EVYVWNLRQNSCLHRF--VDDGSVHGTSLCISLNGSYLATGS-DSGIVNIYDGNS 418 (514)
T ss_pred cCCc----eEEEEecCCcceEEEE--eecCccceeeeeecCCCceEEecc-CcceEEEeccch
Confidence 6666 7888888766444322 223344442 3223445666664 567777777665
No 45
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=95.22 E-value=7.6 Score=44.50 Aligned_cols=151 Identities=15% Similarity=0.183 Sum_probs=97.5
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc--cceeeee-eeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--DVVDGID-IALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~~i~~l~-~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
+++.+...++.+|.++..|-+||+.+=...-+.. +.|-++. -..+..++++|++ ..++-||.+++++.=++..+..
T Consensus 200 PDG~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaD-kt~KIWdVs~~slv~t~~~~~~ 278 (603)
T KOG0318|consen 200 PDGSRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPDSTQFLTVSAD-KTIKIWDVSTNSLVSTWPMGST 278 (603)
T ss_pred CCCCeEEEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCCCceEEEecCC-ceEEEEEeeccceEEEeecCCc
Confidence 4566777788899999999999999876543221 2333332 1256778888875 6899999999999988887654
Q ss_pred cccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800 128 KHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 128 ~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a 206 (794)
...+ .++. ..- .+.++.. -+|.+.-|++.++.+.=...--...+....+ +.++..+|-.+.+| .+..
T Consensus 279 v~dq---qvG~--lWq-kd~lItVSl~G~in~ln~~d~~~~~~i~GHnK~ITaLtv--~~d~~~i~SgsyDG----~I~~ 346 (603)
T KOG0318|consen 279 VEDQ---QVGC--LWQ-KDHLITVSLSGTINYLNPSDPSVLKVISGHNKSITALTV--SPDGKTIYSGSYDG----HINS 346 (603)
T ss_pred hhce---EEEE--EEe-CCeEEEEEcCcEEEEecccCCChhheecccccceeEEEE--cCCCCEEEeeccCc----eEEE
Confidence 2211 1210 112 3344444 4999999999999866555444333322222 23455566655565 7888
Q ss_pred EEcCCCce
Q 003800 207 INAMNGEL 214 (794)
Q Consensus 207 ld~~tG~~ 214 (794)
.|..+|.-
T Consensus 347 W~~~~g~~ 354 (603)
T KOG0318|consen 347 WDSGSGTS 354 (603)
T ss_pred EecCCccc
Confidence 88777753
No 46
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.19 E-value=0.78 Score=47.35 Aligned_cols=190 Identities=13% Similarity=0.113 Sum_probs=100.8
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL 134 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~ 134 (794)
+-....+-.+...|..||++.=|..--. ..+...+. ..+-.|++| +.+..+|+||..+-...=-.-+. +.. .+..
T Consensus 74 f~s~GgDk~v~vwDV~TGkv~Rr~rgH~-aqVNtV~f-NeesSVv~SgsfD~s~r~wDCRS~s~ePiQild-ea~-D~V~ 149 (307)
T KOG0316|consen 74 FASCGGDKAVQVWDVNTGKVDRRFRGHL-AQVNTVRF-NEESSVVASGSFDSSVRLWDCRSRSFEPIQILD-EAK-DGVS 149 (307)
T ss_pred cccCCCCceEEEEEcccCeeeeeccccc-ceeeEEEe-cCcceEEEeccccceeEEEEcccCCCCccchhh-hhc-Ccee
Confidence 3344457789999999999875543221 22333322 233344445 45889999998654322111111 100 0000
Q ss_pred ccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE-EecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800 135 LVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ-LDESDQIYVVGYAGSSQFHAYQINAMNG 212 (794)
Q Consensus 135 ~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~-s~~~~~vyv~~~~g~~~~~v~ald~~tG 212 (794)
.+ .+. +..++..+ ||+++.+|...|..---+- .. |..++- +.+++-+.+.++++ .+.-||-.||
T Consensus 150 Si----~v~-~heIvaGS~DGtvRtydiR~G~l~sDy~-g~----pit~vs~s~d~nc~La~~l~s----tlrLlDk~tG 215 (307)
T KOG0316|consen 150 SI----DVA-EHEIVAGSVDGTVRTYDIRKGTLSSDYF-GH----PITSVSFSKDGNCSLASSLDS----TLRLLDKETG 215 (307)
T ss_pred EE----Eec-ccEEEeeccCCcEEEEEeecceeehhhc-CC----cceeEEecCCCCEEEEeeccc----eeeecccchh
Confidence 01 111 34444444 9999999999886543331 11 112221 22344445545554 7888999999
Q ss_pred ceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEe
Q 003800 213 ELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETH 264 (794)
Q Consensus 213 ~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~ 264 (794)
+++..+.-......--.|-+-.....|..-+..|.++.-||..+.+ +..++
T Consensus 216 klL~sYkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~-~sk~~ 266 (307)
T KOG0316|consen 216 KLLKSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ-ISKLS 266 (307)
T ss_pred HHHHHhcccccceeeeeeeecccceeEEeccCCceEEEEEecccee-eeeec
Confidence 9998775322211111333323333334444678899999988773 33333
No 47
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=95.09 E-value=6.6 Score=43.08 Aligned_cols=141 Identities=15% Similarity=0.231 Sum_probs=80.4
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
..+.+.++.++..-+-.+..||+ |-..+... +++..... ..++.+.++| -.+.|+.|...+|...|...-..
T Consensus 75 ~~~l~aTGGgDD~AflW~~~~ge--~~~eltgHKDSVt~~~F-shdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~--- 148 (399)
T KOG0296|consen 75 NNNLVATGGGDDLAFLWDISTGE--FAGELTGHKDSVTCCSF-SHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEV--- 148 (399)
T ss_pred CCceEEecCCCceEEEEEccCCc--ceeEecCCCCceEEEEE-ccCceEEEecCCCccEEEEEcccCceEEEeeccc---
Confidence 34556677778877778888888 66666543 45544422 3334444455 47899999999999999986222
Q ss_pred cCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE----------EecCCEEEEEEec
Q 003800 130 SKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ----------LDESDQIYVVGYA 197 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~----------s~~~~~vyv~~~~ 197 (794)
.++..+- .+....++.. + +|. +|-|+.++... .++.. -..+|+-.+.++.
T Consensus 149 -~dieWl~----WHp~a~illAG~~DGs-----------vWmw~ip~~~~--~kv~~Gh~~~ct~G~f~pdGKr~~tgy~ 210 (399)
T KOG0296|consen 149 -EDIEWLK----WHPRAHILLAGSTDGS-----------VWMWQIPSQAL--CKVMSGHNSPCTCGEFIPDGKRILTGYD 210 (399)
T ss_pred -CceEEEE----ecccccEEEeecCCCc-----------EEEEECCCcce--eeEecCCCCCcccccccCCCceEEEEec
Confidence 1222331 2222333332 2 454 45555554211 11110 0123443334444
Q ss_pred CCceeEEEEEEcCCCceeeeee
Q 003800 198 GSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 198 g~~~~~v~ald~~tG~~~w~~~ 219 (794)
.+ .+...|++||+++-...
T Consensus 211 dg---ti~~Wn~ktg~p~~~~~ 229 (399)
T KOG0296|consen 211 DG---TIIVWNPKTGQPLHKIT 229 (399)
T ss_pred Cc---eEEEEecCCCceeEEec
Confidence 33 79999999999997664
No 48
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.06 E-value=5.1 Score=41.62 Aligned_cols=146 Identities=14% Similarity=0.121 Sum_probs=78.9
Q ss_pred eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEe
Q 003800 94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRD 171 (794)
Q Consensus 94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~ 171 (794)
.|+..++ .|.+.+|+.||+..|.++-++...+... .++... .++.=+.. .|..++..|..||++.-++.
T Consensus 28 dGnY~lt-cGsdrtvrLWNp~rg~liktYsghG~EV-lD~~~s-------~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~r 98 (307)
T KOG0316|consen 28 DGNYCLT-CGSDRTVRLWNPLRGALIKTYSGHGHEV-LDAALS-------SDNSKFASCGGDKAVQVWDVNTGKVDRRFR 98 (307)
T ss_pred CCCEEEE-cCCCceEEeecccccceeeeecCCCcee-eecccc-------ccccccccCCCCceEEEEEcccCeeeeecc
Confidence 3555555 4456899999999999999998876433 122111 13333333 36678899999999876665
Q ss_pred ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEE
Q 003800 172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVT 250 (794)
Q Consensus 172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v 250 (794)
--...... +........++-.+++. .+.++|-.+-..---+.+... -++ ..+-+.+..++.... .|.+..
T Consensus 99 gH~aqVNt--V~fNeesSVv~SgsfD~----s~r~wDCRS~s~ePiQildea~D~V--~Si~v~~heIvaGS~-DGtvRt 169 (307)
T KOG0316|consen 99 GHLAQVNT--VRFNEESSVVASGSFDS----SVRLWDCRSRSFEPIQILDEAKDGV--SSIDVAEHEIVAGSV-DGTVRT 169 (307)
T ss_pred cccceeeE--EEecCcceEEEeccccc----eeEEEEcccCCCCccchhhhhcCce--eEEEecccEEEeecc-CCcEEE
Confidence 44332211 11111122223223332 566666554322111111100 011 112234556666653 699999
Q ss_pred EEeecce
Q 003800 251 VSFKNRK 257 (794)
Q Consensus 251 ~~l~sg~ 257 (794)
.|+..|+
T Consensus 170 ydiR~G~ 176 (307)
T KOG0316|consen 170 YDIRKGT 176 (307)
T ss_pred EEeecce
Confidence 9999988
No 49
>PHA02790 Kelch-like protein; Provisional
Probab=94.86 E-value=2.6 Score=49.08 Aligned_cols=167 Identities=10% Similarity=0.066 Sum_probs=91.8
Q ss_pred CCEEEEEeCC------CEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEcc--CCeEEEEeCCCCcEeEEE
Q 003800 53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSD--GSTLRAWNLPDGQMVWES 122 (794)
Q Consensus 53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~ 122 (794)
++.||+..+. ..+...|+++++ |+..-+-+. .-.+. +..++.+.++||. ...+..||+.++ .|+.
T Consensus 271 ~~~lyviGG~~~~~~~~~v~~Ydp~~~~--W~~~~~m~~~r~~~~~-v~~~~~iYviGG~~~~~sve~ydp~~n--~W~~ 345 (480)
T PHA02790 271 GEVVYLIGGWMNNEIHNNAIAVNYISNN--WIPIPPMNSPRLYASG-VPANNKLYVVGGLPNPTSVERWFHGDA--AWVN 345 (480)
T ss_pred CCEEEEEcCCCCCCcCCeEEEEECCCCE--EEECCCCCchhhcceE-EEECCEEEEECCcCCCCceEEEECCCC--eEEE
Confidence 6678887653 357788999875 987543321 11122 3467767777764 246888987655 5875
Q ss_pred eccCccccCCccccccccccccCCeEEEEEC-----CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEec
Q 003800 123 FLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-----GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYA 197 (794)
Q Consensus 123 ~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-----g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~ 197 (794)
-..-+.. . .-.. ...-++.+||.++ ..+..+|+.++ .|+...+.+.-...... ..-++.+|++|
T Consensus 346 ~~~l~~~-r--~~~~---~~~~~g~IYviGG~~~~~~~ve~ydp~~~--~W~~~~~m~~~r~~~~~-~~~~~~IYv~G-- 414 (480)
T PHA02790 346 MPSLLKP-R--CNPA---VASINNVIYVIGGHSETDTTTEYLLPNHD--QWQFGPSTYYPHYKSCA-LVFGRRLFLVG-- 414 (480)
T ss_pred CCCCCCC-C--cccE---EEEECCEEEEecCcCCCCccEEEEeCCCC--EEEeCCCCCCccccceE-EEECCEEEEEC--
Confidence 3221111 1 0000 1222677888632 34667887754 79875543321111111 24688999976
Q ss_pred CCceeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800 198 GSSQFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL 241 (794)
Q Consensus 198 g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~ 241 (794)
| .+.++|+.++ .|+.--..+....+ .+.++++.++++.
T Consensus 415 G----~~e~ydp~~~--~W~~~~~m~~~r~~~~~~v~~~~IYviG 453 (480)
T PHA02790 415 R----NAEFYCESSN--TWTLIDDPIYPRDNPELIIVDNKLLLIG 453 (480)
T ss_pred C----ceEEecCCCC--cEeEcCCCCCCccccEEEEECCEEEEEC
Confidence 3 4677888765 68764333332322 2333465566554
No 50
>PHA02713 hypothetical protein; Provisional
Probab=94.77 E-value=1.9 Score=51.33 Aligned_cols=173 Identities=9% Similarity=0.101 Sum_probs=96.4
Q ss_pred CCEEEEEeCC-------CEEEEEECcCCccceEEEcCcccce--eeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800 53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGINDVV--DGIDIALGKYVITLSSDG-----STLRAWNLPDGQM 118 (794)
Q Consensus 53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~~i--~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l 118 (794)
++.||+..+. +.+..+|+++.. |+..-+-+..- .+. +..++.+.++||.+ ..+..||+.+.
T Consensus 303 ~~~IYviGG~~~~~~~~~~v~~Yd~~~n~--W~~~~~m~~~R~~~~~-~~~~g~IYviGG~~~~~~~~sve~Ydp~~~-- 377 (557)
T PHA02713 303 DNEIIIAGGYNFNNPSLNKVYKINIENKI--HVELPPMIKNRCRFSL-AVIDDTIYAIGGQNGTNVERTIECYTMGDD-- 377 (557)
T ss_pred CCEEEEEcCCCCCCCccceEEEEECCCCe--EeeCCCCcchhhceeE-EEECCEEEEECCcCCCCCCceEEEEECCCC--
Confidence 7789988763 358889999874 97644322111 122 34577777777742 34889999876
Q ss_pred eEEEeccCccccCCccccccccccccCCeEEEEEC-------------------------CEEEEEECCCCcEEEEEecc
Q 003800 119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------------------------GCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------------------------g~l~ald~~tG~~~W~~~~~ 173 (794)
.|+.-..-+.. ....+ ...-++.+||.++ ..+.++|+.+. .|+.-.+
T Consensus 378 ~W~~~~~mp~~---r~~~~---~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td--~W~~v~~ 449 (557)
T PHA02713 378 KWKMLPDMPIA---LSSYG---MCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNN--IWETLPN 449 (557)
T ss_pred eEEECCCCCcc---ccccc---EEEECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCC--eEeecCC
Confidence 58864321111 00011 1122577777642 24778888775 5886554
Q ss_pred Ccce-eeeeEEEEecCCEEEEEEecCC-c--eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800 174 AESV-EVQQVIQLDESDQIYVVGYAGS-S--QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL 241 (794)
Q Consensus 174 ~~~~-~~~~~v~s~~~~~vyv~~~~g~-~--~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~ 241 (794)
.+.. ....+ ..-++.+|++|...+ . .-.+.++|+.+ .-.|+.--..|..... .+..+++.+++..
T Consensus 450 m~~~r~~~~~--~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~-~~~W~~~~~m~~~r~~~~~~~~~~~iyv~G 519 (557)
T PHA02713 450 FWTGTIRPGV--VSHKDDIYVVCDIKDEKNVKTCIFRYNTNT-YNGWELITTTESRLSALHTILHDNTIMMLH 519 (557)
T ss_pred CCcccccCcE--EEECCEEEEEeCCCCCCccceeEEEecCCC-CCCeeEccccCcccccceeEEECCEEEEEe
Confidence 3221 11112 246889999874321 1 12467899987 1248875455544443 3333465565544
No 51
>PTZ00421 coronin; Provisional
Probab=94.74 E-value=3.2 Score=48.52 Aligned_cols=154 Identities=15% Similarity=0.135 Sum_probs=82.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
.+.++.++.++.|.-.|.++|+.+=.... ....+..+.....+..++.++.++.++.||+.+|+.+.+...........
T Consensus 138 ~~iLaSgs~DgtVrIWDl~tg~~~~~l~~-h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~ 216 (493)
T PTZ00421 138 MNVLASAGADMVVNVWDVERGKAVEVIKC-HSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQR 216 (493)
T ss_pred CCEEEEEeCCCEEEEEECCCCeEEEEEcC-CCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceE
Confidence 35677888999999999999976432211 12234444222334455546667899999999999988766543221000
Q ss_pred ccccccccccccCCeEEEEE-----CCEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEE
Q 003800 133 LLLVPTNLKVDKDSLILVSS-----KGCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAY 205 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~~-----~g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~ 205 (794)
....+ . .+.++... ++.+...|..+... .-......... ...+....+++.+|+.+. +| .+.
T Consensus 217 ~~w~~-----~-~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~-~~~~~~d~d~~~L~lggkgDg----~Ir 285 (493)
T PTZ00421 217 CLWAK-----R-KDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSA-LFIPFFDEDTNLLYIGSKGEG----NIR 285 (493)
T ss_pred EEEcC-----C-CCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCc-eEEEEEcCCCCEEEEEEeCCC----eEE
Confidence 11111 1 23333321 46777777765442 22222211111 111111234455665543 33 677
Q ss_pred EEEcCCCceeeee
Q 003800 206 QINAMNGELLNHE 218 (794)
Q Consensus 206 ald~~tG~~~w~~ 218 (794)
.+|..+|++....
T Consensus 286 iwdl~~~~~~~~~ 298 (493)
T PTZ00421 286 CFELMNERLTFCS 298 (493)
T ss_pred EEEeeCCceEEEe
Confidence 8888888876554
No 52
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=94.66 E-value=2 Score=44.85 Aligned_cols=106 Identities=14% Similarity=0.187 Sum_probs=77.5
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
++.++-+++++.|---|-+||.++=+..++.+ +..+.+...+++++++ +|+.|.-||+.+=.++=++.+.-.+. +
T Consensus 155 D~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~--VtSlEvs~dG~ilTia-~gssV~Fwdaksf~~lKs~k~P~nV~--S 229 (334)
T KOG0278|consen 155 DKCILSSADDKTVRLWDHRTGTEVQSLEFNSP--VTSLEVSQDGRILTIA-YGSSVKFWDAKSFGLLKSYKMPCNVE--S 229 (334)
T ss_pred CceEEeeccCCceEEEEeccCcEEEEEecCCC--CcceeeccCCCEEEEe-cCceeEEeccccccceeeccCccccc--c
Confidence 44566667888888899999999988777665 4455445566677644 56789999999999998888765443 2
Q ss_pred ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEE
Q 003800 133 LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~ 170 (794)
+.+-| .+.+||. .+..++.+|-.||+.+=.+
T Consensus 230 ASL~P-------~k~~fVaGged~~~~kfDy~TgeEi~~~ 262 (334)
T KOG0278|consen 230 ASLHP-------KKEFFVAGGEDFKVYKFDYNTGEEIGSY 262 (334)
T ss_pred ccccC-------CCceEEecCcceEEEEEeccCCceeeec
Confidence 22222 4567775 3889999999999877665
No 53
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=94.43 E-value=3.9 Score=52.36 Aligned_cols=200 Identities=11% Similarity=0.112 Sum_probs=102.4
Q ss_pred CCEEEEEeCC-CEEEEEECcCCccceEEE-------cCcc--c------ceeeeeeeeCCEEEEEEc-cCCeEEEEeCCC
Q 003800 53 RKRVVVSTEE-NVIASLDLRHGEIFWRHV-------LGIN--D------VVDGIDIALGKYVITLSS-DGSTLRAWNLPD 115 (794)
Q Consensus 53 ~~~Vyv~t~~-g~l~ALn~~tG~ivWR~~-------l~~~--~------~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~t 115 (794)
++.|||+... +.|.-+|..+|.+.=-.. .... . ...++.....++.++|+. .+++|+-||..+
T Consensus 635 gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v~d~~~ 714 (1057)
T PLN02919 635 KNLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWEYNISD 714 (1057)
T ss_pred CCEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEEEECCC
Confidence 4568887764 568888887775310000 0000 0 001221222245566653 457899999999
Q ss_pred CcEeEEEeccCcc------ccC-CccccccccccccC-CeEEEEE--CCEEEEEECCCCcEEEEEecc------------
Q 003800 116 GQMVWESFLRGSK------HSK-PLLLVPTNLKVDKD-SLILVSS--KGCLHAVSSIDGEILWTRDFA------------ 173 (794)
Q Consensus 116 G~llWe~~l~~~~------~s~-~~~~~~~~~~~~~~-~~V~V~~--~g~l~ald~~tG~~~W~~~~~------------ 173 (794)
|...- ....+.. ... .....|..+..+.+ +.+||.. +++|+.+|..+|...|.....
T Consensus 715 g~v~~-~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~ 793 (1057)
T PLN02919 715 GVTRV-FSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGD 793 (1057)
T ss_pred CeEEE-EecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccC
Confidence 87541 1111000 000 00001111122323 3477763 789999999988866543100
Q ss_pred --Cc----ce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec---------ccCccC--ceEEEcC
Q 003800 174 --AE----SV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF---------SGGFVG--DVALVSS 235 (794)
Q Consensus 174 --~~----~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~---------~~~~s~--~~~~vg~ 235 (794)
.. .+ .|..+. ...++.+|+....++ ++..+|+.+|....-...+. ...+.. .+.+..+
T Consensus 794 ~dG~g~~~~l~~P~Gva-vd~dG~LYVADs~N~---rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~d 869 (1057)
T PLN02919 794 HDGVGSEVLLQHPLGVL-CAKDGQIYVADSYNH---KIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGEN 869 (1057)
T ss_pred CCCchhhhhccCCceee-EeCCCcEEEEECCCC---EEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCC
Confidence 00 00 133332 234567888665444 78889998887764332111 111111 1222233
Q ss_pred cEEEEEECCCCeEEEEEeecce
Q 003800 236 DTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 236 ~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+.++.+|..++.++++|+.+++
T Consensus 870 G~lyVaDt~Nn~Irvid~~~~~ 891 (1057)
T PLN02919 870 GRLFVADTNNSLIRYLDLNKGE 891 (1057)
T ss_pred CCEEEEECCCCEEEEEECCCCc
Confidence 4456778888889999998876
No 54
>PLN00181 protein SPA1-RELATED; Provisional
Probab=94.21 E-value=20 Score=44.61 Aligned_cols=190 Identities=18% Similarity=0.108 Sum_probs=96.6
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEE------EcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRH------VLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~------~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
+++.+.+++.++.|.-.|..+...-++. .+.....+..+... ..+..++.++.++.|+.||..+|+.+.+...
T Consensus 494 dg~~latgg~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~ 573 (793)
T PLN00181 494 DGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKE 573 (793)
T ss_pred CCCEEEEEeCCCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecC
Confidence 3556778888898888886542111110 01111112222111 1234455466678999999999999988765
Q ss_pred cCccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCcee
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQF 202 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~ 202 (794)
....+ ..+.+.+ . ++.+++ . .+|.+...|..+|...-+...... ...+.....++..++.+...+
T Consensus 574 H~~~V-~~l~~~p-----~-~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~---v~~v~~~~~~g~~latgs~dg--- 640 (793)
T PLN00181 574 HEKRV-WSIDYSS-----A-DPTLLASGSDDGSVKLWSINQGVSIGTIKTKAN---ICCVQFPSESGRSLAFGSADH--- 640 (793)
T ss_pred CCCCE-EEEEEcC-----C-CCCEEEEEcCCCEEEEEECCCCcEEEEEecCCC---eEEEEEeCCCCCEEEEEeCCC---
Confidence 54322 1111111 1 334444 3 389999999998876655443221 111111123344444443322
Q ss_pred EEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecc
Q 003800 203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNR 256 (794)
Q Consensus 203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg 256 (794)
.+..+|..+++.......+-...+. .+.+..+..+++.. ..+.+.+-|+..+
T Consensus 641 ~I~iwD~~~~~~~~~~~~~h~~~V~-~v~f~~~~~lvs~s-~D~~ikiWd~~~~ 692 (793)
T PLN00181 641 KVYYYDLRNPKLPLCTMIGHSKTVS-YVRFVDSSTLVSSS-TDNTLKLWDLSMS 692 (793)
T ss_pred eEEEEECCCCCccceEecCCCCCEE-EEEEeCCCEEEEEE-CCCEEEEEeCCCC
Confidence 7888898877532211111111121 12223344555554 3577888777643
No 55
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.16 E-value=17 Score=43.59 Aligned_cols=66 Identities=11% Similarity=-0.003 Sum_probs=45.7
Q ss_pred ecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEc-CcEEEEEE-CCCCeEEEEEeecce
Q 003800 186 DESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVS-SDTLVTLD-TTRSILVTVSFKNRK 257 (794)
Q Consensus 186 ~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg-~~~lv~~d-~~~g~L~v~~l~sg~ 257 (794)
..+..++-.+++| .|.|.|.+.++--.+.+ +|..++-+|+-+. .+.+||+- .+.=.+++-++++|+
T Consensus 402 ~~g~~llssSLDG----tVRAwDlkRYrNfRTft--~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGq 469 (893)
T KOG0291|consen 402 ARGNVLLSSSLDG----TVRAWDLKRYRNFRTFT--SPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQ 469 (893)
T ss_pred ecCCEEEEeecCC----eEEeeeecccceeeeec--CCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCe
Confidence 4566777778888 89999999988776664 4555555777763 34555653 222257888888888
No 56
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.01 E-value=17 Score=43.26 Aligned_cols=193 Identities=14% Similarity=0.234 Sum_probs=107.9
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccc--eEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIF--WRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~iv--WR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
++..+|++.....+.-.+..+|+.+ |+..-+.+ +..+.....+.++..+|.++.++.||...|...=..+..++..
T Consensus 73 d~~~L~~a~rs~llrv~~L~tgk~irswKa~He~P--vi~ma~~~~g~LlAtggaD~~v~VWdi~~~~~th~fkG~gGvV 150 (775)
T KOG0319|consen 73 DEEVLVTASRSQLLRVWSLPTGKLIRSWKAIHEAP--VITMAFDPTGTLLATGGADGRVKVWDIKNGYCTHSFKGHGGVV 150 (775)
T ss_pred CccEEEEeeccceEEEEEcccchHhHhHhhccCCC--eEEEEEcCCCceEEeccccceEEEEEeeCCEEEEEecCCCceE
Confidence 4677899999998888888999654 55433333 3233223344566656667899999999998888877765554
Q ss_pred cCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcE----------------------------------EEEEecc
Q 003800 130 SKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEI----------------------------------LWTRDFA 173 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~----------------------------------~W~~~~~ 173 (794)
..+.+-+ +....+++ . .|+.+++.|..++.. +|.+..-
T Consensus 151 -ssl~F~~-----~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~~RDkvi~vwd~~~~ 224 (775)
T KOG0319|consen 151 -SSLLFHP-----HWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAVTSLAFSEDSLELLSVGRDKVIIVWDLVQY 224 (775)
T ss_pred -EEEEeCC-----ccchhheeecCCCceEEEEEcccCchHHHHHHhhhhheeeeeeccCCceEEEeccCcEEEEeehhhh
Confidence 2222222 10111222 2 266666666654433 3444211
Q ss_pred Ccc--e----eeeeEEEEec-----CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEE
Q 003800 174 AES--V----EVQQVIQLDE-----SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLD 242 (794)
Q Consensus 174 ~~~--~----~~~~~v~s~~-----~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d 242 (794)
... + ..+.++.... +..++.+|..| .+--+|+++|+.+...+.+....+..-..+.+.+.+.++.
T Consensus 225 ~~l~~lp~ye~~E~vv~l~~~~~~~~~~~~TaG~~g----~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vt 300 (775)
T KOG0319|consen 225 KKLKTLPLYESLESVVRLREELGGKGEYIITAGGSG----VVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVT 300 (775)
T ss_pred hhhheechhhheeeEEEechhcCCcceEEEEecCCc----eEEEEecccchhhhhhccCCchhhhcceeccccCceEEEE
Confidence 100 0 0111221111 22444444444 7888899999988776544222254444445556666665
Q ss_pred CCCCeEEEEEeecce
Q 003800 243 TTRSILVTVSFKNRK 257 (794)
Q Consensus 243 ~~~g~L~v~~l~sg~ 257 (794)
++ -+|..+|..+.+
T Consensus 301 ae-Qnl~l~d~~~l~ 314 (775)
T KOG0319|consen 301 AE-QNLFLYDEDELT 314 (775)
T ss_pred cc-ceEEEEEccccE
Confidence 43 467777877766
No 57
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=94.00 E-value=11 Score=41.06 Aligned_cols=197 Identities=15% Similarity=0.169 Sum_probs=92.6
Q ss_pred cCCCEEEEEeC-CCEEEEEECc-CCccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCC-CCcEe-EEEecc
Q 003800 51 TGRKRVVVSTE-ENVIASLDLR-HGEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLP-DGQMV-WESFLR 125 (794)
Q Consensus 51 ~~~~~Vyv~t~-~g~l~ALn~~-tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~-tG~ll-We~~l~ 125 (794)
++++.+|+++. ++.|..++.. +|++.=.......+....+.....+..++++. .++.+..||.+ +|.+. -.....
T Consensus 44 pd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~ 123 (330)
T PRK11028 44 PDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIE 123 (330)
T ss_pred CCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeecc
Confidence 34667898875 5667666554 56531111111111122232223455666654 35789999886 45321 111111
Q ss_pred CccccCCccccccccccccCCeEEEEE--CCEEEEEECCC-CcEE----EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 126 GSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSID-GEIL----WTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 126 ~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~t-G~~~----W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
... .+..+.- ..+ ++.++|.. ++.|..+|..+ |... .....+.+. .|..+....++..+|++.. +
T Consensus 124 ~~~--~~~~~~~---~p~-g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~-~p~~~~~~pdg~~lyv~~~-~ 195 (330)
T PRK11028 124 GLE--GCHSANI---DPD-NRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEGA-GPRHMVFHPNQQYAYCVNE-L 195 (330)
T ss_pred CCC--cccEeEe---CCC-CCEEEEeeCCCCEEEEEEECCCCcccccCCCceecCCCC-CCceEEECCCCCEEEEEec-C
Confidence 110 0011100 112 35666653 68888888765 4321 222222111 2444543345557777553 3
Q ss_pred CceeEEEEEEcCCCceeeeeeee-cccCccC-----ceEE-EcCcEEEEEECCCCeEEEEEeec
Q 003800 199 SSQFHAYQINAMNGELLNHETAA-FSGGFVG-----DVAL-VSSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~~w~~~v~-~~~~~s~-----~~~~-vg~~~lv~~d~~~g~L~v~~l~s 255 (794)
+..+.++.++..+|+......+. .|....+ .+.+ ..+..+++.+...+.+.+.++.+
T Consensus 196 ~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~ 259 (330)
T PRK11028 196 NSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSE 259 (330)
T ss_pred CCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeC
Confidence 33455555655567654433332 2322211 1222 13445556666567788888754
No 58
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.91 E-value=12 Score=40.94 Aligned_cols=188 Identities=12% Similarity=0.147 Sum_probs=90.7
Q ss_pred EEEEEeC-CCEEEEEECcC-CccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCC-CCcEeEEEeccCcccc
Q 003800 55 RVVVSTE-ENVIASLDLRH-GEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLP-DGQMVWESFLRGSKHS 130 (794)
Q Consensus 55 ~Vyv~t~-~g~l~ALn~~t-G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~-tG~llWe~~l~~~~~s 130 (794)
++|+++. ++.|..+|..+ |++.=.+.++..+....+....++..+++++ ..+.+..|+.. +|++.=........
T Consensus 3 ~~y~~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~-- 80 (330)
T PRK11028 3 IVYIASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPG-- 80 (330)
T ss_pred EEEEEcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCC--
Confidence 4788854 67788888864 5433223343322222332233455666654 35678888875 56542111111110
Q ss_pred CCcccccccccccc-CCeEEEEE--CCEEEEEECC-CCcE---EEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeE
Q 003800 131 KPLLLVPTNLKVDK-DSLILVSS--KGCLHAVSSI-DGEI---LWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFH 203 (794)
Q Consensus 131 ~~~~~~~~~~~~~~-~~~V~V~~--~g~l~ald~~-tG~~---~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~ 203 (794)
.+..+ ..+. ++.+++.. ++.+..++.. +|.. .-.. +... .+..+....++..+|+.+...+ .
T Consensus 81 -~p~~i----~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~--~~~~-~~~~~~~~p~g~~l~v~~~~~~---~ 149 (330)
T PRK11028 81 -SPTHI----STDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQII--EGLE-GCHSANIDPDNRTLWVPCLKED---R 149 (330)
T ss_pred -CceEE----EECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeec--cCCC-cccEeEeCCCCCEEEEeeCCCC---E
Confidence 11111 1122 34566653 7888888775 4422 2211 1110 1223322334557777665433 5
Q ss_pred EEEEEcCC-Cceeee--eeeecccCcc-CceEEE-cCcEEEEEECCCCeEEEEEeec
Q 003800 204 AYQINAMN-GELLNH--ETAAFSGGFV-GDVALV-SSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 204 v~ald~~t-G~~~w~--~~v~~~~~~s-~~~~~v-g~~~lv~~d~~~g~L~v~~l~s 255 (794)
+..+|..+ |...-. ..+..+.+-. ..+.+- ++..+++.+...+.+.+.++..
T Consensus 150 v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~ 206 (330)
T PRK11028 150 IRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKD 206 (330)
T ss_pred EEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeC
Confidence 66666655 543211 1111221111 122222 4456667776678999999873
No 59
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=93.75 E-value=5.2 Score=46.96 Aligned_cols=153 Identities=14% Similarity=0.172 Sum_probs=92.6
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEE-EEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYV-ITLSSDGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~-V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
..+.+-++.++|++.-++-..|.+.-...|... ..+-.+. -...+. ++.|..++.+|+||+..|..+-.....-..+
T Consensus 121 ~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLsls-w~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l 199 (691)
T KOG2048|consen 121 ENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLS-WNPTGTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRL 199 (691)
T ss_pred ccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEE-ecCCccEEEecccCceEEEEEcCCCceEEEeeeccccc
Confidence 356788899999999999999999999999766 2333331 233344 4545567889999999999887433322222
Q ss_pred cCCccccccccccccCCeEEEEECCEEEEEECCCCcE-EEEEeccCcce-------eeeeEEEEecCCEEEEEEecCCce
Q 003800 130 SKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEI-LWTRDFAAESV-------EVQQVIQLDESDQIYVVGYAGSSQ 201 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~-~W~~~~~~~~~-------~~~~~v~s~~~~~vyv~~~~g~~~ 201 (794)
+..-+.+. =.|.++.++.+.+-| .+|.+ -|-.....-.- ....+.-+...+.++..|.++
T Consensus 200 ~k~~~~iV--------WSv~~Lrd~tI~sgD-S~G~V~FWd~~~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~--- 267 (691)
T KOG2048|consen 200 SKREPTIV--------WSVLFLRDSTIASGD-SAGTVTFWDSIFGTLIQSHSCHDADVLALAVADNEDRVFSAGVDP--- 267 (691)
T ss_pred ccCCceEE--------EEEEEeecCcEEEec-CCceEEEEcccCcchhhhhhhhhcceeEEEEcCCCCeEEEccCCC---
Confidence 11011110 113334577777777 45654 36544332100 011121123457888888887
Q ss_pred eEEEEEEcCCCceeeee
Q 003800 202 FHAYQINAMNGELLNHE 218 (794)
Q Consensus 202 ~~v~ald~~tG~~~w~~ 218 (794)
++.-+...++..-|..
T Consensus 268 -~ii~~~~~~~~~~wv~ 283 (691)
T KOG2048|consen 268 -KIIQYSLTTNKSEWVI 283 (691)
T ss_pred -ceEEEEecCCccceee
Confidence 7888888877666766
No 60
>PHA03098 kelch-like protein; Provisional
Probab=93.74 E-value=3.8 Score=48.30 Aligned_cols=189 Identities=12% Similarity=0.122 Sum_probs=97.8
Q ss_pred CCEEEEEeCC-------CEEEEEECcCCccceEEEcCcccc--eeeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800 53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGINDV--VDGIDIALGKYVITLSSDG-----STLRAWNLPDGQM 118 (794)
Q Consensus 53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~~--i~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l 118 (794)
++.||+..+. +.+..+|+.+++ |+..-+-+.. -.+. +..++.++++||.+ ..+..||+.++
T Consensus 294 ~~~lyv~GG~~~~~~~~~~v~~yd~~~~~--W~~~~~~~~~R~~~~~-~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~-- 368 (534)
T PHA03098 294 NNVIYFIGGMNKNNLSVNSVVSYDTKTKS--WNKVPELIYPRKNPGV-TVFNNRIYVIGGIYNSISLNTVESWKPGES-- 368 (534)
T ss_pred CCEEEEECCCcCCCCeeccEEEEeCCCCe--eeECCCCCcccccceE-EEECCEEEEEeCCCCCEecceEEEEcCCCC--
Confidence 6678876642 358899998874 8654322211 1122 34566677777743 35788998766
Q ss_pred eEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCE
Q 003800 119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQ 190 (794)
Q Consensus 119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~ 190 (794)
.|+....-+.. ..... ....++.+++.++ ..+..+|..++ .|+.-.+.+.-...... ...++.
T Consensus 369 ~W~~~~~lp~~-----r~~~~-~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~--~W~~~~~~p~~r~~~~~-~~~~~~ 439 (534)
T PHA03098 369 KWREEPPLIFP-----RYNPC-VVNVNNLIYVIGGISKNDELLKTVECFSLNTN--KWSKGSPLPISHYGGCA-IYHDGK 439 (534)
T ss_pred ceeeCCCcCcC-----Cccce-EEEECCEEEEECCcCCCCcccceEEEEeCCCC--eeeecCCCCccccCceE-EEECCE
Confidence 48754322111 11100 1112567777632 45788888775 58765443321001111 245789
Q ss_pred EEEEEecCCc-----eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEEECC----CCeEEEEEeecce
Q 003800 191 IYVVGYAGSS-----QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTLDTT----RSILVTVSFKNRK 257 (794)
Q Consensus 191 vyv~~~~g~~-----~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~d~~----~g~L~v~~l~sg~ 257 (794)
+|++|..... --.+.++|+.++ .|+..-..+....+ .....++.++++.-.. .+.+.+.|..+++
T Consensus 440 iyv~GG~~~~~~~~~~~~v~~yd~~~~--~W~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~ 514 (534)
T PHA03098 440 IYVIGGISYIDNIKVYNIVESYNPVTN--KWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNT 514 (534)
T ss_pred EEEECCccCCCCCcccceEEEecCCCC--ceeeCCCCCcccccceEEEECCEEEEEcCCcCCcccceeEEEeCCCCE
Confidence 9988743211 124888998876 46653222222222 2222354555443211 2356667766665
No 61
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=93.56 E-value=3.4 Score=48.15 Aligned_cols=115 Identities=10% Similarity=0.181 Sum_probs=62.1
Q ss_pred eCCEEEEEEc----cCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEE
Q 003800 94 LGKYVITLSS----DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWT 169 (794)
Q Consensus 94 ~g~~~V~Vs~----~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~ 169 (794)
..+++.++.. .....+++|. +|.++|......... ...... .++.+++.....+..+|. .|++.|+
T Consensus 112 ~~~gl~~~~~~~~~~~~~~~~iD~-~G~Vrw~~~~~~~~~-~~~~~l-------~nG~ll~~~~~~~~e~D~-~G~v~~~ 181 (477)
T PF05935_consen 112 MEDGLYFVNGNDWDSSSYTYLIDN-NGDVRWYLPLDSGSD-NSFKQL-------PNGNLLIGSGNRLYEIDL-LGKVIWE 181 (477)
T ss_dssp -TT-EEEEEETT--BEEEEEEEET-TS-EEEEE-GGGT---SSEEE--------TTS-EEEEEBTEEEEE-T-T--EEEE
T ss_pred cCCcEEEEeCCCCCCCceEEEECC-CccEEEEEccCcccc-ceeeEc-------CCCCEEEecCCceEEEcC-CCCEEEe
Confidence 3566777766 4568999996 799999999876532 111222 267777777899999996 5999999
Q ss_pred EeccCccee-eeeEEEEecCCEEEEEEec-------CC---ceeEEEEEEcCCCceeeeeee
Q 003800 170 RDFAAESVE-VQQVIQLDESDQIYVVGYA-------GS---SQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 170 ~~~~~~~~~-~~~~v~s~~~~~vyv~~~~-------g~---~~~~v~ald~~tG~~~w~~~v 220 (794)
++.+..... .=.+.. ..+|.+.+++.. .+ ..=.+..+| .+|+++|+...
T Consensus 182 ~~l~~~~~~~HHD~~~-l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd-~tG~vv~~wd~ 241 (477)
T PF05935_consen 182 YDLPGGYYDFHHDIDE-LPNGNLLILASETKYVDEDKDVDTVEDVIVEVD-PTGEVVWEWDF 241 (477)
T ss_dssp EE--TTEE-B-S-EEE--TTS-EEEEEEETTEE-TS-EE---S-EEEEE--TTS-EEEEEEG
T ss_pred eecCCcccccccccEE-CCCCCEEEEEeecccccCCCCccEecCEEEEEC-CCCCEEEEEeh
Confidence 999874310 000111 233344433331 10 011588899 99999999875
No 62
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=93.54 E-value=3.8 Score=43.64 Aligned_cols=154 Identities=14% Similarity=0.142 Sum_probs=86.4
Q ss_pred CCEEEEEEc--cC-CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEE
Q 003800 95 GKYVITLSS--DG-STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWT 169 (794)
Q Consensus 95 g~~~V~Vs~--~g-~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~ 169 (794)
+++.++.|+ .| +.||-+|..+|+.+.+..+.......++ +.. ++.++.++ .+..+.+|+.+-+++=+
T Consensus 54 ~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGi-------t~~-~d~l~qLTWk~~~~f~yd~~tl~~~~~ 125 (264)
T PF05096_consen 54 DDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGI-------TIL-GDKLYQLTWKEGTGFVYDPNTLKKIGT 125 (264)
T ss_dssp ETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEE-------EEE-TTEEEEEESSSSEEEEEETTTTEEEEE
T ss_pred CCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeE-------EEE-CCEEEEEEecCCeEEEEccccceEEEE
Confidence 445566643 23 6899999999999999999875442222 222 56788874 89999999999988877
Q ss_pred EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeeccc----CccCceEEEcCcEEEEEECCC
Q 003800 170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSG----GFVGDVALVSSDTLVTLDTTR 245 (794)
Q Consensus 170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~----~~s~~~~~vg~~~lv~~d~~~ 245 (794)
++.+...+ .+. .++..+++ + +|++ +++-+|+++-+...+.++.... .+.+ .-++++.+++=+- .+
T Consensus 126 ~~y~~EGW---GLt--~dg~~Li~-S-DGS~--~L~~~dP~~f~~~~~i~V~~~g~pv~~LNE-LE~i~G~IyANVW-~t 194 (264)
T PF05096_consen 126 FPYPGEGW---GLT--SDGKRLIM-S-DGSS--RLYFLDPETFKEVRTIQVTDNGRPVSNLNE-LEYINGKIYANVW-QT 194 (264)
T ss_dssp EE-SSS-----EEE--ECSSCEEE-E--SSS--EEEEE-TTT-SEEEEEE-EETTEE---EEE-EEEETTEEEEEET-TS
T ss_pred EecCCcce---EEE--cCCCEEEE-E-CCcc--ceEEECCcccceEEEEEEEECCEECCCcEe-EEEEcCEEEEEeC-CC
Confidence 77665433 221 34555554 2 4433 7888999998888777654321 1111 1223333333222 23
Q ss_pred CeEEEEEeecceeeeEEEeeccc
Q 003800 246 SILVTVSFKNRKIAFQETHLSNL 268 (794)
Q Consensus 246 g~L~v~~l~sg~~~~~~~~l~~l 268 (794)
..+.++|.++|.+ ...+.++.|
T Consensus 195 d~I~~Idp~tG~V-~~~iDls~L 216 (264)
T PF05096_consen 195 DRIVRIDPETGKV-VGWIDLSGL 216 (264)
T ss_dssp SEEEEEETTT-BE-EEEEE-HHH
T ss_pred CeEEEEeCCCCeE-EEEEEhhHh
Confidence 4566777777763 344444443
No 63
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=93.45 E-value=13 Score=40.63 Aligned_cols=162 Identities=9% Similarity=0.040 Sum_probs=78.2
Q ss_pred ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCC------CEEEEEECcCCcc--ceEEEcCccccee-eeeeeeC
Q 003800 25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEE------NVIASLDLRHGEI--FWRHVLGINDVVD-GIDIALG 95 (794)
Q Consensus 25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~------g~l~ALn~~tG~i--vWR~~l~~~~~i~-~l~~~~g 95 (794)
.++..+..|+..- ..|.....+....-++.||+.... +.+..+|+.+.+- .|+..-+-+.... ......+
T Consensus 45 ~~~~~~~~W~~~~-~lp~~r~~~~~~~~~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~ 123 (323)
T TIGR03548 45 KDENSNLKWVKDG-QLPYEAAYGASVSVENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKD 123 (323)
T ss_pred ecCCCceeEEEcc-cCCccccceEEEEECCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEEC
Confidence 3445556787621 222211111122226778887653 4688889888752 4554322221111 1112345
Q ss_pred CEEEEEEcc-----CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC------CEEEEEECCCC
Q 003800 96 KYVITLSSD-----GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK------GCLHAVSSIDG 164 (794)
Q Consensus 96 ~~~V~Vs~~-----g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~------g~l~ald~~tG 164 (794)
+.+++++|. -..+..||+.+. .|+..-.-+........ ....++.++|..+ ..+.++|..+.
T Consensus 124 ~~iYv~GG~~~~~~~~~v~~yd~~~~--~W~~~~~~p~~~r~~~~-----~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~ 196 (323)
T TIGR03548 124 GTLYVGGGNRNGKPSNKSYLFNLETQ--EWFELPDFPGEPRVQPV-----CVKLQNELYVFGGGSNIAYTDGYKYSPKKN 196 (323)
T ss_pred CEEEEEeCcCCCccCceEEEEcCCCC--CeeECCCCCCCCCCcce-----EEEECCEEEEEcCCCCccccceEEEecCCC
Confidence 555555653 146899998765 48864321110011001 1112567777642 23568888765
Q ss_pred cEEEEEeccCcce-eeee----EEEEecCCEEEEEEe
Q 003800 165 EILWTRDFAAESV-EVQQ----VIQLDESDQIYVVGY 196 (794)
Q Consensus 165 ~~~W~~~~~~~~~-~~~~----~v~s~~~~~vyv~~~ 196 (794)
.|+.-.+.+.. .|.. ......++.+|++|-
T Consensus 197 --~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG 231 (323)
T TIGR03548 197 --QWQKVADPTTDSEPISLLGAASIKINESLLLCIGG 231 (323)
T ss_pred --eeEECCCCCCCCCceeccceeEEEECCCEEEEECC
Confidence 58764432110 1111 111234788998764
No 64
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=93.45 E-value=13 Score=44.32 Aligned_cols=110 Identities=14% Similarity=0.143 Sum_probs=71.5
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEec
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~ 172 (794)
++..++-|+++++|..||...|-=.=.+.-+.... ....+ ...+.+++- + ||+|.|.|...++--=++..
T Consensus 361 Dgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~V-t~v~f-------~~~g~~llssSLDGtVRAwDlkRYrNfRTft~ 432 (893)
T KOG0291|consen 361 DGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGV-TAVQF-------TARGNVLLSSSLDGTVRAWDLKRYRNFRTFTS 432 (893)
T ss_pred CCcEEEeccCCCcEEEEeccCceEEEEeccCCCce-EEEEE-------EecCCEEEEeecCCeEEeeeecccceeeeecC
Confidence 33344446678899999999998777766554332 11111 224555554 4 99999999999988778877
Q ss_pred cCcceeeeeEEEEec--CCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 173 AAESVEVQQVIQLDE--SDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 173 ~~~~~~~~~~v~s~~--~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
|.+.. ..++ +.+ +..|++.+. .++.++..+.+||+.+.-.
T Consensus 433 P~p~Q--fscv-avD~sGelV~AG~~---d~F~IfvWS~qTGqllDiL 474 (893)
T KOG0291|consen 433 PEPIQ--FSCV-AVDPSGELVCAGAQ---DSFEIFVWSVQTGQLLDIL 474 (893)
T ss_pred CCcee--eeEE-EEcCCCCEEEeecc---ceEEEEEEEeecCeeeehh
Confidence 76542 2333 222 445554332 2468888999999988544
No 65
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=93.44 E-value=13 Score=40.02 Aligned_cols=217 Identities=17% Similarity=0.199 Sum_probs=118.7
Q ss_pred ccceeecccccE---eeEEeccCceeeeeeeeeccCCCEEEEEeC--CCEEEEEECcCCccceEEEcCcccceeeeeeee
Q 003800 20 SLSLYEDQVGLM---DWHQQYIGKVKHAVFHTQKTGRKRVVVSTE--ENVIASLDLRHGEIFWRHVLGINDVVDGIDIAL 94 (794)
Q Consensus 20 ~~Al~edqvG~~---dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~--~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~ 94 (794)
+.-||....|+. .-+++| |. ....|.. .+..++-+|. +..|--|+..|-+-+ |+=-+....+..+.+.-
T Consensus 37 sl~LYd~~~g~~~~ti~skky-G~-~~~~Fth---~~~~~i~sStk~d~tIryLsl~dNkyl-RYF~GH~~~V~sL~~sP 110 (311)
T KOG1446|consen 37 SLRLYDSLSGKQVKTINSKKY-GV-DLACFTH---HSNTVIHSSTKEDDTIRYLSLHDNKYL-RYFPGHKKRVNSLSVSP 110 (311)
T ss_pred eEEEEEcCCCceeeEeecccc-cc-cEEEEec---CCceEEEccCCCCCceEEEEeecCceE-EEcCCCCceEEEEEecC
Confidence 456787777773 222233 22 2233442 2556666665 678999998886543 22222223344443333
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CC-EEEEEECC--CCcEEEEE
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KG-CLHAVSSI--DGEILWTR 170 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g-~l~ald~~--tG~~~W~~ 170 (794)
.++.+.-++.+.+||.||...=+-.=-..+.+ .++. +-+..+.+++.. ++ .+.-+|.. ++.+-=++
T Consensus 111 ~~d~FlS~S~D~tvrLWDlR~~~cqg~l~~~~------~pi~----AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf 180 (311)
T KOG1446|consen 111 KDDTFLSSSLDKTVRLWDLRVKKCQGLLNLSG------RPIA----AFDPEGLIFALANGSELIKLYDLRSFDKGPFTTF 180 (311)
T ss_pred CCCeEEecccCCeEEeeEecCCCCceEEecCC------Ccce----eECCCCcEEEEecCCCeEEEEEecccCCCCceeE
Confidence 44555535567899999986332221112221 1222 234457777764 33 56666665 34444444
Q ss_pred eccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeeccc-CccCceEE-EcCcEEEEEECCCCe
Q 003800 171 DFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSG-GFVGDVAL-VSSDTLVTLDTTRSI 247 (794)
Q Consensus 171 ~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~-~~s~~~~~-vg~~~lv~~d~~~g~ 247 (794)
..+.+. .+...+-. ..+|+..+++-.++ .++.+|+-+|.++......... .+..++.+ ..++.+.+.+ +.|.
T Consensus 181 ~i~~~~~~ew~~l~F-S~dGK~iLlsT~~s---~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs-~dg~ 255 (311)
T KOG1446|consen 181 SITDNDEAEWTDLEF-SPDGKSILLSTNAS---FIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGS-DDGT 255 (311)
T ss_pred ccCCCCccceeeeEE-cCCCCEEEEEeCCC---cEEEEEccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEec-CCCc
Confidence 444221 11222322 35666666666654 7889999999988776543322 23334444 3445555554 5799
Q ss_pred EEEEEeecce
Q 003800 248 LVTVSFKNRK 257 (794)
Q Consensus 248 L~v~~l~sg~ 257 (794)
+++-++++|.
T Consensus 256 i~vw~~~tg~ 265 (311)
T KOG1446|consen 256 IHVWNLETGK 265 (311)
T ss_pred EEEEEcCCCc
Confidence 9999999987
No 66
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=93.42 E-value=12 Score=39.36 Aligned_cols=143 Identities=13% Similarity=0.096 Sum_probs=82.9
Q ss_pred EEEE-EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCc--EEEEEeccC
Q 003800 98 VITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGE--ILWTRDFAA 174 (794)
Q Consensus 98 ~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~--~~W~~~~~~ 174 (794)
++.+ .+.+-++|-|.+.+|.=.-..+...... ..+++.+ + .+++.+.....++.+|..+++ ++=+++...
T Consensus 11 viLvsA~YDhTIRfWqa~tG~C~rTiqh~dsqV-NrLeiTp-----d-k~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~ 83 (311)
T KOG0315|consen 11 VILVSAGYDHTIRFWQALTGICSRTIQHPDSQV-NRLEITP-----D-KKDLAAAGNQHVRLYDLNSNNPNPVATFEGHT 83 (311)
T ss_pred eEEEeccCcceeeeeehhcCeEEEEEecCccce-eeEEEcC-----C-cchhhhccCCeeEEEEccCCCCCceeEEeccC
Confidence 4444 4568899999999999998888776554 4445544 2 345555577888888888876 466666554
Q ss_pred cceeeeeEEEEecCCE-EEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEe
Q 003800 175 ESVEVQQVIQLDESDQ-IYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSF 253 (794)
Q Consensus 175 ~~~~~~~~v~s~~~~~-vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l 253 (794)
.... .+.-..+++ .|-.+-+| .+-..|+.+ +.-|+....++.+..-++-..+..++..| .+|.+++=||
T Consensus 84 kNVt---aVgF~~dgrWMyTgseDg----t~kIWdlR~--~~~qR~~~~~spVn~vvlhpnQteLis~d-qsg~irvWDl 153 (311)
T KOG0315|consen 84 KNVT---AVGFQCDGRWMYTGSEDG----TVKIWDLRS--LSCQRNYQHNSPVNTVVLHPNQTELISGD-QSGNIRVWDL 153 (311)
T ss_pred CceE---EEEEeecCeEEEecCCCc----eEEEEeccC--cccchhccCCCCcceEEecCCcceEEeec-CCCcEEEEEc
Confidence 4331 111112222 33322222 455555554 22233333334443323333555666666 3689999999
Q ss_pred ecce
Q 003800 254 KNRK 257 (794)
Q Consensus 254 ~sg~ 257 (794)
+...
T Consensus 154 ~~~~ 157 (311)
T KOG0315|consen 154 GENS 157 (311)
T ss_pred cCCc
Confidence 8764
No 67
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=93.20 E-value=2.8 Score=42.77 Aligned_cols=147 Identities=17% Similarity=0.148 Sum_probs=84.4
Q ss_pred CCCEEEEEeC---CCEEEEEECcCCccceEEEcCccccee--eeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800 52 GRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVD--GIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 52 ~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~--~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
.++.+|.+|. ...|...|..+|++.|.+.+..+ .+- |+ ...++.+..++=..+..+-+|+.|=+.+=+++..+
T Consensus 54 ~~g~i~esTG~yg~S~ir~~~L~~gq~~~s~~l~~~-~~FgEGi-t~~gd~~y~LTw~egvaf~~d~~t~~~lg~~~y~G 131 (262)
T COG3823 54 LDGHILESTGLYGFSKIRVSDLTTGQEIFSEKLAPD-TVFGEGI-TKLGDYFYQLTWKEGVAFKYDADTLEELGRFSYEG 131 (262)
T ss_pred eCCEEEEeccccccceeEEEeccCceEEEEeecCCc-cccccce-eeccceEEEEEeccceeEEEChHHhhhhcccccCC
Confidence 3668888886 46899999999999999999842 221 33 12344444445445678888988887777777666
Q ss_pred ccccCCccccccccccccCCeEEEEEC--CEEEEEECCCCcEEEEEeccCcc-----eeeeeEEEEecCCEEEEEEecCC
Q 003800 127 SKHSKPLLLVPTNLKVDKDSLILVSSK--GCLHAVSSIDGEILWTRDFAAES-----VEVQQVIQLDESDQIYVVGYAGS 199 (794)
Q Consensus 127 ~~~s~~~~~~~~~~~~~~~~~V~V~~~--g~l~ald~~tG~~~W~~~~~~~~-----~~~~~~v~s~~~~~vyv~~~~g~ 199 (794)
+.. + +. - ++.-++.++ ..|+-.|++|=+..=+....... +.-..+ -+|.+|+--....
T Consensus 132 eGW--g--Lt-----~--d~~~LimsdGsatL~frdP~tfa~~~~v~VT~~g~pv~~LNELE~----VdG~lyANVw~t~ 196 (262)
T COG3823 132 EGW--G--LT-----S--DDKNLIMSDGSATLQFRDPKTFAELDTVQVTDDGVPVSKLNELEW----VDGELYANVWQTT 196 (262)
T ss_pred cce--e--ee-----c--CCcceEeeCCceEEEecCHHHhhhcceEEEEECCeecccccceee----eccEEEEeeeeec
Confidence 554 1 11 1 222234443 35666666543322222211111 101112 3667776444432
Q ss_pred ceeEEEEEEcCCCceeeee
Q 003800 200 SQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 200 ~~~~v~ald~~tG~~~w~~ 218 (794)
++.-+|+++|+++.-.
T Consensus 197 ---~I~rI~p~sGrV~~wi 212 (262)
T COG3823 197 ---RIARIDPDSGRVVAWI 212 (262)
T ss_pred ---ceEEEcCCCCcEEEEE
Confidence 6778888888877444
No 68
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=93.04 E-value=4.1 Score=46.10 Aligned_cols=151 Identities=17% Similarity=0.191 Sum_probs=83.8
Q ss_pred CCCEEEE-EeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 52 GRKRVVV-STEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 52 ~~~~Vyv-~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
.++.+++ ++++.++--.|+.++.+ ...+... +-+.+.....+.+.+++ |+.++.||.||+.+-. -|...+.-+.
T Consensus 121 ~d~t~l~s~sDd~v~k~~d~s~a~v--~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~-~~v~elnhg~ 197 (487)
T KOG0310|consen 121 QDNTMLVSGSDDKVVKYWDLSTAYV--QAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLT-SRVVELNHGC 197 (487)
T ss_pred cCCeEEEecCCCceEEEEEcCCcEE--EEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCC-ceeEEecCCC
Confidence 3555554 56677777778777764 4445433 22333222334444444 5678999999998765 6666665331
Q ss_pred ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEec-cCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDF-AAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~-~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a 206 (794)
--.....+| .+..++. ++..+...|..+|.++=.... -... ...+..+..+..++-++++| +|-.
T Consensus 198 pVe~vl~lp-------sgs~iasAgGn~vkVWDl~~G~qll~~~~~H~Kt--VTcL~l~s~~~rLlS~sLD~----~VKV 264 (487)
T KOG0310|consen 198 PVESVLALP-------SGSLIASAGGNSVKVWDLTTGGQLLTSMFNHNKT--VTCLRLASDSTRLLSGSLDR----HVKV 264 (487)
T ss_pred ceeeEEEcC-------CCCEEEEcCCCeEEEEEecCCceehhhhhcccce--EEEEEeecCCceEeeccccc----ceEE
Confidence 101222222 3445554 456677777776655432221 1111 22233334567788888887 7888
Q ss_pred EEcCCCceeeee
Q 003800 207 INAMNGELLNHE 218 (794)
Q Consensus 207 ld~~tG~~~w~~ 218 (794)
+|..+=+.+...
T Consensus 265 fd~t~~Kvv~s~ 276 (487)
T KOG0310|consen 265 FDTTNYKVVHSW 276 (487)
T ss_pred EEccceEEEEee
Confidence 886666666444
No 69
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=92.84 E-value=17 Score=39.70 Aligned_cols=145 Identities=10% Similarity=0.066 Sum_probs=75.2
Q ss_pred EEEEEECcCCccceEEEcCccccee-eeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE--eEEEeccCccccCCccc
Q 003800 64 VIASLDLRHGEIFWRHVLGINDVVD-GIDIALGKYVITLSSDG-----STLRAWNLPDGQM--VWESFLRGSKHSKPLLL 135 (794)
Q Consensus 64 ~l~ALn~~tG~ivWR~~l~~~~~i~-~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l--lWe~~l~~~~~s~~~~~ 135 (794)
.++.|+..+.+..|+..-+-+..-. +..+..++.++++||.. ..+..+|..+.+- .|+..-.-+. +.
T Consensus 40 ~v~~~~~~~~~~~W~~~~~lp~~r~~~~~~~~~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~-----~~ 114 (323)
T TIGR03548 40 GIYIAKDENSNLKWVKDGQLPYEAAYGASVSVENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPF-----TF 114 (323)
T ss_pred eeEEEecCCCceeEEEcccCCccccceEEEEECCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCc-----Cc
Confidence 4666653344567987543332111 11134577777777642 3688888877653 4443211111 01
Q ss_pred cccccccccCCeEEEEEC-------CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEE
Q 003800 136 VPTNLKVDKDSLILVSSK-------GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQI 207 (794)
Q Consensus 136 ~~~~~~~~~~~~V~V~~~-------g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~al 207 (794)
.... ....++.+||..+ ..++++|..+. .|+.-.+.+............++.+|+++... .....+.++
T Consensus 115 ~~~~-~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~--~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~~y 191 (323)
T TIGR03548 115 ENGS-ACYKDGTLYVGGGNRNGKPSNKSYLFNLETQ--EWFELPDFPGEPRVQPVCVKLQNELYVFGGGSNIAYTDGYKY 191 (323)
T ss_pred cCce-EEEECCEEEEEeCcCCCccCceEEEEcCCCC--CeeECCCCCCCCCCcceEEEECCEEEEEcCCCCccccceEEE
Confidence 1000 1122567777642 36888998765 48864432211001111124678999987432 112346789
Q ss_pred EcCCCceeeee
Q 003800 208 NAMNGELLNHE 218 (794)
Q Consensus 208 d~~tG~~~w~~ 218 (794)
|+.+. .|+.
T Consensus 192 d~~~~--~W~~ 200 (323)
T TIGR03548 192 SPKKN--QWQK 200 (323)
T ss_pred ecCCC--eeEE
Confidence 99875 4765
No 70
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=92.79 E-value=6.9 Score=41.64 Aligned_cols=193 Identities=13% Similarity=0.092 Sum_probs=103.0
Q ss_pred CCE-EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCCeEEEEeCCCCc-EeEEEeccCccc
Q 003800 53 RKR-VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGSTLRAWNLPDGQ-MVWESFLRGSKH 129 (794)
Q Consensus 53 ~~~-Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~tG~-llWe~~l~~~~~ 129 (794)
++. =|.+...+.+.-||++||+.. ++.|.+...-.++.+ ..+.-.++ ..+.-+.-+|.+++. ..|...+.-...
T Consensus 72 dG~VWft~qg~gaiGhLdP~tGev~-~ypLg~Ga~Phgiv~gpdg~~Wit--d~~~aI~R~dpkt~evt~f~lp~~~a~~ 148 (353)
T COG4257 72 DGAVWFTAQGTGAIGHLDPATGEVE-TYPLGSGASPHGIVVGPDGSAWIT--DTGLAIGRLDPKTLEVTRFPLPLEHADA 148 (353)
T ss_pred CCceEEecCccccceecCCCCCceE-EEecCCCCCCceEEECCCCCeeEe--cCcceeEEecCcccceEEeecccccCCC
Confidence 554 455667899999999999864 556655422222211 12333444 323368888887764 455555332111
Q ss_pred cCCccccccccccccCCeEEEE-ECCEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800 130 SKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI 207 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al 207 (794)
-+... ..+..+.+.+. ..|.-=+||+.++.+ +|..... .- ++.+. ...++.||+.++.|+ .+.-+
T Consensus 149 --nlet~----vfD~~G~lWFt~q~G~yGrLdPa~~~i~vfpaPqG--~g-pyGi~-atpdGsvwyaslagn---aiari 215 (353)
T COG4257 149 --NLETA----VFDPWGNLWFTGQIGAYGRLDPARNVISVFPAPQG--GG-PYGIC-ATPDGSVWYASLAGN---AIARI 215 (353)
T ss_pred --cccce----eeCCCccEEEeeccccceecCcccCceeeeccCCC--CC-CcceE-ECCCCcEEEEecccc---ceEEc
Confidence 11111 22334555443 455555889888764 3554432 21 33443 467899999999987 68889
Q ss_pred EcCCCceeeeeeeecccCccC-c-eEEE-cCcEEEEEECCCCeEEEEEeecceeeeEEEeec
Q 003800 208 NAMNGELLNHETAAFSGGFVG-D-VALV-SSDTLVTLDTTRSILVTVSFKNRKIAFQETHLS 266 (794)
Q Consensus 208 d~~tG~~~w~~~v~~~~~~s~-~-~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~ 266 (794)
|+.+|.. ..+..|..+.. + -+-+ ..+-+-..+-.+++++..|-.+.+ ..+-+|-
T Consensus 216 dp~~~~a---ev~p~P~~~~~gsRriwsdpig~~wittwg~g~l~rfdPs~~s--W~eypLP 272 (353)
T COG4257 216 DPFAGHA---EVVPQPNALKAGSRRIWSDPIGRAWITTWGTGSLHRFDPSVTS--WIEYPLP 272 (353)
T ss_pred ccccCCc---ceecCCCcccccccccccCccCcEEEeccCCceeeEeCccccc--ceeeeCC
Confidence 9999932 22344443222 1 1100 111222233445666666655544 5555653
No 71
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=92.56 E-value=19 Score=40.39 Aligned_cols=70 Identities=11% Similarity=0.197 Sum_probs=42.6
Q ss_pred CCEEEEEeC--CCEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC-----------CeEEEEeCCCCc
Q 003800 53 RKRVVVSTE--ENVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG-----------STLRAWNLPDGQ 117 (794)
Q Consensus 53 ~~~Vyv~t~--~g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g-----------~~v~A~d~~tG~ 117 (794)
++.||+... .+.+..+|.++-+-.|+..-+-+. ......+..++.++++||.. ..+..||+.+.
T Consensus 38 ~~~iyv~gG~~~~~~~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~~n- 116 (376)
T PRK14131 38 NNTVYVGLGSAGTSWYKLDLNAPSKGWTKIAAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTN- 116 (376)
T ss_pred CCEEEEEeCCCCCeEEEEECCCCCCCeEECCcCCCCCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEeCCCC-
Confidence 678998654 367889998766667986443221 11111134566666667642 24788888764
Q ss_pred EeEEEec
Q 003800 118 MVWESFL 124 (794)
Q Consensus 118 llWe~~l 124 (794)
.|+.-.
T Consensus 117 -~W~~~~ 122 (376)
T PRK14131 117 -SWQKLD 122 (376)
T ss_pred -EEEeCC
Confidence 588753
No 72
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=92.48 E-value=15 Score=42.51 Aligned_cols=193 Identities=12% Similarity=0.106 Sum_probs=98.3
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
++..+..++.+..|...|.+.+...=|...+....+..+.....+..++-++.++.+|.||..+|+.+=......... .
T Consensus 214 d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~i-s 292 (456)
T KOG0266|consen 214 DGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGI-S 292 (456)
T ss_pred CCcEEEEecCCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCCce-E
Confidence 344677788899999888844422223222222223222122222444445567899999999999998877766543 1
Q ss_pred CccccccccccccCCeEEE-E-ECCEEEEEECCCCcEE--EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800 132 PLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEIL--WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI 207 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~--W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al 207 (794)
...+ ..++..++ . .++.+...|..+|..+ =+............+..+..+..++....++ .+.-.
T Consensus 293 ~~~f-------~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~----~~~~w 361 (456)
T KOG0266|consen 293 GLAF-------SPDGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDR----TLKLW 361 (456)
T ss_pred EEEE-------CCCCCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCC----eEEEE
Confidence 1111 11344444 4 3999999999999843 1111111110012222122233333332232 56667
Q ss_pred EcCCCceeeeeeeeccc--CccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 208 NAMNGELLNHETAAFSG--GFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 208 d~~tG~~~w~~~v~~~~--~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
|..+|...-++...... .+...+...++..++... ..+.++.-++.++.
T Consensus 362 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~sg~-~d~~v~~~~~~s~~ 412 (456)
T KOG0266|consen 362 DLRSGKSVGTYTGHSNLVRCIFSPTLSTGGKLIYSGS-EDGSVYVWDSSSGG 412 (456)
T ss_pred EccCCcceeeecccCCcceeEecccccCCCCeEEEEe-CCceEEEEeCCccc
Confidence 88888877666422211 011111111333333332 34566677766654
No 73
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=92.27 E-value=3.4 Score=43.17 Aligned_cols=108 Identities=18% Similarity=0.240 Sum_probs=74.7
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEecc
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~ 173 (794)
.+..+.-+++++.||.||..+|...=+..+..++- ++++.. ++.++.. .++.+.-.|+++=.++=+++.|
T Consensus 154 eD~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~Vt--SlEvs~-------dG~ilTia~gssV~Fwdaksf~~lKs~k~P 224 (334)
T KOG0278|consen 154 EDKCILSSADDKTVRLWDHRTGTEVQSLEFNSPVT--SLEVSQ-------DGRILTIAYGSSVKFWDAKSFGLLKSYKMP 224 (334)
T ss_pred cCceEEeeccCCceEEEEeccCcEEEEEecCCCCc--ceeecc-------CCCEEEEecCceeEEeccccccceeeccCc
Confidence 33444435677899999999999998888877653 444443 5666665 5788889999988888888877
Q ss_pred CcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 174 AESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 174 ~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
-... ...+ .-...+||. |+..++++-+|-.||+.+-.+
T Consensus 225 ~nV~-SASL---~P~k~~fVa---Gged~~~~kfDy~TgeEi~~~ 262 (334)
T KOG0278|consen 225 CNVE-SASL---HPKKEFFVA---GGEDFKVYKFDYNTGEEIGSY 262 (334)
T ss_pred cccc-cccc---cCCCceEEe---cCcceEEEEEeccCCceeeec
Confidence 4321 1112 122356654 455579999999999988664
No 74
>PHA02713 hypothetical protein; Provisional
Probab=92.14 E-value=3.5 Score=49.07 Aligned_cols=148 Identities=12% Similarity=0.195 Sum_probs=83.5
Q ss_pred CCEEEEEeCC------CEEEEEECcCCccceEEEcCccccee--eeeeeeCCEEEEEEccC-------------------
Q 003800 53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGINDVVD--GIDIALGKYVITLSSDG------------------- 105 (794)
Q Consensus 53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~~~i~--~l~~~~g~~~V~Vs~~g------------------- 105 (794)
+++||+..+. +.+..+|+++. .|+..-+-+.... +. ++.++.+.++||..
T Consensus 351 ~g~IYviGG~~~~~~~~sve~Ydp~~~--~W~~~~~mp~~r~~~~~-~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~ 427 (557)
T PHA02713 351 DDTIYAIGGQNGTNVERTIECYTMGDD--KWKMLPDMPIALSSYGM-CVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEED 427 (557)
T ss_pred CCEEEEECCcCCCCCCceEEEEECCCC--eEEECCCCCcccccccE-EEECCEEEEEeCCCccccccccccccccccccc
Confidence 7789987763 34888999987 5997443221111 22 24566666667642
Q ss_pred ----CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCC-CcEEEEEec
Q 003800 106 ----STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSID-GEILWTRDF 172 (794)
Q Consensus 106 ----~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~t-G~~~W~~~~ 172 (794)
..+..||+.+. .|+.-..-....... + .+.-++.+||.++ ..+.++|+.+ . .|+.-.
T Consensus 428 ~~~~~~ve~YDP~td--~W~~v~~m~~~r~~~---~---~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~--~W~~~~ 497 (557)
T PHA02713 428 THSSNKVIRYDTVNN--IWETLPNFWTGTIRP---G---VVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYN--GWELIT 497 (557)
T ss_pred ccccceEEEECCCCC--eEeecCCCCcccccC---c---EEEECCEEEEEeCCCCCCccceeEEEecCCCCC--CeeEcc
Confidence 35888999876 587543221110111 1 1222677888642 2467899887 3 498654
Q ss_pred cCcce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 173 AAESV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 173 ~~~~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
+.+.- ....+ +.-++.+|++|...+. ..+-++|+.|++ |+.
T Consensus 498 ~m~~~r~~~~~--~~~~~~iyv~Gg~~~~-~~~e~yd~~~~~--W~~ 539 (557)
T PHA02713 498 TTESRLSALHT--ILHDNTIMMLHCYESY-MLQDTFNVYTYE--WNH 539 (557)
T ss_pred ccCccccccee--EEECCEEEEEeeecce-eehhhcCccccc--ccc
Confidence 43321 01112 3468999998754322 246778877653 554
No 75
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=91.97 E-value=5.1 Score=43.68 Aligned_cols=147 Identities=13% Similarity=0.199 Sum_probs=83.0
Q ss_pred CCEEEEEECcCCccceEEEcCccc-----c---------------------eeeeeeeeCCEEEEEEc-cCCeEEEEeCC
Q 003800 62 ENVIASLDLRHGEIFWRHVLGIND-----V---------------------VDGIDIALGKYVITLSS-DGSTLRAWNLP 114 (794)
Q Consensus 62 ~g~l~ALn~~tG~ivWR~~l~~~~-----~---------------------i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~ 114 (794)
++.+.-+|++||+++|+-....-. . +..+. ...++-+.||. .-..|+.+|..
T Consensus 95 d~~~~EiDi~TgevlfeW~a~DH~~~~~~~~~~~~~~~~g~~~~~~~D~~HiNsV~-~~~~G~yLiS~R~~~~i~~I~~~ 173 (299)
T PF14269_consen 95 DDVFQEIDIETGEVLFEWSASDHVDPNDSYDSQDPLPGSGGSSSFPWDYFHINSVD-KDDDGDYLISSRNTSTIYKIDPS 173 (299)
T ss_pred cceeEEeccCCCCEEEEEEhhheecccccccccccccCCCcCCCCCCCccEeeeee-ecCCccEEEEecccCEEEEEECC
Confidence 567889999999999998753210 0 00111 12233345565 45789999999
Q ss_pred CCcEeEEEecc-Ccc-------c--cCCccccccccccccCCeEEEEE------------CCEEEEEECCCCcEEEEEec
Q 003800 115 DGQMVWESFLR-GSK-------H--SKPLLLVPTNLKVDKDSLILVSS------------KGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 115 tG~llWe~~l~-~~~-------~--s~~~~~~~~~~~~~~~~~V~V~~------------~g~l~ald~~tG~~~W~~~~ 172 (794)
||+++|+.... ... . +-++.+.+ .-..++.+.++. .+.+..||..+..+.|..+.
T Consensus 174 tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~---~~~~~~~IslFDN~~~~~~~~~~s~~~v~~ld~~~~~~~~~~~~ 250 (299)
T PF14269_consen 174 TGKIIWRLGGKRNSDFTLPATNFSWQHDARFLN---ESNDDGTISLFDNANSDFNGTEPSRGLVLELDPETMTVTLVREY 250 (299)
T ss_pred CCcEEEEeCCCCCCcccccCCcEeeccCCEEec---cCCCCCEEEEEcCCCCCCCCCcCCCceEEEEECCCCEEEEEEEe
Confidence 99999998654 111 1 11222221 001133444432 46899999997776665543
Q ss_pred c---Ccceee----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800 173 A---AESVEV----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 173 ~---~~~~~~----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~ 219 (794)
. .+-.++ .|. ..++.+++.=.. ..++.-+++ +|+++|+..
T Consensus 251 ~~~~~~~~s~~~G~~Q~---L~nGn~li~~g~---~g~~~E~~~-~G~vv~~~~ 297 (299)
T PF14269_consen 251 SDHPDGFYSPSQGSAQR---LPNGNVLIGWGN---NGRISEFTP-DGEVVWEAQ 297 (299)
T ss_pred ecCCCcccccCCCcceE---CCCCCEEEecCC---CceEEEECC-CCCEEEEEE
Confidence 3 211111 122 234555542111 227888885 799999985
No 76
>PRK05137 tolB translocation protein TolB; Provisional
Probab=91.67 E-value=29 Score=39.71 Aligned_cols=188 Identities=15% Similarity=0.075 Sum_probs=90.5
Q ss_pred cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSD--GSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l 124 (794)
+++++|+..+. ...|+.+|.++|+. ++.....+.+...... .|+.+++.... ...++.||..+|.+. ++
T Consensus 211 pDG~~lay~s~~~g~~~i~~~dl~~g~~--~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~---~L 285 (435)
T PRK05137 211 PNRQEITYMSYANGRPRVYLLDLETGQR--ELVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTT---RL 285 (435)
T ss_pred CCCCEEEEEEecCCCCEEEEEECCCCcE--EEeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCCCceE---Ec
Confidence 34556655543 46899999999864 3322222222222122 45555554332 246999999988753 23
Q ss_pred cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800 125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS 199 (794)
Q Consensus 125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~ 199 (794)
.... ....+.. ..+ ++.+++.+ ...++.+|..+|++.--..... .. ..+..+.++..+++....++
T Consensus 286 t~~~~~~~~~~~-----spD-G~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~-~~--~~~~~SpdG~~ia~~~~~~~ 356 (435)
T PRK05137 286 TDSPAIDTSPSY-----SPD-GSQIVFESDRSGSPQLYVMNADGSNPRRISFGGG-RY--STPVWSPRGDLIAFTKQGGG 356 (435)
T ss_pred cCCCCccCceeE-----cCC-CCEEEEEECCCCCCeEEEEECCCCCeEEeecCCC-cc--cCeEECCCCCEEEEEEcCCC
Confidence 2211 1011111 123 23344433 2379999988776543221111 11 11212345566665554332
Q ss_pred ceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCC-----CeEEEEEeecce
Q 003800 200 SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTR-----SILVTVSFKNRK 257 (794)
Q Consensus 200 ~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~-----g~L~v~~l~sg~ 257 (794)
...+..+|+.+|... .+...... +.+.+- .+..+++..... ..|+.+++..+.
T Consensus 357 -~~~i~~~d~~~~~~~---~lt~~~~~-~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~ 415 (435)
T PRK05137 357 -QFSIGVMKPDGSGER---ILTSGFLV-EGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRN 415 (435)
T ss_pred -ceEEEEEECCCCceE---eccCCCCC-CCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCc
Confidence 346778887666532 11112122 222222 334444443222 368888887665
No 77
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=91.65 E-value=7.5 Score=43.01 Aligned_cols=188 Identities=11% Similarity=0.175 Sum_probs=107.7
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCc-----cceEEEcCcccceeeeeee-eCCEEEEEEccC--CeEEEEeCCCCcEeEEE
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGE-----IFWRHVLGINDVVDGIDIA-LGKYVITLSSDG--STLRAWNLPDGQMVWES 122 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~-----ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g--~~v~A~d~~tG~llWe~ 122 (794)
..++.|++...+|.+...+.+.|. .+|-+..+.- ..++-. ....++..||.- ..+--||.+.++.+|+.
T Consensus 113 ~~dg~Litc~~sG~l~~~~~k~~d~hss~l~~la~g~g~---~~~r~~~~~p~Iva~GGke~~n~lkiwdle~~~qiw~a 189 (412)
T KOG3881|consen 113 LADGTLITCVSSGNLQVRHDKSGDLHSSKLIKLATGPGL---YDVRQTDTDPYIVATGGKENINELKIWDLEQSKQIWSA 189 (412)
T ss_pred hcCCEEEEEecCCcEEEEeccCCccccccceeeecCCce---eeeccCCCCCceEecCchhcccceeeeecccceeeeec
Confidence 347789999999998888888554 5555444221 112111 233455546644 57999999999999997
Q ss_pred eccC-ccc-------cCCccccccccccccCCeEEEE--ECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEE
Q 003800 123 FLRG-SKH-------SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQI 191 (794)
Q Consensus 123 ~l~~-~~~-------s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~v 191 (794)
.=-. ..+ -.++.+++ ......|+. .-+.|+-+|...|+ ++=+++.....++...+. ..++.+
T Consensus 190 KNvpnD~L~LrVPvW~tdi~Fl~-----g~~~~~fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~--p~gn~I 262 (412)
T KOG3881|consen 190 KNVPNDRLGLRVPVWITDIRFLE-----GSPNYKFATITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLT--PSGNFI 262 (412)
T ss_pred cCCCCccccceeeeeeccceecC-----CCCCceEEEEecceeEEEecCcccCcceeEeccccCcceeeeec--CCCcEE
Confidence 6321 111 11222222 112455554 37899999999885 666666654433222222 356778
Q ss_pred EEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCce--EEE--cCcEEEEEECCCCeEEEEEeecce
Q 003800 192 YVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDV--ALV--SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 192 yv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~--~~v--g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
|+....| .+..||..+|+..-... .+++|++ +.+ +..+++..- -...+.+.|+++.+
T Consensus 263 y~gn~~g----~l~~FD~r~~kl~g~~~----kg~tGsirsih~hp~~~~las~G-LDRyvRIhD~ktrk 323 (412)
T KOG3881|consen 263 YTGNTKG----QLAKFDLRGGKLLGCGL----KGITGSIRSIHCHPTHPVLASCG-LDRYVRIHDIKTRK 323 (412)
T ss_pred EEecccc----hhheecccCceeecccc----CCccCCcceEEEcCCCceEEeec-cceeEEEeecccch
Confidence 8755555 79999999998875531 1233311 111 223333221 12457888887743
No 78
>PTZ00420 coronin; Provisional
Probab=91.23 E-value=21 Score=42.57 Aligned_cols=69 Identities=4% Similarity=0.091 Sum_probs=48.2
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
+..++.+|.|.-.|.++|+.+++.... ..+..+.....+.+++.++.++.++.||+.+|+.+-+...+.
T Consensus 141 LaSgS~DgtIrIWDl~tg~~~~~i~~~--~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~ 209 (568)
T PTZ00420 141 MCSSGFDSFVNIWDIENEKRAFQINMP--KKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHD 209 (568)
T ss_pred EEEEeCCCeEEEEECCCCcEEEEEecC--CcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEeccc
Confidence 346778999999999999988776543 234433223344455546556799999999999986665543
No 79
>PLN00181 protein SPA1-RELATED; Provisional
Probab=90.74 E-value=52 Score=40.91 Aligned_cols=106 Identities=11% Similarity=0.125 Sum_probs=65.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
...+++++.+|.|...|..+|+.+....-.. ..+..+... .++..++.++.++.++.||..+|..+-.........
T Consensus 545 ~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~-~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~v~-- 621 (793)
T PLN00181 545 KSQVASSNFEGVVQVWDVARSQLVTEMKEHE-KRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANIC-- 621 (793)
T ss_pred CCEEEEEeCCCeEEEEECCCCeEEEEecCCC-CCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEecCCCeE--
Confidence 4568888889999999999998887764322 234444222 233455546667899999999998765554332211
Q ss_pred CccccccccccccCCeEEEE-ECCEEEEEECCCCcE
Q 003800 132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI 166 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~ 166 (794)
...+. ...+..+++. .+|.++..|..+++.
T Consensus 622 ~v~~~-----~~~g~~latgs~dg~I~iwD~~~~~~ 652 (793)
T PLN00181 622 CVQFP-----SESGRSLAFGSADHKVYYYDLRNPKL 652 (793)
T ss_pred EEEEe-----CCCCCEEEEEeCCCeEEEEECCCCCc
Confidence 01110 1112333444 489999999887753
No 80
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=89.96 E-value=29 Score=38.22 Aligned_cols=160 Identities=9% Similarity=0.096 Sum_probs=86.6
Q ss_pred CCEEEEEeCC--CEEEEEECcCCccceEEEcCccc--cee-eeeeeeCCEEEEEEccC-----------CeEEEEeCCCC
Q 003800 53 RKRVVVSTEE--NVIASLDLRHGEIFWRHVLGIND--VVD-GIDIALGKYVITLSSDG-----------STLRAWNLPDG 116 (794)
Q Consensus 53 ~~~Vyv~t~~--g~l~ALn~~tG~ivWR~~l~~~~--~i~-~l~~~~g~~~V~Vs~~g-----------~~v~A~d~~tG 116 (794)
++.||+.... +.+..+|+++.+-.|+...+-+. ... ++ +..++.+.++||.. ..+..||+.+.
T Consensus 17 ~~~vyv~GG~~~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~-~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~ 95 (346)
T TIGR03547 17 GDKVYVGLGSAGTSWYKLDLKKPSKGWQKIADFPGGPRNQAVA-AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKN 95 (346)
T ss_pred CCEEEEEccccCCeeEEEECCCCCCCceECCCCCCCCcccceE-EEECCEEEEEeCCCCCCCCCcceecccEEEEECCCC
Confidence 6788886653 57888998766778997554321 111 22 34577777777742 24777888654
Q ss_pred cEeEEEeccCccccCCccccccccccccCCeEEEEE--C---------------------------------------CE
Q 003800 117 QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--K---------------------------------------GC 155 (794)
Q Consensus 117 ~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~---------------------------------------g~ 155 (794)
.|+.-...... . ..+.......++.|++.. + ..
T Consensus 96 --~W~~~~~~~p~--~--~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (346)
T TIGR03547 96 --SWQKLDTRSPV--G--LLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYFWNKN 169 (346)
T ss_pred --EEecCCCCCCC--c--ccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcCccce
Confidence 48764321110 0 111000101256777763 1 35
Q ss_pred EEEEECCCCcEEEEEeccCcc--eeeeeEEEEecCCEEEEEEecCCc---eeEEEEEEcCCCceeeeeeeecc
Q 003800 156 LHAVSSIDGEILWTRDFAAES--VEVQQVIQLDESDQIYVVGYAGSS---QFHAYQINAMNGELLNHETAAFS 223 (794)
Q Consensus 156 l~ald~~tG~~~W~~~~~~~~--~~~~~~v~s~~~~~vyv~~~~g~~---~~~v~ald~~tG~~~w~~~v~~~ 223 (794)
+..+|+.+. .|+.-.+.+. ..-..+ ..-++++|+++..... ...+..+|.......|+..-.++
T Consensus 170 v~~YDp~t~--~W~~~~~~p~~~r~~~~~--~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~~~~m~ 238 (346)
T TIGR03547 170 VLSYDPSTN--QWRNLGENPFLGTAGSAI--VHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNKLPPLP 238 (346)
T ss_pred EEEEECCCC--ceeECccCCCCcCCCceE--EEECCEEEEEeeeeCCCccchheEEEEecCCCceeeecCCCC
Confidence 777787664 5876544332 111112 2457899998753211 12345566666677798754443
No 81
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=89.90 E-value=2.4 Score=49.29 Aligned_cols=150 Identities=16% Similarity=0.217 Sum_probs=90.9
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc---------------ceeeeee-eeCCEEEEEEccCCeEEEEeCC
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND---------------VVDGIDI-ALGKYVITLSSDGSTLRAWNLP 114 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~---------------~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~ 114 (794)
.++.++-|++++|.|- +||...+.-. -|..++. ....+++.++..+.+++.||..
T Consensus 638 FD~~rLAVa~ddg~i~---------lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl~ 708 (1012)
T KOG1445|consen 638 FDDERLAVATDDGQIN---------LWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDSTIELWDLA 708 (1012)
T ss_pred CChHHeeecccCceEE---------EEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccceeeeeehh
Confidence 4477899999999863 5776543220 1222221 1344566666777899999999
Q ss_pred CCcEeEEEeccCccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEE
Q 003800 115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQI 191 (794)
Q Consensus 115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~v 191 (794)
++++.=+........ .+. ++..++..+. . .||+|+.+++.+++ ++.+-+-+.+.. -.+++.+.++..+
T Consensus 709 ~~~~~~~l~gHtdqI-f~~-------AWSpdGr~~AtVcKDg~~rVy~Prs~e~pv~Eg~gpvgtR-gARi~wacdgr~v 779 (1012)
T KOG1445|consen 709 NAKLYSRLVGHTDQI-FGI-------AWSPDGRRIATVCKDGTLRVYEPRSREQPVYEGKGPVGTR-GARILWACDGRIV 779 (1012)
T ss_pred hhhhhheeccCcCce-eEE-------EECCCCcceeeeecCceEEEeCCCCCCCccccCCCCccCc-ceeEEEEecCcEE
Confidence 999987776654432 111 2222343333 3 59999999999886 445544444332 3445445567777
Q ss_pred EEEEecCCceeEEEEEEcC--CCceeeee
Q 003800 192 YVVGYAGSSQFHAYQINAM--NGELLNHE 218 (794)
Q Consensus 192 yv~~~~g~~~~~v~ald~~--tG~~~w~~ 218 (794)
.++|++..+...+..+|++ .|.++...
T Consensus 780 iv~Gfdk~SeRQv~~Y~Aq~l~~~pl~t~ 808 (1012)
T KOG1445|consen 780 IVVGFDKSSERQVQMYDAQTLDLRPLYTQ 808 (1012)
T ss_pred EEecccccchhhhhhhhhhhccCCcceee
Confidence 7888876555556556655 34455444
No 82
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=89.89 E-value=24 Score=42.68 Aligned_cols=155 Identities=14% Similarity=0.167 Sum_probs=101.7
Q ss_pred eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 49 QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 49 ~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
|++.=++|.+++.+|.+.-+|-+||+++...+--.. .|..+..+-.=++|.+|-.+|+|.-+|...|+.+-+++..-+.
T Consensus 168 P~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f~~~~s-~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~sFk~d~g~ 246 (910)
T KOG1539|consen 168 PSTYLNKIVVGSSQGRLQLWNVRTGKVVYTFQEFFS-RITAIEQSPALDVVAIGLENGTVIIFNLKFDKILMSFKQDWGR 246 (910)
T ss_pred chhheeeEEEeecCCcEEEEEeccCcEEEEeccccc-ceeEeccCCcceEEEEeccCceEEEEEcccCcEEEEEEccccc
Confidence 555578899999999999999999999988754332 2333322223468888887789999999999999999986222
Q ss_pred ccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccC-cceeeeeEEEEecCCEEEEEEecCCceeEEE
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAA-ESVEVQQVIQLDESDQIYVVGYAGSSQFHAY 205 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~-~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ 205 (794)
. ..+.+. .| +..+++. ..|.+.-.|.+.-+..|...... +...-..+. .+..|.+ +..+...+++.
T Consensus 247 V-tslSFr-----tD-G~p~las~~~~G~m~~wDLe~kkl~~v~~nah~~sv~~~~fl---~~epVl~-ta~~DnSlk~~ 315 (910)
T KOG1539|consen 247 V-TSLSFR-----TD-GNPLLASGRSNGDMAFWDLEKKKLINVTRNAHYGSVTGATFL---PGEPVLV-TAGADNSLKVW 315 (910)
T ss_pred e-eEEEec-----cC-CCeeEEeccCCceEEEEEcCCCeeeeeeeccccCCcccceec---CCCceEe-eccCCCceeEE
Confidence 2 122222 22 3344444 36889899988888888876443 221111221 2333443 22233568999
Q ss_pred EEEcCCCcee
Q 003800 206 QINAMNGELL 215 (794)
Q Consensus 206 ald~~tG~~~ 215 (794)
.+|..+|.++
T Consensus 316 vfD~~dg~pR 325 (910)
T KOG1539|consen 316 VFDSGDGVPR 325 (910)
T ss_pred EeeCCCCcch
Confidence 9998888644
No 83
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=89.86 E-value=19 Score=46.33 Aligned_cols=157 Identities=17% Similarity=0.181 Sum_probs=86.4
Q ss_pred CCEEEEEeC-CCEEEEEECcCCccceEEEcCcc------c---------ceeeeeeeeCCEEEEEE-ccCCeEEEEeCCC
Q 003800 53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGIN------D---------VVDGIDIALGKYVITLS-SDGSTLRAWNLPD 115 (794)
Q Consensus 53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~------~---------~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~t 115 (794)
++.+|++.. .+.|.-+|+.+|.+. ...... + ...++.....++.++|+ ..+++|+.||..+
T Consensus 694 ~g~LyVad~~~~~I~v~d~~~g~v~--~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~t 771 (1057)
T PLN02919 694 NEKVYIAMAGQHQIWEYNISDGVTR--VFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKT 771 (1057)
T ss_pred CCeEEEEECCCCeEEEEECCCCeEE--EEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCC
Confidence 567888764 678999999888542 110000 0 01123222334445554 3457999999999
Q ss_pred CcEeEEEeccCc------------cccC-CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCc-----
Q 003800 116 GQMVWESFLRGS------------KHSK-PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAE----- 175 (794)
Q Consensus 116 G~llWe~~l~~~------------~~s~-~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~----- 175 (794)
|...|-...... .... .....|.....+.++.+||. .++++..+|..+|.+.........
T Consensus 772 g~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG 851 (1057)
T PLN02919 772 GGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDG 851 (1057)
T ss_pred CcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCC
Confidence 887654321100 0000 00001111133445678886 388999999999987754432210
Q ss_pred -----ce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee
Q 003800 176 -----SV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL 215 (794)
Q Consensus 176 -----~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~ 215 (794)
.+ .|..+. ...++.+|+.....+ .+..+|+.+|+..
T Consensus 852 ~~~~a~l~~P~GIa-vd~dG~lyVaDt~Nn---~Irvid~~~~~~~ 893 (1057)
T PLN02919 852 KALKAQLSEPAGLA-LGENGRLFVADTNNS---LIRYLDLNKGEAA 893 (1057)
T ss_pred cccccccCCceEEE-EeCCCCEEEEECCCC---EEEEEECCCCccc
Confidence 01 233343 234677888654433 7888899998764
No 84
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=89.86 E-value=36 Score=37.68 Aligned_cols=195 Identities=14% Similarity=0.118 Sum_probs=111.3
Q ss_pred cCCCEEEE--EeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCC-CCcEeEEEeccCc
Q 003800 51 TGRKRVVV--STEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLP-DGQMVWESFLRGS 127 (794)
Q Consensus 51 ~~~~~Vyv--~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~ 127 (794)
.+++.+|| .|...-|..+|...++.+= .++.++....+ +....+...++++| .+...... +|++. +....
T Consensus 104 ~dgk~~~V~N~TPa~SVtVVDl~~~kvv~--ei~~PGC~~iy-P~~~~~F~~lC~DG-sl~~v~Ld~~Gk~~-~~~t~-- 176 (342)
T PF06433_consen 104 ADGKFLYVQNFTPATSVTVVDLAAKKVVG--EIDTPGCWLIY-PSGNRGFSMLCGDG-SLLTVTLDADGKEA-QKSTK-- 176 (342)
T ss_dssp TTSSEEEEEEESSSEEEEEEETTTTEEEE--EEEGTSEEEEE-EEETTEEEEEETTS-CEEEEEETSTSSEE-EEEEE--
T ss_pred cCCcEEEEEccCCCCeEEEEECCCCceee--eecCCCEEEEE-ecCCCceEEEecCC-ceEEEEECCCCCEe-Eeecc--
Confidence 34555666 4567789999999998863 34444433333 23445666667875 45544444 89997 43321
Q ss_pred cc--cCCcccccccccc-ccCC-eEEEEECCEEEEEECCCCcEEEEEeccC-------cceee--eeEEE-EecCCEEEE
Q 003800 128 KH--SKPLLLVPTNLKV-DKDS-LILVSSKGCLHAVSSIDGEILWTRDFAA-------ESVEV--QQVIQ-LDESDQIYV 193 (794)
Q Consensus 128 ~~--s~~~~~~~~~~~~-~~~~-~V~V~~~g~l~ald~~tG~~~W~~~~~~-------~~~~~--~~~v~-s~~~~~vyv 193 (794)
.. ..++.+.. + .. ..++ .+|+...|.|+.+|.....+.|...... ..+.| .|++. ....+.+|+
T Consensus 177 ~F~~~~dp~f~~-~-~~~~~~~~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyv 254 (342)
T PF06433_consen 177 VFDPDDDPLFEH-P-AYSRDGGRLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYV 254 (342)
T ss_dssp ESSTTTS-B-S----EEETTTTEEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEE
T ss_pred ccCCCCcccccc-c-ceECCCCeEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEE
Confidence 11 12222211 0 11 1123 3444469999999988777665433211 12222 22221 235789998
Q ss_pred EEecCC------ceeEEEEEEcCCCceeeeeeeecccCccCceEEE--cCcEEEEEECCCCeEEEEEeecce
Q 003800 194 VGYAGS------SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV--SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 194 ~~~~g~------~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v--g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+-..|. ..-.|..+|++|++++-...+..+.. ++-+. ....+++++..++.|.+.|..+|+
T Consensus 255 LMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~~~---Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk 323 (342)
T PF06433_consen 255 LMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHPID---SIAVSQDDKPLLYALSAGDGTLDVYDAATGK 323 (342)
T ss_dssp EEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEEES---EEEEESSSS-EEEEEETTTTEEEEEETTT--
T ss_pred EecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCccc---eEEEccCCCcEEEEEcCCCCeEEEEeCcCCc
Confidence 765552 24789999999999998776544421 12221 223777888777899999999998
No 85
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=89.78 E-value=30 Score=41.02 Aligned_cols=180 Identities=15% Similarity=0.179 Sum_probs=114.3
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
++.++.++.++.|-..|..+|..+=+.-.+..+.+.++....+++.++-++.+.++|-||..+|.-.=.........
T Consensus 218 ~~~~~~~s~~~tl~~~~~~~~~~i~~~l~GH~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~stv--- 294 (537)
T KOG0274|consen 218 DGFFKSGSDDSTLHLWDLNNGYLILTRLVGHFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTSSV--- 294 (537)
T ss_pred cCeEEecCCCceeEEeecccceEEEeeccCCCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCceE---
Confidence 77788999999999999999987766555544555555444456666655557899999999998877766554432
Q ss_pred ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800 133 LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM 210 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~ 210 (794)
..+ ...+.+.+. .|.+|.+-+..+|+.+=....... +-..+ ....+.++..+.+| .+-..|+.
T Consensus 295 -~~~------~~~~~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~---~V~~v-~~~~~~lvsgs~d~----~v~VW~~~ 359 (537)
T KOG0274|consen 295 -RCL------TIDPFLLVSGSRDNTVKVWDVTNGACLNLLRGHTG---PVNCV-QLDEPLLVSGSYDG----TVKVWDPR 359 (537)
T ss_pred -EEE------EccCceEeeccCCceEEEEeccCcceEEEeccccc---cEEEE-EecCCEEEEEecCc----eEEEEEhh
Confidence 111 113344444 377777777777766655542111 11222 23567777777666 68888999
Q ss_pred CCceeeeeeeecccCccC--ceEEEcC-cEEEEEECCCCeEEEEEeecc
Q 003800 211 NGELLNHETAAFSGGFVG--DVALVSS-DTLVTLDTTRSILVTVSFKNR 256 (794)
Q Consensus 211 tG~~~w~~~v~~~~~~s~--~~~~vg~-~~lv~~d~~~g~L~v~~l~sg 256 (794)
+|+.+...+- -++ .++++++ +.++-... ++.+.+=|+.++
T Consensus 360 ~~~cl~sl~g-----H~~~V~sl~~~~~~~~~Sgs~-D~~IkvWdl~~~ 402 (537)
T KOG0274|consen 360 TGKCLKSLSG-----HTGRVYSLIVDSENRLLSGSL-DTTIKVWDLRTK 402 (537)
T ss_pred hceeeeeecC-----CcceEEEEEecCcceEEeeee-ccceEeecCCch
Confidence 9988876642 112 2344565 55554443 366788888776
No 86
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=89.70 E-value=28 Score=36.22 Aligned_cols=142 Identities=14% Similarity=0.156 Sum_probs=76.4
Q ss_pred CCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
++.+|+.+- .+.|..+|+++|+.. ...++. ..++.....++.++++..+ .++.+|..+|+..--........
T Consensus 11 ~g~l~~~D~~~~~i~~~~~~~~~~~-~~~~~~---~~G~~~~~~~g~l~v~~~~-~~~~~d~~~g~~~~~~~~~~~~~-- 83 (246)
T PF08450_consen 11 DGRLYWVDIPGGRIYRVDPDTGEVE-VIDLPG---PNGMAFDRPDGRLYVADSG-GIAVVDPDTGKVTVLADLPDGGV-- 83 (246)
T ss_dssp TTEEEEEETTTTEEEEEETTTTEEE-EEESSS---EEEEEEECTTSEEEEEETT-CEEEEETTTTEEEEEEEEETTCS--
T ss_pred CCEEEEEEcCCCEEEEEECCCCeEE-EEecCC---CceEEEEccCCEEEEEEcC-ceEEEecCCCcEEEEeeccCCCc--
Confidence 677888874 789999999888652 222332 2233223234666666654 45666999996554444321110
Q ss_pred CccccccccccccCCeEEEEE--C--------CEEEEEECCCCcEEEEEe-ccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 132 PLLLVPTNLKVDKDSLILVSS--K--------GCLHAVSSIDGEILWTRD-FAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~~--~--------g~l~ald~~tG~~~W~~~-~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
....+-+...+.++.+++.. . |.|++++.. |++..... ... +-.+..+.++..+|+.....+
T Consensus 84 -~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~----pNGi~~s~dg~~lyv~ds~~~- 156 (246)
T PF08450_consen 84 -PFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGF----PNGIAFSPDGKTLYVADSFNG- 156 (246)
T ss_dssp -CTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESS----EEEEEEETTSSEEEEEETTTT-
T ss_pred -ccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCccc----ccceEECCcchheeecccccc-
Confidence 01111122344467788853 1 789999988 76443332 222 223332345567887554433
Q ss_pred eeEEEEEEcC
Q 003800 201 QFHAYQINAM 210 (794)
Q Consensus 201 ~~~v~ald~~ 210 (794)
++..++..
T Consensus 157 --~i~~~~~~ 164 (246)
T PF08450_consen 157 --RIWRFDLD 164 (246)
T ss_dssp --EEEEEEEE
T ss_pred --eeEEEecc
Confidence 56666664
No 87
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=89.58 E-value=28 Score=38.34 Aligned_cols=232 Identities=15% Similarity=0.195 Sum_probs=112.9
Q ss_pred EEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800 55 RVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL 134 (794)
Q Consensus 55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~ 134 (794)
-+|-+.+++.|-|-|.+.-+++=.+- +.-..+.++...-..++++-++.+..+|.||..+-..+-......... ....
T Consensus 207 YlFs~gedk~VKCwDLe~nkvIR~Yh-GHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V-~~V~ 284 (460)
T KOG0285|consen 207 YLFSAGEDKQVKCWDLEYNKVIRHYH-GHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPV-ASVM 284 (460)
T ss_pred eEEEecCCCeeEEEechhhhhHHHhc-cccceeEEEeccccceeEEecCCcceEEEeeecccceEEEecCCCCcc-eeEE
Confidence 37788888899999888765432210 000112233222234566656667899999998877776666443322 1111
Q ss_pred ccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCc
Q 003800 135 LVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGE 213 (794)
Q Consensus 135 ~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~ 213 (794)
.- ..+..|+-.+ |+++.--|...|+..=+........ .-+. ..-....|+.+... .+-+.+.-.|+
T Consensus 285 ~~------~~dpqvit~S~D~tvrlWDl~agkt~~tlt~hkksv--ral~-lhP~e~~fASas~d----nik~w~~p~g~ 351 (460)
T KOG0285|consen 285 CQ------PTDPQVITGSHDSTVRLWDLRAGKTMITLTHHKKSV--RALC-LHPKENLFASASPD----NIKQWKLPEGE 351 (460)
T ss_pred ee------cCCCceEEecCCceEEEeeeccCceeEeeeccccee--eEEe-cCCchhhhhccCCc----cceeccCCccc
Confidence 11 1145555554 7787777777776654433322111 1111 01111233222111 45566666666
Q ss_pred eeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecCCcceeEEEec
Q 003800 214 LLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTGMFTVKIN 292 (794)
Q Consensus 214 ~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 292 (794)
.+-.. +....+-. ++-+ .+++++.. .++|.+..-|-++|. .+|... ...++++. +- -.|.|.--.+
T Consensus 352 f~~nl--sgh~~iin-tl~~nsD~v~~~G-~dng~~~fwdwksg~-nyQ~~~--t~vqpGSl--~s----EagI~as~fD 418 (460)
T KOG0285|consen 352 FLQNL--SGHNAIIN-TLSVNSDGVLVSG-GDNGSIMFWDWKSGH-NYQRGQ--TIVQPGSL--ES----EAGIFASCFD 418 (460)
T ss_pred hhhcc--ccccceee-eeeeccCceEEEc-CCceEEEEEecCcCc-cccccc--ccccCCcc--cc----ccceeEEeec
Confidence 55441 22211111 2222 33444443 357889888888887 444331 11111110 00 0122333222
Q ss_pred C-cEEEEEEecCCcEEEEEeecC
Q 003800 293 N-YKLFIRLTSEDKLEVVHKVDH 314 (794)
Q Consensus 293 ~-~~~l~~~~~~~~~~v~~~~~~ 314 (794)
. +.-|+.-+.+..+++++.++.
T Consensus 419 ktg~rlit~eadKtIk~~keDe~ 441 (460)
T KOG0285|consen 419 KTGSRLITGEADKTIKMYKEDEH 441 (460)
T ss_pred ccCceEEeccCCcceEEEecccc
Confidence 2 334555554556777776653
No 88
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=89.56 E-value=6 Score=43.16 Aligned_cols=112 Identities=15% Similarity=0.240 Sum_probs=66.4
Q ss_pred CCCEEEEEeC-CCEEEEEECcCCccceEEEcCccc-------cee---eeeee---eCCEEEEE-Ec----------cCC
Q 003800 52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLGIND-------VVD---GIDIA---LGKYVITL-SS----------DGS 106 (794)
Q Consensus 52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~-------~i~---~l~~~---~g~~~V~V-s~----------~g~ 106 (794)
.++.+++.++ ...|+.+|++||+++||..=+... ... -.+.. .+++.+.+ =. ..+
T Consensus 153 ~~G~yLiS~R~~~~i~~I~~~tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~~~~~~~~IslFDN~~~~~~~~~~s~~ 232 (299)
T PF14269_consen 153 DDGDYLISSRNTSTIYKIDPSTGKIIWRLGGKRNSDFTLPATNFSWQHDARFLNESNDDGTISLFDNANSDFNGTEPSRG 232 (299)
T ss_pred CCccEEEEecccCEEEEEECCCCcEEEEeCCCCCCcccccCCcEeeccCCEEeccCCCCCEEEEEcCCCCCCCCCcCCCc
Confidence 3556777776 589999999999999997433110 010 00111 12333332 11 236
Q ss_pred eEEEEeCCCCcEeEEEecc-Cc-cc----cCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEe
Q 003800 107 TLRAWNLPDGQMVWESFLR-GS-KH----SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRD 171 (794)
Q Consensus 107 ~v~A~d~~tG~llWe~~l~-~~-~~----s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~ 171 (794)
.+..+|..+.+..|..... .+ .. +.....++ .+.++|. ..+++.-++ .+|+++|++.
T Consensus 233 ~v~~ld~~~~~~~~~~~~~~~~~~~~s~~~G~~Q~L~-------nGn~li~~g~~g~~~E~~-~~G~vv~~~~ 297 (299)
T PF14269_consen 233 LVLELDPETMTVTLVREYSDHPDGFYSPSQGSAQRLP-------NGNVLIGWGNNGRISEFT-PDGEVVWEAQ 297 (299)
T ss_pred eEEEEECCCCEEEEEEEeecCCCcccccCCCcceECC-------CCCEEEecCCCceEEEEC-CCCCEEEEEE
Confidence 8999999988776666554 11 11 11122222 4667775 378888887 6899999975
No 89
>PRK04922 tolB translocation protein TolB; Provisional
Probab=88.03 E-value=55 Score=37.46 Aligned_cols=150 Identities=14% Similarity=0.091 Sum_probs=74.3
Q ss_pred CCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEc-cC-CeEEEEeCCCCcEeEEEecc
Q 003800 52 GRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSS-DG-STLRAWNLPDGQMVWESFLR 125 (794)
Q Consensus 52 ~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~-~g-~~v~A~d~~tG~llWe~~l~ 125 (794)
+++.|++.+. ...|+.+|.++|+..--..+ ++........ .|+.+++... +| ..++.||..+|+.. +....
T Consensus 214 Dg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~--~g~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~-~lt~~ 290 (433)
T PRK04922 214 DGKKLAYVSFERGRSAIYVQDLATGQRELVASF--RGINGAPSFSPDGRRLALTLSRDGNPEIYVMDLGSRQLT-RLTNH 290 (433)
T ss_pred CCCEEEEEecCCCCcEEEEEECCCCCEEEeccC--CCCccCceECCCCCEEEEEEeCCCCceEEEEECCCCCeE-ECccC
Confidence 4556666553 34799999999875322112 2111112112 3555555432 22 47999999998753 21111
Q ss_pred CccccCCccccccccccccCCeEEEEE--C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCce
Q 003800 126 GSKHSKPLLLVPTNLKVDKDSLILVSS--K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQ 201 (794)
Q Consensus 126 ~~~~s~~~~~~~~~~~~~~~~~V~V~~--~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~ 201 (794)
.... ..+.+. .+ ++.+++.+ + ..++.+|..+|+..--....... ..+..+.++..+++.+..+ ..
T Consensus 291 ~~~~-~~~~~s-----pD-G~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~~~---~~~~~SpDG~~Ia~~~~~~-~~ 359 (433)
T PRK04922 291 FGID-TEPTWA-----PD-GKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGNYN---ARASVSPDGKKIAMVHGSG-GQ 359 (433)
T ss_pred CCCc-cceEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEEeecCCCCc---cCEEECCCCCEEEEEECCC-Cc
Confidence 1111 111121 22 33444443 2 35889998888654221111111 1122234556666554432 23
Q ss_pred eEEEEEEcCCCcee
Q 003800 202 FHAYQINAMNGELL 215 (794)
Q Consensus 202 ~~v~ald~~tG~~~ 215 (794)
..+..+|+.+|+..
T Consensus 360 ~~I~v~d~~~g~~~ 373 (433)
T PRK04922 360 YRIAVMDLSTGSVR 373 (433)
T ss_pred eeEEEEECCCCCeE
Confidence 46888899888765
No 90
>PRK04792 tolB translocation protein TolB; Provisional
Probab=88.01 E-value=57 Score=37.63 Aligned_cols=148 Identities=11% Similarity=0.099 Sum_probs=74.0
Q ss_pred cCCCEEEEEe-CC--CEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccC--CeEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVST-EE--NVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDG--STLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t-~~--g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g--~~v~A~d~~tG~llWe~~l 124 (794)
+++++|+..+ ++ ..|+.+|..+|+.. +....++....... ..|+.+++.+..+ ..++.+|..+|++. .+
T Consensus 227 PDG~~La~~s~~~g~~~L~~~dl~tg~~~--~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~---~l 301 (448)
T PRK04792 227 PDGRKLAYVSFENRKAEIFVQDIYTQVRE--KVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALT---RI 301 (448)
T ss_pred CCCCEEEEEEecCCCcEEEEEECCCCCeE--EecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeE---EC
Confidence 3455555544 32 47999999998752 22111111111111 2455566654332 36999999888742 22
Q ss_pred cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEE-EEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILW-TRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W-~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
.... ....+.+. .+ ++.+++.+ ...++.+|..+|+..- ++.... .. ....+.++..+++.+..+
T Consensus 302 t~~~~~~~~p~wS-----pD-G~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~~-~~---~~~~SpDG~~l~~~~~~~ 371 (448)
T PRK04792 302 TRHRAIDTEPSWH-----PD-GKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGEQ-NL---GGSITPDGRSMIMVNRTN 371 (448)
T ss_pred ccCCCCccceEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEEecCCCC-Cc---CeeECCCCCEEEEEEecC
Confidence 2111 10111111 22 23444433 3479999998887532 211111 11 111134566676655443
Q ss_pred CceeEEEEEEcCCCce
Q 003800 199 SSQFHAYQINAMNGEL 214 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~ 214 (794)
....++.+|+.+|+.
T Consensus 372 -g~~~I~~~dl~~g~~ 386 (448)
T PRK04792 372 -GKFNIARQDLETGAM 386 (448)
T ss_pred -CceEEEEEECCCCCe
Confidence 235788899999875
No 91
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=87.35 E-value=27 Score=37.34 Aligned_cols=194 Identities=9% Similarity=0.056 Sum_probs=108.9
Q ss_pred CCEEEEEeCCCEEEEEECcCCcc-ceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcE-eEEEeccCccc
Q 003800 53 RKRVVVSTEENVIASLDLRHGEI-FWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQM-VWESFLRGSKH 129 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~i-vWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~l-lWe~~l~~~~~ 129 (794)
++...+....+.|..||++|++. .|...++... +.... +-...+.+-.++..+.-=-+|+.++.+ +|.........
T Consensus 114 dg~~Witd~~~aI~R~dpkt~evt~f~lp~~~a~~nlet~-vfD~~G~lWFt~q~G~yGrLdPa~~~i~vfpaPqG~gpy 192 (353)
T COG4257 114 DGSAWITDTGLAIGRLDPKTLEVTRFPLPLEHADANLETA-VFDPWGNLWFTGQIGAYGRLDPARNVISVFPAPQGGGPY 192 (353)
T ss_pred CCCeeEecCcceeEEecCcccceEEeecccccCCCcccce-eeCCCccEEEeeccccceecCcccCceeeeccCCCCCCc
Confidence 44455555556899999999864 3433333221 22222 234555554444322333578887754 56666444332
Q ss_pred cCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcce-eeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800 130 SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~a 206 (794)
.++..+ ++.|++. .+..+.++|..+|..- ....|.+.- ..+++ .+...+.++.....++ .++.
T Consensus 193 --Gi~atp-------dGsvwyaslagnaiaridp~~~~ae-v~p~P~~~~~gsRri-wsdpig~~wittwg~g---~l~r 258 (353)
T COG4257 193 --GICATP-------DGSVWYASLAGNAIARIDPFAGHAE-VVPQPNALKAGSRRI-WSDPIGRAWITTWGTG---SLHR 258 (353)
T ss_pred --ceEECC-------CCcEEEEeccccceEEcccccCCcc-eecCCCccccccccc-ccCccCcEEEeccCCc---eeeE
Confidence 333333 6778876 4889999999999421 112222100 01112 1234567776433333 7899
Q ss_pred EEcCCCceeeeeeeeccc-CccCceEEEcCcEEEEE-ECCCCeEEEEEeecceeeeEEEeec
Q 003800 207 INAMNGELLNHETAAFSG-GFVGDVALVSSDTLVTL-DTTRSILVTVSFKNRKIAFQETHLS 266 (794)
Q Consensus 207 ld~~tG~~~w~~~v~~~~-~~s~~~~~vg~~~lv~~-d~~~g~L~v~~l~sg~~~~~~~~l~ 266 (794)
+|+.+-. |+.= ..|. ....-.+.|.+.-.||+ |.+.|.|+..|-++.+ +..+|+.
T Consensus 259 fdPs~~s--W~ey-pLPgs~arpys~rVD~~grVW~sea~agai~rfdpeta~--ftv~p~p 315 (353)
T COG4257 259 FDPSVTS--WIEY-PLPGSKARPYSMRVDRHGRVWLSEADAGAIGRFDPETAR--FTVLPIP 315 (353)
T ss_pred eCccccc--ceee-eCCCCCCCcceeeeccCCcEEeeccccCceeecCcccce--EEEecCC
Confidence 9998765 7541 2232 22223455666556777 7778889999888876 7777764
No 92
>PRK03629 tolB translocation protein TolB; Provisional
Probab=87.05 E-value=62 Score=37.04 Aligned_cols=151 Identities=14% Similarity=0.056 Sum_probs=73.0
Q ss_pred cCCCEEEEEe---CCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc-cC-CeEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVST---EENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS-DG-STLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t---~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~-~g-~~v~A~d~~tG~llWe~~l 124 (794)
++++++.+.+ ....|+.+|.++|+..--..++.. ...... ..|+.+++++. .+ ..++.||..+|++.=-..
T Consensus 208 PDG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~--~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~- 284 (429)
T PRK03629 208 PDGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRH--NGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTD- 284 (429)
T ss_pred CCCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCC--cCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccC-
Confidence 3455555443 245788899988874322122211 111111 24555666533 22 369999999887642111
Q ss_pred cCccccCCccccccccccccCCeEEEEE--C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILVSS--K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
..... ..+... .+ ++.+++.+ + -.++.+|..+|+..--.... ... .....+.++..+++.+..++
T Consensus 285 ~~~~~-~~~~wS-----PD-G~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~-~~~--~~~~~SpDG~~Ia~~~~~~g- 353 (429)
T PRK03629 285 GRSNN-TEPTWF-----PD-SQNLAYTSDQAGRPQVYKVNINGGAPQRITWEG-SQN--QDADVSSDGKFMVMVSSNGG- 353 (429)
T ss_pred CCCCc-CceEEC-----CC-CCEEEEEeCCCCCceEEEEECCCCCeEEeecCC-CCc--cCEEECCCCCEEEEEEccCC-
Confidence 11111 111122 22 23344433 2 27888998888654221111 111 11111334555555444432
Q ss_pred eeEEEEEEcCCCcee
Q 003800 201 QFHAYQINAMNGELL 215 (794)
Q Consensus 201 ~~~v~ald~~tG~~~ 215 (794)
...++.+|+.+|+..
T Consensus 354 ~~~I~~~dl~~g~~~ 368 (429)
T PRK03629 354 QQHIAKQDLATGGVQ 368 (429)
T ss_pred CceEEEEECCCCCeE
Confidence 246788899998743
No 93
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=86.61 E-value=5.9 Score=42.60 Aligned_cols=184 Identities=18% Similarity=0.221 Sum_probs=97.5
Q ss_pred CCEEEEEECcCCcc-ceEEEcCcc---------cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 62 ENVIASLDLRHGEI-FWRHVLGIN---------DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 62 ~g~l~ALn~~tG~i-vWR~~l~~~---------~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
+....|--+.||++ +||...+.- ..+.+++...++..+.-++.+..+|.--..+|+.+=|++..+.-. .
T Consensus 274 DsEMlAsGsqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyv-n 352 (508)
T KOG0275|consen 274 DSEMLASGSQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSSYV-N 352 (508)
T ss_pred cHHHhhccCcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccccc-c
Confidence 34444444555654 577654321 123344333333334334557789999999999999999876543 2
Q ss_pred CccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800 132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM 210 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~ 210 (794)
.+.+.+ + ++.++-. +||++..-+.+|++-+=+++..........++....+-.-++++...+ .++..+.
T Consensus 353 ~a~ft~-----d-G~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsn---tv~imn~- 422 (508)
T KOG0275|consen 353 EATFTD-----D-GHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSN---TVYIMNM- 422 (508)
T ss_pred ceEEcC-----C-CCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCC---eEEEEec-
Confidence 322222 2 3444444 599999999999888777766554432222222112222233333222 3444443
Q ss_pred CCceeeeeeeecc--cCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 211 NGELLNHETAAFS--GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 211 tG~~~w~~~v~~~--~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.|+.+....-+-. .++-..++-.-+..++|+- ..+.|+.....+|+
T Consensus 423 qGQvVrsfsSGkREgGdFi~~~lSpkGewiYcig-ED~vlYCF~~~sG~ 470 (508)
T KOG0275|consen 423 QGQVVRSFSSGKREGGDFINAILSPKGEWIYCIG-EDGVLYCFSVLSGK 470 (508)
T ss_pred cceEEeeeccCCccCCceEEEEecCCCcEEEEEc-cCcEEEEEEeecCc
Confidence 4555544421111 1111122222344666775 45788888888887
No 94
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=86.55 E-value=14 Score=44.07 Aligned_cols=173 Identities=12% Similarity=0.124 Sum_probs=94.6
Q ss_pred CCEEEEEeCC-------CEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC------CeEEEEeCCCCc
Q 003800 53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG------STLRAWNLPDGQ 117 (794)
Q Consensus 53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g------~~v~A~d~~tG~ 117 (794)
.+.+|+.... ..+-++|++++ .|+...+-+. .-.+. +..++.+.++||.+ ..+.-+|+.+++
T Consensus 284 ~~~l~~vGG~~~~~~~~~~ve~yd~~~~--~w~~~a~m~~~r~~~~~-~~~~~~lYv~GG~~~~~~~l~~ve~YD~~~~~ 360 (571)
T KOG4441|consen 284 SGKLVAVGGYNRQGQSLRSVECYDPKTN--EWSSLAPMPSPRCRVGV-AVLNGKLYVVGGYDSGSDRLSSVERYDPRTNQ 360 (571)
T ss_pred CCeEEEECCCCCCCcccceeEEecCCcC--cEeecCCCCcccccccE-EEECCEEEEEccccCCCcccceEEEecCCCCc
Confidence 4556665542 46889999999 6877654442 11122 34566666667755 568899999888
Q ss_pred EeEEEeccCccccCCccccccccccccCCeEEEEE--CC-----EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCE
Q 003800 118 MVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KG-----CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQ 190 (794)
Q Consensus 118 llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g-----~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~ 190 (794)
|..-..-..... ..+ ...-++.+|+.+ +| .+-++|+.+ -.|+...+.... ....-...-++.
T Consensus 361 --W~~~a~M~~~R~---~~~---v~~l~g~iYavGG~dg~~~l~svE~YDp~~--~~W~~va~m~~~-r~~~gv~~~~g~ 429 (571)
T KOG4441|consen 361 --WTPVAPMNTKRS---DFG---VAVLDGKLYAVGGFDGEKSLNSVECYDPVT--NKWTPVAPMLTR-RSGHGVAVLGGK 429 (571)
T ss_pred --eeccCCccCccc---cce---eEEECCEEEEEeccccccccccEEEecCCC--CcccccCCCCcc-eeeeEEEEECCE
Confidence 886322111101 111 111145666643 22 355555543 468887765432 112111356899
Q ss_pred EEEEEecCCce---eEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800 191 IYVVGYAGSSQ---FHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL 241 (794)
Q Consensus 191 vyv~~~~g~~~---~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~ 241 (794)
+|++|...+.. -.+-++|+.|++ |...-.++....+ .+...++.+|++.
T Consensus 430 iYi~GG~~~~~~~l~sve~YDP~t~~--W~~~~~M~~~R~~~g~a~~~~~iYvvG 482 (571)
T KOG4441|consen 430 LYIIGGGDGSSNCLNSVECYDPETNT--WTLIAPMNTRRSGFGVAVLNGKIYVVG 482 (571)
T ss_pred EEEEcCcCCCccccceEEEEcCCCCc--eeecCCcccccccceEEEECCEEEEEC
Confidence 99987643222 568899998874 6654333322222 2333355555554
No 95
>PRK00178 tolB translocation protein TolB; Provisional
Probab=86.39 E-value=65 Score=36.61 Aligned_cols=148 Identities=14% Similarity=0.093 Sum_probs=74.2
Q ss_pred cCCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEc-c-CCeEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSS-D-GSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~-~-g~~v~A~d~~tG~llWe~~l 124 (794)
+++++|++.+.+ ..|+.+|.++|+.. +.....+........ .|+.+++... . ...++.+|..+|+..- +
T Consensus 208 pDG~~la~~s~~~~~~~l~~~~l~~g~~~--~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~---l 282 (430)
T PRK00178 208 PDGKRIAYVSFEQKRPRIFVQNLDTGRRE--QITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSR---V 282 (430)
T ss_pred CCCCEEEEEEcCCCCCEEEEEECCCCCEE--EccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEE---c
Confidence 345566554432 47899999988752 222112111111112 4555555433 2 2479999999887531 2
Q ss_pred cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEE-EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEIL-WTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~-W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
.... ....+.+. .+ ++.+++.+ ...++.+|..+|+.. .+... ... .....+.++..+++....+
T Consensus 283 t~~~~~~~~~~~s-----pD-g~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~--~~~--~~~~~Spdg~~i~~~~~~~ 352 (430)
T PRK00178 283 TNHPAIDTEPFWG-----KD-GRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG--NYN--ARPRLSADGKTLVMVHRQD 352 (430)
T ss_pred ccCCCCcCCeEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEeecCC--CCc--cceEECCCCCEEEEEEccC
Confidence 2111 10111111 22 34444443 347999999888753 22211 111 1111134566666655433
Q ss_pred CceeEEEEEEcCCCce
Q 003800 199 SSQFHAYQINAMNGEL 214 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~ 214 (794)
+ ...++.+|+.+|+.
T Consensus 353 ~-~~~l~~~dl~tg~~ 367 (430)
T PRK00178 353 G-NFHVAAQDLQRGSV 367 (430)
T ss_pred C-ceEEEEEECCCCCE
Confidence 2 34688899999875
No 96
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=86.03 E-value=14 Score=42.04 Aligned_cols=113 Identities=17% Similarity=0.252 Sum_probs=71.8
Q ss_pred EEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec--cCccccCC
Q 003800 55 RVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL--RGSKHSKP 132 (794)
Q Consensus 55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l--~~~~~s~~ 132 (794)
.++.++.+|.|---|.++=. -|-..++.+.++... .....+..+++..|+.|+.||..+|..+=-... ...+. .
T Consensus 168 ivvtGsYDg~vrl~DtR~~~-~~v~elnhg~pVe~v-l~lpsgs~iasAgGn~vkVWDl~~G~qll~~~~~H~KtVT--c 243 (487)
T KOG0310|consen 168 IVVTGSYDGKVRLWDTRSLT-SRVVELNHGCPVESV-LALPSGSLIASAGGNSVKVWDLTTGGQLLTSMFNHNKTVT--C 243 (487)
T ss_pred EEEecCCCceEEEEEeccCC-ceeEEecCCCceeeE-EEcCCCCEEEEcCCCeEEEEEecCCceehhhhhcccceEE--E
Confidence 47788889999999988865 788888776555533 234444444465578999999997755433322 12111 1
Q ss_pred ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce
Q 003800 133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV 177 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~ 177 (794)
+.+.. + +..++-.+ |+.|-.+|..+=+++-.+..++|-+
T Consensus 244 L~l~s-----~-~~rLlS~sLD~~VKVfd~t~~Kvv~s~~~~~pvL 283 (487)
T KOG0310|consen 244 LRLAS-----D-STRLLSGSLDRHVKVFDTTNYKVVHSWKYPGPVL 283 (487)
T ss_pred EEeec-----C-CceEeecccccceEEEEccceEEEEeeeccccee
Confidence 11111 1 23333344 9999999988888887777777654
No 97
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=85.87 E-value=20 Score=41.66 Aligned_cols=106 Identities=15% Similarity=0.144 Sum_probs=64.2
Q ss_pred CCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec
Q 003800 145 DSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF 222 (794)
Q Consensus 145 ~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~ 222 (794)
+-.++|++ .|.+..++...|++.|+...+...-..-.+..+...+-+|-++.+ .++.-++.++++.+-.....-
T Consensus 69 ~t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad----~~v~~~~~~~~~~~~~~~~~~ 144 (541)
T KOG4547|consen 69 DTSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGAD----LKVVYILEKEKVIIRIWKEQK 144 (541)
T ss_pred CceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCc----eeEEEEecccceeeeeeccCC
Confidence 34456663 899999999999999999854432101111112234455644333 488889999998874443222
Q ss_pred ccCccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800 223 SGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI 258 (794)
Q Consensus 223 ~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~ 258 (794)
+ ..+.-|+...+.+++.+ .+++.++|++++++
T Consensus 145 ~-~~~sl~is~D~~~l~~a---s~~ik~~~~~~kev 176 (541)
T KOG4547|consen 145 P-LVSSLCISPDGKILLTA---SRQIKVLDIETKEV 176 (541)
T ss_pred C-ccceEEEcCCCCEEEec---cceEEEEEccCceE
Confidence 2 22233443333455544 46899999999884
No 98
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=84.67 E-value=56 Score=34.35 Aligned_cols=105 Identities=10% Similarity=0.125 Sum_probs=62.8
Q ss_pred EEeCCCEEEEEEC------cCCccceEEEcCccc------ceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 58 VSTEENVIASLDL------RHGEIFWRHVLGIND------VVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 58 v~t~~g~l~ALn~------~tG~ivWR~~l~~~~------~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
....+|.|++.-= .-=+.+|+...+... .|..+.+. ..+-+++.+| ++.++.||.+||+..-+++.
T Consensus 76 ls~gdG~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgG-D~~~y~~dlE~G~i~r~~rG 154 (325)
T KOG0649|consen 76 LSGGDGLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGG-DGVIYQVDLEDGRIQREYRG 154 (325)
T ss_pred eeccCceEEEeeehhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEecC-CeEEEEEEecCCEEEEEEcC
Confidence 3334588888731 233567877665441 12222122 2333555455 57999999999999999998
Q ss_pred cCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~ 170 (794)
....+ - .+++ -...+.|+-. .||+++.-|.+|++-+=..
T Consensus 155 HtDYv-H--~vv~----R~~~~qilsG~EDGtvRvWd~kt~k~v~~i 194 (325)
T KOG0649|consen 155 HTDYV-H--SVVG----RNANGQILSGAEDGTVRVWDTKTQKHVSMI 194 (325)
T ss_pred Cccee-e--eeee----cccCcceeecCCCccEEEEeccccceeEEe
Confidence 76543 1 1111 1113455555 3899999999988755443
No 99
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=84.59 E-value=40 Score=36.27 Aligned_cols=152 Identities=11% Similarity=0.125 Sum_probs=94.7
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHS 130 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s 130 (794)
+++.|++++.+..-+--|.++|+..=-+.--. +.+-++.+.-.+.-.|| ++.+...+.||...|.-+=.+.......
T Consensus 155 dD~~ilT~SGD~TCalWDie~g~~~~~f~GH~-gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~ghesDI- 232 (343)
T KOG0286|consen 155 DDNHILTGSGDMTCALWDIETGQQTQVFHGHT-GDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFEGHESDI- 232 (343)
T ss_pred CCCceEecCCCceEEEEEcccceEEEEecCCc-ccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeeccccccc-
Confidence 37779999999999999999997654333211 22323322221333344 4557899999999998877777665444
Q ss_pred CCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEE
Q 003800 131 KPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQIN 208 (794)
Q Consensus 131 ~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald 208 (794)
.+..+.| ++.-|+. .|+.-..+|....+.+=.|+.+....-...+-.+.++..+|+ ++.. ......|
T Consensus 233 Nsv~ffP-------~G~afatGSDD~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfa-gy~d---~~c~vWD 301 (343)
T KOG0286|consen 233 NSVRFFP-------SGDAFATGSDDATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFA-GYDD---FTCNVWD 301 (343)
T ss_pred ceEEEcc-------CCCeeeecCCCceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEe-eecC---CceeEee
Confidence 3444554 5666665 388889999998888877775543321222322334444553 4433 2677778
Q ss_pred cCCCceee
Q 003800 209 AMNGELLN 216 (794)
Q Consensus 209 ~~tG~~~w 216 (794)
...|++.-
T Consensus 302 tlk~e~vg 309 (343)
T KOG0286|consen 302 TLKGERVG 309 (343)
T ss_pred ccccceEE
Confidence 77776653
No 100
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=84.41 E-value=29 Score=39.10 Aligned_cols=119 Identities=19% Similarity=0.218 Sum_probs=70.6
Q ss_pred EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECC---CCcEEEEEe
Q 003800 97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSI---DGEILWTRD 171 (794)
Q Consensus 97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~---tG~~~W~~~ 171 (794)
.+++-++.+.+|..||.++|+..=.....+... +.+..-+ . ...+++. + +++|...|.. .-...|++.
T Consensus 257 nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~k~V-q~l~wh~-----~-~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~ 329 (463)
T KOG0270|consen 257 NVLASGSADKTVKLWDVDTGKPKSSITHHGKKV-QTLEWHP-----Y-EPSVLLSGSYDGTVALKDCRDPSNSGKEWKFD 329 (463)
T ss_pred eeEEecCCCceEEEEEcCCCCcceehhhcCCce-eEEEecC-----C-CceEEEeccccceEEeeeccCccccCceEEec
Confidence 344434457899999999999998877555444 2333332 1 2334443 2 8888888877 344678886
Q ss_pred ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC-CCceeeeeeeecccCccCceE
Q 003800 172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM-NGELLNHETAAFSGGFVGDVA 231 (794)
Q Consensus 172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~-tG~~~w~~~v~~~~~~s~~~~ 231 (794)
..-... ..-......|+++.+.| .|+-+|+. .|+++|+....- .++++-++
T Consensus 330 g~VEkv-----~w~~~se~~f~~~tddG---~v~~~D~R~~~~~vwt~~AHd-~~ISgl~~ 381 (463)
T KOG0270|consen 330 GEVEKV-----AWDPHSENSFFVSTDDG---TVYYFDIRNPGKPVWTLKAHD-DEISGLSV 381 (463)
T ss_pred cceEEE-----EecCCCceeEEEecCCc---eEEeeecCCCCCceeEEEecc-CCcceEEe
Confidence 543322 11122334454554433 78888876 579999986432 35555333
No 101
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=84.23 E-value=44 Score=38.68 Aligned_cols=158 Identities=14% Similarity=0.176 Sum_probs=84.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
+..|+-++.++.+..-|.++|+.+=....... .+.++.....+..++.++.++.++.||..+|..+ -........ ..
T Consensus 258 g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~-~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~-~~~~~~~~~-~~ 334 (456)
T KOG0266|consen 258 GNLLVSGSDDGTVRIWDVRTGECVRKLKGHSD-GISGLAFSPDGNLLVSASYDGTIRVWDLETGSKL-CLKLLSGAE-NS 334 (456)
T ss_pred CCEEEEecCCCcEEEEeccCCeEEEeeeccCC-ceEEEEECCCCCEEEEcCCCccEEEEECCCCcee-eeecccCCC-CC
Confidence 56788899999999999999876544333332 3444422233344444555789999999999954 111111000 01
Q ss_pred ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800 133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM 210 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~ 210 (794)
.++.-.....+ ...+++.. ++.+.-.|..+|...=++...... ........ ..++...+.+... ..+..+|..
T Consensus 335 ~~~~~~~fsp~-~~~ll~~~~d~~~~~w~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~i~sg~~d---~~v~~~~~~ 409 (456)
T KOG0266|consen 335 APVTSVQFSPN-GKYLLSASLDRTLKLWDLRSGKSVGTYTGHSNLVRCIFSPTL-STGGKLIYSGSED---GSVYVWDSS 409 (456)
T ss_pred CceeEEEECCC-CcEEEEecCCCeEEEEEccCCcceeeecccCCcceeEecccc-cCCCCeEEEEeCC---ceEEEEeCC
Confidence 01110000112 33444444 667777777777655444332221 11222322 2334433333332 278999999
Q ss_pred CCceeeee
Q 003800 211 NGELLNHE 218 (794)
Q Consensus 211 tG~~~w~~ 218 (794)
+|..+-..
T Consensus 410 s~~~~~~l 417 (456)
T KOG0266|consen 410 SGGILQRL 417 (456)
T ss_pred ccchhhhh
Confidence 88777554
No 102
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=83.76 E-value=63 Score=34.15 Aligned_cols=60 Identities=15% Similarity=0.296 Sum_probs=38.9
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLP 114 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~ 114 (794)
++-.|.++++|.+---|.+. +.=.+.+....++..+-+.-.+.-++++...+.||.||..
T Consensus 95 grWMyTgseDgt~kIWdlR~--~~~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~ 154 (311)
T KOG0315|consen 95 GRWMYTGSEDGTVKIWDLRS--LSCQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLG 154 (311)
T ss_pred CeEEEecCCCceEEEEeccC--cccchhccCCCCcceEEecCCcceEEeecCCCcEEEEEcc
Confidence 44499999999999888887 2222233322223333233456667767777899999984
No 103
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=83.06 E-value=79 Score=34.79 Aligned_cols=178 Identities=10% Similarity=0.072 Sum_probs=88.7
Q ss_pred CCEEEEEeCC------------CEEEEEECcCCccceEEEcC-cccceeeee-e-eeCCEEEEEEccC------------
Q 003800 53 RKRVVVSTEE------------NVIASLDLRHGEIFWRHVLG-INDVVDGID-I-ALGKYVITLSSDG------------ 105 (794)
Q Consensus 53 ~~~Vyv~t~~------------g~l~ALn~~tG~ivWR~~l~-~~~~i~~l~-~-~~g~~~V~Vs~~g------------ 105 (794)
++.||+.... +.+..+|+.+. .|+.... .+....+.. . ..++.+.+++|.+
T Consensus 63 ~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~--~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~ 140 (346)
T TIGR03547 63 DGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKN--SWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLS 140 (346)
T ss_pred CCEEEEEeCCCCCCCCCcceecccEEEEECCCC--EEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHh
Confidence 7789987753 24778888876 4987642 111111111 1 2455566666642
Q ss_pred ---------------------------CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-----
Q 003800 106 ---------------------------STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK----- 153 (794)
Q Consensus 106 ---------------------------~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~----- 153 (794)
+.+..||+.+. .|+..-.-+.. +..... ....++.++|..+
T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~--~W~~~~~~p~~----~r~~~~-~~~~~~~iyv~GG~~~~~ 213 (346)
T TIGR03547 141 AADKDSEPKDKLIAAYFSQPPEDYFWNKNVLSYDPSTN--QWRNLGENPFL----GTAGSA-IVHKGNKLLLINGEIKPG 213 (346)
T ss_pred hcCccchhhhhhHHHHhCCChhHcCccceEEEEECCCC--ceeECccCCCC----cCCCce-EEEECCEEEEEeeeeCCC
Confidence 46888888765 58764322110 001100 1122567777531
Q ss_pred ---CEEEEEECCCCcEEEEEeccCcce--e-e---eeEEEEecCCEEEEEEecCC-------------------ceeEEE
Q 003800 154 ---GCLHAVSSIDGEILWTRDFAAESV--E-V---QQVIQLDESDQIYVVGYAGS-------------------SQFHAY 205 (794)
Q Consensus 154 ---g~l~ald~~tG~~~W~~~~~~~~~--~-~---~~~v~s~~~~~vyv~~~~g~-------------------~~~~v~ 205 (794)
..++.++.....-.|+.-.+.+.- . + .......-++.+|+++.... ..-.+.
T Consensus 214 ~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e 293 (346)
T TIGR03547 214 LRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSE 293 (346)
T ss_pred ccchheEEEEecCCCceeeecCCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEee
Confidence 124556655566679865443220 0 0 01101245889999875310 001356
Q ss_pred EEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEE
Q 003800 206 QINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTL 241 (794)
Q Consensus 206 ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~ 241 (794)
++|+.+. .|+..-..|........+ +++.++++.
T Consensus 294 ~yd~~~~--~W~~~~~lp~~~~~~~~~~~~~~iyv~G 328 (346)
T TIGR03547 294 VYALDNG--KWSKVGKLPQGLAYGVSVSWNNGVLLIG 328 (346)
T ss_pred EEEecCC--cccccCCCCCCceeeEEEEcCCEEEEEe
Confidence 6777765 487754555444332222 244444433
No 104
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=82.98 E-value=63 Score=39.71 Aligned_cols=186 Identities=15% Similarity=0.110 Sum_probs=107.7
Q ss_pred CCCEEEEEeCCCEEEEEECcCCc---cceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGE---IFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~---ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
..+.+.++|+++.|.+..--.|+ ++=|+.++-. .+.+..++..+..++++-.|..+|..|+...-..+...+.
T Consensus 65 ~s~~f~~~s~~~tv~~y~fps~~~~~iL~Rftlp~r----~~~v~g~g~~iaagsdD~~vK~~~~~D~s~~~~lrgh~ap 140 (933)
T KOG1274|consen 65 YSNHFLTGSEQNTVLRYKFPSGEEDTILARFTLPIR----DLAVSGSGKMIAAGSDDTAVKLLNLDDSSQEKVLRGHDAP 140 (933)
T ss_pred cccceEEeeccceEEEeeCCCCCccceeeeeeccce----EEEEecCCcEEEeecCceeEEEEeccccchheeecccCCc
Confidence 45678889999988888666554 4455555432 3322334446665777788999999999888777665443
Q ss_pred ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcce--e----eeeEEEEecCCEEEEEEecCCce
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESV--E----VQQVIQLDESDQIYVVGYAGSSQ 201 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~--~----~~~~v~s~~~~~vyv~~~~g~~~ 201 (794)
. ..+.+-| . ++.+.+. .+|.|+..|..+|...=++..-.+.. . ..++.....++.+-+.+.++
T Consensus 141 V-l~l~~~p-----~-~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~--- 210 (933)
T KOG1274|consen 141 V-LQLSYDP-----K-GNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDN--- 210 (933)
T ss_pred e-eeeeEcC-----C-CCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCC---
Confidence 2 1111111 1 3344444 49999999999998764443322211 1 11222234567777777676
Q ss_pred eEEEEEEcCCCceeeeeeeeccc-CccCceEE-EcCcEEEEEECCCCeEEEEEee
Q 003800 202 FHAYQINAMNGELLNHETAAFSG-GFVGDVAL-VSSDTLVTLDTTRSILVTVSFK 254 (794)
Q Consensus 202 ~~v~ald~~tG~~~w~~~v~~~~-~~s~~~~~-vg~~~lv~~d~~~g~L~v~~l~ 254 (794)
.|..++..+++.....+....+ .++- +-+ ..+.++++.+. +|.+.+-|.+
T Consensus 211 -~Vkvy~r~~we~~f~Lr~~~~ss~~~~-~~wsPnG~YiAAs~~-~g~I~vWnv~ 262 (933)
T KOG1274|consen 211 -TVKVYSRKGWELQFKLRDKLSSSKFSD-LQWSPNGKYIAASTL-DGQILVWNVD 262 (933)
T ss_pred -eEEEEccCCceeheeecccccccceEE-EEEcCCCcEEeeecc-CCcEEEEecc
Confidence 6888898888877666432211 1111 111 13345555553 4666666655
No 105
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=82.98 E-value=21 Score=39.67 Aligned_cols=71 Identities=15% Similarity=0.222 Sum_probs=49.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
.+.+..++.+|+|.--|..||+.+=+.. .++-+...... .|..+++ +..+.+||.||+.+|+++|+.....
T Consensus 144 ~NVLlsag~Dn~v~iWnv~tgeali~l~--hpd~i~S~sfn~dGs~l~T-tckDKkvRv~dpr~~~~v~e~~~he 215 (472)
T KOG0303|consen 144 PNVLLSAGSDNTVSIWNVGTGEALITLD--HPDMVYSMSFNRDGSLLCT-TCKDKKVRVIDPRRGTVVSEGVAHE 215 (472)
T ss_pred hhhHhhccCCceEEEEeccCCceeeecC--CCCeEEEEEeccCCceeee-ecccceeEEEcCCCCcEeeeccccc
Confidence 5557778889999999999999877743 44434433222 2333334 3346799999999999999985443
No 106
>PHA03098 kelch-like protein; Provisional
Probab=82.12 E-value=47 Score=39.06 Aligned_cols=135 Identities=9% Similarity=0.080 Sum_probs=69.4
Q ss_pred eeCCEEEEEEccC------CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------CEEEEE
Q 003800 93 ALGKYVITLSSDG------STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------GCLHAV 159 (794)
Q Consensus 93 ~~g~~~V~Vs~~g------~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------g~l~al 159 (794)
..++.++++||.+ ..++.||+.+++ |+.- ..... +..... ....++.+++.++ ..+..+
T Consensus 292 ~~~~~lyv~GG~~~~~~~~~~v~~yd~~~~~--W~~~-~~~~~--~R~~~~---~~~~~~~lyv~GG~~~~~~~~~v~~y 363 (534)
T PHA03098 292 VLNNVIYFIGGMNKNNLSVNSVVSYDTKTKS--WNKV-PELIY--PRKNPG---VTVFNNRIYVIGGIYNSISLNTVESW 363 (534)
T ss_pred EECCEEEEECCCcCCCCeeccEEEEeCCCCe--eeEC-CCCCc--ccccce---EEEECCEEEEEeCCCCCEecceEEEE
Confidence 4567777777642 258889988764 7532 21110 000010 1122566777542 346777
Q ss_pred ECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC---ceeEEEEEEcCCCceeeeeeeecccCccCce-EEEcC
Q 003800 160 SSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS---SQFHAYQINAMNGELLNHETAAFSGGFVGDV-ALVSS 235 (794)
Q Consensus 160 d~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~---~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~-~~vg~ 235 (794)
|..++ .|+.-.+.+.-...... ...++.+|++|.... ..-.+..+|+.++ .|+..-..|....+.+ ...++
T Consensus 364 d~~~~--~W~~~~~lp~~r~~~~~-~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~--~W~~~~~~p~~r~~~~~~~~~~ 438 (534)
T PHA03098 364 KPGES--KWREEPPLIFPRYNPCV-VNVNNLIYVIGGISKNDELLKTVECFSLNTN--KWSKGSPLPISHYGGCAIYHDG 438 (534)
T ss_pred cCCCC--ceeeCCCcCcCCccceE-EEECCEEEEECCcCCCCcccceEEEEeCCCC--eeeecCCCCccccCceEEEECC
Confidence 87765 58764433221000111 245889999875311 1135788998875 4877544454444433 33354
Q ss_pred cEEEE
Q 003800 236 DTLVT 240 (794)
Q Consensus 236 ~~lv~ 240 (794)
.++++
T Consensus 439 ~iyv~ 443 (534)
T PHA03098 439 KIYVI 443 (534)
T ss_pred EEEEE
Confidence 44444
No 107
>PHA02790 Kelch-like protein; Provisional
Probab=82.11 E-value=57 Score=38.01 Aligned_cols=147 Identities=10% Similarity=0.004 Sum_probs=75.6
Q ss_pred eCCEEEEEEccC-----CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC----CEEEEEECCCC
Q 003800 94 LGKYVITLSSDG-----STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK----GCLHAVSSIDG 164 (794)
Q Consensus 94 ~g~~~V~Vs~~g-----~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~----g~l~ald~~tG 164 (794)
.++.++++||.+ ..+..+|+.+++ |..-..-+.. . .... ...-++.+|+.++ ..+.++|+.++
T Consensus 270 ~~~~lyviGG~~~~~~~~~v~~Ydp~~~~--W~~~~~m~~~-r--~~~~---~v~~~~~iYviGG~~~~~sve~ydp~~n 341 (480)
T PHA02790 270 VGEVVYLIGGWMNNEIHNNAIAVNYISNN--WIPIPPMNSP-R--LYAS---GVPANNKLYVVGGLPNPTSVERWFHGDA 341 (480)
T ss_pred ECCEEEEEcCCCCCCcCCeEEEEECCCCE--EEECCCCCch-h--hcce---EEEECCEEEEECCcCCCCceEEEECCCC
Confidence 566666667632 357889998765 7654322111 0 0011 1122567777632 34677776544
Q ss_pred cEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceE-EEcCcEEEEEEC
Q 003800 165 EILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVA-LVSSDTLVTLDT 243 (794)
Q Consensus 165 ~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~-~vg~~~lv~~d~ 243 (794)
.|+.-.+.+.- ........-++.+|++|...+..-.+.++|+.+. .|+..-..+......+. .+++.++++.
T Consensus 342 --~W~~~~~l~~~-r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~--~W~~~~~m~~~r~~~~~~~~~~~IYv~G-- 414 (480)
T PHA02790 342 --AWVNMPSLLKP-RCNPAVASINNVIYVIGGHSETDTTTEYLLPNHD--QWQFGPSTYYPHYKSCALVFGRRLFLVG-- 414 (480)
T ss_pred --eEEECCCCCCC-CcccEEEEECCEEEEecCcCCCCccEEEEeCCCC--EEEeCCCCCCccccceEEEECCEEEEEC--
Confidence 58764443321 1111113568999998754322235678898765 68874333333333233 2354555443
Q ss_pred CCCeEEEEEeecce
Q 003800 244 TRSILVTVSFKNRK 257 (794)
Q Consensus 244 ~~g~L~v~~l~sg~ 257 (794)
|...+.|.++++
T Consensus 415 --G~~e~ydp~~~~ 426 (480)
T PHA02790 415 --RNAEFYCESSNT 426 (480)
T ss_pred --CceEEecCCCCc
Confidence 334455655554
No 108
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=81.32 E-value=36 Score=39.74 Aligned_cols=176 Identities=13% Similarity=0.131 Sum_probs=92.5
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc----c
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH----S 130 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~----s 130 (794)
+|++.....|+.||++-|+ |=..++.. +.+....+..-.+++.+|+..+.|-+||+.+-...=.......+. .
T Consensus 148 ly~~gsg~evYRlNLEqGr--fL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~ 225 (703)
T KOG2321|consen 148 LYLVGSGSEVYRLNLEQGR--FLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGG 225 (703)
T ss_pred EEEeecCcceEEEEccccc--cccccccccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccccCCCccc
Confidence 8888888889999999995 44444433 122223223345677778877899999998876655554433211 0
Q ss_pred CCccccccccccccCC-eEEEE-ECCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800 131 KPLLLVPTNLKVDKDS-LILVS-SKGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQI 207 (794)
Q Consensus 131 ~~~~~~~~~~~~~~~~-~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al 207 (794)
...+.+. ++.-..++ .+-|. +.|.++-+|..+-+++-.-+....- +......+....++|+ +.+.. .+-..
T Consensus 226 ~~~~svT-al~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~~~~q~~v~--S~Dk~---~~kiW 299 (703)
T KOG2321|consen 226 DAAPSVT-ALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQDTDQQNKVV--SMDKR---ILKIW 299 (703)
T ss_pred cccCcce-EEEecCCceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccccCCCceEE--ecchH---Hhhhc
Confidence 0111111 00111112 23344 4888888888877776654433211 0000011111112222 33321 34445
Q ss_pred EcCCCceeeeeeeecccCccCceEEEcCcEEEEE
Q 003800 208 NAMNGELLNHETAAFSGGFVGDVALVSSDTLVTL 241 (794)
Q Consensus 208 d~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~ 241 (794)
|..||++.-.. ....++..-|.+.+.+++..+
T Consensus 300 d~~~Gk~~asi--Ept~~lND~C~~p~sGm~f~A 331 (703)
T KOG2321|consen 300 DECTGKPMASI--EPTSDLNDFCFVPGSGMFFTA 331 (703)
T ss_pred ccccCCceeec--cccCCcCceeeecCCceEEEe
Confidence 67777766443 223456667888777765444
No 109
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=81.16 E-value=36 Score=38.16 Aligned_cols=212 Identities=10% Similarity=0.053 Sum_probs=97.1
Q ss_pred EeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee-eeeeeCCEEEEEEccCCeEE
Q 003800 31 MDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG-IDIALGKYVITLSSDGSTLR 109 (794)
Q Consensus 31 ~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~~~g~~~V~Vs~~g~~v~ 109 (794)
+.=.+.++|..+...|..=++++..+.+-..+-.+.--|+.||+.+=...-+.+.+... .-...|..+|+ |+.++.+.
T Consensus 259 ~kl~~tlvgh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~-Gs~dr~i~ 337 (519)
T KOG0293|consen 259 FKLKKTLVGHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVT-GSPDRTII 337 (519)
T ss_pred eeeeeeeecccCceEEEEECCCCCeEEecCchHheeeccCCcchhhhhcccCcCCCcceeEEccCCceeEe-cCCCCcEE
Confidence 33344455554444444434555556665556677788999998754443331112221 11234555444 66678999
Q ss_pred EEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecC
Q 003800 110 AWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDES 188 (794)
Q Consensus 110 A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~ 188 (794)
+||. ||.++=.++.-.. +.+..++.+.+ ++.++.. .+.++..++..+-.-+=......+- .+.. ...+
T Consensus 338 ~wdl-Dgn~~~~W~gvr~-----~~v~dlait~D-gk~vl~v~~d~~i~l~~~e~~~dr~lise~~~i---ts~~-iS~d 406 (519)
T KOG0293|consen 338 MWDL-DGNILGNWEGVRD-----PKVHDLAITYD-GKYVLLVTVDKKIRLYNREARVDRGLISEEQPI---TSFS-ISKD 406 (519)
T ss_pred EecC-Ccchhhccccccc-----ceeEEEEEcCC-CcEEEEEecccceeeechhhhhhhccccccCce---eEEE-EcCC
Confidence 9997 7887643332211 11111111222 3444444 4777777765432111000001110 0111 1345
Q ss_pred CEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 189 DQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 189 ~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+++..+.+... .+.-.|.+.-..+-++.-... .-+-++|+-.++..++..-+..+++++=+..+|+
T Consensus 407 ~k~~LvnL~~q---ei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr~sgk 473 (519)
T KOG0293|consen 407 GKLALVNLQDQ---EIHLWDLEENKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSEDSKVYIWHRISGK 473 (519)
T ss_pred CcEEEEEcccC---eeEEeecchhhHHHHhhcccccceEEEeccCCCCcceEEecCCCceEEEEEccCCc
Confidence 55555555432 334444443222222210000 0111244322232455555566788888877777
No 110
>PRK00178 tolB translocation protein TolB; Provisional
Probab=80.10 E-value=1.1e+02 Score=34.65 Aligned_cols=187 Identities=14% Similarity=0.148 Sum_probs=88.6
Q ss_pred CEEEEEeCCC------EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800 54 KRVVVSTEEN------VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 54 ~~Vyv~t~~g------~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l 124 (794)
..+|+.+... .|...|...+. . ++.+.....+..... ..|+.+++++.. ...|+.||..+|+..--...
T Consensus 164 ~ia~v~~~~~~~~~~~~l~~~d~~g~~-~-~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~ 241 (430)
T PRK00178 164 RILYVTAERFSVNTRYTLQRSDYDGAR-A-VTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNF 241 (430)
T ss_pred eEEEEEeeCCCCCcceEEEEECCCCCC-c-eEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCC
Confidence 3466654322 47777876443 3 222222222222111 246667776643 35799999999976432222
Q ss_pred cCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
.+.. ..+.+. .+ ++.+++ .. + ..++.+|..+|+..--........ ....+.++..+++.+..++
T Consensus 242 ~g~~--~~~~~S-----pD-G~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~---~~~~spDg~~i~f~s~~~g- 309 (430)
T PRK00178 242 EGLN--GAPAWS-----PD-GSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPAIDT---EPFWGKDGRTLYFTSDRGG- 309 (430)
T ss_pred CCCc--CCeEEC-----CC-CCEEEEEEccCCCceEEEEECCCCCeEEcccCCCCcC---CeEECCCCCEEEEEECCCC-
Confidence 2111 111121 22 334443 32 3 379999999887532111111111 1111335556665553322
Q ss_pred eeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC--eEEEEEeecce
Q 003800 201 QFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS--ILVTVSFKNRK 257 (794)
Q Consensus 201 ~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g--~L~v~~l~sg~ 257 (794)
...++.+|+.+|+... +.........+.+ ..++.+++.....+ .++..|+.+++
T Consensus 310 ~~~iy~~d~~~g~~~~---lt~~~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~ 366 (430)
T PRK00178 310 KPQIYKVNVNGGRAER---VTFVGNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGS 366 (430)
T ss_pred CceEEEEECCCCCEEE---eecCCCCccceEECCCCCEEEEEEccCCceEEEEEECCCCC
Confidence 2368888998887431 1111111111222 23345555543333 57788888776
No 111
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=79.53 E-value=1.1e+02 Score=34.35 Aligned_cols=191 Identities=10% Similarity=0.170 Sum_probs=106.6
Q ss_pred CCCEEEEEeCC-CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc---CCeEEEEeCCCCcEeEEEeccCc
Q 003800 52 GRKRVVVSTEE-NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD---GSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 52 ~~~~Vyv~t~~-g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~---g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
..+++|+.+.+ +.+..+|.++=.+.=....... ..++.+...+..++|+.. .+.+..+|..++++.=+......
T Consensus 84 ~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~~--P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~ 161 (381)
T COG3391 84 AGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGLG--PVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNT 161 (381)
T ss_pred CCCeEEEecCCCCeEEEEcCcccceeeEeeeccC--CceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCCC
Confidence 46779998875 8899999554333222222211 113323334556666543 57999999999988766554331
Q ss_pred cccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcc----eeeeeEEEEecCCEEEEEEecCCce
Q 003800 128 KHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAES----VEVQQVIQLDESDQIYVVGYAGSSQ 201 (794)
Q Consensus 128 ~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~----~~~~~~v~s~~~~~vyv~~~~g~~~ 201 (794)
.. . +.. ..+ +..+++.. ++.+..+| .++..+|+ ..+... ..|..+....++..+|+.... ...
T Consensus 162 P~--~--~a~---~p~-g~~vyv~~~~~~~v~vi~-~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~-~~~ 230 (381)
T COG3391 162 PT--G--VAV---DPD-GNKVYVTNSDDNTVSVID-TSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDG-SGS 230 (381)
T ss_pred cc--e--EEE---CCC-CCeEEEEecCCCeEEEEe-CCCcceec-cccccccccCCCCceEEECCCCCEEEEEecc-CCC
Confidence 11 1 111 122 45577774 89999999 55566665 222111 123444323466678875543 223
Q ss_pred eEEEEEEcCCCceeeee-eeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 202 FHAYQINAMNGELLNHE-TAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 202 ~~v~ald~~tG~~~w~~-~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
..+..+|..+|...+.. ..... ...+ ..+. .+..++..+...+.+.++|..+..
T Consensus 231 ~~v~~id~~~~~v~~~~~~~~~~-~~~~-v~~~p~g~~~yv~~~~~~~V~vid~~~~~ 286 (381)
T COG3391 231 NNVLKIDTATGNVTATDLPVGSG-APRG-VAVDPAGKAAYVANSQGGTVSVIDGATDR 286 (381)
T ss_pred ceEEEEeCCCceEEEeccccccC-CCCc-eeECCCCCEEEEEecCCCeEEEEeCCCCc
Confidence 47899999999999873 22221 1111 1111 223444444445778888877755
No 112
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=79.40 E-value=78 Score=33.96 Aligned_cols=70 Identities=16% Similarity=0.179 Sum_probs=41.9
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeC-CEEEEE-EccCCeEEEEeCCCCcEeEE
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALG-KYVITL-SSDGSTLRAWNLPDGQMVWE 121 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g-~~~V~V-s~~g~~v~A~d~~tG~llWe 121 (794)
++..|+.++.+..+---|...+...=++.-.+.+=+..++..-. ..-+++ ++.+.+|+.||..+=+++=.
T Consensus 116 dn~qivSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~ 187 (315)
T KOG0279|consen 116 DNRQIVSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRTT 187 (315)
T ss_pred CCceeecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhhc
Confidence 35558888999988888877665444433322222334432222 234444 45678999999976665533
No 113
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=79.06 E-value=68 Score=38.37 Aligned_cols=172 Identities=11% Similarity=0.096 Sum_probs=94.6
Q ss_pred CCEEEEEeCCC-------EEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800 53 RKRVVVSTEEN-------VIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG-----STLRAWNLPDGQM 118 (794)
Q Consensus 53 ~~~Vyv~t~~g-------~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l 118 (794)
++.||+..+.+ .+...|+++++ |++.-+-.. .-.++ .+.++.+.+|||.+ ..+--||+. .-
T Consensus 332 ~~~lYv~GG~~~~~~~l~~ve~YD~~~~~--W~~~a~M~~~R~~~~v-~~l~g~iYavGG~dg~~~l~svE~YDp~--~~ 406 (571)
T KOG4441|consen 332 NGKLYVVGGYDSGSDRLSSVERYDPRTNQ--WTPVAPMNTKRSDFGV-AVLDGKLYAVGGFDGEKSLNSVECYDPV--TN 406 (571)
T ss_pred CCEEEEEccccCCCcccceEEEecCCCCc--eeccCCccCcccccee-EEECCEEEEEeccccccccccEEEecCC--CC
Confidence 77899877643 57888999998 998443221 11123 34577777777753 236667664 45
Q ss_pred eEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCCCcEEEEEeccCcce-eeeeEEEEecCC
Q 003800 119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSIDGEILWTRDFAAESV-EVQQVIQLDESD 189 (794)
Q Consensus 119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~tG~~~W~~~~~~~~~-~~~~~v~s~~~~ 189 (794)
.|+.-..-... .. ..+ ...-++.+|+..+ ..+.++|+.++ .|+...+.... ....+ +.-++
T Consensus 407 ~W~~va~m~~~-r~--~~g---v~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~--~W~~~~~M~~~R~~~g~--a~~~~ 476 (571)
T KOG4441|consen 407 KWTPVAPMLTR-RS--GHG---VAVLGGKLYIIGGGDGSSNCLNSVECYDPETN--TWTLIAPMNTRRSGFGV--AVLNG 476 (571)
T ss_pred cccccCCCCcc-ee--eeE---EEEECCEEEEEcCcCCCccccceEEEEcCCCC--ceeecCCcccccccceE--EEECC
Confidence 67765532211 00 111 1122567777532 46788888775 58876654432 11112 35689
Q ss_pred EEEEEEecCCc--eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800 190 QIYVVGYAGSS--QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL 241 (794)
Q Consensus 190 ~vyv~~~~g~~--~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~ 241 (794)
.+|++|...+. --.+.++|+.+- .|..--..+...++ .+..+++.++++.
T Consensus 477 ~iYvvGG~~~~~~~~~VE~ydp~~~--~W~~v~~m~~~rs~~g~~~~~~~ly~vG 529 (571)
T KOG4441|consen 477 KIYVVGGFDGTSALSSVERYDPETN--QWTMVAPMTSPRSAVGVVVLGGKLYAVG 529 (571)
T ss_pred EEEEECCccCCCccceEEEEcCCCC--ceeEcccCccccccccEEEECCEEEEEe
Confidence 99988754321 134788998865 35553223333333 2344454444443
No 114
>PRK04043 tolB translocation protein TolB; Provisional
Probab=78.92 E-value=1.3e+02 Score=34.54 Aligned_cols=148 Identities=10% Similarity=0.059 Sum_probs=74.5
Q ss_pred cCCCE-EEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc--cCCeEEEEeCCCCcEeEEEe
Q 003800 51 TGRKR-VVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS--DGSTLRAWNLPDGQMVWESF 123 (794)
Q Consensus 51 ~~~~~-Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~ 123 (794)
+++++ +|+.+. ...|+.+|..+|+.. +....++....... ..|+.+++... ....++.+|..+|.. +.-
T Consensus 197 pDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~--~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~--~~L 272 (419)
T PRK04043 197 NKEQTAFYYTSYGERKPTLYKYNLYTGKKE--KIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTL--TQI 272 (419)
T ss_pred CCCCcEEEEEEccCCCCEEEEEECCCCcEE--EEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcE--EEc
Confidence 34443 665443 357999999998652 22222221111111 24555665533 235799999988863 222
Q ss_pred ccCccccCCccccccccccccCCeEEEEEC----CEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 124 LRGSKHSKPLLLVPTNLKVDKDSLILVSSK----GCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 124 l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~----g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
...+.....+... ++ ++.+++.++ ..|+.+|..+|+. +-++.. . ..+ .+ +.++..+.+.+...
T Consensus 273 T~~~~~d~~p~~S-----PD-G~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g-~--~~~-~~--SPDG~~Ia~~~~~~ 340 (419)
T PRK04043 273 TNYPGIDVNGNFV-----ED-DKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHG-K--NNS-SV--STYKNYIVYSSRET 340 (419)
T ss_pred ccCCCccCccEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEeCccCC-C--cCc-eE--CCCCCEEEEEEcCC
Confidence 2222110112222 23 344555442 3899999999886 333221 1 111 12 33455554444332
Q ss_pred C-----ceeEEEEEEcCCCce
Q 003800 199 S-----SQFHAYQINAMNGEL 214 (794)
Q Consensus 199 ~-----~~~~v~ald~~tG~~ 214 (794)
. ....++.+|+.+|+.
T Consensus 341 ~~~~~~~~~~I~v~d~~~g~~ 361 (419)
T PRK04043 341 NNEFGKNTFNLYLISTNSDYI 361 (419)
T ss_pred CcccCCCCcEEEEEECCCCCe
Confidence 1 124788889998874
No 115
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=78.45 E-value=1.3e+02 Score=34.51 Aligned_cols=190 Identities=15% Similarity=0.160 Sum_probs=96.0
Q ss_pred eccCCCEEEEEeC---CCEEEEEECcCCccceEEE-cCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 49 QKTGRKRVVVSTE---ENVIASLDLRHGEIFWRHV-LGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 49 ~~~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~-l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
|...++|||+.|+ -|.|++.|. +|+-+=||. +..-- ...+ -+.|+.+|+ +. +|.++.+|+++-++. ...+
T Consensus 231 PmIV~~RvYFlsD~eG~GnlYSvdl-dGkDlrrHTnFtdYY-~R~~-nsDGkrIvF-q~-~GdIylydP~td~le-kldI 304 (668)
T COG4946 231 PMIVGERVYFLSDHEGVGNLYSVDL-DGKDLRRHTNFTDYY-PRNA-NSDGKRIVF-QN-AGDIYLYDPETDSLE-KLDI 304 (668)
T ss_pred ceEEcceEEEEecccCccceEEecc-CCchhhhcCCchhcc-cccc-CCCCcEEEE-ec-CCcEEEeCCCcCcce-eeec
Confidence 5556889999997 478999997 576665553 22110 0011 134566666 54 457999999887653 2222
Q ss_pred c--Cc---c---ccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEE
Q 003800 125 R--GS---K---HSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVG 195 (794)
Q Consensus 125 ~--~~---~---~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~ 195 (794)
. -. . ...+...+. ..+...++.+...+.|..+-.+.-.|-.+ +.+.+. ....+. ...+..+.+..
T Consensus 305 ~lpl~rk~k~~k~~~pskyle-dfa~~~Gd~ia~VSRGkaFi~~~~~~~~i---qv~~~~~VrY~r~--~~~~e~~vigt 378 (668)
T COG4946 305 GLPLDRKKKQPKFVNPSKYLE-DFAVVNGDYIALVSRGKAFIMRPWDGYSI---QVGKKGGVRYRRI--QVDPEGDVIGT 378 (668)
T ss_pred CCccccccccccccCHHHhhh-hhccCCCcEEEEEecCcEEEECCCCCeeE---EcCCCCceEEEEE--ccCCcceEEec
Confidence 2 11 0 001111111 01222233333347777777776555322 222221 112222 23444444444
Q ss_pred ecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 196 YAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 196 ~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.+|. .+..+|..+|+...-. .+-+.-..+-+- .+..+++. .++..|.++|+.+|.
T Consensus 379 ~dgD---~l~iyd~~~~e~kr~e---~~lg~I~av~vs~dGK~~vva-Ndr~el~vididngn 434 (668)
T COG4946 379 NDGD---KLGIYDKDGGEVKRIE---KDLGNIEAVKVSPDGKKVVVA-NDRFELWVIDIDNGN 434 (668)
T ss_pred cCCc---eEEEEecCCceEEEee---CCccceEEEEEcCCCcEEEEE-cCceEEEEEEecCCC
Confidence 4554 6788899899854222 111111112211 22334444 357889999999988
No 116
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=78.31 E-value=41 Score=38.53 Aligned_cols=93 Identities=17% Similarity=0.259 Sum_probs=58.6
Q ss_pred EeeEEeccCcee-eeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccc-eeeeee---eeCC--EEEEEEc
Q 003800 31 MDWHQQYIGKVK-HAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDV-VDGIDI---ALGK--YVITLSS 103 (794)
Q Consensus 31 ~dW~~~~vG~~~-~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~-i~~l~~---~~g~--~~V~Vs~ 103 (794)
.||...+ |.+. .-...+-......|+|..+.+ |++|+. +|++.|..+|+.... ...++. ..++ ..+.|++
T Consensus 231 ~dWs~nl-GE~~l~i~v~~~~~~~~~IvvLger~-Lf~l~~-~G~l~~~krLd~~p~~~~~Y~~~~~~~~~~~~~llV~t 307 (418)
T PF14727_consen 231 PDWSFNL-GEQALDIQVVRFSSSESDIVVLGERS-LFCLKD-NGSLRFQKRLDYNPSCFCPYRVPWYNEPSTRLNLLVGT 307 (418)
T ss_pred ceeEEEC-CceeEEEEEEEcCCCCceEEEEecce-EEEEcC-CCeEEEEEecCCceeeEEEEEeecccCCCCceEEEEEe
Confidence 8999865 7654 211111111244577777665 899996 799999999976521 111111 1111 2356677
Q ss_pred cCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 104 DGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 104 ~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
..+++.-| .+.+++|...+....
T Consensus 308 ~t~~LlVy--~d~~L~WsA~l~~~P 330 (418)
T PF14727_consen 308 HTGTLLVY--EDTTLVWSAQLPHVP 330 (418)
T ss_pred cCCeEEEE--eCCeEEEecCCCCCC
Confidence 66799999 489999999986543
No 117
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=77.57 E-value=43 Score=35.11 Aligned_cols=115 Identities=16% Similarity=0.299 Sum_probs=66.1
Q ss_pred CCeEEEEeCCCC-cEeEEEeccCccccCCccccccccccccCCeEEE-E-E-CCEEEEEE--CCCCcE-----EEEEecc
Q 003800 105 GSTLRAWNLPDG-QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S-S-KGCLHAVS--SIDGEI-----LWTRDFA 173 (794)
Q Consensus 105 g~~v~A~d~~tG-~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~-~g~l~ald--~~tG~~-----~W~~~~~ 173 (794)
++.+|.|-+..- +++|..-.-+..+ +++.+...+. . + +-++-|+| ..+|.. +...+..
T Consensus 138 ~g~Ly~~~~~h~v~~i~~~v~IsNgl-----------~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~ 206 (310)
T KOG4499|consen 138 GGELYSWLAGHQVELIWNCVGISNGL-----------AWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKS 206 (310)
T ss_pred ccEEEEeccCCCceeeehhccCCccc-----------cccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccC
Confidence 456777765322 3445443322211 3343444433 3 2 56775555 777753 3333221
Q ss_pred C--cceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCc
Q 003800 174 A--ESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSD 236 (794)
Q Consensus 174 ~--~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~ 236 (794)
. ....|..+. .-..|.+|+..+.|+ +|.-+|+.||+++-+..+..+. + .+|-++|.|
T Consensus 207 ~~~e~~~PDGm~-ID~eG~L~Va~~ng~---~V~~~dp~tGK~L~eiklPt~q-i-tsccFgGkn 265 (310)
T KOG4499|consen 207 QPFESLEPDGMT-IDTEGNLYVATFNGG---TVQKVDPTTGKILLEIKLPTPQ-I-TSCCFGGKN 265 (310)
T ss_pred CCcCCCCCCcce-EccCCcEEEEEecCc---EEEEECCCCCcEEEEEEcCCCc-e-EEEEecCCC
Confidence 1 112233332 236889999999987 9999999999999999776542 2 356666664
No 118
>PLN02193 nitrile-specifier protein
Probab=77.57 E-value=1.5e+02 Score=34.52 Aligned_cols=198 Identities=10% Similarity=0.129 Sum_probs=99.3
Q ss_pred CCEEEEEeCC--------CEEEEEECcCCccceEEEcCccc--ce--eeee-eeeCCEEEEEEccC-----CeEEEEeCC
Q 003800 53 RKRVVVSTEE--------NVIASLDLRHGEIFWRHVLGIND--VV--DGID-IALGKYVITLSSDG-----STLRAWNLP 114 (794)
Q Consensus 53 ~~~Vyv~t~~--------g~l~ALn~~tG~ivWR~~l~~~~--~i--~~l~-~~~g~~~V~Vs~~g-----~~v~A~d~~ 114 (794)
++.||+.... +.+..+|+++. .|+..-.... .. .+.. +..++.+++++|.+ +.++.||+.
T Consensus 175 ~~~iyv~GG~~~~~~~~~~~v~~yD~~~~--~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~~~~~ndv~~yD~~ 252 (470)
T PLN02193 175 GNKIYSFGGEFTPNQPIDKHLYVFDLETR--TWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTT 252 (470)
T ss_pred CCEEEEECCcCCCCCCeeCcEEEEECCCC--EEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCCCCCCccEEEEECC
Confidence 6778886552 35889999885 5986322110 00 0111 23566666667642 468999998
Q ss_pred CCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------CEEEEEECCCCcEEEEEeccCcce-eee-eEEEE
Q 003800 115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------GCLHAVSSIDGEILWTRDFAAESV-EVQ-QVIQL 185 (794)
Q Consensus 115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------g~l~ald~~tG~~~W~~~~~~~~~-~~~-~~v~s 185 (794)
+. .|+.-...... +.+......... ++.++|+.+ ..+.++|..+. .|+.-.+.... .++ .....
T Consensus 253 t~--~W~~l~~~~~~--P~~R~~h~~~~~-~~~iYv~GG~~~~~~~~~~~~yd~~t~--~W~~~~~~~~~~~~R~~~~~~ 325 (470)
T PLN02193 253 TN--EWKLLTPVEEG--PTPRSFHSMAAD-EENVYVFGGVSATARLKTLDSYNIVDK--KWFHCSTPGDSFSIRGGAGLE 325 (470)
T ss_pred CC--EEEEcCcCCCC--CCCccceEEEEE-CCEEEEECCCCCCCCcceEEEEECCCC--EEEeCCCCCCCCCCCCCcEEE
Confidence 64 58764321100 111111111122 566777531 34778888764 58753321100 000 00002
Q ss_pred ecCCEEEEEEec-CCceeEEEEEEcCCCceeeeeeeec---ccCccC-ceEEEcCcEEEEEECC-------------CCe
Q 003800 186 DESDQIYVVGYA-GSSQFHAYQINAMNGELLNHETAAF---SGGFVG-DVALVSSDTLVTLDTT-------------RSI 247 (794)
Q Consensus 186 ~~~~~vyv~~~~-g~~~~~v~ald~~tG~~~w~~~v~~---~~~~s~-~~~~vg~~~lv~~d~~-------------~g~ 247 (794)
.-++.+|+++.. |...-.+.++|+.+.+ |+..-.. |..... .+..+++.+++..-.. .+.
T Consensus 326 ~~~gkiyviGG~~g~~~~dv~~yD~~t~~--W~~~~~~g~~P~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~nd 403 (470)
T PLN02193 326 VVQGKVWVVYGFNGCEVDDVHYYDPVQDK--WTQVETFGVRPSERSVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDG 403 (470)
T ss_pred EECCcEEEEECCCCCccCceEEEECCCCE--EEEeccCCCCCCCcceeEEEEECCEEEEECCccCCccccccCccceecc
Confidence 346788877643 2112368899998764 8764221 322222 3334455555443211 124
Q ss_pred EEEEEeecceeeeEEE
Q 003800 248 LVTVSFKNRKIAFQET 263 (794)
Q Consensus 248 L~v~~l~sg~~~~~~~ 263 (794)
++++|+.+.+ ...+
T Consensus 404 v~~~D~~t~~--W~~~ 417 (470)
T PLN02193 404 TFALDTETLQ--WERL 417 (470)
T ss_pred EEEEEcCcCE--EEEc
Confidence 7788887776 5443
No 119
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=77.56 E-value=54 Score=37.08 Aligned_cols=94 Identities=17% Similarity=0.265 Sum_probs=62.6
Q ss_pred cEeeEEeccCceeee-----------eeeeeccCCCEEEEEeCCCEEEEEECc---CCccceEEEcCcccceeeeeeeeC
Q 003800 30 LMDWHQQYIGKVKHA-----------VFHTQKTGRKRVVVSTEENVIASLDLR---HGEIFWRHVLGINDVVDGIDIALG 95 (794)
Q Consensus 30 ~~dW~~~~vG~~~~~-----------~f~~~~~~~~~Vyv~t~~g~l~ALn~~---tG~ivWR~~l~~~~~i~~l~~~~g 95 (794)
.+.|--.. |+|+.. .++ | ...-.++.++.++.|+-.|-| .-...|+..-+-.. + .. -...
T Consensus 268 V~lWD~~~-g~p~~s~~~~~k~Vq~l~wh-~-~~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEk-v-~w-~~~s 341 (463)
T KOG0270|consen 268 VKLWDVDT-GKPKSSITHHGKKVQTLEWH-P-YEPSVLLSGSYDGTVALKDCRDPSNSGKEWKFDGEVEK-V-AW-DPHS 341 (463)
T ss_pred EEEEEcCC-CCcceehhhcCCceeEEEec-C-CCceEEEeccccceEEeeeccCccccCceEEeccceEE-E-Ee-cCCC
Confidence 37788765 666532 222 1 113347778889999999888 55677886554331 1 11 0234
Q ss_pred CEEEEEEccCCeEEEEeCC-CCcEeEEEeccCccc
Q 003800 96 KYVITLSSDGSTLRAWNLP-DGQMVWESFLRGSKH 129 (794)
Q Consensus 96 ~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~~~ 129 (794)
....++|.++|.||.+|+. .|+++|+...+....
T Consensus 342 e~~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~I 376 (463)
T KOG0270|consen 342 ENSFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEI 376 (463)
T ss_pred ceeEEEecCCceEEeeecCCCCCceeEEEeccCCc
Confidence 5667778888899999987 679999999887654
No 120
>PLN02153 epithiospecifier protein
Probab=77.55 E-value=1.2e+02 Score=33.42 Aligned_cols=196 Identities=12% Similarity=0.099 Sum_probs=97.9
Q ss_pred CCEEEEEeCC--------CEEEEEECcCCccceEEEcCccc--ce--eeee-eeeCCEEEEEEccC-----CeEEEEeCC
Q 003800 53 RKRVVVSTEE--------NVIASLDLRHGEIFWRHVLGIND--VV--DGID-IALGKYVITLSSDG-----STLRAWNLP 114 (794)
Q Consensus 53 ~~~Vyv~t~~--------g~l~ALn~~tG~ivWR~~l~~~~--~i--~~l~-~~~g~~~V~Vs~~g-----~~v~A~d~~ 114 (794)
++.||+.... +.+..+|+.+. .|+..-.... .. .+.. +..++.++++||.. ..+..||+.
T Consensus 32 ~~~iyv~GG~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~ 109 (341)
T PLN02153 32 GDKLYSFGGELKPNEHIDKDLYVFDFNTH--TWSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDEKREFSDFYSYDTV 109 (341)
T ss_pred CCEEEEECCccCCCCceeCcEEEEECCCC--EEEEcCccCCCCCCccCceEEEEECCEEEEECCCCCCCccCcEEEEECC
Confidence 6788886542 46899999886 5986542211 11 0111 34577777777631 358889987
Q ss_pred CCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------------CEEEEEECCCCcEEEEEeccCcce-eee
Q 003800 115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------------GCLHAVSSIDGEILWTRDFAAESV-EVQ 180 (794)
Q Consensus 115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------------g~l~ald~~tG~~~W~~~~~~~~~-~~~ 180 (794)
+ ..|+.--.-.....+.+..... ....++.++|+.+ ..+.++|.++. .|+.-.+.... .++
T Consensus 110 t--~~W~~~~~~~~~~~p~~R~~~~-~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~--~W~~l~~~~~~~~~r 184 (341)
T PLN02153 110 K--NEWTFLTKLDEEGGPEARTFHS-MASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADG--KWVQLPDPGENFEKR 184 (341)
T ss_pred C--CEEEEeccCCCCCCCCCceeeE-EEEECCEEEEECCccCCCccCCCcccceEEEEECCCC--eEeeCCCCCCCCCCC
Confidence 5 4587532110000010111100 1222566777631 14778888765 58853322110 000
Q ss_pred -eEEEEecCCEEEEEEec------CCc----eeEEEEEEcCCCceeeeeeee---cccCccC-ceEEEcCcEEEEEECC-
Q 003800 181 -QVIQLDESDQIYVVGYA------GSS----QFHAYQINAMNGELLNHETAA---FSGGFVG-DVALVSSDTLVTLDTT- 244 (794)
Q Consensus 181 -~~v~s~~~~~vyv~~~~------g~~----~~~v~ald~~tG~~~w~~~v~---~~~~~s~-~~~~vg~~~lv~~d~~- 244 (794)
......-++.+|+++.. |+. .-.+.++|+.+. .|+..-. .|..... .++++++.++++.-..
T Consensus 185 ~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~--~W~~~~~~g~~P~~r~~~~~~~~~~~iyv~GG~~~ 262 (341)
T PLN02153 185 GGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASG--KWTEVETTGAKPSARSVFAHAVVGKYIIIFGGEVW 262 (341)
T ss_pred CcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCC--cEEeccccCCCCCCcceeeeEEECCEEEEECcccC
Confidence 01112457888886532 110 125788998764 4776321 2333222 3444465555554210
Q ss_pred ------------CCeEEEEEeecce
Q 003800 245 ------------RSILVTVSFKNRK 257 (794)
Q Consensus 245 ------------~g~L~v~~l~sg~ 257 (794)
...+++.|+.+.+
T Consensus 263 ~~~~~~~~~~~~~n~v~~~d~~~~~ 287 (341)
T PLN02153 263 PDLKGHLGPGTLSNEGYALDTETLV 287 (341)
T ss_pred CccccccccccccccEEEEEcCccE
Confidence 1257777877665
No 121
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=77.03 E-value=79 Score=34.26 Aligned_cols=109 Identities=11% Similarity=0.048 Sum_probs=64.3
Q ss_pred ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCC
Q 003800 86 VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDG 164 (794)
Q Consensus 86 ~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG 164 (794)
.|..+.....++.+.++..++.+|.+|...-.++=+.....+.+ +..+.+ ...+++. -+|.|..+|..+|
T Consensus 15 ~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~l~~~~~~~~plL--~c~F~d-------~~~~~~G~~dg~vr~~Dln~~ 85 (323)
T KOG1036|consen 15 GISSVKFSPSSSDLLVSSWDGSLRLYDVPANSLKLKFKHGAPLL--DCAFAD-------ESTIVTGGLDGQVRRYDLNTG 85 (323)
T ss_pred ceeeEEEcCcCCcEEEEeccCcEEEEeccchhhhhheecCCcee--eeeccC-------CceEEEeccCceEEEEEecCC
Confidence 44444333333445557788899999998777776666665544 112221 3456666 4999999999998
Q ss_pred cEEEEEeccCcceeeeeEE-EEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800 165 EILWTRDFAAESVEVQQVI-QLDESDQIYVVGYAGSSQFHAYQINAMN 211 (794)
Q Consensus 165 ~~~W~~~~~~~~~~~~~~v-~s~~~~~vyv~~~~g~~~~~v~ald~~t 211 (794)
+..=--....+. +++ ..-..+.+...+.++ .+-.+|+.+
T Consensus 86 ~~~~igth~~~i----~ci~~~~~~~~vIsgsWD~----~ik~wD~R~ 125 (323)
T KOG1036|consen 86 NEDQIGTHDEGI----RCIEYSYEVGCVISGSWDK----TIKFWDPRN 125 (323)
T ss_pred cceeeccCCCce----EEEEeeccCCeEEEcccCc----cEEEEeccc
Confidence 754333333222 232 122356666555554 677778765
No 122
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=76.93 E-value=1.3e+02 Score=33.77 Aligned_cols=157 Identities=15% Similarity=0.188 Sum_probs=91.8
Q ss_pred cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccC
Q 003800 51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
.+++.+||+.. .+.+..+|+.++++.=....+.. . .+..+...+..+++.. ..+.+..+|. ++..+|+ ....
T Consensus 125 ~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~-P-~~~a~~p~g~~vyv~~~~~~~v~vi~~-~~~~v~~-~~~~ 200 (381)
T COG3391 125 PDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNT-P-TGVAVDPDGNKVYVTNSDDNTVSVIDT-SGNSVVR-GSVG 200 (381)
T ss_pred CCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCCC-c-ceEEECCCCCeEEEEecCCCeEEEEeC-CCcceec-cccc
Confidence 34778999988 68999999999987655433322 1 2222233444455543 4579999994 6777776 3211
Q ss_pred ccccCCcccccccccccc-CCeEEEEE--C--CEEEEEECCCCcEEEE-EeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 127 SKHSKPLLLVPTNLKVDK-DSLILVSS--K--GCLHAVSSIDGEILWT-RDFAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 127 ~~~s~~~~~~~~~~~~~~-~~~V~V~~--~--g~l~ald~~tG~~~W~-~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
... ...-.+....++. +..++|.. + +.+..+|..+|.+.|. ...... .+..+.....+..+|+....++
T Consensus 201 ~~~--~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~~~~--~~~~v~~~p~g~~~yv~~~~~~- 275 (381)
T COG3391 201 SLV--GVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSG--APRGVAVDPAGKAAYVANSQGG- 275 (381)
T ss_pred ccc--ccCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEeccccccC--CCCceeECCCCCEEEEEecCCC-
Confidence 111 0000000001222 34577753 3 6999999999999887 333332 1222222345667777654433
Q ss_pred eeEEEEEEcCCCceeeee
Q 003800 201 QFHAYQINAMNGELLNHE 218 (794)
Q Consensus 201 ~~~v~ald~~tG~~~w~~ 218 (794)
.+..+|..+.+.....
T Consensus 276 --~V~vid~~~~~v~~~~ 291 (381)
T COG3391 276 --TVSVIDGATDRVVKTG 291 (381)
T ss_pred --eEEEEeCCCCceeeee
Confidence 7888998888777655
No 123
>PRK04792 tolB translocation protein TolB; Provisional
Probab=76.93 E-value=1.5e+02 Score=34.23 Aligned_cols=150 Identities=9% Similarity=0.071 Sum_probs=71.9
Q ss_pred eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-CC--EEEEEECCCCcEE
Q 003800 94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-KG--CLHAVSSIDGEIL 167 (794)
Q Consensus 94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~g--~l~ald~~tG~~~ 167 (794)
.|+.+++++.. ...|+.+|..+|+..--....+.. ..+.+. .+ ++.+++ .. +| .|+.+|..+|+..
T Consensus 228 DG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~~~g~~--~~~~wS-----PD-G~~La~~~~~~g~~~Iy~~dl~tg~~~ 299 (448)
T PRK04792 228 DGRKLAYVSFENRKAEIFVQDIYTQVREKVTSFPGIN--GAPRFS-----PD-GKKLALVLSKDGQPEIYVVDIATKALT 299 (448)
T ss_pred CCCEEEEEEecCCCcEEEEEECCCCCeEEecCCCCCc--CCeeEC-----CC-CCEEEEEEeCCCCeEEEEEECCCCCeE
Confidence 46667776532 247999999999764322222211 111121 23 334444 33 44 5999999888642
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCC-
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRS- 246 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g- 246 (794)
+..........+..+.++..+++.+..++ ...++.+|+.+|+...-. ..... .........++.+++.....+
T Consensus 300 ---~lt~~~~~~~~p~wSpDG~~I~f~s~~~g-~~~Iy~~dl~~g~~~~Lt-~~g~~-~~~~~~SpDG~~l~~~~~~~g~ 373 (448)
T PRK04792 300 ---RITRHRAIDTEPSWHPDGKSLIFTSERGG-KPQIYRVNLASGKVSRLT-FEGEQ-NLGGSITPDGRSMIMVNRTNGK 373 (448)
T ss_pred ---ECccCCCCccceEECCCCCEEEEEECCCC-CceEEEEECCCCCEEEEe-cCCCC-CcCeeECCCCCEEEEEEecCCc
Confidence 11111100111211334555655443322 247889999988753211 11111 111122223345555443333
Q ss_pred -eEEEEEeecce
Q 003800 247 -ILVTVSFKNRK 257 (794)
Q Consensus 247 -~L~v~~l~sg~ 257 (794)
.++.+++.++.
T Consensus 374 ~~I~~~dl~~g~ 385 (448)
T PRK04792 374 FNIARQDLETGA 385 (448)
T ss_pred eEEEEEECCCCC
Confidence 56777887776
No 124
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=76.37 E-value=1.4e+02 Score=33.54 Aligned_cols=36 Identities=11% Similarity=0.025 Sum_probs=20.4
Q ss_pred EEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800 204 AYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL 241 (794)
Q Consensus 204 v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~ 241 (794)
+.++|+.++ .|+..-..|..... .++.+++.++++.
T Consensus 314 ~e~yd~~~~--~W~~~~~lp~~r~~~~av~~~~~iyv~G 350 (376)
T PRK14131 314 DEIYALVNG--KWQKVGELPQGLAYGVSVSWNNGVLLIG 350 (376)
T ss_pred hheEEecCC--cccccCcCCCCccceEEEEeCCEEEEEc
Confidence 456888875 48765445544433 2333466666655
No 125
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=75.66 E-value=1.4e+02 Score=33.41 Aligned_cols=149 Identities=13% Similarity=0.014 Sum_probs=72.2
Q ss_pred cCCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l 124 (794)
++++.|++.+.. ..|+.+|.++|+..--...... ...... ..++.+++.... ...++.||..+|... .+
T Consensus 199 pdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~--~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~~~---~l 273 (417)
T TIGR02800 199 PDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGM--NGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQLT---RL 273 (417)
T ss_pred CCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCC--ccceEECCCCCEEEEEECCCCCccEEEEECCCCCEE---EC
Confidence 445556555532 5799999999975432222211 111111 234455554332 246999999888642 22
Q ss_pred cC-ccccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800 125 RG-SKHSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS 199 (794)
Q Consensus 125 ~~-~~~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~ 199 (794)
.. ......+.+. .+ ++.+++.+ ...++.+|..+|+..--.. .... ...+..+..+..+++.+.. +
T Consensus 274 ~~~~~~~~~~~~s-----~d-g~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~-~~~~--~~~~~~spdg~~i~~~~~~-~ 343 (417)
T TIGR02800 274 TNGPGIDTEPSWS-----PD-GKSIAFTSDRGGSPQIYMMDADGGEVRRLTF-RGGY--NASPSWSPDGDLIAFVHRE-G 343 (417)
T ss_pred CCCCCCCCCEEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEeec-CCCC--ccCeEECCCCCEEEEEEcc-C
Confidence 11 1110111111 12 33444433 2379999988887432111 1111 1112112344455544333 2
Q ss_pred ceeEEEEEEcCCCce
Q 003800 200 SQFHAYQINAMNGEL 214 (794)
Q Consensus 200 ~~~~v~ald~~tG~~ 214 (794)
...+++.+|+.+|..
T Consensus 344 ~~~~i~~~d~~~~~~ 358 (417)
T TIGR02800 344 GGFNIAVMDLDGGGE 358 (417)
T ss_pred CceEEEEEeCCCCCe
Confidence 345788899988754
No 126
>PLN02193 nitrile-specifier protein
Probab=75.62 E-value=1.3e+02 Score=34.90 Aligned_cols=152 Identities=14% Similarity=0.103 Sum_probs=79.8
Q ss_pred CCEEEEEeCC------CEEEEEECcCCccceEEEcCcc---ccee-eeeeeeCCEEEEEEccC-----CeEEEEeCCCCc
Q 003800 53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGIN---DVVD-GIDIALGKYVITLSSDG-----STLRAWNLPDGQ 117 (794)
Q Consensus 53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~---~~i~-~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~ 117 (794)
++.||+.... +.+.++|+++. .|++..+.. ..-. ......++.+++++|.+ ..+..||+.+.
T Consensus 228 ~~~lYvfGG~~~~~~~ndv~~yD~~t~--~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~~~~~~~~~~yd~~t~- 304 (470)
T PLN02193 228 GSTLYVFGGRDASRQYNGFYSFDTTTN--EWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDK- 304 (470)
T ss_pred CCEEEEECCCCCCCCCccEEEEECCCC--EEEEcCcCCCCCCCccceEEEEECCEEEEECCCCCCCCcceEEEEECCCC-
Confidence 6778887652 57999999986 699854321 1111 11123455666666642 34788998865
Q ss_pred EeEEEeccCccccCCccccccccccccCCeEEEEE--C----CEEEEEECCCCcEEEEEeccC-----cceeeeeEEEEe
Q 003800 118 MVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--K----GCLHAVSSIDGEILWTRDFAA-----ESVEVQQVIQLD 186 (794)
Q Consensus 118 llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~----g~l~ald~~tG~~~W~~~~~~-----~~~~~~~~v~s~ 186 (794)
.|+.--..... ..+-........ ++.+++.. + ..++.+|..+. .|+.-.+. +.. ...+ +.
T Consensus 305 -~W~~~~~~~~~--~~~R~~~~~~~~-~gkiyviGG~~g~~~~dv~~yD~~t~--~W~~~~~~g~~P~~R~-~~~~--~~ 375 (470)
T PLN02193 305 -KWFHCSTPGDS--FSIRGGAGLEVV-QGKVWVVYGFNGCEVDDVHYYDPVQD--KWTQVETFGVRPSERS-VFAS--AA 375 (470)
T ss_pred -EEEeCCCCCCC--CCCCCCcEEEEE-CCcEEEEECCCCCccCceEEEECCCC--EEEEeccCCCCCCCcc-eeEE--EE
Confidence 58754221110 000000000112 45566653 2 56889998875 48765432 111 1112 24
Q ss_pred cCCEEEEEEecCC-----------ceeEEEEEEcCCCceeeee
Q 003800 187 ESDQIYVVGYAGS-----------SQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 187 ~~~~vyv~~~~g~-----------~~~~v~ald~~tG~~~w~~ 218 (794)
-++.+|+.+-... ..-.+.+||+.|. .|+.
T Consensus 376 ~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~--~W~~ 416 (470)
T PLN02193 376 VGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETL--QWER 416 (470)
T ss_pred ECCEEEEECCccCCccccccCccceeccEEEEEcCcC--EEEE
Confidence 5778888764210 0013678887755 4664
No 127
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=74.38 E-value=32 Score=39.18 Aligned_cols=73 Identities=12% Similarity=0.216 Sum_probs=54.6
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLR 125 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~ 125 (794)
.+-+...++-+..|--.|.+||+..=|..+.......-++ +.+..++++|+.+++++.||..+|+++=++.-.
T Consensus 269 ~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~~~cvkf~-pd~~n~fl~G~sd~ki~~wDiRs~kvvqeYd~h 341 (503)
T KOG0282|consen 269 CGTSFLSASFDRFLKLWDTETGQVLSRFHLDKVPTCVKFH-PDNQNIFLVGGSDKKIRQWDIRSGKVVQEYDRH 341 (503)
T ss_pred cCCeeeeeecceeeeeeccccceEEEEEecCCCceeeecC-CCCCcEEEEecCCCcEEEEeccchHHHHHHHhh
Confidence 3556888888999999999999999998886552211222 234577787887889999999999977665543
No 128
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=74.00 E-value=74 Score=39.15 Aligned_cols=119 Identities=12% Similarity=0.086 Sum_probs=74.1
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc-
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS- 130 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s- 130 (794)
++..+.+++++-.|-.+|..|+...=... +....+.++.....+..+.++..+|.|+-||..+|.+.-....-.....
T Consensus 107 ~g~~iaagsdD~~vK~~~~~D~s~~~~lr-gh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~ 185 (933)
T KOG1274|consen 107 SGKMIAAGSDDTAVKLLNLDDSSQEKVLR-GHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEF 185 (933)
T ss_pred CCcEEEeecCceeEEEEeccccchheeec-ccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCccccc
Confidence 45568889999999999999987643321 1122343443334555666666668999999999998866554322110
Q ss_pred --CCccccccccccccC-CeEEEE-ECCEEEEEECCCCcEEEEEeccC
Q 003800 131 --KPLLLVPTNLKVDKD-SLILVS-SKGCLHAVSSIDGEILWTRDFAA 174 (794)
Q Consensus 131 --~~~~~~~~~~~~~~~-~~V~V~-~~g~l~ald~~tG~~~W~~~~~~ 174 (794)
..+...+ ++..+ +...+. .++.|..++..+++.....+...
T Consensus 186 ~~s~i~~~~---aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~ 230 (933)
T KOG1274|consen 186 ILSRICTRL---AWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKL 230 (933)
T ss_pred cccceeeee---eecCCCCeEEeeccCCeEEEEccCCceeheeecccc
Confidence 0011111 22222 444444 58999999999888887776543
No 129
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=73.87 E-value=89 Score=34.02 Aligned_cols=181 Identities=15% Similarity=0.250 Sum_probs=100.4
Q ss_pred ccccc-EeeEEeccCceeeeeeeee----------ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-e
Q 003800 26 DQVGL-MDWHQQYIGKVKHAVFHTQ----------KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-A 93 (794)
Q Consensus 26 dqvG~-~dW~~~~vG~~~~~~f~~~----------~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~ 93 (794)
.|.|+ ..|+...--..+ .|++. +.+...|.-++-+..+----.++|+.+=...--.. -+.-... .
T Consensus 282 sqDGkIKvWri~tG~ClR--rFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsS-yvn~a~ft~ 358 (508)
T KOG0275|consen 282 SQDGKIKVWRIETGQCLR--RFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSS-YVNEATFTD 358 (508)
T ss_pred CcCCcEEEEEEecchHHH--HhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccc-cccceEEcC
Confidence 57788 779987622221 12211 11233466666666776667788876533221111 0111111 2
Q ss_pred eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc-CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEe
Q 003800 94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS-KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRD 171 (794)
Q Consensus 94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s-~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~ 171 (794)
.|..++..|++ ++|+.|+..++.-+=.+.-.+...+ ......| -.....+|- ..++++-++ -.|+++-++.
T Consensus 359 dG~~iisaSsD-gtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~P-----Knpeh~iVCNrsntv~imn-~qGQvVrsfs 431 (508)
T KOG0275|consen 359 DGHHIISASSD-GTVKVWHGKTTECLSTFKPLGTDYPVNSVILLP-----KNPEHFIVCNRSNTVYIMN-MQGQVVRSFS 431 (508)
T ss_pred CCCeEEEecCC-ccEEEecCcchhhhhhccCCCCcccceeEEEcC-----CCCceEEEEcCCCeEEEEe-ccceEEeeec
Confidence 45556665654 6999999999987766654443221 1111222 112233333 467788777 4577777765
Q ss_pred ccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800 172 FAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 172 ~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v 220 (794)
...-. -....+..+..+.-+|.++-++ .+||+...+|..-....+
T Consensus 432 SGkREgGdFi~~~lSpkGewiYcigED~----vlYCF~~~sG~LE~tl~V 477 (508)
T KOG0275|consen 432 SGKREGGDFINAILSPKGEWIYCIGEDG----VLYCFSVLSGKLERTLPV 477 (508)
T ss_pred cCCccCCceEEEEecCCCcEEEEEccCc----EEEEEEeecCceeeeeec
Confidence 54211 0122233356677888887776 899999999987665543
No 130
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=73.50 E-value=1.6e+02 Score=33.00 Aligned_cols=149 Identities=14% Similarity=0.131 Sum_probs=71.2
Q ss_pred eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E---CCEEEEEECCCCcEE
Q 003800 94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S---KGCLHAVSSIDGEIL 167 (794)
Q Consensus 94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~---~g~l~ald~~tG~~~ 167 (794)
.|+.+++++.. ...++.||..+|+..-.....+... .+.+. .+ ++.+++. . ...++.+|..+|...
T Consensus 200 dg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~~~--~~~~s-----pD-g~~l~~~~~~~~~~~i~~~d~~~~~~~ 271 (417)
T TIGR02800 200 DGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGMNG--APAFS-----PD-GSKLAVSLSKDGNPDIYVMDLDGKQLT 271 (417)
T ss_pred CCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCCcc--ceEEC-----CC-CCEEEEEECCCCCccEEEEECCCCCEE
Confidence 45556665432 2579999999997654433332211 11111 22 2345443 2 346899998887542
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCC-
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTR- 245 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~- 245 (794)
=-........ ....+.++..+++.+..++ ...++.+|+.+|+.. ++.....-...+.+ ..+..+++.+...
T Consensus 272 ~l~~~~~~~~---~~~~s~dg~~l~~~s~~~g-~~~iy~~d~~~~~~~---~l~~~~~~~~~~~~spdg~~i~~~~~~~~ 344 (417)
T TIGR02800 272 RLTNGPGIDT---EPSWSPDGKSIAFTSDRGG-SPQIYMMDADGGEVR---RLTFRGGYNASPSWSPDGDLIAFVHREGG 344 (417)
T ss_pred ECCCCCCCCC---CEEECCCCCEEEEEECCCC-CceEEEEECCCCCEE---EeecCCCCccCeEECCCCCEEEEEEccCC
Confidence 1111111111 1111234455655444332 236888898888743 11111111111222 2334555554322
Q ss_pred -CeEEEEEeecce
Q 003800 246 -SILVTVSFKNRK 257 (794)
Q Consensus 246 -g~L~v~~l~sg~ 257 (794)
..++..++.++.
T Consensus 345 ~~~i~~~d~~~~~ 357 (417)
T TIGR02800 345 GFNIAVMDLDGGG 357 (417)
T ss_pred ceEEEEEeCCCCC
Confidence 267777877765
No 131
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=72.85 E-value=1.6e+02 Score=32.65 Aligned_cols=69 Identities=14% Similarity=0.253 Sum_probs=37.4
Q ss_pred cCCEEEEEEecCC-ceeEEEEEEcCCCceeeeeeeecccCccCceEEE---cCcEEEEEECCCCeEEEEEeec-ce
Q 003800 187 ESDQIYVVGYAGS-SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV---SSDTLVTLDTTRSILVTVSFKN-RK 257 (794)
Q Consensus 187 ~~~~vyv~~~~g~-~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v---g~~~lv~~d~~~g~L~v~~l~s-g~ 257 (794)
....+|++...|. -.+..+.+|..+|+.-.-.+...+. +..|.+. .+.++++++...|.+.+.-+.. |.
T Consensus 50 ~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g--~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~ 123 (346)
T COG2706 50 DQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPG--SPPCYVSVDEDGRFVFVANYHSGSVSVYPLQADGS 123 (346)
T ss_pred CCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCC--CCCeEEEECCCCCEEEEEEccCceEEEEEcccCCc
Confidence 4446888776642 2356677888889876444322221 1124332 2235566665556666666644 44
No 132
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=72.81 E-value=1.4e+02 Score=31.99 Aligned_cols=103 Identities=14% Similarity=0.075 Sum_probs=55.0
Q ss_pred EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE------CCEEEEEECCCC-------cEEE
Q 003800 102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS------KGCLHAVSSIDG-------EILW 168 (794)
Q Consensus 102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~------~g~l~ald~~tG-------~~~W 168 (794)
++.+...+.||.++|+.+-.+....++- ...+.. + ++.+++.. .+.|..+|..+- ++.-
T Consensus 70 GSAD~t~kLWDv~tGk~la~~k~~~~Vk--~~~F~~-----~-gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~ 141 (327)
T KOG0643|consen 70 GSADQTAKLWDVETGKQLATWKTNSPVK--RVDFSF-----G-GNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYL 141 (327)
T ss_pred ccccceeEEEEcCCCcEEEEeecCCeeE--EEeecc-----C-CcEEEEEehhhcCcceEEEEEEccCChhhhcccCceE
Confidence 4446789999999999999988776542 222221 1 23333322 455666665421 2222
Q ss_pred EEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 169 TRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 169 ~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
....+.. .+...+....+..++...-+ ..+..+|+.+|+.+-+.
T Consensus 142 kI~t~~s--kit~a~Wg~l~~~ii~Ghe~----G~is~~da~~g~~~v~s 185 (327)
T KOG0643|consen 142 KIPTPDS--KITSALWGPLGETIIAGHED----GSISIYDARTGKELVDS 185 (327)
T ss_pred EecCCcc--ceeeeeecccCCEEEEecCC----CcEEEEEcccCceeeec
Confidence 2222221 11222222234444432223 38999999999776554
No 133
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=71.17 E-value=1.7e+02 Score=33.89 Aligned_cols=147 Identities=18% Similarity=0.292 Sum_probs=78.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
.+-+|++|..|.|--=+..+|=..=- +...+..-++.....+..+.-++.++.|+.|| +-++.|...+..+.. .
T Consensus 339 ~~di~vGTtrN~iL~Gt~~~~f~~~v--~gh~delwgla~hps~~q~~T~gqdk~v~lW~--~~k~~wt~~~~d~~~--~ 412 (626)
T KOG2106|consen 339 KGDILVGTTRNFILQGTLENGFTLTV--QGHGDELWGLATHPSKNQLLTCGQDKHVRLWN--DHKLEWTKIIEDPAE--C 412 (626)
T ss_pred CCcEEEeeccceEEEeeecCCceEEE--EecccceeeEEcCCChhheeeccCcceEEEcc--CCceeEEEEecCcee--E
Confidence 33399999888765544444421111 11111222342233444444366678999999 889999999877643 1
Q ss_pred ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800 133 LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN 211 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t 211 (794)
+.+ +..+.+.+. ..|+...+|.++-. +=+...... +..++.-..+|..++++... .++.++-+|. +
T Consensus 413 ~~f-------hpsg~va~Gt~~G~w~V~d~e~~~-lv~~~~d~~---~ls~v~ysp~G~~lAvgs~d-~~iyiy~Vs~-~ 479 (626)
T KOG2106|consen 413 ADF-------HPSGVVAVGTATGRWFVLDTETQD-LVTIHTDNE---QLSVVRYSPDGAFLAVGSHD-NHIYIYRVSA-N 479 (626)
T ss_pred eec-------cCcceEEEeeccceEEEEecccce-eEEEEecCC---ceEEEEEcCCCCEEEEecCC-CeEEEEEECC-C
Confidence 112 224545555 48999999998843 333333332 22333212344444444432 2456666663 5
Q ss_pred Cceeeee
Q 003800 212 GELLNHE 218 (794)
Q Consensus 212 G~~~w~~ 218 (794)
|+.....
T Consensus 480 g~~y~r~ 486 (626)
T KOG2106|consen 480 GRKYSRV 486 (626)
T ss_pred CcEEEEe
Confidence 5554433
No 134
>PRK05137 tolB translocation protein TolB; Provisional
Probab=69.06 E-value=2.1e+02 Score=32.61 Aligned_cols=137 Identities=14% Similarity=0.060 Sum_probs=65.0
Q ss_pred EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc--cCCeEEEEeCCCCcEeEEEeccCccccCCcccccccc
Q 003800 64 VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS--DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNL 140 (794)
Q Consensus 64 ~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~ 140 (794)
.|...|...+.. ++.......+..... ..|+.+++++. ....|+.||..+|+..=-....+.. ..+.+
T Consensus 183 ~l~~~d~dg~~~--~~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~~g~~--~~~~~----- 253 (435)
T PRK05137 183 RLAIMDQDGANV--RYLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNFPGMT--FAPRF----- 253 (435)
T ss_pred EEEEECCCCCCc--EEEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecCCCcc--cCcEE-----
Confidence 677777754433 222222212222211 25666777763 2368999999999753111111111 11111
Q ss_pred ccccCCeEEE-EE---CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCce
Q 003800 141 KVDKDSLILV-SS---KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGEL 214 (794)
Q Consensus 141 ~~~~~~~V~V-~~---~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~ 214 (794)
..+ ++.+++ .. ...++.+|..+|+..=-...+... .....+.++..+++.+..++ ...++.+|+.+|+.
T Consensus 254 SPD-G~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~~~---~~~~~spDG~~i~f~s~~~g-~~~Iy~~d~~g~~~ 326 (435)
T PRK05137 254 SPD-GRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPAID---TSPSYSPDGSQIVFESDRSG-SPQLYVMNADGSNP 326 (435)
T ss_pred CCC-CCEEEEEEecCCCceEEEEECCCCceEEccCCCCcc---CceeEcCCCCEEEEEECCCC-CCeEEEEECCCCCe
Confidence 123 334443 33 346999999888653211111111 11111334555554443221 23678888877765
No 135
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=68.87 E-value=1.6e+02 Score=32.28 Aligned_cols=170 Identities=18% Similarity=0.280 Sum_probs=78.4
Q ss_pred cccceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEE-EEECcCCccceEEEcCc-ccceeeeeeeeCC
Q 003800 19 PSLSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIA-SLDLRHGEIFWRHVLGI-NDVVDGIDIALGK 96 (794)
Q Consensus 19 ~~~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~-ALn~~tG~ivWR~~l~~-~~~i~~l~~~~g~ 96 (794)
...++|....|-..|+...-+... ....-....++++++.+..|.++ ..| .|+-.|+..-.. ...+..+....++
T Consensus 122 ~~G~iy~T~DgG~tW~~~~~~~~g-s~~~~~r~~dG~~vavs~~G~~~~s~~--~G~~~w~~~~r~~~~riq~~gf~~~~ 198 (302)
T PF14870_consen 122 DRGAIYRTTDGGKTWQAVVSETSG-SINDITRSSDGRYVAVSSRGNFYSSWD--PGQTTWQPHNRNSSRRIQSMGFSPDG 198 (302)
T ss_dssp TT--EEEESSTTSSEEEEE-S-----EEEEEE-TTS-EEEEETTSSEEEEE---TT-SS-EEEE--SSS-EEEEEE-TTS
T ss_pred CCCcEEEeCCCCCCeeEcccCCcc-eeEeEEECCCCcEEEEECcccEEEEec--CCCccceEEccCccceehhceecCCC
Confidence 446889888888899986643332 22221222466766666666554 555 688999975432 3334433222333
Q ss_pred EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCc
Q 003800 97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAE 175 (794)
Q Consensus 97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~ 175 (794)
.+..++ .|+.++-=+..+...-|+........ ..-.++. .+...++.+++.. +|.|+. ..+|-.-|+......
T Consensus 199 ~lw~~~-~Gg~~~~s~~~~~~~~w~~~~~~~~~-~~~~~ld--~a~~~~~~~wa~gg~G~l~~--S~DgGktW~~~~~~~ 272 (302)
T PF14870_consen 199 NLWMLA-RGGQIQFSDDPDDGETWSEPIIPIKT-NGYGILD--LAYRPPNEIWAVGGSGTLLV--STDGGKTWQKDRVGE 272 (302)
T ss_dssp -EEEEE-TTTEEEEEE-TTEEEEE---B-TTSS---S-EEE--EEESSSS-EEEEESTT-EEE--ESSTTSS-EE-GGGT
T ss_pred CEEEEe-CCcEEEEccCCCCccccccccCCccc-CceeeEE--EEecCCCCEEEEeCCccEEE--eCCCCccceECcccc
Confidence 444434 57788888867788889886544311 1111111 1222246677764 554432 356667899876533
Q ss_pred ce--eeeeEEEEecCCEEEEEEecC
Q 003800 176 SV--EVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 176 ~~--~~~~~v~s~~~~~vyv~~~~g 198 (794)
.. .++.++. ..+++-|+++..|
T Consensus 273 ~~~~n~~~i~f-~~~~~gf~lG~~G 296 (302)
T PF14870_consen 273 NVPSNLYRIVF-VNPDKGFVLGQDG 296 (302)
T ss_dssp TSSS---EEEE-EETTEEEEE-STT
T ss_pred CCCCceEEEEE-cCCCceEEECCCc
Confidence 22 2445542 4667888887666
No 136
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=68.85 E-value=2.4e+02 Score=33.12 Aligned_cols=114 Identities=14% Similarity=0.017 Sum_probs=78.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
...+..++..|.+...+..-|++-|+...+... .+....-...-+.++-++.+.++--|+..+++..-.+....+.. .
T Consensus 70 t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~~~~~-~ 148 (541)
T KOG4547|consen 70 TSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQKPLV-S 148 (541)
T ss_pred ceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccCCCcc-c
Confidence 344788899999999999999999998754432 22222111222345434445799999999999988877766544 2
Q ss_pred CccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccC
Q 003800 132 PLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAA 174 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~ 174 (794)
+..+.+ ++.+.+...+.+-.+|..+++++=++..-.
T Consensus 149 sl~is~-------D~~~l~~as~~ik~~~~~~kevv~~ftgh~ 184 (541)
T KOG4547|consen 149 SLCISP-------DGKILLTASRQIKVLDIETKEVVITFTGHG 184 (541)
T ss_pred eEEEcC-------CCCEEEeccceEEEEEccCceEEEEecCCC
Confidence 333332 455666678899999999999998886543
No 137
>PRK02888 nitrous-oxide reductase; Validated
Probab=68.65 E-value=1.3e+02 Score=36.13 Aligned_cols=150 Identities=12% Similarity=0.127 Sum_probs=81.9
Q ss_pred eeeeccCCC-EEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc----cCCeEEEEeCCCCcEe
Q 003800 46 FHTQKTGRK-RVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS----DGSTLRAWNLPDGQMV 119 (794)
Q Consensus 46 f~~~~~~~~-~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~----~g~~v~A~d~~tG~ll 119 (794)
|.-|-..++ .++..++ .|.+.++|+++-++.|+..++.. .+.....-++..++++. .+..+...++.+-.
T Consensus 196 ~~~PlpnDGk~l~~~~ey~~~vSvID~etmeV~~qV~Vdgn--pd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d-- 271 (635)
T PRK02888 196 FRIPLPNDGKDLDDPKKYRSLFTAVDAETMEVAWQVMVDGN--LDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERD-- 271 (635)
T ss_pred cccccCCCCCEeecccceeEEEEEEECccceEEEEEEeCCC--cccceECCCCCEEEEeccCcccCcceeeeccccCc--
Confidence 333433244 4555544 58999999999999999988764 22222233556676664 24556666654433
Q ss_pred EEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCC----C-cEEEEEeccCcceeeeeEEEEecCCEEEEE
Q 003800 120 WESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSID----G-EILWTRDFAAESVEVQQVIQLDESDQIYVV 194 (794)
Q Consensus 120 We~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~t----G-~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~ 194 (794)
|-..+.-... ...+ .+ ++..++ .+++|..+|..+ | +++=....+. . |-.+-.+.++..+|+.
T Consensus 272 ~~vvfni~~i---ea~v-----kd-GK~~~V-~gn~V~VID~~t~~~~~~~v~~yIPVGK--s-PHGV~vSPDGkylyVa 338 (635)
T PRK02888 272 WVVVFNIARI---EEAV-----KA-GKFKTI-GGSKVPVVDGRKAANAGSALTRYVPVPK--N-PHGVNTSPDGKYFIAN 338 (635)
T ss_pred eEEEEchHHH---HHhh-----hC-CCEEEE-CCCEEEEEECCccccCCcceEEEEECCC--C-ccceEECCCCCEEEEe
Confidence 4433332211 0111 11 333443 577899999988 4 3333333332 2 3344323445556654
Q ss_pred EecCCceeEEEEEEcCCCcee
Q 003800 195 GYAGSSQFHAYQINAMNGELL 215 (794)
Q Consensus 195 ~~~g~~~~~v~ald~~tG~~~ 215 (794)
+--.. .+..+|.++-+..
T Consensus 339 nklS~---tVSVIDv~k~k~~ 356 (635)
T PRK02888 339 GKLSP---TVTVIDVRKLDDL 356 (635)
T ss_pred CCCCC---cEEEEEChhhhhh
Confidence 43222 6888888876654
No 138
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=68.47 E-value=2.2e+02 Score=32.58 Aligned_cols=60 Identities=15% Similarity=0.101 Sum_probs=33.8
Q ss_pred CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCCeEEEEEeec
Q 003800 188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g~L~v~~l~s 255 (794)
..++|-++.+. .+-+.|...|.++-... .|..+.. +.+ .+...++|... .|.++..++..
T Consensus 188 ~~rl~TaS~D~----t~k~wdlS~g~LLlti~--fp~si~a-v~lDpae~~~yiGt~-~G~I~~~~~~~ 248 (476)
T KOG0646|consen 188 NARLYTASEDR----TIKLWDLSLGVLLLTIT--FPSSIKA-VALDPAERVVYIGTE-EGKIFQNLLFK 248 (476)
T ss_pred cceEEEecCCc----eEEEEEeccceeeEEEe--cCCccee-EEEcccccEEEecCC-cceEEeeehhc
Confidence 45667665553 56667888898887663 4443322 211 13334445543 47777777654
No 139
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=67.25 E-value=45 Score=38.08 Aligned_cols=141 Identities=11% Similarity=0.141 Sum_probs=84.5
Q ss_pred EEEEEE-ccCCeEEEEeCCC-CcEeEEEeccCccccCCccccccccccccCCeEEE--EECCEEEEEECCCCcEEEEEec
Q 003800 97 YVITLS-SDGSTLRAWNLPD-GQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV--SSKGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 97 ~~V~Vs-~~g~~v~A~d~~t-G~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V--~~~g~l~ald~~tG~~~W~~~~ 172 (794)
+.+++| +.++.|..||.-+ |+.+-.+....... .++..- ..+.=|. ..|+.+.--|.+||+++=++..
T Consensus 227 ~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~V-rd~~~s-------~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~ 298 (503)
T KOG0282|consen 227 GHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPV-RDASFN-------NCGTSFLSASFDRFLKLWDTETGQVLSRFHL 298 (503)
T ss_pred eeEEEecCCCceEEEEEEecCcceehhhhcchhhh-hhhhcc-------ccCCeeeeeecceeeeeeccccceEEEEEec
Confidence 345554 5578999999987 88887777766544 222221 1333333 2499999999999999998877
Q ss_pred cCcceeeeeEEE-EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEE
Q 003800 173 AAESVEVQQVIQ-LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVT 250 (794)
Q Consensus 173 ~~~~~~~~~~v~-s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v 250 (794)
..... ++- -.++..+|++|...+ ++...|..+|+++-++.-.+.... +..|+ ++..++.. ++.+++.+
T Consensus 299 ~~~~~----cvkf~pd~~n~fl~G~sd~---ki~~wDiRs~kvvqeYd~hLg~i~--~i~F~~~g~rFiss-SDdks~ri 368 (503)
T KOG0282|consen 299 DKVPT----CVKFHPDNQNIFLVGGSDK---KIRQWDIRSGKVVQEYDRHLGAIL--DITFVDEGRRFISS-SDDKSVRI 368 (503)
T ss_pred CCCce----eeecCCCCCcEEEEecCCC---cEEEEeccchHHHHHHHhhhhhee--eeEEccCCceEeee-ccCccEEE
Confidence 65321 221 123446666554433 899999999999887742222211 33344 33344333 23455555
Q ss_pred EEeec
Q 003800 251 VSFKN 255 (794)
Q Consensus 251 ~~l~s 255 (794)
-+...
T Consensus 369 We~~~ 373 (503)
T KOG0282|consen 369 WENRI 373 (503)
T ss_pred EEcCC
Confidence 44443
No 140
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=67.19 E-value=2.1e+02 Score=31.85 Aligned_cols=52 Identities=15% Similarity=0.019 Sum_probs=34.2
Q ss_pred EEEEEEcCCCceeeeeeeecccCccCceEEEcCc-EEEEEECCCCeEEEEEeecce
Q 003800 203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.+-..|..||..+-+.. +-...+.+..+-.|+. ++-|.| +++|++-|+++++
T Consensus 315 tIk~wdv~tg~cL~tL~-ghdnwVr~~af~p~Gkyi~ScaD--Dktlrvwdl~~~~ 367 (406)
T KOG0295|consen 315 TIKIWDVSTGMCLFTLV-GHDNWVRGVAFSPGGKYILSCAD--DKTLRVWDLKNLQ 367 (406)
T ss_pred eEEEEeccCCeEEEEEe-cccceeeeeEEcCCCeEEEEEec--CCcEEEEEeccce
Confidence 67888999998876652 2233444433323444 444665 7899999999987
No 141
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=67.05 E-value=2.3e+02 Score=32.27 Aligned_cols=129 Identities=14% Similarity=0.160 Sum_probs=64.4
Q ss_pred ccEeeEEeccCceeee--eeeeecc---CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc----c---ceeeeeeeeCC
Q 003800 29 GLMDWHQQYIGKVKHA--VFHTQKT---GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN----D---VVDGIDIALGK 96 (794)
Q Consensus 29 G~~dW~~~~vG~~~~~--~f~~~~~---~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~----~---~i~~l~~~~g~ 96 (794)
+-.-|++.. .|..+ .+..... +.++-++....|.|. -.+||-.-|++..... + ....+.. .++
T Consensus 73 ~G~~W~q~~--~p~~~~~~L~~V~F~~~d~~~GwAVG~~G~IL--~T~DGG~tW~~~~~~~~~~~~~~~~l~~v~f-~~~ 147 (398)
T PLN00033 73 QSSEWEQVD--LPIDPGVVLLDIAFVPDDPTHGFLLGTRQTLL--ETKDGGKTWVPRSIPSAEDEDFNYRFNSISF-KGK 147 (398)
T ss_pred CCCccEEee--cCCCCCCceEEEEeccCCCCEEEEEcCCCEEE--EEcCCCCCceECccCcccccccccceeeeEE-ECC
Confidence 334599876 34322 2222222 355677777788764 4458999999854211 1 1122212 344
Q ss_pred EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEe
Q 003800 97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRD 171 (794)
Q Consensus 97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~ 171 (794)
..++++.. -+.+-..||-.-|+..............+ ....++..++. ..|.+++- .+|-..|+..
T Consensus 148 ~g~~vG~~---G~il~T~DgG~tW~~~~~~~~~p~~~~~i----~~~~~~~~~ivg~~G~v~~S--~D~G~tW~~~ 214 (398)
T PLN00033 148 EGWIIGKP---AILLHTSDGGETWERIPLSPKLPGEPVLI----KATGPKSAEMVTDEGAIYVT--SNAGRNWKAA 214 (398)
T ss_pred EEEEEcCc---eEEEEEcCCCCCceECccccCCCCCceEE----EEECCCceEEEeccceEEEE--CCCCCCceEc
Confidence 44443332 36666779999998754321110111111 11113333333 45654444 4666788864
No 142
>PRK13684 Ycf48-like protein; Provisional
Probab=67.01 E-value=2.1e+02 Score=31.67 Aligned_cols=179 Identities=15% Similarity=0.219 Sum_probs=88.2
Q ss_pred cceeecccccEeeEEeccCc--eeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCE
Q 003800 21 LSLYEDQVGLMDWHQQYIGK--VKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKY 97 (794)
Q Consensus 21 ~Al~edqvG~~dW~~~~vG~--~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~ 97 (794)
..+|..+.|-..|+....+. +.+. +.-.....+.++++++.|.|+.- .||-.-|+...... ..+..+....++.
T Consensus 109 g~i~~S~DgG~tW~~~~~~~~~~~~~-~~i~~~~~~~~~~~g~~G~i~~S--~DgG~tW~~~~~~~~g~~~~i~~~~~g~ 185 (334)
T PRK13684 109 SLLLHTTDGGKNWTRIPLSEKLPGSP-YLITALGPGTAEMATNVGAIYRT--TDGGKNWEALVEDAAGVVRNLRRSPDGK 185 (334)
T ss_pred ceEEEECCCCCCCeEccCCcCCCCCc-eEEEEECCCcceeeeccceEEEE--CCCCCCceeCcCCCcceEEEEEECCCCe
Confidence 45788777777898765431 1111 11111234557777877766554 47888899755432 2233332223334
Q ss_pred EEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE-eccCc
Q 003800 98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR-DFAAE 175 (794)
Q Consensus 98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~-~~~~~ 175 (794)
.++++..| .++.- ..+|..-|+..-.... ..+..+ ....++.+++. .+|.+. +...+|-..|+. ..+..
T Consensus 186 ~v~~g~~G-~i~~s-~~~gg~tW~~~~~~~~--~~l~~i----~~~~~g~~~~vg~~G~~~-~~s~d~G~sW~~~~~~~~ 256 (334)
T PRK13684 186 YVAVSSRG-NFYST-WEPGQTAWTPHQRNSS--RRLQSM----GFQPDGNLWMLARGGQIR-FNDPDDLESWSKPIIPEI 256 (334)
T ss_pred EEEEeCCc-eEEEE-cCCCCCeEEEeeCCCc--ccceee----eEcCCCCEEEEecCCEEE-EccCCCCCccccccCCcc
Confidence 44445544 44432 2467788986533221 111111 11113444444 466543 434566678885 22211
Q ss_pred --ceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 176 --SVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 176 --~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
......+. ...++.+|+++..| .++ .. .+|-.-|+.
T Consensus 257 ~~~~~l~~v~-~~~~~~~~~~G~~G----~v~-~S-~d~G~tW~~ 294 (334)
T PRK13684 257 TNGYGYLDLA-YRTPGEIWAGGGNG----TLL-VS-KDGGKTWEK 294 (334)
T ss_pred ccccceeeEE-EcCCCCEEEEcCCC----eEE-Ee-CCCCCCCeE
Confidence 11122222 13466788877666 222 22 355556776
No 143
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=66.38 E-value=2.6e+02 Score=32.63 Aligned_cols=182 Identities=10% Similarity=0.141 Sum_probs=96.9
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
.++|++.+-.|.|--||+.++++ =++.-+...+|..+.+..++..++-++.+|.+..||..+|.-- ++.+... +
T Consensus 290 kd~lItVSl~G~in~ln~~d~~~-~~~i~GHnK~ITaLtv~~d~~~i~SgsyDG~I~~W~~~~g~~~---~~~g~~h--~ 363 (603)
T KOG0318|consen 290 KDHLITVSLSGTINYLNPSDPSV-LKVISGHNKSITALTVSPDGKTIYSGSYDGHINSWDSGSGTSD---RLAGKGH--T 363 (603)
T ss_pred CCeEEEEEcCcEEEEecccCCCh-hheecccccceeEEEEcCCCCEEEeeccCceEEEEecCCcccc---ccccccc--c
Confidence 67899999999999999999994 3433333345665644444455664556789999999988643 2222111 1
Q ss_pred ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEE--EeccCcceeeeeEEEEecC-CEEEEEEecCCceeEEEEEE
Q 003800 133 LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWT--RDFAAESVEVQQVIQLDES-DQIYVVGYAGSSQFHAYQIN 208 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~--~~~~~~~~~~~~~v~s~~~-~~vyv~~~~g~~~~~v~ald 208 (794)
..+..+ +....+.++-. .|.+|..++...+..-=. .+.+.. |..+- ...+ +.+.+++.. .++-|.
T Consensus 364 nqI~~~--~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~Q---P~~la-v~~d~~~avv~~~~-----~iv~l~ 432 (603)
T KOG0318|consen 364 NQIKGM--AASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLGSQ---PKGLA-VLSDGGTAVVACIS-----DIVLLQ 432 (603)
T ss_pred ceEEEE--eecCCCcEEEEecCCeEEEEecccCcccccceeecCCC---ceeEE-EcCCCCEEEEEecC-----cEEEEe
Confidence 122221 22223455554 588999998764322111 222221 22221 1233 344444433 244444
Q ss_pred cCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 209 AMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 209 ~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
-.++ +.+. |-+...+++.+ -++-.+|+-...+.+|+..|..+.
T Consensus 433 ~~~~--~~~~----~~~y~~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~ 476 (603)
T KOG0318|consen 433 DQTK--VSSI----PIGYESSAVAVSPDGSEVAVGGQDGKVHVYSLSGDE 476 (603)
T ss_pred cCCc--ceee----ccccccceEEEcCCCCEEEEecccceEEEEEecCCc
Confidence 3222 2222 22333344444 223344554456788888887655
No 144
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=66.16 E-value=69 Score=33.10 Aligned_cols=110 Identities=13% Similarity=0.020 Sum_probs=74.5
Q ss_pred CCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
++.+|..|- +|+-.-.|++|=+++=|+..+..+ -++ ..++.-+.+|.....++-.|++|=.+.=+........
T Consensus 100 gd~~y~LTw~egvaf~~d~~t~~~lg~~~y~GeG--WgL--t~d~~~LimsdGsatL~frdP~tfa~~~~v~VT~~g~-- 173 (262)
T COG3823 100 GDYFYQLTWKEGVAFKYDADTLEELGRFSYEGEG--WGL--TSDDKNLIMSDGSATLQFRDPKTFAELDTVQVTDDGV-- 173 (262)
T ss_pred cceEEEEEeccceeEEEChHHhhhhcccccCCcc--eee--ecCCcceEeeCCceEEEecCHHHhhhcceEEEEECCe--
Confidence 677999885 688888999998888888776652 234 3444445556555789999999887777766654321
Q ss_pred CccccccccccccCCeEEEE--ECCEEEEEECCCCcEE-EEE
Q 003800 132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEIL-WTR 170 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~-W~~ 170 (794)
|+.-.+...-.++.++.- ...++.++++++|+++ |-.
T Consensus 174 --pv~~LNELE~VdG~lyANVw~t~~I~rI~p~sGrV~~wid 213 (262)
T COG3823 174 --PVSKLNELEWVDGELYANVWQTTRIARIDPDSGRVVAWID 213 (262)
T ss_pred --ecccccceeeeccEEEEeeeeecceEEEcCCCCcEEEEEE
Confidence 222222223336777773 4888999999999976 543
No 145
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=65.88 E-value=1.5e+02 Score=32.24 Aligned_cols=61 Identities=13% Similarity=0.142 Sum_probs=41.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPD 115 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~t 115 (794)
...+++++-+|.|--+|..+|...== ....+.+.++...-..+.|+-++.++++..||+..
T Consensus 65 ~~~~~~G~~dg~vr~~Dln~~~~~~i--gth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~ 125 (323)
T KOG1036|consen 65 ESTIVTGGLDGQVRRYDLNTGNEDQI--GTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRN 125 (323)
T ss_pred CceEEEeccCceEEEEEecCCcceee--ccCCCceEEEEeeccCCeEEEcccCccEEEEeccc
Confidence 56799999999999999999865321 11222344443233445555577889999999976
No 146
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=65.81 E-value=1.9e+02 Score=30.68 Aligned_cols=159 Identities=14% Similarity=0.131 Sum_probs=86.3
Q ss_pred CEEEEEEccCCeEEEEeCC------CCcEeEEEeccCccccCCccccccccccc-cCCeEEEE-ECCEEEEEECCCCcEE
Q 003800 96 KYVITLSSDGSTLRAWNLP------DGQMVWESFLRGSKHSKPLLLVPTNLKVD-KDSLILVS-SKGCLHAVSSIDGEIL 167 (794)
Q Consensus 96 ~~~V~Vs~~g~~v~A~d~~------tG~llWe~~l~~~~~s~~~~~~~~~~~~~-~~~~V~V~-~~g~l~ald~~tG~~~ 167 (794)
++.+..+++ |.|++|.=+ -=+.+||....-...+...|-+. ..-.+ ..+.++.. .|+.+|..|.++|+..
T Consensus 72 d~~Lls~gd-G~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPeIN-am~ldP~enSi~~AgGD~~~y~~dlE~G~i~ 149 (325)
T KOG0649|consen 72 DDFLLSGGD-GLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPEIN-AMWLDPSENSILFAGGDGVIYQVDLEDGRIQ 149 (325)
T ss_pred hhheeeccC-ceEEEeeehhhhhhccchhhhhhcCccccCcccCCccc-eeEeccCCCcEEEecCCeEEEEEEecCCEEE
Confidence 445554554 799999632 33678988764322101111110 00111 13444444 6999999999999999
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec-c---c-CccC--ceEEEcCcEEEE
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF-S---G-GFVG--DVALVSSDTLVT 240 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~-~---~-~~s~--~~~~vg~~~lv~ 240 (794)
=+++--...+ -.++.-...+.++-.+-+| .+...|.+|++-+.....-- + + .... .++-++..-++|
T Consensus 150 r~~rGHtDYv--H~vv~R~~~~qilsG~EDG----tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvC 223 (325)
T KOG0649|consen 150 REYRGHTDYV--HSVVGRNANGQILSGAEDG----TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVC 223 (325)
T ss_pred EEEcCCccee--eeeeecccCcceeecCCCc----cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEe
Confidence 8887654432 1122113455666544444 78889999998764432111 1 0 1111 244445667788
Q ss_pred EECCCCeEEEEEeecceeeeEEEee
Q 003800 241 LDTTRSILVTVSFKNRKIAFQETHL 265 (794)
Q Consensus 241 ~d~~~g~L~v~~l~sg~~~~~~~~l 265 (794)
.- ...|..-.|.+-+ ....+|+
T Consensus 224 Gg--Gp~lslwhLrsse-~t~vfpi 245 (325)
T KOG0649|consen 224 GG--GPKLSLWHLRSSE-STCVFPI 245 (325)
T ss_pred cC--CCceeEEeccCCC-ceEEEec
Confidence 73 3355555666544 3555565
No 147
>PLN02153 epithiospecifier protein
Probab=65.49 E-value=2.2e+02 Score=31.33 Aligned_cols=152 Identities=13% Similarity=0.096 Sum_probs=77.4
Q ss_pred CCEEEEEeCC------CEEEEEECcCCccceEEEcCc-----ccc-eeeeeeeeCCEEEEEEccC-----------CeEE
Q 003800 53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGI-----NDV-VDGIDIALGKYVITLSSDG-----------STLR 109 (794)
Q Consensus 53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~-----~~~-i~~l~~~~g~~~V~Vs~~g-----------~~v~ 109 (794)
+++||+.... +.+..+|+++. .|+..-.- +.. .....+..++.+++++|.. ..+.
T Consensus 85 ~~~iyv~GG~~~~~~~~~v~~yd~~t~--~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~ 162 (341)
T PLN02153 85 GTKLYIFGGRDEKREFSDFYSYDTVKN--EWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIE 162 (341)
T ss_pred CCEEEEECCCCCCCccCcEEEEECCCC--EEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEE
Confidence 6778887652 46889999875 59864321 100 1111123455566666632 2578
Q ss_pred EEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC---------------CEEEEEECCCCcEEEEEecc-
Q 003800 110 AWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK---------------GCLHAVSSIDGEILWTRDFA- 173 (794)
Q Consensus 110 A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~---------------g~l~ald~~tG~~~W~~~~~- 173 (794)
.||+.+. .|+..-..... ..+-.... ....++.+++..+ ..+.++|..+. .|+.-..
T Consensus 163 ~yd~~~~--~W~~l~~~~~~--~~~r~~~~-~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~--~W~~~~~~ 235 (341)
T PLN02153 163 AYNIADG--KWVQLPDPGEN--FEKRGGAG-FAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASG--KWTEVETT 235 (341)
T ss_pred EEECCCC--eEeeCCCCCCC--CCCCCcce-EEEECCeEEEEeccccccccCCccceecCceEEEEcCCC--cEEecccc
Confidence 8998765 58853221100 00000000 1112456666421 35788887754 4876432
Q ss_pred ----CcceeeeeEEEEecCCEEEEEEecC---------C--ceeEEEEEEcCCCceeeee
Q 003800 174 ----AESVEVQQVIQLDESDQIYVVGYAG---------S--SQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 174 ----~~~~~~~~~v~s~~~~~vyv~~~~g---------~--~~~~v~ald~~tG~~~w~~ 218 (794)
.+.. ...+ ..-++.+|+.+... . ..-.++++|+.+. .|+.
T Consensus 236 g~~P~~r~-~~~~--~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~--~W~~ 290 (341)
T PLN02153 236 GAKPSARS-VFAH--AVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETL--VWEK 290 (341)
T ss_pred CCCCCCcc-eeee--EEECCEEEEECcccCCccccccccccccccEEEEEcCcc--EEEe
Confidence 1111 1111 23578899876531 0 0125788887644 5764
No 148
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=65.24 E-value=2.1e+02 Score=31.15 Aligned_cols=154 Identities=11% Similarity=0.058 Sum_probs=86.9
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCE--EEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKY--VITLSSDGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~--~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
++.+||.++-++.+--.|..+|++. +.-.....+...+-..+.. .++-|+.+.+|+-||...-.++=+..+..-.+
T Consensus 83 dgskVf~g~~Dk~~k~wDL~S~Q~~--~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~pv~t~~LPeRvY 160 (347)
T KOG0647|consen 83 DGSKVFSGGCDKQAKLWDLASGQVS--QVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNPVATLQLPERVY 160 (347)
T ss_pred CCceEEeeccCCceEEEEccCCCee--eeeecccceeEEEEecCCCcceeEecccccceeecccCCCCeeeeeeccceee
Confidence 4667999999999999999999652 2222222344333222222 33325568899999999999998888876544
Q ss_pred cCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE-eccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800 130 SKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR-DFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI 207 (794)
Q Consensus 130 s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~-~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al 207 (794)
+.+ + . -...+|. .+..+..+++.+|-..-+. +.|... ..+++....+..-|++|..-| ++..-
T Consensus 161 a~D--v-------~-~pm~vVata~r~i~vynL~n~~te~k~~~SpLk~--Q~R~va~f~d~~~~alGsiEG---rv~iq 225 (347)
T KOG0647|consen 161 AAD--V-------L-YPMAVVATAERHIAVYNLENPPTEFKRIESPLKW--QTRCVACFQDKDGFALGSIEG---RVAIQ 225 (347)
T ss_pred ehh--c-------c-CceeEEEecCCcEEEEEcCCCcchhhhhcCcccc--eeeEEEEEecCCceEeeeecc---eEEEE
Confidence 111 1 1 2233443 5788999998887543221 111111 112222233444455543322 66666
Q ss_pred EcCCCceeeeeeeec
Q 003800 208 NAMNGELLNHETAAF 222 (794)
Q Consensus 208 d~~tG~~~w~~~v~~ 222 (794)
....|.+.....+.+
T Consensus 226 ~id~~~~~~nFtFkC 240 (347)
T KOG0647|consen 226 YIDDPNPKDNFTFKC 240 (347)
T ss_pred ecCCCCccCceeEEE
Confidence 666666544444333
No 149
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=64.52 E-value=48 Score=30.42 Aligned_cols=71 Identities=27% Similarity=0.442 Sum_probs=42.3
Q ss_pred CccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-E
Q 003800 73 GEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S 151 (794)
Q Consensus 73 G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~ 151 (794)
+.++|......+ ......+.+..+ |.++..|. +|..+|...... .. ...+++ .
T Consensus 41 ~~~vW~snt~~~--------~~~~~~l~l~~d-GnLvl~~~-~g~~vW~S~~~~-~~---------------~~~~~~L~ 94 (116)
T cd00028 41 RTVVWVANRDNP--------SGSSCTLTLQSD-GNLVIYDG-SGTVVWSSNTTR-VN---------------GNYVLVLL 94 (116)
T ss_pred CeEEEECCCCCC--------CCCCEEEEEecC-CCeEEEcC-CCcEEEEecccC-CC---------------CceEEEEe
Confidence 678898655332 112223444554 46777776 689999866543 10 122333 3
Q ss_pred ECCEEEEEECCCCcEEEEE
Q 003800 152 SKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 152 ~~g~l~ald~~tG~~~W~~ 170 (794)
.+|.|.-++. +|+++|+-
T Consensus 95 ddGnlvl~~~-~~~~~W~S 112 (116)
T cd00028 95 DDGNLVLYDS-DGNFLWQS 112 (116)
T ss_pred CCCCEEEECC-CCCEEEcC
Confidence 6788777775 58999974
No 150
>KOG1027 consensus Serine/threonine protein kinase and endoribonuclease ERN1/IRE1, sensor of the unfolded protein response pathway [Signal transduction mechanisms]
Probab=63.42 E-value=30 Score=42.30 Aligned_cols=109 Identities=17% Similarity=0.246 Sum_probs=66.5
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
.++.+|.++.++.-+-+|++||+..|.....++ +..+ +..+..- .+|.-.|..+=...|......... .
T Consensus 106 sdGi~ysg~k~d~~~lvD~~tg~~~~tf~~~~~--~~~~-v~~grt~-------ytv~m~d~~~~~~~wn~t~~dy~a-~ 174 (903)
T KOG1027|consen 106 SDGILYSGSKQDIWYLVDPKTGEIDYTFNTAEP--IKQL-VYLGRTN-------YTVTMYDKNVRGKTWNTTFGDYSA-Q 174 (903)
T ss_pred CCCeEEecccccceEEecCCccceeEEEecCCc--chhh-eecccce-------eEEecccCcccCceeeccccchhc-c
Confidence 477799999999999999999999999887664 3322 1222222 233333444445556555443221 1
Q ss_pred CccccccccccccCCeEEE--EECCEEEEEECCCCcEEEEEeccCcce
Q 003800 132 PLLLVPTNLKVDKDSLILV--SSKGCLHAVSSIDGEILWTRDFAAESV 177 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V--~~~g~l~ald~~tG~~~W~~~~~~~~~ 177 (794)
.++-. .+.....+ .++|-+.-+|.++|+.+|..+...+..
T Consensus 175 ~~~~~------~~~~~~~~~~~~~g~i~t~D~~~g~~~~~q~~~spvv 216 (903)
T KOG1027|consen 175 YPSGV------RGEKMSHFHSLGNGYIVTVDSESGEKLWLQDLLSPVV 216 (903)
T ss_pred CCCcc------CCceeEEEeecCCccEEeccCcccceeeccccCCceE
Confidence 11111 11122222 247777789999999999998876643
No 151
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=63.25 E-value=1.9e+02 Score=29.91 Aligned_cols=145 Identities=17% Similarity=0.234 Sum_probs=82.1
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccc-cCCeEEEEECCEEEEEECCCCcEEEEEecc
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVD-KDSLILVSSKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~-~~~~V~V~~~g~l~ald~~tG~~~W~~~~~ 173 (794)
.+.++++...++.|+.||+.+|+.. ...+..+. +. ... .++.+++...+.+..+|..+|+..--.+..
T Consensus 11 ~g~l~~~D~~~~~i~~~~~~~~~~~-~~~~~~~~---G~-------~~~~~~g~l~v~~~~~~~~~d~~~g~~~~~~~~~ 79 (246)
T PF08450_consen 11 DGRLYWVDIPGGRIYRVDPDTGEVE-VIDLPGPN---GM-------AFDRPDGRLYVADSGGIAVVDPDTGKVTVLADLP 79 (246)
T ss_dssp TTEEEEEETTTTEEEEEETTTTEEE-EEESSSEE---EE-------EEECTTSEEEEEETTCEEEEETTTTEEEEEEEEE
T ss_pred CCEEEEEEcCCCEEEEEECCCCeEE-EEecCCCc---eE-------EEEccCCEEEEEEcCceEEEecCCCcEEEEeecc
Confidence 3445554445789999999888653 22322211 11 112 257777877666677799999766544442
Q ss_pred --C-cceeeeeEEEEecCCEEEEEEecCC---ce--eEEEEEEcCCCceee-eeeeecccCccCceEEEcCcEEEEEECC
Q 003800 174 --A-ESVEVQQVIQLDESDQIYVVGYAGS---SQ--FHAYQINAMNGELLN-HETAAFSGGFVGDVALVSSDTLVTLDTT 244 (794)
Q Consensus 174 --~-~~~~~~~~v~s~~~~~vyv~~~~g~---~~--~~v~ald~~tG~~~w-~~~v~~~~~~s~~~~~vg~~~lv~~d~~ 244 (794)
. +...+--+. ...++.+|+...... .. ..++.+++. |+... ...+..|.++ ++-..++.+++.|+.
T Consensus 80 ~~~~~~~~~ND~~-vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~pNGi---~~s~dg~~lyv~ds~ 154 (246)
T PF08450_consen 80 DGGVPFNRPNDVA-VDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGFPNGI---AFSPDGKTLYVADSF 154 (246)
T ss_dssp TTCSCTEEEEEEE-E-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESSEEEE---EEETTSSEEEEEETT
T ss_pred CCCcccCCCceEE-EcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCcccccce---EECCcchheeecccc
Confidence 1 222232232 245777998655321 11 579999998 66432 2223333322 222234567778888
Q ss_pred CCeEEEEEeec
Q 003800 245 RSILVTVSFKN 255 (794)
Q Consensus 245 ~g~L~v~~l~s 255 (794)
++.++..++..
T Consensus 155 ~~~i~~~~~~~ 165 (246)
T PF08450_consen 155 NGRIWRFDLDA 165 (246)
T ss_dssp TTEEEEEEEET
T ss_pred cceeEEEeccc
Confidence 88899999874
No 152
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=63.25 E-value=3.5e+02 Score=33.02 Aligned_cols=101 Identities=16% Similarity=0.175 Sum_probs=61.8
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcc-------cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-------DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-------~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
|+.+.....+++|+. +|+..=+-..... ..+.++. ...+..++.|+.|+.+.-||..+++-+-...-. ..
T Consensus 339 v~l~nNtv~~ysl~~-s~~~~p~~~~~~~i~~~GHR~dVRsl~-vS~d~~~~~Sga~~SikiWn~~t~kciRTi~~~-y~ 415 (888)
T KOG0306|consen 339 VLLANNTVEWYSLEN-SGKTSPEADRTSNIEIGGHRSDVRSLC-VSSDSILLASGAGESIKIWNRDTLKCIRTITCG-YI 415 (888)
T ss_pred EEeecCceEEEEecc-CCCCCccccccceeeeccchhheeEEE-eecCceeeeecCCCcEEEEEccCcceeEEeccc-cE
Confidence 556666778999998 6665411110000 0122332 234455666777789999999999988776643 22
Q ss_pred ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEE
Q 003800 129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEIL 167 (794)
Q Consensus 129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~ 167 (794)
+ ...++| ++..|++. .+|+|..+|..++..+
T Consensus 416 l--~~~Fvp------gd~~Iv~G~k~Gel~vfdlaS~~l~ 447 (888)
T KOG0306|consen 416 L--ASKFVP------GDRYIVLGTKNGELQVFDLASASLV 447 (888)
T ss_pred E--EEEecC------CCceEEEeccCCceEEEEeehhhhh
Confidence 2 223444 24555555 4999999999887644
No 153
>PRK03629 tolB translocation protein TolB; Provisional
Probab=63.02 E-value=2.8e+02 Score=31.74 Aligned_cols=149 Identities=13% Similarity=0.139 Sum_probs=72.0
Q ss_pred eCCEEEEEEc--cCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CC--EEEEEECCCCcEE
Q 003800 94 LGKYVITLSS--DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KG--CLHAVSSIDGEIL 167 (794)
Q Consensus 94 ~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g--~l~ald~~tG~~~ 167 (794)
.|+.+++++. .+..++.||..+|+..--....+.. ..+.+. ++ +..+++. . +| .|+.+|.++|+..
T Consensus 209 DG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~~--~~~~~S-----PD-G~~La~~~~~~g~~~I~~~d~~tg~~~ 280 (429)
T PRK03629 209 DGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRHN--GAPAFS-----PD-GSKLAFALSKTGSLNLYVMDLASGQIR 280 (429)
T ss_pred CCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCCc--CCeEEC-----CC-CCEEEEEEcCCCCcEEEEEECCCCCEE
Confidence 4666777653 2357999999999754333222211 112222 23 3344443 2 33 6888998888653
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCC-
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTR- 245 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~- 245 (794)
=-...... ...+..+.++..+++.+..++ ...++.+|+.+|+... +.........+.+ ..+..++......
T Consensus 281 ~lt~~~~~---~~~~~wSPDG~~I~f~s~~~g-~~~Iy~~d~~~g~~~~---lt~~~~~~~~~~~SpDG~~Ia~~~~~~g 353 (429)
T PRK03629 281 QVTDGRSN---NTEPTWFPDSQNLAYTSDQAG-RPQVYKVNINGGAPQR---ITWEGSQNQDADVSSDGKFMVMVSSNGG 353 (429)
T ss_pred EccCCCCC---cCceEECCCCCEEEEEeCCCC-CceEEEEECCCCCeEE---eecCCCCccCEEECCCCCEEEEEEccCC
Confidence 21111111 111221334555554443332 2478888998886531 1111111111222 2334444443322
Q ss_pred -CeEEEEEeecce
Q 003800 246 -SILVTVSFKNRK 257 (794)
Q Consensus 246 -g~L~v~~l~sg~ 257 (794)
..+++.|+.+|.
T Consensus 354 ~~~I~~~dl~~g~ 366 (429)
T PRK03629 354 QQHIAKQDLATGG 366 (429)
T ss_pred CceEEEEECCCCC
Confidence 357778887776
No 154
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=62.93 E-value=1e+02 Score=34.64 Aligned_cols=92 Identities=13% Similarity=0.204 Sum_probs=55.8
Q ss_pred CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEecc
Q 003800 96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~ 173 (794)
.+++.-+|.++.|..||..||+-+-+........ .+ ....++..++. .|..++.+|+.+|+++|+-..-
T Consensus 144 ~NVLlsag~Dn~v~iWnv~tgeali~l~hpd~i~-----S~----sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e~~~h 214 (472)
T KOG0303|consen 144 PNVLLSAGSDNTVSIWNVGTGEALITLDHPDMVY-----SM----SFNRDGSLLCTTCKDKKVRVIDPRRGTVVSEGVAH 214 (472)
T ss_pred hhhHhhccCCceEEEEeccCCceeeecCCCCeEE-----EE----EeccCCceeeeecccceeEEEcCCCCcEeeecccc
Confidence 3444435557899999999999887766333221 11 22235666665 3899999999999999997333
Q ss_pred CcceeeeeEEEEecCCEEEEEEecC
Q 003800 174 AESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 174 ~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
.+.. +.+.+. ..++.++..|+..
T Consensus 215 eG~k-~~Raif-l~~g~i~tTGfsr 237 (472)
T KOG0303|consen 215 EGAK-PARAIF-LASGKIFTTGFSR 237 (472)
T ss_pred cCCC-cceeEE-eccCceeeecccc
Confidence 3222 333332 2344455544443
No 155
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=62.66 E-value=63 Score=29.52 Aligned_cols=81 Identities=26% Similarity=0.435 Sum_probs=45.0
Q ss_pred CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcccccccccc
Q 003800 63 NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKV 142 (794)
Q Consensus 63 g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~ 142 (794)
+.+.-.+..++.++|......+ ......+.+..+ |.+...|. +|..+|+...... .
T Consensus 30 gnlV~~~~~~~~~vW~snt~~~--------~~~~~~l~l~~d-GnLvl~~~-~g~~vW~S~t~~~-~------------- 85 (114)
T smart00108 30 YNLILYKSSSRTVVWVANRDNP--------VSDSCTLTLQSD-GNLVLYDG-DGRVVWSSNTTGA-N------------- 85 (114)
T ss_pred EEEEEEECCCCcEEEECCCCCC--------CCCCEEEEEeCC-CCEEEEeC-CCCEEEEecccCC-C-------------
Confidence 3333334333678898544322 111134444554 46777775 5899999754311 0
Q ss_pred ccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800 143 DKDSLILVS-SKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 143 ~~~~~V~V~-~~g~l~ald~~tG~~~W~~ 170 (794)
....+++ .+|.|.-++. .|+++|+-
T Consensus 86 --~~~~~~L~ddGnlvl~~~-~~~~~W~S 111 (114)
T smart00108 86 --GNYVLVLLDDGNLVIYDS-DGNFLWQS 111 (114)
T ss_pred --CceEEEEeCCCCEEEECC-CCCEEeCC
Confidence 1223333 5788877774 67899973
No 156
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=62.33 E-value=2.3e+02 Score=32.08 Aligned_cols=102 Identities=15% Similarity=0.024 Sum_probs=46.6
Q ss_pred EEECcCCccceEEEcCcccc------eeeeeeeeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCcccccc
Q 003800 67 SLDLRHGEIFWRHVLGINDV------VDGIDIALGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPT 138 (794)
Q Consensus 67 ALn~~tG~ivWR~~l~~~~~------i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~ 138 (794)
-.|+.||..+=|-.-..... -.+. -..|..++|.|.. ..+++.+|.++|+..==....+... .+..+.
T Consensus 14 ~~D~~TG~~VtrLT~~~~~~h~~YF~~~~f-t~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTdg~g~~~-~g~~~s-- 89 (386)
T PF14583_consen 14 WIDPDTGHRVTRLTPPDGHSHRLYFYQNCF-TDDGRKLLFASDFDGNRNLYLLDLATGEITQLTDGPGDNT-FGGFLS-- 89 (386)
T ss_dssp EE-TTT--EEEE-S-TTS-EE---TTS--B--TTS-EEEEEE-TTSS-EEEEEETTT-EEEE---SS-B-T-TT-EE---
T ss_pred EeCCCCCceEEEecCCCCcccceeecCCCc-CCCCCEEEEEeccCCCcceEEEEcccCEEEECccCCCCCc-cceEEe--
Confidence 35778887766532221100 1122 1346677886642 4789999999999873222222111 111111
Q ss_pred ccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcc
Q 003800 139 NLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAES 176 (794)
Q Consensus 139 ~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~ 176 (794)
.. ++.++.. .+..|.++|..|++..=-+..|...
T Consensus 90 ---~~-~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~p~~~ 124 (386)
T PF14583_consen 90 ---PD-DRALYYVKNGRSLRRVDLDTLEERVVYEVPDDW 124 (386)
T ss_dssp ---TT-SSEEEEEETTTEEEEEETTT--EEEEEE--TTE
T ss_pred ---cC-CCeEEEEECCCeEEEEECCcCcEEEEEECCccc
Confidence 22 4454444 5679999999999877666666544
No 157
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=61.78 E-value=73 Score=35.83 Aligned_cols=108 Identities=19% Similarity=0.205 Sum_probs=66.4
Q ss_pred EEEcc-CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce
Q 003800 100 TLSSD-GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV 177 (794)
Q Consensus 100 ~Vs~~-g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~ 177 (794)
++||. +.+||.||..++...-+.++.+-.. ++.+.. + +..+...+ +..+-.+|..+-+++=.+..+.-..
T Consensus 315 ~~SgH~DkkvRfwD~Rs~~~~~sv~~gg~vt--Sl~ls~-----~-g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~ 386 (459)
T KOG0288|consen 315 VISGHFDKKVRFWDIRSADKTRSVPLGGRVT--SLDLSM-----D-GLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKC 386 (459)
T ss_pred eeecccccceEEEeccCCceeeEeecCccee--eEeecc-----C-CeEEeeecCCCceeeeecccccEEEEeecccccc
Confidence 44663 6789999999999999999887432 221111 1 33444443 8888888888877776665543211
Q ss_pred --eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800 178 --EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 178 --~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~ 219 (794)
....++.+.++..|-+ |+.+..|+..+..+|+......
T Consensus 387 asDwtrvvfSpd~~YvaA----GS~dgsv~iW~v~tgKlE~~l~ 426 (459)
T KOG0288|consen 387 ASDWTRVVFSPDGSYVAA----GSADGSVYIWSVFTGKLEKVLS 426 (459)
T ss_pred ccccceeEECCCCceeee----ccCCCcEEEEEccCceEEEEec
Confidence 1223332333333333 3333479999999998876654
No 158
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=61.09 E-value=2.5e+02 Score=30.60 Aligned_cols=202 Identities=13% Similarity=0.110 Sum_probs=104.8
Q ss_pred ccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc--CCeEEEEeCC
Q 003800 37 YIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD--GSTLRAWNLP 114 (794)
Q Consensus 37 ~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~ 114 (794)
+-|++..-.|+. ++..+++.+++..|.-.|..+|+.+=...-...+ +...........+.-|+. +..+|-++..
T Consensus 13 ~~~~i~sl~fs~---~G~~litss~dDsl~LYd~~~g~~~~ti~skkyG-~~~~~Fth~~~~~i~sStk~d~tIryLsl~ 88 (311)
T KOG1446|consen 13 TNGKINSLDFSD---DGLLLITSSEDDSLRLYDSLSGKQVKTINSKKYG-VDLACFTHHSNTVIHSSTKEDDTIRYLSLH 88 (311)
T ss_pred CCCceeEEEecC---CCCEEEEecCCCeEEEEEcCCCceeeEeeccccc-ccEEEEecCCceEEEccCCCCCceEEEEee
Confidence 445554445652 3566888889999999999999877655444332 222222333333333432 5789999999
Q ss_pred CCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcce----eeeeEEEEec-
Q 003800 115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESV----EVQQVIQLDE- 187 (794)
Q Consensus 115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~----~~~~~v~s~~- 187 (794)
|-+-+--+......+ .++.+.| .++.|+.+ |.+++ +|-.+.+...- ....+. +-+
T Consensus 89 dNkylRYF~GH~~~V-~sL~~sP-------~~d~FlS~S~D~tvr---------LWDlR~~~cqg~l~~~~~pi~-AfDp 150 (311)
T KOG1446|consen 89 DNKYLRYFPGHKKRV-NSLSVSP-------KDDTFLSSSLDKTVR---------LWDLRVKKCQGLLNLSGRPIA-AFDP 150 (311)
T ss_pred cCceEEEcCCCCceE-EEEEecC-------CCCeEEecccCCeEE---------eeEecCCCCceEEecCCCcce-eECC
Confidence 998887777665443 3333333 34566642 55543 46555332210 011111 223
Q ss_pred CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccC--ceEEEcCc--EEEEEECCCCeEEEEEeecceeeeEEE
Q 003800 188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVG--DVALVSSD--TLVTLDTTRSILVTVSFKNRKIAFQET 263 (794)
Q Consensus 188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~--~~~~vg~~--~lv~~d~~~g~L~v~~l~sg~~~~~~~ 263 (794)
.|.+|+++..+ ..++++-+-.-.+.+--...+..+ ...+ ..-+-.++ ++++. ..+..+++|--+|.+ .+.+
T Consensus 151 ~GLifA~~~~~-~~IkLyD~Rs~dkgPF~tf~i~~~-~~~ew~~l~FS~dGK~iLlsT--~~s~~~~lDAf~G~~-~~tf 225 (311)
T KOG1446|consen 151 EGLIFALANGS-ELIKLYDLRSFDKGPFTTFSITDN-DEAEWTDLEFSPDGKSILLST--NASFIYLLDAFDGTV-KSTF 225 (311)
T ss_pred CCcEEEEecCC-CeEEEEEecccCCCCceeEccCCC-CccceeeeEEcCCCCEEEEEe--CCCcEEEEEccCCcE-eeeE
Confidence 45566655443 356666555444555444433321 1111 11122122 33333 356777788777773 3434
Q ss_pred ee
Q 003800 264 HL 265 (794)
Q Consensus 264 ~l 265 (794)
..
T Consensus 226 s~ 227 (311)
T KOG1446|consen 226 SG 227 (311)
T ss_pred ee
Confidence 43
No 159
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=61.06 E-value=40 Score=37.47 Aligned_cols=109 Identities=17% Similarity=0.246 Sum_probs=66.2
Q ss_pred ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeee-----eeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGI-----DIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l-----~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
+++++.|..++-+|.|-.-||++|+..=|.--....-|.++ +.......+.-++.++.+|-||..-|+.+-....
T Consensus 166 sPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd~~~~~~~~~lsg 245 (480)
T KOG0271|consen 166 SPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWDTKLGTCVRTLSG 245 (480)
T ss_pred CCCcchhhccccCCeEEEecCCCCCcccccccCcccceeEEeecccccCCCccceecccCCCCEEEEEccCceEEEEecc
Confidence 34566677788899999999999998766544433223322 1122333344345567999999999998877665
Q ss_pred cCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcE
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEI 166 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~ 166 (794)
..... ..+ ...+++.+|-.+ |+++...++.+|..
T Consensus 246 HT~~V----TCv----rwGG~gliySgS~DrtIkvw~a~dG~~ 280 (480)
T KOG0271|consen 246 HTASV----TCV----RWGGEGLIYSGSQDRTIKVWRALDGKL 280 (480)
T ss_pred Cccce----EEE----EEcCCceEEecCCCceEEEEEccchhH
Confidence 54332 111 223234444443 77777666666543
No 160
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=60.42 E-value=3.3e+02 Score=31.74 Aligned_cols=220 Identities=13% Similarity=0.118 Sum_probs=111.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP 132 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~ 132 (794)
++.|+++...|.+.--++.+- ..=|+....++.+-++ -...++.+.-++.++++.+|| .+=+.+-+.+++.+.- +
T Consensus 257 ngdviTgDS~G~i~Iw~~~~~-~~~k~~~aH~ggv~~L-~~lr~GtllSGgKDRki~~Wd-~~y~k~r~~elPe~~G--~ 331 (626)
T KOG2106|consen 257 NGDVITGDSGGNILIWSKGTN-RISKQVHAHDGGVFSL-CMLRDGTLLSGGKDRKIILWD-DNYRKLRETELPEQFG--P 331 (626)
T ss_pred CCCEEeecCCceEEEEeCCCc-eEEeEeeecCCceEEE-EEecCccEeecCccceEEecc-ccccccccccCchhcC--C
Confidence 556888888898888887544 4445555444455555 234555554366789999999 4555555666654321 1
Q ss_pred ccccccccccccCCeEEEEE-CCEEE---------EEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCcee
Q 003800 133 LLLVPTNLKVDKDSLILVSS-KGCLH---------AVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQF 202 (794)
Q Consensus 133 ~~~~~~~~~~~~~~~V~V~~-~g~l~---------ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~ 202 (794)
+..+ . .+.++++|.. .+.+. -.-..-|+.+|.....- ....|+.+.+..
T Consensus 332 iRtv----~-e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hp-------------s~~q~~T~gqdk--- 390 (626)
T KOG2106|consen 332 IRTV----A-EGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHP-------------SKNQLLTCGQDK--- 390 (626)
T ss_pred eeEE----e-cCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCC-------------ChhheeeccCcc---
Confidence 1111 1 2234566652 22222 22223345667654321 122233333321
Q ss_pred EEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecC
Q 003800 203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSS 282 (794)
Q Consensus 203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~ 282 (794)
.+.-.+ .-++.|...+.-|..-.+ +-+.+ .++... ..|...++|.++.. +-++.-+ ..........
T Consensus 391 ~v~lW~--~~k~~wt~~~~d~~~~~~--fhpsg-~va~Gt-~~G~w~V~d~e~~~--lv~~~~d------~~~ls~v~ys 456 (626)
T KOG2106|consen 391 HVRLWN--DHKLEWTKIIEDPAECAD--FHPSG-VVAVGT-ATGRWFVLDTETQD--LVTIHTD------NEQLSVVRYS 456 (626)
T ss_pred eEEEcc--CCceeEEEEecCceeEee--ccCcc-eEEEee-ccceEEEEecccce--eEEEEec------CCceEEEEEc
Confidence 344445 667889997765532211 11122 344333 46888899888855 2222221 1222233333
Q ss_pred Ccc-eeEEEecC-cEEEEEEecCC-cEEEEEee
Q 003800 283 LTG-MFTVKINN-YKLFIRLTSED-KLEVVHKV 312 (794)
Q Consensus 283 ~~~-~~~~~~~~-~~~l~~~~~~~-~~~v~~~~ 312 (794)
+.| .+.+.+.+ +..+++++.+| +...+..-
T Consensus 457 p~G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~ 489 (626)
T KOG2106|consen 457 PDGAFLAVGSHDNHIYIYRVSANGRKYSRVGKC 489 (626)
T ss_pred CCCCEEEEecCCCeEEEEEECCCCcEEEEeeee
Confidence 344 33344444 55677777555 44444433
No 161
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=60.39 E-value=1e+02 Score=35.28 Aligned_cols=77 Identities=13% Similarity=0.146 Sum_probs=53.3
Q ss_pred ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
+++++.|.++...|.|.-|.++||+.+=...++.. +..+.....+..+++++..|.|+-||...-..+-++.-.+..
T Consensus 312 Shd~~fia~~G~~G~I~lLhakT~eli~s~KieG~--v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~v 388 (514)
T KOG2055|consen 312 SHDSNFIAIAGNNGHIHLLHAKTKELITSFKIEGV--VSDFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGSV 388 (514)
T ss_pred cCCCCeEEEcccCceEEeehhhhhhhhheeeeccE--EeeEEEecCCcEEEEEcCCceEEEEecCCcceEEEEeecCcc
Confidence 45577788888899999999999998877777655 433322333345554544459999999887777666655543
No 162
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=59.52 E-value=1.3e+02 Score=34.75 Aligned_cols=112 Identities=18% Similarity=0.234 Sum_probs=68.5
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS 130 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s 130 (794)
.+.+..|..-.+|.|+.-|..+-. +=|+--+..+...++-+..++.-+--||-++.||.||...|+.+=+..+.+...
T Consensus 519 pDakvcFsccsdGnI~vwDLhnq~-~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqhdF~SQIf- 596 (705)
T KOG0639|consen 519 PDAKVCFSCCSDGNIAVWDLHNQT-LVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQHDFSSQIF- 596 (705)
T ss_pred CccceeeeeccCCcEEEEEcccce-eeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhhhhhhhhe-
Confidence 345556666678999999987643 344433333333444333344455546767899999999999999999887665
Q ss_pred CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEec
Q 003800 131 KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 131 ~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~ 172 (794)
++...+. ++-+.|. .++.+-.+. .+|..+.....
T Consensus 597 -SLg~cP~------~dWlavGMens~vevlh-~skp~kyqlhl 631 (705)
T KOG0639|consen 597 -SLGYCPT------GDWLAVGMENSNVEVLH-TSKPEKYQLHL 631 (705)
T ss_pred -ecccCCC------ccceeeecccCcEEEEe-cCCccceeecc
Confidence 3334441 3444444 466666665 45555555433
No 163
>PF05262 Borrelia_P83: Borrelia P83/100 protein; InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=59.09 E-value=59 Score=37.88 Aligned_cols=98 Identities=13% Similarity=0.101 Sum_probs=59.6
Q ss_pred CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE
Q 003800 153 KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL 232 (794)
Q Consensus 153 ~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~ 232 (794)
-+.|..||+.+|.++-+-....-.. +-++...++.|-+.+..|...++++-||+.|=++..+......+ .++++
T Consensus 374 ls~LvllD~~tg~~l~~S~~~~Ir~---r~~~~~~~~~vaI~g~~G~~~ikLvlid~~tLev~kes~~~i~~---~S~l~ 447 (489)
T PF05262_consen 374 LSELVLLDSDTGDTLKRSPVNGIRG---RTFYEREDDLVAIAGCSGNAAIKLVLIDPETLEVKKESEDEISW---QSSLI 447 (489)
T ss_pred ceeEEEEeCCCCceecccccceecc---ceeEEcCCCEEEEeccCCchheEEEecCcccceeeeeccccccc---cCceE
Confidence 4789999999999887654432221 11222334444433344556789999999998888777432221 24555
Q ss_pred E-cCcEEEEEECCCCeEEEEEeecc
Q 003800 233 V-SSDTLVTLDTTRSILVTVSFKNR 256 (794)
Q Consensus 233 v-g~~~lv~~d~~~g~L~v~~l~sg 256 (794)
+ |+.+|+++...+|..+..-..++
T Consensus 448 ~~~~~iyaVv~~~~g~~~L~rF~~~ 472 (489)
T PF05262_consen 448 VDGQMIYAVVKKDNGKWYLGRFDSN 472 (489)
T ss_pred EcCCeEEEEEEcCCCeEEEeecCcc
Confidence 5 55566666345677666655543
No 164
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=58.93 E-value=2.8e+02 Score=30.40 Aligned_cols=180 Identities=16% Similarity=0.296 Sum_probs=83.4
Q ss_pred eeecccccEeeEEeccCceee-eeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEE
Q 003800 23 LYEDQVGLMDWHQQYIGKVKH-AVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVIT 100 (794)
Q Consensus 23 l~edqvG~~dW~~~~vG~~~~-~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~ 100 (794)
++....|-..|..-.+..+.. ..+.-....++.+++++..|.|+.= .||-.-|+...... +.+.......++..|.
T Consensus 83 ll~T~DgG~tW~~v~l~~~lpgs~~~i~~l~~~~~~l~~~~G~iy~T--~DgG~tW~~~~~~~~gs~~~~~r~~dG~~va 160 (302)
T PF14870_consen 83 LLHTTDGGKTWERVPLSSKLPGSPFGITALGDGSAELAGDRGAIYRT--TDGGKTWQAVVSETSGSINDITRSSDGRYVA 160 (302)
T ss_dssp EEEESSTTSS-EE----TT-SS-EEEEEEEETTEEEEEETT--EEEE--SSTTSSEEEEE-S----EEEEEE-TTS-EEE
T ss_pred EEEecCCCCCcEEeecCCCCCCCeeEEEEcCCCcEEEEcCCCcEEEe--CCCCCCeeEcccCCcceeEeEEECCCCcEEE
Confidence 455555666687643221111 1111112235567777877766544 57888999877654 2333332234556777
Q ss_pred EEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCccee-
Q 003800 101 LSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVE- 178 (794)
Q Consensus 101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~- 178 (794)
|+..|.-+..||. |+--|+..-.... ..+..++ ...++.+.+. .+|.++.-+..+.-..|..........
T Consensus 161 vs~~G~~~~s~~~--G~~~w~~~~r~~~--~riq~~g----f~~~~~lw~~~~Gg~~~~s~~~~~~~~w~~~~~~~~~~~ 232 (302)
T PF14870_consen 161 VSSRGNFYSSWDP--GQTTWQPHNRNSS--RRIQSMG----FSPDGNLWMLARGGQIQFSDDPDDGETWSEPIIPIKTNG 232 (302)
T ss_dssp EETTSSEEEEE-T--T-SS-EEEE--SS--S-EEEEE----E-TTS-EEEEETTTEEEEEE-TTEEEEE---B-TTSS--
T ss_pred EECcccEEEEecC--CCccceEEccCcc--ceehhce----ecCCCCEEEEeCCcEEEEccCCCCccccccccCCcccCc
Confidence 7888877889975 9999986644221 1222222 1224556555 478888877555567788744322111
Q ss_pred --eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800 179 --VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 179 --~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~ 219 (794)
...+.. ..++.+|+++..|. ++ .. .+|=.-|+..
T Consensus 233 ~~~ld~a~-~~~~~~wa~gg~G~----l~-~S-~DgGktW~~~ 268 (302)
T PF14870_consen 233 YGILDLAY-RPPNEIWAVGGSGT----LL-VS-TDGGKTWQKD 268 (302)
T ss_dssp S-EEEEEE-SSSS-EEEEESTT-----EE-EE-SSTTSS-EE-
T ss_pred eeeEEEEe-cCCCCEEEEeCCcc----EE-Ee-CCCCccceEC
Confidence 122221 35788998876662 22 12 3555668874
No 165
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=58.70 E-value=3e+02 Score=30.72 Aligned_cols=146 Identities=13% Similarity=0.126 Sum_probs=79.0
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL 134 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~ 134 (794)
...++.++.+.-.|..||++.=. +..- ..+.++.+..-.--+|-.+.+++|-.||+++-+.+-++...-..+ ..+.
T Consensus 166 f~tgs~DrtikIwDlatg~Lklt--ltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS~V-~~L~ 242 (460)
T KOG0285|consen 166 FATGSADRTIKIWDLATGQLKLT--LTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLSGV-YCLD 242 (460)
T ss_pred EEecCCCceeEEEEcccCeEEEe--ecchhheeeeeeecccCceEEEecCCCeeEEEechhhhhHHHhcccccee-EEEe
Confidence 55667789999999999976433 3221 123344222222234435667899999999999988877653322 1122
Q ss_pred ccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800 135 LVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG 212 (794)
Q Consensus 135 ~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG 212 (794)
+.| .-++++. + |.....-|..+-..+-...-- .....+++....+..||-.+.++ .+--.|...|
T Consensus 243 lhP-------Tldvl~t~grDst~RvWDiRtr~~V~~l~GH--~~~V~~V~~~~~dpqvit~S~D~----tvrlWDl~ag 309 (460)
T KOG0285|consen 243 LHP-------TLDVLVTGGRDSTIRVWDIRTRASVHVLSGH--TNPVASVMCQPTDPQVITGSHDS----TVRLWDLRAG 309 (460)
T ss_pred ccc-------cceeEEecCCcceEEEeeecccceEEEecCC--CCcceeEEeecCCCceEEecCCc----eEEEeeeccC
Confidence 222 2344443 2 444444444444433333211 11123333223466777655555 5666687777
Q ss_pred ceeee
Q 003800 213 ELLNH 217 (794)
Q Consensus 213 ~~~w~ 217 (794)
+.+-.
T Consensus 310 kt~~t 314 (460)
T KOG0285|consen 310 KTMIT 314 (460)
T ss_pred ceeEe
Confidence 76543
No 166
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=58.45 E-value=3.4e+02 Score=31.22 Aligned_cols=132 Identities=12% Similarity=0.115 Sum_probs=68.0
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCC-------CcEeEEEeccCc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPD-------GQMVWESFLRGS 127 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~t-------G~llWe~~l~~~ 127 (794)
|+.+|.+|.||.--..||+.+=-..- ..-++..+. ..+++.+++ ++.++.|++|...+ +...=...+.+-
T Consensus 96 l~ag~i~g~lYlWelssG~LL~v~~a-HYQ~ITcL~-fs~dgs~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~H 173 (476)
T KOG0646|consen 96 LLAGTISGNLYLWELSSGILLNVLSA-HYQSITCLK-FSDDGSHIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDH 173 (476)
T ss_pred EEeecccCcEEEEEeccccHHHHHHh-hccceeEEE-EeCCCcEEEecCCCccEEEEEEEeecccccCCCccceeeeccC
Confidence 55666899999999999987643311 112355553 345555555 45578899997632 111111111110
Q ss_pred cccCCcccccccccccc---CCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 128 KHSKPLLLVPTNLKVDK---DSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 128 ~~s~~~~~~~~~~~~~~---~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
.+++... .... +..++-.+ |.++...|...|..+=+...|.+-. .+....++..+|+.+-.|
T Consensus 174 ----tlsITDl--~ig~Gg~~~rl~TaS~D~t~k~wdlS~g~LLlti~fp~si~---av~lDpae~~~yiGt~~G 239 (476)
T KOG0646|consen 174 ----TLSITDL--QIGSGGTNARLYTASEDRTIKLWDLSLGVLLLTITFPSSIK---AVALDPAERVVYIGTEEG 239 (476)
T ss_pred ----cceeEEE--EecCCCccceEEEecCCceEEEEEeccceeeEEEecCCcce---eEEEcccccEEEecCCcc
Confidence 1111110 1111 12333333 7777777888888777776665321 222123455666654444
No 167
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=58.17 E-value=28 Score=39.60 Aligned_cols=179 Identities=11% Similarity=0.101 Sum_probs=102.7
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
+..+..+..+|.|+|||-.|+++.-...+.+.. .+.-+ ..+..+.|.. ...++-+| ..|.++=-..-..++.
T Consensus 141 GrhlllgGrKGHlAa~Dw~t~~L~~Ei~v~Etv~Dv~~L---Hneq~~AVAQ-K~y~yvYD-~~GtElHClk~~~~v~-- 213 (545)
T KOG1272|consen 141 GRHLLLGGRKGHLAAFDWVTKKLHFEINVMETVRDVTFL---HNEQFFAVAQ-KKYVYVYD-NNGTELHCLKRHIRVA-- 213 (545)
T ss_pred ccEEEecCCccceeeeecccceeeeeeehhhhhhhhhhh---cchHHHHhhh-hceEEEec-CCCcEEeehhhcCchh--
Confidence 334888889999999999999998888776551 11112 2233333333 45777777 4687776666555442
Q ss_pred CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEec-CCEEEEEEecCCceeEEEEEE
Q 003800 132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDE-SDQIYVVGYAGSSQFHAYQIN 208 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~-~~~vyv~~~~g~~~~~v~ald 208 (794)
-+.++| -..+++. ..|-|.-.|..+|+.+=+.....+.+ .++ ... -+.|.-+|.. ++.|.-..
T Consensus 214 rLeFLP-------yHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~---~vm-~qNP~NaVih~Ghs---nGtVSlWS 279 (545)
T KOG1272|consen 214 RLEFLP-------YHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRT---DVM-KQNPYNAVIHLGHS---NGTVSLWS 279 (545)
T ss_pred hhcccc-------hhheeeecccCCceEEEeechhhhhHHHHccCCcc---chh-hcCCccceEEEcCC---CceEEecC
Confidence 345555 3556664 37899999999999887776655443 111 011 1222223322 23676667
Q ss_pred cCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEEeec
Q 003800 209 AMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 209 ~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~s 255 (794)
+.+-+++-+. -+. +.++ ++.+-.++.|.++......+.+-||..
T Consensus 280 P~skePLvKi--LcH~g~V~-siAv~~~G~YMaTtG~Dr~~kIWDlR~ 324 (545)
T KOG1272|consen 280 PNSKEPLVKI--LCHRGPVS-SIAVDRGGRYMATTGLDRKVKIWDLRN 324 (545)
T ss_pred CCCcchHHHH--HhcCCCcc-eEEECCCCcEEeecccccceeEeeecc
Confidence 7666655333 111 2222 233323334444444456788888876
No 168
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=58.10 E-value=2.6e+02 Score=32.44 Aligned_cols=139 Identities=10% Similarity=0.132 Sum_probs=73.7
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEE-EEEccCCeEEEEeCCCCcEeEEEecc-CccccCCc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVI-TLSSDGSTLRAWNLPDGQMVWESFLR-GSKHSKPL 133 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V-~Vs~~g~~v~A~d~~tG~llWe~~l~-~~~~s~~~ 133 (794)
|-.++..|.|.-.+.+||..-=.+..+.+..+..++....+..+ ...+++|.|..||...-.+...+.-. .... .++
T Consensus 136 iAsvs~gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~-~gi 214 (673)
T KOG4378|consen 136 IASVSDGGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAPC-RGI 214 (673)
T ss_pred eEEeccCCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCCCcccchhhhccCCc-Ccc
Confidence 44455678888888888866544444433333344333333333 33456689999998655555443321 2122 444
Q ss_pred cccccccccccCCeEEEE--ECCEEEEEECCCCcEE--EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEc
Q 003800 134 LLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEIL--WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINA 209 (794)
Q Consensus 134 ~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~--W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~ 209 (794)
.+.+. ...++|. .|.+++-+|..+-+.. -.|+.| ...+.-...|.+.++| .++++++++|.
T Consensus 215 cfsps------ne~l~vsVG~Dkki~~yD~~s~~s~~~l~y~~P------lstvaf~~~G~~L~aG---~s~G~~i~YD~ 279 (673)
T KOG4378|consen 215 CFSPS------NEALLVSVGYDKKINIYDIRSQASTDRLTYSHP------LSTVAFSECGTYLCAG---NSKGELIAYDM 279 (673)
T ss_pred eecCC------ccceEEEecccceEEEeecccccccceeeecCC------cceeeecCCceEEEee---cCCceEEEEec
Confidence 55541 3445553 4899999986543221 122222 1222112344444443 33448999997
Q ss_pred C
Q 003800 210 M 210 (794)
Q Consensus 210 ~ 210 (794)
.
T Consensus 280 R 280 (673)
T KOG4378|consen 280 R 280 (673)
T ss_pred c
Confidence 5
No 169
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=57.26 E-value=2.7e+02 Score=30.58 Aligned_cols=105 Identities=13% Similarity=0.160 Sum_probs=52.8
Q ss_pred EEEEEccCCeEEEEeCCCC-cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcE--EEEEeccC
Q 003800 98 VITLSSDGSTLRAWNLPDG-QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEI--LWTRDFAA 174 (794)
Q Consensus 98 ~V~Vs~~g~~v~A~d~~tG-~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~--~W~~~~~~ 174 (794)
++++--.+++++.||+.+| ...|...-.-... . ..+ .+..++.....++.++.++|.. .+....+.
T Consensus 39 L~w~DI~~~~i~r~~~~~g~~~~~~~p~~~~~~----~------~~d-~~g~Lv~~~~g~~~~~~~~~~~~t~~~~~~~~ 107 (307)
T COG3386 39 LLWVDILGGRIHRLDPETGKKRVFPSPGGFSSG----A------LID-AGGRLIACEHGVRLLDPDTGGKITLLAEPEDG 107 (307)
T ss_pred EEEEeCCCCeEEEecCCcCceEEEECCCCcccc----e------eec-CCCeEEEEccccEEEeccCCceeEEeccccCC
Confidence 4555445789999999988 6667655432111 1 223 3334444444456666565654 44333222
Q ss_pred ccee-eeeEEEEecCCEEEEEEec----C----CceeEEEEEEcCCCcee
Q 003800 175 ESVE-VQQVIQLDESDQIYVVGYA----G----SSQFHAYQINAMNGELL 215 (794)
Q Consensus 175 ~~~~-~~~~v~s~~~~~vyv~~~~----g----~~~~~v~ald~~tG~~~ 215 (794)
.... +--.+ ...++.+|+.... + .....|+-+|+. |...
T Consensus 108 ~~~~r~ND~~-v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~-g~~~ 155 (307)
T COG3386 108 LPLNRPNDGV-VDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD-GGVV 155 (307)
T ss_pred CCcCCCCcee-EcCCCCEEEeCCCccccCccccCCcceEEEEcCC-CCEE
Confidence 1110 10111 2356777765444 1 112467888873 4433
No 170
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=56.98 E-value=3.1e+02 Score=30.42 Aligned_cols=192 Identities=13% Similarity=0.171 Sum_probs=92.8
Q ss_pred EEEEEeCC-----C-EEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc-c--CC--eEEEEeCCCCcEeEEEe
Q 003800 55 RVVVSTEE-----N-VIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS-D--GS--TLRAWNLPDGQMVWESF 123 (794)
Q Consensus 55 ~Vyv~t~~-----g-~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~--g~--~v~A~d~~tG~llWe~~ 123 (794)
.+|++|.. | .+.-||.++|++-=-+.....++..-+...-.+..+|+.. . .+ ..+.||..+|++---.+
T Consensus 4 ~~YiGtyT~~~s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~ 83 (346)
T COG2706 4 TVYIGTYTKRESQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNR 83 (346)
T ss_pred EEEEeeecccCCCceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeec
Confidence 46777753 2 3666777777653333333333333332223444555421 2 23 35667777898876655
Q ss_pred ccCccccCCccccccccccccCC-eEEEE--ECCEEEEEECC-CCcEEEE---EeccCcceeeee------EEEEe-cCC
Q 003800 124 LRGSKHSKPLLLVPTNLKVDKDS-LILVS--SKGCLHAVSSI-DGEILWT---RDFAAESVEVQQ------VIQLD-ESD 189 (794)
Q Consensus 124 l~~~~~s~~~~~~~~~~~~~~~~-~V~V~--~~g~l~ald~~-tG~~~W~---~~~~~~~~~~~~------~v~s~-~~~ 189 (794)
...+. .++..+ .++.++ .|++. ..|.+..+-.. +|.+.=. .....+.--++| ..... .+.
T Consensus 84 ~~~~g--~~p~yv----svd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~ 157 (346)
T COG2706 84 QTLPG--SPPCYV----SVDEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGR 157 (346)
T ss_pred cccCC--CCCeEE----EECCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCC
Confidence 44322 122233 334344 56665 36777777664 4654321 111111000111 11112 233
Q ss_pred EEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE--EcCcEEEEEECCCCeEEEEEeec
Q 003800 190 QIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL--VSSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 190 ~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~--vg~~~lv~~d~~~g~L~v~~l~s 255 (794)
.+++..+ |.+ +++.+++..|...-......+.+--...++ ..+.+.+|+..-++.+-+.....
T Consensus 158 ~l~v~DL-G~D--ri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~ 222 (346)
T COG2706 158 YLVVPDL-GTD--RIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNP 222 (346)
T ss_pred EEEEeec-CCc--eEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcC
Confidence 4454333 333 566666668887754443333222112222 25567777766677777777766
No 171
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=55.16 E-value=3.1e+02 Score=29.86 Aligned_cols=33 Identities=18% Similarity=0.239 Sum_probs=24.8
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
++..++-.+.+.+|++||+++|+..-+.+....
T Consensus 101 d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~ 133 (338)
T KOG0265|consen 101 DGSHILSCGTDKTVRGWDAETGKRIRKHKGHTS 133 (338)
T ss_pred CCCEEEEecCCceEEEEecccceeeehhccccc
Confidence 334444344567999999999999999888764
No 172
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=54.94 E-value=1.6e+02 Score=32.83 Aligned_cols=65 Identities=14% Similarity=0.218 Sum_probs=42.4
Q ss_pred EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEE
Q 003800 97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWT 169 (794)
Q Consensus 97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~ 169 (794)
...+.++.++.++.||..+|..+-+......-. .+..+-+ ++..++. .|+.|...|.++++..=.
T Consensus 305 ~~l~s~SrDktIk~wdv~tg~cL~tL~ghdnwV-r~~af~p-------~Gkyi~ScaDDktlrvwdl~~~~cmk~ 371 (406)
T KOG0295|consen 305 QVLGSGSRDKTIKIWDVSTGMCLFTLVGHDNWV-RGVAFSP-------GGKYILSCADDKTLRVWDLKNLQCMKT 371 (406)
T ss_pred cEEEeecccceEEEEeccCCeEEEEEeccccee-eeeEEcC-------CCeEEEEEecCCcEEEEEeccceeeec
Confidence 344545567899999999999999887765433 2222222 3433332 488888888877765433
No 173
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=54.87 E-value=3.3e+02 Score=30.85 Aligned_cols=185 Identities=11% Similarity=0.064 Sum_probs=90.3
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
...++.+|.++.+.--|..++++. +.|... +.+..+....+...|+-++.+.++--||...+.-.=+.. ..+.+ .
T Consensus 231 ~~~~iAas~d~~~r~Wnvd~~r~~--~TLsGHtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~kt~l-~~S~c-n 306 (459)
T KOG0288|consen 231 NKHVIAASNDKNLRLWNVDSLRLR--HTLSGHTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSKTVL-PGSQC-N 306 (459)
T ss_pred CceEEeecCCCceeeeeccchhhh--hhhcccccceeeehhhccccceeeccccchhhhhhhhhhheecccc-ccccc-c
Confidence 444777777777666665555443 223221 112222112233322213346778888876533221111 11110 0
Q ss_pred CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEc
Q 003800 132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINA 209 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~ 209 (794)
+ +......++. .+++|...|..++..+-+.+..+.. ..+-.+.++..+...+-+. .+-.+|.
T Consensus 307 D---------I~~~~~~~~SgH~DkkvRfwD~Rs~~~~~sv~~gg~v---tSl~ls~~g~~lLsssRDd----tl~viDl 370 (459)
T KOG0288|consen 307 D---------IVCSISDVISGHFDKKVRFWDIRSADKTRSVPLGGRV---TSLDLSMDGLELLSSSRDD----TLKVIDL 370 (459)
T ss_pred c---------eEecceeeeecccccceEEEeccCCceeeEeecCcce---eeEeeccCCeEEeeecCCC----ceeeeec
Confidence 0 0101111111 2788999998888888887765521 1221112233333222222 4556677
Q ss_pred CCCceeeeeeeec---ccCccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800 210 MNGELLNHETAAF---SGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI 258 (794)
Q Consensus 210 ~tG~~~w~~~v~~---~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~ 258 (794)
.+-++.-.++... .++.+..++-.++.++++. +.+|++++=++.+|++
T Consensus 371 Rt~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAG-S~dgsv~iW~v~tgKl 421 (459)
T KOG0288|consen 371 RTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVAAG-SADGSVYIWSVFTGKL 421 (459)
T ss_pred ccccEEEEeeccccccccccceeEECCCCceeeec-cCCCcEEEEEccCceE
Confidence 6666665554222 2333333333354555555 4679999999999884
No 174
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=54.65 E-value=3.7e+02 Score=30.62 Aligned_cols=98 Identities=16% Similarity=0.165 Sum_probs=59.1
Q ss_pred EEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccc--ccc
Q 003800 65 IASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTN--LKV 142 (794)
Q Consensus 65 l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~--~~~ 142 (794)
|.-.| .+|+++|+...+. +.+.++.-...+.+|+|..+ |.++-+|.. |.. ++.+..+.. ...+.... ...
T Consensus 63 I~iys-~sG~ll~~i~w~~-~~iv~~~wt~~e~LvvV~~d-G~v~vy~~~-G~~--~fsl~~~i~--~~~v~e~~i~~~~ 134 (410)
T PF04841_consen 63 IQIYS-SSGKLLSSIPWDS-GRIVGMGWTDDEELVVVQSD-GTVRVYDLF-GEF--QFSLGEEIE--EEKVLECRIFAIW 134 (410)
T ss_pred EEEEC-CCCCEeEEEEECC-CCEEEEEECCCCeEEEEEcC-CEEEEEeCC-Cce--eechhhhcc--ccCcccccccccc
Confidence 55555 4799999988877 34444433567788888876 589999975 777 666554321 11111100 011
Q ss_pred ccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800 143 DKDSLILVS-SKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 143 ~~~~~V~V~-~~g~l~ald~~tG~~~W~~ 170 (794)
..+..++++ .+++++.++.-+...+|+.
T Consensus 135 ~~~~GivvLt~~~~~~~v~n~~~~~~~~~ 163 (410)
T PF04841_consen 135 FYKNGIVVLTGNNRFYVVNNIDEPVKLRR 163 (410)
T ss_pred cCCCCEEEECCCCeEEEEeCccccchhhc
Confidence 222446665 5888999976665555553
No 175
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=53.80 E-value=4.5e+02 Score=31.27 Aligned_cols=180 Identities=13% Similarity=0.135 Sum_probs=98.8
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKHSK 131 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~s~ 131 (794)
++.++.++.+..+--=|.+||+-.=-...- ...+..+ ...+ .+.+| +.+.+|++||..+|+.+=-.......+
T Consensus 261 ~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh-~stv~~~--~~~~-~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~~V-- 334 (537)
T KOG0274|consen 261 GDKLVSGSTDKTERVWDCSTGECTHSLQGH-TSSVRCL--TIDP-FLLVSGSRDNTVKVWDVTNGACLNLLRGHTGPV-- 334 (537)
T ss_pred CCEEEEEecCCcEEeEecCCCcEEEEecCC-CceEEEE--EccC-ceEeeccCCceEEEEeccCcceEEEeccccccE--
Confidence 556666776777766666676543222211 1112111 2333 34444 457899999999999886665433221
Q ss_pred CccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecC-CEEEEEEecCCceeEEEEEEc
Q 003800 132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDES-DQIYVVGYAGSSQFHAYQINA 209 (794)
Q Consensus 132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~-~~vyv~~~~g~~~~~v~ald~ 209 (794)
-.+ ..+ .+.++.. .+|.+..-|..+|+.+=+.+.-... .+.+. .++ ..+|-.+.++ .+-+.|+
T Consensus 335 --~~v----~~~-~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~--V~sl~--~~~~~~~~Sgs~D~----~IkvWdl 399 (537)
T KOG0274|consen 335 --NCV----QLD-EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTGR--VYSLI--VDSENRLLSGSLDT----TIKVWDL 399 (537)
T ss_pred --EEE----Eec-CCEEEEEecCceEEEEEhhhceeeeeecCCcce--EEEEE--ecCcceEEeeeecc----ceEeecC
Confidence 111 111 3334444 3888888888888877666543222 22222 234 6666656664 6778888
Q ss_pred CCC-ceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 210 MNG-ELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 210 ~tG-~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
.++ +.+-... .+..+. ..+..-++.+++... .+.+++-|.++++
T Consensus 400 ~~~~~c~~tl~--~h~~~v-~~l~~~~~~Lvs~~a-D~~Ik~WD~~~~~ 444 (537)
T KOG0274|consen 400 RTKRKCIHTLQ--GHTSLV-SSLLLRDNFLVSSSA-DGTIKLWDAEEGE 444 (537)
T ss_pred Cchhhhhhhhc--CCcccc-cccccccceeEeccc-cccEEEeecccCc
Confidence 887 3332221 122222 122234566776654 4678888888877
No 176
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=53.06 E-value=24 Score=39.22 Aligned_cols=73 Identities=14% Similarity=0.247 Sum_probs=50.3
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
+.+.||+++..|.|+.+|.++|+..=+.-=+-.+++.+++...+..++.-+|-++.||-+|..+-+++=...+
T Consensus 258 ~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GLDRyvRIhD~ktrkll~kvYv 330 (412)
T KOG3881|consen 258 SGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGLDRYVRIHDIKTRKLLHKVYV 330 (412)
T ss_pred CCcEEEEecccchhheecccCceeeccccCCccCCcceEEEcCCCceEEeeccceeEEEeecccchhhhhhhh
Confidence 4677999999999999999999876553222234455554333444555456679999999998666544433
No 177
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=53.03 E-value=28 Score=37.42 Aligned_cols=73 Identities=12% Similarity=0.199 Sum_probs=48.9
Q ss_pred CCEEEEEeCCCEEEEEECc-CCccceEEEcCcc-c--ceeeeeeeeCCEEEEEEccCCeEEEEeCC-CCcEeEEEeccCc
Q 003800 53 RKRVVVSTEENVIASLDLR-HGEIFWRHVLGIN-D--VVDGIDIALGKYVITLSSDGSTLRAWNLP-DGQMVWESFLRGS 127 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~-tG~ivWR~~l~~~-~--~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~ 127 (794)
.+.||.+++++.+.+.|.| -++-+|+...-.. + .|..- ......++.|+.+..++.||.. -|+++.+....++
T Consensus 178 pnlvytGgDD~~l~~~D~R~p~~~i~~n~kvH~~GV~SI~ss--~~~~~~I~TGsYDe~i~~~DtRnm~kPl~~~~v~GG 255 (339)
T KOG0280|consen 178 PNLVYTGGDDGSLSCWDIRIPKTFIWHNSKVHTSGVVSIYSS--PPKPTYIATGSYDECIRVLDTRNMGKPLFKAKVGGG 255 (339)
T ss_pred CceEEecCCCceEEEEEecCCcceeeecceeeecceEEEecC--CCCCceEEEeccccceeeeehhcccCccccCccccc
Confidence 4679999999999999999 8888998433221 1 12111 1123355657788899999987 5666655554443
No 178
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=52.73 E-value=3.5e+02 Score=29.71 Aligned_cols=186 Identities=14% Similarity=0.208 Sum_probs=95.5
Q ss_pred eeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcC---cccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeE
Q 003800 45 VFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLG---INDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVW 120 (794)
Q Consensus 45 ~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~---~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llW 120 (794)
.|..|.. ...++-++++|.+...+..+ |.-.=. ..+.+..+.+. .++-.+.|+++ ..+|.||.-+|+.-.
T Consensus 90 ~F~~~~S-~shLlS~sdDG~i~iw~~~~----W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D-~~lr~WNLV~Gr~a~ 163 (362)
T KOG0294|consen 90 KFYPPLS-KSHLLSGSDDGHIIIWRVGS----WELLKSLKAHKGQVTDLSIHPSGKLALSVGGD-QVLRTWNLVRGRVAF 163 (362)
T ss_pred EecCCcc-hhheeeecCCCcEEEEEcCC----eEEeeeecccccccceeEecCCCceEEEEcCC-ceeeeehhhcCccce
Confidence 4554432 34689999999999988766 632211 11223333222 46667787875 599999999999998
Q ss_pred EEeccCccccCCccccccccccccCCeEEE-EECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800 121 ESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS 199 (794)
Q Consensus 121 e~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~ 199 (794)
-.++..... -....+ .++-|+ .....+-.+-..+-++.=+...+... -++.-...+.+++.+-++
T Consensus 164 v~~L~~~at--~v~w~~-------~Gd~F~v~~~~~i~i~q~d~A~v~~~i~~~~r~----l~~~~l~~~~L~vG~d~~- 229 (362)
T KOG0294|consen 164 VLNLKNKAT--LVSWSP-------QGDHFVVSGRNKIDIYQLDNASVFREIENPKRI----LCATFLDGSELLVGGDNE- 229 (362)
T ss_pred eeccCCcce--eeEEcC-------CCCEEEEEeccEEEEEecccHhHhhhhhccccc----eeeeecCCceEEEecCCc-
Confidence 888876432 111111 233222 23333322222222222111111101 111112455566533232
Q ss_pred ceeEEEEEEcCCCceeeeeeeecccCccCceEEEcC---cEEEEEECCCCeEEEEEeecc
Q 003800 200 SQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSS---DTLVTLDTTRSILVTVSFKNR 256 (794)
Q Consensus 200 ~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~---~~lv~~d~~~g~L~v~~l~sg 256 (794)
.+..+|..++.+...... -+..+-+ ++.+.+ .+++.+. +.|.+.+=|+...
T Consensus 230 ---~i~~~D~ds~~~~~~~~A-H~~RVK~-i~~~~~~~~~~lvTaS-SDG~I~vWd~~~~ 283 (362)
T KOG0294|consen 230 ---WISLKDTDSDTPLTEFLA-HENRVKD-IASYTNPEHEYLVTAS-SDGFIKVWDIDME 283 (362)
T ss_pred ---eEEEeccCCCccceeeec-chhheee-eEEEecCCceEEEEec-cCceEEEEEcccc
Confidence 788889888777665531 1222222 222222 2555554 4688877777654
No 179
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=50.41 E-value=3.4e+02 Score=28.83 Aligned_cols=187 Identities=15% Similarity=0.177 Sum_probs=81.1
Q ss_pred CCCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc-CCeEEEEeCCC--CcEeE----EEe
Q 003800 52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD-GSTLRAWNLPD--GQMVW----ESF 123 (794)
Q Consensus 52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~-g~~v~A~d~~t--G~llW----e~~ 123 (794)
+.+++|+.++ .+.|+.||. +|+++-|..+...+..-++. ..+++.++++.. .+.++.++..+ ..+-= +..
T Consensus 32 d~~tLfaV~d~~~~i~els~-~G~vlr~i~l~g~~D~EgI~-y~g~~~~vl~~Er~~~L~~~~~~~~~~~~~~~~~~~~~ 109 (248)
T PF06977_consen 32 DTGTLFAVQDEPGEIYELSL-DGKVLRRIPLDGFGDYEGIT-YLGNGRYVLSEERDQRLYIFTIDDDTTSLDRADVQKIS 109 (248)
T ss_dssp TTTEEEEEETTTTEEEEEET-T--EEEEEE-SS-SSEEEEE-E-STTEEEEEETTTTEEEEEEE----TT--EEEEEEEE
T ss_pred CCCeEEEEECCCCEEEEEcC-CCCEEEEEeCCCCCCceeEE-EECCCEEEEEEcCCCcEEEEEEeccccccchhhceEEe
Confidence 4677888776 589999996 79999999997654444553 356666666553 56787777632 22111 111
Q ss_pred ccCcc-ccCCcccccccccccc-CCeEEEEE---CCEEEEEEC--CCCcEEEEEec--cCcce---eeeeEEEEecCCEE
Q 003800 124 LRGSK-HSKPLLLVPTNLKVDK-DSLILVSS---KGCLHAVSS--IDGEILWTRDF--AAESV---EVQQVIQLDESDQI 191 (794)
Q Consensus 124 l~~~~-~s~~~~~~~~~~~~~~-~~~V~V~~---~g~l~ald~--~tG~~~W~~~~--~~~~~---~~~~~v~s~~~~~v 191 (794)
+.... .-.+.+-+ +.+. .+.+++.. -..++.++. ........... ..... .+..+..-...+.+
T Consensus 110 l~~~~~~N~G~EGl----a~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~l 185 (248)
T PF06977_consen 110 LGFPNKGNKGFEGL----AYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHL 185 (248)
T ss_dssp ---S---SS--EEE----EEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEE
T ss_pred cccccCCCcceEEE----EEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeE
Confidence 11110 00111111 1222 34555553 345777775 22222222211 11110 11122212346678
Q ss_pred EEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEE
Q 003800 192 YVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVS 252 (794)
Q Consensus 192 yv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~ 252 (794)
|+++.... .+..+| .+|+++....+... .++... +-+..=+|.|. +|.|++..
T Consensus 186 liLS~es~---~l~~~d-~~G~~~~~~~L~~g~~gl~~~---~~QpEGIa~d~-~G~LYIvs 239 (248)
T PF06977_consen 186 LILSDESR---LLLELD-RQGRVVSSLSLDRGFHGLSKD---IPQPEGIAFDP-DGNLYIVS 239 (248)
T ss_dssp EEEETTTT---EEEEE--TT--EEEEEE-STTGGG-SS------SEEEEEE-T-T--EEEEE
T ss_pred EEEECCCC---eEEEEC-CCCCEEEEEEeCCcccCcccc---cCCccEEEECC-CCCEEEEc
Confidence 88775554 778888 67887766654332 121111 11223356664 46665543
No 180
>COG3419 PilY1 Tfp pilus assembly protein, tip-associated adhesin PilY1 [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=49.91 E-value=2.4e+02 Score=35.65 Aligned_cols=27 Identities=15% Similarity=0.276 Sum_probs=24.0
Q ss_pred EEEEEEccCCeEEEEeCCCCcEeEEEe
Q 003800 97 YVITLSSDGSTLRAWNLPDGQMVWESF 123 (794)
Q Consensus 97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~ 123 (794)
-+|+|+..++++.+||+.+|.++.-+-
T Consensus 583 ~~VyvgandGmLhaFd~~tG~E~fA~~ 609 (1036)
T COG3419 583 PVVYVGANDGMLHAFDANTGSERFAYV 609 (1036)
T ss_pred ceEEEecCCceeeeccCCccceeeecC
Confidence 478889888999999999999998765
No 181
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=48.84 E-value=1.2e+02 Score=32.90 Aligned_cols=98 Identities=12% Similarity=0.232 Sum_probs=64.2
Q ss_pred CCEEEEEECCCCcE--EEEEeccCcce---eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCcc
Q 003800 153 KGCLHAVSSIDGEI--LWTRDFAAESV---EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFV 227 (794)
Q Consensus 153 ~g~l~ald~~tG~~--~W~~~~~~~~~---~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s 227 (794)
=..|+.+|.+++++ +|+.....+.- +...+++-.-.+.+++.=.+|..++-|+.+|..+|+..+-....++.
T Consensus 77 YSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ps~K--- 153 (339)
T PF09910_consen 77 YSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSSNPSLK--- 153 (339)
T ss_pred cceEEEEEcCCCeEEEEEecccCCccccccchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccCCCCcC---
Confidence 56899999999974 69876654321 23345544567888887778888899999999999998766433332
Q ss_pred CceEEEcCcEEEEEEC-----CCCeEEEEEeecce
Q 003800 228 GDVALVSSDTLVTLDT-----TRSILVTVSFKNRK 257 (794)
Q Consensus 228 ~~~~~vg~~~lv~~d~-----~~g~L~v~~l~sg~ 257 (794)
+..+. -.+|.+- ....++++||.+|+
T Consensus 154 G~~~~----D~a~F~i~~~~~g~~~i~~~Dli~~~ 184 (339)
T PF09910_consen 154 GTLVH----DYACFGINNFHKGVSGIHCLDLISGK 184 (339)
T ss_pred ceEee----eeEEEeccccccCCceEEEEEccCCe
Confidence 21111 1223322 12358888888887
No 182
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=48.75 E-value=1.2e+02 Score=27.80 Aligned_cols=60 Identities=23% Similarity=0.528 Sum_probs=38.0
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEE
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTR 170 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~ 170 (794)
+...+.+..+ +.+..+|.. |+.+|......... . . ...+....+|.|..+| .+|+++|+-
T Consensus 19 ~~~~L~l~~d-GnLvl~~~~-~~~iWss~~t~~~~-----~-~-------~~~~~L~~~GNlvl~d-~~~~~lW~S 78 (114)
T PF01453_consen 19 GNYTLILQSD-GNLVLYDSN-GSVIWSSNNTSGRG-----N-S-------GCYLVLQDDGNLVLYD-SSGNVLWQS 78 (114)
T ss_dssp TTEEEEEETT-SEEEEEETT-TEEEEE--S-TTSS-------S-------SEEEEEETTSEEEEEE-TTSEEEEES
T ss_pred ccccceECCC-CeEEEEcCC-CCEEEEecccCCcc-----c-c-------CeEEEEeCCCCEEEEe-ecceEEEee
Confidence 5667777876 478888865 88899983222110 0 0 1122233588888888 699999986
No 183
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=48.64 E-value=1.4e+02 Score=36.07 Aligned_cols=63 Identities=16% Similarity=0.156 Sum_probs=37.7
Q ss_pred EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEec
Q 003800 102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDF 172 (794)
Q Consensus 102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~ 172 (794)
|+.+.+||.||..+|..+--+......+ ..+. ....+.-++. .+|.+---|..+|+++=+...
T Consensus 553 GSsD~tVRlWDv~~G~~VRiF~GH~~~V----~al~----~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~ 617 (707)
T KOG0263|consen 553 GSSDRTVRLWDVSTGNSVRIFTGHKGPV----TALA----FSPCGRYLASGDEDGLIKIWDLANGSLVKQLKG 617 (707)
T ss_pred CCCCceEEEEEcCCCcEEEEecCCCCce----EEEE----EcCCCceEeecccCCcEEEEEcCCCcchhhhhc
Confidence 5557899999999999987776554332 1111 1112333332 267777777777766655433
No 184
>PRK01742 tolB translocation protein TolB; Provisional
Probab=48.41 E-value=4.6e+02 Score=29.84 Aligned_cols=144 Identities=11% Similarity=0.055 Sum_probs=67.2
Q ss_pred cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCC--eEEEEeCCCCcEeEEEec
Q 003800 51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGS--TLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~--~v~A~d~~tG~llWe~~l 124 (794)
+++++++.++. ...|+.+|.++|+..--..+... ...... ..|+.+++.+..++ .++.||..+|.+. +...
T Consensus 213 PDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~--~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~-~lt~ 289 (429)
T PRK01742 213 PDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGH--NGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPS-QLTS 289 (429)
T ss_pred CCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCc--cCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeE-eecc
Confidence 34555655543 24799999999975322222221 111111 23445555443222 5788888777643 1111
Q ss_pred cCccccCCccccccccccccCCeEEEEE--CC--EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800 125 RGSKHSKPLLLVPTNLKVDKDSLILVSS--KG--CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS 200 (794)
Q Consensus 125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g--~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~ 200 (794)
..... ..+... .+ +..+++.+ +| .++.++..+|..... . ... . ....+.++..+++.+..
T Consensus 290 ~~~~~-~~~~wS-----pD-G~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~-~~~-~---~~~~SpDG~~ia~~~~~--- 353 (429)
T PRK01742 290 GAGNN-TEPSWS-----PD-GQSILFTSDRSGSPQVYRMSASGGGASLV-G-GRG-Y---SAQISADGKTLVMINGD--- 353 (429)
T ss_pred CCCCc-CCEEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEEe-c-CCC-C---CccCCCCCCEEEEEcCC---
Confidence 11111 111121 22 23344332 22 677777666654432 1 111 1 11112345555554432
Q ss_pred eeEEEEEEcCCCcee
Q 003800 201 QFHAYQINAMNGELL 215 (794)
Q Consensus 201 ~~~v~ald~~tG~~~ 215 (794)
.+..+|+.+|+..
T Consensus 354 --~i~~~Dl~~g~~~ 366 (429)
T PRK01742 354 --NVVKQDLTSGSTE 366 (429)
T ss_pred --CEEEEECCCCCeE
Confidence 3666899999754
No 185
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=48.10 E-value=92 Score=32.75 Aligned_cols=83 Identities=19% Similarity=0.356 Sum_probs=52.1
Q ss_pred CCEEEEEEccCCeEEEEe--CCCCcEeEEEeccCccccCC-ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEE
Q 003800 95 GKYVITLSSDGSTLRAWN--LPDGQMVWESFLRGSKHSKP-LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWT 169 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d--~~tG~llWe~~l~~~~~s~~-~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~ 169 (794)
.+...++-+.+-.|-||| ..+|.+.=+..+-.-.-+++ -+..|..+.++..+.++|. ++|+++.+|+.||+.+=+
T Consensus 169 ~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK~L~e 248 (310)
T KOG4499|consen 169 AKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGKILLE 248 (310)
T ss_pred CcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECCCCCcEEEE
Confidence 344455544456898888 88887654433321000000 0122223355667778885 589999999999999999
Q ss_pred EeccCcce
Q 003800 170 RDFAAESV 177 (794)
Q Consensus 170 ~~~~~~~~ 177 (794)
...|.+..
T Consensus 249 iklPt~qi 256 (310)
T KOG4499|consen 249 IKLPTPQI 256 (310)
T ss_pred EEcCCCce
Confidence 99997654
No 186
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=46.81 E-value=1.9e+02 Score=26.27 Aligned_cols=52 Identities=19% Similarity=0.409 Sum_probs=28.9
Q ss_pred eEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEecc
Q 003800 107 TLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 107 ~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~ 173 (794)
.+..++...+..+|......+.. . ...+.+..+|.|.-+|. +|.++|.-...
T Consensus 31 nlV~~~~~~~~~vW~snt~~~~~-------------~-~~~l~l~~dGnLvl~~~-~g~~vW~S~t~ 82 (114)
T smart00108 31 NLILYKSSSRTVVWVANRDNPVS-------------D-SCTLTLQSDGNLVLYDG-DGRVVWSSNTT 82 (114)
T ss_pred EEEEEECCCCcEEEECCCCCCCC-------------C-CEEEEEeCCCCEEEEeC-CCCEEEEeccc
Confidence 44444443367888865433211 0 11222335888887774 48899986443
No 187
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=46.80 E-value=2.2e+02 Score=31.54 Aligned_cols=64 Identities=13% Similarity=0.078 Sum_probs=44.4
Q ss_pred EEE-EECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee
Q 003800 148 ILV-SSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL 215 (794)
Q Consensus 148 V~V-~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~ 215 (794)
|.| +++|.+..+|..||+.+=+++.+.+.....+++...++..|+..+.+| .|.++|+.+-...
T Consensus 43 vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~ssDG----~Vr~wD~Rs~~e~ 107 (376)
T KOG1188|consen 43 VAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCSSDG----TVRLWDIRSQAES 107 (376)
T ss_pred EEEEecCCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEeccCC----eEEEEEeecchhh
Confidence 555 479999999999999988887766554333333212567788777777 7888887765444
No 188
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=46.53 E-value=5.1e+02 Score=29.78 Aligned_cols=187 Identities=14% Similarity=0.144 Sum_probs=106.4
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee-eee-eeCCEEEEEEccCCeEEEEeC-----C-----------
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG-IDI-ALGKYVITLSSDGSTLRAWNL-----P----------- 114 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~-~~g~~~V~Vs~~g~~v~A~d~-----~----------- 114 (794)
.+.|.|-+-+|.|.-++.+. ..-++.++.-. +++ +.. ...+-.|+.++ ...+.++.- .
T Consensus 145 ~~~IcVQS~DG~L~~feqe~--~~f~~~lp~~l-lPgPl~Y~~~tDsfvt~ss-s~~l~~Yky~~La~~s~~~~~~~~~~ 220 (418)
T PF14727_consen 145 RDFICVQSMDGSLSFFEQES--FAFSRFLPDFL-LPGPLCYCPRTDSFVTASS-SWTLECYKYQDLASASEASSRQSGTE 220 (418)
T ss_pred ceEEEEEecCceEEEEeCCc--EEEEEEcCCCC-CCcCeEEeecCCEEEEecC-ceeEEEecHHHhhhcccccccccccc
Confidence 56699999999999998654 45566665531 111 111 12333444333 235555431 0
Q ss_pred ----CC---cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcce--eeeeEEEE
Q 003800 115 ----DG---QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESV--EVQQVIQL 185 (794)
Q Consensus 115 ----tG---~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~--~~~~~v~s 185 (794)
+| ..-|.+.++.+.+ ++.++.. ......++|++...|++++. +|.++|..++.-... -++.+...
T Consensus 221 ~~~~~~k~l~~dWs~nlGE~~l--~i~v~~~---~~~~~~IvvLger~Lf~l~~-~G~l~~~krLd~~p~~~~~Y~~~~~ 294 (418)
T PF14727_consen 221 QDISSGKKLNPDWSFNLGEQAL--DIQVVRF---SSSESDIVVLGERSLFCLKD-NGSLRFQKRLDYNPSCFCPYRVPWY 294 (418)
T ss_pred ccccccccccceeEEECCceeE--EEEEEEc---CCCCceEEEEecceEEEEcC-CCeEEEEEecCCceeeEEEEEeecc
Confidence 22 4679999887664 3333321 11245789999999999995 799999999865432 22333111
Q ss_pred ecCC---EEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800 186 DESD---QIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 186 ~~~~---~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
..++ .+.+.+..+ .+..+ ++.+.+|..++... -.++- +-+- -.+.+|.++. +|.|.+.=|+|..
T Consensus 295 ~~~~~~~~llV~t~t~----~LlVy--~d~~L~WsA~l~~~PVal~v-~~~~~~~G~IV~Ls~-~G~L~v~YLGTdP 363 (418)
T PF14727_consen 295 NEPSTRLNLLVGTHTG----TLLVY--EDTTLVWSAQLPHVPVALSV-ANFNGLKGLIVSLSD-EGQLSVSYLGTDP 363 (418)
T ss_pred cCCCCceEEEEEecCC----eEEEE--eCCeEEEecCCCCCCEEEEe-cccCCCCceEEEEcC-CCcEEEEEeCCCC
Confidence 1222 234333333 34443 37788999976321 11110 0000 1468888874 6999999999865
No 189
>PRK13684 Ycf48-like protein; Provisional
Probab=43.82 E-value=4.8e+02 Score=28.74 Aligned_cols=168 Identities=13% Similarity=0.114 Sum_probs=76.6
Q ss_pred ccEeeEEeccCcee---eeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc--c--ceeeeeeeeCCEEEEE
Q 003800 29 GLMDWHQQYIGKVK---HAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--D--VVDGIDIALGKYVITL 101 (794)
Q Consensus 29 G~~dW~~~~vG~~~---~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~--~i~~l~~~~g~~~V~V 101 (794)
....|++...+... .-.|. +++..|+....|.|.. ..||-.-|++..... . .+..+.. .++..+++
T Consensus 33 ~~~~W~~~~~~~~~~l~~v~F~----d~~~g~avG~~G~il~--T~DgG~tW~~~~~~~~~~~~~l~~v~~-~~~~~~~~ 105 (334)
T PRK13684 33 SSSPWQVIDLPTEANLLDIAFT----DPNHGWLVGSNRTLLE--TNDGGETWEERSLDLPEENFRLISISF-KGDEGWIV 105 (334)
T ss_pred cCCCcEEEecCCCCceEEEEEe----CCCcEEEEECCCEEEE--EcCCCCCceECccCCcccccceeeeEE-cCCcEEEe
Confidence 33459988754322 12333 2445555555665543 346788899864321 1 1112211 23333333
Q ss_pred EccCCeEEEEeCCCCcEeEEEeccCccccCC-ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceee
Q 003800 102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKP-LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEV 179 (794)
Q Consensus 102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~-~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~ 179 (794)
+ ..+. .|-..||-.-|+........... ..+. ...++.+++. ..|.+++- .+|-..|+..........
T Consensus 106 G-~~g~--i~~S~DgG~tW~~~~~~~~~~~~~~~i~-----~~~~~~~~~~g~~G~i~~S--~DgG~tW~~~~~~~~g~~ 175 (334)
T PRK13684 106 G-QPSL--LLHTTDGGKNWTRIPLSEKLPGSPYLIT-----ALGPGTAEMATNVGAIYRT--TDGGKNWEALVEDAAGVV 175 (334)
T ss_pred C-CCce--EEEECCCCCCCeEccCCcCCCCCceEEE-----EECCCcceeeeccceEEEE--CCCCCCceeCcCCCcceE
Confidence 3 3333 34467899999876422111001 1111 1112333333 34544333 456678886443221112
Q ss_pred eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800 180 QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET 219 (794)
Q Consensus 180 ~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~ 219 (794)
..+. ...++.+++++..| .++.. ...|..-|+..
T Consensus 176 ~~i~-~~~~g~~v~~g~~G----~i~~s-~~~gg~tW~~~ 209 (334)
T PRK13684 176 RNLR-RSPDGKYVAVSSRG----NFYST-WEPGQTAWTPH 209 (334)
T ss_pred EEEE-ECCCCeEEEEeCCc----eEEEE-cCCCCCeEEEe
Confidence 2222 12445555555555 34432 23566677663
No 190
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=43.53 E-value=3.9e+02 Score=33.30 Aligned_cols=110 Identities=15% Similarity=0.163 Sum_probs=63.0
Q ss_pred CEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCcc-ccCCcccccc-ccccccCCeEEEE-ECCEEEEEECC-CC-cEEEE
Q 003800 96 KYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSK-HSKPLLLVPT-NLKVDKDSLILVS-SKGCLHAVSSI-DG-EILWT 169 (794)
Q Consensus 96 ~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~-~s~~~~~~~~-~~~~~~~~~V~V~-~~g~l~ald~~-tG-~~~W~ 169 (794)
..++.... +...|+-+|.+.|+++=+|...... . ..+.+. ..+.-.....|+. ++..|+++|+. .| +++|.
T Consensus 493 ~~mil~~~~~~~~ly~mDLe~GKVV~eW~~~~~~~v---~~~~p~~K~aqlt~e~tflGls~n~lfriDpR~~~~k~v~~ 569 (794)
T PF08553_consen 493 RNMILLDPNNPNKLYKMDLERGKVVEEWKVHDDIPV---VDIAPDSKFAQLTNEQTFLGLSDNSLFRIDPRLSGNKLVDS 569 (794)
T ss_pred cceEeecCCCCCceEEEecCCCcEEEEeecCCCcce---eEecccccccccCCCceEEEECCCceEEeccCCCCCceeec
Confidence 34555543 4578999999999998888776432 1 011110 0000012345554 79999999998 45 46775
Q ss_pred EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCc
Q 003800 170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGE 213 (794)
Q Consensus 170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~ 213 (794)
....-..-...++.....+|.+.+++..| .+.-+| ..|.
T Consensus 570 ~~k~Y~~~~~Fs~~aTt~~G~iavgs~~G----~IRLyd-~~g~ 608 (794)
T PF08553_consen 570 QSKQYSSKNNFSCFATTEDGYIAVGSNKG----DIRLYD-RLGK 608 (794)
T ss_pred cccccccCCCceEEEecCCceEEEEeCCC----cEEeec-ccch
Confidence 43221111234565455677777766666 344445 4564
No 191
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=43.17 E-value=68 Score=23.08 Aligned_cols=31 Identities=13% Similarity=0.292 Sum_probs=24.3
Q ss_pred CCCEEEEEeC-CCEEEEEECcCCccceEEEcC
Q 003800 52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLG 82 (794)
Q Consensus 52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~ 82 (794)
+++++|++.. .+.|..+|+++|+++=+....
T Consensus 2 d~~~lyv~~~~~~~v~~id~~~~~~~~~i~vg 33 (42)
T TIGR02276 2 DGTKLYVTNSGSNTVSVIDTATNKVIATIPVG 33 (42)
T ss_pred CCCEEEEEeCCCCEEEEEECCCCeEEEEEECC
Confidence 3677999886 689999999999776665553
No 192
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=42.59 E-value=2e+02 Score=26.29 Aligned_cols=22 Identities=23% Similarity=0.557 Sum_probs=15.8
Q ss_pred EECCEEEEEECCCCcEEEEEecc
Q 003800 151 SSKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 151 ~~~g~l~ald~~tG~~~W~~~~~ 173 (794)
..+|.|+..|. +|.++|.-...
T Consensus 62 ~~dGnLvl~~~-~g~~vW~S~~~ 83 (116)
T cd00028 62 QSDGNLVIYDG-SGTVVWSSNTT 83 (116)
T ss_pred ecCCCeEEEcC-CCcEEEEeccc
Confidence 35788877774 67899986543
No 193
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=42.28 E-value=4.2e+02 Score=33.78 Aligned_cols=147 Identities=9% Similarity=0.068 Sum_probs=73.4
Q ss_pred CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCc-----EEEE
Q 003800 96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGE-----ILWT 169 (794)
Q Consensus 96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~-----~~W~ 169 (794)
.+.++++|+-..||-||+..-...=.....++.+ +..+. .....++.+++. .||.|..+|...-. -.|+
T Consensus 1177 ~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~s~t~---vTaLS--~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R 1251 (1387)
T KOG1517|consen 1177 SGHLLVTGDVRSIRIWDAHKEQVVADIPYGSSTL---VTALS--ADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYR 1251 (1387)
T ss_pred CCeEEecCCeeEEEEEecccceeEeecccCCCcc---ceeec--ccccCCceEEEeecCCceEEeecccCCccccceeec
Confidence 3556667766789999998766666555554432 11111 122323444444 59999999876433 3465
Q ss_pred EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCcc--CceEEE--cCcEEEEEECCC
Q 003800 170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFV--GDVALV--SSDTLVTLDTTR 245 (794)
Q Consensus 170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s--~~~~~v--g~~~lv~~d~~~ 245 (794)
.-...+.+.-..+. ...-+.++.++.+| .+.-+|+..-....-.++..++.-. -.++.| -..+++|...
T Consensus 1252 ~h~~~~~Iv~~slq-~~G~~elvSgs~~G----~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~-- 1324 (1387)
T KOG1517|consen 1252 EHNDVEPIVHLSLQ-RQGLGELVSGSQDG----DIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSA-- 1324 (1387)
T ss_pred ccCCcccceeEEee-cCCCcceeeeccCC----eEEEEecccCcccccceeeeccccCccceeeeeccCCCeeeecCc--
Confidence 43332222111221 11223455555555 6777787653222222333333111 133333 4457777753
Q ss_pred CeEEEEEee
Q 003800 246 SILVTVSFK 254 (794)
Q Consensus 246 g~L~v~~l~ 254 (794)
+.+.+.++.
T Consensus 1325 q~ikIy~~~ 1333 (1387)
T KOG1517|consen 1325 QLIKIYSLS 1333 (1387)
T ss_pred ceEEEEecC
Confidence 445555543
No 194
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=42.12 E-value=5.3e+02 Score=28.75 Aligned_cols=189 Identities=10% Similarity=0.106 Sum_probs=108.0
Q ss_pred CEEEEEeC-----CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-Ec------cC---CeEEEEeCCCCcE
Q 003800 54 KRVVVSTE-----ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SS------DG---STLRAWNLPDGQM 118 (794)
Q Consensus 54 ~~Vyv~t~-----~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~------~g---~~v~A~d~~tG~l 118 (794)
.||||.+- .+.++-+|+++|+.+=.....-..+ +.+..++..+++ ++ .| ..|..||+.|=.+
T Consensus 3 ~rvyV~D~~~~~~~~rv~viD~d~~k~lGmi~~g~~~~---~~~spdgk~~y~a~T~~sR~~rG~RtDvv~~~D~~TL~~ 79 (342)
T PF06433_consen 3 HRVYVQDPVFFHMTSRVYVIDADSGKLLGMIDTGFLGN---VALSPDGKTIYVAETFYSRGTRGERTDVVEIWDTQTLSP 79 (342)
T ss_dssp TEEEEEE-GGGGSSEEEEEEETTTTEEEEEEEEESSEE---EEE-TTSSEEEEEEEEEEETTEEEEEEEEEEEETTTTEE
T ss_pred cEEEEECCccccccceEEEEECCCCcEEEEeecccCCc---eeECCCCCEEEEEEEEEeccccccceeEEEEEecCcCcc
Confidence 46666664 3578888888887644433322211 111223333332 21 11 3589999999999
Q ss_pred eEEEeccCc-cccCCcccccccccc-ccCCeEEEEE---CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEE
Q 003800 119 VWESFLRGS-KHSKPLLLVPTNLKV-DKDSLILVSS---KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYV 193 (794)
Q Consensus 119 lWe~~l~~~-~~s~~~~~~~~~~~~-~~~~~V~V~~---~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv 193 (794)
.||..+... .. ...+... .... +.++.++|.. ...|..+|.+.++++=+.+.|.=.. +.+ ..+...+.
T Consensus 80 ~~EI~iP~k~R~-~~~~~~~-~~~ls~dgk~~~V~N~TPa~SVtVVDl~~~kvv~ei~~PGC~~----iyP-~~~~~F~~ 152 (342)
T PF06433_consen 80 TGEIEIPPKPRA-QVVPYKN-MFALSADGKFLYVQNFTPATSVTVVDLAAKKVVGEIDTPGCWL----IYP-SGNRGFSM 152 (342)
T ss_dssp EEEEEETTS-B---BS--GG-GEEE-TTSSEEEEEEESSSEEEEEEETTTTEEEEEEEGTSEEE----EEE-EETTEEEE
T ss_pred cceEecCCcchh-eeccccc-ceEEccCCcEEEEEccCCCCeEEEEECCCCceeeeecCCCEEE----EEe-cCCCceEE
Confidence 999999864 22 1111111 0111 2256777763 7789999999999988777765332 222 35677888
Q ss_pred EEecCCceeEEEEEEcCCCceeeeeeeecccCccC-----ceEEE-cCcEEEEEECCCCeEEEEEeeccee
Q 003800 194 VGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVG-----DVALV-SSDTLVTLDTTRSILVTVSFKNRKI 258 (794)
Q Consensus 194 ~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~-----~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~ 258 (794)
+|.+|+ +..+.||. .|+.. +..-. ...... ...++ .++.+++.. ++|.++..++.....
T Consensus 153 lC~DGs--l~~v~Ld~-~Gk~~-~~~t~-~F~~~~dp~f~~~~~~~~~~~~~F~S-y~G~v~~~dlsg~~~ 217 (342)
T PF06433_consen 153 LCGDGS--LLTVTLDA-DGKEA-QKSTK-VFDPDDDPLFEHPAYSRDGGRLYFVS-YEGNVYSADLSGDSA 217 (342)
T ss_dssp EETTSC--EEEEEETS-TSSEE-EEEEE-ESSTTTS-B-S--EEETTTTEEEEEB-TTSEEEEEEETTSSE
T ss_pred EecCCc--eEEEEECC-CCCEe-Eeecc-ccCCCCcccccccceECCCCeEEEEe-cCCEEEEEeccCCcc
Confidence 888873 33344442 78887 33222 111111 22223 345677776 679999999987653
No 195
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=42.08 E-value=2.2e+02 Score=34.12 Aligned_cols=101 Identities=14% Similarity=0.141 Sum_probs=58.7
Q ss_pred EEEEEe-CCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCc
Q 003800 55 RVVVST-EENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPL 133 (794)
Q Consensus 55 ~Vyv~t-~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~ 133 (794)
.++|+. -++.|.-.|++|++.+=+.+ +..+++..+.+..++..+.-++.+++++.||..--+=+-.+.+..+..
T Consensus 184 t~ivsGgtek~lr~wDprt~~kimkLr-GHTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T~~vH~e~V---- 258 (735)
T KOG0308|consen 184 TIIVSGGTEKDLRLWDPRTCKKIMKLR-GHTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLATYIVHKEGV---- 258 (735)
T ss_pred eEEEecCcccceEEeccccccceeeee-ccccceEEEEEcCCCCeEeecCCCceEEeeeccccceeeeEEeccCce----
Confidence 355544 47889999999999988877 444566655333333233323446799999986555555555544321
Q ss_pred cccccccccccCCeEEEE-ECCEEEEEECCC
Q 003800 134 LLVPTNLKVDKDSLILVS-SKGCLHAVSSID 163 (794)
Q Consensus 134 ~~~~~~~~~~~~~~V~V~-~~g~l~ald~~t 163 (794)
+.... ... -..+|.. .+|.+++-|..+
T Consensus 259 WaL~~--~~s-f~~vYsG~rd~~i~~Tdl~n 286 (735)
T KOG0308|consen 259 WALQS--SPS-FTHVYSGGRDGNIYRTDLRN 286 (735)
T ss_pred EEEee--CCC-cceEEecCCCCcEEecccCC
Confidence 22211 000 1334444 377788877765
No 196
>PRK04922 tolB translocation protein TolB; Provisional
Probab=42.02 E-value=5.7e+02 Score=29.09 Aligned_cols=149 Identities=13% Similarity=0.170 Sum_probs=71.9
Q ss_pred eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEE
Q 003800 94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEIL 167 (794)
Q Consensus 94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~ 167 (794)
.++.+++++.. ...++.||..+|+..--....+.. ..+.+ ..+ ++.+++ .+ + ..|+.+|..+|+..
T Consensus 214 Dg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~~g~~--~~~~~-----SpD-G~~l~~~~s~~g~~~Iy~~d~~~g~~~ 285 (433)
T PRK04922 214 DGKKLAYVSFERGRSAIYVQDLATGQRELVASFRGIN--GAPSF-----SPD-GRRLALTLSRDGNPEIYVMDLGSRQLT 285 (433)
T ss_pred CCCEEEEEecCCCCcEEEEEECCCCCEEEeccCCCCc--cCceE-----CCC-CCEEEEEEeCCCCceEEEEECCCCCeE
Confidence 46667776532 357999999999865333322211 11111 123 334444 33 3 37999999988753
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS 246 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g 246 (794)
=-........ .+..+.++..+++.+..++ ...++.+|+.+|+... +.........+.+ ..++.++......+
T Consensus 286 ~lt~~~~~~~---~~~~spDG~~l~f~sd~~g-~~~iy~~dl~~g~~~~---lt~~g~~~~~~~~SpDG~~Ia~~~~~~~ 358 (433)
T PRK04922 286 RLTNHFGIDT---EPTWAPDGKSIYFTSDRGG-RPQIYRVAASGGSAER---LTFQGNYNARASVSPDGKKIAMVHGSGG 358 (433)
T ss_pred ECccCCCCcc---ceEECCCCCEEEEEECCCC-CceEEEEECCCCCeEE---eecCCCCccCEEECCCCCEEEEEECCCC
Confidence 2111111111 1111234455555443322 2368888988887432 1111111111222 13344444433322
Q ss_pred --eEEEEEeecce
Q 003800 247 --ILVTVSFKNRK 257 (794)
Q Consensus 247 --~L~v~~l~sg~ 257 (794)
.+++.++.+|+
T Consensus 359 ~~~I~v~d~~~g~ 371 (433)
T PRK04922 359 QYRIAVMDLSTGS 371 (433)
T ss_pred ceeEEEEECCCCC
Confidence 57777877765
No 197
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=40.85 E-value=3.8e+02 Score=27.26 Aligned_cols=65 Identities=25% Similarity=0.412 Sum_probs=39.6
Q ss_pred ceEEEEECCCCcEEEEEecccCCCCCCCceeeEEeeecCcccCCCCCCeEEEEEE-ecCCCCCCcEEEEEEccCCcee
Q 003800 497 RKIFALHSGDGRVVWSLLLHKSEACDSPTELNLYQWQTPHHHAMDENPSVLVVGR-CGVSSKAPAILSFVDTYTGKEL 573 (794)
Q Consensus 497 Gkl~alds~~G~i~W~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~~-~~~~~~~~~~~~~~d~~tG~~~ 573 (794)
|+||--|..+|.. |++.+.+.+....| +.+-|- |+...++|++. .| .-..-|.+|.+|..||+..
T Consensus 88 GkIYIkn~~~~~~-~~L~i~~~~~k~sP---K~i~Wi-------DD~~L~vIIG~a~G-TvS~GGnLy~~nl~tg~~~ 153 (200)
T PF15525_consen 88 GKIYIKNLNNNNW-WSLQIDQNEEKYSP---KYIEWI-------DDNNLAVIIGYAHG-TVSKGGNLYKYNLNTGNLT 153 (200)
T ss_pred eeEEEEecCCCce-EEEEecCcccccCC---ceeEEe-------cCCcEEEEEccccc-eEccCCeEEEEEccCCcee
Confidence 7888888887776 88877653211111 134552 23444555553 11 0245588999999999865
No 198
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=40.38 E-value=5.1e+02 Score=28.04 Aligned_cols=175 Identities=13% Similarity=0.104 Sum_probs=88.4
Q ss_pred EEEEEECcCCccceEEEcCcccceeeeee-e--eC----CEEEEEEcc---------C-CeEEEEeCCCC-------cEe
Q 003800 64 VIASLDLRHGEIFWRHVLGINDVVDGIDI-A--LG----KYVITLSSD---------G-STLRAWNLPDG-------QMV 119 (794)
Q Consensus 64 ~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~--~g----~~~V~Vs~~---------g-~~v~A~d~~tG-------~ll 119 (794)
.|--+|+.+.+++=++.|+....+..+.. . .+ ...++||+. . |+++.++...+ +++
T Consensus 3 ~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i 82 (321)
T PF03178_consen 3 SIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLI 82 (321)
T ss_dssp EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEE
T ss_pred EEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEE
Confidence 46667888888887777776643332211 1 11 345555432 1 68999999885 333
Q ss_pred EEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 120 WESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 120 We~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
.+....+++. .+ ... .+.+++..++.|+.++....+ ..=......+.. ...+ ...++.+++.....
T Consensus 83 ~~~~~~g~V~-----ai----~~~-~~~lv~~~g~~l~v~~l~~~~~l~~~~~~~~~~~-i~sl--~~~~~~I~vgD~~~ 149 (321)
T PF03178_consen 83 HSTEVKGPVT-----AI----CSF-NGRLVVAVGNKLYVYDLDNSKTLLKKAFYDSPFY-ITSL--SVFKNYILVGDAMK 149 (321)
T ss_dssp EEEEESS-EE-----EE----EEE-TTEEEEEETTEEEEEEEETTSSEEEEEEE-BSSS-EEEE--EEETTEEEEEESSS
T ss_pred EEEeecCcce-----Eh----hhh-CCEEEEeecCEEEEEEccCcccchhhheecceEE-EEEE--eccccEEEEEEccc
Confidence 3444433322 11 122 566777778888888877666 221111111111 1122 23466666554433
Q ss_pred CceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEee
Q 003800 199 SSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFK 254 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~ 254 (794)
++.++.++...-+...-.+-..+..+....+++.++.+++.|. .|+++++...
T Consensus 150 --sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~-~gnl~~l~~~ 202 (321)
T PF03178_consen 150 --SVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDK-DGNLFVLRYN 202 (321)
T ss_dssp --SEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEET-TSEEEEEEE-
T ss_pred --CEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcC-CCeEEEEEEC
Confidence 2566666763332332222122333333333334457788885 5888777664
No 199
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=38.59 E-value=82 Score=34.10 Aligned_cols=63 Identities=17% Similarity=0.344 Sum_probs=44.8
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCC
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPD 115 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~t 115 (794)
+...||-++.+-.|+..|.+||+..-|+.....- +..+.+. .|-.+|.-++++++++.||...
T Consensus 101 d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~~-vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~ 164 (338)
T KOG0265|consen 101 DGSHILSCGTDKTVRGWDAETGKRIRKHKGHTSF-VNSLDPSRRGPQLVCSGSDDGTLKLWDIRK 164 (338)
T ss_pred CCCEEEEecCCceEEEEecccceeeehhccccce-eeecCccccCCeEEEecCCCceEEEEeecc
Confidence 3556888888999999999999999998887662 2222222 3444444344678999999863
No 200
>PF14339 DUF4394: Domain of unknown function (DUF4394)
Probab=38.50 E-value=5e+02 Score=27.39 Aligned_cols=165 Identities=8% Similarity=-0.003 Sum_probs=0.0
Q ss_pred ccCCCEEEEEeCCCEEEEEECcCCccceE--EEcCcccceeeeeee---eCCEEEEEEccCCeEEEEeCCCCc------E
Q 003800 50 KTGRKRVVVSTEENVIASLDLRHGEIFWR--HVLGINDVVDGIDIA---LGKYVITLSSDGSTLRAWNLPDGQ------M 118 (794)
Q Consensus 50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR--~~l~~~~~i~~l~~~---~g~~~V~Vs~~g~~v~A~d~~tG~------l 118 (794)
.+..+.+|..+..|.||-||+.||.--.- -.+........+.+- ..+.+=+||..| +-.-+|+.+|. .
T Consensus 35 Rpa~G~LYgl~~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~G-qNlR~npdtGav~~~Dg~ 113 (236)
T PF14339_consen 35 RPANGQLYGLGSTGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTG-QNLRLNPDTGAVTIVDGN 113 (236)
T ss_pred ecCCCCEEEEeCCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCC-cEEEECCCCCCceeccCc
Q ss_pred eEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcceeeeeEEE--------------
Q 003800 119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ-------------- 184 (794)
Q Consensus 119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~-------------- 184 (794)
++-.......- ..+.+.. ........=.-...+||.+|...+...=+-.-..+.+.....+.
T Consensus 114 L~y~~gd~~~G-~~p~v~a---aAYTNs~~g~~t~TtLy~ID~~~~~Lv~Q~ppN~GtL~~vG~LGvd~~~~~gFDI~~~ 189 (236)
T PF14339_consen 114 LAYAAGDMNAG-TTPGVTA---AAYTNSFAGATTSTTLYDIDTTLDALVTQNPPNDGTLNTVGPLGVDAAGDAGFDIAGD 189 (236)
T ss_pred cccCCCccccC-CCCceEE---EEEecccCCCccceEEEEEecCCCeEEEecCCCCCcEEeeeccccccCcccceeeecC
Q ss_pred EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeee
Q 003800 185 LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAA 221 (794)
Q Consensus 185 s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~ 221 (794)
.......|.+...++ -.+|.+|+.||+...--.+.
T Consensus 190 ~~~~~~a~a~~~~~~--~~LY~vdL~TG~at~~g~i~ 224 (236)
T PF14339_consen 190 GNGGNAAYAVLGVGG--SGLYTVDLTTGAATLVGQIG 224 (236)
T ss_pred CCcceEEEEEecCCC--cEEEEEECCCcccEEeeecC
No 201
>PF09910 DUF2139: Uncharacterized protein conserved in archaea (DUF2139); InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=38.25 E-value=5.7e+02 Score=28.01 Aligned_cols=157 Identities=10% Similarity=0.116 Sum_probs=85.3
Q ss_pred cccccEeeEEeccCceeeeeeee---eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEE
Q 003800 26 DQVGLMDWHQQYIGKVKHAVFHT---QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITL 101 (794)
Q Consensus 26 dqvG~~dW~~~~vG~~~~~~f~~---~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~V 101 (794)
.+.++..++.+++|.+-...-++ ....++.||++. |-+.=. .+.. ..+...+-.
T Consensus 16 ~~d~~~iY~felvG~~P~SGGDTYNAV~~vDd~IyFGG----------------WVHAPa------~y~gk~~g~~~IdF 73 (339)
T PF09910_consen 16 RDDSEKIYRFELVGPPPTSGGDTYNAVEWVDDFIYFGG----------------WVHAPA------VYEGKGDGRATIDF 73 (339)
T ss_pred cCCceEEEEeeeccCCCCCCCccceeeeeecceEEEee----------------eecCCc------eeeeccCCceEEEE
Confidence 56677889999999764222221 111245555553 432110 1100 122333433
Q ss_pred EccCCeEEEEeCCCC--cEeEEEeccCccc-c---CCccccccccccccCCeEEEEECC----EEEEEECCCCcEEEEEe
Q 003800 102 SSDGSTLRAWNLPDG--QMVWESFLRGSKH-S---KPLLLVPTNLKVDKDSLILVSSKG----CLHAVSSIDGEILWTRD 171 (794)
Q Consensus 102 s~~g~~v~A~d~~tG--~llWe~~l~~~~~-s---~~~~~~~~~~~~~~~~~V~V~~~g----~l~ald~~tG~~~W~~~ 171 (794)
...=+.|..+|.++| +++|.-....+.. . .++ +. .+..+..++...|| .|+.+|..+|+..|-.+
T Consensus 74 ~NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdI-lY----dP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~ 148 (339)
T PF09910_consen 74 RNKYSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDI-LY----DPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSS 148 (339)
T ss_pred eeccceEEEEEcCCCeEEEEEecccCCccccccchhhe-ee----CCCcCEEEEEecCCcceeeeEEEcccCCceeeccC
Confidence 432368999999999 6899988765421 0 111 11 12223333334454 69999999999999887
Q ss_pred ccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEEEcCCCceee
Q 003800 172 FAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQINAMNGELLN 216 (794)
Q Consensus 172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~ald~~tG~~~w 216 (794)
.|...- .+ ..+...|-+ ... ...-.+.|+|+.+|+.+-
T Consensus 149 ~ps~KG----~~--~~D~a~F~i-~~~~~g~~~i~~~Dli~~~~~~ 187 (339)
T PF09910_consen 149 NPSLKG----TL--VHDYACFGI-NNFHKGVSGIHCLDLISGKWVI 187 (339)
T ss_pred CCCcCc----eE--eeeeEEEec-cccccCCceEEEEEccCCeEEE
Confidence 765432 11 123333322 110 111268999999999854
No 202
>PRK02889 tolB translocation protein TolB; Provisional
Probab=37.84 E-value=6.6e+02 Score=28.58 Aligned_cols=149 Identities=13% Similarity=0.100 Sum_probs=69.7
Q ss_pred cCCCEEEEEeC---CCEEEEEECcCCccceEEEcC-cccceeeeeee-eCCEEEEEEcc-C-CeEEEEeCCCCcEeEEEe
Q 003800 51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLG-INDVVDGIDIA-LGKYVITLSSD-G-STLRAWNLPDGQMVWESF 123 (794)
Q Consensus 51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~-~~~~i~~l~~~-~g~~~V~Vs~~-g-~~v~A~d~~tG~llWe~~ 123 (794)
+++++|++.+. ...|+..|..+|+.. .+. .++........ .|+.+++.... + ..++.+|..+|.+. +..
T Consensus 205 PDG~~la~~s~~~~~~~I~~~dl~~g~~~---~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~-~lt 280 (427)
T PRK02889 205 PDGTKLAYVSFESKKPVVYVHDLATGRRR---VVANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLR-RLT 280 (427)
T ss_pred CCCCEEEEEEccCCCcEEEEEECCCCCEE---EeecCCCCccceEECCCCCEEEEEEccCCCceEEEEECCCCCcE-ECC
Confidence 34556666553 246999999999753 221 11111112112 34445554332 2 46888888766532 211
Q ss_pred ccCccccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEE-EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 124 LRGSKHSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEIL-WTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 124 l~~~~~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~-W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
-..... ..+... .+ +..+++.+ .-.++.++..+|+.. -++. .... .....+.++..+++.+..+
T Consensus 281 ~~~~~~-~~~~wS-----pD-G~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~--g~~~--~~~~~SpDG~~Ia~~s~~~ 349 (427)
T PRK02889 281 QSSGID-TEPFFS-----PD-GRSIYFTSDRGGAPQIYRMPASGGAAQRVTFT--GSYN--TSPRISPDGKLLAYISRVG 349 (427)
T ss_pred CCCCCC-cCeEEc-----CC-CCEEEEEecCCCCcEEEEEECCCCceEEEecC--CCCc--CceEECCCCCEEEEEEccC
Confidence 111111 111121 22 23344333 236777887766532 1111 1111 0111134455565555443
Q ss_pred CceeEEEEEEcCCCcee
Q 003800 199 SSQFHAYQINAMNGELL 215 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~~ 215 (794)
+ ...++.+|+.+|+..
T Consensus 350 g-~~~I~v~d~~~g~~~ 365 (427)
T PRK02889 350 G-AFKLYVQDLATGQVT 365 (427)
T ss_pred C-cEEEEEEECCCCCeE
Confidence 2 246888899998765
No 203
>COG3045 CreA Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.55 E-value=95 Score=30.15 Aligned_cols=58 Identities=19% Similarity=0.231 Sum_probs=35.1
Q ss_pred ChHHHHHHHHHHHHhccccccceeecccccEeeEEeccCce--eeeeeeeeccCCCEEEEEeC
Q 003800 1 MAIRFIILTLLFLSSCTIPSLSLYEDQVGLMDWHQQYIGKV--KHAVFHTQKTGRKRVVVSTE 61 (794)
Q Consensus 1 ~~~~~~l~~l~~l~~~~~~~~Al~edqvG~~dW~~~~vG~~--~~~~f~~~~~~~~~Vyv~t~ 61 (794)
|++|.+|++.+++++++.++. .+++|+++=-...+|.- .-..|+.|...+=..|++..
T Consensus 3 ~~~~~~ll~~~~~~~l~~~a~---aE~iG~V~tvf~~~G~D~IvveafdDP~V~gVTCyvs~a 62 (165)
T COG3045 3 MKIRLLLLAGLLLLLLVGLAH---AEEIGSVSTVFDWLGNDHIVVEAFDDPDVKGVTCYVSRA 62 (165)
T ss_pred chHHHHHHHHHHHHHhccccc---hhhccccceeEEEecCCcEEEEecCCCCcCcEEEEEEEe
Confidence 678888888875555555444 45567654323334443 33568877765555777664
No 204
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.73 E-value=4.9e+02 Score=31.81 Aligned_cols=110 Identities=13% Similarity=0.168 Sum_probs=66.8
Q ss_pred CEEEEEEcc-CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcE--EEEEe
Q 003800 96 KYVITLSSD-GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI--LWTRD 171 (794)
Q Consensus 96 ~~~V~Vs~~-g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~--~W~~~ 171 (794)
++-.|+||. ++++|.|+..+=++.-.+.+..-+ .++.+.| + ++..+|. .+|..+.++..+=+. .|...
T Consensus 421 DDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~lI--TAvcy~P-----d-Gk~avIGt~~G~C~fY~t~~lk~~~~~~I~ 492 (712)
T KOG0283|consen 421 DDRYFISGSLDGKVRLWSISDKKVVDWNDLRDLI--TAVCYSP-----D-GKGAVIGTFNGYCRFYDTEGLKLVSDFHIR 492 (712)
T ss_pred CCCcEeecccccceEEeecCcCeeEeehhhhhhh--eeEEecc-----C-CceEEEEEeccEEEEEEccCCeEEEeeeEe
Confidence 456677664 789999999999988888877433 3444444 4 4566666 488888887554433 35554
Q ss_pred ccC------cceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 172 FAA------ESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 172 ~~~------~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
... ..++-.|+.+ ...+.|.|.+.+. ++..+|..+=+++-..
T Consensus 493 ~~~~Kk~~~~rITG~Q~~p-~~~~~vLVTSnDS----rIRI~d~~~~~lv~Kf 540 (712)
T KOG0283|consen 493 LHNKKKKQGKRITGLQFFP-GDPDEVLVTSNDS----RIRIYDGRDKDLVHKF 540 (712)
T ss_pred eccCccccCceeeeeEecC-CCCCeEEEecCCC----ceEEEeccchhhhhhh
Confidence 432 2233444442 3455677666554 5666676555555444
No 205
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=36.65 E-value=6.2e+02 Score=29.58 Aligned_cols=155 Identities=12% Similarity=0.096 Sum_probs=86.3
Q ss_pred eCCEEEEEEccC-----Ce--EEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CC------EEEE
Q 003800 94 LGKYVITLSSDG-----ST--LRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KG------CLHA 158 (794)
Q Consensus 94 ~g~~~V~Vs~~g-----~~--v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g------~l~a 158 (794)
.++.+++++|.+ .. ++.+|..+ .+|......+.. +.+..+...... ++.++++. +. .|+.
T Consensus 69 ~~~~~~vfGG~~~~~~~~~~dl~~~d~~~--~~w~~~~~~g~~--p~~r~g~~~~~~-~~~l~lfGG~~~~~~~~~~l~~ 143 (482)
T KOG0379|consen 69 IGNKLYVFGGYGSGDRLTDLDLYVLDLES--QLWTKPAATGDE--PSPRYGHSLSAV-GDKLYLFGGTDKKYRNLNELHS 143 (482)
T ss_pred ECCEEEEECCCCCCCccccceeEEeecCC--cccccccccCCC--CCcccceeEEEE-CCeEEEEccccCCCCChhheEe
Confidence 466666666532 22 77787765 777776654432 222332211122 35555552 32 7899
Q ss_pred EECCCCcEEEEEeccCcceeee--eEEEEecCCEEEEEEecCC---ceeEEEEEEcCCCceeeeeeee---cccCccC-c
Q 003800 159 VSSIDGEILWTRDFAAESVEVQ--QVIQLDESDQIYVVGYAGS---SQFHAYQINAMNGELLNHETAA---FSGGFVG-D 229 (794)
Q Consensus 159 ld~~tG~~~W~~~~~~~~~~~~--~~v~s~~~~~vyv~~~~g~---~~~~v~ald~~tG~~~w~~~v~---~~~~~s~-~ 229 (794)
+|..|++ |+...+.+...+. .......++++|+.|..+. ..-.++++|+.+=+ |+.-.. .|+...+ .
T Consensus 144 ~d~~t~~--W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~--W~~~~~~g~~P~pR~gH~ 219 (482)
T KOG0379|consen 144 LDLSTRT--WSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETST--WSELDTQGEAPSPRYGHA 219 (482)
T ss_pred ccCCCCc--EEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeecccccc--ceecccCCCCCCCCCCce
Confidence 9988864 5554433221000 1111245688888775542 34578999998766 887432 2333333 4
Q ss_pred eEEEcCcEEEEEECC-----CCeEEEEEeecce
Q 003800 230 VALVSSDTLVTLDTT-----RSILVTVSFKNRK 257 (794)
Q Consensus 230 ~~~vg~~~lv~~d~~-----~g~L~v~~l~sg~ 257 (794)
++++++..+++.... .+.++.+||.+.+
T Consensus 220 ~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~ 252 (482)
T KOG0379|consen 220 MVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWE 252 (482)
T ss_pred EEEECCeEEEEeccccCCceecceEeeecccce
Confidence 555676766655433 2458899988855
No 206
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=36.36 E-value=4.8e+02 Score=28.97 Aligned_cols=172 Identities=13% Similarity=0.121 Sum_probs=86.6
Q ss_pred ECcCCccceEEEcCcccceeeeeeeeC-CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCe
Q 003800 69 DLRHGEIFWRHVLGINDVVDGIDIALG-KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSL 147 (794)
Q Consensus 69 n~~tG~ivWR~~l~~~~~i~~l~~~~g-~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~ 147 (794)
.++++..-|-........+ ..+ +..|.|+-..+.++.||..+|+.+=++....+.. ....+.. -++.+.
T Consensus 17 S~~~~~~~~~Lk~~~q~~~-----~~~~e~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~-N~vrf~~----~ds~h~ 86 (376)
T KOG1188|consen 17 SVRVSNEDFCLKYDIQEQV-----KDGFETAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATT-NGVRFIS----CDSPHG 86 (376)
T ss_pred ccccccccceeeccchhhh-----ccCcceeEEEEecCCeEEEEeccchhhhheecCCCCcc-cceEEec----CCCCCe
Confidence 3456666666555422211 112 2455555445789999999999998888766544 2333332 112345
Q ss_pred EEEE-ECCEEEEEECCCCc----EEEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEEEEEcCCCce-eeeeee
Q 003800 148 ILVS-SKGCLHAVSSIDGE----ILWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAYQINAMNGEL-LNHETA 220 (794)
Q Consensus 148 V~V~-~~g~l~ald~~tG~----~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~ald~~tG~~-~w~~~v 220 (794)
|+.. ++|+|...|..+-. ..|+...+. +..+.+.-..+.++..+. .-++...|+-+|...-+. +.+..-
T Consensus 87 v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~----~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~e 162 (376)
T KOG1188|consen 87 VISCSSDGTVRLWDIRSQAESARISWTQQSGT----PFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNE 162 (376)
T ss_pred eEEeccCCeEEEEEeecchhhhheeccCCCCC----cceEeeccCcCCeEEeccccccCceEEEEEEeccccchhhhhhh
Confidence 5555 59999999887543 456654433 223332222334444332 112334566667654333 222211
Q ss_pred ecccCccCceEEEcCc-EEEEEECCCCeEEEEEeec
Q 003800 221 AFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 221 ~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~s 255 (794)
+-.-+++.-++...++ +++... -.|-+.+.|++.
T Consensus 163 SH~DDVT~lrFHP~~pnlLlSGS-vDGLvnlfD~~~ 197 (376)
T KOG1188|consen 163 SHNDDVTQLRFHPSDPNLLLSGS-VDGLVNLFDTKK 197 (376)
T ss_pred hccCcceeEEecCCCCCeEEeec-ccceEEeeecCC
Confidence 1112333333333333 444443 346666666553
No 207
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=36.30 E-value=7.5e+02 Score=29.47 Aligned_cols=157 Identities=13% Similarity=0.163 Sum_probs=86.0
Q ss_pred eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc---------cCC----------------------cccccccccc
Q 003800 94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH---------SKP----------------------LLLVPTNLKV 142 (794)
Q Consensus 94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~---------s~~----------------------~~~~~~~~~~ 142 (794)
.|+.++..|....+|+.||.++=.+..+.-+..+.. +.. +|-.+-++..
T Consensus 62 DGqY~lAtG~YKP~ikvydlanLSLKFERhlDae~V~feiLsDD~SK~v~L~~DR~IefHak~G~hy~~RIP~~GRDm~y 141 (703)
T KOG2321|consen 62 DGQYLLATGTYKPQIKVYDLANLSLKFERHLDAEVVDFEILSDDYSKSVFLQNDRTIEFHAKYGRHYRTRIPKFGRDMKY 141 (703)
T ss_pred CCcEEEEecccCCceEEEEcccceeeeeecccccceeEEEeccchhhheEeecCceeeehhhcCeeeeeecCcCCccccc
Confidence 466666666677899999999999999988876653 000 0000000000
Q ss_pred cc-CCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800 143 DK-DSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 143 ~~-~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v 220 (794)
+. ..++++. ++..||+|+..-|.-+=-++...+.+ .++....-..+++.|.. ...|-++|+.+-+.......
T Consensus 142 ~~~scDly~~gsg~evYRlNLEqGrfL~P~~~~~~~l---N~v~in~~hgLla~Gt~---~g~VEfwDpR~ksrv~~l~~ 215 (703)
T KOG2321|consen 142 HKPSCDLYLVGSGSEVYRLNLEQGRFLNPFETDSGEL---NVVSINEEHGLLACGTE---DGVVEFWDPRDKSRVGTLDA 215 (703)
T ss_pred cCCCccEEEeecCcceEEEEccccccccccccccccc---eeeeecCccceEEeccc---CceEEEecchhhhhheeeec
Confidence 00 1234444 57789999988887665555544433 22221223334444332 22899999988777765543
Q ss_pred ecc----cCccC-----ceEEEcCcE-EEEEECCCCeEEEEEeecce
Q 003800 221 AFS----GGFVG-----DVALVSSDT-LVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 221 ~~~----~~~s~-----~~~~vg~~~-lv~~d~~~g~L~v~~l~sg~ 257 (794)
+.. .+... ++-|-++++ ++|. ..+|+.++.||.+.+
T Consensus 216 ~~~v~s~pg~~~~~svTal~F~d~gL~~aVG-ts~G~v~iyDLRa~~ 261 (703)
T KOG2321|consen 216 ASSVNSHPGGDAAPSVTALKFRDDGLHVAVG-TSTGSVLIYDLRASK 261 (703)
T ss_pred ccccCCCccccccCcceEEEecCCceeEEee-ccCCcEEEEEcccCC
Confidence 322 11111 122223343 3344 356888888887755
No 208
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=35.62 E-value=83 Score=35.20 Aligned_cols=72 Identities=19% Similarity=0.309 Sum_probs=36.4
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEecc
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLR 125 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~ 125 (794)
++..|+++..+..|-......=-.+=..-++...-+..+ ...++-..+ ++.+++||.||..+|+.+=...+.
T Consensus 162 D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~i--sl~~~~~LlS~sGD~tlr~Wd~~sgk~L~t~dl~ 234 (390)
T KOG3914|consen 162 DDQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTI--SLTDNYLLLSGSGDKTLRLWDITSGKLLDTCDLS 234 (390)
T ss_pred CCCEEEEecCCceEEEEecCcccchhhhccccHhheeee--eeccCceeeecCCCCcEEEEecccCCcccccchh
Confidence 345566666666665554322111111112111112222 333332233 344579999999999999555544
No 209
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=35.28 E-value=1e+03 Score=30.03 Aligned_cols=107 Identities=11% Similarity=0.150 Sum_probs=56.3
Q ss_pred cceEEEcCcc--cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE
Q 003800 75 IFWRHVLGIN--DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS 152 (794)
Q Consensus 75 ivWR~~l~~~--~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~ 152 (794)
..|....=.+ .++.++-....++++.-.+.++.+|.||+..-+-+=.++-..+ ..-++..
T Consensus 239 KaWEvDtcrgH~nnVssvlfhp~q~lIlSnsEDksirVwDm~kRt~v~tfrrend------------------RFW~laa 300 (1202)
T KOG0292|consen 239 KAWEVDTCRGHYNNVSSVLFHPHQDLILSNSEDKSIRVWDMTKRTSVQTFRREND------------------RFWILAA 300 (1202)
T ss_pred cceeehhhhcccCCcceEEecCccceeEecCCCccEEEEecccccceeeeeccCC------------------eEEEEEe
Confidence 3565544322 2344331223345555355678999999975444433332221 2222222
Q ss_pred --CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800 153 --KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN 211 (794)
Q Consensus 153 --~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t 211 (794)
...|+|---++|-.+...+...|.. + ..++.+|.+- . . .+..+|..|
T Consensus 301 hP~lNLfAAgHDsGm~VFkleRErpa~----~---v~~n~LfYvk-d-~---~i~~~d~~t 349 (1202)
T KOG0292|consen 301 HPELNLFAAGHDSGMIVFKLERERPAY----A---VNGNGLFYVK-D-R---FIRSYDLRT 349 (1202)
T ss_pred cCCcceeeeecCCceEEEEEcccCceE----E---EcCCEEEEEc-c-c---eEEeeeccc
Confidence 3555655556777888776554332 2 4667776654 2 2 577777776
No 210
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=34.54 E-value=7e+02 Score=27.94 Aligned_cols=146 Identities=20% Similarity=0.199 Sum_probs=75.7
Q ss_pred CEEEEEECcCC---ccceEEEcCcccceeeeeeeeCCEEEEEEcc---CCeEEEEeCCCCcE-eEEEeccCccccCCccc
Q 003800 63 NVIASLDLRHG---EIFWRHVLGINDVVDGIDIALGKYVITLSSD---GSTLRAWNLPDGQM-VWESFLRGSKHSKPLLL 135 (794)
Q Consensus 63 g~l~ALn~~tG---~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~---g~~v~A~d~~tG~l-lWe~~l~~~~~s~~~~~ 135 (794)
+.++.+|..++ ...|+.............-..++...+++.. .+.|.+.+..+... .|+..+..+.- ...+
T Consensus 252 s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~--~~~l 329 (414)
T PF02897_consen 252 SEVYLLDLDDGGSPDAKPKLLSPREDGVEYYVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDE--DVSL 329 (414)
T ss_dssp EEEEEEECCCTTTSS-SEEEEEESSSS-EEEEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SS--SEEE
T ss_pred CeEEEEeccccCCCcCCcEEEeCCCCceEEEEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCC--ceeE
Confidence 57999999886 7888887765433322212346666666643 36899999988875 56654433221 1111
Q ss_pred cccccccccCCeEEEE--EC--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEEEcC
Q 003800 136 VPTNLKVDKDSLILVS--SK--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQINAM 210 (794)
Q Consensus 136 ~~~~~~~~~~~~V~V~--~~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~ald~~ 210 (794)
... ... .+.+++. .+ .+|..++...|...-....+.... ...+......+.+++ .+.+ -.-..++.+|+.
T Consensus 330 ~~~--~~~-~~~Lvl~~~~~~~~~l~v~~~~~~~~~~~~~~p~~g~-v~~~~~~~~~~~~~~-~~ss~~~P~~~y~~d~~ 404 (414)
T PF02897_consen 330 EDV--SLF-KDYLVLSYRENGSSRLRVYDLDDGKESREIPLPEAGS-VSGVSGDFDSDELRF-SYSSFTTPPTVYRYDLA 404 (414)
T ss_dssp EEE--EEE-TTEEEEEEEETTEEEEEEEETT-TEEEEEEESSSSSE-EEEEES-TT-SEEEE-EEEETTEEEEEEEEETT
T ss_pred EEE--EEE-CCEEEEEEEECCccEEEEEECCCCcEEeeecCCcceE-EeccCCCCCCCEEEE-EEeCCCCCCEEEEEECC
Confidence 110 122 3444443 33 468888877566666666654332 111111123444443 3322 112378889999
Q ss_pred CCcee
Q 003800 211 NGELL 215 (794)
Q Consensus 211 tG~~~ 215 (794)
+|+..
T Consensus 405 t~~~~ 409 (414)
T PF02897_consen 405 TGELT 409 (414)
T ss_dssp TTCEE
T ss_pred CCCEE
Confidence 98864
No 211
>PF05567 Neisseria_PilC: Neisseria PilC beta-propeller domain; InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=34.44 E-value=6.6e+02 Score=27.83 Aligned_cols=55 Identities=20% Similarity=0.250 Sum_probs=32.0
Q ss_pred eeEEEEEEcCC-Cceeeeeeeecc-cCccCceEEEc---C---cEEEEEECCCCeEEEEEeecce
Q 003800 201 QFHAYQINAMN-GELLNHETAAFS-GGFVGDVALVS---S---DTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 201 ~~~v~ald~~t-G~~~w~~~v~~~-~~~s~~~~~vg---~---~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
+..++.+|++| |..+|...+... .+++. +.++. + ..++..|. .|+++.+|+.+..
T Consensus 180 ~~~lyi~d~~t~G~l~~~i~~~~~~~gl~~-~~~~D~d~DG~~D~vYaGDl-~GnlwR~dl~~~~ 242 (335)
T PF05567_consen 180 GAALYILDADTTGALIKKIDVPGGSGGLSS-PAVVDSDGDGYVDRVYAGDL-GGNLWRFDLSSAN 242 (335)
T ss_dssp -EEEEEEETTT---EEEEEEE--STT-EEE-EEEE-TTSSSEE-EEEEEET-TSEEEEEE--TTS
T ss_pred CcEEEEEECCCCCceEEEEecCCCCccccc-cEEEeccCCCeEEEEEEEcC-CCcEEEEECCCCC
Confidence 47899999999 999998765443 23333 33331 1 26778886 5999999997643
No 212
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=33.85 E-value=1.3e+02 Score=33.26 Aligned_cols=98 Identities=16% Similarity=0.165 Sum_probs=54.6
Q ss_pred CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEecc
Q 003800 96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFA 173 (794)
Q Consensus 96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~ 173 (794)
+.+|..|| +.+++.|+..||+-+-......-.. .. ... .+.++|. +|.++.-.|...|.-+=..+--
T Consensus 331 kyIVsASg-DRTikvW~~st~efvRtl~gHkRGI----AC-----lQY-r~rlvVSGSSDntIRlwdi~~G~cLRvLeGH 399 (499)
T KOG0281|consen 331 KYIVSASG-DRTIKVWSTSTCEFVRTLNGHKRGI----AC-----LQY-RDRLVVSGSSDNTIRLWDIECGACLRVLEGH 399 (499)
T ss_pred ceEEEecC-CceEEEEeccceeeehhhhcccccc----ee-----hhc-cCeEEEecCCCceEEEEeccccHHHHHHhch
Confidence 33444444 5799999999998886655443221 11 223 4555554 4888888888888654332211
Q ss_pred CcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800 174 AESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG 212 (794)
Q Consensus 174 ~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG 212 (794)
+. ..+++. .++.++.-.+++| ++-..|..+|
T Consensus 400 Ee---LvRciR-Fd~krIVSGaYDG----kikvWdl~aa 430 (499)
T KOG0281|consen 400 EE---LVRCIR-FDNKRIVSGAYDG----KIKVWDLQAA 430 (499)
T ss_pred HH---hhhhee-ecCceeeeccccc----eEEEEecccc
Confidence 11 122321 3455555444455 6666666554
No 213
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=33.55 E-value=3.9e+02 Score=24.75 Aligned_cols=68 Identities=13% Similarity=0.139 Sum_probs=47.6
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
.+-++++|++..|-.++ ..++++...-... +..+.....+...| +-.+|+|-.++. ...+|+..-...
T Consensus 15 ~~eLlvGs~D~~IRvf~--~~e~~~Ei~e~~~--v~~L~~~~~~~F~Y-~l~NGTVGvY~~--~~RlWRiKSK~~ 82 (111)
T PF14783_consen 15 ENELLVGSDDFEIRVFK--GDEIVAEITETDK--VTSLCSLGGGRFAY-ALANGTVGVYDR--SQRLWRIKSKNQ 82 (111)
T ss_pred cceEEEecCCcEEEEEe--CCcEEEEEecccc--eEEEEEcCCCEEEE-EecCCEEEEEeC--cceeeeeccCCC
Confidence 46699999999999997 4578888665544 44442223444445 444579999976 889999986554
No 214
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=32.25 E-value=5.6e+02 Score=28.99 Aligned_cols=19 Identities=16% Similarity=0.200 Sum_probs=9.0
Q ss_pred EEEEEECCCCeEEEEEeec
Q 003800 237 TLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 237 ~lv~~d~~~g~L~v~~l~s 255 (794)
.++++-...|++.+.+..+
T Consensus 294 kf~AlGT~dGsVai~~~~~ 312 (398)
T KOG0771|consen 294 KFLALGTMDGSVAIYDAKS 312 (398)
T ss_pred cEEEEeccCCcEEEEEece
Confidence 3334433455555555443
No 215
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=31.57 E-value=7.1e+02 Score=29.41 Aligned_cols=116 Identities=12% Similarity=0.157 Sum_probs=66.6
Q ss_pred CCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCccccCCcccccc-cc-ccccCCeEEEEECCEEEEEECC-CCc--EEE
Q 003800 95 GKYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPT-NL-KVDKDSLILVSSKGCLHAVSSI-DGE--ILW 168 (794)
Q Consensus 95 g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~-~~-~~~~~~~V~V~~~g~l~ald~~-tG~--~~W 168 (794)
...+++.++ .-..|+-+|.+.|+++=+|.+..... -..+.+. .. .......++-+++..|+++|+. .|. ..|
T Consensus 344 dsnlil~~~~~~~~l~klDIE~GKIVeEWk~~~di~--mv~~t~d~K~~Ql~~e~TlvGLs~n~vfriDpRv~~~~kl~~ 421 (644)
T KOG2395|consen 344 DSNLILMDGGEQDKLYKLDIERGKIVEEWKFEDDIN--MVDITPDFKFAQLTSEQTLVGLSDNSVFRIDPRVQGKNKLAV 421 (644)
T ss_pred ccceEeeCCCCcCcceeeecccceeeeEeeccCCcc--eeeccCCcchhcccccccEEeecCCceEEecccccCcceeee
Confidence 445667654 34679999999999998887765411 0001000 00 0111233444689999999986 443 557
Q ss_pred EEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeee
Q 003800 169 TRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNH 217 (794)
Q Consensus 169 ~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~ 217 (794)
.....-..-.-.+|...+.+|.+.+.+..| .+.-+|. .|.+..+
T Consensus 422 ~q~kqy~~k~nFsc~aTT~sG~IvvgS~~G----dIRLYdr-i~~~AKT 465 (644)
T KOG2395|consen 422 VQSKQYSTKNNFSCFATTESGYIVVGSLKG----DIRLYDR-IGRRAKT 465 (644)
T ss_pred eeccccccccccceeeecCCceEEEeecCC----cEEeehh-hhhhhhh
Confidence 654432221234666556778888777776 4444554 5555433
No 216
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=30.94 E-value=2.2e+02 Score=32.99 Aligned_cols=75 Identities=12% Similarity=0.174 Sum_probs=57.3
Q ss_pred ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
+.++-+++++.-+|.|-+-|.++|+.+=++++... |-.+..--.++++.||-.++.+..+.. +|....+..+...
T Consensus 560 s~dGtklWTGGlDntvRcWDlregrqlqqhdF~SQ--IfSLg~cP~~dWlavGMens~vevlh~-skp~kyqlhlheS 634 (705)
T KOG0639|consen 560 SKDGTKLWTGGLDNTVRCWDLREGRQLQQHDFSSQ--IFSLGYCPTGDWLAVGMENSNVEVLHT-SKPEKYQLHLHES 634 (705)
T ss_pred cCCCceeecCCCccceeehhhhhhhhhhhhhhhhh--heecccCCCccceeeecccCcEEEEec-CCccceeeccccc
Confidence 33466799999999999999999999988888766 333321246789998877778988875 7888888776653
No 217
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=30.78 E-value=1.1e+02 Score=21.85 Aligned_cols=30 Identities=10% Similarity=0.249 Sum_probs=21.7
Q ss_pred CCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800 188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETA 220 (794)
Q Consensus 188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v 220 (794)
+..+|+....++ .+..+|+.+++.+.+..+
T Consensus 3 ~~~lyv~~~~~~---~v~~id~~~~~~~~~i~v 32 (42)
T TIGR02276 3 GTKLYVTNSGSN---TVSVIDTATNKVIATIPV 32 (42)
T ss_pred CCEEEEEeCCCC---EEEEEECCCCeEEEEEEC
Confidence 456887654433 788899999988877654
No 218
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=30.12 E-value=8.6e+02 Score=27.64 Aligned_cols=182 Identities=13% Similarity=0.143 Sum_probs=84.3
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcc--c-----ceeeee---eeeCC-----EEEEEEccCCeEEEEeCC-CC
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--D-----VVDGID---IALGK-----YVITLSSDGSTLRAWNLP-DG 116 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~-----~i~~l~---~~~g~-----~~V~Vs~~g~~v~A~d~~-tG 116 (794)
=+-|-++.++|.|.-+|.|--+++-+..+.+. . .+..+. ...++ -.++||++.|.+..|... .+
T Consensus 97 iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~G~v~~fkIlp~~ 176 (395)
T PF08596_consen 97 IGFVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNSGNVLTFKILPSS 176 (395)
T ss_dssp TSEEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEETTSEEEEEEEEE-G
T ss_pred CcEEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeCCCCEEEEEEecCC
Confidence 34578888999999999999999999887661 0 111111 11222 245666666788888654 34
Q ss_pred cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEe
Q 003800 117 QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGY 196 (794)
Q Consensus 117 ~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~ 196 (794)
.-.|+....+..... ++. -..+..+|.++|+..+........+..... .++.+.+++.
T Consensus 177 ~g~f~v~~~~~~~~~-------------~~~-----i~~I~~i~~~~G~~a~At~~~~~~l~~g~~----i~g~vVvvSe 234 (395)
T PF08596_consen 177 NGRFSVQFAGATTNH-------------DSP-----ILSIIPINADTGESALATISAMQGLSKGIS----IPGYVVVVSE 234 (395)
T ss_dssp GG-EEEEEEEEE--S-------------S---------EEEEEETTT--B-B-BHHHHHGGGGT--------EEEEEE-S
T ss_pred CCceEEEEeeccccC-------------CCc-----eEEEEEEECCCCCcccCchhHhhccccCCC----cCcEEEEEcc
Confidence 455776654321000 111 113556688888776553221111100000 1223333322
Q ss_pred cCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE------cCcEEEEEECCCCeEEEEEeecceeeeEEEee
Q 003800 197 AGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV------SSDTLVTLDTTRSILVTVSFKNRKIAFQETHL 265 (794)
Q Consensus 197 ~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v------g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l 265 (794)
. .+..+.+.+++...... ..+ -+...+.++ ++..++|+.. +|.+.+..|-.=+ ++.++.+
T Consensus 235 ~-----~irv~~~~~~k~~~K~~-~~~-~~~~~~~vv~~~~~~~~~~Lv~l~~-~G~i~i~SLP~Lk-ei~~~~l 300 (395)
T PF08596_consen 235 S-----DIRVFKPPKSKGAHKSF-DDP-FLCSSASVVPTISRNGGYCLVCLFN-NGSIRIYSLPSLK-EIKSVSL 300 (395)
T ss_dssp S-----EEEEE-TT---EEEEE--SS--EEEEEEEEEEEE-EEEEEEEEEEET-TSEEEEEETTT---EEEEEE-
T ss_pred c-----ceEEEeCCCCcccceee-ccc-cccceEEEEeecccCCceEEEEEEC-CCcEEEEECCCch-HhhcccC
Confidence 2 35556666666543332 111 112222222 4457888874 6888888877633 2454544
No 219
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=30.08 E-value=1.1e+02 Score=33.61 Aligned_cols=72 Identities=14% Similarity=0.182 Sum_probs=42.7
Q ss_pred CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800 52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
+++.|+.++.+-.+-.-|..||+-+=. .-+....|..+ .-.+..|+-|+.+.++|.||+..|..+--.+...
T Consensus 329 d~kyIVsASgDRTikvW~~st~efvRt-l~gHkRGIACl--QYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHE 400 (499)
T KOG0281|consen 329 DDKYIVSASGDRTIKVWSTSTCEFVRT-LNGHKRGIACL--QYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHE 400 (499)
T ss_pred ccceEEEecCCceEEEEeccceeeehh-hhcccccceeh--hccCeEEEecCCCceEEEEeccccHHHHHHhchH
Confidence 355577777788888888777754321 11111123333 2233344324457899999999999885544443
No 220
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=29.90 E-value=5.4e+02 Score=31.75 Aligned_cols=76 Identities=16% Similarity=0.172 Sum_probs=49.7
Q ss_pred EEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEE-EE-ECCEEEEEECCCCcEEEEEeccCc
Q 003800 98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLIL-VS-SKGCLHAVSSIDGEILWTRDFAAE 175 (794)
Q Consensus 98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~-V~-~~g~l~ald~~tG~~~W~~~~~~~ 175 (794)
++.++...|++..||-..|..+=+..-..... +++..++ ..+...+++ +. ....+.-.+..||+..|++.....
T Consensus 81 liAsaD~~GrIil~d~~~~s~~~~l~~~~~~~-qdl~W~~---~rd~Srd~LlaIh~ss~lvLwntdtG~k~Wk~~ys~~ 156 (1062)
T KOG1912|consen 81 LIASADISGRIILVDFVLASVINWLSHSNDSV-QDLCWVP---ARDDSRDVLLAIHGSSTLVLWNTDTGEKFWKYDYSHE 156 (1062)
T ss_pred eEEeccccCcEEEEEehhhhhhhhhcCCCcch-hheeeee---ccCcchheeEEecCCcEEEEEEccCCceeeccccCCc
Confidence 44444446799999999986654444443333 5666665 233233444 34 477888999999999999987655
Q ss_pred ce
Q 003800 176 SV 177 (794)
Q Consensus 176 ~~ 177 (794)
.+
T Consensus 157 iL 158 (1062)
T KOG1912|consen 157 IL 158 (1062)
T ss_pred ce
Confidence 44
No 221
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=29.64 E-value=2.1e+02 Score=34.75 Aligned_cols=69 Identities=13% Similarity=0.260 Sum_probs=45.9
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLR 125 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~ 125 (794)
+.+++.+-.+---|..+|..+=++. +...++..+.....+..+.-++.++.|.-||..+|+++=+....
T Consensus 550 ~aTGSsD~tVRlWDv~~G~~VRiF~-GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~H 618 (707)
T KOG0263|consen 550 VATGSSDRTVRLWDVSTGNSVRIFT-GHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGH 618 (707)
T ss_pred cccCCCCceEEEEEcCCCcEEEEec-CCCCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcc
Confidence 5555567889999999998865542 23344555533333334443555789999999999998766655
No 222
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=29.58 E-value=8.7e+02 Score=27.50 Aligned_cols=64 Identities=20% Similarity=0.279 Sum_probs=40.5
Q ss_pred CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEE
Q 003800 95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEIL 167 (794)
Q Consensus 95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~ 167 (794)
+..++. |+.+.++|-||..|-.+.-.......=. . .+ +...++..++. + +|.+...|+++|++.
T Consensus 127 g~~l~t-GsGD~TvR~WD~~TeTp~~t~KgH~~WV-l---cv----awsPDgk~iASG~~dg~I~lwdpktg~~~ 192 (480)
T KOG0271|consen 127 GSRLVT-GSGDTTVRLWDLDTETPLFTCKGHKNWV-L---CV----AWSPDGKKIASGSKDGSIRLWDPKTGQQI 192 (480)
T ss_pred CceEEe-cCCCceEEeeccCCCCcceeecCCccEE-E---EE----EECCCcchhhccccCCeEEEecCCCCCcc
Confidence 334444 4446899999999988887777654311 0 11 22224555554 3 899999999888653
No 223
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=29.50 E-value=9.2e+02 Score=27.74 Aligned_cols=75 Identities=11% Similarity=0.033 Sum_probs=51.1
Q ss_pred cCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccC
Q 003800 51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRG 126 (794)
Q Consensus 51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~ 126 (794)
+.++-++-++.++..+=-|.++|..+=.+.-+..+ .+... ...-++.++.. ..++.|+-||..++...=.+....
T Consensus 313 ~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~-~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpght 389 (506)
T KOG0289|consen 313 PTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSA-AFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGHT 389 (506)
T ss_pred cCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEe-eEcCCceEEeccCCCceEEEEEcCCccccccCCCCC
Confidence 34667888888999888899999988777665331 22222 12345566654 457899999999988665555443
No 224
>PRK01742 tolB translocation protein TolB; Provisional
Probab=28.87 E-value=9e+02 Score=27.45 Aligned_cols=183 Identities=15% Similarity=0.127 Sum_probs=83.4
Q ss_pred CCEE-EEEeCC-----CEEEEEECcCCccceEEEcCcc-cceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEE
Q 003800 53 RKRV-VVSTEE-----NVIASLDLRHGEIFWRHVLGIN-DVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWES 122 (794)
Q Consensus 53 ~~~V-yv~t~~-----g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~ 122 (794)
..+| |+.+.. ..|.-.|.. |.-. +.+... ..+..... ..|+.+++++.. +..++.||..+|+..--.
T Consensus 168 ~~ria~v~~~~~~~~~~~i~i~d~d-g~~~--~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~ 244 (429)
T PRK01742 168 RTRIAYVVQKNGGSQPYEVRVADYD-GFNQ--FIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVA 244 (429)
T ss_pred CCEEEEEEEEcCCCceEEEEEECCC-CCCc--eEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEe
Confidence 3444 766542 366666764 4332 233222 11222211 256667776643 357999999999754333
Q ss_pred eccCccccCCccccccccccccCCeEEEE-E-CC--EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800 123 FLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KG--CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG 198 (794)
Q Consensus 123 ~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g--~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g 198 (794)
...+.. ..+.+. ++ ++.+++. . +| .++.+|..+|+..=-...... ...+..+.++..+++.+..+
T Consensus 245 ~~~g~~--~~~~wS-----PD-G~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~---~~~~~wSpDG~~i~f~s~~~ 313 (429)
T PRK01742 245 SFRGHN--GAPAFS-----PD-GSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAGN---NTEPSWSPDGQSILFTSDRS 313 (429)
T ss_pred cCCCcc--CceeEC-----CC-CCEEEEEEecCCcEEEEEEECCCCCeEeeccCCCC---cCCEEECCCCCEEEEEECCC
Confidence 332211 111121 22 2334443 2 44 477888877764211111111 11122133444555444322
Q ss_pred CceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800 199 SSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK 257 (794)
Q Consensus 199 ~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~ 257 (794)
....++.++..+|..... . ... ........+..++.... ..+...|+.+|+
T Consensus 314 -g~~~I~~~~~~~~~~~~l---~-~~~-~~~~~SpDG~~ia~~~~--~~i~~~Dl~~g~ 364 (429)
T PRK01742 314 -GSPQVYRMSASGGGASLV---G-GRG-YSAQISADGKTLVMING--DNVVKQDLTSGS 364 (429)
T ss_pred -CCceEEEEECCCCCeEEe---c-CCC-CCccCCCCCCEEEEEcC--CCEEEEECCCCC
Confidence 234778888877654321 1 111 11111113334444432 356668888776
No 225
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=28.17 E-value=8.1e+02 Score=26.81 Aligned_cols=75 Identities=21% Similarity=0.171 Sum_probs=45.0
Q ss_pred cCCCEEEEEeC-CCEEEEEECc--CCccceE----EEcCcccceeeeeeeeCCEEEEEEc-c-CCeEEEEeCCCCcEeEE
Q 003800 51 TGRKRVVVSTE-ENVIASLDLR--HGEIFWR----HVLGINDVVDGIDIALGKYVITLSS-D-GSTLRAWNLPDGQMVWE 121 (794)
Q Consensus 51 ~~~~~Vyv~t~-~g~l~ALn~~--tG~ivWR----~~l~~~~~i~~l~~~~g~~~V~Vs~-~-g~~v~A~d~~tG~llWe 121 (794)
++++.+|++.- .+.|.+++.. +|.+-=| ..-..++..+++. ...++.+.++. . |+.|..|++. |+++=+
T Consensus 172 pDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~-vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~ 249 (307)
T COG3386 172 PDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMA-VDADGNLWVAAVWGGGRVVRFNPD-GKLLGE 249 (307)
T ss_pred CCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceE-EeCCCCEEEecccCCceEEEECCC-CcEEEE
Confidence 34556777665 4778777553 3444333 2112223344553 45556665433 2 3489999997 999999
Q ss_pred EeccCc
Q 003800 122 SFLRGS 127 (794)
Q Consensus 122 ~~l~~~ 127 (794)
..+...
T Consensus 250 i~lP~~ 255 (307)
T COG3386 250 IKLPVK 255 (307)
T ss_pred EECCCC
Confidence 998743
No 226
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=27.01 E-value=1.2e+02 Score=27.45 Aligned_cols=24 Identities=17% Similarity=0.188 Sum_probs=19.6
Q ss_pred cEEEEEEEEceeeeEEEEEEecCCC
Q 003800 731 AWLVVYLIDTITGRILHRMTHHGAQ 755 (794)
Q Consensus 731 ~~l~v~liD~VTG~il~s~~h~~~~ 755 (794)
..++|.+.| .+|+++|+.......
T Consensus 48 ~~vtI~I~d-~~G~vVy~~~~~~~~ 71 (106)
T PF11589_consen 48 GDVTITIKD-STGNVVYSETVSNSA 71 (106)
T ss_dssp SEEEEEEEE-TT--EEEEEEESCGG
T ss_pred CCEEEEEEe-CCCCEEEEEEccCCC
Confidence 689999999 999999999988853
No 227
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=27.00 E-value=2.6e+02 Score=34.79 Aligned_cols=63 Identities=11% Similarity=0.171 Sum_probs=45.3
Q ss_pred CCEEEEEeCCCEEEEEECcCC--ccceEEEcCc--ccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCc
Q 003800 53 RKRVVVSTEENVIASLDLRHG--EIFWRHVLGI--NDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQ 117 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG--~ivWR~~l~~--~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~ 117 (794)
....|++-.+|.|..+|||-. +++|.+.-.. .-.+.++ ++.++|.++||+..|.||.||. .|+
T Consensus 542 ~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~-aTt~~G~iavgs~~G~IRLyd~-~g~ 608 (794)
T PF08553_consen 542 NEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCF-ATTEDGYIAVGSNKGDIRLYDR-LGK 608 (794)
T ss_pred CCceEEEECCCceEEeccCCCCCceeeccccccccCCCceEE-EecCCceEEEEeCCCcEEeecc-cch
Confidence 445899999999999999974 3677654322 2224444 4678888888887789999995 563
No 228
>PRK02889 tolB translocation protein TolB; Provisional
Probab=26.73 E-value=9.8e+02 Score=27.16 Aligned_cols=149 Identities=13% Similarity=0.153 Sum_probs=69.2
Q ss_pred eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEE
Q 003800 94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEIL 167 (794)
Q Consensus 94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~ 167 (794)
.|+.+++++.. ...++.||..+|+..--....+.. ..+.+. .+ ++.+++ .. + ..++.+|..+|...
T Consensus 206 DG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~--~~~~~S-----PD-G~~la~~~~~~g~~~Iy~~d~~~~~~~ 277 (427)
T PRK02889 206 DGTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSN--SAPAWS-----PD-GRTLAVALSRDGNSQIYTVNADGSGLR 277 (427)
T ss_pred CCCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCc--cceEEC-----CC-CCEEEEEEccCCCceEEEEECCCCCcE
Confidence 45556666532 256999999999764222222211 111111 22 234443 32 3 36888888776532
Q ss_pred EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC
Q 003800 168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS 246 (794)
Q Consensus 168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g 246 (794)
........ ...+..+.++..+++.+..++ ...++.+|..+|+..- +.........+.+ ..++.++......+
T Consensus 278 -~lt~~~~~--~~~~~wSpDG~~l~f~s~~~g-~~~Iy~~~~~~g~~~~---lt~~g~~~~~~~~SpDG~~Ia~~s~~~g 350 (427)
T PRK02889 278 -RLTQSSGI--DTEPFFSPDGRSIYFTSDRGG-APQIYRMPASGGAAQR---VTFTGSYNTSPRISPDGKLLAYISRVGG 350 (427)
T ss_pred -ECCCCCCC--CcCeEEcCCCCEEEEEecCCC-CcEEEEEECCCCceEE---EecCCCCcCceEECCCCCEEEEEEccCC
Confidence 11111111 011112344555655443322 3478888887775321 1111111111222 12334444432333
Q ss_pred --eEEEEEeecce
Q 003800 247 --ILVTVSFKNRK 257 (794)
Q Consensus 247 --~L~v~~l~sg~ 257 (794)
.+++.++.+++
T Consensus 351 ~~~I~v~d~~~g~ 363 (427)
T PRK02889 351 AFKLYVQDLATGQ 363 (427)
T ss_pred cEEEEEEECCCCC
Confidence 58888888776
No 229
>PF01456 Mucin: Mucin-like glycoprotein; InterPro: IPR000458 This family of trypanosomal proteins resemble vertebrate mucins. The protein consists of three regions. The N and C terminii are conserved between all members of the family, whereas the central region is not well conserved and contains a large number of threonine residues which can be glycosylated []. Indirect evidence suggested that these genes might encode the core protein of parasite mucins, glycoproteins that were proposed to be involved in the interaction with, and invasion of, mammalian host cells.
Probab=26.14 E-value=49 Score=31.67 Aligned_cols=27 Identities=26% Similarity=0.422 Sum_probs=17.8
Q ss_pred ChHHHHHHHHHHHHhccccccceeecc
Q 003800 1 MAIRFIILTLLFLSSCTIPSLSLYEDQ 27 (794)
Q Consensus 1 ~~~~~~l~~l~~l~~~~~~~~Al~edq 27 (794)
|=-++|||+||+|++|.-++-..-+.+
T Consensus 1 MmtcRLLCalLvlaLcCCpsvc~t~~~ 27 (143)
T PF01456_consen 1 MMTCRLLCALLVLALCCCPSVCATASE 27 (143)
T ss_pred CchHHHHHHHHHHHHHcCcchhccccc
Confidence 335789999999999763333333433
No 230
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=25.83 E-value=1.1e+03 Score=27.48 Aligned_cols=195 Identities=14% Similarity=0.150 Sum_probs=92.0
Q ss_pred eeeCCEEEEEEcc-C-CeEEEEeCCCCcEeEE-EeccCccccCCccccccccccccCCeEEE-EECCEEEEEECCCCcEE
Q 003800 92 IALGKYVITLSSD-G-STLRAWNLPDGQMVWE-SFLRGSKHSKPLLLVPTNLKVDKDSLILV-SSKGCLHAVSSIDGEIL 167 (794)
Q Consensus 92 ~~~g~~~V~Vs~~-g-~~v~A~d~~tG~llWe-~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~~g~l~ald~~tG~~~ 167 (794)
+-+++.+.+++.. | |++++-|. +|+-+-+ +.+..... . ....++.-+| ...|.++.+|+++-++.
T Consensus 232 mIV~~RvYFlsD~eG~GnlYSvdl-dGkDlrrHTnFtdYY~-R---------~~nsDGkrIvFq~~GdIylydP~td~le 300 (668)
T COG4946 232 MIVGERVYFLSDHEGVGNLYSVDL-DGKDLRRHTNFTDYYP-R---------NANSDGKRIVFQNAGDIYLYDPETDSLE 300 (668)
T ss_pred eEEcceEEEEecccCccceEEecc-CCchhhhcCCchhccc-c---------ccCCCCcEEEEecCCcEEEeCCCcCcce
Confidence 4578888888863 3 68999997 5655443 34333221 1 1122444444 46899999999876542
Q ss_pred -EEEeccCcce-eeeeEE-E-------EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcE
Q 003800 168 -WTRDFAAESV-EVQQVI-Q-------LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDT 237 (794)
Q Consensus 168 -W~~~~~~~~~-~~~~~v-~-------s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~ 237 (794)
=...+|...- ...+.+ + +..+|..++...-| +....++-.|-.+ ++..+.++-=....+..+-
T Consensus 301 kldI~lpl~rk~k~~k~~~pskyledfa~~~Gd~ia~VSRG----kaFi~~~~~~~~i---qv~~~~~VrY~r~~~~~e~ 373 (668)
T COG4946 301 KLDIGLPLDRKKKQPKFVNPSKYLEDFAVVNGDYIALVSRG----KAFIMRPWDGYSI---QVGKKGGVRYRRIQVDPEG 373 (668)
T ss_pred eeecCCccccccccccccCHHHhhhhhccCCCcEEEEEecC----cEEEECCCCCeeE---EcCCCCceEEEEEccCCcc
Confidence 2222221100 000000 0 22344444433344 4555554433222 1111111111111112223
Q ss_pred EEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecCCcceeEEEecCcEEEEEEe-cCCcEEEEEee
Q 003800 238 LVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTGMFTVKINNYKLFIRLT-SEDKLEVVHKV 312 (794)
Q Consensus 238 lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~v~~~~ 312 (794)
++..+.+...|-+.+.++++ ++.+- ++-+.++......+|-+++-.+++..|.-++ ++|.+.+.+.-
T Consensus 374 ~vigt~dgD~l~iyd~~~~e--~kr~e------~~lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idkS 441 (668)
T COG4946 374 DVIGTNDGDKLGIYDKDGGE--VKRIE------KDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDKS 441 (668)
T ss_pred eEEeccCCceEEEEecCCce--EEEee------CCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEeccc
Confidence 34444444578888888877 33332 1113344444445566666666644444444 47776665533
No 231
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=25.35 E-value=9.8e+02 Score=26.73 Aligned_cols=65 Identities=9% Similarity=-0.044 Sum_probs=37.7
Q ss_pred CEEEEEECCCC---cEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCce-eeeeee
Q 003800 154 GCLHAVSSIDG---EILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGEL-LNHETA 220 (794)
Q Consensus 154 g~l~ald~~tG---~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~-~w~~~v 220 (794)
..++.++..++ ...|..-.+...-....+ ...++.+|+.+..+....+|+++++.+... -|+..+
T Consensus 252 s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v--~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l 320 (414)
T PF02897_consen 252 SEVYLLDLDDGGSPDAKPKLLSPREDGVEYYV--DHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVL 320 (414)
T ss_dssp EEEEEEECCCTTTSS-SEEEEEESSSS-EEEE--EEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEE
T ss_pred CeEEEEeccccCCCcCCcEEEeCCCCceEEEE--EccCCEEEEeeCCCCCCcEEEEecccccccccceeEE
Confidence 57888888875 445544332111111112 235888998887766678999999998886 355433
No 232
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=25.35 E-value=2e+02 Score=34.02 Aligned_cols=31 Identities=23% Similarity=0.414 Sum_probs=25.6
Q ss_pred EEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800 98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSK 128 (794)
Q Consensus 98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~ 128 (794)
.+.-|+++|.||.|...||+-+|.+.+.+..
T Consensus 414 wlasGsdDGtvriWEi~TgRcvr~~~~d~~I 444 (733)
T KOG0650|consen 414 WLASGSDDGTVRIWEIATGRCVRTVQFDSEI 444 (733)
T ss_pred eeeecCCCCcEEEEEeecceEEEEEeeccee
Confidence 4443566789999999999999999998754
No 233
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=24.48 E-value=6.6e+02 Score=27.33 Aligned_cols=174 Identities=14% Similarity=0.205 Sum_probs=86.1
Q ss_pred CEEEEEeCCCEEEEEECcCCccceEEEcCcccce---eeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800 54 KRVVVSTEENVIASLDLRHGEIFWRHVLGINDVV---DGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS 130 (794)
Q Consensus 54 ~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i---~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s 130 (794)
++=+..++.| +-+-.+||-.-|+...+..... .....+.+++.|.|+..|.-.+.|++ |+-.|.-.-..+.
T Consensus 139 q~g~m~gd~G--ail~T~DgGk~Wk~l~e~~v~~~~~n~ia~s~dng~vaVg~rGs~f~T~~a--Gqt~~~~~g~~s~-- 212 (339)
T COG4447 139 QRGEMLGDQG--AILKTTDGGKNWKALVEKAVGLAVPNEIARSADNGYVAVGARGSFFSTWGA--GQTVWLPHGRNSS-- 212 (339)
T ss_pred hhhhhhcccc--eEEEecCCcccHhHhcccccchhhhhhhhhhccCCeEEEecCcceEecCCC--CccEEeccCCCcc--
Confidence 3344455556 4567789999999877765321 11112456778888888877888886 8886655443322
Q ss_pred CCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCccee--eeeEEE--EecCCEEEEEEecCCceeEEEE
Q 003800 131 KPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVE--VQQVIQ--LDESDQIYVVGYAGSSQFHAYQ 206 (794)
Q Consensus 131 ~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~--~~~~v~--s~~~~~vyv~~~~g~~~~~v~a 206 (794)
.....++. ..++..-+++.......-.+...| --|+-........ +..+.+ -.+++.+|+.+..|+ |
T Consensus 213 ~~letmg~--adag~~g~la~g~qg~~f~~~~~g-D~wsd~~~~~~~g~~~~Gl~d~a~~a~~~v~v~G~gGn----v-- 283 (339)
T COG4447 213 RRLETMGL--ADAGSKGLLARGGQGDQFSWVCGG-DEWSDQGEPVNLGRRSWGLLDFAPRAPPEVWVSGIGGN----V-- 283 (339)
T ss_pred chhccccc--ccCCccceEEEccccceeecCCCc-ccccccccchhcccCCCccccccccCCCCeEEeccCcc----E--
Confidence 22233331 112122455543211222232333 3454322110000 001110 136788998777552 2
Q ss_pred EEcCCCceeeeeeeecccCccC--ceEEEcCc-EEEEEE
Q 003800 207 INAMNGELLNHETAAFSGGFVG--DVALVSSD-TLVTLD 242 (794)
Q Consensus 207 ld~~tG~~~w~~~v~~~~~~s~--~~~~vg~~-~lv~~d 242 (794)
+-...|-..|+.....+..+++ ++++.+.+ -++|.+
T Consensus 284 l~StdgG~t~skd~g~~er~s~l~~V~~ts~~~~~l~Gq 322 (339)
T COG4447 284 LASTDGGTTWSKDGGVEERVSNLYSVVFTSPKAGFLCGQ 322 (339)
T ss_pred EEecCCCeeEeccCChhhhhhhhheEEeccCCceEEEcC
Confidence 2235677788875544433332 34443332 445554
No 234
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=24.32 E-value=3.3e+02 Score=32.36 Aligned_cols=70 Identities=11% Similarity=0.181 Sum_probs=44.1
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH 129 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~ 129 (794)
.+.++++|++| |.-+|+.+|+++=+-..+....|..+.....+ -+.|+++. .++-.+++. |+..-.+..+
T Consensus 175 ~g~lWvgT~dG-L~~fd~~~gkalql~s~~~dk~I~al~~d~qg-~LWVGTdq-Gv~~~e~~G----~~~sn~~~~l 244 (671)
T COG3292 175 NGRLWVGTPDG-LSYFDAGRGKALQLASPPLDKAINALIADVQG-RLWVGTDQ-GVYLQEAEG----WRASNWGPML 244 (671)
T ss_pred cCcEEEecCCc-ceEEccccceEEEcCCCcchhhHHHHHHHhcC-cEEEEecc-ceEEEchhh----ccccccCCCC
Confidence 67899999999 78899999998765444433334433122333 34446653 377777654 7777655443
No 235
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=24.27 E-value=9.4e+02 Score=26.11 Aligned_cols=57 Identities=21% Similarity=0.326 Sum_probs=31.7
Q ss_pred ECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800 69 DLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS 127 (794)
Q Consensus 69 n~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~ 127 (794)
|.+.|.++=|..-- ...+...... .|+..+. ++.++++|.||.++|+..=++..+..
T Consensus 49 d~~~G~~~r~~~GH-sH~v~dv~~s~dg~~alS-~swD~~lrlWDl~~g~~t~~f~GH~~ 106 (315)
T KOG0279|consen 49 DIKYGVPVRRLTGH-SHFVSDVVLSSDGNFALS-ASWDGTLRLWDLATGESTRRFVGHTK 106 (315)
T ss_pred ccccCceeeeeecc-ceEecceEEccCCceEEe-ccccceEEEEEecCCcEEEEEEecCC
Confidence 55566655554331 1112222111 2333333 34468999999999988877776653
No 236
>TIGR00548 lolB outer membrane lipoprotein LolB. This protein, LolB, is known so far only in the gamma and beta subdivisions of the Proteobacteria. It is a processed, lipid-modified outer membrane protein. It is required in E. coli for insertion of the major outer lipoprotein (Lpp) into the outer membrane. Lpp is transferred to LolB from the carrier protein LolA in the periplasm. Previously, this protein was thought to play in role in 5-aminolevulinic acid synthesis and was designated HemM.
Probab=22.40 E-value=1.4e+02 Score=30.45 Aligned_cols=58 Identities=14% Similarity=0.137 Sum_probs=32.0
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcE
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQM 118 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~l 118 (794)
.+++-+-+.+. .-+|...|++.-+....+... -..|...+-+.++++.+...+ .+|+.
T Consensus 51 ~Gria~~~~~~------~~sa~~~W~q~~~~~~~l~L~-~PlG~~~~~l~~~~~~v~l~~-~~g~~ 108 (202)
T TIGR00548 51 DGKVGYISPRD------SGSGRFFWQQRNQGYYDLRLS-GPLGRGALRLTGREGAVSLED-NGGGR 108 (202)
T ss_pred eeeEEEECCCc------eeEEEEEEEECCCCceEEEEE-ccCCCcEEEEEEcCCEEEEEE-CCCCE
Confidence 55666666553 234556799985444334322 135666666655555566655 45554
No 237
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=21.38 E-value=4.8e+02 Score=33.84 Aligned_cols=70 Identities=20% Similarity=0.163 Sum_probs=54.0
Q ss_pred EEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 55 RVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
.|.++|.-+.+...|.++-.-+||.+.+.. +.+..+.+.....++++|+..|.+..||..=+.++=++..
T Consensus 1165 ~lvy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~G~l~lWDLRF~~~i~sw~~ 1235 (1431)
T KOG1240|consen 1165 VLVYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSRGQLVLWDLRFRVPILSWEH 1235 (1431)
T ss_pred eEEEEEeccceEEecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCCceEEEEEeecCceeecccC
Confidence 688899999999999999999999888765 3343442334566888888788999999998877644443
No 238
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=21.23 E-value=1.6e+03 Score=27.62 Aligned_cols=72 Identities=17% Similarity=0.288 Sum_probs=43.5
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEE-EcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRH-VLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL 124 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~-~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l 124 (794)
++...+..-.+.|--+|.+||++.=+. .-+..+.+..+.+.-++..++....+..++-|+..+|+++-++..
T Consensus 30 nG~~L~t~~~d~Vi~idv~t~~~~l~s~~~ed~d~ita~~l~~d~~~L~~a~rs~llrv~~L~tgk~irswKa 102 (775)
T KOG0319|consen 30 NGQHLYTACGDRVIIIDVATGSIALPSGSNEDEDEITALALTPDEEVLVTASRSQLLRVWSLPTGKLIRSWKA 102 (775)
T ss_pred CCCEEEEecCceEEEEEccCCceecccCCccchhhhheeeecCCccEEEEeeccceEEEEEcccchHhHhHhh
Confidence 333444444667888999999987111 111112344443344444555445567899999999988766655
No 239
>PRK13861 type IV secretion system protein VirB9; Provisional
Probab=20.92 E-value=3.6e+02 Score=29.40 Aligned_cols=33 Identities=27% Similarity=0.250 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHhccccccceeecccccEeeEE
Q 003800 3 IRFIILTLLFLSSCTIPSLSLYEDQVGLMDWHQ 35 (794)
Q Consensus 3 ~~~~l~~l~~l~~~~~~~~Al~edqvG~~dW~~ 35 (794)
+|.|+++|++|++|+.++.|.-....+..|=|-
T Consensus 2 ~~~~~~~~~~~~~~~~~a~A~~~p~~~~~D~RI 34 (292)
T PRK13861 2 IKKLFLTLACLLFAAIGALAEDTPAAGKLDPRM 34 (292)
T ss_pred hhHHHHHHHHHHHhccchhHhhcCCCCCCCCce
Confidence 456777887877777777666555555444443
No 240
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=20.70 E-value=3.3e+02 Score=30.85 Aligned_cols=68 Identities=16% Similarity=0.169 Sum_probs=35.8
Q ss_pred CCEEEEEeC---CCEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEe
Q 003800 53 RKRVVVSTE---ENVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESF 123 (794)
Q Consensus 53 ~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~ 123 (794)
+++++++++ ...++.||.+||++. |..+.++ ...+.-...+..++++.. +..|+++|++|++..=-+.
T Consensus 47 G~kllF~s~~dg~~nly~lDL~t~~i~--QLTdg~g~~~~g~~~s~~~~~~~Yv~~-~~~l~~vdL~T~e~~~vy~ 119 (386)
T PF14583_consen 47 GRKLLFASDFDGNRNLYLLDLATGEIT--QLTDGPGDNTFGGFLSPDDRALYYVKN-GRSLRRVDLDTLEERVVYE 119 (386)
T ss_dssp S-EEEEEE-TTSS-EEEEEETTT-EEE--E---SS-B-TTT-EE-TTSSEEEEEET-TTEEEEEETTT--EEEEEE
T ss_pred CCEEEEEeccCCCcceEEEEcccCEEE--ECccCCCCCccceEEecCCCeEEEEEC-CCeEEEEECCcCcEEEEEE
Confidence 445665665 457999999999873 3333221 222221223555667765 3589999999998753333
No 241
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=20.50 E-value=8.2e+02 Score=29.73 Aligned_cols=94 Identities=22% Similarity=0.268 Sum_probs=49.4
Q ss_pred EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccc
Q 003800 56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLL 135 (794)
Q Consensus 56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~ 135 (794)
..-.+.+|.|---|. ||+.+=|..-.+.- +..+....+++.++-+|.++++|-|+.. ...=...+++.. .+.
T Consensus 193 flScsNDg~Ir~w~~-~ge~l~~~~ghtn~-vYsis~~~~~~~Ivs~gEDrtlriW~~~--e~~q~I~lPtts----iWs 264 (745)
T KOG0301|consen 193 FLSCSNDGSIRLWDL-DGEVLLEMHGHTNF-VYSISMALSDGLIVSTGEDRTLRIWKKD--ECVQVITLPTTS----IWS 264 (745)
T ss_pred eEeecCCceEEEEec-cCceeeeeeccceE-EEEEEecCCCCeEEEecCCceEEEeecC--ceEEEEecCccc----eEE
Confidence 444445666666665 66666554433221 1122223455555546778999999864 444344443321 121
Q ss_pred cccccccccCCeEEEE-ECCEEEEEEC
Q 003800 136 VPTNLKVDKDSLILVS-SKGCLHAVSS 161 (794)
Q Consensus 136 ~~~~~~~~~~~~V~V~-~~g~l~ald~ 161 (794)
+- ... .+++++. +||.|+.+..
T Consensus 265 a~---~L~-NgDIvvg~SDG~VrVfT~ 287 (745)
T KOG0301|consen 265 AK---VLL-NGDIVVGGSDGRVRVFTV 287 (745)
T ss_pred EE---Eee-CCCEEEeccCceEEEEEe
Confidence 11 111 5677776 6888776643
No 242
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=20.43 E-value=1.1e+03 Score=28.50 Aligned_cols=60 Identities=8% Similarity=0.034 Sum_probs=40.9
Q ss_pred eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECC
Q 003800 94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSI 162 (794)
Q Consensus 94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~ 162 (794)
..+++++.+. ++.++.||+.+++.+-+....+... +++.. ..++.++..+ |..+.-+|+.
T Consensus 139 TaDgil~s~a-~g~v~i~D~stqk~~~el~~h~d~v-QSa~W-------seDG~llatscKdkqirifDPR 200 (1012)
T KOG1445|consen 139 TADGILASGA-HGSVYITDISTQKTAVELSGHTDKV-QSADW-------SEDGKLLATSCKDKQIRIFDPR 200 (1012)
T ss_pred CcCceEEecc-CceEEEEEcccCceeecccCCchhh-hcccc-------ccCCceEeeecCCcceEEeCCc
Confidence 3566777444 5799999999999999988777655 22222 2255555543 6677777765
No 243
>PF01453 B_lectin: D-mannose binding lectin; InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]: Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity. Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=20.40 E-value=6.6e+02 Score=22.93 Aligned_cols=60 Identities=17% Similarity=0.410 Sum_probs=38.2
Q ss_pred CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEE
Q 003800 53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWES 122 (794)
Q Consensus 53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~ 122 (794)
++..+..+.+|.|.-.|.. |+++|...-.... + ...-.+.+..+ |.+..+| .+|+.+|+.
T Consensus 19 ~~~~L~l~~dGnLvl~~~~-~~~iWss~~t~~~---~----~~~~~~~L~~~-GNlvl~d-~~~~~lW~S 78 (114)
T PF01453_consen 19 GNYTLILQSDGNLVLYDSN-GSVIWSSNNTSGR---G----NSGCYLVLQDD-GNLVLYD-SSGNVLWQS 78 (114)
T ss_dssp TTEEEEEETTSEEEEEETT-TEEEEE--S-TTS---S-----SSEEEEEETT-SEEEEEE-TTSEEEEES
T ss_pred ccccceECCCCeEEEEcCC-CCEEEEecccCCc---c----ccCeEEEEeCC-CCEEEEe-ecceEEEee
Confidence 4457778889999888865 8889997221110 0 01223444554 5788888 599999997
No 244
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=20.15 E-value=1.3e+03 Score=26.06 Aligned_cols=176 Identities=6% Similarity=0.086 Sum_probs=98.1
Q ss_pred CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcccccccccc
Q 003800 63 NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKV 142 (794)
Q Consensus 63 g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~ 142 (794)
..+--+|-+-+.++=+..++.+ +-.+ ......++|--. ..++-+|..+=+++=......+.. .++... ..
T Consensus 68 r~Lkv~~~Kk~~~ICe~~fpt~--IL~V--rmNr~RLvV~Le-e~IyIydI~~MklLhTI~t~~~n~-~gl~Al----S~ 137 (391)
T KOG2110|consen 68 RKLKVVHFKKKTTICEIFFPTS--ILAV--RMNRKRLVVCLE-ESIYIYDIKDMKLLHTIETTPPNP-KGLCAL----SP 137 (391)
T ss_pred ceEEEEEcccCceEEEEecCCc--eEEE--EEccceEEEEEc-ccEEEEecccceeehhhhccCCCc-cceEee----cc
Confidence 3577778888888888777665 4333 334444443333 259999999999987776653221 222222 11
Q ss_pred ccCCeEEEE----ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800 143 DKDSLILVS----SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE 218 (794)
Q Consensus 143 ~~~~~V~V~----~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~ 218 (794)
..++-.+++ +.|.|+-+|..+=++.=..+.-...+ .++.-..+|.+.+-+...| ..+..++..+|+.+.|.
T Consensus 138 n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~l---Aalafs~~G~llATASeKG--TVIRVf~v~~G~kl~eF 212 (391)
T KOG2110|consen 138 NNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGPL---AALAFSPDGTLLATASEKG--TVIRVFSVPEGQKLYEF 212 (391)
T ss_pred CCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCce---eEEEECCCCCEEEEeccCc--eEEEEEEcCCccEeeee
Confidence 212222222 26788888877776666665444333 3332234566655444433 24556677899999998
Q ss_pred eeecc-cCccCceEEE-cCcEEEEEECCCCeEEEEEeec
Q 003800 219 TAAFS-GGFVGDVALV-SSDTLVTLDTTRSILVTVSFKN 255 (794)
Q Consensus 219 ~v~~~-~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~s 255 (794)
+-+.. ..+- +..|- ...++ |..++++.+|+.-|+.
T Consensus 213 RRG~~~~~Iy-SL~Fs~ds~~L-~~sS~TeTVHiFKL~~ 249 (391)
T KOG2110|consen 213 RRGTYPVSIY-SLSFSPDSQFL-AASSNTETVHIFKLEK 249 (391)
T ss_pred eCCceeeEEE-EEEECCCCCeE-EEecCCCeEEEEEecc
Confidence 74432 2221 12222 23344 4444677777777654
No 245
>PF08894 DUF1838: Protein of unknown function (DUF1838); InterPro: IPR014990 This group of proteins are functionally uncharacterised.
Probab=20.15 E-value=74 Score=33.29 Aligned_cols=67 Identities=24% Similarity=0.198 Sum_probs=41.5
Q ss_pred eEeeccCCceEEEEEEcCCCCCCcCCCCCCCcEEEEEEEEceeeeEEEEEEecCCCCCceEEEEecEEEE
Q 003800 700 VMYKYISKNLLFVATVAPKASGHIGSADPDEAWLVVYLIDTITGRILHRMTHHGAQGPVHAVLSENWVVY 769 (794)
Q Consensus 700 VLYKYLNPNl~~v~t~~~~~~~~~~~~~~~~~~l~v~liD~VTG~il~s~~h~~~~~pi~~v~~ENWvvY 769 (794)
.|+|-.-=|..-.+.....+.+. | -.--...|++|+ |.+||+||++-.-+-....|.+||..|=.|=
T Consensus 24 ~LF~ieGmnv~rcv~~~~g~~~~-~-~r~lSREl~~Y~-DP~TgeIL~~W~npwt~e~vpVvhVaNdpv~ 90 (238)
T PF08894_consen 24 LLFKIEGMNVARCVPDEDGEGGE-G-YRFLSRELTFYL-DPVTGEILETWENPWTGEVVPVVHVANDPVN 90 (238)
T ss_pred eeeeeeeeeeeEeeecCCCcchh-h-hhhhhheeeEEe-CCchhhHHHhhcCCCcCCccceEEeccCccc
Confidence 45555555666665554432110 0 000134677777 9999999999888877777888887654443
Done!