Query 004574
Match_columns 744
No_of_seqs 307 out of 3939
Neff 10.3
Searched_HMMs 46136
Date Fri Mar 29 01:38:34 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/004574.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/004574hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG1506 DAP2 Dipeptidyl aminop 100.0 1E-41 2.2E-46 378.7 51.7 566 32-735 14-619 (620)
2 PRK10115 protease 2; Provision 100.0 1.4E-37 2.9E-42 346.9 53.0 519 89-737 129-680 (686)
3 COG1770 PtrB Protease II [Amin 100.0 8.2E-26 1.8E-30 233.7 49.1 517 89-733 131-679 (682)
4 KOG2281 Dipeptidyl aminopeptid 100.0 3.1E-27 6.7E-32 239.9 36.6 315 392-731 527-866 (867)
5 KOG2100 Dipeptidyl aminopeptid 100.0 2.9E-26 6.2E-31 256.2 46.8 386 304-736 345-751 (755)
6 PRK01029 tolB translocation pr 99.9 1.7E-25 3.7E-30 237.9 25.4 251 4-364 164-426 (428)
7 PRK01029 tolB translocation pr 99.9 5.2E-25 1.1E-29 234.2 27.9 266 35-415 141-415 (428)
8 PRK03629 tolB translocation pr 99.9 1.9E-24 4.1E-29 231.2 27.4 236 3-342 177-414 (429)
9 PF00326 Peptidase_S9: Prolyl 99.9 8.2E-26 1.8E-30 219.5 14.2 188 546-735 5-212 (213)
10 PRK05137 tolB translocation pr 99.9 4.7E-24 1E-28 229.8 26.0 236 4-342 181-420 (435)
11 PRK03629 tolB translocation pr 99.9 1.3E-23 2.8E-28 224.8 28.6 250 43-412 164-415 (429)
12 PRK04043 tolB translocation pr 99.9 6.3E-24 1.4E-28 224.0 25.4 232 4-334 168-402 (419)
13 PRK05137 tolB translocation pr 99.9 2.1E-23 4.6E-28 224.8 28.7 254 42-411 165-420 (435)
14 PRK04792 tolB translocation pr 99.9 2.7E-23 5.9E-28 223.0 27.1 234 5-342 198-433 (448)
15 PRK04043 tolB translocation pr 99.9 1.3E-22 2.8E-27 214.1 27.8 248 43-413 155-410 (419)
16 PRK02889 tolB translocation pr 99.9 5.1E-23 1.1E-27 220.5 25.1 227 5-333 176-404 (427)
17 PRK02889 tolB translocation pr 99.9 1.7E-22 3.6E-27 216.6 28.3 246 43-411 164-411 (427)
18 PRK04922 tolB translocation pr 99.9 1.8E-22 3.9E-27 217.2 27.9 253 42-413 167-421 (433)
19 PRK04792 tolB translocation pr 99.9 2.3E-22 5E-27 215.9 28.6 250 43-412 183-434 (448)
20 PRK04922 tolB translocation pr 99.9 1.2E-22 2.5E-27 218.7 25.6 234 5-342 184-419 (433)
21 PRK00178 tolB translocation pr 99.9 2E-22 4.4E-27 217.7 26.2 226 6-333 180-407 (430)
22 COG4946 Uncharacterized protei 99.9 5.9E-22 1.3E-26 194.0 26.1 374 1-449 55-511 (668)
23 PRK00178 tolB translocation pr 99.9 5.8E-22 1.3E-26 214.2 28.4 253 41-412 161-415 (430)
24 COG1505 Serine proteases of th 99.9 3.9E-20 8.4E-25 189.3 34.2 282 433-734 346-648 (648)
25 KOG2237 Predicted serine prote 99.9 4.4E-19 9.5E-24 182.4 35.6 282 437-735 392-708 (712)
26 PRK01742 tolB translocation pr 99.9 1.8E-20 3.9E-25 201.3 26.9 222 42-372 168-391 (429)
27 PRK01742 tolB translocation pr 99.9 1.5E-20 3.3E-25 201.8 26.1 227 5-342 184-412 (429)
28 KOG1455 Lysophospholipase [Lip 99.9 3.7E-20 8.1E-25 175.2 19.3 228 480-732 25-312 (313)
29 PRK10566 esterase; Provisional 99.8 1.1E-19 2.4E-24 181.6 22.0 212 494-733 11-249 (249)
30 PLN02298 hydrolase, alpha/beta 99.8 2.5E-19 5.4E-24 186.8 25.2 237 476-737 26-322 (330)
31 TIGR02800 propeller_TolB tol-p 99.8 7.4E-19 1.6E-23 190.0 25.5 227 5-333 170-398 (417)
32 PRK05077 frsA fermentation/res 99.8 5.7E-19 1.2E-23 186.7 23.2 223 479-733 165-413 (414)
33 PLN02385 hydrolase; alpha/beta 99.8 9.2E-19 2E-23 183.5 23.0 227 482-734 61-347 (349)
34 TIGR02800 propeller_TolB tol-p 99.8 3.3E-18 7.1E-23 185.0 27.5 232 39-373 152-385 (417)
35 PRK13604 luxD acyl transferase 99.8 1.5E-18 3.2E-23 170.4 21.5 198 487-714 14-247 (307)
36 PF00930 DPPIV_N: Dipeptidyl p 99.8 3.4E-18 7.3E-23 178.7 24.7 302 39-451 1-352 (353)
37 PHA02857 monoglyceride lipase; 99.8 3E-18 6.5E-23 174.2 22.1 215 487-732 5-273 (276)
38 COG0823 TolB Periplasmic compo 99.8 2.5E-18 5.4E-23 179.7 21.6 228 6-333 174-403 (425)
39 PF01738 DLH: Dienelactone hyd 99.8 3.6E-19 7.8E-24 173.4 14.2 193 495-733 1-218 (218)
40 TIGR02821 fghA_ester_D S-formy 99.8 4.7E-18 1E-22 171.2 22.6 229 483-733 14-275 (275)
41 PLN02442 S-formylglutathione h 99.8 6.7E-18 1.5E-22 170.2 22.3 235 480-734 16-282 (283)
42 PRK10162 acetyl esterase; Prov 99.8 3.7E-17 8E-22 167.8 25.0 230 480-734 55-317 (318)
43 COG0823 TolB Periplasmic compo 99.8 3.4E-17 7.4E-22 171.2 25.0 217 59-371 170-388 (425)
44 PF00930 DPPIV_N: Dipeptidyl p 99.8 1.1E-17 2.4E-22 174.8 21.3 303 2-372 20-347 (353)
45 PRK10749 lysophospholipase L2; 99.8 2.3E-17 5.1E-22 171.2 22.4 222 482-732 30-329 (330)
46 COG0412 Dienelactone hydrolase 99.8 3.8E-17 8.3E-22 158.3 22.3 204 483-734 3-235 (236)
47 KOG1552 Predicted alpha/beta h 99.8 2.9E-17 6.3E-22 152.8 16.7 214 481-735 34-255 (258)
48 KOG4391 Predicted alpha/beta h 99.8 1.9E-17 4E-22 147.0 14.6 231 472-735 44-285 (300)
49 PF14583 Pectate_lyase22: Olig 99.7 1E-15 2.2E-20 152.6 27.9 344 17-460 18-383 (386)
50 PF05448 AXE1: Acetyl xylan es 99.7 2.7E-17 6E-22 166.2 16.3 225 478-732 52-320 (320)
51 PLN02652 hydrolase; alpha/beta 99.7 2.5E-16 5.3E-21 165.2 23.8 223 482-734 110-389 (395)
52 COG1647 Esterase/lipase [Gener 99.7 6.6E-17 1.4E-21 145.3 14.6 190 514-730 16-242 (243)
53 KOG1515 Arylacetamide deacetyl 99.7 2.4E-15 5.2E-20 150.7 24.4 234 481-732 62-335 (336)
54 PRK11460 putative hydrolase; P 99.7 1.8E-15 4E-20 147.7 20.7 124 578-734 87-210 (232)
55 COG2267 PldB Lysophospholipase 99.7 3E-15 6.6E-20 150.8 21.7 222 482-734 9-296 (298)
56 COG0657 Aes Esterase/lipase [L 99.7 9.2E-15 2E-19 150.7 22.9 223 490-731 59-309 (312)
57 COG3458 Acetyl esterase (deace 99.7 1.8E-15 4E-20 139.9 14.5 223 478-732 52-317 (321)
58 TIGR01840 esterase_phb esteras 99.6 5.2E-15 1.1E-19 143.1 16.9 180 497-696 1-197 (212)
59 PF12695 Abhydrolase_5: Alpha/ 99.6 3E-15 6.5E-20 135.8 13.8 145 515-711 1-145 (145)
60 PF06500 DUF1100: Alpha/beta h 99.6 3.6E-15 7.8E-20 151.0 15.6 220 479-733 162-410 (411)
61 PF12715 Abhydrolase_7: Abhydr 99.6 8.5E-16 1.8E-20 152.3 9.9 221 478-708 84-344 (390)
62 COG4099 Predicted peptidase [G 99.6 4.1E-15 8.8E-20 138.9 13.6 178 490-706 169-354 (387)
63 PF07859 Abhydrolase_3: alpha/ 99.6 1.8E-15 4E-20 146.8 12.0 181 516-713 1-210 (211)
64 PF02230 Abhydrolase_2: Phosph 99.6 5.3E-15 1.2E-19 143.4 14.6 125 579-733 91-216 (216)
65 PRK05371 x-prolyl-dipeptidyl a 99.6 9.9E-14 2.1E-18 156.5 26.8 187 546-736 270-523 (767)
66 TIGR01607 PST-A Plasmodium sub 99.6 2.7E-14 5.9E-19 147.6 20.3 229 487-730 2-331 (332)
67 PLN02511 hydrolase 99.6 5E-14 1.1E-18 148.7 20.1 229 481-735 70-368 (388)
68 TIGR03611 RutD pyrimidine util 99.6 2.3E-14 5E-19 143.9 16.6 190 513-730 13-256 (257)
69 TIGR00976 /NonD putative hydro 99.6 6.5E-14 1.4E-18 155.2 21.2 223 489-737 3-308 (550)
70 TIGR03343 biphenyl_bphD 2-hydr 99.6 4.2E-14 9E-19 144.3 18.1 194 513-730 30-281 (282)
71 PLN00021 chlorophyllase 99.6 1.5E-13 3.3E-18 139.0 20.5 208 493-737 37-288 (313)
72 PRK00870 haloalkane dehalogena 99.6 9.9E-14 2.1E-18 142.8 19.5 221 481-732 20-301 (302)
73 KOG4178 Soluble epoxide hydrol 99.6 3.4E-13 7.5E-18 130.7 21.7 194 513-732 44-320 (322)
74 PRK10985 putative hydrolase; P 99.6 4E-13 8.6E-18 139.1 22.3 220 487-734 36-322 (324)
75 TIGR02240 PHA_depoly_arom poly 99.6 2.4E-13 5.2E-18 138.0 19.9 209 490-734 9-268 (276)
76 TIGR03056 bchO_mg_che_rel puta 99.6 2.1E-13 4.6E-18 138.8 19.5 190 513-730 28-278 (278)
77 TIGR03100 hydr1_PEP hydrolase, 99.5 2E-13 4.3E-18 137.5 18.4 221 483-730 3-273 (274)
78 PF10503 Esterase_phd: Esteras 99.5 1.2E-13 2.6E-18 130.5 15.2 181 495-696 1-198 (220)
79 PLN03087 BODYGUARD 1 domain co 99.5 4.5E-13 9.8E-18 142.3 21.1 216 486-731 179-478 (481)
80 PLN02824 hydrolase, alpha/beta 99.5 3.6E-13 7.7E-18 138.1 19.9 191 514-732 30-294 (294)
81 COG4946 Uncharacterized protei 99.5 9.7E-13 2.1E-17 130.0 21.6 251 4-373 247-509 (668)
82 PRK11071 esterase YqiA; Provis 99.5 3.4E-13 7.4E-18 127.0 17.8 176 514-730 2-189 (190)
83 PRK10673 acyl-CoA esterase; Pr 99.5 3.3E-13 7.2E-18 135.4 18.7 192 512-731 15-254 (255)
84 KOG0271 Notchless-like WD40 re 99.5 8.3E-13 1.8E-17 127.1 20.0 107 292-442 361-467 (480)
85 PLN02965 Probable pheophorbida 99.5 4.6E-13 9.9E-18 134.1 19.2 191 515-731 5-252 (255)
86 TIGR02427 protocat_pcaD 3-oxoa 99.5 7.6E-14 1.7E-18 139.4 13.5 189 513-729 13-250 (251)
87 COG0400 Predicted esterase [Ge 99.5 5.2E-13 1.1E-17 124.7 17.7 125 577-733 82-206 (207)
88 TIGR01738 bioH putative pimelo 99.5 2.2E-13 4.7E-18 135.6 16.0 188 513-729 4-245 (245)
89 TIGR03866 PQQ_ABC_repeats PQQ- 99.5 1.5E-11 3.3E-16 126.6 30.1 267 5-412 11-288 (300)
90 PRK10349 carboxylesterase BioH 99.5 2E-13 4.4E-18 136.9 15.4 188 514-730 14-254 (256)
91 TIGR03866 PQQ_ABC_repeats PQQ- 99.5 3.4E-11 7.3E-16 124.0 32.4 272 60-464 9-283 (300)
92 PF14583 Pectate_lyase22: Olig 99.5 3.7E-12 8E-17 127.4 23.9 293 68-471 16-339 (386)
93 PF02129 Peptidase_S15: X-Pro 99.5 3.8E-13 8.2E-18 135.5 16.4 205 491-711 1-271 (272)
94 PF08840 BAAT_C: BAAT / Acyl-C 99.5 2E-13 4.4E-18 130.7 13.2 156 577-734 5-212 (213)
95 PRK06489 hypothetical protein; 99.5 1.2E-12 2.6E-17 137.8 19.4 167 553-732 103-357 (360)
96 PLN02679 hydrolase, alpha/beta 99.5 1.3E-12 2.8E-17 137.2 19.6 194 513-731 88-356 (360)
97 COG2945 Predicted hydrolase of 99.5 1.2E-12 2.5E-17 115.5 15.6 196 482-730 4-205 (210)
98 KOG0293 WD40 repeat-containing 99.5 2.7E-12 5.8E-17 124.8 18.7 197 32-333 226-426 (519)
99 TIGR01250 pro_imino_pep_2 prol 99.5 1.4E-12 3.1E-17 133.3 17.8 191 513-730 25-288 (288)
100 TIGR03695 menH_SHCHC 2-succiny 99.5 8.6E-13 1.9E-17 131.6 14.9 187 514-729 2-250 (251)
101 PRK07581 hypothetical protein; 99.5 1.2E-12 2.6E-17 137.0 16.5 67 660-733 270-337 (339)
102 PLN02894 hydrolase, alpha/beta 99.5 5E-12 1.1E-16 134.1 21.2 196 513-737 105-390 (402)
103 COG2706 3-carboxymuconate cycl 99.5 1.1E-10 2.4E-15 113.1 27.7 292 5-412 16-332 (346)
104 PRK03592 haloalkane dehalogena 99.4 3.8E-12 8.2E-17 130.6 18.5 194 513-733 27-290 (295)
105 PRK14875 acetoin dehydrogenase 99.4 3.6E-12 7.8E-17 135.7 18.3 188 513-731 131-370 (371)
106 PRK03204 haloalkane dehalogena 99.4 1.3E-11 2.9E-16 125.4 21.5 189 513-729 34-285 (286)
107 TIGR01249 pro_imino_pep_1 prol 99.4 5.1E-12 1.1E-16 130.0 18.5 167 551-732 49-305 (306)
108 PLN02578 hydrolase 99.4 6.4E-12 1.4E-16 131.8 18.8 165 552-730 109-353 (354)
109 TIGR01392 homoserO_Ac_trn homo 99.4 7.4E-12 1.6E-16 131.4 18.6 68 660-730 283-351 (351)
110 PLN02872 triacylglycerol lipas 99.4 4.9E-12 1.1E-16 132.1 16.6 236 479-735 41-392 (395)
111 PRK00175 metX homoserine O-ace 99.4 1.3E-11 2.8E-16 130.5 19.9 70 661-733 305-375 (379)
112 TIGR01836 PHA_synth_III_C poly 99.4 2E-11 4.4E-16 127.9 20.2 176 546-731 85-349 (350)
113 PLN03084 alpha/beta hydrolase 99.4 2.6E-11 5.6E-16 126.6 20.4 189 513-730 127-382 (383)
114 PF12697 Abhydrolase_6: Alpha/ 99.4 1.1E-12 2.5E-17 128.6 9.7 179 516-722 1-226 (228)
115 KOG1454 Predicted hydrolase/ac 99.4 1.3E-11 2.8E-16 125.8 17.3 195 512-732 57-324 (326)
116 PF12740 Chlorophyllase2: Chlo 99.4 2.7E-11 5.9E-16 115.7 18.2 179 496-711 5-205 (259)
117 KOG0315 G-protein beta subunit 99.4 1.8E-10 4E-15 105.0 21.9 273 2-414 17-299 (311)
118 PRK08775 homoserine O-acetyltr 99.4 1.1E-11 2.4E-16 129.7 16.2 172 551-732 95-339 (343)
119 TIGR03101 hydr2_PEP hydrolase, 99.4 1.3E-10 2.7E-15 114.4 22.0 199 485-708 3-243 (266)
120 KOG0271 Notchless-like WD40 re 99.4 3.5E-11 7.6E-16 116.0 17.3 124 207-371 344-470 (480)
121 PF10282 Lactonase: Lactonase, 99.4 1.6E-10 3.6E-15 120.4 23.9 292 5-411 13-332 (345)
122 PRK11126 2-succinyl-6-hydroxy- 99.3 2.8E-11 6E-16 120.4 17.2 184 513-731 2-241 (242)
123 KOG4409 Predicted hydrolase/ac 99.3 1.7E-11 3.6E-16 119.3 14.7 192 513-731 90-363 (365)
124 COG0429 Predicted hydrolase of 99.3 8E-11 1.7E-15 114.0 18.6 219 487-734 54-342 (345)
125 PLN02211 methyl indole-3-aceta 99.3 6.5E-11 1.4E-15 119.1 19.1 190 513-730 18-268 (273)
126 KOG4497 Uncharacterized conser 99.3 1.9E-11 4.1E-16 115.6 13.8 214 35-371 13-229 (447)
127 PRK10439 enterobactin/ferric e 99.3 1.8E-10 3.8E-15 121.2 22.7 215 481-729 179-406 (411)
128 KOG3101 Esterase D [General fu 99.3 9.5E-12 2.1E-16 110.6 10.8 206 492-713 25-263 (283)
129 cd00200 WD40 WD40 domain, foun 99.3 1.6E-09 3.5E-14 109.9 27.2 264 32-444 11-279 (289)
130 KOG1838 Alpha/beta hydrolase [ 99.3 2.2E-10 4.9E-15 115.4 19.1 232 477-732 88-388 (409)
131 PRK11028 6-phosphogluconolacto 99.3 3.7E-09 8E-14 110.3 29.0 265 4-371 11-292 (330)
132 KOG3043 Predicted hydrolase re 99.3 3.6E-11 7.7E-16 109.0 11.5 156 547-733 59-241 (242)
133 KOG0291 WD40-repeat-containing 99.3 1.8E-09 3.9E-14 113.3 25.5 236 21-372 297-540 (893)
134 COG2936 Predicted acyl esteras 99.3 1.6E-10 3.5E-15 121.6 17.8 232 480-734 17-319 (563)
135 KOG0279 G protein beta subunit 99.3 8.4E-10 1.8E-14 102.6 20.1 256 6-371 39-302 (315)
136 COG3509 LpqC Poly(3-hydroxybut 99.3 3.1E-10 6.8E-15 107.8 17.6 217 490-732 42-307 (312)
137 PF02897 Peptidase_S9_N: Proly 99.3 3.4E-09 7.3E-14 114.3 28.3 322 40-469 76-413 (414)
138 KOG2984 Predicted hydrolase [G 99.2 1.3E-11 2.9E-16 108.9 6.7 163 556-731 72-275 (277)
139 PF08662 eIF2A: Eukaryotic tra 99.2 3.3E-10 7.1E-15 106.8 16.2 124 153-317 35-162 (194)
140 KOG0318 WD40 repeat stress pro 99.2 1.9E-08 4.1E-13 101.6 29.1 136 32-193 192-339 (603)
141 KOG0272 U4/U6 small nuclear ri 99.2 9.2E-10 2E-14 107.9 18.9 185 32-317 177-364 (459)
142 PF00756 Esterase: Putative es 99.2 1.3E-11 2.7E-16 123.4 6.2 210 492-729 5-251 (251)
143 KOG2055 WD40 repeat protein [G 99.2 8.1E-10 1.8E-14 109.5 18.4 259 5-371 235-501 (514)
144 KOG1407 WD40 repeat protein [F 99.2 3.1E-09 6.6E-14 97.9 20.7 272 32-449 22-295 (313)
145 PRK13616 lipoprotein LpqB; Pro 99.2 1.1E-09 2.4E-14 120.0 20.6 177 32-292 351-532 (591)
146 KOG4627 Kynurenine formamidase 99.2 8.9E-11 1.9E-15 104.1 9.6 200 481-711 44-247 (270)
147 COG2706 3-carboxymuconate cycl 99.2 1.8E-07 3.9E-12 91.1 32.3 255 158-470 67-333 (346)
148 PLN02980 2-oxoglutarate decarb 99.2 1E-09 2.2E-14 135.0 21.3 198 513-735 1371-1642(1655)
149 PRK11028 6-phosphogluconolacto 99.2 4.6E-08 1E-12 102.0 30.2 225 175-470 80-316 (330)
150 PF03403 PAF-AH_p_II: Platelet 99.2 5.8E-10 1.3E-14 116.0 15.1 180 511-735 98-361 (379)
151 KOG0279 G protein beta subunit 99.1 1.7E-08 3.7E-13 94.0 22.4 275 32-447 17-304 (315)
152 KOG4667 Predicted esterase [Li 99.1 4.6E-10 1E-14 100.6 11.6 200 481-714 9-242 (269)
153 PRK06765 homoserine O-acetyltr 99.1 1.6E-09 3.4E-14 113.5 17.7 69 660-731 318-387 (389)
154 KOG1446 Histone H3 (Lys4) meth 99.1 1.3E-07 2.8E-12 90.1 27.9 253 31-414 15-273 (311)
155 KOG0266 WD40 repeat-containing 99.1 1.2E-08 2.6E-13 110.3 24.4 268 32-445 161-441 (456)
156 KOG0273 Beta-transducin family 99.1 2E-08 4.3E-13 100.1 22.9 268 28-446 233-513 (524)
157 PRK05855 short chain dehydroge 99.1 8.4E-10 1.8E-14 125.2 15.7 64 662-733 230-293 (582)
158 cd00200 WD40 WD40 domain, foun 99.1 3.2E-08 7E-13 100.3 25.2 244 5-371 31-280 (289)
159 KOG0305 Anaphase promoting com 99.1 9.2E-09 2E-13 107.0 19.9 251 3-371 195-450 (484)
160 PF07224 Chlorophyllase: Chlor 99.1 1.9E-09 4.1E-14 100.2 13.2 206 494-735 32-277 (307)
161 PF08538 DUF1749: Protein of u 99.1 8.5E-10 1.8E-14 107.5 11.2 184 546-730 54-303 (303)
162 KOG0318 WD40 repeat stress pro 99.1 6.6E-08 1.4E-12 97.8 24.7 202 38-333 26-266 (603)
163 PF02897 Peptidase_S9_N: Proly 99.1 3.9E-08 8.4E-13 106.1 25.3 272 34-411 127-412 (414)
164 KOG0973 Histone transcription 99.1 1E-08 2.3E-13 112.5 20.2 158 32-268 71-245 (942)
165 KOG0315 G-protein beta subunit 99.1 7.4E-08 1.6E-12 88.3 22.0 269 59-467 17-295 (311)
166 TIGR02658 TTQ_MADH_Hv methylam 99.0 4.6E-07 9.9E-12 92.1 30.1 289 61-466 26-336 (352)
167 KOG0293 WD40 repeat-containing 99.0 4.9E-09 1.1E-13 102.5 14.8 209 28-328 267-511 (519)
168 PRK07868 acyl-CoA synthetase; 99.0 1.1E-08 2.3E-13 121.8 21.2 74 661-740 293-369 (994)
169 KOG0291 WD40-repeat-containing 99.0 2.3E-07 5E-12 97.8 27.8 284 30-460 350-652 (893)
170 PF10282 Lactonase: Lactonase, 99.0 1.6E-07 3.4E-12 98.1 25.9 300 62-469 13-333 (345)
171 COG3571 Predicted hydrolase of 99.0 3.4E-08 7.4E-13 84.3 16.7 160 513-713 14-183 (213)
172 PF03583 LIP: Secretory lipase 99.0 1.5E-08 3.3E-13 101.9 17.5 67 665-737 219-286 (290)
173 PF00561 Abhydrolase_1: alpha/ 99.0 9.2E-10 2E-14 108.4 7.3 141 576-725 28-228 (230)
174 PF09752 DUF2048: Uncharacteri 99.0 2.6E-08 5.7E-13 98.7 16.9 211 494-730 76-347 (348)
175 KOG0272 U4/U6 small nuclear ri 99.0 6.9E-09 1.5E-13 101.9 12.4 193 30-327 261-457 (459)
176 KOG2055 WD40 repeat protein [G 99.0 5.4E-08 1.2E-12 96.8 18.5 276 32-445 215-501 (514)
177 TIGR02658 TTQ_MADH_Hv methylam 98.9 3.7E-07 7.9E-12 92.8 24.8 292 5-334 27-332 (352)
178 PF05728 UPF0227: Uncharacteri 98.9 6.8E-08 1.5E-12 89.5 17.5 136 577-729 44-186 (187)
179 PRK10115 protease 2; Provision 98.9 6.8E-07 1.5E-11 101.1 29.1 262 32-413 128-404 (686)
180 KOG2112 Lysophospholipase [Lip 98.9 1.5E-08 3.2E-13 92.0 12.5 128 577-731 73-203 (206)
181 COG0627 Predicted esterase [Ge 98.9 1.4E-08 3E-13 101.7 13.5 147 585-735 141-314 (316)
182 TIGR01838 PHA_synth_I poly(R)- 98.9 6.1E-08 1.3E-12 104.5 18.4 80 546-630 211-303 (532)
183 KOG1407 WD40 repeat protein [F 98.9 1.1E-07 2.3E-12 87.9 17.0 205 6-320 88-294 (313)
184 KOG3847 Phospholipase A2 (plat 98.9 2E-08 4.2E-13 95.5 12.3 183 508-735 113-374 (399)
185 KOG0263 Transcription initiati 98.9 7E-08 1.5E-12 102.5 17.5 206 167-460 443-649 (707)
186 PF08662 eIF2A: Eukaryotic tra 98.9 6.4E-08 1.4E-12 91.3 15.3 146 5-223 39-185 (194)
187 PTZ00421 coronin; Provisional 98.8 1.6E-06 3.4E-11 93.7 26.9 168 175-409 126-296 (493)
188 PF02239 Cytochrom_D1: Cytochr 98.8 5.9E-07 1.3E-11 93.6 22.4 292 34-468 40-356 (369)
189 COG4188 Predicted dienelactone 98.8 4E-08 8.7E-13 97.6 12.9 207 482-711 38-294 (365)
190 KOG2382 Predicted alpha/beta h 98.8 4.7E-08 1E-12 95.5 12.9 64 662-732 250-313 (315)
191 KOG0973 Histone transcription 98.8 3.4E-07 7.3E-12 100.9 20.8 181 175-389 70-256 (942)
192 KOG2096 WD40 repeat protein [G 98.8 8.4E-07 1.8E-11 84.4 20.1 177 209-443 209-390 (420)
193 COG2382 Fes Enterochelin ester 98.8 1.4E-07 3.1E-12 90.9 14.9 217 481-733 68-296 (299)
194 PRK13616 lipoprotein LpqB; Pro 98.8 1.6E-07 3.4E-12 103.2 17.2 168 89-334 352-529 (591)
195 KOG0273 Beta-transducin family 98.8 5.1E-07 1.1E-11 90.3 18.7 251 7-372 259-513 (524)
196 KOG2314 Translation initiation 98.8 5.8E-06 1.3E-10 84.6 25.6 314 22-447 242-558 (698)
197 KOG0772 Uncharacterized conser 98.8 7.5E-07 1.6E-11 90.0 19.1 244 158-464 190-449 (641)
198 PLN00181 protein SPA1-RELATED; 98.7 1.2E-05 2.6E-10 94.2 31.9 162 157-371 555-727 (793)
199 PF02239 Cytochrom_D1: Cytochr 98.7 4.1E-07 9E-12 94.8 17.6 289 6-410 59-354 (369)
200 KOG0286 G-protein beta subunit 98.7 4.5E-06 9.7E-11 78.8 22.2 234 88-459 57-302 (343)
201 KOG2315 Predicted translation 98.7 1.7E-06 3.7E-11 88.7 21.1 225 5-317 146-373 (566)
202 KOG0772 Uncharacterized conser 98.7 3.2E-06 6.9E-11 85.6 22.3 251 28-371 164-428 (641)
203 COG2819 Predicted hydrolase of 98.7 1.1E-06 2.3E-11 83.9 18.0 130 580-730 123-259 (264)
204 KOG2564 Predicted acetyltransf 98.7 1.4E-07 3.1E-12 88.6 11.8 117 481-626 49-179 (343)
205 KOG0266 WD40 repeat-containing 98.7 2.1E-06 4.5E-11 93.1 22.9 224 32-371 205-441 (456)
206 PF06821 Ser_hydrolase: Serine 98.7 6.5E-07 1.4E-11 82.1 16.2 142 541-713 13-155 (171)
207 KOG0296 Angio-associated migra 98.7 8.2E-06 1.8E-10 79.3 23.8 264 8-371 89-387 (399)
208 KOG2314 Translation initiation 98.7 1.9E-06 4.2E-11 88.0 20.5 269 5-371 282-556 (698)
209 PF06342 DUF1057: Alpha/beta h 98.7 2.1E-06 4.4E-11 82.1 18.6 96 512-628 34-136 (297)
210 PF10340 DUF2424: Protein of u 98.7 1.4E-06 3.1E-11 88.1 17.9 196 497-711 108-349 (374)
211 KOG2624 Triglyceride lipase-ch 98.6 1.3E-06 2.8E-11 90.1 17.5 232 480-733 46-399 (403)
212 PTZ00420 coronin; Provisional 98.6 1.9E-05 4.2E-10 86.0 27.0 117 175-333 126-249 (568)
213 KOG0286 G-protein beta subunit 98.6 1E-05 2.2E-10 76.4 21.1 268 28-371 53-334 (343)
214 PTZ00421 coronin; Provisional 98.6 1.4E-05 3.1E-10 86.4 24.9 202 32-333 77-291 (493)
215 cd00312 Esterase_lipase Estera 98.6 1.6E-07 3.5E-12 103.8 10.1 120 495-630 79-214 (493)
216 KOG2315 Predicted translation 98.6 0.00013 2.9E-09 75.2 29.7 326 33-446 37-374 (566)
217 TIGR01839 PHA_synth_II poly(R) 98.6 1.6E-06 3.5E-11 92.4 16.4 84 545-630 237-329 (560)
218 COG2272 PnbA Carboxylesterase 98.6 1.6E-07 3.4E-12 96.6 8.2 119 495-630 80-218 (491)
219 KOG0263 Transcription initiati 98.6 1.1E-06 2.3E-11 93.8 14.5 199 15-317 438-638 (707)
220 KOG2139 WD40 repeat protein [G 98.6 1.1E-06 2.4E-11 85.0 13.3 113 24-197 189-303 (445)
221 KOG0282 mRNA splicing factor [ 98.6 2.2E-06 4.8E-11 86.2 15.9 230 32-371 216-451 (503)
222 PF07433 DUF1513: Protein of u 98.5 7.3E-05 1.6E-09 73.5 25.7 225 33-334 7-249 (305)
223 PF02273 Acyl_transf_2: Acyl t 98.5 6.9E-06 1.5E-10 76.1 17.1 199 489-714 9-240 (294)
224 COG3208 GrsT Predicted thioest 98.5 6E-06 1.3E-10 77.4 17.0 150 574-730 52-234 (244)
225 KOG0265 U5 snRNP-specific prot 98.5 3.5E-05 7.6E-10 73.2 21.6 220 157-461 69-297 (338)
226 KOG2139 WD40 repeat protein [G 98.5 6.9E-06 1.5E-10 79.7 17.0 157 175-373 196-366 (445)
227 COG1073 Hydrolases of the alph 98.5 1.1E-06 2.3E-11 90.3 12.6 78 652-733 218-298 (299)
228 PTZ00420 coronin; Provisional 98.5 0.00013 2.8E-09 79.7 28.8 232 170-472 69-305 (568)
229 PRK04940 hypothetical protein; 98.5 2.7E-06 5.8E-11 76.9 13.1 116 594-730 60-178 (180)
230 KOG0265 U5 snRNP-specific prot 98.5 3.8E-05 8.2E-10 73.0 20.8 253 3-371 67-327 (338)
231 COG0596 MhpC Predicted hydrola 98.5 4.5E-06 9.8E-11 83.6 16.3 68 556-628 51-122 (282)
232 KOG0305 Anaphase promoting com 98.5 3.2E-05 6.9E-10 81.0 22.4 266 33-446 180-451 (484)
233 KOG0282 mRNA splicing factor [ 98.5 3.6E-06 7.8E-11 84.7 14.4 219 5-333 237-463 (503)
234 cd00707 Pancreat_lipase_like P 98.4 9.1E-07 2E-11 88.6 10.2 104 513-629 36-147 (275)
235 KOG1446 Histone H3 (Lys4) meth 98.4 0.0002 4.2E-09 68.9 24.9 230 158-470 37-272 (311)
236 KOG0275 Conserved WD40 repeat- 98.4 4.7E-06 1E-10 79.3 13.9 251 32-414 215-478 (508)
237 PF08450 SGL: SMP-30/Gluconola 98.4 6.4E-05 1.4E-09 74.6 22.6 143 157-334 22-166 (246)
238 PF06057 VirJ: Bacterial virul 98.4 3.4E-06 7.4E-11 76.3 11.9 166 544-730 18-190 (192)
239 KOG1273 WD40 repeat protein [G 98.4 6.8E-05 1.5E-09 71.7 20.1 255 33-415 26-292 (405)
240 KOG0645 WD40 repeat protein [G 98.4 5.9E-05 1.3E-09 70.6 19.2 203 32-333 16-230 (312)
241 COG5354 Uncharacterized protei 98.4 4.3E-05 9.4E-10 77.7 19.9 224 3-317 150-378 (561)
242 KOG4497 Uncharacterized conser 98.4 7.6E-05 1.6E-09 71.7 20.3 104 241-371 318-421 (447)
243 COG5354 Uncharacterized protei 98.4 6.3E-05 1.4E-09 76.6 20.8 263 6-370 109-377 (561)
244 KOG2096 WD40 repeat protein [G 98.4 2.7E-05 5.8E-10 74.4 16.9 168 157-371 209-392 (420)
245 KOG0296 Angio-associated migra 98.4 0.00053 1.1E-08 67.2 25.7 180 29-313 63-248 (399)
246 PF08450 SGL: SMP-30/Gluconola 98.3 6.3E-05 1.4E-09 74.7 20.6 196 35-333 4-214 (246)
247 COG1506 DAP2 Dipeptidyl aminop 98.3 0.00029 6.2E-09 79.5 27.7 281 28-402 57-343 (620)
248 PLN00181 protein SPA1-RELATED; 98.3 0.00021 4.6E-09 83.7 27.5 195 32-333 534-739 (793)
249 TIGR03230 lipo_lipase lipoprot 98.3 4.2E-06 9.2E-11 87.7 11.1 101 513-628 41-153 (442)
250 KOG1274 WD40 repeat protein [G 98.3 0.00019 4E-09 78.4 23.3 145 175-371 97-251 (933)
251 PF00135 COesterase: Carboxyle 98.3 2.9E-06 6.3E-11 95.2 10.0 121 495-628 109-244 (535)
252 KOG1274 WD40 repeat protein [G 98.2 6.2E-05 1.3E-09 82.0 18.7 179 33-317 99-289 (933)
253 KOG1524 WD40 repeat-containing 98.2 1.6E-05 3.6E-10 80.9 13.4 166 32-317 106-275 (737)
254 COG2021 MET2 Homoserine acetyl 98.2 4.2E-05 9.1E-10 76.3 15.8 66 661-731 302-367 (368)
255 TIGR01849 PHB_depoly_PhaZ poly 98.2 6.3E-05 1.4E-09 78.1 17.4 70 661-732 333-406 (406)
256 KOG0645 WD40 repeat protein [G 98.2 0.0004 8.6E-09 65.3 20.5 147 175-371 15-169 (312)
257 KOG0319 WD40-repeat-containing 98.2 0.00033 7.3E-09 74.6 22.3 193 35-333 24-223 (775)
258 KOG0284 Polyadenylation factor 98.2 7.4E-05 1.6E-09 73.9 16.3 223 32-371 98-325 (464)
259 PF10142 PhoPQ_related: PhoPQ- 98.2 0.00014 3E-09 74.4 19.1 142 588-739 166-327 (367)
260 KOG2551 Phospholipase/carboxyh 98.2 1.5E-05 3.3E-10 73.1 10.5 125 577-735 91-223 (230)
261 PF10647 Gmad1: Lipoprotein Lp 98.2 0.0003 6.6E-09 69.5 20.5 117 19-196 10-133 (253)
262 PF06433 Me-amine-dh_H: Methyl 98.1 0.0029 6.3E-08 63.2 25.9 140 301-467 186-327 (342)
263 COG4757 Predicted alpha/beta h 98.1 3.5E-05 7.6E-10 70.8 11.5 213 485-729 8-280 (281)
264 TIGR02171 Fb_sc_TIGR02171 Fibr 98.1 0.00014 3E-09 80.8 18.3 255 6-300 330-600 (912)
265 PF03959 FSH1: Serine hydrolas 98.1 2.3E-05 5E-10 75.4 10.9 107 577-713 85-203 (212)
266 KOG0275 Conserved WD40 repeat- 98.1 7.2E-05 1.6E-09 71.5 13.7 206 175-466 264-473 (508)
267 COG3386 Gluconolactonase [Carb 98.1 0.0015 3.3E-08 65.8 24.2 144 157-332 47-193 (307)
268 KOG0284 Polyadenylation factor 98.1 4.5E-05 9.8E-10 75.4 12.5 145 175-371 139-283 (464)
269 KOG2919 Guanine nucleotide-bin 98.1 0.0002 4.4E-09 68.8 16.4 136 158-333 134-282 (406)
270 KOG2394 WD40 protein DMR-N9 [G 98.1 0.0013 2.9E-08 67.6 23.1 66 294-371 286-351 (636)
271 PF06028 DUF915: Alpha/beta hy 98.1 5.2E-05 1.1E-09 73.9 12.9 147 577-729 88-252 (255)
272 PLN02919 haloacid dehalogenase 98.1 0.0027 5.9E-08 75.7 29.6 126 178-334 686-835 (1057)
273 KOG1445 Tumor-specific antigen 98.1 5.9E-05 1.3E-09 78.5 13.4 155 27-262 624-784 (1012)
274 KOG0639 Transducin-like enhanc 98.1 6.6E-05 1.4E-09 76.0 13.2 221 6-331 441-664 (705)
275 KOG0283 WD40 repeat-containing 98.0 0.00047 1E-08 74.7 20.1 157 167-371 401-565 (712)
276 KOG0640 mRNA cleavage stimulat 98.0 0.00079 1.7E-08 64.4 18.9 108 172-317 214-324 (430)
277 KOG3253 Predicted alpha/beta h 98.0 0.0001 2.2E-09 77.1 14.1 101 590-713 246-347 (784)
278 PF12048 DUF3530: Protein of u 98.0 0.00025 5.3E-09 72.1 17.0 206 482-732 62-309 (310)
279 KOG1539 WD repeat protein [Gen 98.0 0.0022 4.7E-08 69.6 23.8 65 358-458 582-646 (910)
280 COG1770 PtrB Protease II [Amin 98.0 0.0045 9.7E-08 66.5 25.8 267 32-413 130-406 (682)
281 KOG1009 Chromatin assembly com 98.0 0.00082 1.8E-08 66.8 18.8 130 6-196 39-187 (434)
282 KOG1524 WD40 repeat-containing 98.0 0.0005 1.1E-08 70.5 17.7 159 157-370 85-245 (737)
283 KOG0639 Transducin-like enhanc 98.0 0.00041 9E-09 70.5 16.9 234 20-371 409-652 (705)
284 PLN02919 haloacid dehalogenase 98.0 0.011 2.3E-07 70.7 31.5 229 177-464 626-892 (1057)
285 KOG2106 Uncharacterized conser 98.0 0.0042 9.2E-08 63.6 23.7 149 175-383 369-518 (626)
286 TIGR02171 Fb_sc_TIGR02171 Fibr 97.9 0.00095 2.1E-08 74.4 20.0 94 44-195 320-420 (912)
287 PF04762 IKI3: IKI3 family; I 97.9 0.0073 1.6E-07 70.7 28.5 83 298-390 209-293 (928)
288 PF11339 DUF3141: Protein of u 97.9 0.00032 7E-09 72.8 15.2 50 660-711 292-348 (581)
289 PF07433 DUF1513: Protein of u 97.9 0.0023 5E-08 63.1 20.2 206 157-411 28-255 (305)
290 KOG1553 Predicted alpha/beta h 97.9 0.00012 2.6E-09 71.1 10.6 173 481-680 213-398 (517)
291 PF10230 DUF2305: Uncharacteri 97.8 0.00019 4.1E-09 71.5 12.4 54 576-629 63-122 (266)
292 COG3386 Gluconolactonase [Carb 97.8 0.0058 1.2E-07 61.7 22.5 204 33-333 27-244 (307)
293 KOG0288 WD40 repeat protein Ti 97.8 0.00017 3.8E-09 71.5 11.1 124 209-371 322-450 (459)
294 PF10647 Gmad1: Lipoprotein Lp 97.8 0.0023 5E-08 63.3 19.3 175 63-317 3-184 (253)
295 COG3545 Predicted esterase of 97.8 0.00038 8.2E-09 61.7 11.5 133 578-730 43-177 (181)
296 KOG0307 Vesicle coat complex C 97.8 0.00032 7E-09 78.6 13.5 261 33-408 9-289 (1049)
297 KOG0643 Translation initiation 97.8 0.02 4.4E-07 54.1 23.0 163 158-370 75-249 (327)
298 PF06433 Me-amine-dh_H: Methyl 97.7 0.0035 7.6E-08 62.6 18.9 135 179-334 188-322 (342)
299 PF12146 Hydrolase_4: Putative 97.7 5.4E-05 1.2E-09 59.2 5.1 58 492-573 1-58 (79)
300 PF11144 DUF2920: Protein of u 97.7 0.0011 2.4E-08 67.6 15.7 135 569-706 158-332 (403)
301 PF07676 PD40: WD40-like Beta 97.7 7.2E-05 1.6E-09 49.6 4.7 38 22-66 2-39 (39)
302 KOG0643 Translation initiation 97.7 0.0042 9E-08 58.5 17.7 126 5-195 74-211 (327)
303 KOG1445 Tumor-specific antigen 97.7 0.0015 3.2E-08 68.5 15.8 135 157-333 652-799 (1012)
304 TIGR03502 lipase_Pla1_cef extr 97.7 0.00023 4.9E-09 79.9 10.7 85 513-615 449-576 (792)
305 PF05677 DUF818: Chlamydia CHL 97.7 0.00057 1.2E-08 67.5 12.1 173 481-675 111-300 (365)
306 COG3150 Predicted esterase [Ge 97.6 0.0004 8.7E-09 60.6 9.3 136 577-730 44-187 (191)
307 PF05705 DUF829: Eukaryotic pr 97.6 0.0013 2.9E-08 64.8 14.5 65 663-729 176-240 (240)
308 KOG0306 WD40-repeat-containing 97.6 0.012 2.7E-07 63.3 21.6 207 24-333 367-583 (888)
309 KOG4328 WD40 protein [Function 97.6 0.036 7.8E-07 56.4 23.7 226 157-458 257-493 (498)
310 PRK02888 nitrous-oxide reducta 97.6 0.018 4E-07 62.4 23.1 211 34-333 128-352 (635)
311 KOG1063 RNA polymerase II elon 97.6 0.016 3.5E-07 61.8 22.1 74 244-329 528-601 (764)
312 KOG1273 WD40 repeat protein [G 97.6 0.0086 1.9E-07 57.8 17.9 229 157-471 45-291 (405)
313 COG3243 PhaC Poly(3-hydroxyalk 97.6 0.00039 8.4E-09 70.5 9.4 83 546-630 130-218 (445)
314 KOG0295 WD40 repeat-containing 97.5 0.0031 6.8E-08 61.9 14.7 235 157-465 130-369 (406)
315 KOG0278 Serine/threonine kinas 97.5 0.01 2.2E-07 55.4 16.8 198 175-463 101-300 (334)
316 COG4814 Uncharacterized protei 97.5 0.0012 2.6E-08 61.9 10.7 144 577-730 121-285 (288)
317 PF03096 Ndr: Ndr family; Int 97.5 0.006 1.3E-07 59.7 15.9 211 493-731 9-278 (283)
318 PF07819 PGAP1: PGAP1-like pro 97.5 0.0019 4E-08 62.5 12.6 54 577-630 65-124 (225)
319 KOG0771 Prolactin regulatory e 97.4 0.00087 1.9E-08 67.0 10.0 198 33-331 147-355 (398)
320 KOG2919 Guanine nucleotide-bin 97.4 0.011 2.4E-07 57.3 16.7 121 300-463 209-331 (406)
321 KOG0306 WD40-repeat-containing 97.4 0.023 5E-07 61.3 20.3 144 175-371 509-653 (888)
322 KOG4840 Predicted hydrolases o 97.4 0.0069 1.5E-07 55.6 14.2 86 545-632 56-147 (299)
323 KOG0310 Conserved WD40 repeat- 97.4 0.028 6.1E-07 57.6 19.9 266 32-446 28-299 (487)
324 KOG2048 WD40 repeat protein [G 97.4 0.011 2.3E-07 63.0 17.4 150 27-260 379-536 (691)
325 COG4947 Uncharacterized protei 97.3 0.00021 4.5E-09 62.3 3.8 106 594-712 101-216 (227)
326 PF04762 IKI3: IKI3 family; I 97.3 0.068 1.5E-06 62.8 25.2 106 246-371 214-323 (928)
327 PF00975 Thioesterase: Thioest 97.3 0.0033 7.3E-08 61.5 12.5 35 594-628 66-103 (229)
328 KOG0264 Nucleosome remodeling 97.3 0.014 3E-07 59.2 16.5 236 2-330 144-404 (422)
329 KOG0771 Prolactin regulatory e 97.3 0.0019 4E-08 64.7 10.2 152 30-260 186-342 (398)
330 KOG0278 Serine/threonine kinas 97.3 0.0053 1.1E-07 57.2 12.3 198 27-333 97-300 (334)
331 KOG0319 WD40-repeat-containing 97.2 0.097 2.1E-06 56.6 22.7 296 3-439 301-602 (775)
332 KOG0283 WD40 repeat-containing 97.2 0.011 2.3E-07 64.6 16.1 200 32-333 371-577 (712)
333 KOG0288 WD40 repeat protein Ti 97.2 0.002 4.4E-08 64.2 9.5 124 5-193 322-450 (459)
334 PRK02888 nitrous-oxide reducta 97.2 0.044 9.6E-07 59.6 19.9 56 36-106 198-254 (635)
335 KOG2110 Uncharacterized conser 97.1 0.0065 1.4E-07 59.9 12.3 137 4-219 105-250 (391)
336 COG3490 Uncharacterized protei 97.1 0.1 2.2E-06 50.1 19.6 228 27-334 66-312 (366)
337 KOG1523 Actin-related protein 97.1 0.032 6.9E-07 54.2 16.2 62 33-106 13-75 (361)
338 PF07676 PD40: WD40-like Beta 97.1 0.0011 2.4E-08 43.9 4.6 37 292-328 2-39 (39)
339 KOG0289 mRNA splicing factor [ 97.1 0.039 8.4E-07 55.8 17.1 188 27-317 300-495 (506)
340 KOG1332 Vesicle coat complex C 97.1 0.066 1.4E-06 50.1 17.3 239 5-333 33-289 (299)
341 KOG0289 mRNA splicing factor [ 97.0 0.07 1.5E-06 54.0 18.5 162 158-370 284-450 (506)
342 PF00151 Lipase: Lipase; Inte 97.0 0.0012 2.7E-08 67.5 6.5 52 577-628 133-186 (331)
343 KOG0307 Vesicle coat complex C 97.0 0.0091 2E-07 67.4 13.3 219 22-334 56-286 (1049)
344 KOG0303 Actin-binding protein 97.0 0.068 1.5E-06 53.4 17.7 149 175-372 132-283 (472)
345 KOG1538 Uncharacterized conser 97.0 0.062 1.4E-06 57.2 18.3 57 32-104 14-71 (1081)
346 KOG0295 WD40 repeat-containing 97.0 0.0092 2E-07 58.7 11.5 226 6-333 131-365 (406)
347 KOG0302 Ribosome Assembly prot 97.0 0.024 5.2E-07 56.2 14.3 139 157-333 234-381 (440)
348 KOG2931 Differentiation-relate 96.9 0.069 1.5E-06 51.7 16.5 212 493-730 32-304 (326)
349 PF15492 Nbas_N: Neuroblastoma 96.9 0.47 1E-05 45.9 22.1 39 175-226 44-82 (282)
350 KOG0647 mRNA export protein (c 96.9 0.49 1.1E-05 45.9 24.5 123 22-219 20-147 (347)
351 KOG0316 Conserved WD40 repeat- 96.9 0.053 1.2E-06 50.3 14.7 200 175-466 18-219 (307)
352 KOG0277 Peroxisomal targeting 96.9 0.036 7.8E-07 52.1 13.7 206 34-333 12-222 (311)
353 KOG4378 Nuclear protein COP1 [ 96.8 0.041 9E-07 56.5 15.1 136 157-334 143-282 (673)
354 KOG2048 WD40 repeat protein [G 96.8 0.86 1.9E-05 49.1 24.8 113 175-334 70-186 (691)
355 COG4782 Uncharacterized protei 96.8 0.0058 1.2E-07 61.0 8.6 102 513-630 116-235 (377)
356 KOG1963 WD40 repeat protein [G 96.7 0.4 8.7E-06 53.1 22.7 115 302-461 209-323 (792)
357 KOG0264 Nucleosome remodeling 96.7 0.28 6.1E-06 50.1 20.0 154 175-372 228-393 (422)
358 KOG1516 Carboxylesterase and r 96.7 0.005 1.1E-07 69.0 8.9 121 495-628 97-231 (545)
359 PLN02733 phosphatidylcholine-s 96.7 0.0049 1.1E-07 65.5 8.2 79 546-630 112-202 (440)
360 PF01674 Lipase_2: Lipase (cla 96.7 0.003 6.6E-08 60.1 5.8 66 546-614 20-95 (219)
361 KOG1523 Actin-related protein 96.7 0.089 1.9E-06 51.3 15.4 102 175-317 11-119 (361)
362 KOG1920 IkappaB kinase complex 96.7 0.59 1.3E-05 54.0 24.1 167 175-390 69-278 (1265)
363 KOG0313 Microtubule binding pr 96.7 0.53 1.1E-05 47.1 20.8 248 22-371 97-364 (423)
364 KOG2110 Uncharacterized conser 96.6 0.14 3E-06 50.9 16.8 135 157-332 106-250 (391)
365 KOG1920 IkappaB kinase complex 96.6 0.32 6.9E-06 56.0 21.6 56 35-105 73-128 (1265)
366 KOG0277 Peroxisomal targeting 96.6 0.044 9.6E-07 51.5 12.5 265 5-330 38-307 (311)
367 KOG1332 Vesicle coat complex C 96.6 0.31 6.7E-06 45.8 17.8 70 294-371 203-275 (299)
368 KOG4283 Transcription-coupled 96.6 0.048 1E-06 52.3 12.8 147 2-219 121-278 (397)
369 KOG0650 WD40 repeat nucleolar 96.6 0.049 1.1E-06 57.3 14.0 158 171-371 518-679 (733)
370 KOG2394 WD40 protein DMR-N9 [G 96.6 0.0089 1.9E-07 61.8 8.4 40 175-227 333-372 (636)
371 PF07082 DUF1350: Protein of u 96.5 0.052 1.1E-06 51.8 12.7 153 546-711 38-204 (250)
372 KOG0268 Sof1-like rRNA process 96.5 0.046 9.9E-07 54.0 12.6 121 277-442 211-331 (433)
373 KOG1538 Uncharacterized conser 96.5 0.012 2.6E-07 62.3 9.2 37 157-193 33-72 (1081)
374 PF08386 Abhydrolase_4: TAP-li 96.5 0.0065 1.4E-07 50.5 5.8 60 665-731 34-93 (103)
375 PF05990 DUF900: Alpha/beta hy 96.5 0.014 3.1E-07 56.8 9.0 80 577-678 78-166 (233)
376 KOG0316 Conserved WD40 repeat- 96.5 0.35 7.6E-06 45.1 17.0 116 32-223 19-137 (307)
377 KOG1963 WD40 repeat protein [G 96.5 0.84 1.8E-05 50.7 22.9 103 34-196 209-314 (792)
378 KOG1063 RNA polymerase II elon 96.4 0.023 5E-07 60.7 10.8 69 28-106 523-592 (764)
379 KOG1007 WD repeat protein TSSC 96.4 0.39 8.4E-06 46.3 17.6 116 175-329 124-244 (370)
380 KOG0303 Actin-binding protein 96.4 0.51 1.1E-05 47.5 18.9 123 175-334 82-205 (472)
381 KOG0290 Conserved WD40 repeat- 96.3 0.56 1.2E-05 45.3 18.0 237 6-333 74-319 (364)
382 PF15492 Nbas_N: Neuroblastoma 96.3 0.51 1.1E-05 45.7 18.0 58 36-105 3-62 (282)
383 PTZ00472 serine carboxypeptida 96.3 0.034 7.4E-07 60.1 11.6 64 665-731 364-458 (462)
384 KOG0310 Conserved WD40 repeat- 96.3 0.36 7.7E-06 49.9 17.8 222 32-371 70-298 (487)
385 PF05577 Peptidase_S28: Serine 96.2 0.02 4.3E-07 62.1 9.5 58 574-631 92-150 (434)
386 COG3391 Uncharacterized conser 96.2 0.68 1.5E-05 49.0 20.8 205 33-333 76-284 (381)
387 COG3490 Uncharacterized protei 96.2 1.3 2.8E-05 42.9 21.5 135 158-314 92-241 (366)
388 KOG3975 Uncharacterized conser 96.2 0.33 7.2E-06 45.9 15.6 52 576-628 93-146 (301)
389 KOG0640 mRNA cleavage stimulat 96.1 0.13 2.8E-06 49.7 13.1 191 31-317 217-415 (430)
390 COG3391 Uncharacterized conser 96.1 1.7 3.7E-05 46.1 23.3 214 177-466 76-289 (381)
391 KOG0641 WD40 repeat protein [G 96.1 1.1 2.4E-05 41.4 19.9 138 157-334 163-305 (350)
392 KOG1009 Chromatin assembly com 96.1 0.094 2E-06 52.7 12.1 60 175-260 124-183 (434)
393 KOG2321 WD40 repeat protein [G 96.0 0.81 1.8E-05 48.4 19.0 40 300-345 230-269 (703)
394 PF06977 SdiA-regulated: SdiA- 96.0 0.36 7.7E-06 47.2 15.9 58 32-102 23-80 (248)
395 KOG0299 U3 snoRNP-associated p 95.9 0.52 1.1E-05 48.4 16.8 58 33-104 205-262 (479)
396 COG3204 Uncharacterized protei 95.9 1.1 2.4E-05 43.8 18.1 18 88-105 87-104 (316)
397 KOG1539 WD repeat protein [Gen 95.9 0.36 7.7E-06 53.3 16.5 91 209-330 556-646 (910)
398 KOG0268 Sof1-like rRNA process 95.8 0.017 3.7E-07 56.9 5.9 122 5-195 210-336 (433)
399 KOG4388 Hormone-sensitive lipa 95.8 0.02 4.4E-07 60.1 6.7 68 666-737 788-859 (880)
400 KOG4378 Nuclear protein COP1 [ 95.8 1.4 3.1E-05 45.8 19.2 135 278-461 145-281 (673)
401 KOG0647 mRNA export protein (c 95.7 2.2 4.7E-05 41.6 24.8 73 157-255 52-127 (347)
402 KOG0292 Vesicle coat complex C 95.7 4.1 8.8E-05 45.8 23.4 251 158-457 116-396 (1202)
403 PRK13614 lipoprotein LpqB; Pro 95.7 1.1 2.4E-05 49.4 19.5 55 32-103 344-398 (573)
404 KOG0269 WD40 repeat-containing 95.6 0.31 6.7E-06 53.3 14.7 165 157-371 110-282 (839)
405 KOG4328 WD40 protein [Function 95.6 3 6.6E-05 43.0 20.5 170 157-371 210-388 (498)
406 KOG1036 Mitotic spindle checkp 95.5 2.7 5.8E-05 41.2 24.4 120 20-218 4-125 (323)
407 KOG4389 Acetylcholinesterase/B 95.5 0.031 6.7E-07 57.8 6.5 119 495-630 121-256 (601)
408 KOG2100 Dipeptidyl aminopeptid 95.4 2.2 4.8E-05 49.3 22.0 78 247-334 345-424 (755)
409 KOG1551 Uncharacterized conser 95.4 0.18 3.9E-06 47.9 10.6 68 658-732 294-366 (371)
410 KOG4283 Transcription-coupled 95.4 1.9 4.2E-05 41.8 17.3 119 175-332 102-221 (397)
411 KOG1408 WD40 repeat protein [F 95.4 0.41 8.9E-06 51.8 14.3 79 278-371 620-702 (1080)
412 PRK13614 lipoprotein LpqB; Pro 95.3 0.84 1.8E-05 50.3 17.1 143 158-333 366-519 (573)
413 COG3946 VirJ Type IV secretory 95.3 0.061 1.3E-06 54.5 7.6 167 544-729 276-446 (456)
414 TIGR02604 Piru_Ver_Nterm putat 95.2 0.67 1.5E-05 48.9 15.8 69 177-261 126-201 (367)
415 PRK13613 lipoprotein LpqB; Pro 95.2 2.7 5.8E-05 47.0 20.7 103 32-195 364-475 (599)
416 KOG3967 Uncharacterized conser 95.1 0.19 4.1E-06 46.0 9.6 104 513-618 101-214 (297)
417 PRK13615 lipoprotein LpqB; Pro 94.9 1.6 3.4E-05 48.0 17.9 145 33-261 336-487 (557)
418 KOG2041 WD40 repeat protein [G 94.9 0.55 1.2E-05 50.9 13.7 58 33-105 118-175 (1189)
419 KOG2521 Uncharacterized conser 94.8 1 2.2E-05 45.9 15.1 68 665-734 225-292 (350)
420 PRK13615 lipoprotein LpqB; Pro 94.7 4.9 0.00011 44.3 20.9 142 158-334 356-504 (557)
421 PF11187 DUF2974: Protein of u 94.6 0.052 1.1E-06 52.2 5.1 50 579-628 69-122 (224)
422 PF06977 SdiA-regulated: SdiA- 94.6 4.9 0.00011 39.3 19.0 123 169-331 15-148 (248)
423 KOG0294 WD40 repeat-containing 94.6 5.1 0.00011 39.5 18.7 73 157-255 107-182 (362)
424 KOG1214 Nidogen and related ba 94.6 2 4.3E-05 47.6 17.0 163 158-371 1048-1214(1289)
425 KOG2565 Predicted hydrolases o 94.5 0.12 2.7E-06 51.6 7.4 117 490-628 131-263 (469)
426 PRK10252 entF enterobactin syn 94.5 0.38 8.2E-06 60.5 13.8 35 594-628 1133-1170(1296)
427 PLN02606 palmitoyl-protein thi 94.5 0.31 6.7E-06 48.3 10.0 55 574-629 76-132 (306)
428 TIGR02604 Piru_Ver_Nterm putat 94.3 0.83 1.8E-05 48.2 13.7 130 158-314 48-199 (367)
429 KOG1520 Predicted alkaloid syn 94.2 1.8 4E-05 44.1 14.9 117 36-195 120-239 (376)
430 PF05057 DUF676: Putative seri 94.1 0.044 9.5E-07 52.8 3.5 20 594-613 78-97 (217)
431 KOG0650 WD40 repeat nucleolar 94.1 1.4 3E-05 47.0 14.2 103 244-371 524-626 (733)
432 KOG0641 WD40 repeat protein [G 94.0 5.2 0.00011 37.1 22.5 74 23-105 25-108 (350)
433 KOG2183 Prolylcarboxypeptidase 93.9 0.052 1.1E-06 55.0 3.5 55 574-628 147-201 (492)
434 KOG2182 Hydrolytic enzymes of 93.9 0.46 1E-05 49.9 10.3 127 490-629 66-207 (514)
435 PF13360 PQQ_2: PQQ-like domai 93.8 7 0.00015 38.0 20.9 53 157-223 46-101 (238)
436 PF07995 GSDH: Glucose / Sorbo 93.8 2.8 6.1E-05 43.4 16.3 81 244-330 116-212 (331)
437 cd00741 Lipase Lipase. Lipase 93.8 0.14 3.1E-06 46.2 6.0 36 593-628 27-66 (153)
438 COG4287 PqaA PhoPQ-activated p 93.8 0.38 8.1E-06 48.1 9.0 146 577-732 216-387 (507)
439 PF07519 Tannase: Tannase and 93.8 0.58 1.2E-05 50.8 11.5 66 665-732 353-427 (474)
440 PLN02633 palmitoyl protein thi 93.7 0.59 1.3E-05 46.5 10.3 55 574-629 75-131 (314)
441 PF01764 Lipase_3: Lipase (cla 93.7 0.14 3.1E-06 45.4 5.8 50 577-628 49-105 (140)
442 KOG1034 Transcriptional repres 93.7 1.9 4.2E-05 42.5 13.4 67 28-104 133-199 (385)
443 PF02450 LCAT: Lecithin:choles 93.6 0.18 3.9E-06 53.4 7.3 81 546-630 69-161 (389)
444 TIGR03606 non_repeat_PQQ dehyd 93.6 3.4 7.3E-05 44.3 16.5 57 32-98 31-90 (454)
445 KOG2106 Uncharacterized conser 93.6 11 0.00024 39.6 26.2 57 33-101 203-261 (626)
446 KOG0294 WD40 repeat-containing 93.5 8.3 0.00018 38.0 18.6 130 157-331 63-198 (362)
447 KOG1007 WD repeat protein TSSC 93.5 2.3 5E-05 41.2 13.4 195 32-317 65-277 (370)
448 KOG1214 Nidogen and related ba 93.5 4.5 9.7E-05 45.0 17.1 178 146-371 988-1174(1289)
449 KOG0285 Pleiotropic regulator 93.4 7.9 0.00017 38.8 17.2 181 32-317 153-338 (460)
450 COG1075 LipA Predicted acetylt 93.4 0.22 4.7E-06 51.6 7.3 50 578-629 113-164 (336)
451 TIGR03712 acc_sec_asp2 accesso 93.4 5.9 0.00013 41.9 17.2 106 499-630 281-391 (511)
452 PF13449 Phytase-like: Esteras 93.3 11 0.00025 38.8 21.2 122 209-333 112-252 (326)
453 KOG0299 U3 snoRNP-associated p 93.3 4.2 9.1E-05 42.1 15.5 112 168-309 135-255 (479)
454 COG3319 Thioesterase domains o 93.2 0.32 7E-06 47.6 7.6 51 577-630 50-104 (257)
455 KOG0313 Microtubule binding pr 93.2 2.8 6.1E-05 42.2 13.9 74 157-255 281-359 (423)
456 KOG3621 WD40 repeat-containing 93.0 8.2 0.00018 42.4 18.1 60 33-105 36-95 (726)
457 PRK13613 lipoprotein LpqB; Pro 93.0 3.8 8.2E-05 45.8 16.4 145 157-334 385-541 (599)
458 KOG2321 WD40 repeat protein [G 92.8 16 0.00035 39.2 19.9 182 277-503 156-344 (703)
459 PF00450 Peptidase_S10: Serine 92.6 0.64 1.4E-05 50.1 9.9 63 665-730 330-414 (415)
460 KOG0646 WD40 repeat protein [G 92.4 5.2 0.00011 41.5 14.9 39 157-195 103-144 (476)
461 KOG0270 WD40 repeat-containing 92.3 12 0.00026 38.7 17.3 138 157-333 266-407 (463)
462 PF03088 Str_synth: Strictosid 92.3 0.72 1.6E-05 36.7 7.1 81 246-333 2-88 (89)
463 COG4257 Vgb Streptogramin lyas 92.3 12 0.00025 36.5 20.5 235 167-496 54-292 (353)
464 PF04053 Coatomer_WDAD: Coatom 92.2 1.1 2.5E-05 47.9 10.8 76 210-317 127-213 (443)
465 KOG0269 WD40 repeat-containing 92.0 2 4.3E-05 47.3 12.1 169 5-261 110-283 (839)
466 TIGR03300 assembly_YfgL outer 91.7 20 0.00043 37.9 30.5 81 396-492 290-371 (377)
467 PF03088 Str_synth: Strictosid 91.6 1.2 2.5E-05 35.6 7.5 41 155-195 35-77 (89)
468 KOG0646 WD40 repeat protein [G 91.5 8.4 0.00018 40.0 15.2 61 170-255 77-137 (476)
469 PF13360 PQQ_2: PQQ-like domai 91.2 15 0.00033 35.6 20.5 51 397-465 185-235 (238)
470 TIGR03606 non_repeat_PQQ dehyd 91.2 17 0.00037 39.0 18.1 37 432-469 347-386 (454)
471 KOG2041 WD40 repeat protein [G 91.2 23 0.00049 39.2 18.5 25 294-318 254-278 (1189)
472 PF11288 DUF3089: Protein of u 91.0 0.46 1E-05 44.6 5.6 40 575-615 77-116 (207)
473 PF13449 Phytase-like: Esteras 91.0 14 0.0003 38.2 17.0 46 62-107 112-167 (326)
474 KOG1408 WD40 repeat protein [F 90.9 1.4 3E-05 48.0 9.4 61 244-317 81-141 (1080)
475 KOG0321 WD40 repeat-containing 90.8 12 0.00027 40.5 16.2 268 20-405 89-393 (720)
476 PF07519 Tannase: Tannase and 90.6 0.35 7.7E-06 52.4 5.1 52 577-628 97-149 (474)
477 KOG0308 Conserved WD40 repeat- 90.5 13 0.00028 40.5 16.0 49 278-332 195-243 (735)
478 PLN02454 triacylglycerol lipas 90.5 0.49 1.1E-05 49.3 5.7 41 574-614 208-248 (414)
479 PF03283 PAE: Pectinacetyleste 90.4 0.38 8.3E-06 50.0 5.0 37 577-613 139-175 (361)
480 PF11768 DUF3312: Protein of u 90.4 20 0.00043 38.8 17.4 144 157-334 184-331 (545)
481 PF15525 DUF4652: Domain of un 90.0 5.4 0.00012 36.2 10.9 64 39-105 66-130 (200)
482 PF02089 Palm_thioest: Palmito 89.9 1.5 3.2E-05 43.3 8.2 53 576-629 63-116 (279)
483 KOG4547 WD40 repeat-containing 89.7 20 0.00044 38.5 16.6 137 157-337 80-227 (541)
484 KOG0302 Ribosome Assembly prot 89.7 2.9 6.3E-05 42.1 9.9 137 3-217 232-378 (440)
485 cd00519 Lipase_3 Lipase (class 89.6 0.69 1.5E-05 45.1 5.9 50 577-628 113-167 (229)
486 KOG3724 Negative regulator of 89.2 0.42 9.1E-06 52.8 4.2 52 577-628 158-219 (973)
487 COG3204 Uncharacterized protei 88.9 25 0.00055 34.7 17.1 59 32-103 87-145 (316)
488 PLN02408 phospholipase A1 88.9 0.71 1.5E-05 47.5 5.4 40 575-614 181-220 (365)
489 KOG2111 Uncharacterized conser 88.9 22 0.00047 35.3 14.9 72 244-331 184-257 (346)
490 KOG1036 Mitotic spindle checkp 88.6 27 0.00058 34.5 20.1 87 299-406 233-319 (323)
491 PLN02571 triacylglycerol lipas 88.5 0.73 1.6E-05 48.1 5.3 41 574-614 206-246 (413)
492 KOG0290 Conserved WD40 repeat- 88.4 27 0.00058 34.3 20.6 71 23-100 37-110 (364)
493 PF03022 MRJP: Major royal jel 88.3 22 0.00047 35.9 15.6 64 5-80 34-106 (287)
494 PF07995 GSDH: Glucose / Sorbo 88.3 11 0.00023 39.1 13.8 42 177-219 116-158 (331)
495 KOG1520 Predicted alkaloid syn 88.1 6.6 0.00014 40.2 11.4 130 175-333 115-250 (376)
496 PF05787 DUF839: Bacterial pro 88.0 7.8 0.00017 42.7 13.0 37 159-195 482-522 (524)
497 PLN02324 triacylglycerol lipas 87.9 0.82 1.8E-05 47.6 5.2 41 574-614 195-235 (415)
498 KOG1912 WD40 repeat protein [G 87.7 34 0.00074 38.4 17.0 51 32-98 17-67 (1062)
499 PF06259 Abhydrolase_8: Alpha/ 87.6 1.5 3.2E-05 40.3 6.1 50 577-627 93-142 (177)
500 KOG4532 WD40-like repeat conta 87.6 25 0.00054 34.1 14.1 134 299-471 159-293 (344)
No 1
>COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]
Probab=100.00 E-value=1e-41 Score=378.72 Aligned_cols=566 Identities=21% Similarity=0.218 Sum_probs=374.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+..|..+|+|+.++|+...- +.........+|+.+... .+.++... ....+.|||||+.++|.... +.
T Consensus 14 ~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~d~~~--~~~~~~~~------~~~~~~~spdg~~~~~~~~~--~~ 82 (620)
T COG1506 14 RVSDPRVSPPGGRLAYILTGL-DFLKPLYKSSLWVSDGKT--VRLLTFGG------GVSELRWSPDGSVLAFVSTD--GG 82 (620)
T ss_pred cccCcccCCCCceeEEeeccc-cccccccccceEEEeccc--ccccccCC------cccccccCCCCCEEEEEecc--CC
Confidence 577899999999999998631 122235568899977554 44454444 25678999999999998521 11
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC-ceeeeeccCCCCceEE
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP-AVYTAVEPSPDQKYVL 190 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~-~~~~~~~~SpDG~~i~ 190 (744)
...++|+++.+| .+... ..+....|+|+|++++
T Consensus 83 -------------------------------------------~~~~l~l~~~~g---~~~~~~~~v~~~~~~~~g~~~~ 116 (620)
T COG1506 83 -------------------------------------------RVAQLYLVDVGG---LITKTAFGVSDARWSPDGDRIA 116 (620)
T ss_pred -------------------------------------------CcceEEEEecCC---ceeeeecccccceeCCCCCeEE
Confidence 016899998875 23223 5677889999999999
Q ss_pred EEEeeCCcccccc--------------cCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCce
Q 004574 191 ITSMHRPYSYKVP--------------CARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPST 256 (744)
Q Consensus 191 ~~~~~~~~~~~~~--------------~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~ 256 (744)
+............ .+.....+++++.++ ....+...+ ..+..+.+.++++.
T Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~l~~~d~~~-~~~~~~~~~-------------~~~~~~~~~~~~~~- 181 (620)
T COG1506 117 FLTAEGASKRDGGDHLFVDRLPVWFDGRGGERSDLYVVDIES-KLIKLGLGN-------------LDVVSFATDGDGRL- 181 (620)
T ss_pred EEecccccccCCceeeeecccceeecCCCCcccceEEEccCc-ccccccCCC-------------CceeeeeeCCCCce-
Confidence 9433221100000 000112344444443 111111111 11334455555554
Q ss_pred EEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeec----cceeEEEEcCC
Q 004574 257 LYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKT----SQTRTWLVCPG 332 (744)
Q Consensus 257 l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~----~~~~l~~~~~~ 332 (744)
++.+........ .....++... .++....++.....+..+.|.+||+.+++...... ....+++.+..
T Consensus 182 ~~~~~~~~~~~~-----~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~gk~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (620)
T COG1506 182 VASIRLDDDADP-----WVTNLYVLIE---GNGELESLTPGEGSISKLAFDADGKSIALLGTESDRGLAEGDFILLLDGE 253 (620)
T ss_pred eEEeeeccccCC-----ceEeeEEEec---CCCceEEEcCCCceeeeeeeCCCCCeeEEeccCCccCccccceEEEEecc
Confidence 444433222111 1112233222 35667777777788999999999998888763332 22345555522
Q ss_pred CCCCccee-eeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 333 SKDVAPRV-LFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 333 ~~~~~~~~-l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
.+ .... +...+ .. .+.....+.-++..++|..... .....++..+..++... +.
T Consensus 254 ~~--~~d~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~-----------------~g~~~l~~~~~~~~~~~-~~ 308 (620)
T COG1506 254 LG--EVDGDLSSGD--DT---RGAWAVEGGLDGDGLLFIATDG-----------------GGSSPLFRVDDLGGGVE-GL 308 (620)
T ss_pred cc--ccceeeccCC--cc---cCcHHhccccCCCcEEEEEecC-----------------CCceEEEEEeccCCcee-ee
Confidence 21 1111 11111 00 0100011223455555554220 11222444443333322 22
Q ss_pred eccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCC-CCCCcCCCceEEEEEEcC
Q 004574 412 ESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPH-PYPTLASLQKEMIKYQRK 490 (744)
Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~-~~~~~~~~~~~~i~~~~~ 490 (744)
..+.. ....++.+++.+++..++...|+++|+++. ++..+++..+. ........+++.+++...
T Consensus 309 ~~~~~-------------~v~~f~~~~~~~~~~~s~~~~p~~i~~~~~--~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 373 (620)
T COG1506 309 SGDDG-------------GVPGFDVDGRKLALAYSSPTEPPEIYLYDR--GEEAKLTSSNNSGLKKVKLAEPEPVTYKSN 373 (620)
T ss_pred cCCCc-------------eEEEEeeCCCEEEEEecCCCCccceEEEcC--CCceEEeecccccccccccCCceEEEEEcC
Confidence 22211 112455589999999999999999999987 55555555443 455677789999999999
Q ss_pred CCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCC
Q 004574 491 DGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEG 570 (744)
Q Consensus 491 ~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g 570 (744)
+|.++++++++|+++++.+ ++|+||++|||+... +........+.|+++||+|+.+++++..|+|
T Consensus 374 dG~~i~~~l~~P~~~~~~k--~yP~i~~~hGGP~~~-------------~~~~~~~~~q~~~~~G~~V~~~n~RGS~GyG 438 (620)
T COG1506 374 DGETIHGWLYKPPGFDPRK--KYPLIVYIHGGPSAQ-------------VGYSFNPEIQVLASAGYAVLAPNYRGSTGYG 438 (620)
T ss_pred CCCEEEEEEecCCCCCCCC--CCCEEEEeCCCCccc-------------cccccchhhHHHhcCCeEEEEeCCCCCCccH
Confidence 9999999999999987655 399999999997321 1112335678899999999999998888876
Q ss_pred CC-----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCccc
Q 004574 571 DK-----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQ 639 (744)
Q Consensus 571 ~~-----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~ 639 (744)
.. ...+|+.++++++.+++.+|++||+|+|+|+||+|+++++.+.| .|+++++..+.+++........
T Consensus 439 ~~F~~~~~~~~g~~~~~D~~~~~~~l~~~~~~d~~ri~i~G~SyGGymtl~~~~~~~-~f~a~~~~~~~~~~~~~~~~~~ 517 (620)
T COG1506 439 REFADAIRGDWGGVDLEDLIAAVDALVKLPLVDPERIGITGGSYGGYMTLLAATKTP-RFKAAVAVAGGVDWLLYFGEST 517 (620)
T ss_pred HHHHHhhhhccCCccHHHHHHHHHHHHhCCCcChHHeEEeccChHHHHHHHHHhcCc-hhheEEeccCcchhhhhccccc
Confidence 65 23459999999999999999999999999999999999999996 8888888888766543322211
Q ss_pred cc--------ccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 640 TE--------FRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 640 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
.. ...++.+.+.|.+.||+.+++++++|+|+|||++|..|| .+++++++++|+..|+++++++||+++|.
T Consensus 518 ~~~~~~~~~~~~~~~~~~~~~~~~sp~~~~~~i~~P~LliHG~~D~~v~--~~q~~~~~~aL~~~g~~~~~~~~p~e~H~ 595 (620)
T COG1506 518 EGLRFDPEENGGGPPEDREKYEDRSPIFYADNIKTPLLLIHGEEDDRVP--IEQAEQLVDALKRKGKPVELVVFPDEGHG 595 (620)
T ss_pred hhhcCCHHHhCCCcccChHHHHhcChhhhhcccCCCEEEEeecCCccCC--hHHHHHHHHHHHHcCceEEEEEeCCCCcC
Confidence 11 111111567899999999999999999999999999999 99999999999999999999999999999
Q ss_pred cCccccHHHHHHHHHHHHHHhccC
Q 004574 712 YAARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 712 ~~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
+...++..+.++.+++||.++++.
T Consensus 596 ~~~~~~~~~~~~~~~~~~~~~~~~ 619 (620)
T COG1506 596 FSRPENRVKVLKEILDWFKRHLKQ 619 (620)
T ss_pred CCCchhHHHHHHHHHHHHHHHhcC
Confidence 998788899999999999999864
No 2
>PRK10115 protease 2; Provisional
Probab=100.00 E-value=1.4e-37 Score=346.92 Aligned_cols=519 Identities=13% Similarity=0.058 Sum_probs=343.9
Q ss_pred ccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCC
Q 004574 89 FGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGT 167 (744)
Q Consensus 89 ~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~ 167 (744)
+..+.|||||++|+|.....+.+ ..+|+++++ +|+
T Consensus 129 l~~~~~Spdg~~la~~~d~~G~E--------------------------------------------~~~l~v~d~~tg~ 164 (686)
T PRK10115 129 LGGMAITPDNTIMALAEDFLSRR--------------------------------------------QYGIRFRNLETGN 164 (686)
T ss_pred EeEEEECCCCCEEEEEecCCCcE--------------------------------------------EEEEEEEECCCCC
Confidence 45789999999999986543322 168999999 775
Q ss_pred --eeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC--eeeeccCCCCCCCCCcccCCccCC
Q 004574 168 --AKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK--LVRELCDLPPAEDIPVCYNSVREG 243 (744)
Q Consensus 168 --~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~--~~~~l~~~~~~~~~~~~~~~~~~~ 243 (744)
...+.... ..++|++||+.|+|+..... .....++|++++.+. +.+.|.......
T Consensus 165 ~l~~~i~~~~--~~~~w~~D~~~~~y~~~~~~-------~~~~~~v~~h~lgt~~~~d~lv~~e~~~~------------ 223 (686)
T PRK10115 165 WYPELLDNVE--PSFVWANDSWTFYYVRKHPV-------TLLPYQVWRHTIGTPASQDELVYEEKDDT------------ 223 (686)
T ss_pred CCCccccCcc--eEEEEeeCCCEEEEEEecCC-------CCCCCEEEEEECCCChhHCeEEEeeCCCC------------
Confidence 33332222 45899999999999987531 012368999999877 444454422110
Q ss_pred CCccce-ecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 244 MRSISW-RADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 244 ~~~~~~-spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
..-..+ +.|++. ++ +...+... ..+++++.+ ...++...+......... .....+..+++.++....
T Consensus 224 ~~~~~~~s~d~~~-l~-i~~~~~~~--------~~~~l~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~ly~~tn~~~~ 291 (686)
T PRK10115 224 FYVSLHKTTSKHY-VV-IHLASATT--------SEVLLLDAE-LADAEPFVFLPRRKDHEY-SLDHYQHRFYLRSNRHGK 291 (686)
T ss_pred EEEEEEEcCCCCE-EE-EEEECCcc--------ccEEEEECc-CCCCCceEEEECCCCCEE-EEEeCCCEEEEEEcCCCC
Confidence 111134 448887 44 33222221 125555531 123444444433322211 122334566666655456
Q ss_pred ceeEEEEcCCCCCCcceeeeccccc-cccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEe
Q 004574 323 QTRTWLVCPGSKDVAPRVLFDRVFE-NVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFD 401 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d 401 (744)
...|..+++... .+.+.+...... .+.. +.++ +++|++.... .....|+++|
T Consensus 292 ~~~l~~~~~~~~-~~~~~l~~~~~~~~i~~------~~~~--~~~l~~~~~~------------------~g~~~l~~~~ 344 (686)
T PRK10115 292 NFGLYRTRVRDE-QQWEELIPPRENIMLEG------FTLF--TDWLVVEERQ------------------RGLTSLRQIN 344 (686)
T ss_pred CceEEEecCCCc-ccCeEEECCCCCCEEEE------EEEE--CCEEEEEEEe------------------CCEEEEEEEc
Confidence 678999888742 333444433211 1111 2233 4456555432 2344588888
Q ss_pred cCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCCCCCCcCCCc
Q 004574 402 INTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPHPYPTLASLQ 481 (744)
Q Consensus 402 ~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~~~~~~~~~~ 481 (744)
..+++.+.|...... ...... .+.+++++.+++..++...|+++|.+|+.+++.+.|+..+.+........
T Consensus 345 ~~~~~~~~l~~~~~~-----~~~~~~----~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l~~~~~~~~~~~~~~ 415 (686)
T PRK10115 345 RKTREVIGIAFDDPA-----YVTWIA----YNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVLKQTEVPGFDAANYR 415 (686)
T ss_pred CCCCceEEecCCCCc-----eEeeec----ccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEEEecCCCCcCccccE
Confidence 876666655422111 111110 12336778899999999999999999999998888887653322223568
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
++.+++.+.||.+++++++++++... .++.|+||++||++.... ...+....+.|+++||+|+.+
T Consensus 416 ~e~v~~~s~DG~~Ip~~l~~~~~~~~--~~~~P~ll~~hGg~~~~~-------------~p~f~~~~~~l~~rG~~v~~~ 480 (686)
T PRK10115 416 SEHLWITARDGVEVPVSLVYHRKHFR--KGHNPLLVYGYGSYGASI-------------DADFSFSRLSLLDRGFVYAIV 480 (686)
T ss_pred EEEEEEECCCCCEEEEEEEEECCCCC--CCCCCEEEEEECCCCCCC-------------CCCccHHHHHHHHCCcEEEEE
Confidence 99999999999999997776554322 235799999999863321 111224456889999999998
Q ss_pred CCCCCCCCCCC-----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 562 PSIPIIGEGDK-----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 562 ~~~~~~g~g~~-----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
+.++..|+|.. ...+|+.+++++|.+++++|++|++++|.|+||+++.+++.++|++|+|+|+..|++|
T Consensus 481 n~RGs~g~G~~w~~~g~~~~k~~~~~D~~a~~~~Lv~~g~~d~~rl~i~G~S~GG~l~~~~~~~~Pdlf~A~v~~vp~~D 560 (686)
T PRK10115 481 HVRGGGELGQQWYEDGKFLKKKNTFNDYLDACDALLKLGYGSPSLCYGMGGSAGGMLMGVAINQRPELFHGVIAQVPFVD 560 (686)
T ss_pred EcCCCCccCHHHHHhhhhhcCCCcHHHHHHHHHHHHHcCCCChHHeEEEEECHHHHHHHHHHhcChhheeEEEecCCchh
Confidence 88888887764 3456999999999999999999999999999999999999999999999999999988
Q ss_pred CCCC------CCc--ccccccchhhc--HHHHHhcCcccccCCCCCC-EEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCc
Q 004574 631 KTLT------PFG--FQTEFRTLWEA--TNVYIEMSPITHANKIKKP-ILIIHGEVDDKVGLFPMQAERFFDALKGHGAL 699 (744)
Q Consensus 631 ~~~~------~~~--~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~P-~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~ 699 (744)
.... +.. ...+.+.+++. .+.+.++||+.++.+++.| +||+||.+|.+|| +.++.+++.+|++.+.+
T Consensus 561 ~~~~~~~~~~p~~~~~~~e~G~p~~~~~~~~l~~~SP~~~v~~~~~P~lLi~~g~~D~RV~--~~~~~k~~a~Lr~~~~~ 638 (686)
T PRK10115 561 VVTTMLDESIPLTTGEFEEWGNPQDPQYYEYMKSYSPYDNVTAQAYPHLLVTTGLHDSQVQ--YWEPAKWVAKLRELKTD 638 (686)
T ss_pred HhhhcccCCCCCChhHHHHhCCCCCHHHHHHHHHcCchhccCccCCCceeEEecCCCCCcC--chHHHHHHHHHHhcCCC
Confidence 5421 111 11223445432 3345679999999999999 6777999999999 99999999999999998
Q ss_pred EEEEEe---CCCCcccCc-cccHHHHHHHHHHHHHHhccCCC
Q 004574 700 SRLVLL---PFEHHVYAA-RENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 700 ~~~~~~---~~~~H~~~~-~~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
++++++ +++||+... .....+.....+.||-..+....
T Consensus 639 ~~~vl~~~~~~~GHg~~~~r~~~~~~~A~~~aFl~~~~~~~~ 680 (686)
T PRK10115 639 DHLLLLCTDMDSGHGGKSGRFKSYEGVAMEYAFLIALAQGTL 680 (686)
T ss_pred CceEEEEecCCCCCCCCcCHHHHHHHHHHHHHHHHHHhCCcC
Confidence 888888 999998432 12223344456888888776543
No 3
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=99.97 E-value=8.2e-26 Score=233.73 Aligned_cols=517 Identities=15% Similarity=0.093 Sum_probs=330.4
Q ss_pred ccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCC
Q 004574 89 FGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGT 167 (744)
Q Consensus 89 ~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~ 167 (744)
.+.+.-|||++.|+++.+..+.+ .-.|.+.|+ +|+
T Consensus 131 Lg~~~~s~D~~~la~s~D~~G~e--------------------------------------------~y~lr~kdL~tg~ 166 (682)
T COG1770 131 LGAASISPDHNLLAYSVDVLGDE--------------------------------------------QYTLRFKDLATGE 166 (682)
T ss_pred eeeeeeCCCCceEEEEEeccccc--------------------------------------------EEEEEEEeccccc
Confidence 45778899999999986532222 145777788 774
Q ss_pred eeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCC--CeeeeccCCCCCCCCCcccCCccCCCC
Q 004574 168 AKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDG--KLVRELCDLPPAEDIPVCYNSVREGMR 245 (744)
Q Consensus 168 ~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g--~~~~~l~~~~~~~~~~~~~~~~~~~~~ 245 (744)
...-...+...+.+|.+|++.++|+..+.. .-+.++|.-.+.+ ..-+.+...+-... .-
T Consensus 167 ~~~d~i~~~~~~~~Wa~d~~~lfYt~~d~~--------~rp~kv~~h~~gt~~~~d~lvyeE~d~~f-----------~~ 227 (682)
T COG1770 167 ELPDEITNTSGSFAWAADGKTLFYTRLDEN--------HRPDKVWRHRLGTPGSSDELVYEEKDDRF-----------FL 227 (682)
T ss_pred ccchhhcccccceEEecCCCeEEEEEEcCC--------CCcceEEEEecCCCCCcceEEEEcCCCcE-----------EE
Confidence 332222233557899999999999987642 1345788777766 44444433321110 00
Q ss_pred ccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeec--cCCceEEEeeeeeccc
Q 004574 246 SISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWC--DDSLALVNETWYKTSQ 323 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~S--pDg~~l~~~~~~~~~~ 323 (744)
.+.-|-+.++ | +++.. ......+++++.+ ..+++++.+..... .+..+ .-|..++..+|....+
T Consensus 228 ~v~~s~s~~y-i-~i~~~--------~~~tsE~~ll~a~-~p~~~p~vv~pr~~---g~eY~~eh~~d~f~i~sN~~gkn 293 (682)
T COG1770 228 SVGRSRSEAY-I-VISLG--------SHITSEVRLLDAD-DPEAEPKVVLPREN---GVEYSVEHGGDRFYILSNADGKN 293 (682)
T ss_pred EeeeccCCce-E-EEEcC--------CCcceeEEEEecC-CCCCceEEEEEcCC---CcEEeeeecCcEEEEEecCCCcc
Confidence 1122333333 2 22211 1123457888773 23445555543321 12222 3377888888777667
Q ss_pred eeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecC
Q 004574 324 TRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDIN 403 (744)
Q Consensus 324 ~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~ 403 (744)
..|+...+.......+.+..+.... ..-.++.-..+|+... + ....+.|++++..
T Consensus 294 f~l~~ap~~~~~~~w~~~I~h~~~~-------~l~~~~~f~~~lVl~e------------R------~~glp~v~v~~~~ 348 (682)
T COG1770 294 FKLVRAPVSADKSNWRELIPHREDV-------RLEGVDLFADHLVLLE------------R------QEGLPRVVVRDRK 348 (682)
T ss_pred eEEEEccCCCChhcCeeeeccCCCc-------eeeeeeeeccEEEEEe------------c------ccCCceEEEEecC
Confidence 8888888722112223333322211 0111233333444432 1 2345679999999
Q ss_pred CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCCCC-CCcCCCce
Q 004574 404 TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPHPY-PTLASLQK 482 (744)
Q Consensus 404 ~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~~~-~~~~~~~~ 482 (744)
+++.+.|.-.+.. + ..... .+..++...|.|+.++.+.|.+++-+|+.+++.+.|...+.+- -+......
T Consensus 349 ~~~~~~i~f~~~a-y----~~~l~----~~~e~~s~~lR~~ysS~ttP~~~~~~dm~t~er~~LkqqeV~~g~dp~~Y~s 419 (682)
T COG1770 349 TGEERGIAFDDEA-Y----SAGLS----GNPEFDSDRLRYSYSSMTTPATLFDYDMATGERTLLKQQEVPGGFDPEDYVS 419 (682)
T ss_pred CCceeeEEecchh-h----hcccc----CCCCCCCccEEEEeecccccceeEEeeccCCcEEEEEeccCCCCCChhHeEE
Confidence 9988877665543 1 11111 2445677899999999999999999999999999998865443 33456789
Q ss_pred EEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecC
Q 004574 483 EMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGP 562 (744)
Q Consensus 483 ~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~ 562 (744)
++++..+.+|.+++..+++.+++ ...++.|++++.+|.+..+.+.. +......|+.+||+....+
T Consensus 420 ~riwa~a~dgv~VPVSLvyrkd~--~~~g~~p~lLygYGaYG~s~~p~-------------Fs~~~lSLlDRGfiyAIAH 484 (682)
T COG1770 420 RRIWATADDGVQVPVSLVYRKDT--KLDGSAPLLLYGYGAYGISMDPS-------------FSIARLSLLDRGFVYAIAH 484 (682)
T ss_pred EEEEEEcCCCcEeeEEEEEeccc--CCCCCCcEEEEEeccccccCCcC-------------cccceeeeecCceEEEEEE
Confidence 99999999999999999998774 33456899999999864433222 2233557789998654323
Q ss_pred CCCCCCCCCC-----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCC
Q 004574 563 SIPIIGEGDK-----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNK 631 (744)
Q Consensus 563 ~~~~~g~g~~-----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~ 631 (744)
-++....|.. ....|+.++.++|.++++.++++|+++|.|+||+++..++.+.|++|+++|+..|++|.
T Consensus 485 VRGGgelG~~WYe~GK~l~K~NTf~DFIa~a~~Lv~~g~~~~~~i~a~GGSAGGmLmGav~N~~P~lf~~iiA~VPFVDv 564 (682)
T COG1770 485 VRGGGELGRAWYEDGKLLNKKNTFTDFIAAARHLVKEGYTSPDRIVAIGGSAGGMLMGAVANMAPDLFAGIIAQVPFVDV 564 (682)
T ss_pred eecccccChHHHHhhhhhhccccHHHHHHHHHHHHHcCcCCccceEEeccCchhHHHHHHHhhChhhhhheeecCCccch
Confidence 2333222332 34459999999999999999999999999999999999999999999999999999873
Q ss_pred CCCC----Cccc----ccccchh--hcHHHHHhcCcccccCCCC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC---
Q 004574 632 TLTP----FGFQ----TEFRTLW--EATNVYIEMSPITHANKIK-KPILIIHGEVDDKVGLFPMQAERFFDALKGHG--- 697 (744)
Q Consensus 632 ~~~~----~~~~----~~~~~~~--~~~~~~~~~~~~~~~~~~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~--- 697 (744)
..+- ..+. .+.+.|- +..+.+..+||..++..-. .|+|++.|.+|+.|. +.+..++..+|+..+
T Consensus 565 ltTMlD~slPLT~~E~~EWGNP~d~e~y~yikSYSPYdNV~a~~YP~ilv~~Gl~D~rV~--YwEpAKWvAkLR~~~td~ 642 (682)
T COG1770 565 LTTMLDPSLPLTVTEWDEWGNPLDPEYYDYIKSYSPYDNVEAQPYPAILVTTGLNDPRVQ--YWEPAKWVAKLRELKTDG 642 (682)
T ss_pred hhhhcCCCCCCCccchhhhCCcCCHHHHHHHhhcCchhccccCCCCceEEEccccCCccc--cchHHHHHHHHhhcccCC
Confidence 2211 1111 1222222 1123456779999988765 669999999999999 999999999999865
Q ss_pred CcEEEEEeCCCCccc-CccccHHHHHHHHHHHHHHhc
Q 004574 698 ALSRLVLLPFEHHVY-AARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 698 ~~~~~~~~~~~~H~~-~~~~~~~~~~~~~~~fl~~~l 733 (744)
.++-+.+=-++||+- +......+-...-..|+.+.+
T Consensus 643 ~plLlkt~M~aGHgG~SgRf~~lee~A~eYaF~l~~~ 679 (682)
T COG1770 643 NPLLLKTNMDAGHGGASGRFQRLEEIAFEYAFLLKLA 679 (682)
T ss_pred CcEEEEecccccCCCCCCchHHHHHHHHHHHHHhhhc
Confidence 346666656789963 333333333334466665544
No 4
>KOG2281 consensus Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Posttranslational modification, protein turnover, chaperones]
Probab=99.97 E-value=3.1e-27 Score=239.91 Aligned_cols=315 Identities=18% Similarity=0.233 Sum_probs=234.7
Q ss_pred CCCceEEEEecC-CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecC
Q 004574 392 GNIPFLDLFDIN-TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNF 470 (744)
Q Consensus 392 ~~~~~l~~~d~~-~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~ 470 (744)
....+||+.... .|+..+++...-. . .+.++-+=+.++...++...|+.+.++++.+++...+-..
T Consensus 527 PlE~hLyvvsye~~g~~~rlt~~g~s----h---------~~~l~~~~d~fv~~~~sv~sP~cv~~y~ls~~~~~~l~~q 593 (867)
T KOG2281|consen 527 PLEHHLYVVSYENPGEIARLTEPGYS----H---------SCELDQQCDHFVSYYSSVGSPPCVSLYSLSWPENDPLPKQ 593 (867)
T ss_pred CceeeEEEEEEecCCceeeccCCCcc----c---------chhhhhhhhhHhhhhhcCCCCceEEEEeccCCccCcccch
Confidence 456789998887 8898888865421 0 0123333344666666777788777777655443322211
Q ss_pred CC--------CCCCcCCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCC
Q 004574 471 PH--------PYPTLASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSG 542 (744)
Q Consensus 471 ~~--------~~~~~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~ 542 (744)
.. ..+......+|.+.+....|..+.|.+|+|.++++.+ |+|++++++||+... +. .+.|.+
T Consensus 594 ~~~~~~l~~~~~~~Pdy~p~eif~fqs~tg~~lYgmiyKPhn~~pgk--kYptvl~VYGGP~VQ------lV--nnsfkg 663 (867)
T KOG2281|consen 594 VSFWAILVSGAPPPPDYVPPEIFSFQSKTGLTLYGMIYKPHNFQPGK--KYPTVLNVYGGPGVQ------LV--NNSFKG 663 (867)
T ss_pred hhHHHHHHhcCCCCCccCChhheeeecCCCcEEEEEEEccccCCCCC--CCceEEEEcCCCceE------Ee--eccccc
Confidence 11 1222233456888888888999999999999998876 499999999997442 22 245655
Q ss_pred CCchhHHHHHhCCeEEEecCCCCCCCCCCCC-----------hHHHHHHHHHHHHHcC-CCCCCcEEEEEechHHHHHHH
Q 004574 543 MTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-----------PNDSAEAAVEEVVRRG-VADPSRIAVGGHSYGAFMTAH 610 (744)
Q Consensus 543 ~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~-~~d~~~i~l~G~S~GG~~a~~ 610 (744)
....-...|+++||+|+..+.++..-.|... -.+|-.+.+++|.++. .+|.+||+|.|+|+||+++++
T Consensus 664 i~ylR~~~LaslGy~Vv~IDnRGS~hRGlkFE~~ik~kmGqVE~eDQVeglq~Laeq~gfidmdrV~vhGWSYGGYLSlm 743 (867)
T KOG2281|consen 664 IQYLRFCRLASLGYVVVFIDNRGSAHRGLKFESHIKKKMGQVEVEDQVEGLQMLAEQTGFIDMDRVGVHGWSYGGYLSLM 743 (867)
T ss_pred eehhhhhhhhhcceEEEEEcCCCccccchhhHHHHhhccCeeeehhhHHHHHHHHHhcCcccchheeEeccccccHHHHH
Confidence 5555567889999999985444433223221 1238899999999984 899999999999999999999
Q ss_pred HHHhCCCceeEEEEccCCCCCCCCCCcccccc-cchhhcHHHHHhcCcccccCCCC---CCEEEEeeCCCCCCCCCHHHH
Q 004574 611 LLAHAPHLFCCGIARSGSYNKTLTPFGFQTEF-RTLWEATNVYIEMSPITHANKIK---KPILIIHGEVDDKVGLFPMQA 686 (744)
Q Consensus 611 ~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~---~P~l~i~G~~D~~v~~~~~~~ 686 (744)
+++++|+.|+++|+.+|+.+|....-++.... ..|..+...|..-|...++.++. ..+|++||--|+.|. +-+.
T Consensus 744 ~L~~~P~IfrvAIAGapVT~W~~YDTgYTERYMg~P~~nE~gY~agSV~~~VeklpdepnRLlLvHGliDENVH--F~Ht 821 (867)
T KOG2281|consen 744 GLAQYPNIFRVAIAGAPVTDWRLYDTGYTERYMGYPDNNEHGYGAGSVAGHVEKLPDEPNRLLLVHGLIDENVH--FAHT 821 (867)
T ss_pred HhhcCcceeeEEeccCcceeeeeecccchhhhcCCCccchhcccchhHHHHHhhCCCCCceEEEEecccccchh--hhhH
Confidence 99999999999999999999887666654443 33445566777778777777763 459999999999998 9999
Q ss_pred HHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 687 ERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 687 ~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
-.+..+|-++|++.++++||+.-|.+...+....+-..++.|+.+
T Consensus 822 s~Lvs~lvkagKpyeL~IfP~ERHsiR~~es~~~yE~rll~FlQ~ 866 (867)
T KOG2281|consen 822 SRLVSALVKAGKPYELQIFPNERHSIRNPESGIYYEARLLHFLQE 866 (867)
T ss_pred HHHHHHHHhCCCceEEEEccccccccCCCccchhHHHHHHHHHhh
Confidence 999999999999999999999999998777777777788888875
No 5
>KOG2100 consensus Dipeptidyl aminopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=99.97 E-value=2.9e-26 Score=256.19 Aligned_cols=386 Identities=23% Similarity=0.253 Sum_probs=260.7
Q ss_pred eeeccCCceEEEeeeeecc-ceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEE
Q 004574 304 VSWCDDSLALVNETWYKTS-QTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYIL 382 (744)
Q Consensus 304 ~~~SpDg~~l~~~~~~~~~-~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~ 382 (744)
+.++.|+.+..+......+ ..++.......+ ...+.++.+...-. ..+.++.+.+.++|.....
T Consensus 345 ~~~~~d~~~~~~~~~~~~~~~~hi~~~~~~~~-~~~~~lt~g~w~v~------~i~~~~~~~~~i~f~~~~~-------- 409 (755)
T KOG2100|consen 345 PVFSSDGSSYLKVDSVSDGGYNHIAYLKLSNG-SEPRMLTSGNWEVT------SILGYDKDSNRIYFDAYEE-------- 409 (755)
T ss_pred ceEeecCCceeEEEeeccCCEEEEEEEEcCCC-CccccccccceEEE------EeccccCCCceEEEEecCC--------
Confidence 5677887555444323323 567776666553 24444554433211 1123556777777776331
Q ss_pred EccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCce-EEEEECCC
Q 004574 383 LNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQ-YHILSWPL 461 (744)
Q Consensus 383 ~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~-i~~~~~~~ 461 (744)
....++|+.+++.++..+.++-.... +....+ ...+++..+.++.....+..|-. +-+.+...
T Consensus 410 --------~~~~~~ly~i~~~~~~~~~lt~~~~~---~~~~~~-----~~~~~~~~~~~v~~~~gP~~p~~~~~~~~~~~ 473 (755)
T KOG2100|consen 410 --------DPSERHLYSISLGSGTVESLTCSLIT---GPCTYL-----SVSFSKSAKYYVLSCSGPKVPDGQLTRHSSKN 473 (755)
T ss_pred --------CCCceEEEEEEccccccccccccCCC---CcceEE-----EEecCCcccEEEEEccCCCCCcceeecccccc
Confidence 23467899999988877765543321 111111 13566666777776665554421 22222111
Q ss_pred Cc-eeeeecCCCCCCC----cCCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCC
Q 004574 462 KK-SSQITNFPHPYPT----LASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGS 536 (744)
Q Consensus 462 g~-~~~lt~~~~~~~~----~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~ 536 (744)
.+ ...| ..+..... ......+...+.. +|...++.+.+|+++.+++ ++|++|..|||+.+. .++.
T Consensus 474 ~~~~~~L-e~n~~~~~~~~~~~~p~~~~~~i~~-~~~~~~~~~~lP~~~~~~~--kyPllv~~yGGP~sq-----~v~~- 543 (755)
T KOG2100|consen 474 SKTIVVL-ETNEELKKTIENVALPIVEFGKIEI-DGITANAILILPPNFDPSK--KYPLLVVVYGGPGSQ-----SVTS- 543 (755)
T ss_pred ceEEEEe-ccChhhHHHhhcccCCcceeEEEEe-ccEEEEEEEecCCCCCCCC--CCCEEEEecCCCCcc-----eeee-
Confidence 11 1122 22222111 1112233333332 7889999999999987766 699999999997421 1111
Q ss_pred CCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHH
Q 004574 537 PNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGA 605 (744)
Q Consensus 537 ~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG 605 (744)
.+. .......+...|++|+..++++..++|... ..+|...++.++.+++.+|.+||+|+|+|+||
T Consensus 544 --~~~--~~~~~~~~s~~g~~v~~vd~RGs~~~G~~~~~~~~~~lG~~ev~D~~~~~~~~~~~~~iD~~ri~i~GwSyGG 619 (755)
T KOG2100|consen 544 --KFS--VDWNEVVVSSRGFAVLQVDGRGSGGYGWDFRSALPRNLGDVEVKDQIEAVKKVLKLPFIDRSRVAIWGWSYGG 619 (755)
T ss_pred --eEE--ecHHHHhhccCCeEEEEEcCCCcCCcchhHHHHhhhhcCCcchHHHHHHHHHHHhcccccHHHeEEeccChHH
Confidence 111 112233456899999997777777766652 23489999999999999999999999999999
Q ss_pred HHHHHHHHhCC-CceeEEEEccCCCCCCCCCCcccccc-cchhhcHHHHHhcCcccccCCCCCCE-EEEeeCCCCCCCCC
Q 004574 606 FMTAHLLAHAP-HLFCCGIARSGSYNKTLTPFGFQTEF-RTLWEATNVYIEMSPITHANKIKKPI-LIIHGEVDDKVGLF 682 (744)
Q Consensus 606 ~~a~~~~~~~p-~~~~~~v~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~P~-l~i~G~~D~~v~~~ 682 (744)
++++.++.++| +.|+|+++++|++++.+..-.+.... ..+.++...|.+.++..+++.++.|. |++||+.|..|+
T Consensus 620 y~t~~~l~~~~~~~fkcgvavaPVtd~~~yds~~terymg~p~~~~~~y~e~~~~~~~~~~~~~~~LliHGt~DdnVh-- 697 (755)
T KOG2100|consen 620 YLTLKLLESDPGDVFKCGVAVAPVTDWLYYDSTYTERYMGLPSENDKGYEESSVSSPANNIKTPKLLLIHGTEDDNVH-- 697 (755)
T ss_pred HHHHHHhhhCcCceEEEEEEecceeeeeeecccccHhhcCCCccccchhhhccccchhhhhccCCEEEEEcCCcCCcC--
Confidence 99999999997 78999999999999874433332222 34556666699999999999998886 999999999998
Q ss_pred HHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhccCC
Q 004574 683 PMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLSN 736 (744)
Q Consensus 683 ~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~~ 736 (744)
++++.+++++|+.+|.+.++++||+.+|++........++..+..||..++...
T Consensus 698 ~q~s~~~~~aL~~~gv~~~~~vypde~H~is~~~~~~~~~~~~~~~~~~~~~~~ 751 (755)
T KOG2100|consen 698 FQQSAILIKALQNAGVPFRLLVYPDENHGISYVEVISHLYEKLDRFLRDCFGSP 751 (755)
T ss_pred HHHHHHHHHHHHHCCCceEEEEeCCCCcccccccchHHHHHHHHHHHHHHcCcc
Confidence 999999999999999999999999999999977777899999999999776543
No 6
>PRK01029 tolB translocation protein TolB; Provisional
Probab=99.94 E-value=1.7e-25 Score=237.89 Aligned_cols=251 Identities=14% Similarity=0.074 Sum_probs=181.3
Q ss_pred ccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCe--EEEeeecccccccCCCceeEEEEECCCCceeccccCC
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKR--IAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESP 81 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~--laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~ 81 (744)
...||+++.+| +..++||.. ......|+|||||+. +||++. .++..+||++++++|+.++|+..+
T Consensus 164 ~~~l~~~d~dG----~~~~~lt~~--~~~~~sP~wSPDG~~~~~~y~S~-------~~g~~~I~~~~l~~g~~~~lt~~~ 230 (428)
T PRK01029 164 QGELWSVDYDG----QNLRPLTQE--HSLSITPTWMHIGSGFPYLYVSY-------KLGVPKIFLGSLENPAGKKILALQ 230 (428)
T ss_pred cceEEEEcCCC----CCceEcccC--CCCcccceEccCCCceEEEEEEc-------cCCCceEEEEECCCCCceEeecCC
Confidence 45899999999 999999842 224689999999998 556654 256689999999999999998766
Q ss_pred CccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEE
Q 004574 82 DICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVL 161 (744)
Q Consensus 82 ~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 161 (744)
.. ...++|||||++|+|++..... .++|+
T Consensus 231 g~-----~~~p~wSPDG~~Laf~s~~~g~----------------------------------------------~di~~ 259 (428)
T PRK01029 231 GN-----QLMPTFSPRKKLLAFISDRYGN----------------------------------------------PDLFI 259 (428)
T ss_pred CC-----ccceEECCCCCEEEEEECCCCC----------------------------------------------cceeE
Confidence 52 3467999999999998642111 23444
Q ss_pred --EcC-C---CCeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC--CCeeeeccCCCCCC
Q 004574 162 --GSL-D---GTAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD--GKLVRELCDLPPAE 231 (744)
Q Consensus 162 --~~~-~---G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--g~~~~~l~~~~~~~ 231 (744)
+++ . |+.++++.. +....++|||||++|+|.+.... ..+||+++++ ++..++++.....
T Consensus 260 ~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g----------~~~ly~~~~~~~g~~~~~lt~~~~~- 328 (428)
T PRK01029 260 QSFSLETGAIGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDG----------RPRIYIMQIDPEGQSPRLLTKKYRN- 328 (428)
T ss_pred EEeecccCCCCcceEeecCCCCCcCCeEECCCCCEEEEEECCCC----------CceEEEEECcccccceEEeccCCCC-
Confidence 244 2 366778765 34568999999999999976532 2379999875 3446666654322
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCc
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSL 311 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~ 311 (744)
...+.|||||+. |+|+...++ ...|+++++ ++++.+.++........+.|||||+
T Consensus 329 ------------~~~p~wSPDG~~-Laf~~~~~g---------~~~I~v~dl---~~g~~~~Lt~~~~~~~~p~wSpDG~ 383 (428)
T PRK01029 329 ------------SSCPAWSPDGKK-IAFCSVIKG---------VRQICVYDL---ATGRDYQLTTSPENKESPSWAIDSL 383 (428)
T ss_pred ------------ccceeECCCCCE-EEEEEcCCC---------CcEEEEEEC---CCCCeEEccCCCCCccceEECCCCC
Confidence 345689999999 888864432 235999998 7888888887666678899999999
Q ss_pred eEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCC
Q 004574 312 ALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTG 364 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg 364 (744)
.|+|.+.. .+...||++++++ ++.+.++...... .. ++|||-.
T Consensus 384 ~L~f~~~~-~g~~~L~~vdl~~--g~~~~Lt~~~g~~-----~~--p~Ws~~~ 426 (428)
T PRK01029 384 HLVYSAGN-SNESELYLISLIT--KKTRKIVIGSGEK-----RF--PSWGAFP 426 (428)
T ss_pred EEEEEECC-CCCceEEEEECCC--CCEEEeecCCCcc-----cC--ceecCCC
Confidence 99998743 3567899999988 4556676533221 12 4588754
No 7
>PRK01029 tolB translocation protein TolB; Provisional
Probab=99.94 E-value=5.2e-25 Score=234.22 Aligned_cols=266 Identities=16% Similarity=0.166 Sum_probs=185.5
Q ss_pred ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEE--EEEecCCCCCC
Q 004574 35 FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLL--IFTIPSSRRDP 112 (744)
Q Consensus 35 ~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l--~~~~~~~~~~~ 112 (744)
.|.++ +++|||+........ .....+||++|.+|+.+++||..... ...|.|||||+.+ +|++... +
T Consensus 141 ~~g~~--~~~iayv~~~~~~~~-~~~~~~l~~~d~dG~~~~~lt~~~~~-----~~sP~wSPDG~~~~~~y~S~~~-g-- 209 (428)
T PRK01029 141 VPGIS--SGKIIFSLSTTNSDT-ELKQGELWSVDYDGQNLRPLTQEHSL-----SITPTWMHIGSGFPYLYVSYKL-G-- 209 (428)
T ss_pred CCccc--cCEEEEEEeeCCccc-ccccceEEEEcCCCCCceEcccCCCC-----cccceEccCCCceEEEEEEccC-C--
Confidence 46666 899999987331111 12367999999999999999987652 3567999999874 4454321 1
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEE
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVL 190 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~ 190 (744)
..+||+.++ .|+.++|+.. +....++|||||++|+
T Consensus 210 -------------------------------------------~~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~La 246 (428)
T PRK01029 210 -------------------------------------------VPKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLA 246 (428)
T ss_pred -------------------------------------------CceEEEEECCCCCceEeecCCCCccceEECCCCCEEE
Confidence 168999999 5588888876 6677899999999999
Q ss_pred EEEeeCCcccccccCCCcceEEE--EeCCC---CeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 191 ITSMHRPYSYKVPCARFSQKVQV--WTTDG---KLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~l~~--~~~~g---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
|...... ..++|+ +++++ +..++++...... ...+.|||||++ |+|++..++
T Consensus 247 f~s~~~g----------~~di~~~~~~~~~g~~g~~~~lt~~~~~~------------~~~p~wSPDG~~-Laf~s~~~g 303 (428)
T PRK01029 247 FISDRYG----------NPDLFIQSFSLETGAIGKPRRLLNEAFGT------------QGNPSFSPDGTR-LVFVSNKDG 303 (428)
T ss_pred EEECCCC----------CcceeEEEeecccCCCCcceEeecCCCCC------------cCCeEECCCCCE-EEEEECCCC
Confidence 9986532 124565 45543 3456666442111 235699999999 999875432
Q ss_pred CCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccc
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRV 345 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 345 (744)
. .+||+++++ ..++..+.++........++|||||++|+|... ..+..+|+++|+++ ++.+.++...
T Consensus 304 ~---------~~ly~~~~~-~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~-~~g~~~I~v~dl~~--g~~~~Lt~~~ 370 (428)
T PRK01029 304 R---------PRIYIMQID-PEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSV-IKGVRQICVYDLAT--GRDYQLTTSP 370 (428)
T ss_pred C---------ceEEEEECc-ccccceEEeccCCCCccceeECCCCCEEEEEEc-CCCCcEEEEEECCC--CCeEEccCCC
Confidence 2 248888762 123456667766666778999999999999873 33567899999988 4556776542
Q ss_pred cccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccc
Q 004574 346 FENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNR 415 (744)
Q Consensus 346 ~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~ 415 (744)
.. . .. +.|||||+.|+|.... .....|+++|+.+++.++++...+
T Consensus 371 ~~-~----~~--p~wSpDG~~L~f~~~~------------------~g~~~L~~vdl~~g~~~~Lt~~~g 415 (428)
T PRK01029 371 EN-K----ES--PSWAIDSLHLVYSAGN------------------SNESELYLISLITKKTRKIVIGSG 415 (428)
T ss_pred CC-c----cc--eEECCCCCEEEEEECC------------------CCCceEEEEECCCCCEEEeecCCC
Confidence 11 1 12 5599999999998632 123469999999999988876543
No 8
>PRK03629 tolB translocation protein TolB; Provisional
Probab=99.93 E-value=1.9e-24 Score=231.16 Aligned_cols=236 Identities=15% Similarity=0.116 Sum_probs=178.9
Q ss_pred CccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCC
Q 004574 3 FFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD 82 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~ 82 (744)
+...||+++.++ +..++++.. ......|+|||||++|||++.. .+..+||++++++|+.++++..+.
T Consensus 177 ~~~~l~~~d~dg----~~~~~lt~~--~~~~~~p~wSPDG~~la~~s~~-------~g~~~i~i~dl~~G~~~~l~~~~~ 243 (429)
T PRK03629 177 FPYELRVSDYDG----YNQFVVHRS--PQPLMSPAWSPDGSKLAYVTFE-------SGRSALVIQTLANGAVRQVASFPR 243 (429)
T ss_pred cceeEEEEcCCC----CCCEEeecC--CCceeeeEEcCCCCEEEEEEec-------CCCcEEEEEECCCCCeEEccCCCC
Confidence 356799999988 888888742 2257899999999999998641 456799999999999999986654
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
....+.|||||+.|+|+.... +..+||++
T Consensus 244 -----~~~~~~~SPDG~~La~~~~~~----------------------------------------------g~~~I~~~ 272 (429)
T PRK03629 244 -----HNGAPAFSPDGSKLAFALSKT----------------------------------------------GSLNLYVM 272 (429)
T ss_pred -----CcCCeEECCCCCEEEEEEcCC----------------------------------------------CCcEEEEE
Confidence 245789999999999974321 11479999
Q ss_pred cC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCc
Q 004574 163 SL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSV 240 (744)
Q Consensus 163 ~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~ 240 (744)
|+ +|+.++++.. .....+.|||||++|+|.+.... ..+||++++++++.++++.....
T Consensus 273 d~~tg~~~~lt~~~~~~~~~~wSPDG~~I~f~s~~~g----------~~~Iy~~d~~~g~~~~lt~~~~~---------- 332 (429)
T PRK03629 273 DLASGQIRQVTDGRSNNTEPTWFPDSQNLAYTSDQAG----------RPQVYKVNINGGAPQRITWEGSQ---------- 332 (429)
T ss_pred ECCCCCEEEccCCCCCcCceEECCCCCEEEEEeCCCC----------CceEEEEECCCCCeEEeecCCCC----------
Confidence 99 6688999877 56778999999999999986531 24899999998888877643222
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeee
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYK 320 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~ 320 (744)
...+.|||||+. |+|+...++. ..|+++++ ++++.+.|+.. .....++|||||+.|+|.+. +
T Consensus 333 ---~~~~~~SpDG~~-Ia~~~~~~g~---------~~I~~~dl---~~g~~~~Lt~~-~~~~~p~~SpDG~~i~~~s~-~ 394 (429)
T PRK03629 333 ---NQDADVSSDGKF-MVMVSSNGGQ---------QHIAKQDL---ATGGVQVLTDT-FLDETPSIAPNGTMVIYSSS-Q 394 (429)
T ss_pred ---ccCEEECCCCCE-EEEEEccCCC---------ceEEEEEC---CCCCeEEeCCC-CCCCCceECCCCCEEEEEEc-C
Confidence 335689999998 8887643321 24899988 67777777743 34568999999999999884 3
Q ss_pred ccceeEEEEcCCCCCCcceeee
Q 004574 321 TSQTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 321 ~~~~~l~~~~~~~~~~~~~~l~ 342 (744)
.....|+++++++ ...+.+.
T Consensus 395 ~~~~~l~~~~~~G--~~~~~l~ 414 (429)
T PRK03629 395 GMGSVLNLVSTDG--RFKARLP 414 (429)
T ss_pred CCceEEEEEECCC--CCeEECc
Confidence 3556899999987 3444554
No 9
>PF00326 Peptidase_S9: Prolyl oligopeptidase family This family belongs to family S9 of the peptidase classification.; InterPro: IPR001375 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain covers the active site serine of the serine peptidases belonging to MEROPS peptidase family S9 (prolyl oligopeptidase family, clan SC). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Examples of protein families containing this domain are: Prolyl endopeptidase (3.4.21.26 from EC) (PE) (also called post-proline cleaving enzyme). PE is an enzyme that cleaves peptide bonds on the C-terminal side of prolyl residues. The sequence of PE has been obtained from a mammalian species (pig) and from bacteria (Flavobacterium meningosepticum and Aeromonas hydrophila); there is a high degree of sequence conservation between these sequences. Escherichia coli protease II (3.4.21.83 from EC) (oligopeptidase B) (gene prtB) which cleaves peptide bonds on the C-terminal side of lysyl and argininyl residues. Dipeptidyl peptidase IV (3.4.14.5 from EC) (DPP IV). DPP IV is an enzyme that removes N-terminal dipeptides sequentially from polypeptides having unsubstituted N-termini provided that the penultimate residue is proline. Saccharomyces cerevisiae (Baker's yeast) vacuolar dipeptidyl aminopeptidases A and B (DPAP A and DPAP B), encoded by the STE13 and DAP2 genes respectively. DPAP A is responsible for the proteolytic maturation of the alpha-factor precursor. Acylamino-acid-releasing enzyme (3.4.19.1 from EC) (acyl-peptide hydrolase). This enzyme catalyses the hydrolysis of the amino-terminal peptide bond of an N-acetylated protein to generate a N-acetylated amino acid and a protein with a free amino-terminus. These proteins belong to MEROPS peptidase families S9A, S9B and S9C.; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 2AJ8_D 1ORV_D 2AJB_C 2BUC_D 1ORW_D 2AJC_D 2AJD_C 2BUA_A 2HU8_B 3O4J_B ....
Probab=99.93 E-value=8.2e-26 Score=219.49 Aligned_cols=188 Identities=29% Similarity=0.456 Sum_probs=156.5
Q ss_pred hhHHHHHhCCeEEEecCCCCCCCCCCC-----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIGEGDK-----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g~g~~-----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
..+..|+++||+|+.+++++..|+|.. ...+|+.+++++++++..+|++||+|+|+|+||++|+.++.+
T Consensus 5 ~~~~~la~~Gy~v~~~~~rGs~g~g~~~~~~~~~~~~~~~~~D~~~~i~~l~~~~~iD~~ri~i~G~S~GG~~a~~~~~~ 84 (213)
T PF00326_consen 5 WNAQLLASQGYAVLVPNYRGSGGYGKDFHEAGRGDWGQADVDDVVAAIEYLIKQYYIDPDRIGIMGHSYGGYLALLAATQ 84 (213)
T ss_dssp HHHHHHHTTT-EEEEEE-TTSSSSHHHHHHTTTTGTTHHHHHHHHHHHHHHHHTTSEEEEEEEEEEETHHHHHHHHHHHH
T ss_pred HHHHHHHhCCEEEEEEcCCCCCccchhHHHhhhccccccchhhHHHHHHHHhccccccceeEEEEcccccccccchhhcc
Confidence 457788899999999888777765443 234599999999999999999999999999999999999999
Q ss_pred CCCceeEEEEccCCCCCCCCCCc---ccc----cccchhhcHHHHHhcCcccccCC--CCCCEEEEeeCCCCCCCCCHHH
Q 004574 615 APHLFCCGIARSGSYNKTLTPFG---FQT----EFRTLWEATNVYIEMSPITHANK--IKKPILIIHGEVDDKVGLFPMQ 685 (744)
Q Consensus 615 ~p~~~~~~v~~~~~~~~~~~~~~---~~~----~~~~~~~~~~~~~~~~~~~~~~~--~~~P~l~i~G~~D~~v~~~~~~ 685 (744)
+|++|+++|+.+|+++....... +.. ....++...+.|...++...+.+ +++|+|++||++|.+|| +.+
T Consensus 85 ~~~~f~a~v~~~g~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~P~li~hG~~D~~Vp--~~~ 162 (213)
T PF00326_consen 85 HPDRFKAAVAGAGVSDLFSYYGTTDIYTKAEYLEYGDPWDNPEFYRELSPISPADNVQIKPPVLIIHGENDPRVP--PSQ 162 (213)
T ss_dssp TCCGSSEEEEESE-SSTTCSBHHTCCHHHGHHHHHSSTTTSHHHHHHHHHGGGGGGCGGGSEEEEEEETTBSSST--THH
T ss_pred cceeeeeeeccceecchhcccccccccccccccccCccchhhhhhhhhccccccccccCCCCEEEEccCCCCccC--HHH
Confidence 99999999999999885543221 111 12344556778888899998888 89999999999999999 999
Q ss_pred HHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhccC
Q 004574 686 AERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 686 ~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
+.+++++|++.+++++++++|+++|.+....+...+.+.+.+||+++|+.
T Consensus 163 s~~~~~~L~~~g~~~~~~~~p~~gH~~~~~~~~~~~~~~~~~f~~~~l~~ 212 (213)
T PF00326_consen 163 SLRLYNALRKAGKPVELLIFPGEGHGFGNPENRRDWYERILDFFDKYLKK 212 (213)
T ss_dssp HHHHHHHHHHTTSSEEEEEETT-SSSTTSHHHHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHhcCCCEEEEEcCcCCCCCCCchhHHHHHHHHHHHHHHHcCC
Confidence 99999999999999999999999999887777789999999999999875
No 10
>PRK05137 tolB translocation protein TolB; Provisional
Probab=99.93 E-value=4.7e-24 Score=229.80 Aligned_cols=236 Identities=14% Similarity=0.170 Sum_probs=177.5
Q ss_pred ccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI 83 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~ 83 (744)
..+||+++.++ +..++++.. ...+..|+|||||++|||++.. ++..+||++++++|+.++++..+.
T Consensus 181 ~~~l~~~d~dg----~~~~~lt~~--~~~v~~p~wSpDG~~lay~s~~-------~g~~~i~~~dl~~g~~~~l~~~~g- 246 (435)
T PRK05137 181 IKRLAIMDQDG----ANVRYLTDG--SSLVLTPRFSPNRQEITYMSYA-------NGRPRVYLLDLETGQRELVGNFPG- 246 (435)
T ss_pred ceEEEEECCCC----CCcEEEecC--CCCeEeeEECCCCCEEEEEEec-------CCCCEEEEEECCCCcEEEeecCCC-
Confidence 45899999988 888888832 3358899999999999998642 445899999999999999986654
Q ss_pred cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEc
Q 004574 84 CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGS 163 (744)
Q Consensus 84 ~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 163 (744)
.+..+.|||||+.|+|+.... +..+||+++
T Consensus 247 ----~~~~~~~SPDG~~la~~~~~~----------------------------------------------g~~~Iy~~d 276 (435)
T PRK05137 247 ----MTFAPRFSPDGRKVVMSLSQG----------------------------------------------GNTDIYTMD 276 (435)
T ss_pred ----cccCcEECCCCCEEEEEEecC----------------------------------------------CCceEEEEE
Confidence 245779999999999874321 115899999
Q ss_pred C-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCcc
Q 004574 164 L-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVR 241 (744)
Q Consensus 164 ~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~ 241 (744)
+ +|+.++|+.. +....+.|||||++|+|.+.... ..+||+++++++..++++.....
T Consensus 277 ~~~~~~~~Lt~~~~~~~~~~~spDG~~i~f~s~~~g----------~~~Iy~~d~~g~~~~~lt~~~~~----------- 335 (435)
T PRK05137 277 LRSGTTTRLTDSPAIDTSPSYSPDGSQIVFESDRSG----------SPQLYVMNADGSNPRRISFGGGR----------- 335 (435)
T ss_pred CCCCceEEccCCCCccCceeEcCCCCEEEEEECCCC----------CCeEEEEECCCCCeEEeecCCCc-----------
Confidence 9 5688899877 55668999999999999976532 24899999999988888754322
Q ss_pred CCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeec
Q 004574 242 EGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKT 321 (744)
Q Consensus 242 ~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~ 321 (744)
...+.|||||+. |+|++...+ ...|+++++ +++..+.++. ......++|||||+.|+|......
T Consensus 336 --~~~~~~SpdG~~-ia~~~~~~~---------~~~i~~~d~---~~~~~~~lt~-~~~~~~p~~spDG~~i~~~~~~~~ 399 (435)
T PRK05137 336 --YSTPVWSPRGDL-IAFTKQGGG---------QFSIGVMKP---DGSGERILTS-GFLVEGPTWAPNGRVIMFFRQTPG 399 (435)
T ss_pred --ccCeEECCCCCE-EEEEEcCCC---------ceEEEEEEC---CCCceEeccC-CCCCCCCeECCCCCEEEEEEccCC
Confidence 334689999998 888763221 125888887 5666565654 345778999999999999874332
Q ss_pred c--ceeEEEEcCCCCCCcceeee
Q 004574 322 S--QTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 322 ~--~~~l~~~~~~~~~~~~~~l~ 342 (744)
. ...||+++++++ ..+.+.
T Consensus 400 ~~~~~~L~~~dl~g~--~~~~l~ 420 (435)
T PRK05137 400 SGGAPKLYTVDLTGR--NEREVP 420 (435)
T ss_pred CCCcceEEEEECCCC--ceEEcc
Confidence 2 158999999874 444554
No 11
>PRK03629 tolB translocation protein TolB; Provisional
Probab=99.93 E-value=1.3e-23 Score=224.76 Aligned_cols=250 Identities=18% Similarity=0.202 Sum_probs=184.9
Q ss_pred CeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCC
Q 004574 43 KRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGP 122 (744)
Q Consensus 43 ~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~ 122 (744)
++|||+.... +.....+||++|.+|+..+++|.+.. ....+.|||||+.|+|++... +
T Consensus 164 ~riayv~~~~----~~~~~~~l~~~d~dg~~~~~lt~~~~-----~~~~p~wSPDG~~la~~s~~~-g------------ 221 (429)
T PRK03629 164 TRIAYVVQTN----GGQFPYELRVSDYDGYNQFVVHRSPQ-----PLMSPAWSPDGSKLAYVTFES-G------------ 221 (429)
T ss_pred CeEEEEEeeC----CCCcceeEEEEcCCCCCCEEeecCCC-----ceeeeEEcCCCCEEEEEEecC-C------------
Confidence 6899997621 12346799999999999999987764 356789999999999985321 1
Q ss_pred eeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCccc
Q 004574 123 KIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSY 200 (744)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~ 200 (744)
..+||++++ +|+.++++.. +....++|||||++|+|......
T Consensus 222 ---------------------------------~~~i~i~dl~~G~~~~l~~~~~~~~~~~~SPDG~~La~~~~~~g--- 265 (429)
T PRK03629 222 ---------------------------------RSALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKTG--- 265 (429)
T ss_pred ---------------------------------CcEEEEEECCCCCeEEccCCCCCcCCeEECCCCCEEEEEEcCCC---
Confidence 158999999 6688888765 55668999999999999865421
Q ss_pred ccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEe
Q 004574 201 KVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYT 280 (744)
Q Consensus 201 ~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~ 280 (744)
..+||+++++++..++++..... ...+.|||||+. |+|++...+ ..+||+
T Consensus 266 -------~~~I~~~d~~tg~~~~lt~~~~~-------------~~~~~wSPDG~~-I~f~s~~~g---------~~~Iy~ 315 (429)
T PRK03629 266 -------SLNLYVMDLASGQIRQVTDGRSN-------------NTEPTWFPDSQN-LAYTSDQAG---------RPQVYK 315 (429)
T ss_pred -------CcEEEEEECCCCCEEEccCCCCC-------------cCceEECCCCCE-EEEEeCCCC---------CceEEE
Confidence 23799999999888888766433 346699999998 888875432 125999
Q ss_pred ccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceee
Q 004574 281 QPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTR 360 (744)
Q Consensus 281 ~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 360 (744)
+++ ++++.++++........++|||||++|+|... ..+..+|+++|+++ +..+.++.... . .. ++|
T Consensus 316 ~d~---~~g~~~~lt~~~~~~~~~~~SpDG~~Ia~~~~-~~g~~~I~~~dl~~--g~~~~Lt~~~~-~-----~~--p~~ 381 (429)
T PRK03629 316 VNI---NGGAPQRITWEGSQNQDADVSSDGKFMVMVSS-NGGQQHIAKQDLAT--GGVQVLTDTFL-D-----ET--PSI 381 (429)
T ss_pred EEC---CCCCeEEeecCCCCccCEEECCCCCEEEEEEc-cCCCceEEEEECCC--CCeEEeCCCCC-C-----CC--ceE
Confidence 998 67777788765555678999999999999873 33557899999987 45566664211 1 22 459
Q ss_pred CCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 361 TSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 361 spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
||||+.|+|.... +....|+++++.++..+++..
T Consensus 382 SpDG~~i~~~s~~------------------~~~~~l~~~~~~G~~~~~l~~ 415 (429)
T PRK03629 382 APNGTMVIYSSSQ------------------GMGSVLNLVSTDGRFKARLPA 415 (429)
T ss_pred CCCCCEEEEEEcC------------------CCceEEEEEECCCCCeEECcc
Confidence 9999999998743 123458899997766666654
No 12
>PRK04043 tolB translocation protein TolB; Provisional
Probab=99.93 E-value=6.3e-24 Score=224.00 Aligned_cols=232 Identities=10% Similarity=0.036 Sum_probs=174.9
Q ss_pred ccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCe-EEEeeecccccccCCCceeEEEEECCCCceeccccCCC
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKR-IAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD 82 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~-laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~ 82 (744)
...||+.+.+| ...++++. .+ ....|+|||||++ ++|++.. .+..+||++++++|+.++|+..++
T Consensus 168 ~~~l~~~d~dg----~~~~~~~~--~~-~~~~p~wSpDG~~~i~y~s~~-------~~~~~Iyv~dl~tg~~~~lt~~~g 233 (419)
T PRK04043 168 KSNIVLADYTL----TYQKVIVK--GG-LNIFPKWANKEQTAFYYTSYG-------ERKPTLYKYNLYTGKKEKIASSQG 233 (419)
T ss_pred cceEEEECCCC----CceeEEcc--CC-CeEeEEECCCCCcEEEEEEcc-------CCCCEEEEEECCCCcEEEEecCCC
Confidence 35789999888 76777762 23 4678999999997 6665541 235799999999999999987554
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
....+.|||||+.|+|+.... + ..+||++
T Consensus 234 -----~~~~~~~SPDG~~la~~~~~~-g---------------------------------------------~~~Iy~~ 262 (419)
T PRK04043 234 -----MLVVSDVSKDGSKLLLTMAPK-G---------------------------------------------QPDIYLY 262 (419)
T ss_pred -----cEEeeEECCCCCEEEEEEccC-C---------------------------------------------CcEEEEE
Confidence 234678999999999985421 1 1689999
Q ss_pred cC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCc
Q 004574 163 SL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSV 240 (744)
Q Consensus 163 ~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~ 240 (744)
++ +|+.++|+.. +....+.|||||++|+|+++..+ ..+||++++++++.++++....
T Consensus 263 dl~~g~~~~LT~~~~~d~~p~~SPDG~~I~F~Sdr~g----------~~~Iy~~dl~~g~~~rlt~~g~----------- 321 (419)
T PRK04043 263 DTNTKTLTQITNYPGIDVNGNFVEDDKRIVFVSDRLG----------YPNIFMKKLNSGSVEQVVFHGK----------- 321 (419)
T ss_pred ECCCCcEEEcccCCCccCccEECCCCCEEEEEECCCC----------CceEEEEECCCCCeEeCccCCC-----------
Confidence 99 5588999877 44668899999999999987632 2489999999998888774311
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeee
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYK 320 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~ 320 (744)
....|||||+. |+|+........ ......|+++++ ++++.+.|+.. .....|+|||||+.|+|....
T Consensus 322 ----~~~~~SPDG~~-Ia~~~~~~~~~~---~~~~~~I~v~d~---~~g~~~~LT~~-~~~~~p~~SPDG~~I~f~~~~- 388 (419)
T PRK04043 322 ----NNSSVSTYKNY-IVYSSRETNNEF---GKNTFNLYLIST---NSDYIRRLTAN-GVNQFPRFSSDGGSIMFIKYL- 388 (419)
T ss_pred ----cCceECCCCCE-EEEEEcCCCccc---CCCCcEEEEEEC---CCCCeEECCCC-CCcCCeEECCCCCEEEEEEcc-
Confidence 12489999999 888875432211 001246999998 77888888875 345579999999999999844
Q ss_pred ccceeEEEEcCCCC
Q 004574 321 TSQTRTWLVCPGSK 334 (744)
Q Consensus 321 ~~~~~l~~~~~~~~ 334 (744)
.+...|+++++++.
T Consensus 389 ~~~~~L~~~~l~g~ 402 (419)
T PRK04043 389 GNQSALGIIRLNYN 402 (419)
T ss_pred CCcEEEEEEecCCC
Confidence 56778999999883
No 13
>PRK05137 tolB translocation protein TolB; Provisional
Probab=99.92 E-value=2.1e-23 Score=224.77 Aligned_cols=254 Identities=15% Similarity=0.175 Sum_probs=186.8
Q ss_pred CCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCC
Q 004574 42 GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLG 121 (744)
Q Consensus 42 G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~ 121 (744)
..+|||++... .......+||++|.+|+..+++|.... .+..+.|||||+.|+|++... +
T Consensus 165 ~~~iafv~~~~---~~~~~~~~l~~~d~dg~~~~~lt~~~~-----~v~~p~wSpDG~~lay~s~~~-g----------- 224 (435)
T PRK05137 165 DTRIVYVAESG---PKNKRIKRLAIMDQDGANVRYLTDGSS-----LVLTPRFSPNRQEITYMSYAN-G----------- 224 (435)
T ss_pred CCeEEEEEeeC---CCCCcceEEEEECCCCCCcEEEecCCC-----CeEeeEECCCCCEEEEEEecC-C-----------
Confidence 45899987521 001125799999999999999987765 356789999999999985421 1
Q ss_pred CeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcc
Q 004574 122 PKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYS 199 (744)
Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~ 199 (744)
..+||++++ +|+.++++.. +....++|||||++|+|......
T Consensus 225 ----------------------------------~~~i~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g-- 268 (435)
T PRK05137 225 ----------------------------------RPRVYLLDLETGQRELVGNFPGMTFAPRFSPDGRKVVMSLSQGG-- 268 (435)
T ss_pred ----------------------------------CCEEEEEECCCCcEEEeecCCCcccCcEECCCCCEEEEEEecCC--
Confidence 158999999 6688888766 66778999999999999876432
Q ss_pred cccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEE
Q 004574 200 YKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIY 279 (744)
Q Consensus 200 ~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~ 279 (744)
..+||+++++++..++|+..... ...+.|||||++ |+|.+...+. .+||
T Consensus 269 --------~~~Iy~~d~~~~~~~~Lt~~~~~-------------~~~~~~spDG~~-i~f~s~~~g~---------~~Iy 317 (435)
T PRK05137 269 --------NTDIYTMDLRSGTTTRLTDSPAI-------------DTSPSYSPDGSQ-IVFESDRSGS---------PQLY 317 (435)
T ss_pred --------CceEEEEECCCCceEEccCCCCc-------------cCceeEcCCCCE-EEEEECCCCC---------CeEE
Confidence 24899999999988888876543 335699999999 8888754332 2599
Q ss_pred eccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCcee
Q 004574 280 TQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMT 359 (744)
Q Consensus 280 ~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 359 (744)
++++ ++++.+.++........+.|||||++|+|... .....+|++++++++ ..+.++... . ... ++
T Consensus 318 ~~d~---~g~~~~~lt~~~~~~~~~~~SpdG~~ia~~~~-~~~~~~i~~~d~~~~--~~~~lt~~~-~-----~~~--p~ 383 (435)
T PRK05137 318 VMNA---DGSNPRRISFGGGRYSTPVWSPRGDLIAFTKQ-GGGQFSIGVMKPDGS--GERILTSGF-L-----VEG--PT 383 (435)
T ss_pred EEEC---CCCCeEEeecCCCcccCeEECCCCCEEEEEEc-CCCceEEEEEECCCC--ceEeccCCC-C-----CCC--Ce
Confidence 9998 77888888876666778999999999999873 334578999998763 444554321 1 122 45
Q ss_pred eCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 360 RTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 360 ~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
|||||+.|+|..... +. .....|+++|+.++..+++.
T Consensus 384 ~spDG~~i~~~~~~~------------~~---~~~~~L~~~dl~g~~~~~l~ 420 (435)
T PRK05137 384 WAPNGRVIMFFRQTP------------GS---GGAPKLYTVDLTGRNEREVP 420 (435)
T ss_pred ECCCCCEEEEEEccC------------CC---CCcceEEEEECCCCceEEcc
Confidence 999999999987431 10 01246999999888777664
No 14
>PRK04792 tolB translocation protein TolB; Provisional
Probab=99.92 E-value=2.7e-23 Score=223.00 Aligned_cols=234 Identities=16% Similarity=0.093 Sum_probs=173.4
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..|++.+.++ ...++++..+ ..+..|+|||||++|||++.. ++..+||++++++|+.++++..+.
T Consensus 198 ~~l~i~d~dG----~~~~~l~~~~--~~~~~p~wSPDG~~La~~s~~-------~g~~~L~~~dl~tg~~~~lt~~~g-- 262 (448)
T PRK04792 198 YQLMIADYDG----YNEQMLLRSP--EPLMSPAWSPDGRKLAYVSFE-------NRKAEIFVQDIYTQVREKVTSFPG-- 262 (448)
T ss_pred eEEEEEeCCC----CCceEeecCC--CcccCceECCCCCEEEEEEec-------CCCcEEEEEECCCCCeEEecCCCC--
Confidence 4688888877 7777777433 257899999999999998642 456799999999999999986654
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.|||||+.|+|+.... + ..+||++++
T Consensus 263 ---~~~~~~wSPDG~~La~~~~~~-g---------------------------------------------~~~Iy~~dl 293 (448)
T PRK04792 263 ---INGAPRFSPDGKKLALVLSKD-G---------------------------------------------QPEIYVVDI 293 (448)
T ss_pred ---CcCCeeECCCCCEEEEEEeCC-C---------------------------------------------CeEEEEEEC
Confidence 234679999999999874321 1 158999999
Q ss_pred -CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 -DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 -~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+|+.++++.. .....++|||||++|+|.+.... ..+||++++++++.++++.....
T Consensus 294 ~tg~~~~lt~~~~~~~~p~wSpDG~~I~f~s~~~g----------~~~Iy~~dl~~g~~~~Lt~~g~~------------ 351 (448)
T PRK04792 294 ATKALTRITRHRAIDTEPSWHPDGKSLIFTSERGG----------KPQIYRVNLASGKVSRLTFEGEQ------------ 351 (448)
T ss_pred CCCCeEECccCCCCccceEECCCCCEEEEEECCCC----------CceEEEEECCCCCEEEEecCCCC------------
Confidence 5688889877 55678999999999999975432 24899999998888877632111
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
...+.|||||+. |+|+....+ ...|+++++ ++++.+.++.. .....++|||||+.|+|.+. ..+
T Consensus 352 -~~~~~~SpDG~~-l~~~~~~~g---------~~~I~~~dl---~~g~~~~lt~~-~~d~~ps~spdG~~I~~~~~-~~g 415 (448)
T PRK04792 352 -NLGGSITPDGRS-MIMVNRTNG---------KFNIARQDL---ETGAMQVLTST-RLDESPSVAPNGTMVIYSTT-YQG 415 (448)
T ss_pred -CcCeeECCCCCE-EEEEEecCC---------ceEEEEEEC---CCCCeEEccCC-CCCCCceECCCCCEEEEEEe-cCC
Confidence 224589999998 888764322 124899988 67777777654 23457899999999999884 446
Q ss_pred ceeEEEEcCCCCCCcceeee
Q 004574 323 QTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~ 342 (744)
...||++++++ ...+.++
T Consensus 416 ~~~l~~~~~~G--~~~~~l~ 433 (448)
T PRK04792 416 KQVLAAVSIDG--RFKARLP 433 (448)
T ss_pred ceEEEEEECCC--CceEECc
Confidence 67899999876 3334443
No 15
>PRK04043 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=1.3e-22 Score=214.09 Aligned_cols=248 Identities=15% Similarity=0.116 Sum_probs=181.5
Q ss_pred CeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcE-EEEEecCCCCCCCCCCCCCCC
Q 004574 43 KRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTL-LIFTIPSSRRDPPKKTMVPLG 121 (744)
Q Consensus 43 ~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~-l~~~~~~~~~~~~~~~~~~~~ 121 (744)
.++||++.. ......+||++|.+|...++++... . ...+.|||||+. ++|++... +.
T Consensus 155 ~r~~~v~~~-----~~~~~~~l~~~d~dg~~~~~~~~~~-~-----~~~p~wSpDG~~~i~y~s~~~-~~---------- 212 (419)
T PRK04043 155 KRKVVFSKY-----TGPKKSNIVLADYTLTYQKVIVKGG-L-----NIFPKWANKEQTAFYYTSYGE-RK---------- 212 (419)
T ss_pred eeEEEEEEc-----cCCCcceEEEECCCCCceeEEccCC-C-----eEeEEECCCCCcEEEEEEccC-CC----------
Confidence 467777641 1123689999999999999888663 2 346799999996 66664321 11
Q ss_pred CeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcc
Q 004574 122 PKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYS 199 (744)
Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~ 199 (744)
.+||++|+ +|+.++|+.. +....+.|||||++|+|......
T Consensus 213 -----------------------------------~~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g-- 255 (419)
T PRK04043 213 -----------------------------------PTLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPKG-- 255 (419)
T ss_pred -----------------------------------CEEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccCC--
Confidence 58999999 7799999876 66667899999999999976532
Q ss_pred cccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEE
Q 004574 200 YKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIY 279 (744)
Q Consensus 200 ~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~ 279 (744)
..+||+++++++..++|+..+... ....|||||++ |+|++...+. .+||
T Consensus 256 --------~~~Iy~~dl~~g~~~~LT~~~~~d-------------~~p~~SPDG~~-I~F~Sdr~g~---------~~Iy 304 (419)
T PRK04043 256 --------QPDIYLYDTNTKTLTQITNYPGID-------------VNGNFVEDDKR-IVFVSDRLGY---------PNIF 304 (419)
T ss_pred --------CcEEEEEECCCCcEEEcccCCCcc-------------CccEECCCCCE-EEEEECCCCC---------ceEE
Confidence 358999999999899998776432 23489999999 9999865322 2599
Q ss_pred eccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeec-----cceeEEEEcCCCCCCcceeeeccccccccCCCC
Q 004574 280 TQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKT-----SQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPG 354 (744)
Q Consensus 280 ~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~-----~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 354 (744)
++++ ++++.++++.... ..+.|||||++|+|...... +..+||++|+++ +..+.|+..... .
T Consensus 305 ~~dl---~~g~~~rlt~~g~--~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~--g~~~~LT~~~~~------~ 371 (419)
T PRK04043 305 MKKL---NSGSVEQVVFHGK--NNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNS--DYIRRLTANGVN------Q 371 (419)
T ss_pred EEEC---CCCCeEeCccCCC--cCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCC--CCeEECCCCCCc------C
Confidence 9999 7788877875422 24699999999999884321 236899999988 466778764211 2
Q ss_pred CCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeec
Q 004574 355 SPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWES 413 (744)
Q Consensus 355 ~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~ 413 (744)
. ++|||||+.|+|.... +....|+++++.+....++...
T Consensus 372 ~--p~~SPDG~~I~f~~~~------------------~~~~~L~~~~l~g~~~~~l~~~ 410 (419)
T PRK04043 372 F--PRFSSDGGSIMFIKYL------------------GNQSALGIIRLNYNKSFLFPLK 410 (419)
T ss_pred C--eEECCCCCEEEEEEcc------------------CCcEEEEEEecCCCeeEEeecC
Confidence 2 4599999999999732 2334599999977666666543
No 16
>PRK02889 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=5.1e-23 Score=220.54 Aligned_cols=227 Identities=16% Similarity=0.140 Sum_probs=169.6
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..||+++.++ ...++++.. ...+..|+|||||++|||++. .++..+||++++++|+.++++..+.
T Consensus 176 ~~L~~~D~dG----~~~~~l~~~--~~~v~~p~wSPDG~~la~~s~-------~~~~~~I~~~dl~~g~~~~l~~~~g-- 240 (427)
T PRK02889 176 YQLQISDADG----QNAQSALSS--PEPIISPAWSPDGTKLAYVSF-------ESKKPVVYVHDLATGRRRVVANFKG-- 240 (427)
T ss_pred cEEEEECCCC----CCceEeccC--CCCcccceEcCCCCEEEEEEc-------cCCCcEEEEEECCCCCEEEeecCCC--
Confidence 3689999877 666777633 335789999999999999764 1455789999999999999975554
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....++|||||+.|+|+.... +..+||.+++
T Consensus 241 ---~~~~~~~SPDG~~la~~~~~~----------------------------------------------g~~~Iy~~d~ 271 (427)
T PRK02889 241 ---SNSAPAWSPDGRTLAVALSRD----------------------------------------------GNSQIYTVNA 271 (427)
T ss_pred ---CccceEECCCCCEEEEEEccC----------------------------------------------CCceEEEEEC
Confidence 245789999999999874321 1158999999
Q ss_pred C-CCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 D-GTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 ~-G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+ +..++++.. +....+.|||||++|+|.++... ..+||.++.+++..++++.....
T Consensus 272 ~~~~~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g----------~~~Iy~~~~~~g~~~~lt~~g~~------------ 329 (427)
T PRK02889 272 DGSGLRRLTQSSGIDTEPFFSPDGRSIYFTSDRGG----------APQIYRMPASGGAAQRVTFTGSY------------ 329 (427)
T ss_pred CCCCcEECCCCCCCCcCeEEcCCCCEEEEEecCCC----------CcEEEEEECCCCceEEEecCCCC------------
Confidence 4 488888876 55667899999999999865431 23799999888777776633211
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
...+.|||||+. |+|++...+. ..|+++++ .+++.+.++.. .....++|||||+.|+|.... .+
T Consensus 330 -~~~~~~SpDG~~-Ia~~s~~~g~---------~~I~v~d~---~~g~~~~lt~~-~~~~~p~~spdg~~l~~~~~~-~g 393 (427)
T PRK02889 330 -NTSPRISPDGKL-LAYISRVGGA---------FKLYVQDL---ATGQVTALTDT-TRDESPSFAPNGRYILYATQQ-GG 393 (427)
T ss_pred -cCceEECCCCCE-EEEEEccCCc---------EEEEEEEC---CCCCeEEccCC-CCccCceECCCCCEEEEEEec-CC
Confidence 224689999998 8887643221 24899998 67777777754 344689999999999999844 46
Q ss_pred ceeEEEEcCCC
Q 004574 323 QTRTWLVCPGS 333 (744)
Q Consensus 323 ~~~l~~~~~~~ 333 (744)
...|+++++++
T Consensus 394 ~~~l~~~~~~g 404 (427)
T PRK02889 394 RSVLAAVSSDG 404 (427)
T ss_pred CEEEEEEECCC
Confidence 67899999976
No 17
>PRK02889 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=1.7e-22 Score=216.58 Aligned_cols=246 Identities=17% Similarity=0.211 Sum_probs=178.5
Q ss_pred CeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCC
Q 004574 43 KRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGP 122 (744)
Q Consensus 43 ~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~ 122 (744)
.+|||++. ..+..+||++|.+|...++++.... .+..++|||||+.|+|++... +
T Consensus 164 ~~iayv~~-------~~~~~~L~~~D~dG~~~~~l~~~~~-----~v~~p~wSPDG~~la~~s~~~-~------------ 218 (427)
T PRK02889 164 TRIAYVIK-------TGNRYQLQISDADGQNAQSALSSPE-----PIISPAWSPDGTKLAYVSFES-K------------ 218 (427)
T ss_pred cEEEEEEc-------cCCccEEEEECCCCCCceEeccCCC-----CcccceEcCCCCEEEEEEccC-C------------
Confidence 57999863 1345789999999988898886654 356789999999999986421 1
Q ss_pred eeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCccc
Q 004574 123 KIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSY 200 (744)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~ 200 (744)
..+||++|+ +|+.++++.. +....++|||||++|+|......
T Consensus 219 ---------------------------------~~~I~~~dl~~g~~~~l~~~~g~~~~~~~SPDG~~la~~~~~~g--- 262 (427)
T PRK02889 219 ---------------------------------KPVVYVHDLATGRRRVVANFKGSNSAPAWSPDGRTLAVALSRDG--- 262 (427)
T ss_pred ---------------------------------CcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEEccCC---
Confidence 157999999 6688888755 66678999999999999865432
Q ss_pred ccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEe
Q 004574 201 KVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYT 280 (744)
Q Consensus 201 ~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~ 280 (744)
..+||+++++++..++++..... ...+.|||||++ |+|++...+. ..||.
T Consensus 263 -------~~~Iy~~d~~~~~~~~lt~~~~~-------------~~~~~wSpDG~~-l~f~s~~~g~---------~~Iy~ 312 (427)
T PRK02889 263 -------NSQIYTVNADGSGLRRLTQSSGI-------------DTEPFFSPDGRS-IYFTSDRGGA---------PQIYR 312 (427)
T ss_pred -------CceEEEEECCCCCcEECCCCCCC-------------CcCeEEcCCCCE-EEEEecCCCC---------cEEEE
Confidence 24899999998888888765432 235689999999 8888643322 25999
Q ss_pred ccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceee
Q 004574 281 QPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTR 360 (744)
Q Consensus 281 ~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 360 (744)
+++ .+++.+.++........++|||||++|+|.+. ..+..+|+++|+.+ ++.+.++..... .. ++|
T Consensus 313 ~~~---~~g~~~~lt~~g~~~~~~~~SpDG~~Ia~~s~-~~g~~~I~v~d~~~--g~~~~lt~~~~~------~~--p~~ 378 (427)
T PRK02889 313 MPA---SGGAAQRVTFTGSYNTSPRISPDGKLLAYISR-VGGAFKLYVQDLAT--GQVTALTDTTRD------ES--PSF 378 (427)
T ss_pred EEC---CCCceEEEecCCCCcCceEECCCCCEEEEEEc-cCCcEEEEEEECCC--CCeEEccCCCCc------cC--ceE
Confidence 987 66676777644444557999999999999873 33456899999987 455666643211 12 459
Q ss_pred CCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 361 TSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 361 spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
+|||+.|+|.... .....|+.+++.+...+++.
T Consensus 379 spdg~~l~~~~~~------------------~g~~~l~~~~~~g~~~~~l~ 411 (427)
T PRK02889 379 APNGRYILYATQQ------------------GGRSVLAAVSSDGRIKQRLS 411 (427)
T ss_pred CCCCCEEEEEEec------------------CCCEEEEEEECCCCceEEee
Confidence 9999999999743 12345888888554444554
No 18
>PRK04922 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=1.8e-22 Score=217.23 Aligned_cols=253 Identities=16% Similarity=0.173 Sum_probs=184.0
Q ss_pred CCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCC
Q 004574 42 GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLG 121 (744)
Q Consensus 42 G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~ 121 (744)
+.+|||+.... .......+||++|.+++..++||.+.. .+..++|||||+.|+|++... +
T Consensus 167 ~~~ia~v~~~~---~~~~~~~~l~i~D~~g~~~~~lt~~~~-----~v~~p~wSpDg~~la~~s~~~-~----------- 226 (433)
T PRK04922 167 WTRIAYVTVSG---AGGAMRYALQVADSDGYNPQTILRSAE-----PILSPAWSPDGKKLAYVSFER-G----------- 226 (433)
T ss_pred cceEEEEEEeC---CCCCceEEEEEECCCCCCceEeecCCC-----ccccccCCCCCCEEEEEecCC-C-----------
Confidence 45799987521 111335689999999999999987654 356789999999999985421 1
Q ss_pred CeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcc
Q 004574 122 PKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYS 199 (744)
Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~ 199 (744)
..+||++++ +|+.++++.. +....++|||||++|+|......
T Consensus 227 ----------------------------------~~~l~~~dl~~g~~~~l~~~~g~~~~~~~SpDG~~l~~~~s~~g-- 270 (433)
T PRK04922 227 ----------------------------------RSAIYVQDLATGQRELVASFRGINGAPSFSPDGRRLALTLSRDG-- 270 (433)
T ss_pred ----------------------------------CcEEEEEECCCCCEEEeccCCCCccCceECCCCCEEEEEEeCCC--
Confidence 158999999 6687888765 55568899999999999865432
Q ss_pred cccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEE
Q 004574 200 YKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIY 279 (744)
Q Consensus 200 ~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~ 279 (744)
..+||++++++++.++++..... ...+.|||||++ |+|.+...+. ..||
T Consensus 271 --------~~~Iy~~d~~~g~~~~lt~~~~~-------------~~~~~~spDG~~-l~f~sd~~g~---------~~iy 319 (433)
T PRK04922 271 --------NPEIYVMDLGSRQLTRLTNHFGI-------------DTEPTWAPDGKS-IYFTSDRGGR---------PQIY 319 (433)
T ss_pred --------CceEEEEECCCCCeEECccCCCC-------------ccceEECCCCCE-EEEEECCCCC---------ceEE
Confidence 24899999999888888765332 235699999998 8888754332 2599
Q ss_pred eccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCcee
Q 004574 280 TQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMT 359 (744)
Q Consensus 280 ~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 359 (744)
++++ ++++.+.++........++|||||++|+|... ..+..+|+++++.+ +..+.++..... .. +.
T Consensus 320 ~~dl---~~g~~~~lt~~g~~~~~~~~SpDG~~Ia~~~~-~~~~~~I~v~d~~~--g~~~~Lt~~~~~------~~--p~ 385 (433)
T PRK04922 320 RVAA---SGGSAERLTFQGNYNARASVSPDGKKIAMVHG-SGGQYRIAVMDLST--GSVRTLTPGSLD------ES--PS 385 (433)
T ss_pred EEEC---CCCCeEEeecCCCCccCEEECCCCCEEEEEEC-CCCceeEEEEECCC--CCeEECCCCCCC------CC--ce
Confidence 9988 66777777754445567999999999999863 33556899999987 455666654221 12 45
Q ss_pred eCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeec
Q 004574 360 RTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWES 413 (744)
Q Consensus 360 ~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~ 413 (744)
|||||+.|+|.... .....|+++++.++..+++...
T Consensus 386 ~spdG~~i~~~s~~------------------~g~~~L~~~~~~g~~~~~l~~~ 421 (433)
T PRK04922 386 FAPNGSMVLYATRE------------------GGRGVLAAVSTDGRVRQRLVSA 421 (433)
T ss_pred ECCCCCEEEEEEec------------------CCceEEEEEECCCCceEEcccC
Confidence 99999999998743 1234699999977766666543
No 19
>PRK04792 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=2.3e-22 Score=215.89 Aligned_cols=250 Identities=18% Similarity=0.156 Sum_probs=180.9
Q ss_pred CeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCC
Q 004574 43 KRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGP 122 (744)
Q Consensus 43 ~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~ 122 (744)
.+|||+.... ......+||++|.+|...+++|.... .+..+.|||||++|+|++... +
T Consensus 183 ~riayv~~~~----~~~~~~~l~i~d~dG~~~~~l~~~~~-----~~~~p~wSPDG~~La~~s~~~-g------------ 240 (448)
T PRK04792 183 TRIAYVVVND----KDKYPYQLMIADYDGYNEQMLLRSPE-----PLMSPAWSPDGRKLAYVSFEN-R------------ 240 (448)
T ss_pred CEEEEEEeeC----CCCCceEEEEEeCCCCCceEeecCCC-----cccCceECCCCCEEEEEEecC-C------------
Confidence 4788886521 11234799999999999999987765 356789999999999985421 1
Q ss_pred eeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCccc
Q 004574 123 KIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSY 200 (744)
Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~ 200 (744)
..+||++|+ +|+.++++.. +....++|||||++|+|......
T Consensus 241 ---------------------------------~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g--- 284 (448)
T PRK04792 241 ---------------------------------KAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDG--- 284 (448)
T ss_pred ---------------------------------CcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCC---
Confidence 158999999 6688888765 55568999999999999865532
Q ss_pred ccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEe
Q 004574 201 KVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYT 280 (744)
Q Consensus 201 ~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~ 280 (744)
..+||++++++++.++++..... ...+.|||||+. |+|.+...+. ..||+
T Consensus 285 -------~~~Iy~~dl~tg~~~~lt~~~~~-------------~~~p~wSpDG~~-I~f~s~~~g~---------~~Iy~ 334 (448)
T PRK04792 285 -------QPEIYVVDIATKALTRITRHRAI-------------DTEPSWHPDGKS-LIFTSERGGK---------PQIYR 334 (448)
T ss_pred -------CeEEEEEECCCCCeEECccCCCC-------------ccceEECCCCCE-EEEEECCCCC---------ceEEE
Confidence 24899999999988888765432 335689999998 8888644332 25999
Q ss_pred ccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceee
Q 004574 281 QPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTR 360 (744)
Q Consensus 281 ~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 360 (744)
+++ ++++.+.++........++|||||++|+|... ..+..+||++|+++ +..+.++..... .. ++|
T Consensus 335 ~dl---~~g~~~~Lt~~g~~~~~~~~SpDG~~l~~~~~-~~g~~~I~~~dl~~--g~~~~lt~~~~d------~~--ps~ 400 (448)
T PRK04792 335 VNL---ASGKVSRLTFEGEQNLGGSITPDGRSMIMVNR-TNGKFNIARQDLET--GAMQVLTSTRLD------ES--PSV 400 (448)
T ss_pred EEC---CCCCEEEEecCCCCCcCeeECCCCCEEEEEEe-cCCceEEEEEECCC--CCeEEccCCCCC------CC--ceE
Confidence 998 67777777644344456899999999999863 34567899999988 445566543211 22 349
Q ss_pred CCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 361 TSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 361 spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
+|||+.|+|.... .....|++++.+++..+++..
T Consensus 401 spdG~~I~~~~~~------------------~g~~~l~~~~~~G~~~~~l~~ 434 (448)
T PRK04792 401 APNGTMVIYSTTY------------------QGKQVLAAVSIDGRFKARLPA 434 (448)
T ss_pred CCCCCEEEEEEec------------------CCceEEEEEECCCCceEECcC
Confidence 9999999998733 123358889986665555543
No 20
>PRK04922 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=1.2e-22 Score=218.66 Aligned_cols=234 Identities=14% Similarity=0.102 Sum_probs=175.1
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..||+++.++ +..++++.. +..+..|+|||||++|||++. ..+..+||++++++|+.++++..+.
T Consensus 184 ~~l~i~D~~g----~~~~~lt~~--~~~v~~p~wSpDg~~la~~s~-------~~~~~~l~~~dl~~g~~~~l~~~~g-- 248 (433)
T PRK04922 184 YALQVADSDG----YNPQTILRS--AEPILSPAWSPDGKKLAYVSF-------ERGRSAIYVQDLATGQRELVASFRG-- 248 (433)
T ss_pred EEEEEECCCC----CCceEeecC--CCccccccCCCCCCEEEEEec-------CCCCcEEEEEECCCCCEEEeccCCC--
Confidence 4689999877 778888743 335889999999999999764 1456789999999999999976554
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.|||||+.|+|+.... +..+||++++
T Consensus 249 ---~~~~~~~SpDG~~l~~~~s~~----------------------------------------------g~~~Iy~~d~ 279 (433)
T PRK04922 249 ---INGAPSFSPDGRRLALTLSRD----------------------------------------------GNPEIYVMDL 279 (433)
T ss_pred ---CccCceECCCCCEEEEEEeCC----------------------------------------------CCceEEEEEC
Confidence 234679999999999874321 0158999999
Q ss_pred -CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 -DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 -~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+|+.++++.. .....++|||||++|+|.+.... ..+||++++++++.++++.....
T Consensus 280 ~~g~~~~lt~~~~~~~~~~~spDG~~l~f~sd~~g----------~~~iy~~dl~~g~~~~lt~~g~~------------ 337 (433)
T PRK04922 280 GSRQLTRLTNHFGIDTEPTWAPDGKSIYFTSDRGG----------RPQIYRVAASGGSAERLTFQGNY------------ 337 (433)
T ss_pred CCCCeEECccCCCCccceEECCCCCEEEEEECCCC----------CceEEEEECCCCCeEEeecCCCC------------
Confidence 6688888876 45568899999999999976531 24799999988877777643211
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
...++|||||+. |+|....++ ...|+++++ .+++.+.|+... ....++|||||+.|+|.+.. .+
T Consensus 338 -~~~~~~SpDG~~-Ia~~~~~~~---------~~~I~v~d~---~~g~~~~Lt~~~-~~~~p~~spdG~~i~~~s~~-~g 401 (433)
T PRK04922 338 -NARASVSPDGKK-IAMVHGSGG---------QYRIAVMDL---STGSVRTLTPGS-LDESPSFAPNGSMVLYATRE-GG 401 (433)
T ss_pred -ccCEEECCCCCE-EEEEECCCC---------ceeEEEEEC---CCCCeEECCCCC-CCCCceECCCCCEEEEEEec-CC
Confidence 235699999998 888753211 125899998 677777777543 45678999999999998844 46
Q ss_pred ceeEEEEcCCCCCCcceeee
Q 004574 323 QTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~ 342 (744)
...||++++++ ...+.++
T Consensus 402 ~~~L~~~~~~g--~~~~~l~ 419 (433)
T PRK04922 402 RGVLAAVSTDG--RVRQRLV 419 (433)
T ss_pred ceEEEEEECCC--CceEEcc
Confidence 67899999977 3444554
No 21
>PRK00178 tolB translocation protein TolB; Provisional
Probab=99.91 E-value=2e-22 Score=217.74 Aligned_cols=226 Identities=17% Similarity=0.161 Sum_probs=170.9
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICL 85 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~ 85 (744)
.|++.+.++ +..++++. .+..+..|+|||||++|||++.. ++..+||++++++|+.++++....
T Consensus 180 ~l~~~d~~g----~~~~~l~~--~~~~~~~p~wSpDG~~la~~s~~-------~~~~~l~~~~l~~g~~~~l~~~~g--- 243 (430)
T PRK00178 180 TLQRSDYDG----ARAVTLLQ--SREPILSPRWSPDGKRIAYVSFE-------QKRPRIFVQNLDTGRREQITNFEG--- 243 (430)
T ss_pred EEEEECCCC----CCceEEec--CCCceeeeeECCCCCEEEEEEcC-------CCCCEEEEEECCCCCEEEccCCCC---
Confidence 488888877 77788863 33357899999999999998641 346799999999999999986554
Q ss_pred cccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-
Q 004574 86 NAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL- 164 (744)
Q Consensus 86 ~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~- 164 (744)
....+.|||||+.|+|+.... +..+||++|+
T Consensus 244 --~~~~~~~SpDG~~la~~~~~~----------------------------------------------g~~~Iy~~d~~ 275 (430)
T PRK00178 244 --LNGAPAWSPDGSKLAFVLSKD----------------------------------------------GNPEIYVMDLA 275 (430)
T ss_pred --CcCCeEECCCCCEEEEEEccC----------------------------------------------CCceEEEEECC
Confidence 234689999999999975321 0158999999
Q ss_pred CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCC
Q 004574 165 DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREG 243 (744)
Q Consensus 165 ~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~ 243 (744)
+|+.++++.. .....+.|||||++|+|.+.... ..+||++++++++.++++.....
T Consensus 276 ~~~~~~lt~~~~~~~~~~~spDg~~i~f~s~~~g----------~~~iy~~d~~~g~~~~lt~~~~~------------- 332 (430)
T PRK00178 276 SRQLSRVTNHPAIDTEPFWGKDGRTLYFTSDRGG----------KPQIYKVNVNGGRAERVTFVGNY------------- 332 (430)
T ss_pred CCCeEEcccCCCCcCCeEECCCCCEEEEEECCCC----------CceEEEEECCCCCEEEeecCCCC-------------
Confidence 5688888876 55668899999999999976532 24799999988887777633211
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQ 323 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~ 323 (744)
...+.|||||+. |+|+...++ ...|+++++ ++++.+.|+... ....++|||||+.|+|++. ..+.
T Consensus 333 ~~~~~~Spdg~~-i~~~~~~~~---------~~~l~~~dl---~tg~~~~lt~~~-~~~~p~~spdg~~i~~~~~-~~g~ 397 (430)
T PRK00178 333 NARPRLSADGKT-LVMVHRQDG---------NFHVAAQDL---QRGSVRILTDTS-LDESPSVAPNGTMLIYATR-QQGR 397 (430)
T ss_pred ccceEECCCCCE-EEEEEccCC---------ceEEEEEEC---CCCCEEEccCCC-CCCCceECCCCCEEEEEEe-cCCc
Confidence 224589999998 888864332 124899998 677777777543 4457899999999999884 4466
Q ss_pred eeEEEEcCCC
Q 004574 324 TRTWLVCPGS 333 (744)
Q Consensus 324 ~~l~~~~~~~ 333 (744)
..||++++++
T Consensus 398 ~~l~~~~~~g 407 (430)
T PRK00178 398 GVLMLVSING 407 (430)
T ss_pred eEEEEEECCC
Confidence 7899999976
No 22
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=99.90 E-value=5.9e-22 Score=193.96 Aligned_cols=374 Identities=14% Similarity=0.120 Sum_probs=225.1
Q ss_pred CCCccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccC
Q 004574 1 MPFFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFES 80 (744)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~ 80 (744)
|.||.+||.+++.. |+.+++| +....+..|++||||++|||.+-- ...+-...+||+++.++|+++++|.-
T Consensus 55 Ft~~DdlWe~slk~----g~~~rit--S~lGVvnn~kf~pdGrkvaf~rv~---~~ss~~taDly~v~~e~Ge~kRiTyf 125 (668)
T COG4946 55 FTCCDDLWEYSLKD----GKPLRIT--SGLGVVNNPKFSPDGRKVAFSRVM---LGSSLQTADLYVVPSEDGEAKRITYF 125 (668)
T ss_pred EEechHHHHhhhcc----CCeeEEe--cccceeccccCCCCCcEEEEEEEE---ecCCCccccEEEEeCCCCcEEEEEEe
Confidence 45889999999988 9999998 333357899999999999994430 22234457899999999999999976
Q ss_pred CCccccccccceEEecCCcEEEEEecC-------------CCCCCCCCCCCCCCCeeeec--------------------
Q 004574 81 PDICLNAVFGSFVWVNNSTLLIFTIPS-------------SRRDPPKKTMVPLGPKIQSN-------------------- 127 (744)
Q Consensus 81 ~~~~~~~~~~~~~wspDg~~l~~~~~~-------------~~~~~~~~~~~~~~~~~~~~-------------------- 127 (744)
. ... ..-..|+|||+.|+.+-.. ..+. ....++.+|++-..
T Consensus 126 G-r~f---T~VaG~~~dg~iiV~TD~~tPF~q~~~lYkv~~dg~--~~e~LnlGpathiv~~dg~ivigRntydLP~WK~ 199 (668)
T COG4946 126 G-RRF---TRVAGWIPDGEIIVSTDFHTPFSQWTELYKVNVDGI--KTEPLNLGPATHIVIKDGIIVIGRNTYDLPHWKG 199 (668)
T ss_pred c-ccc---ceeeccCCCCCEEEEeccCCCcccceeeeEEccCCc--eeeeccCCceeeEEEeCCEEEEccCcccCccccc
Confidence 3 221 2244899999998754100 0000 11123333332221
Q ss_pred -CCC---cccccc----c---ccccCCCchhh------hcc--ceeeeeEEEEEcCCC-CeeecCCCceeeeeccCCCCc
Q 004574 128 -EQK---NIIISR----M---TDNLLKDEYDE------SLF--DYYTTAQLVLGSLDG-TAKDFGTPAVYTAVEPSPDQK 187 (744)
Q Consensus 128 -~~~---~~~~~~----~---~~~~~~~~~~~------~~~--~~~~~~~l~~~~~~G-~~~~l~~~~~~~~~~~SpDG~ 187 (744)
.++ ..+..+ . ..++-.+-+.. -+| ++.+.++||.+|++| ..++.|+...+..-..+-||+
T Consensus 200 YkGGtrGklWis~d~g~tFeK~vdl~~~vS~PmIV~~RvYFlsD~eG~GnlYSvdldGkDlrrHTnFtdYY~R~~nsDGk 279 (668)
T COG4946 200 YKGGTRGKLWISSDGGKTFEKFVDLDGNVSSPMIVGERVYFLSDHEGVGNLYSVDLDGKDLRRHTNFTDYYPRNANSDGK 279 (668)
T ss_pred ccCCccceEEEEecCCcceeeeeecCCCcCCceEEcceEEEEecccCccceEEeccCCchhhhcCCchhccccccCCCCc
Confidence 111 011100 0 00110110110 011 222567999999999 788888874444455688999
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCC-CCC-CCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDL-PPA-EDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~-~~~-~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
+|+|.... +||+||++...++.|--. +.. ......+-........++.+ +|.. +++++....
T Consensus 280 rIvFq~~G--------------dIylydP~td~lekldI~lpl~rk~k~~k~~~pskyledfa~~-~Gd~-ia~VSRGka 343 (668)
T COG4946 280 RIVFQNAG--------------DIYLYDPETDSLEKLDIGLPLDRKKKQPKFVNPSKYLEDFAVV-NGDY-IALVSRGKA 343 (668)
T ss_pred EEEEecCC--------------cEEEeCCCcCcceeeecCCccccccccccccCHHHhhhhhccC-CCcE-EEEEecCcE
Confidence 99998543 799999998887777544 333 11110111111112223332 3444 555543210
Q ss_pred -------C------------------CCCc-c-CC-ccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 266 -------G------------------DANV-E-VS-PRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 266 -------~------------------~~~~-~-~~-~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
+ +... . .. .-+.|-+++. .+++.+++...-+.+..+..||||++++.+-
T Consensus 344 Fi~~~~~~~~iqv~~~~~VrY~r~~~~~e~~vigt~dgD~l~iyd~---~~~e~kr~e~~lg~I~av~vs~dGK~~vvaN 420 (668)
T COG4946 344 FIMRPWDGYSIQVGKKGGVRYRRIQVDPEGDVIGTNDGDKLGIYDK---DGGEVKRIEKDLGNIEAVKVSPDGKKVVVAN 420 (668)
T ss_pred EEECCCCCeeEEcCCCCceEEEEEccCCcceEEeccCCceEEEEec---CCceEEEeeCCccceEEEEEcCCCcEEEEEc
Confidence 0 0000 0 00 1113555555 5667677776777888899999999888654
Q ss_pred eeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceE
Q 004574 318 WYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFL 397 (744)
Q Consensus 318 ~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l 397 (744)
...+||++|++++ .++.+ +.+..... ..+.|+|++++|+|..-. + -...+|
T Consensus 421 ----dr~el~vididng--nv~~i-dkS~~~lI-----tdf~~~~nsr~iAYafP~----g-------------y~tq~I 471 (668)
T COG4946 421 ----DRFELWVIDIDNG--NVRLI-DKSEYGLI-----TDFDWHPNSRWIAYAFPE----G-------------YYTQSI 471 (668)
T ss_pred ----CceEEEEEEecCC--CeeEe-ccccccee-----EEEEEcCCceeEEEecCc----c-------------eeeeeE
Confidence 5678999999984 44333 33332221 127799999999998621 1 125679
Q ss_pred EEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCC
Q 004574 398 DLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKT 449 (744)
Q Consensus 398 ~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~ 449 (744)
.++|..+++...+++... .+..++|.||++.|.|-+....
T Consensus 472 klydm~~~Kiy~vTT~ta------------~DfsPaFD~d~ryLYfLs~RsL 511 (668)
T COG4946 472 KLYDMDGGKIYDVTTPTA------------YDFSPAFDPDGRYLYFLSARSL 511 (668)
T ss_pred EEEecCCCeEEEecCCcc------------cccCcccCCCCcEEEEEecccc
Confidence 999999999888876542 2233589999998888664433
No 23
>PRK00178 tolB translocation protein TolB; Provisional
Probab=99.90 E-value=5.8e-22 Score=214.19 Aligned_cols=253 Identities=15% Similarity=0.148 Sum_probs=183.6
Q ss_pred CCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCC
Q 004574 41 DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPL 120 (744)
Q Consensus 41 DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~ 120 (744)
-.++|||++... ...++..+||++|.+|+..++++.... .+..+.|||||+.|+|++... +
T Consensus 161 f~~~ia~v~~~~---~~~~~~~~l~~~d~~g~~~~~l~~~~~-----~~~~p~wSpDG~~la~~s~~~-~---------- 221 (430)
T PRK00178 161 FSTRILYVTAER---FSVNTRYTLQRSDYDGARAVTLLQSRE-----PILSPRWSPDGKRIAYVSFEQ-K---------- 221 (430)
T ss_pred ceeeEEEEEeeC---CCCCcceEEEEECCCCCCceEEecCCC-----ceeeeeECCCCCEEEEEEcCC-C----------
Confidence 456799987522 111345689999999999999986654 246779999999999985421 1
Q ss_pred CCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCc
Q 004574 121 GPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPY 198 (744)
Q Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~ 198 (744)
..+||++++ +|+.++++.. +....+.|||||++|+|......
T Consensus 222 -----------------------------------~~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~~g- 265 (430)
T PRK00178 222 -----------------------------------RPRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSKDG- 265 (430)
T ss_pred -----------------------------------CCEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEccCC-
Confidence 158999999 6788888765 55667999999999999875532
Q ss_pred ccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceE
Q 004574 199 SYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDII 278 (744)
Q Consensus 199 ~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l 278 (744)
..+||+++++++..++++..... ...+.|||||+. |+|.+...+. ..|
T Consensus 266 ---------~~~Iy~~d~~~~~~~~lt~~~~~-------------~~~~~~spDg~~-i~f~s~~~g~---------~~i 313 (430)
T PRK00178 266 ---------NPEIYVMDLASRQLSRVTNHPAI-------------DTEPFWGKDGRT-LYFTSDRGGK---------PQI 313 (430)
T ss_pred ---------CceEEEEECCCCCeEEcccCCCC-------------cCCeEECCCCCE-EEEEECCCCC---------ceE
Confidence 24899999999988888765433 335689999998 8888644322 259
Q ss_pred EeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCce
Q 004574 279 YTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMM 358 (744)
Q Consensus 279 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 358 (744)
|++++ .+++.++++........++|||||++|+|.... .+..+|+++|+.+ ++.+.++..... .. +
T Consensus 314 y~~d~---~~g~~~~lt~~~~~~~~~~~Spdg~~i~~~~~~-~~~~~l~~~dl~t--g~~~~lt~~~~~------~~--p 379 (430)
T PRK00178 314 YKVNV---NGGRAERVTFVGNYNARPRLSADGKTLVMVHRQ-DGNFHVAAQDLQR--GSVRILTDTSLD------ES--P 379 (430)
T ss_pred EEEEC---CCCCEEEeecCCCCccceEECCCCCEEEEEEcc-CCceEEEEEECCC--CCEEEccCCCCC------CC--c
Confidence 99988 677777777544445678999999999998743 3456899999988 455666653211 12 3
Q ss_pred eeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 359 TRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 359 ~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
.|||||+.|+|.... .....|+.+++.++..+++..
T Consensus 380 ~~spdg~~i~~~~~~------------------~g~~~l~~~~~~g~~~~~l~~ 415 (430)
T PRK00178 380 SVAPNGTMLIYATRQ------------------QGRGVLMLVSINGRVRLPLPT 415 (430)
T ss_pred eECCCCCEEEEEEec------------------CCceEEEEEECCCCceEECcC
Confidence 599999999998743 223458999987666555543
No 24
>COG1505 Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]
Probab=99.89 E-value=3.9e-20 Score=189.30 Aligned_cols=282 Identities=20% Similarity=0.196 Sum_probs=217.2
Q ss_pred ecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCCCCCCcCCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCC
Q 004574 433 DINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPHPYPTLASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGP 512 (744)
Q Consensus 433 ~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~~~~~~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~ 512 (744)
+-+.+|..+++...+.+.|+++|++++.+++.+.+...+..+. -..+.+++.+-.+.||.+++++++. ++.+.. +
T Consensus 346 ~~~~~g~ev~l~~t~F~tP~~~~r~~~~~~eLe~ik~~p~~FD-a~~~~veQ~~atSkDGT~IPYFiv~-K~~~~d---~ 420 (648)
T COG1505 346 SADKDGDEVFLAFTSFTTPSTLYRLDLFGGELEVIREQPVQFD-ADNYEVEQFFATSKDGTRIPYFIVR-KGAKKD---E 420 (648)
T ss_pred cCCCCCcEEEEEeecccCCCceEEEecCCceehhhhhccCCcC-ccCceEEEEEEEcCCCccccEEEEe-cCCcCC---C
Confidence 3455778888888899999999999999999999988765443 3456889999999999999999998 664332 4
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC-----------ChHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK-----------LPNDSAEAA 581 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~-----------~~~~d~~~~ 581 (744)
.|++|+.+||-. +.-.|. | ......++++|.+.+..+-++...+|.. ...+|+.++
T Consensus 421 ~pTll~aYGGF~--------vsltP~-f----s~~~~~WLerGg~~v~ANIRGGGEfGp~WH~Aa~k~nrq~vfdDf~AV 487 (648)
T COG1505 421 NPTLLYAYGGFN--------ISLTPR-F----SGSRKLWLERGGVFVLANIRGGGEFGPEWHQAGMKENKQNVFDDFIAV 487 (648)
T ss_pred CceEEEeccccc--------cccCCc-c----chhhHHHHhcCCeEEEEecccCCccCHHHHHHHhhhcchhhhHHHHHH
Confidence 899999999732 221211 1 1233778889877666454454445544 345699999
Q ss_pred HHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-----Ccccccccchh--hcHHHHHh
Q 004574 582 VEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP-----FGFQTEFRTLW--EATNVYIE 654 (744)
Q Consensus 582 ~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~-----~~~~~~~~~~~--~~~~~~~~ 654 (744)
++.|+++++..+++++|.|.|-||.++..++.+.|+.|.|+|+..|+.|+.... ..+..++..|. +....+.+
T Consensus 488 aedLi~rgitspe~lgi~GgSNGGLLvg~alTQrPelfgA~v~evPllDMlRYh~l~aG~sW~~EYG~Pd~P~d~~~l~~ 567 (648)
T COG1505 488 AEDLIKRGITSPEKLGIQGGSNGGLLVGAALTQRPELFGAAVCEVPLLDMLRYHLLTAGSSWIAEYGNPDDPEDRAFLLA 567 (648)
T ss_pred HHHHHHhCCCCHHHhhhccCCCCceEEEeeeccChhhhCceeeccchhhhhhhcccccchhhHhhcCCCCCHHHHHHHHh
Confidence 999999999999999999999999999999999999999999999998853211 11223333332 33456778
Q ss_pred cCcccccCC-CC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccH-HHHHHHHHHHHHH
Q 004574 655 MSPITHANK-IK-KPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENV-MHVIWETDRWLQK 731 (744)
Q Consensus 655 ~~~~~~~~~-~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~-~~~~~~~~~fl~~ 731 (744)
+||.++++. .+ .|+||..+.+|.+|. +.|++.++.+|++.+.++-+.+-.++||+-...... .+....+..||.+
T Consensus 568 YSPy~nl~~g~kYP~~LITTs~~DDRVH--PaHarKfaa~L~e~~~pv~~~e~t~gGH~g~~~~~~~A~~~a~~~afl~r 645 (648)
T COG1505 568 YSPYHNLKPGQKYPPTLITTSLHDDRVH--PAHARKFAAKLQEVGAPVLLREETKGGHGGAAPTAEIARELADLLAFLLR 645 (648)
T ss_pred cCchhcCCccccCCCeEEEccccccccc--chHHHHHHHHHHhcCCceEEEeecCCcccCCCChHHHHHHHHHHHHHHHH
Confidence 999999876 44 779999999999998 999999999999999999999999999986643332 4556668899988
Q ss_pred hcc
Q 004574 732 YCL 734 (744)
Q Consensus 732 ~l~ 734 (744)
.|.
T Consensus 646 ~L~ 648 (648)
T COG1505 646 TLG 648 (648)
T ss_pred hhC
Confidence 763
No 25
>KOG2237 consensus Predicted serine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.87 E-value=4.4e-19 Score=182.38 Aligned_cols=282 Identities=21% Similarity=0.221 Sum_probs=200.5
Q ss_pred CCCEEEEEEecCCCCceEEEEECCCCce--eeeecCCCCCCCcC--CCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCC
Q 004574 437 NQLKILTSKESKTEITQYHILSWPLKKS--SQITNFPHPYPTLA--SLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGP 512 (744)
Q Consensus 437 d~~~~~~~~~~~~~~~~i~~~~~~~g~~--~~lt~~~~~~~~~~--~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~ 512 (744)
+.+.+.|..++...|+.||.+|+..++. ..+.+.....|.+. ....+++++.+.||..++..+++.+.. +..++
T Consensus 392 ~~~~~~f~~sS~l~P~~iy~yDl~~~~~e~~vf~e~~~~lpg~~~s~y~~~r~~~~SkDGt~VPM~Iv~kk~~--k~dg~ 469 (712)
T KOG2237|consen 392 KSSTIRFQFSSFLTPGSIYDYDLANGKPEPSVFREITVVLPGFDASDYVVERIEVSSKDGTKVPMFIVYKKDI--KLDGS 469 (712)
T ss_pred CCceEEEEEeccCCCCeEEEeeccCCCCCCcceeeeccccCcccccceEEEEEEEecCCCCccceEEEEechh--hhcCC
Confidence 4578899999999999999999988743 33444444445544 348999999999999999999995553 33456
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC-----------ChHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK-----------LPNDSAEAA 581 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~-----------~~~~d~~~~ 581 (744)
.|++|+.||+...+.++. |. .....|+.+|++....+-++..++|.. ...+|++++
T Consensus 470 ~P~LLygYGay~isl~p~---------f~----~srl~lld~G~Vla~a~VRGGGe~G~~WHk~G~lakKqN~f~Dfia~ 536 (712)
T KOG2237|consen 470 KPLLLYGYGAYGISLDPS---------FR----ASRLSLLDRGWVLAYANVRGGGEYGEQWHKDGRLAKKQNSFDDFIAC 536 (712)
T ss_pred CceEEEEecccceeeccc---------cc----cceeEEEecceEEEEEeeccCcccccchhhccchhhhcccHHHHHHH
Confidence 899999999864433332 21 112345679997776555555555554 345699999
Q ss_pred HHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC----CCcccccccchhhc---HHHHHh
Q 004574 582 VEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT----PFGFQTEFRTLWEA---TNVYIE 654 (744)
Q Consensus 582 ~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~---~~~~~~ 654 (744)
++||.++++..++|+++.|.|.||.++..++.++|++|.|+|+..|++|...+ ..........-|.+ .+.+..
T Consensus 537 AeyLve~gyt~~~kL~i~G~SaGGlLvga~iN~rPdLF~avia~VpfmDvL~t~~~tilplt~sd~ee~g~p~~~~~~~~ 616 (712)
T KOG2237|consen 537 AEYLVENGYTQPSKLAIEGGSAGGLLVGACINQRPDLFGAVIAKVPFMDVLNTHKDTILPLTTSDYEEWGNPEDFEDLIK 616 (712)
T ss_pred HHHHHHcCCCCccceeEecccCccchhHHHhccCchHhhhhhhcCcceehhhhhccCccccchhhhcccCChhhhhhhhe
Confidence 99999999999999999999999999999999999999999999999884321 11111111111222 233344
Q ss_pred c---CcccccCCCC-CC-EEEEeeCCCCCCCCCHHHHHHHHHHHHhCC-------CcEEEEEeCCCCcccCccc-cHHHH
Q 004574 655 M---SPITHANKIK-KP-ILIIHGEVDDKVGLFPMQAERFFDALKGHG-------ALSRLVLLPFEHHVYAARE-NVMHV 721 (744)
Q Consensus 655 ~---~~~~~~~~~~-~P-~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~-------~~~~~~~~~~~~H~~~~~~-~~~~~ 721 (744)
+ +|+.++.+-. -| +|+..+.+|.+|. +.++..+.++|+.+- .++-+.+..++||+..... ...+.
T Consensus 617 i~~y~pv~~i~~q~~YPS~lvtta~hD~RV~--~~~~~K~vAklre~~~~~~~q~~pvll~i~~~agH~~~~~~~k~~~E 694 (712)
T KOG2237|consen 617 ISPYSPVDNIKKQVQYPSMLVTTADHDDRVG--PLESLKWVAKLREATCDSLKQTNPVLLRIETKAGHGAEKPRFKQIEE 694 (712)
T ss_pred ecccCccCCCchhccCcceEEeeccCCCccc--ccchHHHHHHHHHHhhcchhcCCCEEEEEecCCccccCCchHHHHHH
Confidence 4 4444444333 45 8999999999998 899999999998642 4688999999999875322 22334
Q ss_pred HHHHHHHHHHhccC
Q 004574 722 IWETDRWLQKYCLS 735 (744)
Q Consensus 722 ~~~~~~fl~~~l~~ 735 (744)
....++||.+.+..
T Consensus 695 ~a~~yaFl~K~~~~ 708 (712)
T KOG2237|consen 695 AAFRYAFLAKMLNS 708 (712)
T ss_pred HHHHHHHHHHHhcC
Confidence 45578888887654
No 26
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.87 E-value=1.8e-20 Score=201.25 Aligned_cols=222 Identities=19% Similarity=0.164 Sum_probs=158.6
Q ss_pred CCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCC
Q 004574 42 GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLG 121 (744)
Q Consensus 42 G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~ 121 (744)
+++|||++... +.....+||++|.+|...++|+.+.. .+..+.|||||+.|+|++... +
T Consensus 168 ~~ria~v~~~~----~~~~~~~i~i~d~dg~~~~~lt~~~~-----~v~~p~wSPDG~~la~~s~~~-~----------- 226 (429)
T PRK01742 168 RTRIAYVVQKN----GGSQPYEVRVADYDGFNQFIVNRSSQ-----PLMSPAWSPDGSKLAYVSFEN-K----------- 226 (429)
T ss_pred CCEEEEEEEEc----CCCceEEEEEECCCCCCceEeccCCC-----ccccceEcCCCCEEEEEEecC-C-----------
Confidence 67899987632 11335899999999998888887654 356789999999999986421 1
Q ss_pred CeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcc
Q 004574 122 PKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYS 199 (744)
Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~ 199 (744)
..+||++|+ +|+.++++.. +....++|||||++|++......
T Consensus 227 ----------------------------------~~~i~i~dl~tg~~~~l~~~~g~~~~~~wSPDG~~La~~~~~~g-- 270 (429)
T PRK01742 227 ----------------------------------KSQLVVHDLRSGARKVVASFRGHNGAPAFSPDGSRLAFASSKDG-- 270 (429)
T ss_pred ----------------------------------CcEEEEEeCCCCceEEEecCCCccCceeECCCCCEEEEEEecCC--
Confidence 158999999 6677777654 55567999999999999865431
Q ss_pred cccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEE
Q 004574 200 YKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIY 279 (744)
Q Consensus 200 ~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~ 279 (744)
..+||++++++++.++++..... ...+.|||||+. |+|++..++. .+||
T Consensus 271 --------~~~Iy~~d~~~~~~~~lt~~~~~-------------~~~~~wSpDG~~-i~f~s~~~g~---------~~I~ 319 (429)
T PRK01742 271 --------VLNIYVMGANGGTPSQLTSGAGN-------------NTEPSWSPDGQS-ILFTSDRSGS---------PQVY 319 (429)
T ss_pred --------cEEEEEEECCCCCeEeeccCCCC-------------cCCEEECCCCCE-EEEEECCCCC---------ceEE
Confidence 13799999998888888765433 446799999998 8888644332 2589
Q ss_pred eccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCcee
Q 004574 280 TQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMT 359 (744)
Q Consensus 280 ~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 359 (744)
.++. .++..+.+. ... ..++|||||++|++... ..++++|+.++ +.+.++... .. .. ++
T Consensus 320 ~~~~---~~~~~~~l~-~~~--~~~~~SpDG~~ia~~~~-----~~i~~~Dl~~g--~~~~lt~~~-~~-----~~--~~ 378 (429)
T PRK01742 320 RMSA---SGGGASLVG-GRG--YSAQISADGKTLVMING-----DNVVKQDLTSG--STEVLSSTF-LD-----ES--PS 378 (429)
T ss_pred EEEC---CCCCeEEec-CCC--CCccCCCCCCEEEEEcC-----CCEEEEECCCC--CeEEecCCC-CC-----CC--ce
Confidence 9887 555554442 222 46889999999998763 35888898873 444554321 10 22 45
Q ss_pred eCCCCCeEEEEee
Q 004574 360 RTSTGTNVIAKIK 372 (744)
Q Consensus 360 ~spdg~~l~~~~~ 372 (744)
|||||+.|++...
T Consensus 379 ~sPdG~~i~~~s~ 391 (429)
T PRK01742 379 ISPNGIMIIYSST 391 (429)
T ss_pred ECCCCCEEEEEEc
Confidence 9999999999873
No 27
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.87 E-value=1.5e-20 Score=201.78 Aligned_cols=227 Identities=17% Similarity=0.143 Sum_probs=164.5
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..||+++.++ +..+.++. ....+..|+|||||++|||++. .++..+||++++.+|+.++++....
T Consensus 184 ~~i~i~d~dg----~~~~~lt~--~~~~v~~p~wSPDG~~la~~s~-------~~~~~~i~i~dl~tg~~~~l~~~~g-- 248 (429)
T PRK01742 184 YEVRVADYDG----FNQFIVNR--SSQPLMSPAWSPDGSKLAYVSF-------ENKKSQLVVHDLRSGARKVVASFRG-- 248 (429)
T ss_pred EEEEEECCCC----CCceEecc--CCCccccceEcCCCCEEEEEEe-------cCCCcEEEEEeCCCCceEEEecCCC--
Confidence 5789999877 66667763 2335889999999999999864 1345789999999999888875543
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....++|||||++|+++.... +..+||++++
T Consensus 249 ---~~~~~~wSPDG~~La~~~~~~----------------------------------------------g~~~Iy~~d~ 279 (429)
T PRK01742 249 ---HNGAPAFSPDGSRLAFASSKD----------------------------------------------GVLNIYVMGA 279 (429)
T ss_pred ---ccCceeECCCCCEEEEEEecC----------------------------------------------CcEEEEEEEC
Confidence 134679999999999975311 1157999999
Q ss_pred -CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 -DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 -~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+++.++++.. .....+.|||||++|+|.+.... ..+||.++.+++..+.+... .
T Consensus 280 ~~~~~~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g----------~~~I~~~~~~~~~~~~l~~~-~------------- 335 (429)
T PRK01742 280 NGGTPSQLTSGAGNNTEPSWSPDGQSILFTSDRSG----------SPQVYRMSASGGGASLVGGR-G------------- 335 (429)
T ss_pred CCCCeEeeccCCCCcCCEEECCCCCEEEEEECCCC----------CceEEEEECCCCCeEEecCC-C-------------
Confidence 5588888877 66778999999999999876432 24899999887766555211 0
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
..+.|||||+. |++... ..++++++ .+++.+.++.. .....++|||||+.|+|.+. +..
T Consensus 336 --~~~~~SpDG~~-ia~~~~-------------~~i~~~Dl---~~g~~~~lt~~-~~~~~~~~sPdG~~i~~~s~-~g~ 394 (429)
T PRK01742 336 --YSAQISADGKT-LVMING-------------DNVVKQDL---TSGSTEVLSST-FLDESPSISPNGIMIIYSST-QGL 394 (429)
T ss_pred --CCccCCCCCCE-EEEEcC-------------CCEEEEEC---CCCCeEEecCC-CCCCCceECCCCCEEEEEEc-CCC
Confidence 13579999998 877742 23777887 66666666543 34567999999999999874 334
Q ss_pred ceeEEEEcCCCCCCcceeee
Q 004574 323 QTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~ 342 (744)
...++++++++ ...+.+.
T Consensus 395 ~~~l~~~~~~G--~~~~~l~ 412 (429)
T PRK01742 395 GKVLQLVSADG--RFKARLP 412 (429)
T ss_pred ceEEEEEECCC--CceEEcc
Confidence 45677777776 3444453
No 28
>KOG1455 consensus Lysophospholipase [Lipid transport and metabolism]
Probab=99.85 E-value=3.7e-20 Score=175.21 Aligned_cols=228 Identities=18% Similarity=0.192 Sum_probs=163.2
Q ss_pred CceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEE
Q 004574 480 LQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVL 559 (744)
Q Consensus 480 ~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~ 559 (744)
+......+.+.+|..+....|.|..- .+++.+|+++||.|. .....+...+..|+..||.|+
T Consensus 25 ~~~~~~~~~n~rG~~lft~~W~p~~~----~~pr~lv~~~HG~g~--------------~~s~~~~~~a~~l~~~g~~v~ 86 (313)
T KOG1455|consen 25 VTYSESFFTNPRGAKLFTQSWLPLSG----TEPRGLVFLCHGYGE--------------HSSWRYQSTAKRLAKSGFAVY 86 (313)
T ss_pred cceeeeeEEcCCCCEeEEEecccCCC----CCCceEEEEEcCCcc--------------cchhhHHHHHHHHHhCCCeEE
Confidence 34555566778999999999999652 246889999999751 112223356789999999999
Q ss_pred ecCCCCCCCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 560 AGPSIPIIGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 560 ~~~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
+ .+..|+|.++ ..+|+...++.++.+..--....+++||||||.+++.++.++|+.+.++|+++|+
T Consensus 87 a---~D~~GhG~SdGl~~yi~~~d~~v~D~~~~~~~i~~~~e~~~lp~FL~GeSMGGAV~Ll~~~k~p~~w~G~ilvaPm 163 (313)
T KOG1455|consen 87 A---IDYEGHGRSDGLHAYVPSFDLVVDDVISFFDSIKEREENKGLPRFLFGESMGGAVALLIALKDPNFWDGAILVAPM 163 (313)
T ss_pred E---eeccCCCcCCCCcccCCcHHHHHHHHHHHHHHHhhccccCCCCeeeeecCcchHHHHHHHhhCCcccccceeeecc
Confidence 9 6667776653 2347777777777765444468999999999999999999999999999999997
Q ss_pred CCCCC-------------------CCCc-cccc------ccchhhcH-------------------HHHH-hcCcccccC
Q 004574 629 YNKTL-------------------TPFG-FQTE------FRTLWEAT-------------------NVYI-EMSPITHAN 662 (744)
Q Consensus 629 ~~~~~-------------------~~~~-~~~~------~~~~~~~~-------------------~~~~-~~~~~~~~~ 662 (744)
+.... ..+. .+.. .+.++... +.+. -......+.
T Consensus 164 c~i~~~~kp~p~v~~~l~~l~~liP~wk~vp~~d~~~~~~kdp~~r~~~~~npl~y~g~pRl~T~~ElLr~~~~le~~l~ 243 (313)
T KOG1455|consen 164 CKISEDTKPHPPVISILTLLSKLIPTWKIVPTKDIIDVAFKDPEKRKILRSDPLCYTGKPRLKTAYELLRVTADLEKNLN 243 (313)
T ss_pred cccCCccCCCcHHHHHHHHHHHhCCceeecCCccccccccCCHHHHHHhhcCCceecCCccHHHHHHHHHHHHHHHHhcc
Confidence 53110 0000 0000 01111100 0000 012234567
Q ss_pred CCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccC---ccccHHHHHHHHHHHHHHh
Q 004574 663 KIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYA---ARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 663 ~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~---~~~~~~~~~~~~~~fl~~~ 732 (744)
++.+|+|++||+.|.+.. +..++++|+..... ...+.+|||+.|.+. ..++.+.++..|++||+++
T Consensus 244 ~vtvPflilHG~dD~VTD--p~~Sk~Lye~A~S~--DKTlKlYpGm~H~Ll~gE~~en~e~Vf~DI~~Wl~~r 312 (313)
T KOG1455|consen 244 EVTVPFLILHGTDDKVTD--PKVSKELYEKASSS--DKTLKLYPGMWHSLLSGEPDENVEIVFGDIISWLDER 312 (313)
T ss_pred cccccEEEEecCCCcccC--cHHHHHHHHhccCC--CCceeccccHHHHhhcCCCchhHHHHHHHHHHHHHhc
Confidence 889999999999999977 99999999977654 459999999999976 3577789999999999875
No 29
>PRK10566 esterase; Provisional
Probab=99.85 E-value=1.1e-19 Score=181.64 Aligned_cols=212 Identities=18% Similarity=0.245 Sum_probs=140.9
Q ss_pred EEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC-
Q 004574 494 PLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK- 572 (744)
Q Consensus 494 ~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~- 572 (744)
.+....|.|.+ ..+++.|+||++||.+.. . ......+..|+++||+|+.++.+ |+|.+
T Consensus 11 ~~~~~~~~p~~---~~~~~~p~vv~~HG~~~~------------~---~~~~~~~~~l~~~G~~v~~~d~~---g~G~~~ 69 (249)
T PRK10566 11 GIEVLHAFPAG---QRDTPLPTVFFYHGFTSS------------K---LVYSYFAVALAQAGFRVIMPDAP---MHGARF 69 (249)
T ss_pred CcceEEEcCCC---CCCCCCCEEEEeCCCCcc------------c---chHHHHHHHHHhCCCEEEEecCC---cccccC
Confidence 34456677864 112357999999996411 0 11224577888999999995443 22211
Q ss_pred ----------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEcc-CCCCC----
Q 004574 573 ----------------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARS-GSYNK---- 631 (744)
Q Consensus 573 ----------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~-~~~~~---- 631 (744)
...+|+.++++++.++..+|.++|+++||||||.+|+.++.++|+...+++... +.+..
T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~i~v~G~S~Gg~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (249)
T PRK10566 70 SGDEARRLNHFWQILLQNMQEFPTLRAAIREEGWLLDDRLAVGGASMGGMTALGIMARHPWVKCVASLMGSGYFTSLART 149 (249)
T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHhcCCcCccceeEEeecccHHHHHHHHHhCCCeeEEEEeeCcHHHHHHHHH
Confidence 123477788899988888899999999999999999999999876444433332 22110
Q ss_pred CCCCCccccc--ccchhhcHHHHHhcCcccccCCC-CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCC--cEEEEEeC
Q 004574 632 TLTPFGFQTE--FRTLWEATNVYIEMSPITHANKI-KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGA--LSRLVLLP 706 (744)
Q Consensus 632 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~--~~~~~~~~ 706 (744)
.......... ..........+..+++...+.++ ++|+|++||++|..+| +.++++++++++.++. ++++++++
T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~P~Lii~G~~D~~v~--~~~~~~l~~~l~~~g~~~~~~~~~~~ 227 (249)
T PRK10566 150 LFPPLIPETAAQQAEFNNIVAPLAEWEVTHQLEQLADRPLLLWHGLADDVVP--AAESLRLQQALRERGLDKNLTCLWEP 227 (249)
T ss_pred hcccccccccccHHHHHHHHHHHhhcChhhhhhhcCCCCEEEEEcCCCCcCC--HHHHHHHHHHHHhcCCCcceEEEecC
Confidence 0000000000 00000111122334455556666 6999999999999999 9999999999999886 47999999
Q ss_pred CCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 707 FEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 707 ~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
+++|.+. ...+..+.+||+++|
T Consensus 228 ~~~H~~~-----~~~~~~~~~fl~~~~ 249 (249)
T PRK10566 228 GVRHRIT-----PEALDAGVAFFRQHL 249 (249)
T ss_pred CCCCccC-----HHHHHHHHHHHHhhC
Confidence 9999875 256889999999875
No 30
>PLN02298 hydrolase, alpha/beta fold family protein
Probab=99.85 E-value=2.5e-19 Score=186.78 Aligned_cols=237 Identities=15% Similarity=0.205 Sum_probs=159.9
Q ss_pred CcCCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCC
Q 004574 476 TLASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARR 555 (744)
Q Consensus 476 ~~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G 555 (744)
....++.+...+...+|.++++..+.|.+. ..+.++||++||.+.. ..+ .....+..|+++|
T Consensus 26 ~~~~~~~~~~~~~~~dg~~l~~~~~~~~~~----~~~~~~VvllHG~~~~------------~~~--~~~~~~~~L~~~G 87 (330)
T PLN02298 26 ALKGIKGSKSFFTSPRGLSLFTRSWLPSSS----SPPRALIFMVHGYGND------------ISW--TFQSTAIFLAQMG 87 (330)
T ss_pred hccCCccccceEEcCCCCEEEEEEEecCCC----CCCceEEEEEcCCCCC------------cce--ehhHHHHHHHhCC
Confidence 344456667778888999999999988642 1347899999996410 011 1112345678899
Q ss_pred eEEEecCCCCCCCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEE
Q 004574 556 FAVLAGPSIPIIGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIA 624 (744)
Q Consensus 556 ~~v~~~~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~ 624 (744)
|.|++ ++.+|+|.+. ..+|+.++++++......+..+++|+||||||.+|+.++.++|++++++|+
T Consensus 88 y~V~~---~D~rGhG~S~~~~~~~~~~~~~~~D~~~~i~~l~~~~~~~~~~i~l~GhSmGG~ia~~~a~~~p~~v~~lvl 164 (330)
T PLN02298 88 FACFA---LDLEGHGRSEGLRAYVPNVDLVVEDCLSFFNSVKQREEFQGLPRFLYGESMGGAICLLIHLANPEGFDGAVL 164 (330)
T ss_pred CEEEE---ecCCCCCCCCCccccCCCHHHHHHHHHHHHHHHHhcccCCCCCEEEEEecchhHHHHHHHhcCcccceeEEE
Confidence 99999 5666666542 234888899988876444456899999999999999999999999999999
Q ss_pred ccCCCCCCCC---CCcc-----------cc----cccchh------hcHHHHHhcC----------------------cc
Q 004574 625 RSGSYNKTLT---PFGF-----------QT----EFRTLW------EATNVYIEMS----------------------PI 658 (744)
Q Consensus 625 ~~~~~~~~~~---~~~~-----------~~----~~~~~~------~~~~~~~~~~----------------------~~ 658 (744)
++|....... .+.. .. ...... .....+...+ ..
T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (330)
T PLN02298 165 VAPMCKISDKIRPPWPIPQILTFVARFLPTLAIVPTADLLEKSVKVPAKKIIAKRNPMRYNGKPRLGTVVELLRVTDYLG 244 (330)
T ss_pred ecccccCCcccCCchHHHHHHHHHHHHCCCCccccCCCcccccccCHHHHHHHHhCccccCCCccHHHHHHHHHHHHHHH
Confidence 9986532110 0000 00 000000 0000000001 12
Q ss_pred cccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcc-c--cHHHHHHHHHHHHHHhccC
Q 004574 659 THANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAR-E--NVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 659 ~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~-~--~~~~~~~~~~~fl~~~l~~ 735 (744)
..+.++++|+|++||++|.++| ++.+++++++++. ...+++++++++|.+... + ..+.+.+.+.+||++++..
T Consensus 245 ~~l~~i~~PvLii~G~~D~ivp--~~~~~~l~~~i~~--~~~~l~~~~~a~H~~~~e~pd~~~~~~~~~i~~fl~~~~~~ 320 (330)
T PLN02298 245 KKLKDVSIPFIVLHGSADVVTD--PDVSRALYEEAKS--EDKTIKIYDGMMHSLLFGEPDENIEIVRRDILSWLNERCTG 320 (330)
T ss_pred HhhhhcCCCEEEEecCCCCCCC--HHHHHHHHHHhcc--CCceEEEcCCcEeeeecCCCHHHHHHHHHHHHHHHHHhccC
Confidence 2356789999999999999999 9999999887753 345899999999997532 1 2356788899999999865
Q ss_pred CC
Q 004574 736 NT 737 (744)
Q Consensus 736 ~~ 737 (744)
..
T Consensus 321 ~~ 322 (330)
T PLN02298 321 KA 322 (330)
T ss_pred CC
Confidence 43
No 31
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=99.83 E-value=7.4e-19 Score=190.03 Aligned_cols=227 Identities=19% Similarity=0.203 Sum_probs=166.1
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..||+++.++ +..++++... .....|+|||||++|||+... .+..+||++++++|+.++++....
T Consensus 170 ~~l~~~d~~g----~~~~~l~~~~--~~~~~p~~Spdg~~la~~~~~-------~~~~~i~v~d~~~g~~~~~~~~~~-- 234 (417)
T TIGR02800 170 YELQVADYDG----ANPQTITRSR--EPILSPAWSPDGQKLAYVSFE-------SGKPEIYVQDLATGQREKVASFPG-- 234 (417)
T ss_pred ceEEEEcCCC----CCCEEeecCC--CceecccCCCCCCEEEEEEcC-------CCCcEEEEEECCCCCEEEeecCCC--
Confidence 3589998877 7788887422 247899999999999997641 345789999999998888865443
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.|||||+.|+|+.... ...+||++++
T Consensus 235 ---~~~~~~~spDg~~l~~~~~~~----------------------------------------------~~~~i~~~d~ 265 (417)
T TIGR02800 235 ---MNGAPAFSPDGSKLAVSLSKD----------------------------------------------GNPDIYVMDL 265 (417)
T ss_pred ---CccceEECCCCCEEEEEECCC----------------------------------------------CCccEEEEEC
Confidence 234679999999999874311 0157999999
Q ss_pred -CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 -DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 -~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
++..++++.. .....+.|+|||++|+|.+.... ..+||++++++++.++++.....
T Consensus 266 ~~~~~~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g----------~~~iy~~d~~~~~~~~l~~~~~~------------ 323 (417)
T TIGR02800 266 DGKQLTRLTNGPGIDTEPSWSPDGKSIAFTSDRGG----------SPQIYMMDADGGEVRRLTFRGGY------------ 323 (417)
T ss_pred CCCCEEECCCCCCCCCCEEECCCCCEEEEEECCCC----------CceEEEEECCCCCEEEeecCCCC------------
Confidence 5688888766 44557899999999999876532 23799999998877776643222
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
...+.|||||+. |++..... .+.+|+++++ .++..+.++. ......++|||||+.|+|... ..+
T Consensus 324 -~~~~~~spdg~~-i~~~~~~~---------~~~~i~~~d~---~~~~~~~l~~-~~~~~~p~~spdg~~l~~~~~-~~~ 387 (417)
T TIGR02800 324 -NASPSWSPDGDL-IAFVHREG---------GGFNIAVMDL---DGGGERVLTD-TGLDESPSFAPNGRMILYATT-RGG 387 (417)
T ss_pred -ccCeEECCCCCE-EEEEEccC---------CceEEEEEeC---CCCCeEEccC-CCCCCCceECCCCCEEEEEEe-CCC
Confidence 345689999998 88875322 1235899998 5566555553 334567899999999999884 345
Q ss_pred ceeEEEEcCCC
Q 004574 323 QTRTWLVCPGS 333 (744)
Q Consensus 323 ~~~l~~~~~~~ 333 (744)
...+++++.++
T Consensus 388 ~~~l~~~~~~g 398 (417)
T TIGR02800 388 RGVLGLVSTDG 398 (417)
T ss_pred cEEEEEEECCC
Confidence 57899888776
No 32
>PRK05077 frsA fermentation/respiration switch protein; Reviewed
Probab=99.83 E-value=5.7e-19 Score=186.69 Aligned_cols=223 Identities=18% Similarity=0.162 Sum_probs=151.5
Q ss_pred CCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEE
Q 004574 479 SLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAV 558 (744)
Q Consensus 479 ~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v 558 (744)
....+.++|...++..+++++++|.. .++.|+||++||.+. .........+..|+++||+|
T Consensus 165 ~~~~e~v~i~~~~g~~l~g~l~~P~~-----~~~~P~Vli~gG~~~--------------~~~~~~~~~~~~La~~Gy~v 225 (414)
T PRK05077 165 PGELKELEFPIPGGGPITGFLHLPKG-----DGPFPTVLVCGGLDS--------------LQTDYYRLFRDYLAPRGIAM 225 (414)
T ss_pred CCceEEEEEEcCCCcEEEEEEEECCC-----CCCccEEEEeCCccc--------------chhhhHHHHHHHHHhCCCEE
Confidence 34578889987777789999999974 346899887776420 00011123456788999999
Q ss_pred EecCCCCCCCCCCCC-------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCC
Q 004574 559 LAGPSIPIIGEGDKL-------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNK 631 (744)
Q Consensus 559 ~~~~~~~~~g~g~~~-------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~ 631 (744)
++ ++.+|+|.+. ......++++++..+..+|.+||+++||||||++|+.++..+|++++++|+++|+++.
T Consensus 226 l~---~D~pG~G~s~~~~~~~d~~~~~~avld~l~~~~~vd~~ri~l~G~S~GG~~Al~~A~~~p~ri~a~V~~~~~~~~ 302 (414)
T PRK05077 226 LT---IDMPSVGFSSKWKLTQDSSLLHQAVLNALPNVPWVDHTRVAAFGFRFGANVAVRLAYLEPPRLKAVACLGPVVHT 302 (414)
T ss_pred EE---ECCCCCCCCCCCCccccHHHHHHHHHHHHHhCcccCcccEEEEEEChHHHHHHHHHHhCCcCceEEEEECCccch
Confidence 99 4555555432 1113457889999998899999999999999999999999998999999999988642
Q ss_pred CCCCCcc----cc--------cccchhhcHHHH----HhcCc--ccc-cCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHH
Q 004574 632 TLTPFGF----QT--------EFRTLWEATNVY----IEMSP--ITH-ANKIKKPILIIHGEVDDKVGLFPMQAERFFDA 692 (744)
Q Consensus 632 ~~~~~~~----~~--------~~~~~~~~~~~~----~~~~~--~~~-~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~ 692 (744)
....... .. .........+.+ ..++. ... ..++++|+|++||++|.++| ..+++.+.+.
T Consensus 303 ~~~~~~~~~~~p~~~~~~la~~lg~~~~~~~~l~~~l~~~sl~~~~~l~~~i~~PvLiI~G~~D~ivP--~~~a~~l~~~ 380 (414)
T PRK05077 303 LLTDPKRQQQVPEMYLDVLASRLGMHDASDEALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSP--EEDSRLIASS 380 (414)
T ss_pred hhcchhhhhhchHHHHHHHHHHhCCCCCChHHHHHHhhhccchhhhhhccCCCCcEEEEecCCCCCCC--HHHHHHHHHh
Confidence 1111000 00 000000011111 11111 011 25689999999999999999 8888866544
Q ss_pred HHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 693 LKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 693 l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
. .+.+++++|++ |. .+....++..+.+||.++|
T Consensus 381 ~----~~~~l~~i~~~-~~---~e~~~~~~~~i~~wL~~~l 413 (414)
T PRK05077 381 S----ADGKLLEIPFK-PV---YRNFDKALQEISDWLEDRL 413 (414)
T ss_pred C----CCCeEEEccCC-Cc---cCCHHHHHHHHHHHHHHHh
Confidence 3 34588999987 33 3457899999999999876
No 33
>PLN02385 hydrolase; alpha/beta fold family protein
Probab=99.82 E-value=9.2e-19 Score=183.54 Aligned_cols=227 Identities=14% Similarity=0.142 Sum_probs=149.3
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
.+...+.+.+|.++.+..+.|++ ..++|+||++||.+.. ...+....+..|+++||.|++
T Consensus 61 ~~~~~~~~~~g~~l~~~~~~p~~-----~~~~~~iv~lHG~~~~--------------~~~~~~~~~~~l~~~g~~v~~- 120 (349)
T PLN02385 61 TEESYEVNSRGVEIFSKSWLPEN-----SRPKAAVCFCHGYGDT--------------CTFFFEGIARKIASSGYGVFA- 120 (349)
T ss_pred eeeeeEEcCCCCEEEEEEEecCC-----CCCCeEEEEECCCCCc--------------cchHHHHHHHHHHhCCCEEEE-
Confidence 44445556789999999999864 2357999999996411 111112345577889999999
Q ss_pred CCCCCCCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 562 PSIPIIGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 562 ~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
++.+|+|.+. ..+|+.+.++.+..+...+..+++|+||||||.+++.++.++|++++++|+++|...
T Consensus 121 --~D~~G~G~S~~~~~~~~~~~~~~~dv~~~l~~l~~~~~~~~~~~~LvGhSmGG~val~~a~~~p~~v~glVLi~p~~~ 198 (349)
T PLN02385 121 --MDYPGFGLSEGLHGYIPSFDDLVDDVIEHYSKIKGNPEFRGLPSFLFGQSMGGAVALKVHLKQPNAWDGAILVAPMCK 198 (349)
T ss_pred --ecCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhccccCCCCEEEEEeccchHHHHHHHHhCcchhhheeEeccccc
Confidence 5556665442 123556666666544334456899999999999999999999999999999998643
Q ss_pred CCC---CCCc----------------c-cc-c-ccchhhcH-----------------------HHHHh-cCcccccCCC
Q 004574 631 KTL---TPFG----------------F-QT-E-FRTLWEAT-----------------------NVYIE-MSPITHANKI 664 (744)
Q Consensus 631 ~~~---~~~~----------------~-~~-~-~~~~~~~~-----------------------~~~~~-~~~~~~~~~~ 664 (744)
... .... . .. . ....+... +.+.. ......+.++
T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~l~~i 278 (349)
T PLN02385 199 IADDVVPPPLVLQILILLANLLPKAKLVPQKDLAELAFRDLKKRKMAEYNVIAYKDKPRLRTAVELLRTTQEIEMQLEEV 278 (349)
T ss_pred ccccccCchHHHHHHHHHHHHCCCceecCCCccccccccCHHHHHHhhcCcceeCCCcchHHHHHHHHHHHHHHHhcccC
Confidence 110 0000 0 00 0 00000000 00000 0112335678
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcccc---HHHHHHHHHHHHHHhcc
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAREN---VMHVIWETDRWLQKYCL 734 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~---~~~~~~~~~~fl~~~l~ 734 (744)
++|+|++||++|.+++ ...++++++.+.. .+++++++++++|.+...+. .+.++..+++||++++.
T Consensus 279 ~~P~Lii~G~~D~vv~--~~~~~~l~~~~~~--~~~~l~~i~~~gH~l~~e~p~~~~~~v~~~i~~wL~~~~~ 347 (349)
T PLN02385 279 SLPLLILHGEADKVTD--PSVSKFLYEKASS--SDKKLKLYEDAYHSILEGEPDEMIFQVLDDIISWLDSHST 347 (349)
T ss_pred CCCEEEEEeCCCCccC--hHHHHHHHHHcCC--CCceEEEeCCCeeecccCCChhhHHHHHHHHHHHHHHhcc
Confidence 9999999999999998 8999988877643 34589999999999763221 34588999999998864
No 34
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=99.82 E-value=3.3e-18 Score=185.01 Aligned_cols=232 Identities=18% Similarity=0.199 Sum_probs=168.3
Q ss_pred cCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCC
Q 004574 39 SPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMV 118 (744)
Q Consensus 39 SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~ 118 (744)
...|.+++|++..+ .++..+||++|.+++..++|+.... .+..+.|||||++|+|+.... +
T Consensus 152 ~~~~~~~~~~~~~~-----~~~~~~l~~~d~~g~~~~~l~~~~~-----~~~~p~~Spdg~~la~~~~~~-~-------- 212 (417)
T TIGR02800 152 GAFSTRIAYVSKSG-----KSRRYELQVADYDGANPQTITRSRE-----PILSPAWSPDGQKLAYVSFES-G-------- 212 (417)
T ss_pred CCcCCEEEEEEEeC-----CCCcceEEEEcCCCCCCEEeecCCC-----ceecccCCCCCCEEEEEEcCC-C--------
Confidence 34577899987522 2457889999999999999986553 245779999999999985421 1
Q ss_pred CCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeC
Q 004574 119 PLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHR 196 (744)
Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~ 196 (744)
..+|+++++ +|+.+.+... +....++|||||++|+|.....
T Consensus 213 -------------------------------------~~~i~v~d~~~g~~~~~~~~~~~~~~~~~spDg~~l~~~~~~~ 255 (417)
T TIGR02800 213 -------------------------------------KPEIYVQDLATGQREKVASFPGMNGAPAFSPDGSKLAVSLSKD 255 (417)
T ss_pred -------------------------------------CcEEEEEECCCCCEEEeecCCCCccceEECCCCCEEEEEECCC
Confidence 157999999 6667776655 5566789999999999986543
Q ss_pred CcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccc
Q 004574 197 PYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRD 276 (744)
Q Consensus 197 ~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~ 276 (744)
. ..+||+++++++..++++..... ...+.|+|||++ |+|.+...+. .
T Consensus 256 ~----------~~~i~~~d~~~~~~~~l~~~~~~-------------~~~~~~s~dg~~-l~~~s~~~g~---------~ 302 (417)
T TIGR02800 256 G----------NPDIYVMDLDGKQLTRLTNGPGI-------------DTEPSWSPDGKS-IAFTSDRGGS---------P 302 (417)
T ss_pred C----------CccEEEEECCCCCEEECCCCCCC-------------CCCEEECCCCCE-EEEEECCCCC---------c
Confidence 1 24799999998888887655322 234589999998 8888643322 2
Q ss_pred eEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCC
Q 004574 277 IIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSP 356 (744)
Q Consensus 277 ~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 356 (744)
.||++++ .+++.+.++........+.|||||++|++.... .+..+|+++++.+ +..+.++..... ..
T Consensus 303 ~iy~~d~---~~~~~~~l~~~~~~~~~~~~spdg~~i~~~~~~-~~~~~i~~~d~~~--~~~~~l~~~~~~------~~- 369 (417)
T TIGR02800 303 QIYMMDA---DGGEVRRLTFRGGYNASPSWSPDGDLIAFVHRE-GGGFNIAVMDLDG--GGERVLTDTGLD------ES- 369 (417)
T ss_pred eEEEEEC---CCCCEEEeecCCCCccCeEECCCCCEEEEEEcc-CCceEEEEEeCCC--CCeEEccCCCCC------CC-
Confidence 5999998 667777777666667789999999999998743 3567899999987 344445432111 12
Q ss_pred ceeeCCCCCeEEEEeee
Q 004574 357 MMTRTSTGTNVIAKIKK 373 (744)
Q Consensus 357 ~~~~spdg~~l~~~~~~ 373 (744)
+.|+|||+.|+|...+
T Consensus 370 -p~~spdg~~l~~~~~~ 385 (417)
T TIGR02800 370 -PSFAPNGRMILYATTR 385 (417)
T ss_pred -ceECCCCCEEEEEEeC
Confidence 3599999999998754
No 35
>PRK13604 luxD acyl transferase; Provisional
Probab=99.82 E-value=1.5e-18 Score=170.42 Aligned_cols=198 Identities=13% Similarity=0.064 Sum_probs=136.9
Q ss_pred EEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCC
Q 004574 487 YQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPI 566 (744)
Q Consensus 487 ~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~ 566 (744)
+...+|..+.+++..|.+.. ..+.++||++||.+-. . ..+...+..|+++||+|+. ++.
T Consensus 14 ~~~~dG~~L~Gwl~~P~~~~---~~~~~~vIi~HGf~~~------------~---~~~~~~A~~La~~G~~vLr---fD~ 72 (307)
T PRK13604 14 ICLENGQSIRVWETLPKENS---PKKNNTILIASGFARR------------M---DHFAGLAEYLSSNGFHVIR---YDS 72 (307)
T ss_pred EEcCCCCEEEEEEEcCcccC---CCCCCEEEEeCCCCCC------------h---HHHHHHHHHHHHCCCEEEE---ecC
Confidence 44568999999999997522 3457899999996410 1 1133567789999999999 443
Q ss_pred CCC-CCC----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC--
Q 004574 567 IGE-GDK----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL-- 633 (744)
Q Consensus 567 ~g~-g~~----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~-- 633 (744)
++. |.+ ....|+.++++|++++. .++|+|+||||||.+|+.+|... .++++|+.+|+.+...
T Consensus 73 rg~~GeS~G~~~~~t~s~g~~Dl~aaid~lk~~~---~~~I~LiG~SmGgava~~~A~~~--~v~~lI~~sp~~~l~d~l 147 (307)
T PRK13604 73 LHHVGLSSGTIDEFTMSIGKNSLLTVVDWLNTRG---INNLGLIAASLSARIAYEVINEI--DLSFLITAVGVVNLRDTL 147 (307)
T ss_pred CCCCCCCCCccccCcccccHHHHHHHHHHHHhcC---CCceEEEEECHHHHHHHHHhcCC--CCCEEEEcCCcccHHHHH
Confidence 322 221 22459999999999863 36899999999999987666643 3899999999877210
Q ss_pred ----CC----Ccccccc-------cchh--hcHHHH------HhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHH
Q 004574 634 ----TP----FGFQTEF-------RTLW--EATNVY------IEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFF 690 (744)
Q Consensus 634 ----~~----~~~~~~~-------~~~~--~~~~~~------~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~ 690 (744)
.. +...... .... ...+.. ...++...+.+++.|+|+|||++|..|| ++++++++
T Consensus 148 ~~~~~~~~~~~p~~~lp~~~d~~g~~l~~~~f~~~~~~~~~~~~~s~i~~~~~l~~PvLiIHG~~D~lVp--~~~s~~l~ 225 (307)
T PRK13604 148 ERALGYDYLSLPIDELPEDLDFEGHNLGSEVFVTDCFKHGWDTLDSTINKMKGLDIPFIAFTANNDSWVK--QSEVIDLL 225 (307)
T ss_pred HHhhhcccccCcccccccccccccccccHHHHHHHHHhcCccccccHHHHHhhcCCCEEEEEcCCCCccC--HHHHHHHH
Confidence 00 0000000 0000 000111 1234556677889999999999999999 99999999
Q ss_pred HHHHhCCCcEEEEEeCCCCcccCc
Q 004574 691 DALKGHGALSRLVLLPFEHHVYAA 714 (744)
Q Consensus 691 ~~l~~~~~~~~~~~~~~~~H~~~~ 714 (744)
++++. .+++++++|+++|.+..
T Consensus 226 e~~~s--~~kkl~~i~Ga~H~l~~ 247 (307)
T PRK13604 226 DSIRS--EQCKLYSLIGSSHDLGE 247 (307)
T ss_pred HHhcc--CCcEEEEeCCCccccCc
Confidence 98754 45799999999999864
No 36
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=99.81 E-value=3.4e-18 Score=178.66 Aligned_cols=302 Identities=16% Similarity=0.156 Sum_probs=198.5
Q ss_pred cCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCC
Q 004574 39 SPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMV 118 (744)
Q Consensus 39 SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~ 118 (744)
||||++++|..+..+..+. +...++|++|+++++.++|+.... ....+.|||||+.|+|+..
T Consensus 1 S~d~~~~l~~~~~~~~~r~-s~~~~y~i~d~~~~~~~~l~~~~~-----~~~~~~~sP~g~~~~~v~~------------ 62 (353)
T PF00930_consen 1 SPDGKFVLFATNYTKQWRH-SFKGDYYIYDIETGEITPLTPPPP-----KLQDAKWSPDGKYIAFVRD------------ 62 (353)
T ss_dssp -TTSSEEEEEEEEEEESSS-EEEEEEEEEETTTTEEEESS-EET-----TBSEEEE-SSSTEEEEEET------------
T ss_pred CCCCCeEEEEECcEEeeee-ccceeEEEEecCCCceEECcCCcc-----ccccceeecCCCeeEEEec------------
Confidence 8999999998774433332 556899999999999999976522 3567899999999999842
Q ss_pred CCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCCc-------------------eee
Q 004574 119 PLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTPA-------------------VYT 178 (744)
Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~~-------------------~~~ 178 (744)
.+||+.++ +++.++||..+ ...
T Consensus 63 --------------------------------------~nly~~~~~~~~~~~lT~dg~~~i~nG~~dwvyeEEv~~~~~ 104 (353)
T PF00930_consen 63 --------------------------------------NNLYLRDLATGQETQLTTDGEPGIYNGVPDWVYEEEVFDRRS 104 (353)
T ss_dssp --------------------------------------TEEEEESSTTSEEEESES--TTTEEESB--HHHHHHTSSSSB
T ss_pred --------------------------------------CceEEEECCCCCeEEeccccceeEEcCccceecccccccccc
Confidence 58999998 66888887642 123
Q ss_pred eeccCCCCceEEEEEeeCCccccc---------------c------cCCC--cceEEEEeCCCCeeeeccCCCCCCCCCc
Q 004574 179 AVEPSPDQKYVLITSMHRPYSYKV---------------P------CARF--SQKVQVWTTDGKLVRELCDLPPAEDIPV 235 (744)
Q Consensus 179 ~~~~SpDG~~i~~~~~~~~~~~~~---------------~------~~~~--~~~l~~~~~~g~~~~~l~~~~~~~~~~~ 235 (744)
.+-|||||++|+|...+....... . .|.. .-+++++++++++...+.....
T Consensus 105 ~~~WSpd~~~la~~~~d~~~v~~~~~~~~~~~~~~yp~~~~~~YPk~G~~np~v~l~v~~~~~~~~~~~~~~~~------ 178 (353)
T PF00930_consen 105 AVWWSPDSKYLAFLRFDEREVPEYPLPDYSPPDSQYPEVESIRYPKAGDPNPRVSLFVVDLASGKTTELDPPNS------ 178 (353)
T ss_dssp SEEE-TTSSEEEEEEEE-TTS-EEEEEEESSSTESS-EEEEEE--BTTS---EEEEEEEESSSTCCCEE---HH------
T ss_pred ceEECCCCCEEEEEEECCcCCceEEeeccCCccccCCcccccccCCCCCcCCceEEEEEECCCCcEEEeeeccc------
Confidence 567999999999998765110000 0 0000 1268888888776554332200
Q ss_pred ccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--cc---ceeceeec-cC
Q 004574 236 CYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DL---RFRSVSWC-DD 309 (744)
Q Consensus 236 ~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~---~~~~~~~S-pD 309 (744)
.......+..+.|++|++. |++.... ..+.+..+++++. .++..+.+... .+ ....+.|. ++
T Consensus 179 -~~~~~~yl~~v~W~~d~~~-l~~~~~n-------R~q~~~~l~~~d~---~tg~~~~~~~e~~~~Wv~~~~~~~~~~~~ 246 (353)
T PF00930_consen 179 -LNPQDYYLTRVGWSPDGKR-LWVQWLN-------RDQNRLDLVLCDA---STGETRVVLEETSDGWVDVYDPPHFLGPD 246 (353)
T ss_dssp -HHTSSEEEEEEEEEETTEE-EEEEEEE-------TTSTEEEEEEEEE---CTTTCEEEEEEESSSSSSSSSEEEE-TTT
T ss_pred -cCCCccCcccceecCCCcE-EEEEEcc-------cCCCEEEEEEEEC---CCCceeEEEEecCCcceeeecccccccCC
Confidence 0011122557899999986 4444322 1223345788887 55665554432 11 12345555 99
Q ss_pred CceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCC
Q 004574 310 SLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFT 389 (744)
Q Consensus 310 g~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~ 389 (744)
+..+++.+. +++..+||.++.++ +..+.||.+...- .-.+.|+++++.|+|.+...
T Consensus 247 ~~~~l~~s~-~~G~~hly~~~~~~--~~~~~lT~G~~~V------~~i~~~d~~~~~iyf~a~~~--------------- 302 (353)
T PF00930_consen 247 GNEFLWISE-RDGYRHLYLYDLDG--GKPRQLTSGDWEV------TSILGWDEDNNRIYFTANGD--------------- 302 (353)
T ss_dssp SSEEEEEEE-TTSSEEEEEEETTS--SEEEESS-SSS-E------EEEEEEECTSSEEEEEESSG---------------
T ss_pred CCEEEEEEE-cCCCcEEEEEcccc--cceeccccCceee------cccceEcCCCCEEEEEecCC---------------
Confidence 999999885 77899999999998 5567888776431 11256899999999998431
Q ss_pred CCCCCceEEEEecC-CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCC
Q 004574 390 PEGNIPFLDLFDIN-TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEI 451 (744)
Q Consensus 390 ~~~~~~~l~~~d~~-~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~ 451 (744)
...+++||++++. +++.+.|+...+. . ..+++|||++.++...++...|
T Consensus 303 -~p~~r~lY~v~~~~~~~~~~LT~~~~~----~--------~~~~~Spdg~y~v~~~s~~~~P 352 (353)
T PF00930_consen 303 -NPGERHLYRVSLDSGGEPKCLTCEDGD----H--------YSASFSPDGKYYVDTYSGPDTP 352 (353)
T ss_dssp -GTTSBEEEEEETTETTEEEESSTTSST----T--------EEEEE-TTSSEEEEEEESSSSC
T ss_pred -CCCceEEEEEEeCCCCCeEeccCCCCC----c--------eEEEECCCCCEEEEEEcCCCCC
Confidence 2457899999999 9999999877654 1 1258999999999998877665
No 37
>PHA02857 monoglyceride lipase; Provisional
Probab=99.81 E-value=3e-18 Score=174.16 Aligned_cols=215 Identities=18% Similarity=0.146 Sum_probs=146.2
Q ss_pred EEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCC
Q 004574 487 YQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPI 566 (744)
Q Consensus 487 ~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~ 566 (744)
+.+.+|..+++.+|.|.+ .+.|+|+++||.+. ........+..|+++||.|++ ++.
T Consensus 5 ~~~~~g~~l~~~~~~~~~------~~~~~v~llHG~~~---------------~~~~~~~~~~~l~~~g~~via---~D~ 60 (276)
T PHA02857 5 MFNLDNDYIYCKYWKPIT------YPKALVFISHGAGE---------------HSGRYEELAENISSLGILVFS---HDH 60 (276)
T ss_pred eecCCCCEEEEEeccCCC------CCCEEEEEeCCCcc---------------ccchHHHHHHHHHhCCCEEEE---ccC
Confidence 345689999999998852 24689999999642 112223556778889999999 666
Q ss_pred CCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC
Q 004574 567 IGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP 635 (744)
Q Consensus 567 ~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~ 635 (744)
+|+|.+. ..+|+...++++++.. ...+++|+||||||.+|+.++.++|++++++|+++|........
T Consensus 61 ~G~G~S~~~~~~~~~~~~~~~d~~~~l~~~~~~~--~~~~~~lvG~S~GG~ia~~~a~~~p~~i~~lil~~p~~~~~~~~ 138 (276)
T PHA02857 61 IGHGRSNGEKMMIDDFGVYVRDVVQHVVTIKSTY--PGVPVFLLGHSMGATISILAAYKNPNLFTAMILMSPLVNAEAVP 138 (276)
T ss_pred CCCCCCCCccCCcCCHHHHHHHHHHHHHHHHhhC--CCCCEEEEEcCchHHHHHHHHHhCccccceEEEecccccccccc
Confidence 6666542 1235556666655432 23579999999999999999999999999999999875421100
Q ss_pred C-----------cccccc----cchh--h---cHHHH-------------------Hhc--CcccccCCCCCCEEEEeeC
Q 004574 636 F-----------GFQTEF----RTLW--E---ATNVY-------------------IEM--SPITHANKIKKPILIIHGE 674 (744)
Q Consensus 636 ~-----------~~~~~~----~~~~--~---~~~~~-------------------~~~--~~~~~~~~~~~P~l~i~G~ 674 (744)
. .+.... ...+ . ....+ ... .....++++++|+|+++|+
T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~Pvliv~G~ 218 (276)
T PHA02857 139 RLNLLAAKLMGIFYPNKIVGKLCPESVSRDMDEVYKYQYDPLVNHEKIKAGFASQVLKATNKVRKIIPKIKTPILILQGT 218 (276)
T ss_pred HHHHHHHHHHHHhCCCCccCCCCHhhccCCHHHHHHHhcCCCccCCCccHHHHHHHHHHHHHHHHhcccCCCCEEEEecC
Confidence 0 000000 0000 0 00000 000 1123467889999999999
Q ss_pred CCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcccc--HHHHHHHHHHHHHHh
Q 004574 675 VDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAREN--VMHVIWETDRWLQKY 732 (744)
Q Consensus 675 ~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~--~~~~~~~~~~fl~~~ 732 (744)
+|.++| +..++++.+.+.. ++++.++++++|.+..... .+++++.+++||+++
T Consensus 219 ~D~i~~--~~~~~~l~~~~~~---~~~~~~~~~~gH~~~~e~~~~~~~~~~~~~~~l~~~ 273 (276)
T PHA02857 219 NNEISD--VSGAYYFMQHANC---NREIKIYEGAKHHLHKETDEVKKSVMKEIETWIFNR 273 (276)
T ss_pred CCCcCC--hHHHHHHHHHccC---CceEEEeCCCcccccCCchhHHHHHHHHHHHHHHHh
Confidence 999998 8999888876633 4699999999999875433 678999999999986
No 38
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=99.80 E-value=2.5e-18 Score=179.71 Aligned_cols=228 Identities=17% Similarity=0.203 Sum_probs=165.6
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICL 85 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~ 85 (744)
.|++.+-++ -..+.++ ........|+|||||+.++|+.-.. ....+||++++++|+..++.....
T Consensus 174 ~l~~~D~dg----~~~~~l~--~~~~~~~~p~ws~~~~~~~y~~f~~------~~~~~i~~~~l~~g~~~~i~~~~g--- 238 (425)
T COG0823 174 ELALGDYDG----YNQQKLT--DSGSLILTPAWSPDGKKLAYVSFEL------GGCPRIYYLDLNTGKRPVILNFNG--- 238 (425)
T ss_pred eEEEEccCC----cceeEec--ccCcceeccccCcCCCceEEEEEec------CCCceEEEEeccCCccceeeccCC---
Confidence 445555433 3444554 2233467899999999999986531 222789999999998877765443
Q ss_pred cccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCC
Q 004574 86 NAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLD 165 (744)
Q Consensus 86 ~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 165 (744)
....+.|||||++|+|..... + ..+||++|++
T Consensus 239 --~~~~P~fspDG~~l~f~~~rd-g---------------------------------------------~~~iy~~dl~ 270 (425)
T COG0823 239 --NNGAPAFSPDGSKLAFSSSRD-G---------------------------------------------SPDIYLMDLD 270 (425)
T ss_pred --ccCCccCCCCCCEEEEEECCC-C---------------------------------------------CccEEEEcCC
Confidence 235779999999999986543 1 1589999996
Q ss_pred C-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCC
Q 004574 166 G-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREG 243 (744)
Q Consensus 166 G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~ 243 (744)
+ ...+|+.. +....+.|||||++|+|.++... ..+||+++++|+..++++.......
T Consensus 271 ~~~~~~Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G----------~p~I~~~~~~g~~~~riT~~~~~~~----------- 329 (425)
T COG0823 271 GKNLPRLTNGFGINTSPSWSPDGSKIVFTSDRGG----------RPQIYLYDLEGSQVTRLTFSGGGNS----------- 329 (425)
T ss_pred CCcceecccCCccccCccCCCCCCEEEEEeCCCC----------CcceEEECCCCCceeEeeccCCCCc-----------
Confidence 5 77778887 66779999999999999987753 2389999999999988887755532
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQ 323 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~ 323 (744)
...|||||++ |+|.... .+. -.+...++ .++...++.........++|++||+.+.|.+... ..
T Consensus 330 --~p~~SpdG~~-i~~~~~~-~g~--------~~i~~~~~---~~~~~~~~lt~~~~~e~ps~~~ng~~i~~~s~~~-~~ 393 (425)
T COG0823 330 --NPVWSPDGDK-IVFESSS-GGQ--------WDIDKNDL---ASGGKIRILTSTYLNESPSWAPNGRMIMFSSGQG-GG 393 (425)
T ss_pred --CccCCCCCCE-EEEEecc-CCc--------eeeEEecc---CCCCcEEEccccccCCCCCcCCCCceEEEeccCC-CC
Confidence 4589999999 8888733 211 13666666 3333244444666777899999999999987333 56
Q ss_pred eeEEEEcCCC
Q 004574 324 TRTWLVCPGS 333 (744)
Q Consensus 324 ~~l~~~~~~~ 333 (744)
..|+.+..++
T Consensus 394 ~~l~~~s~~g 403 (425)
T COG0823 394 SVLSLVSLDG 403 (425)
T ss_pred ceEEEeeccc
Confidence 6788777776
No 39
>PF01738 DLH: Dienelactone hydrolase family; InterPro: IPR002925 Dienelactone hydrolases play a crucial role in chlorocatechol degradation via the modified ortho cleavage pathway. Enzymes induced in 4-fluorobenzoate-utilizing bacteria have been classified into three groups on the basis of their specificity towards cis- and trans-dienelactone []. Some proteins contain repeated small fragments of this domain (for example rat kan-1 protein).; GO: 0016787 hydrolase activity; PDB: 1GGV_A 1ZIY_A 1ZI6_A 1ZIC_A 1ZJ5_A 1ZI8_A 1ZJ4_A 1ZI9_A 1ZIX_A 3F67_A.
Probab=99.80 E-value=3.6e-19 Score=173.35 Aligned_cols=193 Identities=23% Similarity=0.271 Sum_probs=136.9
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCC--CCCC
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIG--EGDK 572 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g--~g~~ 572 (744)
+.+++..|++ .++.|.||++|+.. ++.......+..|+++||.|++++.+...+ ....
T Consensus 1 ~~ay~~~P~~-----~~~~~~Vvv~~d~~---------------G~~~~~~~~ad~lA~~Gy~v~~pD~f~~~~~~~~~~ 60 (218)
T PF01738_consen 1 IDAYVARPEG-----GGPRPAVVVIHDIF---------------GLNPNIRDLADRLAEEGYVVLAPDLFGGRGAPPSDP 60 (218)
T ss_dssp EEEEEEEETT-----SSSEEEEEEE-BTT---------------BS-HHHHHHHHHHHHTT-EEEEE-CCCCTS--CCCH
T ss_pred CeEEEEeCCC-----CCCCCEEEEEcCCC---------------CCchHHHHHHHHHHhcCCCEEecccccCCCCCccch
Confidence 4678899987 24689999999742 111112245778899999999977766555 1111
Q ss_pred ----------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCC
Q 004574 573 ----------------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPF 636 (744)
Q Consensus 573 ----------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~ 636 (744)
....|+.+++++|++++.++.+||+++|+|+||.+|+.++.+. +.++++|...|... .
T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~aa~~~l~~~~~~~~~kig~vGfc~GG~~a~~~a~~~-~~~~a~v~~yg~~~----~- 134 (218)
T PF01738_consen 61 EEAFAAMRELFAPRPEQVAADLQAAVDYLRAQPEVDPGKIGVVGFCWGGKLALLLAARD-PRVDAAVSFYGGSP----P- 134 (218)
T ss_dssp HCHHHHHHHCHHHSHHHHHHHHHHHHHHHHCTTTCEEEEEEEEEETHHHHHHHHHHCCT-TTSSEEEEES-SSS----G-
T ss_pred hhHHHHHHHHHhhhHHHHHHHHHHHHHHHHhccccCCCcEEEEEEecchHHhhhhhhhc-cccceEEEEcCCCC----C-
Confidence 1123777889999999888889999999999999999998887 68999999888100 0
Q ss_pred cccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcc-
Q 004574 637 GFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAR- 715 (744)
Q Consensus 637 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~- 715 (744)
..+.....++++|+|+++|++|+.++ .+..+++.++|++.+.++++++||+++|+|...
T Consensus 135 ------------------~~~~~~~~~~~~P~l~~~g~~D~~~~--~~~~~~~~~~l~~~~~~~~~~~y~ga~HgF~~~~ 194 (218)
T PF01738_consen 135 ------------------PPPLEDAPKIKAPVLILFGENDPFFP--PEEVEALEEALKAAGVDVEVHVYPGAGHGFANPS 194 (218)
T ss_dssp ------------------GGHHHHGGG--S-EEEEEETT-TTS---HHHHHHHHHHHHCTTTTEEEEEETT--TTTTSTT
T ss_pred ------------------CcchhhhcccCCCEeecCccCCCCCC--hHHHHHHHHHHHhcCCcEEEEECCCCcccccCCC
Confidence 01112345678999999999999998 888999999999999999999999999999743
Q ss_pred ------ccHHHHHHHHHHHHHHhc
Q 004574 716 ------ENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 716 ------~~~~~~~~~~~~fl~~~l 733 (744)
...++.++.+++||+++|
T Consensus 195 ~~~~~~~aa~~a~~~~~~ff~~~L 218 (218)
T PF01738_consen 195 RPPYDPAAAEDAWQRTLAFFKRHL 218 (218)
T ss_dssp STT--HHHHHHHHHHHHHHHCC--
T ss_pred CcccCHHHHHHHHHHHHHHHHhcC
Confidence 234677888999998876
No 40
>TIGR02821 fghA_ester_D S-formylglutathione hydrolase. This model describes a protein family from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (EC 3.1.2.12). Members in eukaryotes such as the human protein are better known as esterase D (EC 3.1.1.1), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.
Probab=99.80 E-value=4.7e-18 Score=171.20 Aligned_cols=229 Identities=19% Similarity=0.205 Sum_probs=147.3
Q ss_pred EEEEEEc-CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHH-HhCCeEEEe
Q 004574 483 EMIKYQR-KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIF-LARRFAVLA 560 (744)
Q Consensus 483 ~~i~~~~-~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~G~~v~~ 560 (744)
+.+.+.+ .-+..+.+.+|+|+++.. +++|+||++||.+... ..+.. ......+ .+.||.|++
T Consensus 14 ~~~~~~s~~~~~~~~~~v~~P~~~~~---~~~P~vvllHG~~~~~-----------~~~~~--~~~~~~la~~~g~~Vv~ 77 (275)
T TIGR02821 14 GFYRHKSETCGVPMTFGVFLPPQAAA---GPVPVLWYLSGLTCTH-----------ENFMI--KAGAQRFAAEHGLALVA 77 (275)
T ss_pred EEEEEeccccCCceEEEEEcCCCccC---CCCCEEEEccCCCCCc-----------cHHHh--hhHHHHHHhhcCcEEEE
Confidence 3344433 346678899999987533 2489999999975211 11110 0112344 457999999
Q ss_pred cCCCCCCCCC-------------CC--------------ChHHHHHHHHHH-HHHcCCCCCCcEEEEEechHHHHHHHHH
Q 004574 561 GPSIPIIGEG-------------DK--------------LPNDSAEAAVEE-VVRRGVADPSRIAVGGHSYGAFMTAHLL 612 (744)
Q Consensus 561 ~~~~~~~g~g-------------~~--------------~~~~d~~~~~~~-l~~~~~~d~~~i~l~G~S~GG~~a~~~~ 612 (744)
++.. ..|.+ .. .....+.+.+.. +.+...+|.++++++|+||||++|+.++
T Consensus 78 Pd~~-~~g~~~~~~~~~w~~g~~~~~~~d~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~G~S~GG~~a~~~a 156 (275)
T TIGR02821 78 PDTS-PRGTGIAGEDDAWDFGKGAGFYVDATEEPWSQHYRMYSYIVQELPALVAAQFPLDGERQGITGHSMGGHGALVIA 156 (275)
T ss_pred eCCC-CCcCCCCCCcccccccCCccccccCCcCcccccchHHHHHHHHHHHHHHhhCCCCCCceEEEEEChhHHHHHHHH
Confidence 6541 11111 00 112223333333 3344457889999999999999999999
Q ss_pred HhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCC--CCCCEEEEeeCCCCCCCCCH-HHHHHH
Q 004574 613 AHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANK--IKKPILIIHGEVDDKVGLFP-MQAERF 689 (744)
Q Consensus 613 ~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~P~l~i~G~~D~~v~~~~-~~~~~~ 689 (744)
.++|+.++++++++|+++.....+.............+.+...++...+.+ ...|+++.||+.|..++ . .++.++
T Consensus 157 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~plli~~G~~D~~v~--~~~~~~~~ 234 (275)
T TIGR02821 157 LKNPDRFKSVSAFAPIVAPSRCPWGQKAFSAYLGADEAAWRSYDASLLVADGGRHSTILIDQGTADQFLD--EQLRPDAF 234 (275)
T ss_pred HhCcccceEEEEECCccCcccCcchHHHHHHHhcccccchhhcchHHHHhhcccCCCeeEeecCCCcccC--ccccHHHH
Confidence 999999999999999876432211100000000011112223344433332 46899999999999988 7 578899
Q ss_pred HHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 690 FDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 690 ~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
.++|++++.++++.++|+++|+|.. ...++...++|+.+++
T Consensus 235 ~~~l~~~g~~v~~~~~~g~~H~f~~---~~~~~~~~~~~~~~~~ 275 (275)
T TIGR02821 235 EQACRAAGQALTLRRQAGYDHSYYF---IASFIADHLRHHAERL 275 (275)
T ss_pred HHHHHHcCCCeEEEEeCCCCccchh---HHHhHHHHHHHHHhhC
Confidence 9999999999999999999999764 5577788888887763
No 41
>PLN02442 S-formylglutathione hydrolase
Probab=99.80 E-value=6.7e-18 Score=170.19 Aligned_cols=235 Identities=22% Similarity=0.211 Sum_probs=148.8
Q ss_pred CceEEEEEEc-CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEE
Q 004574 480 LQKEMIKYQR-KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAV 558 (744)
Q Consensus 480 ~~~~~i~~~~-~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v 558 (744)
...+.+++.+ .-+..+++.+|+|+.. ...++|+|+++||.+.. ...+... ......+...||+|
T Consensus 16 ~~~~~~~~~s~~l~~~~~~~vy~P~~~---~~~~~Pvv~~lHG~~~~-----------~~~~~~~-~~~~~~~~~~g~~V 80 (283)
T PLN02442 16 GFNRRYKHFSSTLGCSMTFSVYFPPAS---DSGKVPVLYWLSGLTCT-----------DENFIQK-SGAQRAAAARGIAL 80 (283)
T ss_pred CEEEEEEEeccccCCceEEEEEcCCcc---cCCCCCEEEEecCCCcC-----------hHHHHHh-hhHHHHHhhcCeEE
Confidence 3455556654 3467899999999842 23469999999996421 1111110 11234456789999
Q ss_pred EecCCCCCC------------CCCCC-------------ChHH-HHHHHHHHHHHc-CCCCCCcEEEEEechHHHHHHHH
Q 004574 559 LAGPSIPII------------GEGDK-------------LPND-SAEAAVEEVVRR-GVADPSRIAVGGHSYGAFMTAHL 611 (744)
Q Consensus 559 ~~~~~~~~~------------g~g~~-------------~~~~-d~~~~~~~l~~~-~~~d~~~i~l~G~S~GG~~a~~~ 611 (744)
+.++..... +.+.+ ...+ .+.+..+++.+. ..+|.++++|+|+||||++|+.+
T Consensus 81 v~pd~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~~~~~i~G~S~GG~~a~~~ 160 (283)
T PLN02442 81 VAPDTSPRGLNVEGEADSWDFGVGAGFYLNATQEKWKNWRMYDYVVKELPKLLSDNFDQLDTSRASIFGHSMGGHGALTI 160 (283)
T ss_pred EecCCCCCCCCCCCCccccccCCCcceeeccccCCCcccchhhhHHHHHHHHHHHHHHhcCCCceEEEEEChhHHHHHHH
Confidence 996532100 11110 1111 122333334332 33688999999999999999999
Q ss_pred HHhCCCceeEEEEccCCCCCCCCCCcccccc---cchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHH-HHH
Q 004574 612 LAHAPHLFCCGIARSGSYNKTLTPFGFQTEF---RTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPM-QAE 687 (744)
Q Consensus 612 ~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~-~~~ 687 (744)
+.++|++|+++++++|+++.....+...... .......+.+...+++..+...++|+|++||++|.+++ .. +++
T Consensus 161 a~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~d~~~~~~~~~~~~~pvli~~G~~D~~v~--~~~~s~ 238 (283)
T PLN02442 161 YLKNPDKYKSVSAFAPIANPINCPWGQKAFTNYLGSDKADWEEYDATELVSKFNDVSATILIDQGEADKFLK--EQLLPE 238 (283)
T ss_pred HHhCchhEEEEEEECCccCcccCchhhHHHHHHcCCChhhHHHcChhhhhhhccccCCCEEEEECCCCcccc--ccccHH
Confidence 9999999999999999876432211110000 00001112223334444555678999999999999987 53 478
Q ss_pred HHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhcc
Q 004574 688 RFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 688 ~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~ 734 (744)
.+++++++.+.+++++++|+.+|.+. ....+.+..+.|..++++
T Consensus 239 ~~~~~l~~~g~~~~~~~~pg~~H~~~---~~~~~i~~~~~~~~~~~~ 282 (283)
T PLN02442 239 NFEEACKEAGAPVTLRLQPGYDHSYF---FIATFIDDHINHHAQALK 282 (283)
T ss_pred HHHHHHHHcCCCeEEEEeCCCCccHH---HHHHHHHHHHHHHHHHhc
Confidence 89999999999999999999999865 344555556666666553
No 42
>PRK10162 acetyl esterase; Provisional
Probab=99.78 E-value=3.7e-17 Score=167.84 Aligned_cols=230 Identities=13% Similarity=0.120 Sum_probs=160.3
Q ss_pred CceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh-CCeEE
Q 004574 480 LQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA-RRFAV 558 (744)
Q Consensus 480 ~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~v 558 (744)
+..+.+.+...+| .+.+.+|.|.. . +.|+||++|||||...+.. .+ ...+..|+. .|+.|
T Consensus 55 ~~~~~~~i~~~~g-~i~~~~y~P~~---~---~~p~vv~~HGGg~~~g~~~--------~~----~~~~~~la~~~g~~V 115 (318)
T PRK10162 55 MATRAYMVPTPYG-QVETRLYYPQP---D---SQATLFYLHGGGFILGNLD--------TH----DRIMRLLASYSGCTV 115 (318)
T ss_pred ceEEEEEEecCCC-ceEEEEECCCC---C---CCCEEEEEeCCcccCCCch--------hh----hHHHHHHHHHcCCEE
Confidence 3467777876666 59999999953 1 2699999999986533221 11 134556666 59999
Q ss_pred EecCCCCCCCCCCCChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHhC------CCceeEEEEccCCC
Q 004574 559 LAGPSIPIIGEGDKLPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAHA------PHLFCCGIARSGSY 629 (744)
Q Consensus 559 ~~~~~~~~~g~g~~~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~~------p~~~~~~v~~~~~~ 629 (744)
++++++...........+|+.++++|+.++ ..+|++||+|+|+|+||++|+.++.+. +..++++|+++|++
T Consensus 116 v~vdYrlape~~~p~~~~D~~~a~~~l~~~~~~~~~d~~~i~l~G~SaGG~la~~~a~~~~~~~~~~~~~~~~vl~~p~~ 195 (318)
T PRK10162 116 IGIDYTLSPEARFPQAIEEIVAVCCYFHQHAEDYGINMSRIGFAGDSAGAMLALASALWLRDKQIDCGKVAGVLLWYGLY 195 (318)
T ss_pred EEecCCCCCCCCCCCcHHHHHHHHHHHHHhHHHhCCChhHEEEEEECHHHHHHHHHHHHHHhcCCCccChhheEEECCcc
Confidence 998877666654455677999999999874 357889999999999999999988642 35788999999987
Q ss_pred CCCCCCCcccccccchh-----hcHHHHH---------hcCccccc--CCC---CCCEEEEeeCCCCCCCCCHHHHHHHH
Q 004574 630 NKTLTPFGFQTEFRTLW-----EATNVYI---------EMSPITHA--NKI---KKPILIIHGEVDDKVGLFPMQAERFF 690 (744)
Q Consensus 630 ~~~~~~~~~~~~~~~~~-----~~~~~~~---------~~~~~~~~--~~~---~~P~l~i~G~~D~~v~~~~~~~~~~~ 690 (744)
+...... . ......+ ...+.+. ..+|.... ..+ -.|++|++|+.|.+ .++++.+.
T Consensus 196 ~~~~~~s-~-~~~~~~~~~l~~~~~~~~~~~y~~~~~~~~~p~~~p~~~~l~~~lPp~~i~~g~~D~L----~de~~~~~ 269 (318)
T PRK10162 196 GLRDSVS-R-RLLGGVWDGLTQQDLQMYEEAYLSNDADRESPYYCLFNNDLTRDVPPCFIAGAEFDPL----LDDSRLLY 269 (318)
T ss_pred CCCCChh-H-HHhCCCccccCHHHHHHHHHHhCCCccccCCcccCcchhhhhcCCCCeEEEecCCCcC----cChHHHHH
Confidence 6421110 0 0000000 0011111 11222111 122 36999999999998 46899999
Q ss_pred HHHHhCCCcEEEEEeCCCCcccCcc----ccHHHHHHHHHHHHHHhcc
Q 004574 691 DALKGHGALSRLVLLPFEHHVYAAR----ENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 691 ~~l~~~~~~~~~~~~~~~~H~~~~~----~~~~~~~~~~~~fl~~~l~ 734 (744)
++|+++|.+++++++++..|+|... +...+.++.+.+||.++++
T Consensus 270 ~~L~~aGv~v~~~~~~g~~H~f~~~~~~~~~a~~~~~~~~~~l~~~~~ 317 (318)
T PRK10162 270 QTLAAHQQPCEFKLYPGTLHAFLHYSRMMDTADDALRDGAQFFTAQLK 317 (318)
T ss_pred HHHHHcCCCEEEEEECCCceehhhccCchHHHHHHHHHHHHHHHHHhc
Confidence 9999999999999999999998633 3356778889999998864
No 43
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=99.78 E-value=3.4e-17 Score=171.22 Aligned_cols=217 Identities=16% Similarity=0.107 Sum_probs=163.5
Q ss_pred CCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccc
Q 004574 59 SCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMT 138 (744)
Q Consensus 59 ~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (744)
....+||+.|-+|-..+.++.... ....+.||||++.|+|.....+..
T Consensus 170 ~~~~~l~~~D~dg~~~~~l~~~~~-----~~~~p~ws~~~~~~~y~~f~~~~~--------------------------- 217 (425)
T COG0823 170 PLPYELALGDYDGYNQQKLTDSGS-----LILTPAWSPDGKKLAYVSFELGGC--------------------------- 217 (425)
T ss_pred CCCceEEEEccCCcceeEecccCc-----ceeccccCcCCCceEEEEEecCCC---------------------------
Confidence 566899999988666777776654 245679999999999874322111
Q ss_pred cccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeC
Q 004574 139 DNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTT 216 (744)
Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~ 216 (744)
.++|++++ .|...++.+. +....++|||||++|+|....+. ..+||++|+
T Consensus 218 ------------------~~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rdg----------~~~iy~~dl 269 (425)
T COG0823 218 ------------------PRIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRDG----------SPDIYLMDL 269 (425)
T ss_pred ------------------ceEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCCC----------CccEEEEcC
Confidence 47999999 7755555554 78889999999999999987753 358999999
Q ss_pred CCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee
Q 004574 217 DGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK 296 (744)
Q Consensus 217 ~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 296 (744)
.+++.++|+........ ..|||||++ |||++.+.+.- +||+++. +++..++++.
T Consensus 270 ~~~~~~~Lt~~~gi~~~-------------Ps~spdG~~-ivf~Sdr~G~p---------~I~~~~~---~g~~~~riT~ 323 (425)
T COG0823 270 DGKNLPRLTNGFGINTS-------------PSWSPDGSK-IVFTSDRGGRP---------QIYLYDL---EGSQVTRLTF 323 (425)
T ss_pred CCCcceecccCCccccC-------------ccCCCCCCE-EEEEeCCCCCc---------ceEEECC---CCCceeEeec
Confidence 99998888888766544 489999999 99997554433 4999999 7888888887
Q ss_pred eccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 297 LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 297 ~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
..+....+.|||||++|+|.. ...+...+...++.++. . .++....... .. .+|+++|+.|.|..
T Consensus 324 ~~~~~~~p~~SpdG~~i~~~~-~~~g~~~i~~~~~~~~~-~-~~~lt~~~~~-----e~--ps~~~ng~~i~~~s 388 (425)
T COG0823 324 SGGGNSNPVWSPDGDKIVFES-SSGGQWDIDKNDLASGG-K-IRILTSTYLN-----ES--PSWAPNGRMIMFSS 388 (425)
T ss_pred cCCCCcCccCCCCCCEEEEEe-ccCCceeeEEeccCCCC-c-EEEccccccC-----CC--CCcCCCCceEEEec
Confidence 777666999999999999987 32344778888887752 2 3333332221 22 33999999999986
No 44
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=99.78 E-value=1.1e-17 Score=174.80 Aligned_cols=303 Identities=17% Similarity=0.192 Sum_probs=191.5
Q ss_pred CCccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCC
Q 004574 2 PFFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESP 81 (744)
Q Consensus 2 ~~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~ 81 (744)
++...+|++++.. ++.++++.. ......|.|||||++|||+.. ++||+.++.+++.+|||...
T Consensus 20 s~~~~y~i~d~~~----~~~~~l~~~--~~~~~~~~~sP~g~~~~~v~~-----------~nly~~~~~~~~~~~lT~dg 82 (353)
T PF00930_consen 20 SFKGDYYIYDIET----GEITPLTPP--PPKLQDAKWSPDGKYIAFVRD-----------NNLYLRDLATGQETQLTTDG 82 (353)
T ss_dssp EEEEEEEEEETTT----TEEEESS-E--ETTBSEEEE-SSSTEEEEEET-----------TEEEEESSTTSEEEESES--
T ss_pred ccceeEEEEecCC----CceEECcCC--ccccccceeecCCCeeEEEec-----------CceEEEECCCCCeEEecccc
Confidence 4567899999977 888888732 346899999999999999753 68999999999999999765
Q ss_pred Ccc-ccc------------cccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhh
Q 004574 82 DIC-LNA------------VFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDE 148 (744)
Q Consensus 82 ~~~-~~~------------~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (744)
... .++ ....+-|||||++|+|...+++....-. ...-..... .|......+++.
T Consensus 83 ~~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~-------~~~~~~~~~-----~yp~~~~~~YPk 150 (353)
T PF00930_consen 83 EPGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYP-------LPDYSPPDS-----QYPEVESIRYPK 150 (353)
T ss_dssp TTTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEE-------EEEESSSTE-----SS-EEEEEE--B
T ss_pred ceeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEE-------eeccCCccc-----cCCcccccccCC
Confidence 221 111 1236779999999999876554432100 000000000 111112222222
Q ss_pred hccceeeeeEEEEEcC-CCCeeecC------CC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe
Q 004574 149 SLFDYYTTAQLVLGSL-DGTAKDFG------TP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL 220 (744)
Q Consensus 149 ~~~~~~~~~~l~~~~~-~G~~~~l~------~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~ 220 (744)
.+-.+ ....|+++++ +|+...+. .. ..+..+.|++|+++|++....+. .....+.++|...+.
T Consensus 151 ~G~~n-p~v~l~v~~~~~~~~~~~~~~~~~~~~~~yl~~v~W~~d~~~l~~~~~nR~--------q~~~~l~~~d~~tg~ 221 (353)
T PF00930_consen 151 AGDPN-PRVSLFVVDLASGKTTELDPPNSLNPQDYYLTRVGWSPDGKRLWVQWLNRD--------QNRLDLVLCDASTGE 221 (353)
T ss_dssp TTS----EEEEEEEESSSTCCCEE---HHHHTSSEEEEEEEEEETTEEEEEEEEETT--------STEEEEEEEEECTTT
T ss_pred CCCcC-CceEEEEEECCCCcEEEeeeccccCCCccCcccceecCCCcEEEEEEcccC--------CCEEEEEEEECCCCc
Confidence 22222 2368999999 55554432 12 56678999999998888887764 223467788887665
Q ss_pred eeeccCCCCCCCCCcccCCccCCCCcccee-cCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeecc
Q 004574 221 VRELCDLPPAEDIPVCYNSVREGMRSISWR-ADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDL 299 (744)
Q Consensus 221 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~s-pDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~ 299 (744)
.+.+...... ..+ .....+.+. +++.. +++++..++. .+||+++. +++..+.||.++.
T Consensus 222 ~~~~~~e~~~----~Wv----~~~~~~~~~~~~~~~-~l~~s~~~G~---------~hly~~~~---~~~~~~~lT~G~~ 280 (353)
T PF00930_consen 222 TRVVLEETSD----GWV----DVYDPPHFLGPDGNE-FLWISERDGY---------RHLYLYDL---DGGKPRQLTSGDW 280 (353)
T ss_dssp CEEEEEEESS----SSS----SSSSEEEE-TTTSSE-EEEEEETTSS---------EEEEEEET---TSSEEEESS-SSS
T ss_pred eeEEEEecCC----cce----eeecccccccCCCCE-EEEEEEcCCC---------cEEEEEcc---cccceeccccCce
Confidence 5444321111 000 002245565 89988 7787764442 25999998 7788889999988
Q ss_pred cee-ceeeccCCceEEEeeeee-ccceeEEEEcCC-CCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 300 RFR-SVSWCDDSLALVNETWYK-TSQTRTWLVCPG-SKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 300 ~~~-~~~~SpDg~~l~~~~~~~-~~~~~l~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
.+. -+.|+++++.|+|.+... +...+||+++++ + ++.+.||...... ..+++||||++++....
T Consensus 281 ~V~~i~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~--~~~~~LT~~~~~~-------~~~~~Spdg~y~v~~~s 347 (353)
T PF00930_consen 281 EVTSILGWDEDNNRIYFTANGDNPGERHLYRVSLDSG--GEPKCLTCEDGDH-------YSASFSPDGKYYVDTYS 347 (353)
T ss_dssp -EEEEEEEECTSSEEEEEESSGGTTSBEEEEEETTET--TEEEESSTTSSTT-------EEEEE-TTSSEEEEEEE
T ss_pred eecccceEcCCCCEEEEEecCCCCCceEEEEEEeCCC--CCeEeccCCCCCc-------eEEEECCCCCEEEEEEc
Confidence 884 478999999999998543 467899999999 6 6778887655432 12669999999988764
No 45
>PRK10749 lysophospholipase L2; Provisional
Probab=99.78 E-value=2.3e-17 Score=171.20 Aligned_cols=222 Identities=14% Similarity=0.135 Sum_probs=147.8
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
.+...+...+|..+++..+.|+. +.++||++||.+ .....+...+..++++||.|+.
T Consensus 30 ~~~~~~~~~~g~~l~~~~~~~~~-------~~~~vll~HG~~---------------~~~~~y~~~~~~l~~~g~~v~~- 86 (330)
T PRK10749 30 REEAEFTGVDDIPIRFVRFRAPH-------HDRVVVICPGRI---------------ESYVKYAELAYDLFHLGYDVLI- 86 (330)
T ss_pred ccceEEEcCCCCEEEEEEccCCC-------CCcEEEEECCcc---------------chHHHHHHHHHHHHHCCCeEEE-
Confidence 45566667788889888887642 257899999963 1111222445567899999999
Q ss_pred CCCCCCCCCCCC------------hH----HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEc
Q 004574 562 PSIPIIGEGDKL------------PN----DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIAR 625 (744)
Q Consensus 562 ~~~~~~g~g~~~------------~~----~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~ 625 (744)
++.+|+|.+. .. +|+.++++.+.+. .+..++.++||||||.+++.++.++|+.++++|++
T Consensus 87 --~D~~G~G~S~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~--~~~~~~~l~GhSmGG~ia~~~a~~~p~~v~~lvl~ 162 (330)
T PRK10749 87 --IDHRGQGRSGRLLDDPHRGHVERFNDYVDDLAAFWQQEIQP--GPYRKRYALAHSMGGAILTLFLQRHPGVFDAIALC 162 (330)
T ss_pred --EcCCCCCCCCCCCCCCCcCccccHHHHHHHHHHHHHHHHhc--CCCCCeEEEEEcHHHHHHHHHHHhCCCCcceEEEE
Confidence 5555655442 11 2555555554433 23468999999999999999999999999999999
Q ss_pred cCCCCCCCCCCc-------------------c----cccccch---------hhc----HHHHHhc--------------
Q 004574 626 SGSYNKTLTPFG-------------------F----QTEFRTL---------WEA----TNVYIEM-------------- 655 (744)
Q Consensus 626 ~~~~~~~~~~~~-------------------~----~~~~~~~---------~~~----~~~~~~~-------------- 655 (744)
+|.......... + ......+ .+. .+.+...
T Consensus 163 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (330)
T PRK10749 163 APMFGIVLPLPSWMARRILNWAEGHPRIRDGYAIGTGRWRPLPFAINVLTHSRERYRRNLRFYADDPELRVGGPTYHWVR 242 (330)
T ss_pred CchhccCCCCCcHHHHHHHHHHHHhcCCCCcCCCCCCCCCCCCcCCCCCCCCHHHHHHHHHHHHhCCCcccCCCcHHHHH
Confidence 986431100000 0 0000000 000 0001000
Q ss_pred -------CcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC---CcEEEEEeCCCCcccCcccc--HHHHHH
Q 004574 656 -------SPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHG---ALSRLVLLPFEHHVYAAREN--VMHVIW 723 (744)
Q Consensus 656 -------~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~---~~~~~~~~~~~~H~~~~~~~--~~~~~~ 723 (744)
.....++++++|+|++||++|.+++ ...++++++.++..+ ..++++++|+++|.+..... .+.+++
T Consensus 243 ~~~~~~~~~~~~~~~i~~P~Lii~G~~D~vv~--~~~~~~~~~~l~~~~~~~~~~~l~~~~gagH~~~~E~~~~r~~v~~ 320 (330)
T PRK10749 243 ESILAGEQVLAGAGDITTPLLLLQAEEERVVD--NRMHDRFCEARTAAGHPCEGGKPLVIKGAYHEILFEKDAMRSVALN 320 (330)
T ss_pred HHHHHHHHHHhhccCCCCCEEEEEeCCCeeeC--HHHHHHHHHHHhhcCCCCCCceEEEeCCCcchhhhCCcHHHHHHHH
Confidence 1123456789999999999999999 999999999988765 34589999999999874443 678899
Q ss_pred HHHHHHHHh
Q 004574 724 ETDRWLQKY 732 (744)
Q Consensus 724 ~~~~fl~~~ 732 (744)
.+++||+++
T Consensus 321 ~i~~fl~~~ 329 (330)
T PRK10749 321 AIVDFFNRH 329 (330)
T ss_pred HHHHHHhhc
Confidence 999999874
No 46
>COG0412 Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=99.78 E-value=3.8e-17 Score=158.27 Aligned_cols=204 Identities=20% Similarity=0.194 Sum_probs=157.5
Q ss_pred EEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecC
Q 004574 483 EMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGP 562 (744)
Q Consensus 483 ~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~ 562 (744)
+.+.+...+ ..+.+++.+|.+ .++.|+||++|+-. | ........+..|++.||+|++++
T Consensus 3 ~~v~~~~~~-~~~~~~~a~P~~-----~~~~P~VIv~hei~-----------G----l~~~i~~~a~rlA~~Gy~v~~Pd 61 (236)
T COG0412 3 TDVTIPAPD-GELPAYLARPAG-----AGGFPGVIVLHEIF-----------G----LNPHIRDVARRLAKAGYVVLAPD 61 (236)
T ss_pred cceEeeCCC-ceEeEEEecCCc-----CCCCCEEEEEeccc-----------C----CchHHHHHHHHHHhCCcEEEech
Confidence 345666555 789999999987 33459999999842 1 11122356789999999999966
Q ss_pred CCCCCCCCC-------------------CChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEE
Q 004574 563 SIPIIGEGD-------------------KLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGI 623 (744)
Q Consensus 563 ~~~~~g~g~-------------------~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v 623 (744)
-+...+... .+...|+.++++||.+++.++.+||+++|+||||.+++.++.+.| .+++++
T Consensus 62 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~L~~~~~~~~~~ig~~GfC~GG~~a~~~a~~~~-~v~a~v 140 (236)
T COG0412 62 LYGRQGDPTDIEDEPAELETGLVERVDPAEVLADIDAALDYLARQPQVDPKRIGVVGFCMGGGLALLAATRAP-EVKAAV 140 (236)
T ss_pred hhccCCCCCcccccHHHHhhhhhccCCHHHHHHHHHHHHHHHHhCCCCCCceEEEEEEcccHHHHHHhhcccC-CccEEE
Confidence 544332111 123349999999999998899999999999999999999999986 899999
Q ss_pred EccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEE
Q 004574 624 ARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLV 703 (744)
Q Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~ 703 (744)
+..|..-... .....++++|+|+.+|+.|..+| ......+.+++...+.++++.
T Consensus 141 ~fyg~~~~~~------------------------~~~~~~~~~pvl~~~~~~D~~~p--~~~~~~~~~~~~~~~~~~~~~ 194 (236)
T COG0412 141 AFYGGLIADD------------------------TADAPKIKVPVLLHLAGEDPYIP--AADVDALAAALEDAGVKVDLE 194 (236)
T ss_pred EecCCCCCCc------------------------ccccccccCcEEEEecccCCCCC--hhHHHHHHHHHHhcCCCeeEE
Confidence 9888532100 01145789999999999999998 888899999999988899999
Q ss_pred EeCCCCcccCccc----------cHHHHHHHHHHHHHHhcc
Q 004574 704 LLPFEHHVYAARE----------NVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 704 ~~~~~~H~~~~~~----------~~~~~~~~~~~fl~~~l~ 734 (744)
+|+++.|.|.... ..++.++++++||.+++.
T Consensus 195 ~y~ga~H~F~~~~~~~~~~y~~~aa~~a~~~~~~ff~~~~~ 235 (236)
T COG0412 195 IYPGAGHGFANDRADYHPGYDAAAAEDAWQRVLAFFKRLLG 235 (236)
T ss_pred EeCCCccccccCCCcccccCCHHHHHHHHHHHHHHHHHhcc
Confidence 9999999988431 236778899999998874
No 47
>KOG1552 consensus Predicted alpha/beta hydrolase [General function prediction only]
Probab=99.75 E-value=2.9e-17 Score=152.76 Aligned_cols=214 Identities=20% Similarity=0.196 Sum_probs=154.3
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh-CCeEEE
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA-RRFAVL 559 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~v~ 559 (744)
..+.+....+.|..+.+..+.|+.+ ..++++|+||.-.. . +-+......+.. .++.++
T Consensus 34 ~v~v~~~~t~rgn~~~~~y~~~~~~------~~~~lly~hGNa~D--------------l-gq~~~~~~~l~~~ln~nv~ 92 (258)
T KOG1552|consen 34 FVEVFKVKTSRGNEIVCMYVRPPEA------AHPTLLYSHGNAAD--------------L-GQMVELFKELSIFLNCNVV 92 (258)
T ss_pred ccceEEeecCCCCEEEEEEEcCccc------cceEEEEcCCcccc--------------h-HHHHHHHHHHhhcccceEE
Confidence 4556666677888899988888753 36899999995110 0 000011222222 489999
Q ss_pred ecCCCCCCCCCCC-------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC
Q 004574 560 AGPSIPIIGEGDK-------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT 632 (744)
Q Consensus 560 ~~~~~~~~g~g~~-------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~ 632 (744)
. +++.|+|.+ ..++|+.++.+||++... ..++|+|+|+|+|...++.+|++.| ..|+|+.+|+.+..
T Consensus 93 ~---~DYSGyG~S~G~psE~n~y~Di~avye~Lr~~~g-~~~~Iil~G~SiGt~~tv~Lasr~~--~~alVL~SPf~S~~ 166 (258)
T KOG1552|consen 93 S---YDYSGYGRSSGKPSERNLYADIKAVYEWLRNRYG-SPERIILYGQSIGTVPTVDLASRYP--LAAVVLHSPFTSGM 166 (258)
T ss_pred E---EecccccccCCCcccccchhhHHHHHHHHHhhcC-CCceEEEEEecCCchhhhhHhhcCC--cceEEEeccchhhh
Confidence 8 677776654 466799999999999876 6789999999999999999999997 89999999987632
Q ss_pred CCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCccc
Q 004574 633 LTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVY 712 (744)
Q Consensus 633 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~ 712 (744)
..-. .......| +..+.....++++++|+|++||++|++++ ..+..+++++.+. +++..+..+++|..
T Consensus 167 rv~~--~~~~~~~~-----~d~f~~i~kI~~i~~PVLiiHgtdDevv~--~sHg~~Lye~~k~---~~epl~v~g~gH~~ 234 (258)
T KOG1552|consen 167 RVAF--PDTKTTYC-----FDAFPNIEKISKITCPVLIIHGTDDEVVD--FSHGKALYERCKE---KVEPLWVKGAGHND 234 (258)
T ss_pred hhhc--cCcceEEe-----eccccccCcceeccCCEEEEecccCceec--ccccHHHHHhccc---cCCCcEEecCCCcc
Confidence 2111 11111111 11223366778889999999999999999 9999999998875 36888899999975
Q ss_pred CccccHHHHHHHHHHHHHHhccC
Q 004574 713 AARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 713 ~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
. +-..++...+..|+...+..
T Consensus 235 ~--~~~~~yi~~l~~f~~~~~~~ 255 (258)
T KOG1552|consen 235 I--ELYPEYIEHLRRFISSVLPS 255 (258)
T ss_pred c--ccCHHHHHHHHHHHHHhccc
Confidence 4 23347788888888765543
No 48
>KOG4391 consensus Predicted alpha/beta hydrolase BEM46 [General function prediction only]
Probab=99.75 E-value=1.9e-17 Score=147.02 Aligned_cols=231 Identities=19% Similarity=0.261 Sum_probs=163.7
Q ss_pred CCCCCcCCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhH-HH
Q 004574 472 HPYPTLASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSS-LI 550 (744)
Q Consensus 472 ~~~~~~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 550 (744)
.+.|....++-+.+++...|..++++++.+-+. ..|+++++|+... .+ +.+...+ ..
T Consensus 44 vptP~~~n~pye~i~l~T~D~vtL~a~~~~~E~-------S~pTlLyfh~NAG--------------Nm-Ghr~~i~~~f 101 (300)
T KOG4391|consen 44 VPTPKEFNMPYERIELRTRDKVTLDAYLMLSES-------SRPTLLYFHANAG--------------NM-GHRLPIARVF 101 (300)
T ss_pred CCCccccCCCceEEEEEcCcceeEeeeeecccC-------CCceEEEEccCCC--------------cc-cchhhHHHHH
Confidence 345666678899999999999999999988322 3799999998521 11 1111222 24
Q ss_pred HHhCCeEEEecCCCCCCCCCCCC-------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEE
Q 004574 551 FLARRFAVLAGPSIPIIGEGDKL-------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGI 623 (744)
Q Consensus 551 ~~~~G~~v~~~~~~~~~g~g~~~-------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v 623 (744)
+.+.+.+|+. ..++|+|.++ ..-|.+++++||..++..|..|++++|.|.||..|+.+|++..+++.|+|
T Consensus 102 y~~l~mnv~i---vsYRGYG~S~GspsE~GL~lDs~avldyl~t~~~~dktkivlfGrSlGGAvai~lask~~~ri~~~i 178 (300)
T KOG4391|consen 102 YVNLKMNVLI---VSYRGYGKSEGSPSEEGLKLDSEAVLDYLMTRPDLDKTKIVLFGRSLGGAVAIHLASKNSDRISAII 178 (300)
T ss_pred HHHcCceEEE---EEeeccccCCCCccccceeccHHHHHHHHhcCccCCcceEEEEecccCCeeEEEeeccchhheeeee
Confidence 5677999988 3445555442 22389999999999999999999999999999999999999989999998
Q ss_pred EccCCCCCC--CCCCcccccccchhhcHHHHH-hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcE
Q 004574 624 ARSGSYNKT--LTPFGFQTEFRTLWEATNVYI-EMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALS 700 (744)
Q Consensus 624 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~ 700 (744)
+-.-+.... ..+.-++-..+ +...-.+. .+.....+.+.++|.|++.|..|++|| +.+.+++++....+.+
T Consensus 179 vENTF~SIp~~~i~~v~p~~~k--~i~~lc~kn~~~S~~ki~~~~~P~LFiSGlkDelVP--P~~Mr~Ly~~c~S~~K-- 252 (300)
T KOG4391|consen 179 VENTFLSIPHMAIPLVFPFPMK--YIPLLCYKNKWLSYRKIGQCRMPFLFISGLKDELVP--PVMMRQLYELCPSRTK-- 252 (300)
T ss_pred eechhccchhhhhheeccchhh--HHHHHHHHhhhcchhhhccccCceEEeecCccccCC--cHHHHHHHHhCchhhh--
Confidence 776554321 11111110000 00001111 334444566778999999999999999 9999999988776544
Q ss_pred EEEEeCCCCcccCccccHHHHHHHHHHHHHHhccC
Q 004574 701 RLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 701 ~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
++.+||++.|.-.- - ..-+++.+.+||.+.-..
T Consensus 253 rl~eFP~gtHNDT~-i-~dGYfq~i~dFlaE~~~~ 285 (300)
T KOG4391|consen 253 RLAEFPDGTHNDTW-I-CDGYFQAIEDFLAEVVKS 285 (300)
T ss_pred hheeCCCCccCceE-E-eccHHHHHHHHHHHhccC
Confidence 89999999997431 1 246788999999987653
No 49
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=99.75 E-value=1e-15 Score=152.60 Aligned_cols=344 Identities=11% Similarity=0.101 Sum_probs=174.4
Q ss_pred CCC-CceeeecCCCC---CcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccce
Q 004574 17 SLG-PEKEVHGYPDG---AKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSF 92 (744)
Q Consensus 17 ~~g-~~~~l~~~~~~---~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~ 92 (744)
.+| +.+|||..+.. ..-..+.|.+||++|.|.+ +.++..+||++|+++++.+|||.++... ....
T Consensus 18 ~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s-------~~dg~~nly~lDL~t~~i~QLTdg~g~~----~~g~ 86 (386)
T PF14583_consen 18 DTGHRVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFAS-------DFDGNRNLYLLDLATGEITQLTDGPGDN----TFGG 86 (386)
T ss_dssp TT--EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE--------TTSS-EEEEEETTT-EEEE---SS-B-----TTT-
T ss_pred CCCceEEEecCCCCcccceeecCCCcCCCCCEEEEEe-------ccCCCcceEEEEcccCEEEECccCCCCC----ccce
Confidence 344 55677743321 1446899999999999965 3478899999999999999999987532 1245
Q ss_pred EEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeec
Q 004574 93 VWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDF 171 (744)
Q Consensus 93 ~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l 171 (744)
.+||+++.|+|.... .+|+.+|+ +++.+.|
T Consensus 87 ~~s~~~~~~~Yv~~~-------------------------------------------------~~l~~vdL~T~e~~~v 117 (386)
T PF14583_consen 87 FLSPDDRALYYVKNG-------------------------------------------------RSLRRVDLDTLEERVV 117 (386)
T ss_dssp EE-TTSSEEEEEETT-------------------------------------------------TEEEEEETTT--EEEE
T ss_pred EEecCCCeEEEEECC-------------------------------------------------CeEEEEECCcCcEEEE
Confidence 889999999876321 47999999 6666666
Q ss_pred CCC--ceeeeecc--CCCCceEEEEEeeCCcccccc---------cCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccC
Q 004574 172 GTP--AVYTAVEP--SPDQKYVLITSMHRPYSYKVP---------CARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYN 238 (744)
Q Consensus 172 ~~~--~~~~~~~~--SpDG~~i~~~~~~~~~~~~~~---------~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~ 238 (744)
... .......| ..|++.++.......+..... ...-...|+.+++.+++.+.+.....=
T Consensus 118 y~~p~~~~g~gt~v~n~d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~~~~w-------- 189 (386)
T PF14583_consen 118 YEVPDDWKGYGTWVANSDCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFEDTDW-------- 189 (386)
T ss_dssp EE--TTEEEEEEEEE-TTSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEEESS---------
T ss_pred EECCcccccccceeeCCCccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEecCcc--------
Confidence 332 33323344 567888877655432211111 011124799999999888877654321
Q ss_pred CccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCceEEEe
Q 004574 239 SVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLALVNE 316 (744)
Q Consensus 239 ~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~l~~~ 316 (744)
+..+.+||....+|.|+- .++...+ ..+||.++. +++..+.+... ...+.---|+|||..|+|.
T Consensus 190 -----lgH~~fsP~dp~li~fCH--EGpw~~V----d~RiW~i~~---dg~~~~~v~~~~~~e~~gHEfw~~DG~~i~y~ 255 (386)
T PF14583_consen 190 -----LGHVQFSPTDPTLIMFCH--EGPWDLV----DQRIWTINT---DGSNVKKVHRRMEGESVGHEFWVPDGSTIWYD 255 (386)
T ss_dssp -----EEEEEEETTEEEEEEEEE---S-TTTS----S-SEEEEET---TS---EESS---TTEEEEEEEE-TTSS-EEEE
T ss_pred -----ccCcccCCCCCCEEEEec--cCCccee----ceEEEEEEc---CCCcceeeecCCCCcccccccccCCCCEEEEE
Confidence 235788998888799984 3443321 126999998 67766666543 2234455699999999997
Q ss_pred eeeec-cceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCc
Q 004574 317 TWYKT-SQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIP 395 (744)
Q Consensus 317 ~~~~~-~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 395 (744)
..... ...-|+.+++.++ +.+.+.....-. -+..|+||+.++--..+ ....+-...|..- ...+
T Consensus 256 ~~~~~~~~~~i~~~d~~t~--~~~~~~~~p~~~--------H~~ss~Dg~L~vGDG~d----~p~~v~~~~~~~~-~~~p 320 (386)
T PF14583_consen 256 SYTPGGQDFWIAGYDPDTG--ERRRLMEMPWCS--------HFMSSPDGKLFVGDGGD----APVDVADAGGYKI-ENDP 320 (386)
T ss_dssp EEETTT--EEEEEE-TTT----EEEEEEE-SEE--------EEEE-TTSSEEEEEE------------------------
T ss_pred eecCCCCceEEEeeCCCCC--CceEEEeCCcee--------eeEEcCCCCEEEecCCC----CCcccccccccee-cCCc
Confidence 64332 3456888888874 334443322110 14468999877654311 1111111111111 2356
Q ss_pred eEEEEecCCCceeEEeeccch-hhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECC
Q 004574 396 FLDLFDINTGSKERIWESNRE-KYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWP 460 (744)
Q Consensus 396 ~l~~~d~~~g~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~ 460 (744)
.|+++++..+..+.|...+.. .......... --.++|||||++++|.+ +...++.||.+++.
T Consensus 321 ~i~~~~~~~~~~~~l~~h~~sw~v~~~~~q~~--hPhp~FSPDgk~VlF~S-d~~G~~~vY~v~i~ 383 (386)
T PF14583_consen 321 WIYLFDVEAGRFRKLARHDTSWKVLDGDRQVT--HPHPSFSPDGKWVLFRS-DMEGPPAVYLVEIP 383 (386)
T ss_dssp EEEEEETTTTEEEEEEE-------BTTBSSTT------EE-TTSSEEEEEE--TTSS-EEEEEE--
T ss_pred EEEEeccccCceeeeeeccCcceeecCCCccC--CCCCccCCCCCEEEEEC-CCCCCccEEEEeCc
Confidence 899999998887777654321 0000000000 00158999999999866 55777889998753
No 50
>PF05448 AXE1: Acetyl xylan esterase (AXE1); InterPro: IPR008391 This family consists of several bacterial acetyl xylan esterase proteins. Acetyl xylan esterases are enzymes that hydrolyse the ester linkages of the acetyl groups in position 2 and/or 3 of the xylose moieties of natural acetylated xylan from hardwood. These enzymes are one of the accessory enzymes which are part of the xylanolytic system, together with xylanases, beta-xylosidases, alpha-arabinofuranosidases and methylglucuronidases; these are all required for the complete hydrolysis of xylan [].; PDB: 1VLQ_H 3M81_E 3M82_D 3M83_C 3FCY_A 1ODS_F 1ODT_C 1L7A_A 3FYT_A 2XLB_F ....
Probab=99.74 E-value=2.7e-17 Score=166.17 Aligned_cols=225 Identities=21% Similarity=0.135 Sum_probs=147.3
Q ss_pred CCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeE
Q 004574 478 ASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFA 557 (744)
Q Consensus 478 ~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~ 557 (744)
+......++|.+.+|..+.|++++|.+. .+++|+||.+||.+... ........++.+||+
T Consensus 52 ~~~~vy~v~f~s~~g~~V~g~l~~P~~~----~~~~Pavv~~hGyg~~~----------------~~~~~~~~~a~~G~~ 111 (320)
T PF05448_consen 52 PGVEVYDVSFESFDGSRVYGWLYRPKNA----KGKLPAVVQFHGYGGRS----------------GDPFDLLPWAAAGYA 111 (320)
T ss_dssp SSEEEEEEEEEEGGGEEEEEEEEEES-S----SSSEEEEEEE--TT--G----------------GGHHHHHHHHHTT-E
T ss_pred CCEEEEEEEEEccCCCEEEEEEEecCCC----CCCcCEEEEecCCCCCC----------------CCcccccccccCCeE
Confidence 4567788999999999999999999842 45799999999964210 011223467899999
Q ss_pred EEecCCCCCCC------------------CCCCC---------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHH
Q 004574 558 VLAGPSIPIIG------------------EGDKL---------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAH 610 (744)
Q Consensus 558 v~~~~~~~~~g------------------~g~~~---------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~ 610 (744)
|+..+.++..+ .|... ++.|+.+++++|..++.+|.+||+++|.|+||.+++.
T Consensus 112 vl~~d~rGqg~~~~d~~~~~~~~~~g~~~~g~~~~~e~~yyr~~~~D~~ravd~l~slpevD~~rI~v~G~SqGG~lal~ 191 (320)
T PF05448_consen 112 VLAMDVRGQGGRSPDYRGSSGGTLKGHITRGIDDNPEDYYYRRVYLDAVRAVDFLRSLPEVDGKRIGVTGGSQGGGLALA 191 (320)
T ss_dssp EEEE--TTTSSSS-B-SSBSSS-SSSSTTTTTTS-TTT-HHHHHHHHHHHHHHHHHTSTTEEEEEEEEEEETHHHHHHHH
T ss_pred EEEecCCCCCCCCCCccccCCCCCccHHhcCccCchHHHHHHHHHHHHHHHHHHHHhCCCcCcceEEEEeecCchHHHHH
Confidence 99732222111 11111 2249999999999999999999999999999999999
Q ss_pred HHHhCCCceeEEEEccCCCCC-C----CC--CCcccccc-------cchh---hcHHHHHhcCcccccCCCCCCEEEEee
Q 004574 611 LLAHAPHLFCCGIARSGSYNK-T----LT--PFGFQTEF-------RTLW---EATNVYIEMSPITHANKIKKPILIIHG 673 (744)
Q Consensus 611 ~~~~~p~~~~~~v~~~~~~~~-~----~~--~~~~~~~~-------~~~~---~~~~~~~~~~~~~~~~~~~~P~l~i~G 673 (744)
+++.+ ++++++++..|.+.- . .. ...+.... .... +..+.+.-++....+++|++|+|+..|
T Consensus 192 ~aaLd-~rv~~~~~~vP~l~d~~~~~~~~~~~~~y~~~~~~~~~~d~~~~~~~~v~~~L~Y~D~~nfA~ri~~pvl~~~g 270 (320)
T PF05448_consen 192 AAALD-PRVKAAAADVPFLCDFRRALELRADEGPYPEIRRYFRWRDPHHEREPEVFETLSYFDAVNFARRIKCPVLFSVG 270 (320)
T ss_dssp HHHHS-ST-SEEEEESESSSSHHHHHHHT--STTTHHHHHHHHHHSCTHCHHHHHHHHHHTT-HHHHGGG--SEEEEEEE
T ss_pred HHHhC-ccccEEEecCCCccchhhhhhcCCccccHHHHHHHHhccCCCcccHHHHHHHHhhhhHHHHHHHcCCCEEEEEe
Confidence 99999 579999888886421 0 00 01110000 0000 112334456778888999999999999
Q ss_pred CCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 674 EVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 674 ~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
..|..|| +....+.|.++. .++++.+||..+|... .....++.++||.++
T Consensus 271 l~D~~cP--P~t~fA~yN~i~---~~K~l~vyp~~~He~~----~~~~~~~~~~~l~~~ 320 (320)
T PF05448_consen 271 LQDPVCP--PSTQFAAYNAIP---GPKELVVYPEYGHEYG----PEFQEDKQLNFLKEH 320 (320)
T ss_dssp TT-SSS---HHHHHHHHCC-----SSEEEEEETT--SSTT----HHHHHHHHHHHHHH-
T ss_pred cCCCCCC--chhHHHHHhccC---CCeeEEeccCcCCCch----hhHHHHHHHHHHhcC
Confidence 9999998 888888888885 3579999999999654 323367788999764
No 51
>PLN02652 hydrolase; alpha/beta fold family protein
Probab=99.74 E-value=2.5e-16 Score=165.18 Aligned_cols=223 Identities=15% Similarity=0.140 Sum_probs=149.4
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
...+.+...++..+++..|.|.. ..+.|+||++||.+. ........+..|+++||.|+.
T Consensus 110 ~~~~~~~~~~~~~l~~~~~~p~~-----~~~~~~Vl~lHG~~~---------------~~~~~~~~a~~L~~~Gy~V~~- 168 (395)
T PLN02652 110 WATSLFYGARRNALFCRSWAPAA-----GEMRGILIIIHGLNE---------------HSGRYLHFAKQLTSCGFGVYA- 168 (395)
T ss_pred EEEEEEECCCCCEEEEEEecCCC-----CCCceEEEEECCchH---------------HHHHHHHHHHHHHHCCCEEEE-
Confidence 34455566777889999998854 234789999999641 111122456678889999999
Q ss_pred CCCCCCCCCCCC-----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC---ceeEEEEccC
Q 004574 562 PSIPIIGEGDKL-----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH---LFCCGIARSG 627 (744)
Q Consensus 562 ~~~~~~g~g~~~-----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~---~~~~~v~~~~ 627 (744)
++.+|+|.+. ..+|+.++++++.... +..++.++||||||.+++.++. +|+ +++++|+.+|
T Consensus 169 --~D~rGhG~S~~~~~~~~~~~~~~~Dl~~~l~~l~~~~--~~~~i~lvGhSmGG~ial~~a~-~p~~~~~v~glVL~sP 243 (395)
T PLN02652 169 --MDWIGHGGSDGLHGYVPSLDYVVEDTEAFLEKIRSEN--PGVPCFLFGHSTGGAVVLKAAS-YPSIEDKLEGIVLTSP 243 (395)
T ss_pred --eCCCCCCCCCCCCCCCcCHHHHHHHHHHHHHHHHHhC--CCCCEEEEEECHHHHHHHHHHh-ccCcccccceEEEECc
Confidence 5566665432 1347778888877542 2247999999999999987664 554 7999999998
Q ss_pred CCCCCCCC----------------Cccccc--c-cchhhcHHH----H------------------Hhc--CcccccCCC
Q 004574 628 SYNKTLTP----------------FGFQTE--F-RTLWEATNV----Y------------------IEM--SPITHANKI 664 (744)
Q Consensus 628 ~~~~~~~~----------------~~~~~~--~-~~~~~~~~~----~------------------~~~--~~~~~~~~~ 664 (744)
........ +.+... . ......+.. + .+. .....+.++
T Consensus 244 ~l~~~~~~~~~~~~~~l~~~~~p~~~~~~~~~~~~~~s~~~~~~~~~~~dp~~~~g~i~~~~~~~~~~~~~~l~~~L~~I 323 (395)
T PLN02652 244 ALRVKPAHPIVGAVAPIFSLVAPRFQFKGANKRGIPVSRDPAALLAKYSDPLVYTGPIRVRTGHEILRISSYLTRNFKSV 323 (395)
T ss_pred ccccccchHHHHHHHHHHHHhCCCCcccCcccccCCcCCCHHHHHHHhcCCCcccCCchHHHHHHHHHHHHHHHhhcccC
Confidence 75321100 000000 0 000000000 0 000 012345678
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhcc
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~ 734 (744)
++|+|++||++|.++| ++.++++++.+.. ..++++++++++|.....+.++++++.+.+||..++.
T Consensus 324 ~vPvLIi~G~~D~vvp--~~~a~~l~~~~~~--~~k~l~~~~ga~H~l~~e~~~e~v~~~I~~FL~~~~~ 389 (395)
T PLN02652 324 TVPFMVLHGTADRVTD--PLASQDLYNEAAS--RHKDIKLYDGFLHDLLFEPEREEVGRDIIDWMEKRLD 389 (395)
T ss_pred CCCEEEEEeCCCCCCC--HHHHHHHHHhcCC--CCceEEEECCCeEEeccCCCHHHHHHHHHHHHHHHhh
Confidence 9999999999999999 9999999887643 3458889999999987666789999999999998874
No 52
>COG1647 Esterase/lipase [General function prediction only]
Probab=99.73 E-value=6.6e-17 Score=145.30 Aligned_cols=190 Identities=18% Similarity=0.175 Sum_probs=138.0
Q ss_pred eEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC----------ChHHHHHHHHH
Q 004574 514 PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK----------LPNDSAEAAVE 583 (744)
Q Consensus 514 p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~----------~~~~d~~~~~~ 583 (744)
-.|+++||. +|++.... .....|..+||.|.+|. .+|+|.. .+.+|+.++++
T Consensus 16 ~AVLllHGF-----------TGt~~Dvr----~Lgr~L~e~GyTv~aP~---ypGHG~~~e~fl~t~~~DW~~~v~d~Y~ 77 (243)
T COG1647 16 RAVLLLHGF-----------TGTPRDVR----MLGRYLNENGYTVYAPR---YPGHGTLPEDFLKTTPRDWWEDVEDGYR 77 (243)
T ss_pred EEEEEEecc-----------CCCcHHHH----HHHHHHHHCCceEecCC---CCCCCCCHHHHhcCCHHHHHHHHHHHHH
Confidence 688899995 33333332 45567888999999944 4555543 45569999999
Q ss_pred HHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCccc------ccc-----c---------
Q 004574 584 EVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQ------TEF-----R--------- 643 (744)
Q Consensus 584 ~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~------~~~-----~--------- 643 (744)
+|.+.++ +.|+++|.||||-+|+.+|.+.| ++++|.+++.+.......-+. ... .
T Consensus 78 ~L~~~gy---~eI~v~GlSmGGv~alkla~~~p--~K~iv~m~a~~~~k~~~~iie~~l~y~~~~kk~e~k~~e~~~~e~ 152 (243)
T COG1647 78 DLKEAGY---DEIAVVGLSMGGVFALKLAYHYP--PKKIVPMCAPVNVKSWRIIIEGLLEYFRNAKKYEGKDQEQIDKEM 152 (243)
T ss_pred HHHHcCC---CeEEEEeecchhHHHHHHHhhCC--ccceeeecCCcccccchhhhHHHHHHHHHhhhccCCCHHHHHHHH
Confidence 9998765 68999999999999999999997 788888887765221110000 000 0
Q ss_pred -----chhhcHHHHHhc--CcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccc
Q 004574 644 -----TLWEATNVYIEM--SPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARE 716 (744)
Q Consensus 644 -----~~~~~~~~~~~~--~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~ 716 (744)
.++.....+.+. .....+++|..|+|+++|.+|..|| .+.+.-++..+.. .+.++.+|++++|.++...
T Consensus 153 ~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~pt~vvq~~~D~mv~--~~sA~~Iy~~v~s--~~KeL~~~e~SgHVIt~D~ 228 (243)
T COG1647 153 KSYKDTPMTTTAQLKKLIKDARRSLDKIYSPTLVVQGRQDEMVP--AESANFIYDHVES--DDKELKWLEGSGHVITLDK 228 (243)
T ss_pred HHhhcchHHHHHHHHHHHHHHHhhhhhcccchhheecccCCCCC--HHHHHHHHHhccC--CcceeEEEccCCceeecch
Confidence 011111111111 2345578899999999999999999 9999999988865 4569999999999999888
Q ss_pred cHHHHHHHHHHHHH
Q 004574 717 NVMHVIWETDRWLQ 730 (744)
Q Consensus 717 ~~~~~~~~~~~fl~ 730 (744)
..+++.+.++.||+
T Consensus 229 Erd~v~e~V~~FL~ 242 (243)
T COG1647 229 ERDQVEEDVITFLE 242 (243)
T ss_pred hHHHHHHHHHHHhh
Confidence 89999999999996
No 53
>KOG1515 consensus Arylacetamide deacetylase [Defense mechanisms]
Probab=99.71 E-value=2.4e-15 Score=150.70 Aligned_cols=234 Identities=15% Similarity=0.093 Sum_probs=165.0
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHH-HhCCeEEE
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIF-LARRFAVL 559 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~G~~v~ 559 (744)
....+.+.. ...+...+|.|....+. .++|+|||+|||||...+..- ..+......+ .+.+.+|+
T Consensus 62 ~~~dv~~~~--~~~l~vRly~P~~~~~~--~~~p~lvyfHGGGf~~~S~~~----------~~y~~~~~~~a~~~~~vvv 127 (336)
T KOG1515|consen 62 TSKDVTIDP--FTNLPVRLYRPTSSSSE--TKLPVLVYFHGGGFCLGSANS----------PAYDSFCTRLAAELNCVVV 127 (336)
T ss_pred eeeeeEecC--CCCeEEEEEcCCCCCcc--cCceEEEEEeCCccEeCCCCC----------chhHHHHHHHHHHcCeEEE
Confidence 345555543 44588899999876441 359999999999976543321 1122344455 46799999
Q ss_pred ecCCCCCCCCCCCChHHHHHHHHHHHHHc----CCCCCCcEEEEEechHHHHHHHHHHhC------CCceeEEEEccCCC
Q 004574 560 AGPSIPIIGEGDKLPNDSAEAAVEEVVRR----GVADPSRIAVGGHSYGAFMTAHLLAHA------PHLFCCGIARSGSY 629 (744)
Q Consensus 560 ~~~~~~~~g~g~~~~~~d~~~~~~~l~~~----~~~d~~~i~l~G~S~GG~~a~~~~~~~------p~~~~~~v~~~~~~ 629 (744)
+++|+..+.+.....++|...++.|+.++ ..+|++||+|+|-|.||.+|..++.+. +..+++.|++.|++
T Consensus 128 SVdYRLAPEh~~Pa~y~D~~~Al~w~~~~~~~~~~~D~~rv~l~GDSaGGNia~~va~r~~~~~~~~~ki~g~ili~P~~ 207 (336)
T KOG1515|consen 128 SVDYRLAPEHPFPAAYDDGWAALKWVLKNSWLKLGADPSRVFLAGDSAGGNIAHVVAQRAADEKLSKPKIKGQILIYPFF 207 (336)
T ss_pred ecCcccCCCCCCCccchHHHHHHHHHHHhHHHHhCCCcccEEEEccCccHHHHHHHHHHHhhccCCCcceEEEEEEeccc
Confidence 99999999988888999999999999986 468999999999999999999988653 35789999999987
Q ss_pred CCCCCCCc-cc----ccccchhhcHHHHHhc--------------Cccc-----ccCCCC-CCEEEEeeCCCCCCCCCHH
Q 004574 630 NKTLTPFG-FQ----TEFRTLWEATNVYIEM--------------SPIT-----HANKIK-KPILIIHGEVDDKVGLFPM 684 (744)
Q Consensus 630 ~~~~~~~~-~~----~~~~~~~~~~~~~~~~--------------~~~~-----~~~~~~-~P~l~i~G~~D~~v~~~~~ 684 (744)
........ +. ......+...+.+++. +|.. ...... .|+|++.++.|.+ .+
T Consensus 208 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~w~~~lP~~~~~~~~p~~np~~~~~~~d~~~~~lp~tlv~~ag~D~L----~D 283 (336)
T KOG1515|consen 208 QGTDRTESEKQQNLNGSPELARPKIDKWWRLLLPNGKTDLDHPFINPVGNSLAKDLSGLGLPPTLVVVAGYDVL----RD 283 (336)
T ss_pred CCCCCCCHHHHHhhcCCcchhHHHHHHHHHHhCCCCCCCcCCccccccccccccCccccCCCceEEEEeCchhh----hh
Confidence 53221110 00 0001111111222211 1222 111223 5599999999988 78
Q ss_pred HHHHHHHHHHhCCCcEEEEEeCCCCcccCcc----ccHHHHHHHHHHHHHHh
Q 004574 685 QAERFFDALKGHGALSRLVLLPFEHHVYAAR----ENVMHVIWETDRWLQKY 732 (744)
Q Consensus 685 ~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~----~~~~~~~~~~~~fl~~~ 732 (744)
+...+.++|++.|.+++++.++++.|+++.. +...+.+..+.+|+++.
T Consensus 284 ~~~~Y~~~Lkk~Gv~v~~~~~e~~~H~~~~~~~~~~~a~~~~~~i~~fi~~~ 335 (336)
T KOG1515|consen 284 EGLAYAEKLKKAGVEVTLIHYEDGFHGFHILDPSSKEAHALMDAIVEFIKSN 335 (336)
T ss_pred hhHHHHHHHHHcCCeEEEEEECCCeeEEEecCCchhhHHHHHHHHHHHHhhc
Confidence 9999999999999999999999999997642 34567788888888764
No 54
>PRK11460 putative hydrolase; Provisional
Probab=99.69 E-value=1.8e-15 Score=147.66 Aligned_cols=124 Identities=22% Similarity=0.148 Sum_probs=98.7
Q ss_pred HHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCc
Q 004574 578 AEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSP 657 (744)
Q Consensus 578 ~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 657 (744)
+.+.++++.++..++.+||+++|+|+||.+++.++.+.|+.+.+++++++.+.. ...
T Consensus 87 l~~~i~~~~~~~~~~~~~i~l~GfS~Gg~~al~~a~~~~~~~~~vv~~sg~~~~------~~~----------------- 143 (232)
T PRK11460 87 FIETVRYWQQQSGVGASATALIGFSQGAIMALEAVKAEPGLAGRVIAFSGRYAS------LPE----------------- 143 (232)
T ss_pred HHHHHHHHHHhcCCChhhEEEEEECHHHHHHHHHHHhCCCcceEEEEecccccc------ccc-----------------
Confidence 445566666677788899999999999999999999998888888888774310 000
Q ss_pred ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhcc
Q 004574 658 ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 658 ~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~ 734 (744)
....++|+|++||++|.+|| .+.++++++.|++.+.++++++|++++|.+.. +....+.+||.+.+.
T Consensus 144 ---~~~~~~pvli~hG~~D~vvp--~~~~~~~~~~L~~~g~~~~~~~~~~~gH~i~~-----~~~~~~~~~l~~~l~ 210 (232)
T PRK11460 144 ---TAPTATTIHLIHGGEDPVID--VAHAVAAQEALISLGGDVTLDIVEDLGHAIDP-----RLMQFALDRLRYTVP 210 (232)
T ss_pred ---cccCCCcEEEEecCCCCccC--HHHHHHHHHHHHHCCCCeEEEEECCCCCCCCH-----HHHHHHHHHHHHHcc
Confidence 11247899999999999999 99999999999999999999999999999863 445566677766653
No 55
>COG2267 PldB Lysophospholipase [Lipid metabolism]
Probab=99.69 E-value=3e-15 Score=150.77 Aligned_cols=222 Identities=21% Similarity=0.217 Sum_probs=151.0
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
.....+...++..+....+.++. .+..+||++||.+ ++...+...+..|..+||.|++
T Consensus 9 ~~~~~~~~~d~~~~~~~~~~~~~------~~~g~Vvl~HG~~---------------Eh~~ry~~la~~l~~~G~~V~~- 66 (298)
T COG2267 9 RTEGYFTGADGTRLRYRTWAAPE------PPKGVVVLVHGLG---------------EHSGRYEELADDLAARGFDVYA- 66 (298)
T ss_pred cccceeecCCCceEEEEeecCCC------CCCcEEEEecCch---------------HHHHHHHHHHHHHHhCCCEEEE-
Confidence 44455667889999888887764 1237999999973 3444455677889999999999
Q ss_pred CCCCCCCCCCCC------------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCC
Q 004574 562 PSIPIIGEGDKL------------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSY 629 (744)
Q Consensus 562 ~~~~~~g~g~~~------------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~ 629 (744)
++.+|+|.+. ..+|+...++.+.... -..+++|+||||||.+|+.++.+.+..+.++|+.+|++
T Consensus 67 --~D~RGhG~S~r~~rg~~~~f~~~~~dl~~~~~~~~~~~--~~~p~~l~gHSmGg~Ia~~~~~~~~~~i~~~vLssP~~ 142 (298)
T COG2267 67 --LDLRGHGRSPRGQRGHVDSFADYVDDLDAFVETIAEPD--PGLPVFLLGHSMGGLIALLYLARYPPRIDGLVLSSPAL 142 (298)
T ss_pred --ecCCCCCCCCCCCcCCchhHHHHHHHHHHHHHHHhccC--CCCCeEEEEeCcHHHHHHHHHHhCCccccEEEEECccc
Confidence 6777776663 2225555555555431 13689999999999999999999999999999999986
Q ss_pred CCCC-------------------CCCcccc----cccchh-----hcHHHHHh--------------------cC--ccc
Q 004574 630 NKTL-------------------TPFGFQT----EFRTLW-----EATNVYIE--------------------MS--PIT 659 (744)
Q Consensus 630 ~~~~-------------------~~~~~~~----~~~~~~-----~~~~~~~~--------------------~~--~~~ 659 (744)
.... ..+.+.. .....+ ...+.|.+ .. ...
T Consensus 143 ~l~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~sr~~~~~~~~~~dP~~~~~~~~~~w~~~~~~a~~~~~~~ 222 (298)
T COG2267 143 GLGGAILRLILARLALKLLGRIRPKLPVDSNLLEGVLTDDLSRDPAEVAAYEADPLIGVGGPVSRWVDLALLAGRVPALR 222 (298)
T ss_pred cCChhHHHHHHHHHhcccccccccccccCcccccCcCcchhhcCHHHHHHHhcCCccccCCccHHHHHHHHHhhcccchh
Confidence 6431 0001110 000000 00001100 01 122
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCH-HHHHHHHHHHHhCCCc-EEEEEeCCCCcccCccccH--HHHHHHHHHHHHHhcc
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFP-MQAERFFDALKGHGAL-SRLVLLPFEHHVYAARENV--MHVIWETDRWLQKYCL 734 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~-~~~~~~~~~l~~~~~~-~~~~~~~~~~H~~~~~~~~--~~~~~~~~~fl~~~l~ 734 (744)
....+++|+|+++|++|.+++ . +...+++ +..+.+ +++++++++.|.+..+... +.+++.+.+||++++.
T Consensus 223 ~~~~~~~PvLll~g~~D~vv~--~~~~~~~~~---~~~~~~~~~~~~~~g~~He~~~E~~~~r~~~~~~~~~~l~~~~~ 296 (298)
T COG2267 223 DAPAIALPVLLLQGGDDRVVD--NVEGLARFF---ERAGSPDKELKVIPGAYHELLNEPDRAREEVLKDILAWLAEALP 296 (298)
T ss_pred ccccccCCEEEEecCCCcccc--CcHHHHHHH---HhcCCCCceEEecCCcchhhhcCcchHHHHHHHHHHHHHHhhcc
Confidence 245678999999999999986 4 4444444 444433 6999999999998877777 8999999999998764
No 56
>COG0657 Aes Esterase/lipase [Lipid metabolism]
Probab=99.67 E-value=9.2e-15 Score=150.71 Aligned_cols=223 Identities=18% Similarity=0.163 Sum_probs=155.1
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCC
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGE 569 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~ 569 (744)
..+..+...+|.|.. ....+.|+|||+|||||...+... + .......++..|+.|++++|+..+.+
T Consensus 59 ~~~~~~~~~~y~p~~---~~~~~~p~vly~HGGg~~~g~~~~--------~---~~~~~~~~~~~g~~vv~vdYrlaPe~ 124 (312)
T COG0657 59 PSGDGVPVRVYRPDR---KAAATAPVVLYLHGGGWVLGSLRT--------H---DALVARLAAAAGAVVVSVDYRLAPEH 124 (312)
T ss_pred CCCCceeEEEECCCC---CCCCCCcEEEEEeCCeeeecChhh--------h---HHHHHHHHHHcCCEEEecCCCCCCCC
Confidence 344557789999921 223458999999999976543321 1 11334455678999999888888887
Q ss_pred CCCChHHHHHHHHHHHHHcC---CCCCCcEEEEEechHHHHHHHHHHhCCC----ceeEEEEccCCCCCCCCCCcccccc
Q 004574 570 GDKLPNDSAEAAVEEVVRRG---VADPSRIAVGGHSYGAFMTAHLLAHAPH----LFCCGIARSGSYNKTLTPFGFQTEF 642 (744)
Q Consensus 570 g~~~~~~d~~~~~~~l~~~~---~~d~~~i~l~G~S~GG~~a~~~~~~~p~----~~~~~v~~~~~~~~~~~~~~~~~~~ 642 (744)
......+|+.+++.|+.++. .+|+++|+++|+|.||++++.++....+ ..++.++++|.++.......+....
T Consensus 125 ~~p~~~~d~~~a~~~l~~~~~~~g~dp~~i~v~GdSAGG~La~~~a~~~~~~~~~~p~~~~li~P~~d~~~~~~~~~~~~ 204 (312)
T COG0657 125 PFPAALEDAYAAYRWLRANAAELGIDPSRIAVAGDSAGGHLALALALAARDRGLPLPAAQVLISPLLDLTSSAASLPGYG 204 (312)
T ss_pred CCCchHHHHHHHHHHHHhhhHhhCCCccceEEEecCcccHHHHHHHHHHHhcCCCCceEEEEEecccCCcccccchhhcC
Confidence 77777889999999999873 5899999999999999999999876532 5688899999987654111111000
Q ss_pred cchhhc--------HHHHH---------hcCccccc--CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEE
Q 004574 643 RTLWEA--------TNVYI---------EMSPITHA--NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLV 703 (744)
Q Consensus 643 ~~~~~~--------~~~~~---------~~~~~~~~--~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~ 703 (744)
...+-. ...+. ..+|+... .. -.|+++++|+.|.+ ..+++.+.++|++.|.++++.
T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~spl~~~~~~~-lPP~~i~~a~~D~l----~~~~~~~a~~L~~agv~~~~~ 279 (312)
T COG0657 205 EADLLDAAAILAWFADLYLGAAPDREDPEASPLASDDLSG-LPPTLIQTAEFDPL----RDEGEAYAERLRAAGVPVELR 279 (312)
T ss_pred CccccCHHHHHHHHHHHhCcCccccCCCccCccccccccC-CCCEEEEecCCCcc----hhHHHHHHHHHHHcCCeEEEE
Confidence 000000 01111 12232222 22 47899999999998 568999999999999999999
Q ss_pred EeCCCCcccCcc--ccHHHHHHHHHHHHHH
Q 004574 704 LLPFEHHVYAAR--ENVMHVIWETDRWLQK 731 (744)
Q Consensus 704 ~~~~~~H~~~~~--~~~~~~~~~~~~fl~~ 731 (744)
.++++.|.|... +.....+..+.+||..
T Consensus 280 ~~~g~~H~f~~~~~~~a~~~~~~~~~~l~~ 309 (312)
T COG0657 280 VYPGMIHGFDLLTGPEARSALRQIAAFLRA 309 (312)
T ss_pred EeCCcceeccccCcHHHHHHHHHHHHHHHH
Confidence 999999988533 2344556677777763
No 57
>COG3458 Acetyl esterase (deacetylase) [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=99.66 E-value=1.8e-15 Score=139.86 Aligned_cols=223 Identities=17% Similarity=0.128 Sum_probs=153.5
Q ss_pred CCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeE
Q 004574 478 ASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFA 557 (744)
Q Consensus 478 ~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~ 557 (744)
+.++.-.++|.+.+|.+|.+++.+|... ++++|+||..||-+...+.. .-...|+..||+
T Consensus 52 ~~ve~ydvTf~g~~g~rI~gwlvlP~~~----~~~~P~vV~fhGY~g~~g~~----------------~~~l~wa~~Gya 111 (321)
T COG3458 52 PRVEVYDVTFTGYGGARIKGWLVLPRHE----KGKLPAVVQFHGYGGRGGEW----------------HDMLHWAVAGYA 111 (321)
T ss_pred CceEEEEEEEeccCCceEEEEEEeeccc----CCccceEEEEeeccCCCCCc----------------ccccccccccee
Confidence 3456778899999999999999999862 36799999999853211111 113456889999
Q ss_pred EEecCCCCCCCC--------CC-C-------------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHH
Q 004574 558 VLAGPSIPIIGE--------GD-K-------------------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTA 609 (744)
Q Consensus 558 v~~~~~~~~~g~--------g~-~-------------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~ 609 (744)
|+..+.++..+. +. + ..+.|+.++++-+.....+|.+||++.|.|.||.+++
T Consensus 112 vf~MdvRGQg~~~~dt~~~p~~~s~pG~mtrGilD~kd~yyyr~v~~D~~~ave~~~sl~~vde~Ri~v~G~SqGGglal 191 (321)
T COG3458 112 VFVMDVRGQGSSSQDTADPPGGPSDPGFMTRGILDRKDTYYYRGVFLDAVRAVEILASLDEVDEERIGVTGGSQGGGLAL 191 (321)
T ss_pred EEEEecccCCCccccCCCCCCCCcCCceeEeecccCCCceEEeeehHHHHHHHHHHhccCccchhheEEeccccCchhhh
Confidence 997222222111 00 0 1234999999999999999999999999999999999
Q ss_pred HHHHhCCCceeEEEEccCCCCCCCCCCcccccccchh---------------hcHHHHHhcCcccccCCCCCCEEEEeeC
Q 004574 610 HLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLW---------------EATNVYIEMSPITHANKIKKPILIIHGE 674 (744)
Q Consensus 610 ~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~P~l~i~G~ 674 (744)
.+++.. .+++++++..|...-.-....... ..++ +..+.+.-++....+.++|.|+|+..|.
T Consensus 192 aaaal~-~rik~~~~~~Pfl~df~r~i~~~~--~~~ydei~~y~k~h~~~e~~v~~TL~yfD~~n~A~RiK~pvL~svgL 268 (321)
T COG3458 192 AAAALD-PRIKAVVADYPFLSDFPRAIELAT--EGPYDEIQTYFKRHDPKEAEVFETLSYFDIVNLAARIKVPVLMSVGL 268 (321)
T ss_pred hhhhcC-hhhhcccccccccccchhheeecc--cCcHHHHHHHHHhcCchHHHHHHHHhhhhhhhHHHhhccceEEeecc
Confidence 999998 589999999997541111111110 1111 1122233345566678899999999999
Q ss_pred CCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 675 VDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 675 ~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
.|.+|| +.-.-++++++.. +.++.+|+.-+|.-. .....+.+..|+...
T Consensus 269 ~D~vcp--PstqFA~yN~l~~---~K~i~iy~~~aHe~~----p~~~~~~~~~~l~~l 317 (321)
T COG3458 269 MDPVCP--PSTQFAAYNALTT---SKTIEIYPYFAHEGG----PGFQSRQQVHFLKIL 317 (321)
T ss_pred cCCCCC--ChhhHHHhhcccC---CceEEEeeccccccC----cchhHHHHHHHHHhh
Confidence 999998 8888888887764 357888887789644 223344566777653
No 58
>TIGR01840 esterase_phb esterase, PHB depolymerase family. This model describes a subfamily among lipases of the ab-hydrolase family. This subfamily includes bacterial depolymerases for poly(3-hydroxybutyrate) (PHB) and related polyhydroxyalkanoates (PHA), as well as acetyl xylan esterases, feruloyl esterases, and others from fungi.
Probab=99.64 E-value=5.2e-15 Score=143.12 Aligned_cols=180 Identities=18% Similarity=0.156 Sum_probs=113.4
Q ss_pred EEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCC------
Q 004574 497 ATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEG------ 570 (744)
Q Consensus 497 ~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g------ 570 (744)
+++|+|++. .+++|+||++||++.... .+.. .......+.+.||+|++++..+..+.+
T Consensus 1 ~~ly~P~~~----~~~~P~vv~lHG~~~~~~-----------~~~~-~~~~~~~a~~~g~~Vv~Pd~~g~~~~~~~~~~~ 64 (212)
T TIGR01840 1 MYVYVPAGL----TGPRALVLALHGCGQTAS-----------AYVI-DWGWKAAADRYGFVLVAPEQTSYNSSNNCWDWF 64 (212)
T ss_pred CEEEcCCCC----CCCCCEEEEeCCCCCCHH-----------HHhh-hcChHHHHHhCCeEEEecCCcCccccCCCCCCC
Confidence 368889874 235899999999763211 1110 001233344679999996553321100
Q ss_pred -C------CChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCC-c-cccc
Q 004574 571 -D------KLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPF-G-FQTE 641 (744)
Q Consensus 571 -~------~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~-~-~~~~ 641 (744)
. .....++.++++++.++..+|++||+|+|+|+||++++.++.++|+.+++++++++..-...... . ....
T Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~id~~~i~l~G~S~Gg~~a~~~a~~~p~~~~~~~~~~g~~~~~~~~~~~~~~~~ 144 (212)
T TIGR01840 65 FTHHRARGTGEVESLHQLIDAVKANYSIDPNRVYVTGLSAGGGMTAVLGCTYPDVFAGGASNAGLPYGEASSSISATPQM 144 (212)
T ss_pred CccccCCCCccHHHHHHHHHHHHHhcCcChhheEEEEECHHHHHHHHHHHhCchhheEEEeecCCcccccccchhhHhhc
Confidence 0 11234788999999998889999999999999999999999999999999998887632111000 0 0000
Q ss_pred ccchhhcHHHHHhc-Cc-ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC
Q 004574 642 FRTLWEATNVYIEM-SP-ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGH 696 (744)
Q Consensus 642 ~~~~~~~~~~~~~~-~~-~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~ 696 (744)
..-.....+.+. .. .....+...|++++||++|.+|| +..++++++++++.
T Consensus 145 --~~~~~~~~~~~~~~~~~~~~~~~~p~~~i~hG~~D~vVp--~~~~~~~~~~l~~~ 197 (212)
T TIGR01840 145 --CTAATAASVCRLVRGMQSEYNGPTPIMSVVHGDADYTVL--PGNADEIRDAMLKV 197 (212)
T ss_pred --CCCCCHHHHHHHHhccCCcccCCCCeEEEEEcCCCceeC--cchHHHHHHHHHHh
Confidence 000011111111 00 11122234557899999999999 99999999999875
No 59
>PF12695 Abhydrolase_5: Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=99.64 E-value=3e-15 Score=135.84 Aligned_cols=145 Identities=26% Similarity=0.325 Sum_probs=110.3
Q ss_pred EEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChHHHHHHHHHHHHHcCCCCCC
Q 004574 515 CLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPNDSAEAAVEEVVRRGVADPS 594 (744)
Q Consensus 515 ~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~~d~~~~~~~l~~~~~~d~~ 594 (744)
+||++||.+.. ... ....+..|+++||.|+. .+.++.+.....+++.++++++.+... |.+
T Consensus 1 ~vv~~HG~~~~-----------~~~----~~~~~~~l~~~G~~v~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 61 (145)
T PF12695_consen 1 VVVLLHGWGGS-----------RRD----YQPLAEALAEQGYAVVA---FDYPGHGDSDGADAVERVLADIRAGYP-DPD 61 (145)
T ss_dssp EEEEECTTTTT-----------THH----HHHHHHHHHHTTEEEEE---ESCTTSTTSHHSHHHHHHHHHHHHHHC-TCC
T ss_pred CEEEECCCCCC-----------HHH----HHHHHHHHHHCCCEEEE---EecCCCCccchhHHHHHHHHHHHhhcC-CCC
Confidence 58999997521 011 23567788999999999 455555555444577888888754322 778
Q ss_pred cEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeC
Q 004574 595 RIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGE 674 (744)
Q Consensus 595 ~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~ 674 (744)
+|+++|||+||.+++.++.+. ..++++|+++|..+ ...+.+.+.|+|+++|+
T Consensus 62 ~i~l~G~S~Gg~~a~~~~~~~-~~v~~~v~~~~~~~---------------------------~~~~~~~~~pv~~i~g~ 113 (145)
T PF12695_consen 62 RIILIGHSMGGAIAANLAARN-PRVKAVVLLSPYPD---------------------------SEDLAKIRIPVLFIHGE 113 (145)
T ss_dssp EEEEEEETHHHHHHHHHHHHS-TTESEEEEESESSG---------------------------CHHHTTTTSEEEEEEET
T ss_pred cEEEEEEccCcHHHHHHhhhc-cceeEEEEecCccc---------------------------hhhhhccCCcEEEEEEC
Confidence 999999999999999999998 78999999999310 01234567899999999
Q ss_pred CCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 675 VDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 675 ~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
+|..++ .++.++++++++ .+.+++++++++|+
T Consensus 114 ~D~~~~--~~~~~~~~~~~~---~~~~~~~i~g~~H~ 145 (145)
T PF12695_consen 114 NDPLVP--PEQVRRLYEALP---GPKELYIIPGAGHF 145 (145)
T ss_dssp T-SSSH--HHHHHHHHHHHC---SSEEEEEETTS-TT
T ss_pred CCCcCC--HHHHHHHHHHcC---CCcEEEEeCCCcCc
Confidence 999998 999999988886 56799999999995
No 60
>PF06500 DUF1100: Alpha/beta hydrolase of unknown function (DUF1100); InterPro: IPR010520 Proteins in this entry display esterase activity toward pNP-butyrate []. This entry also includes 2,6-dihydropseudooxynicotine hydrolase which has a role in nicotine catabolism by cleaving a C-C bond in 2,6-dihydroxypseudooxyicotine [, ].; PDB: 3OUR_A 3MVE_B 2JBW_C.
Probab=99.64 E-value=3.6e-15 Score=151.00 Aligned_cols=220 Identities=18% Similarity=0.220 Sum_probs=132.2
Q ss_pred CCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhH-HHHHhCCeE
Q 004574 479 SLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSS-LIFLARRFA 557 (744)
Q Consensus 479 ~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~G~~ 557 (744)
..+.+.++|.- +|..+++++.+|++ +++.|+||++-|.. .+........ ..++.+|++
T Consensus 162 ~~~i~~v~iP~-eg~~I~g~LhlP~~-----~~p~P~VIv~gGlD---------------s~qeD~~~l~~~~l~~rGiA 220 (411)
T PF06500_consen 162 DYPIEEVEIPF-EGKTIPGYLHLPSG-----EKPYPTVIVCGGLD---------------SLQEDLYRLFRDYLAPRGIA 220 (411)
T ss_dssp SSEEEEEEEEE-TTCEEEEEEEESSS-----SS-EEEEEEE--TT---------------S-GGGGHHHHHCCCHHCT-E
T ss_pred CCCcEEEEEee-CCcEEEEEEEcCCC-----CCCCCEEEEeCCcc---------------hhHHHHHHHHHHHHHhCCCE
Confidence 44578888874 45899999999985 55799999986632 1211111112 246789999
Q ss_pred EEecCCCCCCCCCCC-------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 558 VLAGPSIPIIGEGDK-------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 558 v~~~~~~~~~g~g~~-------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
++. .+.+|-|.+ +...-..++++||...++||.+||+++|.|+||+.|.++|..++++++|+|++.|++.
T Consensus 221 ~Lt---vDmPG~G~s~~~~l~~D~~~l~~aVLd~L~~~p~VD~~RV~~~G~SfGGy~AvRlA~le~~RlkavV~~Ga~vh 297 (411)
T PF06500_consen 221 MLT---VDMPGQGESPKWPLTQDSSRLHQAVLDYLASRPWVDHTRVGAWGFSFGGYYAVRLAALEDPRLKAVVALGAPVH 297 (411)
T ss_dssp EEE---E--TTSGGGTTT-S-S-CCHHHHHHHHHHHHSTTEEEEEEEEEEETHHHHHHHHHHHHTTTT-SEEEEES---S
T ss_pred EEE---EccCCCcccccCCCCcCHHHHHHHHHHHHhcCCccChhheEEEEeccchHHHHHHHHhcccceeeEeeeCchHh
Confidence 998 444443332 1112478899999999999999999999999999999999988889999999999765
Q ss_pred CCCCCCcccccc------------cchhhcH----HHHHhcCccc--cc--CCCCCCEEEEeeCCCCCCCCCHHHHHHHH
Q 004574 631 KTLTPFGFQTEF------------RTLWEAT----NVYIEMSPIT--HA--NKIKKPILIIHGEVDDKVGLFPMQAERFF 690 (744)
Q Consensus 631 ~~~~~~~~~~~~------------~~~~~~~----~~~~~~~~~~--~~--~~~~~P~l~i~G~~D~~v~~~~~~~~~~~ 690 (744)
..+......... +...... ..+..+|... .+ ++.++|+|.+.|++|.++| .+..+-+.
T Consensus 298 ~~ft~~~~~~~~P~my~d~LA~rlG~~~~~~~~l~~el~~~SLk~qGlL~~rr~~~plL~i~~~~D~v~P--~eD~~lia 375 (411)
T PF06500_consen 298 HFFTDPEWQQRVPDMYLDVLASRLGMAAVSDESLRGELNKFSLKTQGLLSGRRCPTPLLAINGEDDPVSP--IEDSRLIA 375 (411)
T ss_dssp CGGH-HHHHTTS-HHHHHHHHHHCT-SCE-HHHHHHHGGGGSTTTTTTTTSS-BSS-EEEEEETT-SSS---HHHHHHHH
T ss_pred hhhccHHHHhcCCHHHHHHHHHHhCCccCCHHHHHHHHHhcCcchhccccCCCCCcceEEeecCCCCCCC--HHHHHHHH
Confidence 332211100000 0000001 1123344422 23 5678999999999999988 77665443
Q ss_pred HHHHhCCCcEEEEEeCCCC-cccCccccHHHHHHHHHHHHHHhc
Q 004574 691 DALKGHGALSRLVLLPFEH-HVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 691 ~~l~~~~~~~~~~~~~~~~-H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
..+.+-+...++... |. .....+..+.+||++.|
T Consensus 376 ----~~s~~gk~~~~~~~~~~~-----gy~~al~~~~~Wl~~~l 410 (411)
T PF06500_consen 376 ----ESSTDGKALRIPSKPLHM-----GYPQALDEIYKWLEDKL 410 (411)
T ss_dssp ----HTBTT-EEEEE-SSSHHH-----HHHHHHHHHHHHHHHHH
T ss_pred ----hcCCCCceeecCCCcccc-----chHHHHHHHHHHHHHhc
Confidence 333334555555443 43 23477889999999865
No 61
>PF12715 Abhydrolase_7: Abhydrolase family; PDB: 3NUZ_C 3G8Y_A.
Probab=99.63 E-value=8.5e-16 Score=152.31 Aligned_cols=221 Identities=16% Similarity=0.180 Sum_probs=113.9
Q ss_pred CCCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcc---cCCCCccCCCCchhHHHHHhC
Q 004574 478 ASLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQV---RGSPNEFSGMTPTSSLIFLAR 554 (744)
Q Consensus 478 ~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 554 (744)
..+..|.+.|...++..++++++.|.+. ++|.|+||++||.|..-....+.. ......+..........|+++
T Consensus 84 dGY~~EKv~f~~~p~~~vpaylLvPd~~----~~p~PAVL~lHgHg~~Ke~~~g~~gv~~~~~~~~~~~~~~~g~~LAk~ 159 (390)
T PF12715_consen 84 DGYTREKVEFNTTPGSRVPAYLLVPDGA----KGPFPAVLCLHGHGGGKEKMAGEDGVSPDLKDDYDDPKQDYGDQLAKR 159 (390)
T ss_dssp TTEEEEEEEE--STTB-EEEEEEEETT------S-EEEEEEE--TT--HHHHCT---SSGCG--STTSTTT-HHHHHHTT
T ss_pred CCeEEEEEEEEccCCeeEEEEEEecCCC----CCCCCEEEEeCCCCCCcccccCCcccccccchhhccccccHHHHHHhC
Confidence 3467888999888999999999999874 467999999999653211111100 011111222233467799999
Q ss_pred CeEEEecCCCC--CCCC------CCC-ChH------------------HHHHHHHHHHHHcCCCCCCcEEEEEechHHHH
Q 004574 555 RFAVLAGPSIP--IIGE------GDK-LPN------------------DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFM 607 (744)
Q Consensus 555 G~~v~~~~~~~--~~g~------g~~-~~~------------------~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~ 607 (744)
||+|++++..+ -++. +.. ... -|.+++++||..++.||++||+++|+||||+.
T Consensus 160 GYVvla~D~~g~GER~~~e~~~~~~~~~~~~la~~~l~lG~S~~G~~~~ddmr~lDfL~slpeVD~~RIG~~GfSmGg~~ 239 (390)
T PF12715_consen 160 GYVVLAPDALGFGERGDMEGAAQGSNYDCQALARNLLMLGRSLAGLMAWDDMRALDFLASLPEVDPDRIGCMGFSMGGYR 239 (390)
T ss_dssp TSEEEEE--TTSGGG-SSCCCTTTTS--HHHHHHHHHHTT--HHHHHHHHHHHHHHHHCT-TTEEEEEEEEEEEGGGHHH
T ss_pred CCEEEEEccccccccccccccccccchhHHHHHHHHHHcCcCHHHHHHHHHHHHHHHHhcCcccCccceEEEeecccHHH
Confidence 99999943221 1111 000 000 17888999999999999999999999999999
Q ss_pred HHHHHHhCCCceeEEEEccCCCCCCCC--CCcccccc------cchhhcHHHHHhcCcccccC-C-CCCCEEEEeeCCCC
Q 004574 608 TAHLLAHAPHLFCCGIARSGSYNKTLT--PFGFQTEF------RTLWEATNVYIEMSPITHAN-K-IKKPILIIHGEVDD 677 (744)
Q Consensus 608 a~~~~~~~p~~~~~~v~~~~~~~~~~~--~~~~~~~~------~~~~~~~~~~~~~~~~~~~~-~-~~~P~l~i~G~~D~ 677 (744)
++++++.. ++++++|+.+.+.-.... ........ ...+.....++++--..++. - ...|+|++.|..|.
T Consensus 240 a~~LaALD-dRIka~v~~~~l~~~~~~~~~mt~~~~~~~~~~~~~~~~~iPgl~r~~D~PdIasliAPRPll~~nG~~Dk 318 (390)
T PF12715_consen 240 AWWLAALD-DRIKATVANGYLCTTQERALLMTMPNNNGLRGFPNCICNYIPGLWRYFDFPDIASLIAPRPLLFENGGKDK 318 (390)
T ss_dssp HHHHHHH--TT--EEEEES-B--HHHHHHHB----TTS----SS-GGG--TTCCCC--HHHHHHTTTTS-EEESS-B-HH
T ss_pred HHHHHHcc-hhhHhHhhhhhhhccchhhHhhccccccccCcCcchhhhhCccHHhhCccHHHHHHhCCCcchhhcCCccc
Confidence 99999998 788888877665321000 00000000 00000000000000001111 1 25789999999999
Q ss_pred CCCCCHHHHHHHHHHHHhCCCcEEEEEeCCC
Q 004574 678 KVGLFPMQAERFFDALKGHGALSRLVLLPFE 708 (744)
Q Consensus 678 ~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 708 (744)
.+| . .+..|+. ..++.+++++.||+.
T Consensus 319 lf~--i--V~~AY~~-~~~p~n~~~~~~p~~ 344 (390)
T PF12715_consen 319 LFP--I--VRRAYAI-MGAPDNFQIHHYPKF 344 (390)
T ss_dssp HHH--H--HHHHHHH-TT-GGGEEE---GGG
T ss_pred ccH--H--HHHHHHh-cCCCcceEEeecccc
Confidence 865 3 4444433 234567899999864
No 62
>COG4099 Predicted peptidase [General function prediction only]
Probab=99.63 E-value=4.1e-15 Score=138.89 Aligned_cols=178 Identities=21% Similarity=0.290 Sum_probs=125.4
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCc-eEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh--CCeEEEecCCCCC
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPL-PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA--RRFAVLAGPSIPI 566 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~-p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~G~~v~~~~~~~~ 566 (744)
..|.++.+.+|.|+++.+.++ + |+|||+||+|..+.+-.-.+.. +.+ +..++. -++-|++|.+-..
T Consensus 169 ~tgneLkYrly~Pkdy~pdkk--y~PLvlfLHgagq~g~dn~~~l~s------g~g---aiawa~pedqcfVlAPQy~~i 237 (387)
T COG4099 169 STGNELKYRLYTPKDYAPDKK--YYPLVLFLHGAGQGGSDNDKVLSS------GIG---AIAWAGPEDQCFVLAPQYNPI 237 (387)
T ss_pred ccCceeeEEEecccccCCCCc--cccEEEEEecCCCCCchhhhhhhc------Ccc---ceeeecccCceEEEccccccc
Confidence 457899999999999987764 5 9999999987543322221111 111 122222 2345556443332
Q ss_pred CCCCCC---ChHHHHHHHHH-HHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccc
Q 004574 567 IGEGDK---LPNDSAEAAVE-EVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEF 642 (744)
Q Consensus 567 ~g~g~~---~~~~d~~~~~~-~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~ 642 (744)
.....+ .+.....+.+. -+.+++.||.+||+++|.|+||+.++.++.+.|+.|+|++.++|--+.
T Consensus 238 f~d~e~~t~~~l~~~idli~~vlas~ynID~sRIYviGlSrG~~gt~al~~kfPdfFAaa~~iaG~~d~----------- 306 (387)
T COG4099 238 FADSEEKTLLYLIEKIDLILEVLASTYNIDRSRIYVIGLSRGGFGTWALAEKFPDFFAAAVPIAGGGDR----------- 306 (387)
T ss_pred ccccccccchhHHHHHHHHHHHHhhccCcccceEEEEeecCcchhhHHHHHhCchhhheeeeecCCCch-----------
Confidence 222222 22223444454 667789999999999999999999999999999999999999996432
Q ss_pred cchhhcHHHHHhcCcccccCCC-CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeC
Q 004574 643 RTLWEATNVYIEMSPITHANKI-KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLP 706 (744)
Q Consensus 643 ~~~~~~~~~~~~~~~~~~~~~~-~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~ 706 (744)
..+++.+ +.|+-++|+.+|..+| .+.++-+++.|+....++.+..|.
T Consensus 307 ---------------v~lv~~lk~~piWvfhs~dDkv~P--v~nSrv~y~~lk~~~~kv~Ytaf~ 354 (387)
T COG4099 307 ---------------VYLVRTLKKAPIWVFHSSDDKVIP--VSNSRVLYERLKALDRKVNYTAFL 354 (387)
T ss_pred ---------------hhhhhhhccCceEEEEecCCCccc--cCcceeehHHHHhhccccchhhhh
Confidence 1112222 5899999999999999 999999999999988888777666
No 63
>PF07859 Abhydrolase_3: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR013094 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents the catalytic domain fold-3 of alpha/beta hydrolase. ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3D7R_B 2C7B_B 3ZWQ_B 2YH2_B 3BXP_A 3D3N_A 1LZK_A 1LZL_A 2O7V_A 2O7R_A ....
Probab=99.63 E-value=1.8e-15 Score=146.82 Aligned_cols=181 Identities=20% Similarity=0.150 Sum_probs=123.6
Q ss_pred EEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh-CCeEEEecCCCCCCCCCCCChHHHHHHHHHHHHHc---CCC
Q 004574 516 LFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA-RRFAVLAGPSIPIIGEGDKLPNDSAEAAVEEVVRR---GVA 591 (744)
Q Consensus 516 vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~v~~~~~~~~~g~g~~~~~~d~~~~~~~l~~~---~~~ 591 (744)
||++|||||....... .......+++ .|++|+..+|+..+........+|+.++++|+.++ ..+
T Consensus 1 v~~~HGGg~~~g~~~~------------~~~~~~~la~~~g~~v~~~~Yrl~p~~~~p~~~~D~~~a~~~l~~~~~~~~~ 68 (211)
T PF07859_consen 1 VVYIHGGGWVMGSKES------------HWPFAARLAAERGFVVVSIDYRLAPEAPFPAALEDVKAAYRWLLKNADKLGI 68 (211)
T ss_dssp EEEE--STTTSCGTTT------------HHHHHHHHHHHHTSEEEEEE---TTTSSTTHHHHHHHHHHHHHHHTHHHHTE
T ss_pred CEEECCcccccCChHH------------HHHHHHHHHhhccEEEEEeeccccccccccccccccccceeeeccccccccc
Confidence 7899999987543331 1234556664 89999998877776666667788999999999997 457
Q ss_pred CCCcEEEEEechHHHHHHHHHHhCCC----ceeEEEEccCCCCC-CCCCCccc----ccccchh--hc----HHHHH---
Q 004574 592 DPSRIAVGGHSYGAFMTAHLLAHAPH----LFCCGIARSGSYNK-TLTPFGFQ----TEFRTLW--EA----TNVYI--- 653 (744)
Q Consensus 592 d~~~i~l~G~S~GG~~a~~~~~~~p~----~~~~~v~~~~~~~~-~~~~~~~~----~~~~~~~--~~----~~~~~--- 653 (744)
|+++|+|+|+|.||.+|+.++....+ .++++++++|..+. ........ ......+ .. ...+.
T Consensus 69 d~~~i~l~G~SAGg~la~~~~~~~~~~~~~~~~~~~~~~p~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (211)
T PF07859_consen 69 DPERIVLIGDSAGGHLALSLALRARDRGLPKPKGIILISPWTDLQDFDGPSYDDSNENKDDPFLPAPKIDWFWKLYLPGS 148 (211)
T ss_dssp EEEEEEEEEETHHHHHHHHHHHHHHHTTTCHESEEEEESCHSSTSTSSCHHHHHHHHHSTTSSSBHHHHHHHHHHHHSTG
T ss_pred cccceEEeecccccchhhhhhhhhhhhcccchhhhhcccccccchhcccccccccccccccccccccccccccccccccc
Confidence 88999999999999999999875422 48999999998765 11111110 0000000 00 01111
Q ss_pred -----hcCcccccCCC--CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccC
Q 004574 654 -----EMSPITHANKI--KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYA 713 (744)
Q Consensus 654 -----~~~~~~~~~~~--~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~ 713 (744)
..+|+.. ..+ -.|+++++|+.|.+ .+++..++++|++.|.+++++++++..|+|.
T Consensus 149 ~~~~~~~sp~~~-~~~~~~Pp~~i~~g~~D~l----~~~~~~~~~~L~~~gv~v~~~~~~g~~H~f~ 210 (211)
T PF07859_consen 149 DRDDPLASPLNA-SDLKGLPPTLIIHGEDDVL----VDDSLRFAEKLKKAGVDVELHVYPGMPHGFF 210 (211)
T ss_dssp GTTSTTTSGGGS-SCCTTCHEEEEEEETTSTT----HHHHHHHHHHHHHTT-EEEEEEETTEETTGG
T ss_pred cccccccccccc-cccccCCCeeeeccccccc----hHHHHHHHHHHHHCCCCEEEEEECCCeEEee
Confidence 2244443 122 35899999999986 6788999999999999999999999999874
No 64
>PF02230 Abhydrolase_2: Phospholipase/Carboxylesterase; InterPro: IPR003140 This entry represents the alpha/beta hydrolase domain found in phospholipases [], carboxylesterases [] and thioesterases.; GO: 0016787 hydrolase activity; PDB: 3U0V_A 1AUR_A 1AUO_B 1FJ2_B 3CN9_A 3CN7_A.
Probab=99.62 E-value=5.3e-15 Score=143.39 Aligned_cols=125 Identities=30% Similarity=0.339 Sum_probs=90.2
Q ss_pred HHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcc
Q 004574 579 EAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPI 658 (744)
Q Consensus 579 ~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 658 (744)
.+.++...+. .++++||+|+|+|+||.+|+.++.++|+.+.++|+++|....... ..
T Consensus 91 ~~li~~~~~~-~i~~~ri~l~GFSQGa~~al~~~l~~p~~~~gvv~lsG~~~~~~~---~~------------------- 147 (216)
T PF02230_consen 91 DELIDEEVAY-GIDPSRIFLGGFSQGAAMALYLALRYPEPLAGVVALSGYLPPESE---LE------------------- 147 (216)
T ss_dssp HHHHHHHHHT-T--GGGEEEEEETHHHHHHHHHHHCTSSTSSEEEEES---TTGCC---CH-------------------
T ss_pred HHHHHHHHHc-CCChhheehhhhhhHHHHHHHHHHHcCcCcCEEEEeecccccccc---cc-------------------
Confidence 3334433333 489999999999999999999999999999999999997432100 00
Q ss_pred cccCC-CCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 659 THANK-IKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 659 ~~~~~-~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
..... -++|++++||..|+++| ...+++..+.|++.+.+++++.|++.+|.+. .+.+..+.+||.+++
T Consensus 148 ~~~~~~~~~pi~~~hG~~D~vvp--~~~~~~~~~~L~~~~~~v~~~~~~g~gH~i~-----~~~~~~~~~~l~~~~ 216 (216)
T PF02230_consen 148 DRPEALAKTPILIIHGDEDPVVP--FEWAEKTAEFLKAAGANVEFHEYPGGGHEIS-----PEELRDLREFLEKHI 216 (216)
T ss_dssp CCHCCCCTS-EEEEEETT-SSST--HHHHHHHHHHHHCTT-GEEEEEETT-SSS-------HHHHHHHHHHHHHH-
T ss_pred ccccccCCCcEEEEecCCCCccc--HHHHHHHHHHHHhcCCCEEEEEcCCCCCCCC-----HHHHHHHHHHHhhhC
Confidence 00111 16899999999999999 9999999999999999999999999999875 366788899998863
No 65
>PRK05371 x-prolyl-dipeptidyl aminopeptidase; Provisional
Probab=99.62 E-value=9.9e-14 Score=156.54 Aligned_cols=187 Identities=23% Similarity=0.262 Sum_probs=133.1
Q ss_pred hhHHHHHhCCeEEEecCCCCCCC-CCC-----CChHHHHHHHHHHHHHc---------------CCCCCCcEEEEEechH
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIG-EGD-----KLPNDSAEAAVEEVVRR---------------GVADPSRIAVGGHSYG 604 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g-~g~-----~~~~~d~~~~~~~l~~~---------------~~~d~~~i~l~G~S~G 604 (744)
.....|+.+||+|+..+.++..+ .|. ....+|+.++|+||..+ .+. ..||+++|.|||
T Consensus 270 ~~~~~~~~rGYaVV~~D~RGtg~SeG~~~~~~~~E~~D~~~vIeWl~~~~~~~~d~~~~~~~kq~Ws-nGkVGm~G~SY~ 348 (767)
T PRK05371 270 SLNDYFLPRGFAVVYVSGIGTRGSDGCPTTGDYQEIESMKAVIDWLNGRATAYTDRTRGKEVKADWS-NGKVAMTGKSYL 348 (767)
T ss_pred hHHHHHHhCCeEEEEEcCCCCCCCCCcCccCCHHHHHHHHHHHHHHhhCCccccccccccccccCCC-CCeeEEEEEcHH
Confidence 45578999999999833333222 111 11234899999999953 333 479999999999
Q ss_pred HHHHHHHHHhCCCceeEEEEccCCCCCCCC--C-------Ccccccc---------------------cchhh-------
Q 004574 605 AFMTAHLLAHAPHLFCCGIARSGSYNKTLT--P-------FGFQTEF---------------------RTLWE------- 647 (744)
Q Consensus 605 G~~a~~~~~~~p~~~~~~v~~~~~~~~~~~--~-------~~~~~~~---------------------~~~~~------- 647 (744)
|++++.+|+..|+.++++|..+++.++... . .++..+. ...+.
T Consensus 349 G~~~~~aAa~~pp~LkAIVp~a~is~~yd~yr~~G~~~~~~g~~ged~d~l~~~~~~r~~~~~~~~~~~~~~~~~~~~~~ 428 (767)
T PRK05371 349 GTLPNAVATTGVEGLETIIPEAAISSWYDYYRENGLVRAPGGYQGEDLDVLAELTYSRNLLAGDYLRHNEACEKLLAELT 428 (767)
T ss_pred HHHHHHHHhhCCCcceEEEeeCCCCcHHHHhhcCCceeccCCcCCcchhhHHHHhhhcccCcchhhcchHHHHHHHhhhh
Confidence 999999999999999999999888653210 0 0110000 00000
Q ss_pred ---------cHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccH
Q 004574 648 ---------ATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENV 718 (744)
Q Consensus 648 ---------~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~ 718 (744)
..+.|.+.++..++.++++|+|++||..|..++ ..++.++++++++.+.+.++.+.++ +|........
T Consensus 429 ~~~~~~~~~y~~fW~~rn~~~~~~kIkvPvLlIhGw~D~~V~--~~~s~~ly~aL~~~g~pkkL~l~~g-~H~~~~~~~~ 505 (767)
T PRK05371 429 AAQDRKTGDYNDFWDDRNYLKDADKIKASVLVVHGLNDWNVK--PKQVYQWWDALPENGVPKKLFLHQG-GHVYPNNWQS 505 (767)
T ss_pred hhhhhcCCCccHHHHhCCHhhHhhCCCCCEEEEeeCCCCCCC--hHHHHHHHHHHHhcCCCeEEEEeCC-CccCCCchhH
Confidence 011233446667788999999999999999998 8999999999999888888877765 5865444445
Q ss_pred HHHHHHHHHHHHHhccCC
Q 004574 719 MHVIWETDRWLQKYCLSN 736 (744)
Q Consensus 719 ~~~~~~~~~fl~~~l~~~ 736 (744)
.++.+.+.+||+++|+..
T Consensus 506 ~d~~e~~~~Wfd~~LkG~ 523 (767)
T PRK05371 506 IDFRDTMNAWFTHKLLGI 523 (767)
T ss_pred HHHHHHHHHHHHhccccC
Confidence 678889999999998754
No 66
>TIGR01607 PST-A Plasmodium subtelomeric family (PST-A). These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii.
Probab=99.62 E-value=2.7e-14 Score=147.61 Aligned_cols=229 Identities=15% Similarity=0.137 Sum_probs=139.3
Q ss_pred EEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCc------------ccCCCCccCCCCchhHHHHHhC
Q 004574 487 YQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQ------------VRGSPNEFSGMTPTSSLIFLAR 554 (744)
Q Consensus 487 ~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~ 554 (744)
+.+.+|..+....|.|++ ++.+||++||.+......... +.+ +..|..+....+..|.++
T Consensus 2 ~~~~~g~~l~~~~~~~~~-------~kg~v~i~HG~~eh~~~~~~~~~~~~~~~~~~~~~~-~~ry~~y~~~~~~~l~~~ 73 (332)
T TIGR01607 2 FRNKDGLLLKTYSWIVKN-------AIGIIVLIHGLKSHLRLQFLKINAKIVNNDRAVLID-TDNYYIYKDSWIENFNKN 73 (332)
T ss_pred ccCCCCCeEEEeeeeccC-------CeEEEEEECCCchhhhhhhhhcCcccCCCCeeEEEc-CCcceEeeHHHHHHHHHC
Confidence 456788899999888752 378999999965322110000 000 001111112457788999
Q ss_pred CeEEEecCCCCCCCCCCCC--------------hHHHHHHHHHHHHHc------------------CCCCCCcEEEEEec
Q 004574 555 RFAVLAGPSIPIIGEGDKL--------------PNDSAEAAVEEVVRR------------------GVADPSRIAVGGHS 602 (744)
Q Consensus 555 G~~v~~~~~~~~~g~g~~~--------------~~~d~~~~~~~l~~~------------------~~~d~~~i~l~G~S 602 (744)
||.|++ .+.+|+|.+. ..+|+...++.+.+. ..-...++.|+|||
T Consensus 74 G~~V~~---~D~rGHG~S~~~~~~~g~~~~~~~~v~Dl~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~GhS 150 (332)
T TIGR01607 74 GYSVYG---LDLQGHGESDGLQNLRGHINCFDDLVYDVIQYMNRINDSIILENETKSDDESYDIVNTKENRLPMYIIGLS 150 (332)
T ss_pred CCcEEE---ecccccCCCccccccccchhhHHHHHHHHHHHHHHhhhhhccccccccccccccccccccCCCceeEeecc
Confidence 999999 5555555332 123666666665542 11113579999999
Q ss_pred hHHHHHHHHHHhCCC--------ceeEEEEccCCCCCCCC--------C----------------Ccccccccchh--hc
Q 004574 603 YGAFMTAHLLAHAPH--------LFCCGIARSGSYNKTLT--------P----------------FGFQTEFRTLW--EA 648 (744)
Q Consensus 603 ~GG~~a~~~~~~~p~--------~~~~~v~~~~~~~~~~~--------~----------------~~~~~~~~~~~--~~ 648 (744)
|||.+++.++.+.++ .++++|+.+|+...... . ..........+ ..
T Consensus 151 mGg~i~~~~~~~~~~~~~~~~~~~i~g~i~~s~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~p~~~~~~~~~~~~~~~~ 230 (332)
T TIGR01607 151 MGGNIALRLLELLGKSNENNDKLNIKGCISLSGMISIKSVGSDDSFKFKYFYLPVMNFMSRVFPTFRISKKIRYEKSPYV 230 (332)
T ss_pred CccHHHHHHHHHhccccccccccccceEEEeccceEEecccCCCcchhhhhHHHHHHHHHHHCCcccccCccccccChhh
Confidence 999999998865432 58889988876421100 0 00000000000 00
Q ss_pred HHH-------------------HHhc--CcccccCCC--CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEe
Q 004574 649 TNV-------------------YIEM--SPITHANKI--KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLL 705 (744)
Q Consensus 649 ~~~-------------------~~~~--~~~~~~~~~--~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~ 705 (744)
.+. +... .....+.++ ++|+|++||++|.+++ .+.++++++++.. ..++++++
T Consensus 231 ~~~~~~Dp~~~~~~~s~~~~~~l~~~~~~~~~~~~~i~~~~P~Lii~G~~D~vv~--~~~~~~~~~~~~~--~~~~l~~~ 306 (332)
T TIGR01607 231 NDIIKFDKFRYDGGITFNLASELIKATDTLDCDIDYIPKDIPILFIHSKGDCVCS--YEGTVSFYNKLSI--SNKELHTL 306 (332)
T ss_pred hhHHhcCccccCCcccHHHHHHHHHHHHHHHhhHhhCCCCCCEEEEEeCCCCccC--HHHHHHHHHhccC--CCcEEEEE
Confidence 000 0000 001123445 6899999999999998 8888888776543 34688999
Q ss_pred CCCCcccCccccHHHHHHHHHHHHH
Q 004574 706 PFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 706 ~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
++++|.+......+++++.+.+||+
T Consensus 307 ~g~~H~i~~E~~~~~v~~~i~~wL~ 331 (332)
T TIGR01607 307 EDMDHVITIEPGNEEVLKKIIEWIS 331 (332)
T ss_pred CCCCCCCccCCCHHHHHHHHHHHhh
Confidence 9999999866667889999999985
No 67
>PLN02511 hydrolase
Probab=99.60 E-value=5e-14 Score=148.73 Aligned_cols=229 Identities=13% Similarity=0.080 Sum_probs=140.3
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA 560 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~ 560 (744)
..++..+...||..+....+.+.... .....|+||++||.+.. ....+....+..++++||.|++
T Consensus 70 ~~~re~l~~~DG~~~~ldw~~~~~~~--~~~~~p~vvllHG~~g~-------------s~~~y~~~~~~~~~~~g~~vv~ 134 (388)
T PLN02511 70 RYRRECLRTPDGGAVALDWVSGDDRA--LPADAPVLILLPGLTGG-------------SDDSYVRHMLLRARSKGWRVVV 134 (388)
T ss_pred ceeEEEEECCCCCEEEEEecCccccc--CCCCCCEEEEECCCCCC-------------CCCHHHHHHHHHHHHCCCEEEE
Confidence 34445566677777776555432111 11236899999996311 0111111234456789999999
Q ss_pred cCCCCCCCCCCC----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCc--eeEEEEccCC
Q 004574 561 GPSIPIIGEGDK----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHL--FCCGIARSGS 628 (744)
Q Consensus 561 ~~~~~~~g~g~~----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~--~~~~v~~~~~ 628 (744)
++.+|+|.+ ...+|+.++++++..+.. ..++.++||||||.+++.++.++|+. +++++++++.
T Consensus 135 ---~d~rG~G~s~~~~~~~~~~~~~~Dl~~~i~~l~~~~~--~~~~~lvG~SlGg~i~~~yl~~~~~~~~v~~~v~is~p 209 (388)
T PLN02511 135 ---FNSRGCADSPVTTPQFYSASFTGDLRQVVDHVAGRYP--SANLYAAGWSLGANILVNYLGEEGENCPLSGAVSLCNP 209 (388)
T ss_pred ---EecCCCCCCCCCCcCEEcCCchHHHHHHHHHHHHHCC--CCCEEEEEechhHHHHHHHHHhcCCCCCceEEEEECCC
Confidence 555565543 234589999999987632 35899999999999999999999886 7888877766
Q ss_pred CCCCC------CCC--c----ccc--------------------------cccch--h-----------hcH-HHHHhcC
Q 004574 629 YNKTL------TPF--G----FQT--------------------------EFRTL--W-----------EAT-NVYIEMS 656 (744)
Q Consensus 629 ~~~~~------~~~--~----~~~--------------------------~~~~~--~-----------~~~-~~~~~~~ 656 (744)
.+... ..+ . +.. ..... + ... +.|.+.+
T Consensus 210 ~~l~~~~~~~~~~~~~~y~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fd~~~t~~~~gf~~~~~yy~~~s 289 (388)
T PLN02511 210 FDLVIADEDFHKGFNNVYDKALAKALRKIFAKHALLFEGLGGEYNIPLVANAKTVRDFDDGLTRVSFGFKSVDAYYSNSS 289 (388)
T ss_pred cCHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhCCCccCHHHHHhCCCHHHHHHhhhhhcCCCCCHHHHHHHcC
Confidence 54210 000 0 000 00000 0 000 1123344
Q ss_pred cccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHH------HHHHHHHHHHH
Q 004574 657 PITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVM------HVIWETDRWLQ 730 (744)
Q Consensus 657 ~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~------~~~~~~~~fl~ 730 (744)
+...+.++++|+|+|+|++|+++| ...... .+......+++++++++||..... .+. -+.+.+.+||.
T Consensus 290 ~~~~L~~I~vPtLiI~g~dDpi~p--~~~~~~---~~~~~~p~~~l~~~~~gGH~~~~E-~p~~~~~~~w~~~~i~~Fl~ 363 (388)
T PLN02511 290 SSDSIKHVRVPLLCIQAANDPIAP--ARGIPR---EDIKANPNCLLIVTPSGGHLGWVA-GPEAPFGAPWTDPVVMEFLE 363 (388)
T ss_pred chhhhccCCCCeEEEEcCCCCcCC--cccCcH---hHHhcCCCEEEEECCCcceecccc-CCCCCCCCccHHHHHHHHHH
Confidence 556778899999999999999987 443311 122234568999999999976532 222 24677889998
Q ss_pred HhccC
Q 004574 731 KYCLS 735 (744)
Q Consensus 731 ~~l~~ 735 (744)
.....
T Consensus 364 ~~~~~ 368 (388)
T PLN02511 364 ALEEG 368 (388)
T ss_pred HHHHh
Confidence 76543
No 68
>TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067.
Probab=99.59 E-value=2.3e-14 Score=143.93 Aligned_cols=190 Identities=17% Similarity=0.182 Sum_probs=120.2
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-------hHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-------PNDSAEAAVEEV 585 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-------~~~d~~~~~~~l 585 (744)
.|+||++||.+.. .. .+......+.+||.|++ .+.+|+|.+. ..++..+.+..+
T Consensus 13 ~~~iv~lhG~~~~-----------~~-----~~~~~~~~l~~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~~~~~~~ 73 (257)
T TIGR03611 13 APVVVLSSGLGGS-----------GS-----YWAPQLDVLTQRFHVVT---YDHRGTGRSPGELPPGYSIAHMADDVLQL 73 (257)
T ss_pred CCEEEEEcCCCcc-----------hh-----HHHHHHHHHHhccEEEE---EcCCCCCCCCCCCcccCCHHHHHHHHHHH
Confidence 6889999996421 01 11122334467899999 5555655442 122333333333
Q ss_pred HHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCC----------------c---------cc-
Q 004574 586 VRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPF----------------G---------FQ- 639 (744)
Q Consensus 586 ~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~----------------~---------~~- 639 (744)
.+. ++..++.++||||||.+|+.++.++|+.++++|++++......... . +.
T Consensus 74 i~~--~~~~~~~l~G~S~Gg~~a~~~a~~~~~~v~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (257)
T TIGR03611 74 LDA--LNIERFHFVGHALGGLIGLQLALRYPERLLSLVLINAWSRPDPHTRRCFDVRIALLQHAGPEAYVHAQALFLYPA 151 (257)
T ss_pred HHH--hCCCcEEEEEechhHHHHHHHHHHChHHhHHheeecCCCCCChhHHHHHHHHHHHHhccCcchhhhhhhhhhccc
Confidence 332 2346899999999999999999999999999998887533210000 0 00
Q ss_pred ---cccc-----------chhhc-------HHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCC
Q 004574 640 ---TEFR-----------TLWEA-------TNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGA 698 (744)
Q Consensus 640 ---~~~~-----------~~~~~-------~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~ 698 (744)
.... ..+.. ...+...+....+.++++|+|+++|++|..+| .+.++++++.+.
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P~l~i~g~~D~~~~--~~~~~~~~~~~~---- 225 (257)
T TIGR03611 152 DWISENAARLAADEAHALAHFPGKANVLRRINALEAFDVSARLDRIQHPVLLIANRDDMLVP--YTQSLRLAAALP---- 225 (257)
T ss_pred cHhhccchhhhhhhhhcccccCccHHHHHHHHHHHcCCcHHHhcccCccEEEEecCcCcccC--HHHHHHHHHhcC----
Confidence 0000 00000 00111122334466789999999999999998 888888776653
Q ss_pred cEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 699 LSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 699 ~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
..+++++++++|.+. .+.++.+.+.+.+||+
T Consensus 226 ~~~~~~~~~~gH~~~-~~~~~~~~~~i~~fl~ 256 (257)
T TIGR03611 226 NAQLKLLPYGGHASN-VTDPETFNRALLDFLK 256 (257)
T ss_pred CceEEEECCCCCCcc-ccCHHHHHHHHHHHhc
Confidence 348888999999876 5678889999999985
No 69
>TIGR00976 /NonD putative hydrolase, CocE/NonD family. This model represents a protein subfamily that includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and local similarity by PSI-BLAST to many other hydrolases.
Probab=99.59 E-value=6.5e-14 Score=155.18 Aligned_cols=223 Identities=19% Similarity=0.253 Sum_probs=142.2
Q ss_pred cCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCC
Q 004574 489 RKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIG 568 (744)
Q Consensus 489 ~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g 568 (744)
..+|.++.+.+|.|.+ .+++|+||++||.+.... .........+..|+++||+|+.. +.+|
T Consensus 3 ~~DG~~L~~~~~~P~~-----~~~~P~Il~~~gyg~~~~-----------~~~~~~~~~~~~l~~~Gy~vv~~---D~RG 63 (550)
T TIGR00976 3 MRDGTRLAIDVYRPAG-----GGPVPVILSRTPYGKDAG-----------LRWGLDKTEPAWFVAQGYAVVIQ---DTRG 63 (550)
T ss_pred CCCCCEEEEEEEecCC-----CCCCCEEEEecCCCCchh-----------hccccccccHHHHHhCCcEEEEE---eccc
Confidence 4689999999999975 336899999998642100 00011112356789999999993 3344
Q ss_pred CCCC---------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC---CC
Q 004574 569 EGDK---------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT---PF 636 (744)
Q Consensus 569 ~g~~---------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~---~~ 636 (744)
+|.+ ...+|+.++++|+.++++.+ .+|+++|+||||.+++.++..+|+.++++|..++..+.... ..
T Consensus 64 ~g~S~g~~~~~~~~~~~D~~~~i~~l~~q~~~~-~~v~~~G~S~GG~~a~~~a~~~~~~l~aiv~~~~~~d~~~~~~~~g 142 (550)
T TIGR00976 64 RGASEGEFDLLGSDEAADGYDLVDWIAKQPWCD-GNVGMLGVSYLAVTQLLAAVLQPPALRAIAPQEGVWDLYRDIAFPG 142 (550)
T ss_pred cccCCCceEecCcccchHHHHHHHHHHhCCCCC-CcEEEEEeChHHHHHHHHhccCCCceeEEeecCcccchhHhhccCC
Confidence 3332 23459999999999998776 69999999999999999999999999999998887653210 00
Q ss_pred ccccc------------ccc--------h-------hh---cHH-------------------------------HHHhc
Q 004574 637 GFQTE------------FRT--------L-------WE---ATN-------------------------------VYIEM 655 (744)
Q Consensus 637 ~~~~~------------~~~--------~-------~~---~~~-------------------------------~~~~~ 655 (744)
.+... ... . .. ... .+...
T Consensus 143 ~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~ 222 (550)
T TIGR00976 143 ALRLDVLLGWWALLATDSMRPRADDRPPRYAAAARLAQSYDDCQTALSHTPRSSVLALDRFIGWWIQVVDDDYDESWVSI 222 (550)
T ss_pred eeccchhHHHHHhhccccccccccccccchHHHHHHhhhhhhHHHHHhcCCccccccccccchhhhhccCCCCChhhccC
Confidence 00000 000 0 00 000 00011
Q ss_pred CcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCc--------cccHHHHHH--HH
Q 004574 656 SPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAA--------RENVMHVIW--ET 725 (744)
Q Consensus 656 ~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~--------~~~~~~~~~--~~ 725 (744)
+....+.++++|+|++.|-.|.. ...+.+.++++...+ +.++++-|. .|.... ......+.. ..
T Consensus 223 ~~~~~~~~i~vP~l~~~gw~D~~----~~g~~~~~~~~~~~~-~~~lilGpw-~H~~~~~~~~~~~~g~~~~~~~~~~~~ 296 (550)
T TIGR00976 223 SLWRDLGGSDVPTLVTGGWYDNH----SRGSIRLFLAVHRGG-AQRLVVGPW-THSGLGGRVGDGNYGMAALSWVDEAEQ 296 (550)
T ss_pred chhhHhcCCCCCEEEeCcccCCC----CchHHHHHHHHhhcC-CceEEEccC-CCCCcccccCCCccCccccccchhhhh
Confidence 22234567899999999999943 466777788877654 567776665 375210 000011122 46
Q ss_pred HHHHHHhccCCC
Q 004574 726 DRWLQKYCLSNT 737 (744)
Q Consensus 726 ~~fl~~~l~~~~ 737 (744)
++||+++|+...
T Consensus 297 ~~wfD~~Lkg~~ 308 (550)
T TIGR00976 297 LAFFDRHLKGGT 308 (550)
T ss_pred HHHHHHHhCCCC
Confidence 899999998643
No 70
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=99.59 E-value=4.2e-14 Score=144.34 Aligned_cols=194 Identities=13% Similarity=0.066 Sum_probs=121.3
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChH--H-----HHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPN--D-----SAEAAVEEV 585 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~--~-----d~~~~~~~l 585 (744)
.|.||++||.+... ..+.. ....+..+++.||.|++ .+.+|+|.+... + ...+.+..+
T Consensus 30 ~~~ivllHG~~~~~-----------~~~~~-~~~~~~~l~~~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~~~l~~~ 94 (282)
T TIGR03343 30 GEAVIMLHGGGPGA-----------GGWSN-YYRNIGPFVDAGYRVIL---KDSPGFNKSDAVVMDEQRGLVNARAVKGL 94 (282)
T ss_pred CCeEEEECCCCCch-----------hhHHH-HHHHHHHHHhCCCEEEE---ECCCCCCCCCCCcCcccccchhHHHHHHH
Confidence 36799999964210 11110 01223456778999999 556666665321 0 112223233
Q ss_pred HHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC--CCCCc--------------cc----------
Q 004574 586 VRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT--LTPFG--------------FQ---------- 639 (744)
Q Consensus 586 ~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~--~~~~~--------------~~---------- 639 (744)
.+. .+.++++++||||||.+++.++.++|++++++|++++..... ..... ..
T Consensus 95 l~~--l~~~~~~lvG~S~Gg~ia~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (282)
T TIGR03343 95 MDA--LDIEKAHLVGNSMGGATALNFALEYPDRIGKLILMGPGGLGPSLFAPMPMEGIKLLFKLYAEPSYETLKQMLNVF 172 (282)
T ss_pred HHH--cCCCCeeEEEECchHHHHHHHHHhChHhhceEEEECCCCCCccccccCchHHHHHHHHHhcCCCHHHHHHHHhhC
Confidence 332 234689999999999999999999999999999988742110 00000 00
Q ss_pred --ccc-------cchhh----cHHH---HHh---------cCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHH
Q 004574 640 --TEF-------RTLWE----ATNV---YIE---------MSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALK 694 (744)
Q Consensus 640 --~~~-------~~~~~----~~~~---~~~---------~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~ 694 (744)
... ...|. .+.. +.. .+....++++++|+|+++|++|..++ .+.++++.+.++
T Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~Pvlli~G~~D~~v~--~~~~~~~~~~~~ 250 (282)
T TIGR03343 173 LFDQSLITEELLQGRWENIQRQPEHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDRFVP--LDHGLKLLWNMP 250 (282)
T ss_pred ccCcccCcHHHHHhHHHHhhcCHHHHHHHHHhccccccccchHHHHHhhCCCCEEEEEccCCCcCC--chhHHHHHHhCC
Confidence 000 00000 0000 000 01112356789999999999999998 888887776653
Q ss_pred hCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 695 GHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 695 ~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
++++++++++||... .+.+..+.+.+.+||.
T Consensus 251 ----~~~~~~i~~agH~~~-~e~p~~~~~~i~~fl~ 281 (282)
T TIGR03343 251 ----DAQLHVFSRCGHWAQ-WEHADAFNRLVIDFLR 281 (282)
T ss_pred ----CCEEEEeCCCCcCCc-ccCHHHHHHHHHHHhh
Confidence 568999999999976 6778899999999985
No 71
>PLN00021 chlorophyllase
Probab=99.58 E-value=1.5e-13 Score=139.02 Aligned_cols=208 Identities=17% Similarity=0.115 Sum_probs=134.3
Q ss_pred eEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC
Q 004574 493 VPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK 572 (744)
Q Consensus 493 ~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~ 572 (744)
..+++.+|.|.. .+++|+||++||.+.. .......+..|+++||.|++++.++..+....
T Consensus 37 ~~~p~~v~~P~~-----~g~~PvVv~lHG~~~~---------------~~~y~~l~~~Las~G~~VvapD~~g~~~~~~~ 96 (313)
T PLN00021 37 PPKPLLVATPSE-----AGTYPVLLFLHGYLLY---------------NSFYSQLLQHIASHGFIVVAPQLYTLAGPDGT 96 (313)
T ss_pred CCceEEEEeCCC-----CCCCCEEEEECCCCCC---------------cccHHHHHHHHHhCCCEEEEecCCCcCCCCch
Confidence 468889999975 3458999999997521 11122456778899999999665443322222
Q ss_pred ChHHHHHHHHHHHHHc--------CCCCCCcEEEEEechHHHHHHHHHHhCCC-----ceeEEEEccCCCCCCCCCCccc
Q 004574 573 LPNDSAEAAVEEVVRR--------GVADPSRIAVGGHSYGAFMTAHLLAHAPH-----LFCCGIARSGSYNKTLTPFGFQ 639 (744)
Q Consensus 573 ~~~~d~~~~~~~l~~~--------~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-----~~~~~v~~~~~~~~~~~~~~~~ 639 (744)
...+++.++++|+.+. ..+|.++++|+||||||.+|+.++..+++ .++++|++.|+....... .
T Consensus 97 ~~i~d~~~~~~~l~~~l~~~l~~~~~~d~~~v~l~GHS~GG~iA~~lA~~~~~~~~~~~v~ali~ldPv~g~~~~~---~ 173 (313)
T PLN00021 97 DEIKDAAAVINWLSSGLAAVLPEGVRPDLSKLALAGHSRGGKTAFALALGKAAVSLPLKFSALIGLDPVDGTSKGK---Q 173 (313)
T ss_pred hhHHHHHHHHHHHHhhhhhhcccccccChhheEEEEECcchHHHHHHHhhccccccccceeeEEeecccccccccc---C
Confidence 3345678888888753 23667899999999999999999998864 589999998874422110 0
Q ss_pred ccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCC---------CCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCc
Q 004574 640 TEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDD---------KVGLFPMQAERFFDALKGHGALSRLVLLPFEHH 710 (744)
Q Consensus 640 ~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~---------~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H 710 (744)
. .++. + .+ ....-++.+|+|++++..|. ..| +..+..+++++.+ .+..+.+.++++|
T Consensus 174 ~---~p~i----l-~~--~~~s~~~~~P~liig~g~~~~~~~~~~p~~ap-~~~~~~~f~~~~~---~~~~~~~~~~~gH 239 (313)
T PLN00021 174 T---PPPV----L-TY--APHSFNLDIPVLVIGTGLGGEPRNPLFPPCAP-DGVNHAEFFNECK---APAVHFVAKDYGH 239 (313)
T ss_pred C---CCcc----c-cc--CcccccCCCCeEEEecCCCcccccccccccCC-CCCCHHHHHHhcC---CCeeeeeecCCCc
Confidence 0 0000 0 00 11122368999999999763 122 1233466776554 4678889999999
Q ss_pred ccCcc----------------------ccHHHHHHHHHHHHHHhccCCC
Q 004574 711 VYAAR----------------------ENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 711 ~~~~~----------------------~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
+-... ...+.+...+..||..+|....
T Consensus 240 ~~~~~~~~~~~~~~~~~~~c~~g~~~~~~r~~~~g~~~aFl~~~l~~~~ 288 (313)
T PLN00021 240 MDMLDDDTSGIRGKITGCMCKNGKPRKPMRRFVGGAVVAFLKAYLEGDT 288 (313)
T ss_pred ceeecCCCccccccccccccCCCCchHHHHHHHHHHHHHHHHHHhcCch
Confidence 63311 1123445568999999886543
No 72
>PRK00870 haloalkane dehalogenase; Provisional
Probab=99.58 E-value=9.9e-14 Score=142.75 Aligned_cols=221 Identities=15% Similarity=0.135 Sum_probs=130.3
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA 560 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~ 560 (744)
....+.+...+|..+... |...+ ....|.||++||.+.. ......++..|.++||.|++
T Consensus 20 ~~~~~~~~~~~~~~~~i~-y~~~G-----~~~~~~lvliHG~~~~---------------~~~w~~~~~~L~~~gy~vi~ 78 (302)
T PRK00870 20 APHYVDVDDGDGGPLRMH-YVDEG-----PADGPPVLLLHGEPSW---------------SYLYRKMIPILAAAGHRVIA 78 (302)
T ss_pred CceeEeecCCCCceEEEE-EEecC-----CCCCCEEEEECCCCCc---------------hhhHHHHHHHHHhCCCEEEE
Confidence 444566655455544322 22222 1124689999996411 11112344566678999999
Q ss_pred cCCCCCCCCCCCC--------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC
Q 004574 561 GPSIPIIGEGDKL--------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT 632 (744)
Q Consensus 561 ~~~~~~~g~g~~~--------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~ 632 (744)
.+.+|+|.+. ..+++.+.+..+.+. ++.+++.|+||||||.+|+.++.++|++++++|++++.....
T Consensus 79 ---~Dl~G~G~S~~~~~~~~~~~~~~a~~l~~~l~~--l~~~~v~lvGhS~Gg~ia~~~a~~~p~~v~~lvl~~~~~~~~ 153 (302)
T PRK00870 79 ---PDLIGFGRSDKPTRREDYTYARHVEWMRSWFEQ--LDLTDVTLVCQDWGGLIGLRLAAEHPDRFARLVVANTGLPTG 153 (302)
T ss_pred ---ECCCCCCCCCCCCCcccCCHHHHHHHHHHHHHH--cCCCCEEEEEEChHHHHHHHHHHhChhheeEEEEeCCCCCCc
Confidence 5666666542 122333334333333 233689999999999999999999999999999987632100
Q ss_pred CC--C-----C-ccccc------------ccchh---hcHHHHH----------------hc---C-----------ccc
Q 004574 633 LT--P-----F-GFQTE------------FRTLW---EATNVYI----------------EM---S-----------PIT 659 (744)
Q Consensus 633 ~~--~-----~-~~~~~------------~~~~~---~~~~~~~----------------~~---~-----------~~~ 659 (744)
.. . + .+... ....+ +....+. .. . ...
T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (302)
T PRK00870 154 DGPMPDAFWAWRAFSQYSPVLPVGRLVNGGTVRDLSDAVRAAYDAPFPDESYKAGARAFPLLVPTSPDDPAVAANRAAWA 233 (302)
T ss_pred cccchHHHhhhhcccccCchhhHHHHhhccccccCCHHHHHHhhcccCChhhhcchhhhhhcCCCCCCCcchHHHHHHHH
Confidence 00 0 0 00000 00000 0000000 00 0 012
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
.+.++++|+|+++|++|..++ ... +++.+.+... ..+.+.++++++|... .+.++.+.+.+.+||.++
T Consensus 234 ~l~~i~~P~lii~G~~D~~~~--~~~-~~~~~~~~~~-~~~~~~~i~~~gH~~~-~e~p~~~~~~l~~fl~~~ 301 (302)
T PRK00870 234 VLERWDKPFLTAFSDSDPITG--GGD-AILQKRIPGA-AGQPHPTIKGAGHFLQ-EDSGEELAEAVLEFIRAT 301 (302)
T ss_pred hhhcCCCceEEEecCCCCccc--Cch-HHHHhhcccc-cccceeeecCCCccch-hhChHHHHHHHHHHHhcC
Confidence 346789999999999999988 644 6666555432 1235789999999986 677889999999999764
No 73
>KOG4178 consensus Soluble epoxide hydrolase [Lipid transport and metabolism]
Probab=99.58 E-value=3.4e-13 Score=130.70 Aligned_cols=194 Identities=16% Similarity=0.132 Sum_probs=125.8
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChH--------H-HHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPN--------D-SAEAAVE 583 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~--------~-d~~~~~~ 583 (744)
-|+|+++||.+ ..+..|. .+...|+++||.|++ .+.+|+|.++.- . -+.+.+.
T Consensus 44 gP~illlHGfP--------------e~wyswr-~q~~~la~~~~rviA---~DlrGyG~Sd~P~~~~~Yt~~~l~~di~~ 105 (322)
T KOG4178|consen 44 GPIVLLLHGFP--------------ESWYSWR-HQIPGLASRGYRVIA---PDLRGYGFSDAPPHISEYTIDELVGDIVA 105 (322)
T ss_pred CCEEEEEccCC--------------ccchhhh-hhhhhhhhcceEEEe---cCCCCCCCCCCCCCcceeeHHHHHHHHHH
Confidence 79999999986 3333333 455678999999999 777888776421 1 2222223
Q ss_pred HHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC--------CCC-------CCCcccc----c---
Q 004574 584 EVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN--------KTL-------TPFGFQT----E--- 641 (744)
Q Consensus 584 ~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~--------~~~-------~~~~~~~----~--- 641 (744)
.|...+ -+|+.++||+||+.+|+.++..+|++++++|+++.++. ... ....++. +
T Consensus 106 lld~Lg---~~k~~lvgHDwGaivaw~la~~~Perv~~lv~~nv~~~~p~~~~~~~~~~~f~~~~y~~~fQ~~~~~E~~~ 182 (322)
T KOG4178|consen 106 LLDHLG---LKKAFLVGHDWGAIVAWRLALFYPERVDGLVTLNVPFPNPKLKPLDSSKAIFGKSYYICLFQEPGKPETEL 182 (322)
T ss_pred HHHHhc---cceeEEEeccchhHHHHHHHHhChhhcceEEEecCCCCCcccchhhhhccccCccceeEeccccCcchhhh
Confidence 333333 37999999999999999999999999999999875432 000 0000000 0
Q ss_pred ----------------------------ccchhh---cH-------------------HHHHhcC--cccccCCCCCCEE
Q 004574 642 ----------------------------FRTLWE---AT-------------------NVYIEMS--PITHANKIKKPIL 669 (744)
Q Consensus 642 ----------------------------~~~~~~---~~-------------------~~~~~~~--~~~~~~~~~~P~l 669 (744)
....|. .. +.+.... ......++++|++
T Consensus 183 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~t~edi~~~~~~f~~~g~~gplNyyrn~~r~w~a~~~~~~~i~iPv~ 262 (322)
T KOG4178|consen 183 SKDDTEMLVKTFRTRKTPGPLIVPKQPNENPLWLTEEDIAFYVSKFQIDGFTGPLNYYRNFRRNWEAAPWALAKITIPVL 262 (322)
T ss_pred ccchhHHhHHhhhccccCCccccCCCCCCccchhhHHHHHHHHhccccccccccchhhHHHhhCchhccccccccccceE
Confidence 000011 11 1111111 1344567899999
Q ss_pred EEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 670 IIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 670 ~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
++.|+.|...+ ...-.+.++..-. .-.+.+++++++|... .++++.+++++++||++.
T Consensus 263 fi~G~~D~v~~--~p~~~~~~rk~vp--~l~~~vv~~~~gH~vq-qe~p~~v~~~i~~f~~~~ 320 (322)
T KOG4178|consen 263 FIWGDLDPVLP--YPIFGELYRKDVP--RLTERVVIEGIGHFVQ-QEKPQEVNQAILGFINSF 320 (322)
T ss_pred EEEecCccccc--chhHHHHHHHhhc--cccceEEecCCccccc-ccCHHHHHHHHHHHHHhh
Confidence 99999999987 5533334333321 1126888999999887 788999999999999875
No 74
>PRK10985 putative hydrolase; Provisional
Probab=99.56 E-value=4e-13 Score=139.09 Aligned_cols=220 Identities=19% Similarity=0.116 Sum_probs=131.8
Q ss_pred EEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCC
Q 004574 487 YQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPI 566 (744)
Q Consensus 487 ~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~ 566 (744)
+...||..+.......+. ...+.|+||++||.+.. ....+....+..|+++||.|+..+ .
T Consensus 36 ~~~~dg~~~~l~w~~~~~----~~~~~p~vll~HG~~g~-------------~~~~~~~~~~~~l~~~G~~v~~~d---~ 95 (324)
T PRK10985 36 LELPDGDFVDLAWSEDPA----QARHKPRLVLFHGLEGS-------------FNSPYAHGLLEAAQKRGWLGVVMH---F 95 (324)
T ss_pred EECCCCCEEEEecCCCCc----cCCCCCEEEEeCCCCCC-------------CcCHHHHHHHHHHHHCCCEEEEEe---C
Confidence 445577655443321111 12247899999996311 011112234567889999999944 3
Q ss_pred CCCCCC----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCc--eeEEEEccCCCCCCCC
Q 004574 567 IGEGDK----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHL--FCCGIARSGSYNKTLT 634 (744)
Q Consensus 567 ~g~g~~----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~--~~~~v~~~~~~~~~~~ 634 (744)
+|+|.+ ...+|+..++++++++.. ..+++++||||||.+++.+++++++. ++++|+++++++....
T Consensus 96 rG~g~~~~~~~~~~~~~~~~D~~~~i~~l~~~~~--~~~~~~vG~S~GG~i~~~~~~~~~~~~~~~~~v~i~~p~~~~~~ 173 (324)
T PRK10985 96 RGCSGEPNRLHRIYHSGETEDARFFLRWLQREFG--HVPTAAVGYSLGGNMLACLLAKEGDDLPLDAAVIVSAPLMLEAC 173 (324)
T ss_pred CCCCCCccCCcceECCCchHHHHHHHHHHHHhCC--CCCEEEEEecchHHHHHHHHHhhCCCCCccEEEEEcCCCCHHHH
Confidence 444322 134689999999998643 35799999999999888888776543 7888888887542100
Q ss_pred CC----c----cc-------------------c------cc----c----------ch---h-hcHHHHHhcCcccccCC
Q 004574 635 PF----G----FQ-------------------T------EF----R----------TL---W-EATNVYIEMSPITHANK 663 (744)
Q Consensus 635 ~~----~----~~-------------------~------~~----~----------~~---~-~~~~~~~~~~~~~~~~~ 663 (744)
.. . +. . +. . .+ + ...+.|...+....+.+
T Consensus 174 ~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fd~~~~~~~~g~~~~~~~y~~~~~~~~l~~ 253 (324)
T PRK10985 174 SYRMEQGFSRVYQRYLLNLLKANAARKLAAYPGTLPINLAQLKSVRRLREFDDLITARIHGFADAIDYYRQCSALPLLNQ 253 (324)
T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhccccccCCHHHHhcCCcHHHHhhhheeccCCCCCHHHHHHHCChHHHHhC
Confidence 00 0 00 0 00 0 00 0 00122233344556788
Q ss_pred CCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccc---cH-HHHHHHHHHHHHHhcc
Q 004574 664 IKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARE---NV-MHVIWETDRWLQKYCL 734 (744)
Q Consensus 664 ~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~---~~-~~~~~~~~~fl~~~l~ 734 (744)
+++|+|+++|++|++++ ......+ .+...++++.+++++||...... .. .-.-+.+.+||...+.
T Consensus 254 i~~P~lii~g~~D~~~~--~~~~~~~----~~~~~~~~~~~~~~~GH~~~~~g~~~~~~~w~~~~~~~~~~~~~~ 322 (324)
T PRK10985 254 IRKPTLIIHAKDDPFMT--HEVIPKP----ESLPPNVEYQLTEHGGHVGFVGGTLLKPQMWLEQRIPDWLTTYLE 322 (324)
T ss_pred CCCCEEEEecCCCCCCC--hhhChHH----HHhCCCeEEEECCCCCceeeCCCCCCCCCccHHHHHHHHHHHhhc
Confidence 99999999999999987 5555443 22335678999999999755322 12 2234458888876653
No 75
>TIGR02240 PHA_depoly_arom poly(3-hydroxyalkanoate) depolymerase. This family consists of the polyhydroxyalkanoic acid (PHA) depolymerase of Pseudomonas oleovorans, Pseudomonas putida BM01, and related species. This enzyme is part of polyester storage and mobilization system as in many bacteria. However, species containing this enzyme are unusual in their capacity to produce aromatic polyesters when grown on carbon sources such as benzoic acid or phenylacetic acid.
Probab=99.56 E-value=2.4e-13 Score=137.97 Aligned_cols=209 Identities=13% Similarity=0.113 Sum_probs=128.3
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCC
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGE 569 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~ 569 (744)
.+|.+++..... .+ . ..+.||++||.+.. ... ...++.. +..+|.|++ .+.+|+
T Consensus 9 ~~~~~~~~~~~~-~~-----~-~~~plvllHG~~~~--------------~~~-w~~~~~~-L~~~~~vi~---~Dl~G~ 62 (276)
T TIGR02240 9 LDGQSIRTAVRP-GK-----E-GLTPLLIFNGIGAN--------------LEL-VFPFIEA-LDPDLEVIA---FDVPGV 62 (276)
T ss_pred cCCcEEEEEEec-CC-----C-CCCcEEEEeCCCcc--------------hHH-HHHHHHH-hccCceEEE---ECCCCC
Confidence 456666664431 11 1 13578999996421 111 1122333 456799999 666777
Q ss_pred CCCC------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC----------
Q 004574 570 GDKL------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL---------- 633 (744)
Q Consensus 570 g~~~------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~---------- 633 (744)
|.+. ..+++.+.+..+.+.. +.+++.|+||||||.+++.+|.++|++++++|++++......
T Consensus 63 G~S~~~~~~~~~~~~~~~~~~~i~~l--~~~~~~LvG~S~GG~va~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~ 140 (276)
T TIGR02240 63 GGSSTPRHPYRFPGLAKLAARMLDYL--DYGQVNAIGVSWGGALAQQFAHDYPERCKKLILAATAAGAVMVPGKPKVLMM 140 (276)
T ss_pred CCCCCCCCcCcHHHHHHHHHHHHHHh--CcCceEEEEECHHHHHHHHHHHHCHHHhhheEEeccCCccccCCCchhHHHH
Confidence 7653 1234444444444432 236899999999999999999999999999999987542100
Q ss_pred -CC-Cccccc----------ccchh-hcHH----------------HHH------hcCcccccCCCCCCEEEEeeCCCCC
Q 004574 634 -TP-FGFQTE----------FRTLW-EATN----------------VYI------EMSPITHANKIKKPILIIHGEVDDK 678 (744)
Q Consensus 634 -~~-~~~~~~----------~~~~~-~~~~----------------~~~------~~~~~~~~~~~~~P~l~i~G~~D~~ 678 (744)
.. ..+... ..... ..++ .+. .......+.++++|+|+++|++|..
T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lii~G~~D~~ 220 (276)
T TIGR02240 141 MASPRRYIQPSHGIHIAPDIYGGAFRRDPELAMAHASKVRSGGKLGYYWQLFAGLGWTSIHWLHKIQQPTLVLAGDDDPI 220 (276)
T ss_pred hcCchhhhccccccchhhhhccceeeccchhhhhhhhhcccCCCchHHHHHHHHcCCchhhHhhcCCCCEEEEEeCCCCc
Confidence 00 000000 00000 0000 000 0111233578899999999999999
Q ss_pred CCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhcc
Q 004574 679 VGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 679 v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~ 734 (744)
++ ...++++.+.+. ..+++++++ +|... .+.+..+.+.+.+||++.-.
T Consensus 221 v~--~~~~~~l~~~~~----~~~~~~i~~-gH~~~-~e~p~~~~~~i~~fl~~~~~ 268 (276)
T TIGR02240 221 IP--LINMRLLAWRIP----NAELHIIDD-GHLFL-ITRAEAVAPIIMKFLAEERQ 268 (276)
T ss_pred CC--HHHHHHHHHhCC----CCEEEEEcC-CCchh-hccHHHHHHHHHHHHHHhhh
Confidence 98 888888876654 347778875 89876 56788999999999987643
No 76
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=99.55 E-value=2.1e-13 Score=138.83 Aligned_cols=190 Identities=20% Similarity=0.175 Sum_probs=118.6
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-------hHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-------PNDSAEAAVEEV 585 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-------~~~d~~~~~~~l 585 (744)
.|+||++||.+... ..| ...... ++++|.|+. .+.+|+|.+. ..+++.+.+..+
T Consensus 28 ~~~vv~~hG~~~~~-----------~~~----~~~~~~-l~~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~~~l~~~ 88 (278)
T TIGR03056 28 GPLLLLLHGTGAST-----------HSW----RDLMPP-LARSFRVVA---PDLPGHGFTRAPFRFRFTLPSMAEDLSAL 88 (278)
T ss_pred CCeEEEEcCCCCCH-----------HHH----HHHHHH-HhhCcEEEe---ecCCCCCCCCCccccCCCHHHHHHHHHHH
Confidence 47899999964211 111 122333 456799999 5555655442 233444445444
Q ss_pred HHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC------CC--------ccc---------ccc
Q 004574 586 VRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT------PF--------GFQ---------TEF 642 (744)
Q Consensus 586 ~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~------~~--------~~~---------~~~ 642 (744)
.+... .+++.|+||||||.+++.++.+.|++++++|++++.+..... +. ... ...
T Consensus 89 i~~~~--~~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (278)
T TIGR03056 89 CAAEG--LSPDGVIGHSAGAAIALRLALDGPVTPRMVVGINAALMPFEGMAGTLFPYMARVLACNPFTPPMMSRGAADQQ 166 (278)
T ss_pred HHHcC--CCCceEEEECccHHHHHHHHHhCCcccceEEEEcCcccccccccccccchhhHhhhhcccchHHHHhhcccCc
Confidence 44422 357899999999999999999999999999888765321000 00 000 000
Q ss_pred ---------cchhh--cHHHHH-----------------hcC---cccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHH
Q 004574 643 ---------RTLWE--ATNVYI-----------------EMS---PITHANKIKKPILIIHGEVDDKVGLFPMQAERFFD 691 (744)
Q Consensus 643 ---------~~~~~--~~~~~~-----------------~~~---~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~ 691 (744)
...+. ....+. .+. ....++++++|+|+++|++|..+| ...++++.+
T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P~lii~g~~D~~vp--~~~~~~~~~ 244 (278)
T TIGR03056 167 RVERLIRDTGSLLDKAGMTYYGRLIRSPAHVDGALSMMAQWDLAPLNRDLPRITIPLHLIAGEEDKAVP--PDESKRAAT 244 (278)
T ss_pred chhHHhhccccccccchhhHHHHhhcCchhhhHHHHHhhcccccchhhhcccCCCCEEEEEeCCCcccC--HHHHHHHHH
Confidence 00000 000000 000 112356789999999999999998 787777765
Q ss_pred HHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 692 ALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 692 ~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
.+. ..++.++++++|.+. .+.+..+.+.+.+||+
T Consensus 245 ~~~----~~~~~~~~~~gH~~~-~e~p~~~~~~i~~f~~ 278 (278)
T TIGR03056 245 RVP----TATLHVVPGGGHLVH-EEQADGVVGLILQAAE 278 (278)
T ss_pred hcc----CCeEEEECCCCCccc-ccCHHHHHHHHHHHhC
Confidence 543 358899999999877 5668899999999984
No 77
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=99.55 E-value=2e-13 Score=137.51 Aligned_cols=221 Identities=14% Similarity=0.089 Sum_probs=134.5
Q ss_pred EEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecC
Q 004574 483 EMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGP 562 (744)
Q Consensus 483 ~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~ 562 (744)
+.+.+.. ++..+.++++.|.+. . .+.||++||++.. +.+. + ......+..|+++||.|++
T Consensus 3 ~~~~~~~-~~~~l~g~~~~p~~~-----~-~~~vv~i~gg~~~--------~~g~--~-~~~~~la~~l~~~G~~v~~-- 62 (274)
T TIGR03100 3 RALTFSC-EGETLVGVLHIPGAS-----H-TTGVLIVVGGPQY--------RVGS--H-RQFVLLARRLAEAGFPVLR-- 62 (274)
T ss_pred eeEEEEc-CCcEEEEEEEcCCCC-----C-CCeEEEEeCCccc--------cCCc--h-hHHHHHHHHHHHCCCEEEE--
Confidence 4566764 567899999998641 1 2455556664310 0000 0 0012346678889999999
Q ss_pred CCCCCCCCCC--------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC
Q 004574 563 SIPIIGEGDK--------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT 634 (744)
Q Consensus 563 ~~~~~g~g~~--------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~ 634 (744)
++.+|+|.+ ...+|+.+++++++++.. ..++|.++||||||.+++.++... .+++++|+++|.+.....
T Consensus 63 -~Dl~G~G~S~~~~~~~~~~~~d~~~~~~~l~~~~~-g~~~i~l~G~S~Gg~~a~~~a~~~-~~v~~lil~~p~~~~~~~ 139 (274)
T TIGR03100 63 -FDYRGMGDSEGENLGFEGIDADIAAAIDAFREAAP-HLRRIVAWGLCDAASAALLYAPAD-LRVAGLVLLNPWVRTEAA 139 (274)
T ss_pred -eCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHhhCC-CCCcEEEEEECHHHHHHHHHhhhC-CCccEEEEECCccCCccc
Confidence 455555543 123589999999987521 125799999999999999887664 689999999987542110
Q ss_pred CCc--c----cc--cccchhhc-----------HHHHHh----c---C-----------cccccCCCCCCEEEEeeCCCC
Q 004574 635 PFG--F----QT--EFRTLWEA-----------TNVYIE----M---S-----------PITHANKIKKPILIIHGEVDD 677 (744)
Q Consensus 635 ~~~--~----~~--~~~~~~~~-----------~~~~~~----~---~-----------~~~~~~~~~~P~l~i~G~~D~ 677 (744)
... . .. .....|.. ...+.. . . ....+.++++|+|+++|..|.
T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~P~ll~~g~~D~ 219 (274)
T TIGR03100 140 QAASRIRHYYLGQLLSADFWRKLLSGEVNLGSSLRGLGDALLKARQKGDEVAHGGLAERMKAGLERFQGPVLFILSGNDL 219 (274)
T ss_pred chHHHHHHHHHHHHhChHHHHHhcCCCccHHHHHHHHHHHHHhhhhcCCCcccchHHHHHHHHHHhcCCcEEEEEcCcch
Confidence 000 0 00 00011110 000110 1 0 012234678999999999998
Q ss_pred CCCCCHHHH-----HHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 678 KVGLFPMQA-----ERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 678 ~v~~~~~~~-----~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
..+ .... .++.+.+. ...+++.++++++|.++.....+.+.+.+.+||+
T Consensus 220 ~~~--~~~~~~~~~~~~~~~l~--~~~v~~~~~~~~~H~l~~e~~~~~v~~~i~~wL~ 273 (274)
T TIGR03100 220 TAQ--EFADSVLGEPAWRGALE--DPGIERVEIDGADHTFSDRVWREWVAARTTEWLR 273 (274)
T ss_pred hHH--HHHHHhccChhhHHHhh--cCCeEEEecCCCCcccccHHHHHHHHHHHHHHHh
Confidence 642 1110 22222222 1457999999999988767777899999999995
No 78
>PF10503 Esterase_phd: Esterase PHB depolymerase
Probab=99.54 E-value=1.2e-13 Score=130.50 Aligned_cols=181 Identities=23% Similarity=0.252 Sum_probs=112.5
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHH-hCCeEEEecCCCCC---CC-C
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFL-ARRFAVLAGPSIPI---IG-E 569 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~G~~v~~~~~~~~---~g-~ 569 (744)
|.+.+|+|++.. .++.|+||++||.+... ..+... .....++ ++||+|+.+..... .+ +
T Consensus 1 l~Y~lYvP~~~~---~~~~PLVv~LHG~~~~a-----------~~~~~~--s~~~~lAd~~GfivvyP~~~~~~~~~~cw 64 (220)
T PF10503_consen 1 LSYRLYVPPGAP---RGPVPLVVVLHGCGQSA-----------EDFAAG--SGWNALADREGFIVVYPEQSRRANPQGCW 64 (220)
T ss_pred CcEEEecCCCCC---CCCCCEEEEeCCCCCCH-----------HHHHhh--cCHHHHhhcCCeEEEcccccccCCCCCcc
Confidence 356899999753 23589999999976322 111111 1123445 57999998543211 01 0
Q ss_pred ---------CCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-Cc-c
Q 004574 570 ---------GDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP-FG-F 638 (744)
Q Consensus 570 ---------g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~-~~-~ 638 (744)
+..+. ..+...++++.++..||++||++.|+|.||+|+..+++.+|++|+++..+++..-..... .. .
T Consensus 65 ~w~~~~~~~g~~d~-~~i~~lv~~v~~~~~iD~~RVyv~G~S~Gg~ma~~la~~~pd~faa~a~~sG~~~~~a~~~~~a~ 143 (220)
T PF10503_consen 65 NWFSDDQQRGGGDV-AFIAALVDYVAARYNIDPSRVYVTGLSNGGMMANVLACAYPDLFAAVAVVSGVPYGCAASGASAL 143 (220)
T ss_pred cccccccccCccch-hhHHHHHHhHhhhcccCCCceeeEEECHHHHHHHHHHHhCCccceEEEeecccccccccCcccHH
Confidence 11111 257788999999999999999999999999999999999999999999888763211110 00 0
Q ss_pred cccccchhhcHHHHHhcCccccc-CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC
Q 004574 639 QTEFRTLWEATNVYIEMSPITHA-NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGH 696 (744)
Q Consensus 639 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~ 696 (744)
.......-..+..... ..... ..-..|++++||+.|..|. +.++.++.+.+...
T Consensus 144 ~~m~~g~~~~p~~~~~--a~~~~g~~~~~P~~v~hG~~D~tV~--~~n~~~~~~q~~~~ 198 (220)
T PF10503_consen 144 SAMRSGPRPAPAAAWG--ARSDAGAYPGYPRIVFHGTADTTVN--PQNADQLVAQWLNV 198 (220)
T ss_pred HHhhCCCCCChHHHHH--hhhhccCCCCCCEEEEecCCCCccC--cchHHHHHHHHHHc
Confidence 0000000001111100 00001 1113699999999999998 89999998887753
No 79
>PLN03087 BODYGUARD 1 domain containing hydrolase; Provisional
Probab=99.54 E-value=4.5e-13 Score=142.25 Aligned_cols=216 Identities=15% Similarity=0.147 Sum_probs=132.4
Q ss_pred EEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHH---hCCeEEEecC
Q 004574 486 KYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFL---ARRFAVLAGP 562 (744)
Q Consensus 486 ~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~G~~v~~~~ 562 (744)
.+....+.+++.....|++. ...|.||++||.+.. ...|.......|+ +++|.|++
T Consensus 179 ~~~~~~~~~l~~~~~gp~~~-----~~k~~VVLlHG~~~s--------------~~~W~~~~~~~L~~~~~~~yrVia-- 237 (481)
T PLN03087 179 SWLSSSNESLFVHVQQPKDN-----KAKEDVLFIHGFISS--------------SAFWTETLFPNFSDAAKSTYRLFA-- 237 (481)
T ss_pred eeEeeCCeEEEEEEecCCCC-----CCCCeEEEECCCCcc--------------HHHHHHHHHHHHHHHhhCCCEEEE--
Confidence 44444567888888777651 124789999997421 1111111122333 47999999
Q ss_pred CCCCCCCCCCC-------hHHHHHHHH-HHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC
Q 004574 563 SIPIIGEGDKL-------PNDSAEAAV-EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT 634 (744)
Q Consensus 563 ~~~~~g~g~~~-------~~~d~~~~~-~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~ 634 (744)
.+.+|+|.+. ..++..+.+ ..+.+... .+++.++||||||.+++.++.++|++++++|++++.......
T Consensus 238 -~Dl~G~G~S~~p~~~~ytl~~~a~~l~~~ll~~lg--~~k~~LVGhSmGG~iAl~~A~~~Pe~V~~LVLi~~~~~~~~~ 314 (481)
T PLN03087 238 -VDLLGFGRSPKPADSLYTLREHLEMIERSVLERYK--VKSFHIVAHSLGCILALALAVKHPGAVKSLTLLAPPYYPVPK 314 (481)
T ss_pred -ECCCCCCCCcCCCCCcCCHHHHHHHHHHHHHHHcC--CCCEEEEEECHHHHHHHHHHHhChHhccEEEEECCCcccccc
Confidence 5556666442 122344444 23444322 368999999999999999999999999999999865321000
Q ss_pred -------------------CCcccc------c--cc----------chhhcH----------HHHH----hc--Ccc---
Q 004574 635 -------------------PFGFQT------E--FR----------TLWEAT----------NVYI----EM--SPI--- 658 (744)
Q Consensus 635 -------------------~~~~~~------~--~~----------~~~~~~----------~~~~----~~--~~~--- 658 (744)
...+.. + .. ..|+.. ..+. .. .+.
T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~~~~~~~~ 394 (481)
T PLN03087 315 GVQATQYVMRKVAPRRVWPPIAFGASVACWYEHISRTICLVICKNHRLWEFLTRLLTRNRMRTFLIEGFFCHTHNAAWHT 394 (481)
T ss_pred chhHHHHHHHHhcccccCCccccchhHHHHHHHHHhhhhcccccchHHHHHHHHHhhhhhhhHHHHHHHHhccchhhHHH
Confidence 000000 0 00 001100 0000 00 000
Q ss_pred -----------------cccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHH
Q 004574 659 -----------------THANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHV 721 (744)
Q Consensus 659 -----------------~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~ 721 (744)
...+++++|+|+++|++|..+| ...++.+.+.+. ..++++++++||.....+.+..+
T Consensus 395 l~~~i~~~~~~l~~~l~~l~~~I~vPtLII~Ge~D~ivP--~~~~~~la~~iP----~a~l~vI~~aGH~~~v~e~p~~f 468 (481)
T PLN03087 395 LHNIICGSGSKLDGYLDHVRDQLKCDVAIFHGGDDELIP--VECSYAVKAKVP----RARVKVIDDKDHITIVVGRQKEF 468 (481)
T ss_pred HHHHHhchhhhhhhHHHHHHHhCCCCEEEEEECCCCCCC--HHHHHHHHHhCC----CCEEEEeCCCCCcchhhcCHHHH
Confidence 0012589999999999999998 888887766654 35999999999986644667899
Q ss_pred HHHHHHHHHH
Q 004574 722 IWETDRWLQK 731 (744)
Q Consensus 722 ~~~~~~fl~~ 731 (744)
++.+.+|...
T Consensus 469 a~~L~~F~~~ 478 (481)
T PLN03087 469 ARELEEIWRR 478 (481)
T ss_pred HHHHHHHhhc
Confidence 9999999854
No 80
>PLN02824 hydrolase, alpha/beta fold family protein
Probab=99.54 E-value=3.6e-13 Score=138.14 Aligned_cols=191 Identities=18% Similarity=0.197 Sum_probs=119.3
Q ss_pred eEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-------------hHHHHHH
Q 004574 514 PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-------------PNDSAEA 580 (744)
Q Consensus 514 p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-------------~~~d~~~ 580 (744)
|.||++||.+.. ...| ..++..|+.+ |.|++ .+.+|+|.+. ..+++.+
T Consensus 30 ~~vlllHG~~~~--------------~~~w-~~~~~~L~~~-~~vi~---~DlpG~G~S~~~~~~~~~~~~~~~~~~~a~ 90 (294)
T PLN02824 30 PALVLVHGFGGN--------------ADHW-RKNTPVLAKS-HRVYA---IDLLGYGYSDKPNPRSAPPNSFYTFETWGE 90 (294)
T ss_pred CeEEEECCCCCC--------------hhHH-HHHHHHHHhC-CeEEE---EcCCCCCCCCCCccccccccccCCHHHHHH
Confidence 679999997521 1111 1334455555 68888 5566666542 1234444
Q ss_pred HHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC---CCC-Ccc------cc---c------
Q 004574 581 AVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT---LTP-FGF------QT---E------ 641 (744)
Q Consensus 581 ~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~---~~~-~~~------~~---~------ 641 (744)
.+..+.+...+ +++.|+||||||.+++.++.++|++++++|++++..... ... ... .. .
T Consensus 91 ~l~~~l~~l~~--~~~~lvGhS~Gg~va~~~a~~~p~~v~~lili~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (294)
T PLN02824 91 QLNDFCSDVVG--DPAFVICNSVGGVVGLQAAVDAPELVRGVMLINISLRGLHIKKQPWLGRPFIKAFQNLLRETAVGKA 168 (294)
T ss_pred HHHHHHHHhcC--CCeEEEEeCHHHHHHHHHHHhChhheeEEEEECCCcccccccccchhhhHHHHHHHHHHhchhHHHH
Confidence 44444443223 689999999999999999999999999999998743110 000 000 00 0
Q ss_pred ----c------cch----hh-----c---HH-------------HHHhc---C----cccccCCCCCCEEEEeeCCCCCC
Q 004574 642 ----F------RTL----WE-----A---TN-------------VYIEM---S----PITHANKIKKPILIIHGEVDDKV 679 (744)
Q Consensus 642 ----~------~~~----~~-----~---~~-------------~~~~~---~----~~~~~~~~~~P~l~i~G~~D~~v 679 (744)
. ... +. . .+ .+.++ . ....++++++|+|+++|++|..+
T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lvi~G~~D~~~ 248 (294)
T PLN02824 169 FFKSVATPETVKNILCQCYHDDSAVTDELVEAILRPGLEPGAVDVFLDFISYSGGPLPEELLPAVKCPVLIAWGEKDPWE 248 (294)
T ss_pred HHHhhcCHHHHHHHHHHhccChhhccHHHHHHHHhccCCchHHHHHHHHhccccccchHHHHhhcCCCeEEEEecCCCCC
Confidence 0 000 00 0 00 00000 0 11235678999999999999999
Q ss_pred CCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 680 GLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 680 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
+ ...++.+ .+.....+++++++++|... .+.++.+.+.+.+||+++
T Consensus 249 ~--~~~~~~~----~~~~~~~~~~~i~~~gH~~~-~e~p~~~~~~i~~fl~~~ 294 (294)
T PLN02824 249 P--VELGRAY----ANFDAVEDFIVLPGVGHCPQ-DEAPELVNPLIESFVARH 294 (294)
T ss_pred C--hHHHHHH----HhcCCccceEEeCCCCCChh-hhCHHHHHHHHHHHHhcC
Confidence 8 7766554 33334468999999999877 677889999999999753
No 81
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=99.54 E-value=9.7e-13 Score=130.00 Aligned_cols=251 Identities=13% Similarity=0.061 Sum_probs=174.2
Q ss_pred ccceeEeecCCCCCCCCceeeecCCCCCcccce-eecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccC-C
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFV-SWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFES-P 81 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p-~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~-~ 81 (744)
+.+||..+|+| .-.++.|.+. ...| ..+.||++|+| ...++||+.|.++...+.|--+ +
T Consensus 247 ~GnlYSvdldG----kDlrrHTnFt----dYY~R~~nsDGkrIvF-----------q~~GdIylydP~td~lekldI~lp 307 (668)
T COG4946 247 VGNLYSVDLDG----KDLRRHTNFT----DYYPRNANSDGKRIVF-----------QNAGDIYLYDPETDSLEKLDIGLP 307 (668)
T ss_pred ccceEEeccCC----chhhhcCCch----hccccccCCCCcEEEE-----------ecCCcEEEeCCCcCcceeeecCCc
Confidence 45899999988 7777877443 3333 56899999999 5568999999998877776422 1
Q ss_pred Cc-c-ccccccc----e-EEec-CCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccce
Q 004574 82 DI-C-LNAVFGS----F-VWVN-NSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDY 153 (744)
Q Consensus 82 ~~-~-~~~~~~~----~-~wsp-Dg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (744)
-. . -.+.+.. + .+|+ +|.+|++++.
T Consensus 308 l~rk~k~~k~~~pskyledfa~~~Gd~ia~VSR----------------------------------------------- 340 (668)
T COG4946 308 LDRKKKQPKFVNPSKYLEDFAVVNGDYIALVSR----------------------------------------------- 340 (668)
T ss_pred cccccccccccCHHHhhhhhccCCCcEEEEEec-----------------------------------------------
Confidence 10 0 0000110 1 1443 5666666532
Q ss_pred eeeeEEEEEcC-CCCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC
Q 004574 154 YTTAQLVLGSL-DGTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED 232 (744)
Q Consensus 154 ~~~~~l~~~~~-~G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~ 232 (744)
++.++++. .|-.-++...+.+..-..+-|++.++....+. ..|-+++.+|++.+++...-+.
T Consensus 341 ---GkaFi~~~~~~~~iqv~~~~~VrY~r~~~~~e~~vigt~dg------------D~l~iyd~~~~e~kr~e~~lg~-- 403 (668)
T COG4946 341 ---GKAFIMRPWDGYSIQVGKKGGVRYRRIQVDPEGDVIGTNDG------------DKLGIYDKDGGEVKRIEKDLGN-- 403 (668)
T ss_pred ---CcEEEECCCCCeeEEcCCCCceEEEEEccCCcceEEeccCC------------ceEEEEecCCceEEEeeCCccc--
Confidence 68899999 77777787776677777888888776665443 3789999999998887655333
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee-ccceeceeeccCCc
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-DLRFRSVSWCDDSL 311 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-~~~~~~~~~SpDg~ 311 (744)
+..+..||||++ ++... .+-+||++++ +++..+.+-.. .+-+..+.|+|+++
T Consensus 404 -----------I~av~vs~dGK~-~vvaN------------dr~el~vidi---dngnv~~idkS~~~lItdf~~~~nsr 456 (668)
T COG4946 404 -----------IEAVKVSPDGKK-VVVAN------------DRFELWVIDI---DNGNVRLIDKSEYGLITDFDWHPNSR 456 (668)
T ss_pred -----------eEEEEEcCCCcE-EEEEc------------CceEEEEEEe---cCCCeeEecccccceeEEEEEcCCce
Confidence 567889999997 55442 3446999999 78887776554 34578899999999
Q ss_pred eEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeee
Q 004574 312 ALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKK 373 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~ 373 (744)
+|+|+....=-..+|.++|+++ +..-.+|.....+ ..|. +.|||++|+|.+.+
T Consensus 457 ~iAYafP~gy~tq~Iklydm~~--~Kiy~vTT~ta~D-----fsPa--FD~d~ryLYfLs~R 509 (668)
T COG4946 457 WIAYAFPEGYYTQSIKLYDMDG--GKIYDVTTPTAYD-----FSPA--FDPDGRYLYFLSAR 509 (668)
T ss_pred eEEEecCcceeeeeEEEEecCC--CeEEEecCCcccc-----cCcc--cCCCCcEEEEEecc
Confidence 9999874322446788999888 4444555433322 3444 89999999999855
No 82
>PRK11071 esterase YqiA; Provisional
Probab=99.54 E-value=3.4e-13 Score=126.99 Aligned_cols=176 Identities=11% Similarity=0.018 Sum_probs=109.1
Q ss_pred eEEEEECCCCCcccccCCcccCCCCccCCCCch-hHHHHHh--CCeEEEecCCCCCCCCCCCChHHHHHHHHHHHHHcCC
Q 004574 514 PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPT-SSLIFLA--RRFAVLAGPSIPIIGEGDKLPNDSAEAAVEEVVRRGV 590 (744)
Q Consensus 514 p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~G~~v~~~~~~~~~g~g~~~~~~d~~~~~~~l~~~~~ 590 (744)
|.||++||.+.+ ...+... ....+.+ .+|.|++ .+.++++ +++.+.+..+.++..
T Consensus 2 p~illlHGf~ss--------------~~~~~~~~~~~~l~~~~~~~~v~~---~dl~g~~-----~~~~~~l~~l~~~~~ 59 (190)
T PRK11071 2 STLLYLHGFNSS--------------PRSAKATLLKNWLAQHHPDIEMIV---PQLPPYP-----ADAAELLESLVLEHG 59 (190)
T ss_pred CeEEEECCCCCC--------------cchHHHHHHHHHHHHhCCCCeEEe---CCCCCCH-----HHHHHHHHHHHHHcC
Confidence 679999996421 1111111 1233444 3799988 3444543 356666666666533
Q ss_pred CCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC------C-Ccccccccchh--hcHHHHHhcCccccc
Q 004574 591 ADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT------P-FGFQTEFRTLW--EATNVYIEMSPITHA 661 (744)
Q Consensus 591 ~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~------~-~~~~~~~~~~~--~~~~~~~~~~~~~~~ 661 (744)
. +++.++|+||||++++.++.++|. + +|+++|..+.... . ..........+ ...+......+.. +
T Consensus 60 ~--~~~~lvG~S~Gg~~a~~~a~~~~~--~-~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~-i 133 (190)
T PRK11071 60 G--DPLGLVGSSLGGYYATWLSQCFML--P-AVVVNPAVRPFELLTDYLGENENPYTGQQYVLESRHIYDLKVMQIDP-L 133 (190)
T ss_pred C--CCeEEEEECHHHHHHHHHHHHcCC--C-EEEECCCCCHHHHHHHhcCCcccccCCCcEEEcHHHHHHHHhcCCcc-C
Confidence 3 589999999999999999999973 3 4667776551000 0 00000000111 1111122222222 3
Q ss_pred CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 662 NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 662 ~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
. ..+|++++||++|+.|| ++.+.++++.. ..+++++++|.+.. .+++.+.+.+||.
T Consensus 134 ~-~~~~v~iihg~~De~V~--~~~a~~~~~~~-------~~~~~~ggdH~f~~---~~~~~~~i~~fl~ 189 (190)
T PRK11071 134 E-SPDLIWLLQQTGDEVLD--YRQAVAYYAAC-------RQTVEEGGNHAFVG---FERYFNQIVDFLG 189 (190)
T ss_pred C-ChhhEEEEEeCCCCcCC--HHHHHHHHHhc-------ceEEECCCCcchhh---HHHhHHHHHHHhc
Confidence 3 67889999999999999 99999998842 56678999999854 3788889999974
No 83
>PRK10673 acyl-CoA esterase; Provisional
Probab=99.53 E-value=3.3e-13 Score=135.42 Aligned_cols=192 Identities=13% Similarity=0.093 Sum_probs=114.6
Q ss_pred CceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-----hHHHHHHHHHHHH
Q 004574 512 PLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-----PNDSAEAAVEEVV 586 (744)
Q Consensus 512 ~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-----~~~d~~~~~~~l~ 586 (744)
..|.||++||.+.. ... ....+..| ..+|.|+. .+.+|+|.+. ..++..+-+..+.
T Consensus 15 ~~~~iv~lhG~~~~--------------~~~-~~~~~~~l-~~~~~vi~---~D~~G~G~s~~~~~~~~~~~~~d~~~~l 75 (255)
T PRK10673 15 NNSPIVLVHGLFGS--------------LDN-LGVLARDL-VNDHDIIQ---VDMRNHGLSPRDPVMNYPAMAQDLLDTL 75 (255)
T ss_pred CCCCEEEECCCCCc--------------hhH-HHHHHHHH-hhCCeEEE---ECCCCCCCCCCCCCCCHHHHHHHHHHHH
Confidence 36889999996411 111 11223333 56799998 5556665442 1122222222222
Q ss_pred HcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC-CCCCCCC-------------Cccccc--cc-------
Q 004574 587 RRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS-YNKTLTP-------------FGFQTE--FR------- 643 (744)
Q Consensus 587 ~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~-~~~~~~~-------------~~~~~~--~~------- 643 (744)
+. +..+++.|+||||||.+++.++.++|++++++|++++. ....... ...... ..
T Consensus 76 ~~--l~~~~~~lvGhS~Gg~va~~~a~~~~~~v~~lvli~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (255)
T PRK10673 76 DA--LQIEKATFIGHSMGGKAVMALTALAPDRIDKLVAIDIAPVDYHVRRHDEIFAAINAVSEAGATTRQQAAAIMRQHL 153 (255)
T ss_pred HH--cCCCceEEEEECHHHHHHHHHHHhCHhhcceEEEEecCCCCccchhhHHHHHHHHHhhhcccccHHHHHHHHHHhc
Confidence 22 12357999999999999999999999999999987532 1100000 000000 00
Q ss_pred ---------------chhh-cH----HHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEE
Q 004574 644 ---------------TLWE-AT----NVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLV 703 (744)
Q Consensus 644 ---------------~~~~-~~----~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~ 703 (744)
..|. .. +.+........++++++|+|+++|++|..++ ....+.+.+.+ .+.+++
T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~~~--~~~~~~~~~~~----~~~~~~ 227 (255)
T PRK10673 154 NEEGVIQFLLKSFVDGEWRFNVPVLWDQYPHIVGWEKIPAWPHPALFIRGGNSPYVT--EAYRDDLLAQF----PQARAH 227 (255)
T ss_pred CCHHHHHHHHhcCCcceeEeeHHHHHHhHHHHhCCcccCCCCCCeEEEECCCCCCCC--HHHHHHHHHhC----CCcEEE
Confidence 0000 00 0111111122345678999999999999987 66665554443 456899
Q ss_pred EeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 704 LLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 704 ~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
++++++|... .+.+..+.+.+.+||.+
T Consensus 228 ~~~~~gH~~~-~~~p~~~~~~l~~fl~~ 254 (255)
T PRK10673 228 VIAGAGHWVH-AEKPDAVLRAIRRYLND 254 (255)
T ss_pred EeCCCCCeee-ccCHHHHHHHHHHHHhc
Confidence 9999999776 66688899999999975
No 84
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.53 E-value=8.3e-13 Score=127.06 Aligned_cols=107 Identities=13% Similarity=0.128 Sum_probs=69.1
Q ss_pred eEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 292 EILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 292 ~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+++.....++.+.||||+++|+.++.+. .+.+++..+ ++....+.++...++. ++||.|.+.|+..+
T Consensus 361 ~rmtgHq~lVn~V~fSPd~r~IASaSFDk----SVkLW~g~t--Gk~lasfRGHv~~VYq------vawsaDsRLlVS~S 428 (480)
T KOG0271|consen 361 TRMTGHQALVNHVSFSPDGRYIASASFDK----SVKLWDGRT--GKFLASFRGHVAAVYQ------VAWSADSRLLVSGS 428 (480)
T ss_pred hhhhchhhheeeEEECCCccEEEEeeccc----ceeeeeCCC--cchhhhhhhccceeEE------EEeccCccEEEEcC
Confidence 45666677899999999999999887433 244455445 3333334455554444 78999999888776
Q ss_pred eecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEE
Q 004574 372 KKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKIL 442 (744)
Q Consensus 372 ~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~ 442 (744)
++ ..|.+|+..+.+...-..... +.|.. ..|||||+.++
T Consensus 429 kD---------------------sTLKvw~V~tkKl~~DLpGh~----DEVf~-------vDwspDG~rV~ 467 (480)
T KOG0271|consen 429 KD---------------------STLKVWDVRTKKLKQDLPGHA----DEVFA-------VDWSPDGQRVA 467 (480)
T ss_pred CC---------------------ceEEEEEeeeeeecccCCCCC----ceEEE-------EEecCCCceee
Confidence 33 248999988776643221111 12222 48999998765
No 85
>PLN02965 Probable pheophorbidase
Probab=99.53 E-value=4.6e-13 Score=134.10 Aligned_cols=191 Identities=11% Similarity=0.082 Sum_probs=118.8
Q ss_pred EEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-------hHHHHHHHHHHHHH
Q 004574 515 CLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-------PNDSAEAAVEEVVR 587 (744)
Q Consensus 515 ~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-------~~~d~~~~~~~l~~ 587 (744)
+||++||.+.. ...| ..++..|+++||.|++ .+.+|+|.+. ..+++.+.+..+.+
T Consensus 5 ~vvllHG~~~~--------------~~~w-~~~~~~L~~~~~~via---~Dl~G~G~S~~~~~~~~~~~~~a~dl~~~l~ 66 (255)
T PLN02965 5 HFVFVHGASHG--------------AWCW-YKLATLLDAAGFKSTC---VDLTGAGISLTDSNTVSSSDQYNRPLFALLS 66 (255)
T ss_pred EEEEECCCCCC--------------cCcH-HHHHHHHhhCCceEEE---ecCCcCCCCCCCccccCCHHHHHHHHHHHHH
Confidence 58999997521 1111 1344566688999999 5666666442 12333333333333
Q ss_pred cCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCC---CCC-C---------C--CCcc--ccc--cc-----
Q 004574 588 RGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSY---NKT-L---------T--PFGF--QTE--FR----- 643 (744)
Q Consensus 588 ~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~---~~~-~---------~--~~~~--~~~--~~----- 643 (744)
...+ .+++.++||||||.+++.++.++|++++++|++++.. ... . . .+.. ... ..
T Consensus 67 ~l~~-~~~~~lvGhSmGG~ia~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (255)
T PLN02965 67 DLPP-DHKVILVGHSIGGGSVTEALCKFTDKISMAIYVAAAMVKPGSIISPRLKNVMEGTEKIWDYTFGEGPDKPPTGIM 145 (255)
T ss_pred hcCC-CCCEEEEecCcchHHHHHHHHhCchheeEEEEEccccCCCCCCccHHHHhhhhccccceeeeeccCCCCCcchhh
Confidence 2112 1489999999999999999999999999999887641 100 0 0 0000 000 00
Q ss_pred ---chh-----h--cHH--HH--Hhc--Cc----------ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC
Q 004574 644 ---TLW-----E--ATN--VY--IEM--SP----------ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHG 697 (744)
Q Consensus 644 ---~~~-----~--~~~--~~--~~~--~~----------~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~ 697 (744)
... . ..+ .+ ... .+ ...+.++++|+|+++|++|..+| ...++.+.+.+.
T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~vP~lvi~g~~D~~~~--~~~~~~~~~~~~--- 220 (255)
T PLN02965 146 MKPEFVRHYYYNQSPLEDYTLSSKLLRPAPVRAFQDLDKLPPNPEAEKVPRVYIKTAKDNLFD--PVRQDVMVENWP--- 220 (255)
T ss_pred cCHHHHHHHHhcCCCHHHHHHHHHhcCCCCCcchhhhhhccchhhcCCCCEEEEEcCCCCCCC--HHHHHHHHHhCC---
Confidence 000 0 000 00 000 00 01223689999999999999998 877777765553
Q ss_pred CcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 698 ALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 698 ~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
..++++++++||.++ .+.++.+.+.+.+|+++
T Consensus 221 -~a~~~~i~~~GH~~~-~e~p~~v~~~l~~~~~~ 252 (255)
T PLN02965 221 -PAQTYVLEDSDHSAF-FSVPTTLFQYLLQAVSS 252 (255)
T ss_pred -cceEEEecCCCCchh-hcCHHHHHHHHHHHHHH
Confidence 358999999999987 67788999999999875
No 86
>TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate.
Probab=99.53 E-value=7.6e-14 Score=139.43 Aligned_cols=189 Identities=17% Similarity=0.165 Sum_probs=116.9
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC------hHHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL------PNDSAEAAVEEVV 586 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~------~~~d~~~~~~~l~ 586 (744)
.|+||++||.+.. ...+ ...+ ..+.+||.|+. .+.+|+|.+. ..++..+.+..+.
T Consensus 13 ~~~li~~hg~~~~--------------~~~~-~~~~-~~l~~~~~v~~---~d~~G~G~s~~~~~~~~~~~~~~~~~~~i 73 (251)
T TIGR02427 13 APVLVFINSLGTD--------------LRMW-DPVL-PALTPDFRVLR---YDKRGHGLSDAPEGPYSIEDLADDVLALL 73 (251)
T ss_pred CCeEEEEcCcccc--------------hhhH-HHHH-HHhhcccEEEE---ecCCCCCCCCCCCCCCCHHHHHHHHHHHH
Confidence 6899999996421 1111 1223 33467999999 4555555431 2223344444444
Q ss_pred HcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-C----------ccc------------cccc
Q 004574 587 RRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP-F----------GFQ------------TEFR 643 (744)
Q Consensus 587 ~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~-~----------~~~------------~~~~ 643 (744)
+. ++.+++.++|||+||.+++.++.++|+.++++|++++........ + ... ....
T Consensus 74 ~~--~~~~~v~liG~S~Gg~~a~~~a~~~p~~v~~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (251)
T TIGR02427 74 DH--LGIERAVFCGLSLGGLIAQGLAARRPDRVRALVLSNTAAKIGTPESWNARIAAVRAEGLAALADAVLERWFTPGFR 151 (251)
T ss_pred HH--hCCCceEEEEeCchHHHHHHHHHHCHHHhHHHhhccCccccCchhhHHHHHhhhhhccHHHHHHHHHHHHcccccc
Confidence 33 234689999999999999999999999999999887643211000 0 000 0000
Q ss_pred --c-----hhh------cH-------HHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEE
Q 004574 644 --T-----LWE------AT-------NVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLV 703 (744)
Q Consensus 644 --~-----~~~------~~-------~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~ 703 (744)
. .+. .. ..+...+....+.++++|+|+++|++|..++ .+..+++.+.+. ..+++
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Pvlii~g~~D~~~~--~~~~~~~~~~~~----~~~~~ 225 (251)
T TIGR02427 152 EAHPARLDLYRNMLVRQPPDGYAGCCAAIRDADFRDRLGAIAVPTLCIAGDQDGSTP--PELVREIADLVP----GARFA 225 (251)
T ss_pred cCChHHHHHHHHHHHhcCHHHHHHHHHHHhcccHHHHhhhcCCCeEEEEeccCCcCC--hHHHHHHHHhCC----CceEE
Confidence 0 000 00 0011112233456789999999999999998 777776665543 35889
Q ss_pred EeCCCCcccCccccHHHHHHHHHHHH
Q 004574 704 LLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 704 ~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
++++++|... .+.++.+.+.+.+||
T Consensus 226 ~~~~~gH~~~-~~~p~~~~~~i~~fl 250 (251)
T TIGR02427 226 EIRGAGHIPC-VEQPEAFNAALRDFL 250 (251)
T ss_pred EECCCCCccc-ccChHHHHHHHHHHh
Confidence 9999999877 456778888888887
No 87
>COG0400 Predicted esterase [General function prediction only]
Probab=99.53 E-value=5.2e-13 Score=124.71 Aligned_cols=125 Identities=26% Similarity=0.237 Sum_probs=102.6
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcC
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMS 656 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 656 (744)
.+.++++.+.++..+|.+|+.++|+|.||++++.+..++|..++++|+++|+.-.....
T Consensus 82 ~~~~~l~~~~~~~gi~~~~ii~~GfSqGA~ial~~~l~~~~~~~~ail~~g~~~~~~~~--------------------- 140 (207)
T COG0400 82 KLAEFLEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAILFSGMLPLEPEL--------------------- 140 (207)
T ss_pred HHHHHHHHHHHHhCCChhheEEEecChHHHHHHHHHHhCchhhccchhcCCcCCCCCcc---------------------
Confidence 45566666666788999999999999999999999999999999999999975422110
Q ss_pred cccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 657 PITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 657 ~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
....-.+|+|++||+.|++|| ...+.++.+.|+..|.+++..+++ +||.+.. +..+.+.+|+...+
T Consensus 141 ---~~~~~~~pill~hG~~Dpvvp--~~~~~~l~~~l~~~g~~v~~~~~~-~GH~i~~-----e~~~~~~~wl~~~~ 206 (207)
T COG0400 141 ---LPDLAGTPILLSHGTEDPVVP--LALAEALAEYLTASGADVEVRWHE-GGHEIPP-----EELEAARSWLANTL 206 (207)
T ss_pred ---ccccCCCeEEEeccCcCCccC--HHHHHHHHHHHHHcCCCEEEEEec-CCCcCCH-----HHHHHHHHHHHhcc
Confidence 011236899999999999999 999999999999999999999999 7898763 55667778887653
No 88
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=99.52 E-value=2.2e-13 Score=135.61 Aligned_cols=188 Identities=14% Similarity=0.107 Sum_probs=119.5
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCCh--HHHHHHHHHHHHHcCC
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLP--NDSAEAAVEEVVRRGV 590 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~--~~d~~~~~~~l~~~~~ 590 (744)
.|.||++||.+.. ...+ ...+.. +..+|.|++ .+.+|+|.+.. ..++.+.++.+.+..
T Consensus 4 ~~~iv~~HG~~~~--------------~~~~-~~~~~~-l~~~~~vi~---~d~~G~G~s~~~~~~~~~~~~~~~~~~~- 63 (245)
T TIGR01738 4 NVHLVLIHGWGMN--------------AEVF-RCLDEE-LSAHFTLHL---VDLPGHGRSRGFGPLSLADAAEAIAAQA- 63 (245)
T ss_pred CceEEEEcCCCCc--------------hhhH-HHHHHh-hccCeEEEE---ecCCcCccCCCCCCcCHHHHHHHHHHhC-
Confidence 3789999996421 1111 122333 346799999 55566665432 124666666666542
Q ss_pred CCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC-CC--ccc------------cccc---chh------
Q 004574 591 ADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT-PF--GFQ------------TEFR---TLW------ 646 (744)
Q Consensus 591 ~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~-~~--~~~------------~~~~---~~~------ 646 (744)
.+++.++||||||.+++.++.++|++++++|++++....... .+ ... .... ..+
T Consensus 64 --~~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141 (245)
T TIGR01738 64 --PDPAIWLGWSLGGLVALHIAATHPDRVRALVTVASSPCFSAREDWPEGIKPDVLTGFQQQLSDDYQRTIERFLALQTL 141 (245)
T ss_pred --CCCeEEEEEcHHHHHHHHHHHHCHHhhheeeEecCCcccccCCcccccCCHHHHHHHHHHhhhhHHHHHHHHHHHHHh
Confidence 268999999999999999999999999999988764321000 00 000 0000 000
Q ss_pred ------hcHH----------------------HHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCC
Q 004574 647 ------EATN----------------------VYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGA 698 (744)
Q Consensus 647 ------~~~~----------------------~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~ 698 (744)
.... .+...+....+.++++|+|+++|++|..++ .+..+.+.+.+ .
T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~Pvlii~g~~D~~~~--~~~~~~~~~~~----~ 215 (245)
T TIGR01738 142 GTPTARQDARALKQTLLARPTPNVQVLQAGLEILATVDLRQPLQNISVPFLRLYGYLDGLVP--AKVVPYLDKLA----P 215 (245)
T ss_pred cCCccchHHHHHHHHhhccCCCCHHHHHHHHHHhhcccHHHHHhcCCCCEEEEeecCCcccC--HHHHHHHHHhC----C
Confidence 0000 000111123356889999999999999998 77777666544 3
Q ss_pred cEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 699 LSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 699 ~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
+++++++++++|... .+.+..+.+.+.+|+
T Consensus 216 ~~~~~~~~~~gH~~~-~e~p~~~~~~i~~fi 245 (245)
T TIGR01738 216 HSELYIFAKAAHAPF-LSHAEAFCALLVAFK 245 (245)
T ss_pred CCeEEEeCCCCCCcc-ccCHHHHHHHHHhhC
Confidence 568999999999877 567889999999885
No 89
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.52 E-value=1.5e-11 Score=126.61 Aligned_cols=267 Identities=13% Similarity=0.037 Sum_probs=155.6
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..|+++++.. ++..+.. ..+.......|+|||+.++.+. .....|+++++.+++..+......
T Consensus 11 ~~v~~~d~~t----~~~~~~~--~~~~~~~~l~~~~dg~~l~~~~---------~~~~~v~~~d~~~~~~~~~~~~~~-- 73 (300)
T TIGR03866 11 NTISVIDTAT----LEVTRTF--PVGQRPRGITLSKDGKLLYVCA---------SDSDTIQVIDLATGEVIGTLPSGP-- 73 (300)
T ss_pred CEEEEEECCC----CceEEEE--ECCCCCCceEECCCCCEEEEEE---------CCCCeEEEEECCCCcEEEeccCCC--
Confidence 3578888865 5543332 2233467799999999876543 234678999999887654322111
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.|+|||+.|+.+... .+.|+++|+
T Consensus 74 ---~~~~~~~~~~g~~l~~~~~~------------------------------------------------~~~l~~~d~ 102 (300)
T TIGR03866 74 ---DPELFALHPNGKILYIANED------------------------------------------------DNLVTVIDI 102 (300)
T ss_pred ---CccEEEECCCCCEEEEEcCC------------------------------------------------CCeEEEEEC
Confidence 13467899999988765210 046788888
Q ss_pred -CCC-eeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 -DGT-AKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 -~G~-~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+++ ...+........++|+|||+.+++..... ..+..|+..+.......... .
T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~dg~~l~~~~~~~------------~~~~~~d~~~~~~~~~~~~~-------------~ 157 (300)
T TIGR03866 103 ETRKVLAEIPVGVEPEGMAVSPDGKIVVNTSETT------------NMAHFIDTKTYEIVDNVLVD-------------Q 157 (300)
T ss_pred CCCeEEeEeeCCCCcceEEECCCCCEEEEEecCC------------CeEEEEeCCCCeEEEEEEcC-------------C
Confidence 443 33332223356789999999988775432 23556777655432211111 1
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeee-------ccceeceeeccCCceEE
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKL-------DLRFRSVSWCDDSLALV 314 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~-------~~~~~~~~~SpDg~~l~ 314 (744)
.+..+.|+|||+. |++... ..+.|+++++ .+++. ..+... ......++|+|||+.++
T Consensus 158 ~~~~~~~s~dg~~-l~~~~~-----------~~~~v~i~d~---~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~ 222 (300)
T TIGR03866 158 RPRFAEFTADGKE-LWVSSE-----------IGGTVSVIDV---ATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAF 222 (300)
T ss_pred CccEEEECCCCCE-EEEEcC-----------CCCEEEEEEc---CcceeeeeeeecccccccccCCccceEECCCCCEEE
Confidence 1446789999986 554421 1224777887 44432 222211 11223578999999866
Q ss_pred EeeeeeccceeEEEEcCCCCCCccee-eeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCC
Q 004574 315 NETWYKTSQTRTWLVCPGSKDVAPRV-LFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGN 393 (744)
Q Consensus 315 ~~~~~~~~~~~l~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~ 393 (744)
.... ....+.++|+.+. +... +.... ....++|+|||++|+.... .
T Consensus 223 ~~~~---~~~~i~v~d~~~~--~~~~~~~~~~--------~~~~~~~~~~g~~l~~~~~--------------------~ 269 (300)
T TIGR03866 223 VALG---PANRVAVVDAKTY--EVLDYLLVGQ--------RVWQLAFTPDEKYLLTTNG--------------------V 269 (300)
T ss_pred EEcC---CCCeEEEEECCCC--cEEEEEEeCC--------CcceEEECCCCCEEEEEcC--------------------C
Confidence 5431 1235888888763 3222 21111 1112679999999876531 1
Q ss_pred CceEEEEecCCCceeEEee
Q 004574 394 IPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 394 ~~~l~~~d~~~g~~~~l~~ 412 (744)
...|.+||+.+++....+.
T Consensus 270 ~~~i~v~d~~~~~~~~~~~ 288 (300)
T TIGR03866 270 SNDVSVIDVAALKVIKSIK 288 (300)
T ss_pred CCeEEEEECCCCcEEEEEE
Confidence 2349999999888644433
No 90
>PRK10349 carboxylesterase BioH; Provisional
Probab=99.52 E-value=2e-13 Score=136.92 Aligned_cols=188 Identities=12% Similarity=0.063 Sum_probs=117.9
Q ss_pred eEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChH--HHHHHHHHHHHHcCCC
Q 004574 514 PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPN--DSAEAAVEEVVRRGVA 591 (744)
Q Consensus 514 p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~--~d~~~~~~~l~~~~~~ 591 (744)
|.||++||.+... ..| ..++..| +..|.|++ .+.+|+|.+... ..+.+.++.+.+.
T Consensus 14 ~~ivllHG~~~~~-----------~~w----~~~~~~L-~~~~~vi~---~Dl~G~G~S~~~~~~~~~~~~~~l~~~--- 71 (256)
T PRK10349 14 VHLVLLHGWGLNA-----------EVW----RCIDEEL-SSHFTLHL---VDLPGFGRSRGFGALSLADMAEAVLQQ--- 71 (256)
T ss_pred CeEEEECCCCCCh-----------hHH----HHHHHHH-hcCCEEEE---ecCCCCCCCCCCCCCCHHHHHHHHHhc---
Confidence 5689999965211 111 1233344 45699999 666777655311 1344455555543
Q ss_pred CCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC----CCCc------c----cc---cccchh--------
Q 004574 592 DPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL----TPFG------F----QT---EFRTLW-------- 646 (744)
Q Consensus 592 d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~----~~~~------~----~~---~~~~~~-------- 646 (744)
..+++.++||||||.+|+.++.++|++++++|++++...... .... + .. .....+
T Consensus 72 ~~~~~~lvGhS~Gg~ia~~~a~~~p~~v~~lili~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (256)
T PRK10349 72 APDKAIWLGWSLGGLVASQIALTHPERVQALVTVASSPCFSARDEWPGIKPDVLAGFQQQLSDDFQRTVERFLALQTMGT 151 (256)
T ss_pred CCCCeEEEEECHHHHHHHHHHHhChHhhheEEEecCccceecCCCCCcccHHHHHHHHHHHHhchHHHHHHHHHHHHccC
Confidence 236899999999999999999999999999998876321100 0000 0 00 000000
Q ss_pred h--c-----------------H-------HHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcE
Q 004574 647 E--A-----------------T-------NVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALS 700 (744)
Q Consensus 647 ~--~-----------------~-------~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~ 700 (744)
. . . ..+...+....+.++++|+|+++|++|..+| .+.++.+.+.+ .+.
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lii~G~~D~~~~--~~~~~~~~~~i----~~~ 225 (256)
T PRK10349 152 ETARQDARALKKTVLALPMPEVDVLNGGLEILKTVDLRQPLQNVSMPFLRLYGYLDGLVP--RKVVPMLDKLW----PHS 225 (256)
T ss_pred chHHHHHHHHHHHhhccCCCcHHHHHHHHHHHHhCccHHHHhhcCCCeEEEecCCCccCC--HHHHHHHHHhC----CCC
Confidence 0 0 0 0001112233456789999999999999988 76665555444 345
Q ss_pred EEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 701 RLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 701 ~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
+++++++++|... .+.++.+.+.+.+|-.
T Consensus 226 ~~~~i~~~gH~~~-~e~p~~f~~~l~~~~~ 254 (256)
T PRK10349 226 ESYIFAKAAHAPF-ISHPAEFCHLLVALKQ 254 (256)
T ss_pred eEEEeCCCCCCcc-ccCHHHHHHHHHHHhc
Confidence 9999999999887 6788889888888753
No 91
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.52 E-value=3.4e-11 Score=124.02 Aligned_cols=272 Identities=11% Similarity=0.029 Sum_probs=156.8
Q ss_pred CceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCccccccccc
Q 004574 60 CKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTD 139 (744)
Q Consensus 60 ~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (744)
..+.|+++++++++..+...... ....+.|+|||+.|+++...
T Consensus 9 ~d~~v~~~d~~t~~~~~~~~~~~-----~~~~l~~~~dg~~l~~~~~~-------------------------------- 51 (300)
T TIGR03866 9 KDNTISVIDTATLEVTRTFPVGQ-----RPRGITLSKDGKLLYVCASD-------------------------------- 51 (300)
T ss_pred CCCEEEEEECCCCceEEEEECCC-----CCCceEECCCCCEEEEEECC--------------------------------
Confidence 34588888998887655443322 23467999999987665211
Q ss_pred ccCCCchhhhccceeeeeEEEEEcC-CCCeee-cCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC
Q 004574 140 NLLKDEYDESLFDYYTTAQLVLGSL-DGTAKD-FGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD 217 (744)
Q Consensus 140 ~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~-l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 217 (744)
...|+++++ +|+... +........+.|+|||+.+++..... ..+++||+.
T Consensus 52 ----------------~~~v~~~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~------------~~l~~~d~~ 103 (300)
T TIGR03866 52 ----------------SDTIQVIDLATGEVIGTLPSGPDPELFALHPNGKILYIANEDD------------NLVTVIDIE 103 (300)
T ss_pred ----------------CCeEEEEECCCCcEEEeccCCCCccEEEECCCCCEEEEEcCCC------------CeEEEEECC
Confidence 046888888 664433 43333445788999999887664321 379999987
Q ss_pred CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee
Q 004574 218 GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL 297 (744)
Q Consensus 218 g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 297 (744)
..+......... .+..++|+|||+. ++.... + ...++.++. .+++.......
T Consensus 104 ~~~~~~~~~~~~-------------~~~~~~~~~dg~~-l~~~~~-~----------~~~~~~~d~---~~~~~~~~~~~ 155 (300)
T TIGR03866 104 TRKVLAEIPVGV-------------EPEGMAVSPDGKI-VVNTSE-T----------TNMAHFIDT---KTYEIVDNVLV 155 (300)
T ss_pred CCeEEeEeeCCC-------------CcceEEECCCCCE-EEEEec-C----------CCeEEEEeC---CCCeEEEEEEc
Confidence 654322111111 1456899999986 554431 1 112445565 33333222222
Q ss_pred ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCC-ceeeCCCCCeEEEEeeecCC
Q 004574 298 DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSP-MMTRTSTGTNVIAKIKKEND 376 (744)
Q Consensus 298 ~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~spdg~~l~~~~~~~~~ 376 (744)
......++|+|||+.|++... ....++++|+.++ .....+.... .........+ .+.++|||+.++....
T Consensus 156 ~~~~~~~~~s~dg~~l~~~~~---~~~~v~i~d~~~~-~~~~~~~~~~-~~~~~~~~~~~~i~~s~dg~~~~~~~~---- 226 (300)
T TIGR03866 156 DQRPRFAEFTADGKELWVSSE---IGGTVSVIDVATR-KVIKKITFEI-PGVHPEAVQPVGIKLTKDGKTAFVALG---- 226 (300)
T ss_pred CCCccEEEECCCCCEEEEEcC---CCCEEEEEEcCcc-eeeeeeeecc-cccccccCCccceEECCCCCEEEEEcC----
Confidence 334566899999998877542 2235888888774 1212222111 0000000111 2568999998765431
Q ss_pred cceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEE
Q 004574 377 EQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHI 456 (744)
Q Consensus 377 ~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~ 456 (744)
....|.+||..+++.......... ...++|+|+|+.|+.... .-+.|.+
T Consensus 227 ----------------~~~~i~v~d~~~~~~~~~~~~~~~------------~~~~~~~~~g~~l~~~~~---~~~~i~v 275 (300)
T TIGR03866 227 ----------------PANRVAVVDAKTYEVLDYLLVGQR------------VWQLAFTPDEKYLLTTNG---VSNDVSV 275 (300)
T ss_pred ----------------CCCeEEEEECCCCcEEEEEEeCCC------------cceEEECCCCCEEEEEcC---CCCeEEE
Confidence 122488999988876543321111 122589999987664322 2346999
Q ss_pred EECCCCce
Q 004574 457 LSWPLKKS 464 (744)
Q Consensus 457 ~~~~~g~~ 464 (744)
+|+.+++.
T Consensus 276 ~d~~~~~~ 283 (300)
T TIGR03866 276 IDVAALKV 283 (300)
T ss_pred EECCCCcE
Confidence 99988775
No 92
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=99.52 E-value=3.7e-12 Score=127.40 Aligned_cols=293 Identities=13% Similarity=0.107 Sum_probs=156.8
Q ss_pred ECCC-CceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCch
Q 004574 68 DAET-GEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEY 146 (744)
Q Consensus 68 ~~~g-g~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (744)
|.+| -+.+|||..+..+..-.+....|.+||++|+|.+... +
T Consensus 16 D~~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s~~d-g------------------------------------ 58 (386)
T PF14583_consen 16 DPDTGHRVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFASDFD-G------------------------------------ 58 (386)
T ss_dssp -TTT--EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE-TT-S------------------------------------
T ss_pred CCCCCceEEEecCCCCcccceeecCCCcCCCCCEEEEEeccC-C------------------------------------
Confidence 4454 4689999887755443466789999999999986531 1
Q ss_pred hhhccceeeeeEEEEEcC-CCCeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee
Q 004574 147 DESLFDYYTTAQLVLGSL-DGTAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE 223 (744)
Q Consensus 147 ~~~~~~~~~~~~l~~~~~-~G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~ 223 (744)
..++|++|+ +|+++|||+. ....+..+||+++.|+|..+.. +|+.+|+++.+.+.
T Consensus 59 ---------~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~~-------------~l~~vdL~T~e~~~ 116 (386)
T PF14583_consen 59 ---------NRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNGR-------------SLRRVDLDTLEERV 116 (386)
T ss_dssp ---------S-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETTT-------------EEEEEETTT--EEE
T ss_pred ---------CcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECCC-------------eEEEEECCcCcEEE
Confidence 169999999 8899999997 3344678999999999986442 79999999998888
Q ss_pred ccCCCCCCCCCcccCCccCCCCccce--ecCCCceEEEEEeecCCC----------CCccCCccceEEeccCCCCCCCCc
Q 004574 224 LCDLPPAEDIPVCYNSVREGMRSISW--RADKPSTLYWVEAQDRGD----------ANVEVSPRDIIYTQPAEPAEGEKP 291 (744)
Q Consensus 224 l~~~~~~~~~~~~~~~~~~~~~~~~~--spDg~~~l~~~~~~~~~~----------~~~~~~~~~~l~~~~~~~~~~~~~ 291 (744)
|...+..- .....| ..|++. ++.++...... +.+......+|+.+++ ++|+.
T Consensus 117 vy~~p~~~------------~g~gt~v~n~d~t~-~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl---~tG~~ 180 (386)
T PF14583_consen 117 VYEVPDDW------------KGYGTWVANSDCTK-LVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDL---KTGER 180 (386)
T ss_dssp EEE--TTE------------EEEEEEEE-TTSSE-EEEEEEEGGG-----SHHHHHHHHHC---EEEEEEET---TT--E
T ss_pred EEECCccc------------ccccceeeCCCccE-EEEEEEeehhccCccccHHHHHHHhhCCCceEEEEEC---CCCce
Confidence 87665331 001255 456666 66665432211 1112234457999999 78999
Q ss_pred eEeeeeccceeceeeccC-CceEEEeeeeec--cceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEE
Q 004574 292 EILHKLDLRFRSVSWCDD-SLALVNETWYKT--SQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVI 368 (744)
Q Consensus 292 ~~l~~~~~~~~~~~~SpD-g~~l~~~~~~~~--~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~ 368 (744)
+.+.....-++-+.+||. ...|.|.....- -..+||.++.++. ..+.+......... +.. + |+|||+.|.
T Consensus 181 ~~v~~~~~wlgH~~fsP~dp~li~fCHEGpw~~Vd~RiW~i~~dg~--~~~~v~~~~~~e~~---gHE-f-w~~DG~~i~ 253 (386)
T PF14583_consen 181 KVVFEDTDWLGHVQFSPTDPTLIMFCHEGPWDLVDQRIWTINTDGS--NVKKVHRRMEGESV---GHE-F-WVPDGSTIW 253 (386)
T ss_dssp EEEEEESS-EEEEEEETTEEEEEEEEE-S-TTTSS-SEEEEETTS-----EESS---TTEEE---EEE-E-E-TTSS-EE
T ss_pred eEEEecCccccCcccCCCCCCEEEEeccCCcceeceEEEEEEcCCC--cceeeecCCCCccc---ccc-c-ccCCCCEEE
Confidence 999888888888999987 556777663221 2348999999884 33444333222111 111 2 999999999
Q ss_pred EEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecC
Q 004574 369 AKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESK 448 (744)
Q Consensus 369 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~ 448 (744)
|..... .+....|+.+|+.+++.+.+..-+ ....+ ..|+||+.++=--...
T Consensus 254 y~~~~~----------------~~~~~~i~~~d~~t~~~~~~~~~p------~~~H~-------~ss~Dg~L~vGDG~d~ 304 (386)
T PF14583_consen 254 YDSYTP----------------GGQDFWIAGYDPDTGERRRLMEMP------WCSHF-------MSSPDGKLFVGDGGDA 304 (386)
T ss_dssp EEEEET----------------TT--EEEEEE-TTT--EEEEEEE-------SEEEE-------EE-TTSSEEEEEE---
T ss_pred EEeecC----------------CCCceEEEeeCCCCCCceEEEeCC------ceeee-------EEcCCCCEEEecCCCC
Confidence 986432 133456889999998877654332 12233 4567776544221110
Q ss_pred ------------CCCceEEEEECCCCceeeeecCC
Q 004574 449 ------------TEITQYHILSWPLKKSSQITNFP 471 (744)
Q Consensus 449 ------------~~~~~i~~~~~~~g~~~~lt~~~ 471 (744)
..-+-||++++..++...|....
T Consensus 305 p~~v~~~~~~~~~~~p~i~~~~~~~~~~~~l~~h~ 339 (386)
T PF14583_consen 305 PVDVADAGGYKIENDPWIYLFDVEAGRFRKLARHD 339 (386)
T ss_dssp ----------------EEEEEETTTTEEEEEEE--
T ss_pred CccccccccceecCCcEEEEeccccCceeeeeecc
Confidence 11236888888888877776653
No 93
>PF02129 Peptidase_S15: X-Pro dipeptidyl-peptidase (S15 family); InterPro: IPR000383 This entry represents a domain found peptidases Xaa-Pro dipeptidyl-peptidase and glutaryl-7-aminocephalosporanic-acid acylase, which belong to MEROPS peptidase families S15 and S45 respectively []. It is also found in hydrolases from the CocE/NonD family. Cocaine esterase (CocE) hydrolyzes cocaine endowing the bacteria with the ability to utilise cocaine as a sole source of carbon and energy []. ; GO: 0004177 aminopeptidase activity, 0006508 proteolysis; PDB: 1LNS_A 3PUI_A 3PUH_B 1JU3_A 3I2I_A 3I2G_A 1JU4_A 3I2K_A 3IDA_A 3I2H_A ....
Probab=99.51 E-value=3.8e-13 Score=135.46 Aligned_cols=205 Identities=25% Similarity=0.365 Sum_probs=127.1
Q ss_pred CCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCC
Q 004574 491 DGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEG 570 (744)
Q Consensus 491 ~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g 570 (744)
||.+|.+.+|+| +. ...+++|+||..|+-+...... .... ............|+++||+|+. .+.+|.|
T Consensus 1 DGv~L~adv~~P-~~--~~~~~~P~il~~tpY~~~~~~~-~~~~----~~~~~~~~~~~~~~~~GY~vV~---~D~RG~g 69 (272)
T PF02129_consen 1 DGVRLAADVYRP-GA--DGGGPFPVILTRTPYGKGDQTA-SDLA----GANPGPPSARRPFAERGYAVVV---QDVRGTG 69 (272)
T ss_dssp TS-EEEEEEEEE-----TTSSSEEEEEEEESSTCTC-HH-HHHH----TTCHHSHGGGHHHHHTT-EEEE---EE-TTST
T ss_pred CCCEEEEEEEec-CC--CCCCcccEEEEccCcCCCCCcc-cchh----hhhcccchhHHHHHhCCCEEEE---ECCcccc
Confidence 688999999999 21 1256799999999854110000 0000 0000000112238999999998 3444433
Q ss_pred CC---------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC-C--Ccc
Q 004574 571 DK---------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT-P--FGF 638 (744)
Q Consensus 571 ~~---------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~-~--~~~ 638 (744)
.+ ...+|..++|+|+.++++-+ .||+++|.|++|+.++.+|+..|..++|++...+..|.... . .+.
T Consensus 70 ~S~G~~~~~~~~e~~D~~d~I~W~~~Qpws~-G~VGm~G~SY~G~~q~~~A~~~~p~LkAi~p~~~~~d~~~~~~~~gG~ 148 (272)
T PF02129_consen 70 GSEGEFDPMSPNEAQDGYDTIEWIAAQPWSN-GKVGMYGISYGGFTQWAAAARRPPHLKAIVPQSGWSDLYRDSIYPGGA 148 (272)
T ss_dssp TS-S-B-TTSHHHHHHHHHHHHHHHHCTTEE-EEEEEEEETHHHHHHHHHHTTT-TTEEEEEEESE-SBTCCTSSEETTE
T ss_pred cCCCccccCChhHHHHHHHHHHHHHhCCCCC-CeEEeeccCHHHHHHHHHHhcCCCCceEEEecccCCcccccchhcCCc
Confidence 32 12239999999999997765 69999999999999999999888999999999887664431 0 000
Q ss_pred ccc-ccchh---------------------h-------------------------------cHHHHHhcCcccccCCCC
Q 004574 639 QTE-FRTLW---------------------E-------------------------------ATNVYIEMSPITHANKIK 665 (744)
Q Consensus 639 ~~~-~~~~~---------------------~-------------------------------~~~~~~~~~~~~~~~~~~ 665 (744)
... ....| . ..+.+.+.++...+.+++
T Consensus 149 ~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~i~ 228 (272)
T PF02129_consen 149 FRLGFFAGWEDLQSQQEDPQSRPAPDRDYLRERARYEALGDSPLGRLPRDPPYWDEWLDHPPYDPFWQERSPSERLDKID 228 (272)
T ss_dssp EBCCHHHHHHHHHHHHHHHTCCCCSSSHHHHHHHHHHCHHHHHHHHCHGGTHHHHHHHHT-SSSHHHHTTBHHHHHGG--
T ss_pred ccccchhHHHHHHHHhhcccCCCchhhhhhhhhhhhhhhhhHHHhhhccccHHHHHHHhCCCcCHHHHhCChHHHHhhCC
Confidence 000 00000 0 001122223444568899
Q ss_pred CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC-CcEEEEEeCCCCcc
Q 004574 666 KPILIIHGEVDDKVGLFPMQAERFFDALKGHG-ALSRLVLLPFEHHV 711 (744)
Q Consensus 666 ~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~H~ 711 (744)
+|+|++.|..|... ...+.+.+++|+..+ .+.++++-|.. |+
T Consensus 229 vP~l~v~Gw~D~~~---~~~~~~~~~~l~~~~~~~~~Liigpw~-H~ 271 (272)
T PF02129_consen 229 VPVLIVGGWYDTLF---LRGALRAYEALRAPGSKPQRLIIGPWT-HG 271 (272)
T ss_dssp SEEEEEEETTCSST---SHHHHHHHHHHCTTSTC-EEEEEESES-TT
T ss_pred CCEEEecccCCccc---chHHHHHHHHhhcCCCCCCEEEEeCCC-CC
Confidence 99999999999665 478899999999888 67788888764 74
No 94
>PF08840 BAAT_C: BAAT / Acyl-CoA thioester hydrolase C terminal; InterPro: IPR014940 Acyl-CoA thioesterases are a group of enzymes that catalyse the hydrolysis of acyl-CoAs to the free fatty acid and coenzyme A (CoASH), providing the potential to regulate intracellular levels of acyl-CoAs, free fatty acids and CoASH. Bile acid-CoA:amino acid N-acetyltransferase (BAAT) is involved in bile acid metabolism and may also act as an acyl-CoA thioesterase that regulates intracellular levels of free fatty acids []. This entry represents a catalytic domain is found at the C terminus of acyl-CoA thioester hydrolases and bile acid-CoA:amino acid N-acetyltransferases. ; PDB: 3K2I_B 3HLK_B.
Probab=99.50 E-value=2e-13 Score=130.74 Aligned_cols=156 Identities=22% Similarity=0.183 Sum_probs=95.7
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC---------CCcccccc--cch
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT---------PFGFQTEF--RTL 645 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~---------~~~~~~~~--~~~ 645 (744)
-++++++||++++.++.++|+|+|.|.||-+|+.+|+..| .++++|+++|..-.... +..+.... ...
T Consensus 5 yfe~Ai~~L~~~p~v~~~~Igi~G~SkGaelALllAs~~~-~i~avVa~~ps~~~~~~~~~~~~~~~~lp~~~~~~~~~~ 83 (213)
T PF08840_consen 5 YFEEAIDWLKSHPEVDPDKIGIIGISKGAELALLLASRFP-QISAVVAISPSSVVFQGIGFYRDSSKPLPYLPFDISKFS 83 (213)
T ss_dssp HHHHHHHHHHCSTTB--SSEEEEEETHHHHHHHHHHHHSS-SEEEEEEES--SB--SSEEEETTE--EE----B-GGG-E
T ss_pred HHHHHHHHHHhCCCCCCCCEEEEEECHHHHHHHHHHhcCC-CccEEEEeCCceeEecchhcccCCCccCCcCCcChhhce
Confidence 4789999999999999999999999999999999999996 89999999885321110 00000000 000
Q ss_pred hhc---------HHHH---HhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCc--EEEEEeCCCCcc
Q 004574 646 WEA---------TNVY---IEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGAL--SRLVLLPFEHHV 711 (744)
Q Consensus 646 ~~~---------~~~~---~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~H~ 711 (744)
+.. .... ......-.+.++++|+|+++|++|...|. ...++.+.++|++.+.+ ++++.||++||.
T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~a~IpvE~i~~piLli~g~dD~~WpS-~~~a~~i~~rL~~~~~~~~~~~l~Y~~aGH~ 162 (213)
T PF08840_consen 84 WNEPGLLRSRYAFELADDKAVEEARIPVEKIKGPILLISGEDDQIWPS-SEMAEQIEERLKAAGFPHNVEHLSYPGAGHL 162 (213)
T ss_dssp E-TTS-EE-TT-B--TTTGGGCCCB--GGG--SEEEEEEETT-SSS-H-HHHHHHHHHHHHCTT-----EEEEETTB-S-
T ss_pred ecCCcceehhhhhhcccccccccccccHHHcCCCEEEEEeCCCCccch-HHHHHHHHHHHHHhCCCCcceEEEcCCCCce
Confidence 000 0000 00111223667899999999999999982 45577788889988865 799999999997
Q ss_pred cCcc---------------------------ccHHHHHHHHHHHHHHhcc
Q 004574 712 YAAR---------------------------ENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 712 ~~~~---------------------------~~~~~~~~~~~~fl~~~l~ 734 (744)
+..+ ...++.++.+++||.++|+
T Consensus 163 i~~Py~P~~~~~~~~~~~~~~~~GG~~~~~a~A~~dsW~~~l~Fl~~~L~ 212 (213)
T PF08840_consen 163 IEPPYFPHCRASYHKFIGTPLAWGGEPEAHAKAQEDSWKKILEFLRKHLG 212 (213)
T ss_dssp --STT-----EEEETTTTEEEE--B-HHHHHHHHHHHHHHHHHHHHHH--
T ss_pred ecCCCCCCcccccccccCCcccCCCChHHHHHHHHHHHHHHHHHHHHHhC
Confidence 6421 1235678889999999985
No 95
>PRK06489 hypothetical protein; Provisional
Probab=99.49 E-value=1.2e-12 Score=137.82 Aligned_cols=167 Identities=19% Similarity=0.234 Sum_probs=104.9
Q ss_pred hCCeEEEecCCCCCCCCCCCC-------------hHHHHHHH-HHHHHHcCCCCCCcEE-EEEechHHHHHHHHHHhCCC
Q 004574 553 ARRFAVLAGPSIPIIGEGDKL-------------PNDSAEAA-VEEVVRRGVADPSRIA-VGGHSYGAFMTAHLLAHAPH 617 (744)
Q Consensus 553 ~~G~~v~~~~~~~~~g~g~~~-------------~~~d~~~~-~~~l~~~~~~d~~~i~-l~G~S~GG~~a~~~~~~~p~ 617 (744)
..+|.|++ .+.+|+|.+. ..+++.+. +..+.+...+ +++. |+||||||++|+.++.++|+
T Consensus 103 ~~~~~Via---~Dl~GhG~S~~p~~~~~~~~~~~~~~~~a~~~~~~l~~~lgi--~~~~~lvG~SmGG~vAl~~A~~~P~ 177 (360)
T PRK06489 103 ASKYFIIL---PDGIGHGKSSKPSDGLRAAFPRYDYDDMVEAQYRLVTEGLGV--KHLRLILGTSMGGMHAWMWGEKYPD 177 (360)
T ss_pred ccCCEEEE---eCCCCCCCCCCCCcCCCCCCCcccHHHHHHHHHHHHHHhcCC--CceeEEEEECHHHHHHHHHHHhCch
Confidence 67899999 5566666542 12233332 3334343333 4675 89999999999999999999
Q ss_pred ceeEEEEccCCCCCC------C-----------CCC---cccc-c-c---------------------cchh-hc-----
Q 004574 618 LFCCGIARSGSYNKT------L-----------TPF---GFQT-E-F---------------------RTLW-EA----- 648 (744)
Q Consensus 618 ~~~~~v~~~~~~~~~------~-----------~~~---~~~~-~-~---------------------~~~~-~~----- 648 (744)
+++++|++++..... . ..+ .+.. . . .... ..
T Consensus 178 ~V~~LVLi~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (360)
T PRK06489 178 FMDALMPMASQPTEMSGRNWMWRRMLIESIRNDPAWNNGNYTTQPPSLKRANPMFAIATSGGTLAYQAQAPTRAAADKLV 257 (360)
T ss_pred hhheeeeeccCcccccHHHHHHHHHHHHHHHhCCCCCCCCCCCCHHHHHHHHHHHHHHHhCCHHHHHHhcCChHHHHHHH
Confidence 999999887642100 0 000 0000 0 0 0000 00
Q ss_pred -----------HHHHH-------hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHH--HHHHHHHHhCCCcEEEEEeCCC
Q 004574 649 -----------TNVYI-------EMSPITHANKIKKPILIIHGEVDDKVGLFPMQA--ERFFDALKGHGALSRLVLLPFE 708 (744)
Q Consensus 649 -----------~~~~~-------~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~ 708 (744)
...+. ..+....+.++++|+|+++|++|..+| ...+ +++.+.+. +.+++++|++
T Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~L~~I~~PvLvI~G~~D~~~p--~~~~~~~~la~~ip----~a~l~~i~~a 331 (360)
T PRK06489 258 DERLAAPVTADANDFLYQWDSSRDYNPSPDLEKIKAPVLAINSADDERNP--PETGVMEAALKRVK----HGRLVLIPAS 331 (360)
T ss_pred HHHHHhhhhcCHHHHHHHHHHhhccChHHHHHhCCCCEEEEecCCCcccC--hhhHHHHHHHHhCc----CCeEEEECCC
Confidence 00000 011223356789999999999999988 6654 55555543 3589999996
Q ss_pred ----CcccCccccHHHHHHHHHHHHHHh
Q 004574 709 ----HHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 709 ----~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
||... +.++.+.+.+.+||+++
T Consensus 332 ~~~~GH~~~--e~P~~~~~~i~~FL~~~ 357 (360)
T PRK06489 332 PETRGHGTT--GSAKFWKAYLAEFLAQV 357 (360)
T ss_pred CCCCCcccc--cCHHHHHHHHHHHHHhc
Confidence 99875 47889999999999865
No 96
>PLN02679 hydrolase, alpha/beta fold family protein
Probab=99.49 E-value=1.3e-12 Score=137.19 Aligned_cols=194 Identities=15% Similarity=0.148 Sum_probs=117.5
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCCh-------HHHHHHHH-HH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLP-------NDSAEAAV-EE 584 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~-------~~d~~~~~-~~ 584 (744)
.|.||++||.+.. ...| ..++..| ..+|.|++ .+.+|+|.+.. .++..+.+ ++
T Consensus 88 gp~lvllHG~~~~--------------~~~w-~~~~~~L-~~~~~via---~Dl~G~G~S~~~~~~~~~~~~~a~~l~~~ 148 (360)
T PLN02679 88 GPPVLLVHGFGAS--------------IPHW-RRNIGVL-AKNYTVYA---IDLLGFGASDKPPGFSYTMETWAELILDF 148 (360)
T ss_pred CCeEEEECCCCCC--------------HHHH-HHHHHHH-hcCCEEEE---ECCCCCCCCCCCCCccccHHHHHHHHHHH
Confidence 3689999997521 1111 1223334 45899999 55666665421 12233333 33
Q ss_pred HHHcCCCCCCcEEEEEechHHHHHHHHHH-hCCCceeEEEEccCCCCCCCC----CCc-------------------cc-
Q 004574 585 VVRRGVADPSRIAVGGHSYGAFMTAHLLA-HAPHLFCCGIARSGSYNKTLT----PFG-------------------FQ- 639 (744)
Q Consensus 585 l~~~~~~d~~~i~l~G~S~GG~~a~~~~~-~~p~~~~~~v~~~~~~~~~~~----~~~-------------------~~- 639 (744)
+.+. ..+++.|+||||||.+++.++. .+|++++++|++++....... .+. ..
T Consensus 149 l~~l---~~~~~~lvGhS~Gg~ia~~~a~~~~P~rV~~LVLi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (360)
T PLN02679 149 LEEV---VQKPTVLIGNSVGSLACVIAASESTRDLVRGLVLLNCAGGMNNKAVVDDWRIKLLLPLLWLIDFLLKQRGIAS 225 (360)
T ss_pred HHHh---cCCCeEEEEECHHHHHHHHHHHhcChhhcCEEEEECCccccccccccchHHHhhhcchHHHHHHHhhchhhHH
Confidence 3322 2368999999999999998887 468999999998864311000 000 00
Q ss_pred ---cc---ccch-----------h----hc-------------HHHHHhc-------CcccccCCCCCCEEEEeeCCCCC
Q 004574 640 ---TE---FRTL-----------W----EA-------------TNVYIEM-------SPITHANKIKKPILIIHGEVDDK 678 (744)
Q Consensus 640 ---~~---~~~~-----------~----~~-------------~~~~~~~-------~~~~~~~~~~~P~l~i~G~~D~~ 678 (744)
.. .... . +. ...+... +....+.++++|+|+++|++|..
T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~PtLii~G~~D~~ 305 (360)
T PLN02679 226 ALFNRVKQRDNLKNILLSVYGNKEAVDDELVEIIRGPADDEGALDAFVSIVTGPPGPNPIKLIPRISLPILVLWGDQDPF 305 (360)
T ss_pred HHHHHhcCHHHHHHHHHHhccCcccCCHHHHHHHHhhccCCChHHHHHHHHhcCCCCCHHHHhhhcCCCEEEEEeCCCCC
Confidence 00 0000 0 00 0000000 11123567899999999999999
Q ss_pred CCCCHHHH-HHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 679 VGLFPMQA-ERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 679 v~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
+| .... .++++.+.+.-.+.+++++++++|..+ .+.++.+++.+.+||++
T Consensus 306 ~p--~~~~~~~~~~~l~~~ip~~~l~~i~~aGH~~~-~E~Pe~~~~~I~~FL~~ 356 (360)
T PLN02679 306 TP--LDGPVGKYFSSLPSQLPNVTLYVLEGVGHCPH-DDRPDLVHEKLLPWLAQ 356 (360)
T ss_pred cC--chhhHHHHHHhhhccCCceEEEEcCCCCCCcc-ccCHHHHHHHHHHHHHh
Confidence 88 6532 234455555445689999999999877 67789999999999975
No 97
>COG2945 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=99.49 E-value=1.2e-12 Score=115.51 Aligned_cols=196 Identities=17% Similarity=0.217 Sum_probs=128.1
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEec
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAG 561 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~ 561 (744)
+..+.+....| .+.+ .|.|++ ....|+.|.+|..+...+.. ........+..|.++||+++..
T Consensus 4 ~~~v~i~Gp~G-~le~-~~~~~~-----~~~~~iAli~HPHPl~gGtm----------~nkvv~~la~~l~~~G~atlRf 66 (210)
T COG2945 4 MPTVIINGPAG-RLEG-RYEPAK-----TPAAPIALICHPHPLFGGTM----------NNKVVQTLARALVKRGFATLRF 66 (210)
T ss_pred CCcEEecCCcc-ccee-ccCCCC-----CCCCceEEecCCCccccCcc----------CCHHHHHHHHHHHhCCceEEee
Confidence 44566666555 3555 555554 33479999999864221111 1111123456778999999994
Q ss_pred CCCC------CCCCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC
Q 004574 562 PSIP------IIGEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP 635 (744)
Q Consensus 562 ~~~~------~~g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~ 635 (744)
+.++ ..+.|..+. +|+.++++|++++..- ..-..++|+|+|+++++.+|.+.|+ ....+.+.|..+.+ .
T Consensus 67 NfRgVG~S~G~fD~GiGE~-~Da~aaldW~~~~hp~-s~~~~l~GfSFGa~Ia~~la~r~~e-~~~~is~~p~~~~~--d 141 (210)
T COG2945 67 NFRGVGRSQGEFDNGIGEL-EDAAAALDWLQARHPD-SASCWLAGFSFGAYIAMQLAMRRPE-ILVFISILPPINAY--D 141 (210)
T ss_pred cccccccccCcccCCcchH-HHHHHHHHHHHhhCCC-chhhhhcccchHHHHHHHHHHhccc-ccceeeccCCCCch--h
Confidence 4433 122333334 4899999999997532 2235889999999999999999865 34455566654411 0
Q ss_pred CcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcc
Q 004574 636 FGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAR 715 (744)
Q Consensus 636 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~ 715 (744)
...+....+|.++++|+.|.++. .....++++ +.+..++++++++|.|+.
T Consensus 142 ----------------------fs~l~P~P~~~lvi~g~~Ddvv~--l~~~l~~~~-----~~~~~~i~i~~a~HFF~g- 191 (210)
T COG2945 142 ----------------------FSFLAPCPSPGLVIQGDADDVVD--LVAVLKWQE-----SIKITVITIPGADHFFHG- 191 (210)
T ss_pred ----------------------hhhccCCCCCceeEecChhhhhc--HHHHHHhhc-----CCCCceEEecCCCceecc-
Confidence 01233456899999999999887 555554443 266789999999998863
Q ss_pred ccHHHHHHHHHHHHH
Q 004574 716 ENVMHVIWETDRWLQ 730 (744)
Q Consensus 716 ~~~~~~~~~~~~fl~ 730 (744)
+.....+.+.+|+.
T Consensus 192 -Kl~~l~~~i~~~l~ 205 (210)
T COG2945 192 -KLIELRDTIADFLE 205 (210)
T ss_pred -cHHHHHHHHHHHhh
Confidence 45677788888884
No 98
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.48 E-value=2.7e-12 Score=124.78 Aligned_cols=197 Identities=15% Similarity=0.149 Sum_probs=131.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+....||++|+|||-++. +...-||.+..++. .+..-...+ +...|..+.||||.++|+....+
T Consensus 226 EVWfl~FS~nGkyLAsaSk--------D~Taiiw~v~~d~~-~kl~~tlvg--h~~~V~yi~wSPDdryLlaCg~~---- 290 (519)
T KOG0293|consen 226 EVWFLQFSHNGKYLASASK--------DSTAIIWIVVYDVH-FKLKKTLVG--HSQPVSYIMWSPDDRYLLACGFD---- 290 (519)
T ss_pred cEEEEEEcCCCeeEeeccC--------CceEEEEEEecCcc-eeeeeeeec--ccCceEEEEECCCCCeEEecCch----
Confidence 5889999999999999764 45567888876654 221111111 12247788999999999876321
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC---ceeeeeccCCCCc
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP---AVYTAVEPSPDQK 187 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~---~~~~~~~~SpDG~ 187 (744)
..++++|+ +|+.+.+-.. ....+.+|-|||.
T Consensus 291 ---------------------------------------------e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~ 325 (519)
T KOG0293|consen 291 ---------------------------------------------EVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGF 325 (519)
T ss_pred ---------------------------------------------HheeeccCCcchhhhhcccCcCCCcceeEEccCCc
Confidence 34788899 7855544332 4667889999999
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
+++..+.+ ..+..|+++|.....--... .|. +.+++.++||++ ++.+.
T Consensus 326 ~~V~Gs~d-------------r~i~~wdlDgn~~~~W~gvr----~~~--------v~dlait~Dgk~-vl~v~------ 373 (519)
T KOG0293|consen 326 RFVTGSPD-------------RTIIMWDLDGNILGNWEGVR----DPK--------VHDLAITYDGKY-VLLVT------ 373 (519)
T ss_pred eeEecCCC-------------CcEEEecCCcchhhcccccc----cce--------eEEEEEcCCCcE-EEEEe------
Confidence 98766544 37999999998532211110 121 567899999998 55542
Q ss_pred CCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
...+|.+++. ++..-+.+......+.+++.|.||++++..- . ..++..+|++.
T Consensus 374 ------~d~~i~l~~~---e~~~dr~lise~~~its~~iS~d~k~~LvnL--~--~qei~LWDl~e 426 (519)
T KOG0293|consen 374 ------VDKKIRLYNR---EARVDRGLISEEQPITSFSISKDGKLALVNL--Q--DQEIHLWDLEE 426 (519)
T ss_pred ------cccceeeech---hhhhhhccccccCceeEEEEcCCCcEEEEEc--c--cCeeEEeecch
Confidence 1124777776 3333344666778889999999999877644 2 23566667765
No 99
>TIGR01250 pro_imino_pep_2 proline-specific peptidases, Bacillus coagulans-type subfamily. This model describes a subfamily of the alpha/beta fold family of hydrolases. Characterized members include prolinases (Pro-Xaa dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC 3.4.11.5), and a leucyl aminopeptidase
Probab=99.47 E-value=1.4e-12 Score=133.33 Aligned_cols=191 Identities=18% Similarity=0.140 Sum_probs=114.1
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC---------hHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL---------PNDSAEAAVE 583 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~---------~~~d~~~~~~ 583 (744)
.|.||++||++... ..+ .......+.+.||.|+. .+.+|+|.+. ..+++.+.+.
T Consensus 25 ~~~vl~~hG~~g~~-----------~~~---~~~~~~~l~~~g~~vi~---~d~~G~G~s~~~~~~~~~~~~~~~~~~~~ 87 (288)
T TIGR01250 25 KIKLLLLHGGPGMS-----------HEY---LENLRELLKEEGREVIM---YDQLGCGYSDQPDDSDELWTIDYFVDELE 87 (288)
T ss_pred CCeEEEEcCCCCcc-----------HHH---HHHHHHHHHhcCCEEEE---EcCCCCCCCCCCCcccccccHHHHHHHHH
Confidence 47789999964110 011 11222333445999999 4445554432 1234444444
Q ss_pred HHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-------Ccccc---------cccchhh
Q 004574 584 EVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP-------FGFQT---------EFRTLWE 647 (744)
Q Consensus 584 ~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~-------~~~~~---------~~~~~~~ 647 (744)
.+.+.. +.+++.++||||||.+++.++..+|++++++|+.+++....... ..+.. .....+.
T Consensus 88 ~~~~~~--~~~~~~liG~S~Gg~ia~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (288)
T TIGR01250 88 EVREKL--GLDKFYLLGHSWGGMLAQEYALKYGQHLKGLIISSMLDSAPEYVKELNRLRKELPPEVRAAIKRCEASGDYD 165 (288)
T ss_pred HHHHHc--CCCcEEEEEeehHHHHHHHHHHhCccccceeeEecccccchHHHHHHHHHHhhcChhHHHHHHHHHhccCcc
Confidence 444442 23579999999999999999999999999999887753210000 00000 0000000
Q ss_pred c------H---------------H------------HHH---------------hcCcccccCCCCCCEEEEeeCCCCCC
Q 004574 648 A------T---------------N------------VYI---------------EMSPITHANKIKKPILIIHGEVDDKV 679 (744)
Q Consensus 648 ~------~---------------~------------~~~---------------~~~~~~~~~~~~~P~l~i~G~~D~~v 679 (744)
. . . .+. .......+.++++|+|+++|++|..
T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lii~G~~D~~- 244 (288)
T TIGR01250 166 NPEYQEAVEVFYHHLLCRTRKWPEALKHLKSGMNTNVYNIMQGPNEFTITGNLKDWDITDKLSEIKVPTLLTVGEFDTM- 244 (288)
T ss_pred hHHHHHHHHHHHHHhhcccccchHHHHHHhhccCHHHHhcccCCccccccccccccCHHHHhhccCCCEEEEecCCCcc-
Confidence 0 0 0 000 0011123457899999999999984
Q ss_pred CCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 680 GLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 680 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
+ ...++.+.+.+. .++++++++++|... .+.+.++.+.+.+||+
T Consensus 245 ~--~~~~~~~~~~~~----~~~~~~~~~~gH~~~-~e~p~~~~~~i~~fl~ 288 (288)
T TIGR01250 245 T--PEAAREMQELIA----GSRLVVFPDGSHMTM-IEDPEVYFKLLSDFIR 288 (288)
T ss_pred C--HHHHHHHHHhcc----CCeEEEeCCCCCCcc-cCCHHHHHHHHHHHhC
Confidence 4 566666655443 457899999999877 5678899999999973
No 100
>TIGR03695 menH_SHCHC 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase. This protein catalyzes the formation of SHCHC, or (1 R,6 R)-2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate, by elmination of pyruvate from 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPHCHC). Note that SHCHC synthase activity previously was attributed to MenD, which in fact is SEPHCHC synthase.
Probab=99.46 E-value=8.6e-13 Score=131.64 Aligned_cols=187 Identities=18% Similarity=0.123 Sum_probs=113.2
Q ss_pred eEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCCh-----HHHHHHHHHH----
Q 004574 514 PCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLP-----NDSAEAAVEE---- 584 (744)
Q Consensus 514 p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~-----~~d~~~~~~~---- 584 (744)
|+||++||.+... ..| ......|+ +||.|+. .+.+|+|.+.. ..++.+.++.
T Consensus 2 ~~vv~~hG~~~~~-----------~~~----~~~~~~L~-~~~~v~~---~d~~g~G~s~~~~~~~~~~~~~~~~~~~~~ 62 (251)
T TIGR03695 2 PVLVFLHGFLGSG-----------ADW----QALIELLG-PHFRCLA---IDLPGHGSSQSPDEIERYDFEEAAQDILAT 62 (251)
T ss_pred CEEEEEcCCCCch-----------hhH----HHHHHHhc-ccCeEEE---EcCCCCCCCCCCCccChhhHHHHHHHHHHH
Confidence 6799999964211 111 23344555 8999999 45555554421 1234444433
Q ss_pred HHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-----Cc----------------------
Q 004574 585 VVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP-----FG---------------------- 637 (744)
Q Consensus 585 l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~-----~~---------------------- 637 (744)
+.+. .+.+++.++|||+||.+|+.++.++|+.+++++++++........ ..
T Consensus 63 ~~~~--~~~~~~~l~G~S~Gg~ia~~~a~~~~~~v~~lil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 140 (251)
T TIGR03695 63 LLDQ--LGIEPFFLVGYSMGGRIALYYALQYPERVQGLILESGSPGLATEEERAARRQNDEQLAQRFEQEGLEAFLDDWY 140 (251)
T ss_pred HHHH--cCCCeEEEEEeccHHHHHHHHHHhCchheeeeEEecCCCCcCchHhhhhhhhcchhhhhHHHhcCccHHHHHHh
Confidence 3333 345789999999999999999999999999999888753211000 00
Q ss_pred ----ccccccchhhcH----------------HHHHh------cCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHH
Q 004574 638 ----FQTEFRTLWEAT----------------NVYIE------MSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFD 691 (744)
Q Consensus 638 ----~~~~~~~~~~~~----------------~~~~~------~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~ 691 (744)
+........... ..+.. .+....+.++++|+|+++|++|..++ ...+
T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~g~~D~~~~---~~~~---- 213 (251)
T TIGR03695 141 QQPLFASQKNLPPEQRQALRAKRLANNPEGLAKMLRATGLGKQPSLWPKLQALTIPVLYLCGEKDEKFV---QIAK---- 213 (251)
T ss_pred cCceeeecccCChHHhHHHHHhcccccchHHHHHHHHhhhhcccchHHHhhCCCCceEEEeeCcchHHH---HHHH----
Confidence 000000000000 00000 11122356789999999999998642 3333
Q ss_pred HHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 692 ALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 692 ~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
.+.+...+++++++|+++|... .+.++.+...+.+||
T Consensus 214 ~~~~~~~~~~~~~~~~~gH~~~-~e~~~~~~~~i~~~l 250 (251)
T TIGR03695 214 EMQKLLPNLTLVIIANAGHNIH-LENPEAFAKILLAFL 250 (251)
T ss_pred HHHhcCCCCcEEEEcCCCCCcC-ccChHHHHHHHHHHh
Confidence 3444445679999999999876 556778888888887
No 101
>PRK07581 hypothetical protein; Validated
Probab=99.46 E-value=1.2e-12 Score=136.98 Aligned_cols=67 Identities=13% Similarity=-0.033 Sum_probs=55.2
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC-CCcccCccccHHHHHHHHHHHHHHhc
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPF-EHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
.+.++++|+|+++|++|..+| ...++.+.+.+. ..+++++++ +||... .+...++.+.+.+||.+.+
T Consensus 270 ~L~~I~~PtLvI~G~~D~~~p--~~~~~~l~~~ip----~a~l~~i~~~~GH~~~-~~~~~~~~~~~~~~~~~~~ 337 (339)
T PRK07581 270 ALGSITAKTFVMPISTDLYFP--PEDCEAEAALIP----NAELRPIESIWGHLAG-FGQNPADIAFIDAALKELL 337 (339)
T ss_pred HHhcCCCCEEEEEeCCCCCCC--HHHHHHHHHhCC----CCeEEEeCCCCCcccc-ccCcHHHHHHHHHHHHHHH
Confidence 345689999999999999998 888877766553 358899998 899876 5667789999999999876
No 102
>PLN02894 hydrolase, alpha/beta fold family protein
Probab=99.46 E-value=5e-12 Score=134.14 Aligned_cols=196 Identities=17% Similarity=0.108 Sum_probs=119.3
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChH-------HH----HHH-
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPN-------DS----AEA- 580 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~-------~d----~~~- 580 (744)
.|+||++||.+... ..| ...+..|+ ++|.|++ .+.+|+|.+... ++ +.+
T Consensus 105 ~p~vvllHG~~~~~-----------~~~----~~~~~~L~-~~~~vi~---~D~rG~G~S~~~~~~~~~~~~~~~~~~~~ 165 (402)
T PLN02894 105 APTLVMVHGYGASQ-----------GFF----FRNFDALA-SRFRVIA---IDQLGWGGSSRPDFTCKSTEETEAWFIDS 165 (402)
T ss_pred CCEEEEECCCCcch-----------hHH----HHHHHHHH-hCCEEEE---ECCCCCCCCCCCCcccccHHHHHHHHHHH
Confidence 58999999975211 011 12334444 4699999 555665544211 11 122
Q ss_pred HHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC------------C-----------c
Q 004574 581 AVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP------------F-----------G 637 (744)
Q Consensus 581 ~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~------------~-----------~ 637 (744)
+.+++... +.+++.|+||||||++++.++.++|++++++|+++|..-..... + .
T Consensus 166 i~~~~~~l---~~~~~~lvGhS~GG~la~~~a~~~p~~v~~lvl~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (402)
T PLN02894 166 FEEWRKAK---NLSNFILLGHSFGGYVAAKYALKHPEHVQHLILVGPAGFSSESDDKSEWLTKFRATWKGAVLNHLWESN 242 (402)
T ss_pred HHHHHHHc---CCCCeEEEEECHHHHHHHHHHHhCchhhcEEEEECCccccCCcchhHHHHhhcchhHHHHHHHHHhhcC
Confidence 22333322 34689999999999999999999999999999887642110000 0 0
Q ss_pred c-c---------------ccc--c----------chhhcHHHH---------------------------HhcCcccccC
Q 004574 638 F-Q---------------TEF--R----------TLWEATNVY---------------------------IEMSPITHAN 662 (744)
Q Consensus 638 ~-~---------------~~~--~----------~~~~~~~~~---------------------------~~~~~~~~~~ 662 (744)
+ + ... . ...+....+ ...+....+.
T Consensus 243 ~~p~~~~~~~gp~~~~l~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 322 (402)
T PLN02894 243 FTPQKIIRGLGPWGPNLVRRYTTARFGAHSTGDILSEEESKLLTDYVYHTLAAKASGELCLKYIFSFGAFARKPLLESAS 322 (402)
T ss_pred CCHHHHHHhccchhHHHHHHHHHHHhhhcccccccCcchhhHHHHHHHHhhcCCCchHHHHHHhccCchhhcchHhhhcc
Confidence 0 0 000 0 000000000 0001112356
Q ss_pred CCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhccCCC
Q 004574 663 KIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 663 ~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
++++|+|+++|++|..++ ..+.++.+.+ +.+++++++++++|... .+++..+++.+.+|++..+....
T Consensus 323 ~I~vP~liI~G~~D~i~~---~~~~~~~~~~---~~~~~~~~i~~aGH~~~-~E~P~~f~~~l~~~~~~~~~~~~ 390 (402)
T PLN02894 323 EWKVPTTFIYGRHDWMNY---EGAVEARKRM---KVPCEIIRVPQGGHFVF-LDNPSGFHSAVLYACRKYLSPDR 390 (402)
T ss_pred cCCCCEEEEEeCCCCCCc---HHHHHHHHHc---CCCCcEEEeCCCCCeee-ccCHHHHHHHHHHHHHHhccCCc
Confidence 789999999999998654 4454444333 44578999999999876 67788999999999999887644
No 103
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=99.45 E-value=1.1e-10 Score=113.12 Aligned_cols=292 Identities=13% Similarity=0.089 Sum_probs=172.7
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
.+|++++++. .+|+...+..+.+-...+...|+|++++|..+... +..++---|.+|-+.|+...+-......
T Consensus 16 ~gI~v~~ld~--~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~-----~~~ggvaay~iD~~~G~Lt~ln~~~~~g 88 (346)
T COG2706 16 QGIYVFNLDT--KTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEP-----GEEGGVAAYRIDPDDGRLTFLNRQTLPG 88 (346)
T ss_pred CceEEEEEeC--cccccchhhhccccCCCceEEECCCCCEEEEEEec-----CCcCcEEEEEEcCCCCeEEEeeccccCC
Confidence 5799999975 56776666644454468889999999887665432 1245556677776668877664333211
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
.. -..++.++||+.|+.+.... +.|-+..+
T Consensus 89 ~~--p~yvsvd~~g~~vf~AnY~~------------------------------------------------g~v~v~p~ 118 (346)
T COG2706 89 SP--PCYVSVDEDGRFVFVANYHS------------------------------------------------GSVSVYPL 118 (346)
T ss_pred CC--CeEEEECCCCCEEEEEEccC------------------------------------------------ceEEEEEc
Confidence 11 13568899999888775431 22333333
Q ss_pred --CCCeeec---CCC-c----------eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCC
Q 004574 165 --DGTAKDF---GTP-A----------VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLP 228 (744)
Q Consensus 165 --~G~~~~l---~~~-~----------~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~ 228 (744)
+|....+ ..+ + -.+...++|||++|+...... .++++|+++.+.........
T Consensus 119 ~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~------------Dri~~y~~~dg~L~~~~~~~ 186 (346)
T COG2706 119 QADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGT------------DRIFLYDLDDGKLTPADPAE 186 (346)
T ss_pred ccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCC------------ceEEEEEcccCccccccccc
Confidence 3422211 111 1 145667999999998875554 37888888755444333222
Q ss_pred CCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee---------ecc
Q 004574 229 PAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK---------LDL 299 (744)
Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~---------~~~ 299 (744)
. ....|+|.++|.|+|+. .|.+...++ .-.+|-++- ..++...|.. +..
T Consensus 187 v---------~~G~GPRHi~FHpn~k~-aY~v~EL~s---------tV~v~~y~~---~~g~~~~lQ~i~tlP~dF~g~~ 244 (346)
T COG2706 187 V---------KPGAGPRHIVFHPNGKY-AYLVNELNS---------TVDVLEYNP---AVGKFEELQTIDTLPEDFTGTN 244 (346)
T ss_pred c---------CCCCCcceEEEcCCCcE-EEEEeccCC---------EEEEEEEcC---CCceEEEeeeeccCccccCCCC
Confidence 1 12246899999999995 555543222 112444443 2355444432 133
Q ss_pred ceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcce
Q 004574 300 RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQI 379 (744)
Q Consensus 300 ~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~ 379 (744)
....+..||||+.|+. +++-.....++.++..++ ....+.....+... |-. |.++++|+.|+...++
T Consensus 245 ~~aaIhis~dGrFLYa-sNRg~dsI~~f~V~~~~g--~L~~~~~~~teg~~--PR~--F~i~~~g~~Liaa~q~------ 311 (346)
T COG2706 245 WAAAIHISPDGRFLYA-SNRGHDSIAVFSVDPDGG--KLELVGITPTEGQF--PRD--FNINPSGRFLIAANQK------ 311 (346)
T ss_pred ceeEEEECCCCCEEEE-ecCCCCeEEEEEEcCCCC--EEEEEEEeccCCcC--Ccc--ceeCCCCCEEEEEccC------
Confidence 4556788999996665 434335567777887763 33233222222111 111 4579999999887643
Q ss_pred EEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 380 YILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 380 ~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
.+.-.++..|..+|+.+.+..
T Consensus 312 ------------sd~i~vf~~d~~TG~L~~~~~ 332 (346)
T COG2706 312 ------------SDNITVFERDKETGRLTLLGR 332 (346)
T ss_pred ------------CCcEEEEEEcCCCceEEeccc
Confidence 344568889999998876644
No 104
>PRK03592 haloalkane dehalogenase; Provisional
Probab=99.44 E-value=3.8e-12 Score=130.63 Aligned_cols=194 Identities=11% Similarity=0.060 Sum_probs=118.1
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC------hHHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL------PNDSAEAAVEEVV 586 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~------~~~d~~~~~~~l~ 586 (744)
.|.||++||.+.. ...| ...+..|++++ .|++ .+.+|+|.+. ..++..+.+..+.
T Consensus 27 g~~vvllHG~~~~--------------~~~w-~~~~~~L~~~~-~via---~D~~G~G~S~~~~~~~~~~~~a~dl~~ll 87 (295)
T PRK03592 27 GDPIVFLHGNPTS--------------SYLW-RNIIPHLAGLG-RCLA---PDLIGMGASDKPDIDYTFADHARYLDAWF 87 (295)
T ss_pred CCEEEEECCCCCC--------------HHHH-HHHHHHHhhCC-EEEE---EcCCCCCCCCCCCCCCCHHHHHHHHHHHH
Confidence 3679999997421 1111 13445666665 8888 5666666553 2223333333333
Q ss_pred HcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC---CCCc-------cc--c--c---------cc
Q 004574 587 RRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL---TPFG-------FQ--T--E---------FR 643 (744)
Q Consensus 587 ~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~---~~~~-------~~--~--~---------~~ 643 (744)
+... .+++.++||||||.+|+.++.++|++++++|++++...... .... +. . . ..
T Consensus 88 ~~l~--~~~~~lvGhS~Gg~ia~~~a~~~p~~v~~lil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (295)
T PRK03592 88 DALG--LDDVVLVGHDWGSALGFDWAARHPDRVRGIAFMEAIVRPMTWDDFPPAVRELFQALRSPGEGEEMVLEENVFIE 165 (295)
T ss_pred HHhC--CCCeEEEEECHHHHHHHHHHHhChhheeEEEEECCCCCCcchhhcchhHHHHHHHHhCcccccccccchhhHHh
Confidence 3322 36899999999999999999999999999999887421100 0000 00 0 0 00
Q ss_pred chhh-------cHH---HHH-----------------hcC--------------cccccCCCCCCEEEEeeCCCCCCCCC
Q 004574 644 TLWE-------ATN---VYI-----------------EMS--------------PITHANKIKKPILIIHGEVDDKVGLF 682 (744)
Q Consensus 644 ~~~~-------~~~---~~~-----------------~~~--------------~~~~~~~~~~P~l~i~G~~D~~v~~~ 682 (744)
..+. ..+ .+. ... ....+.++++|+|+++|++|..++
T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lii~G~~D~~~~-- 243 (295)
T PRK03592 166 RVLPGSILRPLSDEEMAVYRRPFPTPESRRPTLSWPRELPIDGEPADVVALVEEYAQWLATSDVPKLLINAEPGAILT-- 243 (295)
T ss_pred hcccCcccccCCHHHHHHHHhhcCCchhhhhhhhhhhhcCCCCcchhhHhhhhHhHHHhccCCCCeEEEeccCCcccC--
Confidence 0000 000 000 000 011245689999999999999985
Q ss_pred HHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 683 PMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 683 ~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
.....++...+. .+.++.++++++|... .+.++.+.+.+.+||.+..
T Consensus 244 ~~~~~~~~~~~~---~~~~~~~i~~~gH~~~-~e~p~~v~~~i~~fl~~~~ 290 (295)
T PRK03592 244 TGAIRDWCRSWP---NQLEITVFGAGLHFAQ-EDSPEEIGAAIAAWLRRLR 290 (295)
T ss_pred cHHHHHHHHHhh---hhcceeeccCcchhhh-hcCHHHHHHHHHHHHHHhc
Confidence 455545543322 2458999999999987 5678899999999998653
No 105
>PRK14875 acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional
Probab=99.43 E-value=3.6e-12 Score=135.69 Aligned_cols=188 Identities=18% Similarity=0.111 Sum_probs=116.2
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC------ChHHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK------LPNDSAEAAVEEVV 586 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~------~~~~d~~~~~~~l~ 586 (744)
.|.||++||.+... ..| ......+.++|.|+. ++.+|+|.+ ...+++.+.+..+.
T Consensus 131 ~~~vl~~HG~~~~~-----------~~~-----~~~~~~l~~~~~v~~---~d~~g~G~s~~~~~~~~~~~~~~~~~~~~ 191 (371)
T PRK14875 131 GTPVVLIHGFGGDL-----------NNW-----LFNHAALAAGRPVIA---LDLPGHGASSKAVGAGSLDELAAAVLAFL 191 (371)
T ss_pred CCeEEEECCCCCcc-----------chH-----HHHHHHHhcCCEEEE---EcCCCCCCCCCCCCCCCHHHHHHHHHHHH
Confidence 57899999864110 111 112223456699999 455555544 12334555555544
Q ss_pred HcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcc------c---------------ccccch
Q 004574 587 RRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGF------Q---------------TEFRTL 645 (744)
Q Consensus 587 ~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~------~---------------~~~~~~ 645 (744)
+. ++..++.|+|||+||.+++.++.++|++++++|+++|..........+ . ......
T Consensus 192 ~~--~~~~~~~lvG~S~Gg~~a~~~a~~~~~~v~~lv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (371)
T PRK14875 192 DA--LGIERAHLVGHSMGGAVALRLAARAPQRVASLTLIAPAGLGPEINGDYIDGFVAAESRRELKPVLELLFADPALVT 269 (371)
T ss_pred Hh--cCCccEEEEeechHHHHHHHHHHhCchheeEEEEECcCCcCcccchhHHHHhhcccchhHHHHHHHHHhcChhhCC
Confidence 44 345689999999999999999999999999999988753211000000 0 000000
Q ss_pred hhc-------------HHH---HH---------hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcE
Q 004574 646 WEA-------------TNV---YI---------EMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALS 700 (744)
Q Consensus 646 ~~~-------------~~~---~~---------~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~ 700 (744)
+.. ... +. ..+....+.++++|+|+++|++|..+| ..+++.+. ..+
T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~Pvlii~g~~D~~vp--~~~~~~l~-------~~~ 340 (371)
T PRK14875 270 RQMVEDLLKYKRLDGVDDALRALADALFAGGRQRVDLRDRLASLAIPVLVIWGEQDRIIP--AAHAQGLP-------DGV 340 (371)
T ss_pred HHHHHHHHHHhccccHHHHHHHHHHHhccCcccchhHHHHHhcCCCCEEEEEECCCCccC--HHHHhhcc-------CCC
Confidence 000 000 00 011122456789999999999999998 77665432 246
Q ss_pred EEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 701 RLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 701 ~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
++.++++++|... .+.++.+.+.+.+||++
T Consensus 341 ~~~~~~~~gH~~~-~e~p~~~~~~i~~fl~~ 370 (371)
T PRK14875 341 AVHVLPGAGHMPQ-MEAAADVNRLLAEFLGK 370 (371)
T ss_pred eEEEeCCCCCChh-hhCHHHHHHHHHHHhcc
Confidence 8999999999876 56678888888899864
No 106
>PRK03204 haloalkane dehalogenase; Provisional
Probab=99.43 E-value=1.3e-11 Score=125.41 Aligned_cols=189 Identities=15% Similarity=0.082 Sum_probs=114.7
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-------hHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-------PNDSAEAAVEEV 585 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-------~~~d~~~~~~~l 585 (744)
.|.||++||.+.. .. .+......+.++|.|++ .+.+|+|.+. ..++..+.+..+
T Consensus 34 ~~~iv~lHG~~~~--------------~~--~~~~~~~~l~~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~~~~~~~ 94 (286)
T PRK03204 34 GPPILLCHGNPTW--------------SF--LYRDIIVALRDRFRCVA---PDYLGFGLSERPSGFGYQIDEHARVIGEF 94 (286)
T ss_pred CCEEEEECCCCcc--------------HH--HHHHHHHHHhCCcEEEE---ECCCCCCCCCCCCccccCHHHHHHHHHHH
Confidence 4679999996411 00 11122233456799999 4555555432 234667777776
Q ss_pred HHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC-----------CCcccc---------cc---
Q 004574 586 VRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT-----------PFGFQT---------EF--- 642 (744)
Q Consensus 586 ~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~-----------~~~~~~---------~~--- 642 (744)
.+.. +.+++.++||||||.+++.++..+|++++++|++++..-.... ...... ..
T Consensus 95 ~~~~--~~~~~~lvG~S~Gg~va~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (286)
T PRK03204 95 VDHL--GLDRYLSMGQDWGGPISMAVAVERADRVRGVVLGNTWFWPADTLAMKAFSRVMSSPPVQYAILRRNFFVERLIP 172 (286)
T ss_pred HHHh--CCCCEEEEEECccHHHHHHHHHhChhheeEEEEECccccCCCchhHHHHHHHhccccchhhhhhhhHHHHHhcc
Confidence 6653 3368999999999999999999999999999987664310000 000000 00
Q ss_pred ---cchh--hcHHHHH-----------------hcCc----ccc----cC--CCCCCEEEEeeCCCCCCCCCHH-HHHHH
Q 004574 643 ---RTLW--EATNVYI-----------------EMSP----ITH----AN--KIKKPILIIHGEVDDKVGLFPM-QAERF 689 (744)
Q Consensus 643 ---~~~~--~~~~~~~-----------------~~~~----~~~----~~--~~~~P~l~i~G~~D~~v~~~~~-~~~~~ 689 (744)
.... .....+. .... ... +. .+++|+|+++|++|..++ .. .++++
T Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~PtliI~G~~D~~~~--~~~~~~~~ 250 (286)
T PRK03204 173 AGTEHRPSSAVMAHYRAVQPNAAARRGVAEMPKQILAARPLLARLAREVPATLGTKPTLLVWGMKDVAFR--PKTILPRL 250 (286)
T ss_pred ccccCCCCHHHHHHhcCCCCCHHHHHHHHHHHHhcchhhHHHHHhhhhhhhhcCCCCeEEEecCCCcccC--cHHHHHHH
Confidence 0000 0000000 0000 000 11 127999999999999875 44 34545
Q ss_pred HHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 690 FDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 690 ~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
.+.+. ..++++++++||... .+.++.+.+.+.+||
T Consensus 251 ~~~ip----~~~~~~i~~aGH~~~-~e~Pe~~~~~i~~~~ 285 (286)
T PRK03204 251 RATFP----DHVLVELPNAKHFIQ-EDAPDRIAAAIIERF 285 (286)
T ss_pred HHhcC----CCeEEEcCCCccccc-ccCHHHHHHHHHHhc
Confidence 44443 459999999999987 678889999999997
No 107
>TIGR01249 pro_imino_pep_1 proline iminopeptidase, Neisseria-type subfamily. This model represents one of two related families of proline iminopeptidase in the alpha/beta fold hydrolase family. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.
Probab=99.43 E-value=5.1e-12 Score=130.00 Aligned_cols=167 Identities=17% Similarity=0.170 Sum_probs=103.5
Q ss_pred HHhCCeEEEecCCCCCCCCCCCC--------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEE
Q 004574 551 FLARRFAVLAGPSIPIIGEGDKL--------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCG 622 (744)
Q Consensus 551 ~~~~G~~v~~~~~~~~~g~g~~~--------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~ 622 (744)
+...+|.|++ .+.+|+|.+. ..+++.+.+..+.+... .+++.++||||||.+++.++.++|++++++
T Consensus 49 ~~~~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~~~dl~~l~~~l~--~~~~~lvG~S~GG~ia~~~a~~~p~~v~~l 123 (306)
T TIGR01249 49 FDPETYRIVL---FDQRGCGKSTPHACLEENTTWDLVADIEKLREKLG--IKNWLVFGGSWGSTLALAYAQTHPEVVTGL 123 (306)
T ss_pred cCccCCEEEE---ECCCCCCCCCCCCCcccCCHHHHHHHHHHHHHHcC--CCCEEEEEECHHHHHHHHHHHHChHhhhhh
Confidence 4457899999 4555555432 12356666666665433 357999999999999999999999999999
Q ss_pred EEccCCCCCC-------------CCCC-------ccccccc------------------------ch---hhc-------
Q 004574 623 IARSGSYNKT-------------LTPF-------GFQTEFR------------------------TL---WEA------- 648 (744)
Q Consensus 623 v~~~~~~~~~-------------~~~~-------~~~~~~~------------------------~~---~~~------- 648 (744)
|+.++..... ..+. ....... .. |..
T Consensus 124 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (306)
T TIGR01249 124 VLRGIFLLREKEWSWFYEGGASMIYPDAWQRFMDSIPENERNEQLVNAYHDRLQSGDEETKLAAAKAWVDWESTTLLRPI 203 (306)
T ss_pred eeeccccCCHHHHHHHHhcchhhhCHHHHHHHhhhCChhhhhccHHHHHHHHccCCCHHHHHHHHHHHHHHhChhhcCCC
Confidence 8876542110 0000 0000000 00 000
Q ss_pred ---------H---HHHHh-----------cC----cccccCCC-CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcE
Q 004574 649 ---------T---NVYIE-----------MS----PITHANKI-KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALS 700 (744)
Q Consensus 649 ---------~---~~~~~-----------~~----~~~~~~~~-~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~ 700 (744)
+ ..+.. .+ ....+.++ ++|+|++||++|.++| ...++++++.+. ..
T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~P~lii~g~~D~~~p--~~~~~~~~~~~~----~~ 277 (306)
T TIGR01249 204 NEIVSTAEDFKFSLAFARLENHYFVNKGFLDVENFILDNISKIRNIPTYIVHGRYDLCCP--LQSAWALHKAFP----EA 277 (306)
T ss_pred CCccccccchHHHHHHHHHHHhHHHHhchhcCchHHHHhhhhccCCCeEEEecCCCCCCC--HHHHHHHHHhCC----CC
Confidence 0 00000 00 01223456 5899999999999998 888888877653 35
Q ss_pred EEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 701 RLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 701 ~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
+++++++++|..... ...+.+++|+...
T Consensus 278 ~~~~~~~~gH~~~~~----~~~~~i~~~~~~~ 305 (306)
T TIGR01249 278 ELKVTNNAGHSAFDP----NNLAALVHALETY 305 (306)
T ss_pred EEEEECCCCCCCCCh----HHHHHHHHHHHHh
Confidence 899999999987633 3456667777654
No 108
>PLN02578 hydrolase
Probab=99.42 E-value=6.4e-12 Score=131.83 Aligned_cols=165 Identities=20% Similarity=0.234 Sum_probs=103.4
Q ss_pred HhCCeEEEecCCCCCCCCCCCCh------HHH-HHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEE
Q 004574 552 LARRFAVLAGPSIPIIGEGDKLP------NDS-AEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIA 624 (744)
Q Consensus 552 ~~~G~~v~~~~~~~~~g~g~~~~------~~d-~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~ 624 (744)
+..+|.|++ .+.+|+|.+.. .++ ..++++++.+.. .+++.++|||+||++++.+|.++|++++++|+
T Consensus 109 l~~~~~v~~---~D~~G~G~S~~~~~~~~~~~~a~~l~~~i~~~~---~~~~~lvG~S~Gg~ia~~~A~~~p~~v~~lvL 182 (354)
T PLN02578 109 LAKKYKVYA---LDLLGFGWSDKALIEYDAMVWRDQVADFVKEVV---KEPAVLVGNSLGGFTALSTAVGYPELVAGVAL 182 (354)
T ss_pred HhcCCEEEE---ECCCCCCCCCCcccccCHHHHHHHHHHHHHHhc---cCCeEEEEECHHHHHHHHHHHhChHhcceEEE
Confidence 356799999 55556655421 111 223333443332 36899999999999999999999999999998
Q ss_pred ccCCCCCCCCC------------C--c-ccccc----------------cc----------hhhc---------------
Q 004574 625 RSGSYNKTLTP------------F--G-FQTEF----------------RT----------LWEA--------------- 648 (744)
Q Consensus 625 ~~~~~~~~~~~------------~--~-~~~~~----------------~~----------~~~~--------------- 648 (744)
+++........ . . ..... .. .+..
T Consensus 183 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (354)
T PLN02578 183 LNSAGQFGSESREKEEAIVVEETVLTRFVVKPLKEWFQRVVLGFLFWQAKQPSRIESVLKSVYKDKSNVDDYLVESITEP 262 (354)
T ss_pred ECCCccccccccccccccccccchhhHHHhHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHhcCCcccCCHHHHHHHHhc
Confidence 87542110000 0 0 00000 00 0000
Q ss_pred ------HHHHH-----------hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 649 ------TNVYI-----------EMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 649 ------~~~~~-----------~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
.+.+. .......++++++|+|+++|++|.+++ ...++++.+.+. +.+++++ +++|+
T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~PvLiI~G~~D~~v~--~~~~~~l~~~~p----~a~l~~i-~~GH~ 335 (354)
T PLN02578 263 AADPNAGEVYYRLMSRFLFNQSRYTLDSLLSKLSCPLLLLWGDLDPWVG--PAKAEKIKAFYP----DTTLVNL-QAGHC 335 (354)
T ss_pred ccCCchHHHHHHHHHHHhcCCCCCCHHHHhhcCCCCEEEEEeCCCCCCC--HHHHHHHHHhCC----CCEEEEe-CCCCC
Confidence 00000 001122356789999999999999988 888777766553 3477777 58999
Q ss_pred cCccccHHHHHHHHHHHHH
Q 004574 712 YAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 712 ~~~~~~~~~~~~~~~~fl~ 730 (744)
++ .+.++++.+.+.+||+
T Consensus 336 ~~-~e~p~~~~~~I~~fl~ 353 (354)
T PLN02578 336 PH-DEVPEQVNKALLEWLS 353 (354)
T ss_pred cc-ccCHHHHHHHHHHHHh
Confidence 87 6778899999999985
No 109
>TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase. This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. This model has been rebuilt to identify sequences more broadly, including a number of sequences suggested to be homoserine O-acetyltransferase based on proximity to other Met biosynthesis genes.
Probab=99.41 E-value=7.4e-12 Score=131.40 Aligned_cols=68 Identities=19% Similarity=0.129 Sum_probs=54.8
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEE-EeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLV-LLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
.++++++|+|+|+|++|..+| ...++++.+.+......++++ ++++++|... .+.++.+.+.+.+||.
T Consensus 283 ~l~~I~~P~Lvi~G~~D~~~p--~~~~~~~a~~i~~~~~~v~~~~i~~~~GH~~~-le~p~~~~~~l~~FL~ 351 (351)
T TIGR01392 283 ALSRIKAPFLVVSITSDWLFP--PAESRELAKALPAAGLRVTYVEIESPYGHDAF-LVETDQVEELIRGFLR 351 (351)
T ss_pred HHhhCCCCEEEEEeCCccccC--HHHHHHHHHHHhhcCCceEEEEeCCCCCcchh-hcCHHHHHHHHHHHhC
Confidence 456789999999999999998 999999999888654444444 4568999877 5678899999999973
No 110
>PLN02872 triacylglycerol lipase
Probab=99.41 E-value=4.9e-12 Score=132.14 Aligned_cols=236 Identities=15% Similarity=0.093 Sum_probs=144.0
Q ss_pred CCceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCC--CCchhHHHHHhCCe
Q 004574 479 SLQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSG--MTPTSSLIFLARRF 556 (744)
Q Consensus 479 ~~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~G~ 556 (744)
.++.|.-.+...||..+...-+ |.........+.|+|+++||.+... ..|.. .....+..|+++||
T Consensus 41 gy~~e~h~v~T~DGy~L~l~ri-~~~~~~~~~~~~~~Vll~HGl~~ss-----------~~w~~~~~~~sla~~La~~Gy 108 (395)
T PLN02872 41 GYSCTEHTIQTKDGYLLALQRV-SSRNPRLGSQRGPPVLLQHGLFMAG-----------DAWFLNSPEQSLGFILADHGF 108 (395)
T ss_pred CCCceEEEEECCCCcEEEEEEc-CCCCCCCCCCCCCeEEEeCcccccc-----------cceeecCcccchHHHHHhCCC
Confidence 4567777787789987766554 4221111122368899999964221 11111 11234456789999
Q ss_pred EEEecCCCCCCC-CC-----C--------C--Ch-HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--
Q 004574 557 AVLAGPSIPIIG-EG-----D--------K--LP-NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-- 617 (744)
Q Consensus 557 ~v~~~~~~~~~g-~g-----~--------~--~~-~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-- 617 (744)
.|+.++.++... +| . + +. ..|+.++++++.+.. .+++.++||||||.+++.++ ..|+
T Consensus 109 dV~l~n~RG~~~s~gh~~~~~~~~~fw~~s~~e~a~~Dl~a~id~i~~~~---~~~v~~VGhS~Gg~~~~~~~-~~p~~~ 184 (395)
T PLN02872 109 DVWVGNVRGTRWSYGHVTLSEKDKEFWDWSWQELALYDLAEMIHYVYSIT---NSKIFIVGHSQGTIMSLAAL-TQPNVV 184 (395)
T ss_pred CcccccccccccccCCCCCCccchhccCCcHHHHHHHHHHHHHHHHHhcc---CCceEEEEECHHHHHHHHHh-hChHHH
Confidence 999855443210 11 0 0 11 248999999998653 36899999999999998555 5565
Q ss_pred -ceeEEEEccCCCCCC--------------------------CCCCc--c------ccc------c--------------
Q 004574 618 -LFCCGIARSGSYNKT--------------------------LTPFG--F------QTE------F-------------- 642 (744)
Q Consensus 618 -~~~~~v~~~~~~~~~--------------------------~~~~~--~------~~~------~-------------- 642 (744)
.++++++++|..-.. ..+.. . .+. .
T Consensus 185 ~~v~~~~~l~P~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~C~~~~~c~~~~~~~~g~~~~~n~ 264 (395)
T PLN02872 185 EMVEAAALLCPISYLDHVTAPLVLRMVFMHLDQMVVAMGIHQLNFRSDVLVKLLDSICEGHMDCNDLLTSITGTNCCFNA 264 (395)
T ss_pred HHHHHHHHhcchhhhccCCCHHHHHHHHHhHHHHHHHhcCceecCCcHHHHHHHHHHccCchhHHHHHHHHhCCCcccch
Confidence 566667776642100 00000 0 000 0
Q ss_pred ------------c-----------------------chhhcHHHHHhc-CcccccCCC--CCCEEEEeeCCCCCCCCCHH
Q 004574 643 ------------R-----------------------TLWEATNVYIEM-SPITHANKI--KKPILIIHGEVDDKVGLFPM 684 (744)
Q Consensus 643 ------------~-----------------------~~~~~~~~~~~~-~~~~~~~~~--~~P~l~i~G~~D~~v~~~~~ 684 (744)
. ....+...|... -|...+.++ ++|+++++|++|..++ ..
T Consensus 265 ~~~~~~~~~~pagtS~k~~~H~~Q~~~s~~f~~yDyg~~~n~~~Yg~~~pP~Y~l~~i~~~~Pv~i~~G~~D~lv~--~~ 342 (395)
T PLN02872 265 SRIDYYLEYEPHPSSVKNLRHLFQMIRKGTFAHYDYGIFKNLKLYGQVNPPAFDLSLIPKSLPLWMGYGGTDGLAD--VT 342 (395)
T ss_pred hhhhHHHhcCCCcchHHHHHHHHHHHhcCCcccCCCCchhhHHHhCCCCCCCcCcccCCCCccEEEEEcCCCCCCC--HH
Confidence 0 000011111111 234456777 5899999999999998 88
Q ss_pred HHHHHHHHHHhCCCcEEEEEeCCCCcc--cCccccHHHHHHHHHHHHHHhccC
Q 004574 685 QAERFFDALKGHGALSRLVLLPFEHHV--YAARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 685 ~~~~~~~~l~~~~~~~~~~~~~~~~H~--~~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
.++++.+.+.. ..+++.+++.+|. +...+.++.+++.+++||+++...
T Consensus 343 dv~~l~~~Lp~---~~~l~~l~~~gH~dfi~~~eape~V~~~Il~fL~~~~~~ 392 (395)
T PLN02872 343 DVEHTLAELPS---KPELLYLENYGHIDFLLSTSAKEDVYNHMIQFFRSLGKS 392 (395)
T ss_pred HHHHHHHHCCC---ccEEEEcCCCCCHHHHhCcchHHHHHHHHHHHHHHhhhc
Confidence 88888776643 2478889999997 444677888999999999976543
No 111
>PRK00175 metX homoserine O-acetyltransferase; Provisional
Probab=99.41 E-value=1.3e-11 Score=130.53 Aligned_cols=70 Identities=17% Similarity=0.067 Sum_probs=60.6
Q ss_pred cCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeC-CCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 661 ANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLP-FEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 661 ~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
+.+|++|+|+|+|++|..+| ++.++++.+.+...+..+++++++ ++||... .+.++++.+.+.+||.+.-
T Consensus 305 l~~I~~PtLvI~G~~D~~~p--~~~~~~la~~i~~a~~~~~l~~i~~~~GH~~~-le~p~~~~~~L~~FL~~~~ 375 (379)
T PRK00175 305 LARIKARFLVVSFTSDWLFP--PARSREIVDALLAAGADVSYAEIDSPYGHDAF-LLDDPRYGRLVRAFLERAA 375 (379)
T ss_pred HhcCCCCEEEEEECCccccC--HHHHHHHHHHHHhcCCCeEEEEeCCCCCchhH-hcCHHHHHHHHHHHHHhhh
Confidence 46789999999999999998 999999999998877777888775 8999876 5667899999999998753
No 112
>TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit. This model represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (TIGR01834) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerize short-chain-length hydroxyalkanoates.
Probab=99.39 E-value=2e-11 Score=127.92 Aligned_cols=176 Identities=17% Similarity=0.107 Sum_probs=116.4
Q ss_pred hhHHHHHhCCeEEEecCCCCCCCCCCC-------Ch-HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIGEGDK-------LP-NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH 617 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g~g~~-------~~-~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~ 617 (744)
..+..|+++||.|+..+ ..+.|.+ .. .+++.+++++++++... +++.++||||||.+++.+++.+|+
T Consensus 85 ~~~~~L~~~G~~V~~~D---~~g~g~s~~~~~~~d~~~~~~~~~v~~l~~~~~~--~~i~lvGhS~GG~i~~~~~~~~~~ 159 (350)
T TIGR01836 85 SLVRGLLERGQDVYLID---WGYPDRADRYLTLDDYINGYIDKCVDYICRTSKL--DQISLLGICQGGTFSLCYAALYPD 159 (350)
T ss_pred hHHHHHHHCCCeEEEEe---CCCCCHHHhcCCHHHHHHHHHHHHHHHHHHHhCC--CcccEEEECHHHHHHHHHHHhCch
Confidence 56778899999999943 3333322 11 12577889999887433 689999999999999999999999
Q ss_pred ceeEEEEccCCCCCCCCCCc-------------------ccc-----------cccchh----------hcHH---HHH-
Q 004574 618 LFCCGIARSGSYNKTLTPFG-------------------FQT-----------EFRTLW----------EATN---VYI- 653 (744)
Q Consensus 618 ~~~~~v~~~~~~~~~~~~~~-------------------~~~-----------~~~~~~----------~~~~---~~~- 653 (744)
.++++|+++++++....... .+. .....| .+.+ .+.
T Consensus 160 ~v~~lv~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~f~~l~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (350)
T TIGR01836 160 KIKNLVTMVTPVDFETPGNMLSNWARHVDIDLAVDTMGNIPGELLNLTFLMLKPFSLGYQKYVNLVDILEDERKVENFLR 239 (350)
T ss_pred heeeEEEeccccccCCCCchhhhhccccCHHHHHHhcCCCCHHHHHHHHHhcCcchhhhHHHHHHHHhcCChHHHHHHHH
Confidence 99999999987653211000 000 000000 0000 000
Q ss_pred --h--cC--c-----------------------------ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCC
Q 004574 654 --E--MS--P-----------------------------ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGA 698 (744)
Q Consensus 654 --~--~~--~-----------------------------~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~ 698 (744)
. .+ + ...+.++++|+|+++|++|.+++ +..++++++.+.. .
T Consensus 240 ~~~w~~d~~~~~~~~~~~~~~~~~~~n~l~~g~~~~~~~~~~l~~i~~Pvliv~G~~D~i~~--~~~~~~~~~~~~~--~ 315 (350)
T TIGR01836 240 MEKWIFDSPDQAGEAFRQFVKDFYQQNGLINGEVEIGGRKVDLKNIKMPILNIYAERDHLVP--PDASKALNDLVSS--E 315 (350)
T ss_pred HHHHhcCCcCccHHHHHHHHHHHHhcCcccCCeeEECCEEccHHhCCCCeEEEecCCCCcCC--HHHHHHHHHHcCC--C
Confidence 0 00 0 11245679999999999999998 8888888877653 4
Q ss_pred cEEEEEeCCCCcccCc--cccHHHHHHHHHHHHHH
Q 004574 699 LSRLVLLPFEHHVYAA--RENVMHVIWETDRWLQK 731 (744)
Q Consensus 699 ~~~~~~~~~~~H~~~~--~~~~~~~~~~~~~fl~~ 731 (744)
.+++++++ ++|.... ......++..+.+||.+
T Consensus 316 ~~~~~~~~-~gH~~~~~~~~~~~~v~~~i~~wl~~ 349 (350)
T TIGR01836 316 DYTELSFP-GGHIGIYVSGKAQKEVPPAIGKWLQA 349 (350)
T ss_pred CeEEEEcC-CCCEEEEECchhHhhhhHHHHHHHHh
Confidence 56888888 4676432 23357889999999975
No 113
>PLN03084 alpha/beta hydrolase fold protein; Provisional
Probab=99.39 E-value=2.6e-11 Score=126.57 Aligned_cols=189 Identities=13% Similarity=0.100 Sum_probs=117.1
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC----------hHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL----------PNDSAEAAV 582 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~----------~~~d~~~~~ 582 (744)
.|.||++||.+.. ...|. .++..| ..+|.|++ .+.+|+|.+. ..+++.+.+
T Consensus 127 ~~~ivllHG~~~~--------------~~~w~-~~~~~L-~~~~~Via---~DlpG~G~S~~p~~~~~~~ys~~~~a~~l 187 (383)
T PLN03084 127 NPPVLLIHGFPSQ--------------AYSYR-KVLPVL-SKNYHAIA---FDWLGFGFSDKPQPGYGFNYTLDEYVSSL 187 (383)
T ss_pred CCeEEEECCCCCC--------------HHHHH-HHHHHH-hcCCEEEE---ECCCCCCCCCCCcccccccCCHHHHHHHH
Confidence 4789999997521 11111 223344 56899999 5556665432 223344444
Q ss_pred HHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC--CCCc-----------------cc----
Q 004574 583 EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL--TPFG-----------------FQ---- 639 (744)
Q Consensus 583 ~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~--~~~~-----------------~~---- 639 (744)
..+.+.-. .+++.|+|||+||.+++.++.++|++++++|++++...... .+.. ..
T Consensus 188 ~~~i~~l~--~~~~~LvG~s~GG~ia~~~a~~~P~~v~~lILi~~~~~~~~~~~p~~l~~~~~~l~~~~~~~~~~~~~~~ 265 (383)
T PLN03084 188 ESLIDELK--SDKVSLVVQGYFSPPVVKYASAHPDKIKKLILLNPPLTKEHAKLPSTLSEFSNFLLGEIFSQDPLRASDK 265 (383)
T ss_pred HHHHHHhC--CCCceEEEECHHHHHHHHHHHhChHhhcEEEEECCCCccccccchHHHHHHHHHHhhhhhhcchHHHHhh
Confidence 44443322 35899999999999999999999999999999998643110 0000 00
Q ss_pred ---c--------cc----cchhhc--------HHHHHhcC-cc-------c---ccCCCCCCEEEEeeCCCCCCCCCHHH
Q 004574 640 ---T--------EF----RTLWEA--------TNVYIEMS-PI-------T---HANKIKKPILIIHGEVDDKVGLFPMQ 685 (744)
Q Consensus 640 ---~--------~~----~~~~~~--------~~~~~~~~-~~-------~---~~~~~~~P~l~i~G~~D~~v~~~~~~ 685 (744)
. +. ..++.. ...+..+. .. . ...++++|+|+++|+.|..++ .+.
T Consensus 266 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~l~~~~r~~~~~l~~~~~~l~~~l~~~~i~vPvLiI~G~~D~~v~--~~~ 343 (383)
T PLN03084 266 ALTSCGPYAMKEDDAMVYRRPYLTSGSSGFALNAISRSMKKELKKYIEEMRSILTDKNWKTPITVCWGLRDRWLN--YDG 343 (383)
T ss_pred hhcccCccCCCHHHHHHHhccccCCcchHHHHHHHHHHhhcccchhhHHHHhhhccccCCCCEEEEeeCCCCCcC--HHH
Confidence 0 00 000000 00000000 00 0 013578999999999999988 776
Q ss_pred HHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 686 AERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 686 ~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
++++.+.. +.++.++++++|... .+.++.+.+.+.+||.
T Consensus 344 ~~~~a~~~-----~a~l~vIp~aGH~~~-~E~Pe~v~~~I~~Fl~ 382 (383)
T PLN03084 344 VEDFCKSS-----QHKLIELPMAGHHVQ-EDCGEELGGIISGILS 382 (383)
T ss_pred HHHHHHhc-----CCeEEEECCCCCCcc-hhCHHHHHHHHHHHhh
Confidence 66665532 458999999999887 6788999999999985
No 114
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=99.39 E-value=1.1e-12 Score=128.62 Aligned_cols=179 Identities=23% Similarity=0.245 Sum_probs=109.3
Q ss_pred EEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCCh--------HHHHHHHHHHHHH
Q 004574 516 LFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLP--------NDSAEAAVEEVVR 587 (744)
Q Consensus 516 vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~--------~~d~~~~~~~l~~ 587 (744)
||++||.+... ......+..| ++||.|++ .+.+|+|.+.. .++..+.+..+.+
T Consensus 1 vv~~hG~~~~~---------------~~~~~~~~~l-~~~~~v~~---~d~~G~G~s~~~~~~~~~~~~~~~~~l~~~l~ 61 (228)
T PF12697_consen 1 VVFLHGFGGSS---------------ESWDPLAEAL-ARGYRVIA---FDLPGHGRSDPPPDYSPYSIEDYAEDLAELLD 61 (228)
T ss_dssp EEEE-STTTTG---------------GGGHHHHHHH-HTTSEEEE---EECTTSTTSSSHSSGSGGSHHHHHHHHHHHHH
T ss_pred eEEECCCCCCH---------------HHHHHHHHHH-hCCCEEEE---EecCCccccccccccCCcchhhhhhhhhhccc
Confidence 78999975221 1112344555 58999999 55566555422 1233333333333
Q ss_pred cCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC--C---c-ccc--------------cccchhh
Q 004574 588 RGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP--F---G-FQT--------------EFRTLWE 647 (744)
Q Consensus 588 ~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~--~---~-~~~--------------~~~~~~~ 647 (744)
... .+++.++|||+||.+++.++.++|++++++|+++|........ . . ... .....+.
T Consensus 62 ~~~--~~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (228)
T PF12697_consen 62 ALG--IKKVILVGHSMGGMIALRLAARYPDRVKGLVLLSPPPPLPDSPSRSFGPSFIRRLLAWRSRSLRRLASRFFYRWF 139 (228)
T ss_dssp HTT--TSSEEEEEETHHHHHHHHHHHHSGGGEEEEEEESESSSHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccc--cccccccccccccccccccccccccccccceeecccccccccccccccchhhhhhhhcccccccccccccccccc
Confidence 322 2689999999999999999999999999999999986421100 0 0 000 0000000
Q ss_pred cH---HH----------------HHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCC
Q 004574 648 AT---NV----------------YIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFE 708 (744)
Q Consensus 648 ~~---~~----------------~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 708 (744)
.. .. .........++++++|+++++|++|..++ ....+++.+.+ .++++++++++
T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvl~i~g~~D~~~~--~~~~~~~~~~~----~~~~~~~~~~~ 213 (228)
T PF12697_consen 140 DGDEPEDLIRSSRRALAEYLRSNLWQADLSEALPRIKVPVLVIHGEDDPIVP--PESAEELADKL----PNAELVVIPGA 213 (228)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHGSSSEEEEEEETTSSSSH--HHHHHHHHHHS----TTEEEEEETTS
T ss_pred ccccccccccccccccccccccccccccccccccccCCCeEEeecCCCCCCC--HHHHHHHHHHC----CCCEEEEECCC
Confidence 00 00 01122334567789999999999999987 66666665443 45799999999
Q ss_pred CcccCccccHHHHH
Q 004574 709 HHVYAARENVMHVI 722 (744)
Q Consensus 709 ~H~~~~~~~~~~~~ 722 (744)
+|... .++++++.
T Consensus 214 gH~~~-~~~p~~~~ 226 (228)
T PF12697_consen 214 GHFLF-LEQPDEVA 226 (228)
T ss_dssp SSTHH-HHSHHHHH
T ss_pred CCccH-HHCHHHHh
Confidence 99876 44554443
No 115
>KOG1454 consensus Predicted hydrolase/acyltransferase (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=99.38 E-value=1.3e-11 Score=125.76 Aligned_cols=195 Identities=19% Similarity=0.147 Sum_probs=117.0
Q ss_pred CceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhC-CeEEEecCCCCCCCCCCCC------hHHHHHHHHHH
Q 004574 512 PLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLAR-RFAVLAGPSIPIIGEGDKL------PNDSAEAAVEE 584 (744)
Q Consensus 512 ~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-G~~v~~~~~~~~~g~g~~~------~~~d~~~~~~~ 584 (744)
..|.||++||.+... ..|. ..+..+.+. |+.|++ .+..|+|... .+ ++...++.
T Consensus 57 ~~~pvlllHGF~~~~-----------~~w~----~~~~~L~~~~~~~v~a---iDl~G~g~~s~~~~~~~y-~~~~~v~~ 117 (326)
T KOG1454|consen 57 DKPPVLLLHGFGASS-----------FSWR----RVVPLLSKAKGLRVLA---IDLPGHGYSSPLPRGPLY-TLRELVEL 117 (326)
T ss_pred CCCcEEEeccccCCc-----------ccHh----hhccccccccceEEEE---EecCCCCcCCCCCCCCce-ehhHHHHH
Confidence 478899999975211 1111 122223332 688888 5555555221 11 23333333
Q ss_pred HHHc-CCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEE---EccCCCCCCCCC--------------------Cc--c
Q 004574 585 VVRR-GVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGI---ARSGSYNKTLTP--------------------FG--F 638 (744)
Q Consensus 585 l~~~-~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v---~~~~~~~~~~~~--------------------~~--~ 638 (744)
+.+. ...-.+++.++|||+||.+|+.+|+.+|+.++.+| .+.+.+...... .. .
T Consensus 118 i~~~~~~~~~~~~~lvghS~Gg~va~~~Aa~~P~~V~~lv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 197 (326)
T KOG1454|consen 118 IRRFVKEVFVEPVSLVGHSLGGIVALKAAAYYPETVDSLVLLDLLGPPVYSTPKGIKGLRRLLDKFLSALELLIPLSLTE 197 (326)
T ss_pred HHHHHHhhcCcceEEEEeCcHHHHHHHHHHhCcccccceeeecccccccccCCcchhHHHHhhhhhccHhhhcCcccccc
Confidence 3332 01112469999999999999999999999999999 554432100000 00 0
Q ss_pred c-----c-----------cccchhhcH--------------HHHH---------hcCcccccCCCC-CCEEEEeeCCCCC
Q 004574 639 Q-----T-----------EFRTLWEAT--------------NVYI---------EMSPITHANKIK-KPILIIHGEVDDK 678 (744)
Q Consensus 639 ~-----~-----------~~~~~~~~~--------------~~~~---------~~~~~~~~~~~~-~P~l~i~G~~D~~ 678 (744)
. . .....++.. ..+. +......++++. +|+|+++|+.|++
T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~pvlii~G~~D~~ 277 (326)
T KOG1454|consen 198 PVRLVSEGLLRCLKVVYTDPSRLLEKLLHLLSRPVKEHFHRDARLSLFLELLGFDENLLSLIKKIWKCPVLIIWGDKDQI 277 (326)
T ss_pred chhheeHhhhcceeeeccccccchhhhhhheecccccchhhhheeeEEEeccCccchHHHhhccccCCceEEEEcCcCCc
Confidence 0 0 000000000 0000 011223445665 9999999999999
Q ss_pred CCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 679 VGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 679 v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
+| .+.+..+.+.+ .++++.+++++||..+ .+.++.+++.+..|+.++
T Consensus 278 ~p--~~~~~~~~~~~----pn~~~~~I~~~gH~~h-~e~Pe~~~~~i~~Fi~~~ 324 (326)
T KOG1454|consen 278 VP--LELAEELKKKL----PNAELVEIPGAGHLPH-LERPEEVAALLRSFIARL 324 (326)
T ss_pred cC--HHHHHHHHhhC----CCceEEEeCCCCcccc-cCCHHHHHHHHHHHHHHh
Confidence 99 77666555443 5679999999999988 488999999999999875
No 116
>PF12740 Chlorophyllase2: Chlorophyllase enzyme; InterPro: IPR010821 This family consists of several chlorophyllase proteins (3.1.1.14 from EC). Chlorophyllase (Chlase) is the first enzyme involved in chlorophyll degradation and catalyses the hydrolysis of the ester bond to yield chlorophyllide and phytol [, , ].; GO: 0047746 chlorophyllase activity, 0015996 chlorophyll catabolic process
Probab=99.38 E-value=2.7e-11 Score=115.71 Aligned_cols=179 Identities=18% Similarity=0.164 Sum_probs=118.6
Q ss_pred EEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChH
Q 004574 496 TATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPN 575 (744)
Q Consensus 496 ~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~ 575 (744)
+..++.|.. .+.+|+|||+||.. ....++...+..++++||+|+.+..+...+.......
T Consensus 5 ~l~v~~P~~-----~g~yPVv~f~~G~~---------------~~~s~Ys~ll~hvAShGyIVV~~d~~~~~~~~~~~~~ 64 (259)
T PF12740_consen 5 PLLVYYPSS-----AGTYPVVLFLHGFL---------------LINSWYSQLLEHVASHGYIVVAPDLYSIGGPDDTDEV 64 (259)
T ss_pred CeEEEecCC-----CCCcCEEEEeCCcC---------------CCHHHHHHHHHHHHhCceEEEEecccccCCCCcchhH
Confidence 346788876 56799999999963 1111234667789999999999665454444444455
Q ss_pred HHHHHHHHHHHHc--C------CCCCCcEEEEEechHHHHHHHHHHhC-----CCceeEEEEccCCCCCCCCCCcccccc
Q 004574 576 DSAEAAVEEVVRR--G------VADPSRIAVGGHSYGAFMTAHLLAHA-----PHLFCCGIARSGSYNKTLTPFGFQTEF 642 (744)
Q Consensus 576 ~d~~~~~~~l~~~--~------~~d~~~i~l~G~S~GG~~a~~~~~~~-----p~~~~~~v~~~~~~~~~~~~~~~~~~~ 642 (744)
+++.+.++|+.+. . .+|-.||+|+|||.||-+|..++..+ +.+|+++|++.|+-...... +..
T Consensus 65 ~~~~~vi~Wl~~~L~~~l~~~v~~D~s~l~l~GHSrGGk~Af~~al~~~~~~~~~~~~ali~lDPVdG~~~~~---~~~- 140 (259)
T PF12740_consen 65 ASAAEVIDWLAKGLESKLPLGVKPDFSKLALAGHSRGGKVAFAMALGNASSSLDLRFSALILLDPVDGMSKGS---QTE- 140 (259)
T ss_pred HHHHHHHHHHHhcchhhccccccccccceEEeeeCCCCHHHHHHHhhhcccccccceeEEEEecccccccccc---CCC-
Confidence 6889999998773 1 25788999999999999999998886 45899999999974211100 000
Q ss_pred cchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCC---------CCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 643 RTLWEATNVYIEMSPITHANKIKKPILIIHGEVDD---------KVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 643 ~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~---------~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
+.. ..+. ...-+..+|+|++-..-.. .+| .-.+-.++|.++ ..+.-+++.++.||+
T Consensus 141 ------P~v-~~~~--p~s~~~~~P~lviGtGLg~~~~~~~~~~CaP-~g~n~~~Ff~~~---~~p~~~~v~~~~GH~ 205 (259)
T PF12740_consen 141 ------PPV-LTYT--PQSFDFSMPALVIGTGLGGEPRNPLFPPCAP-AGVNYREFFDEC---KPPSWHFVAKDYGHM 205 (259)
T ss_pred ------Ccc-ccCc--ccccCCCCCeEEEecccCcccccccCCCCCC-CCCCHHHHHHhc---CCCEEEEEeCCCCch
Confidence 000 0000 0111245999988666553 222 123466777766 356778888999996
No 117
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.37 E-value=1.8e-10 Score=105.03 Aligned_cols=273 Identities=13% Similarity=0.126 Sum_probs=176.4
Q ss_pred CCccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCC
Q 004574 2 PFFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESP 81 (744)
Q Consensus 2 ~~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~ 81 (744)
+|-..|..+.+.. |.=.+--..+++ .+....++||++.||. .+..+|.++|+.++.+.++..-+
T Consensus 17 ~YDhTIRfWqa~t----G~C~rTiqh~ds-qVNrLeiTpdk~~LAa-----------a~~qhvRlyD~~S~np~Pv~t~e 80 (311)
T KOG0315|consen 17 GYDHTIRFWQALT----GICSRTIQHPDS-QVNRLEITPDKKDLAA-----------AGNQHVRLYDLNSNNPNPVATFE 80 (311)
T ss_pred cCcceeeeeehhc----CeEEEEEecCcc-ceeeEEEcCCcchhhh-----------ccCCeeEEEEccCCCCCceeEEe
Confidence 3556677777755 766655556666 5889999999999999 45588999999999887764333
Q ss_pred CccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEE
Q 004574 82 DICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVL 161 (744)
Q Consensus 82 ~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 161 (744)
... ..+..+.|--||++++-.+.+ +.+-+
T Consensus 81 ~h~--kNVtaVgF~~dgrWMyTgseD-------------------------------------------------gt~kI 109 (311)
T KOG0315|consen 81 GHT--KNVTAVGFQCDGRWMYTGSED-------------------------------------------------GTVKI 109 (311)
T ss_pred ccC--CceEEEEEeecCeEEEecCCC-------------------------------------------------ceEEE
Confidence 211 136678899999999865322 23444
Q ss_pred EcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCCcccC
Q 004574 162 GSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIPVCYN 238 (744)
Q Consensus 162 ~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~~~~~ 238 (744)
+|+ +-..+++... ..+..+...|....|+..... ..|++||+..... .++.....
T Consensus 110 WdlR~~~~qR~~~~~spVn~vvlhpnQteLis~dqs-------------g~irvWDl~~~~c~~~liPe~~--------- 167 (311)
T KOG0315|consen 110 WDLRSLSCQRNYQHNSPVNTVVLHPNQTELISGDQS-------------GNIRVWDLGENSCTHELIPEDD--------- 167 (311)
T ss_pred EeccCcccchhccCCCCcceEEecCCcceEEeecCC-------------CcEEEEEccCCccccccCCCCC---------
Confidence 455 3344555444 566677888888887655332 4899999876632 22222221
Q ss_pred CccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC----ceEee---eeccceeceeeccCCc
Q 004574 239 SVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK----PEILH---KLDLRFRSVSWCDDSL 311 (744)
Q Consensus 239 ~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~----~~~l~---~~~~~~~~~~~SpDg~ 311 (744)
..++.++..|||+. |+-+. .++.+|++++ .+.. .+.++ -..+.+-...+|||++
T Consensus 168 ---~~i~sl~v~~dgsm-l~a~n------------nkG~cyvW~l---~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k 228 (311)
T KOG0315|consen 168 ---TSIQSLTVMPDGSM-LAAAN------------NKGNCYVWRL---LNHQTASELEPVHKFQAHNGHILRCLLSPDVK 228 (311)
T ss_pred ---cceeeEEEcCCCcE-EEEec------------CCccEEEEEc---cCCCccccceEhhheecccceEEEEEECCCCc
Confidence 11678899999997 54442 3456888877 2222 12222 2355677788999999
Q ss_pred eEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCC
Q 004574 312 ALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPE 391 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~ 391 (744)
+|+..+.+ ..+++++.++- ......+++...++.+ -+||.||++|+..+.+
T Consensus 229 ~lat~ssd----ktv~iwn~~~~-~kle~~l~gh~rWvWd------c~FS~dg~YlvTassd------------------ 279 (311)
T KOG0315|consen 229 YLATCSSD----KTVKIWNTDDF-FKLELVLTGHQRWVWD------CAFSADGEYLVTASSD------------------ 279 (311)
T ss_pred EEEeecCC----ceEEEEecCCc-eeeEEEeecCCceEEe------eeeccCccEEEecCCC------------------
Confidence 99987733 34666666552 2333444555444433 4489999999887521
Q ss_pred CCCceEEEEecCCCceeEEeecc
Q 004574 392 GNIPFLDLFDINTGSKERIWESN 414 (744)
Q Consensus 392 ~~~~~l~~~d~~~g~~~~l~~~~ 414 (744)
....+||+..++..+.+...
T Consensus 280 ---~~~rlW~~~~~k~v~qy~gh 299 (311)
T KOG0315|consen 280 ---HTARLWDLSAGKEVRQYQGH 299 (311)
T ss_pred ---CceeecccccCceeeecCCc
Confidence 23667888888887776654
No 118
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=99.37 E-value=1.1e-11 Score=129.69 Aligned_cols=172 Identities=18% Similarity=0.170 Sum_probs=107.1
Q ss_pred HHhCCeEEEecCCCCCCCCCCCC----hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEcc
Q 004574 551 FLARRFAVLAGPSIPIIGEGDKL----PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARS 626 (744)
Q Consensus 551 ~~~~G~~v~~~~~~~~~g~g~~~----~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~ 626 (744)
|...+|.|++ .+.+|+|.+. ..++..+.+..+.+...++ +.+.++||||||++|+.++.++|++++++|+++
T Consensus 95 L~~~~~~Vi~---~Dl~G~g~s~~~~~~~~~~a~dl~~ll~~l~l~-~~~~lvG~SmGG~vA~~~A~~~P~~V~~LvLi~ 170 (343)
T PRK08775 95 LDPARFRLLA---FDFIGADGSLDVPIDTADQADAIALLLDALGIA-RLHAFVGYSYGALVGLQFASRHPARVRTLVVVS 170 (343)
T ss_pred cCccccEEEE---EeCCCCCCCCCCCCCHHHHHHHHHHHHHHcCCC-cceEEEEECHHHHHHHHHHHHChHhhheEEEEC
Confidence 4356899999 4444554331 1223344443444432332 235799999999999999999999999999998
Q ss_pred CCCCCCCCC--Cc------------------------------ccc-c-ccchhh------------cHH---------H
Q 004574 627 GSYNKTLTP--FG------------------------------FQT-E-FRTLWE------------ATN---------V 651 (744)
Q Consensus 627 ~~~~~~~~~--~~------------------------------~~~-~-~~~~~~------------~~~---------~ 651 (744)
+........ +. +.. . ....+. ... .
T Consensus 171 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 250 (343)
T PRK08775 171 GAHRAHPYAAAWRALQRRAVALGQLQCAEKHGLALARQLAMLSYRTPEEFEERFDAPPEVINGRVRVAAEDYLDAAGAQY 250 (343)
T ss_pred ccccCCHHHHHHHHHHHHHHHcCCCCCCchhHHHHHHHHHHHHcCCHHHHHHHhCCCccccCCCccchHHHHHHHHHHHH
Confidence 753210000 00 000 0 000000 000 0
Q ss_pred HHhcC-------------cccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC-CCcccCcccc
Q 004574 652 YIEMS-------------PITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPF-EHHVYAAREN 717 (744)
Q Consensus 652 ~~~~~-------------~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~H~~~~~~~ 717 (744)
....+ ....+.++++|+|+++|++|..+| ...++++.+.+. ...++.++++ +||... .+.
T Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~l~~I~~PtLvi~G~~D~~~p--~~~~~~~~~~i~---p~a~l~~i~~~aGH~~~-lE~ 324 (343)
T PRK08775 251 VARTPVNAYLRLSESIDLHRVDPEAIRVPTVVVAVEGDRLVP--LADLVELAEGLG---PRGSLRVLRSPYGHDAF-LKE 324 (343)
T ss_pred HHhcChhHHHHHHHHHhhcCCChhcCCCCeEEEEeCCCEeeC--HHHHHHHHHHcC---CCCeEEEEeCCccHHHH-hcC
Confidence 00000 011246789999999999999998 888877776553 2458999985 999877 677
Q ss_pred HHHHHHHHHHHHHHh
Q 004574 718 VMHVIWETDRWLQKY 732 (744)
Q Consensus 718 ~~~~~~~~~~fl~~~ 732 (744)
++.+.+.+.+||.+.
T Consensus 325 Pe~~~~~l~~FL~~~ 339 (343)
T PRK08775 325 TDRIDAILTTALRST 339 (343)
T ss_pred HHHHHHHHHHHHHhc
Confidence 889999999999764
No 119
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=99.36 E-value=1.3e-10 Score=114.43 Aligned_cols=199 Identities=15% Similarity=0.092 Sum_probs=122.1
Q ss_pred EEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCC
Q 004574 485 IKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSI 564 (744)
Q Consensus 485 i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~ 564 (744)
+.+....| .+.+.++.|.+ ..++|+||++||.+... +.+......++..|+++||.|+. +
T Consensus 3 ~~l~~~~g-~~~~~~~~p~~-----~~~~~~VlllHG~g~~~-----------~~~~~~~~~la~~La~~Gy~Vl~---~ 62 (266)
T TIGR03101 3 FFLDAPHG-FRFCLYHPPVA-----VGPRGVVIYLPPFAEEM-----------NKSRRMVALQARAFAAGGFGVLQ---I 62 (266)
T ss_pred EEecCCCC-cEEEEEecCCC-----CCCceEEEEECCCcccc-----------cchhHHHHHHHHHHHHCCCEEEE---E
Confidence 45555555 47777777765 23479999999964210 11111111345678889999999 4
Q ss_pred CCCCCCCC----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC
Q 004574 565 PIIGEGDK----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT 634 (744)
Q Consensus 565 ~~~g~g~~----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~ 634 (744)
+.+|+|.+ ...+|+..+++++.+++ ..+|+|+||||||.+++.++.++|+.++++|+++|+......
T Consensus 63 Dl~G~G~S~g~~~~~~~~~~~~Dv~~ai~~L~~~~---~~~v~LvG~SmGG~vAl~~A~~~p~~v~~lVL~~P~~~g~~~ 139 (266)
T TIGR03101 63 DLYGCGDSAGDFAAARWDVWKEDVAAAYRWLIEQG---HPPVTLWGLRLGALLALDAANPLAAKCNRLVLWQPVVSGKQQ 139 (266)
T ss_pred CCCCCCCCCCccccCCHHHHHHHHHHHHHHHHhcC---CCCEEEEEECHHHHHHHHHHHhCccccceEEEeccccchHHH
Confidence 55555443 12358888999998763 368999999999999999999999999999999997541110
Q ss_pred CCc-------------cccc----c------c---------chhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCC
Q 004574 635 PFG-------------FQTE----F------R---------TLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLF 682 (744)
Q Consensus 635 ~~~-------------~~~~----~------~---------~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~ 682 (744)
... .... . . ..-.....+.+.+....... ..++|++.-+.+.--+ .
T Consensus 140 l~~~lrl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~l~~~~l~~~~~~-~~~~~~~~~~~~~~~~-~ 217 (266)
T TIGR03101 140 LQQFLRLRLVARRLGGESAEASNSLRERLLAGEDVEIAGYELAPALASDLDQRQLAPAVPK-NCPVHWFEVRPEEGAT-L 217 (266)
T ss_pred HHHHHHHHHHHHhccccccccchhHHhhccCCCeEEEeceecCHHHHHHHHhcccCCCCCC-CCceEEEEeccccCCC-C
Confidence 000 0000 0 0 00000111111211111111 5568888764332211 1
Q ss_pred HHHHHHHHHHHHhCCCcEEEEEeCCC
Q 004574 683 PMQAERFFDALKGHGALSRLVLLPFE 708 (744)
Q Consensus 683 ~~~~~~~~~~l~~~~~~~~~~~~~~~ 708 (744)
.....++.+++++.|..++...+++-
T Consensus 218 ~~~~~~l~~~~~~~g~~v~~~~~~~~ 243 (266)
T TIGR03101 218 SPVFSRLGEQWVQSGVEVTVDLVPGP 243 (266)
T ss_pred CHHHHHHHHHHHHcCCeEeeeecCCc
Confidence 45677899999999999999999987
No 120
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.36 E-value=3.5e-11 Score=116.05 Aligned_cols=124 Identities=11% Similarity=0.062 Sum_probs=71.9
Q ss_pred CcceEEEEeCCCC--eeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCC
Q 004574 207 FSQKVQVWTTDGK--LVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAE 284 (744)
Q Consensus 207 ~~~~l~~~~~~g~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~ 284 (744)
....+++|+.... .+.+++.+... +..+.|||||++ |+..++.+. +-+++.
T Consensus 344 Dd~tlflW~p~~~kkpi~rmtgHq~l-------------Vn~V~fSPd~r~-IASaSFDkS------------VkLW~g- 396 (480)
T KOG0271|consen 344 DDFTLFLWNPFKSKKPITRMTGHQAL-------------VNHVSFSPDGRY-IASASFDKS------------VKLWDG- 396 (480)
T ss_pred CCceEEEecccccccchhhhhchhhh-------------eeeEEECCCccE-EEEeecccc------------eeeeeC-
Confidence 3358899986532 23333333211 667899999998 777765321 444444
Q ss_pred CCCCCCceEeee-eccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCC
Q 004574 285 PAEGEKPEILHK-LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTST 363 (744)
Q Consensus 285 ~~~~~~~~~l~~-~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spd 363 (744)
.+|+....+. .-..+..++||.|.+.|+..+ ....|.++++.+. +...=..+....+ +.+.||||
T Consensus 397 --~tGk~lasfRGHv~~VYqvawsaDsRLlVS~S----kDsTLKvw~V~tk--Kl~~DLpGh~DEV------f~vDwspD 462 (480)
T KOG0271|consen 397 --RTGKFLASFRGHVAAVYQVAWSADSRLLVSGS----KDSTLKVWDVRTK--KLKQDLPGHADEV------FAVDWSPD 462 (480)
T ss_pred --CCcchhhhhhhccceeEEEEeccCccEEEEcC----CCceEEEEEeeee--eecccCCCCCceE------EEEEecCC
Confidence 3344333332 345678999999999888766 2234666666552 1111122222222 23679999
Q ss_pred CCeEEEEe
Q 004574 364 GTNVIAKI 371 (744)
Q Consensus 364 g~~l~~~~ 371 (744)
|+.++...
T Consensus 463 G~rV~sgg 470 (480)
T KOG0271|consen 463 GQRVASGG 470 (480)
T ss_pred CceeecCC
Confidence 99987653
No 121
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.35 E-value=1.6e-10 Score=120.44 Aligned_cols=292 Identities=15% Similarity=0.155 Sum_probs=157.7
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
.+|++++++. .+|+..++.....+...+...+|||+++|..+... ....+.-..|.++.++|+.+.+.......
T Consensus 13 ~gI~~~~~d~--~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~----~~~~g~v~~~~i~~~~g~L~~~~~~~~~g 86 (345)
T PF10282_consen 13 GGIYVFRFDE--ETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEG----SGDSGGVSSYRIDPDTGTLTLLNSVPSGG 86 (345)
T ss_dssp TEEEEEEEET--TTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETT----SSTTTEEEEEEEETTTTEEEEEEEEEESS
T ss_pred CcEEEEEEcC--CCCCceEeeeecCCCCCceEEEEeCCCEEEEEEcc----ccCCCCEEEEEECCCcceeEEeeeeccCC
Confidence 5899999944 55777777655555577888999999877665421 11244455566665557766664332111
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
. .-..++.+|||++|+.+... .+.+.++++
T Consensus 87 ~--~p~~i~~~~~g~~l~vany~------------------------------------------------~g~v~v~~l 116 (345)
T PF10282_consen 87 S--SPCHIAVDPDGRFLYVANYG------------------------------------------------GGSVSVFPL 116 (345)
T ss_dssp S--CEEEEEECTTSSEEEEEETT------------------------------------------------TTEEEEEEE
T ss_pred C--CcEEEEEecCCCEEEEEEcc------------------------------------------------CCeEEEEEc
Confidence 0 12356889999999887432 134444444
Q ss_pred --CCCeeec---C-----------CC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe--eeecc
Q 004574 165 --DGTAKDF---G-----------TP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL--VRELC 225 (744)
Q Consensus 165 --~G~~~~l---~-----------~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~--~~~l~ 225 (744)
+|..... . .. ...+...++|||++|+...... ..|++|+.+... .....
T Consensus 117 ~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~------------D~v~~~~~~~~~~~l~~~~ 184 (345)
T PF10282_consen 117 DDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGA------------DRVYVYDIDDDTGKLTPVD 184 (345)
T ss_dssp CTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTT------------TEEEEEEE-TTS-TEEEEE
T ss_pred cCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCC------------CEEEEEEEeCCCceEEEee
Confidence 4532222 1 11 3456789999999998764443 368888876543 32211
Q ss_pred CCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--------
Q 004574 226 DLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-------- 297 (744)
Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-------- 297 (744)
..... ...|++.++|+|||+. +|.+... ...|.+++... ..+..+.+...
T Consensus 185 ~~~~~---------~G~GPRh~~f~pdg~~-~Yv~~e~-----------s~~v~v~~~~~-~~g~~~~~~~~~~~~~~~~ 242 (345)
T PF10282_consen 185 SIKVP---------PGSGPRHLAFSPDGKY-AYVVNEL-----------SNTVSVFDYDP-SDGSLTEIQTISTLPEGFT 242 (345)
T ss_dssp EEECS---------TTSSEEEEEE-TTSSE-EEEEETT-----------TTEEEEEEEET-TTTEEEEEEEEESCETTSC
T ss_pred ccccc---------cCCCCcEEEEcCCcCE-EEEecCC-----------CCcEEEEeecc-cCCceeEEEEeeecccccc
Confidence 11111 1134889999999996 5555321 22344444410 23332222111
Q ss_pred -ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCC
Q 004574 298 -DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKEND 376 (744)
Q Consensus 298 -~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~ 376 (744)
......+.+||||+.|+.+. .......+|.+|..+ +..+.+..-..... .|- .+.++|||++|+.....
T Consensus 243 ~~~~~~~i~ispdg~~lyvsn-r~~~sI~vf~~d~~~--g~l~~~~~~~~~G~--~Pr--~~~~s~~g~~l~Va~~~--- 312 (345)
T PF10282_consen 243 GENAPAEIAISPDGRFLYVSN-RGSNSISVFDLDPAT--GTLTLVQTVPTGGK--FPR--HFAFSPDGRYLYVANQD--- 312 (345)
T ss_dssp SSSSEEEEEE-TTSSEEEEEE-CTTTEEEEEEECTTT--TTEEEEEEEEESSS--SEE--EEEE-TTSSEEEEEETT---
T ss_pred ccCCceeEEEecCCCEEEEEe-ccCCEEEEEEEecCC--CceEEEEEEeCCCC--Ccc--EEEEeCCCCEEEEEecC---
Confidence 11355688999999777654 433444555554444 34333322111100 011 15689999999887632
Q ss_pred cceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 377 EQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 377 ~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
...-.++.+|..+|..+.+.
T Consensus 313 ---------------s~~v~vf~~d~~tG~l~~~~ 332 (345)
T PF10282_consen 313 ---------------SNTVSVFDIDPDTGKLTPVG 332 (345)
T ss_dssp ---------------TTEEEEEEEETTTTEEEEEE
T ss_pred ---------------CCeEEEEEEeCCCCcEEEec
Confidence 23344667787888876553
No 122
>PRK11126 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase; Provisional
Probab=99.35 E-value=2.8e-11 Score=120.38 Aligned_cols=184 Identities=15% Similarity=0.105 Sum_probs=108.5
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC-----hHHHHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL-----PNDSAEAAVEEVVR 587 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~-----~~~d~~~~~~~l~~ 587 (744)
.|.||++||.+... ..| ..+...| .+|.|++ .+.+|+|.+. ..++..+.+..+.+
T Consensus 2 ~p~vvllHG~~~~~--------------~~w-~~~~~~l--~~~~vi~---~D~~G~G~S~~~~~~~~~~~~~~l~~~l~ 61 (242)
T PRK11126 2 LPWLVFLHGLLGSG--------------QDW-QPVGEAL--PDYPRLY---IDLPGHGGSAAISVDGFADVSRLLSQTLQ 61 (242)
T ss_pred CCEEEEECCCCCCh--------------HHH-HHHHHHc--CCCCEEE---ecCCCCCCCCCccccCHHHHHHHHHHHHH
Confidence 36899999975221 111 1233333 4799999 5666666542 12222333323333
Q ss_pred cCCCCCCcEEEEEechHHHHHHHHHHhCCCc-eeEEEEccCCCCCCCCC-----------Cc--ccccc-----------
Q 004574 588 RGVADPSRIAVGGHSYGAFMTAHLLAHAPHL-FCCGIARSGSYNKTLTP-----------FG--FQTEF----------- 642 (744)
Q Consensus 588 ~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~-~~~~v~~~~~~~~~~~~-----------~~--~~~~~----------- 642 (744)
.. +.+++.++||||||.+|+.++.++|+. ++++++.++........ +. +....
T Consensus 62 ~~--~~~~~~lvG~S~Gg~va~~~a~~~~~~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (242)
T PRK11126 62 SY--NILPYWLVGYSLGGRIAMYYACQGLAGGLCGLIVEGGNPGLQNAEERQARWQNDRQWAQRFRQEPLEQVLADWYQQ 139 (242)
T ss_pred Hc--CCCCeEEEEECHHHHHHHHHHHhCCcccccEEEEeCCCCCCCCHHHHHHHHhhhHHHHHHhccCcHHHHHHHHHhc
Confidence 22 236899999999999999999998654 99999877543210000 00 00000
Q ss_pred ----cchhhcHHH---------------H-Hhc------CcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC
Q 004574 643 ----RTLWEATNV---------------Y-IEM------SPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGH 696 (744)
Q Consensus 643 ----~~~~~~~~~---------------~-~~~------~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~ 696 (744)
......... + ... +....+.++++|+|+++|++|..+. . +.+.
T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lii~G~~D~~~~---~----~~~~---- 208 (242)
T PRK11126 140 PVFASLNAEQRQQLVAKRSNNNGAAVAAMLEATSLAKQPDLRPALQALTFPFYYLCGERDSKFQ---A----LAQQ---- 208 (242)
T ss_pred chhhccCccHHHHHHHhcccCCHHHHHHHHHhcCcccCCcHHHHhhccCCCeEEEEeCCcchHH---H----HHHH----
Confidence 000000000 0 000 1122456789999999999998532 1 2111
Q ss_pred CCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 697 GALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 697 ~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
...+++++++++|.+. .+.++.+.+.+.+||.+
T Consensus 209 -~~~~~~~i~~~gH~~~-~e~p~~~~~~i~~fl~~ 241 (242)
T PRK11126 209 -LALPLHVIPNAGHNAH-RENPAAFAASLAQILRL 241 (242)
T ss_pred -hcCeEEEeCCCCCchh-hhChHHHHHHHHHHHhh
Confidence 1468999999999877 67788999999999964
No 123
>KOG4409 consensus Predicted hydrolase/acyltransferase (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=99.35 E-value=1.7e-11 Score=119.26 Aligned_cols=192 Identities=15% Similarity=0.045 Sum_probs=118.1
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC----------hHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL----------PNDSAEAAV 582 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~----------~~~d~~~~~ 582 (744)
+..+|++||.|... +........|++ ...|++ ++..|+|.+. ....+.+.+
T Consensus 90 ~~plVliHGyGAg~---------------g~f~~Nf~~La~-~~~vya---iDllG~G~SSRP~F~~d~~~~e~~fvesi 150 (365)
T KOG4409|consen 90 KTPLVLIHGYGAGL---------------GLFFRNFDDLAK-IRNVYA---IDLLGFGRSSRPKFSIDPTTAEKEFVESI 150 (365)
T ss_pred CCcEEEEeccchhH---------------HHHHHhhhhhhh-cCceEE---ecccCCCCCCCCCCCCCcccchHHHHHHH
Confidence 55678899965221 001122334444 788888 6677776652 112444555
Q ss_pred HHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC--CCc-------cc--------ccc---
Q 004574 583 EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT--PFG-------FQ--------TEF--- 642 (744)
Q Consensus 583 ~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~--~~~-------~~--------~~~--- 642 (744)
+.-+..-.+ +|..|+|||+||++|..+|.++|++++-+|+.+|.--.... ... +. ...
T Consensus 151 E~WR~~~~L--~KmilvGHSfGGYLaa~YAlKyPerV~kLiLvsP~Gf~~~~~~~~~~~~~~~~w~~~~~~~~~~~nPl~ 228 (365)
T KOG4409|consen 151 EQWRKKMGL--EKMILVGHSFGGYLAAKYALKYPERVEKLILVSPWGFPEKPDSEPEFTKPPPEWYKALFLVATNFNPLA 228 (365)
T ss_pred HHHHHHcCC--cceeEeeccchHHHHHHHHHhChHhhceEEEecccccccCCCcchhhcCCChHHHhhhhhhhhcCCHHH
Confidence 444443334 58999999999999999999999999999999985211100 000 00 000
Q ss_pred ----cchh---------------------hcH-H--HHHh----------------------cCcccccCCCC--CCEEE
Q 004574 643 ----RTLW---------------------EAT-N--VYIE----------------------MSPITHANKIK--KPILI 670 (744)
Q Consensus 643 ----~~~~---------------------~~~-~--~~~~----------------------~~~~~~~~~~~--~P~l~ 670 (744)
..+| ++. . .|.. ...++.+..++ +|+++
T Consensus 229 ~LR~~Gp~Gp~Lv~~~~~d~~~k~~~~~~ed~l~~YiY~~n~~~psgE~~fk~l~~~~g~Ar~Pm~~r~~~l~~~~pv~f 308 (365)
T KOG4409|consen 229 LLRLMGPLGPKLVSRLRPDRFRKFPSLIEEDFLHEYIYHCNAQNPSGETAFKNLFEPGGWARRPMIQRLRELKKDVPVTF 308 (365)
T ss_pred HHHhccccchHHHhhhhHHHHHhccccchhHHHHHHHHHhcCCCCcHHHHHHHHHhccchhhhhHHHHHHhhccCCCEEE
Confidence 0000 000 0 0000 00122233344 99999
Q ss_pred EeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 671 IHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 671 i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
|+|++|..- ...+.++...+ ....++.+++|++||... .++++.+++.++.++++
T Consensus 309 iyG~~dWmD---~~~g~~~~~~~--~~~~~~~~~v~~aGHhvy-lDnp~~Fn~~v~~~~~~ 363 (365)
T KOG4409|consen 309 IYGDRDWMD---KNAGLEVTKSL--MKEYVEIIIVPGAGHHVY-LDNPEFFNQIVLEECDK 363 (365)
T ss_pred EecCccccc---chhHHHHHHHh--hcccceEEEecCCCceee-cCCHHHHHHHHHHHHhc
Confidence 999999863 46666666655 335689999999999876 78899999999998875
No 124
>COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=99.34 E-value=8e-11 Score=113.98 Aligned_cols=219 Identities=18% Similarity=0.170 Sum_probs=130.8
Q ss_pred EEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCC
Q 004574 487 YQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPI 566 (744)
Q Consensus 487 ~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~ 566 (744)
+.-.+|..+...+..++. ..+.|+||.+||- .|+- .+.+....+..+..+||.|+. +..
T Consensus 54 v~~pdg~~~~ldw~~~p~-----~~~~P~vVl~HGL-----------~G~s--~s~y~r~L~~~~~~rg~~~Vv---~~~ 112 (345)
T COG0429 54 LETPDGGFIDLDWSEDPR-----AAKKPLVVLFHGL-----------EGSS--NSPYARGLMRALSRRGWLVVV---FHF 112 (345)
T ss_pred EEcCCCCEEEEeeccCcc-----ccCCceEEEEecc-----------CCCC--cCHHHHHHHHHHHhcCCeEEE---Eec
Confidence 333455445444544332 2236999999993 2211 111222345577789999988 455
Q ss_pred CCCCCCC----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHH-HHHHHHHHhCCC-ceeEEEEccCCCCCCCC
Q 004574 567 IGEGDKL----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGA-FMTAHLLAHAPH-LFCCGIARSGSYNKTLT 634 (744)
Q Consensus 567 ~g~g~~~----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG-~~a~~~~~~~p~-~~~~~v~~~~~~~~~~~ 634 (744)
+|++.+. ..+|+..++++++++.. +.++.++|.|+|| ++|.+++.+--+ .+.|+++++.++|....
T Consensus 113 Rgcs~~~n~~p~~yh~G~t~D~~~~l~~l~~~~~--~r~~~avG~SLGgnmLa~ylgeeg~d~~~~aa~~vs~P~Dl~~~ 190 (345)
T COG0429 113 RGCSGEANTSPRLYHSGETEDIRFFLDWLKARFP--PRPLYAVGFSLGGNMLANYLGEEGDDLPLDAAVAVSAPFDLEAC 190 (345)
T ss_pred ccccCCcccCcceecccchhHHHHHHHHHHHhCC--CCceEEEEecccHHHHHHHHHhhccCcccceeeeeeCHHHHHHH
Confidence 5543331 22599999999998643 4689999999999 555555544311 34555555543331000
Q ss_pred ------CCc--------------------------cccc-------ccchh--------------hcHHHHHhcCccccc
Q 004574 635 ------PFG--------------------------FQTE-------FRTLW--------------EATNVYIEMSPITHA 661 (744)
Q Consensus 635 ------~~~--------------------------~~~~-------~~~~~--------------~~~~~~~~~~~~~~~ 661 (744)
.+. .+.. .+..| +..+.|.+.|.+..+
T Consensus 191 ~~~l~~~~s~~ly~r~l~~~L~~~~~~kl~~l~~~~p~~~~~~ik~~~ti~eFD~~~Tap~~Gf~da~dYYr~aSs~~~L 270 (345)
T COG0429 191 AYRLDSGFSLRLYSRYLLRNLKRNAARKLKELEPSLPGTVLAAIKRCRTIREFDDLLTAPLHGFADAEDYYRQASSLPLL 270 (345)
T ss_pred HHHhcCchhhhhhHHHHHHHHHHHHHHHHHhcCcccCcHHHHHHHhhchHHhccceeeecccCCCcHHHHHHhccccccc
Confidence 000 0000 01111 123566777889999
Q ss_pred CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHh-CCCcEEEEEeCCCCcccCcc--c-cHH-HHHHHHHHHHHHhcc
Q 004574 662 NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKG-HGALSRLVLLPFEHHVYAAR--E-NVM-HVIWETDRWLQKYCL 734 (744)
Q Consensus 662 ~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~H~~~~~--~-~~~-~~~~~~~~fl~~~l~ 734 (744)
++|.+|+||||..+|++++ ....- .... ....+.+.+-+.+||.-... . ... -.-+++.+||+..+.
T Consensus 271 ~~Ir~PtLii~A~DDP~~~--~~~iP----~~~~~~np~v~l~~t~~GGHvGfl~~~~~~~~~W~~~ri~~~l~~~~~ 342 (345)
T COG0429 271 PKIRKPTLIINAKDDPFMP--PEVIP----KLQEMLNPNVLLQLTEHGGHVGFLGGKLLHPQMWLEQRILDWLDPFLE 342 (345)
T ss_pred cccccceEEEecCCCCCCC--hhhCC----cchhcCCCceEEEeecCCceEEeccCccccchhhHHHHHHHHHHHHHh
Confidence 9999999999999999987 33222 2222 46778999999999963311 1 121 334668999987754
No 125
>PLN02211 methyl indole-3-acetate methyltransferase
Probab=99.34 E-value=6.5e-11 Score=119.15 Aligned_cols=190 Identities=15% Similarity=0.118 Sum_probs=113.3
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC-------ChHH-HHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK-------LPND-SAEAAVEE 584 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~-------~~~~-d~~~~~~~ 584 (744)
.|.||++||.+... ..| ..+...|.++||.|+. .+.+|+|.+ ...+ .+...+++
T Consensus 18 ~p~vvliHG~~~~~-----------~~w----~~~~~~L~~~g~~vi~---~dl~g~G~s~~~~~~~~~~~~~~~~l~~~ 79 (273)
T PLN02211 18 PPHFVLIHGISGGS-----------WCW----YKIRCLMENSGYKVTC---IDLKSAGIDQSDADSVTTFDEYNKPLIDF 79 (273)
T ss_pred CCeEEEECCCCCCc-----------CcH----HHHHHHHHhCCCEEEE---ecccCCCCCCCCcccCCCHHHHHHHHHHH
Confidence 68999999964210 111 2345566678999999 445555532 1222 33344444
Q ss_pred HHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC--C------C--------------Ccccc--
Q 004574 585 VVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL--T------P--------------FGFQT-- 640 (744)
Q Consensus 585 l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~--~------~--------------~~~~~-- 640 (744)
+.+... .+++.|+||||||.+++.++..+|++++++|++++...... . . +....
T Consensus 80 i~~l~~--~~~v~lvGhS~GG~v~~~~a~~~p~~v~~lv~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (273)
T PLN02211 80 LSSLPE--NEKVILVGHSAGGLSVTQAIHRFPKKICLAVYVAATMLKLGFQTDEDMKDGVPDLSEFGDVYELGFGLGPDQ 157 (273)
T ss_pred HHhcCC--CCCEEEEEECchHHHHHHHHHhChhheeEEEEeccccCCCCCCHHHHHhccccchhhhccceeeeeccCCCC
Confidence 444322 36899999999999999999999999999999876421000 0 0 00000
Q ss_pred cccchh---h---------cHH---H-HHh------cCccc------ccCCC-CCCEEEEeeCCCCCCCCCHHHHHHHHH
Q 004574 641 EFRTLW---E---------ATN---V-YIE------MSPIT------HANKI-KKPILIIHGEVDDKVGLFPMQAERFFD 691 (744)
Q Consensus 641 ~~~~~~---~---------~~~---~-~~~------~~~~~------~~~~~-~~P~l~i~G~~D~~v~~~~~~~~~~~~ 691 (744)
...... + .++ . +.. ...+. ...++ .+|+++|.|++|..+| ++..+.+.+
T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP~l~I~g~~D~~ip--~~~~~~m~~ 235 (273)
T PLN02211 158 PPTSAIIKKEFRRKILYQMSPQEDSTLAAMLLRPGPILALRSARFEEETGDIDKVPRVYIKTLHDHVVK--PEQQEAMIK 235 (273)
T ss_pred CCceeeeCHHHHHHHHhcCCCHHHHHHHHHhcCCcCccccccccccccccccCccceEEEEeCCCCCCC--HHHHHHHHH
Confidence 000000 0 000 0 000 01111 12244 6899999999999998 887777776
Q ss_pred HHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 692 ALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 692 ~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
.+.. .+++.++ ++|... .+.++.+...+.++..
T Consensus 236 ~~~~----~~~~~l~-~gH~p~-ls~P~~~~~~i~~~a~ 268 (273)
T PLN02211 236 RWPP----SQVYELE-SDHSPF-FSTPFLLFGLLIKAAA 268 (273)
T ss_pred hCCc----cEEEEEC-CCCCcc-ccCHHHHHHHHHHHHH
Confidence 5532 3777786 789866 5667777777766644
No 126
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=99.34 E-value=1.9e-11 Score=115.61 Aligned_cols=214 Identities=19% Similarity=0.194 Sum_probs=132.1
Q ss_pred ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCC
Q 004574 35 FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPK 114 (744)
Q Consensus 35 ~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~ 114 (744)
...+||+|+|||..++ .+|.+.|..+=+..+|...-. .+..+.|+.|...++-....
T Consensus 13 ~c~fSp~g~yiAs~~~-----------yrlviRd~~tlq~~qlf~cld-----ki~yieW~ads~~ilC~~yk------- 69 (447)
T KOG4497|consen 13 FCSFSPCGNYIASLSR-----------YRLVIRDSETLQLHQLFLCLD-----KIVYIEWKADSCHILCVAYK------- 69 (447)
T ss_pred ceeECCCCCeeeeeee-----------eEEEEeccchhhHHHHHHHHH-----Hhhheeeeccceeeeeeeec-------
Confidence 5689999999999754 588999988888877753322 25577999999988765331
Q ss_pred CCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC--CeeecCCC-ceeeeeccCCCCceEEE
Q 004574 115 KTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG--TAKDFGTP-AVYTAVEPSPDQKYVLI 191 (744)
Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G--~~~~l~~~-~~~~~~~~SpDG~~i~~ 191 (744)
...+.++++.- =.-.+..+ .+.....|||||+.|+.
T Consensus 70 -----------------------------------------~~~vqvwsl~Qpew~ckIdeg~agls~~~WSPdgrhiL~ 108 (447)
T KOG4497|consen 70 -----------------------------------------DPKVQVWSLVQPEWYCKIDEGQAGLSSISWSPDGRHILL 108 (447)
T ss_pred -----------------------------------------cceEEEEEeecceeEEEeccCCCcceeeeECCCcceEee
Confidence 13455556522 23345555 77889999999999998
Q ss_pred EEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCcc
Q 004574 192 TSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVE 271 (744)
Q Consensus 192 ~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~ 271 (744)
++.-. -+|-+|.+.++....+...... ...++|.|||+. .+..+.++=-+..
T Consensus 109 tseF~------------lriTVWSL~t~~~~~~~~pK~~-------------~kg~~f~~dg~f-~ai~sRrDCkdyv-- 160 (447)
T KOG4497|consen 109 TSEFD------------LRITVWSLNTQKGYLLPHPKTN-------------VKGYAFHPDGQF-CAILSRRDCKDYV-- 160 (447)
T ss_pred eecce------------eEEEEEEeccceeEEecccccC-------------ceeEEECCCCce-eeeeecccHHHHH--
Confidence 86442 4788899888776665443322 557899999996 6655433221110
Q ss_pred CCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccC
Q 004574 272 VSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYS 351 (744)
Q Consensus 272 ~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~ 351 (744)
.|..... -.-.+..--.......+.|||||.+|+. ++..-.+.+|.+.-..|
T Consensus 161 -----~i~~c~~----W~ll~~f~~dT~DltgieWsPdg~~laV--wd~~Leykv~aYe~~lG----------------- 212 (447)
T KOG4497|consen 161 -----QISSCKA----WILLKEFKLDTIDLTGIEWSPDGNWLAV--WDNVLEYKVYAYERGLG----------------- 212 (447)
T ss_pred -----HHHhhHH----HHHHHhcCCCcccccCceECCCCcEEEE--ecchhhheeeeeeeccc-----------------
Confidence 0111000 0000000001223456899999999987 44445566666654321
Q ss_pred CCCCCceeeCCCCCeEEEEe
Q 004574 352 DPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 352 ~~~~~~~~~spdg~~l~~~~ 371 (744)
.-..+|||-++.|+..+
T Consensus 213 ---~k~v~wsP~~qflavGs 229 (447)
T KOG4497|consen 213 ---LKFVEWSPCNQFLAVGS 229 (447)
T ss_pred ---eeEEEeccccceEEeec
Confidence 11144777777777665
No 127
>PRK10439 enterobactin/ferric enterobactin esterase; Provisional
Probab=99.33 E-value=1.8e-10 Score=121.22 Aligned_cols=215 Identities=13% Similarity=-0.000 Sum_probs=130.6
Q ss_pred ceEEEEEEcC-CCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCe---
Q 004574 481 QKEMIKYQRK-DGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRF--- 556 (744)
Q Consensus 481 ~~~~i~~~~~-~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~--- 556 (744)
..+.+++.+. -|.....++|.|+++. .+++|+|+++||..|.. .......+..+.+.|.
T Consensus 179 ~~~~~~~~S~~Lg~~r~v~VY~P~~y~---~~~~PvlyllDG~~w~~--------------~~~~~~~ld~li~~g~i~P 241 (411)
T PRK10439 179 PAKEIIWKSERLGNSRRVWIYTTGDAA---PEERPLAILLDGQFWAE--------------SMPVWPALDSLTHRGQLPP 241 (411)
T ss_pred ceEEEEEEccccCCceEEEEEECCCCC---CCCCCEEEEEECHHhhh--------------cCCHHHHHHHHHHcCCCCc
Confidence 3455666553 3667888999999875 23599999999964321 0011123445566673
Q ss_pred -EEEecCCCCCCC----CCCCC-hHH-HHHHHHHHHHHcCC--CCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccC
Q 004574 557 -AVLAGPSIPIIG----EGDKL-PND-SAEAAVEEVVRRGV--ADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSG 627 (744)
Q Consensus 557 -~v~~~~~~~~~g----~g~~~-~~~-d~~~~~~~l~~~~~--~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~ 627 (744)
+++.++..+... .+... ..+ -+.+.+.++.++.. .|+++.+|+|+||||..|+.++.++|+.|.++++++|
T Consensus 242 ~ivV~id~~~~~~R~~el~~~~~f~~~l~~eLlP~I~~~y~~~~d~~~~~IaG~S~GGl~AL~~al~~Pd~Fg~v~s~Sg 321 (411)
T PRK10439 242 AVYLLIDAIDTTHRSQELPCNADFWLAVQQELLPQVRAIAPFSDDADRTVVAGQSFGGLAALYAGLHWPERFGCVLSQSG 321 (411)
T ss_pred eEEEEECCCCcccccccCCchHHHHHHHHHHHHHHHHHhCCCCCCccceEEEEEChHHHHHHHHHHhCcccccEEEEecc
Confidence 333433221111 11111 111 23455567776643 4778999999999999999999999999999999999
Q ss_pred CCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC
Q 004574 628 SYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPF 707 (744)
Q Consensus 628 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 707 (744)
.+-+.... .....| ..+.+.+. .....+..++|-+|+.|... ....+++.+.|+++|.++.+.++++
T Consensus 322 s~ww~~~~-----~~~~~~-l~~~l~~~----~~~~~~lr~~i~~G~~E~~~---~~~~~~l~~~L~~~G~~~~~~~~~G 388 (411)
T PRK10439 322 SFWWPHRG-----GQQEGV-LLEQLKAG----EVSARGLRIVLEAGRREPMI---MRANQALYAQLHPAGHSVFWRQVDG 388 (411)
T ss_pred ceecCCcc-----CCchhH-HHHHHHhc----ccCCCCceEEEeCCCCCchH---HHHHHHHHHHHHHCCCcEEEEECCC
Confidence 64211000 000000 01111110 01112345888899998654 5788999999999999999999998
Q ss_pred CCcccCccccHHHHHHHHHHHH
Q 004574 708 EHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 708 ~~H~~~~~~~~~~~~~~~~~fl 729 (744)
+|.... +...+...+.||
T Consensus 389 -GHd~~~---Wr~~L~~~L~~l 406 (411)
T PRK10439 389 -GHDALC---WRGGLIQGLIDL 406 (411)
T ss_pred -CcCHHH---HHHHHHHHHHHH
Confidence 596432 333344444444
No 128
>KOG3101 consensus Esterase D [General function prediction only]
Probab=99.33 E-value=9.5e-12 Score=110.57 Aligned_cols=206 Identities=20% Similarity=0.192 Sum_probs=131.5
Q ss_pred CeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCC----
Q 004574 492 GVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPII---- 567 (744)
Q Consensus 492 g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~---- 567 (744)
+..+..-+|+|+++..++ +.|++.|+-|- +.+...+..-.. .-+....+|++|+.++..+..
T Consensus 25 ~c~Mtf~vylPp~a~~~k--~~P~lf~LSGL-----------TCT~~Nfi~Ksg-~qq~As~hgl~vV~PDTSPRG~~v~ 90 (283)
T KOG3101|consen 25 KCSMTFGVYLPPDAPRGK--RCPVLFYLSGL-----------TCTHENFIEKSG-FQQQASKHGLAVVAPDTSPRGVEVA 90 (283)
T ss_pred ccceEEEEecCCCcccCC--cCceEEEecCC-----------cccchhhHhhhh-HHHhHhhcCeEEECCCCCCCccccC
Confidence 456777899998876544 59999999883 333333432111 122335689999985543221
Q ss_pred C------CCCC----------------ChHHHH-HHHHHHHH-HcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEE
Q 004574 568 G------EGDK----------------LPNDSA-EAAVEEVV-RRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGI 623 (744)
Q Consensus 568 g------~g~~----------------~~~~d~-~~~~~~l~-~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v 623 (744)
| +|.. .+++-+ ++..+-+. ....+|+.|++|.||||||+-|+..+.+.|.+++.+.
T Consensus 91 g~~eswDFG~GAGFYvnAt~epw~~~yrMYdYv~kELp~~l~~~~~pld~~k~~IfGHSMGGhGAl~~~Lkn~~kykSvS 170 (283)
T KOG3101|consen 91 GDDESWDFGQGAGFYVNATQEPWAKHYRMYDYVVKELPQLLNSANVPLDPLKVGIFGHSMGGHGALTIYLKNPSKYKSVS 170 (283)
T ss_pred CCcccccccCCceeEEecccchHhhhhhHHHHHHHHHHHHhccccccccchhcceeccccCCCceEEEEEcCccccccee
Confidence 1 1111 122211 12222222 1235889999999999999999999999999999999
Q ss_pred EccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCC---CCCCEEEEeeCCCCCCCCCHHH-HHHHHHHHHhCC-C
Q 004574 624 ARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANK---IKKPILIIHGEVDDKVGLFPMQ-AERFFDALKGHG-A 698 (744)
Q Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~P~l~i~G~~D~~v~~~~~~-~~~~~~~l~~~~-~ 698 (744)
+.+|+++....+|+........-++...|..+++...+++ ...-+||-+|..|.+.+ -+- -+.+.++.+... .
T Consensus 171 AFAPI~NP~~cpWGqKAf~gYLG~~ka~W~~yDat~lik~y~~~~~~ilIdqG~~D~Fl~--~qLlPe~l~~a~~~~~~~ 248 (283)
T KOG3101|consen 171 AFAPICNPINCPWGQKAFTGYLGDNKAQWEAYDATHLIKNYRGVGDDILIDQGAADNFLA--EQLLPENLLEACKATWQA 248 (283)
T ss_pred ccccccCcccCcchHHHhhcccCCChHHHhhcchHHHHHhcCCCCccEEEecCccchhhh--hhcChHHHHHHhhccccc
Confidence 9999999877777654444444455666666766554444 45569999999998853 111 233444444332 5
Q ss_pred cEEEEEeCCCCcccC
Q 004574 699 LSRLVLLPFEHHVYA 713 (744)
Q Consensus 699 ~~~~~~~~~~~H~~~ 713 (744)
++.+..-++-.|...
T Consensus 249 ~v~~r~~~gyDHSYy 263 (283)
T KOG3101|consen 249 PVVFRLQEGYDHSYY 263 (283)
T ss_pred cEEEEeecCCCccee
Confidence 778888888888654
No 129
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.30 E-value=1.6e-09 Score=109.90 Aligned_cols=264 Identities=15% Similarity=0.124 Sum_probs=157.8
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+....|+|||++|++... .+.|+++++.+++.......... .+..+.|+|+++.|++...+
T Consensus 11 ~i~~~~~~~~~~~l~~~~~----------~g~i~i~~~~~~~~~~~~~~~~~----~i~~~~~~~~~~~l~~~~~~---- 72 (289)
T cd00200 11 GVTCVAFSPDGKLLATGSG----------DGTIKVWDLETGELLRTLKGHTG----PVRDVAASADGTYLASGSSD---- 72 (289)
T ss_pred CEEEEEEcCCCCEEEEeec----------CcEEEEEEeeCCCcEEEEecCCc----ceeEEEECCCCCEEEEEcCC----
Confidence 5888999999999998642 35677778777653332222111 23477999999888876321
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeeeccCCCCce
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
+.|+++++ ++ ....+... ..+..+.|+|+++.
T Consensus 73 ---------------------------------------------~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 107 (289)
T cd00200 73 ---------------------------------------------KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRI 107 (289)
T ss_pred ---------------------------------------------CeEEEEEcCcccceEEEeccCCcEEEEEEcCCCCE
Confidence 57888888 44 44555444 57888999999775
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
|+..... ..+.+|++...+ ...+.... ..+..+.|+|+++. ++...
T Consensus 108 ~~~~~~~-------------~~i~~~~~~~~~~~~~~~~~~-------------~~i~~~~~~~~~~~-l~~~~------ 154 (289)
T cd00200 108 LSSSSRD-------------KTIKVWDVETGKCLTTLRGHT-------------DWVNSVAFSPDGTF-VASSS------ 154 (289)
T ss_pred EEEecCC-------------CeEEEEECCCcEEEEEeccCC-------------CcEEEEEEcCcCCE-EEEEc------
Confidence 5544312 378999987443 33333121 11567899999763 33331
Q ss_pred CCccCCccceEEeccCCCCCCCCc-eEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeecccc
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVF 346 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 346 (744)
..+.|.+++. ..++. ..+......+..+.|+|+++.+++... ...+.++|+..+ +.........
T Consensus 155 ------~~~~i~i~d~---~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~----~~~i~i~d~~~~--~~~~~~~~~~ 219 (289)
T cd00200 155 ------QDGTIKLWDL---RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSS----DGTIKLWDLSTG--KCLGTLRGHE 219 (289)
T ss_pred ------CCCcEEEEEc---cccccceeEecCccccceEEECCCcCEEEEecC----CCcEEEEECCCC--ceecchhhcC
Confidence 1224777777 33333 333333457888999999998888763 345777887652 2222221111
Q ss_pred ccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeee
Q 004574 347 ENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALV 426 (744)
Q Consensus 347 ~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~ 426 (744)
.. ...+.|+|++..++... ....|++||..+++.......... .+
T Consensus 220 ~~------i~~~~~~~~~~~~~~~~---------------------~~~~i~i~~~~~~~~~~~~~~~~~----~i---- 264 (289)
T cd00200 220 NG------VNSVAFSPDGYLLASGS---------------------EDGTIRVWDLRTGECVQTLSGHTN----SV---- 264 (289)
T ss_pred Cc------eEEEEEcCCCcEEEEEc---------------------CCCcEEEEEcCCceeEEEccccCC----cE----
Confidence 11 11256899976665543 112488888877665444332211 11
Q ss_pred cCCcceecccCCCEEEEE
Q 004574 427 FGQGEEDINLNQLKILTS 444 (744)
Q Consensus 427 ~~~~~~~~s~d~~~~~~~ 444 (744)
..++|+|+++.++..
T Consensus 265 ---~~~~~~~~~~~l~~~ 279 (289)
T cd00200 265 ---TSLAWSPDGKRLASG 279 (289)
T ss_pred ---EEEEECCCCCEEEEe
Confidence 125788888766543
No 130
>KOG1838 consensus Alpha/beta hydrolase [General function prediction only]
Probab=99.28 E-value=2.2e-10 Score=115.43 Aligned_cols=232 Identities=16% Similarity=0.124 Sum_probs=145.5
Q ss_pred cCCCceEEEEEEcCCCeEEEEEEEeCCCCCC-CCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCC
Q 004574 477 LASLQKEMIKYQRKDGVPLTATLYLPPGYDQ-SKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARR 555 (744)
Q Consensus 477 ~~~~~~~~i~~~~~~g~~l~~~~~~P~~~~~-~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G 555 (744)
.+.+.-++..+.-.||..+...++.+..... .+.+..|+||++||-. + .+...+-...+....++|
T Consensus 88 ~p~~~y~Reii~~~DGG~~~lDW~~~~~~~~~~~~~~~P~vvilpGlt-----------g--~S~~~YVr~lv~~a~~~G 154 (409)
T KOG1838|consen 88 KPPVEYTREIIKTSDGGTVTLDWVENPDSRCRTDDGTDPIVVILPGLT-----------G--GSHESYVRHLVHEAQRKG 154 (409)
T ss_pred CCCCcceeEEEEeCCCCEEEEeeccCcccccCCCCCCCcEEEEecCCC-----------C--CChhHHHHHHHHHHHhCC
Confidence 3344555555556788889888887765321 1134579999999942 1 112222223455666889
Q ss_pred eEEEecCCCCCCCCCCCC----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC---ceeEE
Q 004574 556 FAVLAGPSIPIIGEGDKL----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH---LFCCG 622 (744)
Q Consensus 556 ~~v~~~~~~~~~g~g~~~----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~---~~~~~ 622 (744)
|.|+. +..+|.+.+. ..+|+.+++++++++.. ..++..+|.||||.+....+++..+ ..+|+
T Consensus 155 ~r~VV---fN~RG~~g~~LtTpr~f~ag~t~Dl~~~v~~i~~~~P--~a~l~avG~S~Gg~iL~nYLGE~g~~~~l~~a~ 229 (409)
T KOG1838|consen 155 YRVVV---FNHRGLGGSKLTTPRLFTAGWTEDLREVVNHIKKRYP--QAPLFAVGFSMGGNILTNYLGEEGDNTPLIAAV 229 (409)
T ss_pred cEEEE---ECCCCCCCCccCCCceeecCCHHHHHHHHHHHHHhCC--CCceEEEEecchHHHHHHHhhhccCCCCceeEE
Confidence 99988 5555544432 23499999999999753 2589999999999999999887533 34444
Q ss_pred EEccCCCCCCCC-------------------------------------CCcccccc---cc-----------hhhcHHH
Q 004574 623 IARSGSYNKTLT-------------------------------------PFGFQTEF---RT-----------LWEATNV 651 (744)
Q Consensus 623 v~~~~~~~~~~~-------------------------------------~~~~~~~~---~~-----------~~~~~~~ 651 (744)
++.+| +|.... .+....+. +. +-...+.
T Consensus 230 ~v~~P-wd~~~~~~~~~~~~~~~~y~~~l~~~l~~~~~~~r~~~~~~~vd~d~~~~~~SvreFD~~~t~~~~gf~~~deY 308 (409)
T KOG1838|consen 230 AVCNP-WDLLAASRSIETPLYRRFYNRALTLNLKRIVLRHRHTLFEDPVDFDVILKSRSVREFDEALTRPMFGFKSVDEY 308 (409)
T ss_pred EEecc-chhhhhhhHHhcccchHHHHHHHHHhHHHHHhhhhhhhhhccchhhhhhhcCcHHHHHhhhhhhhcCCCcHHHH
Confidence 44444 331000 00000000 00 0012356
Q ss_pred HHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcc---ccHHHHHHH-HHH
Q 004574 652 YIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAR---ENVMHVIWE-TDR 727 (744)
Q Consensus 652 ~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~---~~~~~~~~~-~~~ 727 (744)
|.+.|+...+++|++|+|+|+..+|+++| . ++.-.- ..+ .++.+-+.+-..+||.-... ++...+++. +.+
T Consensus 309 Y~~aSs~~~v~~I~VP~L~ina~DDPv~p--~-~~ip~~-~~~-~np~v~l~~T~~GGHlgfleg~~p~~~~w~~~~l~e 383 (409)
T KOG1838|consen 309 YKKASSSNYVDKIKVPLLCINAADDPVVP--E-EAIPID-DIK-SNPNVLLVITSHGGHLGFLEGLWPSARTWMDKLLVE 383 (409)
T ss_pred HhhcchhhhcccccccEEEEecCCCCCCC--c-ccCCHH-HHh-cCCcEEEEEeCCCceeeeeccCCCccchhHHHHHHH
Confidence 77789999999999999999999999997 3 233222 222 34578888888899963322 245566666 777
Q ss_pred HHHHh
Q 004574 728 WLQKY 732 (744)
Q Consensus 728 fl~~~ 732 (744)
|+...
T Consensus 384 f~~~~ 388 (409)
T KOG1838|consen 384 FLGNA 388 (409)
T ss_pred HHHHH
Confidence 77654
No 131
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.28 E-value=3.7e-09 Score=110.25 Aligned_cols=265 Identities=13% Similarity=0.096 Sum_probs=136.7
Q ss_pred ccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECC-CCceeccccCCC
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAE-TGEAKPLFESPD 82 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~-gg~~~~lt~~~~ 82 (744)
-..|.++++.. .|+.+.+..++.+.......+||||++|+.... ....|.+++++ .|+.+.+.....
T Consensus 11 ~~~I~~~~~~~---~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~---------~~~~i~~~~~~~~g~l~~~~~~~~ 78 (330)
T PRK11028 11 SQQIHVWNLNH---EGALTLLQVVDVPGQVQPMVISPDKRHLYVGVR---------PEFRVLSYRIADDGALTFAAESPL 78 (330)
T ss_pred CCCEEEEEECC---CCceeeeeEEecCCCCccEEECCCCCEEEEEEC---------CCCcEEEEEECCCCceEEeeeecC
Confidence 35688888853 256555554554445677899999998866432 22455555554 444333321111
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
......++++|||+.|+.+... .+.|.++
T Consensus 79 ---~~~p~~i~~~~~g~~l~v~~~~------------------------------------------------~~~v~v~ 107 (330)
T PRK11028 79 ---PGSPTHISTDHQGRFLFSASYN------------------------------------------------ANCVSVS 107 (330)
T ss_pred ---CCCceEEEECCCCCEEEEEEcC------------------------------------------------CCeEEEE
Confidence 0123567899999998876421 0345555
Q ss_pred cC--CCCe-eec--CCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCC-CeeeeccCCCCCCCCCc
Q 004574 163 SL--DGTA-KDF--GTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDG-KLVRELCDLPPAEDIPV 235 (744)
Q Consensus 163 ~~--~G~~-~~l--~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g-~~~~~l~~~~~~~~~~~ 235 (744)
++ +|.. +.+ ... ......+++|||++|++..... ..|++|+++. +........... .+
T Consensus 108 ~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~------------~~v~v~d~~~~g~l~~~~~~~~~--~~- 172 (330)
T PRK11028 108 PLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKE------------DRIRLFTLSDDGHLVAQEPAEVT--TV- 172 (330)
T ss_pred EECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCC------------CEEEEEEECCCCcccccCCCcee--cC-
Confidence 55 3422 111 112 3345677999999987765443 3789998864 222110000000 00
Q ss_pred ccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee---------ccceeceee
Q 004574 236 CYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL---------DLRFRSVSW 306 (744)
Q Consensus 236 ~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---------~~~~~~~~~ 306 (744)
...+++.++|+|||+. +|.+. . ..+.|.++++++ .+++.+.+... ......+.+
T Consensus 173 ----~g~~p~~~~~~pdg~~-lyv~~-~----------~~~~v~v~~~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~i~~ 235 (330)
T PRK11028 173 ----EGAGPRHMVFHPNQQY-AYCVN-E----------LNSSVDVWQLKD-PHGEIECVQTLDMMPADFSDTRWAADIHI 235 (330)
T ss_pred ----CCCCCceEEECCCCCE-EEEEe-c----------CCCEEEEEEEeC-CCCCEEEEEEEecCCCcCCCCccceeEEE
Confidence 0123678899999996 44442 2 123466666521 12332221111 011124678
Q ss_pred ccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 307 CDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 307 SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+|||++++.+. ...+ .|.+++++......+.+-..... ..|.. +.++|||++|+...
T Consensus 236 ~pdg~~lyv~~-~~~~--~I~v~~i~~~~~~~~~~~~~~~~---~~p~~--~~~~~dg~~l~va~ 292 (330)
T PRK11028 236 TPDGRHLYACD-RTAS--LISVFSVSEDGSVLSFEGHQPTE---TQPRG--FNIDHSGKYLIAAG 292 (330)
T ss_pred CCCCCEEEEec-CCCC--eEEEEEEeCCCCeEEEeEEEecc---ccCCc--eEECCCCCEEEEEE
Confidence 99999877653 3223 34444443321221111111111 11111 56899999998875
No 132
>KOG3043 consensus Predicted hydrolase related to dienelactone hydrolase [General function prediction only]
Probab=99.27 E-value=3.6e-11 Score=108.99 Aligned_cols=156 Identities=21% Similarity=0.262 Sum_probs=117.7
Q ss_pred hHHHHHhCCeEEEecCCCCCCCCCCC----------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHH
Q 004574 547 SSLIFLARRFAVLAGPSIPIIGEGDK----------------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAH 610 (744)
Q Consensus 547 ~~~~~~~~G~~v~~~~~~~~~g~g~~----------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~ 610 (744)
.+..++..||.|+.|+.+...-+..+ ....++..+++||+.++ +..+|+++|++|||.++..
T Consensus 59 ~Adk~A~~Gy~v~vPD~~~Gdp~~~~~~~~~~~~w~~~~~~~~~~~~i~~v~k~lk~~g--~~kkIGv~GfCwGak~vv~ 136 (242)
T KOG3043|consen 59 GADKVALNGYTVLVPDFFRGDPWSPSLQKSERPEWMKGHSPPKIWKDITAVVKWLKNHG--DSKKIGVVGFCWGAKVVVT 136 (242)
T ss_pred HHHHHhcCCcEEEcchhhcCCCCCCCCChhhhHHHHhcCCcccchhHHHHHHHHHHHcC--CcceeeEEEEeecceEEEE
Confidence 45577888999999765443111111 12238999999999664 3579999999999999999
Q ss_pred HHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHH
Q 004574 611 LLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFF 690 (744)
Q Consensus 611 ~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~ 690 (744)
+....| .|.++++..|.+.- ...+.++++|+|++.++.|..+| +....++-
T Consensus 137 ~~~~~~-~f~a~v~~hps~~d--------------------------~~D~~~vk~Pilfl~ae~D~~~p--~~~v~~~e 187 (242)
T KOG3043|consen 137 LSAKDP-EFDAGVSFHPSFVD--------------------------SADIANVKAPILFLFAELDEDVP--PKDVKAWE 187 (242)
T ss_pred eeccch-hheeeeEecCCcCC--------------------------hhHHhcCCCCEEEEeecccccCC--HHHHHHHH
Confidence 998886 77888877775320 12356789999999999999999 89888888
Q ss_pred HHHHhCCC-cEEEEEeCCCCcccCc----------cccHHHHHHHHHHHHHHhc
Q 004574 691 DALKGHGA-LSRLVLLPFEHHVYAA----------RENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 691 ~~l~~~~~-~~~~~~~~~~~H~~~~----------~~~~~~~~~~~~~fl~~~l 733 (744)
+.++.... ..++.+|++.+|+|.. ....+..++..+.||++++
T Consensus 188 e~lk~~~~~~~~v~~f~g~~HGf~~~r~~~~~Ped~~~~eea~~~~~~Wf~~y~ 241 (242)
T KOG3043|consen 188 EKLKENPAVGSQVKTFSGVGHGFVARRANISSPEDKKAAEEAYQRFISWFKHYL 241 (242)
T ss_pred HHHhcCcccceeEEEcCCccchhhhhccCCCChhHHHHHHHHHHHHHHHHHHhh
Confidence 88887542 3589999999999873 1223667888999999886
No 133
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.27 E-value=1.8e-09 Score=113.27 Aligned_cols=236 Identities=16% Similarity=0.139 Sum_probs=151.9
Q ss_pred ceeeecCCCC-CcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCc
Q 004574 21 EKEVHGYPDG-AKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNST 99 (744)
Q Consensus 21 ~~~l~~~~~~-~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~ 99 (744)
...++.++.+ ..+...+|+--|.||||.+ ..-+||-|++..+..-..-.++.. .....++.||||+
T Consensus 297 f~lih~LSis~~~I~t~~~N~tGDWiA~g~---------~klgQLlVweWqsEsYVlKQQgH~----~~i~~l~YSpDgq 363 (893)
T KOG0291|consen 297 FNLIHSLSISDQKILTVSFNSTGDWIAFGC---------SKLGQLLVWEWQSESYVLKQQGHS----DRITSLAYSPDGQ 363 (893)
T ss_pred ceEEEEeecccceeeEEEecccCCEEEEcC---------CccceEEEEEeeccceeeeccccc----cceeeEEECCCCc
Confidence 3445555555 3677899999999999965 334677777766544322211111 1245779999999
Q ss_pred EEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ce
Q 004574 100 LLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AV 176 (744)
Q Consensus 100 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~ 176 (744)
.|+-...+ +.+-++|. +| =....+++ ..
T Consensus 364 ~iaTG~eD-------------------------------------------------gKVKvWn~~SgfC~vTFteHts~ 394 (893)
T KOG0291|consen 364 LIATGAED-------------------------------------------------GKVKVWNTQSGFCFVTFTEHTSG 394 (893)
T ss_pred EEEeccCC-------------------------------------------------CcEEEEeccCceEEEEeccCCCc
Confidence 99865321 45677787 66 45566677 78
Q ss_pred eeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCc
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~ 255 (744)
+..+.|+.+|+.|+-.+.+. .+..||+. ..+-+..+...... ...++..|.|.
T Consensus 395 Vt~v~f~~~g~~llssSLDG-------------tVRAwDlkRYrNfRTft~P~p~Q------------fscvavD~sGe- 448 (893)
T KOG0291|consen 395 VTAVQFTARGNVLLSSSLDG-------------TVRAWDLKRYRNFRTFTSPEPIQ------------FSCVAVDPSGE- 448 (893)
T ss_pred eEEEEEEecCCEEEEeecCC-------------eEEeeeecccceeeeecCCCcee------------eeEEEEcCCCC-
Confidence 88999999999887665553 68888876 44455544332111 33456666676
Q ss_pred eEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE-eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 256 TLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 256 ~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
|+++...+. =.|+++++ ++|+... |..+.+.+..++|+|+|..|+..+|+.+ +.++|+-+.
T Consensus 449 -lV~AG~~d~----------F~IfvWS~---qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkT----VRiW~if~s 510 (893)
T KOG0291|consen 449 -LVCAGAQDS----------FEIFVWSV---QTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKT----VRIWDIFSS 510 (893)
T ss_pred -EEEeeccce----------EEEEEEEe---ecCeeeehhcCCCCcceeeEEccccCeEEeccccce----EEEEEeecc
Confidence 676643322 13899998 6666554 4455888999999999999998887652 444554442
Q ss_pred CCcc--eeeeccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 335 DVAP--RVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 335 ~~~~--~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
.++. ..+.... ..++++|||+.|+....
T Consensus 511 ~~~vEtl~i~sdv----------l~vsfrPdG~elaVaTl 540 (893)
T KOG0291|consen 511 SGTVETLEIRSDV----------LAVSFRPDGKELAVATL 540 (893)
T ss_pred CceeeeEeeccce----------eEEEEcCCCCeEEEEEe
Confidence 1222 2222211 12679999999999873
No 134
>COG2936 Predicted acyl esterases [General function prediction only]
Probab=99.27 E-value=1.6e-10 Score=121.58 Aligned_cols=232 Identities=22% Similarity=0.296 Sum_probs=146.3
Q ss_pred CceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEE
Q 004574 480 LQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVL 559 (744)
Q Consensus 480 ~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~ 559 (744)
+....+.+.-.||.+|.+.+|+|++ .+++|+++..+-.++.-.+ + ..+ .... .......++.+||+|+
T Consensus 17 ~~~~~v~V~MRDGvrL~~dIy~Pa~-----~g~~Pvll~~~~~Py~k~~--~-~~~--~~~~--~~p~~~~~aa~GYavV 84 (563)
T COG2936 17 YIERDVMVPMRDGVRLAADIYRPAG-----AGPLPVLLSRTRLPYRKRN--G-TFG--PQLS--ALPQPAWFAAQGYAVV 84 (563)
T ss_pred eeeeeeeEEecCCeEEEEEEEccCC-----CCCCceeEEeecccccccc--c-cCc--chhh--cccccceeecCceEEE
Confidence 5666777888899999999999987 4679999998822221110 0 000 0000 0011126789999999
Q ss_pred ecCCCCCCCCCCC---------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 560 AGPSIPIIGEGDK---------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 560 ~~~~~~~~g~g~~---------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
. .+.+|.+.+ .-.+|.-+.|+||.+|++-+ .+|+++|.|++|+..+++|+..|..+|+++...+..|
T Consensus 85 ~---qDvRG~~~SeG~~~~~~~~E~~Dg~D~I~Wia~QpWsN-G~Vgm~G~SY~g~tq~~~Aa~~pPaLkai~p~~~~~D 160 (563)
T COG2936 85 N---QDVRGRGGSEGVFDPESSREAEDGYDTIEWLAKQPWSN-GNVGMLGLSYLGFTQLAAAALQPPALKAIAPTEGLVD 160 (563)
T ss_pred E---ecccccccCCcccceeccccccchhHHHHHHHhCCccC-CeeeeecccHHHHHHHHHHhcCCchheeecccccccc
Confidence 8 333443322 12348999999999999887 7999999999999999999999999999999988765
Q ss_pred CCCCC--------------Cc------------cc----------c------------cccchhhc---------HHHHH
Q 004574 631 KTLTP--------------FG------------FQ----------T------------EFRTLWEA---------TNVYI 653 (744)
Q Consensus 631 ~~~~~--------------~~------------~~----------~------------~~~~~~~~---------~~~~~ 653 (744)
..... +. .. . ....+|.. ...+.
T Consensus 161 ~y~d~~~~~G~~~~~~~~~W~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~hp~~ddfW~ 240 (563)
T COG2936 161 RYRDDAFYGGGAELNFNLGWALTMLAPQPLTRIRPARLDRLAPLRVGAERWRDAPTELLEGEPYFLELWLEHPLRDDFWR 240 (563)
T ss_pred ccccccccCcchhhhhhHHHHhhhcccCcccccccccccccchhhhhhccccccccchhccCcccchhhhcCCCccchhh
Confidence 21100 00 00 0 00011110 01122
Q ss_pred hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcccc-----HHHHHHHHHHH
Q 004574 654 EMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAREN-----VMHVIWETDRW 728 (744)
Q Consensus 654 ~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~-----~~~~~~~~~~f 728 (744)
+.+......++++|+|++.|-.|.. ...+.+++..++.. +..+++-|-. |.-..... .....+...+|
T Consensus 241 ~~~~~~d~~~i~vP~L~i~gW~D~~----l~~~~~~~~~~~~r--~~~lvvgPw~-H~~~~~~~~~~~y~~~al~~~~~~ 313 (563)
T COG2936 241 RGDRVADLSKIKVPALVIGGWSDGY----LHTAIKLFAFLRSR--PVKLVVGPWT-HGGPEWEGPGKDYGATALSWQDDF 313 (563)
T ss_pred ccCcccccccCCCcEEEEccccccc----ccchHHHhhhcccC--CceeEEcccc-cCCCcccccccchhhhhhhhhHhh
Confidence 3345556778999999999999975 45677788777764 4567776653 75432222 23334444555
Q ss_pred HHHhcc
Q 004574 729 LQKYCL 734 (744)
Q Consensus 729 l~~~l~ 734 (744)
|..++.
T Consensus 314 l~~~~~ 319 (563)
T COG2936 314 LDAYLD 319 (563)
T ss_pred hhHhhh
Confidence 554443
No 135
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.26 E-value=8.4e-10 Score=102.57 Aligned_cols=256 Identities=16% Similarity=0.083 Sum_probs=154.1
Q ss_pred ceeEeecCCC-CCCC-CceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc
Q 004574 6 GIGIHRLLPD-DSLG-PEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI 83 (744)
Q Consensus 6 ~~~~~~~~~~-~~~g-~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~ 83 (744)
.|.+++|..| ...| ..++++++.+ .++....|+||++..-.+. + .-|.+.|+++|+.++..-+...
T Consensus 39 ~ii~W~L~~dd~~~G~~~r~~~GHsH--~v~dv~~s~dg~~alS~sw--------D--~~lrlWDl~~g~~t~~f~GH~~ 106 (315)
T KOG0279|consen 39 TIIVWKLTSDDIKYGVPVRRLTGHSH--FVSDVVLSSDGNFALSASW--------D--GTLRLWDLATGESTRRFVGHTK 106 (315)
T ss_pred EEEEEEeccCccccCceeeeeeccce--EecceEEccCCceEEeccc--------c--ceEEEEEecCCcEEEEEEecCC
Confidence 3566666443 2444 4567775443 5899999999987766543 3 4566669999887776544432
Q ss_pred cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEc
Q 004574 84 CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGS 163 (744)
Q Consensus 84 ~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 163 (744)
.+..+++|||.++|+-.+.+ ..|-+++
T Consensus 107 ----dVlsva~s~dn~qivSGSrD-------------------------------------------------kTiklwn 133 (315)
T KOG0279|consen 107 ----DVLSVAFSTDNRQIVSGSRD-------------------------------------------------KTIKLWN 133 (315)
T ss_pred ----ceEEEEecCCCceeecCCCc-------------------------------------------------ceeeeee
Confidence 47788999999999865321 3455556
Q ss_pred CCC--CeeecCC--CceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 164 LDG--TAKDFGT--PAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 164 ~~G--~~~~l~~--~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
.-| +.+.... .+.+.-+.|||.....++.+... ...+-+||+++-+.++-...
T Consensus 134 t~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~-----------DktvKvWnl~~~~l~~~~~g------------ 190 (315)
T KOG0279|consen 134 TLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASW-----------DKTVKVWNLRNCQLRTTFIG------------ 190 (315)
T ss_pred ecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccC-----------CceEEEEccCCcchhhcccc------------
Confidence 633 2222222 26788899999974333333222 24788999998765553322
Q ss_pred ccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeee
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWY 319 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~ 319 (744)
....+..+++||||.. ++. ++ ..+.++++++ ..++--.-.+....+++++|||..-+|+... .
T Consensus 191 h~~~v~t~~vSpDGsl-cas-----Gg-------kdg~~~LwdL---~~~k~lysl~a~~~v~sl~fspnrywL~~at-~ 253 (315)
T KOG0279|consen 191 HSGYVNTVTVSPDGSL-CAS-----GG-------KDGEAMLWDL---NEGKNLYSLEAFDIVNSLCFSPNRYWLCAAT-A 253 (315)
T ss_pred ccccEEEEEECCCCCE-Eec-----CC-------CCceEEEEEc---cCCceeEeccCCCeEeeEEecCCceeEeecc-C
Confidence 1123567899999985 332 11 2235888888 4444333334466789999999998887665 1
Q ss_pred eccceeEEEEcCCCCCCcceeee-ccccc-cccCCCCCCceeeCCCCCeEEEEe
Q 004574 320 KTSQTRTWLVCPGSKDVAPRVLF-DRVFE-NVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 320 ~~~~~~l~~~~~~~~~~~~~~l~-~~~~~-~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
. .|.++|++++. ..-.+. +.... .....|--..++||+||+.|+...
T Consensus 254 ~----sIkIwdl~~~~-~v~~l~~d~~g~s~~~~~~~clslaws~dG~tLf~g~ 302 (315)
T KOG0279|consen 254 T----SIKIWDLESKA-VVEELKLDGIGPSSKAGDPICLSLAWSADGQTLFAGY 302 (315)
T ss_pred C----ceEEEeccchh-hhhhccccccccccccCCcEEEEEEEcCCCcEEEeee
Confidence 1 27777877741 111111 11111 001112223477999999987765
No 136
>COG3509 LpqC Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=99.26 E-value=3.1e-10 Score=107.84 Aligned_cols=217 Identities=19% Similarity=0.186 Sum_probs=131.1
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHH-hCCeEEEecCCCCCCC
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFL-ARRFAVLAGPSIPIIG 568 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~G~~v~~~~~~~~~g 568 (744)
.++...++.+|.|++... +.|+||++||++.+ ..++.+.+ . ...++ ..||.|+.++.++..-
T Consensus 42 ~~g~~r~y~l~vP~g~~~----~apLvv~LHG~~~s---gag~~~~s--g--------~d~lAd~~gFlV~yPdg~~~~w 104 (312)
T COG3509 42 VNGLKRSYRLYVPPGLPS----GAPLVVVLHGSGGS---GAGQLHGT--G--------WDALADREGFLVAYPDGYDRAW 104 (312)
T ss_pred cCCCccceEEEcCCCCCC----CCCEEEEEecCCCC---hHHhhccc--c--------hhhhhcccCcEEECcCcccccc
Confidence 467789999999998633 24999999997532 22222221 1 12334 5799999864433211
Q ss_pred ----C----CCC------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC
Q 004574 569 ----E----GDK------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT 634 (744)
Q Consensus 569 ----~----g~~------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~ 634 (744)
. +.+ .-...+.+.++.|..+..||+.||++.|.|.||.|+.++++.+|+.|.++..+++...
T Consensus 105 n~~~~~~~~~p~~~~~g~ddVgflr~lva~l~~~~gidp~RVyvtGlS~GG~Ma~~lac~~p~~faa~A~VAg~~~---- 180 (312)
T COG3509 105 NANGCGNWFGPADRRRGVDDVGFLRALVAKLVNEYGIDPARVYVTGLSNGGRMANRLACEYPDIFAAIAPVAGLLA---- 180 (312)
T ss_pred CCCcccccCCcccccCCccHHHHHHHHHHHHHHhcCcCcceEEEEeeCcHHHHHHHHHhcCcccccceeeeecccC----
Confidence 1 111 1122677888888888899999999999999999999999999999999988888641
Q ss_pred CCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHh-------------------
Q 004574 635 PFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKG------------------- 695 (744)
Q Consensus 635 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~------------------- 695 (744)
..-.+....+......--..+|..-...=..| |-+|..|..|+ .....+..+++..
T Consensus 181 -~~~a~~~~rp~~~m~~~G~~Dp~~p~~gG~~~--~g~g~~~~~v~--~~~~~~~Waa~ng~~~~p~~~~~~~~~~~~~~ 255 (312)
T COG3509 181 -LGVACTPPRPVSVMAFHGTADPLNPYHGGGVP--IGRGQRDGVVS--AADLAARWAAVNGCQAGPDTAELPDVGDGTDY 255 (312)
T ss_pred -CCcccCCCCchhHHHhcCCCCCCCCCCCCCcc--ccccccccccc--HHHHHHHHHHhcCCCCCCcccccCCCccccee
Confidence 00001111111100000011221111111233 67777777765 5555555555422
Q ss_pred ----CCCcEEEEEeCCCCcccCccc-----------cHHHHHHHHHHHHHHh
Q 004574 696 ----HGALSRLVLLPFEHHVYAARE-----------NVMHVIWETDRWLQKY 732 (744)
Q Consensus 696 ----~~~~~~~~~~~~~~H~~~~~~-----------~~~~~~~~~~~fl~~~ 732 (744)
.+..+++..+.+.||.+..-. ...+..+.+-+||..+
T Consensus 256 ~~~~~~~~V~~y~i~g~GH~wp~~~~~~~~~~g~~t~~~dat~~iw~Ff~~~ 307 (312)
T COG3509 256 DTCDGNARVELYTIDGGGHTWPGGTQYGPAALGMSTRGFDATERIWRFFRQH 307 (312)
T ss_pred eccCCCcceEEEEEeCCcccCcCCCCCCcccccccccCcchHHHHHHHHHhc
Confidence 124579999999999875211 1123566788888765
No 137
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=99.26 E-value=3.4e-09 Score=114.28 Aligned_cols=322 Identities=12% Similarity=0.060 Sum_probs=177.4
Q ss_pred CCCCeEEEeeecccccccCCCceeEEEEECC--CCce-eccccCCCc--cc-cccccceEEecCCcEEEEEecCCCCCCC
Q 004574 40 PDGKRIAFSVRVDEEDNVSSCKLRVWIADAE--TGEA-KPLFESPDI--CL-NAVFGSFVWVNNSTLLIFTIPSSRRDPP 113 (744)
Q Consensus 40 pDG~~laf~~~~~~~~~~~~~~~~l~~~~~~--gg~~-~~lt~~~~~--~~-~~~~~~~~wspDg~~l~~~~~~~~~~~~ 113 (744)
..|.+..|.... ......-+|+.... +|+. +.|.+.... .. ......+.+||||++|+|+....+.+.
T Consensus 76 ~~g~~~y~~~~~-----~~~~~~~~~r~~~~~~~~~~~evllD~n~l~~~~~~~~~~~~~~Spdg~~la~~~s~~G~e~- 149 (414)
T PF02897_consen 76 RRGGYYYYSRNQ-----GGKNYPVLYRRKTDEEDGPEEEVLLDPNELAKDGGYVSLGGFSVSPDGKRLAYSLSDGGSEW- 149 (414)
T ss_dssp EETTEEEEEEE------SS-SS-EEEEEETTS-TS-C-EEEEEGGGGSTTSS-EEEEEEEETTTSSEEEEEEEETTSSE-
T ss_pred EECCeEEEEEEc-----CCCceEEEEEEecccCCCCceEEEEcchHhhccCceEEeeeeeECCCCCEEEEEecCCCCce-
Confidence 366677776542 11334557777766 3443 333322110 00 112236789999999999866543331
Q ss_pred CCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecC-CCceeeeeccCCCCceEEE
Q 004574 114 KKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFG-TPAVYTAVEPSPDQKYVLI 191 (744)
Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~-~~~~~~~~~~SpDG~~i~~ 191 (744)
..|+++|+ +|+...-. .......+.|++||+.++|
T Consensus 150 -------------------------------------------~~l~v~Dl~tg~~l~d~i~~~~~~~~~W~~d~~~~~y 186 (414)
T PF02897_consen 150 -------------------------------------------YTLRVFDLETGKFLPDGIENPKFSSVSWSDDGKGFFY 186 (414)
T ss_dssp -------------------------------------------EEEEEEETTTTEEEEEEEEEEESEEEEECTTSSEEEE
T ss_pred -------------------------------------------EEEEEEECCCCcCcCCcccccccceEEEeCCCCEEEE
Confidence 68999999 77332211 1111223999999999999
Q ss_pred EEeeCCcccccccCCCcceEEEEeCCCCeee--eccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCC
Q 004574 192 TSMHRPYSYKVPCARFSQKVQVWTTDGKLVR--ELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDAN 269 (744)
Q Consensus 192 ~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~ 269 (744)
+......... ...++.++|++.+.+.... .|...+..... .-.+..|+||+. |+..+.....
T Consensus 187 ~~~~~~~~~~--~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~~~----------~~~~~~s~d~~~-l~i~~~~~~~--- 250 (414)
T PF02897_consen 187 TRFDEDQRTS--DSGYPRQVYRHKLGTPQSEDELVFEEPDEPFW----------FVSVSRSKDGRY-LFISSSSGTS--- 250 (414)
T ss_dssp EECSTTTSS---CCGCCEEEEEEETTS-GGG-EEEEC-TTCTTS----------EEEEEE-TTSSE-EEEEEESSSS---
T ss_pred EEeCcccccc--cCCCCcEEEEEECCCChHhCeeEEeecCCCcE----------EEEEEecCcccE-EEEEEEcccc---
Confidence 9876531100 1223578999999876544 44433322110 115678999997 4444322221
Q ss_pred ccCCccceEEeccCCCC--CCCCceEeeeec-cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCC-cce-eeecc
Q 004574 270 VEVSPRDIIYTQPAEPA--EGEKPEILHKLD-LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDV-APR-VLFDR 344 (744)
Q Consensus 270 ~~~~~~~~l~~~~~~~~--~~~~~~~l~~~~-~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~-~~~-~l~~~ 344 (744)
...+|++++... .....+.+.... .....+... |..+++.++.......|+.++++.... ... .+...
T Consensus 251 -----~s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v~~~--~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~ 323 (414)
T PF02897_consen 251 -----ESEVYLLDLDDGGSPDAKPKLLSPREDGVEYYVDHH--GDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPE 323 (414)
T ss_dssp -----EEEEEEEECCCTTTSS-SEEEEEESSSS-EEEEEEE--TTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--
T ss_pred -----CCeEEEEeccccCCCcCCcEEEeCCCCceEEEEEcc--CCEEEEeeCCCCCCcEEEEecccccccccceeEEcCC
Confidence 135899998310 012555565432 233333333 677777776655778999999987521 112 23222
Q ss_pred ccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecC-CCceeEEeeccchhhhhhee
Q 004574 345 VFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDIN-TGSKERIWESNREKYFETAV 423 (744)
Q Consensus 345 ~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~-~g~~~~l~~~~~~~~~~~~~ 423 (744)
... ...-.++..+.+|+..... .....|.++++. +.....+...... .+.
T Consensus 324 ~~~-------~~l~~~~~~~~~Lvl~~~~------------------~~~~~l~v~~~~~~~~~~~~~~p~~g----~v~ 374 (414)
T PF02897_consen 324 DED-------VSLEDVSLFKDYLVLSYRE------------------NGSSRLRVYDLDDGKESREIPLPEAG----SVS 374 (414)
T ss_dssp SSS-------EEEEEEEEETTEEEEEEEE------------------TTEEEEEEEETT-TEEEEEEESSSSS----EEE
T ss_pred CCc-------eeEEEEEEECCEEEEEEEE------------------CCccEEEEEECCCCcEEeeecCCcce----EEe
Confidence 111 0112245566666665533 234569999998 4544444332211 111
Q ss_pred eeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeec
Q 004574 424 ALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITN 469 (744)
Q Consensus 424 ~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~ 469 (744)
. .+..++++.+.|..++...|+.+|.+|+.+++.+.++.
T Consensus 375 ~-------~~~~~~~~~~~~~~ss~~~P~~~y~~d~~t~~~~~~k~ 413 (414)
T PF02897_consen 375 G-------VSGDFDSDELRFSYSSFTTPPTVYRYDLATGELTLLKQ 413 (414)
T ss_dssp E-------EES-TT-SEEEEEEEETTEEEEEEEEETTTTCEEEEEE
T ss_pred c-------cCCCCCCCEEEEEEeCCCCCCEEEEEECCCCCEEEEEe
Confidence 1 24567889999999999999999999999999988864
No 138
>KOG2984 consensus Predicted hydrolase [General function prediction only]
Probab=99.24 E-value=1.3e-11 Score=108.93 Aligned_cols=163 Identities=17% Similarity=0.099 Sum_probs=107.9
Q ss_pred eEEEecCCCCCCCCCCC----------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEc
Q 004574 556 FAVLAGPSIPIIGEGDK----------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIAR 625 (744)
Q Consensus 556 ~~v~~~~~~~~~g~g~~----------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~ 625 (744)
+.+++ .+.+|+|.+ ....|.+.+++-+.... .+++.++|+|-||..|+.+|+++++.+..+|.+
T Consensus 72 ~Tiva---wDPpGYG~SrPP~Rkf~~~ff~~Da~~avdLM~aLk---~~~fsvlGWSdGgiTalivAak~~e~v~rmiiw 145 (277)
T KOG2984|consen 72 VTIVA---WDPPGYGTSRPPERKFEVQFFMKDAEYAVDLMEALK---LEPFSVLGWSDGGITALIVAAKGKEKVNRMIIW 145 (277)
T ss_pred eEEEE---ECCCCCCCCCCCcccchHHHHHHhHHHHHHHHHHhC---CCCeeEeeecCCCeEEEEeeccChhhhhhheee
Confidence 66666 444555544 23448888888888753 478999999999999999999999988888877
Q ss_pred cCCCCCCC-CCC---------cccccccchhhcH-------HHHHhc-------------C-cccccCCCCCCEEEEeeC
Q 004574 626 SGSYNKTL-TPF---------GFQTEFRTLWEAT-------NVYIEM-------------S-PITHANKIKKPILIIHGE 674 (744)
Q Consensus 626 ~~~~~~~~-~~~---------~~~~~~~~~~~~~-------~~~~~~-------------~-~~~~~~~~~~P~l~i~G~ 674 (744)
.+..-... ... .+....+.|++.. ..+.++ + ....+++++||+||+||+
T Consensus 146 ga~ayvn~~~~ma~kgiRdv~kWs~r~R~P~e~~Yg~e~f~~~wa~wvD~v~qf~~~~dG~fCr~~lp~vkcPtli~hG~ 225 (277)
T KOG2984|consen 146 GAAAYVNHLGAMAFKGIRDVNKWSARGRQPYEDHYGPETFRTQWAAWVDVVDQFHSFCDGRFCRLVLPQVKCPTLIMHGG 225 (277)
T ss_pred cccceecchhHHHHhchHHHhhhhhhhcchHHHhcCHHHHHHHHHHHHHHHHHHhhcCCCchHhhhcccccCCeeEeeCC
Confidence 65421000 000 0011112222211 111111 0 123468899999999999
Q ss_pred CCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 675 VDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 675 ~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
.|+.++ ..++--+-..+ .-.++.++|.++|.++ ......++..+.+||++
T Consensus 226 kDp~~~--~~hv~fi~~~~----~~a~~~~~peGkHn~h-Lrya~eFnklv~dFl~~ 275 (277)
T KOG2984|consen 226 KDPFCG--DPHVCFIPVLK----SLAKVEIHPEGKHNFH-LRYAKEFNKLVLDFLKS 275 (277)
T ss_pred cCCCCC--CCCccchhhhc----ccceEEEccCCCccee-eechHHHHHHHHHHHhc
Confidence 999988 56554443333 3348999999999998 55577899999999975
No 139
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=99.23 E-value=3.3e-10 Score=106.84 Aligned_cols=124 Identities=18% Similarity=0.190 Sum_probs=83.8
Q ss_pred eeeeeEEEEEcC-CCCeeecC--CCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCC
Q 004574 153 YYTTAQLVLGSL-DGTAKDFG--TPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPP 229 (744)
Q Consensus 153 ~~~~~~l~~~~~-~G~~~~l~--~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~ 229 (744)
+++...||.++. +.....+. ..+.+..++|||+|+++++..... +..+.+|++++..+..+...+
T Consensus 35 ~~~~~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~-----------~~~v~lyd~~~~~i~~~~~~~- 102 (194)
T PF08662_consen 35 YYGEFELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSM-----------PAKVTLYDVKGKKIFSFGTQP- 102 (194)
T ss_pred EEeeEEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccC-----------CcccEEEcCcccEeEeecCCC-
Confidence 345578999988 44555443 336799999999999998875432 247888998866665554322
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee-eccceeceeecc
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK-LDLRFRSVSWCD 308 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~~~~~~~~~~Sp 308 (744)
...+.|||+|+. |+.....+. .+.|.++|. . +...+.. .......++|||
T Consensus 103 --------------~n~i~wsP~G~~-l~~~g~~n~---------~G~l~~wd~---~--~~~~i~~~~~~~~t~~~WsP 153 (194)
T PF08662_consen 103 --------------RNTISWSPDGRF-LVLAGFGNL---------NGDLEFWDV---R--KKKKISTFEHSDATDVEWSP 153 (194)
T ss_pred --------------ceEEEECCCCCE-EEEEEccCC---------CcEEEEEEC---C--CCEEeeccccCcEEEEEEcC
Confidence 336899999997 666643222 234888887 3 2233322 233567899999
Q ss_pred CCceEEEee
Q 004574 309 DSLALVNET 317 (744)
Q Consensus 309 Dg~~l~~~~ 317 (744)
||++|+.++
T Consensus 154 dGr~~~ta~ 162 (194)
T PF08662_consen 154 DGRYLATAT 162 (194)
T ss_pred CCCEEEEEE
Confidence 999999876
No 140
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.23 E-value=1.9e-08 Score=101.55 Aligned_cols=136 Identities=24% Similarity=0.264 Sum_probs=77.0
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
-+...++||||+++|-+. ....++++|-.+|+..-.... ...+.+.+-.++||||+++++-.+.+.
T Consensus 192 FV~~VRysPDG~~Fat~g----------sDgki~iyDGktge~vg~l~~-~~aHkGsIfalsWsPDs~~~~T~SaDk--- 257 (603)
T KOG0318|consen 192 FVNCVRYSPDGSRFATAG----------SDGKIYIYDGKTGEKVGELED-SDAHKGSIFALSWSPDSTQFLTVSADK--- 257 (603)
T ss_pred ceeeEEECCCCCeEEEec----------CCccEEEEcCCCccEEEEecC-CCCccccEEEEEECCCCceEEEecCCc---
Confidence 467899999999999874 346789999877775443322 223445667899999999998775432
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeec----------CCC--ceeee
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDF----------GTP--AVYTA 179 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l----------~~~--~~~~~ 179 (744)
...++........++-.-+.. ..|+ .=+..|. ..+|..+.++|...-+ ..+ ..+..
T Consensus 258 t~KIWdVs~~slv~t~~~~~~-----v~dq----qvG~lWq---kd~lItVSl~G~in~ln~~d~~~~~~i~GHnK~ITa 325 (603)
T KOG0318|consen 258 TIKIWDVSTNSLVSTWPMGST-----VEDQ----QVGCLWQ---KDHLITVSLSGTINYLNPSDPSVLKVISGHNKSITA 325 (603)
T ss_pred eEEEEEeeccceEEEeecCCc-----hhce----EEEEEEe---CCeEEEEEcCcEEEEecccCCChhheecccccceeE
Confidence 222333333322222110000 0000 0011111 2456666666633322 222 56789
Q ss_pred eccCCCCceEEEEE
Q 004574 180 VEPSPDQKYVLITS 193 (744)
Q Consensus 180 ~~~SpDG~~i~~~~ 193 (744)
++.+|||++|+-.+
T Consensus 326 Ltv~~d~~~i~Sgs 339 (603)
T KOG0318|consen 326 LTVSPDGKTIYSGS 339 (603)
T ss_pred EEEcCCCCEEEeec
Confidence 99999998886543
No 141
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.22 E-value=9.2e-10 Score=107.93 Aligned_cols=185 Identities=19% Similarity=0.174 Sum_probs=117.8
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecC--CcEEEEEecCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNN--STLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspD--g~~l~~~~~~~~ 109 (744)
.+....+|+||+.||-.+. +|...||-++ .....+...+... .++.+.|+|. +..|+-.+.+
T Consensus 177 Pis~~~fS~ds~~laT~sw--------sG~~kvW~~~--~~~~~~~l~gH~~----~v~~~~fhP~~~~~~lat~s~D-- 240 (459)
T KOG0272|consen 177 PISGCSFSRDSKHLATGSW--------SGLVKVWSVP--QCNLLQTLRGHTS----RVGAAVFHPVDSDLNLATASAD-- 240 (459)
T ss_pred cceeeEeecCCCeEEEeec--------CCceeEeecC--CcceeEEEecccc----ceeeEEEccCCCccceeeeccC--
Confidence 4667899999999999776 7778888665 4454444333221 4678899997 4456654322
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC-ceeeeeccCCCCce
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
+...||.++-+-...+|+.+ ..+...+|.|+|++
T Consensus 241 ---------------------------------------------gtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~ 275 (459)
T KOG0272|consen 241 ---------------------------------------------GTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKF 275 (459)
T ss_pred ---------------------------------------------CceeeeccCCCcchhhhhcchhhheeeeecCCCce
Confidence 11345554444355666656 77889999999999
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDA 268 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~ 268 (744)
|.-.+.+ ..-.+||+.++..-.+-.+. ..++.+++|.|||. |+.+... +
T Consensus 276 L~TasfD-------------~tWRlWD~~tk~ElL~QEGH------------s~~v~~iaf~~DGS--L~~tGGl---D- 324 (459)
T KOG0272|consen 276 LGTASFD-------------STWRLWDLETKSELLLQEGH------------SKGVFSIAFQPDGS--LAATGGL---D- 324 (459)
T ss_pred eeecccc-------------cchhhcccccchhhHhhccc------------ccccceeEecCCCc--eeeccCc---c-
Confidence 9866544 24566888877544333332 23488999999998 5554211 1
Q ss_pred CccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 269 NVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 269 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
.-++|| |++ ++..+-.|-.....+.+++|||+|..|+..+
T Consensus 325 -----~~~RvW--DlR--tgr~im~L~gH~k~I~~V~fsPNGy~lATgs 364 (459)
T KOG0272|consen 325 -----SLGRVW--DLR--TGRCIMFLAGHIKEILSVAFSPNGYHLATGS 364 (459)
T ss_pred -----chhhee--ecc--cCcEEEEecccccceeeEeECCCceEEeecC
Confidence 222344 442 2333223333455788899999999999876
No 142
>PF00756 Esterase: Putative esterase; InterPro: IPR000801 This family contains several seemingly unrelated proteins, including human esterase D; mycobacterial antigen 85, which is responsible for the high affinity of mycobacteria to fibronectin; Corynebacterium glutamicum major secreted protein PS1; and hypothetical proteins from Escherichia coli, yeast, mycobacteria and Haemophilus influenzae.; PDB: 3LS2_A 1VA5_B 1DQZ_B 3HRH_A 1DQY_A 2GZR_A 2GZS_A 3GFF_A 1R88_A 3E4D_D ....
Probab=99.22 E-value=1.3e-11 Score=123.43 Aligned_cols=210 Identities=23% Similarity=0.311 Sum_probs=120.7
Q ss_pred CeEEEEEEEeCCCCCCCCCCCceEEEEECCC-CCcccccCCcccCCCCccCCCCchhHHHHHhCC----eEEEecCCCCC
Q 004574 492 GVPLTATLYLPPGYDQSKDGPLPCLFWAYPE-DYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARR----FAVLAGPSIPI 566 (744)
Q Consensus 492 g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G----~~v~~~~~~~~ 566 (744)
|......+|+|+++.+ .+++|||+++||. .+.. .......+..++..| .+++..+..+.
T Consensus 5 g~~~~~~VylP~~y~~--~~~~PvlylldG~~~~~~--------------~~~~~~~~~~~~~~~~~~~~iiV~i~~~~~ 68 (251)
T PF00756_consen 5 GRDRRVWVYLPPGYDP--SKPYPVLYLLDGQSGWFR--------------NGNAQEALDRLIAEGKIPPMIIVVIPNGDN 68 (251)
T ss_dssp TEEEEEEEEECTTGGT--TTTEEEEEEESHTTHHHH--------------HHHHHHHHHHHHHHHTSEEEEEEEEESSST
T ss_pred CCeEEEEEEECCCCCC--CCCCEEEEEccCCccccc--------------cchHHHHHHHHHHhCCCCceEEEEEecccc
Confidence 5678899999999833 4579999999993 2110 000011222334443 23333111111
Q ss_pred C----CC-------------CCCChHH--HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccC
Q 004574 567 I----GE-------------GDKLPND--SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSG 627 (744)
Q Consensus 567 ~----g~-------------g~~~~~~--d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~ 627 (744)
. .+ +....+. -..+++.++.++..+++++.+|+|+||||+.|+.++.++|+.|.++++++|
T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~el~p~i~~~~~~~~~~~~i~G~S~GG~~Al~~~l~~Pd~F~~~~~~S~ 148 (251)
T PF00756_consen 69 SRFYTSWYLPAGSSRRADDSGGGDAYETFLTEELIPYIEANYRTDPDRRAIAGHSMGGYGALYLALRHPDLFGAVIAFSG 148 (251)
T ss_dssp SSTTSBTTSSBCTTCBCTSTTTHHHHHHHHHTHHHHHHHHHSSEEECCEEEEEETHHHHHHHHHHHHSTTTESEEEEESE
T ss_pred cccccccccccccccccccCCCCcccceehhccchhHHHHhcccccceeEEeccCCCcHHHHHHHHhCccccccccccCc
Confidence 1 00 1111111 345777888888777766799999999999999999999999999999999
Q ss_pred CCCCCCCCCcccccccchhhcHHHHHhcCcccc-----cCCCCCCEEEEeeCCCCCCCC--------CHHHHHHHHHHHH
Q 004574 628 SYNKTLTPFGFQTEFRTLWEATNVYIEMSPITH-----ANKIKKPILIIHGEVDDKVGL--------FPMQAERFFDALK 694 (744)
Q Consensus 628 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~P~l~i~G~~D~~v~~--------~~~~~~~~~~~l~ 694 (744)
.++.....+.. ... ..+...++... ...-..++++.+|+.|....- ......++.+.|+
T Consensus 149 ~~~~~~~~w~~--~~~------~~~~~~~~~~~~~~~~~~~~~~~i~l~~G~~d~~~~~~~~~~~~~~~~~~~~~~~~l~ 220 (251)
T PF00756_consen 149 ALDPSPSLWGP--SDD------EAWKENDPFDLIKALSQKKKPLRIYLDVGTKDEFGGWEDSAQILQFLANNRELAQLLK 220 (251)
T ss_dssp ESETTHCHHHH--STC------GHHGGCHHHHHHHHHHHTTSEEEEEEEEETTSTTHHCSHHHHHHHHHHHHHHHHHHCC
T ss_pred cccccccccCc--CCc------HHhhhccHHHHhhhhhcccCCCeEEEEeCCCCcccccccCHHHHHHHHHhHhhHHHHH
Confidence 86543111110 001 11111111111 123356799999999983210 0123334444455
Q ss_pred hCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 695 GHGALSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 695 ~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
..+.+..+.++++ +|.+. .+...+...+.||
T Consensus 221 ~~g~~~~~~~~~G-~H~~~---~W~~~l~~~L~~~ 251 (251)
T PF00756_consen 221 AKGIPHTYHVFPG-GHDWA---YWRRRLPDALPWM 251 (251)
T ss_dssp CEECTTESEEEHS-ESSHH---HHHHHHHHHHHHH
T ss_pred HcCCCceEEEecC-ccchh---hHHHHHHHHHhhC
Confidence 5667788888884 57543 3555565666654
No 143
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.21 E-value=8.1e-10 Score=109.51 Aligned_cols=259 Identities=14% Similarity=0.105 Sum_probs=158.5
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI 83 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~ 83 (744)
..+-++.|+| ..-.+|+.+--.. -+....|+|+|+..+|++ ....-+|.+|+.+.+..++-..-+.
T Consensus 235 ~~lrifqvDG----k~N~~lqS~~l~~fPi~~a~f~p~G~~~i~~s---------~rrky~ysyDle~ak~~k~~~~~g~ 301 (514)
T KOG2055|consen 235 GTLRIFQVDG----KVNPKLQSIHLEKFPIQKAEFAPNGHSVIFTS---------GRRKYLYSYDLETAKVTKLKPPYGV 301 (514)
T ss_pred CcEEEEEecC----ccChhheeeeeccCccceeeecCCCceEEEec---------ccceEEEEeeccccccccccCCCCc
Confidence 3566777766 4333554322211 366789999999676754 3346789999999988777543332
Q ss_pred cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEc
Q 004574 84 CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGS 163 (744)
Q Consensus 84 ~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 163 (744)
. ...+..|..|||++.|++... .+.|+++.
T Consensus 302 e-~~~~e~FeVShd~~fia~~G~-------------------------------------------------~G~I~lLh 331 (514)
T KOG2055|consen 302 E-EKSMERFEVSHDSNFIAIAGN-------------------------------------------------NGHIHLLH 331 (514)
T ss_pred c-cchhheeEecCCCCeEEEccc-------------------------------------------------CceEEeeh
Confidence 2 224668899999999988622 16888888
Q ss_pred C-CC-CeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCcc
Q 004574 164 L-DG-TAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVR 241 (744)
Q Consensus 164 ~-~G-~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~ 241 (744)
. ++ -+..+..++.+..++||.||+.|+..... .++|+||+............+..
T Consensus 332 akT~eli~s~KieG~v~~~~fsSdsk~l~~~~~~-------------GeV~v~nl~~~~~~~rf~D~G~v---------- 388 (514)
T KOG2055|consen 332 AKTKELITSFKIEGVVSDFTFSSDSKELLASGGT-------------GEVYVWNLRQNSCLHRFVDDGSV---------- 388 (514)
T ss_pred hhhhhhhheeeeccEEeeEEEecCCcEEEEEcCC-------------ceEEEEecCCcceEEEEeecCcc----------
Confidence 7 66 34445455889999999999988766433 38999999877554443332221
Q ss_pred CCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCC-CCCCCCceEeeee---ccceeceeeccCCceEEEee
Q 004574 242 EGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAE-PAEGEKPEILHKL---DLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 242 ~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~~ 317 (744)
.-..+..|++|++ ++.-+ ..+.+-+++.. -+.+..++.+... -..+.+++|+||++.|+..+
T Consensus 389 -~gts~~~S~ng~y-lA~GS------------~~GiVNIYd~~s~~~s~~PkPik~~dNLtt~Itsl~Fn~d~qiLAiaS 454 (514)
T KOG2055|consen 389 -HGTSLCISLNGSY-LATGS------------DSGIVNIYDGNSCFASTNPKPIKTVDNLTTAITSLQFNHDAQILAIAS 454 (514)
T ss_pred -ceeeeeecCCCce-EEecc------------CcceEEEeccchhhccCCCCchhhhhhhheeeeeeeeCcchhhhhhhh
Confidence 1225677899985 44332 22334444431 1233444444322 34678999999999999887
Q ss_pred eeeccceeEEEEcCCCCCCcceeeeccccccccCCCCC-CceeeCCCCCeEEEEe
Q 004574 318 WYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGS-PMMTRTSTGTNVIAKI 371 (744)
Q Consensus 318 ~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~spdg~~l~~~~ 371 (744)
+... ..+.++++... +......... ..-+. -.++|||.|-++++..
T Consensus 455 ~~~k--nalrLVHvPS~----TVFsNfP~~n--~~vg~vtc~aFSP~sG~lAvGN 501 (514)
T KOG2055|consen 455 RVKK--NALRLVHVPSC----TVFSNFPTSN--TKVGHVTCMAFSPNSGYLAVGN 501 (514)
T ss_pred hccc--cceEEEeccce----eeeccCCCCC--CcccceEEEEecCCCceEEeec
Confidence 4433 35666666552 1111111100 00011 1256999999999875
No 144
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.21 E-value=3.1e-09 Score=97.91 Aligned_cols=272 Identities=14% Similarity=0.194 Sum_probs=154.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+...+|+-||.++|-.+. +....+|-++.. +.+...........+..+.|.|-..-++++... +
T Consensus 22 ~v~Sv~wn~~g~~lasgs~--------dktv~v~n~e~~----r~~~~~~~~gh~~svdql~w~~~~~d~~atas~---d 86 (313)
T KOG1407|consen 22 KVHSVAWNCDGTKLASGSF--------DKTVSVWNLERD----RFRKELVYRGHTDSVDQLCWDPKHPDLFATASG---D 86 (313)
T ss_pred cceEEEEcccCceeeeccc--------CCceEEEEecch----hhhhhhcccCCCcchhhheeCCCCCcceEEecC---C
Confidence 5788999999999998653 445666665533 333333333333356788999877766665321 0
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceE
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYV 189 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i 189 (744)
..|.++|. .|+...-++. ++.....|||||+++
T Consensus 87 ---------------------------------------------k~ir~wd~r~~k~~~~i~~~~eni~i~wsp~g~~~ 121 (313)
T KOG1407|consen 87 ---------------------------------------------KTIRIWDIRSGKCTARIETKGENINITWSPDGEYI 121 (313)
T ss_pred ---------------------------------------------ceEEEEEeccCcEEEEeeccCcceEEEEcCCCCEE
Confidence 35777787 7755544444 777789999999999
Q ss_pred EEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCC
Q 004574 190 LITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDAN 269 (744)
Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~ 269 (744)
++...++ .|-.+|....++..--..+.. ..++.|+-++. ++|+....|--
T Consensus 122 ~~~~kdD-------------~it~id~r~~~~~~~~~~~~e-------------~ne~~w~~~nd--~Fflt~GlG~v-- 171 (313)
T KOG1407|consen 122 AVGNKDD-------------RITFIDARTYKIVNEEQFKFE-------------VNEISWNNSND--LFFLTNGLGCV-- 171 (313)
T ss_pred EEecCcc-------------cEEEEEecccceeehhcccce-------------eeeeeecCCCC--EEEEecCCceE--
Confidence 9986654 344455443322221111111 44678886655 77774322211
Q ss_pred ccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccc
Q 004574 270 VEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENV 349 (744)
Q Consensus 270 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~ 349 (744)
.|.-++. + .....+..+....--+.|+|||++++..+.+. -+-++|++.- .=.+.++....
T Consensus 172 -------~ILsyps--L--kpv~si~AH~snCicI~f~p~GryfA~GsADA----lvSLWD~~EL-iC~R~isRldw--- 232 (313)
T KOG1407|consen 172 -------EILSYPS--L--KPVQSIKAHPSNCICIEFDPDGRYFATGSADA----LVSLWDVDEL-ICERCISRLDW--- 232 (313)
T ss_pred -------EEEeccc--c--ccccccccCCcceEEEEECCCCceEeeccccc----eeeccChhHh-hhheeeccccC---
Confidence 1333221 0 11122333344555688999999999877333 2334454441 01122222221
Q ss_pred cCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCC
Q 004574 350 YSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQ 429 (744)
Q Consensus 350 ~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~ 429 (744)
|. -.++||-||++|+..+++ ..|-+-+..||.. +|...-+ .+.++
T Consensus 233 ---pV-RTlSFS~dg~~lASaSED---------------------h~IDIA~vetGd~--~~eI~~~-----~~t~t--- 277 (313)
T KOG1407|consen 233 ---PV-RTLSFSHDGRMLASASED---------------------HFIDIAEVETGDR--VWEIPCE-----GPTFT--- 277 (313)
T ss_pred ---ce-EEEEeccCcceeeccCcc---------------------ceEEeEecccCCe--EEEeecc-----CCcee---
Confidence 11 116699999999887632 2344555556643 4443221 12222
Q ss_pred cceecccCCCEEEEEEecCC
Q 004574 430 GEEDINLNQLKILTSKESKT 449 (744)
Q Consensus 430 ~~~~~s~d~~~~~~~~~~~~ 449 (744)
.+|.|....|+|......
T Consensus 278 --VAWHPk~~LLAyA~ddk~ 295 (313)
T KOG1407|consen 278 --VAWHPKRPLLAYACDDKD 295 (313)
T ss_pred --EEecCCCceeeEEecCCC
Confidence 489999999998876544
No 145
>PRK13616 lipoprotein LpqB; Provisional
Probab=99.20 E-value=1.1e-09 Score=119.98 Aligned_cols=177 Identities=15% Similarity=0.199 Sum_probs=110.8
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+..|.+||||+++||+..... .+.++..+||+.+. +++.+++|.+.. ...+.|||||+.|+|......-
T Consensus 351 ~vsspaiSpdG~~vA~v~~~~~--~~~d~~s~Lwv~~~-gg~~~~lt~g~~------~t~PsWspDG~~lw~v~dg~~~- 420 (591)
T PRK13616 351 NITSAALSRSGRQVAAVVTLGR--GAPDPASSLWVGPL-GGVAVQVLEGHS------LTRPSWSLDADAVWVVVDGNTV- 420 (591)
T ss_pred CcccceECCCCCEEEEEEeecC--CCCCcceEEEEEeC-CCcceeeecCCC------CCCceECCCCCceEEEecCcce-
Confidence 5689999999999999986321 22246689999996 566788886653 4678999999999998532110
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC-CeeecCCCceeeeeccCCCCceEE
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG-TAKDFGTPAVYTAVEPSPDQKYVL 190 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G-~~~~l~~~~~~~~~~~SpDG~~i~ 190 (744)
..... ....+++|++++++ +.++ ...+.+..+.|||||++|+
T Consensus 421 ------------~~v~~------------------------~~~~gql~~~~vd~ge~~~-~~~g~Issl~wSpDG~RiA 463 (591)
T PRK13616 421 ------------VRVIR------------------------DPATGQLARTPVDASAVAS-RVPGPISELQLSRDGVRAA 463 (591)
T ss_pred ------------EEEec------------------------cCCCceEEEEeccCchhhh-ccCCCcCeEEECCCCCEEE
Confidence 00000 00126899999955 6655 2236799999999999999
Q ss_pred EEEeeCCcccccccCCCcceEEE---EeCCCCeeeeccCCCCCCCCCcccCCccCC-CCccceecCCCceEEEEEeecCC
Q 004574 191 ITSMHRPYSYKVPCARFSQKVQV---WTTDGKLVRELCDLPPAEDIPVCYNSVREG-MRSISWRADKPSTLYWVEAQDRG 266 (744)
Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~l~~---~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~spDg~~~l~~~~~~~~~ 266 (744)
|.... +|++ ...+++. +.|+.... +...... ..++.|..|++ |+... . ++
T Consensus 464 ~i~~g--------------~v~Va~Vvr~~~G~-~~l~~~~~-------l~~~l~~~~~~l~W~~~~~--L~V~~-~-~~ 517 (591)
T PRK13616 464 MIIGG--------------KVYLAVVEQTEDGQ-YALTNPRE-------VGPGLGDTAVSLDWRTGDS--LVVGR-S-DP 517 (591)
T ss_pred EEECC--------------EEEEEEEEeCCCCc-eeecccEE-------eecccCCccccceEecCCE--EEEEe-c-CC
Confidence 98631 5666 4445554 44432210 0000011 34678999997 55332 2 21
Q ss_pred CCCccCCccceEEeccCCCCCCCCce
Q 004574 267 DANVEVSPRDIIYTQPAEPAEGEKPE 292 (744)
Q Consensus 267 ~~~~~~~~~~~l~~~~~~~~~~~~~~ 292 (744)
+ ..+|.+++ ++....
T Consensus 518 ~--------~~v~~v~v---DG~~~~ 532 (591)
T PRK13616 518 E--------HPVWYVNL---DGSNSD 532 (591)
T ss_pred C--------CceEEEec---CCcccc
Confidence 1 13788887 544433
No 146
>KOG4627 consensus Kynurenine formamidase [Amino acid transport and metabolism]
Probab=99.19 E-value=8.9e-11 Score=104.12 Aligned_cols=200 Identities=14% Similarity=0.101 Sum_probs=129.6
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA 560 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~ 560 (744)
..+.+.|-. +| .....+|.|.. .-|+.||+|||+|.-++...++ ..+.....+||+|.+
T Consensus 44 r~e~l~Yg~-~g-~q~VDIwg~~~-------~~klfIfIHGGYW~~g~rk~cl------------siv~~a~~~gY~vas 102 (270)
T KOG4627|consen 44 RVEHLRYGE-GG-RQLVDIWGSTN-------QAKLFIFIHGGYWQEGDRKMCL------------SIVGPAVRRGYRVAS 102 (270)
T ss_pred chhccccCC-CC-ceEEEEecCCC-------CccEEEEEecchhhcCchhccc------------chhhhhhhcCeEEEE
Confidence 455555542 33 34456777754 1589999999988654444222 234566789999988
Q ss_pred cCC-CCCCCCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC-CCceeEEEEccCCCCCCC-CCCc
Q 004574 561 GPS-IPIIGEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA-PHLFCCGIARSGSYNKTL-TPFG 637 (744)
Q Consensus 561 ~~~-~~~~g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~-p~~~~~~v~~~~~~~~~~-~~~~ 637 (744)
..+ +-..++-.++...++...++|+.+... +.+++.+.|||+|+.+|+.+.++. ..++.++++.+|+++... +...
T Consensus 103 vgY~l~~q~htL~qt~~~~~~gv~filk~~~-n~k~l~~gGHSaGAHLa~qav~R~r~prI~gl~l~~GvY~l~EL~~te 181 (270)
T KOG4627|consen 103 VGYNLCPQVHTLEQTMTQFTHGVNFILKYTE-NTKVLTFGGHSAGAHLAAQAVMRQRSPRIWGLILLCGVYDLRELSNTE 181 (270)
T ss_pred eccCcCcccccHHHHHHHHHHHHHHHHHhcc-cceeEEEcccchHHHHHHHHHHHhcCchHHHHHHHhhHhhHHHHhCCc
Confidence 332 112223334566689999999998632 346799999999999999998764 348999999999887321 1111
Q ss_pred ccccccchhhcHHHHHhcCc-ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 638 FQTEFRTLWEATNVYIEMSP-ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 638 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
....... ..+.....|+ +.....++.|+|++.+++|.--- +++.+.+...++++ .+..|++.+|.
T Consensus 182 ~g~dlgL---t~~~ae~~Scdl~~~~~v~~~ilVv~~~~espkl--ieQnrdf~~q~~~a----~~~~f~n~~hy 247 (270)
T KOG4627|consen 182 SGNDLGL---TERNAESVSCDLWEYTDVTVWILVVAAEHESPKL--IEQNRDFADQLRKA----SFTLFKNYDHY 247 (270)
T ss_pred cccccCc---ccchhhhcCccHHHhcCceeeeeEeeecccCcHH--HHhhhhHHHHhhhc----ceeecCCcchh
Confidence 1111000 0011111222 22356778999999999997654 78888888877654 88899999895
No 147
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=99.18 E-value=1.8e-07 Score=91.13 Aligned_cols=255 Identities=13% Similarity=0.074 Sum_probs=137.5
Q ss_pred EEEEEcC-CCCeeecCCC----ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeC--CCCeeeeccC-CCC
Q 004574 158 QLVLGSL-DGTAKDFGTP----AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTT--DGKLVRELCD-LPP 229 (744)
Q Consensus 158 ~l~~~~~-~G~~~~l~~~----~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~--~g~~~~~l~~-~~~ 229 (744)
--|.+|- +|..+.+... .....++.++||++|+...-.. ..+-++.+ +|.. ..+.+ ...
T Consensus 67 aay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~------------g~v~v~p~~~dG~l-~~~v~~~~h 133 (346)
T COG2706 67 AAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHS------------GSVSVYPLQADGSL-QPVVQVVKH 133 (346)
T ss_pred EEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccC------------ceEEEEEcccCCcc-ccceeeeec
Confidence 4566666 5766655322 3336788999999887664442 24555544 3332 22111 111
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEe----eeeccceecee
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEIL----HKLDLRFRSVS 305 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l----~~~~~~~~~~~ 305 (744)
....|..-... -..-...+.|||+. |+.+. - ..++|+++++ +.|..... ......-+.+.
T Consensus 134 ~g~~p~~rQ~~-~h~H~a~~tP~~~~-l~v~D---L--------G~Dri~~y~~---~dg~L~~~~~~~v~~G~GPRHi~ 197 (346)
T COG2706 134 TGSGPHERQES-PHVHSANFTPDGRY-LVVPD---L--------GTDRIFLYDL---DDGKLTPADPAEVKPGAGPRHIV 197 (346)
T ss_pred CCCCCCccccC-CccceeeeCCCCCE-EEEee---c--------CCceEEEEEc---ccCccccccccccCCCCCcceEE
Confidence 11111100000 01334678999996 54442 1 2345777777 33332221 12344567789
Q ss_pred eccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEcc
Q 004574 306 WCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNG 385 (744)
Q Consensus 306 ~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~ 385 (744)
|.|+|+..+... .-++.-.+|.++...++.+..+....-..+.....+...+..|+||++|+... +
T Consensus 198 FHpn~k~aY~v~-EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasN-R------------ 263 (346)
T COG2706 198 FHPNGKYAYLVN-ELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASN-R------------ 263 (346)
T ss_pred EcCCCcEEEEEe-ccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEec-C------------
Confidence 999998655544 55566777888766432222222111112222223334466799999876653 2
Q ss_pred CCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCcee
Q 004574 386 RGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSS 465 (744)
Q Consensus 386 ~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~ 465 (744)
| .+.-.++.+|..+|+.+.+-....+ . .-.+...++++|+.|+...+. ...-.+|.+|.++|++.
T Consensus 264 -g----~dsI~~f~V~~~~g~L~~~~~~~te-------g--~~PR~F~i~~~g~~Liaa~q~-sd~i~vf~~d~~TG~L~ 328 (346)
T COG2706 264 -G----HDSIAVFSVDPDGGKLELVGITPTE-------G--QFPRDFNINPSGRFLIAANQK-SDNITVFERDKETGRLT 328 (346)
T ss_pred -C----CCeEEEEEEcCCCCEEEEEEEeccC-------C--cCCccceeCCCCCEEEEEccC-CCcEEEEEEcCCCceEE
Confidence 1 1123467778888876544332211 0 012335888999877766554 44478999999999888
Q ss_pred eeecC
Q 004574 466 QITNF 470 (744)
Q Consensus 466 ~lt~~ 470 (744)
.+...
T Consensus 329 ~~~~~ 333 (346)
T COG2706 329 LLGRY 333 (346)
T ss_pred ecccc
Confidence 77653
No 148
>PLN02980 2-oxoglutarate decarboxylase/ hydro-lyase/ magnesium ion binding / thiamin pyrophosphate binding
Probab=99.17 E-value=1e-09 Score=135.04 Aligned_cols=198 Identities=16% Similarity=0.209 Sum_probs=121.0
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC--------------hHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL--------------PNDSA 578 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~--------------~~~d~ 578 (744)
.|+||++||.+... ..| ..++..| ..+|.|+. .+.+|+|.+. ..+++
T Consensus 1371 ~~~vVllHG~~~s~-----------~~w----~~~~~~L-~~~~rVi~---~Dl~G~G~S~~~~~~~~~~~~~~~si~~~ 1431 (1655)
T PLN02980 1371 GSVVLFLHGFLGTG-----------EDW----IPIMKAI-SGSARCIS---IDLPGHGGSKIQNHAKETQTEPTLSVELV 1431 (1655)
T ss_pred CCeEEEECCCCCCH-----------HHH----HHHHHHH-hCCCEEEE---EcCCCCCCCCCccccccccccccCCHHHH
Confidence 57899999975211 111 1223333 45799998 4455555432 12334
Q ss_pred HHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC------C------------ccc-
Q 004574 579 EAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP------F------------GFQ- 639 (744)
Q Consensus 579 ~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~------~------------~~~- 639 (744)
.+.+..+.++ ...+++.|+||||||.+++.++.++|++++++|++++........ . ...
T Consensus 1432 a~~l~~ll~~--l~~~~v~LvGhSmGG~iAl~~A~~~P~~V~~lVlis~~p~~~~~~~~~~~~~~~~~~~~~l~~~g~~~ 1509 (1655)
T PLN02980 1432 ADLLYKLIEH--ITPGKVTLVGYSMGARIALYMALRFSDKIEGAVIISGSPGLKDEVARKIRSAKDDSRARMLIDHGLEI 1509 (1655)
T ss_pred HHHHHHHHHH--hCCCCEEEEEECHHHHHHHHHHHhChHhhCEEEEECCCCccCchHHHHHHhhhhhHHHHHHHhhhHHH
Confidence 4444434433 234689999999999999999999999999999887642110000 0 000
Q ss_pred --cc--ccchhhc-------------------HHH----HHhc------CcccccCCCCCCEEEEeeCCCCCCCCCHHHH
Q 004574 640 --TE--FRTLWEA-------------------TNV----YIEM------SPITHANKIKKPILIIHGEVDDKVGLFPMQA 686 (744)
Q Consensus 640 --~~--~~~~~~~-------------------~~~----~~~~------~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~ 686 (744)
.. ....|.. ... +... +....+.++++|+|+++|++|..++ ..+
T Consensus 1510 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~dl~~~L~~I~~PtLlI~Ge~D~~~~---~~a 1586 (1655)
T PLN02980 1510 FLENWYSGELWKSLRNHPHFNKIVASRLLHKDVPSLAKLLSDLSIGRQPSLWEDLKQCDTPLLLVVGEKDVKFK---QIA 1586 (1655)
T ss_pred HHHHhccHHHhhhhccCHHHHHHHHHHHhcCCHHHHHHHHHHhhhcccchHHHHHhhCCCCEEEEEECCCCccH---HHH
Confidence 00 0000000 000 0000 0112366789999999999998753 566
Q ss_pred HHHHHHHHhCC--------CcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhccC
Q 004574 687 ERFFDALKGHG--------ALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 687 ~~~~~~l~~~~--------~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~ 735 (744)
.++.+.+.... ..++++++|++||..+ .+.++.+.+.+.+||.+.-.+
T Consensus 1587 ~~~~~~i~~a~~~~~~~~~~~a~lvvI~~aGH~~~-lE~Pe~f~~~I~~FL~~~~~~ 1642 (1655)
T PLN02980 1587 QKMYREIGKSKESGNDKGKEIIEIVEIPNCGHAVH-LENPLPVIRALRKFLTRLHNS 1642 (1655)
T ss_pred HHHHHHccccccccccccccceEEEEECCCCCchH-HHCHHHHHHHHHHHHHhcccc
Confidence 67776665421 1268999999999887 677889999999999975433
No 149
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.16 E-value=4.6e-08 Score=102.03 Aligned_cols=225 Identities=10% Similarity=0.026 Sum_probs=110.6
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC--CCeeeeccCCCCCCCCCcccCCccCCCCccceecC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD--GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRAD 252 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spD 252 (744)
+....++++|||++|+...... ..+.+|+++ +...+.+...+.. .++..++++||
T Consensus 80 ~~p~~i~~~~~g~~l~v~~~~~------------~~v~v~~~~~~g~~~~~~~~~~~~-----------~~~~~~~~~p~ 136 (330)
T PRK11028 80 GSPTHISTDHQGRFLFSASYNA------------NCVSVSPLDKDGIPVAPIQIIEGL-----------EGCHSANIDPD 136 (330)
T ss_pred CCceEEEECCCCCEEEEEEcCC------------CeEEEEEECCCCCCCCceeeccCC-----------CcccEeEeCCC
Confidence 4556789999999888764332 367777765 2211111111110 12456789999
Q ss_pred CCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE-----ee-eeccceeceeeccCCceEEEeeeeeccceeE
Q 004574 253 KPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI-----LH-KLDLRFRSVSWCDDSLALVNETWYKTSQTRT 326 (744)
Q Consensus 253 g~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-----l~-~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l 326 (744)
|+. ++... . ..+.|.+++++ +.+.... .. ........++|+|||++++... ...+ .|
T Consensus 137 g~~-l~v~~-~----------~~~~v~v~d~~--~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~-~~~~--~v 199 (330)
T PRK11028 137 NRT-LWVPC-L----------KEDRIRLFTLS--DDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVN-ELNS--SV 199 (330)
T ss_pred CCE-EEEee-C----------CCCEEEEEEEC--CCCcccccCCCceecCCCCCCceEEECCCCCEEEEEe-cCCC--EE
Confidence 986 54442 1 12357777763 2222110 11 1123345689999999877654 3223 45
Q ss_pred EEEcCCCCCCcceeeeccc-cccccCCCCCC-ceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecC-
Q 004574 327 WLVCPGSKDVAPRVLFDRV-FENVYSDPGSP-MMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDIN- 403 (744)
Q Consensus 327 ~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~-~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~- 403 (744)
.+++++...++.+.+.... .......+..+ .+.++|||++|+..... ...|.+|+..
T Consensus 200 ~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~--------------------~~~I~v~~i~~ 259 (330)
T PRK11028 200 DVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRT--------------------ASLISVFSVSE 259 (330)
T ss_pred EEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCC--------------------CCeEEEEEEeC
Confidence 5555542112222211100 00000001111 24578999988776311 1235555553
Q ss_pred -CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecC
Q 004574 404 -TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNF 470 (744)
Q Consensus 404 -~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~ 470 (744)
++..+ +...- .. ......+.++|||+.|+.+... ...-.+|.++..+|..+.+..+
T Consensus 260 ~~~~~~-~~~~~--------~~-~~~p~~~~~~~dg~~l~va~~~-~~~v~v~~~~~~~g~l~~~~~~ 316 (330)
T PRK11028 260 DGSVLS-FEGHQ--------PT-ETQPRGFNIDHSGKYLIAAGQK-SHHISVYEIDGETGLLTELGRY 316 (330)
T ss_pred CCCeEE-EeEEE--------ec-cccCCceEECCCCCEEEEEEcc-CCcEEEEEEcCCCCcEEEcccc
Confidence 22222 21110 00 0112236899999887765532 3333466666566766655443
No 150
>PF03403 PAF-AH_p_II: Platelet-activating factor acetylhydrolase, isoform II; PDB: 3F98_B 3F97_B 3D59_A 3F96_A 3D5E_B 3F9C_A.
Probab=99.15 E-value=5.8e-10 Score=115.97 Aligned_cols=180 Identities=17% Similarity=0.136 Sum_probs=99.4
Q ss_pred CCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCC-------------C-------
Q 004574 511 GPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGE-------------G------- 570 (744)
Q Consensus 511 ~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~-------------g------- 570 (744)
+++|+|||.||.+ ++...+...+..||++||+|++.+.++..+. .
T Consensus 98 ~~~PvvIFSHGlg---------------g~R~~yS~~~~eLAS~GyVV~aieHrDgSa~~t~~~~~~~~~~~~~~~~~~~ 162 (379)
T PF03403_consen 98 GKFPVVIFSHGLG---------------GSRTSYSAICGELASHGYVVAAIEHRDGSAPATYFMRDGSGAEVEPYVVEYL 162 (379)
T ss_dssp S-EEEEEEE--TT-----------------TTTTHHHHHHHHHTT-EEEEE---SS-SSEEEE-SSHHHHHHT-------
T ss_pred CCCCEEEEeCCCC---------------cchhhHHHHHHHHHhCCeEEEEeccCCCceeEEEeccCCCcccccccccccc
Confidence 4599999999964 2223344667899999999998433322110 0
Q ss_pred C-----------CC-----------hHHHHHHHHHHHHHc--------------------CCCCCCcEEEEEechHHHHH
Q 004574 571 D-----------KL-----------PNDSAEAAVEEVVRR--------------------GVADPSRIAVGGHSYGAFMT 608 (744)
Q Consensus 571 ~-----------~~-----------~~~d~~~~~~~l~~~--------------------~~~d~~~i~l~G~S~GG~~a 608 (744)
. .+ ...++..+++.|.+. +.+|.++|+++|||+||..+
T Consensus 163 ~~~~~~~~~~~~~~~~~~R~~QL~~R~~Ei~~~l~~L~~i~~G~~~~~~l~~~~~l~~~~grlD~~~i~~~GHSFGGATa 242 (379)
T PF03403_consen 163 EEEWIPLRDFDPEEEFELRNAQLRQRVAEIQFVLDALEEINSGDPVENVLPSSFDLSQFKGRLDLSRIGLAGHSFGGATA 242 (379)
T ss_dssp --EEEE-----GGGHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-----SS--SS-GGGGTT-EEEEEEEEEEETHHHHHH
T ss_pred ccceeccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccCCccCHHHHhhhcchhheeeeecCchHHHH
Confidence 0 00 011777788777641 23567899999999999999
Q ss_pred HHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHH
Q 004574 609 AHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAER 688 (744)
Q Consensus 609 ~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~ 688 (744)
+.++.+. .+|+++|++.|..- +. ..+ ....++.|+|+|+.+. ... ..+...
T Consensus 243 ~~~l~~d-~r~~~~I~LD~W~~----Pl--~~~------------------~~~~i~~P~L~InSe~-f~~---~~~~~~ 293 (379)
T PF03403_consen 243 LQALRQD-TRFKAGILLDPWMF----PL--GDE------------------IYSKIPQPLLFINSES-FQW---WENIFR 293 (379)
T ss_dssp HHHHHH--TT--EEEEES---T----TS---GG------------------GGGG--S-EEEEEETT-T-----HHHHHH
T ss_pred HHHHhhc-cCcceEEEeCCccc----CC--Ccc------------------cccCCCCCEEEEECcc-cCC---hhhHHH
Confidence 9999888 78999999998531 11 000 0134678999998875 321 333333
Q ss_pred HHHHHHhCCCcEEEEEeCCCCcccCc-----cc-------------c----HHHHHHHHHHHHHHhccC
Q 004574 689 FFDALKGHGALSRLVLLPFEHHVYAA-----RE-------------N----VMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 689 ~~~~l~~~~~~~~~~~~~~~~H~~~~-----~~-------------~----~~~~~~~~~~fl~~~l~~ 735 (744)
+.+ +........++.+.++.|.-.. .+ . .+...+.+++||+++|+.
T Consensus 294 ~~~-~~~~~~~~~~~ti~gt~H~s~sD~~ll~P~~l~~~~~~~g~~dp~~a~~i~~~~~l~FL~~~L~~ 361 (379)
T PF03403_consen 294 MKK-VISNNKESRMLTIKGTAHLSFSDFPLLSPWLLGKFLGLKGSIDPERALRINNRASLAFLRRHLGL 361 (379)
T ss_dssp HHT-T--TTS-EEEEEETT--GGGGSGGGGTS-HHHHHHTTSS-SS-HHHHHHHHHHHHHHHHHHHHT-
T ss_pred HHH-HhccCCCcEEEEECCCcCCCcchhhhhhHHHHHHHhccccCcCHHHHHHHHHHHHHHHHHHhcCC
Confidence 333 2234456789999999995221 00 1 233456789999999874
No 151
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.14 E-value=1.7e-08 Score=94.01 Aligned_cols=275 Identities=11% Similarity=0.060 Sum_probs=166.9
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECC---CCc-eeccccCCCccccccccceEEecCCcEEEEEecC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAE---TGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~---gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
.+...+..+.+..+.+...+ +...-+|-+.-+ .|. .|+++.+.+ .+....-|+||.+.+..+.+
T Consensus 17 ~Vt~la~~~~~~~~l~sasr-------Dk~ii~W~L~~dd~~~G~~~r~~~GHsH-----~v~dv~~s~dg~~alS~swD 84 (315)
T KOG0279|consen 17 WVTALAIKIKNSDILVSASR-------DKTIIVWKLTSDDIKYGVPVRRLTGHSH-----FVSDVVLSSDGNFALSASWD 84 (315)
T ss_pred eEEEEEeecCCCceEEEccc-------ceEEEEEEeccCccccCceeeeeeccce-----EecceEEccCCceEEecccc
Confidence 56677777777777774432 222333433322 233 355554343 56788999999977655322
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCee-ecCCC-ceeeeeccCC
Q 004574 108 SRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAK-DFGTP-AVYTAVEPSP 184 (744)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~-~l~~~-~~~~~~~~Sp 184 (744)
..+.++|+ +|+.+ ++--+ ..+...++||
T Consensus 85 -------------------------------------------------~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~ 115 (315)
T KOG0279|consen 85 -------------------------------------------------GTLRLWDLATGESTRRFVGHTKDVLSVAFST 115 (315)
T ss_pred -------------------------------------------------ceEEEEEecCCcEEEEEEecCCceEEEEecC
Confidence 56888899 77544 44444 7788999999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCC-CCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLP-PAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
|.++|+-.+.+ ..|-+|+.-|.....+.... .+- +..+.|+|+-.. .+.++..
T Consensus 116 dn~qivSGSrD-------------kTiklwnt~g~ck~t~~~~~~~~W------------VscvrfsP~~~~-p~Ivs~s 169 (315)
T KOG0279|consen 116 DNRQIVSGSRD-------------KTIKLWNTLGVCKYTIHEDSHREW------------VSCVRFSPNESN-PIIVSAS 169 (315)
T ss_pred CCceeecCCCc-------------ceeeeeeecccEEEEEecCCCcCc------------EEEEEEcCCCCC-cEEEEcc
Confidence 99999866544 37899999888777766553 111 456899999655 4444321
Q ss_pred cCCCCCccCCccceEEeccCCCCCCCCceE-eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeee
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLF 342 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~ 342 (744)
....+-++++ .+-+.+. +....+.++.+++||||..+++.. ...+++.+|++.+ +. .-+
T Consensus 170 ----------~DktvKvWnl---~~~~l~~~~~gh~~~v~t~~vSpDGslcasGg----kdg~~~LwdL~~~--k~-lys 229 (315)
T KOG0279|consen 170 ----------WDKTVKVWNL---RNCQLRTTFIGHSGYVNTVTVSPDGSLCASGG----KDGEAMLWDLNEG--KN-LYS 229 (315)
T ss_pred ----------CCceEEEEcc---CCcchhhccccccccEEEEEECCCCCEEecCC----CCceEEEEEccCC--ce-eEe
Confidence 2224777777 3434333 334567888999999999888743 3357888888874 22 222
Q ss_pred ccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhhe
Q 004574 343 DRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETA 422 (744)
Q Consensus 343 ~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~ 422 (744)
..+...+.+ ++|||+-=+|+... ...|.+||+.++....-...++.
T Consensus 230 l~a~~~v~s------l~fspnrywL~~at----------------------~~sIkIwdl~~~~~v~~l~~d~~------ 275 (315)
T KOG0279|consen 230 LEAFDIVNS------LCFSPNRYWLCAAT----------------------ATSIKIWDLESKAVVEELKLDGI------ 275 (315)
T ss_pred ccCCCeEee------EEecCCceeEeecc----------------------CCceEEEeccchhhhhhcccccc------
Confidence 222222222 66888876665553 23488999887754321111110
Q ss_pred eeeecC----CcceecccCCCEEEEEEec
Q 004574 423 VALVFG----QGEEDINLNQLKILTSKES 447 (744)
Q Consensus 423 ~~~~~~----~~~~~~s~d~~~~~~~~~~ 447 (744)
...... -..++||+||.+|+....+
T Consensus 276 g~s~~~~~~~clslaws~dG~tLf~g~td 304 (315)
T KOG0279|consen 276 GPSSKAGDPICLSLAWSADGQTLFAGYTD 304 (315)
T ss_pred ccccccCCcEEEEEEEcCCCcEEEeeecC
Confidence 000000 0127999999998876643
No 152
>KOG4667 consensus Predicted esterase [Lipid transport and metabolism]
Probab=99.14 E-value=4.6e-10 Score=100.63 Aligned_cols=200 Identities=14% Similarity=0.113 Sum_probs=122.6
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA 560 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~ 560 (744)
....+.|.+..+..+-+.+..- +..-++|++||+- - +.-...+...+..++..||.++.
T Consensus 9 ~~~~ivi~n~~ne~lvg~lh~t--------gs~e~vvlcHGfr---------S----~Kn~~~~~~vA~~~e~~gis~fR 67 (269)
T KOG4667|consen 9 IAQKIVIPNSRNEKLVGLLHET--------GSTEIVVLCHGFR---------S----HKNAIIMKNVAKALEKEGISAFR 67 (269)
T ss_pred eeeEEEeccCCCchhhcceecc--------CCceEEEEeeccc---------c----ccchHHHHHHHHHHHhcCceEEE
Confidence 3445555555555555533221 1245899999951 0 01111223456677889999998
Q ss_pred cCCCCCCCCCCCC----------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 561 GPSIPIIGEGDKL----------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 561 ~~~~~~~g~g~~~----------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
++..|.|.++ ..+|+..+++++..... .--+|+|||-||.+++..+.+.++ +.-+|.++|-++
T Consensus 68 ---fDF~GnGeS~gsf~~Gn~~~eadDL~sV~q~~s~~nr---~v~vi~gHSkGg~Vvl~ya~K~~d-~~~viNcsGRyd 140 (269)
T KOG4667|consen 68 ---FDFSGNGESEGSFYYGNYNTEADDLHSVIQYFSNSNR---VVPVILGHSKGGDVVLLYASKYHD-IRNVINCSGRYD 140 (269)
T ss_pred ---EEecCCCCcCCccccCcccchHHHHHHHHHHhccCce---EEEEEEeecCccHHHHHHHHhhcC-chheEEcccccc
Confidence 4444444331 12588888888886432 124789999999999999999965 777888888766
Q ss_pred CCCCC------Ccccc-cccchhhc------------HHHH-Hhc--CcccccCCC--CCCEEEEeeCCCCCCCCCHHHH
Q 004574 631 KTLTP------FGFQT-EFRTLWEA------------TNVY-IEM--SPITHANKI--KKPILIIHGEVDDKVGLFPMQA 686 (744)
Q Consensus 631 ~~~~~------~~~~~-~~~~~~~~------------~~~~-~~~--~~~~~~~~~--~~P~l~i~G~~D~~v~~~~~~~ 686 (744)
..... ..... ....+|.. ++.+ ... +......+| .||+|-+||..|.+|| .+.|
T Consensus 141 l~~~I~eRlg~~~l~~ike~Gfid~~~rkG~y~~rvt~eSlmdrLntd~h~aclkId~~C~VLTvhGs~D~IVP--ve~A 218 (269)
T KOG4667|consen 141 LKNGINERLGEDYLERIKEQGFIDVGPRKGKYGYRVTEESLMDRLNTDIHEACLKIDKQCRVLTVHGSEDEIVP--VEDA 218 (269)
T ss_pred hhcchhhhhcccHHHHHHhCCceecCcccCCcCceecHHHHHHHHhchhhhhhcCcCccCceEEEeccCCceee--chhH
Confidence 32111 00000 00111111 1111 111 111112233 6999999999999999 9999
Q ss_pred HHHHHHHHhCCCcEEEEEeCCCCcccCc
Q 004574 687 ERFFDALKGHGALSRLVLLPFEHHVYAA 714 (744)
Q Consensus 687 ~~~~~~l~~~~~~~~~~~~~~~~H~~~~ 714 (744)
.++.+.+.. .++.+++++.|++..
T Consensus 219 kefAk~i~n----H~L~iIEgADHnyt~ 242 (269)
T KOG4667|consen 219 KEFAKIIPN----HKLEIIEGADHNYTG 242 (269)
T ss_pred HHHHHhccC----CceEEecCCCcCccc
Confidence 999887764 589999999999874
No 153
>PRK06765 homoserine O-acetyltransferase; Provisional
Probab=99.14 E-value=1.6e-09 Score=113.54 Aligned_cols=69 Identities=14% Similarity=0.167 Sum_probs=59.1
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC-CCcccCccccHHHHHHHHHHHHHH
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPF-EHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
.+.++++|+|+|+|+.|.++| ...++++.+.++..+.+++++++++ .+|... .+.+..+.+.+.+||++
T Consensus 318 ~L~~I~~PtLvI~G~~D~l~p--~~~~~~la~~lp~~~~~a~l~~I~s~~GH~~~-le~p~~~~~~I~~FL~~ 387 (389)
T PRK06765 318 ALSNIEANVLMIPCKQDLLQP--PRYNYKMVDILQKQGKYAEVYEIESINGHMAG-VFDIHLFEKKIYEFLNR 387 (389)
T ss_pred HHhcCCCCEEEEEeCCCCCCC--HHHHHHHHHHhhhcCCCeEEEEECCCCCcchh-hcCHHHHHHHHHHHHcc
Confidence 345789999999999999999 9999999988887666789999985 899876 56778899999999965
No 154
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.13 E-value=1.3e-07 Score=90.11 Aligned_cols=253 Identities=15% Similarity=0.172 Sum_probs=157.3
Q ss_pred CcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCC
Q 004574 31 AKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRR 110 (744)
Q Consensus 31 ~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~ 110 (744)
..+....+|+||..++-.+ ....|.+++...|+..+....... ++.-+.|......+...+.. .
T Consensus 15 ~~i~sl~fs~~G~~litss----------~dDsl~LYd~~~g~~~~ti~skky----G~~~~~Fth~~~~~i~sStk--~ 78 (311)
T KOG1446|consen 15 GKINSLDFSDDGLLLITSS----------EDDSLRLYDSLSGKQVKTINSKKY----GVDLACFTHHSNTVIHSSTK--E 78 (311)
T ss_pred CceeEEEecCCCCEEEEec----------CCCeEEEEEcCCCceeeEeecccc----cccEEEEecCCceEEEccCC--C
Confidence 3688899999999988732 224567778887776655544432 34556777666666654321 0
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC--ceeeeeccCCCCc
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP--AVYTAVEPSPDQK 187 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~--~~~~~~~~SpDG~ 187 (744)
. ..|..+++ +-+--+-..+ ..+..+.-||-+.
T Consensus 79 d---------------------------------------------~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~d 113 (311)
T KOG1446|consen 79 D---------------------------------------------DTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKDD 113 (311)
T ss_pred C---------------------------------------------CceEEEEeecCceEEEcCCCCceEEEEEecCCCC
Confidence 0 34666666 4443333333 6678899999886
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
..+-.+.+ ..|.+||+...+..-+...... .-+++.|.| |+|+....+
T Consensus 114 ~FlS~S~D-------------~tvrLWDlR~~~cqg~l~~~~~--------------pi~AfDp~G---LifA~~~~~-- 161 (311)
T KOG1446|consen 114 TFLSSSLD-------------KTVRLWDLRVKKCQGLLNLSGR--------------PIAAFDPEG---LIFALANGS-- 161 (311)
T ss_pred eEEecccC-------------CeEEeeEecCCCCceEEecCCC--------------cceeECCCC---cEEEEecCC--
Confidence 54433333 3799999886654433322111 134899999 777754322
Q ss_pred CCccCCccceEEeccCCCCCCCCceEeeee---ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeecc
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKPEILHKL---DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDR 344 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~ 344 (744)
+.|.+.|++.++.|..+.+.-. ..++..+.|||||++|+.++ ....++++|.-.| ..+.-...
T Consensus 162 --------~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT----~~s~~~~lDAf~G--~~~~tfs~ 227 (311)
T KOG1446|consen 162 --------ELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLST----NASFIYLLDAFDG--TVKSTFSG 227 (311)
T ss_pred --------CeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEe----CCCcEEEEEccCC--cEeeeEee
Confidence 2477888766666665554333 45788899999999999887 3446889997774 32222221
Q ss_pred ccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeecc
Q 004574 345 VFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESN 414 (744)
Q Consensus 345 ~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~ 414 (744)
..... ..+. ...++|||+.|+..+. ...|++|++.+|.....+...
T Consensus 228 ~~~~~-~~~~--~a~ftPds~Fvl~gs~---------------------dg~i~vw~~~tg~~v~~~~~~ 273 (311)
T KOG1446|consen 228 YPNAG-NLPL--SATFTPDSKFVLSGSD---------------------DGTIHVWNLETGKKVAVLRGP 273 (311)
T ss_pred ccCCC-Ccce--eEEECCCCcEEEEecC---------------------CCcEEEEEcCCCcEeeEecCC
Confidence 11110 0112 2458999999887752 124899999999887776653
No 155
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.12 E-value=1.2e-08 Score=110.30 Aligned_cols=268 Identities=16% Similarity=0.150 Sum_probs=156.4
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc---eeccccCCCccccccccceEEecCCcEEEEEecCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE---AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSS 108 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~---~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~ 108 (744)
.+....+||||++++.... +....+| .+.+++ .+.+. .....+..+.|||||+.|+-.+.+
T Consensus 161 sv~~~~fs~~g~~l~~~~~--------~~~i~~~--~~~~~~~~~~~~l~-----~h~~~v~~~~fs~d~~~l~s~s~D- 224 (456)
T KOG0266|consen 161 SVTCVDFSPDGRALAAASS--------DGLIRIW--KLEGIKSNLLRELS-----GHTRGVSDVAFSPDGSYLLSGSDD- 224 (456)
T ss_pred ceEEEEEcCCCCeEEEccC--------CCcEEEe--ecccccchhhcccc-----ccccceeeeEECCCCcEEEEecCC-
Confidence 4566899999999988643 3334444 444544 22221 122257789999999977765322
Q ss_pred CCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC--CeeecCCC-ceeeeeccCC
Q 004574 109 RRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG--TAKDFGTP-AVYTAVEPSP 184 (744)
Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G--~~~~l~~~-~~~~~~~~Sp 184 (744)
.+|+++++ +. ..+.+..+ ..+...+|+|
T Consensus 225 ------------------------------------------------~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p 256 (456)
T KOG0266|consen 225 ------------------------------------------------KTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSP 256 (456)
T ss_pred ------------------------------------------------ceEEEeeccCCCeEEEEecCCCCceEEEEecC
Confidence 46777777 44 44555544 7788999999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee-ccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE-LCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
+|+.|+..+.+ ..+.+|++.+++... +..+.. ++..+.+++||+. |+..+
T Consensus 257 ~g~~i~Sgs~D-------------~tvriWd~~~~~~~~~l~~hs~-------------~is~~~f~~d~~~-l~s~s-- 307 (456)
T KOG0266|consen 257 DGNLLVSGSDD-------------GTVRIWDVRTGECVRKLKGHSD-------------GISGLAFSPDGNL-LVSAS-- 307 (456)
T ss_pred CCCEEEEecCC-------------CcEEEEeccCCeEEEeeeccCC-------------ceEEEEECCCCCE-EEEcC--
Confidence 99655544333 379999998754443 333321 2667899999986 43332
Q ss_pred cCCCCCccCCccceEEeccCCCCCCCCc---eEeeeecc--ceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcc
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEGEKP---EILHKLDL--RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAP 338 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~---~~l~~~~~--~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~ 338 (744)
..+.|.++|+ .++.. ..+..... .+..+.|||+|++|+.... ...+.++++..+ ...
T Consensus 308 ----------~d~~i~vwd~---~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~----d~~~~~w~l~~~-~~~ 369 (456)
T KOG0266|consen 308 ----------YDGTIRVWDL---ETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASL----DRTLKLWDLRSG-KSV 369 (456)
T ss_pred ----------CCccEEEEEC---CCCceeeeecccCCCCCCceeEEEECCCCcEEEEecC----CCeEEEEEccCC-cce
Confidence 3345888888 55552 23333322 4788999999999998762 234666666653 222
Q ss_pred eeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhh
Q 004574 339 RVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKY 418 (744)
Q Consensus 339 ~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~ 418 (744)
......... .... .. ...+++|++++..... ..++.||+.++..........
T Consensus 370 ~~~~~~~~~-~~~~-~~--~~~~~~~~~i~sg~~d---------------------~~v~~~~~~s~~~~~~l~~h~--- 421 (456)
T KOG0266|consen 370 GTYTGHSNL-VRCI-FS--PTLSTGGKLIYSGSED---------------------GSVYVWDSSSGGILQRLEGHS--- 421 (456)
T ss_pred eeecccCCc-ceeE-ec--ccccCCCCeEEEEeCC---------------------ceEEEEeCCccchhhhhcCCC---
Confidence 222222221 1000 01 1247788877776522 248888988765433322210
Q ss_pred hhheeeeecCCcceecccCCCEEEEEE
Q 004574 419 FETAVALVFGQGEEDINLNQLKILTSK 445 (744)
Q Consensus 419 ~~~~~~~~~~~~~~~~s~d~~~~~~~~ 445 (744)
......++++|..+.++...
T Consensus 422 -------~~~~~~~~~~~~~~~~~s~s 441 (456)
T KOG0266|consen 422 -------KAAVSDLSSHPTENLIASSS 441 (456)
T ss_pred -------CCceeccccCCCcCeeeecC
Confidence 11112246677776666543
No 156
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.12 E-value=2e-08 Score=100.11 Aligned_cols=268 Identities=14% Similarity=0.133 Sum_probs=165.2
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecC
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
+....++...|+-||..||+.. ..+.+.+++.+|+...-|-++.. .+..+.|+-+|.+|+....+
T Consensus 233 ~~nkdVT~L~Wn~~G~~LatG~----------~~G~~riw~~~G~l~~tl~~Hkg-----PI~slKWnk~G~yilS~~vD 297 (524)
T KOG0273|consen 233 PSNKDVTSLDWNNDGTLLATGS----------EDGEARIWNKDGNLISTLGQHKG-----PIFSLKWNKKGTYILSGGVD 297 (524)
T ss_pred CccCCcceEEecCCCCeEEEee----------cCcEEEEEecCchhhhhhhccCC-----ceEEEEEcCCCCEEEeccCC
Confidence 3444688999999999999964 34566666888888888866665 46788999999999875321
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC--ceeeeeccCC
Q 004574 108 SRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP--AVYTAVEPSP 184 (744)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~--~~~~~~~~Sp 184 (744)
+.+.++|. +|+..+.... .....+.|--
T Consensus 298 -------------------------------------------------~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~ 328 (524)
T KOG0273|consen 298 -------------------------------------------------GTTILWDAHTGTVKQQFEFHSAPALDVDWQS 328 (524)
T ss_pred -------------------------------------------------ccEEEEeccCceEEEeeeeccCCccceEEec
Confidence 45677788 8865555433 3334566754
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
+.+. +-...+ ..|+++.++++.. +.+..+... +..+.|.|.|+- |+-++ .
T Consensus 329 ~~~F-~ts~td-------------~~i~V~kv~~~~P~~t~~GH~g~-------------V~alk~n~tg~L-LaS~S-d 379 (524)
T KOG0273|consen 329 NDEF-ATSSTD-------------GCIHVCKVGEDRPVKTFIGHHGE-------------VNALKWNPTGSL-LASCS-D 379 (524)
T ss_pred CceE-eecCCC-------------ceEEEEEecCCCcceeeecccCc-------------eEEEEECCCCce-EEEec-C
Confidence 4432 222111 3588888876543 333333322 668899999974 44443 1
Q ss_pred cCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCce---------EEEeeeeeccceeEEEEcCCCC
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLA---------LVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~---------l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
++ .+-+++.. .+.....|......+..+.|||+|.- |+..+ ....+..+|+..
T Consensus 380 D~-----------TlkiWs~~--~~~~~~~l~~Hskei~t~~wsp~g~v~~n~~~~~~l~sas----~dstV~lwdv~~- 441 (524)
T KOG0273|consen 380 DG-----------TLKIWSMG--QSNSVHDLQAHSKEIYTIKWSPTGPVTSNPNMNLMLASAS----FDSTVKLWDVES- 441 (524)
T ss_pred CC-----------eeEeeecC--CCcchhhhhhhccceeeEeecCCCCccCCCcCCceEEEee----cCCeEEEEEccC-
Confidence 11 23333321 22333445555666777889988642 23222 334566777776
Q ss_pred CCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeecc
Q 004574 335 DVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESN 414 (744)
Q Consensus 335 ~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~ 414 (744)
+.++..+..+...+++ +++||||++|++..-+ ..+.+|+..+++..+-...+
T Consensus 442 -gv~i~~f~kH~~pVys------vafS~~g~ylAsGs~d---------------------g~V~iws~~~~~l~~s~~~~ 493 (524)
T KOG0273|consen 442 -GVPIHTLMKHQEPVYS------VAFSPNGRYLASGSLD---------------------GCVHIWSTKTGKLVKSYQGT 493 (524)
T ss_pred -CceeEeeccCCCceEE------EEecCCCcEEEecCCC---------------------CeeEeccccchheeEeecCC
Confidence 4555555445544444 7799999999987521 23788888888765444333
Q ss_pred chhhhhheeeeecCCcceecccCCCEEEEEEe
Q 004574 415 REKYFETAVALVFGQGEEDINLNQLKILTSKE 446 (744)
Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~ 446 (744)
+. + ..+.|+.+|+++....+
T Consensus 494 ~~-----I-------fel~Wn~~G~kl~~~~s 513 (524)
T KOG0273|consen 494 GG-----I-------FELCWNAAGDKLGACAS 513 (524)
T ss_pred Ce-----E-------EEEEEcCCCCEEEEEec
Confidence 22 1 12588888987776554
No 157
>PRK05855 short chain dehydrogenase; Validated
Probab=99.12 E-value=8.4e-10 Score=125.25 Aligned_cols=64 Identities=9% Similarity=-0.006 Sum_probs=49.1
Q ss_pred CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhc
Q 004574 662 NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 662 ~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
..+++|+|+++|++|.+++ ....+.+.+.+ ...++++++ ++|..+ .+.+..+...+.+||++.-
T Consensus 230 ~~~~~P~lii~G~~D~~v~--~~~~~~~~~~~----~~~~~~~~~-~gH~~~-~e~p~~~~~~i~~fl~~~~ 293 (582)
T PRK05855 230 RYTDVPVQLIVPTGDPYVR--PALYDDLSRWV----PRLWRREIK-AGHWLP-MSHPQVLAAAVAEFVDAVE 293 (582)
T ss_pred CCccCceEEEEeCCCcccC--HHHhccccccC----CcceEEEcc-CCCcch-hhChhHHHHHHHHHHHhcc
Confidence 3478999999999999998 77766654333 235777776 589877 6677889999999998743
No 158
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.10 E-value=3.2e-08 Score=100.28 Aligned_cols=244 Identities=14% Similarity=0.119 Sum_probs=145.7
Q ss_pred cceeEeecCCCCCCCCc-eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-eccccCCC
Q 004574 5 TGIGIHRLLPDDSLGPE-KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-KPLFESPD 82 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~-~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-~~lt~~~~ 82 (744)
..|.++++.. ++. ..+. .....+....|+|++++|+... ..+.|+++++.+++. +.+.....
T Consensus 31 g~i~i~~~~~----~~~~~~~~--~~~~~i~~~~~~~~~~~l~~~~----------~~~~i~i~~~~~~~~~~~~~~~~~ 94 (289)
T cd00200 31 GTIKVWDLET----GELLRTLK--GHTGPVRDVAASADGTYLASGS----------SDKTIRLWDLETGECVRTLTGHTS 94 (289)
T ss_pred cEEEEEEeeC----CCcEEEEe--cCCcceeEEEECCCCCEEEEEc----------CCCeEEEEEcCcccceEEEeccCC
Confidence 3577777755 432 3333 2222456899999998888853 246788888887543 33322221
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
.+..+.|+|+++.|+..... +.+.++
T Consensus 95 -----~i~~~~~~~~~~~~~~~~~~-------------------------------------------------~~i~~~ 120 (289)
T cd00200 95 -----YVSSVAFSPDGRILSSSSRD-------------------------------------------------KTIKVW 120 (289)
T ss_pred -----cEEEEEEcCCCCEEEEecCC-------------------------------------------------CeEEEE
Confidence 35678999998766654211 567788
Q ss_pred cC-CCC-eeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 163 SL-DGT-AKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 163 ~~-~G~-~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
++ +++ ...+... ..+..+.|+|+++.|+..... ..+.+|++...+.........
T Consensus 121 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~-------------~~i~i~d~~~~~~~~~~~~~~---------- 177 (289)
T cd00200 121 DVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQD-------------GTIKLWDLRTGKCVATLTGHT---------- 177 (289)
T ss_pred ECCCcEEEEEeccCCCcEEEEEEcCcCCEEEEEcCC-------------CcEEEEEccccccceeEecCc----------
Confidence 88 463 3344433 668899999998877655322 368899987443322221111
Q ss_pred ccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee-eeccceeceeeccCCceEEEeee
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH-KLDLRFRSVSWCDDSLALVNETW 318 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~~~~~~~~~~SpDg~~l~~~~~ 318 (744)
..+..+.|+|+++. ++.... .+.+.+++. ..++..... .....+..+.|+|++..++...
T Consensus 178 --~~i~~~~~~~~~~~-l~~~~~------------~~~i~i~d~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~- 238 (289)
T cd00200 178 --GEVNSVAFSPDGEK-LLSSSS------------DGTIKLWDL---STGKCLGTLRGHENGVNSVAFSPDGYLLASGS- 238 (289)
T ss_pred --cccceEEECCCcCE-EEEecC------------CCcEEEEEC---CCCceecchhhcCCceEEEEEcCCCcEEEEEc-
Confidence 12667899999986 655532 234777777 333333333 4455788899999976665543
Q ss_pred eeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 319 YKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
....|+++++..+ ............ ...+.|+++++.|+...
T Consensus 239 ---~~~~i~i~~~~~~--~~~~~~~~~~~~------i~~~~~~~~~~~l~~~~ 280 (289)
T cd00200 239 ---EDGTIRVWDLRTG--ECVQTLSGHTNS------VTSLAWSPDGKRLASGS 280 (289)
T ss_pred ---CCCcEEEEEcCCc--eeEEEccccCCc------EEEEEECCCCCEEEEec
Confidence 2345777787653 222222211111 11266999998887764
No 159
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.08 E-value=9.2e-09 Score=106.96 Aligned_cols=251 Identities=14% Similarity=0.165 Sum_probs=157.0
Q ss_pred CccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCC
Q 004574 3 FFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD 82 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~ 82 (744)
-.+.+|+++... |+.++|..+. ...++...|+++|++||.... + +.|.++|....+..+-....+
T Consensus 195 lg~~vylW~~~s----~~v~~l~~~~-~~~vtSv~ws~~G~~LavG~~--------~--g~v~iwD~~~~k~~~~~~~~h 259 (484)
T KOG0305|consen 195 LGQSVYLWSASS----GSVTELCSFG-EELVTSVKWSPDGSHLAVGTS--------D--GTVQIWDVKEQKKTRTLRGSH 259 (484)
T ss_pred ecceEEEEecCC----CceEEeEecC-CCceEEEEECCCCCEEEEeec--------C--CeEEEEehhhccccccccCCc
Confidence 356899999877 8988888665 346899999999999999643 3 455666766543322222211
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
..-++.++|. +..+..... .+.|...
T Consensus 260 ---~~rvg~laW~--~~~lssGsr-------------------------------------------------~~~I~~~ 285 (484)
T KOG0305|consen 260 ---ASRVGSLAWN--SSVLSSGSR-------------------------------------------------DGKILNH 285 (484)
T ss_pred ---CceeEEEecc--CceEEEecC-------------------------------------------------CCcEEEE
Confidence 1236788998 333322110 1334444
Q ss_pred cC-CC--CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCccc
Q 004574 163 SL-DG--TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCY 237 (744)
Q Consensus 163 ~~-~G--~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~ 237 (744)
|+ .. ..+.+..+ ..+.++.|++||++++-..++ +.+.+||.... ....++.+..+
T Consensus 286 dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnD-------------N~~~Iwd~~~~~p~~~~~~H~aA------- 345 (484)
T KOG0305|consen 286 DVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGND-------------NVVFIWDGLSPEPKFTFTEHTAA------- 345 (484)
T ss_pred EEecchhhhhhhhcccceeeeeEECCCCCeeccCCCc-------------cceEeccCCCccccEEEecccee-------
Confidence 44 32 22223334 788999999999998866444 37889998443 34445555444
Q ss_pred CCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 238 NSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 238 ~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
+..++|+|-.+. |+-+. ++. ....|..++. .++...........+.++.||+..+-|+.+.
T Consensus 346 ------VKA~awcP~q~~-lLAsG---GGs------~D~~i~fwn~---~~g~~i~~vdtgsQVcsL~Wsk~~kEi~sth 406 (484)
T KOG0305|consen 346 ------VKALAWCPWQSG-LLATG---GGS------ADRCIKFWNT---NTGARIDSVDTGSQVCSLIWSKKYKELLSTH 406 (484)
T ss_pred ------eeEeeeCCCccC-ceEEc---CCC------cccEEEEEEc---CCCcEecccccCCceeeEEEcCCCCEEEEec
Confidence 667899999988 44331 111 1225777777 5566555556677899999999999999887
Q ss_pred eeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 318 WYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 318 ~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
-.....-.||.++... ......++... ...++|||||..|+..+
T Consensus 407 G~s~n~i~lw~~ps~~----~~~~l~gH~~R------Vl~la~SPdg~~i~t~a 450 (484)
T KOG0305|consen 407 GYSENQITLWKYPSMK----LVAELLGHTSR------VLYLALSPDGETIVTGA 450 (484)
T ss_pred CCCCCcEEEEeccccc----eeeeecCCcce------eEEEEECCCCCEEEEec
Confidence 5544556677775321 11111222211 12267999999998886
No 160
>PF07224 Chlorophyllase: Chlorophyllase; InterPro: IPR010821 This family consists of several chlorophyllase proteins (3.1.1.14 from EC). Chlorophyllase (Chlase) is the first enzyme involved in chlorophyll degradation and catalyses the hydrolysis of the ester bond to yield chlorophyllide and phytol [, , ].; GO: 0047746 chlorophyllase activity, 0015996 chlorophyll catabolic process
Probab=99.08 E-value=1.9e-09 Score=100.22 Aligned_cols=206 Identities=17% Similarity=0.130 Sum_probs=127.9
Q ss_pred EEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCCC
Q 004574 494 PLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKL 573 (744)
Q Consensus 494 ~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~ 573 (744)
..+..++.|.. ++.+|+|+++||. ..+..++......++++||+|+++.-+........+
T Consensus 32 PkpLlI~tP~~-----~G~yPVilF~HG~---------------~l~ns~Ys~lL~HIASHGfIVVAPQl~~~~~p~~~~ 91 (307)
T PF07224_consen 32 PKPLLIVTPSE-----AGTYPVILFLHGF---------------NLYNSFYSQLLAHIASHGFIVVAPQLYTLFPPDGQD 91 (307)
T ss_pred CCCeEEecCCc-----CCCccEEEEeech---------------hhhhHHHHHHHHHHhhcCeEEEechhhcccCCCchH
Confidence 34556777866 5679999999995 223333446677889999999996655544433333
Q ss_pred hHHHHHHHHHHHHHc------C--CCCCCcEEEEEechHHHHHHHHHHhCC-C-ceeEEEEccCCCCCCCCCCccccccc
Q 004574 574 PNDSAEAAVEEVVRR------G--VADPSRIAVGGHSYGAFMTAHLLAHAP-H-LFCCGIARSGSYNKTLTPFGFQTEFR 643 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~------~--~~d~~~i~l~G~S~GG~~a~~~~~~~p-~-~~~~~v~~~~~~~~~~~~~~~~~~~~ 643 (744)
..++..++++||.+. . ..+..+++++|||.||..|..+|..+. + .|.++|.+.|+-...... +....
T Consensus 92 Ei~~aa~V~~WL~~gL~~~Lp~~V~~nl~klal~GHSrGGktAFAlALg~a~~lkfsaLIGiDPV~G~~k~~---~t~P~ 168 (307)
T PF07224_consen 92 EIKSAASVINWLPEGLQHVLPENVEANLSKLALSGHSRGGKTAFALALGYATSLKFSALIGIDPVAGTSKGK---QTPPP 168 (307)
T ss_pred HHHHHHHHHHHHHhhhhhhCCCCcccccceEEEeecCCccHHHHHHHhcccccCchhheecccccCCCCCCC---CCCCC
Confidence 445788999999863 1 245679999999999999999987662 2 588999998874321100 00000
Q ss_pred chhhcHHHHHhcCcccccCCCCCCEEEEeeCCC-----CCCCCCHH--HHHHHHHHHHhCCCcEEEEEeCCCCcccCc--
Q 004574 644 TLWEATNVYIEMSPITHANKIKKPILIIHGEVD-----DKVGLFPM--QAERFFDALKGHGALSRLVLLPFEHHVYAA-- 714 (744)
Q Consensus 644 ~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D-----~~v~~~~~--~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~-- 714 (744)
...+.|. .-.+.+|+++|-..-- -..+|.+. +-++++...+ .++.+.+..+-||+-..
T Consensus 169 --------iLty~p~--SF~l~iPv~VIGtGLg~~~~~~~~~CaP~gvnH~eFf~eCk---~p~~hfV~~dYGHmDmLDD 235 (307)
T PF07224_consen 169 --------ILTYVPQ--SFDLDIPVLVIGTGLGPKRNPLFPPCAPDGVNHEEFFNECK---PPCAHFVAKDYGHMDMLDD 235 (307)
T ss_pred --------eeecCCc--ccccCCceEEEecCcCccccCCCCCCCCCCcCHHHHHHhhc---ccceeeeeccccccccccc
Confidence 0000010 1135699998854332 12223232 3466777665 55677777888895321
Q ss_pred ---------------------cccHHHHHHHHHHHHHHhccC
Q 004574 715 ---------------------RENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 715 ---------------------~~~~~~~~~~~~~fl~~~l~~ 735 (744)
...+..+-..+.+||+.+|..
T Consensus 236 ~~~g~~G~~~~clCkng~~pr~pMRr~vgGivVAFL~a~l~~ 277 (307)
T PF07224_consen 236 DTPGIIGKLSYCLCKNGKSPRDPMRRFVGGIVVAFLKAYLEG 277 (307)
T ss_pred CccccccceeeEeecCCCCcchHHHHhhhhhHHHHHHHHHcC
Confidence 111233345588999988864
No 161
>PF08538 DUF1749: Protein of unknown function (DUF1749); InterPro: IPR013744 This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. ; PDB: 2Q0X_B.
Probab=99.07 E-value=8.5e-10 Score=107.54 Aligned_cols=184 Identities=13% Similarity=0.107 Sum_probs=71.3
Q ss_pred hhHHHHHhCCeEEEecCC-CCCCCCCCCChH---HHHHHHHHHHHHcC--CCCCCcEEEEEechHHHHHHHHHHhC----
Q 004574 546 TSSLIFLARRFAVLAGPS-IPIIGEGDKLPN---DSAEAAVEEVVRRG--VADPSRIAVGGHSYGAFMTAHLLAHA---- 615 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~-~~~~g~g~~~~~---~d~~~~~~~l~~~~--~~d~~~i~l~G~S~GG~~a~~~~~~~---- 615 (744)
..+..|...||.|+...- ..+.|+|..... +|+.++|+||+... ....+||+|||||-|..-++.++...
T Consensus 54 ~La~aL~~~~wsl~q~~LsSSy~G~G~~SL~~D~~eI~~~v~ylr~~~~g~~~~~kIVLmGHSTGcQdvl~Yl~~~~~~~ 133 (303)
T PF08538_consen 54 DLAEALEETGWSLFQVQLSSSYSGWGTSSLDRDVEEIAQLVEYLRSEKGGHFGREKIVLMGHSTGCQDVLHYLSSPNPSP 133 (303)
T ss_dssp HHHHHHT-TT-EEEEE--GGGBTTS-S--HHHHHHHHHHHHHHHHHHS------S-EEEEEECCHHHHHHHHHHH-TT--
T ss_pred HHHHHhccCCeEEEEEEecCccCCcCcchhhhHHHHHHHHHHHHHHhhccccCCccEEEEecCCCcHHHHHHHhccCccc
Confidence 344555567999987332 235667766433 38899999999873 23568999999999999999998765
Q ss_pred -CCceeEEEEccCCCCCCCCCCcccc-cc--------------c-----ch-------h-hcH---HHHHh---------
Q 004574 616 -PHLFCCGIARSGSYNKTLTPFGFQT-EF--------------R-----TL-------W-EAT---NVYIE--------- 654 (744)
Q Consensus 616 -p~~~~~~v~~~~~~~~~~~~~~~~~-~~--------------~-----~~-------~-~~~---~~~~~--------- 654 (744)
...+.++|+.+|+.|++........ +. . .+ + ..+ ..+..
T Consensus 134 ~~~~VdG~ILQApVSDREa~~~~~~~~~~~~~~v~~A~~~i~~g~~~~~lp~~~~~~~~~~~PiTA~Rf~SL~s~~gdDD 213 (303)
T PF08538_consen 134 SRPPVDGAILQAPVSDREAILNFLGEREAYEELVALAKELIAEGKGDEILPREFTPLVFYDTPITAYRFLSLASPGGDDD 213 (303)
T ss_dssp -CCCEEEEEEEEE---TTSTTTSHHH---HHHHHHHHHHHHHCT-TT-GG----GGTTT-SS---HHHHHT-S-SSHHHH
T ss_pred cccceEEEEEeCCCCChhHhhhcccchHHHHHHHHHHHHHHHcCCCCceeeccccccccCCCcccHHHHHhccCCCCccc
Confidence 2579999999999997765443221 00 0 00 0 000 00000
Q ss_pred --------cCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCc----EEEEEeCCCCcccCcccc---HH
Q 004574 655 --------MSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGAL----SRLVLLPFEHHVYAAREN---VM 719 (744)
Q Consensus 655 --------~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~----~~~~~~~~~~H~~~~~~~---~~ 719 (744)
-.......++.+|+|++.+++|+.||- ...-+++.++++.+-.+ ....++||++|....... .+
T Consensus 214 ~FSSDL~de~l~~tfG~v~~plLvl~Sg~DEyvP~-~vdk~~Ll~rw~~a~~~~~~s~~S~iI~GA~H~~~~~~~~~~~~ 292 (303)
T PF08538_consen 214 YFSSDLSDERLKKTFGKVSKPLLVLYSGKDEYVPP-WVDKEALLERWKAATNPKIWSPLSGIIPGASHNVSGPSQAEARE 292 (303)
T ss_dssp THHHHHTT-HHHHTGGG--S-EEEEEE--TT-------------------------------------------------
T ss_pred ccCCCCCHHHHHHHhccCCCceEEEecCCCceecc-cccccccccccccccccccccccccccccccccccccccccccc
Confidence 001123566889999999999999982 23445666776654332 235689999999874333 23
Q ss_pred HHHHHHHHHHH
Q 004574 720 HVIWETDRWLQ 730 (744)
Q Consensus 720 ~~~~~~~~fl~ 730 (744)
...+++..||+
T Consensus 293 ~l~~rV~~fl~ 303 (303)
T PF08538_consen 293 WLVERVVKFLK 303 (303)
T ss_dssp -----------
T ss_pred cccccccccCC
Confidence 56677777763
No 162
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.07 E-value=6.6e-08 Score=97.76 Aligned_cols=202 Identities=11% Similarity=0.084 Sum_probs=120.0
Q ss_pred ecCCCCeEEEeeecccccccCCCceeEEEEECCCCceecc-ccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCC
Q 004574 38 WSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPL-FESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKT 116 (744)
Q Consensus 38 ~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~l-t~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~ 116 (744)
=.|-|+.|+| .++..+++++++..+...+ |.+.. .+.-..+||.|.+|+.. +..
T Consensus 26 ~dpkgd~ilY-----------~nGksv~ir~i~~~~~~~iYtEH~~-----~vtVAkySPsG~yiASG--D~s------- 80 (603)
T KOG0318|consen 26 GDPKGDNILY-----------TNGKSVIIRNIDNPASVDIYTEHAH-----QVTVAKYSPSGFYIASG--DVS------- 80 (603)
T ss_pred cCCCCCeEEE-----------eCCCEEEEEECCCccceeeeccccc-----eeEEEEeCCCceEEeec--CCc-------
Confidence 3788999999 4557899999997665433 33322 23455899999977643 111
Q ss_pred CCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC-----ceeeeeccCCCCceEEE
Q 004574 117 MVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP-----AVYTAVEPSPDQKYVLI 191 (744)
Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~-----~~~~~~~~SpDG~~i~~ 191 (744)
+.+.+++.+.+..-|... +.+..++|++||++|+.
T Consensus 81 ----------------------------------------G~vRIWdtt~~~hiLKnef~v~aG~I~Di~Wd~ds~RI~a 120 (603)
T KOG0318|consen 81 ----------------------------------------GKVRIWDTTQKEHILKNEFQVLAGPIKDISWDFDSKRIAA 120 (603)
T ss_pred ----------------------------------------CcEEEEeccCcceeeeeeeeecccccccceeCCCCcEEEE
Confidence 455566664422222221 77889999999999998
Q ss_pred EEeeCCcccccccCCCcceEEEEe-------------------CCCCeeeeccCC------CCCCCCCcccCCccCC---
Q 004574 192 TSMHRPYSYKVPCARFSQKVQVWT-------------------TDGKLVRELCDL------PPAEDIPVCYNSVREG--- 243 (744)
Q Consensus 192 ~~~~~~~~~~~~~~~~~~~l~~~~-------------------~~g~~~~~l~~~------~~~~~~~~~~~~~~~~--- 243 (744)
....++ ++ ..+++|| +.-.+..++... ...++-|..|......
T Consensus 121 vGEGre--------rf-g~~F~~DSG~SvGei~GhSr~ins~~~KpsRPfRi~T~sdDn~v~ffeGPPFKFk~s~r~Hsk 191 (603)
T KOG0318|consen 121 VGEGRE--------RF-GHVFLWDSGNSVGEITGHSRRINSVDFKPSRPFRIATGSDDNTVAFFEGPPFKFKSSFREHSK 191 (603)
T ss_pred EecCcc--------ce-eEEEEecCCCccceeeccceeEeeeeccCCCceEEEeccCCCeEEEeeCCCeeeeeccccccc
Confidence 876542 11 1334443 332222222111 0112223333333221
Q ss_pred -CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee----eccceeceeeccCCceEEEeee
Q 004574 244 -MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK----LDLRFRSVSWCDDSLALVNETW 318 (744)
Q Consensus 244 -~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~----~~~~~~~~~~SpDg~~l~~~~~ 318 (744)
++.+.+||||+. ++-+. ..+.++++|- ++++..-... ..+.+..++||||+++++..+.
T Consensus 192 FV~~VRysPDG~~-Fat~g------------sDgki~iyDG---ktge~vg~l~~~~aHkGsIfalsWsPDs~~~~T~Sa 255 (603)
T KOG0318|consen 192 FVNCVRYSPDGSR-FATAG------------SDGKIYIYDG---KTGEKVGELEDSDAHKGSIFALSWSPDSTQFLTVSA 255 (603)
T ss_pred ceeeEEECCCCCe-EEEec------------CCccEEEEcC---CCccEEEEecCCCCccccEEEEEECCCCceEEEecC
Confidence 667899999986 33331 2335899887 5665443332 3667889999999999998773
Q ss_pred eeccceeEEEEcCCC
Q 004574 319 YKTSQTRTWLVCPGS 333 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~ 333 (744)
+ ..+.++|+.+
T Consensus 256 D----kt~KIWdVs~ 266 (603)
T KOG0318|consen 256 D----KTIKIWDVST 266 (603)
T ss_pred C----ceEEEEEeec
Confidence 3 2355556655
No 163
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=99.07 E-value=3.9e-08 Score=106.06 Aligned_cols=272 Identities=18% Similarity=0.138 Sum_probs=152.3
Q ss_pred cceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCC
Q 004574 34 NFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPP 113 (744)
Q Consensus 34 ~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~ 113 (744)
....+||||++|||... ..++....|+++|+++|+...- .-. ......+.|++||+.|+|+.........
T Consensus 127 ~~~~~Spdg~~la~~~s-----~~G~e~~~l~v~Dl~tg~~l~d--~i~---~~~~~~~~W~~d~~~~~y~~~~~~~~~~ 196 (414)
T PF02897_consen 127 GGFSVSPDGKRLAYSLS-----DGGSEWYTLRVFDLETGKFLPD--GIE---NPKFSSVSWSDDGKGFFYTRFDEDQRTS 196 (414)
T ss_dssp EEEEETTTSSEEEEEEE-----ETTSSEEEEEEEETTTTEEEEE--EEE---EEESEEEEECTTSSEEEEEECSTTTSS-
T ss_pred eeeeECCCCCEEEEEec-----CCCCceEEEEEEECCCCcCcCC--ccc---ccccceEEEeCCCCEEEEEEeCcccccc
Confidence 36789999999999875 2336668899999999854321 101 0012248999999999998654322100
Q ss_pred CCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCe--eecCCC---ce-eeeeccCCCC
Q 004574 114 KKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTA--KDFGTP---AV-YTAVEPSPDQ 186 (744)
Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~--~~l~~~---~~-~~~~~~SpDG 186 (744)
......+||+..+ ++.. ..|... .. ...+..|+||
T Consensus 197 --------------------------------------~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~~~~~~~~~s~d~ 238 (414)
T PF02897_consen 197 --------------------------------------DSGYPRQVYRHKLGTPQSEDELVFEEPDEPFWFVSVSRSKDG 238 (414)
T ss_dssp --------------------------------------CCGCCEEEEEEETTS-GGG-EEEEC-TTCTTSEEEEEE-TTS
T ss_pred --------------------------------------cCCCCcEEEEEECCCChHhCeeEEeecCCCcEEEEEEecCcc
Confidence 0001168999998 6622 344333 23 5578899999
Q ss_pred ceEEEEEeeCCcccccccCCCcceEEEEeCCCC-----eeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEE
Q 004574 187 KYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-----LVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 187 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~ 261 (744)
++|++.+.... . ..++|+.++... ..+.+...... ... .....|.. +++..
T Consensus 239 ~~l~i~~~~~~--------~-~s~v~~~d~~~~~~~~~~~~~l~~~~~~-------------~~~-~v~~~~~~-~yi~T 294 (414)
T PF02897_consen 239 RYLFISSSSGT--------S-ESEVYLLDLDDGGSPDAKPKLLSPREDG-------------VEY-YVDHHGDR-LYILT 294 (414)
T ss_dssp SEEEEEEESSS--------S-EEEEEEEECCCTTTSS-SEEEEEESSSS--------------EE-EEEEETTE-EEEEE
T ss_pred cEEEEEEEccc--------c-CCeEEEEeccccCCCcCCcEEEeCCCCc-------------eEE-EEEccCCE-EEEee
Confidence 99998876642 1 258999998764 34444332111 111 12222555 66665
Q ss_pred eecCCCCCccCCccceEEeccCCCCCCCCce-EeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCC-CCCCcce
Q 004574 262 AQDRGDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG-SKDVAPR 339 (744)
Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~-~~~~~~~ 339 (744)
+.+. .+..|+.+++.....+... .+........--.++..+.+|++.. ..+...+|.++++. + ....
T Consensus 295 n~~a--------~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~Lvl~~-~~~~~~~l~v~~~~~~--~~~~ 363 (414)
T PF02897_consen 295 NDDA--------PNGRLVAVDLADPSPAEWWTVLIPEDEDVSLEDVSLFKDYLVLSY-RENGSSRLRVYDLDDG--KESR 363 (414)
T ss_dssp -TT---------TT-EEEEEETTSTSGGGEEEEEE--SSSEEEEEEEEETTEEEEEE-EETTEEEEEEEETT-T--EEEE
T ss_pred CCCC--------CCcEEEEecccccccccceeEEcCCCCceeEEEEEEECCEEEEEE-EECCccEEEEEECCCC--cEEe
Confidence 4322 3446888888321111123 4444444333344556677787776 44577899999998 4 2222
Q ss_pred eeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 340 VLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 340 ~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
.+.......+ ..+...+++..+.|... ++ ...+.+|.+|+.+++.+.+.
T Consensus 364 ~~~~p~~g~v------~~~~~~~~~~~~~~~~s--------------s~---~~P~~~y~~d~~t~~~~~~k 412 (414)
T PF02897_consen 364 EIPLPEAGSV------SGVSGDFDSDELRFSYS--------------SF---TTPPTVYRYDLATGELTLLK 412 (414)
T ss_dssp EEESSSSSEE------EEEES-TT-SEEEEEEE--------------ET---TEEEEEEEEETTTTCEEEEE
T ss_pred eecCCcceEE------eccCCCCCCCEEEEEEe--------------CC---CCCCEEEEEECCCCCEEEEE
Confidence 2221111111 11234566666666653 22 34567999999999987653
No 164
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.06 E-value=1e-08 Score=112.46 Aligned_cols=158 Identities=20% Similarity=0.220 Sum_probs=95.9
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEEC------C--CCceeccccCCC----ccccccccceEEecCCc
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADA------E--TGEAKPLFESPD----ICLNAVFGSFVWVNNST 99 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~------~--gg~~~~lt~~~~----~~~~~~~~~~~wspDg~ 99 (744)
.+.-.+|||||++||+.+. +.-..||-+.- - +|..+-+..... .++...+.++.||||++
T Consensus 71 sv~CVR~S~dG~~lAsGSD--------D~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~~~ 142 (942)
T KOG0973|consen 71 SVNCVRFSPDGSYLASGSD--------DRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPDDS 142 (942)
T ss_pred ceeEEEECCCCCeEeeccC--------cceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCCcc
Confidence 5667789999999999543 34455666552 1 111111111100 11222577899999999
Q ss_pred EEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ce
Q 004574 100 LLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AV 176 (744)
Q Consensus 100 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~ 176 (744)
+|+..+.+ +.|.+++. +. ..+.+..+ .-
T Consensus 143 ~lvS~s~D-------------------------------------------------nsViiwn~~tF~~~~vl~~H~s~ 173 (942)
T KOG0973|consen 143 LLVSVSLD-------------------------------------------------NSVIIWNAKTFELLKVLRGHQSL 173 (942)
T ss_pred EEEEeccc-------------------------------------------------ceEEEEccccceeeeeeeccccc
Confidence 99876432 46777777 55 34444444 67
Q ss_pred eeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC--CCCccceecCCC
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE--GMRSISWRADKP 254 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~--~~~~~~~spDg~ 254 (744)
+-+.+|.|-|++++-.++++ .|-+|+...-.+......+.+ ..+. ..+.+.|||||+
T Consensus 174 VKGvs~DP~Gky~ASqsdDr-------------tikvwrt~dw~i~k~It~pf~--------~~~~~T~f~RlSWSPDG~ 232 (942)
T KOG0973|consen 174 VKGVSWDPIGKYFASQSDDR-------------TLKVWRTSDWGIEKSITKPFE--------ESPLTTFFLRLSWSPDGH 232 (942)
T ss_pred ccceEECCccCeeeeecCCc-------------eEEEEEcccceeeEeeccchh--------hCCCcceeeecccCCCcC
Confidence 77999999999998876654 677777544333333223222 1111 155789999999
Q ss_pred ceEEEEEeecCCCC
Q 004574 255 STLYWVEAQDRGDA 268 (744)
Q Consensus 255 ~~l~~~~~~~~~~~ 268 (744)
+ |+...+-+++..
T Consensus 233 ~-las~nA~n~~~~ 245 (942)
T KOG0973|consen 233 H-LASPNAVNGGKS 245 (942)
T ss_pred e-ecchhhccCCcc
Confidence 8 766655554443
No 165
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.05 E-value=7.4e-08 Score=88.33 Aligned_cols=269 Identities=11% Similarity=0.078 Sum_probs=161.9
Q ss_pred CCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccc
Q 004574 59 SCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMT 138 (744)
Q Consensus 59 ~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (744)
.+..-|.+..+.+|.=.+-.+..+. -+..+..+||++.|+....
T Consensus 17 ~YDhTIRfWqa~tG~C~rTiqh~ds----qVNrLeiTpdk~~LAaa~~-------------------------------- 60 (311)
T KOG0315|consen 17 GYDHTIRFWQALTGICSRTIQHPDS----QVNRLEITPDKKDLAAAGN-------------------------------- 60 (311)
T ss_pred cCcceeeeeehhcCeEEEEEecCcc----ceeeEEEcCCcchhhhccC--------------------------------
Confidence 4445566677788875444444332 3557789999999987521
Q ss_pred cccCCCchhhhccceeeeeEEEEEcC-CCCeeecCC---C-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEE
Q 004574 139 DNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGT---P-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQV 213 (744)
Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~---~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~ 213 (744)
.+|.++|+ +++..++.. + .++..+.|.-||++.+-. .++ ..+.+
T Consensus 61 ------------------qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTg-seD------------gt~kI 109 (311)
T KOG0315|consen 61 ------------------QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTG-SED------------GTVKI 109 (311)
T ss_pred ------------------CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEec-CCC------------ceEEE
Confidence 46777788 665544422 2 677888999999988644 332 36778
Q ss_pred EeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE
Q 004574 214 WTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI 293 (744)
Q Consensus 214 ~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 293 (744)
||+..-...++...+.. +..+...|+... |.... ..+.|+++|+. +..-..+
T Consensus 110 WdlR~~~~qR~~~~~sp-------------Vn~vvlhpnQte-Lis~d------------qsg~irvWDl~--~~~c~~~ 161 (311)
T KOG0315|consen 110 WDLRSLSCQRNYQHNSP-------------VNTVVLHPNQTE-LISGD------------QSGNIRVWDLG--ENSCTHE 161 (311)
T ss_pred EeccCcccchhccCCCC-------------cceEEecCCcce-EEeec------------CCCcEEEEEcc--CCccccc
Confidence 88776555554444322 557788999887 44331 23458888883 2222334
Q ss_pred eee-eccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCc--ceeeeccccccccCCCCC-CceeeCCCCCeEEE
Q 004574 294 LHK-LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVA--PRVLFDRVFENVYSDPGS-PMMTRTSTGTNVIA 369 (744)
Q Consensus 294 l~~-~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~--~~~l~~~~~~~~~~~~~~-~~~~~spdg~~l~~ 369 (744)
|.+ .+..+.+++..|||+.++... .....|++++.+.... ...+++ +.. +.+. ..-.+|||+++|+.
T Consensus 162 liPe~~~~i~sl~v~~dgsml~a~n----nkG~cyvW~l~~~~~~s~l~P~~k--~~a---h~~~il~C~lSPd~k~lat 232 (311)
T KOG0315|consen 162 LIPEDDTSIQSLTVMPDGSMLAAAN----NKGNCYVWRLLNHQTASELEPVHK--FQA---HNGHILRCLLSPDVKYLAT 232 (311)
T ss_pred cCCCCCcceeeEEEcCCCcEEEEec----CCccEEEEEccCCCccccceEhhh--eec---ccceEEEEEECCCCcEEEe
Confidence 443 356788999999999988654 3456888887764211 112222 111 1111 11238999999999
Q ss_pred EeeecCCcceEEEEccCCCCCCCCCceEEEEecCCC-ceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecC
Q 004574 370 KIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTG-SKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESK 448 (744)
Q Consensus 370 ~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~ 448 (744)
.+.+ ..+++|+.++- +.+...+.... +.| .++||.||+.|+... +.
T Consensus 233 ~ssd---------------------ktv~iwn~~~~~kle~~l~gh~r------WvW-----dc~FS~dg~YlvTas-sd 279 (311)
T KOG0315|consen 233 CSSD---------------------KTVKIWNTDDFFKLELVLTGHQR------WVW-----DCAFSADGEYLVTAS-SD 279 (311)
T ss_pred ecCC---------------------ceEEEEecCCceeeEEEeecCCc------eEE-----eeeeccCccEEEecC-CC
Confidence 8632 23667776555 33333333322 223 269999998766433 32
Q ss_pred CCCceEEEEECCCCceeee
Q 004574 449 TEITQYHILSWPLKKSSQI 467 (744)
Q Consensus 449 ~~~~~i~~~~~~~g~~~~l 467 (744)
....+|++..++..+-
T Consensus 280 ---~~~rlW~~~~~k~v~q 295 (311)
T KOG0315|consen 280 ---HTARLWDLSAGKEVRQ 295 (311)
T ss_pred ---CceeecccccCceeee
Confidence 3456677777765543
No 166
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=99.04 E-value=4.6e-07 Score=92.14 Aligned_cols=289 Identities=13% Similarity=0.064 Sum_probs=148.3
Q ss_pred ceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEec-CCCCCCCCCCCCCCCCeeeecCCCccccccccc
Q 004574 61 KLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIP-SSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTD 139 (744)
Q Consensus 61 ~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (744)
.++|+++|.++++......... . .....||||+.||.+.. -.++..
T Consensus 26 ~~~v~ViD~~~~~v~g~i~~G~---~---P~~~~spDg~~lyva~~~~~R~~~--------------------------- 72 (352)
T TIGR02658 26 TTQVYTIDGEAGRVLGMTDGGF---L---PNPVVASDGSFFAHASTVYSRIAR--------------------------- 72 (352)
T ss_pred CceEEEEECCCCEEEEEEEccC---C---CceeECCCCCEEEEEecccccccc---------------------------
Confidence 3899999999887644322211 1 12258999999988733 000000
Q ss_pred ccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC--------ceeeeeccCCCCceEEEEEeeCCcccccccCCCcc
Q 004574 140 NLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP--------AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQ 209 (744)
Q Consensus 140 ~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~--------~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~ 209 (744)
......|-++|+ ++ ...+|..+ .....+++||||++|++..... ..
T Consensus 73 -------------G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p-----------~~ 128 (352)
T TIGR02658 73 -------------GKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSP-----------SP 128 (352)
T ss_pred -------------CCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCC-----------CC
Confidence 001146778888 66 44445432 1223678999999998664331 14
Q ss_pred eEEEEeCCCCeeee-ccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCC
Q 004574 210 KVQVWTTDGKLVRE-LCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEG 288 (744)
Q Consensus 210 ~l~~~~~~g~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (744)
.+-++|+..++... +.........+ ......+..+.||+. +......++. ....+
T Consensus 129 ~V~VvD~~~~kvv~ei~vp~~~~vy~------t~e~~~~~~~~Dg~~-~~v~~d~~g~-----------~~~~~------ 184 (352)
T TIGR02658 129 AVGVVDLEGKAFVRMMDVPDCYHIFP------TANDTFFMHCRDGSL-AKVGYGTKGN-----------PKIKP------ 184 (352)
T ss_pred EEEEEECCCCcEEEEEeCCCCcEEEE------ecCCccEEEeecCce-EEEEecCCCc-----------eEEee------
Confidence 78889988665433 33211110000 000112344566654 2222111110 00000
Q ss_pred CCceEeeee--ccceeceeecc-CCceEEEeeeeeccceeEEEEcCCCCCCccee----eecccc-ccccCCCC--CCce
Q 004574 289 EKPEILHKL--DLRFRSVSWCD-DSLALVNETWYKTSQTRTWLVCPGSKDVAPRV----LFDRVF-ENVYSDPG--SPMM 358 (744)
Q Consensus 289 ~~~~~l~~~--~~~~~~~~~Sp-Dg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~----l~~~~~-~~~~~~~~--~~~~ 358 (744)
..++.. ..-...|.+++ ||++++.+. . ..++.+|+.+....... ++.... ..+ .|+ .+ +
T Consensus 185 ---~~vf~~~~~~v~~rP~~~~~dg~~~~vs~-e----G~V~~id~~~~~~~~~~~~~~~~~~~~~~~w--rP~g~q~-i 253 (352)
T TIGR02658 185 ---TEVFHPEDEYLINHPAYSNKSGRLVWPTY-T----GKIFQIDLSSGDAKFLPAIEAFTEAEKADGW--RPGGWQQ-V 253 (352)
T ss_pred ---eeeecCCccccccCCceEcCCCcEEEEec-C----CeEEEEecCCCcceecceeeecccccccccc--CCCccee-E
Confidence 011111 11123345667 776665544 2 57999997663111111 111111 011 112 23 5
Q ss_pred eeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCC
Q 004574 359 TRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQ 438 (744)
Q Consensus 359 ~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~ 438 (744)
++++||++++...++. ...++.....+++++|..+++...-+....+ .. .+++|||+
T Consensus 254 a~~~dg~~lyV~~~~~-----------~~~thk~~~~~V~ViD~~t~kvi~~i~vG~~-----~~-------~iavS~Dg 310 (352)
T TIGR02658 254 AYHRARDRIYLLADQR-----------AKWTHKTASRFLFVVDAKTGKRLRKIELGHE-----ID-------SINVSQDA 310 (352)
T ss_pred EEcCCCCEEEEEecCC-----------ccccccCCCCEEEEEECCCCeEEEEEeCCCc-----ee-------eEEECCCC
Confidence 6788888887744221 1123334456899999999988665543322 22 25899999
Q ss_pred CEEEEEEecCCCCceEEEEECCCCceee
Q 004574 439 LKILTSKESKTEITQYHILSWPLKKSSQ 466 (744)
Q Consensus 439 ~~~~~~~~~~~~~~~i~~~~~~~g~~~~ 466 (744)
+-++|..+.. ...+..+|..+++..+
T Consensus 311 kp~lyvtn~~--s~~VsViD~~t~k~i~ 336 (352)
T TIGR02658 311 KPLLYALSTG--DKTLYIFDAETGKELS 336 (352)
T ss_pred CeEEEEeCCC--CCcEEEEECcCCeEEe
Confidence 8566655432 3458899988876543
No 167
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.04 E-value=4.9e-09 Score=102.52 Aligned_cols=209 Identities=18% Similarity=0.289 Sum_probs=130.2
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecC
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
.+...+....||||.++|+--. ....+++.|+++|+.+.+..... .-.+...+|-|||..++..+++
T Consensus 267 gh~~~V~yi~wSPDdryLlaCg----------~~e~~~lwDv~tgd~~~~y~~~~---~~S~~sc~W~pDg~~~V~Gs~d 333 (519)
T KOG0293|consen 267 GHSQPVSYIMWSPDDRYLLACG----------FDEVLSLWDVDTGDLRHLYPSGL---GFSVSSCAWCPDGFRFVTGSPD 333 (519)
T ss_pred cccCceEEEEECCCCCeEEecC----------chHheeeccCCcchhhhhcccCc---CCCcceeEEccCCceeEecCCC
Confidence 3344688999999999987632 12348888999999887754431 1135688999999998765432
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC---ceeeeeccCC
Q 004574 108 SRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP---AVYTAVEPSP 184 (744)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~---~~~~~~~~Sp 184 (744)
.+++.++++|+...--.. ..+..++.++
T Consensus 334 -------------------------------------------------r~i~~wdlDgn~~~~W~gvr~~~v~dlait~ 364 (519)
T KOG0293|consen 334 -------------------------------------------------RTIIMWDLDGNILGNWEGVRDPKVHDLAITY 364 (519)
T ss_pred -------------------------------------------------CcEEEecCCcchhhcccccccceeEEEEEcC
Confidence 578888998844211111 4577899999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee-
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ- 263 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~- 263 (744)
||++++....+. .|.+|+.+....+.+.... .++..+..|.||+..|+-....
T Consensus 365 Dgk~vl~v~~d~-------------~i~l~~~e~~~dr~lise~-------------~~its~~iS~d~k~~LvnL~~qe 418 (519)
T KOG0293|consen 365 DGKYVLLVTVDK-------------KIRLYNREARVDRGLISEE-------------QPITSFSISKDGKLALVNLQDQE 418 (519)
T ss_pred CCcEEEEEeccc-------------ceeeechhhhhhhcccccc-------------CceeEEEEcCCCcEEEEEcccCe
Confidence 999998876442 6777776655444222111 1145677888887433333221
Q ss_pred -----------------------------cCCCCCc--cCCccceEEeccCCCCCCCCce-EeeeeccceeceeeccCCc
Q 004574 264 -----------------------------DRGDANV--EVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFRSVSWCDDSL 311 (744)
Q Consensus 264 -----------------------------~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~~~~~SpDg~ 311 (744)
.+..... .-+..++||+++. .++.+. .|+.....++-++|+|...
T Consensus 419 i~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr---~sgkll~~LsGHs~~vNcVswNP~~p 495 (519)
T KOG0293|consen 419 IHLWDLEENKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSEDSKVYIWHR---ISGKLLAVLSGHSKTVNCVSWNPADP 495 (519)
T ss_pred eEEeecchhhHHHHhhcccccceEEEeccCCCCcceEEecCCCceEEEEEc---cCCceeEeecCCcceeeEEecCCCCH
Confidence 1111111 1133456888887 455544 4555567788889999888
Q ss_pred eEEEeeeeeccceeEEE
Q 004574 312 ALVNETWYKTSQTRTWL 328 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~ 328 (744)
.++.++ .+++..+||-
T Consensus 496 ~m~ASa-sDDgtIRIWg 511 (519)
T KOG0293|consen 496 EMFASA-SDDGTIRIWG 511 (519)
T ss_pred HHhhcc-CCCCeEEEec
Confidence 766655 3335555554
No 168
>PRK07868 acyl-CoA synthetase; Validated
Probab=99.04 E-value=1.1e-08 Score=121.82 Aligned_cols=74 Identities=18% Similarity=0.143 Sum_probs=57.3
Q ss_pred cCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEE-EEeCCCCcccC--ccccHHHHHHHHHHHHHHhccCCC
Q 004574 661 ANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRL-VLLPFEHHVYA--ARENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 661 ~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~H~~~--~~~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
+.++++|+|+++|++|.++| +..++.+.+.+.. .++ .+++++||... .......++..+.+||.++-....
T Consensus 293 L~~i~~P~L~i~G~~D~ivp--~~~~~~l~~~i~~----a~~~~~~~~~GH~g~~~g~~a~~~~wp~i~~wl~~~~~~~~ 366 (994)
T PRK07868 293 LADITCPVLAFVGEVDDIGQ--PASVRGIRRAAPN----AEVYESLIRAGHFGLVVGSRAAQQTWPTVADWVKWLEGDGD 366 (994)
T ss_pred hhhCCCCEEEEEeCCCCCCC--HHHHHHHHHhCCC----CeEEEEeCCCCCEeeeechhhhhhhChHHHHHHHHhccCCC
Confidence 56789999999999999999 8888888765532 355 67789999743 345567889999999999876655
Q ss_pred CCC
Q 004574 738 SDG 740 (744)
Q Consensus 738 ~~~ 740 (744)
+++
T Consensus 367 ~~~ 369 (994)
T PRK07868 367 KPE 369 (994)
T ss_pred CCc
Confidence 443
No 169
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.03 E-value=2.3e-07 Score=97.85 Aligned_cols=284 Identities=13% Similarity=0.165 Sum_probs=164.5
Q ss_pred CCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 30 GAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 30 ~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
.........||||++||-.+. +++..|| +...|-= -+|+.++ ..++..+.|+.+|+.|+.++-+
T Consensus 350 ~~~i~~l~YSpDgq~iaTG~e--------DgKVKvW--n~~SgfC-~vTFteH---ts~Vt~v~f~~~g~~llssSLD-- 413 (893)
T KOG0291|consen 350 SDRITSLAYSPDGQLIATGAE--------DGKVKVW--NTQSGFC-FVTFTEH---TSGVTAVQFTARGNVLLSSSLD-- 413 (893)
T ss_pred ccceeeEEECCCCcEEEeccC--------CCcEEEE--eccCceE-EEEeccC---CCceEEEEEEecCCEEEEeecC--
Confidence 336778899999999998543 5555555 6444431 1233332 1157789999999988876432
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC--ceeeeeccCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP--AVYTAVEPSPD 185 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~--~~~~~~~~SpD 185 (744)
+.+..+|+ .+ ..+.++.+ .....++..|.
T Consensus 414 -----------------------------------------------GtVRAwDlkRYrNfRTft~P~p~QfscvavD~s 446 (893)
T KOG0291|consen 414 -----------------------------------------------GTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPS 446 (893)
T ss_pred -----------------------------------------------CeEEeeeecccceeeeecCCCceeeeEEEEcCC
Confidence 45666677 55 66767666 44556667777
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
|.-|.....+ ..+|++|+..+++...+..++-. .+..++|+|+|+. |+-.+
T Consensus 447 GelV~AG~~d------------~F~IfvWS~qTGqllDiLsGHEg------------PVs~l~f~~~~~~-LaS~S---- 497 (893)
T KOG0291|consen 447 GELVCAGAQD------------SFEIFVWSVQTGQLLDILSGHEG------------PVSGLSFSPDGSL-LASGS---- 497 (893)
T ss_pred CCEEEeeccc------------eEEEEEEEeecCeeeehhcCCCC------------cceeeEEccccCe-EEecc----
Confidence 8754433322 25899999997766555444211 1667899999985 55442
Q ss_pred CCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC------------
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS------------ 333 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~------------ 333 (744)
+...|-+++.- ......+-......+..++|+|||+.|+.++-+ + +|-++|...
T Consensus 498 --------WDkTVRiW~if--~s~~~vEtl~i~sdvl~vsfrPdG~elaVaTld--g--qItf~d~~~~~q~~~IdgrkD 563 (893)
T KOG0291|consen 498 --------WDKTVRIWDIF--SSSGTVETLEIRSDVLAVSFRPDGKELAVATLD--G--QITFFDIKEAVQVGSIDGRKD 563 (893)
T ss_pred --------ccceEEEEEee--ccCceeeeEeeccceeEEEEcCCCCeEEEEEec--c--eEEEEEhhhceeeccccchhh
Confidence 11234555541 222223333345667789999999999988722 1 233333332
Q ss_pred ---CCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEE
Q 004574 334 ---KDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERI 410 (744)
Q Consensus 334 ---~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l 410 (744)
++...-.++..+... .-....++.|+||+.|+.. +..+.+.+++..++-.-+.
T Consensus 564 ~~~gR~~~D~~ta~~sa~---~K~Ftti~ySaDG~~IlAg---------------------G~sn~iCiY~v~~~vllkk 619 (893)
T KOG0291|consen 564 LSGGRKETDRITAENSAK---GKTFTTICYSADGKCILAG---------------------GESNSICIYDVPEGVLLKK 619 (893)
T ss_pred ccccccccceeehhhccc---CCceEEEEEcCCCCEEEec---------------------CCcccEEEEECchhheeee
Confidence 111111222221110 0011236689999988665 2345689999988866544
Q ss_pred eeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECC
Q 004574 411 WESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWP 460 (744)
Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~ 460 (744)
+.-.. +-|.||-.-++.+...+.-+.+-++|-+
T Consensus 620 fqiS~-----------------N~sLdg~~efln~rkmtEfG~~~LiD~e 652 (893)
T KOG0291|consen 620 FQISD-----------------NRSLDGVLEFLNRRKMTEFGNMDLIDTE 652 (893)
T ss_pred EEecc-----------------ccchhHHHHHhccccccccCCccccccc
Confidence 43221 2344453444555555666667777653
No 170
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=99.01 E-value=1.6e-07 Score=98.09 Aligned_cols=300 Identities=10% Similarity=0.040 Sum_probs=148.3
Q ss_pred eeEEEEEC--CCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCccccccccc
Q 004574 62 LRVWIADA--ETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTD 139 (744)
Q Consensus 62 ~~l~~~~~--~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (744)
..|+++.+ ++|+...+..... .. .-..++++|++++||.+.......
T Consensus 13 ~gI~~~~~d~~~g~l~~~~~~~~-~~--~Ps~l~~~~~~~~LY~~~e~~~~~---------------------------- 61 (345)
T PF10282_consen 13 GGIYVFRFDEETGTLTLVQTVAE-GE--NPSWLAVSPDGRRLYVVNEGSGDS---------------------------- 61 (345)
T ss_dssp TEEEEEEEETTTTEEEEEEEEEE-SS--SECCEEE-TTSSEEEEEETTSSTT----------------------------
T ss_pred CcEEEEEEcCCCCCceEeeeecC-CC--CCceEEEEeCCCEEEEEEccccCC----------------------------
Confidence 34555544 6777665532111 11 235778999999998875431000
Q ss_pred ccCCCchhhhccceeeeeEE--EEEcCC-CCeeecC----CCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEE
Q 004574 140 NLLKDEYDESLFDYYTTAQL--VLGSLD-GTAKDFG----TPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQ 212 (744)
Q Consensus 140 ~~~~~~~~~~~~~~~~~~~l--~~~~~~-G~~~~l~----~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~ 212 (744)
+.| |.++-+ |+.+.+. .......++.+|||++|+...-.. ..+.
T Consensus 62 -----------------g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~~l~vany~~------------g~v~ 112 (345)
T PF10282_consen 62 -----------------GGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGRFLYVANYGG------------GSVS 112 (345)
T ss_dssp -----------------TEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSSEEEEEETTT------------TEEE
T ss_pred -----------------CCEEEEEECCCcceeEEeeeeccCCCCcEEEEEecCCCEEEEEEccC------------CeEE
Confidence 234 444444 6554442 224556788999999998774332 3566
Q ss_pred EEeCCC-Ceeeec---cCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCC
Q 004574 213 VWTTDG-KLVREL---CDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEG 288 (744)
Q Consensus 213 ~~~~~g-~~~~~l---~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (744)
+++++. +..... ....+. .|.........+..+.++|||+. ++.. .. ..+.|++++.+. .+
T Consensus 113 v~~l~~~g~l~~~~~~~~~~g~--g~~~~rq~~~h~H~v~~~pdg~~-v~v~-dl----------G~D~v~~~~~~~-~~ 177 (345)
T PF10282_consen 113 VFPLDDDGSLGEVVQTVRHEGS--GPNPDRQEGPHPHQVVFSPDGRF-VYVP-DL----------GADRVYVYDIDD-DT 177 (345)
T ss_dssp EEEECTTSEEEEEEEEEESEEE--ESSTTTTSSTCEEEEEE-TTSSE-EEEE-ET----------TTTEEEEEEE-T-TS
T ss_pred EEEccCCcccceeeeecccCCC--CCcccccccccceeEEECCCCCE-EEEE-ec----------CCCEEEEEEEeC-CC
Confidence 666653 222222 111100 00000000111346889999996 5444 22 223477777621 22
Q ss_pred CCceEee----eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeecccc--ccccCCCCCCceeeCC
Q 004574 289 EKPEILH----KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVF--ENVYSDPGSPMMTRTS 362 (744)
Q Consensus 289 ~~~~~l~----~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~sp 362 (744)
++..... .....-+.++|+|||++++... ..++.-.++.++... +..+.+..... ...........+..||
T Consensus 178 ~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~-e~s~~v~v~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~i~isp 254 (345)
T PF10282_consen 178 GKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVN-ELSNTVSVFDYDPSD--GSLTEIQTISTLPEGFTGENAPAEIAISP 254 (345)
T ss_dssp -TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEE-TTTTEEEEEEEETTT--TEEEEEEEEESCETTSCSSSSEEEEEE-T
T ss_pred ceEEEeeccccccCCCCcEEEEcCCcCEEEEec-CCCCcEEEEeecccC--CceeEEEEeeeccccccccCCceeEEEec
Confidence 2222211 1133456789999999877654 333433444444223 33322211111 0100000112256899
Q ss_pred CCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEec--CCCceeEEeeccchhhhhheeeeecCCcceecccCCCE
Q 004574 363 TGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDI--NTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLK 440 (744)
Q Consensus 363 dg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~--~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~ 440 (744)
||++|+..... ...|.++++ .+|+.+.+-..... -.....++++|||+.
T Consensus 255 dg~~lyvsnr~--------------------~~sI~vf~~d~~~g~l~~~~~~~~~---------G~~Pr~~~~s~~g~~ 305 (345)
T PF10282_consen 255 DGRFLYVSNRG--------------------SNSISVFDLDPATGTLTLVQTVPTG---------GKFPRHFAFSPDGRY 305 (345)
T ss_dssp TSSEEEEEECT--------------------TTEEEEEEECTTTTTEEEEEEEEES---------SSSEEEEEE-TTSSE
T ss_pred CCCEEEEEecc--------------------CCEEEEEEEecCCCceEEEEEEeCC---------CCCccEEEEeCCCCE
Confidence 99988776411 223555554 66777655332110 001233689999987
Q ss_pred EEEEEecCCCCceEEEEECCCCceeeeec
Q 004574 441 ILTSKESKTEITQYHILSWPLKKSSQITN 469 (744)
Q Consensus 441 ~~~~~~~~~~~~~i~~~~~~~g~~~~lt~ 469 (744)
|+... .....-.+|.+|.++|+++.+..
T Consensus 306 l~Va~-~~s~~v~vf~~d~~tG~l~~~~~ 333 (345)
T PF10282_consen 306 LYVAN-QDSNTVSVFDIDPDTGKLTPVGS 333 (345)
T ss_dssp EEEEE-TTTTEEEEEEEETTTTEEEEEEE
T ss_pred EEEEe-cCCCeEEEEEEeCCCCcEEEecc
Confidence 76644 44444668888888998877653
No 171
>COG3571 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=99.00 E-value=3.4e-08 Score=84.25 Aligned_cols=160 Identities=13% Similarity=0.100 Sum_probs=102.9
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccC-CCCchhHHHHHhCCeEEEe--cCCCCCCCCC-------CCChHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFS-GMTPTSSLIFLARRFAVLA--GPSIPIIGEG-------DKLPNDSAEAAV 582 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~G~~v~~--~~~~~~~g~g-------~~~~~~d~~~~~ 582 (744)
-.+||+.||.|-. +. ..+...+..|+.+|+.|.. -+|+..+..+ ..........++
T Consensus 14 ~~tilLaHGAGas--------------mdSt~m~~~a~~la~~G~~vaRfefpYma~Rrtg~rkPp~~~~t~~~~~~~~~ 79 (213)
T COG3571 14 PVTILLAHGAGAS--------------MDSTSMTAVAAALARRGWLVARFEFPYMAARRTGRRKPPPGSGTLNPEYIVAI 79 (213)
T ss_pred CEEEEEecCCCCC--------------CCCHHHHHHHHHHHhCceeEEEeecchhhhccccCCCCcCccccCCHHHHHHH
Confidence 3578899997521 11 1223456688899999886 2222222222 222333566666
Q ss_pred HHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccC
Q 004574 583 EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHAN 662 (744)
Q Consensus 583 ~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 662 (744)
.+|++.. +..++++-|+||||-++.+++..---.+.+++|++-++..... +.. --..|+.
T Consensus 80 aql~~~l--~~gpLi~GGkSmGGR~aSmvade~~A~i~~L~clgYPfhppGK----Pe~--------------~Rt~HL~ 139 (213)
T COG3571 80 AQLRAGL--AEGPLIIGGKSMGGRVASMVADELQAPIDGLVCLGYPFHPPGK----PEQ--------------LRTEHLT 139 (213)
T ss_pred HHHHhcc--cCCceeeccccccchHHHHHHHhhcCCcceEEEecCccCCCCC----ccc--------------chhhhcc
Confidence 6777653 4468999999999999999986643347788877654321111 110 1124677
Q ss_pred CCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccC
Q 004574 663 KIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYA 713 (744)
Q Consensus 663 ~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~ 713 (744)
.+++|+||.||+.|++-. .++...+ ....+++++++.++.|..-
T Consensus 140 gl~tPtli~qGtrD~fGt--r~~Va~y-----~ls~~iev~wl~~adHDLk 183 (213)
T COG3571 140 GLKTPTLITQGTRDEFGT--RDEVAGY-----ALSDPIEVVWLEDADHDLK 183 (213)
T ss_pred CCCCCeEEeecccccccC--HHHHHhh-----hcCCceEEEEeccCccccc
Confidence 899999999999999865 5544211 2346789999999999764
No 172
>PF03583 LIP: Secretory lipase ; InterPro: IPR005152 This entry represents a family of secreted lipases. Family members include the LIP lipases from Candida albicans, which are expressed and secreted during the infection cycle of these pathogens [].; GO: 0004806 triglyceride lipase activity, 0016042 lipid catabolic process
Probab=99.00 E-value=1.5e-08 Score=101.88 Aligned_cols=67 Identities=24% Similarity=0.320 Sum_probs=57.0
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC-CcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhccCCC
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHG-ALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
+.|++|.||..|.+|| +..+.++++.+.++| .+++++.++..+|.... .......++||.+.+...+
T Consensus 219 ~~Pv~i~~g~~D~vvP--~~~~~~l~~~~c~~G~a~V~~~~~~~~~H~~~~----~~~~~~a~~Wl~~rf~G~~ 286 (290)
T PF03583_consen 219 TVPVLIYQGTADEVVP--PADTDALVAKWCAAGGADVEYVRYPGGGHLGAA----FASAPDALAWLDDRFAGKP 286 (290)
T ss_pred CCCEEEEecCCCCCCC--hHHHHHHHHHHHHcCCCCEEEEecCCCChhhhh----hcCcHHHHHHHHHHHCCCC
Confidence 6999999999999999 999999999999999 89999999999996432 2234567899999987644
No 173
>PF00561 Abhydrolase_1: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR000073 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents fold-1 of alpha/beta hydrolase.; PDB: 2VAT_E 2VAX_C 2VAV_H 2PSJ_A 2PSH_B 2PSE_A 2PSF_A 2PSD_A 2EDA_A 1CIJ_A ....
Probab=98.97 E-value=9.2e-10 Score=108.36 Aligned_cols=141 Identities=23% Similarity=0.227 Sum_probs=92.3
Q ss_pred HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCC--CC----CCCCC-ccc--------c
Q 004574 576 DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSY--NK----TLTPF-GFQ--------T 640 (744)
Q Consensus 576 ~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~--~~----~~~~~-~~~--------~ 640 (744)
+|+.+.++.+++...+ +++.++||||||.+++.++.++|++++++|+++++. .. ..... ... .
T Consensus 28 ~~~~~~~~~~~~~l~~--~~~~~vG~S~Gg~~~~~~a~~~p~~v~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (230)
T PF00561_consen 28 DDLAADLEALREALGI--KKINLVGHSMGGMLALEYAAQYPERVKKLVLISPPPDLPDGLWNRIWPRGNLQGQLLDNFFN 105 (230)
T ss_dssp HHHHHHHHHHHHHHTT--SSEEEEEETHHHHHHHHHHHHSGGGEEEEEEESESSHHHHHHHHHCHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCC--CCeEEEEECCChHHHHHHHHHCchhhcCcEEEeeeccchhhhhHHHHhhhhhhhhHHHhhhc
Confidence 4889999999886555 459999999999999999999999999999999851 00 00000 000 0
Q ss_pred ----------------------c-ccch--------hhc--------------HHHHHhcCcccccCCCCCCEEEEeeCC
Q 004574 641 ----------------------E-FRTL--------WEA--------------TNVYIEMSPITHANKIKKPILIIHGEV 675 (744)
Q Consensus 641 ----------------------~-~~~~--------~~~--------------~~~~~~~~~~~~~~~~~~P~l~i~G~~ 675 (744)
. .... +.. ...+........+.++++|+|+++|++
T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~p~l~i~~~~ 185 (230)
T PF00561_consen 106 FLSDPIKPLLGRWPKQFFAYDREFVEDFLKQFQSQQYARFAETDAFDNMFWNALGYFSVWDPSPALSNIKVPTLIIWGED 185 (230)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHHTCHHHHHHHHHHHHHHHHHHHHHHHHHTTTTSEEEEEEETT
T ss_pred cccccchhhhhhhhhheeeccCccccchhhccchhhhhHHHHHHHHhhhccccccccccccccccccccCCCeEEEEeCC
Confidence 0 0000 000 000111122334567999999999999
Q ss_pred CCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHH
Q 004574 676 DDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWET 725 (744)
Q Consensus 676 D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~ 725 (744)
|.++| +.....+.+.+ ...++++++++||... ......+.+.+
T Consensus 186 D~~~p--~~~~~~~~~~~----~~~~~~~~~~~GH~~~-~~~~~~~~~~i 228 (230)
T PF00561_consen 186 DPLVP--PESSEQLAKLI----PNSQLVLIEGSGHFAF-LEGPDEFNEII 228 (230)
T ss_dssp CSSSH--HHHHHHHHHHS----TTEEEEEETTCCSTHH-HHSHHHHHHHH
T ss_pred CCCCC--HHHHHHHHHhc----CCCEEEECCCCChHHH-hcCHHhhhhhh
Confidence 99998 77777654443 3479999999999865 44454544443
No 174
>PF09752 DUF2048: Uncharacterized conserved protein (DUF2048); InterPro: IPR019149 This family of proteins has no known function.
Probab=98.96 E-value=2.6e-08 Score=98.67 Aligned_cols=211 Identities=16% Similarity=0.064 Sum_probs=125.3
Q ss_pred EEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe--cCCCCCCCCC-
Q 004574 494 PLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA--GPSIPIIGEG- 570 (744)
Q Consensus 494 ~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~--~~~~~~~g~g- 570 (744)
..+..++.|...++ ..+|++|.+.|.| .+.|..+....+..|++.|++.+. .++++.+-.-
T Consensus 76 ~a~~~~~~P~~~~~---~~rp~~IhLagTG-------------Dh~f~rR~~l~a~pLl~~gi~s~~le~Pyyg~RkP~~ 139 (348)
T PF09752_consen 76 TARFQLLLPKRWDS---PYRPVCIHLAGTG-------------DHGFWRRRRLMARPLLKEGIASLILENPYYGQRKPKD 139 (348)
T ss_pred heEEEEEECCcccc---CCCceEEEecCCC-------------ccchhhhhhhhhhHHHHcCcceEEEecccccccChhH
Confidence 34455666876422 2389999999965 134444444447788888997765 4554443311
Q ss_pred ---CC------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC-C-
Q 004574 571 ---DK------------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT-L- 633 (744)
Q Consensus 571 ---~~------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~-~- 633 (744)
.. ....++...+.|+.++++ .++++.|.||||.+|..+++.+|..+..+-++++..... +
T Consensus 140 Q~~s~l~~VsDl~~~g~~~i~E~~~Ll~Wl~~~G~---~~~g~~G~SmGG~~A~laa~~~p~pv~~vp~ls~~sAs~vFt 216 (348)
T PF09752_consen 140 QRRSSLRNVSDLFVMGRATILESRALLHWLEREGY---GPLGLTGISMGGHMAALAASNWPRPVALVPCLSWSSASVVFT 216 (348)
T ss_pred hhcccccchhHHHHHHhHHHHHHHHHHHHHHhcCC---CceEEEEechhHhhHHhhhhcCCCceeEEEeecccCCCcchh
Confidence 10 122378889999999864 589999999999999999999998777666666532210 0
Q ss_pred -------CCCcccccc-------------------------cchhhcHHH----HHhcCcccccCCCC-----CCEEEEe
Q 004574 634 -------TPFGFQTEF-------------------------RTLWEATNV----YIEMSPITHANKIK-----KPILIIH 672 (744)
Q Consensus 634 -------~~~~~~~~~-------------------------~~~~~~~~~----~~~~~~~~~~~~~~-----~P~l~i~ 672 (744)
..|...... ...+...+. ...+.-..++.+.. ..+.++.
T Consensus 217 ~Gvls~~i~W~~L~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ea~~~m~~~md~~T~l~nf~~P~dp~~ii~V~ 296 (348)
T PF09752_consen 217 EGVLSNSINWDALEKQFEDTVYEEEISDIPAQNKSLPLDSMEERRRDREALRFMRGVMDSFTHLTNFPVPVDPSAIIFVA 296 (348)
T ss_pred hhhhhcCCCHHHHHHHhcccchhhhhcccccCcccccchhhccccchHHHHHHHHHHHHhhccccccCCCCCCCcEEEEE
Confidence 000000000 000000111 11122333444443 3388999
Q ss_pred eCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 673 GEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 673 G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
+++|..|| ......|.+.. ..+++.++++ ||.-...-....+.+.|.+=|+
T Consensus 297 A~~DaYVP--r~~v~~Lq~~W----PGsEvR~l~g-GHVsA~L~~q~~fR~AI~Daf~ 347 (348)
T PF09752_consen 297 AKNDAYVP--RHGVLSLQEIW----PGSEVRYLPG-GHVSAYLLHQEAFRQAIYDAFE 347 (348)
T ss_pred ecCceEec--hhhcchHHHhC----CCCeEEEecC-CcEEEeeechHHHHHHHHHHhh
Confidence 99999998 77666665544 3457777887 8975444444556666666543
No 175
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=98.96 E-value=6.9e-09 Score=101.94 Aligned_cols=193 Identities=16% Similarity=0.154 Sum_probs=125.7
Q ss_pred CCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 30 GAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 30 ~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
..++...+|.|+|++|+-++. +..-.|| |+.++++-.+-.+... ++.+++|.|||..++-...+
T Consensus 261 ~~RVs~VafHPsG~~L~Tasf--------D~tWRlW--D~~tk~ElL~QEGHs~----~v~~iaf~~DGSL~~tGGlD-- 324 (459)
T KOG0272|consen 261 LARVSRVAFHPSGKFLGTASF--------DSTWRLW--DLETKSELLLQEGHSK----GVFSIAFQPDGSLAATGGLD-- 324 (459)
T ss_pred hhhheeeeecCCCceeeeccc--------ccchhhc--ccccchhhHhhccccc----ccceeEecCCCceeeccCcc--
Confidence 447999999999999999765 3345555 7788877666433322 57889999999977643211
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeeeccCCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQ 186 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG 186 (744)
..+.|| |+ +| .+--|..+ ..+.++.|||+|
T Consensus 325 ---------------------------------------------~~~RvW--DlRtgr~im~L~gH~k~I~~V~fsPNG 357 (459)
T KOG0272|consen 325 ---------------------------------------------SLGRVW--DLRTGRCIMFLAGHIKEILSVAFSPNG 357 (459)
T ss_pred ---------------------------------------------chhhee--ecccCcEEEEecccccceeeEeECCCc
Confidence 113444 77 77 33334334 788999999999
Q ss_pred ceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 187 KYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 187 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
..|+-.+.++ ..-+||+..... ..+..+. .-+..+.|+|++.++|+-++..
T Consensus 358 y~lATgs~Dn-------------t~kVWDLR~r~~ly~ipAH~-------------nlVS~Vk~~p~~g~fL~TasyD-- 409 (459)
T KOG0272|consen 358 YHLATGSSDN-------------TCKVWDLRMRSELYTIPAHS-------------NLVSQVKYSPQEGYFLVTASYD-- 409 (459)
T ss_pred eEEeecCCCC-------------cEEEeeecccccceeccccc-------------chhhheEecccCCeEEEEcccC--
Confidence 9998665553 678888876543 2222111 1266789999666535544321
Q ss_pred CCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEE
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTW 327 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~ 327 (744)
....||.-. .....+.|......+.++..|+||+.|+..+.++ ...||
T Consensus 410 --------~t~kiWs~~----~~~~~ksLaGHe~kV~s~Dis~d~~~i~t~s~DR--T~KLW 457 (459)
T KOG0272|consen 410 --------NTVKIWSTR----TWSPLKSLAGHEGKVISLDISPDSQAIATSSFDR--TIKLW 457 (459)
T ss_pred --------cceeeecCC----CcccchhhcCCccceEEEEeccCCceEEEeccCc--eeeec
Confidence 112244422 3344456777788899999999999999887544 33455
No 176
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=98.95 E-value=5.4e-08 Score=96.84 Aligned_cols=276 Identities=11% Similarity=0.042 Sum_probs=158.1
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.++...|.|--. |+.++. -++.-+||-+| |..-..|+.---.. .+ .....++|+|+..+|++.+
T Consensus 215 ~I~sv~FHp~~p-lllvaG-------~d~~lrifqvD--Gk~N~~lqS~~l~~-fP-i~~a~f~p~G~~~i~~s~r---- 278 (514)
T KOG2055|consen 215 GITSVQFHPTAP-LLLVAG-------LDGTLRIFQVD--GKVNPKLQSIHLEK-FP-IQKAEFAPNGHSVIFTSGR---- 278 (514)
T ss_pred CceEEEecCCCc-eEEEec-------CCCcEEEEEec--CccChhheeeeecc-Cc-cceeeecCCCceEEEeccc----
Confidence 578889999655 444331 14455566554 65555554221110 11 2356899999966655321
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-----ceeeeeccCCC
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-----AVYTAVEPSPD 185 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-----~~~~~~~~SpD 185 (744)
..-+|.+|+ +++..++... .....+..|||
T Consensus 279 --------------------------------------------rky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd 314 (514)
T KOG2055|consen 279 --------------------------------------------RKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSHD 314 (514)
T ss_pred --------------------------------------------ceEEEEeeccccccccccCCCCcccchhheeEecCC
Confidence 135888999 7787777554 34568889999
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
+++|++..... .|++....+++ .|+....+ ..+.++.|+.||+. |+.+.
T Consensus 315 ~~fia~~G~~G-------------~I~lLhakT~e--li~s~Kie-----------G~v~~~~fsSdsk~-l~~~~---- 363 (514)
T KOG2055|consen 315 SNFIAIAGNNG-------------HIHLLHAKTKE--LITSFKIE-----------GVVSDFTFSSDSKE-LLASG---- 363 (514)
T ss_pred CCeEEEcccCc-------------eEEeehhhhhh--hhheeeec-----------cEEeeEEEecCCcE-EEEEc----
Confidence 99998875543 56666555543 22222222 11668899999986 44442
Q ss_pred CCCCccCCccceEEeccCCCCCCCCceEeeeecc--ceeceeeccCCceEEEeeeeeccceeEEEEcCCCC--CCcceee
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDL--RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK--DVAPRVL 341 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~--~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~--~~~~~~l 341 (744)
..+.||++++. .......+..++ ...+++-|++|.+|+..+ . .+--.| +|.++. ...++.+
T Consensus 364 --------~~GeV~v~nl~---~~~~~~rf~D~G~v~gts~~~S~ng~ylA~GS-~-~GiVNI--Yd~~s~~~s~~PkPi 428 (514)
T KOG2055|consen 364 --------GTGEVYVWNLR---QNSCLHRFVDDGSVHGTSLCISLNGSYLATGS-D-SGIVNI--YDGNSCFASTNPKPI 428 (514)
T ss_pred --------CCceEEEEecC---CcceEEEEeecCccceeeeeecCCCceEEecc-C-cceEEE--eccchhhccCCCCch
Confidence 23359999983 222222233333 445678889999988876 2 233334 443332 1233333
Q ss_pred ecc-ccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhh
Q 004574 342 FDR-VFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFE 420 (744)
Q Consensus 342 ~~~-~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~ 420 (744)
..- +... ....++|++|++.|+..+.. ...+|.++.+.+-.+-.-|..... ...
T Consensus 429 k~~dNLtt-----~Itsl~Fn~d~qiLAiaS~~-------------------~knalrLVHvPS~TVFsNfP~~n~-~vg 483 (514)
T KOG2055|consen 429 KTVDNLTT-----AITSLQFNHDAQILAIASRV-------------------KKNALRLVHVPSCTVFSNFPTSNT-KVG 483 (514)
T ss_pred hhhhhhhe-----eeeeeeeCcchhhhhhhhhc-------------------cccceEEEeccceeeeccCCCCCC-ccc
Confidence 221 1111 11226799999999888632 234567776665555444443322 222
Q ss_pred heeeeecCCcceecccCCCEEEEEE
Q 004574 421 TAVALVFGQGEEDINLNQLKILTSK 445 (744)
Q Consensus 421 ~~~~~~~~~~~~~~s~d~~~~~~~~ 445 (744)
.+. -++|||.|..+++..
T Consensus 484 ~vt-------c~aFSP~sG~lAvGN 501 (514)
T KOG2055|consen 484 HVT-------CMAFSPNSGYLAVGN 501 (514)
T ss_pred ceE-------EEEecCCCceEEeec
Confidence 222 359999999888744
No 177
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.94 E-value=3.7e-07 Score=92.84 Aligned_cols=292 Identities=11% Similarity=-0.003 Sum_probs=141.5
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceec-cccCCCc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKP-LFESPDI 83 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~-lt~~~~~ 83 (744)
..|.+++... ++.. ..++.|..... .+||||+.|+.+...-...........|-++|+++.+... +.-.+..
T Consensus 27 ~~v~ViD~~~----~~v~--g~i~~G~~P~~-~~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p 99 (352)
T TIGR02658 27 TQVYTIDGEA----GRVL--GMTDGGFLPNP-VVASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGP 99 (352)
T ss_pred ceEEEEECCC----CEEE--EEEEccCCCce-eECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCc
Confidence 4577777644 4333 22456654555 5999999887765411101112344678889999887643 3221110
Q ss_pred cc--cccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhh-hccceeeeeEEE
Q 004574 84 CL--NAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDE-SLFDYYTTAQLV 160 (744)
Q Consensus 84 ~~--~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~ 160 (744)
+. ...-..++.||||++|++......+. ........+....+..-+. +..-|+..+ .....-..+.+.
T Consensus 100 ~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~-V~VvD~~~~kvv~ei~vp~--------~~~vy~t~e~~~~~~~~Dg~~~ 170 (352)
T TIGR02658 100 RFLVGTYPWMTSLTPDNKTLLFYQFSPSPA-VGVVDLEGKAFVRMMDVPD--------CYHIFPTANDTFFMHCRDGSLA 170 (352)
T ss_pred hhhccCccceEEECCCCCEEEEecCCCCCE-EEEEECCCCcEEEEEeCCC--------CcEEEEecCCccEEEeecCceE
Confidence 10 11123679999999998753221111 0111111111111100000 000011100 000000112222
Q ss_pred EEcC--CCCee----ecCCC---ceeeeeccCC-CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCC
Q 004574 161 LGSL--DGTAK----DFGTP---AVYTAVEPSP-DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPA 230 (744)
Q Consensus 161 ~~~~--~G~~~----~l~~~---~~~~~~~~Sp-DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~ 230 (744)
.+.+ +|+.. ++... .-...+.+++ ||++++.+ .+ ..+++.|+.+.....+......
T Consensus 171 ~v~~d~~g~~~~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs-~e-------------G~V~~id~~~~~~~~~~~~~~~ 236 (352)
T TIGR02658 171 KVGYGTKGNPKIKPTEVFHPEDEYLINHPAYSNKSGRLVWPT-YT-------------GKIFQIDLSSGDAKFLPAIEAF 236 (352)
T ss_pred EEEecCCCceEEeeeeeecCCccccccCCceEcCCCcEEEEe-cC-------------CeEEEEecCCCcceecceeeec
Confidence 2222 34311 11000 1113445677 88766555 33 2789998766554443222111
Q ss_pred CCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCC
Q 004574 231 EDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDS 310 (744)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg 310 (744)
........+.+.+...++++|||++ +|........+- -....+.|+++|. .+++...-.........+++||||
T Consensus 237 ~~~~~~~~wrP~g~q~ia~~~dg~~-lyV~~~~~~~~t--hk~~~~~V~ViD~---~t~kvi~~i~vG~~~~~iavS~Dg 310 (352)
T TIGR02658 237 TEAEKADGWRPGGWQQVAYHRARDR-IYLLADQRAKWT--HKTASRFLFVVDA---KTGKRLRKIELGHEIDSINVSQDA 310 (352)
T ss_pred cccccccccCCCcceeEEEcCCCCE-EEEEecCCcccc--ccCCCCEEEEEEC---CCCeEEEEEeCCCceeeEEECCCC
Confidence 1011111223344556999999997 555332111111 1123357999998 566655544456678889999999
Q ss_pred ceEEEeeeeeccceeEEEEcCCCC
Q 004574 311 LALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 311 ~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+.++|..+.. ...|.++|+.+.
T Consensus 311 kp~lyvtn~~--s~~VsViD~~t~ 332 (352)
T TIGR02658 311 KPLLYALSTG--DKTLYIFDAETG 332 (352)
T ss_pred CeEEEEeCCC--CCcEEEEECcCC
Confidence 9677766433 345889998774
No 178
>PF05728 UPF0227: Uncharacterised protein family (UPF0227); InterPro: IPR008886 Despite being classed as uncharacterised proteins, the members of this family are almost certainly enzymes in that they contain a domain distantly related to IPR000073 from INTERPRO. One of the members of this family YqiA has been shown to be a esterase []. Other members, which include the Escherichia coli (strain K12) YcfP protein are uncharacterised.
Probab=98.93 E-value=6.8e-08 Score=89.47 Aligned_cols=136 Identities=11% Similarity=0.060 Sum_probs=80.7
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC------CCcccccccchhhcHH
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT------PFGFQTEFRTLWEATN 650 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 650 (744)
++.+.++.+.+... ++.+.|+|.|+||+.|.+++.+++ +++ |++.|.+..... ...........+-...
T Consensus 44 ~a~~~l~~~i~~~~--~~~~~liGSSlGG~~A~~La~~~~--~~a-vLiNPav~p~~~l~~~iG~~~~~~~~e~~~~~~~ 118 (187)
T PF05728_consen 44 EAIAQLEQLIEELK--PENVVLIGSSLGGFYATYLAERYG--LPA-VLINPAVRPYELLQDYIGEQTNPYTGESYELTEE 118 (187)
T ss_pred HHHHHHHHHHHhCC--CCCeEEEEEChHHHHHHHHHHHhC--CCE-EEEcCCCCHHHHHHHhhCccccCCCCccceechH
Confidence 44555555555432 234999999999999999999883 445 777887542110 0000000011111112
Q ss_pred HHHhcCccccc-CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 651 VYIEMSPITHA-NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 651 ~~~~~~~~~~~-~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
.+.+...+... .....++++++++.|++++ +..+.+.++. +...+.+|++|.+.. .++....|++|+
T Consensus 119 ~~~~l~~l~~~~~~~~~~~lvll~~~DEvLd--~~~a~~~~~~-------~~~~i~~ggdH~f~~---f~~~l~~i~~f~ 186 (187)
T PF05728_consen 119 HIEELKALEVPYPTNPERYLVLLQTGDEVLD--YREAVAKYRG-------CAQIIEEGGDHSFQD---FEEYLPQIIAFL 186 (187)
T ss_pred hhhhcceEeccccCCCccEEEEEecCCcccC--HHHHHHHhcC-------ceEEEEeCCCCCCcc---HHHHHHHHHHhh
Confidence 22222222211 2235789999999999988 7666544422 245567888998764 668888888887
No 179
>PRK10115 protease 2; Provisional
Probab=98.93 E-value=6.8e-07 Score=101.11 Aligned_cols=262 Identities=13% Similarity=0.114 Sum_probs=150.0
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc--eeccccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE--AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~--~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
......|||||++|||.. +...+...+|+++++++|+ ...+.... ..+.|++|++.|+|+.....
T Consensus 128 ~l~~~~~Spdg~~la~~~-----d~~G~E~~~l~v~d~~tg~~l~~~i~~~~--------~~~~w~~D~~~~~y~~~~~~ 194 (686)
T PRK10115 128 TLGGMAITPDNTIMALAE-----DFLSRRQYGIRFRNLETGNWYPELLDNVE--------PSFVWANDSWTFYYVRKHPV 194 (686)
T ss_pred EEeEEEECCCCCEEEEEe-----cCCCcEEEEEEEEECCCCCCCCccccCcc--------eEEEEeeCCCEEEEEEecCC
Confidence 456789999999999986 3445777889999999887 44442211 24799999999999865311
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC--CeeecCCC--c-eeeeeccC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG--TAKDFGTP--A-VYTAVEPS 183 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G--~~~~l~~~--~-~~~~~~~S 183 (744)
.. ...++|+.++ ++ +.+.|... . .......+
T Consensus 195 ~~-------------------------------------------~~~~v~~h~lgt~~~~d~lv~~e~~~~~~~~~~~s 231 (686)
T PRK10115 195 TL-------------------------------------------LPYQVWRHTIGTPASQDELVYEEKDDTFYVSLHKT 231 (686)
T ss_pred CC-------------------------------------------CCCEEEEEECCCChhHCeEEEeeCCCCEEEEEEEc
Confidence 00 1158999999 77 44555443 2 22233345
Q ss_pred CCCceEEEEEeeCCcccccccCCCcceEEEEeCC--CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEE
Q 004574 184 PDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD--GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 184 pDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~ 261 (744)
.|++++++...... ..++++++.+ ....+.+...+... .+.....+.. +++..
T Consensus 232 ~d~~~l~i~~~~~~----------~~~~~l~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~-ly~~t 286 (686)
T PRK10115 232 TSKHYVVIHLASAT----------TSEVLLLDAELADAEPFVFLPRRKDH--------------EYSLDHYQHR-FYLRS 286 (686)
T ss_pred CCCCEEEEEEECCc----------cccEEEEECcCCCCCceEEEECCCCC--------------EEEEEeCCCE-EEEEE
Confidence 59999986655432 2467777742 22222222221110 0112222343 66665
Q ss_pred eecCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcce
Q 004574 262 AQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPR 339 (744)
Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~ 339 (744)
+.+. .+..|+.+++. +.++.+.+... +..+..+.++ +.+|++.. ...+...|+++++.++ ...
T Consensus 287 n~~~--------~~~~l~~~~~~--~~~~~~~l~~~~~~~~i~~~~~~--~~~l~~~~-~~~g~~~l~~~~~~~~--~~~ 351 (686)
T PRK10115 287 NRHG--------KNFGLYRTRVR--DEQQWEELIPPRENIMLEGFTLF--TDWLVVEE-RQRGLTSLRQINRKTR--EVI 351 (686)
T ss_pred cCCC--------CCceEEEecCC--CcccCeEEECCCCCCEEEEEEEE--CCEEEEEE-EeCCEEEEEEEcCCCC--ceE
Confidence 4322 22247777762 13555666655 2356777777 55777766 4446778999998652 333
Q ss_pred eee-ccccccccCCCCCCceeeC--CCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeec
Q 004574 340 VLF-DRVFENVYSDPGSPMMTRT--STGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWES 413 (744)
Q Consensus 340 ~l~-~~~~~~~~~~~~~~~~~~s--pdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~ 413 (744)
.+. ...... ..+..+ +++..+++... + ......+|.+|+.+++.+.+...
T Consensus 352 ~l~~~~~~~~-------~~~~~~~~~~~~~~~~~~s--------------s---~~~P~~~y~~d~~~~~~~~l~~~ 404 (686)
T PRK10115 352 GIAFDDPAYV-------TWIAYNPEPETSRLRYGYS--------------S---MTTPDTLFELDMDTGERRVLKQT 404 (686)
T ss_pred EecCCCCceE-------eeecccCCCCCceEEEEEe--------------c---CCCCCEEEEEECCCCcEEEEEec
Confidence 333 211110 001122 45555555542 2 23456799999998887766543
No 180
>KOG2112 consensus Lysophospholipase [Lipid transport and metabolism]
Probab=98.93 E-value=1.5e-08 Score=91.96 Aligned_cols=128 Identities=21% Similarity=0.281 Sum_probs=96.1
Q ss_pred HHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHH
Q 004574 577 SAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYI 653 (744)
Q Consensus 577 d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 653 (744)
...+.+..|.++ ..++.+||++-|+||||.+++.++..+|..+.+.+..++......... .. |
T Consensus 73 ~aa~~i~~Li~~e~~~Gi~~~rI~igGfs~G~a~aL~~~~~~~~~l~G~~~~s~~~p~~~~~~--~~-----~------- 138 (206)
T KOG2112|consen 73 RAADNIANLIDNEPANGIPSNRIGIGGFSQGGALALYSALTYPKALGGIFALSGFLPRASIGL--PG-----W------- 138 (206)
T ss_pred HHHHHHHHHHHHHHHcCCCccceeEcccCchHHHHHHHHhccccccceeeccccccccchhhc--cC-----C-------
Confidence 344455555543 457889999999999999999999999877777777777644211110 00 0
Q ss_pred hcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 654 EMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 654 ~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
-+ ..+ .+|+++.||+.|+.|| ..-.++..+.|+..+.+++++.|++.+|.... +-+..+..|+.+
T Consensus 139 --~~---~~~-~~~i~~~Hg~~d~~vp--~~~g~~s~~~l~~~~~~~~f~~y~g~~h~~~~-----~e~~~~~~~~~~ 203 (206)
T KOG2112|consen 139 --LP---GVN-YTPILLCHGTADPLVP--FRFGEKSAQFLKSLGVRVTFKPYPGLGHSTSP-----QELDDLKSWIKT 203 (206)
T ss_pred --cc---ccC-cchhheecccCCceee--hHHHHHHHHHHHHcCCceeeeecCCccccccH-----HHHHHHHHHHHH
Confidence 00 001 6899999999999999 89999999999999988999999999997653 446677888876
No 181
>COG0627 Predicted esterase [General function prediction only]
Probab=98.92 E-value=1.4e-08 Score=101.74 Aligned_cols=147 Identities=20% Similarity=0.192 Sum_probs=102.1
Q ss_pred HHHcCCCCC--CcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC---------CCCCcccccccchhhc--HHH
Q 004574 585 VVRRGVADP--SRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT---------LTPFGFQTEFRTLWEA--TNV 651 (744)
Q Consensus 585 l~~~~~~d~--~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~---------~~~~~~~~~~~~~~~~--~~~ 651 (744)
+.+....+. ++.+|+||||||+-|+.+|.++|++|+.+..++|+++.. ...++..... ..|.. ...
T Consensus 141 ~~~~f~~~~~~~~~aI~G~SMGG~GAl~lA~~~pd~f~~~sS~Sg~~~~s~~~~~~~~~~~~~g~~~~~-~~~G~~~~~~ 219 (316)
T COG0627 141 WEAAFPADGTGDGRAIAGHSMGGYGALKLALKHPDRFKSASSFSGILSPSSPWGPTLAMGDPWGGKAFN-AMLGPDSDPA 219 (316)
T ss_pred HHHhcCcccccCCceeEEEeccchhhhhhhhhCcchhceeccccccccccccccccccccccccCccHH-HhcCCCcccc
Confidence 333344454 389999999999999999999999999999999998754 2211111100 01111 113
Q ss_pred HHhcCcccccCC--------------CCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcccc
Q 004574 652 YIEMSPITHANK--------------IKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAREN 717 (744)
Q Consensus 652 ~~~~~~~~~~~~--------------~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~ 717 (744)
+.++++...+.+ ...++++-+|..|.+........+.+.+++++.|.+..+...++..|.+ ..
T Consensus 220 w~~~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~d~g~ad~~~~~~~~~~~~~~~a~~~~g~~~~~~~~~~G~Hsw---~~ 296 (316)
T COG0627 220 WQENDPLSLIEKLVANANTRIWVYGGSPPELLIDNGPADFFLAANNLSTRAFAEALRAAGIPNGVRDQPGGDHSW---YF 296 (316)
T ss_pred ccccCchhHHHHhhhcccccceecccCCCccccccccchhhhhhcccCHHHHHHHHHhcCCCceeeeCCCCCcCH---HH
Confidence 344455444432 4477888899999874201334788999999999999999888888874 44
Q ss_pred HHHHHHHHHHHHHHhccC
Q 004574 718 VMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 718 ~~~~~~~~~~fl~~~l~~ 735 (744)
+..++...+.|+.+.|..
T Consensus 297 w~~~l~~~~~~~a~~l~~ 314 (316)
T COG0627 297 WASQLADHLPWLAGALGL 314 (316)
T ss_pred HHHHHHHHHHHHHHHhcc
Confidence 778888899999887754
No 182
>TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I. This model represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=98.90 E-value=6.1e-08 Score=104.52 Aligned_cols=80 Identities=13% Similarity=0.022 Sum_probs=56.0
Q ss_pred hhHHHHHhCCeEEEecCCCCCCCCCCC-------Ch-HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHH----HHHH
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIGEGDK-------LP-NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTA----HLLA 613 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g~g~~-------~~-~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~----~~~~ 613 (744)
..+..|+++||.|+..+ ..++|.+ +. .+++.++++++++.. +.+++.++||||||.++. .+++
T Consensus 211 Slv~~L~~qGf~V~~iD---wrgpg~s~~~~~~ddY~~~~i~~al~~v~~~~--g~~kv~lvG~cmGGtl~a~ala~~aa 285 (532)
T TIGR01838 211 SLVRWLVEQGHTVFVIS---WRNPDASQADKTFDDYIRDGVIAALEVVEAIT--GEKQVNCVGYCIGGTLLSTALAYLAA 285 (532)
T ss_pred HHHHHHHHCCcEEEEEE---CCCCCcccccCChhhhHHHHHHHHHHHHHHhc--CCCCeEEEEECcCcHHHHHHHHHHHH
Confidence 46778899999998833 2332222 12 235788888888753 346899999999999863 2344
Q ss_pred hC-CCceeEEEEccCCCC
Q 004574 614 HA-PHLFCCGIARSGSYN 630 (744)
Q Consensus 614 ~~-p~~~~~~v~~~~~~~ 630 (744)
.. +++++++++++...|
T Consensus 286 ~~~~~rv~slvll~t~~D 303 (532)
T TIGR01838 286 RGDDKRIKSATFFTTLLD 303 (532)
T ss_pred hCCCCccceEEEEecCcC
Confidence 54 678999998887655
No 183
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=98.89 E-value=1.1e-07 Score=87.92 Aligned_cols=205 Identities=19% Similarity=0.168 Sum_probs=122.1
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICL 85 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~ 85 (744)
+|.+++... ++.+..+... .......|||||+++++.. ....|-.+|..+-+...-.+..
T Consensus 88 ~ir~wd~r~----~k~~~~i~~~--~eni~i~wsp~g~~~~~~~----------kdD~it~id~r~~~~~~~~~~~---- 147 (313)
T KOG1407|consen 88 TIRIWDIRS----GKCTARIETK--GENINITWSPDGEYIAVGN----------KDDRITFIDARTYKIVNEEQFK---- 147 (313)
T ss_pred eEEEEEecc----CcEEEEeecc--CcceEEEEcCCCCEEEEec----------CcccEEEEEecccceeehhccc----
Confidence 455666644 5554444222 2367899999999999963 3356777776543332211111
Q ss_pred cccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCC
Q 004574 86 NAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLD 165 (744)
Q Consensus 86 ~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 165 (744)
..+....|+.++. ++|.... .+. .+|..++-=
T Consensus 148 -~e~ne~~w~~~nd-~Fflt~G--lG~--------------------------------------------v~ILsypsL 179 (313)
T KOG1407|consen 148 -FEVNEISWNNSND-LFFLTNG--LGC--------------------------------------------VEILSYPSL 179 (313)
T ss_pred -ceeeeeeecCCCC-EEEEecC--Cce--------------------------------------------EEEEecccc
Confidence 1344678995555 5555321 111 233333311
Q ss_pred CCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCC
Q 004574 166 GTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREG 243 (744)
Q Consensus 166 G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~ 243 (744)
...+.|..+ .+..-+.|+|||++++..+.+. .+-+||++.-- .+-++...+.
T Consensus 180 kpv~si~AH~snCicI~f~p~GryfA~GsADA-------------lvSLWD~~ELiC~R~isRldwp------------- 233 (313)
T KOG1407|consen 180 KPVQSIKAHPSNCICIEFDPDGRYFATGSADA-------------LVSLWDVDELICERCISRLDWP------------- 233 (313)
T ss_pred ccccccccCCcceEEEEECCCCceEeeccccc-------------eeeccChhHhhhheeeccccCc-------------
Confidence 112223333 5566789999999999886553 56778876543 2333434332
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeee
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYK 320 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~ 320 (744)
++.++||.||+. |+..+. .--|-+..+ ++|....-.+..+....++|.|....|+|+..+.
T Consensus 234 VRTlSFS~dg~~-lASaSE------------Dh~IDIA~v---etGd~~~eI~~~~~t~tVAWHPk~~LLAyA~ddk 294 (313)
T KOG1407|consen 234 VRTLSFSHDGRM-LASASE------------DHFIDIAEV---ETGDRVWEIPCEGPTFTVAWHPKRPLLAYACDDK 294 (313)
T ss_pred eEEEEeccCcce-eeccCc------------cceEEeEec---ccCCeEEEeeccCCceeEEecCCCceeeEEecCC
Confidence 888999999997 665531 112566666 5565444444677888999999999999998444
No 184
>KOG3847 consensus Phospholipase A2 (platelet-activating factor acetylhydrolase in humans) [Lipid transport and metabolism]
Probab=98.89 E-value=2e-08 Score=95.54 Aligned_cols=183 Identities=17% Similarity=0.128 Sum_probs=110.3
Q ss_pred CCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCC-------------CCCC--
Q 004574 508 SKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIG-------------EGDK-- 572 (744)
Q Consensus 508 ~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g-------------~g~~-- 572 (744)
.+.+++|+|||.||-|. +...+......+|++||+|.+...++... ....
T Consensus 113 tk~~k~PvvvFSHGLgg---------------sRt~YSa~c~~LAShG~VVaavEHRD~SA~~Ty~~~~~~~n~~lveq~ 177 (399)
T KOG3847|consen 113 TKNDKYPVVVFSHGLGG---------------SRTLYSAYCTSLASHGFVVAAVEHRDRSACWTYVLKEKHENEPLVEQW 177 (399)
T ss_pred CCCCCccEEEEeccccc---------------chhhHHHHhhhHhhCceEEEEeecccCcceeEEEecccccCCcccccc
Confidence 34667999999999531 22222245668899999999854443322 0000
Q ss_pred ---------C------------hHHHHHHHHHHHHHc---------------------CCCCCCcEEEEEechHHHHHHH
Q 004574 573 ---------L------------PNDSAEAAVEEVVRR---------------------GVADPSRIAVGGHSYGAFMTAH 610 (744)
Q Consensus 573 ---------~------------~~~d~~~~~~~l~~~---------------------~~~d~~~i~l~G~S~GG~~a~~ 610 (744)
+ ...++..|+.-|.+. +.+|..+++++|||.||+.++.
T Consensus 178 ~~ir~v~~~ekef~irNeqv~~R~~Ec~~aL~il~~i~~g~~~~~~L~g~~~~~~~~K~nl~~s~~aViGHSFGgAT~i~ 257 (399)
T KOG3847|consen 178 IKIRLVEANEKEFHIRNEQVGQRAQECQKALKILEQINDGGTPDNVLPGNNSDLEQLKGNLDTSQAAVIGHSFGGATSIA 257 (399)
T ss_pred eEeeeeccCceeEEeeCHHHHHHHHHHHHHHHHHHHhhcCCCchhcccCccccHHHHhcchhhhhhhheeccccchhhhh
Confidence 0 111555666555442 1356779999999999999998
Q ss_pred HHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHH
Q 004574 611 LLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFF 690 (744)
Q Consensus 611 ~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~ 690 (744)
..+.+ ..|+|+|++.+.+ .+.. . ....+++.|+|+|.- .|... .+....-
T Consensus 258 ~ss~~-t~FrcaI~lD~WM----~Pl~----~----------------~~~~~arqP~~finv-~~fQ~----~en~~vm 307 (399)
T KOG3847|consen 258 SSSSH-TDFRCAIALDAWM----FPLD----Q----------------LQYSQARQPTLFINV-EDFQW----NENLLVM 307 (399)
T ss_pred hhccc-cceeeeeeeeeee----cccc----h----------------hhhhhccCCeEEEEc-ccccc----hhHHHHH
Confidence 88887 6899999887742 1110 0 012356789999983 33332 2333333
Q ss_pred HHHHhCCCcEEEEEeCCCCcccCc----------------------cccHHHHHHHHHHHHHHhccC
Q 004574 691 DALKGHGALSRLVLLPFEHHVYAA----------------------RENVMHVIWETDRWLQKYCLS 735 (744)
Q Consensus 691 ~~l~~~~~~~~~~~~~~~~H~~~~----------------------~~~~~~~~~~~~~fl~~~l~~ 735 (744)
+........-..+++.|+-|--.. ....+...+..++||.+++..
T Consensus 308 Kki~~~n~g~~~it~~GsVHqnfsDfpfv~p~~i~k~f~~kg~~dpy~~~~~~~r~slaFLq~h~d~ 374 (399)
T KOG3847|consen 308 KKIESQNEGNHVITLDGSVHQNFSDFPFVTPNWIGKVFKVKGETDPYEAMQIAIRASLAFLQKHLDL 374 (399)
T ss_pred HhhhCCCccceEEEEccceecccccCccccHHHHHHHhccCCCCChHHHHHHHHHHHHHHHHhhhhh
Confidence 333333344477888888883210 112244567789999998765
No 185
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=98.88 E-value=7e-08 Score=102.52 Aligned_cols=206 Identities=12% Similarity=0.111 Sum_probs=132.7
Q ss_pred CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCC
Q 004574 167 TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMR 245 (744)
Q Consensus 167 ~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~ 245 (744)
..+.+..+ +.+.+.+||||-++|+-.+.+ ..+.+|.+++.....+-.++.. | ++
T Consensus 443 ~~~~L~GH~GPVyg~sFsPd~rfLlScSED-------------~svRLWsl~t~s~~V~y~GH~~---P---------Vw 497 (707)
T KOG0263|consen 443 TSRTLYGHSGPVYGCSFSPDRRFLLSCSED-------------SSVRLWSLDTWSCLVIYKGHLA---P---------VW 497 (707)
T ss_pred eeEEeecCCCceeeeeecccccceeeccCC-------------cceeeeecccceeEEEecCCCc---c---------ee
Confidence 55556666 889999999999988755433 3677888887654444333222 2 77
Q ss_pred ccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecccee
Q 004574 246 SISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTR 325 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~ 325 (744)
.+.|+|-| +||++...... -+||..+- ....+.+..+...+.-+.|.|++.+++..+.+ . .
T Consensus 498 dV~F~P~G---yYFatas~D~t--------ArLWs~d~----~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD--~--t 558 (707)
T KOG0263|consen 498 DVQFAPRG---YYFATASHDQT--------ARLWSTDH----NKPLRIFAGHLSDVDCVSFHPNSNYVATGSSD--R--T 558 (707)
T ss_pred eEEecCCc---eEEEecCCCce--------eeeeeccc----CCchhhhcccccccceEEECCcccccccCCCC--c--e
Confidence 88999998 88887532222 25777764 22223344445667778999999999987532 2 3
Q ss_pred EEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCC
Q 004574 326 TWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTG 405 (744)
Q Consensus 326 l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g 405 (744)
+.++|+.. +...+++.++...+.. +++||+|++|+.... ...|.+||+.+|
T Consensus 559 VRlWDv~~--G~~VRiF~GH~~~V~a------l~~Sp~Gr~LaSg~e---------------------d~~I~iWDl~~~ 609 (707)
T KOG0263|consen 559 VRLWDVST--GNSVRIFTGHKGPVTA------LAFSPCGRYLASGDE---------------------DGLIKIWDLANG 609 (707)
T ss_pred EEEEEcCC--CcEEEEecCCCCceEE------EEEcCCCceEeeccc---------------------CCcEEEEEcCCC
Confidence 55556666 3446777665544333 679999999988752 224899999988
Q ss_pred ceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECC
Q 004574 406 SKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWP 460 (744)
Q Consensus 406 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~ 460 (744)
+........ ++....++||+||+.|+-... --.+.+||+.
T Consensus 610 ~~v~~l~~H-----------t~ti~SlsFS~dg~vLasgg~----DnsV~lWD~~ 649 (707)
T KOG0263|consen 610 SLVKQLKGH-----------TGTIYSLSFSRDGNVLASGGA----DNSVRLWDLT 649 (707)
T ss_pred cchhhhhcc-----------cCceeEEEEecCCCEEEecCC----CCeEEEEEch
Confidence 764332222 122233689999988774332 3357777863
No 186
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.87 E-value=6.4e-08 Score=91.34 Aligned_cols=146 Identities=17% Similarity=0.228 Sum_probs=89.5
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
.+|+.++..+ .+...+. +.....+...+|||+|+.+|.+.. .....|.++++++.....+- ..
T Consensus 39 ~~l~~~~~~~----~~~~~i~-l~~~~~I~~~~WsP~g~~favi~g--------~~~~~v~lyd~~~~~i~~~~--~~-- 101 (194)
T PF08662_consen 39 FELFYLNEKN----IPVESIE-LKKEGPIHDVAWSPNGNEFAVIYG--------SMPAKVTLYDVKGKKIFSFG--TQ-- 101 (194)
T ss_pred EEEEEEecCC----Cccceee-ccCCCceEEEEECcCCCEEEEEEc--------cCCcccEEEcCcccEeEeec--CC--
Confidence 4566665544 5555555 333324899999999999998753 22336777787644444442 11
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.|||+|++|+.....+ ..++|.++|+
T Consensus 102 ---~~n~i~wsP~G~~l~~~g~~n----------------------------------------------~~G~l~~wd~ 132 (194)
T PF08662_consen 102 ---PRNTISWSPDGRFLVLAGFGN----------------------------------------------LNGDLEFWDV 132 (194)
T ss_pred ---CceEEEECCCCCEEEEEEccC----------------------------------------------CCcEEEEEEC
Confidence 123579999999999874211 0146777888
Q ss_pred C-CCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee
Q 004574 165 D-GTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE 223 (744)
Q Consensus 165 ~-G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~ 223 (744)
. .+.-..........+.|||||++|+...... .-+....+.+|+..|+.+.+
T Consensus 133 ~~~~~i~~~~~~~~t~~~WsPdGr~~~ta~t~~-------r~~~dng~~Iw~~~G~~l~~ 185 (194)
T PF08662_consen 133 RKKKKISTFEHSDATDVEWSPDGRYLATATTSP-------RLRVDNGFKIWSFQGRLLYK 185 (194)
T ss_pred CCCEEeeccccCcEEEEEEcCCCCEEEEEEecc-------ceeccccEEEEEecCeEeEe
Confidence 4 4333223335677899999999999776431 11122456667777775443
No 187
>PTZ00421 coronin; Provisional
Probab=98.84 E-value=1.6e-06 Score=93.74 Aligned_cols=168 Identities=11% Similarity=0.047 Sum_probs=95.2
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeee-eccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVR-ELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+..+.|+|++..++++...+ ..|.+||+..+... .+.... ..+..+.|+|||
T Consensus 126 ~~V~~l~f~P~~~~iLaSgs~D------------gtVrIWDl~tg~~~~~l~~h~-------------~~V~sla~spdG 180 (493)
T PTZ00421 126 KKVGIVSFHPSAMNVLASAGAD------------MVVNVWDVERGKAVEVIKCHS-------------DQITSLEWNLDG 180 (493)
T ss_pred CcEEEEEeCcCCCCEEEEEeCC------------CEEEEEECCCCeEEEEEcCCC-------------CceEEEEEECCC
Confidence 5677899999986555554432 37999998865433 222221 126688999999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCce-Eeeeecc-ceeceeeccCCceEEEeeeeeccceeEEEEcC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDL-RFRSVSWCDDSLALVNETWYKTSQTRTWLVCP 331 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~-~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~ 331 (744)
+. |+..+ ..+.|.++|. ..++.. .+..... ......|++++..|+...........|.++|+
T Consensus 181 ~l-Latgs------------~Dg~IrIwD~---rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDl 244 (493)
T PTZ00421 181 SL-LCTTS------------KDKKLNIIDP---RDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDT 244 (493)
T ss_pred CE-EEEec------------CCCEEEEEEC---CCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeC
Confidence 85 44432 1235778887 333322 2222222 23467899998877765433223456777787
Q ss_pred CCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeE
Q 004574 332 GSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKER 409 (744)
Q Consensus 332 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~ 409 (744)
... ..+......+.. .......|++|+..|+....+ ...|+.||+.+++...
T Consensus 245 r~~-~~p~~~~~~d~~-----~~~~~~~~d~d~~~L~lggkg--------------------Dg~Iriwdl~~~~~~~ 296 (493)
T PTZ00421 245 RKM-ASPYSTVDLDQS-----SALFIPFFDEDTNLLYIGSKG--------------------EGNIRCFELMNERLTF 296 (493)
T ss_pred CCC-CCceeEeccCCC-----CceEEEEEcCCCCEEEEEEeC--------------------CCeEEEEEeeCCceEE
Confidence 653 122222211111 122223489999887766422 1248888887776543
No 188
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.83 E-value=5.9e-07 Score=93.64 Aligned_cols=292 Identities=10% Similarity=0.004 Sum_probs=136.6
Q ss_pred cceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCcee-ccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 34 NFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAK-PLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 34 ~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~-~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
....+||||++++... ..+.|-++|+.+++.. ++--+. ....+++|+||++|+.+..
T Consensus 40 ~~~~~s~Dgr~~yv~~----------rdg~vsviD~~~~~~v~~i~~G~------~~~~i~~s~DG~~~~v~n~------ 97 (369)
T PF02239_consen 40 AGLKFSPDGRYLYVAN----------RDGTVSVIDLATGKVVATIKVGG------NPRGIAVSPDGKYVYVANY------ 97 (369)
T ss_dssp EEEE-TT-SSEEEEEE----------TTSEEEEEETTSSSEEEEEE-SS------EEEEEEE--TTTEEEEEEE------
T ss_pred eEEEecCCCCEEEEEc----------CCCeEEEEECCcccEEEEEecCC------CcceEEEcCCCCEEEEEec------
Confidence 4567899999865542 1257999999988743 332222 2357899999999987632
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCe-eecCCC--------ceeeeecc
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTA-KDFGTP--------AVYTAVEP 182 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~-~~l~~~--------~~~~~~~~ 182 (744)
..+++.++|. +.+. +.+... ..+..+.-
T Consensus 98 ------------------------------------------~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~ 135 (369)
T PF02239_consen 98 ------------------------------------------EPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVA 135 (369)
T ss_dssp ------------------------------------------ETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE
T ss_pred ------------------------------------------CCCceeEeccccccceeecccccccccccCCCceeEEe
Confidence 1157777777 4433 333221 23446667
Q ss_pred CCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEe
Q 004574 183 SPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEA 262 (744)
Q Consensus 183 SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~ 262 (744)
||+....++...+. .++|++|........+...... ..+.+..|+|||++ ++...+
T Consensus 136 s~~~~~fVv~lkd~------------~~I~vVdy~d~~~~~~~~i~~g-----------~~~~D~~~dpdgry-~~va~~ 191 (369)
T PF02239_consen 136 SPGRPEFVVNLKDT------------GEIWVVDYSDPKNLKVTTIKVG-----------RFPHDGGFDPDGRY-FLVAAN 191 (369)
T ss_dssp -SSSSEEEEEETTT------------TEEEEEETTTSSCEEEEEEE-------------TTEEEEEE-TTSSE-EEEEEG
T ss_pred cCCCCEEEEEEccC------------CeEEEEEeccccccceeeeccc-----------ccccccccCcccce-eeeccc
Confidence 88888766665553 3889988554321111111111 11446789999997 333322
Q ss_pred ecCCCCCccCCccceEEeccCCCCCCCCceEeeeec-----cc-------eeceeeccCCceEEEeeeeeccceeEEEEc
Q 004574 263 QDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLD-----LR-------FRSVSWCDDSLALVNETWYKTSQTRTWLVC 330 (744)
Q Consensus 263 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~-----~~-------~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~ 330 (744)
. .+.|-+++. +.+....+.... +. ...+.|+..+......+..... .+.+++
T Consensus 192 ~-----------sn~i~viD~---~~~k~v~~i~~g~~p~~~~~~~~php~~g~vw~~~~~~~~~~~~ig~~--~v~v~d 255 (369)
T PF02239_consen 192 G-----------SNKIAVIDT---KTGKLVALIDTGKKPHPGPGANFPHPGFGPVWATSGLGYFAIPLIGTD--PVSVHD 255 (369)
T ss_dssp G-----------GTEEEEEET---TTTEEEEEEE-SSSBEETTEEEEEETTTEEEEEEEBSSSSEEEEEE----TTT-ST
T ss_pred c-----------cceeEEEee---ccceEEEEeeccccccccccccccCCCcceEEeeccccceecccccCC--ccccch
Confidence 1 224666666 344333332211 11 1123454333221111100000 011222
Q ss_pred CCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEE
Q 004574 331 PGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERI 410 (744)
Q Consensus 331 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l 410 (744)
...- ...+.+..... + ..+..+||+++|+...--. .....|.++|..+-+...-
T Consensus 256 ~~~w-kvv~~I~~~G~-------g-lFi~thP~s~~vwvd~~~~-----------------~~~~~v~viD~~tl~~~~~ 309 (369)
T PF02239_consen 256 DYAW-KVVKTIPTQGG-------G-LFIKTHPDSRYVWVDTFLN-----------------PDADTVQVIDKKTLKVVKT 309 (369)
T ss_dssp TTBT-SEEEEEE-SSS-------S---EE--TT-SEEEEE-TT------------------SSHT-EEEEECCGTEEEE-
T ss_pred hhcC-eEEEEEECCCC-------c-ceeecCCCCccEEeeccCC-----------------CCCceEEEEECcCcceeEE
Confidence 2221 11111211111 1 2245799999988862110 1145699999987754322
Q ss_pred ee-ccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCce-eeee
Q 004574 411 WE-SNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKS-SQIT 468 (744)
Q Consensus 411 ~~-~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~-~~lt 468 (744)
+. ..+. .+.. +.|++||+++.++...... .|..+|..+.+. ++++
T Consensus 310 i~~~~~~----~~~h-------~ef~~dG~~v~vS~~~~~~--~i~v~D~~Tl~~~~~i~ 356 (369)
T PF02239_consen 310 ITPGPGK----RVVH-------MEFNPDGKEVWVSVWDGNG--AIVVYDAKTLKEKKRIP 356 (369)
T ss_dssp HHHHHT------EEE-------EEE-TTSSEEEEEEE--TT--EEEEEETTTTEEEEEEE
T ss_pred EeccCCC----cEec-------cEECCCCCEEEEEEecCCC--EEEEEECCCcEEEEEEE
Confidence 21 1110 1222 4899999998887765553 799999877554 4443
No 189
>COG4188 Predicted dienelactone hydrolase [General function prediction only]
Probab=98.83 E-value=4e-08 Score=97.63 Aligned_cols=207 Identities=22% Similarity=0.235 Sum_probs=118.9
Q ss_pred eEEEEEEcC-CCeEEEEEEEeCCCCCC-CCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEE
Q 004574 482 KEMIKYQRK-DGVPLTATLYLPPGYDQ-SKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVL 559 (744)
Q Consensus 482 ~~~i~~~~~-~g~~l~~~~~~P~~~~~-~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~ 559 (744)
...+++... ++.++...++.|..... -..+++|+||+.||.|- +...+.+.+..+++.||+|.
T Consensus 38 ~~~i~~~~~~r~~~~~v~~~~p~~~~~~~~~~~~PlvvlshG~Gs---------------~~~~f~~~A~~lAs~Gf~Va 102 (365)
T COG4188 38 FVTITLNDPQRDRERPVDLRLPQGGTGTVALYLLPLVVLSHGSGS---------------YVTGFAWLAEHLASYGFVVA 102 (365)
T ss_pred EEEEeccCcccCCccccceeccCCCccccccCcCCeEEecCCCCC---------------CccchhhhHHHHhhCceEEE
Confidence 555666543 36678888998876421 01246999999999752 12223466889999999998
Q ss_pred ecCCCC--CCC-----CCC---C-----ChHHHHHHHHHHHHHc---C----CCCCCcEEEEEechHHHHHHHHHHhCCC
Q 004574 560 AGPSIP--IIG-----EGD---K-----LPNDSAEAAVEEVVRR---G----VADPSRIAVGGHSYGAFMTAHLLAHAPH 617 (744)
Q Consensus 560 ~~~~~~--~~g-----~g~---~-----~~~~d~~~~~~~l~~~---~----~~d~~~i~l~G~S~GG~~a~~~~~~~p~ 617 (744)
.....+ ..+ .+. . +.-.|+...+++|.+. + .+|..+|+++|||+||+.++.++.-..+
T Consensus 103 ~~~hpgs~~~~~~~~~~~~~~~~p~~~~erp~dis~lLd~L~~~~~sP~l~~~ld~~~Vgv~GhS~GG~T~m~laGA~~~ 182 (365)
T COG4188 103 APDHPGSNAGGAPAAYAGPGSYAPAEWWERPLDISALLDALLQLTASPALAGRLDPQRVGVLGHSFGGYTAMELAGAELD 182 (365)
T ss_pred eccCCCcccccCChhhcCCcccchhhhhcccccHHHHHHHHHHhhcCcccccccCccceEEEecccccHHHHHhcccccc
Confidence 722111 000 111 1 1122899999999887 4 4788999999999999999998866533
Q ss_pred ce----eEE----EEccCC-CCCCCCCCcccccccchhhcHHHHHh----------cCc-------ccccCCCCCCEEEE
Q 004574 618 LF----CCG----IARSGS-YNKTLTPFGFQTEFRTLWEATNVYIE----------MSP-------ITHANKIKKPILII 671 (744)
Q Consensus 618 ~~----~~~----v~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~-------~~~~~~~~~P~l~i 671 (744)
.- .|. ++..+. .+.... ......|.....+.. ..| ..-+.+++.|++++
T Consensus 183 ~~~~~~~C~~~~~~~~~~~~~~~~~l-----~q~~av~~~~~~~~~rDpriravvA~~p~~~~~Fg~tgl~~v~~P~~~~ 257 (365)
T COG4188 183 AEALLQHCESASRICLDPPGLNGRLL-----NQCAAVWLPRQAYDLRDPRIRAVVAINPALGMIFGTTGLVKVTDPVLLA 257 (365)
T ss_pred HHHHHHHhhhhhhcccCCCCcChhhh-----ccccccccchhhhccccccceeeeeccCCcccccccccceeeecceeee
Confidence 11 111 111111 000000 000000111111111 111 12246789999999
Q ss_pred eeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcc
Q 004574 672 HGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 672 ~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
.|..|.+.| +..+....+..|. +....+.+.|++.|.
T Consensus 258 a~s~D~~aP-~~~~~~~~f~~l~--g~~k~~~~vp~a~h~ 294 (365)
T COG4188 258 AGSADGFAP-PVTEQIRPFGYLP--GALKYLRLVPGATHF 294 (365)
T ss_pred cccccccCC-cccccccccccCC--cchhheeecCCCccc
Confidence 999999866 1233333343333 333578888999996
No 190
>KOG2382 consensus Predicted alpha/beta hydrolase [General function prediction only]
Probab=98.82 E-value=4.7e-08 Score=95.54 Aligned_cols=64 Identities=17% Similarity=0.105 Sum_probs=52.5
Q ss_pred CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 662 NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 662 ~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
.....|+|+++|.++..++ .++-.++.+.+ ..+++++++++||+.+ .++++.+++.+.+|+.++
T Consensus 250 ~~~~~pvlfi~g~~S~fv~--~~~~~~~~~~f----p~~e~~~ld~aGHwVh-~E~P~~~~~~i~~Fl~~~ 313 (315)
T KOG2382|consen 250 GPYTGPVLFIKGLQSKFVP--DEHYPRMEKIF----PNVEVHELDEAGHWVH-LEKPEEFIESISEFLEEP 313 (315)
T ss_pred cccccceeEEecCCCCCcC--hhHHHHHHHhc----cchheeecccCCceee-cCCHHHHHHHHHHHhccc
Confidence 4457999999999999998 55555554444 4479999999999998 788999999999998765
No 191
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=98.82 E-value=3.4e-07 Score=100.91 Aligned_cols=181 Identities=13% Similarity=0.149 Sum_probs=100.7
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccC---CC-CCCC--CCcccCCccCCCCccc
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCD---LP-PAED--IPVCYNSVREGMRSIS 248 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~---~~-~~~~--~~~~~~~~~~~~~~~~ 248 (744)
+.+.-+.|||||+++|..+++. .+.+|+..+.-...+.. .. ..+. ....+-.....+.++.
T Consensus 70 ~sv~CVR~S~dG~~lAsGSDD~-------------~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~ 136 (942)
T KOG0973|consen 70 GSVNCVRFSPDGSYLASGSDDR-------------LVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVN 136 (942)
T ss_pred CceeEEEECCCCCeEeeccCcc-------------eEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceec
Confidence 6777888999999999887664 33444333100000000 00 0000 0000011112267899
Q ss_pred eecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEE
Q 004574 249 WRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWL 328 (744)
Q Consensus 249 ~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~ 328 (744)
||||+.. |+..+. -..|.+++...| ...+.+-.....+-.++|.|-|++|+..+.+ ..-.+|+
T Consensus 137 Wsp~~~~-lvS~s~------------DnsViiwn~~tF--~~~~vl~~H~s~VKGvs~DP~Gky~ASqsdD--rtikvwr 199 (942)
T KOG0973|consen 137 WSPDDSL-LVSVSL------------DNSVIIWNAKTF--ELLKVLRGHQSLVKGVSWDPIGKYFASQSDD--RTLKVWR 199 (942)
T ss_pred cCCCccE-EEEecc------------cceEEEEccccc--eeeeeeecccccccceEECCccCeeeeecCC--ceEEEEE
Confidence 9999986 665542 123777777322 2333344456678889999999999987633 4456666
Q ss_pred EcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCC
Q 004574 329 VCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFT 389 (744)
Q Consensus 329 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~ 389 (744)
..- ........+.++........-.++|||||++|+....-.+.....-++.+++|.
T Consensus 200 t~d----w~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n~~~~~~~IieR~tWk 256 (942)
T KOG0973|consen 200 TSD----WGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVNGGKSTIAIIERGTWK 256 (942)
T ss_pred ccc----ceeeEeeccchhhCCCcceeeecccCCCcCeecchhhccCCcceeEEEecCCce
Confidence 332 122233333443322221222255999999999887666666666667776663
No 192
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=98.80 E-value=8.4e-07 Score=84.35 Aligned_cols=177 Identities=14% Similarity=0.152 Sum_probs=96.8
Q ss_pred ceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCC
Q 004574 209 QKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEG 288 (744)
Q Consensus 209 ~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (744)
..|.+|++.|.....+...... ....+.||||+- |+.+.+. ++... ...+|.-+- +=
T Consensus 209 t~i~lw~lkGq~L~~idtnq~~-------------n~~aavSP~GRF-ia~~gFT--pDVkV----wE~~f~kdG---~f 265 (420)
T KOG2096|consen 209 TKICLWDLKGQLLQSIDTNQSS-------------NYDAAVSPDGRF-IAVSGFT--PDVKV----WEPIFTKDG---TF 265 (420)
T ss_pred CcEEEEecCCceeeeecccccc-------------ccceeeCCCCcE-EEEecCC--CCceE----EEEEeccCc---ch
Confidence 4799999998876665433322 235689999986 6555432 11100 011222111 11
Q ss_pred CCceEee---eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCC-CcceeeeccccccccCCCCCC-ceeeCCC
Q 004574 289 EKPEILH---KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKD-VAPRVLFDRVFENVYSDPGSP-MMTRTST 363 (744)
Q Consensus 289 ~~~~~l~---~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~-~~~~~l~~~~~~~~~~~~~~~-~~~~spd 363 (744)
.+..++. .....+..++||++.+.++..+ .++..+||-.|+.-.. ..++.|-.++.. .-...+.| .++.||.
T Consensus 266 qev~rvf~LkGH~saV~~~aFsn~S~r~vtvS--kDG~wriwdtdVrY~~~qDpk~Lk~g~~p-l~aag~~p~RL~lsP~ 342 (420)
T KOG2096|consen 266 QEVKRVFSLKGHQSAVLAAAFSNSSTRAVTVS--KDGKWRIWDTDVRYEAGQDPKILKEGSAP-LHAAGSEPVRLELSPS 342 (420)
T ss_pred hhhhhhheeccchhheeeeeeCCCcceeEEEe--cCCcEEEeeccceEecCCCchHhhcCCcc-hhhcCCCceEEEeCCC
Confidence 1222222 2345678899999999999876 3356677777764321 223333222211 00111222 2568999
Q ss_pred CCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEE
Q 004574 364 GTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILT 443 (744)
Q Consensus 364 g~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~ 443 (744)
|+.|+..... .|..+...+|+...-...-. ...-..++|++||+.++-
T Consensus 343 g~~lA~s~gs----------------------~l~~~~se~g~~~~~~e~~h----------~~~Is~is~~~~g~~~at 390 (420)
T KOG2096|consen 343 GDSLAVSFGS----------------------DLKVFASEDGKDYPELEDIH----------STTISSISYSSDGKYIAT 390 (420)
T ss_pred CcEEEeecCC----------------------ceEEEEcccCccchhHHHhh----------cCceeeEEecCCCcEEee
Confidence 9999887521 27788877776543322110 011123699999988774
No 193
>COG2382 Fes Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]
Probab=98.79 E-value=1.4e-07 Score=90.88 Aligned_cols=217 Identities=19% Similarity=0.115 Sum_probs=125.8
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCC----e
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARR----F 556 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G----~ 556 (744)
..+.+.+.+.-..+....+|+|+++.+. .++|++++.||-.|... +.-......+...| -
T Consensus 68 ~~~~~~~~~~l~~~~~~vv~lppgy~~~--~k~pvl~~~DG~~~~~~--------------g~i~~~~dsli~~g~i~pa 131 (299)
T COG2382 68 PVEEILYSSELLSERRRVVYLPPGYNPL--EKYPVLYLQDGQDWFRS--------------GRIPRILDSLIAAGEIPPA 131 (299)
T ss_pred chhhhhhhhhhccceeEEEEeCCCCCcc--ccccEEEEeccHHHHhc--------------CChHHHHHHHHHcCCCCCc
Confidence 3455555554445677789999998654 46999999999532211 11112344555554 3
Q ss_pred EEEecCCCCCCC----CCCCC--hHHHHHHHHHHHHHcCC--CCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 557 AVLAGPSIPIIG----EGDKL--PNDSAEAAVEEVVRRGV--ADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 557 ~v~~~~~~~~~g----~g~~~--~~~d~~~~~~~l~~~~~--~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
+++..++.+... ++... ...-+.+.+-++.+... -+.++-+|+|.|+||.++++++.++|+.|-.++..+|.
T Consensus 132 i~vgid~~d~~~R~~~~~~n~~~~~~L~~eLlP~v~~~yp~~~~a~~r~L~G~SlGG~vsL~agl~~Pe~FG~V~s~Sps 211 (299)
T COG2382 132 ILVGIDYIDVKKRREELHCNEAYWRFLAQELLPYVEERYPTSADADGRVLAGDSLGGLVSLYAGLRHPERFGHVLSQSGS 211 (299)
T ss_pred eEEecCCCCHHHHHHHhcccHHHHHHHHHHhhhhhhccCcccccCCCcEEeccccccHHHHHHHhcCchhhceeeccCCc
Confidence 344433322110 11111 11134555566666432 34578999999999999999999999999999999998
Q ss_pred CCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCC
Q 004574 629 YNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFE 708 (744)
Q Consensus 629 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 708 (744)
+.+...... ..... ....+....+.....=++...++.+.+ ....+++.+.|+..+.+..+..|+|
T Consensus 212 ~~~~~~~~~---~~~~~------~~~l~~~~a~~~~~~~~l~~g~~~~~~----~~pNr~L~~~L~~~g~~~~yre~~G- 277 (299)
T COG2382 212 FWWTPLDTQ---PQGEV------AESLKILHAIGTDERIVLTTGGEEGDF----LRPNRALAAQLEKKGIPYYYREYPG- 277 (299)
T ss_pred cccCccccc---cccch------hhhhhhhhccCccceEEeecCCccccc----cchhHHHHHHHHhcCCcceeeecCC-
Confidence 765422111 00000 111111111222223233333444444 4556789999999999999999998
Q ss_pred CcccCccccHHHHHHHHHHHHHHhc
Q 004574 709 HHVYAARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 709 ~H~~~~~~~~~~~~~~~~~fl~~~l 733 (744)
||.+.. +...+.++|...+
T Consensus 278 gHdw~~------Wr~~l~~~L~~l~ 296 (299)
T COG2382 278 GHDWAW------WRPALAEGLQLLL 296 (299)
T ss_pred CCchhH------hHHHHHHHHHHhh
Confidence 797653 2334455554433
No 194
>PRK13616 lipoprotein LpqB; Provisional
Probab=98.79 E-value=1.6e-07 Score=103.23 Aligned_cols=168 Identities=14% Similarity=0.095 Sum_probs=110.0
Q ss_pred ccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCe
Q 004574 89 FGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTA 168 (744)
Q Consensus 89 ~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~ 168 (744)
+.+++.||||+.++|+........ ...++||+++..|..
T Consensus 352 vsspaiSpdG~~vA~v~~~~~~~~-----------------------------------------d~~s~Lwv~~~gg~~ 390 (591)
T PRK13616 352 ITSAALSRSGRQVAAVVTLGRGAP-----------------------------------------DPASSLWVGPLGGVA 390 (591)
T ss_pred cccceECCCCCEEEEEEeecCCCC-----------------------------------------CcceEEEEEeCCCcc
Confidence 568899999999999853211100 012689999987777
Q ss_pred eecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccc
Q 004574 169 KDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSIS 248 (744)
Q Consensus 169 ~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 248 (744)
++++.......|+|||||+.|+|..+.+...+.. ......++|+.++++++.++ ... .++..+.
T Consensus 391 ~~lt~g~~~t~PsWspDG~~lw~v~dg~~~~~v~-~~~~~gql~~~~vd~ge~~~--~~~-------------g~Issl~ 454 (591)
T PRK13616 391 VQVLEGHSLTRPSWSLDADAVWVVVDGNTVVRVI-RDPATGQLARTPVDASAVAS--RVP-------------GPISELQ 454 (591)
T ss_pred eeeecCCCCCCceECCCCCceEEEecCcceEEEe-ccCCCceEEEEeccCchhhh--ccC-------------CCcCeEE
Confidence 8887775588999999999999997543211111 11123478887877776554 211 1267899
Q ss_pred eecCCCceEEEEEeecCCCCCccCCccceEEe---ccCCCCCCCCceE------eeeeccc-eeceeeccCCceEEEeee
Q 004574 249 WRADKPSTLYWVEAQDRGDANVEVSPRDIIYT---QPAEPAEGEKPEI------LHKLDLR-FRSVSWCDDSLALVNETW 318 (744)
Q Consensus 249 ~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~---~~~~~~~~~~~~~------l~~~~~~-~~~~~~SpDg~~l~~~~~ 318 (744)
|||||++ |+|+.. ++|++ ... .+|+ .. |...... ..++.|.+|+.. +....
T Consensus 455 wSpDG~R-iA~i~~-------------g~v~Va~Vvr~---~~G~-~~l~~~~~l~~~l~~~~~~l~W~~~~~L-~V~~~ 515 (591)
T PRK13616 455 LSRDGVR-AAMIIG-------------GKVYLAVVEQT---EDGQ-YALTNPREVGPGLGDTAVSLDWRTGDSL-VVGRS 515 (591)
T ss_pred ECCCCCE-EEEEEC-------------CEEEEEEEEeC---CCCc-eeecccEEeecccCCccccceEecCCEE-EEEec
Confidence 9999999 888851 24666 443 4454 33 3333333 577999999994 43332
Q ss_pred eeccceeEEEEcCCCC
Q 004574 319 YKTSQTRTWLVCPGSK 334 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~~ 334 (744)
.....+|++++++.
T Consensus 516 --~~~~~v~~v~vDG~ 529 (591)
T PRK13616 516 --DPEHPVWYVNLDGS 529 (591)
T ss_pred --CCCCceEEEecCCc
Confidence 34567999999984
No 195
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=98.78 E-value=5.1e-07 Score=90.31 Aligned_cols=251 Identities=14% Similarity=0.099 Sum_probs=144.3
Q ss_pred eeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcccc
Q 004574 7 IGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLN 86 (744)
Q Consensus 7 ~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~ 86 (744)
+.+++..| +....+.... + .+....|+-+|.||+-... ++ .+-++|..+|+.++.......
T Consensus 259 ~riw~~~G----~l~~tl~~Hk-g-PI~slKWnk~G~yilS~~v--------D~--ttilwd~~~g~~~q~f~~~s~--- 319 (524)
T KOG0273|consen 259 ARIWNKDG----NLISTLGQHK-G-PIFSLKWNKKGTYILSGGV--------DG--TTILWDAHTGTVKQQFEFHSA--- 319 (524)
T ss_pred EEEEecCc----hhhhhhhccC-C-ceEEEEEcCCCCEEEeccC--------Cc--cEEEEeccCceEEEeeeeccC---
Confidence 44555555 4444444222 2 4788999999999998532 33 345558888887665433221
Q ss_pred ccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC--
Q 004574 87 AVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-- 164 (744)
Q Consensus 87 ~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-- 164 (744)
...++.|-.+.+...-. . + +.|+++-+
T Consensus 320 -~~lDVdW~~~~~F~ts~-t--d-----------------------------------------------~~i~V~kv~~ 348 (524)
T KOG0273|consen 320 -PALDVDWQSNDEFATSS-T--D-----------------------------------------------GCIHVCKVGE 348 (524)
T ss_pred -CccceEEecCceEeecC-C--C-----------------------------------------------ceEEEEEecC
Confidence 12457888777643221 1 0 23444444
Q ss_pred CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCC
Q 004574 165 DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREG 243 (744)
Q Consensus 165 ~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~ 243 (744)
++-...+.-+ +.+..+.|.|.|+-|+-.+.+ ..+-+|+........- +.+....
T Consensus 349 ~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD-------------~TlkiWs~~~~~~~~~------------l~~Hske 403 (524)
T KOG0273|consen 349 DRPVKTFIGHHGEVNALKWNPTGSLLASCSDD-------------GTLKIWSMGQSNSVHD------------LQAHSKE 403 (524)
T ss_pred CCcceeeecccCceEEEEECCCCceEEEecCC-------------CeeEeeecCCCcchhh------------hhhhccc
Confidence 4444444445 899999999999977655443 2566676443321111 1111222
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
+..+.|||+|.. ......+...........+.++++ ..+. .-.++.+...+..++|||||+++++.+ .
T Consensus 404 i~t~~wsp~g~v----~~n~~~~~~l~sas~dstV~lwdv---~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs----~ 472 (524)
T KOG0273|consen 404 IYTIKWSPTGPV----TSNPNMNLMLASASFDSTVKLWDV---ESGVPIHTLMKHQEPVYSVAFSPNGRYLASGS----L 472 (524)
T ss_pred eeeEeecCCCCc----cCCCcCCceEEEeecCCeEEEEEc---cCCceeEeeccCCCceEEEEecCCCcEEEecC----C
Confidence 567788998863 111112222112223345677777 3343 344667788999999999999999876 2
Q ss_pred ceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 323 QTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
...+.+++...+ +..+-.. + ..+.+.++|+.+|.+|.....
T Consensus 473 dg~V~iws~~~~--~l~~s~~-~------~~~Ifel~Wn~~G~kl~~~~s 513 (524)
T KOG0273|consen 473 DGCVHIWSTKTG--KLVKSYQ-G------TGGIFELCWNAAGDKLGACAS 513 (524)
T ss_pred CCeeEeccccch--heeEeec-C------CCeEEEEEEcCCCCEEEEEec
Confidence 234666666653 2111111 1 112344889999999887763
No 196
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=98.76 E-value=5.8e-06 Score=84.56 Aligned_cols=314 Identities=12% Similarity=0.091 Sum_probs=171.3
Q ss_pred eeeecCCCCCcccceeecCCCCeEEEeeeccc-ccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcE
Q 004574 22 KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDE-EDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTL 100 (744)
Q Consensus 22 ~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~-~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~ 100 (744)
.++..+.+. .+....+||.-+||+--+.... ...+.+.+.+|.+.|+.+|..++-....... ...+.-+.||.|+|+
T Consensus 242 ~r~~RF~Hp-~Vq~idfSP~EkYLVT~s~~p~~~~~~d~e~~~l~IWDI~tG~lkrsF~~~~~~-~~~WP~frWS~DdKy 319 (698)
T KOG2314|consen 242 DRIQRFYHP-GVQFIDFSPNEKYLVTYSPEPIIVEEDDNEGQQLIIWDIATGLLKRSFPVIKSP-YLKWPIFRWSHDDKY 319 (698)
T ss_pred HHHHhccCC-CceeeecCCccceEEEecCCccccCcccCCCceEEEEEccccchhcceeccCCC-ccccceEEeccCCce
Confidence 345544443 5888999999999887665221 1122356788999999999987765442111 112456799999999
Q ss_pred EEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCCceeeee
Q 004574 101 LIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTPAVYTAV 180 (744)
Q Consensus 101 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~~~~~~~ 180 (744)
++-.... + +.+. ....++++ +++.-.+ .++..+
T Consensus 320 ~Arm~~~--s-------------isIy---------------------------Etpsf~ll--d~Kslki---~gIr~F 352 (698)
T KOG2314|consen 320 FARMTGN--S-------------ISIY---------------------------ETPSFMLL--DKKSLKI---SGIRDF 352 (698)
T ss_pred eEEeccc--e-------------EEEE---------------------------ecCceeee--cccccCC---ccccCc
Confidence 8865210 0 0000 00112222 2211111 567889
Q ss_pred ccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCC-CCCCCCCcccCCccCCCCccceecCCCceEEE
Q 004574 181 EPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDL-PPAEDIPVCYNSVREGMRSISWRADKPSTLYW 259 (744)
Q Consensus 181 ~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~ 259 (744)
.|||-+.-|+|-..+.. .-+..+-+..+-.++..+.... ..+ .-.+.|-.+|.+ |++
T Consensus 353 swsP~~~llAYwtpe~~--------~~parvtL~evPs~~~iRt~nlfnVs-------------DckLhWQk~gdy-Lcv 410 (698)
T KOG2314|consen 353 SWSPTSNLLAYWTPETN--------NIPARVTLMEVPSKREIRTKNLFNVS-------------DCKLHWQKSGDY-LCV 410 (698)
T ss_pred ccCCCcceEEEEccccc--------CCcceEEEEecCccceeeeccceeee-------------ccEEEeccCCcE-EEE
Confidence 99999999999976642 1234566666544332221111 111 113679889998 555
Q ss_pred EEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee-eeeccceeEEEEcCCCCCCcc
Q 004574 260 VEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET-WYKTSQTRTWLVCPGSKDVAP 338 (744)
Q Consensus 260 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~-~~~~~~~~l~~~~~~~~~~~~ 338 (744)
-..+-.-.......++-.|+.++. + .-+.........+-.++|-|.|..++..+ +....+-..|.+.... ..+
T Consensus 411 kvdR~tK~~~~g~f~n~eIfrire---K-dIpve~velke~vi~FaWEP~gdkF~vi~g~~~k~tvsfY~~e~~~--~~~ 484 (698)
T KOG2314|consen 411 KVDRHTKSKVKGQFSNLEIFRIRE---K-DIPVEVVELKESVIAFAWEPHGDKFAVISGNTVKNTVSFYAVETNI--KKP 484 (698)
T ss_pred EEEeeccccccceEeeEEEEEeec---c-CCCceeeecchheeeeeeccCCCeEEEEEccccccceeEEEeecCC--Cch
Confidence 443322111111122334666654 2 23344455566777899999999998876 3333455667666544 333
Q ss_pred eeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhh
Q 004574 339 RVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKY 418 (744)
Q Consensus 339 ~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~ 418 (744)
..+-...-.. .+ .+.|||.|++++.....+ .+..|.-+|..-...+.+-..+
T Consensus 485 ~lVk~~dk~~-----~N-~vfwsPkG~fvvva~l~s------------------~~g~l~F~D~~~a~~k~~~~~e---- 536 (698)
T KOG2314|consen 485 SLVKELDKKF-----AN-TVFWSPKGRFVVVAALVS------------------RRGDLEFYDTDYADLKDTASPE---- 536 (698)
T ss_pred hhhhhhcccc-----cc-eEEEcCCCcEEEEEEecc------------------cccceEEEecchhhhhhccCcc----
Confidence 3332211111 11 155999999999887442 1223666666432222221111
Q ss_pred hhheeeeecCCcceecccCCCEEEEEEec
Q 004574 419 FETAVALVFGQGEEDINLNQLKILTSKES 447 (744)
Q Consensus 419 ~~~~~~~~~~~~~~~~s~d~~~~~~~~~~ 447 (744)
+... +..-|.|.|+.++.+.+.
T Consensus 537 ---h~~a----t~veWDPtGRYvvT~ss~ 558 (698)
T KOG2314|consen 537 ---HFAA----TEVEWDPTGRYVVTSSSS 558 (698)
T ss_pred ---cccc----ccceECCCCCEEEEeeeh
Confidence 1111 235788899877765543
No 197
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=98.76 E-value=7.5e-07 Score=90.04 Aligned_cols=244 Identities=10% Similarity=0.089 Sum_probs=128.1
Q ss_pred EEEEEcCCC------CeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCC
Q 004574 158 QLVLGSLDG------TAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPP 229 (744)
Q Consensus 158 ~l~~~~~~G------~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~ 229 (744)
.+..+|..| ..++|... ..+..+.|||.|..|++.+... +.-++|.+|.+......+..
T Consensus 190 ~v~~wDf~gMdas~~~fr~l~P~E~h~i~sl~ys~Tg~~iLvvsg~a-------------qakl~DRdG~~~~e~~KGDQ 256 (641)
T KOG0772|consen 190 TVKFWDFQGMDASMRSFRQLQPCETHQINSLQYSVTGDQILVVSGSA-------------QAKLLDRDGFEIVEFSKGDQ 256 (641)
T ss_pred eEEEEecccccccchhhhccCcccccccceeeecCCCCeEEEEecCc-------------ceeEEccCCceeeeeeccch
Confidence 455666655 22344322 5677999999999999997664 56677777776555433321
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee------ccceec
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL------DLRFRS 303 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~------~~~~~~ 303 (744)
.-.....--....++..-.|.|+.+. .+.+...++ .+-++++. +..+..++... .-.+..
T Consensus 257 YI~Dm~nTKGHia~lt~g~whP~~k~-~FlT~s~Dg-----------tlRiWdv~--~~k~q~qVik~k~~~g~Rv~~ts 322 (641)
T KOG0772|consen 257 YIRDMYNTKGHIAELTCGCWHPDNKE-EFLTCSYDG-----------TLRIWDVN--NTKSQLQVIKTKPAGGKRVPVTS 322 (641)
T ss_pred hhhhhhccCCceeeeeccccccCccc-ceEEecCCC-----------cEEEEecC--CchhheeEEeeccCCCcccCcee
Confidence 10000000001112445689999988 444432222 24444442 22333333322 224677
Q ss_pred eeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEE
Q 004574 304 VSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILL 383 (744)
Q Consensus 304 ~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~ 383 (744)
.+|+|||+.|+... . ++..++|-..-.+ ..+......+-.. ..+...++||+||++|+.+..+
T Consensus 323 C~~nrdg~~iAagc-~-DGSIQ~W~~~~~~--v~p~~~vk~AH~~---g~~Itsi~FS~dg~~LlSRg~D---------- 385 (641)
T KOG0772|consen 323 CAWNRDGKLIAAGC-L-DGSIQIWDKGSRT--VRPVMKVKDAHLP---GQDITSISFSYDGNYLLSRGFD---------- 385 (641)
T ss_pred eecCCCcchhhhcc-c-CCceeeeecCCcc--cccceEeeeccCC---CCceeEEEeccccchhhhccCC----------
Confidence 89999999988776 2 2444555432111 2222222211110 0022336799999999887522
Q ss_pred ccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecC--CCCceEEEEECCC
Q 004574 384 NGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESK--TEITQYHILSWPL 461 (744)
Q Consensus 384 ~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~--~~~~~i~~~~~~~ 461 (744)
..|.+||+.--+ +.|.... ..... -..+.+.||||.+.|+...+.. ..++.|+.+|..+
T Consensus 386 -----------~tLKvWDLrq~k-kpL~~~t------gL~t~-~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t 446 (641)
T KOG0772|consen 386 -----------DTLKVWDLRQFK-KPLNVRT------GLPTP-FPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRMT 446 (641)
T ss_pred -----------Cceeeeeccccc-cchhhhc------CCCcc-CCCCccccCCCceEEEecccccCCCCCceEEEEeccc
Confidence 248899985221 1121111 01111 1245689999997655443322 2456788887655
Q ss_pred Cce
Q 004574 462 KKS 464 (744)
Q Consensus 462 g~~ 464 (744)
-+.
T Consensus 447 ~d~ 449 (641)
T KOG0772|consen 447 LDT 449 (641)
T ss_pred eee
Confidence 433
No 198
>PLN00181 protein SPA1-RELATED; Provisional
Probab=98.74 E-value=1.2e-05 Score=94.16 Aligned_cols=162 Identities=10% Similarity=0.046 Sum_probs=89.3
Q ss_pred eEEEEEcC-CC-CeeecCCC-ceeeeeccCC-CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-AVYTAVEPSP-DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~~~~~~~~Sp-DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~ 231 (744)
+.|.++|+ ++ ....+..+ ..+..++|+| ||..|+-.+.+ ..+.+||+..+.. ..+. ....
T Consensus 555 g~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~D-------------g~v~iWd~~~~~~~~~~~-~~~~- 619 (793)
T PLN00181 555 GVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDD-------------GSVKLWSINQGVSIGTIK-TKAN- 619 (793)
T ss_pred CeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCC-------------CEEEEEECCCCcEEEEEe-cCCC-
Confidence 56778888 56 44455445 7788999997 67766555333 3789999875432 2222 1111
Q ss_pred CCCcccCCccCCCCcccee-cCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccC
Q 004574 232 DIPVCYNSVREGMRSISWR-ADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDD 309 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~s-pDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpD 309 (744)
+..+.|+ ++|.. |+..+ ..+.|+++++. .... ...+......+..+.|+ |
T Consensus 620 ------------v~~v~~~~~~g~~-latgs------------~dg~I~iwD~~--~~~~~~~~~~~h~~~V~~v~f~-~ 671 (793)
T PLN00181 620 ------------ICCVQFPSESGRS-LAFGS------------ADHKVYYYDLR--NPKLPLCTMIGHSKTVSYVRFV-D 671 (793)
T ss_pred ------------eEEEEEeCCCCCE-EEEEe------------CCCeEEEEECC--CCCccceEecCCCCCEEEEEEe-C
Confidence 3456774 45665 55442 12358888873 2221 22334345567888897 7
Q ss_pred CceEEEeeeeeccceeEEEEcCCCCC----CcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 310 SLALVNETWYKTSQTRTWLVCPGSKD----VAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 310 g~~l~~~~~~~~~~~~l~~~~~~~~~----~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+..|+..+. ++ .|.++|+.... ........+... ....++|+|+|.+|+...
T Consensus 672 ~~~lvs~s~--D~--~ikiWd~~~~~~~~~~~~l~~~~gh~~------~i~~v~~s~~~~~lasgs 727 (793)
T PLN00181 672 SSTLVSSST--DN--TLKLWDLSMSISGINETPLHSFMGHTN------VKNFVGLSVSDGYIATGS 727 (793)
T ss_pred CCEEEEEEC--CC--EEEEEeCCCCccccCCcceEEEcCCCC------CeeEEEEcCCCCEEEEEe
Confidence 777776552 22 35555654310 111111111111 112256999999888775
No 199
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.74 E-value=4.1e-07 Score=94.81 Aligned_cols=289 Identities=13% Similarity=0.055 Sum_probs=137.9
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc--
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI-- 83 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~-- 83 (744)
.|-++++.. ++ .+..+..|......++||||++|+.... ..+++-++|.++.+..+.......
T Consensus 59 ~vsviD~~~----~~--~v~~i~~G~~~~~i~~s~DG~~~~v~n~---------~~~~v~v~D~~tle~v~~I~~~~~~~ 123 (369)
T PF02239_consen 59 TVSVIDLAT----GK--VVATIKVGGNPRGIAVSPDGKYVYVANY---------EPGTVSVIDAETLEPVKTIPTGGMPV 123 (369)
T ss_dssp EEEEEETTS----SS--EEEEEE-SSEEEEEEE--TTTEEEEEEE---------ETTEEEEEETTT--EEEEEE--EE-T
T ss_pred eEEEEECCc----cc--EEEEEecCCCcceEEEcCCCCEEEEEec---------CCCceeEeccccccceeecccccccc
Confidence 455566633 33 3333456667788999999999977553 336899999888776543211110
Q ss_pred -cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 84 -CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 84 -~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
....-+..+.-||+...+++...+ .++||++
T Consensus 124 ~~~~~Rv~aIv~s~~~~~fVv~lkd------------------------------------------------~~~I~vV 155 (369)
T PF02239_consen 124 DGPESRVAAIVASPGRPEFVVNLKD------------------------------------------------TGEIWVV 155 (369)
T ss_dssp TTS---EEEEEE-SSSSEEEEEETT------------------------------------------------TTEEEEE
T ss_pred cccCCCceeEEecCCCCEEEEEEcc------------------------------------------------CCeEEEE
Confidence 000012234456777765554321 1689999
Q ss_pred cC-CCCe---eecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccC
Q 004574 163 SL-DGTA---KDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYN 238 (744)
Q Consensus 163 ~~-~G~~---~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~ 238 (744)
|. +.+. +.+.......+..|+|||++++...... ..+-++|...+....+...... ..|....
T Consensus 156 dy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~s------------n~i~viD~~~~k~v~~i~~g~~-p~~~~~~ 222 (369)
T PF02239_consen 156 DYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGS------------NKIAVIDTKTGKLVALIDTGKK-PHPGPGA 222 (369)
T ss_dssp ETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGG------------TEEEEEETTTTEEEEEEE-SSS-BEETTEE
T ss_pred EeccccccceeeecccccccccccCcccceeeeccccc------------ceeEEEeeccceEEEEeecccc-ccccccc
Confidence 98 4422 2332334566789999999987765543 3677888776655443322100 0110011
Q ss_pred CccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeee
Q 004574 239 SVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETW 318 (744)
Q Consensus 239 ~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~ 318 (744)
.-.+.-.++.|.-.+.... .+..... +. +.+++. ..-+..+-....+.--.+..+||+++++....
T Consensus 223 ~~php~~g~vw~~~~~~~~-~~~~ig~-~~---------v~v~d~---~~wkvv~~I~~~G~glFi~thP~s~~vwvd~~ 288 (369)
T PF02239_consen 223 NFPHPGFGPVWATSGLGYF-AIPLIGT-DP---------VSVHDD---YAWKVVKTIPTQGGGLFIKTHPDSRYVWVDTF 288 (369)
T ss_dssp EEEETTTEEEEEEEBSSSS-EEEEEE---T---------TT-STT---TBTSEEEEEE-SSSS--EE--TT-SEEEEE-T
T ss_pred cccCCCcceEEeeccccce-ecccccC-Cc---------cccchh---hcCeEEEEEECCCCcceeecCCCCccEEeecc
Confidence 1111122344544433211 1111111 11 122232 12222222222333356677999999998732
Q ss_pred eeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEE
Q 004574 319 YKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLD 398 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~ 398 (744)
..+....|.++|..+- ...+.+....... ...+.|++||+.+.++.... ...|.
T Consensus 289 ~~~~~~~v~viD~~tl-~~~~~i~~~~~~~------~~h~ef~~dG~~v~vS~~~~-------------------~~~i~ 342 (369)
T PF02239_consen 289 LNPDADTVQVIDKKTL-KVVKTITPGPGKR------VVHMEFNPDGKEVWVSVWDG-------------------NGAIV 342 (369)
T ss_dssp T-SSHT-EEEEECCGT-EEEE-HHHHHT--------EEEEEE-TTSSEEEEEEE---------------------TTEEE
T ss_pred CCCCCceEEEEECcCc-ceeEEEeccCCCc------EeccEECCCCCEEEEEEecC-------------------CCEEE
Confidence 2234568999998873 1112332222100 11156999999988876321 11699
Q ss_pred EEecCCCceeEE
Q 004574 399 LFDINTGSKERI 410 (744)
Q Consensus 399 ~~d~~~g~~~~l 410 (744)
++|..|.+....
T Consensus 343 v~D~~Tl~~~~~ 354 (369)
T PF02239_consen 343 VYDAKTLKEKKR 354 (369)
T ss_dssp EEETTTTEEEEE
T ss_pred EEECCCcEEEEE
Confidence 999988876544
No 200
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=98.73 E-value=4.5e-06 Score=78.76 Aligned_cols=234 Identities=12% Similarity=0.121 Sum_probs=146.6
Q ss_pred cccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC
Q 004574 88 VFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG 166 (744)
Q Consensus 88 ~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G 166 (744)
-+..+.|++|+++|+..+.+ +.|.++|. ++
T Consensus 57 Ki~~~~ws~Dsr~ivSaSqD-------------------------------------------------GklIvWDs~Tt 87 (343)
T KOG0286|consen 57 KIYAMDWSTDSRRIVSASQD-------------------------------------------------GKLIVWDSFTT 87 (343)
T ss_pred ceeeeEecCCcCeEEeeccC-------------------------------------------------CeEEEEEcccc
Confidence 35678999999999987542 57888888 76
Q ss_pred -CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC--e-----eeeccCCCCCCCCCccc
Q 004574 167 -TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK--L-----VRELCDLPPAEDIPVCY 237 (744)
Q Consensus 167 -~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~--~-----~~~l~~~~~~~~~~~~~ 237 (744)
+...++-+ ..+-..+|||.|+.|+-...++ .--+|++... + .+.|..+.
T Consensus 88 nK~haipl~s~WVMtCA~sPSg~~VAcGGLdN-------------~Csiy~ls~~d~~g~~~v~r~l~gHt--------- 145 (343)
T KOG0286|consen 88 NKVHAIPLPSSWVMTCAYSPSGNFVACGGLDN-------------KCSIYPLSTRDAEGNVRVSRELAGHT--------- 145 (343)
T ss_pred cceeEEecCceeEEEEEECCCCCeEEecCcCc-------------eeEEEecccccccccceeeeeecCcc---------
Confidence 66666666 7888899999999999876554 3334444322 1 11122221
Q ss_pred CCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeeeccceeceeecc-CCceEEE
Q 004574 238 NSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCD-DSLALVN 315 (744)
Q Consensus 238 ~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~Sp-Dg~~l~~ 315 (744)
..+....|.+|+. |+-.+ ++ ...-++|+ +.++. ..+..+.+.+..++++| |++.++.
T Consensus 146 ----gylScC~f~dD~~--ilT~S----GD--------~TCalWDi---e~g~~~~~f~GH~gDV~slsl~p~~~ntFvS 204 (343)
T KOG0286|consen 146 ----GYLSCCRFLDDNH--ILTGS----GD--------MTCALWDI---ETGQQTQVFHGHTGDVMSLSLSPSDGNTFVS 204 (343)
T ss_pred ----ceeEEEEEcCCCc--eEecC----CC--------ceEEEEEc---ccceEEEEecCCcccEEEEecCCCCCCeEEe
Confidence 2255677777775 33221 12 24677777 44443 44455577899999999 8998887
Q ss_pred eeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCc
Q 004574 316 ETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIP 395 (744)
Q Consensus 316 ~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 395 (744)
.+-+. ..+++|+-. +..++.+.++..++.. +.+-|+|..++..+. ..
T Consensus 205 g~cD~----~aklWD~R~--~~c~qtF~ghesDINs------v~ffP~G~afatGSD---------------------D~ 251 (343)
T KOG0286|consen 205 GGCDK----SAKLWDVRS--GQCVQTFEGHESDINS------VRFFPSGDAFATGSD---------------------DA 251 (343)
T ss_pred ccccc----ceeeeeccC--cceeEeecccccccce------EEEccCCCeeeecCC---------------------Cc
Confidence 65222 345666666 4556766666555433 678899987776641 22
Q ss_pred eEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEEC
Q 004574 396 FLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSW 459 (744)
Q Consensus 396 ~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~ 459 (744)
....||+..+..-.++..+.. ..+...++||..|+.|+..+.+ ....+||.
T Consensus 252 tcRlyDlRaD~~~a~ys~~~~---------~~gitSv~FS~SGRlLfagy~d----~~c~vWDt 302 (343)
T KOG0286|consen 252 TCRLYDLRADQELAVYSHDSI---------ICGITSVAFSKSGRLLFAGYDD----FTCNVWDT 302 (343)
T ss_pred eeEEEeecCCcEEeeeccCcc---------cCCceeEEEcccccEEEeeecC----CceeEeec
Confidence 367888887776666654322 1122336888889765544432 34666663
No 201
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=98.73 E-value=1.7e-06 Score=88.71 Aligned_cols=225 Identities=16% Similarity=0.187 Sum_probs=142.9
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..++.+++.. ..+...+|. ..+++...+||-+..-..+...+ +....-...+||-++ ..+....+....-
T Consensus 146 nev~f~~~~~--f~~~~~kl~----~~~i~~f~lSpgp~~~~vAvyvP-e~kGaPa~vri~~~~-~~~~~~~~a~ksF-- 215 (566)
T KOG2315|consen 146 NEVQFYDLGS--FKTIQHKLS----VSGITMLSLSPGPEPPFVAVYVP-EKKGAPASVRIYKYP-EEGQHQPVANKSF-- 215 (566)
T ss_pred ceEEEEecCC--ccceeeeee----ccceeeEEecCCCCCceEEEEcc-CCCCCCcEEEEeccc-cccccchhhhccc--
Confidence 3566677654 345555664 23578888898755433333322 122224456677766 3444444432221
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
+...-..+.|.+-|..|+.+... ..|...-.||+...||.++.
T Consensus 216 Fkadkvqm~WN~~gt~LLvLast-------------------------------------dVDktn~SYYGEq~Lyll~t 258 (566)
T KOG2315|consen 216 FKADKVQMKWNKLGTALLVLAST-------------------------------------DVDKTNASYYGEQTLYLLAT 258 (566)
T ss_pred cccceeEEEeccCCceEEEEEEE-------------------------------------eecCCCccccccceEEEEEe
Confidence 00012367899999987765321 12233344667788999999
Q ss_pred CCCe--eecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 DGTA--KDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 ~G~~--~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+|+. .+|...+.++++.|+|+|+..++. +|-.+..+-++|+++..+..+..++.+
T Consensus 259 ~g~s~~V~L~k~GPVhdv~W~~s~~EF~Vv-----------yGfMPAkvtifnlr~~~v~df~egpRN------------ 315 (566)
T KOG2315|consen 259 QGESVSVPLLKEGPVHDVTWSPSGREFAVV-----------YGFMPAKVTIFNLRGKPVFDFPEGPRN------------ 315 (566)
T ss_pred cCceEEEecCCCCCceEEEECCCCCEEEEE-----------EecccceEEEEcCCCCEeEeCCCCCcc------------
Confidence 8844 344555999999999999988777 334467899999999988877766554
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee-ccceeceeeccCCceEEEee
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-DLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-~~~~~~~~~SpDg~~l~~~~ 317 (744)
.+-|+|.|+. |+++.+.+-.+ .+-++|+ .+ .+.+... .....-+.|||||++|+.++
T Consensus 316 ---~~~fnp~g~i-i~lAGFGNL~G---------~mEvwDv---~n--~K~i~~~~a~~tt~~eW~PdGe~flTAT 373 (566)
T KOG2315|consen 316 ---TAFFNPHGNI-ILLAGFGNLPG---------DMEVWDV---PN--RKLIAKFKAANTTVFEWSPDGEYFLTAT 373 (566)
T ss_pred ---ceEECCCCCE-EEEeecCCCCC---------ceEEEec---cc--hhhccccccCCceEEEEcCCCcEEEEEe
Confidence 4679999997 77776654333 3777777 32 2334333 33456688999999999876
No 202
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=98.72 E-value=3.2e-06 Score=85.64 Aligned_cols=251 Identities=15% Similarity=0.134 Sum_probs=133.4
Q ss_pred CCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-----eccccCCCccccccccceEEecCCcEE
Q 004574 28 PDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-----KPLFESPDICLNAVFGSFVWVNNSTLL 101 (744)
Q Consensus 28 ~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-----~~lt~~~~~~~~~~~~~~~wspDg~~l 101 (744)
+++. .++..++-|.|-+++-- .....+..+|..|-.. |+|-..+.+ .+..+.|||.|..|
T Consensus 164 ~hgtk~Vsal~~Dp~GaR~~sG----------s~Dy~v~~wDf~gMdas~~~fr~l~P~E~h----~i~sl~ys~Tg~~i 229 (641)
T KOG0772|consen 164 KHGTKIVSALAVDPSGARFVSG----------SLDYTVKFWDFQGMDASMRSFRQLQPCETH----QINSLQYSVTGDQI 229 (641)
T ss_pred cCCceEEEEeeecCCCceeeec----------cccceEEEEecccccccchhhhccCccccc----ccceeeecCCCCeE
Confidence 3554 57788999999988774 3445667778887654 334322221 24578999999998
Q ss_pred EEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC---CCCeeecCCCceee
Q 004574 102 IFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL---DGTAKDFGTPAVYT 178 (744)
Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~---~G~~~~l~~~~~~~ 178 (744)
+.++-...... .. -++..+.....+.-|+.|+ .|.+. .+.
T Consensus 230 Lvvsg~aqakl----~D---------------------------RdG~~~~e~~KGDQYI~Dm~nTKGHia------~lt 272 (641)
T KOG0772|consen 230 LVVSGSAQAKL----LD---------------------------RDGFEIVEFSKGDQYIRDMYNTKGHIA------ELT 272 (641)
T ss_pred EEEecCcceeE----Ec---------------------------cCCceeeeeeccchhhhhhhccCCcee------eee
Confidence 87743211110 00 0000011111134455554 22333 344
Q ss_pred eeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCC-CeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceE
Q 004574 179 AVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDG-KLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTL 257 (744)
Q Consensus 179 ~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l 257 (744)
.-.|.|+.+..+.+.... ..+.+|+.+. +..+++.....+.+ .+-.+...+|+|||+. |
T Consensus 273 ~g~whP~~k~~FlT~s~D------------gtlRiWdv~~~k~q~qVik~k~~~g-------~Rv~~tsC~~nrdg~~-i 332 (641)
T KOG0772|consen 273 CGCWHPDNKEEFLTCSYD------------GTLRIWDVNNTKSQLQVIKTKPAGG-------KRVPVTSCAWNRDGKL-I 332 (641)
T ss_pred ccccccCcccceEEecCC------------CcEEEEecCCchhheeEEeeccCCC-------cccCceeeecCCCcch-h
Confidence 568999999999887664 3677787653 32333322211111 1112667899999986 3
Q ss_pred EEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee----eccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 258 YWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK----LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~----~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
+-. -.+ +.|-+++... -+..+..... ....+.+++||+||++|+....+ ..|.++|+..
T Consensus 333 Aag-c~D-----------GSIQ~W~~~~-~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D----~tLKvWDLrq 395 (641)
T KOG0772|consen 333 AAG-CLD-----------GSIQIWDKGS-RTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFD----DTLKVWDLRQ 395 (641)
T ss_pred hhc-ccC-----------CceeeeecCC-cccccceEeeeccCCCCceeEEEeccccchhhhccCC----Cceeeeeccc
Confidence 321 111 1244444210 1112222222 12367889999999999976533 2466667665
Q ss_pred CCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 334 KDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 334 ~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
- .++..+..+-... ++++- .+||||.+.|+...
T Consensus 396 ~-kkpL~~~tgL~t~---~~~td-c~FSPd~kli~TGt 428 (641)
T KOG0772|consen 396 F-KKPLNVRTGLPTP---FPGTD-CCFSPDDKLILTGT 428 (641)
T ss_pred c-ccchhhhcCCCcc---CCCCc-cccCCCceEEEecc
Confidence 3 2222222221111 11221 45899999877654
No 203
>COG2819 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=98.72 E-value=1.1e-06 Score=83.89 Aligned_cols=130 Identities=23% Similarity=0.240 Sum_probs=82.2
Q ss_pred HHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCccc
Q 004574 580 AAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPIT 659 (744)
Q Consensus 580 ~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 659 (744)
...-|+.++..+|.++.+|+|||+||.+++.++..+|+.|...++++|..= |.+...........
T Consensus 123 ~lkP~Ie~~y~~~~~~~~i~GhSlGGLfvl~aLL~~p~~F~~y~~~SPSlW---------------w~n~~~l~~~~~~~ 187 (264)
T COG2819 123 QLKPFIEARYRTNSERTAIIGHSLGGLFVLFALLTYPDCFGRYGLISPSLW---------------WHNEAILREIESLK 187 (264)
T ss_pred hhHHHHhcccccCcccceeeeecchhHHHHHHHhcCcchhceeeeecchhh---------------hCCHHHhccccccc
Confidence 334455556778899999999999999999999999999999999999531 22222233322222
Q ss_pred ccCCCCCCEEEEee--CCCCCCCC----CHHHHHHHHHHHHh-CCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 660 HANKIKKPILIIHG--EVDDKVGL----FPMQAERFFDALKG-HGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 660 ~~~~~~~P~l~i~G--~~D~~v~~----~~~~~~~~~~~l~~-~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
.. + ..++++..| +.|..... ...++.+..+.++. .+....+..+|+.+|+- .....+..++.|+.
T Consensus 188 ~~-~-~~~i~l~iG~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~f~~~~~~~H~~----~~~~~~~~al~~l~ 259 (264)
T COG2819 188 LL-K-TKRICLYIGSGELDSSRSIRMAENKQEAAELSSLLEKRTGARLVFQEEPLEHHGS----VIHASLPSALRFLD 259 (264)
T ss_pred cC-C-CcceEEEecccccCcchhhhhhhHHHHHHHHHHHHhhccCCceEecccccccccc----hHHHHHHHHHHhhh
Confidence 22 2 444555544 33432110 03445555566666 77888999999887853 23344555566664
No 204
>KOG2564 consensus Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]
Probab=98.72 E-value=1.4e-07 Score=88.59 Aligned_cols=117 Identities=21% Similarity=0.252 Sum_probs=72.9
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh-CCeEEE
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA-RRFAVL 559 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~v~ 559 (744)
+.+.+.+...+. ++..++..|+. ..-|++++.||+|.+....+ ..+..+.. .-..|+
T Consensus 49 ekedv~i~~~~~-t~n~Y~t~~~~------t~gpil~l~HG~G~S~LSfA---------------~~a~el~s~~~~r~~ 106 (343)
T KOG2564|consen 49 EKEDVSIDGSDL-TFNVYLTLPSA------TEGPILLLLHGGGSSALSFA---------------IFASELKSKIRCRCL 106 (343)
T ss_pred cccccccCCCcc-eEEEEEecCCC------CCccEEEEeecCcccchhHH---------------HHHHHHHhhcceeEE
Confidence 466677765554 57677777752 13689999999874422111 22334443 356677
Q ss_pred ecCCCCCCCCCCCChH-----------HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC--CCceeEEEEcc
Q 004574 560 AGPSIPIIGEGDKLPN-----------DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA--PHLFCCGIARS 626 (744)
Q Consensus 560 ~~~~~~~~g~g~~~~~-----------~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~--p~~~~~~v~~~ 626 (744)
+ .+.+|+|.+... .|+.+.++++-. -.+.+|.|+||||||.+|.+.+... |. +.+++.+.
T Consensus 107 a---~DlRgHGeTk~~~e~dlS~eT~~KD~~~~i~~~fg---e~~~~iilVGHSmGGaIav~~a~~k~lps-l~Gl~viD 179 (343)
T KOG2564|consen 107 A---LDLRGHGETKVENEDDLSLETMSKDFGAVIKELFG---ELPPQIILVGHSMGGAIAVHTAASKTLPS-LAGLVVID 179 (343)
T ss_pred E---eeccccCccccCChhhcCHHHHHHHHHHHHHHHhc---cCCCceEEEeccccchhhhhhhhhhhchh-hhceEEEE
Confidence 7 788888876321 155544444432 2346899999999999998887654 54 55555543
No 205
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.72 E-value=2.1e-06 Score=93.08 Aligned_cols=224 Identities=13% Similarity=0.115 Sum_probs=134.4
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEEC-CCCce-eccccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADA-ETGEA-KPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~-~gg~~-~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
.+....|||||++|+-.+ ....|++.++ +.+.. +.+-.+.. .+....|+|+|+.|+..+.+
T Consensus 205 ~v~~~~fs~d~~~l~s~s----------~D~tiriwd~~~~~~~~~~l~gH~~-----~v~~~~f~p~g~~i~Sgs~D-- 267 (456)
T KOG0266|consen 205 GVSDVAFSPDGSYLLSGS----------DDKTLRIWDLKDDGRNLKTLKGHST-----YVTSVAFSPDGNLLVSGSDD-- 267 (456)
T ss_pred ceeeeEECCCCcEEEEec----------CCceEEEeeccCCCeEEEEecCCCC-----ceEEEEecCCCCEEEEecCC--
Confidence 688999999999877753 3456777777 44343 33422222 46788999999777765332
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCee-ecCCC-ceeeeeccCCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAK-DFGTP-AVYTAVEPSPDQ 186 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~-~l~~~-~~~~~~~~SpDG 186 (744)
..+.++++ +|+.. .|..+ ..+...+|++||
T Consensus 268 -----------------------------------------------~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~ 300 (456)
T KOG0266|consen 268 -----------------------------------------------GTVRIWDVRTGECVRKLKGHSDGISGLAFSPDG 300 (456)
T ss_pred -----------------------------------------------CcEEEEeccCCeEEEeeeccCCceEEEEECCCC
Confidence 45777788 67544 44444 788899999999
Q ss_pred ceEEEEEeeCCcccccccCCCcceEEEEeCCCCeee---eccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 187 KYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVR---ELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 187 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~---~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
+.|+-.+.+ ..+.+||..++... .+...... . + +..+.|+|+|++ |+...
T Consensus 301 ~~l~s~s~d-------------~~i~vwd~~~~~~~~~~~~~~~~~~-~-~---------~~~~~fsp~~~~-ll~~~-- 353 (456)
T KOG0266|consen 301 NLLVSASYD-------------GTIRVWDLETGSKLCLKLLSGAENS-A-P---------VTSVQFSPNGKY-LLSAS-- 353 (456)
T ss_pred CEEEEcCCC-------------ccEEEEECCCCceeeeecccCCCCC-C-c---------eeEEEECCCCcE-EEEec--
Confidence 999877432 47999999988732 22222111 1 2 567899999986 33331
Q ss_pred cCCCCCccCCccceEEeccCCCCCCCC-ceEeeeec---cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcce
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLD---LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPR 339 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~---~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~ 339 (744)
....+-++++ ..+. ........ .....+..+++|++++..+. ...++++++.++ ...
T Consensus 354 ----------~d~~~~~w~l---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~sg~~----d~~v~~~~~~s~--~~~ 414 (456)
T KOG0266|consen 354 ----------LDRTLKLWDL---RSGKSVGTYTGHSNLVRCIFSPTLSTGGKLIYSGSE----DGSVYVWDSSSG--GIL 414 (456)
T ss_pred ----------CCCeEEEEEc---cCCcceeeecccCCcceeEecccccCCCCeEEEEeC----CceEEEEeCCcc--chh
Confidence 1123444444 2222 22222211 23345556889998887762 335888888763 222
Q ss_pred eeeccc-cccccCCCCCCceeeCCCCCeEEEEe
Q 004574 340 VLFDRV-FENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 340 ~l~~~~-~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
....+. ...+ ...+|+|....++...
T Consensus 415 ~~l~~h~~~~~------~~~~~~~~~~~~~s~s 441 (456)
T KOG0266|consen 415 QRLEGHSKAAV------SDLSSHPTENLIASSS 441 (456)
T ss_pred hhhcCCCCCce------eccccCCCcCeeeecC
Confidence 322222 1111 1145788888777764
No 206
>PF06821 Ser_hydrolase: Serine hydrolase; InterPro: IPR010662 This family contains a number of hypothetical bacterial proteins of unknown function, which may be cytosolic. The Crystal Structure Of The Yden Gene Product Swiss:P96671 from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins [].; GO: 0016787 hydrolase activity; PDB: 3BDV_B 2QS9_A 1UXO_A.
Probab=98.72 E-value=6.5e-07 Score=82.11 Aligned_cols=142 Identities=17% Similarity=0.138 Sum_probs=88.8
Q ss_pred CCCCchhHHHHHhCCeEEEecCCCCCCCCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHH-HhCCCce
Q 004574 541 SGMTPTSSLIFLARRFAVLAGPSIPIIGEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLL-AHAPHLF 619 (744)
Q Consensus 541 ~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~-~~~p~~~ 619 (744)
..|.......+... +.|-.++- . .-++.+++..|.+.-....+.+.|+|||.|+.++++++ .+....+
T Consensus 13 ~HW~~wl~~~l~~~-~~V~~~~~-~---------~P~~~~W~~~l~~~i~~~~~~~ilVaHSLGc~~~l~~l~~~~~~~v 81 (171)
T PF06821_consen 13 DHWQPWLERQLENS-VRVEQPDW-D---------NPDLDEWVQALDQAIDAIDEPTILVAHSLGCLTALRWLAEQSQKKV 81 (171)
T ss_dssp TSTHHHHHHHHTTS-EEEEEC---T---------S--HHHHHHHHHHCCHC-TTTEEEEEETHHHHHHHHHHHHTCCSSE
T ss_pred cHHHHHHHHhCCCC-eEEecccc-C---------CCCHHHHHHHHHHHHhhcCCCeEEEEeCHHHHHHHHHHhhcccccc
Confidence 33444555566555 66665221 1 11578888888876322235799999999999999999 6677899
Q ss_pred eEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCc
Q 004574 620 CCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGAL 699 (744)
Q Consensus 620 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~ 699 (744)
+++++++|+.... ........ ..+.+. ....+..|.+++.+++|+.+| +..++++.+.+.
T Consensus 82 ~g~lLVAp~~~~~--~~~~~~~~----------~~f~~~-p~~~l~~~~~viaS~nDp~vp--~~~a~~~A~~l~----- 141 (171)
T PF06821_consen 82 AGALLVAPFDPDD--PEPFPPEL----------DGFTPL-PRDPLPFPSIVIASDNDPYVP--FERAQRLAQRLG----- 141 (171)
T ss_dssp EEEEEES--SCGC--HHCCTCGG----------CCCTTS-HCCHHHCCEEEEEETTBSSS---HHHHHHHHHHHT-----
T ss_pred cEEEEEcCCCccc--ccchhhhc----------cccccC-cccccCCCeEEEEcCCCCccC--HHHHHHHHHHcC-----
Confidence 9999999984310 00000000 000110 112235677999999999999 999999998884
Q ss_pred EEEEEeCCCCcccC
Q 004574 700 SRLVLLPFEHHVYA 713 (744)
Q Consensus 700 ~~~~~~~~~~H~~~ 713 (744)
.++++++++||...
T Consensus 142 a~~~~~~~~GHf~~ 155 (171)
T PF06821_consen 142 AELIILGGGGHFNA 155 (171)
T ss_dssp -EEEEETS-TTSSG
T ss_pred CCeEECCCCCCccc
Confidence 38999999999654
No 207
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=98.71 E-value=8.2e-06 Score=79.33 Aligned_cols=264 Identities=11% Similarity=0.084 Sum_probs=146.2
Q ss_pred eEeecCCCCCCCC-ceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcccc
Q 004574 8 GIHRLLPDDSLGP-EKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLN 86 (744)
Q Consensus 8 ~~~~~~~~~~~g~-~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~ 86 (744)
+++++.. |+ +-+++ .+..++....||-||.+||-. +-.+.|.+....+|..+.....+..
T Consensus 89 flW~~~~----ge~~~elt--gHKDSVt~~~FshdgtlLATG----------dmsG~v~v~~~stg~~~~~~~~e~~--- 149 (399)
T KOG0296|consen 89 FLWDIST----GEFAGELT--GHKDSVTCCSFSHDGTLLATG----------DMSGKVLVFKVSTGGEQWKLDQEVE--- 149 (399)
T ss_pred EEEEccC----CcceeEec--CCCCceEEEEEccCceEEEec----------CCCccEEEEEcccCceEEEeecccC---
Confidence 4555544 54 33454 334478999999999999984 3334555556667666554432221
Q ss_pred ccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-C
Q 004574 87 AVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-D 165 (744)
Q Consensus 87 ~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~ 165 (744)
....+.|.|-+..|++...+ +.+|.+.+ +
T Consensus 150 -dieWl~WHp~a~illAG~~D-------------------------------------------------GsvWmw~ip~ 179 (399)
T KOG0296|consen 150 -DIEWLKWHPRAHILLAGSTD-------------------------------------------------GSVWMWQIPS 179 (399)
T ss_pred -ceEEEEecccccEEEeecCC-------------------------------------------------CcEEEEECCC
Confidence 35688999988888775322 67888888 5
Q ss_pred CCeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccC
Q 004574 166 GTAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 166 G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+...++..+ .....-.|.||||+|+-.... ..|.+|++.+. ....++....- ...+......
T Consensus 180 ~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~d-------------gti~~Wn~ktg~p~~~~~~~e~~--~~~~~~~~~~ 244 (399)
T KOG0296|consen 180 QALCKVMSGHNSPCTCGEFIPDGKRILTGYDD-------------GTIIVWNPKTGQPLHKITQAEGL--ELPCISLNLA 244 (399)
T ss_pred cceeeEecCCCCCcccccccCCCceEEEEecC-------------ceEEEEecCCCceeEEecccccC--cCCccccccc
Confidence 444444333 445566799999999877543 37889998754 34444422211 0001111101
Q ss_pred CCCccceecCCC--------ceEEEEEeecCC---------CCCc-------------cCCccceEEeccCCCCCCCCce
Q 004574 243 GMRSISWRADKP--------STLYWVEAQDRG---------DANV-------------EVSPRDIIYTQPAEPAEGEKPE 292 (744)
Q Consensus 243 ~~~~~~~spDg~--------~~l~~~~~~~~~---------~~~~-------------~~~~~~~l~~~~~~~~~~~~~~ 292 (744)
+-..+.=+.+|. .-++++-+.+.+ .+.. -...-+.|.++|. .....+
T Consensus 245 ~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve~~~~ss~lpL~A~G~vdG~i~iyD~---a~~~~R 321 (399)
T KOG0296|consen 245 GSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVESIPSSSKLPLAACGSVDGTIAIYDL---AASTLR 321 (399)
T ss_pred cceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhhhcccccccchhhcccccceEEEEec---ccchhh
Confidence 100111122221 112222110000 0000 0011235666666 555666
Q ss_pred EeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 293 ILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 293 ~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+......+..+.|-+ -.+|+... .+..++.+|.-++ +.+....++...+.. ++.+||++.|+..+
T Consensus 322 ~~c~he~~V~~l~w~~-t~~l~t~c----~~g~v~~wDaRtG--~l~~~y~GH~~~Il~------f~ls~~~~~vvT~s 387 (399)
T KOG0296|consen 322 HICEHEDGVTKLKWLN-TDYLLTAC----ANGKVRQWDARTG--QLKFTYTGHQMGILD------FALSPQKRLVVTVS 387 (399)
T ss_pred eeccCCCceEEEEEcC-cchheeec----cCceEEeeecccc--ceEEEEecCchheeE------EEEcCCCcEEEEec
Confidence 6666667788888988 55555554 3456888888773 444444444433222 67899999888776
No 208
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=98.71 E-value=1.9e-06 Score=87.96 Aligned_cols=269 Identities=12% Similarity=0.123 Sum_probs=144.6
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCccc--ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCC
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKIN--FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD 82 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~--~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~ 82 (744)
+.|.++++.. |..++--..+.+.... .-+||-|+|++|-+.. ..|-+++... -- |.....
T Consensus 282 ~~l~IWDI~t----G~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~-----------~sisIyEtps--f~-lld~Ks 343 (698)
T KOG2314|consen 282 QQLIIWDIAT----GLLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTG-----------NSISIYETPS--FM-LLDKKS 343 (698)
T ss_pred ceEEEEEccc----cchhcceeccCCCccccceEEeccCCceeEEecc-----------ceEEEEecCc--ee-eecccc
Confidence 4578888866 8766554333344344 4499999999999742 3455555332 11 111111
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
..+. ++.++.|||-+..|+|-++...... ..+-++
T Consensus 344 lki~-gIr~FswsP~~~llAYwtpe~~~~p--------------------------------------------arvtL~ 378 (698)
T KOG2314|consen 344 LKIS-GIRDFSWSPTSNLLAYWTPETNNIP--------------------------------------------ARVTLM 378 (698)
T ss_pred cCCc-cccCcccCCCcceEEEEcccccCCc--------------------------------------------ceEEEE
Confidence 1222 4778999999999998765432221 233333
Q ss_pred cC-CCCeeecCCCc--eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 163 SL-DGTAKDFGTPA--VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 163 ~~-~G~~~~l~~~~--~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
.+ ++..-+..+-- ...-+-|-..|.+|.|..++-.-. ...+.+ .++.++.+..++ +......-..
T Consensus 379 evPs~~~iRt~nlfnVsDckLhWQk~gdyLcvkvdR~tK~--~~~g~f-~n~eIfrireKd---Ipve~velke------ 446 (698)
T KOG2314|consen 379 EVPSKREIRTKNLFNVSDCKLHWQKSGDYLCVKVDRHTKS--KVKGQF-SNLEIFRIREKD---IPVEVVELKE------ 446 (698)
T ss_pred ecCccceeeeccceeeeccEEEeccCCcEEEEEEEeeccc--cccceE-eeEEEEEeeccC---CCceeeecch------
Confidence 34 33111111111 233567999999999987764211 111111 233334443332 1111111111
Q ss_pred ccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee-eccceeceeeccCCceEEEeee
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK-LDLRFRSVSWCDDSLALVNETW 318 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~~~~~~~~~~SpDg~~l~~~~~ 318 (744)
.+..++|-|.|.+ .+.++.... ...-..|-+.. ..+.+..+-. .....+.+.|||.|+.++.++
T Consensus 447 ---~vi~FaWEP~gdk-F~vi~g~~~-------k~tvsfY~~e~---~~~~~~lVk~~dk~~~N~vfwsPkG~fvvva~- 511 (698)
T KOG2314|consen 447 ---SVIAFAWEPHGDK-FAVISGNTV-------KNTVSFYAVET---NIKKPSLVKELDKKFANTVFWSPKGRFVVVAA- 511 (698)
T ss_pred ---heeeeeeccCCCe-EEEEEcccc-------ccceeEEEeec---CCCchhhhhhhcccccceEEEcCCCcEEEEEE-
Confidence 1557899999999 555542211 11223555543 3333332221 135677899999999998876
Q ss_pred eeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 319 YKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
..+..+.|+-+|++-. ..+.+ .... ++...-+.|.|.|++++..+
T Consensus 512 l~s~~g~l~F~D~~~a--~~k~~-~~~e-----h~~at~veWDPtGRYvvT~s 556 (698)
T KOG2314|consen 512 LVSRRGDLEFYDTDYA--DLKDT-ASPE-----HFAATEVEWDPTGRYVVTSS 556 (698)
T ss_pred ecccccceEEEecchh--hhhhc-cCcc-----ccccccceECCCCCEEEEee
Confidence 3335677888887641 11111 1111 11222256999999998876
No 209
>PF06342 DUF1057: Alpha/beta hydrolase of unknown function (DUF1057); InterPro: IPR010463 This entry consists of proteins of unknown function which have an alpha/beta hydrolase fold.
Probab=98.68 E-value=2.1e-06 Score=82.08 Aligned_cols=96 Identities=15% Similarity=0.075 Sum_probs=70.4
Q ss_pred CceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC-------ChHHHHHHHHHH
Q 004574 512 PLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK-------LPNDSAEAAVEE 584 (744)
Q Consensus 512 ~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~-------~~~~d~~~~~~~ 584 (744)
++.+||-+||.|.+ .+.|. .....|.+.|++++..+ .+|+|.. ....+-...++.
T Consensus 34 ~~gTVv~~hGsPGS-----------H~DFk----Yi~~~l~~~~iR~I~iN---~PGf~~t~~~~~~~~~n~er~~~~~~ 95 (297)
T PF06342_consen 34 PLGTVVAFHGSPGS-----------HNDFK----YIRPPLDEAGIRFIGIN---YPGFGFTPGYPDQQYTNEERQNFVNA 95 (297)
T ss_pred CceeEEEecCCCCC-----------ccchh----hhhhHHHHcCeEEEEeC---CCCCCCCCCCcccccChHHHHHHHHH
Confidence 47799999998632 23332 45667889999999944 3443332 233466777777
Q ss_pred HHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 585 VVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 585 l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
+.+.-.++ +++.++|||.|+-.|+.++...| ..+++++.|+
T Consensus 96 ll~~l~i~-~~~i~~gHSrGcenal~la~~~~--~~g~~lin~~ 136 (297)
T PF06342_consen 96 LLDELGIK-GKLIFLGHSRGCENALQLAVTHP--LHGLVLINPP 136 (297)
T ss_pred HHHHcCCC-CceEEEEeccchHHHHHHHhcCc--cceEEEecCC
Confidence 77776676 79999999999999999999985 4588888875
No 210
>PF10340 DUF2424: Protein of unknown function (DUF2424); InterPro: IPR019436 Sterol homeostasis in eukaryotic cells relies on the reciprocal interconversion of free sterols and steryl esters. In Saccharomyces cerevisiae (Baker's yeast) sterol acetylation requires the acetyltransferase Atf2, whereas deacetylation requires Say1, a membrane-anchored deacetylase with a putative active site in the ER lumen. Lack of Say1 results in the secretion of acetylated sterols into the culture medium, indicating that the substrate specificity of Say1 determines whether acetylated sterols are secreted from the cells or whether they are deacetylated and retained. In S. cerevisiae cells lacking Say1 or Atf2 are sensitive against the plant-derived allylbenzene eugenol and both Say1 and Atf2 affect pregnenolone toxicity, indicating that lipid acetylation acts as a detoxification pathway []. Homologues of Say1 are present in the mammalian genome and can functionally substitute for Say1 in yeast demonstrating that part of this pathway has been evolutionarily conserved [].
Probab=98.66 E-value=1.4e-06 Score=88.14 Aligned_cols=196 Identities=15% Similarity=0.068 Sum_probs=108.7
Q ss_pred EEEEe-CCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCC----CCCC
Q 004574 497 ATLYL-PPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPII----GEGD 571 (744)
Q Consensus 497 ~~~~~-P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~----g~g~ 571 (744)
.+++. |....++. -|+|||+||||+.......++.. ......+.. ..+++..+|.... |+..
T Consensus 108 ~Wlvk~P~~~~pk~---DpVlIYlHGGGY~l~~~p~qi~~---------L~~i~~~l~-~~SILvLDYsLt~~~~~~~~y 174 (374)
T PF10340_consen 108 YWLVKAPNRFKPKS---DPVLIYLHGGGYFLGTTPSQIEF---------LLNIYKLLP-EVSILVLDYSLTSSDEHGHKY 174 (374)
T ss_pred EEEEeCCcccCCCC---CcEEEEEcCCeeEecCCHHHHHH---------HHHHHHHcC-CCeEEEEeccccccccCCCcC
Confidence 46665 65543332 59999999998653222111100 000111122 3355443332222 3323
Q ss_pred CChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC-----CCceeEEEEccCCCCCCCCC---Cc-cc-cc
Q 004574 572 KLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA-----PHLFCCGIARSGSYNKTLTP---FG-FQ-TE 641 (744)
Q Consensus 572 ~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~-----p~~~~~~v~~~~~~~~~~~~---~~-~~-~~ 641 (744)
.....++.+.+++|.+... .++|.|||-|+||.+++.++..- ...-+.+|+++|.++..... .. +. ..
T Consensus 175 PtQL~qlv~~Y~~Lv~~~G--~~nI~LmGDSAGGnL~Ls~LqyL~~~~~~~~Pk~~iLISPWv~l~~~~~~~~~~~~~n~ 252 (374)
T PF10340_consen 175 PTQLRQLVATYDYLVESEG--NKNIILMGDSAGGNLALSFLQYLKKPNKLPYPKSAILISPWVNLVPQDSQEGSSYHDNE 252 (374)
T ss_pred chHHHHHHHHHHHHHhccC--CCeEEEEecCccHHHHHHHHHHHhhcCCCCCCceeEEECCCcCCcCCCCCCCccccccc
Confidence 3334478899999995432 36899999999999999887532 12358999999987654111 00 00 00
Q ss_pred c--cchhhcHH----HHHh---------cCcccccCC-----------CCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHh
Q 004574 642 F--RTLWEATN----VYIE---------MSPITHANK-----------IKKPILIIHGEVDDKVGLFPMQAERFFDALKG 695 (744)
Q Consensus 642 ~--~~~~~~~~----~~~~---------~~~~~~~~~-----------~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~ 695 (744)
. ...+.... .|.. ..+....+. -++-++|+.|+++.+ .++.+++.+.+..
T Consensus 253 ~~D~l~~~~~~~~~~~y~~~~~~~~~~~~~~~~n~~~n~d~~~W~~I~~~~~vfVi~Ge~Evf----rddI~~~~~~~~~ 328 (374)
T PF10340_consen 253 KRDMLSYKGLSMFGDAYIGNNDPENDLNSLPFVNIEYNFDAEDWKDILKKYSVFVIYGEDEVF----RDDILEWAKKLND 328 (374)
T ss_pred cccccchhhHHHHHHhhccccccccccccCCccCcccCCChhHHHHhccCCcEEEEECCcccc----HHHHHHHHHHHhh
Confidence 0 00000000 1100 111111110 135799999999876 7899999999987
Q ss_pred CCCc-----EEEEEeCCCCcc
Q 004574 696 HGAL-----SRLVLLPFEHHV 711 (744)
Q Consensus 696 ~~~~-----~~~~~~~~~~H~ 711 (744)
.+.. .+..+-+++.|.
T Consensus 329 ~~~~~~~~~~nv~~~~~G~Hi 349 (374)
T PF10340_consen 329 VKPNKFSNSNNVYIDEGGIHI 349 (374)
T ss_pred cCccccCCcceEEEecCCccc
Confidence 6533 577888888885
No 211
>KOG2624 consensus Triglyceride lipase-cholesterol esterase [Lipid transport and metabolism]
Probab=98.64 E-value=1.3e-06 Score=90.14 Aligned_cols=232 Identities=17% Similarity=0.154 Sum_probs=147.3
Q ss_pred CceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCC--CCchhHHHHHhCCeE
Q 004574 480 LQKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSG--MTPTSSLIFLARRFA 557 (744)
Q Consensus 480 ~~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~G~~ 557 (744)
.+.|...+...||..+ ..--+|.. ++++|+|++.||--- +...|.. .....+..|+.+||-
T Consensus 46 y~~E~h~V~T~DgYiL-~lhRIp~~-----~~~rp~Vll~HGLl~-----------sS~~Wv~n~p~~sLaf~LadaGYD 108 (403)
T KOG2624|consen 46 YPVEEHEVTTEDGYIL-TLHRIPRG-----KKKRPVVLLQHGLLA-----------SSSSWVLNGPEQSLAFLLADAGYD 108 (403)
T ss_pred CceEEEEEEccCCeEE-EEeeecCC-----CCCCCcEEEeecccc-----------ccccceecCccccHHHHHHHcCCc
Confidence 5677777777788733 33444654 156999999999421 1122221 223456678899999
Q ss_pred EEecCCCCCCC------CCC---C--------C-hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--
Q 004574 558 VLAGPSIPIIG------EGD---K--------L-PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-- 617 (744)
Q Consensus 558 v~~~~~~~~~g------~g~---~--------~-~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-- 617 (744)
|..++.++..= ... . + ...|+.+.|+++.+.- ..+++..+|||.|+.....++...|+
T Consensus 109 VWLgN~RGn~ySr~h~~l~~~~~~~FW~FS~~Em~~yDLPA~IdyIL~~T--~~~kl~yvGHSQGtt~~fv~lS~~p~~~ 186 (403)
T KOG2624|consen 109 VWLGNNRGNTYSRKHKKLSPSSDKEFWDFSWHEMGTYDLPAMIDYILEKT--GQEKLHYVGHSQGTTTFFVMLSERPEYN 186 (403)
T ss_pred eeeecCcCcccchhhcccCCcCCcceeecchhhhhhcCHHHHHHHHHHhc--cccceEEEEEEccchhheehhcccchhh
Confidence 99855543210 011 0 1 1229999999999864 34799999999999999999988876
Q ss_pred -ceeEEEEccCCCCCCCC---------C-----------Cc---c-c------------cc---c---------------
Q 004574 618 -LFCCGIARSGSYNKTLT---------P-----------FG---F-Q------------TE---F--------------- 642 (744)
Q Consensus 618 -~~~~~v~~~~~~~~~~~---------~-----------~~---~-~------------~~---~--------------- 642 (744)
+++..++++|+.-.... . ++ + + +. .
T Consensus 187 ~kI~~~~aLAP~~~~k~~~~~~~~~~~~~~~~~~~~~~~fg~~~f~p~~~~~~~~~~~~C~~~~~~~~lC~~~~~~~~G~ 266 (403)
T KOG2624|consen 187 KKIKSFIALAPAAFPKHIKSLLNKFLDPFLGAFSLLPLLFGRKEFLPSNLFIKKFARKICSGSKIFADLCSNFLFLLVGW 266 (403)
T ss_pred hhhheeeeecchhhhcccccHHHHhhhhhhhhhhHHHHhcCCccccchhhHHHHHHHHHhcchhHHHHHHHHHHHHHcCc
Confidence 68888999886421100 0 00 0 0 00 0
Q ss_pred -cchh-----------------------------------------hcHHHHHh-cCcccccCCCCCCEEEEeeCCCCCC
Q 004574 643 -RTLW-----------------------------------------EATNVYIE-MSPITHANKIKKPILIIHGEVDDKV 679 (744)
Q Consensus 643 -~~~~-----------------------------------------~~~~~~~~-~~~~~~~~~~~~P~l~i~G~~D~~v 679 (744)
...| .+...|.. ..|...+.++++|+.+.+|.+|.++
T Consensus 267 ~~~~~n~~~~~~~~~h~pagtSvk~~~H~~Q~~~s~~f~~yD~G~~~N~~~Y~q~~pP~Y~l~~i~~P~~l~~g~~D~l~ 346 (403)
T KOG2624|consen 267 NSNNWNTTLLPVYLAHLPAGTSVKNIVHWAQIVRSGKFRKYDYGSKRNLKHYGQSTPPEYDLTNIKVPTALYYGDNDWLA 346 (403)
T ss_pred chHhhhhcccchhhccCCCCccHHHHHHHHHHhcCCCccccCCCccccHhhcCCCCCCCCCccccccCEEEEecCCcccC
Confidence 0000 00111111 1345567788999999999999998
Q ss_pred CCCHHHHHHHHHHHHhCCCcEEEEEeCCCCccc--CccccHHHHHHHHHHHHHHhc
Q 004574 680 GLFPMQAERFFDALKGHGALSRLVLLPFEHHVY--AARENVMHVIWETDRWLQKYC 733 (744)
Q Consensus 680 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~--~~~~~~~~~~~~~~~fl~~~l 733 (744)
. .+....+...+..... ....-+++-+|.- ......+.++..+++.+.+..
T Consensus 347 ~--~~DV~~~~~~~~~~~~-~~~~~~~~ynHlDFi~g~da~~~vy~~vi~~~~~~~ 399 (403)
T KOG2624|consen 347 D--PEDVLILLLVLPNSVI-KYIVPIPEYNHLDFIWGLDAKEEVYDPVIERLRLFE 399 (403)
T ss_pred C--HHHHHHHHHhcccccc-cccccCCCccceeeeeccCcHHHHHHHHHHHHHhhh
Confidence 8 8888888777765433 2333367888863 335667889999999998764
No 212
>PTZ00420 coronin; Provisional
Probab=98.63 E-value=1.9e-05 Score=85.99 Aligned_cols=117 Identities=12% Similarity=0.123 Sum_probs=70.7
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeee-eccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVR-ELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+..++|+|++..|+.+...+ ..|.+||+..++.. .+. .+. .+..++|+|||
T Consensus 126 ~~V~sVaf~P~g~~iLaSgS~D------------gtIrIWDl~tg~~~~~i~-~~~-------------~V~SlswspdG 179 (568)
T PTZ00420 126 KKISIIDWNPMNYYIMCSSGFD------------SFVNIWDIENEKRAFQIN-MPK-------------KLSSLKWNIKG 179 (568)
T ss_pred CcEEEEEECCCCCeEEEEEeCC------------CeEEEEECCCCcEEEEEe-cCC-------------cEEEEEECCCC
Confidence 6678999999999887665443 37899998765432 221 111 16678999999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCce-Eeeeecccee-----ceeeccCCceEEEeeeeeccceeEE
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFR-----SVSWCDDSLALVNETWYKTSQTRTW 327 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~-----~~~~SpDg~~l~~~~~~~~~~~~l~ 327 (744)
+. |+... ..+.|.++|. .+++.. .+....+... ...|++|+.+|+.+..+......+.
T Consensus 180 ~l-Lat~s------------~D~~IrIwD~---Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~Vk 243 (568)
T PTZ00420 180 NL-LSGTC------------VGKHMHIIDP---RKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMK 243 (568)
T ss_pred CE-EEEEe------------cCCEEEEEEC---CCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEE
Confidence 85 44332 1224788887 334332 2322222211 1235699998887764443334688
Q ss_pred EEcCCC
Q 004574 328 LVCPGS 333 (744)
Q Consensus 328 ~~~~~~ 333 (744)
++|+..
T Consensus 244 LWDlr~ 249 (568)
T PTZ00420 244 LWDLKN 249 (568)
T ss_pred EEECCC
Confidence 888775
No 213
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=98.63 E-value=1e-05 Score=76.44 Aligned_cols=268 Identities=11% Similarity=0.091 Sum_probs=147.1
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCC-ceeccccCCCccccccccceEEecCCcEEEEEec
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETG-EAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg-~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
.+..++....|++|.++|+-.+. ++ .|-|.|.-+. +...++-... -+...++||.|+.++-..-
T Consensus 53 GH~~Ki~~~~ws~Dsr~ivSaSq--------DG--klIvWDs~TtnK~haipl~s~-----WVMtCA~sPSg~~VAcGGL 117 (343)
T KOG0286|consen 53 GHLNKIYAMDWSTDSRRIVSASQ--------DG--KLIVWDSFTTNKVHAIPLPSS-----WVMTCAYSPSGNFVACGGL 117 (343)
T ss_pred ccccceeeeEecCCcCeEEeecc--------CC--eEEEEEcccccceeEEecCce-----eEEEEEECCCCCeEEecCc
Confidence 34447899999999999998653 43 4555576544 4444432222 3557799999999975422
Q ss_pred CCCCCCCCCCCCCCCCeeeecCCCcccccccccccC------CCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-cee
Q 004574 107 SSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLL------KDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVY 177 (744)
Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~ 177 (744)
++.=...... .....+.....|.+..-. .+-.+...+...+.....++|+ +| ..+.+..+ +.+
T Consensus 118 dN~Csiy~ls--------~~d~~g~~~v~r~l~gHtgylScC~f~dD~~ilT~SGD~TCalWDie~g~~~~~f~GH~gDV 189 (343)
T KOG0286|consen 118 DNKCSIYPLS--------TRDAEGNVRVSRELAGHTGYLSCCRFLDDNHILTGSGDMTCALWDIETGQQTQVFHGHTGDV 189 (343)
T ss_pred CceeEEEecc--------cccccccceeeeeecCccceeEEEEEcCCCceEecCCCceEEEEEcccceEEEEecCCcccE
Confidence 1110000000 000001111111111100 1111233333334456778899 78 44444545 889
Q ss_pred eeeccCC-CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCce
Q 004574 178 TAVEPSP-DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPST 256 (744)
Q Consensus 178 ~~~~~Sp-DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~ 256 (744)
..+..+| |++..+-...+ ...++||+..+..++-...+-.+ +..+.|-|+|..
T Consensus 190 ~slsl~p~~~ntFvSg~cD-------------~~aklWD~R~~~c~qtF~ghesD------------INsv~ffP~G~a- 243 (343)
T KOG0286|consen 190 MSLSLSPSDGNTFVSGGCD-------------KSAKLWDVRSGQCVQTFEGHESD------------INSVRFFPSGDA- 243 (343)
T ss_pred EEEecCCCCCCeEEecccc-------------cceeeeeccCcceeEeecccccc------------cceEEEccCCCe-
Confidence 9999999 88875544333 25788898877666655443332 667899999954
Q ss_pred EEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee---ccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 257 LYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL---DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 257 l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
|+.-.+. ..-++|=+.. .....++.. ...+.+++||.-||.|+... ......++|.-.
T Consensus 244 --fatGSDD--------~tcRlyDlRa-----D~~~a~ys~~~~~~gitSv~FS~SGRlLfagy----~d~~c~vWDtlk 304 (343)
T KOG0286|consen 244 --FATGSDD--------ATCRLYDLRA-----DQELAVYSHDSIICGITSVAFSKSGRLLFAGY----DDFTCNVWDTLK 304 (343)
T ss_pred --eeecCCC--------ceeEEEeecC-----CcEEeeeccCcccCCceeEEEcccccEEEeee----cCCceeEeeccc
Confidence 4432211 1123444433 222333333 45788999999999877654 233466677554
Q ss_pred CCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 334 KDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 334 ~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
++..-+..++.+.+-. +..+|||.-|+..+
T Consensus 305 --~e~vg~L~GHeNRvSc------l~~s~DG~av~TgS 334 (343)
T KOG0286|consen 305 --GERVGVLAGHENRVSC------LGVSPDGMAVATGS 334 (343)
T ss_pred --cceEEEeeccCCeeEE------EEECCCCcEEEecc
Confidence 2333333333332211 55899998877665
No 214
>PTZ00421 coronin; Provisional
Probab=98.60 E-value=1.4e-05 Score=86.41 Aligned_cols=202 Identities=11% Similarity=0.049 Sum_probs=115.1
Q ss_pred cccceeecC-CCCeEEEeeecccccccCCCceeEEEEECCCCce-----eccccCCCccccccccceEEecCCc-EEEEE
Q 004574 32 KINFVSWSP-DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-----KPLFESPDICLNAVFGSFVWVNNST-LLIFT 104 (744)
Q Consensus 32 ~~~~p~~Sp-DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-----~~lt~~~~~~~~~~~~~~~wspDg~-~l~~~ 104 (744)
.+....||| |++.||..+. + ..|.++++.++.. .++..... +...+..+.|+|++. .|+..
T Consensus 77 ~V~~v~fsP~d~~~LaSgS~--------D--gtIkIWdi~~~~~~~~~~~~l~~L~g--H~~~V~~l~f~P~~~~iLaSg 144 (493)
T PTZ00421 77 PIIDVAFNPFDPQKLFTASE--------D--GTIMGWGIPEEGLTQNISDPIVHLQG--HTKKVGIVSFHPSAMNVLASA 144 (493)
T ss_pred CEEEEEEcCCCCCEEEEEeC--------C--CEEEEEecCCCccccccCcceEEecC--CCCcEEEEEeCcCCCCEEEEE
Confidence 578899999 8988887542 3 3455556654321 11111111 111366789999875 45443
Q ss_pred ecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCC-eeecCCC-ceeeeec
Q 004574 105 IPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGT-AKDFGTP-AVYTAVE 181 (744)
Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~-~~~l~~~-~~~~~~~ 181 (744)
+.+ +.|.++|+ +++ ...+..+ ..+..++
T Consensus 145 s~D-------------------------------------------------gtVrIWDl~tg~~~~~l~~h~~~V~sla 175 (493)
T PTZ00421 145 GAD-------------------------------------------------MVVNVWDVERGKAVEVIKCHSDQITSLE 175 (493)
T ss_pred eCC-------------------------------------------------CEEEEEECCCCeEEEEEcCCCCceEEEE
Confidence 211 46788888 664 3444444 6788999
Q ss_pred cCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEE
Q 004574 182 PSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWV 260 (744)
Q Consensus 182 ~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~ 260 (744)
|+|||+.|+..+.+ ..|.+||+..++ ...+....... ...+.|.+++.. ++.+
T Consensus 176 ~spdG~lLatgs~D-------------g~IrIwD~rsg~~v~tl~~H~~~~------------~~~~~w~~~~~~-ivt~ 229 (493)
T PTZ00421 176 WNLDGSLLCTTSKD-------------KKLNIIDPRDGTIVSSVEAHASAK------------SQRCLWAKRKDL-IITL 229 (493)
T ss_pred EECCCCEEEEecCC-------------CEEEEEECCCCcEEEEEecCCCCc------------ceEEEEcCCCCe-EEEE
Confidence 99999988766443 378999987543 33332221110 224578898875 4444
Q ss_pred EeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 261 EAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 261 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
..... ....|.++|++ +...+...... ......+.|++|++.|+.... +...|.++++..
T Consensus 230 G~s~s--------~Dr~VklWDlr--~~~~p~~~~~~d~~~~~~~~~~d~d~~~L~lggk---gDg~Iriwdl~~ 291 (493)
T PTZ00421 230 GCSKS--------QQRQIMLWDTR--KMASPYSTVDLDQSSALFIPFFDEDTNLLYIGSK---GEGNIRCFELMN 291 (493)
T ss_pred ecCCC--------CCCeEEEEeCC--CCCCceeEeccCCCCceEEEEEcCCCCEEEEEEe---CCCeEEEEEeeC
Confidence 32111 11247777773 22223222221 223455779999998776542 233577777765
No 215
>cd00312 Esterase_lipase Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.
Probab=98.59 E-value=1.6e-07 Score=103.78 Aligned_cols=120 Identities=19% Similarity=0.213 Sum_probs=83.5
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhC-C-eEEEecCCC-CCCCCCC
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLAR-R-FAVLAGPSI-PIIGEGD 571 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-G-~~v~~~~~~-~~~g~g~ 571 (744)
+...+|.|....+ .+++||||++|||+|..+... .+ ....++.. + ++|+..+++ +..|+..
T Consensus 79 l~l~i~~p~~~~~--~~~~pv~v~ihGG~~~~g~~~--------~~------~~~~~~~~~~~~~vv~~~yRlg~~g~~~ 142 (493)
T cd00312 79 LYLNVYTPKNTKP--GNSLPVMVWIHGGGFMFGSGS--------LY------PGDGLAREGDNVIVVSINYRLGVLGFLS 142 (493)
T ss_pred CeEEEEeCCCCCC--CCCCCEEEEEcCCccccCCCC--------CC------ChHHHHhcCCCEEEEEeccccccccccc
Confidence 6668888875422 235899999999987543221 11 12334443 3 888886665 4444422
Q ss_pred C--------ChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHh--CCCceeEEEEccCCCC
Q 004574 572 K--------LPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAH--APHLFCCGIARSGSYN 630 (744)
Q Consensus 572 ~--------~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~--~p~~~~~~v~~~~~~~ 630 (744)
. ....|+..+++|++++ -..|+++|.|+|+|+||++++.++.. .+.+|+++|+++|...
T Consensus 143 ~~~~~~~~n~g~~D~~~al~wv~~~i~~fggd~~~v~~~G~SaG~~~~~~~~~~~~~~~lf~~~i~~sg~~~ 214 (493)
T cd00312 143 TGDIELPGNYGLKDQRLALKWVQDNIAAFGGDPDSVTIFGESAGGASVSLLLLSPDSKGLFHRAISQSGSAL 214 (493)
T ss_pred CCCCCCCcchhHHHHHHHHHHHHHHHHHhCCCcceEEEEeecHHHHHhhhHhhCcchhHHHHHHhhhcCCcc
Confidence 1 1244999999999986 24799999999999999999888876 2457999999988643
No 216
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=98.58 E-value=0.00013 Score=75.23 Aligned_cols=326 Identities=12% Similarity=0.088 Sum_probs=164.7
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
......|++|+.+||. ...+++|.+...+. ..+|.... .+..+.+||-|.+|.-=.......
T Consensus 37 ~~v~~~S~~G~lfA~~-----------~~~~v~i~~~~~~~-~~lt~~~~-----~~~~L~fSP~g~yL~T~e~~~i~~- 98 (566)
T KOG2315|consen 37 CNVFAYSNNGRLFAYS-----------DNQVVKVFEIATLK-VVLCVELK-----KTYDLLFSPKGNYLLTWEPWAIYG- 98 (566)
T ss_pred ceeEEEcCCCcEEEEE-----------cCCeEEEEEccCCc-EEEEeccc-----eeeeeeeccccccccccccccccc-
Confidence 4567899999999994 44678888877775 33433322 135678999998874211110000
Q ss_pred CCCCCCCCCCeeeecCCCccc-ccccccc--cCCCchhhhccceeeeeEEEEEcCCC---CeeecCCCceeeeeccCCCC
Q 004574 113 PKKTMVPLGPKIQSNEQKNII-ISRMTDN--LLKDEYDESLFDYYTTAQLVLGSLDG---TAKDFGTPAVYTAVEPSPDQ 186 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~~~~G---~~~~l~~~~~~~~~~~SpDG 186 (744)
+.....+.-..+.......+. .....+. ..+++.++....+.-.+.++.+++.+ ...+|. ...+..+.+||-+
T Consensus 99 ~~~~~~pn~~v~~vet~~~~s~~q~k~Q~~W~~qfs~dEsl~arlv~nev~f~~~~~f~~~~~kl~-~~~i~~f~lSpgp 177 (566)
T KOG2315|consen 99 PKNASNPNVLVYNVETGVQRSQIQKKMQNGWVPQFSIDESLAARLVSNEVQFYDLGSFKTIQHKLS-VSGITMLSLSPGP 177 (566)
T ss_pred CCCCCCCceeeeeeccceehhheehhhhcCcccccccchhhhhhhhcceEEEEecCCccceeeeee-ccceeeEEecCCC
Confidence 000011111111111100000 0001111 33444555555555456777777733 222332 1345567777764
Q ss_pred c--eEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 187 K--YVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 187 ~--~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
. .|++...+. +..+..+.+|... +.....+.....-. . + --.+.|.+-|+.+|+.++..
T Consensus 178 ~~~~vAvyvPe~--------kGaPa~vri~~~~~~~~~~~~a~ksFFk--a---d-----kvqm~WN~~gt~LLvLastd 239 (566)
T KOG2315|consen 178 EPPFVAVYVPEK--------KGAPASVRIYKYPEEGQHQPVANKSFFK--A---D-----KVQMKWNKLGTALLVLASTD 239 (566)
T ss_pred CCceEEEEccCC--------CCCCcEEEEeccccccccchhhhccccc--c---c-----eeEEEeccCCceEEEEEEEe
Confidence 3 455444332 1233445554433 22222222111100 0 0 11478999999966665432
Q ss_pred --cCCCCCccCCccceEEeccCCCCCCCCceEe-eeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCccee
Q 004574 264 --DRGDANVEVSPRDIIYTQPAEPAEGEKPEIL-HKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRV 340 (744)
Q Consensus 264 --~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l-~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~ 340 (744)
.++...+ ....||.++. + |+...+ ....+.+..+.|||+|+-++...-..+ ..+-+.|+.+. .
T Consensus 240 VDktn~SYY---GEq~Lyll~t---~-g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGfMP--Akvtifnlr~~-----~ 305 (566)
T KOG2315|consen 240 VDKTNASYY---GEQTLYLLAT---Q-GESVSVPLLKEGPVHDVTWSPSGREFAVVYGFMP--AKVTIFNLRGK-----P 305 (566)
T ss_pred ecCCCcccc---ccceEEEEEe---c-CceEEEecCCCCCceEEEECCCCCEEEEEEeccc--ceEEEEcCCCC-----E
Confidence 1111111 1235888887 4 333322 334788999999999998877653332 34556677662 3
Q ss_pred eeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhh
Q 004574 341 LFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFE 420 (744)
Q Consensus 341 l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~ 420 (744)
+.+...+. -+- +.|+|-|.+|++..-+ ..+..+-+||..+.+. |...+
T Consensus 306 v~df~egp-----RN~-~~fnp~g~ii~lAGFG------------------NL~G~mEvwDv~n~K~--i~~~~------ 353 (566)
T KOG2315|consen 306 VFDFPEGP-----RNT-AFFNPHGNIILLAGFG------------------NLPGDMEVWDVPNRKL--IAKFK------ 353 (566)
T ss_pred eEeCCCCC-----ccc-eEECCCCCEEEEeecC------------------CCCCceEEEeccchhh--ccccc------
Confidence 33322221 111 4499999999888632 1233477888765322 22111
Q ss_pred heeeeecCCcceecccCCCEEEEEEe
Q 004574 421 TAVALVFGQGEEDINLNQLKILTSKE 446 (744)
Q Consensus 421 ~~~~~~~~~~~~~~s~d~~~~~~~~~ 446 (744)
..+.+.+.|+|||+.++....
T Consensus 354 -----a~~tt~~eW~PdGe~flTATT 374 (566)
T KOG2315|consen 354 -----AANTTVFEWSPDGEYFLTATT 374 (566)
T ss_pred -----cCCceEEEEcCCCcEEEEEec
Confidence 112334799999988775543
No 217
>TIGR01839 PHA_synth_II poly(R)-hydroxyalkanoic acid synthase, class II. This model represents the class II subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs, typically with six to fourteen carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=98.57 E-value=1.6e-06 Score=92.37 Aligned_cols=84 Identities=14% Similarity=-0.029 Sum_probs=60.4
Q ss_pred chhHHHHHhCCeEEEecCC----CCCCCCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHH----HHHhCC
Q 004574 545 PTSSLIFLARRFAVLAGPS----IPIIGEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAH----LLAHAP 616 (744)
Q Consensus 545 ~~~~~~~~~~G~~v~~~~~----~~~~g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~----~~~~~p 616 (744)
...+..|+++||.|+..+- ......+..+..+.+.++++.+++... .++|.++|+|+||.+++. ++++++
T Consensus 237 ~SlVr~lv~qG~~VflIsW~nP~~~~r~~~ldDYv~~i~~Ald~V~~~tG--~~~vnl~GyC~GGtl~a~~~a~~aA~~~ 314 (560)
T TIGR01839 237 KSFVQYCLKNQLQVFIISWRNPDKAHREWGLSTYVDALKEAVDAVRAITG--SRDLNLLGACAGGLTCAALVGHLQALGQ 314 (560)
T ss_pred chHHHHHHHcCCeEEEEeCCCCChhhcCCCHHHHHHHHHHHHHHHHHhcC--CCCeeEEEECcchHHHHHHHHHHHhcCC
Confidence 3567889999999998321 111223333444567777787777633 468999999999999997 677777
Q ss_pred C-ceeEEEEccCCCC
Q 004574 617 H-LFCCGIARSGSYN 630 (744)
Q Consensus 617 ~-~~~~~v~~~~~~~ 630 (744)
+ +++.++++..+.|
T Consensus 315 ~~~V~sltllatplD 329 (560)
T TIGR01839 315 LRKVNSLTYLVSLLD 329 (560)
T ss_pred CCceeeEEeeecccc
Confidence 5 7999998887765
No 218
>COG2272 PnbA Carboxylesterase type B [Lipid metabolism]
Probab=98.56 E-value=1.6e-07 Score=96.65 Aligned_cols=119 Identities=24% Similarity=0.219 Sum_probs=82.7
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCC-eEEEecCCC-CCCCC---
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARR-FAVLAGPSI-PIIGE--- 569 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G-~~v~~~~~~-~~~g~--- 569 (744)
+..-+|.|. .. .+++||+||+|||+|..+..... ......|+++| ++|++.+++ +..|+
T Consensus 80 L~LNIwaP~-~~---a~~~PVmV~IHGG~y~~Gs~s~~------------~ydgs~La~~g~vVvVSvNYRLG~lGfL~~ 143 (491)
T COG2272 80 LYLNIWAPE-VP---AEKLPVMVYIHGGGYIMGSGSEP------------LYDGSALAARGDVVVVSVNYRLGALGFLDL 143 (491)
T ss_pred eeEEeeccC-CC---CCCCcEEEEEeccccccCCCccc------------ccChHHHHhcCCEEEEEeCcccccceeeeh
Confidence 555778887 11 34589999999998765433211 12245678888 888885553 33342
Q ss_pred ---CCC------ChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHhCCC---ceeEEEEccCCCC
Q 004574 570 ---GDK------LPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAHAPH---LFCCGIARSGSYN 630 (744)
Q Consensus 570 ---g~~------~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~~p~---~~~~~v~~~~~~~ 630 (744)
... .-..|+..+++|++++ -.-|+++|.|+|+|+||+.++.+++- |. .|..+|+.+|...
T Consensus 144 ~~~~~~~~~~~n~Gl~DqilALkWV~~NIe~FGGDp~NVTl~GeSAGa~si~~Lla~-P~AkGLF~rAi~~Sg~~~ 218 (491)
T COG2272 144 SSLDTEDAFASNLGLLDQILALKWVRDNIEAFGGDPQNVTLFGESAGAASILTLLAV-PSAKGLFHRAIALSGAAS 218 (491)
T ss_pred hhccccccccccccHHHHHHHHHHHHHHHHHhCCCccceEEeeccchHHHHHHhhcC-ccchHHHHHHHHhCCCCC
Confidence 111 1234999999999986 24689999999999999999888765 33 6888888888653
No 219
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=98.56 E-value=1.1e-06 Score=93.79 Aligned_cols=199 Identities=16% Similarity=0.174 Sum_probs=127.3
Q ss_pred CCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEE
Q 004574 15 DDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVW 94 (744)
Q Consensus 15 ~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~w 94 (744)
+.+++..+.+.+ +..-+...+||||-++|.--+. +....||-++ +....-+.. .+ ..-+..+.|
T Consensus 438 ~~~~~~~~~L~G--H~GPVyg~sFsPd~rfLlScSE--------D~svRLWsl~--t~s~~V~y~-GH---~~PVwdV~F 501 (707)
T KOG0263|consen 438 DDSSGTSRTLYG--HSGPVYGCSFSPDRRFLLSCSE--------DSSVRLWSLD--TWSCLVIYK-GH---LAPVWDVQF 501 (707)
T ss_pred ccCCceeEEeec--CCCceeeeeecccccceeeccC--------Ccceeeeecc--cceeEEEec-CC---CcceeeEEe
Confidence 345566666764 4335889999999999877432 5567788665 333222222 21 113567789
Q ss_pred ecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC
Q 004574 95 VNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP 174 (744)
Q Consensus 95 spDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~ 174 (744)
+|-|-+++-.+.+ +..+||..|-.--.+-+..+
T Consensus 502 ~P~GyYFatas~D-----------------------------------------------~tArLWs~d~~~PlRifagh 534 (707)
T KOG0263|consen 502 APRGYYFATASHD-----------------------------------------------QTARLWSTDHNKPLRIFAGH 534 (707)
T ss_pred cCCceEEEecCCC-----------------------------------------------ceeeeeecccCCchhhhccc
Confidence 9998665544221 12467766653333333333
Q ss_pred -ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 -AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 -~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+.-..|.|++.+++-.+.+ ..+.+||.-.+...++..++... +..+++||+|
T Consensus 535 lsDV~cv~FHPNs~Y~aTGSsD-------------~tVRlWDv~~G~~VRiF~GH~~~------------V~al~~Sp~G 589 (707)
T KOG0263|consen 535 LSDVDCVSFHPNSNYVATGSSD-------------RTVRLWDVSTGNSVRIFTGHKGP------------VTALAFSPCG 589 (707)
T ss_pred ccccceEEECCcccccccCCCC-------------ceEEEEEcCCCcEEEEecCCCCc------------eEEEEEcCCC
Confidence 555568899999988755333 36889998877666666553221 6688999999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeeeccceeceeeccCCceEEEee
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
++ |+..+ ..+.|.++|+ .+++. ..+......+.+++||.||..|+...
T Consensus 590 r~-LaSg~------------ed~~I~iWDl---~~~~~v~~l~~Ht~ti~SlsFS~dg~vLasgg 638 (707)
T KOG0263|consen 590 RY-LASGD------------EDGLIKIWDL---ANGSLVKQLKGHTGTIYSLSFSRDGNVLASGG 638 (707)
T ss_pred ce-Eeecc------------cCCcEEEEEc---CCCcchhhhhcccCceeEEEEecCCCEEEecC
Confidence 87 55442 2345888888 44443 44555677889999999999998765
No 220
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.56 E-value=1.1e-06 Score=85.05 Aligned_cols=113 Identities=17% Similarity=0.221 Sum_probs=78.5
Q ss_pred eecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccc-cCCCccccccccceEEecCCcEEE
Q 004574 24 VHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLF-ESPDICLNAVFGSFVWVNNSTLLI 102 (744)
Q Consensus 24 l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt-~~~~~~~~~~~~~~~wspDg~~l~ 102 (744)
+...|...-+++..|.+||..++-++ -+...|-+.++++|+-.+|- .+.+ +++.+.|||||.+|+
T Consensus 189 vl~~pgh~pVtsmqwn~dgt~l~tAS---------~gsssi~iWdpdtg~~~pL~~~glg-----g~slLkwSPdgd~lf 254 (445)
T KOG2139|consen 189 VLQDPGHNPVTSMQWNEDGTILVTAS---------FGSSSIMIWDPDTGQKIPLIPKGLG-----GFSLLKWSPDGDVLF 254 (445)
T ss_pred heeCCCCceeeEEEEcCCCCEEeecc---------cCcceEEEEcCCCCCcccccccCCC-----ceeeEEEcCCCCEEE
Confidence 33345555788999999999988865 34467888899999988885 2222 577889999999998
Q ss_pred EEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC-CeeecCCCceeeeec
Q 004574 103 FTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG-TAKDFGTPAVYTAVE 181 (744)
Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G-~~~~l~~~~~~~~~~ 181 (744)
.++.+. ...+|-.+-.- +.+.+...+.+....
T Consensus 255 aAt~da-----------------------------------------------vfrlw~e~q~wt~erw~lgsgrvqtac 287 (445)
T KOG2139|consen 255 AATCDA-----------------------------------------------VFRLWQENQSWTKERWILGSGRVQTAC 287 (445)
T ss_pred Eecccc-----------------------------------------------eeeeehhcccceecceeccCCceeeee
Confidence 774320 02344222222 233333346888999
Q ss_pred cCCCCceEEEEEeeCC
Q 004574 182 PSPDQKYVLITSMHRP 197 (744)
Q Consensus 182 ~SpDG~~i~~~~~~~~ 197 (744)
|||+|++|+|.....+
T Consensus 288 WspcGsfLLf~~sgsp 303 (445)
T KOG2139|consen 288 WSPCGSFLLFACSGSP 303 (445)
T ss_pred ecCCCCEEEEEEcCCc
Confidence 9999999999987753
No 221
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=98.56 E-value=2.2e-06 Score=86.19 Aligned_cols=230 Identities=11% Similarity=0.094 Sum_probs=133.3
Q ss_pred cccceeecC-CCCeEEEeeecccccccCCCceeEEEEECCCCceeccc-cCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 32 KINFVSWSP-DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLF-ESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~Sp-DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt-~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
.++...|.| -|+.|+-. .....|+++++-+ .-+.|. +..+ +-.+..+.||++|..++.++.+
T Consensus 216 gvsai~~fp~~~hLlLS~----------gmD~~vklW~vy~-~~~~lrtf~gH---~k~Vrd~~~s~~g~~fLS~sfD-- 279 (503)
T KOG0282|consen 216 GVSAIQWFPKKGHLLLSG----------GMDGLVKLWNVYD-DRRCLRTFKGH---RKPVRDASFNNCGTSFLSASFD-- 279 (503)
T ss_pred ccchhhhccceeeEEEec----------CCCceEEEEEEec-Ccceehhhhcc---hhhhhhhhccccCCeeeeeecc--
Confidence 589999999 77666653 2334566666554 222222 2222 1147788999999998876432
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCCceeeeeccCCCCc
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTPAVYTAVEPSPDQK 187 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~~~~~~~~~SpDG~ 187 (744)
.-|-++|+ +| ...++.......-..|.||+.
T Consensus 280 -----------------------------------------------~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd~~ 312 (503)
T KOG0282|consen 280 -----------------------------------------------RFLKLWDTETGQVLSRFHLDKVPTCVKFHPDNQ 312 (503)
T ss_pred -----------------------------------------------eeeeeeccccceEEEEEecCCCceeeecCCCCC
Confidence 45667788 88 555555555556778999998
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
.+++....+ ..|..||+..+++.+--+.... .+..+.|-++|++ |++..+..
T Consensus 313 n~fl~G~sd------------~ki~~wDiRs~kvvqeYd~hLg------------~i~~i~F~~~g~r---FissSDdk- 364 (503)
T KOG0282|consen 313 NIFLVGGSD------------KKIRQWDIRSGKVVQEYDRHLG------------AILDITFVDEGRR---FISSSDDK- 364 (503)
T ss_pred cEEEEecCC------------CcEEEEeccchHHHHHHHhhhh------------heeeeEEccCCce---EeeeccCc-
Confidence 887775543 4799999987765543333222 1567899999987 33322211
Q ss_pred CCccCCccceEEeccCCCCCCCCceEeee-eccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC-CCcceeeeccc
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKPEILHK-LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK-DVAPRVLFDRV 345 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~-~~~~~~l~~~~ 345 (744)
.-+||-... +-..+.+.. .......+...|.+++++..+ ..+.|++..+... ....+....+.
T Consensus 365 -------s~riWe~~~----~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs----~dN~i~ifs~~~~~r~nkkK~feGh 429 (503)
T KOG0282|consen 365 -------SVRIWENRI----PVPIKNIADPEMHTMPCLTLHPNGKWFAAQS----MDNYIAIFSTVPPFRLNKKKRFEGH 429 (503)
T ss_pred -------cEEEEEcCC----CccchhhcchhhccCcceecCCCCCeehhhc----cCceEEEEecccccccCHhhhhcce
Confidence 223454443 112222222 233455677889999988766 2234555543321 01111222222
Q ss_pred cccccCCCCCCceeeCCCCCeEEEEe
Q 004574 346 FENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 346 ~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
... + ..-.+.+||||++|++..
T Consensus 430 ~va--G--ys~~v~fSpDG~~l~SGd 451 (503)
T KOG0282|consen 430 SVA--G--YSCQVDFSPDGRTLCSGD 451 (503)
T ss_pred ecc--C--ceeeEEEcCCCCeEEeec
Confidence 111 1 122356999999998875
No 222
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=98.55 E-value=7.3e-05 Score=73.47 Aligned_cols=225 Identities=15% Similarity=0.128 Sum_probs=128.5
Q ss_pred ccceeecC-CCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 33 INFVSWSP-DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 33 ~~~p~~Sp-DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
...+..+| ++..+||.++ -+.-++++|..+|+..+....+..++ ..+.-.|||||++||.+..+-..
T Consensus 7 gH~~a~~p~~~~avafaRR---------PG~~~~v~D~~~g~~~~~~~a~~gRH--FyGHg~fs~dG~~LytTEnd~~~- 74 (305)
T PF07433_consen 7 GHGVAAHPTRPEAVAFARR---------PGTFALVFDCRTGQLLQRLWAPPGRH--FYGHGVFSPDGRLLYTTENDYET- 74 (305)
T ss_pred ccceeeCCCCCeEEEEEeC---------CCcEEEEEEcCCCceeeEEcCCCCCE--EecCEEEcCCCCEEEEeccccCC-
Confidence 55678899 6777777664 23578899999998775544443222 24577999999999877432111
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCC-C--CeeecCCC-ceeeeeccCCCCc
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLD-G--TAKDFGTP-AVYTAVEPSPDQK 187 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-G--~~~~l~~~-~~~~~~~~SpDG~ 187 (744)
+.+.|-++|+. + ....+... -+.+.+.|.|||+
T Consensus 75 -------------------------------------------g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pDG~ 111 (305)
T PF07433_consen 75 -------------------------------------------GRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPDGE 111 (305)
T ss_pred -------------------------------------------CcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCCCC
Confidence 22678888885 4 33344444 4566888999999
Q ss_pred eEEEEEeeC---Ccc--cccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEE
Q 004574 188 YVLITSMHR---PYS--YKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 188 ~i~~~~~~~---~~~--~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~ 261 (744)
.|++....- ++. ...........|-..|.. |..+.+..-.+ . .....++.+++.+||. +++..
T Consensus 112 tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~-~--------~~~lSiRHLa~~~~G~--V~~a~ 180 (305)
T PF07433_consen 112 TLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPP-D--------LHQLSIRHLAVDGDGT--VAFAM 180 (305)
T ss_pred EEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCc-c--------ccccceeeEEecCCCc--EEEEE
Confidence 988764321 110 011111222355556544 44333321110 0 0112288999999996 44443
Q ss_pred eecCCCCCccCCccceEEeccCCCCCCCCceEee-------eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 262 AQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH-------KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-------~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
.. .+.... ..-.|.+.+. ++..+.+. .....+.++++++||..++.++. + -..+.++|..++
T Consensus 181 Q~-qg~~~~---~~PLva~~~~----g~~~~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsP-r--Gg~~~~~d~~tg 249 (305)
T PF07433_consen 181 QY-QGDPGD---APPLVALHRR----GGALRLLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSP-R--GGRVAVWDAATG 249 (305)
T ss_pred ec-CCCCCc---cCCeEEEEcC----CCcceeccCChHHHHhhCCceEEEEEeCCCCEEEEECC-C--CCEEEEEECCCC
Confidence 32 222211 1123555443 22222221 12467899999999999998873 2 235666677663
No 223
>PF02273 Acyl_transf_2: Acyl transferase; InterPro: IPR003157 LuxD proteins are bacterial acyl transferases. Together with an acyl-protein synthetase (LuxE) and reductase (LuxC), they form a multienzyme complex. This complex channels activated fatty acids into the aldehyde substrate for the luciferase-catalyzed bacterial bioluminescence reaction [, ]. ; GO: 0016746 transferase activity, transferring acyl groups, 0006631 fatty acid metabolic process; PDB: 1THT_B.
Probab=98.53 E-value=6.9e-06 Score=76.06 Aligned_cols=199 Identities=17% Similarity=0.190 Sum_probs=105.9
Q ss_pred cCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCC
Q 004574 489 RKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIG 568 (744)
Q Consensus 489 ~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g 568 (744)
-.+|..+..+-..|+.-.+.+ .+.||+.-|.+.. +.. +...+.+|+.+||.|+..+...-.|
T Consensus 9 ~~~~~~I~vwet~P~~~~~~~---~~tiliA~Gf~rr--------------mdh-~agLA~YL~~NGFhViRyDsl~HvG 70 (294)
T PF02273_consen 9 LEDGRQIRVWETRPKNNEPKR---NNTILIAPGFARR--------------MDH-FAGLAEYLSANGFHVIRYDSLNHVG 70 (294)
T ss_dssp ETTTEEEEEEEE---TTS------S-EEEEE-TT-GG--------------GGG-GHHHHHHHHTTT--EEEE---B---
T ss_pred cCCCCEEEEeccCCCCCCccc---CCeEEEecchhHH--------------HHH-HHHHHHHHhhCCeEEEecccccccc
Confidence 457889999888898744433 6899999886421 111 1245678899999999843333333
Q ss_pred CCCC--------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCC-----
Q 004574 569 EGDK--------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTP----- 635 (744)
Q Consensus 569 ~g~~--------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~----- 635 (744)
.... ...+++..+++||++.+. .+++|+.-|.-|-+|...+++- ...-+|...|+++...+-
T Consensus 71 lSsG~I~eftms~g~~sL~~V~dwl~~~g~---~~~GLIAaSLSaRIAy~Va~~i--~lsfLitaVGVVnlr~TLe~al~ 145 (294)
T PF02273_consen 71 LSSGDINEFTMSIGKASLLTVIDWLATRGI---RRIGLIAASLSARIAYEVAADI--NLSFLITAVGVVNLRDTLEKALG 145 (294)
T ss_dssp ----------HHHHHHHHHHHHHHHHHTT------EEEEEETTHHHHHHHHTTTS----SEEEEES--S-HHHHHHHHHS
T ss_pred CCCCChhhcchHHhHHHHHHHHHHHHhcCC---CcchhhhhhhhHHHHHHHhhcc--CcceEEEEeeeeeHHHHHHHHhc
Confidence 2111 123489999999997763 5799999999999999999864 366677777887632110
Q ss_pred C---c-----ccccc--cchhhcHHHH----H--hc----CcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHh
Q 004574 636 F---G-----FQTEF--RTLWEATNVY----I--EM----SPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKG 695 (744)
Q Consensus 636 ~---~-----~~~~~--~~~~~~~~~~----~--~~----~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~ 695 (744)
+ . .+... ...--..+.+ . .+ +....++.+.+|++.+++++|..|. ..+..++...+..
T Consensus 146 ~Dyl~~~i~~lp~dldfeGh~l~~~vFv~dc~e~~w~~l~ST~~~~k~l~iP~iaF~A~~D~WV~--q~eV~~~~~~~~s 223 (294)
T PF02273_consen 146 YDYLQLPIEQLPEDLDFEGHNLGAEVFVTDCFEHGWDDLDSTINDMKRLSIPFIAFTANDDDWVK--QSEVEELLDNINS 223 (294)
T ss_dssp S-GGGS-GGG--SEEEETTEEEEHHHHHHHHHHTT-SSHHHHHHHHTT--S-EEEEEETT-TTS---HHHHHHHHTT-TT
T ss_pred cchhhcchhhCCCcccccccccchHHHHHHHHHcCCccchhHHHHHhhCCCCEEEEEeCCCcccc--HHHHHHHHHhcCC
Confidence 0 0 00000 0000001111 1 11 3455678889999999999999987 7777777665543
Q ss_pred CCCcEEEEEeCCCCcccCc
Q 004574 696 HGALSRLVLLPFEHHVYAA 714 (744)
Q Consensus 696 ~~~~~~~~~~~~~~H~~~~ 714 (744)
..+++..++|+.|....
T Consensus 224 --~~~klysl~Gs~HdL~e 240 (294)
T PF02273_consen 224 --NKCKLYSLPGSSHDLGE 240 (294)
T ss_dssp ----EEEEEETT-SS-TTS
T ss_pred --CceeEEEecCccchhhh
Confidence 46799999999998753
No 224
>COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=98.53 E-value=6e-06 Score=77.37 Aligned_cols=150 Identities=15% Similarity=0.104 Sum_probs=85.2
Q ss_pred hHHHHHHHHHHHHHcC--CCCCCcEEEEEechHHHHHHHHHHhCC---CceeEEEEcc---CCCCCCCCCCccc-----c
Q 004574 574 PNDSAEAAVEEVVRRG--VADPSRIAVGGHSYGAFMTAHLLAHAP---HLFCCGIARS---GSYNKTLTPFGFQ-----T 640 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~--~~d~~~i~l~G~S~GG~~a~~~~~~~p---~~~~~~v~~~---~~~~~~~~~~~~~-----~ 640 (744)
...|+...++.+...- ..-...++++||||||.+|.-+|.+.- -...++.+.+ |.++......... .
T Consensus 52 ~~~di~~Lad~la~el~~~~~d~P~alfGHSmGa~lAfEvArrl~~~g~~p~~lfisg~~aP~~~~~~~i~~~~D~~~l~ 131 (244)
T COG3208 52 LLTDIESLADELANELLPPLLDAPFALFGHSMGAMLAFEVARRLERAGLPPRALFISGCRAPHYDRGKQIHHLDDADFLA 131 (244)
T ss_pred ccccHHHHHHHHHHHhccccCCCCeeecccchhHHHHHHHHHHHHHcCCCcceEEEecCCCCCCcccCCccCCCHHHHHH
Confidence 4447888888877642 222357999999999999998886541 1233443333 2122110000000 0
Q ss_pred ----cc---cchhhcHHHHHhcCc-----------ccc--cCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcE
Q 004574 641 ----EF---RTLWEATNVYIEMSP-----------ITH--ANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALS 700 (744)
Q Consensus 641 ----~~---~~~~~~~~~~~~~~~-----------~~~--~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~ 700 (744)
.. ...+++++.+.-.-| ..+ -..+.||+.++.|++|..|. .++...+.+.. +...
T Consensus 132 ~l~~lgG~p~e~led~El~~l~LPilRAD~~~~e~Y~~~~~~pl~~pi~~~~G~~D~~vs--~~~~~~W~~~t---~~~f 206 (244)
T COG3208 132 DLVDLGGTPPELLEDPELMALFLPILRADFRALESYRYPPPAPLACPIHAFGGEKDHEVS--RDELGAWREHT---KGDF 206 (244)
T ss_pred HHHHhCCCChHHhcCHHHHHHHHHHHHHHHHHhcccccCCCCCcCcceEEeccCcchhcc--HHHHHHHHHhh---cCCc
Confidence 00 122333333221111 111 24578999999999999987 77776666554 3567
Q ss_pred EEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 701 RLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 701 ~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
++++|+| +|++. .....++...+.+.+.
T Consensus 207 ~l~~fdG-gHFfl-~~~~~~v~~~i~~~l~ 234 (244)
T COG3208 207 TLRVFDG-GHFFL-NQQREEVLARLEQHLA 234 (244)
T ss_pred eEEEecC-cceeh-hhhHHHHHHHHHHHhh
Confidence 9999997 49876 4444444444444443
No 225
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=98.51 E-value=3.5e-05 Score=73.16 Aligned_cols=220 Identities=10% Similarity=0.080 Sum_probs=124.6
Q ss_pred eEEEEEcCCC---CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCC
Q 004574 157 AQLVLGSLDG---TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~~G---~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~ 231 (744)
.+|+++++.| +...++-+ +-+-.+.|++||+.|+-.+.+ ..++.||..+++ .++......-
T Consensus 69 r~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~s~i~S~gtD-------------k~v~~wD~~tG~~~rk~k~h~~~- 134 (338)
T KOG0265|consen 69 RAIVLWNVYGDCENFWVLKGHSGAVMELHGMRDGSHILSCGTD-------------KTVRGWDAETGKRIRKHKGHTSF- 134 (338)
T ss_pred ceEEEEeccccccceeeeccccceeEeeeeccCCCEEEEecCC-------------ceEEEEecccceeeehhccccce-
Confidence 5899999866 34444444 778899999999988766544 379999988554 3433222111
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCc
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSL 311 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~ 311 (744)
+..+.-+--|.. |+... . ..+.+-++|. -..+..++.........+.|..++.
T Consensus 135 ------------vNs~~p~rrg~~-lv~Sg-s----------dD~t~kl~D~---R~k~~~~t~~~kyqltAv~f~d~s~ 187 (338)
T KOG0265|consen 135 ------------VNSLDPSRRGPQ-LVCSG-S----------DDGTLKLWDI---RKKEAIKTFENKYQLTAVGFKDTSD 187 (338)
T ss_pred ------------eeecCccccCCe-EEEec-C----------CCceEEEEee---cccchhhccccceeEEEEEeccccc
Confidence 112221222222 22221 1 1223666666 2334444555567778899999988
Q ss_pred eEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCC
Q 004574 312 ALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPE 391 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~ 391 (744)
.+.... ..++|.++|+-.+ ....++.++...+.. +..|++|.++...+++
T Consensus 188 qv~sgg----Idn~ikvWd~r~~--d~~~~lsGh~DtIt~------lsls~~gs~llsnsMd------------------ 237 (338)
T KOG0265|consen 188 QVISGG----IDNDIKVWDLRKN--DGLYTLSGHADTITG------LSLSRYGSFLLSNSMD------------------ 237 (338)
T ss_pred ceeecc----ccCceeeeccccC--cceEEeecccCceee------EEeccCCCcccccccc------------------
Confidence 877654 3445777777442 234455444332222 5579999988777643
Q ss_pred CCCceEEEEecC----CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCC
Q 004574 392 GNIPFLDLFDIN----TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPL 461 (744)
Q Consensus 392 ~~~~~l~~~d~~----~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~ 461 (744)
..+.+||.. ..+...++.+... .|+.-- -.++|||++..+-+...+ ..+|.||..+
T Consensus 238 ---~tvrvwd~rp~~p~~R~v~if~g~~h-nfeknl------L~cswsp~~~~i~ags~d----r~vyvwd~~~ 297 (338)
T KOG0265|consen 238 ---NTVRVWDVRPFAPSQRCVKIFQGHIH-NFEKNL------LKCSWSPNGTKITAGSAD----RFVYVWDTTS 297 (338)
T ss_pred ---ceEEEEEecccCCCCceEEEeecchh-hhhhhc------ceeeccCCCCcccccccc----ceEEEeeccc
Confidence 246677753 2222334433222 233211 237999999877765543 3478887433
No 226
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.50 E-value=6.9e-06 Score=79.75 Aligned_cols=157 Identities=13% Similarity=0.145 Sum_probs=90.3
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+..+.|.+||..++-.+-.. ..|.+||++.+...+|..... .|..-+.|||||.
T Consensus 196 ~pVtsmqwn~dgt~l~tAS~gs------------ssi~iWdpdtg~~~pL~~~gl------------gg~slLkwSPdgd 251 (445)
T KOG2139|consen 196 NPVTSMQWNEDGTILVTASFGS------------SSIMIWDPDTGQKIPLIPKGL------------GGFSLLKWSPDGD 251 (445)
T ss_pred ceeeEEEEcCCCCEEeecccCc------------ceEEEEcCCCCCcccccccCC------------CceeeEEEcCCCC
Confidence 4577899999999887554432 479999999887666652211 2245679999999
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+ .|+..-+... +||-..-. -+.+.- +. +.+++....|||+|+.|+|+.. +...||.+...+.
T Consensus 252 ~--lfaAt~davf---------rlw~e~q~--wt~erw-~l-gsgrvqtacWspcGsfLLf~~s---gsp~lysl~f~~~ 313 (445)
T KOG2139|consen 252 V--LFAATCDAVF---------RLWQENQS--WTKERW-IL-GSGRVQTACWSPCGSFLLFACS---GSPRLYSLTFDGE 313 (445)
T ss_pred E--EEEeccccee---------eeehhccc--ceecce-ec-cCCceeeeeecCCCCEEEEEEc---CCceEEEEeecCC
Confidence 6 3443222211 13311110 111211 22 3558999999999999999873 3446777776653
Q ss_pred CCcce------e-eecccccccc-CCC-----C-CCceeeCCCCCeEEEEeee
Q 004574 335 DVAPR------V-LFDRVFENVY-SDP-----G-SPMMTRTSTGTNVIAKIKK 373 (744)
Q Consensus 335 ~~~~~------~-l~~~~~~~~~-~~~-----~-~~~~~~spdg~~l~~~~~~ 373 (744)
..... . +...+...+. ..+ + .-.++|.|-|.+|+...++
T Consensus 314 ~~~~~~~~~~k~~lliaDL~e~ti~ag~~l~cgeaq~lawDpsGeyLav~fKg 366 (445)
T KOG2139|consen 314 DSVFLRPQSIKRVLLIADLQEVTICAGQRLCCGEAQCLAWDPSGEYLAVIFKG 366 (445)
T ss_pred CccccCcccceeeeeeccchhhhhhcCcccccCccceeeECCCCCEEEEEEcC
Confidence 21111 1 1111111000 000 1 1126799999999998855
No 227
>COG1073 Hydrolases of the alpha/beta superfamily [General function prediction only]
Probab=98.50 E-value=1.1e-06 Score=90.33 Aligned_cols=78 Identities=23% Similarity=0.430 Sum_probs=62.6
Q ss_pred HHhcCcccccCCCC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccH--HHHHHHHHHH
Q 004574 652 YIEMSPITHANKIK-KPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENV--MHVIWETDRW 728 (744)
Q Consensus 652 ~~~~~~~~~~~~~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~--~~~~~~~~~f 728 (744)
+...++...+.++. +|+|++||.+|..|| ...+..++++.+.. +.+..++++++|........ .+....+.+|
T Consensus 218 ~~~~d~~~~~~~i~~~P~l~~~G~~D~~vp--~~~~~~~~~~~~~~--~~~~~~~~~~~H~~~~~~~~~~~~~~~~~~~f 293 (299)
T COG1073 218 LLLLDPFDDAEKISPRPVLLVHGERDEVVP--LRDAEDLYEAARER--PKKLLFVPGGGHIDLYDNPPAVEQALDKLAEF 293 (299)
T ss_pred hccCcchhhHhhcCCcceEEEecCCCcccc--hhhhHHHHhhhccC--CceEEEecCCccccccCccHHHHHHHHHHHHH
Confidence 44556666677776 799999999999999 99999999888765 66888889999987643333 4789999999
Q ss_pred HHHhc
Q 004574 729 LQKYC 733 (744)
Q Consensus 729 l~~~l 733 (744)
|.+++
T Consensus 294 ~~~~l 298 (299)
T COG1073 294 LERHL 298 (299)
T ss_pred HHHhc
Confidence 99876
No 228
>PTZ00420 coronin; Provisional
Probab=98.49 E-value=0.00013 Score=79.71 Aligned_cols=232 Identities=10% Similarity=0.129 Sum_probs=117.7
Q ss_pred ecCCC-ceeeeeccCCC-CceEEEEEeeCCcccccccCCCcceEEEEeCCCCe--eeeccCCCCCCCCCcccCCccCCCC
Q 004574 170 DFGTP-AVYTAVEPSPD-QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL--VRELCDLPPAEDIPVCYNSVREGMR 245 (744)
Q Consensus 170 ~l~~~-~~~~~~~~SpD-G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~--~~~l~~~~~~~~~~~~~~~~~~~~~ 245 (744)
.+..+ +.+..++|+|+ ++.|+ +...+ ..|.+|++.... ...+.. +.. .+......+.
T Consensus 69 ~L~gH~~~V~~lafsP~~~~lLA-SgS~D------------gtIrIWDi~t~~~~~~~i~~-p~~-----~L~gH~~~V~ 129 (568)
T PTZ00420 69 KLKGHTSSILDLQFNPCFSEILA-SGSED------------LTIRVWEIPHNDESVKEIKD-PQC-----ILKGHKKKIS 129 (568)
T ss_pred EEcCCCCCEEEEEEcCCCCCEEE-EEeCC------------CeEEEEECCCCCcccccccc-ceE-----EeecCCCcEE
Confidence 34444 67889999998 55554 43332 378889976321 111110 000 0011112366
Q ss_pred ccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecccee
Q 004574 246 SISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTR 325 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~ 325 (744)
.+.|+|++.. ++.+... .+.|.++|+ ..++..........+.+++|+|||+.|+.++ ....
T Consensus 130 sVaf~P~g~~-iLaSgS~-----------DgtIrIWDl---~tg~~~~~i~~~~~V~SlswspdG~lLat~s----~D~~ 190 (568)
T PTZ00420 130 IIDWNPMNYY-IMCSSGF-----------DSFVNIWDI---ENEKRAFQINMPKKLSSLKWNIKGNLLSGTC----VGKH 190 (568)
T ss_pred EEEECCCCCe-EEEEEeC-----------CCeEEEEEC---CCCcEEEEEecCCcEEEEEECCCCCEEEEEe----cCCE
Confidence 8899999986 4333211 124778887 4444332223445688999999999887665 2235
Q ss_pred EEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCC-
Q 004574 326 TWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINT- 404 (744)
Q Consensus 326 l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~- 404 (744)
|.++|+.++ +.......+.+..... ......+++|+.+|+...... ...+.|.+||+..
T Consensus 191 IrIwD~Rsg--~~i~tl~gH~g~~~s~-~v~~~~fs~d~~~IlTtG~d~-----------------~~~R~VkLWDlr~~ 250 (568)
T PTZ00420 191 MHIIDPRKQ--EIASSFHIHDGGKNTK-NIWIDGLGGDDNYILSTGFSK-----------------NNMREMKLWDLKNT 250 (568)
T ss_pred EEEEECCCC--cEEEEEecccCCceeE-EEEeeeEcCCCCEEEEEEcCC-----------------CCccEEEEEECCCC
Confidence 788888763 2222222221110000 000012458888877765210 1234688999873
Q ss_pred CceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCC
Q 004574 405 GSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPH 472 (744)
Q Consensus 405 g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~ 472 (744)
++.......+.. ...+. +.+.++.+.++.+.. .-..|+++++..+....|..+..
T Consensus 251 ~~pl~~~~ld~~-----~~~L~-----p~~D~~tg~l~lsGk---GD~tIr~~e~~~~~~~~l~~~~s 305 (568)
T PTZ00420 251 TSALVTMSIDNA-----SAPLI-----PHYDESTGLIYLIGK---GDGNCRYYQHSLGSIRKVNEYKS 305 (568)
T ss_pred CCceEEEEecCC-----ccceE-----EeeeCCCCCEEEEEE---CCCeEEEEEccCCcEEeeccccc
Confidence 333222221110 11110 133344334333322 23468888877777777766543
No 229
>PRK04940 hypothetical protein; Provisional
Probab=98.49 E-value=2.7e-06 Score=76.93 Aligned_cols=116 Identities=10% Similarity=0.059 Sum_probs=73.0
Q ss_pred CcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCC---CCcccccccchhhcHHHHHhcCcccccCCCCCCEEE
Q 004574 594 SRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLT---PFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILI 670 (744)
Q Consensus 594 ~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~ 670 (744)
++++|+|.|+||+.|.+++.++ .++| |++.|.+..... ..+.+.+ ...-..+...+.. .+-.-..++
T Consensus 60 ~~~~liGSSLGGyyA~~La~~~--g~~a-VLiNPAv~P~~~L~~~ig~~~~--y~~~~~~h~~eL~-----~~~p~r~~v 129 (180)
T PRK04940 60 ERPLICGVGLGGYWAERIGFLC--GIRQ-VIFNPNLFPEENMEGKIDRPEE--YADIATKCVTNFR-----EKNRDRCLV 129 (180)
T ss_pred CCcEEEEeChHHHHHHHHHHHH--CCCE-EEECCCCChHHHHHHHhCCCcc--hhhhhHHHHHHhh-----hcCcccEEE
Confidence 4699999999999999999998 2454 456776542110 0011111 0011111111111 111233699
Q ss_pred EeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 671 IHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 671 i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
+..+.|++.. +.++.+.+... .+..+.+|++|.|.. .++....|++|+.
T Consensus 130 llq~gDEvLD--yr~a~~~y~~~------y~~~v~~GGdH~f~~---fe~~l~~I~~F~~ 178 (180)
T PRK04940 130 ILSRNDEVLD--SQRTAEELHPY------YEIVWDEEQTHKFKN---ISPHLQRIKAFKT 178 (180)
T ss_pred EEeCCCcccC--HHHHHHHhccC------ceEEEECCCCCCCCC---HHHHHHHHHHHHh
Confidence 9999999987 77777666332 157889999998765 6678899999985
No 230
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=98.48 E-value=3.8e-05 Score=72.96 Aligned_cols=253 Identities=13% Similarity=0.035 Sum_probs=152.2
Q ss_pred CccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCC
Q 004574 3 FFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD 82 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~ 82 (744)
|-+-|+++++.+| -.....++.... .+....|++||+.|+-.. ....|+..|+++|+..+--....
T Consensus 67 ~Dr~I~LWnv~gd--ceN~~~lkgHsg--AVM~l~~~~d~s~i~S~g----------tDk~v~~wD~~tG~~~rk~k~h~ 132 (338)
T KOG0265|consen 67 SDRAIVLWNVYGD--CENFWVLKGHSG--AVMELHGMRDGSHILSCG----------TDKTVRGWDAETGKRIRKHKGHT 132 (338)
T ss_pred CcceEEEEecccc--ccceeeeccccc--eeEeeeeccCCCEEEEec----------CCceEEEEecccceeeehhcccc
Confidence 3456899998772 123345553332 488999999999998863 23578888999888655433332
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
.. +..+.=+.=|-.|+-+..+ .+++-++
T Consensus 133 ~~----vNs~~p~rrg~~lv~Sgsd------------------------------------------------D~t~kl~ 160 (338)
T KOG0265|consen 133 SF----VNSLDPSRRGPQLVCSGSD------------------------------------------------DGTLKLW 160 (338)
T ss_pred ce----eeecCccccCCeEEEecCC------------------------------------------------CceEEEE
Confidence 21 2222212224444443211 0467777
Q ss_pred cC-CC-CeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCc
Q 004574 163 SL-DG-TAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSV 240 (744)
Q Consensus 163 ~~-~G-~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~ 240 (744)
|+ .. .++.+.+......+.|..++..+.-..-+ ++|-+||+...+..-+..+....
T Consensus 161 D~R~k~~~~t~~~kyqltAv~f~d~s~qv~sggId-------------n~ikvWd~r~~d~~~~lsGh~Dt--------- 218 (338)
T KOG0265|consen 161 DIRKKEAIKTFENKYQLTAVGFKDTSDQVISGGID-------------NDIKVWDLRKNDGLYTLSGHADT--------- 218 (338)
T ss_pred eecccchhhccccceeEEEEEecccccceeecccc-------------CceeeeccccCcceEEeecccCc---------
Confidence 77 44 34444444788899999999888654323 36888998655544444443221
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeee-----ccceeceeeccCCceEE
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKL-----DLRFRSVSWCDDSLALV 314 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~-----~~~~~~~~~SpDg~~l~ 314 (744)
+.++..||+|.. +.--+ +...+.+++.+++..++. ..+..+ +...-..+|||+++.+.
T Consensus 219 ---It~lsls~~gs~-llsns------------Md~tvrvwd~rp~~p~~R~v~if~g~~hnfeknlL~cswsp~~~~i~ 282 (338)
T KOG0265|consen 219 ---ITGLSLSRYGSF-LLSNS------------MDNTVRVWDVRPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPNGTKIT 282 (338)
T ss_pred ---eeeEEeccCCCc-ccccc------------ccceEEEEEecccCCCCceEEEeecchhhhhhhcceeeccCCCCccc
Confidence 557888999986 32221 334577788877776665 445443 22344678999999998
Q ss_pred EeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 315 NETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 315 ~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+.+.+ ..+|++|..+ ........+..+.+.. +.|.|....|...+
T Consensus 283 ags~d----r~vyvwd~~~--r~~lyklpGh~gsvn~------~~Fhp~e~iils~~ 327 (338)
T KOG0265|consen 283 AGSAD----RFVYVWDTTS--RRILYKLPGHYGSVNE------VDFHPTEPIILSCS 327 (338)
T ss_pred ccccc----ceEEEeeccc--ccEEEEcCCcceeEEE------eeecCCCcEEEEec
Confidence 87733 3688888766 2223334444444333 44677777666554
No 231
>COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=98.47 E-value=4.5e-06 Score=83.57 Aligned_cols=68 Identities=15% Similarity=0.167 Sum_probs=48.4
Q ss_pred eEEEecCCCCCCCCCCCC----hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 556 FAVLAGPSIPIIGEGDKL----PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 556 ~~v~~~~~~~~~g~g~~~----~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
|.++. .+.+|+|.+. ......+.+..+.+.... .++.++||||||.+++.++.++|+.++++|++++.
T Consensus 51 ~~~~~---~d~~g~g~s~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~G~S~Gg~~~~~~~~~~p~~~~~~v~~~~~ 122 (282)
T COG0596 51 YRVIA---PDLRGHGRSDPAGYSLSAYADDLAALLDALGL--EKVVLVGHSMGGAVALALALRHPDRVRGLVLIGPA 122 (282)
T ss_pred eEEEE---ecccCCCCCCcccccHHHHHHHHHHHHHHhCC--CceEEEEecccHHHHHHHHHhcchhhheeeEecCC
Confidence 88888 4555666653 222233444444443333 34999999999999999999999999999998864
No 232
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=98.47 E-value=3.2e-05 Score=81.04 Aligned_cols=266 Identities=11% Similarity=0.127 Sum_probs=156.6
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
.....||. ...||.. -...||+.+..+|+..+|..... ..+..+.|+++|+.|+.....
T Consensus 180 ~nlldWss-~n~laVa-----------lg~~vylW~~~s~~v~~l~~~~~----~~vtSv~ws~~G~~LavG~~~----- 238 (484)
T KOG0305|consen 180 LNLLDWSS-ANVLAVA-----------LGQSVYLWSASSGSVTELCSFGE----ELVTSVKWSPDGSHLAVGTSD----- 238 (484)
T ss_pred hhHhhccc-CCeEEEE-----------ecceEEEEecCCCceEEeEecCC----CceEEEEECCCCCEEEEeecC-----
Confidence 34568884 3466663 33689999999999988875531 147789999999999986321
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCC-C-ceeeeeccCCCCce
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGT-P-AVYTAVEPSPDQKY 188 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~-~-~~~~~~~~SpDG~~ 188 (744)
+.+.++|. +- .++.+.. + ..+..++|. +.
T Consensus 239 --------------------------------------------g~v~iwD~~~~k~~~~~~~~h~~rvg~laW~--~~- 271 (484)
T KOG0305|consen 239 --------------------------------------------GTVQIWDVKEQKKTRTLRGSHASRVGSLAWN--SS- 271 (484)
T ss_pred --------------------------------------------CeEEEEehhhccccccccCCcCceeEEEecc--Cc-
Confidence 56777777 33 5566655 4 677789998 22
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
++-..... ..|..+|+...+. ...... ....+-.+.|++|+++ ++-- +.+
T Consensus 272 ~lssGsr~------------~~I~~~dvR~~~~~~~~~~~------------H~qeVCgLkws~d~~~-lASG----gnD 322 (484)
T KOG0305|consen 272 VLSSGSRD------------GKILNHDVRISQHVVSTLQG------------HRQEVCGLKWSPDGNQ-LASG----GND 322 (484)
T ss_pred eEEEecCC------------CcEEEEEEecchhhhhhhhc------------ccceeeeeEECCCCCe-eccC----CCc
Confidence 33232221 2455555443221 111111 1112557899999986 3221 111
Q ss_pred CCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCccee-eecccc
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRV-LFDRVF 346 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~-l~~~~~ 346 (744)
..+++++.. .......++.....+--++|+|=-..|+.+. .......|..+|..++ .... +..++.
T Consensus 323 --------N~~~Iwd~~--~~~p~~~~~~H~aAVKA~awcP~q~~lLAsG-GGs~D~~i~fwn~~~g--~~i~~vdtgsQ 389 (484)
T KOG0305|consen 323 --------NVVFIWDGL--SPEPKFTFTEHTAAVKALAWCPWQSGLLATG-GGSADRCIKFWNTNTG--ARIDSVDTGSQ 389 (484)
T ss_pred --------cceEeccCC--CccccEEEeccceeeeEeeeCCCccCceEEc-CCCcccEEEEEEcCCC--cEecccccCCc
Confidence 247888762 3344455777788889999999988777665 2334457777787764 2222 222221
Q ss_pred ccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeee
Q 004574 347 ENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALV 426 (744)
Q Consensus 347 ~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~ 426 (744)
- -.+.||+..+.|+... |.+ +..|-+|...+-+.......... .|-
T Consensus 390 V--------csL~Wsk~~kEi~sth---------------G~s----~n~i~lw~~ps~~~~~~l~gH~~----RVl--- 435 (484)
T KOG0305|consen 390 V--------CSLIWSKKYKELLSTH---------------GYS----ENQITLWKYPSMKLVAELLGHTS----RVL--- 435 (484)
T ss_pred e--------eeEEEcCCCCEEEEec---------------CCC----CCcEEEEeccccceeeeecCCcc----eeE---
Confidence 1 1166999999998874 221 22355555555433333322211 222
Q ss_pred cCCcceecccCCCEEEEEEe
Q 004574 427 FGQGEEDINLNQLKILTSKE 446 (744)
Q Consensus 427 ~~~~~~~~s~d~~~~~~~~~ 446 (744)
-+++||||..++....
T Consensus 436 ----~la~SPdg~~i~t~a~ 451 (484)
T KOG0305|consen 436 ----YLALSPDGETIVTGAA 451 (484)
T ss_pred ----EEEECCCCCEEEEecc
Confidence 2589999988775443
No 233
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=98.46 E-value=3.6e-06 Score=84.72 Aligned_cols=219 Identities=10% Similarity=0.043 Sum_probs=129.7
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
+.|+|+++..+ .+-.+... .+...+...+||++|..+.-++. + .-|-+.|+++|+...-.....
T Consensus 237 ~~vklW~vy~~--~~~lrtf~--gH~k~Vrd~~~s~~g~~fLS~sf--------D--~~lKlwDtETG~~~~~f~~~~-- 300 (503)
T KOG0282|consen 237 GLVKLWNVYDD--RRCLRTFK--GHRKPVRDASFNNCGTSFLSASF--------D--RFLKLWDTETGQVLSRFHLDK-- 300 (503)
T ss_pred ceEEEEEEecC--cceehhhh--cchhhhhhhhccccCCeeeeeec--------c--eeeeeeccccceEEEEEecCC--
Confidence 35788888551 12323332 33335899999999999888653 3 445555999988655432222
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
....+.+.||+..++++... ...|..+|+
T Consensus 301 ---~~~cvkf~pd~~n~fl~G~s------------------------------------------------d~ki~~wDi 329 (503)
T KOG0282|consen 301 ---VPTCVKFHPDNQNIFLVGGS------------------------------------------------DKKIRQWDI 329 (503)
T ss_pred ---CceeeecCCCCCcEEEEecC------------------------------------------------CCcEEEEec
Confidence 12355788999777765211 157888898
Q ss_pred -CCCeeec-CCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCc
Q 004574 165 -DGTAKDF-GTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSV 240 (744)
Q Consensus 165 -~G~~~~l-~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~ 240 (744)
+|++.+- ..+ +.+..+.|-|+|++.+-++.+. .+.+|+.... .++.+......
T Consensus 330 Rs~kvvqeYd~hLg~i~~i~F~~~g~rFissSDdk-------------s~riWe~~~~v~ik~i~~~~~h---------- 386 (503)
T KOG0282|consen 330 RSGKVVQEYDRHLGAILDITFVDEGRRFISSSDDK-------------SVRIWENRIPVPIKNIADPEMH---------- 386 (503)
T ss_pred cchHHHHHHHhhhhheeeeEEccCCceEeeeccCc-------------cEEEEEcCCCccchhhcchhhc----------
Confidence 8864433 333 7888999999999988776553 4555654432 23333322211
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCC-CCCCCCceEeeee---ccceeceeeccCCceEEEe
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAE-PAEGEKPEILHKL---DLRFRSVSWCDDSLALVNE 316 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~ 316 (744)
....++.+|+|+. ++..+ .-..|++..+. ++. ...++...+ .+..-.+.|||||+.|++.
T Consensus 387 --smP~~~~~P~~~~-~~aQs------------~dN~i~ifs~~~~~r-~nkkK~feGh~vaGys~~v~fSpDG~~l~SG 450 (503)
T KOG0282|consen 387 --TMPCLTLHPNGKW-FAAQS------------MDNYIAIFSTVPPFR-LNKKKRFEGHSVAGYSCQVDFSPDGRTLCSG 450 (503)
T ss_pred --cCcceecCCCCCe-ehhhc------------cCceEEEEecccccc-cCHhhhhcceeccCceeeEEEcCCCCeEEee
Confidence 1446788899886 22221 11124444331 111 111112221 4455678899999999976
Q ss_pred eeeeccceeEEEEcCCC
Q 004574 317 TWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 317 ~~~~~~~~~l~~~~~~~ 333 (744)
. ....++.+|..+
T Consensus 451 d----sdG~v~~wdwkt 463 (503)
T KOG0282|consen 451 D----SDGKVNFWDWKT 463 (503)
T ss_pred c----CCccEEEeechh
Confidence 4 445788888766
No 234
>cd00707 Pancreat_lipase_like Pancreatic lipase-like enzymes. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=98.45 E-value=9.1e-07 Score=88.62 Aligned_cols=104 Identities=10% Similarity=-0.052 Sum_probs=68.6
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh-CCeEEEecCCCCCCCCCCC-------ChHHHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA-RRFAVLAGPSIPIIGEGDK-------LPNDSAEAAVEE 584 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~v~~~~~~~~~g~g~~-------~~~~d~~~~~~~ 584 (744)
.|++|++||.+.. ....+.......+++ .+|.|+..+..+....... ...+++.+++++
T Consensus 36 ~p~vilIHG~~~~-------------~~~~~~~~l~~~ll~~~~~nVi~vD~~~~~~~~y~~a~~~~~~v~~~la~~l~~ 102 (275)
T cd00707 36 RPTRFIIHGWTSS-------------GEESWISDLRKAYLSRGDYNVIVVDWGRGANPNYPQAVNNTRVVGAELAKFLDF 102 (275)
T ss_pred CCcEEEEcCCCCC-------------CCCcHHHHHHHHHHhcCCCEEEEEECccccccChHHHHHhHHHHHHHHHHHHHH
Confidence 6899999995311 101111222333444 5899998443222110000 011367778888
Q ss_pred HHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCC
Q 004574 585 VVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSY 629 (744)
Q Consensus 585 l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~ 629 (744)
|.+...++.++|.|+||||||.+|..++.+.|++++.++++.|..
T Consensus 103 L~~~~g~~~~~i~lIGhSlGa~vAg~~a~~~~~~v~~iv~LDPa~ 147 (275)
T cd00707 103 LVDNTGLSLENVHLIGHSLGAHVAGFAGKRLNGKLGRITGLDPAG 147 (275)
T ss_pred HHHhcCCChHHEEEEEecHHHHHHHHHHHHhcCccceeEEecCCc
Confidence 877655667899999999999999999999999999999998863
No 235
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=98.44 E-value=0.0002 Score=68.86 Aligned_cols=230 Identities=11% Similarity=0.035 Sum_probs=129.2
Q ss_pred EEEEEcC-CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCC
Q 004574 158 QLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIP 234 (744)
Q Consensus 158 ~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~ 234 (744)
.|.++|. +| ..+.+... .++.-..|......+++.+...+ ..|...++...+..+...+...
T Consensus 37 sl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~~~i~sStk~d-----------~tIryLsl~dNkylRYF~GH~~---- 101 (311)
T KOG1446|consen 37 SLRLYDSLSGKQVKTINSKKYGVDLACFTHHSNTVIHSSTKED-----------DTIRYLSLHDNKYLRYFPGHKK---- 101 (311)
T ss_pred eEEEEEcCCCceeeEeecccccccEEEEecCCceEEEccCCCC-----------CceEEEEeecCceEEEcCCCCc----
Confidence 5677788 78 55555555 67777888888888888876432 2455566655554444333222
Q ss_pred cccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEE
Q 004574 235 VCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALV 314 (744)
Q Consensus 235 ~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~ 314 (744)
-+..+..+|=+.. |++. +....|.++|+ ...+-.-+.. .....-+++.|.|-.++
T Consensus 102 --------~V~sL~~sP~~d~---FlS~----------S~D~tvrLWDl---R~~~cqg~l~-~~~~pi~AfDp~GLifA 156 (311)
T KOG1446|consen 102 --------RVNSLSVSPKDDT---FLSS----------SLDKTVRLWDL---RVKKCQGLLN-LSGRPIAAFDPEGLIFA 156 (311)
T ss_pred --------eEEEEEecCCCCe---EEec----------ccCCeEEeeEe---cCCCCceEEe-cCCCcceeECCCCcEEE
Confidence 1445667777754 3321 11224677776 2222222221 22234478999998877
Q ss_pred EeeeeeccceeEEEEcCCCCCCccee---eeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCC
Q 004574 315 NETWYKTSQTRTWLVCPGSKDVAPRV---LFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPE 391 (744)
Q Consensus 315 ~~~~~~~~~~~l~~~~~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~ 391 (744)
... +...|.++|+-.-+..+-+ +.......+.. +.+||||+.|+.....
T Consensus 157 ~~~----~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~------l~FS~dGK~iLlsT~~------------------ 208 (311)
T KOG1446|consen 157 LAN----GSELIKLYDLRSFDKGPFTTFSITDNDEAEWTD------LEFSPDGKSILLSTNA------------------ 208 (311)
T ss_pred Eec----CCCeEEEEEecccCCCCceeEccCCCCccceee------eEEcCCCCEEEEEeCC------------------
Confidence 765 2225666665442222222 22211211111 6699999999888522
Q ss_pred CCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecC
Q 004574 392 GNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNF 470 (744)
Q Consensus 392 ~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~ 470 (744)
..++++|.-+|....-+..... . ......+.++||++.++.... -+.|..|++++|+......-
T Consensus 209 ---s~~~~lDAf~G~~~~tfs~~~~------~--~~~~~~a~ftPds~Fvl~gs~----dg~i~vw~~~tg~~v~~~~~ 272 (311)
T KOG1446|consen 209 ---SFIYLLDAFDGTVKSTFSGYPN------A--GNLPLSATFTPDSKFVLSGSD----DGTIHVWNLETGKKVAVLRG 272 (311)
T ss_pred ---CcEEEEEccCCcEeeeEeeccC------C--CCcceeEEECCCCcEEEEecC----CCcEEEEEcCCCcEeeEecC
Confidence 2488899888875443332211 0 000123689999986664332 35688999988876554443
No 236
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=98.44 E-value=4.7e-06 Score=79.33 Aligned_cols=251 Identities=13% Similarity=0.143 Sum_probs=141.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceec-cccCCCcc---ccccccceEEecCCcEEEEEecC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKP-LFESPDIC---LNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~-lt~~~~~~---~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
.+.-.+|||||++++-.+. ++--.+| +-.+|+.+. |-+..... ....+..+.||.|.++|+..+.+
T Consensus 215 h~EcA~FSPDgqyLvsgSv--------DGFiEVW--ny~~GKlrKDLkYQAqd~fMMmd~aVlci~FSRDsEMlAsGsqD 284 (508)
T KOG0275|consen 215 HVECARFSPDGQYLVSGSV--------DGFIEVW--NYTTGKLRKDLKYQAQDNFMMMDDAVLCISFSRDSEMLASGSQD 284 (508)
T ss_pred chhheeeCCCCceEeeccc--------cceeeee--hhccchhhhhhhhhhhcceeecccceEEEeecccHHHhhccCcC
Confidence 3556899999999988543 4444555 666777654 32222111 11124567889999888754221
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC--ceeeeeccC
Q 004574 108 SRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP--AVYTAVEPS 183 (744)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~--~~~~~~~~S 183 (744)
++|-++.+ +| -.+++... .++..+.||
T Consensus 285 -------------------------------------------------GkIKvWri~tG~ClRrFdrAHtkGvt~l~FS 315 (508)
T KOG0275|consen 285 -------------------------------------------------GKIKVWRIETGQCLRRFDRAHTKGVTCLSFS 315 (508)
T ss_pred -------------------------------------------------CcEEEEEEecchHHHHhhhhhccCeeEEEEc
Confidence 45666666 77 44555423 678899999
Q ss_pred CCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEe
Q 004574 184 PDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEA 262 (744)
Q Consensus 184 pDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~ 262 (744)
.|+.+|+-.+.+. .+.+--+..+ -.+....+.. .+.+..|++||.. +..++
T Consensus 316 rD~SqiLS~sfD~-------------tvRiHGlKSGK~LKEfrGHsS-------------yvn~a~ft~dG~~-iisaS- 367 (508)
T KOG0275|consen 316 RDNSQILSASFDQ-------------TVRIHGLKSGKCLKEFRGHSS-------------YVNEATFTDDGHH-IISAS- 367 (508)
T ss_pred cCcchhhcccccc-------------eEEEeccccchhHHHhcCccc-------------cccceEEcCCCCe-EEEec-
Confidence 9999997664432 3444444433 2333222211 1557789999987 55443
Q ss_pred ecCCCCCccCCccceEEeccCCCCCCCCceE---eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcce
Q 004574 263 QDRGDANVEVSPRDIIYTQPAEPAEGEKPEI---LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPR 339 (744)
Q Consensus 263 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~---l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~ 339 (744)
..+.+-+++. ++.+-.. ....+..++++-.-|...--....| ..+.+|++++++. -.+
T Consensus 368 -----------sDgtvkvW~~---KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCN---rsntv~imn~qGQ--vVr 428 (508)
T KOG0275|consen 368 -----------SDGTVKVWHG---KTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCN---RSNTVYIMNMQGQ--VVR 428 (508)
T ss_pred -----------CCccEEEecC---cchhhhhhccCCCCcccceeEEEcCCCCceEEEEc---CCCeEEEEeccce--EEe
Confidence 1223555554 3222111 1224556667666666543222232 3346999999872 223
Q ss_pred eeeccccccccCCCCCCc-eeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeecc
Q 004574 340 VLFDRVFENVYSDPGSPM-MTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESN 414 (744)
Q Consensus 340 ~l~~~~~~~~~~~~~~~~-~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~ 414 (744)
....+..+. |.+. ...||.|.+++...++ ..||-+...+|+.++.....
T Consensus 429 sfsSGkREg-----GdFi~~~lSpkGewiYcigED---------------------~vlYCF~~~sG~LE~tl~Vh 478 (508)
T KOG0275|consen 429 SFSSGKREG-----GDFINAILSPKGEWIYCIGED---------------------GVLYCFSVLSGKLERTLPVH 478 (508)
T ss_pred eeccCCccC-----CceEEEEecCCCcEEEEEccC---------------------cEEEEEEeecCceeeeeecc
Confidence 333322221 2111 3479999998877522 23777787788887655443
No 237
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.42 E-value=6.4e-05 Score=74.60 Aligned_cols=143 Identities=15% Similarity=0.170 Sum_probs=88.0
Q ss_pred eEEEEEcCCC-CeeecCCCceeeeeccC-CCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCC
Q 004574 157 AQLVLGSLDG-TAKDFGTPAVYTAVEPS-PDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIP 234 (744)
Q Consensus 157 ~~l~~~~~~G-~~~~l~~~~~~~~~~~S-pDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~ 234 (744)
++|++++.++ +.+.+.... ..++.+. +|| .|++.... .+.++|+.+++.+.+........
T Consensus 22 ~~i~~~~~~~~~~~~~~~~~-~~G~~~~~~~g-~l~v~~~~--------------~~~~~d~~~g~~~~~~~~~~~~~-- 83 (246)
T PF08450_consen 22 GRIYRVDPDTGEVEVIDLPG-PNGMAFDRPDG-RLYVADSG--------------GIAVVDPDTGKVTVLADLPDGGV-- 83 (246)
T ss_dssp TEEEEEETTTTEEEEEESSS-EEEEEEECTTS-EEEEEETT--------------CEEEEETTTTEEEEEEEEETTCS--
T ss_pred CEEEEEECCCCeEEEEecCC-CceEEEEccCC-EEEEEEcC--------------ceEEEecCCCcEEEEeeccCCCc--
Confidence 6899999955 444333333 5556666 664 45555332 35666888877777665521100
Q ss_pred cccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEE
Q 004574 235 VCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALV 314 (744)
Q Consensus 235 ~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~ 314 (744)
....+.++++.|||+ ||+........... ..+.||.++. + ++...+...-...+.++|||||+.|+
T Consensus 84 -----~~~~~ND~~vd~~G~--ly~t~~~~~~~~~~---~~g~v~~~~~---~-~~~~~~~~~~~~pNGi~~s~dg~~ly 149 (246)
T PF08450_consen 84 -----PFNRPNDVAVDPDGN--LYVTDSGGGGASGI---DPGSVYRIDP---D-GKVTVVADGLGFPNGIAFSPDGKTLY 149 (246)
T ss_dssp -----CTEEEEEEEE-TTS---EEEEEECCBCTTCG---GSEEEEEEET---T-SEEEEEEEEESSEEEEEEETTSSEEE
T ss_pred -----ccCCCceEEEcCCCC--EEEEecCCCccccc---cccceEEECC---C-CeEEEEecCcccccceEECCcchhee
Confidence 001256789999998 77776443322211 1167999987 4 56566665666678899999999887
Q ss_pred EeeeeeccceeEEEEcCCCC
Q 004574 315 NETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 315 ~~~~~~~~~~~l~~~~~~~~ 334 (744)
+.. +....||+++++..
T Consensus 150 v~d---s~~~~i~~~~~~~~ 166 (246)
T PF08450_consen 150 VAD---SFNGRIWRFDLDAD 166 (246)
T ss_dssp EEE---TTTTEEEEEEEETT
T ss_pred ecc---cccceeEEEecccc
Confidence 743 35567999998753
No 238
>PF06057 VirJ: Bacterial virulence protein (VirJ); InterPro: IPR010333 This entry contains several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localised to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium [].
Probab=98.42 E-value=3.4e-06 Score=76.35 Aligned_cols=166 Identities=14% Similarity=0.091 Sum_probs=99.8
Q ss_pred CchhHHHHHhCCeEEEecCCCCCC--CCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC----
Q 004574 544 TPTSSLIFLARRFAVLAGPSIPII--GEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH---- 617 (744)
Q Consensus 544 ~~~~~~~~~~~G~~v~~~~~~~~~--g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~---- 617 (744)
-...+..|+++|+.|+..+...+. .....+.-.|+.+++++..++... +++.|+|.|+|+-+...+..+-|.
T Consensus 18 d~~~a~~l~~~G~~VvGvdsl~Yfw~~rtP~~~a~Dl~~~i~~y~~~w~~--~~vvLiGYSFGADvlP~~~nrLp~~~r~ 95 (192)
T PF06057_consen 18 DKQIAEALAKQGVPVVGVDSLRYFWSERTPEQTAADLARIIRHYRARWGR--KRVVLIGYSFGADVLPFIYNRLPAALRA 95 (192)
T ss_pred hHHHHHHHHHCCCeEEEechHHHHhhhCCHHHHHHHHHHHHHHHHHHhCC--ceEEEEeecCCchhHHHHHhhCCHHHHh
Confidence 346678999999999984432211 111223444899999888887543 699999999999988888877664
Q ss_pred ceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC
Q 004574 618 LFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIK-KPILIIHGEVDDKVGLFPMQAERFFDALKGH 696 (744)
Q Consensus 618 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~ 696 (744)
+++.+++++|..... +.+.... .+.....-..+.+...++++. .|+++|+|+++.-..|+ .++.
T Consensus 96 ~v~~v~Ll~p~~~~d---Feihv~~--wlg~~~~~~~~~~~pei~~l~~~~v~CiyG~~E~d~~cp---------~l~~- 160 (192)
T PF06057_consen 96 RVAQVVLLSPSTTAD---FEIHVSG--WLGMGGDDAAYPVIPEIAKLPPAPVQCIYGEDEDDSLCP---------SLRQ- 160 (192)
T ss_pred heeEEEEeccCCcce---EEEEhhh--hcCCCCCcccCCchHHHHhCCCCeEEEEEcCCCCCCcCc---------cccC-
Confidence 688888888753211 1110000 000000000123334455664 69999999877654321 2333
Q ss_pred CCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 697 GALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 697 ~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
..++.+.+||+ |.|. .+.....+.|++-|.
T Consensus 161 -~~~~~i~lpGg-HHfd--~dy~~La~~Il~~l~ 190 (192)
T PF06057_consen 161 -PGVEVIALPGG-HHFD--GDYDALAKRILDALK 190 (192)
T ss_pred -CCcEEEEcCCC-cCCC--CCHHHHHHHHHHHHh
Confidence 45688999986 6553 334556666665554
No 239
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=98.38 E-value=6.8e-05 Score=71.66 Aligned_cols=255 Identities=13% Similarity=0.038 Sum_probs=138.8
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
..-..||+-|.+||.-+. ++.|.++|..|-. ++-++.+-. -+..+.||+||+.|+-++.+
T Consensus 26 a~~~~Fs~~G~~lAvGc~----------nG~vvI~D~~T~~iar~lsaH~~-----pi~sl~WS~dgr~LltsS~D---- 86 (405)
T KOG1273|consen 26 AECCQFSRWGDYLAVGCA----------NGRVVIYDFDTFRIARMLSAHVR-----PITSLCWSRDGRKLLTSSRD---- 86 (405)
T ss_pred cceEEeccCcceeeeecc----------CCcEEEEEccccchhhhhhcccc-----ceeEEEecCCCCEeeeecCC----
Confidence 567899999999999653 3567777877654 344433222 36788999999999876432
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCCceeeeeccCCCCceE
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTPAVYTAVEPSPDQKYV 189 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~~~~~~~~~SpDG~~i 189 (744)
..+-.+|+ +| -..++.-...+....|.|-.+..
T Consensus 87 ---------------------------------------------~si~lwDl~~gs~l~rirf~spv~~~q~hp~k~n~ 121 (405)
T KOG1273|consen 87 ---------------------------------------------WSIKLWDLLKGSPLKRIRFDSPVWGAQWHPRKRNK 121 (405)
T ss_pred ---------------------------------------------ceeEEEeccCCCceeEEEccCccceeeeccccCCe
Confidence 46777888 88 34444433456667777765544
Q ss_pred EEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCC-CCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCC
Q 004574 190 LITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDL-PPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDA 268 (744)
Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~ 268 (744)
++...-.. .-++.+......+-|... ++..... .+...|.+.|++ |+.-
T Consensus 122 ~va~~~~~------------sp~vi~~s~~~h~~Lp~d~d~dln~s---------as~~~fdr~g~y-IitG-------- 171 (405)
T KOG1273|consen 122 CVATIMEE------------SPVVIDFSDPKHSVLPKDDDGDLNSS---------ASHGVFDRRGKY-IITG-------- 171 (405)
T ss_pred EEEEEecC------------CcEEEEecCCceeeccCCCccccccc---------cccccccCCCCE-EEEe--------
Confidence 44332221 123333332333333322 1211111 222357777876 3322
Q ss_pred CccCCccceEEeccCCCCCCCCceEeee--eccceeceeeccCCceEEEeeeeeccceeEEEEcCC-----CCCCcceee
Q 004574 269 NVEVSPRDIIYTQPAEPAEGEKPEILHK--LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG-----SKDVAPRVL 341 (744)
Q Consensus 269 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~--~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~-----~~~~~~~~l 341 (744)
...+.|.+++. ++-+...-.. .-..+.++.+|..|+.|++.+.++ .|..++.. +.++++...
T Consensus 172 ----tsKGkllv~~a---~t~e~vas~rits~~~IK~I~~s~~g~~liiNtsDR----vIR~ye~~di~~~~r~~e~e~~ 240 (405)
T KOG1273|consen 172 ----TSKGKLLVYDA---ETLECVASFRITSVQAIKQIIVSRKGRFLIINTSDR----VIRTYEISDIDDEGRDGEVEPE 240 (405)
T ss_pred ----cCcceEEEEec---chheeeeeeeechheeeeEEEEeccCcEEEEecCCc----eEEEEehhhhcccCccCCcChh
Confidence 13345777776 3222221111 123456788999999999866433 23333322 111222110
Q ss_pred eccccccccC-CCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccc
Q 004574 342 FDRVFENVYS-DPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNR 415 (744)
Q Consensus 342 ~~~~~~~~~~-~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~ 415 (744)
...+++.+ .+.. -..+|-||.+|+..+ .....||+|....|...++.++..
T Consensus 241 --~K~qDvVNk~~Wk-~ccfs~dgeYv~a~s--------------------~~aHaLYIWE~~~GsLVKILhG~k 292 (405)
T KOG1273|consen 241 --HKLQDVVNKLQWK-KCCFSGDGEYVCAGS--------------------ARAHALYIWEKSIGSLVKILHGTK 292 (405)
T ss_pred --HHHHHHHhhhhhh-heeecCCccEEEecc--------------------ccceeEEEEecCCcceeeeecCCc
Confidence 11111110 0001 145788999887765 234579999999999888877543
No 240
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=98.38 E-value=5.9e-05 Score=70.64 Aligned_cols=203 Identities=16% Similarity=0.183 Sum_probs=113.6
Q ss_pred cccceeecCC-CCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 32 KINFVSWSPD-GKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpD-G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
++...+|+|- |.-||- + .....|.+.++.++. ....|-..+ .+.-.+..++|||.|++|+..+.+..
T Consensus 16 r~W~~awhp~~g~ilAs-c---------g~Dk~vriw~~~~~~s~~ck~vld~-~hkrsVRsvAwsp~g~~La~aSFD~t 84 (312)
T KOG0645|consen 16 RVWSVAWHPGKGVILAS-C---------GTDKAVRIWSTSSGDSWTCKTVLDD-GHKRSVRSVAWSPHGRYLASASFDAT 84 (312)
T ss_pred cEEEEEeccCCceEEEe-e---------cCCceEEEEecCCCCcEEEEEeccc-cchheeeeeeecCCCcEEEEeeccce
Confidence 6889999998 764444 2 223456666666432 222222221 11225789999999999987754311
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCee--ecCCC--ceeeeeccCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAK--DFGTP--AVYTAVEPSPD 185 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~--~l~~~--~~~~~~~~SpD 185 (744)
.-||.-. +|+.+ ...++ ..+...+||++
T Consensus 85 -----------------------------------------------~~Iw~k~-~~efecv~~lEGHEnEVK~Vaws~s 116 (312)
T KOG0645|consen 85 -----------------------------------------------VVIWKKE-DGEFECVATLEGHENEVKCVAWSAS 116 (312)
T ss_pred -----------------------------------------------EEEeecC-CCceeEEeeeeccccceeEEEEcCC
Confidence 1222211 33222 22222 56779999999
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee-ccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE-LCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
|++|+--+... .+|+|.+++.+... +....... ..+..+.|.|.-. |+|...
T Consensus 117 G~~LATCSRDK-------------SVWiWe~deddEfec~aVL~~Ht----------qDVK~V~WHPt~d--lL~S~S-- 169 (312)
T KOG0645|consen 117 GNYLATCSRDK-------------SVWIWEIDEDDEFECIAVLQEHT----------QDVKHVIWHPTED--LLFSCS-- 169 (312)
T ss_pred CCEEEEeeCCC-------------eEEEEEecCCCcEEEEeeecccc----------ccccEEEEcCCcc--eeEEec--
Confidence 99998665543 79999988554332 22221110 1155678999655 444432
Q ss_pred CCCCCccCCccceEEeccCCCCCCCCc---eEeeeeccceeceeeccCCceEEEeeeeeccceeEEE--EcCCC
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAEGEKP---EILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWL--VCPGS 333 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~~~~---~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~--~~~~~ 333 (744)
+.+.|-++..+ ++... ..|......+-.++|.+.|..|+..+ ++.+..||+ .++..
T Consensus 170 ---------YDnTIk~~~~~--~dddW~c~~tl~g~~~TVW~~~F~~~G~rl~s~s--dD~tv~Iw~~~~~~~~ 230 (312)
T KOG0645|consen 170 ---------YDNTIKVYRDE--DDDDWECVQTLDGHENTVWSLAFDNIGSRLVSCS--DDGTVSIWRLYTDLSG 230 (312)
T ss_pred ---------cCCeEEEEeec--CCCCeeEEEEecCccceEEEEEecCCCceEEEec--CCcceEeeeeccCcch
Confidence 22234444431 12222 22333334677889999999988765 335667887 44443
No 241
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=98.38 E-value=4.3e-05 Score=77.74 Aligned_cols=224 Identities=14% Similarity=0.109 Sum_probs=132.4
Q ss_pred CccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCC--eEEEeeecccccccCCCceeEEEEECCCCceeccccC
Q 004574 3 FFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGK--RIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFES 80 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~--~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~ 80 (744)
+-+.|+++++.++...++...|. +.| +..-.|||-|. -|||... +.....+...||.++ .+...+..+.-
T Consensus 150 v~~sl~i~e~t~n~~~~p~~~lr--~~g--i~dFsisP~~n~~~la~~tP---Ek~~kpa~~~i~sIp-~~s~l~tk~lf 221 (561)
T COG5354 150 VGSSLYIHEITDNIEEHPFKNLR--PVG--ILDFSISPEGNHDELAYWTP---EKLNKPAMVRILSIP-KNSVLVTKNLF 221 (561)
T ss_pred ccCeEEEEecCCccccCchhhcc--ccc--eeeEEecCCCCCceEEEEcc---ccCCCCcEEEEEEcc-CCCeeeeeeeE
Confidence 34678888886644444444443 333 77889999754 4677543 133334555666666 22222222211
Q ss_pred CCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEE
Q 004574 81 PDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLV 160 (744)
Q Consensus 81 ~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 160 (744)
.. .-..+.|.+.|++|++...... . ..-.+++.+.||
T Consensus 222 k~-----~~~qLkW~~~g~~ll~l~~t~~-k-------------------------------------snKsyfgesnLy 258 (561)
T COG5354 222 KV-----SGVQLKWQVLGKYLLVLVMTHT-K-------------------------------------SNKSYFGESNLY 258 (561)
T ss_pred ee-----cccEEEEecCCceEEEEEEEee-e-------------------------------------cccceeccceEE
Confidence 11 1135689999999998754211 0 011233447899
Q ss_pred EEcCCC-CeeecCC-CceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccC
Q 004574 161 LGSLDG-TAKDFGT-PAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYN 238 (744)
Q Consensus 161 ~~~~~G-~~~~l~~-~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~ 238 (744)
++++++ ...-... .+.+++++|+|++++.+..+.-. +..+-.+++.|.-..-+...+.+
T Consensus 259 l~~~~e~~i~V~~~~~~pVhdf~W~p~S~~F~vi~g~~-----------pa~~s~~~lr~Nl~~~~Pe~~rN-------- 319 (561)
T COG5354 259 LLRITERSIPVEKDLKDPVHDFTWEPLSSRFAVISGYM-----------PASVSVFDLRGNLRFYFPEQKRN-------- 319 (561)
T ss_pred EEeecccccceeccccccceeeeecccCCceeEEeccc-----------ccceeecccccceEEecCCcccc--------
Confidence 999977 3333223 38899999999999999886433 24567778887744433333222
Q ss_pred CccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeec-cceeceeeccCCceEEEee
Q 004574 239 SVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLD-LRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 239 ~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~-~~~~~~~~SpDg~~l~~~~ 317 (744)
.+.|||.+++ +++..+.+... .+.+++. . +......... .......|||||+.+....
T Consensus 320 -------T~~fsp~~r~-il~agF~nl~g---------ni~i~~~---~-~rf~~~~~~~~~n~s~~~wspd~qF~~~~~ 378 (561)
T COG5354 320 -------TIFFSPHERY-ILFAGFDNLQG---------NIEIFDP---A-GRFKVAGAFNGLNTSYCDWSPDGQFYDTDT 378 (561)
T ss_pred -------cccccCcccE-EEEecCCcccc---------ceEEecc---C-CceEEEEEeecCCceEeeccCCceEEEecC
Confidence 4579999998 77765443322 2556664 2 3444443333 3455567999999776554
No 242
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=98.37 E-value=7.6e-05 Score=71.73 Aligned_cols=104 Identities=12% Similarity=0.136 Sum_probs=64.0
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeee
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYK 320 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~ 320 (744)
..|+.-++||+|..+ ++ + ++...+ ..+|++|+ ..-+.-.+..-...+..+.|.|.-..|+...
T Consensus 318 k~g~g~lafs~Ds~y-~a-T--rnd~~P-------nalW~Wdl---q~l~l~avLiQk~piraf~WdP~~prL~vct--- 380 (447)
T KOG4497|consen 318 KCGAGKLAFSCDSTY-AA-T--RNDKYP-------NALWLWDL---QNLKLHAVLIQKHPIRAFEWDPGRPRLVVCT--- 380 (447)
T ss_pred ccccceeeecCCceE-Ee-e--ecCCCC-------ceEEEEec---hhhhhhhhhhhccceeEEEeCCCCceEEEEc---
Confidence 345667999999864 22 2 222222 34999998 3333333333456778899999988888766
Q ss_pred ccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 321 TSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 321 ~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+..+||.+.+.+. ....+....+. .-.+.|.-+|..++...
T Consensus 381 -g~srLY~W~psg~--~~V~vP~~GF~-------i~~l~W~~~g~~i~l~~ 421 (447)
T KOG4497|consen 381 -GKSRLYFWAPSGP--RVVGVPKKGFN-------IQKLQWLQPGEFIVLCG 421 (447)
T ss_pred -CCceEEEEcCCCc--eEEecCCCCce-------eeeEEecCCCcEEEEEc
Confidence 5567999988762 11222222221 11167999999887775
No 243
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=98.37 E-value=6.3e-05 Score=76.60 Aligned_cols=263 Identities=10% Similarity=0.052 Sum_probs=141.0
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCccc--ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKIN--FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI 83 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~--~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~ 83 (744)
++.+++... |....--......... ...||-|.+++|.+.. ..|+++++ |+..-+. +..
T Consensus 109 ~~~vwd~~s----g~iv~sf~~~~q~~~~Wp~~k~s~~D~y~ARvv~-----------~sl~i~e~-t~n~~~~---p~~ 169 (561)
T COG5354 109 NVFVWDIAS----GMIVFSFNGISQPYLGWPVLKFSIDDKYVARVVG-----------SSLYIHEI-TDNIEEH---PFK 169 (561)
T ss_pred ceeEEeccC----ceeEeeccccCCcccccceeeeeecchhhhhhcc-----------CeEEEEec-CCccccC---chh
Confidence 577788755 5433221111111122 4689999999999754 67899987 4433332 211
Q ss_pred ccc-ccccceEEecC--CcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEE
Q 004574 84 CLN-AVFGSFVWVNN--STLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLV 160 (744)
Q Consensus 84 ~~~-~~~~~~~wspD--g~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 160 (744)
.+. .++..+.|||- +..|+|-.+...+.. ..-.|+
T Consensus 170 ~lr~~gi~dFsisP~~n~~~la~~tPEk~~kp------------------------------------------a~~~i~ 207 (561)
T COG5354 170 NLRPVGILDFSISPEGNHDELAYWTPEKLNKP------------------------------------------AMVRIL 207 (561)
T ss_pred hccccceeeEEecCCCCCceEEEEccccCCCC------------------------------------------cEEEEE
Confidence 111 14678999996 445666544322221 002344
Q ss_pred EEcCCCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 161 LGSLDGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 161 ~~~~~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
.+..+......+-. ..-..+.|.+.|++|.+.....- ...+..-...+||++++.+..++-..... .|
T Consensus 208 sIp~~s~l~tk~lfk~~~~qLkW~~~g~~ll~l~~t~~--ksnKsyfgesnLyl~~~~e~~i~V~~~~~----~p----- 276 (561)
T COG5354 208 SIPKNSVLVTKNLFKVSGVQLKWQVLGKYLLVLVMTHT--KSNKSYFGESNLYLLRITERSIPVEKDLK----DP----- 276 (561)
T ss_pred EccCCCeeeeeeeEeecccEEEEecCCceEEEEEEEee--ecccceeccceEEEEeecccccceecccc----cc-----
Confidence 44422222222111 33446889999999999876531 11111111358999998866544332211 11
Q ss_pred ccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeee
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWY 319 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~ 319 (744)
+.+++|+|+++. .+.++. ..+ ..+-+.++ . +..+ ..-....-+.+.|||.+++++++. .
T Consensus 277 ----Vhdf~W~p~S~~-F~vi~g---~~p-------a~~s~~~l---r-~Nl~-~~~Pe~~rNT~~fsp~~r~il~ag-F 335 (561)
T COG5354 277 ----VHDFTWEPLSSR-FAVISG---YMP-------ASVSVFDL---R-GNLR-FYFPEQKRNTIFFSPHERYILFAG-F 335 (561)
T ss_pred ----ceeeeecccCCc-eeEEec---ccc-------cceeeccc---c-cceE-EecCCcccccccccCcccEEEEec-C
Confidence 668999999999 555531 111 12444444 2 2222 111234445678999999999987 3
Q ss_pred eccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEE
Q 004574 320 KTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAK 370 (744)
Q Consensus 320 ~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~ 370 (744)
...+..+-++|..+. .+.+.. +.. ...-...|||||+++...
T Consensus 336 ~nl~gni~i~~~~~r---f~~~~~--~~~----~n~s~~~wspd~qF~~~~ 377 (561)
T COG5354 336 DNLQGNIEIFDPAGR---FKVAGA--FNG----LNTSYCDWSPDGQFYDTD 377 (561)
T ss_pred CccccceEEeccCCc---eEEEEE--eec----CCceEeeccCCceEEEec
Confidence 335566778887762 222211 111 011113499999976544
No 244
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=98.36 E-value=2.7e-05 Score=74.40 Aligned_cols=168 Identities=15% Similarity=0.193 Sum_probs=89.7
Q ss_pred eEEEEEcCCC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeC----CCC--eeeeccCCC
Q 004574 157 AQLVLGSLDG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTT----DGK--LVRELCDLP 228 (744)
Q Consensus 157 ~~l~~~~~~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~----~g~--~~~~l~~~~ 228 (744)
..|.++++.| ....+... ......+.||||++|+..... .++.+|++ +|. +++++....
T Consensus 209 t~i~lw~lkGq~L~~idtnq~~n~~aavSP~GRFia~~gFT-------------pDVkVwE~~f~kdG~fqev~rvf~Lk 275 (420)
T KOG2096|consen 209 TKICLWDLKGQLLQSIDTNQSSNYDAAVSPDGRFIAVSGFT-------------PDVKVWEPIFTKDGTFQEVKRVFSLK 275 (420)
T ss_pred CcEEEEecCCceeeeeccccccccceeeCCCCcEEEEecCC-------------CCceEEEEEeccCcchhhhhhhheec
Confidence 4688888888 44444333 556677899999999877443 35666654 332 233333332
Q ss_pred CCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCC-CCCCceEeeee-------ccc
Q 004574 229 PAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPA-EGEKPEILHKL-------DLR 300 (744)
Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~l~~~-------~~~ 300 (744)
+.. -.+..++|||+.++ ++-++ .++. .+||-.|+ -+ .+..+..|-.+ ...
T Consensus 276 GH~----------saV~~~aFsn~S~r-~vtvS-kDG~---------wriwdtdV-rY~~~qDpk~Lk~g~~pl~aag~~ 333 (420)
T KOG2096|consen 276 GHQ----------SAVLAAAFSNSSTR-AVTVS-KDGK---------WRIWDTDV-RYEAGQDPKILKEGSAPLHAAGSE 333 (420)
T ss_pred cch----------hheeeeeeCCCcce-eEEEe-cCCc---------EEEeeccc-eEecCCCchHhhcCCcchhhcCCC
Confidence 221 11556788999887 65554 2221 13444443 11 11222222111 122
Q ss_pred eeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 301 FRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 301 ~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.-.++.||.|+.|+.+.- ..|..+....++.. ..+-+.....+ ..++|++||++++...
T Consensus 334 p~RL~lsP~g~~lA~s~g-----s~l~~~~se~g~~~-~~~e~~h~~~I------s~is~~~~g~~~atcG 392 (420)
T KOG2096|consen 334 PVRLELSPSGDSLAVSFG-----SDLKVFASEDGKDY-PELEDIHSTTI------SSISYSSDGKYIATCG 392 (420)
T ss_pred ceEEEeCCCCcEEEeecC-----CceEEEEcccCccc-hhHHHhhcCce------eeEEecCCCcEEeeec
Confidence 235789999999998761 23445554443111 11111111111 1277999999999885
No 245
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=98.35 E-value=0.00053 Score=67.15 Aligned_cols=180 Identities=16% Similarity=0.161 Sum_probs=107.6
Q ss_pred CCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEecC
Q 004574 29 DGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 29 ~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
+...+...+.+|+-+++|--. .++..-|| +..+|+ +-.+|.+.+ .+....||.||.+|+-.
T Consensus 63 H~~svFavsl~P~~~l~aTGG--------gDD~AflW--~~~~ge~~~eltgHKD-----SVt~~~FshdgtlLATG--- 124 (399)
T KOG0296|consen 63 HTDSVFAVSLHPNNNLVATGG--------GDDLAFLW--DISTGEFAGELTGHKD-----SVTCCSFSHDGTLLATG--- 124 (399)
T ss_pred cCCceEEEEeCCCCceEEecC--------CCceEEEE--EccCCcceeEecCCCC-----ceEEEEEccCceEEEec---
Confidence 344688899999766555521 13334444 666665 455665555 57788999999988753
Q ss_pred CCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeeeccCC
Q 004574 108 SRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAVEPSP 184 (744)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~~~Sp 184 (744)
+ ..++|.++.. +| ....+... ..+..+.|.|
T Consensus 125 ---d-------------------------------------------msG~v~v~~~stg~~~~~~~~e~~dieWl~WHp 158 (399)
T KOG0296|consen 125 ---D-------------------------------------------MSGKVLVFKVSTGGEQWKLDQEVEDIEWLKWHP 158 (399)
T ss_pred ---C-------------------------------------------CCccEEEEEcccCceEEEeecccCceEEEEecc
Confidence 1 1156777777 66 55555544 8888999999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
-+..|++.+.. ..+|.|.+..+..-++..++... ...=.|.|||++ ++-..
T Consensus 159 ~a~illAG~~D-------------GsvWmw~ip~~~~~kv~~Gh~~~------------ct~G~f~pdGKr-~~tgy--- 209 (399)
T KOG0296|consen 159 RAHILLAGSTD-------------GSVWMWQIPSQALCKVMSGHNSP------------CTCGEFIPDGKR-ILTGY--- 209 (399)
T ss_pred cccEEEeecCC-------------CcEEEEECCCcceeeEecCCCCC------------cccccccCCCce-EEEEe---
Confidence 88766666444 37999999874444444443221 112268999998 54442
Q ss_pred CCCCCccCCccceEEeccCCCCCCCCceE-eeeec-cceeceeeccCCceE
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKLD-LRFRSVSWCDDSLAL 313 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~~-~~~~~~~~SpDg~~l 313 (744)
..+.|.+++. +++++.. +.... .....+.++.+|..+
T Consensus 210 ---------~dgti~~Wn~---ktg~p~~~~~~~e~~~~~~~~~~~~~~~~ 248 (399)
T KOG0296|consen 210 ---------DDGTIIVWNP---KTGQPLHKITQAEGLELPCISLNLAGSTL 248 (399)
T ss_pred ---------cCceEEEEec---CCCceeEEecccccCcCCcccccccccee
Confidence 1234777777 5555433 33222 223334444555433
No 246
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.35 E-value=6.3e-05 Score=74.69 Aligned_cols=196 Identities=15% Similarity=0.164 Sum_probs=113.2
Q ss_pred ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEe-cCCcEEEEEecCCCCCCC
Q 004574 35 FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWV-NNSTLLIFTIPSSRRDPP 113 (744)
Q Consensus 35 ~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~ws-pDg~~l~~~~~~~~~~~~ 113 (744)
.|.|.++...|.|+- -...+|+.++.++++.+.+.... ...+... +||+. +++..
T Consensus 4 gp~~d~~~g~l~~~D---------~~~~~i~~~~~~~~~~~~~~~~~-------~~G~~~~~~~g~l-~v~~~------- 59 (246)
T PF08450_consen 4 GPVWDPRDGRLYWVD---------IPGGRIYRVDPDTGEVEVIDLPG-------PNGMAFDRPDGRL-YVADS------- 59 (246)
T ss_dssp EEEEETTTTEEEEEE---------TTTTEEEEEETTTTEEEEEESSS-------EEEEEEECTTSEE-EEEET-------
T ss_pred ceEEECCCCEEEEEE---------cCCCEEEEEECCCCeEEEEecCC-------CceEEEEccCCEE-EEEEc-------
Confidence 489999666787763 34578999999998876653222 1344555 67654 44321
Q ss_pred CCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCC------C-ceeeeeccCCC
Q 004574 114 KKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGT------P-AVYTAVEPSPD 185 (744)
Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~------~-~~~~~~~~SpD 185 (744)
..+.++|. +|+.+.+.. . ......++.||
T Consensus 60 -------------------------------------------~~~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~ 96 (246)
T PF08450_consen 60 -------------------------------------------GGIAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPD 96 (246)
T ss_dssp -------------------------------------------TCEEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TT
T ss_pred -------------------------------------------CceEEEecCCCcEEEEeeccCCCcccCCCceEEEcCC
Confidence 23444476 665554422 2 44568899999
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
|+ |+++.......... ....+|+++.+ ++.+.+...-. .+..++|+|||+. ||++..
T Consensus 97 G~-ly~t~~~~~~~~~~----~~g~v~~~~~~-~~~~~~~~~~~-------------~pNGi~~s~dg~~-lyv~ds--- 153 (246)
T PF08450_consen 97 GN-LYVTDSGGGGASGI----DPGSVYRIDPD-GKVTVVADGLG-------------FPNGIAFSPDGKT-LYVADS--- 153 (246)
T ss_dssp S--EEEEEECCBCTTCG----GSEEEEEEETT-SEEEEEEEEES-------------SEEEEEEETTSSE-EEEEET---
T ss_pred CC-EEEEecCCCccccc----cccceEEECCC-CeEEEEecCcc-------------cccceEECCcchh-eeeccc---
Confidence 99 87776654211110 01579999999 44444432211 1557899999997 766631
Q ss_pred CCCCccCCccceEEeccCCCCCCCC---ceEeeee---ccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEK---PEILHKL---DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~---~~~l~~~---~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
...+|+.++.+. .+++ .+.+... .+....+++..+|+.++. .+ ....|++++.++
T Consensus 154 --------~~~~i~~~~~~~-~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va-~~---~~~~I~~~~p~G 214 (246)
T PF08450_consen 154 --------FNGRIWRFDLDA-DGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVA-DW---GGGRIVVFDPDG 214 (246)
T ss_dssp --------TTTEEEEEEEET-TTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEE-EE---TTTEEEEEETTS
T ss_pred --------ccceeEEEeccc-cccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEE-Ec---CCCEEEEECCCc
Confidence 334588888721 1221 1222222 123667889999974443 22 345799999875
No 247
>COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]
Probab=98.33 E-value=0.00029 Score=79.53 Aligned_cols=281 Identities=13% Similarity=0.081 Sum_probs=144.2
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecC
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
..+..+..++|||||+.++|..+ +.....++|+++.. | .++.... .+....|+|+|+.+++....
T Consensus 57 ~~~~~~~~~~~spdg~~~~~~~~------~~~~~~~l~l~~~~-g---~~~~~~~-----~v~~~~~~~~g~~~~~~~~~ 121 (620)
T COG1506 57 TFGGGVSELRWSPDGSVLAFVST------DGGRVAQLYLVDVG-G---LITKTAF-----GVSDARWSPDGDRIAFLTAE 121 (620)
T ss_pred ccCCcccccccCCCCCEEEEEec------cCCCcceEEEEecC-C---ceeeeec-----ccccceeCCCCCeEEEEecc
Confidence 45557999999999999999863 11336899999988 4 2221121 35677999999999994322
Q ss_pred CCCCCCCC--CCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC-ceeeeeccCC
Q 004574 108 SRRDPPKK--TMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP-AVYTAVEPSP 184 (744)
Q Consensus 108 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~-~~~~~~~~Sp 184 (744)
........ ...+..+.+....+ ....++|+++..+....+... ..+..+.+.+
T Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~g------------------------~~~~~l~~~d~~~~~~~~~~~~~~~~~~~~~~ 177 (620)
T COG1506 122 GASKRDGGDHLFVDRLPVWFDGRG------------------------GERSDLYVVDIESKLIKLGLGNLDVVSFATDG 177 (620)
T ss_pred cccccCCceeeeecccceeecCCC------------------------CcccceEEEccCcccccccCCCCceeeeeeCC
Confidence 11110000 00000001100000 123689999986544444444 6677788888
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
|++.++......+ ........+++....+....++..... ...+.|.+||+. +++.....
T Consensus 178 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~gk~-~~~~~~~~ 237 (620)
T COG1506 178 DGRLVASIRLDDD------ADPWVTNLYVLIEGNGELESLTPGEGS-------------ISKLAFDADGKS-IALLGTES 237 (620)
T ss_pred CCceeEEeeeccc------cCCceEeeEEEecCCCceEEEcCCCce-------------eeeeeeCCCCCe-eEEeccCC
Confidence 8887777655432 011112333333344555555555443 445689999997 66665443
Q ss_pred CCCCCccCCccceEEeccCCCCCCCCceEee-eec--cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceee
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAEGEKPEILH-KLD--LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVL 341 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~~--~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l 341 (744)
.... .....+++.+. +.++..... ..+ .......+.-++..++|.+....+...++.+...+. . ..+
T Consensus 238 ~~~~----~~~~~~~~~~~---~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~-~--~~~ 307 (620)
T COG1506 238 DRGL----AEGDFILLLDG---ELGEVDGDLSSGDDTRGAWAVEGGLDGDGLLFIATDGGGSSPLFRVDDLGG-G--VEG 307 (620)
T ss_pred ccCc----cccceEEEEec---cccccceeeccCCcccCcHHhccccCCCcEEEEEecCCCceEEEEEeccCC-c--eee
Confidence 3211 12233555442 222222211 111 111122223456666666644344445555553332 1 122
Q ss_pred eccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEec
Q 004574 342 FDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDI 402 (744)
Q Consensus 342 ~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~ 402 (744)
...+...+ ..++.+|+.+++.... ....+.+++++.
T Consensus 308 ~~~~~~~v--------~~f~~~~~~~~~~~s~-----------------~~~p~~i~~~~~ 343 (620)
T COG1506 308 LSGDDGGV--------PGFDVDGRKLALAYSS-----------------PTEPPEIYLYDR 343 (620)
T ss_pred ecCCCceE--------EEEeeCCCEEEEEecC-----------------CCCccceEEEcC
Confidence 22221111 2356688888887633 234566888876
No 248
>PLN00181 protein SPA1-RELATED; Provisional
Probab=98.32 E-value=0.00021 Score=83.75 Aligned_cols=195 Identities=12% Similarity=0.054 Sum_probs=109.6
Q ss_pred cccceeecCC-CCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEec-CCcEEEEEecCCC
Q 004574 32 KINFVSWSPD-GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVN-NSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpD-G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wsp-Dg~~l~~~~~~~~ 109 (744)
.+....|+|. +++||.. ...+.|.++|+.+++........ ...+..+.|+| |+..|+..+.+
T Consensus 534 ~v~~l~~~~~~~~~las~----------~~Dg~v~lWd~~~~~~~~~~~~H----~~~V~~l~~~p~~~~~L~Sgs~D-- 597 (793)
T PLN00181 534 KLSGICWNSYIKSQVASS----------NFEGVVQVWDVARSQLVTEMKEH----EKRVWSIDYSSADPTLLASGSDD-- 597 (793)
T ss_pred ceeeEEeccCCCCEEEEE----------eCCCeEEEEECCCCeEEEEecCC----CCCEEEEEEcCCCCCEEEEEcCC--
Confidence 4667899875 6666654 23356777788877643332221 11466889997 77777665321
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCCceeeeecc-CCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTPAVYTAVEP-SPDQ 186 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~~~~~~~~~-SpDG 186 (744)
+.|.++|+ ++ ....+.....+....| +++|
T Consensus 598 -----------------------------------------------g~v~iWd~~~~~~~~~~~~~~~v~~v~~~~~~g 630 (793)
T PLN00181 598 -----------------------------------------------GSVKLWSINQGVSIGTIKTKANICCVQFPSESG 630 (793)
T ss_pred -----------------------------------------------CEEEEEECCCCcEEEEEecCCCeEEEEEeCCCC
Confidence 46777787 56 3333333345667778 5678
Q ss_pred ceEEEEEeeCCcccccccCCCcceEEEEeCCCCe--eeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 187 KYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL--VRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 187 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
+.|+..+.. ..|++||+.... ...+..+. ..+..+.|+ |+.. |+..+
T Consensus 631 ~~latgs~d-------------g~I~iwD~~~~~~~~~~~~~h~-------------~~V~~v~f~-~~~~-lvs~s--- 679 (793)
T PLN00181 631 RSLAFGSAD-------------HKVYYYDLRNPKLPLCTMIGHS-------------KTVSYVRFV-DSST-LVSSS--- 679 (793)
T ss_pred CEEEEEeCC-------------CeEEEEECCCCCccceEecCCC-------------CCEEEEEEe-CCCE-EEEEE---
Confidence 888766443 379999986442 12222111 114567786 6654 44332
Q ss_pred CCCCCccCCccceEEeccCCCCC----CCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAE----GEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~----~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
..+.|.++++.... ......+......+..++|+|+|++|+..+. ...+++++...
T Consensus 680 ---------~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~----D~~v~iw~~~~ 739 (793)
T PLN00181 680 ---------TDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSE----TNEVFVYHKAF 739 (793)
T ss_pred ---------CCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeC----CCEEEEEECCC
Confidence 11235666652100 1112233333445677899999998887762 23466666543
No 249
>TIGR03230 lipo_lipase lipoprotein lipase. Members of this protein family are lipoprotein lipase (EC 3.1.1.34), a eukaryotic triacylglycerol lipase active in plasma and similar to pancreatic and hepatic triacylglycerol lipases (EC 3.1.1.3). It is also called clearing factor. It cleaves chylomicron and VLDL triacylglycerols; it also has phospholipase A-1 activity.
Probab=98.30 E-value=4.2e-06 Score=87.70 Aligned_cols=101 Identities=9% Similarity=-0.047 Sum_probs=66.6
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHh--CCeEEEecCCCCCCCCCCCC----------hHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLA--RRFAVLAGPSIPIIGEGDKL----------PNDSAEA 580 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~G~~v~~~~~~~~~g~g~~~----------~~~d~~~ 580 (744)
.|++|++||.+.. ..+..+.......|.. ..|.|++. +..+.+.+. .-+++.+
T Consensus 41 ~ptvIlIHG~~~s------------~~~~~w~~~l~~al~~~~~d~nVI~V---Dw~g~g~s~y~~a~~~t~~vg~~la~ 105 (442)
T TIGR03230 41 TKTFIVIHGWTVT------------GMFESWVPKLVAALYEREPSANVIVV---DWLSRAQQHYPTSAAYTKLVGKDVAK 105 (442)
T ss_pred CCeEEEECCCCcC------------CcchhhHHHHHHHHHhccCCCEEEEE---ECCCcCCCCCccccccHHHHHHHHHH
Confidence 6889999995310 0011111112333332 36999983 333333221 1125677
Q ss_pred HHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 581 AVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 581 ~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
.+++|.+...++.+++.|+||||||.+|..++...|.++..+++++|.
T Consensus 106 lI~~L~~~~gl~l~~VhLIGHSLGAhIAg~ag~~~p~rV~rItgLDPA 153 (442)
T TIGR03230 106 FVNWMQEEFNYPWDNVHLLGYSLGAHVAGIAGSLTKHKVNRITGLDPA 153 (442)
T ss_pred HHHHHHHhhCCCCCcEEEEEECHHHHHHHHHHHhCCcceeEEEEEcCC
Confidence 778877654466789999999999999999999999999999999885
No 250
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=98.29 E-value=0.00019 Score=78.39 Aligned_cols=145 Identities=9% Similarity=0.000 Sum_probs=84.5
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+...+++-+|+.|++.+.+. .|-+.+.+ ++..+.+-.+... +..+.+.|.|
T Consensus 97 lp~r~~~v~g~g~~iaagsdD~-------------~vK~~~~~D~s~~~~lrgh~ap-------------Vl~l~~~p~~ 150 (933)
T KOG1274|consen 97 LPIRDLAVSGSGKMIAAGSDDT-------------AVKLLNLDDSSQEKVLRGHDAP-------------VLQLSYDPKG 150 (933)
T ss_pred ccceEEEEecCCcEEEeecCce-------------eEEEEeccccchheeecccCCc-------------eeeeeEcCCC
Confidence 4466788999999999887663 45666654 3333333333222 5578899999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE-eeee--------ccceeceeeccCCceEEEeeeeeccce
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKL--------DLRFRSVSWCDDSLALVNETWYKTSQT 324 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~--------~~~~~~~~~SpDg~~l~~~~~~~~~~~ 324 (744)
.- |+.++ -.+.|.++++ ..+.... ++.. ..-...++|+|+|..+++..- ..
T Consensus 151 ~f-LAvss------------~dG~v~iw~~---~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~----d~ 210 (933)
T KOG1274|consen 151 NF-LAVSS------------CDGKVQIWDL---QDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPV----DN 210 (933)
T ss_pred CE-EEEEe------------cCceEEEEEc---ccchhhhhcccCCccccccccceeeeeeecCCCCeEEeecc----CC
Confidence 74 55443 2345778877 3333222 2211 223467999999777776541 22
Q ss_pred eEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 325 RTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 325 ~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+.+++..+-....+...+..... ...++|||.|+|||...
T Consensus 211 ~Vkvy~r~~we~~f~Lr~~~~ss~------~~~~~wsPnG~YiAAs~ 251 (933)
T KOG1274|consen 211 TVKVYSRKGWELQFKLRDKLSSSK------FSDLQWSPNGKYIAAST 251 (933)
T ss_pred eEEEEccCCceeheeecccccccc------eEEEEEcCCCcEEeeec
Confidence 466666665322222222222211 12267999999999886
No 251
>PF00135 COesterase: Carboxylesterase family The prints entry is specific to acetylcholinesterase; InterPro: IPR002018 Higher eukaryotes have many distinct esterases. Among the different types are those which act on carboxylic esters (3.1.1 from EC). Carboxyl-esterases have been classified into three categories (A, B and C) on the basis of differential patterns of inhibition by organophosphates. The sequence of a number of type-B carboxylesterases indicates [, , ] that the majority are evolutionary related. As is the case for lipases and serine proteases, the catalytic apparatus of esterases involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.; PDB: 3B3Q_A 1CLE_B 1GQS_A 2VJD_A 1HBJ_A 2C5G_A 1U65_A 2WG1_A 1FSS_A 3M3D_A ....
Probab=98.27 E-value=2.9e-06 Score=95.24 Aligned_cols=121 Identities=21% Similarity=0.222 Sum_probs=79.2
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCC-CCCCC---C
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSI-PIIGE---G 570 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~-~~~g~---g 570 (744)
+..-+|.|.....+. ++||+||+|||+|..+.... .. ......++.++++|+..+|+ +..|+ +
T Consensus 109 L~LnI~~P~~~~~~~--~lPV~v~ihGG~f~~G~~~~------~~-----~~~~~~~~~~~vivVt~nYRlg~~Gfl~~~ 175 (535)
T PF00135_consen 109 LYLNIYTPSNASSNS--KLPVMVWIHGGGFMFGSGSF------PP-----YDGASLAASKDVIVVTINYRLGAFGFLSLG 175 (535)
T ss_dssp -EEEEEEETSSSSTT--SEEEEEEE--STTTSSCTTS------GG-----GHTHHHHHHHTSEEEEE----HHHHH-BSS
T ss_pred HHHhhhhcccccccc--ccceEEEeecccccCCCccc------cc-----ccccccccCCCEEEEEeccccccccccccc
Confidence 666888998854332 59999999999876543310 01 12244567889999986653 22221 1
Q ss_pred ------CCChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHhC--CCceeEEEEccCC
Q 004574 571 ------DKLPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAHA--PHLFCCGIARSGS 628 (744)
Q Consensus 571 ------~~~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~~--p~~~~~~v~~~~~ 628 (744)
...-+.|...|++|++++ =.-|+++|-|+|+|.||..+..++... ..+|..+|+.+|.
T Consensus 176 ~~~~~~gN~Gl~Dq~~AL~WV~~nI~~FGGDp~~VTl~G~SAGa~sv~~~l~sp~~~~LF~raI~~SGs 244 (535)
T PF00135_consen 176 DLDAPSGNYGLLDQRLALKWVQDNIAAFGGDPDNVTLFGQSAGAASVSLLLLSPSSKGLFHRAILQSGS 244 (535)
T ss_dssp STTSHBSTHHHHHHHHHHHHHHHHGGGGTEEEEEEEEEEETHHHHHHHHHHHGGGGTTSBSEEEEES--
T ss_pred ccccCchhhhhhhhHHHHHHHHhhhhhcccCCcceeeeeecccccccceeeeccccccccccccccccc
Confidence 111233999999999996 236899999999999999998877652 3589999999984
No 252
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=98.25 E-value=6.2e-05 Score=81.99 Aligned_cols=179 Identities=15% Similarity=0.196 Sum_probs=116.3
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
.+..+++-+|+++|+.+ ....|-+++++.+.....+.+.. +.+..+.+.|.++.|+..+-+
T Consensus 99 ~r~~~v~g~g~~iaags----------dD~~vK~~~~~D~s~~~~lrgh~----apVl~l~~~p~~~fLAvss~d----- 159 (933)
T KOG1274|consen 99 IRDLAVSGSGKMIAAGS----------DDTAVKLLNLDDSSQEKVLRGHD----APVLQLSYDPKGNFLAVSSCD----- 159 (933)
T ss_pred ceEEEEecCCcEEEeec----------CceeEEEEeccccchheeecccC----CceeeeeEcCCCCEEEEEecC-----
Confidence 67789999999999963 34667778877665555443332 146788999999999987432
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCC-eeecCC---------Cceeeeec
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGT-AKDFGT---------PAVYTAVE 181 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~-~~~l~~---------~~~~~~~~ 181 (744)
+++.++++ +|. ...++. ...+..++
T Consensus 160 --------------------------------------------G~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~a 195 (933)
T KOG1274|consen 160 --------------------------------------------GKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLA 195 (933)
T ss_pred --------------------------------------------ceEEEEEcccchhhhhcccCCccccccccceeeeee
Confidence 67888888 662 222211 13456899
Q ss_pred cCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEE
Q 004574 182 PSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWV 260 (744)
Q Consensus 182 ~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~ 260 (744)
|+|||..+++...++ .+-+|+.++.+. ..|...... .....++|||.|++ |+-.
T Consensus 196 W~Pk~g~la~~~~d~-------------~Vkvy~r~~we~~f~Lr~~~~s-----------s~~~~~~wsPnG~Y-iAAs 250 (933)
T KOG1274|consen 196 WHPKGGTLAVPPVDN-------------TVKVYSRKGWELQFKLRDKLSS-----------SKFSDLQWSPNGKY-IAAS 250 (933)
T ss_pred ecCCCCeEEeeccCC-------------eEEEEccCCceeheeecccccc-----------cceEEEEEcCCCcE-Eeee
Confidence 999988888876654 577888776642 333222211 11446899999997 5544
Q ss_pred EeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 261 EAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 261 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
. ..++|.+++. ++-+ + -.....+...+|-|+...|-+..
T Consensus 251 ~------------~~g~I~vWnv---~t~~-~--~~~~~~Vc~~aw~p~~n~it~~~ 289 (933)
T KOG1274|consen 251 T------------LDGQILVWNV---DTHE-R--HEFKRAVCCEAWKPNANAITLIT 289 (933)
T ss_pred c------------cCCcEEEEec---ccch-h--ccccceeEEEecCCCCCeeEEEe
Confidence 2 4456888888 3311 1 11234566778888887776665
No 253
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=98.24 E-value=1.6e-05 Score=80.94 Aligned_cols=166 Identities=13% Similarity=0.256 Sum_probs=104.1
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+...+|||||.-|.-+.. ++.-.|| .-.|--...+.+... .+..++|.||+..++|...
T Consensus 106 A~~~gRW~~dGtgLlt~GE--------DG~iKiW--SrsGMLRStl~Q~~~-----~v~c~~W~p~S~~vl~c~g----- 165 (737)
T KOG1524|consen 106 AISSGRWSPDGAGLLTAGE--------DGVIKIW--SRSGMLRSTVVQNEE-----SIRCARWAPNSNSIVFCQG----- 165 (737)
T ss_pred hhhhcccCCCCceeeeecC--------CceEEEE--eccchHHHHHhhcCc-----eeEEEEECCCCCceEEecC-----
Confidence 3567899999998887532 4444555 434443334444443 3567899999999999721
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCC--CCeeecCCC-ceeeeeccCCCCce
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLD--GTAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~--G~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
.++++-++. .++-+...+ +.+....|+|...-
T Consensus 166 ---------------------------------------------~h~~IKpL~~n~k~i~WkAHDGiiL~~~W~~~s~l 200 (737)
T KOG1524|consen 166 ---------------------------------------------GHISIKPLAANSKIIRWRAHDGLVLSLSWSTQSNI 200 (737)
T ss_pred ---------------------------------------------CeEEEeecccccceeEEeccCcEEEEeecCccccc
Confidence 578888883 355566566 78889999998886
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDA 268 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~ 268 (744)
|+ +..++ ...-+||.-|.....-.... +.+.+++|.|| + +|.+
T Consensus 201 I~-sgGED------------~kfKvWD~~G~~Lf~S~~~e-------------y~ITSva~npd-~--~~~v-------- 243 (737)
T KOG1524|consen 201 IA-SGGED------------FRFKIWDAQGANLFTSAAEE-------------YAITSVAFNPE-K--DYLL-------- 243 (737)
T ss_pred ee-ecCCc------------eeEEeecccCcccccCChhc-------------cceeeeeeccc-c--ceee--------
Confidence 65 33332 24556787776433322111 22678999999 3 3222
Q ss_pred CccCCccceEEeccCCCCCCCCceEee-eeccceeceeeccCCceEEEee
Q 004574 269 NVEVSPRDIIYTQPAEPAEGEKPEILH-KLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 269 ~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
+-.+. .++. +.-+.+..++||+||.+++...
T Consensus 244 ----------~S~nt--------~R~~~p~~GSifnlsWS~DGTQ~a~gt 275 (737)
T KOG1524|consen 244 ----------WSYNT--------ARFSSPRVGSIFNLSWSADGTQATCGT 275 (737)
T ss_pred ----------eeeee--------eeecCCCccceEEEEEcCCCceeeccc
Confidence 11111 1111 1356678899999999998876
No 254
>COG2021 MET2 Homoserine acetyltransferase [Amino acid transport and metabolism]
Probab=98.23 E-value=4.2e-05 Score=76.32 Aligned_cols=66 Identities=20% Similarity=0.068 Sum_probs=49.4
Q ss_pred cCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 661 ANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 661 ~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
+++++.|+|++.-+.|.+.| +.+.+++.++|+..+. -.++-...||.-. .-....+...+..||+.
T Consensus 302 l~~i~~~~lv~gi~sD~lfp--~~~~~~~~~~L~~~~~--~~~i~S~~GHDaF-L~e~~~~~~~i~~fL~~ 367 (368)
T COG2021 302 LARIKAPVLVVGITSDWLFP--PELQRALAEALPAAGA--LREIDSPYGHDAF-LVESEAVGPLIRKFLAL 367 (368)
T ss_pred HhcCccCEEEEEecccccCC--HHHHHHHHHhccccCc--eEEecCCCCchhh-hcchhhhhHHHHHHhhc
Confidence 56789999999999999999 9999999999987765 2234445688744 22234566778888763
No 255
>TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular. This model represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting. This family is named for PHAs generally, rather than polyhydroxybutyrate (PHB) specificially as in Ralstonia eutropha H16, to avoid overcalling chemical specificity in other species. Note that this family lacks the classic GXSXG lipase motif and instead shows weak similarity to some
Probab=98.21 E-value=6.3e-05 Score=78.12 Aligned_cols=70 Identities=19% Similarity=0.217 Sum_probs=54.6
Q ss_pred cCCCC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCC-CcEEEEEeCCCCcc--cCccccHHHHHHHHHHHHHHh
Q 004574 661 ANKIK-KPILIIHGEVDDKVGLFPMQAERFFDALKGHG-ALSRLVLLPFEHHV--YAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 661 ~~~~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~H~--~~~~~~~~~~~~~~~~fl~~~ 732 (744)
+.+|+ +|+|.+.|++|.+++ +.+++.+.+.....+ ...+.++.+++||. |.......+.+..+.+||.++
T Consensus 333 l~~I~~~pll~V~ge~D~I~p--~~qt~aa~~l~~~~~s~~k~~~~~~~~GH~Gvf~G~r~~~~i~P~i~~wl~~~ 406 (406)
T TIGR01849 333 PGAITRVALLTVEGENDDISG--LGQTKAALRLCTGIPEDMKRHHLQPGVGHYGVFSGSRFREEIYPLVREFIRRN 406 (406)
T ss_pred HHHCcccceEEEeccCCCcCC--HHHhHHHHHHhhcCChhhceEeecCCCCeEEEeeChhhhhhhchHHHHHHHhC
Confidence 45688 999999999999999 999998888764444 34567777788996 344555678888999999763
No 256
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=98.20 E-value=0.0004 Score=65.29 Aligned_cols=147 Identities=14% Similarity=0.180 Sum_probs=87.7
Q ss_pred ceeeeeccCCC-CceEEEEEeeCCcccccccCCCcceEEEEeCCCCe---eeeccCCCCCCCCCcccCCccCCCCcccee
Q 004574 175 AVYTAVEPSPD-QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL---VRELCDLPPAEDIPVCYNSVREGMRSISWR 250 (744)
Q Consensus 175 ~~~~~~~~SpD-G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~s 250 (744)
+.+-.++|+|- |. |+++.... ..|.+|+..+.+ .+.+.+. . ....++.++||
T Consensus 15 ~r~W~~awhp~~g~-ilAscg~D------------k~vriw~~~~~~s~~ck~vld~--~---------hkrsVRsvAws 70 (312)
T KOG0645|consen 15 DRVWSVAWHPGKGV-ILASCGTD------------KAVRIWSTSSGDSWTCKTVLDD--G---------HKRSVRSVAWS 70 (312)
T ss_pred CcEEEEEeccCCce-EEEeecCC------------ceEEEEecCCCCcEEEEEeccc--c---------chheeeeeeec
Confidence 45678899997 55 55554432 367778877422 2222222 1 11228899999
Q ss_pred cCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee---eeccceeceeeccCCceEEEeeeeeccceeEE
Q 004574 251 ADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH---KLDLRFRSVSWCDDSLALVNETWYKTSQTRTW 327 (744)
Q Consensus 251 pDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~---~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~ 327 (744)
|.|+. |+..++... +.++.- ..++...+. ..+-.+-.++||++|.+|+..+.++ .+|
T Consensus 71 p~g~~-La~aSFD~t------------~~Iw~k---~~~efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDK----SVW 130 (312)
T KOG0645|consen 71 PHGRY-LASASFDAT------------VVIWKK---EDGEFECVATLEGHENEVKCVAWSASGNYLATCSRDK----SVW 130 (312)
T ss_pred CCCcE-EEEeeccce------------EEEeec---CCCceeEEeeeeccccceeEEEEcCCCCEEEEeeCCC----eEE
Confidence 99997 777764311 233322 234444433 3355678899999999999987433 477
Q ss_pred EEcCCCC-CCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 328 LVCPGSK-DVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 328 ~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
++.++++ +.+-..+...+.+++ -.+.|.|....|+..+
T Consensus 131 iWe~deddEfec~aVL~~HtqDV------K~V~WHPt~dlL~S~S 169 (312)
T KOG0645|consen 131 IWEIDEDDEFECIAVLQEHTQDV------KHVIWHPTEDLLFSCS 169 (312)
T ss_pred EEEecCCCcEEEEeeeccccccc------cEEEEcCCcceeEEec
Confidence 7777654 223334455555542 2266888877766654
No 257
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.19 E-value=0.00033 Score=74.61 Aligned_cols=193 Identities=16% Similarity=0.134 Sum_probs=120.4
Q ss_pred ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCC
Q 004574 35 FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPK 114 (744)
Q Consensus 35 ~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~ 114 (744)
..+||++|+.|+.+. ++.|-++|+++++.. +......+.. ....+..+||++.|+.+...
T Consensus 24 ~~~~s~nG~~L~t~~-----------~d~Vi~idv~t~~~~-l~s~~~ed~d-~ita~~l~~d~~~L~~a~rs------- 83 (775)
T KOG0319|consen 24 PVAWSSNGQHLYTAC-----------GDRVIIIDVATGSIA-LPSGSNEDED-EITALALTPDEEVLVTASRS------- 83 (775)
T ss_pred ceeECCCCCEEEEec-----------CceEEEEEccCCcee-cccCCccchh-hhheeeecCCccEEEEeecc-------
Confidence 389999999998854 366888899999875 4333322211 35678999999988876321
Q ss_pred CCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeec-CC-C-ceeeeeccCCCCceEE
Q 004574 115 KTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDF-GT-P-AVYTAVEPSPDQKYVL 190 (744)
Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l-~~-~-~~~~~~~~SpDG~~i~ 190 (744)
..+-++.+ +|+..+. .. + ..+..++++|-|.-++
T Consensus 84 ------------------------------------------~llrv~~L~tgk~irswKa~He~Pvi~ma~~~~g~LlA 121 (775)
T KOG0319|consen 84 ------------------------------------------QLLRVWSLPTGKLIRSWKAIHEAPVITMAFDPTGTLLA 121 (775)
T ss_pred ------------------------------------------ceEEEEEcccchHhHhHhhccCCCeEEEEEcCCCceEE
Confidence 45666677 6743322 22 3 6777889999994433
Q ss_pred EEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCC-CCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCC
Q 004574 191 ITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDL-PPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDAN 269 (744)
Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~ 269 (744)
+... ...+-+||..++....-..+ ++- +..+.|.|+-.+.+.+...
T Consensus 122 -tgga------------D~~v~VWdi~~~~~th~fkG~gGv-------------Vssl~F~~~~~~~lL~sg~------- 168 (775)
T KOG0319|consen 122 -TGGA------------DGRVKVWDIKNGYCTHSFKGHGGV-------------VSSLLFHPHWNRWLLASGA------- 168 (775)
T ss_pred -eccc------------cceEEEEEeeCCEEEEEecCCCce-------------EEEEEeCCccchhheeecC-------
Confidence 3222 24788999988765543332 222 5567777776552333321
Q ss_pred ccCCccceEEeccCCCCCCCCc--eEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 270 VEVSPRDIIYTQPAEPAEGEKP--EILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 270 ~~~~~~~~l~~~~~~~~~~~~~--~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
....+.++++ ..+.. ..+..+.-.+.++++++|+.-+++...+ .-++++|+..
T Consensus 169 ----~D~~v~vwnl---~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~~RD----kvi~vwd~~~ 223 (775)
T KOG0319|consen 169 ----TDGTVRVWNL---NDKRTCLHTMILHKSAVTSLAFSEDSLELLSVGRD----KVIIVWDLVQ 223 (775)
T ss_pred ----CCceEEEEEc---ccCchHHHHHHhhhhheeeeeeccCCceEEEeccC----cEEEEeehhh
Confidence 1224677776 32222 1122334567889999999999988733 3577777754
No 258
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=98.19 E-value=7.4e-05 Score=73.90 Aligned_cols=223 Identities=16% Similarity=0.205 Sum_probs=123.8
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEecCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRR 110 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~ 110 (744)
.+...+|.|+|++|...+. ++...|| +..+=. +..+..++. .+..+.||++|.+++.... +
T Consensus 98 ~V~~v~WtPeGRRLltgs~--------SGEFtLW--Ng~~fnFEtilQaHDs-----~Vr~m~ws~~g~wmiSgD~---g 159 (464)
T KOG0284|consen 98 PVNVVRWTPEGRRLLTGSQ--------SGEFTLW--NGTSFNFETILQAHDS-----PVRTMKWSHNGTWMISGDK---G 159 (464)
T ss_pred ceeeEEEcCCCceeEeecc--------cccEEEe--cCceeeHHHHhhhhcc-----cceeEEEccCCCEEEEcCC---C
Confidence 4678899999999998764 5556677 321101 111111111 3678899999999875411 1
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC-CeeecCCC--ceeeeeccCCCCc
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG-TAKDFGTP--AVYTAVEPSPDQK 187 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G-~~~~l~~~--~~~~~~~~SpDG~ 187 (744)
+-|-.++.+= .++.+..+ ..+..++|||...
T Consensus 160 ----------------------------------------------G~iKyWqpnmnnVk~~~ahh~eaIRdlafSpnDs 193 (464)
T KOG0284|consen 160 ----------------------------------------------GMIKYWQPNMNNVKIIQAHHAEAIRDLAFSPNDS 193 (464)
T ss_pred ----------------------------------------------ceEEecccchhhhHHhhHhhhhhhheeccCCCCc
Confidence 2222333321 11111111 5677899999555
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRG 266 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~ 266 (744)
..+ +..++ ..|.+||.. .++.+.|...... +..+.|.|.-. |+++...+.
T Consensus 194 kF~-t~SdD------------g~ikiWdf~~~kee~vL~GHgwd-------------VksvdWHP~kg--LiasgskDn- 244 (464)
T KOG0284|consen 194 KFL-TCSDD------------GTIKIWDFRMPKEERVLRGHGWD-------------VKSVDWHPTKG--LIASGSKDN- 244 (464)
T ss_pred eeE-EecCC------------CeEEEEeccCCchhheeccCCCC-------------cceeccCCccc--eeEEccCCc-
Confidence 433 33332 378888854 5556666433333 67789999865 444432211
Q ss_pred CCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeecccc
Q 004574 267 DANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVF 346 (744)
Q Consensus 267 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 346 (744)
.+-++|.+ .+.....|......+..+.|+|+|.+|+..+.+. .+.++|+-.- +..+...+..
T Consensus 245 ----------lVKlWDpr--Sg~cl~tlh~HKntVl~~~f~~n~N~Llt~skD~----~~kv~DiR~m--kEl~~~r~Hk 306 (464)
T KOG0284|consen 245 ----------LVKLWDPR--SGSCLATLHGHKNTVLAVKFNPNGNWLLTGSKDQ----SCKVFDIRTM--KELFTYRGHK 306 (464)
T ss_pred ----------eeEeecCC--CcchhhhhhhccceEEEEEEcCCCCeeEEccCCc----eEEEEehhHh--HHHHHhhcch
Confidence 36666652 3333334455566788899999999999877332 4556665421 1112222222
Q ss_pred ccccCCCCCCceeeCCCCCeEEEEe
Q 004574 347 ENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 347 ~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.++.. +.|+|=-..|+...
T Consensus 307 kdv~~------~~WhP~~~~lftsg 325 (464)
T KOG0284|consen 307 KDVTS------LTWHPLNESLFTSG 325 (464)
T ss_pred hhhee------eccccccccceeec
Confidence 22222 45777777666553
No 259
>PF10142 PhoPQ_related: PhoPQ-activated pathogenicity-related protein; InterPro: IPR009199 Proteins in this entry are believed to play a role in virulence/pathogenicity in Salmonella. Salmonella typhi PqaA has been shown to be activated by PhoP/Q two-component regulatory system, which regulates many virulence genes []. It has been also shown to confer resistance to antimicrobial peptides (melittin) []. Members of this family are predicted to belong to the alpha/beta hydrolase domain superfamily.
Probab=98.19 E-value=0.00014 Score=74.39 Aligned_cols=142 Identities=15% Similarity=0.122 Sum_probs=101.1
Q ss_pred cCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccC-CCCCC---------CC-CCc-----ccccccchh----h
Q 004574 588 RGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSG-SYNKT---------LT-PFG-----FQTEFRTLW----E 647 (744)
Q Consensus 588 ~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~-~~~~~---------~~-~~~-----~~~~~~~~~----~ 647 (744)
...++.++..+.|.|-=|..++.+|+.+ +|++|++.+.- +.+.. +. .+. +..+.-..+ .
T Consensus 166 ~~~~~i~~FvV~GaSKRGWTtWltaa~D-~RV~aivP~Vid~LN~~~~l~h~y~~yG~~ws~a~~dY~~~gi~~~l~tp~ 244 (367)
T PF10142_consen 166 KFGVNIEKFVVTGASKRGWTTWLTAAVD-PRVKAIVPIVIDVLNMKANLEHQYRSYGGNWSFAFQDYYNEGITQQLDTPE 244 (367)
T ss_pred hcCCCccEEEEeCCchHhHHHHHhhccC-cceeEEeeEEEccCCcHHHHHHHHHHhCCCCccchhhhhHhCchhhcCCHH
Confidence 3456778999999999999999888865 78888776542 22210 00 111 111100000 1
Q ss_pred cHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHH
Q 004574 648 ATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDR 727 (744)
Q Consensus 648 ~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~ 727 (744)
..+...-.+|+.+.+++.+|-|++.|..|++.. ++.+.-++..|+. +..+.++|+++|.... .++...+..
T Consensus 245 f~~L~~ivDP~~Y~~rL~~PK~ii~atgDeFf~--pD~~~~y~d~L~G---~K~lr~vPN~~H~~~~----~~~~~~l~~ 315 (367)
T PF10142_consen 245 FDKLMQIVDPYSYRDRLTMPKYIINATGDEFFV--PDSSNFYYDKLPG---EKYLRYVPNAGHSLIG----SDVVQSLRA 315 (367)
T ss_pred HHHHHHhcCHHHHHHhcCccEEEEecCCCceec--cCchHHHHhhCCC---CeeEEeCCCCCcccch----HHHHHHHHH
Confidence 123445668999999999999999999999988 8999999988874 5689999999998653 677888999
Q ss_pred HHHHhccCCCCC
Q 004574 728 WLQKYCLSNTSD 739 (744)
Q Consensus 728 fl~~~l~~~~~~ 739 (744)
|+...+.....+
T Consensus 316 f~~~~~~~~~lP 327 (367)
T PF10142_consen 316 FYNRIQNGRPLP 327 (367)
T ss_pred HHHHHHcCCCCC
Confidence 999877654433
No 260
>KOG2551 consensus Phospholipase/carboxyhydrolase [Amino acid transport and metabolism]
Probab=98.17 E-value=1.5e-05 Score=73.09 Aligned_cols=125 Identities=21% Similarity=0.227 Sum_probs=89.7
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC--C------CceeEEEEccCCCCCCCCCCcccccccchhhc
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA--P------HLFCCGIARSGSYNKTLTPFGFQTEFRTLWEA 648 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~--p------~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 648 (744)
-+..+.+|+++++..| +|+|+|.|+.++..+++.. . ..|+-+|+++|+.-...
T Consensus 91 sl~yl~~~i~enGPFD----GllGFSQGA~laa~l~~~~~~~~~~~~~P~~kF~v~~SGf~~~~~--------------- 151 (230)
T KOG2551|consen 91 SLEYLEDYIKENGPFD----GLLGFSQGAALAALLAGLGQKGLPYVKQPPFKFAVFISGFKFPSK--------------- 151 (230)
T ss_pred HHHHHHHHHHHhCCCc----cccccchhHHHHHHhhcccccCCcccCCCCeEEEEEEecCCCCcc---------------
Confidence 4566667788888888 8999999999999998721 1 14677888888643210
Q ss_pred HHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHH
Q 004574 649 TNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRW 728 (744)
Q Consensus 649 ~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~f 728 (744)
.+.-..+...+++|+|-+.|+.|.++| ...+..|++....+ .++.-| +||.+. +...+.+.+.+|
T Consensus 152 -----~~~~~~~~~~i~~PSLHi~G~~D~iv~--~~~s~~L~~~~~~a----~vl~Hp-ggH~VP---~~~~~~~~i~~f 216 (230)
T KOG2551|consen 152 -----KLDESAYKRPLSTPSLHIFGETDTIVP--SERSEQLAESFKDA----TVLEHP-GGHIVP---NKAKYKEKIADF 216 (230)
T ss_pred -----hhhhhhhccCCCCCeeEEecccceeec--chHHHHHHHhcCCC----eEEecC-CCccCC---CchHHHHHHHHH
Confidence 001112345689999999999999999 89999999988765 444445 469754 344778888999
Q ss_pred HHHhccC
Q 004574 729 LQKYCLS 735 (744)
Q Consensus 729 l~~~l~~ 735 (744)
|...+.+
T Consensus 217 i~~~~~~ 223 (230)
T KOG2551|consen 217 IQSFLQE 223 (230)
T ss_pred HHHHHHh
Confidence 9876643
No 261
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=98.16 E-value=0.0003 Score=69.50 Aligned_cols=117 Identities=20% Similarity=0.225 Sum_probs=72.3
Q ss_pred CCceeeecC-CCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEec
Q 004574 19 GPEKEVHGY-PDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVN 96 (744)
Q Consensus 19 g~~~~l~~~-~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wsp 96 (744)
+..+.+... +.+. ....|++||||+.+||+.. .++...||+....+.....+ ... ....+.|++
T Consensus 10 ~~~~pv~g~~~~~~~~~~s~AvS~dg~~~A~v~~-------~~~~~~L~~~~~~~~~~~~~-~g~------~l~~PS~d~ 75 (253)
T PF10647_consen 10 GGVTPVPGALGEGGYDVTSPAVSPDGSRVAAVSE-------GDGGRSLYVGPAGGPVRPVL-TGG------SLTRPSWDP 75 (253)
T ss_pred CceeECCCCcCcCCccccceEECCCCCeEEEEEE-------cCCCCEEEEEcCCCcceeec-cCC------ccccccccC
Confidence 445555432 2222 5789999999999999862 25668899998554444444 222 256789999
Q ss_pred CCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC--CeeecCCC
Q 004574 97 NSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG--TAKDFGTP 174 (744)
Q Consensus 97 Dg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G--~~~~l~~~ 174 (744)
+|...++.. . ... ..++....+| ....+...
T Consensus 76 ~g~~W~v~~-~-~~~---------------------------------------------~~~~~~~~~g~~~~~~v~~~ 108 (253)
T PF10647_consen 76 DGWVWTVDD-G-SGG---------------------------------------------VRVVRDSASGTGEPVEVDWP 108 (253)
T ss_pred CCCEEEEEc-C-CCc---------------------------------------------eEEEEecCCCcceeEEeccc
Confidence 977555532 1 000 1222222244 33333322
Q ss_pred --c-eeeeeccCCCCceEEEEEeeC
Q 004574 175 --A-VYTAVEPSPDQKYVLITSMHR 196 (744)
Q Consensus 175 --~-~~~~~~~SpDG~~i~~~~~~~ 196 (744)
. .+..+.+||||.+|++.....
T Consensus 109 ~~~~~I~~l~vSpDG~RvA~v~~~~ 133 (253)
T PF10647_consen 109 GLRGRITALRVSPDGTRVAVVVEDG 133 (253)
T ss_pred ccCCceEEEEECCCCcEEEEEEecC
Confidence 2 688999999999999998654
No 262
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=98.12 E-value=0.0029 Score=63.19 Aligned_cols=140 Identities=9% Similarity=0.038 Sum_probs=71.5
Q ss_pred eeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccc-cccCCCCCCceeeCCCCCeEEEEeeecCCcce
Q 004574 301 FRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFE-NVYSDPGSPMMTRTSTGTNVIAKIKKENDEQI 379 (744)
Q Consensus 301 ~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~ 379 (744)
...++++.++..++|.+ -...+|.+++.+. +++.....+.. +.... -.|-|-|-.++..... .+.-
T Consensus 186 f~~~~~~~~~~~~~F~S----y~G~v~~~dlsg~--~~~~~~~~~~~t~~e~~-----~~WrPGG~Q~~A~~~~--~~rl 252 (342)
T PF06433_consen 186 FEHPAYSRDGGRLYFVS----YEGNVYSADLSGD--SAKFGKPWSLLTDAEKA-----DGWRPGGWQLIAYHAA--SGRL 252 (342)
T ss_dssp -S--EEETTTTEEEEEB----TTSEEEEEEETTS--SEEEEEEEESS-HHHHH-----TTEEE-SSS-EEEETT--TTEE
T ss_pred ccccceECCCCeEEEEe----cCCEEEEEeccCC--cccccCcccccCccccc-----cCcCCcceeeeeeccc--cCeE
Confidence 34567777777788766 4567999998874 33322111100 00000 0155555443222211 1122
Q ss_pred EEEEc-cCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEE
Q 004574 380 YILLN-GRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 380 ~~~~~-~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~ 458 (744)
|++.. ....+++.--.+||.+|+.+++...-+..+.. .. .+++|-|.+=++|..... -..|+.+|
T Consensus 253 yvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~-----~~-------Si~Vsqd~~P~L~~~~~~--~~~l~v~D 318 (342)
T PF06433_consen 253 YVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHP-----ID-------SIAVSQDDKPLLYALSAG--DGTLDVYD 318 (342)
T ss_dssp EEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEE-----ES-------EEEEESSSS-EEEEEETT--TTEEEEEE
T ss_pred EEEecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCc-----cc-------eEEEccCCCcEEEEEcCC--CCeEEEEe
Confidence 22222 23345555678999999999977555443322 11 147788888888876542 25799999
Q ss_pred CCCCceeee
Q 004574 459 WPLKKSSQI 467 (744)
Q Consensus 459 ~~~g~~~~l 467 (744)
..+|+..+-
T Consensus 319 ~~tGk~~~~ 327 (342)
T PF06433_consen 319 AATGKLVRS 327 (342)
T ss_dssp TTT--EEEE
T ss_pred CcCCcEEee
Confidence 999876553
No 263
>COG4757 Predicted alpha/beta hydrolase [General function prediction only]
Probab=98.11 E-value=3.5e-05 Score=70.82 Aligned_cols=213 Identities=17% Similarity=0.154 Sum_probs=117.6
Q ss_pred EEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCC
Q 004574 485 IKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSI 564 (744)
Q Consensus 485 i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~ 564 (744)
+.+.+.||..+.+..| |.+- +.+--|++-|+ ++.+..+ +...+...+.+||.|+. +
T Consensus 8 ~~l~~~DG~~l~~~~~-pA~~------~~~g~~~va~a-----------~Gv~~~f---YRrfA~~a~~~Gf~Vlt---~ 63 (281)
T COG4757 8 AHLPAPDGYSLPGQRF-PADG------KASGRLVVAGA-----------TGVGQYF---YRRFAAAAAKAGFEVLT---F 63 (281)
T ss_pred cccccCCCccCccccc-cCCC------CCCCcEEeccc-----------CCcchhH---hHHHHHHhhccCceEEE---E
Confidence 5567789999998877 4331 12211222222 1111112 22456677789999999 4
Q ss_pred CCCCCCCCC--------------hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC--
Q 004574 565 PIIGEGDKL--------------PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS-- 628 (744)
Q Consensus 565 ~~~g~g~~~--------------~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~-- 628 (744)
+++|.|.+. ...|+.++++++++... ......+|||+||.+. -++.+++..-.+.|..++.
T Consensus 64 dyRG~g~S~p~~~~~~~~~~~DwA~~D~~aal~~~~~~~~--~~P~y~vgHS~GGqa~-gL~~~~~k~~a~~vfG~gagw 140 (281)
T COG4757 64 DYRGIGQSRPASLSGSQWRYLDWARLDFPAALAALKKALP--GHPLYFVGHSFGGQAL-GLLGQHPKYAAFAVFGSGAGW 140 (281)
T ss_pred ecccccCCCccccccCccchhhhhhcchHHHHHHHHhhCC--CCceEEeeccccceee-cccccCcccceeeEecccccc
Confidence 444444331 22399999999998532 2468999999999765 4555554221222222221
Q ss_pred ---CCC------------CCCCCcccccccchh-----------hcHHH---HHhc------Cc-----ccccCCCCCCE
Q 004574 629 ---YNK------------TLTPFGFQTEFRTLW-----------EATNV---YIEM------SP-----ITHANKIKKPI 668 (744)
Q Consensus 629 ---~~~------------~~~~~~~~~~~~~~~-----------~~~~~---~~~~------~~-----~~~~~~~~~P~ 668 (744)
... ......+-. ...+| ..... +.++ +| .+-.+++.+|+
T Consensus 141 sg~m~~~~~l~~~~l~~lv~p~lt~w~-g~~p~~l~G~G~d~p~~v~RdW~RwcR~p~y~fddp~~~~~~q~yaaVrtPi 219 (281)
T COG4757 141 SGWMGLRERLGAVLLWNLVGPPLTFWK-GYMPKDLLGLGSDLPGTVMRDWARWCRHPRYYFDDPAMRNYRQVYAAVRTPI 219 (281)
T ss_pred ccchhhhhcccceeeccccccchhhcc-ccCcHhhcCCCccCcchHHHHHHHHhcCccccccChhHhHHHHHHHHhcCce
Confidence 000 000000000 00000 00011 1111 11 22245678999
Q ss_pred EEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCC----CcccCccccHHHHHHHHHHHH
Q 004574 669 LIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFE----HHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 669 l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~----~H~~~~~~~~~~~~~~~~~fl 729 (744)
+.+...+|+.+| ....+.+.+..+++ +.+...++.. ||+-...+..+...++.++||
T Consensus 220 ~~~~~~DD~w~P--~As~d~f~~~y~nA--pl~~~~~~~~~~~lGH~gyfR~~~Ealwk~~L~w~ 280 (281)
T COG4757 220 TFSRALDDPWAP--PASRDAFASFYRNA--PLEMRDLPRAEGPLGHMGYFREPFEALWKEMLGWF 280 (281)
T ss_pred eeeccCCCCcCC--HHHHHHHHHhhhcC--cccceecCcccCcccchhhhccchHHHHHHHHHhh
Confidence 999999999998 88888888777654 4566666654 787543444467777777776
No 264
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=98.11 E-value=0.00014 Score=80.81 Aligned_cols=255 Identities=14% Similarity=0.124 Sum_probs=123.3
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEE-eeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAF-SVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf-~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
+|.+.|.+| +..+.+. +.....+..|+|||||++||| ++.. .-.+...||+.++.+.....+--.-+
T Consensus 330 ~L~~~D~dG----~n~~~ve-~~~~~~i~sP~~SPDG~~vAY~ts~e-----~~~g~s~vYv~~L~t~~~~~vkl~ve-- 397 (912)
T TIGR02171 330 NLAYIDYTK----GASRAVE-IEDTISVYHPDISPDGKKVAFCTGIE-----GLPGKSSVYVRNLNASGSGLVKLPVE-- 397 (912)
T ss_pred eEEEEecCC----CCceEEE-ecCCCceecCcCCCCCCEEEEEEeec-----CCCCCceEEEEehhccCCCceEeecc--
Confidence 788999888 7777772 223445889999999999999 6542 11357889999999766554421111
Q ss_pred ccccccceEE--ecCCcE-EEEEecCCCCCCCCCCCCCCCCeeeec-CCCc-ccccccccccCCCchh-hhccceeeeeE
Q 004574 85 LNAVFGSFVW--VNNSTL-LIFTIPSSRRDPPKKTMVPLGPKIQSN-EQKN-IIISRMTDNLLKDEYD-ESLFDYYTTAQ 158 (744)
Q Consensus 85 ~~~~~~~~~w--spDg~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 158 (744)
...-|.| ..+|.. |+|++....+.... ..-...-|+.. ..++ --+..++.+-+.-... ...+...+ ..
T Consensus 398 ---~aaiprwrv~e~gdt~ivyv~~a~nn~d~~--~~~~~stw~v~f~~gkfg~p~kl~dga~hggvs~~~~lavtg-a~ 471 (912)
T TIGR02171 398 ---NAAIPRWRVLENGDTVIVYVSDASNNKDDA--TFAAYSTWQVPFANGKFGTPKKLFDGAYHGGVSEDLNLAVSG-AR 471 (912)
T ss_pred ---cccccceEecCCCCeEEEEEcCCCCCcchh--hhhhcceEEEEecCCCCCCchhhhccccccccccCCceeeeh-hh
Confidence 1234466 466665 56665544433111 00011111111 0010 0011111110000000 00000001 11
Q ss_pred EEEEcC------CCCeeecCCCceeeeeccCCCC-ceEEEEEeeC--CcccccccCCCcceEEEEeCCCCeeeeccCCCC
Q 004574 159 LVLGSL------DGTAKDFGTPAVYTAVEPSPDQ-KYVLITSMHR--PYSYKVPCARFSQKVQVWTTDGKLVRELCDLPP 229 (744)
Q Consensus 159 l~~~~~------~G~~~~l~~~~~~~~~~~SpDG-~~i~~~~~~~--~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~ 229 (744)
|-+..+ +|+-.-.-+.....+.+.+.|| ++-+|..... ...|.+.....-.+|.+.|..|+.++.+....+
T Consensus 472 llr~~~~~~~~~~~~~~vwyn~eqacn~sl~~d~~~rt~fldfgg~tg~~fvg~~y~~he~~lvads~gklv~~v~ap~g 551 (912)
T TIGR02171 472 LLRAHVANEDVDNGKDDVWYNGEQACNASLAKDGSKRTLFLDFGGSTGQAFVGQKYGVHERLLVADSKGKLVRAVAAPAG 551 (912)
T ss_pred HhhhhhcccccccCccceeecchhccchhhhccCCcceEEEecCCccchhhccccccceeEEEEecCCCchhhhccCCCC
Confidence 112222 1211111122345577777787 4566664332 223333333333478888888988887766544
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccc
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLR 300 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~ 300 (744)
... -+-.|-.+.+. ++.+.-.+....+ ..|.++++ ..++...|..++.-
T Consensus 552 ytf------------dh~ew~~~~~~-~~vatl~n~~g~h------~ki~lv~~---~~~~i~~l~eg~el 600 (912)
T TIGR02171 552 YTF------------DHTEWVTGRSN-LAVATLTNVNGAH------KKIALINL---SDSKVTELVEGDEL 600 (912)
T ss_pred ccc------------cchhhhcCCCc-eEEEEeecCCCcc------ceEEEEEc---CCCceEEeeccccc
Confidence 321 12246544344 4444333332221 25778887 56777777765443
No 265
>PF03959 FSH1: Serine hydrolase (FSH1); InterPro: IPR005645 This entry represents proteins belonging to the AB hydrolase family. It consists of serine hydrolases of unknown specificity [, ] and includes uncharacterised proteins.; PDB: 1YCD_A.
Probab=98.10 E-value=2.3e-05 Score=75.39 Aligned_cols=107 Identities=23% Similarity=0.224 Sum_probs=62.6
Q ss_pred HHHHHHHHHHH----cCCCCCCcEEEEEechHHHHHHHHHHhC--------CCceeEEEEccCCCCCCCCCCcccccccc
Q 004574 577 SAEAAVEEVVR----RGVADPSRIAVGGHSYGAFMTAHLLAHA--------PHLFCCGIARSGSYNKTLTPFGFQTEFRT 644 (744)
Q Consensus 577 d~~~~~~~l~~----~~~~d~~~i~l~G~S~GG~~a~~~~~~~--------p~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 644 (744)
++.++++++.+ ++.+ .+|+|+|.||.+|..++... ...++.+|++++...... .
T Consensus 85 ~~~~sl~~l~~~i~~~GPf----dGvlGFSQGA~lAa~ll~~~~~~~~~~~~~~~kf~V~~sg~~p~~~-------~--- 150 (212)
T PF03959_consen 85 GLDESLDYLRDYIEENGPF----DGVLGFSQGAALAALLLALQQRGRPDGAHPPFKFAVFISGFPPPDP-------D--- 150 (212)
T ss_dssp --HHHHHHHHHHHHHH-------SEEEEETHHHHHHHHHHHHHHHHST--T----SEEEEES----EEE-----------
T ss_pred CHHHHHHHHHHHHHhcCCe----EEEEeecHHHHHHHHHHHHHHhhcccccCCCceEEEEEcccCCCch-------h---
Confidence 45666666554 3434 59999999999999888542 235789999988642100 0
Q ss_pred hhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccC
Q 004574 645 LWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYA 713 (744)
Q Consensus 645 ~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~ 713 (744)
........++++|+|-++|++|.+++ .+.++++++..... .+++..++ ||.+-
T Consensus 151 ----------~~~~~~~~~i~iPtlHv~G~~D~~~~--~~~s~~L~~~~~~~---~~v~~h~g-GH~vP 203 (212)
T PF03959_consen 151 ----------YQELYDEPKISIPTLHVIGENDPVVP--PERSEALAEMFDPD---ARVIEHDG-GHHVP 203 (212)
T ss_dssp ----------GTTTT--TT---EEEEEEETT-SSS---HHHHHHHHHHHHHH---EEEEEESS-SSS--
T ss_pred ----------hhhhhccccCCCCeEEEEeCCCCCcc--hHHHHHHHHhccCC---cEEEEECC-CCcCc
Confidence 00011245679999999999999998 88999999988764 57777775 58764
No 266
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=98.10 E-value=7.2e-05 Score=71.46 Aligned_cols=206 Identities=11% Similarity=0.084 Sum_probs=108.4
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+..+.||.|..-++-.+.+ .+|-+|.+.+++..+-.+. +...|+..+.||.|+.
T Consensus 264 ~aVlci~FSRDsEMlAsGsqD-------------GkIKvWri~tG~ClRrFdr-----------AHtkGvt~l~FSrD~S 319 (508)
T KOG0275|consen 264 DAVLCISFSRDSEMLASGSQD-------------GKIKVWRIETGQCLRRFDR-----------AHTKGVTCLSFSRDNS 319 (508)
T ss_pred cceEEEeecccHHHhhccCcC-------------CcEEEEEEecchHHHHhhh-----------hhccCeeEEEEccCcc
Confidence 345577889998877655433 3788888877664332221 1224466789999998
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCce-EeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
. +...++. -.+.+..+ ++|+.- .+-.....++.+.|++||.+|+..+.+ + .+.+++..+
T Consensus 320 q-iLS~sfD------------~tvRiHGl---KSGK~LKEfrGHsSyvn~a~ft~dG~~iisaSsD--g--tvkvW~~Kt 379 (508)
T KOG0275|consen 320 Q-ILSASFD------------QTVRIHGL---KSGKCLKEFRGHSSYVNEATFTDDGHHIISASSD--G--TVKVWHGKT 379 (508)
T ss_pred h-hhccccc------------ceEEEecc---ccchhHHHhcCccccccceEEcCCCCeEEEecCC--c--cEEEecCcc
Confidence 7 5444321 01333333 445432 233334467788999999999977622 3 344555444
Q ss_pred CCC--cceee-eccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEE
Q 004574 334 KDV--APRVL-FDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERI 410 (744)
Q Consensus 334 ~~~--~~~~l-~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l 410 (744)
.+- ..+.+ ++-....+...|- + ....|+.. ..+.+|++++.+.-++..
T Consensus 380 teC~~Tfk~~~~d~~vnsv~~~PK------n-peh~iVCN----------------------rsntv~imn~qGQvVrsf 430 (508)
T KOG0275|consen 380 TECLSTFKPLGTDYPVNSVILLPK------N-PEHFIVCN----------------------RSNTVYIMNMQGQVVRSF 430 (508)
T ss_pred hhhhhhccCCCCcccceeEEEcCC------C-CceEEEEc----------------------CCCeEEEEeccceEEeee
Confidence 210 00111 1111111100000 1 11122222 123488999854444444
Q ss_pred eeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceee
Q 004574 411 WESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQ 466 (744)
Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~ 466 (744)
...+.+ -..+. .+.+||.|.++.+... ...+|..+.-+|+.++
T Consensus 431 sSGkRE-----gGdFi----~~~lSpkGewiYcigE----D~vlYCF~~~sG~LE~ 473 (508)
T KOG0275|consen 431 SSGKRE-----GGDFI----NAILSPKGEWIYCIGE----DGVLYCFSVLSGKLER 473 (508)
T ss_pred ccCCcc-----CCceE----EEEecCCCcEEEEEcc----CcEEEEEEeecCceee
Confidence 333322 11122 2688999987766544 3568888877777665
No 267
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=98.10 E-value=0.0015 Score=65.81 Aligned_cols=144 Identities=11% Similarity=0.017 Sum_probs=81.6
Q ss_pred eEEEEEcC-CCCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-eeccCCCCCCCCC
Q 004574 157 AQLVLGSL-DGTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-RELCDLPPAEDIP 234 (744)
Q Consensus 157 ~~l~~~~~-~G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~~l~~~~~~~~~~ 234 (744)
+.|+.++. +|+.+.+..+..+.....-.++..|+.. .. .+++++.+.+.. +.+....-. ...
T Consensus 47 ~~i~r~~~~~g~~~~~~~p~~~~~~~~~d~~g~Lv~~-~~--------------g~~~~~~~~~~~~t~~~~~~~~-~~~ 110 (307)
T COG3386 47 GRIHRLDPETGKKRVFPSPGGFSSGALIDAGGRLIAC-EH--------------GVRLLDPDTGGKITLLAEPEDG-LPL 110 (307)
T ss_pred CeEEEecCCcCceEEEECCCCcccceeecCCCeEEEE-cc--------------ccEEEeccCCceeEEeccccCC-CCc
Confidence 68999999 6777777666444444433344444433 22 245555543333 333322111 000
Q ss_pred cccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee-ccceeceeeccCCceE
Q 004574 235 VCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-DLRFRSVSWCDDSLAL 313 (744)
Q Consensus 235 ~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-~~~~~~~~~SpDg~~l 313 (744)
....+....|||. ++|................+.||+++. . +..+++... -...+.++|||||+.+
T Consensus 111 -------~r~ND~~v~pdG~--~wfgt~~~~~~~~~~~~~~G~lyr~~p---~-g~~~~l~~~~~~~~NGla~SpDg~tl 177 (307)
T COG3386 111 -------NRPNDGVVDPDGR--IWFGDMGYFDLGKSEERPTGSLYRVDP---D-GGVVRLLDDDLTIPNGLAFSPDGKTL 177 (307)
T ss_pred -------CCCCceeEcCCCC--EEEeCCCccccCccccCCcceEEEEcC---C-CCEEEeecCcEEecCceEECCCCCEE
Confidence 1145678899997 777654410011111224457899885 3 444555544 5567889999999988
Q ss_pred EEeeeeeccceeEEEEcCC
Q 004574 314 VNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 314 ~~~~~~~~~~~~l~~~~~~ 332 (744)
+++- +....|++++.+
T Consensus 178 y~aD---T~~~~i~r~~~d 193 (307)
T COG3386 178 YVAD---TPANRIHRYDLD 193 (307)
T ss_pred EEEe---CCCCeEEEEecC
Confidence 8753 345678888876
No 268
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=98.09 E-value=4.5e-05 Score=75.36 Aligned_cols=145 Identities=10% Similarity=0.113 Sum_probs=87.1
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+..+.||++|.+++-. +.. .-|-.|++.=..++.+.....+ .+++++|||...
T Consensus 139 s~Vr~m~ws~~g~wmiSg-D~g------------G~iKyWqpnmnnVk~~~ahh~e------------aIRdlafSpnDs 193 (464)
T KOG0284|consen 139 SPVRTMKWSHNGTWMISG-DKG------------GMIKYWQPNMNNVKIIQAHHAE------------AIRDLAFSPNDS 193 (464)
T ss_pred ccceeEEEccCCCEEEEc-CCC------------ceEEecccchhhhHHhhHhhhh------------hhheeccCCCCc
Confidence 567789999999988644 221 2566677665554444333212 278899999887
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+ .+-++ ..+.|-+++.. ...+.+.|......+.++.|.|.-..|+..+ .+ +-+..+|..++
T Consensus 194 k-F~t~S------------dDg~ikiWdf~--~~kee~vL~GHgwdVksvdWHP~kgLiasgs-kD---nlVKlWDprSg 254 (464)
T KOG0284|consen 194 K-FLTCS------------DDGTIKIWDFR--MPKEERVLRGHGWDVKSVDWHPTKGLIASGS-KD---NLVKLWDPRSG 254 (464)
T ss_pred e-eEEec------------CCCeEEEEecc--CCchhheeccCCCCcceeccCCccceeEEcc-CC---ceeEeecCCCc
Confidence 7 33332 22346666652 3445566766678899999999865555444 22 25777787764
Q ss_pred CCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 335 DVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 335 ~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+-........+.+. .+.|+|+|.+|+..+
T Consensus 255 --~cl~tlh~HKntVl------~~~f~~n~N~Llt~s 283 (464)
T KOG0284|consen 255 --SCLATLHGHKNTVL------AVKFNPNGNWLLTGS 283 (464)
T ss_pred --chhhhhhhccceEE------EEEEcCCCCeeEEcc
Confidence 21111111111111 166999999888876
No 269
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=98.08 E-value=0.0002 Score=68.80 Aligned_cols=136 Identities=15% Similarity=0.179 Sum_probs=76.7
Q ss_pred EEEEEcC-CCCeeec---CCC----ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC--CCeeeeccCC
Q 004574 158 QLVLGSL-DGTAKDF---GTP----AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD--GKLVRELCDL 227 (744)
Q Consensus 158 ~l~~~~~-~G~~~~l---~~~----~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--g~~~~~l~~~ 227 (744)
-|.++|. +|+.+.- -++ -....+.|||||.+|+-.-. ..|.++|.. |......+..
T Consensus 134 PIh~wdaftG~lraSy~~ydh~de~taAhsL~Fs~DGeqlfaGyk--------------rcirvFdt~RpGr~c~vy~t~ 199 (406)
T KOG2919|consen 134 PIHLWDAFTGKLRASYRAYDHQDEYTAAHSLQFSPDGEQLFAGYK--------------RCIRVFDTSRPGRDCPVYTTV 199 (406)
T ss_pred ceeeeeccccccccchhhhhhHHhhhhheeEEecCCCCeEeeccc--------------ceEEEeeccCCCCCCcchhhh
Confidence 4667788 8854422 111 23458899999999975532 357777752 3332222221
Q ss_pred CC-CCCCCcccCCccCC-CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEe-eeeccceece
Q 004574 228 PP-AEDIPVCYNSVREG-MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEIL-HKLDLRFRSV 304 (744)
Q Consensus 228 ~~-~~~~~~~~~~~~~~-~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l-~~~~~~~~~~ 304 (744)
.. .. ...+ +..+++||-..+.+++-+-. .+--||.. .++.+-.+ ....+.+..+
T Consensus 200 ~~~k~--------gq~giisc~a~sP~~~~~~a~gsY~----------q~~giy~~-----~~~~pl~llggh~gGvThL 256 (406)
T KOG2919|consen 200 TKGKF--------GQKGIISCFAFSPMDSKTLAVGSYG----------QRVGIYND-----DGRRPLQLLGGHGGGVTHL 256 (406)
T ss_pred hcccc--------cccceeeeeeccCCCCcceeeeccc----------ceeeeEec-----CCCCceeeecccCCCeeeE
Confidence 11 10 0112 44688999887645554311 11124443 33454444 4446789999
Q ss_pred eeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 305 SWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 305 ~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
.|.+||.+|+..+... ..|..+|+-.
T Consensus 257 ~~~edGn~lfsGaRk~---dkIl~WDiR~ 282 (406)
T KOG2919|consen 257 QWCEDGNKLFSGARKD---DKILCWDIRY 282 (406)
T ss_pred EeccCcCeecccccCC---CeEEEEeehh
Confidence 9999999988766322 2466667654
No 270
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.08 E-value=0.0013 Score=67.65 Aligned_cols=66 Identities=9% Similarity=0.032 Sum_probs=41.6
Q ss_pred eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 294 LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 294 l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
..-....++.++|||||++|+..+. + + -|.+.|.+. .+..-+.+.=++ |-.-++||||||+|+...
T Consensus 286 w~~~~g~in~f~FS~DG~~LA~VSq-D-G--fLRvF~fdt--~eLlg~mkSYFG------GLLCvcWSPDGKyIvtGG 351 (636)
T KOG2394|consen 286 WHIGEGSINEFAFSPDGKYLATVSQ-D-G--FLRIFDFDT--QELLGVMKSYFG------GLLCVCWSPDGKYIVTGG 351 (636)
T ss_pred eEeccccccceeEcCCCceEEEEec-C-c--eEEEeeccH--HHHHHHHHhhcc------ceEEEEEcCCccEEEecC
Confidence 3445668889999999999999872 2 3 355555554 111122221122 333377999999998875
No 271
>PF06028 DUF915: Alpha/beta hydrolase of unknown function (DUF915); InterPro: IPR010315 This family consists of bacterial proteins of unknown function, which are hydrolase-like.; PDB: 3LP5_A 3FLE_A 3DS8_A.
Probab=98.08 E-value=5.2e-05 Score=73.95 Aligned_cols=147 Identities=14% Similarity=0.043 Sum_probs=93.1
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC-----ceeEEEEccCCCCCCCCCCcccc----cccchhh
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-----LFCCGIARSGSYNKTLTPFGFQT----EFRTLWE 647 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-----~~~~~v~~~~~~~~~~~~~~~~~----~~~~~~~ 647 (744)
-+..++.+|.++..+ +++-++||||||..++.++..+.. .+..+|+++++++.......... ....|-.
T Consensus 88 wl~~vl~~L~~~Y~~--~~~N~VGHSmGg~~~~~yl~~~~~~~~~P~l~K~V~Ia~pfng~~~~~~~~~~~~~~~~gp~~ 165 (255)
T PF06028_consen 88 WLKKVLKYLKKKYHF--KKFNLVGHSMGGLSWTYYLENYGNDKNLPKLNKLVTIAGPFNGILGMNDDQNQNDLNKNGPKS 165 (255)
T ss_dssp HHHHHHHHHHHCC----SEEEEEEETHHHHHHHHHHHHCTTGTTS-EEEEEEEES--TTTTTCCSC-TTTT-CSTT-BSS
T ss_pred HHHHHHHHHHHhcCC--CEEeEEEECccHHHHHHHHHHhccCCCCcccceEEEeccccCccccccccchhhhhcccCCcc
Confidence 678888889888777 589999999999999999887522 57888889988775432110000 0000222
Q ss_pred cHHHHHhcCcc-cccCCCCCCEEEEeeC------CCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC--CCcccCccccH
Q 004574 648 ATNVYIEMSPI-THANKIKKPILIIHGE------VDDKVGLFPMQAERFFDALKGHGALSRLVLLPF--EHHVYAARENV 718 (744)
Q Consensus 648 ~~~~~~~~~~~-~~~~~~~~P~l~i~G~------~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~H~~~~~~~~ 718 (744)
..+.|..+-.. ..--.-.+.+|-|.|. .|-.|| ...+..+..-++......+-.++.| +.|.-. ...
T Consensus 166 ~~~~y~~l~~~~~~~~p~~i~VLnI~G~~~~g~~sDG~V~--~~Ss~sl~~L~~~~~~~Y~e~~v~G~~a~HS~L--heN 241 (255)
T PF06028_consen 166 MTPMYQDLLKNRRKNFPKNIQVLNIYGDLEDGSNSDGIVP--NASSLSLRYLLKNRAKSYQEKTVTGKDAQHSQL--HEN 241 (255)
T ss_dssp --HHHHHHHHTHGGGSTTT-EEEEEEEESBTTCSBTSSSB--HHHHCTHHHHCTTTSSEEEEEEEESGGGSCCGG--GCC
T ss_pred cCHHHHHHHHHHHhhCCCCeEEEEEecccCCCCCCCeEEe--HHHHHHHHHHhhcccCceEEEEEECCCCccccC--CCC
Confidence 22333332221 1111124669999998 899999 8888888877777667777777765 468643 234
Q ss_pred HHHHHHHHHHH
Q 004574 719 MHVIWETDRWL 729 (744)
Q Consensus 719 ~~~~~~~~~fl 729 (744)
..+.+.|.+||
T Consensus 242 ~~V~~~I~~FL 252 (255)
T PF06028_consen 242 PQVDKLIIQFL 252 (255)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHHh
Confidence 47888888887
No 272
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.08 E-value=0.0027 Score=75.68 Aligned_cols=126 Identities=14% Similarity=0.080 Sum_probs=68.9
Q ss_pred eeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC--CcccCCccCCCCccceecCCCc
Q 004574 178 TAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI--PVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 178 ~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~spDg~~ 255 (744)
..++++|++..|++....+ .+|++|+..++....+......... .......-..+.+++++|||+.
T Consensus 686 ~gVa~dp~~g~LyVad~~~------------~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~ 753 (1057)
T PLN02919 686 WDVCFEPVNEKVYIAMAGQ------------HQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKE 753 (1057)
T ss_pred eEEEEecCCCeEEEEECCC------------CeEEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCE
Confidence 3678999887787764432 4799999877665544321000000 0000111123667999999987
Q ss_pred eEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeec----------------------cceeceeeccCCceE
Q 004574 256 TLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLD----------------------LRFRSVSWCDDSLAL 313 (744)
Q Consensus 256 ~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~----------------------~~~~~~~~SpDg~~l 313 (744)
||.+.. ....|.+++. +++....+..+. .....+++++||+.+
T Consensus 754 -LYVADs-----------~n~~Irv~D~---~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LY 818 (1057)
T PLN02919 754 -LYIADS-----------ESSSIRALDL---KTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIY 818 (1057)
T ss_pred -EEEEEC-----------CCCeEEEEEC---CCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEE
Confidence 666532 2234777776 333322221100 122467889999733
Q ss_pred EEeeeeeccceeEEEEcCCCC
Q 004574 314 VNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 314 ~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+ +. ..+..|.++|.+++
T Consensus 819 V--AD--s~N~rIrviD~~tg 835 (1057)
T PLN02919 819 V--AD--SYNHKIKKLDPATK 835 (1057)
T ss_pred E--EE--CCCCEEEEEECCCC
Confidence 3 21 24557888998763
No 273
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.07 E-value=5.9e-05 Score=78.49 Aligned_cols=155 Identities=13% Similarity=0.184 Sum_probs=92.9
Q ss_pred CCCCCcccceeecC-CCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcccc-ccccceEEecCCc-EEEE
Q 004574 27 YPDGAKINFVSWSP-DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLN-AVFGSFVWVNNST-LLIF 103 (744)
Q Consensus 27 ~~~~~~~~~p~~Sp-DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~-~~~~~~~wspDg~-~l~~ 103 (744)
+-.|.-++...|.| |-++||..+. ++.-+||.+...|......|........ .-+..+.|.|=-. .|+.
T Consensus 624 l~Ngt~vtDl~WdPFD~~rLAVa~d--------dg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~ 695 (1012)
T KOG1445|consen 624 LFNGTLVTDLHWDPFDDERLAVATD--------DGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAV 695 (1012)
T ss_pred cccCceeeecccCCCChHHeeeccc--------CceEEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhh
Confidence 33455677888988 7788998654 7778999998777665444422210000 0122334444221 1221
Q ss_pred EecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeee
Q 004574 104 TIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAV 180 (744)
Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~ 180 (744)
.++ ...|-++|+ ++ ...+|..+ +.+.++
T Consensus 696 asy-------------------------------------------------d~Ti~lWDl~~~~~~~~l~gHtdqIf~~ 726 (1012)
T KOG1445|consen 696 ASY-------------------------------------------------DSTIELWDLANAKLYSRLVGHTDQIFGI 726 (1012)
T ss_pred hhc-------------------------------------------------cceeeeeehhhhhhhheeccCcCceeEE
Confidence 111 146777888 66 55566666 889999
Q ss_pred ccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEE
Q 004574 181 EPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWV 260 (744)
Q Consensus 181 ~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~ 260 (744)
+|||||++|+-...+. .|.+|++..++. .+-..++..+. . -..+.|.-||+. ++.+
T Consensus 727 AWSpdGr~~AtVcKDg-------------~~rVy~Prs~e~-pv~Eg~gpvgt--------R-gARi~wacdgr~-viv~ 782 (1012)
T KOG1445|consen 727 AWSPDGRRIATVCKDG-------------TLRVYEPRSREQ-PVYEGKGPVGT--------R-GARILWACDGRI-VIVV 782 (1012)
T ss_pred EECCCCcceeeeecCc-------------eEEEeCCCCCCC-ccccCCCCccC--------c-ceeEEEEecCcE-EEEe
Confidence 9999999999876553 789998876532 22222221100 0 224789999997 5555
Q ss_pred Ee
Q 004574 261 EA 262 (744)
Q Consensus 261 ~~ 262 (744)
.+
T Consensus 783 Gf 784 (1012)
T KOG1445|consen 783 GF 784 (1012)
T ss_pred cc
Confidence 43
No 274
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=98.05 E-value=6.6e-05 Score=76.02 Aligned_cols=221 Identities=10% Similarity=0.073 Sum_probs=137.4
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccc
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICL 85 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~ 85 (744)
.|.+|++.+....-+..+|.=+.-+..++...++|||+.|..- ..-..|-|+|++.-..+--.......
T Consensus 441 cVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgrtLivG----------GeastlsiWDLAapTprikaeltssa- 509 (705)
T KOG0639|consen 441 CVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGRTLIVG----------GEASTLSIWDLAAPTPRIKAELTSSA- 509 (705)
T ss_pred eEEEeeccCCCCCCccccccccCcccceeeeEecCCCceEEec----------cccceeeeeeccCCCcchhhhcCCcc-
Confidence 4778888774333344445434445578899999999999873 22345666677765544322222100
Q ss_pred cccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-
Q 004574 86 NAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL- 164 (744)
Q Consensus 86 ~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~- 164 (744)
+.--.++.|||.+..+..-. .+.|.++|+
T Consensus 510 -paCyALa~spDakvcFsccs-------------------------------------------------dGnI~vwDLh 539 (705)
T KOG0639|consen 510 -PACYALAISPDAKVCFSCCS-------------------------------------------------DGNIAVWDLH 539 (705)
T ss_pred -hhhhhhhcCCccceeeeecc-------------------------------------------------CCcEEEEEcc
Confidence 00125678999985443311 167888899
Q ss_pred CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccC
Q 004574 165 DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVRE 242 (744)
Q Consensus 165 ~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~ 242 (744)
+- ..+++.-. .+...+..|+||.+|+-...+ +.+.-||+..+. ++.......
T Consensus 540 nq~~VrqfqGhtDGascIdis~dGtklWTGGlD-------------ntvRcWDlregr--qlqqhdF~S----------- 593 (705)
T KOG0639|consen 540 NQTLVRQFQGHTDGASCIDISKDGTKLWTGGLD-------------NTVRCWDLREGR--QLQQHDFSS----------- 593 (705)
T ss_pred cceeeecccCCCCCceeEEecCCCceeecCCCc-------------cceeehhhhhhh--hhhhhhhhh-----------
Confidence 55 67777655 778889999999999865433 367788876442 333332221
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
.+..+..+|.|.+ |+.-= .++.++++.. .+.+.-+|...+--+-++.|++-|+|++.+. .++
T Consensus 594 QIfSLg~cP~~dW-lavGM------------ens~vevlh~---skp~kyqlhlheScVLSlKFa~cGkwfvStG--kDn 655 (705)
T KOG0639|consen 594 QIFSLGYCPTGDW-LAVGM------------ENSNVEVLHT---SKPEKYQLHLHESCVLSLKFAYCGKWFVSTG--KDN 655 (705)
T ss_pred hheecccCCCccc-eeeec------------ccCcEEEEec---CCccceeecccccEEEEEEecccCceeeecC--chh
Confidence 1556788999998 44321 2345777776 4445556776777888999999999998765 223
Q ss_pred ceeEEEEcC
Q 004574 323 QTRTWLVCP 331 (744)
Q Consensus 323 ~~~l~~~~~ 331 (744)
-...|+..-
T Consensus 656 lLnawrtPy 664 (705)
T KOG0639|consen 656 LLNAWRTPY 664 (705)
T ss_pred hhhhccCcc
Confidence 334565553
No 275
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.04 E-value=0.00047 Score=74.73 Aligned_cols=157 Identities=13% Similarity=0.120 Sum_probs=84.1
Q ss_pred CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCC
Q 004574 167 TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMR 245 (744)
Q Consensus 167 ~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~ 245 (744)
+.-.+..+ ..+..++|.|-..+-+....- ..++.+|++...++..-++...- +.
T Consensus 401 ~CL~~F~HndfVTcVaFnPvDDryFiSGSL------------D~KvRiWsI~d~~Vv~W~Dl~~l-------------IT 455 (712)
T KOG0283|consen 401 ECLKVFSHNDFVTCVAFNPVDDRYFISGSL------------DGKVRLWSISDKKVVDWNDLRDL-------------IT 455 (712)
T ss_pred ceeeEEecCCeeEEEEecccCCCcEeeccc------------ccceEEeecCcCeeEeehhhhhh-------------he
Confidence 33333444 677788999977665544333 35889999888876655554322 66
Q ss_pred ccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCC--CCCceEeeee----ccceeceeeccCCc-eEEEeee
Q 004574 246 SISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAE--GEKPEILHKL----DLRFRSVSWCDDSL-ALVNETW 318 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~--~~~~~~l~~~----~~~~~~~~~SpDg~-~l~~~~~ 318 (744)
.+.++|||++.|+=+ ..+.+.+++..+.+ ......+... ...+..+.+.|-.. .++.+++
T Consensus 456 Avcy~PdGk~avIGt-------------~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSn 522 (712)
T KOG0283|consen 456 AVCYSPDGKGAVIGT-------------FNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTSN 522 (712)
T ss_pred eEEeccCCceEEEEE-------------eccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEecC
Confidence 789999999633322 22334444431110 0000001100 22577777775433 4665553
Q ss_pred eeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 319 YKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 319 ~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
..+|.++|... .......++.... ......+++.||++|+..+
T Consensus 523 ----DSrIRI~d~~~--~~lv~KfKG~~n~----~SQ~~Asfs~Dgk~IVs~s 565 (712)
T KOG0283|consen 523 ----DSRIRIYDGRD--KDLVHKFKGFRNT----SSQISASFSSDGKHIVSAS 565 (712)
T ss_pred ----CCceEEEeccc--hhhhhhhcccccC----CcceeeeEccCCCEEEEee
Confidence 23688888644 2222223322111 0112244788999999987
No 276
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=98.02 E-value=0.00079 Score=64.38 Aligned_cols=108 Identities=7% Similarity=0.132 Sum_probs=67.6
Q ss_pred CCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceec
Q 004574 172 GTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRA 251 (744)
Q Consensus 172 ~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~sp 251 (744)
.+...+..++|.|.|.+|+....+. .+.+||+++-+...-...... ....+..+.+|+
T Consensus 214 qd~~~vrsiSfHPsGefllvgTdHp-------------~~rlYdv~T~QcfvsanPd~q---------ht~ai~~V~Ys~ 271 (430)
T KOG0640|consen 214 QDTEPVRSISFHPSGEFLLVGTDHP-------------TLRLYDVNTYQCFVSANPDDQ---------HTGAITQVRYSS 271 (430)
T ss_pred hccceeeeEeecCCCceEEEecCCC-------------ceeEEeccceeEeeecCcccc---------cccceeEEEecC
Confidence 3337788999999999999887664 578888887765543332111 112366788999
Q ss_pred CCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEee--eeccceeceeeccCCceEEEee
Q 004574 252 DKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILH--KLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 252 Dg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~--~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
.|+ ||.+...+ +.|-++|- -++. .+.+- -+...+.+..|+.+|++|+.+.
T Consensus 272 t~~--lYvTaSkD-----------G~IklwDG---VS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG 324 (430)
T KOG0640|consen 272 TGS--LYVTASKD-----------GAIKLWDG---VSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSG 324 (430)
T ss_pred Ccc--EEEEeccC-----------CcEEeecc---ccHHHHHHHHhhcCCceeeeEEEccCCeEEeecC
Confidence 997 67665332 23555553 1111 11111 1234577889999999998765
No 277
>KOG3253 consensus Predicted alpha/beta hydrolase [General function prediction only]
Probab=98.02 E-value=0.0001 Score=77.05 Aligned_cols=101 Identities=19% Similarity=0.245 Sum_probs=73.8
Q ss_pred CCCCCcEEEEEechHHHHHHHHHHhCCC-ceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCE
Q 004574 590 VADPSRIAVGGHSYGAFMTAHLLAHAPH-LFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPI 668 (744)
Q Consensus 590 ~~d~~~i~l~G~S~GG~~a~~~~~~~p~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~ 668 (744)
.....+|.|+|+|||+.++.+....+.+ .+.++||+.-..+....+-+...+ .+-.++.|+
T Consensus 246 efpha~IiLvGrsmGAlVachVSpsnsdv~V~~vVCigypl~~vdgprgirDE------------------~Lldmk~PV 307 (784)
T KOG3253|consen 246 EFPHAPIILVGRSMGALVACHVSPSNSDVEVDAVVCIGYPLDTVDGPRGIRDE------------------ALLDMKQPV 307 (784)
T ss_pred cCCCCceEEEecccCceeeEEeccccCCceEEEEEEecccccCCCcccCCcch------------------hhHhcCCce
Confidence 3445789999999998888877765533 478888887665432221111111 133468999
Q ss_pred EEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccC
Q 004574 669 LIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYA 713 (744)
Q Consensus 669 l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~ 713 (744)
|++.|.+|..++ ....+++.++++. .++++++.+++|.+.
T Consensus 308 LFV~Gsnd~mcs--pn~ME~vreKMqA---~~elhVI~~adhsma 347 (784)
T KOG3253|consen 308 LFVIGSNDHMCS--PNSMEEVREKMQA---EVELHVIGGADHSMA 347 (784)
T ss_pred EEEecCCcccCC--HHHHHHHHHHhhc---cceEEEecCCCcccc
Confidence 999999999988 8889999888874 468999999999875
No 278
>PF12048 DUF3530: Protein of unknown function (DUF3530); InterPro: IPR022529 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 272 to 336 amino acids in length. These proteins are distantly related to alpa/beta hydrolases so they may act as enzymes.
Probab=98.02 E-value=0.00025 Score=72.09 Aligned_cols=206 Identities=17% Similarity=0.144 Sum_probs=122.0
Q ss_pred eEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe-
Q 004574 482 KEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA- 560 (744)
Q Consensus 482 ~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~- 560 (744)
.+..++.. ++.+.. .+|+|... ..++.+||++||.|.. -.+.+........|..+|+.+++
T Consensus 62 ~e~~~L~~-~~~~fl-aL~~~~~~----~~~~G~vIilp~~g~~------------~d~p~~i~~LR~~L~~~GW~Tlsi 123 (310)
T PF12048_consen 62 DEVQWLQA-GEERFL-ALWRPANS----AKPQGAVIILPDWGEH------------PDWPGLIAPLRRELPDHGWATLSI 123 (310)
T ss_pred hhcEEeec-CCEEEE-EEEecccC----CCCceEEEEecCCCCC------------CCcHhHHHHHHHHhhhcCceEEEe
Confidence 34444443 444444 58888652 3357899999996421 12222233455677889999997
Q ss_pred -cCCCC-----CC--------CCCCC-------------------------ChHHHHHHHHHHHHHcCCCCCCcEEEEEe
Q 004574 561 -GPSIP-----II--------GEGDK-------------------------LPNDSAEAAVEEVVRRGVADPSRIAVGGH 601 (744)
Q Consensus 561 -~~~~~-----~~--------g~g~~-------------------------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~ 601 (744)
.+... .. ..+.. ....-+.+++.++.+++. .+|+|+||
T Consensus 124 t~P~~~~~~~p~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ari~Aa~~~~~~~~~---~~ivlIg~ 200 (310)
T PF12048_consen 124 TLPDPAPPASPNRATEAEEVPSAGDQQLSQPSDEPSPASAQEAEAREAYEERLFARIEAAIAFAQQQGG---KNIVLIGH 200 (310)
T ss_pred cCCCcccccCCccCCCCCCCCCCCCCCcCCCCCCCccccccHhHHhHHHHHHHHHHHHHHHHHHHhcCC---ceEEEEEe
Confidence 11100 00 00000 112256677777777653 45999999
Q ss_pred chHHHHHHHHHHhCCC-ceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCC
Q 004574 602 SYGAFMTAHLLAHAPH-LFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVG 680 (744)
Q Consensus 602 S~GG~~a~~~~~~~p~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~ 680 (744)
+.|+++++.++...+. .+.++|.+++....... .......+.+++.|+|=|++.....+
T Consensus 201 G~gA~~~~~~la~~~~~~~daLV~I~a~~p~~~~-------------------n~~l~~~la~l~iPvLDi~~~~~~~~- 260 (310)
T PF12048_consen 201 GTGAGWAARYLAEKPPPMPDALVLINAYWPQPDR-------------------NPALAEQLAQLKIPVLDIYSADNPAS- 260 (310)
T ss_pred ChhHHHHHHHHhcCCCcccCeEEEEeCCCCcchh-------------------hhhHHHHhhccCCCEEEEecCCChHH-
Confidence 9999999999988754 57899999986321110 01122335678999999998883322
Q ss_pred CCHHHHHHHHHHHHhCC-CcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 681 LFPMQAERFFDALKGHG-ALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 681 ~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
...+.+-....++.. ...+-.-+++..|.... ....+.++|..||.++
T Consensus 261 --~~~a~~R~~~a~r~~~~~YrQ~~L~~~~~~~~~--~~~~l~~rIrGWL~~~ 309 (310)
T PF12048_consen 261 --QQTAKQRKQAAKRNKKPDYRQIQLPGLPDNPSG--WQEQLLRRIRGWLKRH 309 (310)
T ss_pred --HHHHHHHHHHHHhccCCCceeEecCCCCCChhh--HHHHHHHHHHHHHHhh
Confidence 333333333333333 34566666666664432 1223889999999875
No 279
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=97.99 E-value=0.0022 Score=69.62 Aligned_cols=65 Identities=15% Similarity=0.054 Sum_probs=46.0
Q ss_pred eeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccC
Q 004574 358 MTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLN 437 (744)
Q Consensus 358 ~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d 437 (744)
+++||||+||+..++++ .|..||+.++..---..-+ ..-..+++||+
T Consensus 582 ~~FS~DgrWlisasmD~---------------------tIr~wDlpt~~lID~~~vd------------~~~~sls~SPn 628 (910)
T KOG1539|consen 582 MTFSPDGRWLISASMDS---------------------TIRTWDLPTGTLIDGLLVD------------SPCTSLSFSPN 628 (910)
T ss_pred eEeCCCCcEEEEeecCC---------------------cEEEEeccCcceeeeEecC------------CcceeeEECCC
Confidence 67999999999987542 3999999998764222211 11123689999
Q ss_pred CCEEEEEEecCCCCceEEEEE
Q 004574 438 QLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 438 ~~~~~~~~~~~~~~~~i~~~~ 458 (744)
|..|+.+..+.+ .||+|.
T Consensus 629 gD~LAT~Hvd~~---gIylWs 646 (910)
T KOG1539|consen 629 GDFLATVHVDQN---GIYLWS 646 (910)
T ss_pred CCEEEEEEecCc---eEEEEE
Confidence 999998886544 377776
No 280
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=97.98 E-value=0.0045 Score=66.47 Aligned_cols=267 Identities=16% Similarity=0.165 Sum_probs=142.6
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.....+.|||++.+||... ..++.+-.|.+.++++|+...-.-.. ....++|.+|++.|+|+..++...
T Consensus 130 ~Lg~~~~s~D~~~la~s~D-----~~G~e~y~lr~kdL~tg~~~~d~i~~------~~~~~~Wa~d~~~lfYt~~d~~~r 198 (682)
T COG1770 130 SLGAASISPDHNLLAYSVD-----VLGDEQYTLRFKDLATGEELPDEITN------TSGSFAWAADGKTLFYTRLDENHR 198 (682)
T ss_pred eeeeeeeCCCCceEEEEEe-----cccccEEEEEEEecccccccchhhcc------cccceEEecCCCeEEEEEEcCCCC
Confidence 4567899999999999864 33466788999999998865542111 245789999999999997653332
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC---CeeecCCC---ceeeeeccCCC
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG---TAKDFGTP---AVYTAVEPSPD 185 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G---~~~~l~~~---~~~~~~~~SpD 185 (744)
. .+||.-.+.+ ..+.|-+. .-...+.-|.+
T Consensus 199 p--------------------------------------------~kv~~h~~gt~~~~d~lvyeE~d~~f~~~v~~s~s 234 (682)
T COG1770 199 P--------------------------------------------DKVWRHRLGTPGSSDELVYEEKDDRFFLSVGRSRS 234 (682)
T ss_pred c--------------------------------------------ceEEEEecCCCCCcceEEEEcCCCcEEEEeeeccC
Confidence 1 5777777722 33333322 44446666777
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCC--eeeeccCCCCCCCCCcccCCccCCCCccceecC--CCceEEEEE
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK--LVRELCDLPPAEDIPVCYNSVREGMRSISWRAD--KPSTLYWVE 261 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spD--g~~~l~~~~ 261 (744)
.++|+...... ...++++.+.+.. +.+.+...+ . ++.++.+ |.. .+..+
T Consensus 235 ~~yi~i~~~~~----------~tsE~~ll~a~~p~~~p~vv~pr~--~--------------g~eY~~eh~~d~-f~i~s 287 (682)
T COG1770 235 EAYIVISLGSH----------ITSEVRLLDADDPEAEPKVVLPRE--N--------------GVEYSVEHGGDR-FYILS 287 (682)
T ss_pred CceEEEEcCCC----------cceeEEEEecCCCCCceEEEEEcC--C--------------CcEEeeeecCcE-EEEEe
Confidence 78887775332 2357777776533 223332221 0 1122222 333 33333
Q ss_pred eecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceee
Q 004574 262 AQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVL 341 (744)
Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l 341 (744)
+.++... .|+..++ .......+.+........--.++-=..+|+... ...+..+|++.+..+++.....+
T Consensus 288 N~~gknf--------~l~~ap~-~~~~~~w~~~I~h~~~~~l~~~~~f~~~lVl~e-R~~glp~v~v~~~~~~~~~~i~f 357 (682)
T COG1770 288 NADGKNF--------KLVRAPV-SADKSNWRELIPHREDVRLEGVDLFADHLVLLE-RQEGLPRVVVRDRKTGEERGIAF 357 (682)
T ss_pred cCCCcce--------EEEEccC-CCChhcCeeeeccCCCceeeeeeeeccEEEEEe-cccCCceEEEEecCCCceeeEEe
Confidence 3322111 3555544 112223344444433322223334456666655 44466788888876642221222
Q ss_pred eccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeec
Q 004574 342 FDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWES 413 (744)
Q Consensus 342 ~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~ 413 (744)
.+..+..... ++ -.++...|.+.. ... -....++-+|+.+++.+.+-+.
T Consensus 358 ~~~ay~~~l~--~~----~e~~s~~lR~~y--------------sS~---ttP~~~~~~dm~t~er~~Lkqq 406 (682)
T COG1770 358 DDEAYSAGLS--GN----PEFDSDRLRYSY--------------SSM---TTPATLFDYDMATGERTLLKQQ 406 (682)
T ss_pred cchhhhcccc--CC----CCCCCccEEEEe--------------ecc---cccceeEEeeccCCcEEEEEec
Confidence 2222211000 11 123444454443 222 2344699999999988877553
No 281
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=97.98 E-value=0.00082 Score=66.79 Aligned_cols=130 Identities=18% Similarity=0.210 Sum_probs=78.7
Q ss_pred ceeEeecCCCCCCC-CceeeecCCCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEE------CCC-----C
Q 004574 6 GIGIHRLLPDDSLG-PEKEVHGYPDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIAD------AET-----G 72 (744)
Q Consensus 6 ~~~~~~~~~~~~~g-~~~~l~~~~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~------~~g-----g 72 (744)
.||.++...++.++ ...-+..|.... .+...+|||+|+.+|-... ++..-||... .++ .
T Consensus 39 riW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D--------~g~v~lWk~~~~~~~~~d~e~~~~k 110 (434)
T KOG1009|consen 39 RIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGD--------GGEVFLWKQGDVRIFDADTEADLNK 110 (434)
T ss_pred eeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCC--------CceEEEEEecCcCCccccchhhhCc
Confidence 45555554422211 223333344433 6788999999999998532 4455566554 222 1
Q ss_pred ce---eccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhh
Q 004574 73 EA---KPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDES 149 (744)
Q Consensus 73 ~~---~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (744)
+. ++.-.+ ....+-.++|+||+..+.+.+.+
T Consensus 111 e~w~v~k~lr~----h~~diydL~Ws~d~~~l~s~s~d------------------------------------------ 144 (434)
T KOG1009|consen 111 EKWVVKKVLRG----HRDDIYDLAWSPDSNFLVSGSVD------------------------------------------ 144 (434)
T ss_pred cceEEEEEecc----cccchhhhhccCCCceeeeeecc------------------------------------------
Confidence 10 001000 01134578999999999887432
Q ss_pred ccceeeeeEEEEEcC-CCCeee-cCCC-ceeeeeccCCCCceEEEEEeeC
Q 004574 150 LFDYYTTAQLVLGSL-DGTAKD-FGTP-AVYTAVEPSPDQKYVLITSMHR 196 (744)
Q Consensus 150 ~~~~~~~~~l~~~~~-~G~~~~-l~~~-~~~~~~~~SpDG~~i~~~~~~~ 196 (744)
+.++.+|+ .|.... +.++ ..++..+|.|=+++|+-.+..+
T Consensus 145 -------ns~~l~Dv~~G~l~~~~~dh~~yvqgvawDpl~qyv~s~s~dr 187 (434)
T KOG1009|consen 145 -------NSVRLWDVHAGQLLAILDDHEHYVQGVAWDPLNQYVASKSSDR 187 (434)
T ss_pred -------ceEEEEEeccceeEeeccccccccceeecchhhhhhhhhccCc
Confidence 46778888 885444 4444 7788999999999998776654
No 282
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=97.97 E-value=0.0005 Score=70.53 Aligned_cols=159 Identities=15% Similarity=0.176 Sum_probs=98.8
Q ss_pred eEEEEEcCCCCeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCC
Q 004574 157 AQLVLGSLDGTAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIP 234 (744)
Q Consensus 157 ~~l~~~~~~G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~ 234 (744)
+.+.+++-.|..++-.+. +.+..-.|+|||.-|+-. .++ ..|-+|...|.....+.+..-.
T Consensus 85 Gkf~il~k~~rVE~sv~AH~~A~~~gRW~~dGtgLlt~-GED------------G~iKiWSrsGMLRStl~Q~~~~---- 147 (737)
T KOG1524|consen 85 GRFVILNKSARVERSISAHAAAISSGRWSPDGAGLLTA-GED------------GVIKIWSRSGMLRSTVVQNEES---- 147 (737)
T ss_pred ceEEEecccchhhhhhhhhhhhhhhcccCCCCceeeee-cCC------------ceEEEEeccchHHHHHhhcCce----
Confidence 345555555544443333 667788999999988754 332 3788899888866655544322
Q ss_pred cccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEE
Q 004574 235 VCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALV 314 (744)
Q Consensus 235 ~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~ 314 (744)
++.++|.|+... ++|+. .+++++-++. ....+.+.-.+++-+-.+.|++....|+
T Consensus 148 ---------v~c~~W~p~S~~-vl~c~-------------g~h~~IKpL~--~n~k~i~WkAHDGiiL~~~W~~~s~lI~ 202 (737)
T KOG1524|consen 148 ---------IRCARWAPNSNS-IVFCQ-------------GGHISIKPLA--ANSKIIRWRAHDGLVLSLSWSTQSNIIA 202 (737)
T ss_pred ---------eEEEEECCCCCc-eEEec-------------CCeEEEeecc--cccceeEEeccCcEEEEeecCcccccee
Confidence 678899999999 88883 3357776663 2233334444677788899999998888
Q ss_pred EeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEE
Q 004574 315 NETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAK 370 (744)
Q Consensus 315 ~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~ 370 (744)
... ...+..++|..+ ..|.....++ +| ...++|.||..+++..
T Consensus 203 sgG----ED~kfKvWD~~G-----~~Lf~S~~~e---y~-ITSva~npd~~~~v~S 245 (737)
T KOG1524|consen 203 SGG----EDFRFKIWDAQG-----ANLFTSAAEE---YA-ITSVAFNPEKDYLLWS 245 (737)
T ss_pred ecC----CceeEEeecccC-----cccccCChhc---cc-eeeeeeccccceeeee
Confidence 654 223455667555 3444433332 11 1226799995444443
No 283
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=97.97 E-value=0.00041 Score=70.47 Aligned_cols=234 Identities=15% Similarity=0.179 Sum_probs=137.9
Q ss_pred CceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCcccc--ccccceEEec
Q 004574 20 PEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLN--AVFGSFVWVN 96 (744)
Q Consensus 20 ~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~--~~~~~~~wsp 96 (744)
.++++..|.+|.-+-...+|..-+++.- .+++-|-|+|+.+-. ..++.+.+ +++ ........+|
T Consensus 409 harq~~tL~HGEvVcAvtIS~~trhVyT-----------gGkgcVKVWdis~pg~k~PvsqLd--cl~rdnyiRSckL~p 475 (705)
T KOG0639|consen 409 HARQINTLAHGEVVCAVTISNPTRHVYT-----------GGKGCVKVWDISQPGNKSPVSQLD--CLNRDNYIRSCKLLP 475 (705)
T ss_pred hHHhhhhhccCcEEEEEEecCCcceeEe-----------cCCCeEEEeeccCCCCCCcccccc--ccCcccceeeeEecC
Confidence 3455555777776677888888777644 444555666766432 22332222 111 1244667899
Q ss_pred CCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCe----eecC
Q 004574 97 NSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTA----KDFG 172 (744)
Q Consensus 97 Dg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~----~~l~ 172 (744)
||+.|+..... +.|-++|+.-.. .+|+
T Consensus 476 dgrtLivGGea-------------------------------------------------stlsiWDLAapTprikaelt 506 (705)
T KOG0639|consen 476 DGRTLIVGGEA-------------------------------------------------STLSIWDLAAPTPRIKAELT 506 (705)
T ss_pred CCceEEecccc-------------------------------------------------ceeeeeeccCCCcchhhhcC
Confidence 99999875211 467777884422 2344
Q ss_pred CC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceec
Q 004574 173 TP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRA 251 (744)
Q Consensus 173 ~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~sp 251 (744)
.. .....++.|||.+ |.|..... ..|.+||+......+-.. ....|...|..|+
T Consensus 507 ssapaCyALa~spDak-vcFsccsd------------GnI~vwDLhnq~~Vrqfq------------GhtDGascIdis~ 561 (705)
T KOG0639|consen 507 SSAPACYALAISPDAK-VCFSCCSD------------GNIAVWDLHNQTLVRQFQ------------GHTDGASCIDISK 561 (705)
T ss_pred CcchhhhhhhcCCccc-eeeeeccC------------CcEEEEEcccceeeeccc------------CCCCCceeEEecC
Confidence 44 4455788999998 66665543 379999987654333222 2223466788999
Q ss_pred CCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeecc--ceeceeeccCCceEEEeeeeeccceeEEEE
Q 004574 252 DKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDL--RFRSVSWCDDSLALVNETWYKTSQTRTWLV 329 (744)
Q Consensus 252 Dg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~--~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~ 329 (744)
||.+ | |+. +.+. .+..+|++ +.+++...+. .+.++..+|.|.|++..- .+.+++++
T Consensus 562 dGtk-l-WTG---GlDn--------tvRcWDlr-----egrqlqqhdF~SQIfSLg~cP~~dWlavGM----ens~vevl 619 (705)
T KOG0639|consen 562 DGTK-L-WTG---GLDN--------TVRCWDLR-----EGRQLQQHDFSSQIFSLGYCPTGDWLAVGM----ENSNVEVL 619 (705)
T ss_pred CCce-e-ecC---CCcc--------ceeehhhh-----hhhhhhhhhhhhhheecccCCCccceeeec----ccCcEEEE
Confidence 9986 3 442 1111 35666662 3344444433 466777889999999876 33467788
Q ss_pred cCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 330 CPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 330 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
...+. +.-+|..+.. ....+.|++=|++++...
T Consensus 620 h~skp--~kyqlhlheS-------cVLSlKFa~cGkwfvStG 652 (705)
T KOG0639|consen 620 HTSKP--EKYQLHLHES-------CVLSLKFAYCGKWFVSTG 652 (705)
T ss_pred ecCCc--cceeeccccc-------EEEEEEecccCceeeecC
Confidence 76653 3334432211 112255888999887765
No 284
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.95 E-value=0.011 Score=70.72 Aligned_cols=229 Identities=9% Similarity=0.031 Sum_probs=116.2
Q ss_pred eeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC-Ccc---cCCccCCCCccceecC
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI-PVC---YNSVREGMRSISWRAD 252 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~-~~~---~~~~~~~~~~~~~spD 252 (744)
..+++++++|..|++....+ ..|.+++..++.++.+......... ... ....-..+.+++++|+
T Consensus 626 P~GIavd~~gn~LYVaDt~n------------~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~ 693 (1057)
T PLN02919 626 PQGLAYNAKKNLLYVADTEN------------HALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPV 693 (1057)
T ss_pred CcEEEEeCCCCEEEEEeCCC------------ceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecC
Confidence 45788999988777664332 3688888887777666432100000 000 0000112557899996
Q ss_pred CCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee---------------ccceeceeeccCCceEEEee
Q 004574 253 KPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL---------------DLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 253 g~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---------------~~~~~~~~~SpDg~~l~~~~ 317 (744)
+.. ||.+.. ...+|++++. .++....+... -.....++++|||+.|+++.
T Consensus 694 ~g~-LyVad~-----------~~~~I~v~d~---~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVAD 758 (1057)
T PLN02919 694 NEK-VYIAMA-----------GQHQIWEYNI---SDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIAD 758 (1057)
T ss_pred CCe-EEEEEC-----------CCCeEEEEEC---CCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEE
Confidence 654 555521 2234666665 33322211100 11234588999999877654
Q ss_pred eeeccceeEEEEcCCCCCCcceeeeccc---------ccc--------ccCCCCCCceeeCCCCCeEEEEeeecCCcceE
Q 004574 318 WYKTSQTRTWLVCPGSKDVAPRVLFDRV---------FEN--------VYSDPGSPMMTRTSTGTNVIAKIKKENDEQIY 380 (744)
Q Consensus 318 ~~~~~~~~l~~~~~~~~~~~~~~l~~~~---------~~~--------~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~ 380 (744)
.....|.++|++++ ..+.+.... ++. ....|. .+++++||. |++..
T Consensus 759 ---s~n~~Irv~D~~tg--~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~--Gvavd~dG~-LYVAD--------- 821 (1057)
T PLN02919 759 ---SESSSIRALDLKTG--GSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPL--GVLCAKDGQ-IYVAD--------- 821 (1057)
T ss_pred ---CCCCeEEEEECCCC--cEEEEEecccccCcccccccCCCCchhhhhccCCc--eeeEeCCCc-EEEEE---------
Confidence 23457999998764 222221110 000 000111 134667775 32221
Q ss_pred EEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhh--eeeeecCCcceecccCCCEEEEEEecCCCCceEEEEE
Q 004574 381 ILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFET--AVALVFGQGEEDINLNQLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 381 ~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~ 458 (744)
.....|.+||..++....+......-+.+. ..........+++++||+. |...+.+ ..|.++|
T Consensus 822 -----------s~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~l--yVaDt~N--n~Irvid 886 (1057)
T PLN02919 822 -----------SYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRL--FVADTNN--SLIRYLD 886 (1057)
T ss_pred -----------CCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCE--EEEECCC--CEEEEEE
Confidence 223459999998887776653221000000 0000112334789999973 3333323 3688999
Q ss_pred CCCCce
Q 004574 459 WPLKKS 464 (744)
Q Consensus 459 ~~~g~~ 464 (744)
+.+++.
T Consensus 887 ~~~~~~ 892 (1057)
T PLN02919 887 LNKGEA 892 (1057)
T ss_pred CCCCcc
Confidence 988765
No 285
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=97.95 E-value=0.0042 Score=63.56 Aligned_cols=149 Identities=10% Similarity=0.119 Sum_probs=84.8
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
.+...++-.|+...++-.... ..+.+|+ +-+. -.++...+ .....|.|.|
T Consensus 369 delwgla~hps~~q~~T~gqd-------------k~v~lW~-~~k~~wt~~~~d~---------------~~~~~fhpsg 419 (626)
T KOG2106|consen 369 DELWGLATHPSKNQLLTCGQD-------------KHVRLWN-DHKLEWTKIIEDP---------------AECADFHPSG 419 (626)
T ss_pred cceeeEEcCCChhheeeccCc-------------ceEEEcc-CCceeEEEEecCc---------------eeEeeccCcc
Confidence 556678888887766544333 3677787 2111 11122221 3356788888
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
. |+.-. ..++.++++. ++.....+-.....++-+.+||||..|+..+- ++..-||+++.++
T Consensus 420 -~-va~Gt------------~~G~w~V~d~---e~~~lv~~~~d~~~ls~v~ysp~G~~lAvgs~--d~~iyiy~Vs~~g 480 (626)
T KOG2106|consen 420 -V-VAVGT------------ATGRWFVLDT---ETQDLVTIHTDNEQLSVVRYSPDGAFLAVGSH--DNHIYIYRVSANG 480 (626)
T ss_pred -e-EEEee------------ccceEEEEec---ccceeEEEEecCCceEEEEEcCCCCEEEEecC--CCeEEEEEECCCC
Confidence 3 44432 2345788887 44444444434667788899999999998772 1333455555444
Q ss_pred CCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEE
Q 004574 334 KDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILL 383 (744)
Q Consensus 334 ~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~ 383 (744)
..-..+-...... -..+.||+|+++|...+ .+...+++
T Consensus 481 --~~y~r~~k~~gs~------ithLDwS~Ds~~~~~~S----~d~eiLyW 518 (626)
T KOG2106|consen 481 --RKYSRVGKCSGSP------ITHLDWSSDSQFLVSNS----GDYEILYW 518 (626)
T ss_pred --cEEEEeeeecCce------eEEeeecCCCceEEecc----CceEEEEE
Confidence 3333333322211 01156999999998876 44555554
No 286
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=97.91 E-value=0.00095 Score=74.45 Aligned_cols=94 Identities=12% Similarity=0.057 Sum_probs=63.2
Q ss_pred eEEEeeecccccccCCCceeEEEEECCCCceecc-ccCCCccccccccceEEecCCcEEEE-EecCCCCCCCCCCCCCCC
Q 004574 44 RIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPL-FESPDICLNAVFGSFVWVNNSTLLIF-TIPSSRRDPPKKTMVPLG 121 (744)
Q Consensus 44 ~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~l-t~~~~~~~~~~~~~~~wspDg~~l~~-~~~~~~~~~~~~~~~~~~ 121 (744)
+|||+.. ...+|.++|.+|..++.+ +.... .+..|.|||||+.|+| ++...-.
T Consensus 320 kiAfv~~---------~~~~L~~~D~dG~n~~~ve~~~~~-----~i~sP~~SPDG~~vAY~ts~e~~~----------- 374 (912)
T TIGR02171 320 KLAFRND---------VTGNLAYIDYTKGASRAVEIEDTI-----SVYHPDISPDGKKVAFCTGIEGLP----------- 374 (912)
T ss_pred eEEEEEc---------CCCeEEEEecCCCCceEEEecCCC-----ceecCcCCCCCCEEEEEEeecCCC-----------
Confidence 6899753 123999999999999988 65554 3567799999999999 5332100
Q ss_pred CeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC--CC-CeeecCCC-ceeeeeccCCCCc-eEEEEEee
Q 004574 122 PKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL--DG-TAKDFGTP-AVYTAVEPSPDQK-YVLITSMH 195 (744)
Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~G-~~~~l~~~-~~~~~~~~SpDG~-~i~~~~~~ 195 (744)
+...||+.++ +| ...+|.-+ .-+.+....+.|. -|+|.++.
T Consensus 375 ---------------------------------g~s~vYv~~L~t~~~~~vkl~ve~aaiprwrv~e~gdt~ivyv~~a 420 (912)
T TIGR02171 375 ---------------------------------GKSSVYVRNLNASGSGLVKLPVENAAIPRWRVLENGDTVIVYVSDA 420 (912)
T ss_pred ---------------------------------CCceEEEEehhccCCCceEeecccccccceEecCCCCeEEEEEcCC
Confidence 1157999999 44 55555434 5565666778775 45555543
No 287
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.91 E-value=0.0073 Score=70.69 Aligned_cols=83 Identities=8% Similarity=0.072 Sum_probs=51.7
Q ss_pred ccceeceeeccCCceEEEeeeeec-c-ceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecC
Q 004574 298 DLRFRSVSWCDDSLALVNETWYKT-S-QTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKEN 375 (744)
Q Consensus 298 ~~~~~~~~~SpDg~~l~~~~~~~~-~-~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~ 375 (744)
+.....++|-.||++++.++-... + ...|.+++-+| . .+-+...... -...++|.|.|..|+.... -.
T Consensus 209 dd~~~~ISWRGDG~yFAVss~~~~~~~~R~iRVy~ReG---~-L~stSE~v~g-----Le~~l~WrPsG~lIA~~q~-~~ 278 (928)
T PF04762_consen 209 DDGRVRISWRGDGEYFAVSSVEPETGSRRVIRVYSREG---E-LQSTSEPVDG-----LEGALSWRPSGNLIASSQR-LP 278 (928)
T ss_pred CCCceEEEECCCCcEEEEEEEEcCCCceeEEEEECCCc---e-EEeccccCCC-----ccCCccCCCCCCEEEEEEE-cC
Confidence 334556899999999999874222 3 46677777654 1 2222222221 1223679999999888875 33
Q ss_pred CcceEEEEccCCCCC
Q 004574 376 DEQIYILLNGRGFTP 390 (744)
Q Consensus 376 ~~~~~~~~~~~g~~~ 390 (744)
+...+.++.++|..+
T Consensus 279 ~~~~VvFfErNGLrh 293 (928)
T PF04762_consen 279 DRHDVVFFERNGLRH 293 (928)
T ss_pred CCcEEEEEecCCcEe
Confidence 456677777777533
No 288
>PF11339 DUF3141: Protein of unknown function (DUF3141); InterPro: IPR024501 This family of proteins appears to be predominantly expressed in Proteobacteria. Their function is unknown.
Probab=97.90 E-value=0.00032 Score=72.75 Aligned_cols=50 Identities=22% Similarity=0.320 Sum_probs=41.3
Q ss_pred ccCCCCCCEEEEeeCCCCCCCCCHHHHHHH-------HHHHHhCCCcEEEEEeCCCCcc
Q 004574 660 HANKIKKPILIIHGEVDDKVGLFPMQAERF-------FDALKGHGALSRLVLLPFEHHV 711 (744)
Q Consensus 660 ~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~-------~~~l~~~~~~~~~~~~~~~~H~ 711 (744)
.+++|.+|++++.|..|.+.| ++++..+ .+.++..|..+.+.+-+..||-
T Consensus 292 DLr~Ir~Piivfas~gDnITP--P~QaL~WI~dlY~~~~ei~a~gQ~IVY~~h~~vGHL 348 (581)
T PF11339_consen 292 DLRNIRSPIIVFASYGDNITP--PQQALNWIPDLYPDTEEIKAAGQTIVYLLHESVGHL 348 (581)
T ss_pred ehhhCCCCEEEEeccCCCCCC--hhHhccchHhhcCCHHHHHhCCCEEEEEecCCCCce
Confidence 367899999999999999998 9988444 3456777888899999999994
No 289
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=97.89 E-value=0.0023 Score=63.14 Aligned_cols=206 Identities=15% Similarity=0.098 Sum_probs=111.9
Q ss_pred eEEEEEcC-CC-CeeecCCC-c--eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-A--VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~--~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~ 231 (744)
.-++++|. +| ....+..+ + -+---.|||||++|+.+.++.+ .....|-+||... ..+++...+...
T Consensus 28 ~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~dG~~LytTEnd~~--------~g~G~IgVyd~~~-~~~ri~E~~s~G 98 (305)
T PF07433_consen 28 TFALVFDCRTGQLLQRLWAPPGRHFYGHGVFSPDGRLLYTTENDYE--------TGRGVIGVYDAAR-GYRRIGEFPSHG 98 (305)
T ss_pred cEEEEEEcCCCceeeEEcCCCCCEEecCEEEcCCCCEEEEeccccC--------CCcEEEEEEECcC-CcEEEeEecCCC
Confidence 35778888 77 44455444 2 2335679999998877755432 2245799999883 344443333221
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEee-----cCCCCCcc-CCccceEEeccCCCCCCCCce-E--eee--eccc
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQ-----DRGDANVE-VSPRDIIYTQPAEPAEGEKPE-I--LHK--LDLR 300 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~-----~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~-~--l~~--~~~~ 300 (744)
.++-++.|.|||+. |+...-. +.+..+.+ ..+...|..+|. .+|+.. + |.. ....
T Consensus 99 ----------IGPHel~l~pDG~t-LvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~---~sG~ll~q~~Lp~~~~~lS 164 (305)
T PF07433_consen 99 ----------IGPHELLLMPDGET-LVVANGGIETHPDSGRAKLNLDTMQPSLVYLDA---RSGALLEQVELPPDLHQLS 164 (305)
T ss_pred ----------cChhhEEEcCCCCE-EEEEcCCCccCcccCceecChhhcCCceEEEec---CCCceeeeeecCccccccc
Confidence 33668899999987 5554211 11122222 244445666665 445432 2 322 2446
Q ss_pred eeceeeccCCceEEEeeeeec---cceeEEEEcCCCCCCcceeeecccc---ccccCCCCCCceeeCCCCCeEEEEeeec
Q 004574 301 FRSVSWCDDSLALVNETWYKT---SQTRTWLVCPGSKDVAPRVLFDRVF---ENVYSDPGSPMMTRTSTGTNVIAKIKKE 374 (744)
Q Consensus 301 ~~~~~~SpDg~~l~~~~~~~~---~~~~l~~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~~~~~spdg~~l~~~~~~~ 374 (744)
+.-+++++||..++-..+.-+ ..--|.+....+ . ..+..... .....+-+ .++++.+|..++.++-+
T Consensus 165 iRHLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~---~-~~~~~~p~~~~~~l~~Y~g--SIa~~~~g~~ia~tsPr- 237 (305)
T PF07433_consen 165 IRHLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGG---A-LRLLPAPEEQWRRLNGYIG--SIAADRDGRLIAVTSPR- 237 (305)
T ss_pred eeeEEecCCCcEEEEEecCCCCCccCCeEEEEcCCC---c-ceeccCChHHHHhhCCceE--EEEEeCCCCEEEEECCC-
Confidence 888999999975443332111 112233333222 1 22222111 11112222 36688999988888632
Q ss_pred CCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEe
Q 004574 375 NDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIW 411 (744)
Q Consensus 375 ~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~ 411 (744)
...+.+||..+|+.....
T Consensus 238 -------------------Gg~~~~~d~~tg~~~~~~ 255 (305)
T PF07433_consen 238 -------------------GGRVAVWDAATGRLLGSV 255 (305)
T ss_pred -------------------CCEEEEEECCCCCEeecc
Confidence 224778899999876543
No 290
>KOG1553 consensus Predicted alpha/beta hydrolase BAT5 [General function prediction only]
Probab=97.86 E-value=0.00012 Score=71.12 Aligned_cols=173 Identities=14% Similarity=0.116 Sum_probs=102.4
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCC-CCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEE
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPP-GYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVL 559 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~-~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~ 559 (744)
...+.++.+.||.++.+.+..-. +..+++ .-+||++-|.. +|..-+ .+..-++.||.|+
T Consensus 213 NG~R~kiks~dgneiDtmF~d~r~n~~~ng---q~LvIC~EGNA---------------GFYEvG--~m~tP~~lgYsvL 272 (517)
T KOG1553|consen 213 NGQRLKIKSSDGNEIDTMFLDGRPNQSGNG---QDLVICFEGNA---------------GFYEVG--VMNTPAQLGYSVL 272 (517)
T ss_pred CCeEEEEeecCCcchhheeecCCCCCCCCC---ceEEEEecCCc---------------cceEee--eecChHHhCceee
Confidence 46677888888888888666432 222222 46888888741 222111 1223378899999
Q ss_pred ecCCCCCCCCCCC-------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC
Q 004574 560 AGPSIPIIGEGDK-------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT 632 (744)
Q Consensus 560 ~~~~~~~~g~g~~-------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~ 632 (744)
. ...+|++.+ .....+.++++|.+..-....+.|.|.|+|.||+.++++|..+|+ ++++|+-+.+-|..
T Consensus 273 G---wNhPGFagSTG~P~p~n~~nA~DaVvQfAI~~Lgf~~edIilygWSIGGF~~~waAs~YPd-VkavvLDAtFDDll 348 (517)
T KOG1553|consen 273 G---WNHPGFAGSTGLPYPVNTLNAADAVVQFAIQVLGFRQEDIILYGWSIGGFPVAWAASNYPD-VKAVVLDATFDDLL 348 (517)
T ss_pred c---cCCCCccccCCCCCcccchHHHHHHHHHHHHHcCCCccceEEEEeecCCchHHHHhhcCCC-ceEEEeecchhhhh
Confidence 8 333443322 223356677888888766667889999999999999999999975 68888766543311
Q ss_pred CCCCcccccccchhhc-----HHHHHhcCcccccCCCCCCEEEEeeCCCCCCC
Q 004574 633 LTPFGFQTEFRTLWEA-----TNVYIEMSPITHANKIKKPILIIHGEVDDKVG 680 (744)
Q Consensus 633 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~ 680 (744)
+..... ....|.. ...+...+....+.+.+.|+++|--++|+++.
T Consensus 349 --pLAl~r-MP~~~~giV~~aiRnh~NLnnaell~ry~GPi~lIRRt~dEIit 398 (517)
T KOG1553|consen 349 --PLALFR-MPTFFSGIVEHAIRNHMNLNNAELLARYKGPIRLIRRTQDEIIT 398 (517)
T ss_pred --hHHhhh-chHHHHHHHHHHHHHhcccchHHHHHhhcCchhHhhhhhHhhhh
Confidence 111000 0111111 11112222333345567888888777777654
No 291
>PF10230 DUF2305: Uncharacterised conserved protein (DUF2305); InterPro: IPR019363 This entry contains proteins that have no known function.
Probab=97.85 E-value=0.00019 Score=71.51 Aligned_cols=54 Identities=19% Similarity=0.184 Sum_probs=40.8
Q ss_pred HHHHHHHHHHHHcCC---CCCCcEEEEEechHHHHHHHHHHhCC---CceeEEEEccCCC
Q 004574 576 DSAEAAVEEVVRRGV---ADPSRIAVGGHSYGAFMTAHLLAHAP---HLFCCGIARSGSY 629 (744)
Q Consensus 576 ~d~~~~~~~l~~~~~---~d~~~i~l~G~S~GG~~a~~~~~~~p---~~~~~~v~~~~~~ 629 (744)
+.+...++++.+.-. ....++.|+|||.|+++++.++.+.+ ..++.++++.|..
T Consensus 63 ~QI~hk~~~i~~~~~~~~~~~~~liLiGHSIGayi~levl~r~~~~~~~V~~~~lLfPTi 122 (266)
T PF10230_consen 63 DQIEHKIDFIKELIPQKNKPNVKLILIGHSIGAYIALEVLKRLPDLKFRVKKVILLFPTI 122 (266)
T ss_pred HHHHHHHHHHHHHhhhhcCCCCcEEEEeCcHHHHHHHHHHHhccccCCceeEEEEeCCcc
Confidence 355555555555311 13478999999999999999999998 6888899888863
No 292
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=97.82 E-value=0.0058 Score=61.71 Aligned_cols=204 Identities=19% Similarity=0.182 Sum_probs=110.5
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
-..|.|-|+++.|.|+- -.++.|+.++.++|+.+.+..... +.....-.++..|+...
T Consensus 27 gEgP~w~~~~~~L~w~D---------I~~~~i~r~~~~~g~~~~~~~p~~------~~~~~~~d~~g~Lv~~~------- 84 (307)
T COG3386 27 GEGPVWDPDRGALLWVD---------ILGGRIHRLDPETGKKRVFPSPGG------FSSGALIDAGGRLIACE------- 84 (307)
T ss_pred ccCccCcCCCCEEEEEe---------CCCCeEEEecCCcCceEEEECCCC------cccceeecCCCeEEEEc-------
Confidence 45799999999998853 345789999988777666643332 23333333444444331
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCe-eecCCC------ceeeeeccCC
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTA-KDFGTP------AVYTAVEPSP 184 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~-~~l~~~------~~~~~~~~Sp 184 (744)
..+++++. +|.. +.+... .........|
T Consensus 85 --------------------------------------------~g~~~~~~~~~~~~t~~~~~~~~~~~~r~ND~~v~p 120 (307)
T COG3386 85 --------------------------------------------HGVRLLDPDTGGKITLLAEPEDGLPLNRPNDGVVDP 120 (307)
T ss_pred --------------------------------------------cccEEEeccCCceeEEeccccCCCCcCCCCceeEcC
Confidence 12344443 3433 333322 1223566788
Q ss_pred CCceEEEEEeeCCcccccc-cCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 185 DQKYVLITSMHRPYSYKVP-CARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~-~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
||. |+|..+.. .. ... ..+....||+++++++..+.+...-.. ..+++|||||+. +|++..
T Consensus 121 dG~-~wfgt~~~-~~-~~~~~~~~~G~lyr~~p~g~~~~l~~~~~~~-------------~NGla~SpDg~t-ly~aDT- 182 (307)
T COG3386 121 DGR-IWFGDMGY-FD-LGKSEERPTGSLYRVDPDGGVVRLLDDDLTI-------------PNGLAFSPDGKT-LYVADT- 182 (307)
T ss_pred CCC-EEEeCCCc-cc-cCccccCCcceEEEEcCCCCEEEeecCcEEe-------------cCceEECCCCCE-EEEEeC-
Confidence 886 67776662 11 111 112234799999988877776653211 557899999997 877742
Q ss_pred cCCCCCccCCccceEEeccCCCCCC--CCce-Eee--eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEG--EKPE-ILH--KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~-~l~--~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
...+|+.++.+...+ +..+ .+. ...+....++...||...+...+ +-..|.+.+.++
T Consensus 183 ----------~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~---~g~~v~~~~pdG 244 (307)
T COG3386 183 ----------PANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVW---GGGRVVRFNPDG 244 (307)
T ss_pred ----------CCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEeccc---CCceEEEECCCC
Confidence 334577777632111 1111 111 12344445555666664432221 113566777664
No 293
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=97.81 E-value=0.00017 Score=71.48 Aligned_cols=124 Identities=9% Similarity=0.086 Sum_probs=73.4
Q ss_pred ceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCC
Q 004574 209 QKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEG 288 (744)
Q Consensus 209 ~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (744)
..|..||+.+.....-....+. +.++..++||..+|.+. ..+.+-++|+ -+
T Consensus 322 kkvRfwD~Rs~~~~~sv~~gg~-------------vtSl~ls~~g~~lLsss-------------RDdtl~viDl---Rt 372 (459)
T KOG0288|consen 322 KKVRFWDIRSADKTRSVPLGGR-------------VTSLDLSMDGLELLSSS-------------RDDTLKVIDL---RT 372 (459)
T ss_pred cceEEEeccCCceeeEeecCcc-------------eeeEeeccCCeEEeeec-------------CCCceeeeec---cc
Confidence 4688899766544433222221 55678899998833332 1123666666 34
Q ss_pred CCceEeeee-----ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCC
Q 004574 289 EKPEILHKL-----DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTST 363 (744)
Q Consensus 289 ~~~~~l~~~-----~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spd 363 (744)
.+++..... ...++.+.|||||.+++..+ ....+|++++.++.-+.+.-...+. ++...++|+|-
T Consensus 373 ~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAGS----~dgsv~iW~v~tgKlE~~l~~s~s~------~aI~s~~W~~s 442 (459)
T KOG0288|consen 373 KEIRQTFSAEGFKCASDWTRVVFSPDGSYVAAGS----ADGSVYIWSVFTGKLEKVLSLSTSN------AAITSLSWNPS 442 (459)
T ss_pred ccEEEEeeccccccccccceeEECCCCceeeecc----CCCcEEEEEccCceEEEEeccCCCC------cceEEEEEcCC
Confidence 444444332 22467789999999999876 3446889998885222222111111 13334779999
Q ss_pred CCeEEEEe
Q 004574 364 GTNVIAKI 371 (744)
Q Consensus 364 g~~l~~~~ 371 (744)
|+.++...
T Consensus 443 G~~Llsad 450 (459)
T KOG0288|consen 443 GSGLLSAD 450 (459)
T ss_pred Cchhhccc
Confidence 99887764
No 294
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=97.81 E-value=0.0023 Score=63.26 Aligned_cols=175 Identities=16% Similarity=0.102 Sum_probs=96.7
Q ss_pred eEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccC
Q 004574 63 RVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLL 142 (744)
Q Consensus 63 ~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (744)
+|+. +.++..+++....... ......+++|+||+.+++.... .+
T Consensus 3 ~l~~--~~~~~~~pv~g~~~~~-~~~~~s~AvS~dg~~~A~v~~~-~~-------------------------------- 46 (253)
T PF10647_consen 3 QLVR--VSGGGVTPVPGALGEG-GYDVTSPAVSPDGSRVAAVSEG-DG-------------------------------- 46 (253)
T ss_pred cEEE--ecCCceeECCCCcCcC-CccccceEECCCCCeEEEEEEc-CC--------------------------------
Confidence 4444 3455566664332211 1135688999999999998611 11
Q ss_pred CCchhhhccceeeeeEEEEEcCCCCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeee
Q 004574 143 KDEYDESLFDYYTTAQLVLGSLDGTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVR 222 (744)
Q Consensus 143 ~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~ 222 (744)
..+|++....+....+........|+|++||. +++...... ...+.....++....
T Consensus 47 -------------~~~L~~~~~~~~~~~~~~g~~l~~PS~d~~g~-~W~v~~~~~----------~~~~~~~~~~g~~~~ 102 (253)
T PF10647_consen 47 -------------GRSLYVGPAGGPVRPVLTGGSLTRPSWDPDGW-VWTVDDGSG----------GVRVVRDSASGTGEP 102 (253)
T ss_pred -------------CCEEEEEcCCCcceeeccCCccccccccCCCC-EEEEEcCCC----------ceEEEEecCCCccee
Confidence 15789888877666665556788999999965 444433321 011222122332221
Q ss_pred eccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee------
Q 004574 223 ELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK------ 296 (744)
Q Consensus 223 ~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~------ 296 (744)
...+.+.... .+..+.+||||.+ ++++....+.. +|++..+.-...+.+..+..
T Consensus 103 ~~v~~~~~~~----------~I~~l~vSpDG~R-vA~v~~~~~~~---------~v~va~V~r~~~g~~~~l~~~~~~~~ 162 (253)
T PF10647_consen 103 VEVDWPGLRG----------RITALRVSPDGTR-VAVVVEDGGGG---------RVYVAGVVRDGDGVPRRLTGPRRVAP 162 (253)
T ss_pred EEecccccCC----------ceEEEEECCCCcE-EEEEEecCCCC---------eEEEEEEEeCCCCCcceeccceEecc
Confidence 1111111100 2668999999999 88876443332 35555431112332233221
Q ss_pred -eccceeceeeccCCceEEEee
Q 004574 297 -LDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 297 -~~~~~~~~~~SpDg~~l~~~~ 317 (744)
....+..++|+++++.++...
T Consensus 163 ~~~~~v~~v~W~~~~~L~V~~~ 184 (253)
T PF10647_consen 163 PLLSDVTDVAWSDDSTLVVLGR 184 (253)
T ss_pred cccCcceeeeecCCCEEEEEeC
Confidence 134567899999998776654
No 295
>COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only]
Probab=97.77 E-value=0.00038 Score=61.74 Aligned_cols=133 Identities=15% Similarity=0.090 Sum_probs=83.0
Q ss_pred HHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCc
Q 004574 578 AEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSP 657 (744)
Q Consensus 578 ~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 657 (744)
..++++.|.+.-..-++.++|++||.|+.+++..+.+....++++++++|+--..- . ..+.....+++
T Consensus 43 ~~dWi~~l~~~v~a~~~~~vlVAHSLGc~~v~h~~~~~~~~V~GalLVAppd~~~~--~----------~~~~~~~tf~~ 110 (181)
T COG3545 43 LDDWIARLEKEVNAAEGPVVLVAHSLGCATVAHWAEHIQRQVAGALLVAPPDVSRP--E----------IRPKHLMTFDP 110 (181)
T ss_pred HHHHHHHHHHHHhccCCCeEEEEecccHHHHHHHHHhhhhccceEEEecCCCcccc--c----------cchhhccccCC
Confidence 34444444433222235599999999999999999887668999999998732110 0 00111111222
Q ss_pred ccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCc--cccHHHHHHHHHHHHH
Q 004574 658 ITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAA--RENVMHVIWETDRWLQ 730 (744)
Q Consensus 658 ~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~--~~~~~~~~~~~~~fl~ 730 (744)
.. ..++.-|.++++..+|+.++ +++++.+.+++-. .++....+||.... ...+.+....+.+|+.
T Consensus 111 ~p-~~~lpfps~vvaSrnDp~~~--~~~a~~~a~~wgs-----~lv~~g~~GHiN~~sG~g~wpeg~~~l~~~~s 177 (181)
T COG3545 111 IP-REPLPFPSVVVASRNDPYVS--YEHAEDLANAWGS-----ALVDVGEGGHINAESGFGPWPEGYALLAQLLS 177 (181)
T ss_pred Cc-cccCCCceeEEEecCCCCCC--HHHHHHHHHhccH-----hheecccccccchhhcCCCcHHHHHHHHHHhh
Confidence 11 23345789999999999999 9999999887754 57777788884321 2223444444455543
No 296
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.76 E-value=0.00032 Score=78.56 Aligned_cols=261 Identities=14% Similarity=0.172 Sum_probs=138.8
Q ss_pred ccceeecCCCC-eEEEeeecccccc--cCCCceeEEEEECCCCce--eccccCCCccccccccceEEecCCcE---EEEE
Q 004574 33 INFVSWSPDGK-RIAFSVRVDEEDN--VSSCKLRVWIADAETGEA--KPLFESPDICLNAVFGSFVWVNNSTL---LIFT 104 (744)
Q Consensus 33 ~~~p~~SpDG~-~laf~~~~~~~~~--~~~~~~~l~~~~~~gg~~--~~lt~~~~~~~~~~~~~~~wspDg~~---l~~~ 104 (744)
-..-+|||++. ++|-.......+. ..+...+||-++....+. +++..... ..-+..++|++-|.. |+..
T Consensus 9 ta~~awSp~~~~~laagt~aq~~D~sfst~~slEifeld~~~~~~dlk~~~s~~s---~~rF~kL~W~~~g~~~~GlIaG 85 (1049)
T KOG0307|consen 9 TATFAWSPASPPLLAAGTAAQQFDASFSTSASLEIFELDFSDESSDLKPVGSLQS---SNRFNKLAWGSYGSHSHGLIAG 85 (1049)
T ss_pred cceEEecCCCchhhHHHhhhhccccccccccccceeeecccCccccccccccccc---cccceeeeecccCCCccceeec
Confidence 45679999996 3433222222222 225567888888776653 44422221 112557899988776 2322
Q ss_pred ecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC----CC-Ceeec---CCC-c
Q 004574 105 IPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL----DG-TAKDF---GTP-A 175 (744)
Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~----~G-~~~~l---~~~-~ 175 (744)
.- ++ ++|-+++. .+ +..-| ..+ +
T Consensus 86 G~-ed-----------------------------------------------G~I~ly~p~~~~~~~~~~~la~~~~h~G 117 (1049)
T KOG0307|consen 86 GL-ED-----------------------------------------------GNIVLYDPASIIANASEEVLATKSKHTG 117 (1049)
T ss_pred cc-cC-----------------------------------------------CceEEecchhhccCcchHHHhhhcccCC
Confidence 10 00 34555554 13 22222 333 7
Q ss_pred eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCc
Q 004574 176 VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 176 ~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~ 255 (744)
.+.++.|+|.+.-++-.... ..+|++||+.-- ....+..... -...+..++|.-.-..
T Consensus 118 ~V~gLDfN~~q~nlLASGa~------------~geI~iWDlnn~-~tP~~~~~~~---------~~~eI~~lsWNrkvqh 175 (1049)
T KOG0307|consen 118 PVLGLDFNPFQGNLLASGAD------------DGEILIWDLNKP-ETPFTPGSQA---------PPSEIKCLSWNRKVSH 175 (1049)
T ss_pred ceeeeeccccCCceeeccCC------------CCcEEEeccCCc-CCCCCCCCCC---------CcccceEeccchhhhH
Confidence 88899999998844433322 248999998742 1222111000 0011445666544333
Q ss_pred eEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCce-EEEeeeeeccceeEEEEcCC
Q 004574 256 TLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLA-LVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 256 ~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~-l~~~~~~~~~~~~l~~~~~~ 332 (744)
| +++ ...+++..++|++ .......+... +..++.++|.||+.. |+.++ +++....|-.+|+-
T Consensus 176 -I-LAS----------~s~sg~~~iWDlr--~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As-~dd~~PviqlWDlR 240 (1049)
T KOG0307|consen 176 -I-LAS----------GSPSGRAVIWDLR--KKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVAS-GDDSAPVIQLWDLR 240 (1049)
T ss_pred -H-hhc----------cCCCCCceecccc--CCCcccccccCCCccceeeeeeCCCCceeeeeec-CCCCCceeEeeccc
Confidence 1 221 1234457777774 33334444433 456789999999765 55444 44455566666654
Q ss_pred CCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCcee
Q 004574 333 SKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKE 408 (744)
Q Consensus 333 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~ 408 (744)
-. .++.++...+.. |...+.|++.+..++..+.+ ..++..|+.++|++-
T Consensus 241 ~a-ssP~k~~~~H~~------GilslsWc~~D~~lllSsgk--------------------D~~ii~wN~~tgEvl 289 (1049)
T KOG0307|consen 241 FA-SSPLKILEGHQR------GILSLSWCPQDPRLLLSSGK--------------------DNRIICWNPNTGEVL 289 (1049)
T ss_pred cc-CCchhhhccccc------ceeeeccCCCCchhhhcccC--------------------CCCeeEecCCCceEe
Confidence 43 223333333322 23336799988666666522 334888898888764
No 297
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=97.76 E-value=0.02 Score=54.06 Aligned_cols=163 Identities=12% Similarity=0.058 Sum_probs=93.1
Q ss_pred EEEEEcC-CC-CeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-------e-eeeccCC
Q 004574 158 QLVLGSL-DG-TAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-------L-VRELCDL 227 (744)
Q Consensus 158 ~l~~~~~-~G-~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-------~-~~~l~~~ 227 (744)
.+-++|+ +| ...++..+-.+....|+++|..+++...+.- .+...+.++++... + ...|...
T Consensus 75 t~kLWDv~tGk~la~~k~~~~Vk~~~F~~~gn~~l~~tD~~m--------g~~~~v~~fdi~~~~~~~~s~ep~~kI~t~ 146 (327)
T KOG0643|consen 75 TAKLWDVETGKQLATWKTNSPVKRVDFSFGGNLILASTDKQM--------GYTCFVSVFDIRDDSSDIDSEEPYLKIPTP 146 (327)
T ss_pred eeEEEEcCCCcEEEEeecCCeeEEEeeccCCcEEEEEehhhc--------CcceEEEEEEccCChhhhcccCceEEecCC
Confidence 4566688 88 4455555567778999999999999876631 23345666765422 2 2222211
Q ss_pred CCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEe-eeeccceecee
Q 004574 228 PPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEIL-HKLDLRFRSVS 305 (744)
Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l-~~~~~~~~~~~ 305 (744)
... +....|+|-++. |+... ..+.|-++++ .++. .... ......++++.
T Consensus 147 ~sk-------------it~a~Wg~l~~~-ii~Gh------------e~G~is~~da---~~g~~~v~s~~~h~~~Ind~q 197 (327)
T KOG0643|consen 147 DSK-------------ITSALWGPLGET-IIAGH------------EDGSISIYDA---RTGKELVDSDEEHSSKINDLQ 197 (327)
T ss_pred ccc-------------eeeeeecccCCE-EEEec------------CCCcEEEEEc---ccCceeeechhhhcccccccc
Confidence 111 445689999987 44431 2345777777 3332 2211 12244788999
Q ss_pred eccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEE
Q 004574 306 WCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAK 370 (744)
Q Consensus 306 ~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~ 370 (744)
+|+|..+++..+.+ .+ -.++|+.+-.-..+-.++...+. .+.+|--..++..
T Consensus 198 ~s~d~T~FiT~s~D--tt--akl~D~~tl~v~Kty~te~PvN~---------aaisP~~d~Vilg 249 (327)
T KOG0643|consen 198 FSRDRTYFITGSKD--TT--AKLVDVRTLEVLKTYTTERPVNT---------AAISPLLDHVILG 249 (327)
T ss_pred ccCCcceEEecccC--cc--ceeeeccceeeEEEeeecccccc---------eecccccceEEec
Confidence 99999999987622 22 33556555322223345544433 2356666655554
No 298
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=97.73 E-value=0.0035 Score=62.60 Aligned_cols=135 Identities=13% Similarity=0.050 Sum_probs=76.5
Q ss_pred eeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEE
Q 004574 179 AVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLY 258 (744)
Q Consensus 179 ~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~ 258 (744)
.+.++.++.+++|.+-+ .++|..++.+...+....-...........+.+.|..-+++.+..+. ||
T Consensus 188 ~~~~~~~~~~~~F~Sy~-------------G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~r-ly 253 (342)
T PF06433_consen 188 HPAYSRDGGRLYFVSYE-------------GNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGR-LY 253 (342)
T ss_dssp --EEETTTTEEEEEBTT-------------SEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTE-EE
T ss_pred ccceECCCCeEEEEecC-------------CEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCe-EE
Confidence 55667777777777433 47888888877654443222111111123444566667788777665 55
Q ss_pred EEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 259 WVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 259 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
..-. .+... .-..+...||++|+ ++++...-.+....+.+++.|.|.+-++|..+. ....|+++|..++
T Consensus 254 vLMh-~g~~g-sHKdpgteVWv~D~---~t~krv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~--~~~~l~v~D~~tG 322 (342)
T PF06433_consen 254 VLMH-QGGEG-SHKDPGTEVWVYDL---KTHKRVARIPLEHPIDSIAVSQDDKPLLYALSA--GDGTLDVYDAATG 322 (342)
T ss_dssp EEEE-E--TT--TTS-EEEEEEEET---TTTEEEEEEEEEEEESEEEEESSSS-EEEEEET--TTTEEEEEETTT-
T ss_pred EEec-CCCCC-CccCCceEEEEEEC---CCCeEEEEEeCCCccceEEEccCCCcEEEEEcC--CCCeEEEEeCcCC
Confidence 4432 22221 11234457999998 555544434445567789999999988886532 3457999999884
No 299
>PF12146 Hydrolase_4: Putative lysophospholipase; InterPro: IPR022742 This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins.
Probab=97.73 E-value=5.4e-05 Score=59.20 Aligned_cols=58 Identities=21% Similarity=0.217 Sum_probs=45.4
Q ss_pred CeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCC
Q 004574 492 GVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGD 571 (744)
Q Consensus 492 g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~ 571 (744)
|.+|.+..|.|++ . ++++|+++||.+ +...++...+..|+++||.|+. ++.+|+|.
T Consensus 1 G~~L~~~~w~p~~-----~-~k~~v~i~HG~~---------------eh~~ry~~~a~~L~~~G~~V~~---~D~rGhG~ 56 (79)
T PF12146_consen 1 GTKLFYRRWKPEN-----P-PKAVVVIVHGFG---------------EHSGRYAHLAEFLAEQGYAVFA---YDHRGHGR 56 (79)
T ss_pred CcEEEEEEecCCC-----C-CCEEEEEeCCcH---------------HHHHHHHHHHHHHHhCCCEEEE---ECCCcCCC
Confidence 5678899999976 2 589999999963 3444556778899999999999 77777777
Q ss_pred CC
Q 004574 572 KL 573 (744)
Q Consensus 572 ~~ 573 (744)
+.
T Consensus 57 S~ 58 (79)
T PF12146_consen 57 SE 58 (79)
T ss_pred CC
Confidence 63
No 300
>PF11144 DUF2920: Protein of unknown function (DUF2920); InterPro: IPR022605 This bacterial family of proteins has no known function.
Probab=97.73 E-value=0.0011 Score=67.63 Aligned_cols=135 Identities=19% Similarity=0.127 Sum_probs=87.6
Q ss_pred CCCCChHHHHHHHHHHHHHcCCCC--CCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC--CC---CCc----
Q 004574 569 EGDKLPNDSAEAAVEEVVRRGVAD--PSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT--LT---PFG---- 637 (744)
Q Consensus 569 ~g~~~~~~d~~~~~~~l~~~~~~d--~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~--~~---~~~---- 637 (744)
+|-..+. |+..|+.++.++-.-. .-++.++|+|+||++|..++--.|-.+.+++-.++..... .. ...
T Consensus 158 ~GIMqAi-D~INAl~~l~k~~~~~~~~lp~I~~G~s~G~yla~l~~k~aP~~~~~~iDns~~~~p~l~~I~Gre~~~~~y 236 (403)
T PF11144_consen 158 FGIMQAI-DIINALLDLKKIFPKNGGGLPKIYIGSSHGGYLAHLCAKIAPWLFDGVIDNSSYALPPLRYIFGREIDFMKY 236 (403)
T ss_pred hHHHHHH-HHHHHHHHHHHhhhcccCCCcEEEEecCcHHHHHHHHHhhCccceeEEEecCccccchhheeeeeecCcccc
Confidence 4444444 7888888888863222 2489999999999999999999999999999888754311 00 000
Q ss_pred -------------ccccccchhhcH---------HHH--HhcCccccc---CCC-C-CCEEEEeeCCCCCCCCCHHHHHH
Q 004574 638 -------------FQTEFRTLWEAT---------NVY--IEMSPITHA---NKI-K-KPILIIHGEVDDKVGLFPMQAER 688 (744)
Q Consensus 638 -------------~~~~~~~~~~~~---------~~~--~~~~~~~~~---~~~-~-~P~l~i~G~~D~~v~~~~~~~~~ 688 (744)
.......+|... +.+ ...-...|+ .+. + +=....|+..|..+| .++-++
T Consensus 237 ~~~~~~~~~~~~~i~~~~Kt~Wt~n~~S~~~Fs~~~~~IR~iLn~~HL~iqs~~n~~~~yvsYHs~~D~~~p--~~~K~~ 314 (403)
T PF11144_consen 237 ICSGEFFNFKNIRIYCFDKTFWTRNKNSPYYFSKARYIIRSILNPDHLKIQSNYNKKIIYVSYHSIKDDLAP--AEDKEE 314 (403)
T ss_pred cccccccccCCEEEEEEeccccccCCCCccccChHHHHHHHhcChHHHHHHHhcccceEEEEEeccCCCCCC--HHHHHH
Confidence 000112334221 011 111011111 112 2 336677999999999 999999
Q ss_pred HHHHHHhCCCcEEEEEeC
Q 004574 689 FFDALKGHGALSRLVLLP 706 (744)
Q Consensus 689 ~~~~l~~~~~~~~~~~~~ 706 (744)
+++.++..|-+++++.+.
T Consensus 315 l~~~l~~lgfda~l~lIk 332 (403)
T PF11144_consen 315 LYEILKNLGFDATLHLIK 332 (403)
T ss_pred HHHHHHHcCCCeEEEEec
Confidence 999999999999999984
No 301
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=97.71 E-value=7.2e-05 Score=49.60 Aligned_cols=38 Identities=24% Similarity=0.448 Sum_probs=25.6
Q ss_pred eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEE
Q 004574 22 KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWI 66 (744)
Q Consensus 22 ~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~ 66 (744)
++++.. ......|.|||||++|+|++.+. + .+..+||+
T Consensus 2 ~~~t~~--~~~~~~p~~SpDGk~i~f~s~~~----~-~g~~diy~ 39 (39)
T PF07676_consen 2 KQLTNS--PGDDGSPAWSPDGKYIYFTSNRN----D-RGSFDIYV 39 (39)
T ss_dssp EEES-S--SSSEEEEEE-TTSSEEEEEEECT-------SSEEEEE
T ss_pred cCcccC--CccccCEEEecCCCEEEEEecCC----C-CCCcCEEC
Confidence 456532 23588999999999999988631 1 47788885
No 302
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=97.71 E-value=0.0042 Score=58.51 Aligned_cols=126 Identities=14% Similarity=0.124 Sum_probs=78.1
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECC-------CCc-eec
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAE-------TGE-AKP 76 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~-------gg~-~~~ 76 (744)
..+.+|++.. |+..-. ...+..+....||++|..+++... ..-....-|.+.++. +.+ ...
T Consensus 74 ~t~kLWDv~t----Gk~la~--~k~~~~Vk~~~F~~~gn~~l~~tD-----~~mg~~~~v~~fdi~~~~~~~~s~ep~~k 142 (327)
T KOG0643|consen 74 QTAKLWDVET----GKQLAT--WKTNSPVKRVDFSFGGNLILASTD-----KQMGYTCFVSVFDIRDDSSDIDSEEPYLK 142 (327)
T ss_pred ceeEEEEcCC----CcEEEE--eecCCeeEEEeeccCCcEEEEEeh-----hhcCcceEEEEEEccCChhhhcccCceEE
Confidence 3467788855 665433 344556888999999999999764 222444566666766 333 222
Q ss_pred cccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeee
Q 004574 77 LFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTT 156 (744)
Q Consensus 77 lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (744)
+...+. ......|+|-++.|+..-. .
T Consensus 143 I~t~~s-----kit~a~Wg~l~~~ii~Ghe-------------------------------------------------~ 168 (327)
T KOG0643|consen 143 IPTPDS-----KITSALWGPLGETIIAGHE-------------------------------------------------D 168 (327)
T ss_pred ecCCcc-----ceeeeeecccCCEEEEecC-------------------------------------------------C
Confidence 222221 1345689999999987511 1
Q ss_pred eEEEEEcC-CC-Ceeec-CCC-ceeeeeccCCCCceEEEEEee
Q 004574 157 AQLVLGSL-DG-TAKDF-GTP-AVYTAVEPSPDQKYVLITSMH 195 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l-~~~-~~~~~~~~SpDG~~i~~~~~~ 195 (744)
+.|-.+|+ +| +...- ..+ ..+.++.+|||..+.+-.+.+
T Consensus 169 G~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~D 211 (327)
T KOG0643|consen 169 GSISIYDARTGKELVDSDEEHSSKINDLQFSRDRTYFITGSKD 211 (327)
T ss_pred CcEEEEEcccCceeeechhhhccccccccccCCcceEEecccC
Confidence 46777788 77 33322 222 567899999999877655433
No 303
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=97.68 E-value=0.0015 Score=68.51 Aligned_cols=135 Identities=13% Similarity=0.175 Sum_probs=79.4
Q ss_pred eEEEEEcCCCCeeec-------CCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeee-eccCC
Q 004574 157 AQLVLGSLDGTAKDF-------GTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVR-ELCDL 227 (744)
Q Consensus 157 ~~l~~~~~~G~~~~l-------~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~-~l~~~ 227 (744)
-+||.+...|-.+.. +-+ ..+..+.|.|=..-++.+ ..+...|.+||+.....+ .+..+
T Consensus 652 i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~------------asyd~Ti~lWDl~~~~~~~~l~gH 719 (1012)
T KOG1445|consen 652 INLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAV------------ASYDSTIELWDLANAKLYSRLVGH 719 (1012)
T ss_pred EEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhh------------hhccceeeeeehhhhhhhheeccC
Confidence 467777766632222 222 445566666633322222 234457899998755433 22222
Q ss_pred CCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeec----cceec
Q 004574 228 PPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLD----LRFRS 303 (744)
Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~----~~~~~ 303 (744)
. .++.+++|||||+. ++-+- ..+.|.++.-+ +..+.+..+. .+...
T Consensus 720 t-------------dqIf~~AWSpdGr~-~AtVc------------KDg~~rVy~Pr----s~e~pv~Eg~gpvgtRgAR 769 (1012)
T KOG1445|consen 720 T-------------DQIFGIAWSPDGRR-IATVC------------KDGTLRVYEPR----SREQPVYEGKGPVGTRGAR 769 (1012)
T ss_pred c-------------CceeEEEECCCCcc-eeeee------------cCceEEEeCCC----CCCCccccCCCCccCccee
Confidence 2 23789999999998 66552 22346666542 2223344332 34456
Q ss_pred eeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 304 VSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 304 ~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
+.|.=||++++....+.....+|-.++.+.
T Consensus 770 i~wacdgr~viv~Gfdk~SeRQv~~Y~Aq~ 799 (1012)
T KOG1445|consen 770 ILWACDGRIVIVVGFDKSSERQVQMYDAQT 799 (1012)
T ss_pred EEEEecCcEEEEecccccchhhhhhhhhhh
Confidence 889999999998876666666777777765
No 304
>TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae
Probab=97.67 E-value=0.00023 Score=79.89 Aligned_cols=85 Identities=16% Similarity=0.072 Sum_probs=54.3
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCCCCC--------------------
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGEGDK-------------------- 572 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~g~~-------------------- 572 (744)
.|+||++||.+. ........+..|+++||.|++.+ .+|+|.+
T Consensus 449 ~P~VVllHG~~g---------------~~~~~~~lA~~La~~Gy~VIaiD---lpGHG~S~~~~~~~~~~a~~~~~~~y~ 510 (792)
T TIGR03502 449 WPVVIYQHGITG---------------AKENALAFAGTLAAAGVATIAID---HPLHGARSFDANASGVNATNANVLAYM 510 (792)
T ss_pred CcEEEEeCCCCC---------------CHHHHHHHHHHHHhCCcEEEEeC---CCCCCccccccccccccccccCcccee
Confidence 689999999631 11112245667788999999833 3333222
Q ss_pred -------------ChHHHHHHHHHHHH------Hc----CCCCCCcEEEEEechHHHHHHHHHHhC
Q 004574 573 -------------LPNDSAEAAVEEVV------RR----GVADPSRIAVGGHSYGAFMTAHLLAHA 615 (744)
Q Consensus 573 -------------~~~~d~~~~~~~l~------~~----~~~d~~~i~l~G~S~GG~~a~~~~~~~ 615 (744)
....|+......+. +. ...+..+|.++||||||.++..++...
T Consensus 511 Nl~~l~~aRDn~rQ~v~Dll~L~~~l~~~~~~~~~~~~~~~~~~~~V~~lGHSLGgiig~~~~~~a 576 (792)
T TIGR03502 511 NLASLLVARDNLRQSILDLLGLRLSLNGSALAGAPLSGINVIDGSKVSFLGHSLGGIVGTSFIAYA 576 (792)
T ss_pred ccccccccccCHHHHHHHHHHHHHHHhcccccccccccccCCCCCcEEEEecCHHHHHHHHHHHhc
Confidence 01125555555555 11 125567999999999999999998763
No 305
>PF05677 DUF818: Chlamydia CHLPS protein (DUF818); InterPro: IPR008536 This family of unknown function includes several Chlamydia CHLPS proteins and Legionella SidB proteins.
Probab=97.67 E-value=0.00057 Score=67.45 Aligned_cols=173 Identities=14% Similarity=0.079 Sum_probs=96.0
Q ss_pred ceEEEEEEcCCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHH-hCCeEEE
Q 004574 481 QKEMIKYQRKDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFL-ARRFAVL 559 (744)
Q Consensus 481 ~~~~i~~~~~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~G~~v~ 559 (744)
..+++.+.. |+..+.+....-+++. +-..||++-|.+...... .+..........++ +.|.+|+
T Consensus 111 ~~kRv~Iq~-D~~~IDt~~I~~~~a~-----~~RWiL~s~GNg~~~E~~---------~~~~~~~~~~~~~ak~~~aNvl 175 (365)
T PF05677_consen 111 SVKRVPIQY-DGVKIDTMAIHQPEAK-----PQRWILVSNGNGECYENR---------AMLDYKDDWIQRFAKELGANVL 175 (365)
T ss_pred ceeeEEEee-CCEEEEEEEeeCCCCC-----CCcEEEEEcCChHHhhhh---------hhhccccHHHHHHHHHcCCcEE
Confidence 577888875 8888888776533322 234788887865211000 00001123344444 5689999
Q ss_pred ecCCCCCCCCCCC-------ChHHHHHHHHHHHHHcC-CCCCCcEEEEEechHHHHHHHHHHhCC----CceeEEEEccC
Q 004574 560 AGPSIPIIGEGDK-------LPNDSAEAAVEEVVRRG-VADPSRIAVGGHSYGAFMTAHLLAHAP----HLFCCGIARSG 627 (744)
Q Consensus 560 ~~~~~~~~g~g~~-------~~~~d~~~~~~~l~~~~-~~d~~~i~l~G~S~GG~~a~~~~~~~p----~~~~~~v~~~~ 627 (744)
. +.++|.|.+ ++..|.++.++||+++. .+.+++|.+.|||.||.++..++.++. +.++=.+
T Consensus 176 ~---fNYpGVg~S~G~~s~~dLv~~~~a~v~yL~d~~~G~ka~~Ii~yG~SLGG~Vqa~AL~~~~~~~~dgi~~~~---- 248 (365)
T PF05677_consen 176 V---FNYPGVGSSTGPPSRKDLVKDYQACVRYLRDEEQGPKAKNIILYGHSLGGGVQAEALKKEVLKGSDGIRWFL---- 248 (365)
T ss_pred E---ECCCccccCCCCCCHHHHHHHHHHHHHHHHhcccCCChheEEEeeccccHHHHHHHHHhcccccCCCeeEEE----
Confidence 8 444443333 23448889999998753 577899999999999999887665542 1122111
Q ss_pred CCCCCCCCCcccc---cc-cchhhcHHHHHhcCcccccCCCCCCEEEEeeCC
Q 004574 628 SYNKTLTPFGFQT---EF-RTLWEATNVYIEMSPITHANKIKKPILIIHGEV 675 (744)
Q Consensus 628 ~~~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~ 675 (744)
+.|+......-.. .. -..|-..-.-|.++......++.||=+++|+.+
T Consensus 249 ikDRsfssl~~vas~~~~~~~~~l~~l~gWnidS~K~s~~l~cpeIii~~~d 300 (365)
T PF05677_consen 249 IKDRSFSSLAAVASQFFGPIGKLLIKLLGWNIDSAKNSEKLQCPEIIIYGVD 300 (365)
T ss_pred EecCCcchHHHHHHHHHHHHHHHHHHHhccCCCchhhhccCCCCeEEEeccc
Confidence 1111111110000 00 000111111234566666778899999999864
No 306
>COG3150 Predicted esterase [General function prediction only]
Probab=97.63 E-value=0.0004 Score=60.57 Aligned_cols=136 Identities=11% Similarity=0.108 Sum_probs=73.2
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhc------HH
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEA------TN 650 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 650 (744)
++++-++-+..+.. | ..++|+|-|.||+.|.+++.+. .+++++ +.|.+-....-.++......++.. ..
T Consensus 44 ~a~~ele~~i~~~~-~-~~p~ivGssLGGY~At~l~~~~--Girav~-~NPav~P~e~l~gylg~~en~ytg~~y~le~~ 118 (191)
T COG3150 44 QALKELEKAVQELG-D-ESPLIVGSSLGGYYATWLGFLC--GIRAVV-FNPAVRPYELLTGYLGRPENPYTGQEYVLESR 118 (191)
T ss_pred HHHHHHHHHHHHcC-C-CCceEEeecchHHHHHHHHHHh--CChhhh-cCCCcCchhhhhhhcCCCCCCCCcceEEeehh
Confidence 34444444444422 2 3499999999999999999987 455555 444322110000000000011100 00
Q ss_pred HHHhcCcccccCCCCCC-EEEEee-CCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHH
Q 004574 651 VYIEMSPITHANKIKKP-ILIIHG-EVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRW 728 (744)
Q Consensus 651 ~~~~~~~~~~~~~~~~P-~l~i~G-~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~f 728 (744)
..... ....++.++.| -|++.. +.|++.. ..++.+.+... ...++++++|.|.. ....++.|+.|
T Consensus 119 hI~~l-~~~~~~~l~~p~~~~lL~qtgDEvLD--yr~a~a~y~~~-------~~~V~dgg~H~F~~---f~~~l~~i~aF 185 (191)
T COG3150 119 HIATL-CVLQFRELNRPRCLVLLSQTGDEVLD--YRQAVAYYHPC-------YEIVWDGGDHKFKG---FSRHLQRIKAF 185 (191)
T ss_pred hHHHH-HHhhccccCCCcEEEeecccccHHHH--HHHHHHHhhhh-------hheeecCCCccccc---hHHhHHHHHHH
Confidence 01111 11123444444 444444 4498865 77776666544 45678889998764 56778888888
Q ss_pred HH
Q 004574 729 LQ 730 (744)
Q Consensus 729 l~ 730 (744)
..
T Consensus 186 ~g 187 (191)
T COG3150 186 KG 187 (191)
T ss_pred hc
Confidence 74
No 307
>PF05705 DUF829: Eukaryotic protein of unknown function (DUF829); InterPro: IPR008547 This signature identifies Transmembrane protein 53, that have no known function but are predicted to be integral membrane proteins.
Probab=97.62 E-value=0.0013 Score=64.84 Aligned_cols=65 Identities=15% Similarity=0.087 Sum_probs=59.8
Q ss_pred CCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 663 KIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 663 ~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
...+|-|.+.++.|.+++ .+..++..+..++.|.+++...++++.|.-+....++++.+.+.+|+
T Consensus 176 ~~~~p~lylYS~~D~l~~--~~~ve~~~~~~~~~G~~V~~~~f~~S~HV~H~r~~p~~Y~~~v~~fw 240 (240)
T PF05705_consen 176 PSRCPRLYLYSKADPLIP--WRDVEEHAEEARRKGWDVRAEKFEDSPHVAHLRKHPDRYWRAVDEFW 240 (240)
T ss_pred CCCCCeEEecCCCCcCcC--HHHHHHHHHHHHHcCCeEEEecCCCCchhhhcccCHHHHHHHHHhhC
Confidence 346899999999999999 99999999999999999999999999999988888999999998874
No 308
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.61 E-value=0.012 Score=63.30 Aligned_cols=207 Identities=14% Similarity=0.085 Sum_probs=118.2
Q ss_pred eecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEE
Q 004574 24 VHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLI 102 (744)
Q Consensus 24 l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~ 102 (744)
++...+...++...+|.|..-++- .++..|-+++.++.+ .|.++.+- .....|-|.+++|+
T Consensus 367 i~~~GHR~dVRsl~vS~d~~~~~S-----------ga~~SikiWn~~t~kciRTi~~~y-------~l~~~Fvpgd~~Iv 428 (888)
T KOG0306|consen 367 IEIGGHRSDVRSLCVSSDSILLAS-----------GAGESIKIWNRDTLKCIRTITCGY-------ILASKFVPGDRYIV 428 (888)
T ss_pred eeeccchhheeEEEeecCceeeee-----------cCCCcEEEEEccCcceeEEecccc-------EEEEEecCCCceEE
Confidence 343333446788899998766655 333445555666544 55564331 23446789888887
Q ss_pred EEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeee
Q 004574 103 FTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTA 179 (744)
Q Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~ 179 (744)
.... .+.|-++|+ ++ -.+.+..+ +.+..
T Consensus 429 ~G~k-------------------------------------------------~Gel~vfdlaS~~l~Eti~AHdgaIWs 459 (888)
T KOG0306|consen 429 LGTK-------------------------------------------------NGELQVFDLASASLVETIRAHDGAIWS 459 (888)
T ss_pred Eecc-------------------------------------------------CCceEEEEeehhhhhhhhhccccceee
Confidence 6532 156667777 55 34444444 77788
Q ss_pred eccCCCCceEEEEEeeCCcccccccCCCcceEEEEe------CCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 180 VEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWT------TDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 180 ~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~------~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
++-+||++..+-.+.+. .+-.|+ ..|.+.+.|.-....... + ...+-.+.+||||
T Consensus 460 i~~~pD~~g~vT~saDk-------------tVkfWdf~l~~~~~gt~~k~lsl~~~rtLe---l---~ddvL~v~~Spdg 520 (888)
T KOG0306|consen 460 ISLSPDNKGFVTGSADK-------------TVKFWDFKLVVSVPGTQKKVLSLKHTRTLE---L---EDDVLCVSVSPDG 520 (888)
T ss_pred eeecCCCCceEEecCCc-------------EEEEEeEEEEeccCcccceeeeeccceEEe---c---cccEEEEEEcCCC
Confidence 99999999987665443 233332 223333322211100000 0 0114568999999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
+. |+.. -. +....+|.+|.- .=...|..+...+.++..|||++.++..+.+ .+-.+|-+|...
T Consensus 521 k~-LaVs-LL---------dnTVkVyflDtl----KFflsLYGHkLPV~smDIS~DSklivTgSAD--KnVKiWGLdFGD 583 (888)
T KOG0306|consen 521 KL-LAVS-LL---------DNTVKVYFLDTL----KFFLSLYGHKLPVLSMDISPDSKLIVTGSAD--KNVKIWGLDFGD 583 (888)
T ss_pred cE-EEEE-ec---------cCeEEEEEecce----eeeeeecccccceeEEeccCCcCeEEeccCC--CceEEeccccch
Confidence 95 4433 22 233347776641 1224466667788889999999999887632 344566665443
No 309
>KOG4328 consensus WD40 protein [Function unknown]
Probab=97.60 E-value=0.036 Score=56.37 Aligned_cols=226 Identities=12% Similarity=0.054 Sum_probs=114.0
Q ss_pred eEEEEEcCCC-Ceeec-CC--C-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCC
Q 004574 157 AQLVLGSLDG-TAKDF-GT--P-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~~G-~~~~l-~~--~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~ 231 (744)
+.|-..|+++ ..+.+ .. . .......++-+...++|..+-. ...++-...++++...+.-...-
T Consensus 257 GtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G-----------~f~~iD~R~~~s~~~~~~lh~kK- 324 (498)
T KOG4328|consen 257 GTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVG-----------NFNVIDLRTDGSEYENLRLHKKK- 324 (498)
T ss_pred ceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeeccc-----------ceEEEEeecCCccchhhhhhhcc-
Confidence 4566677744 22222 11 2 5667888888888777774432 12333334455544444333221
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-c-eEeeeeccceeceeeccC
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-P-EILHKLDLRFRSVSWCDD 309 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~-~~l~~~~~~~~~~~~SpD 309 (744)
+..+++.|--.++|+-.+. ...+-++|++.+.+.. + ...+.....++++.|||+
T Consensus 325 ------------I~sv~~NP~~p~~laT~s~------------D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs 380 (498)
T KOG4328|consen 325 ------------ITSVALNPVCPWFLATASL------------DQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPS 380 (498)
T ss_pred ------------cceeecCCCCchheeeccc------------CcceeeeehhhhcCCCCcceecccccceeeeeEEcCC
Confidence 5677888888873433321 1124455554333222 1 223445678999999999
Q ss_pred CceEEEeeeeeccceeEEEEcCC---CCCCcceeeeccc-cccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEcc
Q 004574 310 SLALVNETWYKTSQTRTWLVCPG---SKDVAPRVLFDRV-FENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNG 385 (744)
Q Consensus 310 g~~l~~~~~~~~~~~~l~~~~~~---~~~~~~~~l~~~~-~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~ 385 (744)
|-.|+....+ . .|.++|.. .. ..+.....++ .......+.. ..|.||...|+....
T Consensus 381 ~gtl~TT~~D--~--~IRv~dss~~sa~-~~p~~~I~Hn~~t~RwlT~fK--A~W~P~~~li~vg~~------------- 440 (498)
T KOG4328|consen 381 GGTLLTTCQD--N--EIRVFDSSCISAK-DEPLGTIPHNNRTGRWLTPFK--AAWDPDYNLIVVGRY------------- 440 (498)
T ss_pred CCceEeeccC--C--ceEEeeccccccc-CCccceeeccCcccccccchh--heeCCCccEEEEecc-------------
Confidence 8888876522 3 35555542 11 2222222222 1111111222 349998887777642
Q ss_pred CCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEE
Q 004574 386 RGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 386 ~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~ 458 (744)
...|-++|..+|++.--.... +...+... ..+.|=+..++. ..+..+.||++.
T Consensus 441 --------~r~IDv~~~~~~q~v~el~~P---~~~tI~~v------n~~HP~~~~~~a---G~~s~Gki~vft 493 (498)
T KOG4328|consen 441 --------PRPIDVFDGNGGQMVCELHDP---ESSTIPSV------NEFHPMRDTLAA---GGNSSGKIYVFT 493 (498)
T ss_pred --------CcceeEEcCCCCEEeeeccCc---cccccccc------eeecccccceec---cCCccceEEEEe
Confidence 233777887777632222211 11122222 355665654442 334456677654
No 310
>PRK02888 nitrous-oxide reductase; Validated
Probab=97.59 E-value=0.018 Score=62.39 Aligned_cols=211 Identities=8% Similarity=-0.026 Sum_probs=109.5
Q ss_pred cceeec-----CCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEE--ecCCcEEEEEec
Q 004574 34 NFVSWS-----PDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVW--VNNSTLLIFTIP 106 (744)
Q Consensus 34 ~~p~~S-----pDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~w--spDg~~l~~~~~ 106 (744)
..|.+| -||+||.. . .+.+.+|-+++++.-+..+++..+... +.-...+ .|+-.+++....
T Consensus 128 HHp~~s~t~g~ydGr~~fi-n--------dk~n~Rvari~l~~~~~~~i~~iPn~~---~~Hg~~~~~~p~t~yv~~~~e 195 (635)
T PRK02888 128 HHPHMSFTDGTYDGRYLFI-N--------DKANTRVARIRLDVMKCDKITELPNVQ---GIHGLRPQKIPRTGYVFCNGE 195 (635)
T ss_pred CCCcccccCCccceeEEEE-e--------cCCCcceEEEECccEeeceeEeCCCcc---CccccCccccCCccEEEeCcc
Confidence 455554 47877754 2 156788999999988888887666421 1112222 366666654321
Q ss_pred CCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC--CeeecCCCceeeeeccCC
Q 004574 107 SSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG--TAKDFGTPAVYTAVEPSP 184 (744)
Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G--~~~~l~~~~~~~~~~~Sp 184 (744)
- ..+..+++...... .. ..+-+-++|.+. -..++.-.+......++|
T Consensus 196 ~------~~PlpnDGk~l~~~------------------------~e-y~~~vSvID~etmeV~~qV~Vdgnpd~v~~sp 244 (635)
T PRK02888 196 F------RIPLPNDGKDLDDP------------------------KK-YRSLFTAVDAETMEVAWQVMVDGNLDNVDTDY 244 (635)
T ss_pred c------ccccCCCCCEeecc------------------------cc-eeEEEEEEECccceEEEEEEeCCCcccceECC
Confidence 0 00000111100000 00 114555667633 233443334556788999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
||+++++++.+.+. ...+...+.............. --.+.+||+. .+ +.
T Consensus 245 dGk~afvTsyNsE~---------G~tl~em~a~e~d~~vvfni~~----------------iea~vkdGK~-~~-V~--- 294 (635)
T PRK02888 245 DGKYAFSTCYNSEE---------GVTLAEMMAAERDWVVVFNIAR----------------IEEAVKAGKF-KT-IG--- 294 (635)
T ss_pred CCCEEEEeccCccc---------CcceeeeccccCceEEEEchHH----------------HHHhhhCCCE-EE-EC---
Confidence 99999988643211 1233334332222111111110 0156778885 33 31
Q ss_pred CCCCCccCCccceEEeccCCCCCC-----CCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAEG-----EKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~-----~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
...+-++|. .. ...........+...+.+||||++++.+.. ....+-++|+..
T Consensus 295 ----------gn~V~VID~---~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVank---lS~tVSVIDv~k 352 (635)
T PRK02888 295 ----------GSKVPVVDG---RKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGK---LSPTVTVIDVRK 352 (635)
T ss_pred ----------CCEEEEEEC---CccccCCcceEEEEECCCCccceEECCCCCEEEEeCC---CCCcEEEEEChh
Confidence 124777776 32 233444445667778999999999987652 233566777765
No 311
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=97.59 E-value=0.016 Score=61.82 Aligned_cols=74 Identities=9% Similarity=0.055 Sum_probs=46.1
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQ 323 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~ 323 (744)
+..++.||+|+ |+..+.+.... ....|++++.. .-.+...|..+...+..++|||||++|+..+.++ .
T Consensus 528 v~~l~~s~~gn--liASaCKS~~~------ehAvI~lw~t~--~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsRDR--t 595 (764)
T KOG1063|consen 528 VYALAISPTGN--LIASACKSSLK------EHAVIRLWNTA--NWLQVQELEGHSLTVTRLAFSPDGRYLLSVSRDR--T 595 (764)
T ss_pred EEEEEecCCCC--EEeehhhhCCc------cceEEEEEecc--chhhhheecccceEEEEEEECCCCcEEEEeecCc--e
Confidence 44567777776 44443332222 22357888762 2233345777788899999999999999887443 3
Q ss_pred eeEEEE
Q 004574 324 TRTWLV 329 (744)
Q Consensus 324 ~~l~~~ 329 (744)
..||-.
T Consensus 596 ~sl~~~ 601 (764)
T KOG1063|consen 596 VSLYEV 601 (764)
T ss_pred EEeeee
Confidence 345544
No 312
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=97.56 E-value=0.0086 Score=57.76 Aligned_cols=229 Identities=12% Similarity=0.087 Sum_probs=117.3
Q ss_pred eEEEEEcC-CCC-eeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCC
Q 004574 157 AQLVLGSL-DGT-AKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAED 232 (744)
Q Consensus 157 ~~l~~~~~-~G~-~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~ 232 (744)
+.+.++|. +-. .+.++.+ ..+..++||+||+.|+-.+... .+-+||+- |...+++--.
T Consensus 45 G~vvI~D~~T~~iar~lsaH~~pi~sl~WS~dgr~LltsS~D~-------------si~lwDl~~gs~l~rirf~----- 106 (405)
T KOG1273|consen 45 GRVVIYDFDTFRIARMLSAHVRPITSLCWSRDGRKLLTSSRDW-------------SIKLWDLLKGSPLKRIRFD----- 106 (405)
T ss_pred CcEEEEEccccchhhhhhccccceeEEEecCCCCEeeeecCCc-------------eeEEEeccCCCceeEEEcc-----
Confidence 56777777 333 3444545 8899999999999987665443 68889964 5544443211
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee-ccc----eeceeec
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-DLR----FRSVSWC 307 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-~~~----~~~~~~S 307 (744)
.| ++...|.|-.+. .+.+.-. +..-++++. .....+.|... ++. .....|.
T Consensus 107 sp---------v~~~q~hp~k~n-~~va~~~-----------~~sp~vi~~---s~~~h~~Lp~d~d~dln~sas~~~fd 162 (405)
T KOG1273|consen 107 SP---------VWGAQWHPRKRN-KCVATIM-----------EESPVVIDF---SDPKHSVLPKDDDGDLNSSASHGVFD 162 (405)
T ss_pred Cc---------cceeeeccccCC-eEEEEEe-----------cCCcEEEEe---cCCceeeccCCCcccccccccccccc
Confidence 11 556678877655 3333211 111223333 12222333221 111 1112477
Q ss_pred cCCceEEEeeeeeccceeEEEEcCCCCC--CcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEcc
Q 004574 308 DDSLALVNETWYKTSQTRTWLVCPGSKD--VAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNG 385 (744)
Q Consensus 308 pDg~~l~~~~~~~~~~~~l~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~ 385 (744)
+-|++|+... ....|.+++..+-+ .+.+..+..+.. .+.++-.|+.|++...
T Consensus 163 r~g~yIitGt----sKGkllv~~a~t~e~vas~rits~~~IK---------~I~~s~~g~~liiNts------------- 216 (405)
T KOG1273|consen 163 RRGKYIITGT----SKGKLLVYDAETLECVASFRITSVQAIK---------QIIVSRKGRFLIINTS------------- 216 (405)
T ss_pred CCCCEEEEec----CcceEEEEecchheeeeeeeechheeee---------EEEEeccCcEEEEecC-------------
Confidence 8899888765 33456677766521 011111111111 1446777777777642
Q ss_pred CCCCCCCCCceEEEEecC-------CCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEE
Q 004574 386 RGFTPEGNIPFLDLFDIN-------TGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 386 ~g~~~~~~~~~l~~~d~~-------~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~ 458 (744)
.+-|..+++. +++++..-. +.+- .+- ....-+.+|-||..++... ...-.||.|.
T Consensus 217 --------DRvIR~ye~~di~~~~r~~e~e~~~K-----~qDv-VNk-~~Wk~ccfs~dgeYv~a~s---~~aHaLYIWE 278 (405)
T KOG1273|consen 217 --------DRVIRTYEISDIDDEGRDGEVEPEHK-----LQDV-VNK-LQWKKCCFSGDGEYVCAGS---ARAHALYIWE 278 (405)
T ss_pred --------CceEEEEehhhhcccCccCCcChhHH-----HHHH-Hhh-hhhhheeecCCccEEEecc---ccceeEEEEe
Confidence 1235555543 222221100 1111 110 0122378888887655433 2233699999
Q ss_pred CCCCceeeeecCC
Q 004574 459 WPLKKSSQITNFP 471 (744)
Q Consensus 459 ~~~g~~~~lt~~~ 471 (744)
...|.+.++.+-+
T Consensus 279 ~~~GsLVKILhG~ 291 (405)
T KOG1273|consen 279 KSIGSLVKILHGT 291 (405)
T ss_pred cCCcceeeeecCC
Confidence 8789888887643
No 313
>COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism]
Probab=97.55 E-value=0.00039 Score=70.49 Aligned_cols=83 Identities=13% Similarity=0.004 Sum_probs=57.9
Q ss_pred hhHHHHHhCCeEEEecCC-CCCC---CCCCCChH-HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCc-e
Q 004574 546 TSSLIFLARRFAVLAGPS-IPII---GEGDKLPN-DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHL-F 619 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~-~~~~---g~g~~~~~-~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~-~ 619 (744)
..+..++++|..|+..+- .+.. ..+.++.. +.+..+++.+++.... ++|-++|+|.||.++..+++..+.+ +
T Consensus 130 s~V~~l~~~g~~vfvIsw~nPd~~~~~~~~edYi~e~l~~aid~v~~itg~--~~InliGyCvGGtl~~~ala~~~~k~I 207 (445)
T COG3243 130 SLVRWLLEQGLDVFVISWRNPDASLAAKNLEDYILEGLSEAIDTVKDITGQ--KDINLIGYCVGGTLLAAALALMAAKRI 207 (445)
T ss_pred cHHHHHHHcCCceEEEeccCchHhhhhccHHHHHHHHHHHHHHHHHHHhCc--cccceeeEecchHHHHHHHHhhhhccc
Confidence 456788899998876221 1111 12222222 4677778888776443 5899999999999999999888777 8
Q ss_pred eEEEEccCCCC
Q 004574 620 CCGIARSGSYN 630 (744)
Q Consensus 620 ~~~v~~~~~~~ 630 (744)
+.+..+....|
T Consensus 208 ~S~T~lts~~D 218 (445)
T COG3243 208 KSLTLLTSPVD 218 (445)
T ss_pred ccceeeecchh
Confidence 88888877654
No 314
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=97.51 E-value=0.0031 Score=61.88 Aligned_cols=235 Identities=11% Similarity=0.050 Sum_probs=123.3
Q ss_pred eEEEEEcC-CCCeee-cCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC
Q 004574 157 AQLVLGSL-DGTAKD-FGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI 233 (744)
Q Consensus 157 ~~l~~~~~-~G~~~~-l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~ 233 (744)
..|+++|. +|+..+ |..+ ..+..++|+.-|+.|+-.+..- .+.+|+.++. .+.+....+.
T Consensus 130 ~tikv~D~~tg~~e~~LrGHt~sv~di~~~a~Gk~l~tcSsDl-------------~~~LWd~~~~-~~c~ks~~gh--- 192 (406)
T KOG0295|consen 130 ATIKVFDTETGELERSLRGHTDSVFDISFDASGKYLATCSSDL-------------SAKLWDFDTF-FRCIKSLIGH--- 192 (406)
T ss_pred ceEEEEEccchhhhhhhhccccceeEEEEecCccEEEecCCcc-------------chhheeHHHH-HHHHHHhcCc---
Confidence 57888898 885533 3223 5577899999998887654442 3566776653 2222222111
Q ss_pred CcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCC-CceEeeeeccceeceeeccCCce
Q 004574 234 PVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGE-KPEILHKLDLRFRSVSWCDDSLA 312 (744)
Q Consensus 234 ~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~l~~~~~~~~~~~~SpDg~~ 312 (744)
..++..+.+.|-|.+ |+.++.. ..|..++. +++ ....++....-+..+....||+.
T Consensus 193 -------~h~vS~V~f~P~gd~-ilS~srD------------~tik~We~---~tg~cv~t~~~h~ewvr~v~v~~DGti 249 (406)
T KOG0295|consen 193 -------EHGVSSVFFLPLGDH-ILSCSRD------------NTIKAWEC---DTGYCVKTFPGHSEWVRMVRVNQDGTI 249 (406)
T ss_pred -------ccceeeEEEEecCCe-eeecccc------------cceeEEec---ccceeEEeccCchHhEEEEEecCCeeE
Confidence 133667899999976 5554311 12566666 333 33445555566778888999998
Q ss_pred EEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecC-CcceEEEEccCCCCCC
Q 004574 313 LVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKEN-DEQIYILLNGRGFTPE 391 (744)
Q Consensus 313 l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~-~~~~~~~~~~~g~~~~ 391 (744)
++..+++ ..-++|.+ .+ .+-+.+.......+.- ++|.|...+--......+ .++.++...
T Consensus 250 ~As~s~d--qtl~vW~~--~t--~~~k~~lR~hEh~vEc------i~wap~~~~~~i~~at~~~~~~~~l~s~------- 310 (406)
T KOG0295|consen 250 IASCSND--QTLRVWVV--AT--KQCKAELREHEHPVEC------IAWAPESSYPSISEATGSTNGGQVLGSG------- 310 (406)
T ss_pred EEecCCC--ceEEEEEe--cc--chhhhhhhccccceEE------EEecccccCcchhhccCCCCCccEEEee-------
Confidence 8866532 33445544 44 2222222211111111 234444432111111111 112222211
Q ss_pred CCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCcee
Q 004574 392 GNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSS 465 (744)
Q Consensus 392 ~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~ 465 (744)
.....|..||..+|.. |++-.+ ..+++. .++|+|.|+.|+ +.++ ...|..||+..++-.
T Consensus 311 SrDktIk~wdv~tg~c--L~tL~g--hdnwVr-------~~af~p~Gkyi~-ScaD---Dktlrvwdl~~~~cm 369 (406)
T KOG0295|consen 311 SRDKTIKIWDVSTGMC--LFTLVG--HDNWVR-------GVAFSPGGKYIL-SCAD---DKTLRVWDLKNLQCM 369 (406)
T ss_pred cccceEEEEeccCCeE--EEEEec--ccceee-------eeEEcCCCeEEE-EEec---CCcEEEEEeccceee
Confidence 2234599999999965 332211 112333 358999997655 4433 345888888776543
No 315
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=97.49 E-value=0.01 Score=55.40 Aligned_cols=198 Identities=13% Similarity=0.075 Sum_probs=105.7
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+...+||.|.++|+-..++. -+.+++++.-+ -.|..+.....+++.+.|.-..+
T Consensus 101 hivk~~af~~ds~~lltgg~ek-------------llrvfdln~p~-----------App~E~~ghtg~Ir~v~wc~eD~ 156 (334)
T KOG0278|consen 101 HIVKAVAFSQDSNYLLTGGQEK-------------LLRVFDLNRPK-----------APPKEISGHTGGIRTVLWCHEDK 156 (334)
T ss_pred heeeeEEecccchhhhccchHH-------------HhhhhhccCCC-----------CCchhhcCCCCcceeEEEeccCc
Confidence 5566888999998887655442 23344433221 11222333334577888876665
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
. |+..+ ....+.++|. .++..++-...+..+.++..|+||++|..+. + ..+.-+|...
T Consensus 157 ~-iLSSa------------dd~tVRLWD~---rTgt~v~sL~~~s~VtSlEvs~dG~ilTia~----g-ssV~Fwdaks- 214 (334)
T KOG0278|consen 157 C-ILSSA------------DDKTVRLWDH---RTGTEVQSLEFNSPVTSLEVSQDGRILTIAY----G-SSVKFWDAKS- 214 (334)
T ss_pred e-EEeec------------cCCceEEEEe---ccCcEEEEEecCCCCcceeeccCCCEEEEec----C-ceeEEecccc-
Confidence 4 33221 1123556666 4566666666788999999999999887654 1 1233444443
Q ss_pred CCcceeeec--cccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 335 DVAPRVLFD--RVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 335 ~~~~~~l~~--~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
.......+ .+... .+.+|+...++... ..-.++++|..||+......
T Consensus 215 -f~~lKs~k~P~nV~S---------ASL~P~k~~fVaGg---------------------ed~~~~kfDy~TgeEi~~~n 263 (334)
T KOG0278|consen 215 -FGLLKSYKMPCNVES---------ASLHPKKEFFVAGG---------------------EDFKVYKFDYNTGEEIGSYN 263 (334)
T ss_pred -ccceeeccCcccccc---------ccccCCCceEEecC---------------------cceEEEEEeccCCceeeecc
Confidence 11111111 11111 23577775443331 12238899999998765542
Q ss_pred ccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCc
Q 004574 413 SNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKK 463 (744)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~ 463 (744)
.. .+..+-. ..|||||...+ +...-+.|.+|-...++
T Consensus 264 kg---h~gpVhc-------VrFSPdGE~yA----sGSEDGTirlWQt~~~~ 300 (334)
T KOG0278|consen 264 KG---HFGPVHC-------VRFSPDGELYA----SGSEDGTIRLWQTTPGK 300 (334)
T ss_pred cC---CCCceEE-------EEECCCCceee----ccCCCceEEEEEecCCC
Confidence 21 1122332 38999997544 22334556555433333
No 316
>COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only]
Probab=97.46 E-value=0.0012 Score=61.90 Aligned_cols=144 Identities=16% Similarity=0.104 Sum_probs=87.4
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC-----ceeEEEEccCCCCCCCC-CCcccc--cccch--h
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-----LFCCGIARSGSYNKTLT-PFGFQT--EFRTL--W 646 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-----~~~~~v~~~~~~~~~~~-~~~~~~--~~~~~--~ 646 (744)
-++.++.+|.++..++ ++-++||||||......+..+.. .+...|.+++.++.... +-.-.. ....+ .
T Consensus 121 wlk~~msyL~~~Y~i~--k~n~VGhSmGg~~~~~Y~~~yg~dks~P~lnK~V~l~gpfN~~~l~~de~v~~v~~~~~~~~ 198 (288)
T COG4814 121 WLKKAMSYLQKHYNIP--KFNAVGHSMGGLGLTYYMIDYGDDKSLPPLNKLVSLAGPFNVGNLVPDETVTDVLKDGPGLI 198 (288)
T ss_pred HHHHHHHHHHHhcCCc--eeeeeeeccccHHHHHHHHHhcCCCCCcchhheEEecccccccccCCCcchheeeccCcccc
Confidence 6788899999987774 89999999999998888877521 35667777777662111 100000 00000 0
Q ss_pred hcH-HHHHhcCcccccCCC--CCCEEEEeeCCC------CCCCCCHHHHHHHHHHHHhCCCcEEEEEeCC--CCcccCcc
Q 004574 647 EAT-NVYIEMSPITHANKI--KKPILIIHGEVD------DKVGLFPMQAERFFDALKGHGALSRLVLLPF--EHHVYAAR 715 (744)
Q Consensus 647 ~~~-~~~~~~~~~~~~~~~--~~P~l~i~G~~D------~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~H~~~~~ 715 (744)
..+ ..|+... -..+ .+.+|+|.|+-| -.|| ...+.-.+..+...++.+...++++ +.|.- .
T Consensus 199 ~t~y~~y~~~n----~k~v~~~~evl~IaGDl~dg~~tDG~Vp--~assls~~~lf~~~~ksy~e~~~~Gk~a~Hs~--l 270 (288)
T COG4814 199 KTPYYDYIAKN----YKKVSPNTEVLLIAGDLDDGKQTDGAVP--WASSLSIYHLFKKNGKSYIESLYKGKDARHSK--L 270 (288)
T ss_pred CcHHHHHHHhc----ceeCCCCcEEEEEecccccCCcCCCcee--chHhHHHHHHhccCcceeEEEeeeCCcchhhc--c
Confidence 000 1111110 1111 466999999754 5576 7778778877777777666666665 56743 2
Q ss_pred ccHHHHHHHHHHHHH
Q 004574 716 ENVMHVIWETDRWLQ 730 (744)
Q Consensus 716 ~~~~~~~~~~~~fl~ 730 (744)
+....+...+..||-
T Consensus 271 hen~~v~~yv~~FLw 285 (288)
T COG4814 271 HENPTVAKYVKNFLW 285 (288)
T ss_pred CCChhHHHHHHHHhh
Confidence 334477788888874
No 317
>PF03096 Ndr: Ndr family; InterPro: IPR004142 This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted []. The precise molecular and cellular function of members of this family is still unknown, yet they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (Common sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases (IPR000073 from INTERPRO), suggesting that this family may have an enzymatic function.; PDB: 2QMQ_A 2XMR_B 2XMQ_B 2XMS_A.
Probab=97.46 E-value=0.006 Score=59.67 Aligned_cols=211 Identities=16% Similarity=0.145 Sum_probs=111.4
Q ss_pred eEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCCCC--C
Q 004574 493 VPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPIIGE--G 570 (744)
Q Consensus 493 ~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~g~--g 570 (744)
..++.+++-..+ +++|++|-.|--|-... ..|...............|.++-. +.+|. |
T Consensus 9 G~v~V~v~G~~~------~~kp~ilT~HDvGlNh~----------scF~~ff~~~~m~~i~~~f~i~Hi---~aPGqe~g 69 (283)
T PF03096_consen 9 GSVHVTVQGDPK------GNKPAILTYHDVGLNHK----------SCFQGFFNFEDMQEILQNFCIYHI---DAPGQEEG 69 (283)
T ss_dssp EEEEEEEESS--------TTS-EEEEE--TT--HH----------HHCHHHHCSHHHHHHHTTSEEEEE---E-TTTSTT
T ss_pred eEEEEEEEecCC------CCCceEEEeccccccch----------HHHHHHhcchhHHHHhhceEEEEE---eCCCCCCC
Confidence 367777765332 24899999996542111 112222222333444566777752 22221 1
Q ss_pred CC--------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCC----------
Q 004574 571 DK--------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKT---------- 632 (744)
Q Consensus 571 ~~--------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~---------- 632 (744)
.. ..++++.+.+..+.+.-.+ +.+..+|-.+|+++-.++|..+|+++.++|+++|.....
T Consensus 70 a~~~p~~y~yPsmd~LAe~l~~Vl~~f~l--k~vIg~GvGAGAnIL~rfAl~~p~~V~GLiLvn~~~~~~gw~Ew~~~K~ 147 (283)
T PF03096_consen 70 AATLPEGYQYPSMDQLAEMLPEVLDHFGL--KSVIGFGVGAGANILARFALKHPERVLGLILVNPTCTAAGWMEWFYQKL 147 (283)
T ss_dssp -----TT-----HHHHHCTHHHHHHHHT-----EEEEEETHHHHHHHHHHHHSGGGEEEEEEES---S---HHHHHHHHH
T ss_pred cccccccccccCHHHHHHHHHHHHHhCCc--cEEEEEeeccchhhhhhccccCccceeEEEEEecCCCCccHHHHHHHHH
Confidence 11 2344566666666554334 469999999999999999999999999999999864311
Q ss_pred ----CCCCcccc--cccchh----------------------------hcH----HHHHhc-CcccccCCCCCCEEEEee
Q 004574 633 ----LTPFGFQT--EFRTLW----------------------------EAT----NVYIEM-SPITHANKIKKPILIIHG 673 (744)
Q Consensus 633 ----~~~~~~~~--~~~~~~----------------------------~~~----~~~~~~-~~~~~~~~~~~P~l~i~G 673 (744)
+...++.. .....| .+. ..|... +.....+...||+|++.|
T Consensus 148 ~~~~L~~~gmt~~~~d~Ll~h~Fg~~~~~~n~Dlv~~yr~~l~~~~Np~Nl~~f~~sy~~R~DL~~~~~~~~c~vLlvvG 227 (283)
T PF03096_consen 148 SSWLLYSYGMTSSVKDYLLWHYFGKEEEENNSDLVQTYRQHLDERINPKNLALFLNSYNSRTDLSIERPSLGCPVLLVVG 227 (283)
T ss_dssp H-------CTTS-HHHHHHHHHS-HHHHHCT-HHHHHHHHHHHT-TTHHHHHHHHHHHHT-----SECTTCCS-EEEEEE
T ss_pred hcccccccccccchHHhhhhcccccccccccHHHHHHHHHHHhcCCCHHHHHHHHHHHhccccchhhcCCCCCCeEEEEe
Confidence 00000000 000000 000 001111 112223456799999999
Q ss_pred CCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 674 EVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 674 ~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
...+. .+.+.++..+|- ..+..++.++++|-... .+.+....+.+.=||..
T Consensus 228 ~~Sp~----~~~vv~~ns~Ld--p~~ttllkv~dcGglV~-eEqP~klaea~~lFlQG 278 (283)
T PF03096_consen 228 DNSPH----VDDVVEMNSKLD--PTKTTLLKVADCGGLVL-EEQPGKLAEAFKLFLQG 278 (283)
T ss_dssp TTSTT----HHHHHHHHHHS---CCCEEEEEETT-TT-HH-HH-HHHHHHHHHHHHHH
T ss_pred cCCcc----hhhHHHHHhhcC--cccceEEEecccCCccc-ccCcHHHHHHHHHHHcc
Confidence 99987 688888988874 45689999999866554 56677777777666653
No 318
>PF07819 PGAP1: PGAP1-like protein; InterPro: IPR012908 The sequences found in this family are similar to PGAP1 (Q765A7 from SWISSPROT). This is an endoplasmic reticulum membrane protein with a catalytic serine-containing motif that is conserved in a number of lipases. PGAP1 functions as a GPI inositol-deacylase; this deacylation is important for the efficient transport of GPI-anchored proteins from the endoplasmic reticulum to the Golgi body [].; GO: 0016788 hydrolase activity, acting on ester bonds, 0006505 GPI anchor metabolic process, 0006886 intracellular protein transport, 0031227 intrinsic to endoplasmic reticulum membrane
Probab=97.46 E-value=0.0019 Score=62.52 Aligned_cols=54 Identities=15% Similarity=0.138 Sum_probs=39.3
Q ss_pred HHHHHHHHHHHcC---CCCCCcEEEEEechHHHHHHHHHHhCC---CceeEEEEccCCCC
Q 004574 577 SAEAAVEEVVRRG---VADPSRIAVGGHSYGAFMTAHLLAHAP---HLFCCGIARSGSYN 630 (744)
Q Consensus 577 d~~~~~~~l~~~~---~~d~~~i~l~G~S~GG~~a~~~~~~~p---~~~~~~v~~~~~~~ 630 (744)
-+.++++.+.+.. ...+++|.|+||||||.+|-.++...+ +.++.+|.++.+..
T Consensus 65 ~~~~~i~~i~~~~~~~~~~~~~vilVgHSmGGlvar~~l~~~~~~~~~v~~iitl~tPh~ 124 (225)
T PF07819_consen 65 FLAEAIKYILELYKSNRPPPRSVILVGHSMGGLVARSALSLPNYDPDSVKTIITLGTPHR 124 (225)
T ss_pred HHHHHHHHHHHhhhhccCCCCceEEEEEchhhHHHHHHHhccccccccEEEEEEEcCCCC
Confidence 4555666665543 345689999999999999887776542 46889988887654
No 319
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.44 E-value=0.00087 Score=67.02 Aligned_cols=198 Identities=14% Similarity=0.061 Sum_probs=103.8
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
..-.++++||..||-... ++.-+||-++-..- .+ .+..+...+.++.|||||+.|++...
T Consensus 147 ~k~vaf~~~gs~latgg~--------dg~lRv~~~Ps~~t---~l---~e~~~~~eV~DL~FS~dgk~lasig~------ 206 (398)
T KOG0771|consen 147 QKVVAFNGDGSKLATGGT--------DGTLRVWEWPSMLT---IL---EEIAHHAEVKDLDFSPDGKFLASIGA------ 206 (398)
T ss_pred ceEEEEcCCCCEeeeccc--------cceEEEEecCcchh---hh---hhHhhcCccccceeCCCCcEEEEecC------
Confidence 356789999999998533 44555554442211 11 11112225778999999999998732
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC---ceeeeeccCCCC-
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP---AVYTAVEPSPDQ- 186 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~---~~~~~~~~SpDG- 186 (744)
....++++ +| ...+++.. ..+....|+-|+
T Consensus 207 --------------------------------------------d~~~VW~~~~g~~~a~~t~~~k~~~~~~cRF~~d~~ 242 (398)
T KOG0771|consen 207 --------------------------------------------DSARVWSVNTGAALARKTPFSKDEMFSSCRFSVDNA 242 (398)
T ss_pred --------------------------------------------CceEEEEeccCchhhhcCCcccchhhhhceecccCC
Confidence 23445566 66 55555533 455677888887
Q ss_pred --ceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEee
Q 004574 187 --KYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQ 263 (744)
Q Consensus 187 --~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~ 263 (744)
...++....... ... ..++.+|+-+ -.+.++.... ..++..++.|+||+. ++.-..
T Consensus 243 ~~~l~laa~~~~~~-~v~-----~~~~~~w~~~~~l~~~~~~~~-------------~~siSsl~VS~dGkf-~AlGT~- 301 (398)
T KOG0771|consen 243 QETLRLAASQFPGG-GVR-----LCDISLWSGSNFLRLRKKIKR-------------FKSISSLAVSDDGKF-LALGTM- 301 (398)
T ss_pred CceEEEEEecCCCC-cee-----EEEeeeeccccccchhhhhhc-------------cCcceeEEEcCCCcE-EEEecc-
Confidence 222222222110 000 0123333322 0011111111 112668899999985 554431
Q ss_pred cCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCceEEEeeeeeccceeEEEEcC
Q 004574 264 DRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCP 331 (744)
Q Consensus 264 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~ 331 (744)
.+.+-++++ ..=+...+.+. .+.+..+.||||.++++-.+.+ ....+..+.+
T Consensus 302 -----------dGsVai~~~---~~lq~~~~vk~aH~~~VT~ltF~Pdsr~~~svSs~--~~~~v~~l~v 355 (398)
T KOG0771|consen 302 -----------DGSVAIYDA---KSLQRLQYVKEAHLGFVTGLTFSPDSRYLASVSSD--NEAAVTKLAV 355 (398)
T ss_pred -----------CCcEEEEEe---ceeeeeEeehhhheeeeeeEEEcCCcCcccccccC--CceeEEEEee
Confidence 223555554 22222333332 4467889999999998875522 3334544444
No 320
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=97.41 E-value=0.011 Score=57.29 Aligned_cols=121 Identities=8% Similarity=0.020 Sum_probs=61.7
Q ss_pred ceeceeeccCCc-eEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcc
Q 004574 300 RFRSVSWCDDSL-ALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQ 378 (744)
Q Consensus 300 ~~~~~~~SpDg~-~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~ 378 (744)
-+..+++||-.. .++..+-. ....||.-+ + ..+..+.-+..+ |...++|.+||.+|+..+.
T Consensus 209 iisc~a~sP~~~~~~a~gsY~--q~~giy~~~--~--~~pl~llggh~g------GvThL~~~edGn~lfsGaR------ 270 (406)
T KOG2919|consen 209 IISCFAFSPMDSKTLAVGSYG--QRVGIYNDD--G--RRPLQLLGGHGG------GVTHLQWCEDGNKLFSGAR------ 270 (406)
T ss_pred eeeeeeccCCCCcceeeeccc--ceeeeEecC--C--CCceeeecccCC------CeeeEEeccCcCeeccccc------
Confidence 456678888755 45544311 223355443 3 344444433333 3444789999998876652
Q ss_pred eEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEE
Q 004574 379 IYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILS 458 (744)
Q Consensus 379 ~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~ 458 (744)
....|..||+.-. ...++.-... +. -+..---..+.|+|+.|+ +..+.+.+.+||
T Consensus 271 --------------k~dkIl~WDiR~~-~~pv~~L~rh-----v~-~TNQRI~FDld~~~~~La----sG~tdG~V~vwd 325 (406)
T KOG2919|consen 271 --------------KDDKILCWDIRYS-RDPVYALERH-----VG-DTNQRILFDLDPKGEILA----SGDTDGSVRVWD 325 (406)
T ss_pred --------------CCCeEEEEeehhc-cchhhhhhhh-----cc-CccceEEEecCCCCceee----ccCCCccEEEEe
Confidence 2234778887422 1122221111 00 000000135567777766 344456788888
Q ss_pred CCC-Cc
Q 004574 459 WPL-KK 463 (744)
Q Consensus 459 ~~~-g~ 463 (744)
+++ |.
T Consensus 326 lk~~gn 331 (406)
T KOG2919|consen 326 LKDLGN 331 (406)
T ss_pred cCCCCC
Confidence 876 55
No 321
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.39 E-value=0.023 Score=61.30 Aligned_cols=144 Identities=17% Similarity=0.130 Sum_probs=89.4
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee-ccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE-LCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+...++||||++|++.--++ .+-+|-+++-.... |-.+. .| +..+..|||+
T Consensus 509 ddvL~v~~Spdgk~LaVsLLdn-------------TVkVyflDtlKFflsLYGHk----LP---------V~smDIS~DS 562 (888)
T KOG0306|consen 509 DDVLCVSVSPDGKLLAVSLLDN-------------TVKVYFLDTLKFFLSLYGHK----LP---------VLSMDISPDS 562 (888)
T ss_pred ccEEEEEEcCCCcEEEEEeccC-------------eEEEEEecceeeeeeecccc----cc---------eeEEeccCCc
Confidence 6677899999999999886554 44444444432221 11111 12 6678899999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
+- |+-.++ +.+..+|-++. |.=.+.+..++..+.++.|-|+. ++.|++ .. ...+..+|.+.
T Consensus 563 kl-ivTgSA----------DKnVKiWGLdF----GDCHKS~fAHdDSvm~V~F~P~~-~~FFt~-gK--D~kvKqWDg~k 623 (888)
T KOG0306|consen 563 KL-IVTGSA----------DKNVKIWGLDF----GDCHKSFFAHDDSVMSVQFLPKT-HLFFTC-GK--DGKVKQWDGEK 623 (888)
T ss_pred Ce-EEeccC----------CCceEEecccc----chhhhhhhcccCceeEEEEcccc-eeEEEe-cC--cceEEeechhh
Confidence 85 554432 23334666554 23345577778888999999954 556655 22 23566676443
Q ss_pred CCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 334 KDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 334 ~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+..+..+++...++. ++.+|+|.+++..+
T Consensus 624 --Fe~iq~L~~H~~ev~c------Lav~~~G~~vvs~s 653 (888)
T KOG0306|consen 624 --FEEIQKLDGHHSEVWC------LAVSPNGSFVVSSS 653 (888)
T ss_pred --hhhheeeccchheeee------eEEcCCCCeEEecc
Confidence 4555666666555544 56799999888776
No 322
>KOG4840 consensus Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=97.38 E-value=0.0069 Score=55.57 Aligned_cols=86 Identities=12% Similarity=0.123 Sum_probs=61.0
Q ss_pred chhHHHHHhCCeEEEecCC-CCCCCCCCCC---hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC--CCc
Q 004574 545 PTSSLIFLARRFAVLAGPS-IPIIGEGDKL---PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA--PHL 618 (744)
Q Consensus 545 ~~~~~~~~~~G~~v~~~~~-~~~~g~g~~~---~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~--p~~ 618 (744)
......+.+.+|..+.+-. ..+.|+|... -.+|+..+++++.-.+. ..+|+++|||.|..-.++.+.+. |..
T Consensus 56 ~~L~~~lde~~wslVq~q~~Ssy~G~Gt~slk~D~edl~~l~~Hi~~~~f--St~vVL~GhSTGcQdi~yYlTnt~~~r~ 133 (299)
T KOG4840|consen 56 TMLNRYLDENSWSLVQPQLRSSYNGYGTFSLKDDVEDLKCLLEHIQLCGF--STDVVLVGHSTGCQDIMYYLTNTTKDRK 133 (299)
T ss_pred HHHHHHHhhccceeeeeeccccccccccccccccHHHHHHHHHHhhccCc--ccceEEEecCccchHHHHHHHhccchHH
Confidence 3556677889998886322 2345566553 33477777776654433 24899999999999888887443 567
Q ss_pred eeEEEEccCCCCCC
Q 004574 619 FCCGIARSGSYNKT 632 (744)
Q Consensus 619 ~~~~v~~~~~~~~~ 632 (744)
+.++|+.+|+.|++
T Consensus 134 iraaIlqApVSDrE 147 (299)
T KOG4840|consen 134 IRAAILQAPVSDRE 147 (299)
T ss_pred HHHHHHhCccchhh
Confidence 89999999998865
No 323
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.37 E-value=0.028 Score=57.59 Aligned_cols=266 Identities=10% Similarity=0.082 Sum_probs=150.5
Q ss_pred cccceeecCCCCe-EEEeeecccccccCCCceeEEEEECCCCceec-cccCCCccccccccceEEecCCcEEEEEecCCC
Q 004574 32 KINFVSWSPDGKR-IAFSVRVDEEDNVSSCKLRVWIADAETGEAKP-LFESPDICLNAVFGSFVWVNNSTLLIFTIPSSR 109 (744)
Q Consensus 32 ~~~~p~~SpDG~~-laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~-lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~ 109 (744)
.+++.+|||---+ +|.++ +-..+|| +..+-+.++ +..-.. .+.+..|-.||+.|+.. ++.
T Consensus 28 ~vssl~fsp~~P~d~aVt~---------S~rvqly--~~~~~~~~k~~srFk~-----~v~s~~fR~DG~LlaaG--D~s 89 (487)
T KOG0310|consen 28 SVSSLCFSPKHPYDFAVTS---------SVRVQLY--SSVTRSVRKTFSRFKD-----VVYSVDFRSDGRLLAAG--DES 89 (487)
T ss_pred cceeEecCCCCCCceEEec---------ccEEEEE--ecchhhhhhhHHhhcc-----ceeEEEeecCCeEEEcc--CCc
Confidence 6788999993322 45543 3345565 544545444 322222 46677888999988764 111
Q ss_pred CCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC--CeeecCCC-ceeeeeccCCCC
Q 004574 110 RDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG--TAKDFGTP-AVYTAVEPSPDQ 186 (744)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G--~~~~l~~~-~~~~~~~~SpDG 186 (744)
+.+-++|.+. -.+++..+ ..+....|+|++
T Consensus 90 -----------------------------------------------G~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d 122 (487)
T KOG0310|consen 90 -----------------------------------------------GHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQD 122 (487)
T ss_pred -----------------------------------------------CcEEEeccccHHHHHHHhhccCceeEEEecccC
Confidence 4566666533 33444444 667788899999
Q ss_pred ceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCC
Q 004574 187 KYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRG 266 (744)
Q Consensus 187 ~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~ 266 (744)
+.++....++ ..+-.|++++..+ ++ ...... ..++..+|+|-... ++++..
T Consensus 123 ~t~l~s~sDd------------~v~k~~d~s~a~v-~~-~l~~ht----------DYVR~g~~~~~~~h-ivvtGs---- 173 (487)
T KOG0310|consen 123 NTMLVSGSDD------------KVVKYWDLSTAYV-QA-ELSGHT----------DYVRCGDISPANDH-IVVTGS---- 173 (487)
T ss_pred CeEEEecCCC------------ceEEEEEcCCcEE-EE-EecCCc----------ceeEeeccccCCCe-EEEecC----
Confidence 9998776554 3566678887764 21 111111 22667789988776 666631
Q ss_pred CCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccc-
Q 004574 267 DANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRV- 345 (744)
Q Consensus 267 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~- 345 (744)
+.+.|-++|.+. .+ +...-......+.++.+-|.|..|+.+. -..+.++|+.++ ++ .++...
T Consensus 174 -------YDg~vrl~DtR~-~~-~~v~elnhg~pVe~vl~lpsgs~iasAg-----Gn~vkVWDl~~G-~q--ll~~~~~ 236 (487)
T KOG0310|consen 174 -------YDGKVRLWDTRS-LT-SRVVELNHGCPVESVLALPSGSLIASAG-----GNSVKVWDLTTG-GQ--LLTSMFN 236 (487)
T ss_pred -------CCceEEEEEecc-CC-ceeEEecCCCceeeEEEcCCCCEEEEcC-----CCeEEEEEecCC-ce--ehhhhhc
Confidence 223455666521 22 2222233466788889999999888765 234778888864 22 222111
Q ss_pred cccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeee
Q 004574 346 FENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVAL 425 (744)
Q Consensus 346 ~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~ 425 (744)
... -...+....|++.|+..+ ...++.+||..+-++.-=|.-.
T Consensus 237 H~K-----tVTcL~l~s~~~rLlS~s---------------------LD~~VKVfd~t~~Kvv~s~~~~----------- 279 (487)
T KOG0310|consen 237 HNK-----TVTCLRLASDSTRLLSGS---------------------LDRHVKVFDTTNYKVVHSWKYP----------- 279 (487)
T ss_pred ccc-----eEEEEEeecCCceEeecc---------------------cccceEEEEccceEEEEeeecc-----------
Confidence 110 001144556777776653 2346888886544443222211
Q ss_pred ecCCcceecccCCCEEEEEEe
Q 004574 426 VFGQGEEDINLNQLKILTSKE 446 (744)
Q Consensus 426 ~~~~~~~~~s~d~~~~~~~~~ 446 (744)
++.-.+++|||++.++..-+
T Consensus 280 -~pvLsiavs~dd~t~viGms 299 (487)
T KOG0310|consen 280 -GPVLSIAVSPDDQTVVIGMS 299 (487)
T ss_pred -cceeeEEecCCCceEEEecc
Confidence 11123689999998886443
No 324
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=97.37 E-value=0.011 Score=63.04 Aligned_cols=150 Identities=13% Similarity=0.110 Sum_probs=92.7
Q ss_pred CCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-eccccCCCccccccccceEEecCCcEEEEEe
Q 004574 27 YPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-KPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 27 ~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
+++...++..+.||||++||+.+- ....||.+..+.... +.+-..+.. .-.+....++-|+..+++.+
T Consensus 379 ~k~~~nIs~~aiSPdg~~Ia~st~---------~~~~iy~L~~~~~vk~~~v~~~~~~--~~~a~~i~ftid~~k~~~~s 447 (691)
T KOG2048|consen 379 TKEKENISCAAISPDGNLIAISTV---------SRTKIYRLQPDPNVKVINVDDVPLA--LLDASAISFTIDKNKLFLVS 447 (691)
T ss_pred cCCccceeeeccCCCCCEEEEeec---------cceEEEEeccCcceeEEEeccchhh--hccceeeEEEecCceEEEEe
Confidence 344446788899999999999642 235677776665221 222111111 11245667888888877764
Q ss_pred cCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC-Ce---eecCC---Cceee
Q 004574 106 PSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG-TA---KDFGT---PAVYT 178 (744)
Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G-~~---~~l~~---~~~~~ 178 (744)
.. ..++..+.+++ .. ..+.. ...+.
T Consensus 448 ~~------------------------------------------------~~~le~~el~~ps~kel~~~~~~~~~~~I~ 479 (691)
T KOG2048|consen 448 KN------------------------------------------------IFSLEEFELETPSFKELKSIQSQAKCPSIS 479 (691)
T ss_pred cc------------------------------------------------cceeEEEEecCcchhhhhccccccCCCcce
Confidence 11 04566666632 22 22222 16788
Q ss_pred eeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEE
Q 004574 179 AVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLY 258 (744)
Q Consensus 179 ~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~ 258 (744)
.+.-||||++|+..... ..|++|++++.+.+.+....... +...+++|.-...|+
T Consensus 480 ~l~~SsdG~yiaa~~t~-------------g~I~v~nl~~~~~~~l~~rln~~------------vTa~~~~~~~~~~lv 534 (691)
T KOG2048|consen 480 RLVVSSDGNYIAAISTR-------------GQIFVYNLETLESHLLKVRLNID------------VTAAAFSPFVRNRLV 534 (691)
T ss_pred eEEEcCCCCEEEEEecc-------------ceEEEEEcccceeecchhccCcc------------eeeeeccccccCcEE
Confidence 99999999999998755 37999999999888876332221 445577765544354
Q ss_pred EE
Q 004574 259 WV 260 (744)
Q Consensus 259 ~~ 260 (744)
..
T Consensus 535 va 536 (691)
T KOG2048|consen 535 VA 536 (691)
T ss_pred EE
Confidence 44
No 325
>COG4947 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.33 E-value=0.00021 Score=62.25 Aligned_cols=106 Identities=21% Similarity=0.237 Sum_probs=73.4
Q ss_pred CcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCCCCCcccccccchhhcHHHHHhcCcccccCCC---------
Q 004574 594 SRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTLTPFGFQTEFRTLWEATNVYIEMSPITHANKI--------- 664 (744)
Q Consensus 594 ~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 664 (744)
.+..+-|.||||+.|+.+..++|+.|.++|+++|+++....-.++..... .+ .+|..+++.+
T Consensus 101 gs~~~sgcsmGayhA~nfvfrhP~lftkvialSGvYdardffg~yyddDv-~y--------nsP~dylpg~~dp~~l~rl 171 (227)
T COG4947 101 GSTIVSGCSMGAYHAANFVFRHPHLFTKVIALSGVYDARDFFGGYYDDDV-YY--------NSPSDYLPGLADPFRLERL 171 (227)
T ss_pred CCccccccchhhhhhhhhheeChhHhhhheeecceeeHHHhccccccCce-ee--------cChhhhccCCcChHHHHHH
Confidence 56788999999999999999999999999999999885432222222111 11 2333333332
Q ss_pred -CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCccc
Q 004574 665 -KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVY 712 (744)
Q Consensus 665 -~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~ 712 (744)
...+.+..|..|+.. .+.+++-+.|..+.++..+.++.+..|.+
T Consensus 172 r~~~~vfc~G~e~~~L----~~~~~L~~~l~dKqipaw~~~WggvaHdw 216 (227)
T COG4947 172 RRIDMVFCIGDEDPFL----DNNQHLSRLLSDKQIPAWMHVWGGVAHDW 216 (227)
T ss_pred hhccEEEEecCccccc----cchHHHHHHhccccccHHHHHhccccccc
Confidence 345788889888874 46677878887777777777776666643
No 326
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.30 E-value=0.068 Score=62.80 Aligned_cols=106 Identities=12% Similarity=0.080 Sum_probs=57.5
Q ss_pred ccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee-eeccceeceeeccCCceEEEeeeeeccce
Q 004574 246 SISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH-KLDLRFRSVSWCDDSLALVNETWYKTSQT 324 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~~~~~~~~~~SpDg~~l~~~~~~~~~~~ 324 (744)
.++|..||.+ ++..+...... .+..+-+++. + |...... +.++-...++|.|.|..|+.+.. .+...
T Consensus 214 ~ISWRGDG~y-FAVss~~~~~~------~~R~iRVy~R---e-G~L~stSE~v~gLe~~l~WrPsG~lIA~~q~-~~~~~ 281 (928)
T PF04762_consen 214 RISWRGDGEY-FAVSSVEPETG------SRRVIRVYSR---E-GELQSTSEPVDGLEGALSWRPSGNLIASSQR-LPDRH 281 (928)
T ss_pred EEEECCCCcE-EEEEEEEcCCC------ceeEEEEECC---C-ceEEeccccCCCccCCccCCCCCCEEEEEEE-cCCCc
Confidence 6899999986 44443211111 1234777776 3 3222111 12444567899999999988763 33444
Q ss_pred eEEEEcCCCCCCcceeee-c--cccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 325 RTWLVCPGSKDVAPRVLF-D--RVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 325 ~l~~~~~~~~~~~~~~l~-~--~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.|.-.--+|- .....+ . ..... .-.+.|++|+..|+...
T Consensus 282 ~VvFfErNGL--rhgeF~l~~~~~~~~------v~~l~Wn~ds~iLAv~~ 323 (928)
T PF04762_consen 282 DVVFFERNGL--RHGEFTLRFDPEEEK------VIELAWNSDSEILAVWL 323 (928)
T ss_pred EEEEEecCCc--EeeeEecCCCCCCce------eeEEEECCCCCEEEEEe
Confidence 4544444441 111111 0 11111 12277999999999987
No 327
>PF00975 Thioesterase: Thioesterase domain; InterPro: IPR001031 Thioesterase domains often occur integrated in or associated with peptide synthetases which are involved in the non-ribosomal synthesis of peptide antibiotics []. Thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates.; GO: 0016788 hydrolase activity, acting on ester bonds, 0009058 biosynthetic process; PDB: 2RON_A 2K2Q_B 3LCR_B 2HFJ_B 1MNQ_A 1MN6_B 1MNA_B 2HFK_B 2H7Y_B 2H7X_A ....
Probab=97.29 E-value=0.0033 Score=61.51 Aligned_cols=35 Identities=11% Similarity=0.156 Sum_probs=27.1
Q ss_pred CcEEEEEechHHHHHHHHHHhC---CCceeEEEEccCC
Q 004574 594 SRIAVGGHSYGAFMTAHLLAHA---PHLFCCGIARSGS 628 (744)
Q Consensus 594 ~~i~l~G~S~GG~~a~~~~~~~---p~~~~~~v~~~~~ 628 (744)
.++.|+|||+||.+|.-+|.+- -..+..++++++.
T Consensus 66 gp~~L~G~S~Gg~lA~E~A~~Le~~G~~v~~l~liD~~ 103 (229)
T PF00975_consen 66 GPYVLAGWSFGGILAFEMARQLEEAGEEVSRLILIDSP 103 (229)
T ss_dssp SSEEEEEETHHHHHHHHHHHHHHHTT-SESEEEEESCS
T ss_pred CCeeehccCccHHHHHHHHHHHHHhhhccCceEEecCC
Confidence 3899999999999999888542 2347778888754
No 328
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=97.28 E-value=0.014 Score=59.16 Aligned_cols=236 Identities=13% Similarity=0.156 Sum_probs=130.4
Q ss_pred CCccceeEeecCCCCCCCCce-------eeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce
Q 004574 2 PFFTGIGIHRLLPDDSLGPEK-------EVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA 74 (744)
Q Consensus 2 ~~~~~~~~~~~~~~~~~g~~~-------~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~ 74 (744)
...+++++++.....+..+.. +|.+... .....+|++.-+--.... ..+....+|-+...+.+.
T Consensus 144 t~~~dv~Vfd~tk~~s~~~~~~~~~Pdl~L~gH~~--eg~glsWn~~~~g~Lls~-------~~d~~i~lwdi~~~~~~~ 214 (422)
T KOG0264|consen 144 TSSGDVYVFDYTKHPSKPKASGECRPDLRLKGHEK--EGYGLSWNRQQEGTLLSG-------SDDHTICLWDINAESKED 214 (422)
T ss_pred CCCCCEEEEEeccCCCcccccccCCCceEEEeecc--cccccccccccceeEeec-------cCCCcEEEEeccccccCC
Confidence 456788888886643333322 5554333 144578988766544422 124445555444333322
Q ss_pred eccccCC-CccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccce
Q 004574 75 KPLFESP-DICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDY 153 (744)
Q Consensus 75 ~~lt~~~-~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (744)
+.+.... .......+..++|.|-.+.|+-...+
T Consensus 215 ~~~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~d---------------------------------------------- 248 (422)
T KOG0264|consen 215 KVVDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGD---------------------------------------------- 248 (422)
T ss_pred ccccceEEeecCCcceehhhccccchhhheeecC----------------------------------------------
Confidence 2221000 01112246688999877666543211
Q ss_pred eeeeEEEEEcC-CC--CeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCC
Q 004574 154 YTTAQLVLGSL-DG--TAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLP 228 (744)
Q Consensus 154 ~~~~~l~~~~~-~G--~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~ 228 (744)
.+.|.++|+ ++ +++..... +.+..++|+|=+..|+-+...+ ..+.+||+..-..+..+ .
T Consensus 249 --d~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D------------~tV~LwDlRnL~~~lh~-~- 312 (422)
T KOG0264|consen 249 --DGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGSAD------------KTVALWDLRNLNKPLHT-F- 312 (422)
T ss_pred --CCeEEEEEcCCCCCCCcccccccCCceeEEEeCCCCCceEEeccCC------------CcEEEeechhcccCcee-c-
Confidence 046777777 53 33333333 7788999999999888776543 47899987643322111 1
Q ss_pred CCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCC---------CCCCceEeeeecc
Q 004574 229 PAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPA---------EGEKPEILHKLDL 299 (744)
Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~---------~~~~~~~l~~~~~ 299 (744)
......+..+.|||+-.. ++-++.. +. ++.++|+... +.+.+..|+...+
T Consensus 313 ---------e~H~dev~~V~WSPh~et-vLASSg~---D~--------rl~vWDls~ig~eq~~eda~dgppEllF~HgG 371 (422)
T KOG0264|consen 313 ---------EGHEDEVFQVEWSPHNET-VLASSGT---DR--------RLNVWDLSRIGEEQSPEDAEDGPPELLFIHGG 371 (422)
T ss_pred ---------cCCCcceEEEEeCCCCCc-eeEeccc---CC--------cEEEEeccccccccChhhhccCCcceeEEecC
Confidence 111122667899999987 5444322 21 3555554211 2233444554443
Q ss_pred ---ceeceeeccCCceEEEeeeeeccceeEEEEc
Q 004574 300 ---RFRSVSWCDDSLALVNETWYKTSQTRTWLVC 330 (744)
Q Consensus 300 ---~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~ 330 (744)
.+..++|.|..-+++.++. .++.-+||.+.
T Consensus 372 H~~kV~DfsWnp~ePW~I~Sva-eDN~LqIW~~s 404 (422)
T KOG0264|consen 372 HTAKVSDFSWNPNEPWTIASVA-EDNILQIWQMA 404 (422)
T ss_pred cccccccccCCCCCCeEEEEec-CCceEEEeecc
Confidence 5778999999999888773 33556677654
No 329
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.27 E-value=0.0019 Score=64.74 Aligned_cols=152 Identities=18% Similarity=0.172 Sum_probs=85.6
Q ss_pred CCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCC---cEEEEEe
Q 004574 30 GAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNS---TLLIFTI 105 (744)
Q Consensus 30 ~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg---~~l~~~~ 105 (744)
...+....|||||+.||++.. + ...|| ++++|- ...+|..... .-+....|+-|+ ...+++.
T Consensus 186 ~~eV~DL~FS~dgk~lasig~--------d-~~~VW--~~~~g~~~a~~t~~~k~---~~~~~cRF~~d~~~~~l~laa~ 251 (398)
T KOG0771|consen 186 HAEVKDLDFSPDGKFLASIGA--------D-SARVW--SVNTGAALARKTPFSKD---EMFSSCRFSVDNAQETLRLAAS 251 (398)
T ss_pred cCccccceeCCCCcEEEEecC--------C-ceEEE--EeccCchhhhcCCcccc---hhhhhceecccCCCceEEEEEe
Confidence 336788999999999999863 2 44555 666663 3334422211 124566888776 3334432
Q ss_pred cCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC-ceeeeeccCC
Q 004574 106 PSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP-AVYTAVEPSP 184 (744)
Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~-~~~~~~~~Sp 184 (744)
....+.. + .....+|..+.-+..++.... ..++.++.|+
T Consensus 252 ~~~~~~v-----------------------~-----------------~~~~~~w~~~~~l~~~~~~~~~~siSsl~VS~ 291 (398)
T KOG0771|consen 252 QFPGGGV-----------------------R-----------------LCDISLWSGSNFLRLRKKIKRFKSISSLAVSD 291 (398)
T ss_pred cCCCCce-----------------------e-----------------EEEeeeeccccccchhhhhhccCcceeEEEcC
Confidence 2111110 0 000123322211233444444 6889999999
Q ss_pred CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEE
Q 004574 185 DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWV 260 (744)
Q Consensus 185 DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~ 260 (744)
||+.+++..+.. .+-+++...-+..++...... .-+.+++|+||.++ ++-+
T Consensus 292 dGkf~AlGT~dG-------------sVai~~~~~lq~~~~vk~aH~-----------~~VT~ltF~Pdsr~-~~sv 342 (398)
T KOG0771|consen 292 DGKFLALGTMDG-------------SVAIYDAKSLQRLQYVKEAHL-----------GFVTGLTFSPDSRY-LASV 342 (398)
T ss_pred CCcEEEEeccCC-------------cEEEEEeceeeeeEeehhhhe-----------eeeeeEEEcCCcCc-cccc
Confidence 999999998764 566777665554444322111 11567899999987 5444
No 330
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=97.27 E-value=0.0053 Score=57.19 Aligned_cols=198 Identities=14% Similarity=0.186 Sum_probs=116.0
Q ss_pred CCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc--eeccccCCCccccccccceEEecCCcEEEEE
Q 004574 27 YPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE--AKPLFESPDICLNAVFGSFVWVNNSTLLIFT 104 (744)
Q Consensus 27 ~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~--~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~ 104 (744)
+.+...+...+||.|.++|+--.. + .-|.+.++..-+ ++.+...+. ++..+.|-...+.|+..
T Consensus 97 f~hkhivk~~af~~ds~~lltgg~--------e--kllrvfdln~p~App~E~~ghtg-----~Ir~v~wc~eD~~iLSS 161 (334)
T KOG0278|consen 97 FEHKHIVKAVAFSQDSNYLLTGGQ--------E--KLLRVFDLNRPKAPPKEISGHTG-----GIRTVLWCHEDKCILSS 161 (334)
T ss_pred hhhhheeeeEEecccchhhhccch--------H--HHhhhhhccCCCCCchhhcCCCC-----cceeEEEeccCceEEee
Confidence 344445778888998888866221 1 223334444333 333433332 46677888877777654
Q ss_pred ecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCCceeeeecc
Q 004574 105 IPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTPAVYTAVEP 182 (744)
Q Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~~~~~~~~~ 182 (744)
.++ ..+.++|. +| +.+.|..+..+.++..
T Consensus 162 add-------------------------------------------------~tVRLWD~rTgt~v~sL~~~s~VtSlEv 192 (334)
T KOG0278|consen 162 ADD-------------------------------------------------KTVRLWDHRTGTEVQSLEFNSPVTSLEV 192 (334)
T ss_pred ccC-------------------------------------------------CceEEEEeccCcEEEEEecCCCCcceee
Confidence 221 35666688 88 6666655577888999
Q ss_pred CCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEe
Q 004574 183 SPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEA 262 (744)
Q Consensus 183 SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~ 262 (744)
|+||+.|... .. ..+-.||++.=...+-...|-. +.+.+.+|+-. ++.+.
T Consensus 193 s~dG~ilTia-~g-------------ssV~Fwdaksf~~lKs~k~P~n-------------V~SASL~P~k~--~fVaG- 242 (334)
T KOG0278|consen 193 SQDGRILTIA-YG-------------SSVKFWDAKSFGLLKSYKMPCN-------------VESASLHPKKE--FFVAG- 242 (334)
T ss_pred ccCCCEEEEe-cC-------------ceeEEeccccccceeeccCccc-------------cccccccCCCc--eEEec-
Confidence 9999977544 22 2566677654322222222222 44567888874 44331
Q ss_pred ecCCCCCccCCccceEEeccCCCCCCCCceEee--eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 263 QDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH--KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 263 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~--~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
+.+. .+|.+|- .+++..... ...+.+..+.|||||...+..+ .++..+||.+....
T Consensus 243 --ged~--------~~~kfDy---~TgeEi~~~nkgh~gpVhcVrFSPdGE~yAsGS--EDGTirlWQt~~~~ 300 (334)
T KOG0278|consen 243 --GEDF--------KVYKFDY---NTGEEIGSYNKGHFGPVHCVRFSPDGELYASGS--EDGTIRLWQTTPGK 300 (334)
T ss_pred --Ccce--------EEEEEec---cCCceeeecccCCCCceEEEEECCCCceeeccC--CCceEEEEEecCCC
Confidence 1111 2666665 555544443 2356788899999998666544 44777888877543
No 331
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.22 E-value=0.097 Score=56.63 Aligned_cols=296 Identities=12% Similarity=0.080 Sum_probs=155.2
Q ss_pred CccceeEeecCCCCCCCCc-eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCC
Q 004574 3 FFTGIGIHRLLPDDSLGPE-KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESP 81 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~-~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~ 81 (744)
.+++|.+++.+. .+. +++.+..+. ....-.|-|+-+++|.+++ ...|.++++.|....-+..+.
T Consensus 301 aeQnl~l~d~~~----l~i~k~ivG~ndE-I~Dm~~lG~e~~~laVATN----------s~~lr~y~~~~~~c~ii~GH~ 365 (775)
T KOG0319|consen 301 AEQNLFLYDEDE----LTIVKQIVGYNDE-ILDMKFLGPEESHLAVATN----------SPELRLYTLPTSYCQIIPGHT 365 (775)
T ss_pred ccceEEEEEccc----cEEehhhcCCchh-heeeeecCCccceEEEEeC----------CCceEEEecCCCceEEEeCch
Confidence 468899997755 432 344422222 3446678899999999854 346666788888877554333
Q ss_pred CccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEE
Q 004574 82 DICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVL 161 (744)
Q Consensus 82 ~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 161 (744)
+ .+.++.-..+|-.|+-.+.+ ....+|+
T Consensus 366 e-----~vlSL~~~~~g~llat~sKD-----------------------------------------------~svilWr 393 (775)
T KOG0319|consen 366 E-----AVLSLDVWSSGDLLATGSKD-----------------------------------------------KSVILWR 393 (775)
T ss_pred h-----heeeeeecccCcEEEEecCC-----------------------------------------------ceEEEEE
Confidence 3 24444422355444433211 1256777
Q ss_pred EcC-CCCe---eecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcc
Q 004574 162 GSL-DGTA---KDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVC 236 (744)
Q Consensus 162 ~~~-~G~~---~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~ 236 (744)
++- .++. .+.+.+ ..+..++.+.-|-.++.+...+ ..+-+|++..++...- +......+.
T Consensus 394 ~~~~~~~~~~~a~~~gH~~svgava~~~~~asffvsvS~D------------~tlK~W~l~~s~~~~~---~~~~~~~~t 458 (775)
T KOG0319|consen 394 LNNNCSKSLCVAQANGHTNSVGAVAGSKLGASFFVSVSQD------------CTLKLWDLPKSKETAF---PIVLTCRYT 458 (775)
T ss_pred ecCCcchhhhhhhhcccccccceeeecccCccEEEEecCC------------ceEEEecCCCcccccc---cceehhhHH
Confidence 744 2222 122222 5555666777776666554443 2566777655221111 000000000
Q ss_pred cCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEe
Q 004574 237 YNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNE 316 (744)
Q Consensus 237 ~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~ 316 (744)
.-+....+..++.+|+.+- ++-.+ .+ ..-.||.++ ...-...|..+..++..+.|||..+.++..
T Consensus 459 ~~aHdKdIN~Vaia~ndkL-iAT~S-qD---------ktaKiW~le----~~~l~~vLsGH~RGvw~V~Fs~~dq~laT~ 523 (775)
T KOG0319|consen 459 ERAHDKDINCVAIAPNDKL-IATGS-QD---------KTAKIWDLE----QLRLLGVLSGHTRGVWCVSFSKNDQLLATC 523 (775)
T ss_pred HHhhcccccceEecCCCce-EEecc-cc---------cceeeeccc----CceEEEEeeCCccceEEEEeccccceeEec
Confidence 1112233778899999873 33332 11 122345443 122223344456778899999999988887
Q ss_pred eeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCce
Q 004574 317 TWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPF 396 (744)
Q Consensus 317 ~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 396 (744)
+.+ ..-.||.++ + .+-..-..++...+ .-.+|-.+|+.|+.... +| -
T Consensus 524 SgD--~TvKIW~is--~--fSClkT~eGH~~aV------lra~F~~~~~qliS~~a-------------dG--------l 570 (775)
T KOG0319|consen 524 SGD--KTVKIWSIS--T--FSCLKTFEGHTSAV------LRASFIRNGKQLISAGA-------------DG--------L 570 (775)
T ss_pred cCC--ceEEEEEec--c--ceeeeeecCcccee------EeeeeeeCCcEEEeccC-------------CC--------c
Confidence 622 334555554 4 22222223222211 11235667777766542 11 3
Q ss_pred EEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCC
Q 004574 397 LDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQL 439 (744)
Q Consensus 397 l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~ 439 (744)
|.+|+.++++...-.+... +.++.+ +.++++.
T Consensus 571 iKlWnikt~eC~~tlD~H~----DrvWaL-------~~~~~~~ 602 (775)
T KOG0319|consen 571 IKLWNIKTNECEMTLDAHN----DRVWAL-------SVSPLLD 602 (775)
T ss_pred EEEEeccchhhhhhhhhcc----ceeEEE-------eecCccc
Confidence 8899999888765554433 346655 4456665
No 332
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=97.22 E-value=0.011 Score=64.62 Aligned_cols=200 Identities=17% Similarity=0.182 Sum_probs=106.1
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEec-CCcEEEEEecCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVN-NSTLLIFTIPSSRR 110 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wsp-Dg~~l~~~~~~~~~ 110 (744)
.+=..+||.++ +|.-.+. +....|| .+...+--.+..+.+ .|..++|.| |.++++-.+-+
T Consensus 371 DILDlSWSKn~-fLLSSSM--------DKTVRLW--h~~~~~CL~~F~Hnd-----fVTcVaFnPvDDryFiSGSLD--- 431 (712)
T KOG0283|consen 371 DILDLSWSKNN-FLLSSSM--------DKTVRLW--HPGRKECLKVFSHND-----FVTCVAFNPVDDRYFISGSLD--- 431 (712)
T ss_pred hheecccccCC-eeEeccc--------cccEEee--cCCCcceeeEEecCC-----eeEEEEecccCCCcEeecccc---
Confidence 35678999987 5555443 4456677 444545444444443 577889999 55544432211
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCce
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
+.+.++++ +-++.-.++- ..+..++|+|||+.
T Consensus 432 ----------------------------------------------~KvRiWsI~d~~Vv~W~Dl~~lITAvcy~PdGk~ 465 (712)
T KOG0283|consen 432 ----------------------------------------------GKVRLWSISDKKVVDWNDLRDLITAVCYSPDGKG 465 (712)
T ss_pred ----------------------------------------------cceEEeecCcCeeEeehhhhhhheeEEeccCCce
Confidence 34555566 4455545544 67889999999999
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCC-CCccceecCCCceEEEEEeecCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREG-MRSISWRADKPSTLYWVEAQDRGD 267 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~spDg~~~l~~~~~~~~~~ 267 (744)
.++.+... ....|+..+-+...-.......+. ..... +.++.+.|-...-|+.+++
T Consensus 466 avIGt~~G-------------~C~fY~t~~lk~~~~~~I~~~~~K-----k~~~~rITG~Q~~p~~~~~vLVTSn----- 522 (712)
T KOG0283|consen 466 AVIGTFNG-------------YCRFYDTEGLKLVSDFHIRLHNKK-----KKQGKRITGLQFFPGDPDEVLVTSN----- 522 (712)
T ss_pred EEEEEecc-------------EEEEEEccCCeEEEeeeEeeccCc-----cccCceeeeeEecCCCCCeEEEecC-----
Confidence 98886653 344555444432221111100000 00011 3345554433221444431
Q ss_pred CCccCCccceEEeccCCCCCCCCceEeeee---ccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 268 ANVEVSPRDIIYTQPAEPAEGEKPEILHKL---DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
-.+|-+++.. .......+.+ ...-...+|+.||++|+.++ ++ . .+|+++.+.
T Consensus 523 -------DSrIRI~d~~---~~~lv~KfKG~~n~~SQ~~Asfs~Dgk~IVs~s-eD-s--~VYiW~~~~ 577 (712)
T KOG0283|consen 523 -------DSRIRIYDGR---DKDLVHKFKGFRNTSSQISASFSSDGKHIVSAS-ED-S--WVYIWKNDS 577 (712)
T ss_pred -------CCceEEEecc---chhhhhhhcccccCCcceeeeEccCCCEEEEee-cC-c--eEEEEeCCC
Confidence 1246666651 1222222222 22335678999999999987 22 2 466666654
No 333
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=97.20 E-value=0.002 Score=64.21 Aligned_cols=124 Identities=15% Similarity=0.184 Sum_probs=85.4
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
.+|..++.-. +..+ ...|.+..+++...|+||..|.--++ ...|=++|+.+.+.++....+...
T Consensus 322 kkvRfwD~Rs----~~~~--~sv~~gg~vtSl~ls~~g~~lLsssR----------Ddtl~viDlRt~eI~~~~sA~g~k 385 (459)
T KOG0288|consen 322 KKVRFWDIRS----ADKT--RSVPLGGRVTSLDLSMDGLELLSSSR----------DDTLKVIDLRTKEIRQTFSAEGFK 385 (459)
T ss_pred cceEEEeccC----Ccee--eEeecCcceeeEeeccCCeEEeeecC----------CCceeeeecccccEEEEeeccccc
Confidence 3455566522 3323 33466668999999999999877432 245777899998888776555433
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL 164 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 164 (744)
..-.+....||||+++++..+.+ +.||++++
T Consensus 386 ~asDwtrvvfSpd~~YvaAGS~d-------------------------------------------------gsv~iW~v 416 (459)
T KOG0288|consen 386 CASDWTRVVFSPDGSYVAAGSAD-------------------------------------------------GSVYIWSV 416 (459)
T ss_pred cccccceeEECCCCceeeeccCC-------------------------------------------------CcEEEEEc
Confidence 22236678999999999875321 68999999
Q ss_pred -CCCeeecCCC----ceeeeeccCCCCceEEEEE
Q 004574 165 -DGTAKDFGTP----AVYTAVEPSPDQKYVLITS 193 (744)
Q Consensus 165 -~G~~~~l~~~----~~~~~~~~SpDG~~i~~~~ 193 (744)
+|+.+.+... ..+...+|+|-|+.++-..
T Consensus 417 ~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Llsad 450 (459)
T KOG0288|consen 417 FTGKLEKVLSLSTSNAAITSLSWNPSGSGLLSAD 450 (459)
T ss_pred cCceEEEEeccCCCCcceEEEEEcCCCchhhccc
Confidence 7766554332 3588999999999887553
No 334
>PRK02888 nitrous-oxide reductase; Validated
Probab=97.16 E-value=0.044 Score=59.55 Aligned_cols=56 Identities=16% Similarity=0.048 Sum_probs=34.4
Q ss_pred eeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-eccccCCCccccccccceEEecCCcEEEEEec
Q 004574 36 VSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-KPLFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 36 p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
.=++|||+.+.-. .+..+-+-++|.++.+. .++.= .. ......++|||++++++..
T Consensus 198 ~PlpnDGk~l~~~---------~ey~~~vSvID~etmeV~~qV~V-dg-----npd~v~~spdGk~afvTsy 254 (635)
T PRK02888 198 IPLPNDGKDLDDP---------KKYRSLFTAVDAETMEVAWQVMV-DG-----NLDNVDTDYDGKYAFSTCY 254 (635)
T ss_pred cccCCCCCEeecc---------cceeEEEEEEECccceEEEEEEe-CC-----CcccceECCCCCEEEEecc
Confidence 3457788766322 15557778888886543 22211 11 1235689999999998864
No 335
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=97.15 E-value=0.0065 Score=59.92 Aligned_cols=137 Identities=15% Similarity=0.151 Sum_probs=86.1
Q ss_pred ccceeEeecCCCCCCCCceeeec---C-CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceecccc
Q 004574 4 FTGIGIHRLLPDDSLGPEKEVHG---Y-PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFE 79 (744)
Q Consensus 4 ~~~~~~~~~~~~~~~g~~~~l~~---~-~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~ 79 (744)
+..||++++.. .+.|.. . |...+....+.|+++.++||=.. ...++|.+.|+.+=++.-...
T Consensus 105 ee~IyIydI~~------MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s--------~t~GdV~l~d~~nl~~v~~I~ 170 (391)
T KOG2110|consen 105 EESIYIYDIKD------MKLLHTIETTPPNPKGLCALSPNNANCYLAYPGS--------TTSGDVVLFDTINLQPVNTIN 170 (391)
T ss_pred cccEEEEeccc------ceeehhhhccCCCccceEeeccCCCCceEEecCC--------CCCceEEEEEcccceeeeEEE
Confidence 34599999955 233322 2 22324556667777789999422 346788888977655443322
Q ss_pred CCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEE
Q 004574 80 SPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQL 159 (744)
Q Consensus 80 ~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 159 (744)
.. .+.+..+++||||.+|+-++.. | .-|
T Consensus 171 aH----~~~lAalafs~~G~llATASeK--G----------------------------------------------TVI 198 (391)
T KOG2110|consen 171 AH----KGPLAALAFSPDGTLLATASEK--G----------------------------------------------TVI 198 (391)
T ss_pred ec----CCceeEEEECCCCCEEEEeccC--c----------------------------------------------eEE
Confidence 11 2246688999999999876431 1 234
Q ss_pred EEEcC-CC-CeeecCCC---ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC
Q 004574 160 VLGSL-DG-TAKDFGTP---AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK 219 (744)
Q Consensus 160 ~~~~~-~G-~~~~l~~~---~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~ 219 (744)
.++.+ +| +..++--+ -.+.+++||||++.|..+++.. .|.++.++..
T Consensus 199 RVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~Te-------------TVHiFKL~~~ 250 (391)
T KOG2110|consen 199 RVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTE-------------TVHIFKLEKV 250 (391)
T ss_pred EEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCC-------------eEEEEEeccc
Confidence 55566 66 44444333 5677999999999988876553 5666666543
No 336
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.13 E-value=0.1 Score=50.14 Aligned_cols=228 Identities=10% Similarity=0.049 Sum_probs=113.4
Q ss_pred CCCCCcccceeecCCCC-eEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 27 YPDGAKINFVSWSPDGK-RIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 27 ~~~~~~~~~p~~SpDG~-~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
+|.. .....++|--. -+||+++ -+.--++.|..+.+.-++-...+.++ .++.=.|||||++||.+.
T Consensus 66 lpaR--~Hgi~~~p~~~ravafARr---------PGtf~~vfD~~~~~~pv~~~s~~~RH--fyGHGvfs~dG~~LYATE 132 (366)
T COG3490 66 LPAR--GHGIAFHPALPRAVAFARR---------PGTFAMVFDPNGAQEPVTLVSQEGRH--FYGHGVFSPDGRLLYATE 132 (366)
T ss_pred cccc--cCCeecCCCCcceEEEEec---------CCceEEEECCCCCcCcEEEecccCce--eecccccCCCCcEEEeec
Confidence 4543 44556666544 4666553 23456777888776544332222222 234558999999999874
Q ss_pred cCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeec---CCC-ceeeeec
Q 004574 106 PSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDF---GTP-AVYTAVE 181 (744)
Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l---~~~-~~~~~~~ 181 (744)
.+-+.. .+-|-++|..-..+++ ... -+...+.
T Consensus 133 ndfd~~--------------------------------------------rGViGvYd~r~~fqrvgE~~t~GiGpHev~ 168 (366)
T COG3490 133 NDFDPN--------------------------------------------RGVIGVYDAREGFQRVGEFSTHGIGPHEVT 168 (366)
T ss_pred CCCCCC--------------------------------------------CceEEEEecccccceecccccCCcCcceeE
Confidence 432111 1455566663323333 223 4566889
Q ss_pred cCCCCceEEEEEeeC----Cccc-ccccCCCcceEEEEe-CCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCc
Q 004574 182 PSPDQKYVLITSMHR----PYSY-KVPCARFSQKVQVWT-TDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 182 ~SpDG~~i~~~~~~~----~~~~-~~~~~~~~~~l~~~~-~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~ 255 (744)
|.+||+.|+.....- ++.. ..........+-+.+ ..|..+.+.+-.+... ...++.+..-+||+
T Consensus 169 lm~DGrtlvvanGGIethpdfgR~~lNldsMePSlvlld~atG~liekh~Lp~~l~---------~lSiRHld~g~dgt- 238 (366)
T COG3490 169 LMADGRTLVVANGGIETHPDFGRTELNLDSMEPSLVLLDAATGNLIEKHTLPASLR---------QLSIRHLDIGRDGT- 238 (366)
T ss_pred EecCCcEEEEeCCceecccccCccccchhhcCccEEEEeccccchhhhccCchhhh---------hcceeeeeeCCCCc-
Confidence 999999998774411 1100 011112223556666 4444433332211110 11266788889997
Q ss_pred eEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--------ccceeceeeccCCceEEEeeeeeccceeEE
Q 004574 256 TLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--------DLRFRSVSWCDDSLALVNETWYKTSQTRTW 327 (744)
Q Consensus 256 ~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--------~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~ 327 (744)
+.|.-...+.... .--|.-.-. . +++.++.+. ...+.+++...+-.+++.++. .-+...
T Consensus 239 -vwfgcQy~G~~~d-----~ppLvg~~~---~-g~~l~~~~~pee~~~~~anYigsiA~n~~~glV~lTSP---~GN~~v 305 (366)
T COG3490 239 -VWFGCQYRGPRND-----LPPLVGHFR---K-GEPLEFLDLPEEQTAAFANYIGSIAANRRDGLVALTSP---RGNRAV 305 (366)
T ss_pred -EEEEEEeeCCCcc-----CCcceeecc---C-CCcCcccCCCHHHHHHHHhhhhheeecccCCeEEEecC---CCCeEE
Confidence 5554322221110 001222222 2 333333221 234566676666666666552 234577
Q ss_pred EEcCCCC
Q 004574 328 LVCPGSK 334 (744)
Q Consensus 328 ~~~~~~~ 334 (744)
++|.+++
T Consensus 306 i~da~tG 312 (366)
T COG3490 306 IWDAATG 312 (366)
T ss_pred EEEcCCC
Confidence 8888885
No 337
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=97.11 E-value=0.032 Score=54.20 Aligned_cols=62 Identities=18% Similarity=0.274 Sum_probs=41.4
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEec
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
++-=+|++|++.||... .+.+|.|+...+.+ .+++-....+ ...+..+.|+|.+..|+-.+.
T Consensus 13 itchAwn~drt~iAv~~----------~~~evhiy~~~~~~~w~~~htls~H--d~~vtgvdWap~snrIvtcs~ 75 (361)
T KOG1523|consen 13 ITCHAWNSDRTQIAVSP----------NNHEVHIYSMLGADLWEPAHTLSEH--DKIVTGVDWAPKSNRIVTCSH 75 (361)
T ss_pred eeeeeecCCCceEEecc----------CCceEEEEEecCCCCceeceehhhh--CcceeEEeecCCCCceeEccC
Confidence 56678999999999953 34577777777877 3333222221 223567899999988886543
No 338
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=97.09 E-value=0.0011 Score=43.86 Aligned_cols=37 Identities=11% Similarity=0.050 Sum_probs=26.4
Q ss_pred eEeeeeccceeceeeccCCceEEEeeeee-ccceeEEE
Q 004574 292 EILHKLDLRFRSVSWCDDSLALVNETWYK-TSQTRTWL 328 (744)
Q Consensus 292 ~~l~~~~~~~~~~~~SpDg~~l~~~~~~~-~~~~~l~~ 328 (744)
++++........++|||||++|+|.+... .+..+||+
T Consensus 2 ~~~t~~~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy~ 39 (39)
T PF07676_consen 2 KQLTNSPGDDGSPAWSPDGKYIYFTSNRNDRGSFDIYV 39 (39)
T ss_dssp EEES-SSSSEEEEEE-TTSSEEEEEEECT--SSEEEEE
T ss_pred cCcccCCccccCEEEecCCCEEEEEecCCCCCCcCEEC
Confidence 45666777888999999999999998433 26677774
No 339
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=97.08 E-value=0.039 Score=55.76 Aligned_cols=188 Identities=11% Similarity=0.105 Sum_probs=106.9
Q ss_pred CCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 27 YPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 27 ~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
.++...++.....|.|+|+..+++ ...+-.-+..+|. ...++.. ..+. .....++.|||-.+....
T Consensus 300 ~~h~~~V~~ls~h~tgeYllsAs~----------d~~w~Fsd~~~g~~lt~vs~~-~s~v--~~ts~~fHpDgLifgtgt 366 (506)
T KOG0289|consen 300 RPHEEPVTGLSLHPTGEYLLSASN----------DGTWAFSDISSGSQLTVVSDE-TSDV--EYTSAAFHPDGLIFGTGT 366 (506)
T ss_pred ccccccceeeeeccCCcEEEEecC----------CceEEEEEccCCcEEEEEeec-cccc--eeEEeeEcCCceEEeccC
Confidence 345556888999999999988653 1223333444443 2333221 1111 245679999995443221
Q ss_pred cCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeeecc
Q 004574 106 PSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAVEP 182 (744)
Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~~~ 182 (744)
. .++|-++|+ ++ +..++.-+ +.+..++|
T Consensus 367 ~-------------------------------------------------d~~vkiwdlks~~~~a~Fpght~~vk~i~F 397 (506)
T KOG0289|consen 367 P-------------------------------------------------DGVVKIWDLKSQTNVAKFPGHTGPVKAISF 397 (506)
T ss_pred C-------------------------------------------------CceEEEEEcCCccccccCCCCCCceeEEEe
Confidence 1 167888888 55 56666556 88999999
Q ss_pred CCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEE
Q 004574 183 SPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 183 SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~ 261 (744)
|.+|-+++..++.. .+.+||+.-- ..+.+.... . .++..+.|.+.|++ |....
T Consensus 398 sENGY~Lat~add~-------------~V~lwDLRKl~n~kt~~l~~---~---------~~v~s~~fD~SGt~-L~~~g 451 (506)
T KOG0289|consen 398 SENGYWLATAADDG-------------SVKLWDLRKLKNFKTIQLDE---K---------KEVNSLSFDQSGTY-LGIAG 451 (506)
T ss_pred ccCceEEEEEecCC-------------eEEEEEehhhcccceeeccc---c---------ccceeEEEcCCCCe-EEeec
Confidence 99999998876553 5888997532 233322221 1 12556778888886 33331
Q ss_pred eecCCCCCccCCccceEEeccCCCCCCCCceEeeee---ccceeceeeccCCceEEEee
Q 004574 262 AQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL---DLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~---~~~~~~~~~SpDg~~l~~~~ 317 (744)
..-+||++.. .+.+.+.+... .+-...+.|-.+-++++..+
T Consensus 452 ------------~~l~Vy~~~k---~~k~W~~~~~~~~~sg~st~v~Fg~~aq~l~s~s 495 (506)
T KOG0289|consen 452 ------------SDLQVYICKK---KTKSWTEIKELADHSGLSTGVRFGEHAQYLASTS 495 (506)
T ss_pred ------------ceeEEEEEec---ccccceeeehhhhcccccceeeecccceEEeecc
Confidence 1124677664 44444443321 22333444555555555544
No 340
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.07 E-value=0.066 Score=50.08 Aligned_cols=239 Identities=18% Similarity=0.221 Sum_probs=117.0
Q ss_pred cceeEeecCCCCCCCCceeeecCCC-CCcccceeecC--CCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCC
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPD-GAKINFVSWSP--DGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESP 81 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~-~~~~~~p~~Sp--DG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~ 81 (744)
+.|.+..+..+ |+...+..|.. ..-+....|.- -|..||-.+. + .++-+..-++|+-.++....
T Consensus 33 ~tVkIf~v~~n---~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsY--------D--gkVIiWke~~g~w~k~~e~~ 99 (299)
T KOG1332|consen 33 GTVKIFEVRNN---GQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSY--------D--GKVIIWKEENGRWTKAYEHA 99 (299)
T ss_pred ccEEEEEEcCC---CCceeeeEecCCCCCeeEEeecccccCcEeeEeec--------C--ceEEEEecCCCchhhhhhhh
Confidence 45666666552 33233332222 11355556655 7888887643 3 44444465677655553322
Q ss_pred CccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEE
Q 004574 82 DICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVL 161 (744)
Q Consensus 82 ~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 161 (744)
.+...+..++|.|.+.-|.......+|. -.|..
T Consensus 100 --~h~~SVNsV~wapheygl~LacasSDG~---------------------------------------------vsvl~ 132 (299)
T KOG1332|consen 100 --AHSASVNSVAWAPHEYGLLLACASSDGK---------------------------------------------VSVLT 132 (299)
T ss_pred --hhcccceeecccccccceEEEEeeCCCc---------------------------------------------EEEEE
Confidence 2333567889999765433222121221 34555
Q ss_pred EcCCC--CeeecCCC--ceeeeeccCCC---CceEEEEEeeCCcccccccCCCcceEEEEeCCCCe---eeeccCCCCCC
Q 004574 162 GSLDG--TAKDFGTP--AVYTAVEPSPD---QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL---VRELCDLPPAE 231 (744)
Q Consensus 162 ~~~~G--~~~~l~~~--~~~~~~~~SpD---G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~---~~~l~~~~~~~ 231 (744)
++-+| ...++... -++.+.+|+|- |.-+ -.....+ .+....+.-.+.+-+|+.+..+ .+.|..+
T Consensus 133 ~~~~g~w~t~ki~~aH~~GvnsVswapa~~~g~~~-~~~~~~~-~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H---- 206 (299)
T KOG1332|consen 133 YDSSGGWTTSKIVFAHEIGVNSVSWAPASAPGSLV-DQGPAAK-VKRLVSGGCDNLVKIWKFDSDSWKLERTLEGH---- 206 (299)
T ss_pred EcCCCCccchhhhhccccccceeeecCcCCCcccc-ccCcccc-cceeeccCCccceeeeecCCcchhhhhhhhhc----
Confidence 55554 45555443 66778889987 5211 0000000 0111111112345555544321 1112111
Q ss_pred CCCcccCCccCCCCccceecCCC---ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC--ceEeeeeccceeceee
Q 004574 232 DIPVCYNSVREGMRSISWRADKP---STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK--PEILHKLDLRFRSVSW 306 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~---~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~~l~~~~~~~~~~~~ 306 (744)
..=+|+++|.|.-. ..|+.++ .+ .+..||..+. +.++ .+.|.+....+..++|
T Consensus 207 ---------~dwVRDVAwaP~~gl~~s~iAS~S----qD------g~viIwt~~~---e~e~wk~tll~~f~~~~w~vSW 264 (299)
T KOG1332|consen 207 ---------KDWVRDVAWAPSVGLPKSTIASCS----QD------GTVIIWTKDE---EYEPWKKTLLEEFPDVVWRVSW 264 (299)
T ss_pred ---------chhhhhhhhccccCCCceeeEEec----CC------CcEEEEEecC---ccCcccccccccCCcceEEEEE
Confidence 11177889998742 1122221 11 2224566554 3222 2223334556778999
Q ss_pred ccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 307 CDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 307 SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
|.-|..|+.+. .++...||.-++++
T Consensus 265 S~sGn~LaVs~--GdNkvtlwke~~~G 289 (299)
T KOG1332|consen 265 SLSGNILAVSG--GDNKVTLWKENVDG 289 (299)
T ss_pred eccccEEEEec--CCcEEEEEEeCCCC
Confidence 99999988765 22445677766664
No 341
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=97.04 E-value=0.07 Score=53.98 Aligned_cols=162 Identities=10% Similarity=0.082 Sum_probs=89.6
Q ss_pred EEEEEcC--CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeeeccCCCCCCCC
Q 004574 158 QLVLGSL--DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRELCDLPPAEDI 233 (744)
Q Consensus 158 ~l~~~~~--~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~l~~~~~~~~~ 233 (744)
.|+++.+ ..+...+... ..+......|.|.+++-.+++.-+.| .++. |......... ..+
T Consensus 284 ~i~vws~~~~s~~~~~~~h~~~V~~ls~h~tgeYllsAs~d~~w~F-------------sd~~~g~~lt~vs~~-~s~-- 347 (506)
T KOG0289|consen 284 IIRVWSVPLSSEPTSSRPHEEPVTGLSLHPTGEYLLSASNDGTWAF-------------SDISSGSQLTVVSDE-TSD-- 347 (506)
T ss_pred eEEeeccccccCccccccccccceeeeeccCCcEEEEecCCceEEE-------------EEccCCcEEEEEeec-ccc--
Confidence 4444444 3344333333 66778888999999887766642221 2332 3333333322 110
Q ss_pred CcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccCCce
Q 004574 234 PVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDDSLA 312 (744)
Q Consensus 234 ~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpDg~~ 312 (744)
-.....+|.||| |.|... ...+.|-++++ +++. ..++..+.+.+..++||.+|-|
T Consensus 348 --------v~~ts~~fHpDg---Lifgtg----------t~d~~vkiwdl---ks~~~~a~Fpght~~vk~i~FsENGY~ 403 (506)
T KOG0289|consen 348 --------VEYTSAAFHPDG---LIFGTG----------TPDGVVKIWDL---KSQTNVAKFPGHTGPVKAISFSENGYW 403 (506)
T ss_pred --------ceeEEeeEcCCc---eEEecc----------CCCceEEEEEc---CCccccccCCCCCCceeEEEeccCceE
Confidence 114467899999 556521 13446788887 4333 2334445677889999999999
Q ss_pred EEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEE
Q 004574 313 LVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAK 370 (744)
Q Consensus 313 l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~ 370 (744)
|+..+++. .+.++|+--. ...+.+.-.... +...+.+...|++|+..
T Consensus 404 Lat~add~----~V~lwDLRKl-~n~kt~~l~~~~------~v~s~~fD~SGt~L~~~ 450 (506)
T KOG0289|consen 404 LATAADDG----SVKLWDLRKL-KNFKTIQLDEKK------EVNSLSFDQSGTYLGIA 450 (506)
T ss_pred EEEEecCC----eEEEEEehhh-cccceeeccccc------cceeEEEcCCCCeEEee
Confidence 99887322 3667776542 111222111111 11225577888887765
No 342
>PF00151 Lipase: Lipase; InterPro: IPR013818 Triglyceride lipases (3.1.1.3 from EC) are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. At least three tissue-specific isozymes exist in higher vertebrates, pancreatic, hepatic and gastric/lingual. These lipases are closely related to each other and to lipoprotein lipase (3.1.1.34 from EC), which hydrolyses triglycerides of chylomicrons and very low density lipoproteins (VLDL) []. The most conserved region in all these proteins is centred around a serine residue which has been shown [] to participate, with an histidine and an aspartic acid residue, in a charge relay system. Such a region is also present in lipases of prokaryotic origin and in lecithin-cholesterol acyltransferase (2.3.1.43 from EC) (LCAT) [], which catalyzes fatty acid transfer between phosphatidylcholine and cholesterol.; PDB: 1LPB_B 1LPA_B 1N8S_A 1GPL_A 1W52_X 2PVS_B 2OXE_B 1BU8_A 2PPL_A 1ETH_A ....
Probab=97.02 E-value=0.0012 Score=67.52 Aligned_cols=52 Identities=13% Similarity=0.017 Sum_probs=39.0
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--ceeEEEEccCC
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH--LFCCGIARSGS 628 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~--~~~~~v~~~~~ 628 (744)
.+...+.+|.....++.++|.|+|||+||++|..++..... ++..+..+.|.
T Consensus 133 ~la~~l~~L~~~~g~~~~~ihlIGhSLGAHvaG~aG~~~~~~~ki~rItgLDPA 186 (331)
T PF00151_consen 133 QLAKFLSFLINNFGVPPENIHLIGHSLGAHVAGFAGKYLKGGGKIGRITGLDPA 186 (331)
T ss_dssp HHHHHHHHHHHHH---GGGEEEEEETCHHHHHHHHHHHTTT---SSEEEEES-B
T ss_pred HHHHHHHHHHhhcCCChhHEEEEeeccchhhhhhhhhhccCcceeeEEEecCcc
Confidence 56666777776556888999999999999999998887755 68888888886
No 343
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.00 E-value=0.0091 Score=67.43 Aligned_cols=219 Identities=15% Similarity=0.222 Sum_probs=129.3
Q ss_pred eeeecCCCCCcccceeecCCCCe----EEEeeecccccccCCCceeEEEEEC--CCCceeccccCCCccccccccceEEe
Q 004574 22 KEVHGYPDGAKINFVSWSPDGKR----IAFSVRVDEEDNVSSCKLRVWIADA--ETGEAKPLFESPDICLNAVFGSFVWV 95 (744)
Q Consensus 22 ~~l~~~~~~~~~~~p~~SpDG~~----laf~~~~~~~~~~~~~~~~l~~~~~--~gg~~~~lt~~~~~~~~~~~~~~~ws 95 (744)
+.+-.+....+-...+|++-|.. ||=.. .+|+-.||-.+. ++++.--|..... +.+-|..+.|+
T Consensus 56 k~~~s~~s~~rF~kL~W~~~g~~~~GlIaGG~--------edG~I~ly~p~~~~~~~~~~~la~~~~--h~G~V~gLDfN 125 (1049)
T KOG0307|consen 56 KPVGSLQSSNRFNKLAWGSYGSHSHGLIAGGL--------EDGNIVLYDPASIIANASEEVLATKSK--HTGPVLGLDFN 125 (1049)
T ss_pred cccccccccccceeeeecccCCCccceeeccc--------cCCceEEecchhhccCcchHHHhhhcc--cCCceeeeecc
Confidence 44444555557788999999998 44321 144444443332 1333333322211 23346778999
Q ss_pred cCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCC--
Q 004574 96 NNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGT-- 173 (744)
Q Consensus 96 pDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~-- 173 (744)
+.+..++.... ..++|+++|++.-.++.+.
T Consensus 126 ~~q~nlLASGa------------------------------------------------~~geI~iWDlnn~~tP~~~~~ 157 (1049)
T KOG0307|consen 126 PFQGNLLASGA------------------------------------------------DDGEILIWDLNKPETPFTPGS 157 (1049)
T ss_pred ccCCceeeccC------------------------------------------------CCCcEEEeccCCcCCCCCCCC
Confidence 98875544311 1178999999552222222
Q ss_pred --C-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccCCCCccce
Q 004574 174 --P-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVREGMRSISW 249 (744)
Q Consensus 174 --~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (744)
. ..+..++|...-.+|+-..... ...-+||+..+ .+..+++.... .....++|
T Consensus 158 ~~~~~eI~~lsWNrkvqhILAS~s~s------------g~~~iWDlr~~~pii~ls~~~~~-----------~~~S~l~W 214 (1049)
T KOG0307|consen 158 QAPPSEIKCLSWNRKVSHILASGSPS------------GRAVIWDLRKKKPIIKLSDTPGR-----------MHCSVLAW 214 (1049)
T ss_pred CCCcccceEeccchhhhHHhhccCCC------------CCceeccccCCCcccccccCCCc-----------cceeeeee
Confidence 1 6677888887777776554332 35777888755 34444444332 11447899
Q ss_pred ecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEE
Q 004574 250 RADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLV 329 (744)
Q Consensus 250 spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~ 329 (744)
.||....|+.++ .+...+ .|-++|++ +.+...+.++.+...+-+++|++.+..++.++ . ...+++.+
T Consensus 215 hP~~aTql~~As-~dd~~P--------viqlWDlR-~assP~k~~~~H~~GilslsWc~~D~~lllSs-g--kD~~ii~w 281 (1049)
T KOG0307|consen 215 HPDHATQLLVAS-GDDSAP--------VIQLWDLR-FASSPLKILEGHQRGILSLSWCPQDPRLLLSS-G--KDNRIICW 281 (1049)
T ss_pred CCCCceeeeeec-CCCCCc--------eeEeeccc-ccCCchhhhcccccceeeeccCCCCchhhhcc-c--CCCCeeEe
Confidence 999987455544 222222 36777763 23334455566788899999999987776665 2 33468888
Q ss_pred cCCCC
Q 004574 330 CPGSK 334 (744)
Q Consensus 330 ~~~~~ 334 (744)
+.+++
T Consensus 282 N~~tg 286 (1049)
T KOG0307|consen 282 NPNTG 286 (1049)
T ss_pred cCCCc
Confidence 88774
No 344
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=96.99 E-value=0.068 Score=53.44 Aligned_cols=149 Identities=14% Similarity=0.067 Sum_probs=79.7
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
..+.-+.|.|-..-|++++.-+ +.+.+|+.++++ ...|. ++.- +.++.|+-||
T Consensus 132 rrVg~V~wHPtA~NVLlsag~D------------n~v~iWnv~tgeali~l~-hpd~-------------i~S~sfn~dG 185 (472)
T KOG0303|consen 132 RRVGLVQWHPTAPNVLLSAGSD------------NTVSIWNVGTGEALITLD-HPDM-------------VYSMSFNRDG 185 (472)
T ss_pred eeEEEEeecccchhhHhhccCC------------ceEEEEeccCCceeeecC-CCCe-------------EEEEEeccCC
Confidence 3455678999888887775443 479999998765 33333 3221 6678999999
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee-ee-ccceeceeeccCCceEEEeeeeeccceeEEEEcC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH-KL-DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCP 331 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~-~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~ 331 (744)
.. |+-+. ....|-+++. -.++...-- .+ ......+.|-.+|. |+.+...+-...++-++|.
T Consensus 186 s~-l~Ttc------------kDKkvRv~dp---r~~~~v~e~~~heG~k~~Raifl~~g~-i~tTGfsr~seRq~aLwdp 248 (472)
T KOG0303|consen 186 SL-LCTTC------------KDKKVRVIDP---RRGTVVSEGVAHEGAKPARAIFLASGK-IFTTGFSRMSERQIALWDP 248 (472)
T ss_pred ce-eeeec------------ccceeEEEcC---CCCcEeeecccccCCCcceeEEeccCc-eeeeccccccccceeccCc
Confidence 86 44442 1124666665 223322211 11 22345577888998 4443322223345555565
Q ss_pred CCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 332 GSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 332 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
+.- .++..+.+.+... |....=|.+|-+.|+...+
T Consensus 249 ~nl-~eP~~~~elDtSn-----Gvl~PFyD~dt~ivYl~GK 283 (472)
T KOG0303|consen 249 NNL-EEPIALQELDTSN-----GVLLPFYDPDTSIVYLCGK 283 (472)
T ss_pred ccc-cCcceeEEeccCC-----ceEEeeecCCCCEEEEEec
Confidence 543 2333333222211 2211226777776655554
No 345
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=96.98 E-value=0.062 Score=57.19 Aligned_cols=57 Identities=19% Similarity=0.357 Sum_probs=42.9
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc-eeccccCCCccccccccceEEecCCcEEEEE
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE-AKPLFESPDICLNAVFGSFVWVNNSTLLIFT 104 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~-~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~ 104 (744)
.+...++-|||..+..+ .++.|+++|...|. ...|-.+.+ .+--++||.||+.++..
T Consensus 14 ci~d~afkPDGsqL~lA-----------Ag~rlliyD~ndG~llqtLKgHKD-----tVycVAys~dGkrFASG 71 (1081)
T KOG1538|consen 14 CINDIAFKPDGTQLILA-----------AGSRLLVYDTSDGTLLQPLKGHKD-----TVYCVAYAKDGKRFASG 71 (1081)
T ss_pred chheeEECCCCceEEEe-----------cCCEEEEEeCCCcccccccccccc-----eEEEEEEccCCceeccC
Confidence 57889999999999984 34789999987554 455543333 46678999999988643
No 346
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=96.97 E-value=0.0092 Score=58.73 Aligned_cols=226 Identities=14% Similarity=0.117 Sum_probs=124.9
Q ss_pred ceeEeecCCCCCCCCc-eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc--eeccccCCC
Q 004574 6 GIGIHRLLPDDSLGPE-KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE--AKPLFESPD 82 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~-~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~--~~~lt~~~~ 82 (744)
.|++++... |+. +-|.+. ...+...+++.-|++||-.+. +-.-+|| +.++-. .+.+...++
T Consensus 131 tikv~D~~t----g~~e~~LrGH--t~sv~di~~~a~Gk~l~tcSs--------Dl~~~LW--d~~~~~~c~ks~~gh~h 194 (406)
T KOG0295|consen 131 TIKVFDTET----GELERSLRGH--TDSVFDISFDASGKYLATCSS--------DLSAKLW--DFDTFFRCIKSLIGHEH 194 (406)
T ss_pred eEEEEEccc----hhhhhhhhcc--ccceeEEEEecCccEEEecCC--------ccchhhe--eHHHHHHHHHHhcCccc
Confidence 566777655 544 233322 225888999999999988643 2224555 544421 222222222
Q ss_pred ccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEE
Q 004574 83 ICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLG 162 (744)
Q Consensus 83 ~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 162 (744)
.++.+.+-|-|.+|+..+.+ ..|..+
T Consensus 195 -----~vS~V~f~P~gd~ilS~srD-------------------------------------------------~tik~W 220 (406)
T KOG0295|consen 195 -----GVSSVFFLPLGDHILSCSRD-------------------------------------------------NTIKAW 220 (406)
T ss_pred -----ceeeEEEEecCCeeeecccc-------------------------------------------------cceeEE
Confidence 57788999999888765332 345666
Q ss_pred cC-CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 163 SL-DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 163 ~~-~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
+. +| ...+++.+ ..+..+..+.||.-++-.++. ..+.+|-.+.++.+.+....--
T Consensus 221 e~~tg~cv~t~~~h~ewvr~v~v~~DGti~As~s~d-------------qtl~vW~~~t~~~k~~lR~hEh--------- 278 (406)
T KOG0295|consen 221 ECDTGYCVKTFPGHSEWVRMVRVNQDGTIIASCSND-------------QTLRVWVVATKQCKAELREHEH--------- 278 (406)
T ss_pred ecccceeEEeccCchHhEEEEEecCCeeEEEecCCC-------------ceEEEEEeccchhhhhhhcccc---------
Confidence 77 77 66777766 778888899999866544333 3688888777755544322110
Q ss_pred ccCCCCccceecCCCceEEEEEeec-CCCCCccCCcc-ceEEeccCCCCCCCCc-eEeeeeccceeceeeccCCceEEEe
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQD-RGDANVEVSPR-DIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCDDSLALVNE 316 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~-~~~~~~~~~~~-~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~SpDg~~l~~~ 316 (744)
.+.-++|.|...+.=.+-+..+ ++.+......+ ..|-++++ ..+.- -.|...+..+..++|+|-|++|+..
T Consensus 279 ---~vEci~wap~~~~~~i~~at~~~~~~~~l~s~SrDktIk~wdv---~tg~cL~tL~ghdnwVr~~af~p~Gkyi~Sc 352 (406)
T KOG0295|consen 279 ---PVECIAWAPESSYPSISEATGSTNGGQVLGSGSRDKTIKIWDV---STGMCLFTLVGHDNWVRGVAFSPGGKYILSC 352 (406)
T ss_pred ---ceEEEEecccccCcchhhccCCCCCccEEEeecccceEEEEec---cCCeEEEEEecccceeeeeEEcCCCeEEEEE
Confidence 0223445444321000000000 00111111122 24666666 33421 1233345678899999999999987
Q ss_pred eeeeccceeEEEEcCCC
Q 004574 317 TWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 317 ~~~~~~~~~l~~~~~~~ 333 (744)
+.+ ..|.++|+..
T Consensus 353 aDD----ktlrvwdl~~ 365 (406)
T KOG0295|consen 353 ADD----KTLRVWDLKN 365 (406)
T ss_pred ecC----CcEEEEEecc
Confidence 722 2366666665
No 347
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=96.97 E-value=0.024 Score=56.18 Aligned_cols=139 Identities=13% Similarity=0.221 Sum_probs=86.1
Q ss_pred eEEEEEcC-CC----CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee-ccC-CC
Q 004574 157 AQLVLGSL-DG----TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE-LCD-LP 228 (744)
Q Consensus 157 ~~l~~~~~-~G----~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~-l~~-~~ 228 (744)
.+|+++.. +| ..++++.+ ..+..+.|||..+.++++..-. ..|.+||+..+..+. +.. ..
T Consensus 234 ~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~D------------gsIrIWDiRs~~~~~~~~~kAh 301 (440)
T KOG0302|consen 234 KGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCD------------GSIRIWDIRSGPKKAAVSTKAH 301 (440)
T ss_pred cceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecC------------ceEEEEEecCCCccceeEeecc
Confidence 45666655 55 23444445 6777999999999999887664 368889987653222 211 11
Q ss_pred CCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee-eeccceeceeec
Q 004574 229 PAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH-KLDLRFRSVSWC 307 (744)
Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~-~~~~~~~~~~~S 307 (744)
..+ +.-+.|+-+-. +|++- + ..+.+-+++++.++.+++...+ .....+.++.|+
T Consensus 302 ~sD------------VNVISWnr~~~-lLasG-----~-------DdGt~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~ 356 (440)
T KOG0302|consen 302 NSD------------VNVISWNRREP-LLASG-----G-------DDGTLSIWDLRQFKSGQPVATFKYHKAPITSIEWH 356 (440)
T ss_pred CCc------------eeeEEccCCcc-eeeec-----C-------CCceEEEEEhhhccCCCcceeEEeccCCeeEEEec
Confidence 111 45678876554 23332 1 1234777788777778776654 457789999999
Q ss_pred cCCceEEEeeeeeccceeEEEEcCCC
Q 004574 308 DDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 308 pDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
|....++.++ ..+.+..||-+.++.
T Consensus 357 p~e~s~iaas-g~D~QitiWDlsvE~ 381 (440)
T KOG0302|consen 357 PHEDSVIAAS-GEDNQITIWDLSVEA 381 (440)
T ss_pred cccCceEEec-cCCCcEEEEEeeccC
Confidence 9977766655 333555555555443
No 348
>KOG2931 consensus Differentiation-related gene 1 protein (NDR1 protein), related proteins [Function unknown]
Probab=96.91 E-value=0.069 Score=51.75 Aligned_cols=212 Identities=15% Similarity=0.099 Sum_probs=118.6
Q ss_pred eEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCC-chhHHHHHhCCeEEEecCCC----CCC
Q 004574 493 VPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMT-PTSSLIFLARRFAVLAGPSI----PII 567 (744)
Q Consensus 493 ~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~G~~v~~~~~~----~~~ 567 (744)
..++..++--++ +++|++|-.|.-|-... .+|.... ...+..++.+ |.++-.+.. +..
T Consensus 32 G~v~V~V~Gd~~------~~kpaiiTyhDlglN~~----------scFq~ff~~p~m~ei~~~-fcv~HV~~PGqe~gAp 94 (326)
T KOG2931|consen 32 GVVHVTVYGDPK------GNKPAIITYHDLGLNHK----------SCFQGFFNFPDMAEILEH-FCVYHVDAPGQEDGAP 94 (326)
T ss_pred ccEEEEEecCCC------CCCceEEEecccccchH----------hHhHHhhcCHhHHHHHhh-eEEEecCCCccccCCc
Confidence 357777775322 24788999997542211 1122222 2345566777 777652221 111
Q ss_pred CC--C-CCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCCCC-CCCc------
Q 004574 568 GE--G-DKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNKTL-TPFG------ 637 (744)
Q Consensus 568 g~--g-~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~~~-~~~~------ 637 (744)
.. + .-..++++.+.+..+...-.+ +.|.-+|--+|+++-.+.|..+|+++.++|++.+...... ..|.
T Consensus 95 ~~p~~y~yPsmd~LAd~l~~VL~~f~l--k~vIg~GvGAGAyIL~rFAl~hp~rV~GLvLIn~~~~a~gwiew~~~K~~s 172 (326)
T KOG2931|consen 95 SFPEGYPYPSMDDLADMLPEVLDHFGL--KSVIGMGVGAGAYILARFALNHPERVLGLVLINCDPCAKGWIEWAYNKVSS 172 (326)
T ss_pred cCCCCCCCCCHHHHHHHHHHHHHhcCc--ceEEEecccccHHHHHHHHhcChhheeEEEEEecCCCCchHHHHHHHHHHH
Confidence 11 1 113455777777777765333 5688899999999999999999999999999987533110 0000
Q ss_pred --------------------cccc--ccch---------h------hcHHHH-HhcCcccc----c----CCCCCCEEEE
Q 004574 638 --------------------FQTE--FRTL---------W------EATNVY-IEMSPITH----A----NKIKKPILII 671 (744)
Q Consensus 638 --------------------~~~~--~~~~---------~------~~~~~~-~~~~~~~~----~----~~~~~P~l~i 671 (744)
|..+ .... . .+...| ..+.-... . ..++||+|++
T Consensus 173 ~~l~~~Gmt~~~~d~ll~H~Fg~e~~~~~~diVq~Yr~~l~~~~N~~Nl~~fl~ayn~R~DL~~~r~~~~~tlkc~vllv 252 (326)
T KOG2931|consen 173 NLLYYYGMTQGVKDYLLAHHFGKEELGNNSDIVQEYRQHLGERLNPKNLALFLNAYNGRRDLSIERPKLGTTLKCPVLLV 252 (326)
T ss_pred HHHHhhchhhhHHHHHHHHHhccccccccHHHHHHHHHHHHhcCChhHHHHHHHHhcCCCCccccCCCcCccccccEEEE
Confidence 0000 0000 0 000001 11111111 1 1457999999
Q ss_pred eeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHH
Q 004574 672 HGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQ 730 (744)
Q Consensus 672 ~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~ 730 (744)
.|...+. .+.+.++..+|- .....++.+.+++-... .+.+....+.+.=|+.
T Consensus 253 vGd~Sp~----~~~vv~~n~~Ld--p~~ttllk~~d~g~l~~-e~qP~kl~ea~~~Flq 304 (326)
T KOG2931|consen 253 VGDNSPH----VSAVVECNSKLD--PTYTTLLKMADCGGLVQ-EEQPGKLAEAFKYFLQ 304 (326)
T ss_pred ecCCCch----hhhhhhhhcccC--cccceEEEEcccCCccc-ccCchHHHHHHHHHHc
Confidence 9999887 456666666553 34568888888866544 3455555555555554
No 349
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=96.88 E-value=0.47 Score=45.88 Aligned_cols=39 Identities=13% Similarity=0.141 Sum_probs=31.0
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCD 226 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~ 226 (744)
..-..++||||+..|+|.... ..|.++|+-|.+...|..
T Consensus 44 PQWRkl~WSpD~tlLa~a~S~-------------G~i~vfdl~g~~lf~I~p 82 (282)
T PF15492_consen 44 PQWRKLAWSPDCTLLAYAEST-------------GTIRVFDLMGSELFVIPP 82 (282)
T ss_pred chheEEEECCCCcEEEEEcCC-------------CeEEEEecccceeEEcCc
Confidence 345578999999999998555 379999999988777654
No 350
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.88 E-value=0.49 Score=45.95 Aligned_cols=123 Identities=18% Similarity=0.313 Sum_probs=75.5
Q ss_pred eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce--eccccCCCccccccccceEEecCCc
Q 004574 22 KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA--KPLFESPDICLNAVFGSFVWVNNST 99 (744)
Q Consensus 22 ~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~--~~lt~~~~~~~~~~~~~~~wspDg~ 99 (744)
.++..-|+. .++...|||.-+.++.++. .++..++|-+.-. |.. +..-... +.+....||.||.
T Consensus 20 ~ev~~pP~D-sIS~l~FSP~~~~~~~A~S-------WD~tVR~wevq~~-g~~~~ka~~~~~-----~PvL~v~Wsddgs 85 (347)
T KOG0647|consen 20 YEVPNPPED-SISALAFSPQADNLLAAGS-------WDGTVRIWEVQNS-GQLVPKAQQSHD-----GPVLDVCWSDDGS 85 (347)
T ss_pred eecCCCccc-chheeEeccccCceEEecc-------cCCceEEEEEecC-CcccchhhhccC-----CCeEEEEEccCCc
Confidence 455544444 6899999995544443221 2666778877644 332 2221122 2466789999997
Q ss_pred EEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-cee
Q 004574 100 LLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVY 177 (744)
Q Consensus 100 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~ 177 (744)
.++...-+ .++-++|+ +|+..++..+ +.+
T Consensus 86 kVf~g~~D-------------------------------------------------k~~k~wDL~S~Q~~~v~~Hd~pv 116 (347)
T KOG0647|consen 86 KVFSGGCD-------------------------------------------------KQAKLWDLASGQVSQVAAHDAPV 116 (347)
T ss_pred eEEeeccC-------------------------------------------------CceEEEEccCCCeeeeeecccce
Confidence 77654221 46777899 8888888877 777
Q ss_pred eeeccCCCCce-EEEEEeeCCcccccccCCCcceEEEEeCCCC
Q 004574 178 TAVEPSPDQKY-VLITSMHRPYSYKVPCARFSQKVQVWTTDGK 219 (744)
Q Consensus 178 ~~~~~SpDG~~-i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~ 219 (744)
....|-+-..+ ++.+ |++...|-.||....
T Consensus 117 kt~~wv~~~~~~cl~T------------GSWDKTlKfWD~R~~ 147 (347)
T KOG0647|consen 117 KTCHWVPGMNYQCLVT------------GSWDKTLKFWDTRSS 147 (347)
T ss_pred eEEEEecCCCcceeEe------------cccccceeecccCCC
Confidence 78888765551 2222 233456777886644
No 351
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.86 E-value=0.053 Score=50.35 Aligned_cols=200 Identities=17% Similarity=0.211 Sum_probs=117.8
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
+.+....|.-||++.+--..+ ..+.+|++..+ .++....+ +.+ +.+.+.+.|.
T Consensus 18 gaV~avryN~dGnY~ltcGsd-------------rtvrLWNp~rg~liktYsgh-G~E------------VlD~~~s~Dn 71 (307)
T KOG0316|consen 18 GAVRAVRYNVDGNYCLTCGSD-------------RTVRLWNPLRGALIKTYSGH-GHE------------VLDAALSSDN 71 (307)
T ss_pred cceEEEEEccCCCEEEEcCCC-------------ceEEeecccccceeeeecCC-Cce------------eeeccccccc
Confidence 667788899999987655333 37889997644 44443322 221 5667777787
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~ 332 (744)
.+ ++.. +++. .++++|+ .+|+. +++-...+.++.+.|..+...++..+.+ ...++|-+--.
T Consensus 72 sk-f~s~----GgDk--------~v~vwDV---~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD--~s~r~wDCRS~ 133 (307)
T KOG0316|consen 72 SK-FASC----GGDK--------AVQVWDV---NTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFD--SSVRLWDCRSR 133 (307)
T ss_pred cc-cccC----CCCc--------eEEEEEc---ccCeeeeecccccceeeEEEecCcceEEEecccc--ceeEEEEcccC
Confidence 76 3322 2222 4888888 55554 4455567788999998887766655422 23345544433
Q ss_pred CCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEee
Q 004574 333 SKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWE 412 (744)
Q Consensus 333 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~ 412 (744)
. .++.++.+.....+.+ -.-.+..|+..+. + ..+..+|+..|... .
T Consensus 134 s--~ePiQildea~D~V~S--------i~v~~heIvaGS~-------------D--------GtvRtydiR~G~l~---s 179 (307)
T KOG0316|consen 134 S--FEPIQILDEAKDGVSS--------IDVAEHEIVAGSV-------------D--------GTVRTYDIRKGTLS---S 179 (307)
T ss_pred C--CCccchhhhhcCceeE--------EEecccEEEeecc-------------C--------CcEEEEEeecceee---h
Confidence 3 6777776655433211 1223333433321 1 13778888766542 1
Q ss_pred ccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceee
Q 004574 413 SNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQ 466 (744)
Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~ 466 (744)
. |+... ..-.++|+|++-.+....+ ..|.++|-++|+..+
T Consensus 180 D----y~g~p------it~vs~s~d~nc~La~~l~----stlrLlDk~tGklL~ 219 (307)
T KOG0316|consen 180 D----YFGHP------ITSVSFSKDGNCSLASSLD----STLRLLDKETGKLLK 219 (307)
T ss_pred h----hcCCc------ceeEEecCCCCEEEEeecc----ceeeecccchhHHHH
Confidence 1 22221 1225899999987766543 569999988887654
No 352
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.85 E-value=0.036 Score=52.05 Aligned_cols=206 Identities=11% Similarity=0.078 Sum_probs=116.5
Q ss_pred cceeecCC-CCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 34 NFVSWSPD-GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 34 ~~p~~SpD-G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
....|||= -.+||.+... --.=-+++.|+++++++++..+.... -+.+.+.-+++||+.-+.++++... +
T Consensus 12 ysvqfSPf~~nrLavAt~q---~yGl~G~G~L~ile~~~~~gi~e~~s--~d~~D~LfdV~Wse~~e~~~~~a~G---D- 82 (311)
T KOG0277|consen 12 YSVQFSPFVENRLAVATAQ---HYGLAGNGRLFILEVTDPKGIQECQS--YDTEDGLFDVAWSENHENQVIAASG---D- 82 (311)
T ss_pred ceeEecccccchhheeehh---hcccccCceEEEEecCCCCCeEEEEe--eecccceeEeeecCCCcceEEEEec---C-
Confidence 45566662 1245554320 01114578999999974433332211 1112245688999987776665321 1
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCC---CCeeecCCC-ceeeeeccCCCCce
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLD---GTAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~---G~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
+.|-++|++ +-+..+.++ .++.+..|++--++
T Consensus 83 --------------------------------------------GSLrl~d~~~~s~Pi~~~kEH~~EV~Svdwn~~~r~ 118 (311)
T KOG0277|consen 83 --------------------------------------------GSLRLFDLTMPSKPIHKFKEHKREVYSVDWNTVRRR 118 (311)
T ss_pred --------------------------------------------ceEEEeccCCCCcchhHHHhhhhheEEeccccccce
Confidence 344444542 233333455 78999999999999
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCC
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDA 268 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~ 268 (744)
+++++.=+ ..|-+|+.+-.+..+-..+... -+....|||.-..+++.++
T Consensus 119 ~~ltsSWD------------~TiKLW~~~r~~Sv~Tf~gh~~------------~Iy~a~~sp~~~nlfas~S------- 167 (311)
T KOG0277|consen 119 IFLTSSWD------------GTIKLWDPNRPNSVQTFNGHNS------------CIYQAAFSPHIPNLFASAS------- 167 (311)
T ss_pred eEEeeccC------------CceEeecCCCCcceEeecCCcc------------EEEEEecCCCCCCeEEEcc-------
Confidence 98886322 3677777664443222222111 1456789998877344332
Q ss_pred CccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 269 NVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 269 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
..+.+-++|++ ..|+...+.-.+..+-...|+.-...++++.. ...-|+.+|+..
T Consensus 168 -----gd~~l~lwdvr--~~gk~~~i~ah~~Eil~cdw~ky~~~vl~Tg~---vd~~vr~wDir~ 222 (311)
T KOG0277|consen 168 -----GDGTLRLWDVR--SPGKFMSIEAHNSEILCCDWSKYNHNVLATGG---VDNLVRGWDIRN 222 (311)
T ss_pred -----CCceEEEEEec--CCCceeEEEeccceeEeecccccCCcEEEecC---CCceEEEEehhh
Confidence 12234455553 23555545555667778889998888887762 223577777665
No 353
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=96.83 E-value=0.041 Score=56.46 Aligned_cols=136 Identities=13% Similarity=0.146 Sum_probs=90.2
Q ss_pred eEEEEEcC-CC-CeeecCCC-c-eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-A-VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED 232 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~-~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~ 232 (744)
+.|.+..+ ++ +.+.++.+ + .+.-+.|||-.+.++.+...+ ..+.+||..|...+--... .
T Consensus 143 Gdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~------------G~VtlwDv~g~sp~~~~~~--~-- 206 (673)
T KOG4378|consen 143 GDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDK------------GAVTLWDVQGMSPIFHASE--A-- 206 (673)
T ss_pred CcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccC------------CeEEEEeccCCCcccchhh--h--
Confidence 57777777 66 66677666 3 344778999999888876654 4788999988743321111 0
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCce
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLA 312 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~ 312 (744)
.....+++.|||....+|+-+ .+..+|+++|.. .-.....|+ .......++|+++|.+
T Consensus 207 -------HsAP~~gicfspsne~l~vsV------------G~Dkki~~yD~~--s~~s~~~l~-y~~Plstvaf~~~G~~ 264 (673)
T KOG4378|consen 207 -------HSAPCRGICFSPSNEALLVSV------------GYDKKINIYDIR--SQASTDRLT-YSHPLSTVAFSECGTY 264 (673)
T ss_pred -------ccCCcCcceecCCccceEEEe------------cccceEEEeecc--cccccceee-ecCCcceeeecCCceE
Confidence 111266889999988644333 133468998872 111222333 4667788999999999
Q ss_pred EEEeeeeeccceeEEEEcCCCC
Q 004574 313 LVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 313 l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
|+... ...+|+.+|+-+.
T Consensus 265 L~aG~----s~G~~i~YD~R~~ 282 (673)
T KOG4378|consen 265 LCAGN----SKGELIAYDMRST 282 (673)
T ss_pred EEeec----CCceEEEEecccC
Confidence 98765 4568999998875
No 354
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=96.78 E-value=0.86 Score=49.15 Aligned_cols=113 Identities=11% Similarity=0.087 Sum_probs=66.0
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC-eeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK-LVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
+.+.+++|+ +|.+|+-. ..+ ..|-.||+... +.+.+....+. ++.++.+|.+
T Consensus 70 rsIE~L~W~-e~~RLFS~-g~s------------g~i~EwDl~~lk~~~~~d~~gg~-------------IWsiai~p~~ 122 (691)
T KOG2048|consen 70 RSIESLAWA-EGGRLFSS-GLS------------GSITEWDLHTLKQKYNIDSNGGA-------------IWSIAINPEN 122 (691)
T ss_pred CceeeEEEc-cCCeEEee-cCC------------ceEEEEecccCceeEEecCCCcc-------------eeEEEeCCcc
Confidence 678899999 55566543 322 37888887644 33333322222 6777778877
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc--eEe-eeeccceeceeeccCCceEEEeeeeeccceeEEEEc
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP--EIL-HKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVC 330 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~l-~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~ 330 (744)
+. +. +.-. .+.++.+.. ..+.. ..+ .....++-+++|.|+|.+|+..+ ..+.|.++|
T Consensus 123 ~~-l~-Igcd-----------dGvl~~~s~---~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs----~Dg~Iriwd 182 (691)
T KOG2048|consen 123 TI-LA-IGCD-----------DGVLYDFSI---GPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGS----IDGVIRIWD 182 (691)
T ss_pred ce-EE-eecC-----------CceEEEEec---CCceEEEEeecccccceEEEEEecCCccEEEecc----cCceEEEEE
Confidence 65 22 2111 123555554 22221 112 22246788999999999999876 334577778
Q ss_pred CCCC
Q 004574 331 PGSK 334 (744)
Q Consensus 331 ~~~~ 334 (744)
+..+
T Consensus 183 ~~~~ 186 (691)
T KOG2048|consen 183 VKSG 186 (691)
T ss_pred cCCC
Confidence 7764
No 355
>COG4782 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.78 E-value=0.0058 Score=61.01 Aligned_cols=102 Identities=16% Similarity=0.153 Sum_probs=63.9
Q ss_pred ceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCe----EEEe-cCCCCCCCCCCC-----ChHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRF----AVLA-GPSIPIIGEGDK-----LPNDSAEAAV 582 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~----~v~~-~~~~~~~g~g~~-----~~~~d~~~~~ 582 (744)
.-++||+||.. +.|.......++.....|+ ++++ ++.-...+|... ...++++..+
T Consensus 116 k~vlvFvHGfN--------------ntf~dav~R~aqI~~d~g~~~~pVvFSWPS~g~l~~Yn~DreS~~~Sr~aLe~~l 181 (377)
T COG4782 116 KTVLVFVHGFN--------------NTFEDAVYRTAQIVHDSGNDGVPVVFSWPSRGSLLGYNYDRESTNYSRPALERLL 181 (377)
T ss_pred CeEEEEEcccC--------------CchhHHHHHHHHHHhhcCCCcceEEEEcCCCCeeeecccchhhhhhhHHHHHHHH
Confidence 57999999975 3333323334444444554 3344 222222233221 2334888999
Q ss_pred HHHHHcCCCCCCcEEEEEechHHHHHHHHHHh----C----CCceeEEEEccCCCC
Q 004574 583 EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH----A----PHLFCCGIARSGSYN 630 (744)
Q Consensus 583 ~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~----~----p~~~~~~v~~~~~~~ 630 (744)
.+|.+...+ ++|.|++||||.++++.++-+ . +..++-+|+.+|-.|
T Consensus 182 r~La~~~~~--~~I~ilAHSMGtwl~~e~LrQLai~~~~~l~~ki~nViLAaPDiD 235 (377)
T COG4782 182 RYLATDKPV--KRIYLLAHSMGTWLLMEALRQLAIRADRPLPAKIKNVILAAPDID 235 (377)
T ss_pred HHHHhCCCC--ceEEEEEecchHHHHHHHHHHHhccCCcchhhhhhheEeeCCCCC
Confidence 999987654 689999999999999877644 1 335778888888654
No 356
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=96.73 E-value=0.4 Score=53.15 Aligned_cols=115 Identities=16% Similarity=0.114 Sum_probs=61.2
Q ss_pred eceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEE
Q 004574 302 RSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYI 381 (744)
Q Consensus 302 ~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~ 381 (744)
.-.++||.+++++... .++.-.+|+---..++...-++...+...+.. ++||+||.+|+...
T Consensus 209 t~~~~spn~~~~Aa~d--~dGrI~vw~d~~~~~~~~t~t~lHWH~~~V~~------L~fS~~G~~LlSGG---------- 270 (792)
T KOG1963|consen 209 TCVALSPNERYLAAGD--SDGRILVWRDFGSSDDSETCTLLHWHHDEVNS------LSFSSDGAYLLSGG---------- 270 (792)
T ss_pred eeEEeccccceEEEec--cCCcEEEEeccccccccccceEEEecccccce------eEEecCCceEeecc----------
Confidence 4578999999988754 23444444422101111222333322222222 67999999886653
Q ss_pred EEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCC
Q 004574 382 LLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPL 461 (744)
Q Consensus 382 ~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~ 461 (744)
...-|.+|-+.+++.+-|-.-+ ..-..+.+|||+.......++ .+|.++...+
T Consensus 271 -----------~E~VLv~Wq~~T~~kqfLPRLg------------s~I~~i~vS~ds~~~sl~~~D----NqI~li~~~d 323 (792)
T KOG1963|consen 271 -----------REGVLVLWQLETGKKQFLPRLG------------SPILHIVVSPDSDLYSLVLED----NQIHLIKASD 323 (792)
T ss_pred -----------cceEEEEEeecCCCcccccccC------------CeeEEEEEcCCCCeEEEEecC----ceEEEEeccc
Confidence 2234788888888743231111 111226888999776665542 3455555433
No 357
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=96.73 E-value=0.28 Score=50.07 Aligned_cols=154 Identities=10% Similarity=0.049 Sum_probs=88.1
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC--eeeeccCCCCCCCCCcccCCccCCCCccceecC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK--LVRELCDLPPAEDIPVCYNSVREGMRSISWRAD 252 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spD 252 (744)
..+...+|+|--+.++-....+ ..+.+||.... +....... ....+..++|+|-
T Consensus 228 ~~VeDV~~h~~h~~lF~sv~dd------------~~L~iwD~R~~~~~~~~~~~a------------h~~~vn~~~fnp~ 283 (422)
T KOG0264|consen 228 DVVEDVAWHPLHEDLFGSVGDD------------GKLMIWDTRSNTSKPSHSVKA------------HSAEVNCVAFNPF 283 (422)
T ss_pred cceehhhccccchhhheeecCC------------CeEEEEEcCCCCCCCcccccc------------cCCceeEEEeCCC
Confidence 4566888999888776655443 47888987742 11111111 1122667899998
Q ss_pred CCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCC
Q 004574 253 KPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 253 g~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~ 332 (744)
+..+|+-.+ ..+.|.++|++.+. .....+......+..+.|||.-..++.++. .++ ++.++|+.
T Consensus 284 ~~~ilAT~S------------~D~tV~LwDlRnL~-~~lh~~e~H~dev~~V~WSPh~etvLASSg-~D~--rl~vWDls 347 (422)
T KOG0264|consen 284 NEFILATGS------------ADKTVALWDLRNLN-KPLHTFEGHEDEVFQVEWSPHNETVLASSG-TDR--RLNVWDLS 347 (422)
T ss_pred CCceEEecc------------CCCcEEEeechhcc-cCceeccCCCcceEEEEeCCCCCceeEecc-cCC--cEEEEecc
Confidence 876343332 22347777775432 233344455778899999999988877662 223 45555554
Q ss_pred CC--C--------CcceeeeccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 333 SK--D--------VAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 333 ~~--~--------~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
.- + +.+..++.+.... .....++|.|..-+++....
T Consensus 348 ~ig~eq~~eda~dgppEllF~HgGH~----~kV~DfsWnp~ePW~I~Sva 393 (422)
T KOG0264|consen 348 RIGEEQSPEDAEDGPPELLFIHGGHT----AKVSDFSWNPNEPWTIASVA 393 (422)
T ss_pred ccccccChhhhccCCcceeEEecCcc----cccccccCCCCCCeEEEEec
Confidence 32 1 1112222222111 01233779999998888763
No 358
>KOG1516 consensus Carboxylesterase and related proteins [General function prediction only]
Probab=96.72 E-value=0.005 Score=69.05 Aligned_cols=121 Identities=25% Similarity=0.291 Sum_probs=75.5
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEecCCC-CCCCC---C
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSI-PIIGE---G 570 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~-~~~g~---g 570 (744)
+..-+|.|.....+ + +||+|++|||++...... .+.. ......+..+..+|+...++ +..|+ |
T Consensus 97 LylNV~tp~~~~~~--~-~pV~V~iHGG~~~~gs~~--------~~~~--~~~~~~~~~~~VVvVt~~YRLG~lGF~st~ 163 (545)
T KOG1516|consen 97 LYLNVYTPQGCSES--K-LPVMVYIHGGGFQFGSAS--------SFEI--ISPAYVLLLKDVVVVTINYRLGPLGFLSTG 163 (545)
T ss_pred ceEEEeccCCCccC--C-CCEEEEEeCCceeecccc--------chhh--cCchhccccCCEEEEEecccceeceeeecC
Confidence 56677878763221 2 899999999986543321 1100 01122334556777764443 32331 1
Q ss_pred CC-----ChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHHHHHHh--CCCceeEEEEccCC
Q 004574 571 DK-----LPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTAHLLAH--APHLFCCGIARSGS 628 (744)
Q Consensus 571 ~~-----~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~~~~~~--~p~~~~~~v~~~~~ 628 (744)
.. ..+-|...+++|++++ =.-|+++|.|+|||+||..+..+... ...+|..+|.+++.
T Consensus 164 d~~~~gN~gl~Dq~~AL~wv~~~I~~FGGdp~~vTl~G~saGa~~v~~l~~Sp~s~~LF~~aI~~SG~ 231 (545)
T KOG1516|consen 164 DSAAPGNLGLFDQLLALRWVKDNIPSFGGDPKNVTLFGHSAGAASVSLLTLSPHSRGLFHKAISMSGN 231 (545)
T ss_pred CCCCCCcccHHHHHHHHHHHHHHHHhcCCCCCeEEEEeechhHHHHHHHhcCHhhHHHHHHHHhhccc
Confidence 11 1233899999999985 23689999999999999999766643 12468888888775
No 359
>PLN02733 phosphatidylcholine-sterol O-acyltransferase
Probab=96.71 E-value=0.0049 Score=65.46 Aligned_cols=79 Identities=13% Similarity=0.175 Sum_probs=54.2
Q ss_pred hhHHHHHhCCeEEEecCCCCCCCCCC--------CChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIGEGD--------KLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH 617 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g~g~--------~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~ 617 (744)
..+..|.+.||.+-. +..|++. ....+++.+.++.+.+... .++|.|+||||||.++..++..+|+
T Consensus 112 ~li~~L~~~GY~~~~----dL~g~gYDwR~~~~~~~~~~~Lk~lIe~~~~~~g--~~kV~LVGHSMGGlva~~fl~~~p~ 185 (440)
T PLN02733 112 DMIEQLIKWGYKEGK----TLFGFGYDFRQSNRLPETMDGLKKKLETVYKASG--GKKVNIISHSMGGLLVKCFMSLHSD 185 (440)
T ss_pred HHHHHHHHcCCccCC----CcccCCCCccccccHHHHHHHHHHHHHHHHHHcC--CCCEEEEEECHhHHHHHHHHHHCCH
Confidence 456788899997521 2222211 1223467777777665432 3689999999999999999988876
Q ss_pred c----eeEEEEccCCCC
Q 004574 618 L----FCCGIARSGSYN 630 (744)
Q Consensus 618 ~----~~~~v~~~~~~~ 630 (744)
. ++.+|++++++.
T Consensus 186 ~~~k~I~~~I~la~P~~ 202 (440)
T PLN02733 186 VFEKYVNSWIAIAAPFQ 202 (440)
T ss_pred hHHhHhccEEEECCCCC
Confidence 3 688888887755
No 360
>PF01674 Lipase_2: Lipase (class 2); InterPro: IPR002918 Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids []. This group of lipases has been called class 2 as they are not clearly related to other lipase families, and includes LipA and LipB from Bacillus subtilis [] and uncharacterised proteins from Caenorhabditis.; PDB: 2VTV_B 2X76_A 2X5X_A 2QXU_A 3QMM_A 1I6W_A 3D2C_J 2QXT_B 1R50_A 1T2N_A ....
Probab=96.69 E-value=0.003 Score=60.12 Aligned_cols=66 Identities=17% Similarity=0.145 Sum_probs=37.1
Q ss_pred hhHHHHHhCCeE---EEecCCCCCCCCCCC-------ChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 546 TSSLIFLARRFA---VLAGPSIPIIGEGDK-------LPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 546 ~~~~~~~~~G~~---v~~~~~~~~~g~g~~-------~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
..+..|+++||. |++..+-........ +...++.++|+.+++.- .. ||-|+||||||.++-+++.-
T Consensus 20 ~~~~~l~~~GY~~~~vya~tyg~~~~~~~~~~~~~~~~~~~~l~~fI~~Vl~~T--Ga-kVDIVgHS~G~~iaR~yi~~ 95 (219)
T PF01674_consen 20 TLAPYLKAAGYCDSEVYALTYGSGNGSPSVQNAHMSCESAKQLRAFIDAVLAYT--GA-KVDIVGHSMGGTIARYYIKG 95 (219)
T ss_dssp HHHHHHHHTT--CCCEEEE--S-CCHHTHHHHHHB-HHHHHHHHHHHHHHHHHH--T---EEEEEETCHHHHHHHHHHH
T ss_pred HHHHHHHHcCCCcceeEeccCCCCCCCCcccccccchhhHHHHHHHHHHHHHhh--CC-EEEEEEcCCcCHHHHHHHHH
Confidence 567789999998 566332111110000 11125666777666542 34 89999999999999877753
No 361
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=96.68 E-value=0.089 Score=51.25 Aligned_cols=102 Identities=14% Similarity=0.286 Sum_probs=66.4
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeee----ccCCCCCCCCCcccCCccCCCCcccee
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE----LCDLPPAEDIPVCYNSVREGMRSISWR 250 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~----l~~~~~~~~~~~~~~~~~~~~~~~~~s 250 (744)
..+...+|++|+..|++..+.. ++.+|...+.+... +..+... +..+-|+
T Consensus 11 ~pitchAwn~drt~iAv~~~~~-------------evhiy~~~~~~~w~~~htls~Hd~~-------------vtgvdWa 64 (361)
T KOG1523|consen 11 EPITCHAWNSDRTQIAVSPNNH-------------EVHIYSMLGADLWEPAHTLSEHDKI-------------VTGVDWA 64 (361)
T ss_pred CceeeeeecCCCceEEeccCCc-------------eEEEEEecCCCCceeceehhhhCcc-------------eeEEeec
Confidence 4566789999999999986654 67777777665222 2222222 4568999
Q ss_pred cCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC---ceEeeeeccceeceeeccCCceEEEee
Q 004574 251 ADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK---PEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 251 pDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~---~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
|-.++ |+-++...+ -.+|... ++++ .-.|...++....+.|||.+.+|+..+
T Consensus 65 p~snr-Ivtcs~drn----------ayVw~~~----~~~~WkptlvLlRiNrAAt~V~WsP~enkFAVgS 119 (361)
T KOG1523|consen 65 PKSNR-IVTCSHDRN----------AYVWTQP----SGGTWKPTLVLLRINRAATCVKWSPKENKFAVGS 119 (361)
T ss_pred CCCCc-eeEccCCCC----------ccccccC----CCCeeccceeEEEeccceeeEeecCcCceEEecc
Confidence 99988 777643211 1133322 2232 233566678888999999999998866
No 362
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=96.68 E-value=0.59 Score=53.99 Aligned_cols=167 Identities=13% Similarity=0.151 Sum_probs=95.2
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
+.+.++.|--|+..|.+..... +|-++|+.......+.... .|+..+.||||+.
T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G-------------~iilvd~et~~~eivg~vd-------------~GI~aaswS~Dee 122 (1265)
T KOG1920|consen 69 DEIVSVQFLADTNSICVITALG-------------DIILVDPETLELEIVGNVD-------------NGISAASWSPDEE 122 (1265)
T ss_pred cceEEEEEecccceEEEEecCC-------------cEEEEcccccceeeeeecc-------------CceEEEeecCCCc
Confidence 4677888888888888876553 5677777666544443322 3366789999998
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccC------------CCCC---------CCCceEeee-----------------
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPA------------EPAE---------GEKPEILHK----------------- 296 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~------------~~~~---------~~~~~~l~~----------------- 296 (744)
. ++++..... |++..- ++.. |.+.+++-.
T Consensus 123 ~-l~liT~~~t------------ll~mT~~f~~i~E~~L~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek~~ 189 (1265)
T KOG1920|consen 123 L-LALITGRQT------------LLFMTKDFEPIAEKPLDADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEKAL 189 (1265)
T ss_pred E-EEEEeCCcE------------EEEEeccccchhccccccccccccccceecccccceeeecchhhhcccccccccccc
Confidence 7 776643211 111110 0000 001111110
Q ss_pred ----eccceeceeeccCCceEEEeeee-eccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 297 ----LDLRFRSVSWCDDSLALVNETWY-KTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 297 ----~~~~~~~~~~SpDg~~l~~~~~~-~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
-+.+-.+++|-.||++++.+.-. .++...|.++|-++. ...+...... -...++|-|.|..++...
T Consensus 190 ~~~~~~~~~~~IsWRgDg~~fAVs~~~~~~~~RkirV~drEg~----Lns~se~~~~-----l~~~LsWkPsgs~iA~iq 260 (1265)
T KOG1920|consen 190 EQIEQDDHKTSISWRGDGEYFAVSFVESETGTRKIRVYDREGA----LNSTSEPVEG-----LQHSLSWKPSGSLIAAIQ 260 (1265)
T ss_pred cchhhccCCceEEEccCCcEEEEEEEeccCCceeEEEecccch----hhcccCcccc-----cccceeecCCCCeEeeee
Confidence 01123458999999999986522 223366777776531 1222221111 233478999999998887
Q ss_pred eecCCcceEEEEccCCCCC
Q 004574 372 KKENDEQIYILLNGRGFTP 390 (744)
Q Consensus 372 ~~~~~~~~~~~~~~~g~~~ 390 (744)
...+.+ ++.++.++|..+
T Consensus 261 ~~~sd~-~IvffErNGL~h 278 (1265)
T KOG1920|consen 261 CKTSDS-DIVFFERNGLRH 278 (1265)
T ss_pred ecCCCC-cEEEEecCCccc
Confidence 665444 677778888644
No 363
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=96.66 E-value=0.53 Score=47.14 Aligned_cols=248 Identities=16% Similarity=0.130 Sum_probs=134.2
Q ss_pred eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEe-cCCcE
Q 004574 22 KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWV-NNSTL 100 (744)
Q Consensus 22 ~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~ws-pDg~~ 100 (744)
..+..+.+..+++....+ +++|.-.++ .+.+.+++.+|...+++..+.. -+...+|- +|...
T Consensus 97 ~pl~~~~hdDWVSsv~~~--~~~IltgsY----------Dg~~riWd~~Gk~~~~~~Ght~-----~ik~v~~v~~n~~~ 159 (423)
T KOG0313|consen 97 KPLQCFLHDDWVSSVKGA--SKWILTGSY----------DGTSRIWDLKGKSIKTIVGHTG-----PIKSVAWVIKNSSS 159 (423)
T ss_pred Cccccccchhhhhhhccc--CceEEEeec----------CCeeEEEecCCceEEEEecCCc-----ceeeeEEEecCCcc
Confidence 344434445466666666 678877543 3455556888988888876655 24455664 44332
Q ss_pred EEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC--CeeecCCC--c
Q 004574 101 LIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG--TAKDFGTP--A 175 (744)
Q Consensus 101 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G--~~~~l~~~--~ 175 (744)
=.|++...+ ..-.||.++. +- +..+.-.+ +
T Consensus 160 ~~fvsas~D---------------------------------------------qtl~Lw~~~~~~~~~~~~~~~~GHk~ 194 (423)
T KOG0313|consen 160 CLFVSASMD---------------------------------------------QTLRLWKWNVGENKVKALKVCRGHKR 194 (423)
T ss_pred ceEEEecCC---------------------------------------------ceEEEEEecCchhhhhHHhHhccccc
Confidence 122211100 1146788877 33 22222112 6
Q ss_pred eeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeec---cC-------CC--CCCCCCc-ccCCccC
Q 004574 176 VYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVREL---CD-------LP--PAEDIPV-CYNSVRE 242 (744)
Q Consensus 176 ~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l---~~-------~~--~~~~~~~-~~~~~~~ 242 (744)
.+..++-.+||.++.-.+-+. .|-+|+........+ .. .. ...+.|. .+.....
T Consensus 195 ~V~sVsv~~sgtr~~SgS~D~-------------~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r~P~vtl~GHt~ 261 (423)
T KOG0313|consen 195 SVDSVSVDSSGTRFCSGSWDT-------------MLKIWSVETDEEDELESSSNRRRKKQKREKEGGTRTPLVTLEGHTE 261 (423)
T ss_pred ceeEEEecCCCCeEEeecccc-------------eeeecccCCCccccccccchhhhhhhhhhhcccccCceEEeccccc
Confidence 778888899999876554332 455555211111111 00 00 0001110 1122222
Q ss_pred CCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeecc
Q 004574 243 GMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 243 ~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
.+..+.|++.+ .+|.... + -.|-++|++ .++....++ +......++.+|..+.|+..+.+
T Consensus 262 ~Vs~V~w~d~~---v~yS~Sw---D--------HTIk~WDle--tg~~~~~~~-~~ksl~~i~~~~~~~Ll~~gssd--- 321 (423)
T KOG0313|consen 262 PVSSVVWSDAT---VIYSVSW---D--------HTIKVWDLE--TGGLKSTLT-TNKSLNCISYSPLSKLLASGSSD--- 321 (423)
T ss_pred ceeeEEEcCCC---ceEeecc---c--------ceEEEEEee--cccceeeee-cCcceeEeecccccceeeecCCC---
Confidence 36678888744 3444211 1 137888883 344444444 57778889999998888877633
Q ss_pred ceeEEEEcCCCCCCcceee-eccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 323 QTRTWLVCPGSKDVAPRVL-FDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+|.++|...+.++.... .-++.+.+.. +.|+|-..+++...
T Consensus 322 -r~irl~DPR~~~gs~v~~s~~gH~nwVss------vkwsp~~~~~~~S~ 364 (423)
T KOG0313|consen 322 -RHIRLWDPRTGDGSVVSQSLIGHKNWVSS------VKWSPTNEFQLVSG 364 (423)
T ss_pred -CceeecCCCCCCCceeEEeeecchhhhhh------eecCCCCceEEEEE
Confidence 367888887765554333 3333333322 66999999887775
No 364
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=96.65 E-value=0.14 Score=50.93 Aligned_cols=135 Identities=13% Similarity=0.135 Sum_probs=81.5
Q ss_pred eEEEEEcCCC-Ce----eec-CCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCC
Q 004574 157 AQLVLGSLDG-TA----KDF-GTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPP 229 (744)
Q Consensus 157 ~~l~~~~~~G-~~----~~l-~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~ 229 (744)
.+||++|+.- +. ... .++.+...++.++.+.+|+|-.... ..++.+||...-+ ...+.-+.+
T Consensus 106 e~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t-----------~GdV~l~d~~nl~~v~~I~aH~~ 174 (391)
T KOG2110|consen 106 ESIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTT-----------SGDVVLFDTINLQPVNTINAHKG 174 (391)
T ss_pred ccEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCC-----------CceEEEEEcccceeeeEEEecCC
Confidence 5799999944 22 111 1225677888888888999985543 3589999876432 222222222
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeee--ccceeceee
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKL--DLRFRSVSW 306 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~--~~~~~~~~~ 306 (744)
. +..++|+|||.. |+-.+.. + ..|.+..+ ..|+ ..++-.+ .-.+.+++|
T Consensus 175 ~-------------lAalafs~~G~l-lATASeK--G---------TVIRVf~v---~~G~kl~eFRRG~~~~~IySL~F 226 (391)
T KOG2110|consen 175 P-------------LAALAFSPDGTL-LATASEK--G---------TVIRVFSV---PEGQKLYEFRRGTYPVSIYSLSF 226 (391)
T ss_pred c-------------eeEEEECCCCCE-EEEeccC--c---------eEEEEEEc---CCccEeeeeeCCceeeEEEEEEE
Confidence 2 667899999986 6665421 1 12455554 2232 2222111 346788999
Q ss_pred ccCCceEEEeeeeeccceeEEEEcCC
Q 004574 307 CDDSLALVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 307 SpDg~~l~~~~~~~~~~~~l~~~~~~ 332 (744)
|||++.|..+++ +..-|++.++..
T Consensus 227 s~ds~~L~~sS~--TeTVHiFKL~~~ 250 (391)
T KOG2110|consen 227 SPDSQFLAASSN--TETVHIFKLEKV 250 (391)
T ss_pred CCCCCeEEEecC--CCeEEEEEeccc
Confidence 999998887663 345577776643
No 365
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=96.62 E-value=0.32 Score=56.04 Aligned_cols=56 Identities=18% Similarity=0.200 Sum_probs=36.0
Q ss_pred ceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 35 FVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 35 ~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
+..+--|+..|.++ ...++|-+++.++.....+..-.. ++....||||++.+++.+
T Consensus 73 s~~fl~d~~~i~v~----------~~~G~iilvd~et~~~eivg~vd~-----GI~aaswS~Dee~l~liT 128 (1265)
T KOG1920|consen 73 SVQFLADTNSICVI----------TALGDIILVDPETLELEIVGNVDN-----GISAASWSPDEELLALIT 128 (1265)
T ss_pred EEEEecccceEEEE----------ecCCcEEEEcccccceeeeeeccC-----ceEEEeecCCCcEEEEEe
Confidence 34444455555554 334566677766666555544333 678899999999999874
No 366
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.61 E-value=0.044 Score=51.49 Aligned_cols=265 Identities=14% Similarity=0.096 Sum_probs=134.6
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC 84 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~ 84 (744)
..|++.++.+ + +-.+++..+.........+||+.-..+++++. +++.-+|| | .+..+.+|....++
T Consensus 38 G~L~ile~~~--~-~gi~e~~s~d~~D~LfdV~Wse~~e~~~~~a~-------GDGSLrl~--d-~~~~s~Pi~~~kEH- 103 (311)
T KOG0277|consen 38 GRLFILEVTD--P-KGIQECQSYDTEDGLFDVAWSENHENQVIAAS-------GDGSLRLF--D-LTMPSKPIHKFKEH- 103 (311)
T ss_pred ceEEEEecCC--C-CCeEEEEeeecccceeEeeecCCCcceEEEEe-------cCceEEEe--c-cCCCCcchhHHHhh-
Confidence 3577888753 1 22234433333335889999999999999763 24445555 5 34444555322221
Q ss_pred ccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhcccee-eee--EEEE
Q 004574 85 LNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYY-TTA--QLVL 161 (744)
Q Consensus 85 ~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~l~~ 161 (744)
..++-++.|++--+..+.++. -......+.-.-....++.. +. ....|+..+++. .+..|... +.+ .||-
T Consensus 104 -~~EV~Svdwn~~~r~~~ltsS--WD~TiKLW~~~r~~Sv~Tf~-gh--~~~Iy~a~~sp~-~~nlfas~Sgd~~l~lwd 176 (311)
T KOG0277|consen 104 -KREVYSVDWNTVRRRIFLTSS--WDGTIKLWDPNRPNSVQTFN-GH--NSCIYQAAFSPH-IPNLFASASGDGTLRLWD 176 (311)
T ss_pred -hhheEEeccccccceeEEeec--cCCceEeecCCCCcceEeec-CC--ccEEEEEecCCC-CCCeEEEccCCceEEEEE
Confidence 114667889986666555541 22222222211111122211 11 123344443332 23333322 223 4554
Q ss_pred EcCCCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCc
Q 004574 162 GSLDGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSV 240 (744)
Q Consensus 162 ~~~~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~ 240 (744)
++..|+..-+..+ .++....|+.=...|+++...+ ..|+.||+..-+ ..+....+ .
T Consensus 177 vr~~gk~~~i~ah~~Eil~cdw~ky~~~vl~Tg~vd------------~~vr~wDir~~r-~pl~eL~g----------h 233 (311)
T KOG0277|consen 177 VRSPGKFMSIEAHNSEILCCDWSKYNHNVLATGGVD------------NLVRGWDIRNLR-TPLFELNG----------H 233 (311)
T ss_pred ecCCCceeEEEeccceeEeecccccCCcEEEecCCC------------ceEEEEehhhcc-ccceeecC----------C
Confidence 4447766556555 5788889999999999997664 368888876432 12222211 1
Q ss_pred cCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccC-CceEEEeeee
Q 004574 241 REGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDD-SLALVNETWY 319 (744)
Q Consensus 241 ~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpD-g~~l~~~~~~ 319 (744)
..-++.+.|||....+|+.++. ++..+||-... ..+-.+..-....-+..+.||+- +.+++-..|+
T Consensus 234 ~~AVRkvk~Sph~~~lLaSasY----------DmT~riw~~~~---~ds~~e~~~~HtEFv~g~Dws~~~~~~vAs~gWD 300 (311)
T KOG0277|consen 234 GLAVRKVKFSPHHASLLASASY----------DMTVRIWDPER---QDSAIETVDHHTEFVCGLDWSLFDPGQVASTGWD 300 (311)
T ss_pred ceEEEEEecCcchhhHhhhccc----------cceEEeccccc---chhhhhhhhccceEEeccccccccCceeeecccc
Confidence 1127788999998774443321 12334444332 11111111112233455677754 5566665554
Q ss_pred eccceeEEEEc
Q 004574 320 KTSQTRTWLVC 330 (744)
Q Consensus 320 ~~~~~~l~~~~ 330 (744)
. .+|+++
T Consensus 301 e----~l~Vw~ 307 (311)
T KOG0277|consen 301 E----LLYVWN 307 (311)
T ss_pred c----ceeeec
Confidence 4 366665
No 367
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.61 E-value=0.31 Score=45.82 Aligned_cols=70 Identities=11% Similarity=0.117 Sum_probs=39.3
Q ss_pred eeeeccceeceeeccCC---ceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEE
Q 004574 294 LHKLDLRFRSVSWCDDS---LALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAK 370 (744)
Q Consensus 294 l~~~~~~~~~~~~SpDg---~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~ 370 (744)
|-....-++.++|+|.- +..+.+. ..++.--||..+.+...-+.+.+.+ +.+ ..-.++||+-|..|+..
T Consensus 203 l~~H~dwVRDVAwaP~~gl~~s~iAS~-SqDg~viIwt~~~e~e~wk~tll~~--f~~-----~~w~vSWS~sGn~LaVs 274 (299)
T KOG1332|consen 203 LEGHKDWVRDVAWAPSVGLPKSTIASC-SQDGTVIIWTKDEEYEPWKKTLLEE--FPD-----VVWRVSWSLSGNILAVS 274 (299)
T ss_pred hhhcchhhhhhhhccccCCCceeeEEe-cCCCcEEEEEecCccCccccccccc--CCc-----ceEEEEEeccccEEEEe
Confidence 44445567889999983 2222222 3446667777775543223333322 111 11126799999998887
Q ss_pred e
Q 004574 371 I 371 (744)
Q Consensus 371 ~ 371 (744)
.
T Consensus 275 ~ 275 (299)
T KOG1332|consen 275 G 275 (299)
T ss_pred c
Confidence 4
No 368
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=96.59 E-value=0.048 Score=52.27 Aligned_cols=147 Identities=14% Similarity=0.179 Sum_probs=88.3
Q ss_pred CCccceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCC---eEEEeeecccccccCCCceeEEEEECCCCcee-cc
Q 004574 2 PFFTGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGK---RIAFSVRVDEEDNVSSCKLRVWIADAETGEAK-PL 77 (744)
Q Consensus 2 ~~~~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~---~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~-~l 77 (744)
+|-..|.+++... ....-.+.-...+..-+|||=.. .||.. ....+|.+.|+++|.-. -|
T Consensus 121 SFDhtlKVWDtnT------lQ~a~~F~me~~VYshamSp~a~sHcLiA~g----------tr~~~VrLCDi~SGs~sH~L 184 (397)
T KOG4283|consen 121 SFDHTLKVWDTNT------LQEAVDFKMEGKVYSHAMSPMAMSHCLIAAG----------TRDVQVRLCDIASGSFSHTL 184 (397)
T ss_pred cccceEEEeeccc------ceeeEEeecCceeehhhcChhhhcceEEEEe----------cCCCcEEEEeccCCcceeee
Confidence 4556788888744 22222223333588889999765 23332 34478888999988754 44
Q ss_pred ccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceee-e
Q 004574 78 FESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYT-T 156 (744)
Q Consensus 78 t~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 156 (744)
..+.. ++..+.|||-.++++++....+.- ..|+-.+ .
T Consensus 185 sGHr~-----~vlaV~Wsp~~e~vLatgsaDg~i-------------------------------------rlWDiRras 222 (397)
T KOG4283|consen 185 SGHRD-----GVLAVEWSPSSEWVLATGSADGAI-------------------------------------RLWDIRRAS 222 (397)
T ss_pred ccccC-----ceEEEEeccCceeEEEecCCCceE-------------------------------------EEEEeeccc
Confidence 33332 578899999999999874321111 1111111 1
Q ss_pred eEEEEEcC-CCCeeec----CCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCC
Q 004574 157 AQLVLGSL-DGTAKDF----GTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGK 219 (744)
Q Consensus 157 ~~l~~~~~-~G~~~~l----~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~ 219 (744)
+-+.++|. +++.-++ +.. +.+.+++|+.||.+++.....+ .+.+|+...+
T Consensus 223 gcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tSd~~~l~~~gtd~-------------r~r~wn~~~G 278 (397)
T KOG4283|consen 223 GCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTSDARYLASCGTDD-------------RIRVWNMESG 278 (397)
T ss_pred ceeEEeecccCccCccccccccccceeeeeeecccchhhhhccCcc-------------ceEEeecccC
Confidence 44556666 5522222 122 7788999999999998765443 5777776544
No 369
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=96.59 E-value=0.049 Score=57.32 Aligned_cols=158 Identities=11% Similarity=0.109 Sum_probs=89.7
Q ss_pred cCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCcccee
Q 004574 171 FGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWR 250 (744)
Q Consensus 171 l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~s 250 (744)
|.....+..+.|..+|.+|+.+..... ...+.+..+.-...+ .+.. ...+.+..+.|.
T Consensus 518 I~~~k~i~~vtWHrkGDYlatV~~~~~----------~~~VliHQLSK~~sQ----~PF~--------kskG~vq~v~FH 575 (733)
T KOG0650|consen 518 IKHPKSIRQVTWHRKGDYLATVMPDSG----------NKSVLIHQLSKRKSQ----SPFR--------KSKGLVQRVKFH 575 (733)
T ss_pred EecCCccceeeeecCCceEEEeccCCC----------cceEEEEeccccccc----Cchh--------hcCCceeEEEec
Confidence 334467888999999999998865432 245666665433211 1110 111115567888
Q ss_pred cCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEE
Q 004574 251 ADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLV 329 (744)
Q Consensus 251 pDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~ 329 (744)
|-..+ |+.+. ...|.++++ ..++ .+.|..+...+++++.+|.|..|+..+.+ ..+.-+
T Consensus 576 Ps~p~-lfVaT-------------q~~vRiYdL---~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d----~k~~Wf 634 (733)
T KOG0650|consen 576 PSKPY-LFVAT-------------QRSVRIYDL---SKQELVKKLLTGSKWISSMSIHPNGDNLILGSYD----KKMCWF 634 (733)
T ss_pred CCCce-EEEEe-------------ccceEEEeh---hHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCC----CeeEEE
Confidence 88876 33332 112666666 2222 23455567788999999999999887732 356666
Q ss_pred cCCCCCCcceee--eccccccccCCCCCCcee-eCCCCCeEEEEe
Q 004574 330 CPGSKDVAPRVL--FDRVFENVYSDPGSPMMT-RTSTGTNVIAKI 371 (744)
Q Consensus 330 ~~~~~~~~~~~l--~~~~~~~~~~~~~~~~~~-~spdg~~l~~~~ 371 (744)
|++-+...-+.+ -......+.-++-.|.|+ -|+||..++|-.
T Consensus 635 DldlsskPyk~lr~H~~avr~Va~H~ryPLfas~sdDgtv~Vfhg 679 (733)
T KOG0650|consen 635 DLDLSSKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGTVIVFHG 679 (733)
T ss_pred EcccCcchhHHhhhhhhhhhhhhhccccceeeeecCCCcEEEEee
Confidence 766541222222 222233333333444444 677888888765
No 370
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=96.56 E-value=0.0089 Score=61.82 Aligned_cols=40 Identities=25% Similarity=0.271 Sum_probs=27.3
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDL 227 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~ 227 (744)
++..-.+||||||+|+....++ -+-+|....+++..-.+.
T Consensus 333 GGLLCvcWSPDGKyIvtGGEDD-------------LVtVwSf~erRVVARGqG 372 (636)
T KOG2394|consen 333 GGLLCVCWSPDGKYIVTGGEDD-------------LVTVWSFEERRVVARGQG 372 (636)
T ss_pred cceEEEEEcCCccEEEecCCcc-------------eEEEEEeccceEEEeccc
Confidence 6777899999999998664332 466777766655444333
No 371
>PF07082 DUF1350: Protein of unknown function (DUF1350); InterPro: IPR010765 This family consists of several hypothetical proteins from both cyanobacteria and plants. Members of this family are typically around 250 residues in length. The function of this family is unknown but the species distribution indicates that the family may be involved in photosynthesis.
Probab=96.53 E-value=0.052 Score=51.81 Aligned_cols=153 Identities=18% Similarity=0.092 Sum_probs=83.5
Q ss_pred hhHHHHHhCCeEEEecCCCCCCCCCCC--ChHHHHHHHHHHHHHcCCCCC--CcEEEEEechHHHHHHHHHHhCCCceeE
Q 004574 546 TSSLIFLARRFAVLAGPSIPIIGEGDK--LPNDSAEAAVEEVVRRGVADP--SRIAVGGHSYGAFMTAHLLAHAPHLFCC 621 (744)
Q Consensus 546 ~~~~~~~~~G~~v~~~~~~~~~g~g~~--~~~~d~~~~~~~l~~~~~~d~--~~i~l~G~S~GG~~a~~~~~~~p~~~~~ 621 (744)
.....|+++||+|++-++.....+... +.......+++.|.++...+. -.++=+|||+|+-+-+.+....+..-++
T Consensus 38 ~lLe~La~~Gy~ViAtPy~~tfDH~~~A~~~~~~f~~~~~~L~~~~~~~~~~lP~~~vGHSlGcklhlLi~s~~~~~r~g 117 (250)
T PF07082_consen 38 YLLERLADRGYAVIATPYVVTFDHQAIAREVWERFERCLRALQKRGGLDPAYLPVYGVGHSLGCKLHLLIGSLFDVERAG 117 (250)
T ss_pred HHHHHHHhCCcEEEEEecCCCCcHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCeeeeecccchHHHHHHhhhccCcccc
Confidence 456788899999999554332222111 122356666777776654443 2677899999999998888766443355
Q ss_pred EEEccCCCC--CCCCCCc------c-cccccchhhcHHHHHhcCcccccCCCCCCEEEEeeCCCCCCCCCHHHHHHHHHH
Q 004574 622 GIARSGSYN--KTLTPFG------F-QTEFRTLWEATNVYIEMSPITHANKIKKPILIIHGEVDDKVGLFPMQAERFFDA 692 (744)
Q Consensus 622 ~v~~~~~~~--~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~ 692 (744)
-|+++---. ....++. . ......|-+......+.. .-...|+|-=.+|.+ +++..+.+.
T Consensus 118 niliSFNN~~a~~aIP~~~~l~~~l~~EF~PsP~ET~~li~~~Y-------~~~rnLLIkF~~D~i-----Dqt~~L~~~ 185 (250)
T PF07082_consen 118 NILISFNNFPADEAIPLLEQLAPALRLEFTPSPEETRRLIRESY-------QVRRNLLIKFNDDDI-----DQTDELEQI 185 (250)
T ss_pred eEEEecCChHHHhhCchHhhhccccccCccCCHHHHHHHHHHhc-------CCccceEEEecCCCc-----cchHHHHHH
Confidence 565553100 0000000 0 000111112111111111 123467787788864 677788888
Q ss_pred HHhCC-CcEEEEEeCCCCcc
Q 004574 693 LKGHG-ALSRLVLLPFEHHV 711 (744)
Q Consensus 693 l~~~~-~~~~~~~~~~~~H~ 711 (744)
|+.+. .-++....+| +|-
T Consensus 186 L~~r~~~~~~~~~L~G-~HL 204 (250)
T PF07082_consen 186 LQQRFPDMVSIQTLPG-NHL 204 (250)
T ss_pred HhhhccccceEEeCCC-CCC
Confidence 88653 3356777775 584
No 372
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.53 E-value=0.046 Score=54.00 Aligned_cols=121 Identities=13% Similarity=0.175 Sum_probs=72.3
Q ss_pred eEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCC
Q 004574 277 IIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSP 356 (744)
Q Consensus 277 ~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 356 (744)
.|+++|++ .+.+..-.....+.+.++|+|. .+.|.+.. ....||..|+-.- ..+..+..+....+.+
T Consensus 211 sIvLyD~R---~~~Pl~KVi~~mRTN~IswnPe--afnF~~a~--ED~nlY~~DmR~l-~~p~~v~~dhvsAV~d----- 277 (433)
T KOG0268|consen 211 SIVLYDLR---QASPLKKVILTMRTNTICWNPE--AFNFVAAN--EDHNLYTYDMRNL-SRPLNVHKDHVSAVMD----- 277 (433)
T ss_pred ceEEEecc---cCCccceeeeeccccceecCcc--ccceeecc--ccccceehhhhhh-cccchhhcccceeEEE-----
Confidence 37888873 3344333335778899999993 34444312 2246888887653 2333444433333222
Q ss_pred ceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceeccc
Q 004574 357 MMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINL 436 (744)
Q Consensus 357 ~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~ 436 (744)
+.+||-|+.++..+ ....|.++....+..+.+...... ..|.. ..||-
T Consensus 278 -VdfsptG~Efvsgs---------------------yDksIRIf~~~~~~SRdiYhtkRM---q~V~~-------Vk~S~ 325 (433)
T KOG0268|consen 278 -VDFSPTGQEFVSGS---------------------YDKSIRIFPVNHGHSRDIYHTKRM---QHVFC-------VKYSM 325 (433)
T ss_pred -eccCCCcchhcccc---------------------ccceEEEeecCCCcchhhhhHhhh---heeeE-------EEEec
Confidence 56899999987765 233478888877777767655432 22322 37888
Q ss_pred CCCEEE
Q 004574 437 NQLKIL 442 (744)
Q Consensus 437 d~~~~~ 442 (744)
|++.++
T Consensus 326 Dskyi~ 331 (433)
T KOG0268|consen 326 DSKYII 331 (433)
T ss_pred cccEEE
Confidence 886554
No 373
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=96.52 E-value=0.012 Score=62.34 Aligned_cols=37 Identities=19% Similarity=0.137 Sum_probs=30.0
Q ss_pred eEEEEEcC-CC-CeeecCCC-ceeeeeccCCCCceEEEEE
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKYVLITS 193 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~ 193 (744)
+.++++|. +| ..+.|..+ ..+.-.+||.||++.+-.+
T Consensus 33 ~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~ 72 (1081)
T KOG1538|consen 33 SRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGS 72 (1081)
T ss_pred CEEEEEeCCCcccccccccccceEEEEEEccCCceeccCC
Confidence 58999999 88 67777666 7788999999999876443
No 374
>PF08386 Abhydrolase_4: TAP-like protein; InterPro: IPR013595 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents a C-terminal domain associated with putative hydrolases and bacterial peptidases that belong to MEROPS peptidase family S33 (clan SC). They are related to a tripeptidyl aminopeptidase from Streptomyces lividans (Q54410 from SWISSPROT). A member of this family (Q6E3K7 from SWISSPROT) is thought to be involved in the C-terminal processing of propionicin F, a bacteriocidin characterised from Propionibacterium freudenreichii []. ; GO: 0008233 peptidase activity
Probab=96.48 E-value=0.0065 Score=50.50 Aligned_cols=60 Identities=28% Similarity=0.227 Sum_probs=48.9
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHH
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQK 731 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~ 731 (744)
..|+|++.++.|+..| +..++++.+.|.. .+++..++.+|+... ....-+.+.+.+||..
T Consensus 34 ~~piL~l~~~~Dp~TP--~~~a~~~~~~l~~----s~lvt~~g~gHg~~~-~~s~C~~~~v~~yl~~ 93 (103)
T PF08386_consen 34 APPILVLGGTHDPVTP--YEGARAMAARLPG----SRLVTVDGAGHGVYA-GGSPCVDKAVDDYLLD 93 (103)
T ss_pred CCCEEEEecCcCCCCc--HHHHHHHHHHCCC----ceEEEEeccCcceec-CCChHHHHHHHHHHHc
Confidence 5899999999999998 9999999888764 489999999999763 3344667777788863
No 375
>PF05990 DUF900: Alpha/beta hydrolase of unknown function (DUF900); InterPro: IPR010297 This domain is associated with proteins of unknown function, which are hydrolase-like.
Probab=96.46 E-value=0.014 Score=56.75 Aligned_cols=80 Identities=15% Similarity=0.150 Sum_probs=51.3
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC----C-----CceeEEEEccCCCCCCCCCCcccccccchhh
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA----P-----HLFCCGIARSGSYNKTLTPFGFQTEFRTLWE 647 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~----p-----~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 647 (744)
.+.+.+..|.+.. ...+|.|++||||+.+.+.++..- + ..|..+|+.+|-++... +.
T Consensus 78 ~l~~~L~~L~~~~--~~~~I~ilaHSMG~rv~~~aL~~l~~~~~~~~~~~~~~~viL~ApDid~d~----f~-------- 143 (233)
T PF05990_consen 78 ALARFLRDLARAP--GIKRIHILAHSMGNRVLLEALRQLASEGERPDVKARFDNVILAAPDIDNDV----FR-------- 143 (233)
T ss_pred HHHHHHHHHHhcc--CCceEEEEEeCchHHHHHHHHHHHHhcccchhhHhhhheEEEECCCCCHHH----HH--------
Confidence 5555566665552 247999999999999998876541 1 25677888888654211 00
Q ss_pred cHHHHHhcCcccccCCCCCCEEEEeeCCCCC
Q 004574 648 ATNVYIEMSPITHANKIKKPILIIHGEVDDK 678 (744)
Q Consensus 648 ~~~~~~~~~~~~~~~~~~~P~l~i~G~~D~~ 678 (744)
..+. .+.+...++.+.+..+|..
T Consensus 144 --~~~~------~~~~~~~~itvy~s~~D~A 166 (233)
T PF05990_consen 144 --SQLP------DLGSSARRITVYYSRNDRA 166 (233)
T ss_pred --HHHH------HHhhcCCCEEEEEcCCchH
Confidence 0000 2334457899999999976
No 376
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.46 E-value=0.35 Score=45.14 Aligned_cols=116 Identities=12% Similarity=0.207 Sum_probs=71.3
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRD 111 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~ 111 (744)
.+...+++-||+|..-... +..-.|| +...|...+-..+.+. ++...+-+-|...++.. +++
T Consensus 19 aV~avryN~dGnY~ltcGs--------drtvrLW--Np~rg~liktYsghG~----EVlD~~~s~Dnskf~s~----GgD 80 (307)
T KOG0316|consen 19 AVRAVRYNVDGNYCLTCGS--------DRTVRLW--NPLRGALIKTYSGHGH----EVLDAALSSDNSKFASC----GGD 80 (307)
T ss_pred ceEEEEEccCCCEEEEcCC--------CceEEee--cccccceeeeecCCCc----eeeeccccccccccccC----CCC
Confidence 4788999999998766321 3334444 5555543332222222 35566777787777643 111
Q ss_pred CCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC-ceeeeeccCCCCce
Q 004574 112 PPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKY 188 (744)
Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~ 188 (744)
.+++++|+ +| ..+++..+ +.+..+.|..+..
T Consensus 81 ---------------------------------------------k~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesS- 114 (307)
T KOG0316|consen 81 ---------------------------------------------KAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESS- 114 (307)
T ss_pred ---------------------------------------------ceEEEEEcccCeeeeecccccceeeEEEecCcce-
Confidence 57888899 88 66777666 7888899986655
Q ss_pred EEEEEeeCCcccccccCCCcceEEEEeCCCCeeee
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRE 223 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~ 223 (744)
|++...- ...+.+||......++
T Consensus 115 Vv~Sgsf------------D~s~r~wDCRS~s~eP 137 (307)
T KOG0316|consen 115 VVASGSF------------DSSVRLWDCRSRSFEP 137 (307)
T ss_pred EEEeccc------------cceeEEEEcccCCCCc
Confidence 4444322 2478888876554333
No 377
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=96.45 E-value=0.84 Score=50.75 Aligned_cols=103 Identities=17% Similarity=0.114 Sum_probs=61.2
Q ss_pred cceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCCC
Q 004574 34 NFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPP 113 (744)
Q Consensus 34 ~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~ 113 (744)
.--++||.|+++|-.- . + ++|.+..--+.+-...|..--+-+...+..++||+||.+|+....
T Consensus 209 t~~~~spn~~~~Aa~d------~--d--GrI~vw~d~~~~~~~~t~t~lHWH~~~V~~L~fS~~G~~LlSGG~------- 271 (792)
T KOG1963|consen 209 TCVALSPNERYLAAGD------S--D--GRILVWRDFGSSDDSETCTLLHWHHDEVNSLSFSSDGAYLLSGGR------- 271 (792)
T ss_pred eeEEeccccceEEEec------c--C--CcEEEEeccccccccccceEEEecccccceeEEecCCceEeeccc-------
Confidence 3479999999999852 1 3 445555433312222221111111225778999999999985411
Q ss_pred CCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC--ceeeeeccCCCCceEE
Q 004574 114 KKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP--AVYTAVEPSPDQKYVL 190 (744)
Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~--~~~~~~~~SpDG~~i~ 190 (744)
.+-+.++.+ +++ +++... ..+..+.+||||....
T Consensus 272 ------------------------------------------E~VLv~Wq~~T~~-kqfLPRLgs~I~~i~vS~ds~~~s 308 (792)
T KOG1963|consen 272 ------------------------------------------EGVLVLWQLETGK-KQFLPRLGSPILHIVVSPDSDLYS 308 (792)
T ss_pred ------------------------------------------ceEEEEEeecCCC-cccccccCCeeEEEEEcCCCCeEE
Confidence 145666777 556 444444 5677888888888766
Q ss_pred EEEeeC
Q 004574 191 ITSMHR 196 (744)
Q Consensus 191 ~~~~~~ 196 (744)
....++
T Consensus 309 l~~~DN 314 (792)
T KOG1963|consen 309 LVLEDN 314 (792)
T ss_pred EEecCc
Confidence 665543
No 378
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=96.45 E-value=0.023 Score=60.73 Aligned_cols=69 Identities=12% Similarity=0.082 Sum_probs=48.4
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceec-cccCCCccccccccceEEecCCcEEEEEec
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKP-LFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~-lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
.+|..+.....||+|+.||-.++ +......-|++.+..+=...+ |-.+.- ++..++|||||++|+.++.
T Consensus 523 GHGyEv~~l~~s~~gnliASaCK-----S~~~ehAvI~lw~t~~W~~~~~L~~HsL-----TVT~l~FSpdg~~LLsvsR 592 (764)
T KOG1063|consen 523 GHGYEVYALAISPTGNLIASACK-----SSLKEHAVIRLWNTANWLQVQELEGHSL-----TVTRLAFSPDGRYLLSVSR 592 (764)
T ss_pred cCceeEEEEEecCCCCEEeehhh-----hCCccceEEEEEeccchhhhheecccce-----EEEEEEECCCCcEEEEeec
Confidence 45556788999999999999875 223555778888855433333 322221 5778999999999988754
No 379
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=96.43 E-value=0.39 Score=46.26 Aligned_cols=116 Identities=15% Similarity=0.209 Sum_probs=64.9
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee--eeccCCCCCCCCCcccCCccCCCCccceec-
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV--RELCDLPPAEDIPVCYNSVREGMRSISWRA- 251 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~sp- 251 (744)
+.+.-+.|-|++..|+-.... +|-+|+++.+.. ..+......+ ......+-+|||
T Consensus 124 g~i~cvew~Pns~klasm~dn--------------~i~l~~l~ess~~vaev~ss~s~e--------~~~~ftsg~WspH 181 (370)
T KOG1007|consen 124 GKINCVEWEPNSDKLASMDDN--------------NIVLWSLDESSKIVAEVLSSESAE--------MRHSFTSGAWSPH 181 (370)
T ss_pred CceeeEEEcCCCCeeEEeccC--------------ceEEEEcccCcchheeeccccccc--------ccceecccccCCC
Confidence 677788999999999876422 688888875432 2221111111 112244568998
Q ss_pred -CCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeee-eccceeceeeccCCceEEEeeeeeccceeEEEE
Q 004574 252 -DKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHK-LDLRFRSVSWCDDSLALVNETWYKTSQTRTWLV 329 (744)
Q Consensus 252 -Dg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~ 329 (744)
||.. ++.+. ...++.+|.+ .-.+...+-. ....+.++.|.|+-+.++.+. .+++.-+||-.
T Consensus 182 Hdgnq-v~tt~-------------d~tl~~~D~R--T~~~~~sI~dAHgq~vrdlDfNpnkq~~lvt~-gDdgyvriWD~ 244 (370)
T KOG1007|consen 182 HDGNQ-VATTS-------------DSTLQFWDLR--TMKKNNSIEDAHGQRVRDLDFNPNKQHILVTC-GDDGYVRIWDT 244 (370)
T ss_pred Cccce-EEEeC-------------CCcEEEEEcc--chhhhcchhhhhcceeeeccCCCCceEEEEEc-CCCccEEEEec
Confidence 6665 44332 1136666652 1111111211 133477888999999888877 44455555543
No 380
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=96.40 E-value=0.51 Score=47.49 Aligned_cols=123 Identities=15% Similarity=0.194 Sum_probs=74.3
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeC-CCCeeeeccCCCCCCCCCcccCCccCCCCccceecCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTT-DGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADK 253 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~-~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg 253 (744)
+.+....|.|=...++-+..++ ..+.+|.+ ++...+.++.. .. .+.....-+.-+.|.|-.
T Consensus 82 ~~vLDi~w~PfnD~vIASgSeD------------~~v~vW~IPe~~l~~~ltep-vv-----~L~gH~rrVg~V~wHPtA 143 (472)
T KOG0303|consen 82 APVLDIDWCPFNDCVIASGSED------------TKVMVWQIPENGLTRDLTEP-VV-----ELYGHQRRVGLVQWHPTA 143 (472)
T ss_pred ccccccccCccCCceeecCCCC------------ceEEEEECCCcccccCcccc-eE-----EEeecceeEEEEeecccc
Confidence 6677899999887776665443 36778875 34433433311 00 001111114457888887
Q ss_pred CceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 254 PSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 254 ~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
.. +.+.+ + ....+.++++ .+|+...-.....-+.+.+|+-||..|+.++. ...|.++|.-.
T Consensus 144 ~N-VLlsa---g--------~Dn~v~iWnv---~tgeali~l~hpd~i~S~sfn~dGs~l~Ttck----DKkvRv~dpr~ 204 (472)
T KOG0303|consen 144 PN-VLLSA---G--------SDNTVSIWNV---GTGEALITLDHPDMVYSMSFNRDGSLLCTTCK----DKKVRVIDPRR 204 (472)
T ss_pred hh-hHhhc---c--------CCceEEEEec---cCCceeeecCCCCeEEEEEeccCCceeeeecc----cceeEEEcCCC
Confidence 66 44442 1 1224788888 44443222235667889999999999998873 34688888766
Q ss_pred C
Q 004574 334 K 334 (744)
Q Consensus 334 ~ 334 (744)
+
T Consensus 205 ~ 205 (472)
T KOG0303|consen 205 G 205 (472)
T ss_pred C
Confidence 3
No 381
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=96.32 E-value=0.56 Score=45.29 Aligned_cols=237 Identities=14% Similarity=0.140 Sum_probs=126.6
Q ss_pred ceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceecc---ccCCC
Q 004574 6 GIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPL---FESPD 82 (744)
Q Consensus 6 ~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~l---t~~~~ 82 (744)
.|-+..++. .+|+...-..+....-.+...|.||-+-+. ++.....+..-.||.+..+.++...- .....
T Consensus 74 kvqiv~ld~--~s~e~~~~a~fd~~YP~tK~~wiPd~~g~~-----pdlLATs~D~LRlWri~~ee~~~~~~~~L~~~kn 146 (364)
T KOG0290|consen 74 KVQIVQLDE--DSGELVEDANFDHPYPVTKLMWIPDSKGVY-----PDLLATSSDFLRLWRIGDEESRVELQSVLNNNKN 146 (364)
T ss_pred eeEEEEEcc--CCCceeccCCCCCCCCccceEecCCccccC-----cchhhcccCeEEEEeccCcCCceehhhhhccCcc
Confidence 345555543 345544443334444577889999975211 01122225567888887655444332 22223
Q ss_pred ccccccccceEEec-CCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEE
Q 004574 83 ICLNAVFGSFVWVN-NSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVL 161 (744)
Q Consensus 83 ~~~~~~~~~~~wsp-Dg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 161 (744)
..+.+-..+|.|.- |-++|...+-+ .+..||-
T Consensus 147 s~~~aPlTSFDWne~dp~~igtSSiD-----------------------------------------------TTCTiWd 179 (364)
T KOG0290|consen 147 SEFCAPLTSFDWNEVDPNLIGTSSID-----------------------------------------------TTCTIWD 179 (364)
T ss_pred cccCCcccccccccCCcceeEeeccc-----------------------------------------------CeEEEEE
Confidence 33333355778885 45555443221 1145554
Q ss_pred EcC--CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCccc
Q 004574 162 GSL--DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCY 237 (744)
Q Consensus 162 ~~~--~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~ 237 (744)
+.. .| -.+||..+ ..+..++|+.+|..++.+...+ +.+.++|+...+...|...+.....|
T Consensus 180 ie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaD------------GSvRmFDLR~leHSTIIYE~p~~~~p--- 244 (364)
T KOG0290|consen 180 IETGVSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGAD------------GSVRMFDLRSLEHSTIIYEDPSPSTP--- 244 (364)
T ss_pred EeeccccceeeEEEecCcceeEEEeccCccceEEEecCC------------CcEEEEEecccccceEEecCCCCCCc---
Confidence 433 34 34566666 8899999999888887665543 36777787665544443322111111
Q ss_pred CCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccCCceEEEe
Q 004574 238 NSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDDSLALVNE 316 (744)
Q Consensus 238 ~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpDg~~l~~~ 316 (744)
.-.++|++.....++-... ....+.++|++ .... ..+|-.+.+.++.++|.|-...-+.+
T Consensus 245 ------LlRLswnkqDpnymATf~~-----------dS~~V~iLDiR--~P~tpva~L~~H~a~VNgIaWaPhS~~hict 305 (364)
T KOG0290|consen 245 ------LLRLSWNKQDPNYMATFAM-----------DSNKVVILDIR--VPCTPVARLRNHQASVNGIAWAPHSSSHICT 305 (364)
T ss_pred ------ceeeccCcCCchHHhhhhc-----------CCceEEEEEec--CCCcceehhhcCcccccceEecCCCCceeee
Confidence 3345676655442322210 11236666663 2223 33456668889999999998776665
Q ss_pred eeeeccceeEEEEcCCC
Q 004574 317 TWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 317 ~~~~~~~~~l~~~~~~~ 333 (744)
+ .++. +..++|++.
T Consensus 306 a-GDD~--qaliWDl~q 319 (364)
T KOG0290|consen 306 A-GDDC--QALIWDLQQ 319 (364)
T ss_pred c-CCcc--eEEEEeccc
Confidence 5 2223 344556654
No 382
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=96.31 E-value=0.51 Score=45.65 Aligned_cols=58 Identities=22% Similarity=0.290 Sum_probs=34.5
Q ss_pred eeecCCCCeEEEeeecccccccCCCceeEEEEECCCC--ceeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 36 VSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETG--EAKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 36 p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg--~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
.+.++||+.||.+.. ..|-+...... ....-++-+ .+-++...-++||||+..|+++.
T Consensus 3 ~~~~~~Gk~lAi~qd-----------~~iEiRsa~Ddf~si~~kcqVp-kD~~PQWRkl~WSpD~tlLa~a~ 62 (282)
T PF15492_consen 3 LALSSDGKLLAILQD-----------QCIEIRSAKDDFSSIIGKCQVP-KDPNPQWRKLAWSPDCTLLAYAE 62 (282)
T ss_pred eeecCCCcEEEEEec-----------cEEEEEeccCCchheeEEEecC-CCCCchheEEEECCCCcEEEEEc
Confidence 567899999999864 33444333321 111111112 12244567889999999999973
No 383
>PTZ00472 serine carboxypeptidase (CBP1); Provisional
Probab=96.29 E-value=0.034 Score=60.13 Aligned_cols=64 Identities=17% Similarity=0.149 Sum_probs=48.7
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC-----------------C---------C-----cEEEEEeCCCCcccC
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGH-----------------G---------A-----LSRLVLLPFEHHVYA 713 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~-----------------~---------~-----~~~~~~~~~~~H~~~ 713 (744)
..++||.+|..|.+|+ ....+++.++|+-. + . +..++.+.++||+..
T Consensus 364 gikVLiYnGd~D~icn--~~Gt~~wi~~L~w~g~~~f~~a~~~~w~~~~~~v~G~vk~~~~~~~~~l~~~~V~~AGH~vp 441 (462)
T PTZ00472 364 GVRVMIYAGDMDFICN--WIGNKAWTLALQWPGNAEFNAAPDVPFSAVDGRWAGLVRSAASNTSSGFSFVQVYNAGHMVP 441 (462)
T ss_pred CceEEEEECCcCeecC--cHhHHHHHHhCCCCCccchhhcCccccEecCCEeceEEEEEecccCCCeEEEEECCCCccCh
Confidence 4799999999999998 88888888877511 1 1 355677789999876
Q ss_pred ccccHHHHHHHHHHHHHH
Q 004574 714 ARENVMHVIWETDRWLQK 731 (744)
Q Consensus 714 ~~~~~~~~~~~~~~fl~~ 731 (744)
.+.++.....+..|+..
T Consensus 442 -~d~P~~~~~~i~~fl~~ 458 (462)
T PTZ00472 442 -MDQPAVALTMINRFLRN 458 (462)
T ss_pred -hhHHHHHHHHHHHHHcC
Confidence 56677888888888753
No 384
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.29 E-value=0.36 Score=49.85 Aligned_cols=222 Identities=12% Similarity=0.147 Sum_probs=126.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-eccccCCCccccccccceEEecCCcEEEEEecCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-KPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRR 110 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~ 110 (744)
.+.+..|-.||+.+|.- +..+.+-+.|..+... +++-.+.. .+....|+|++..++....+ +
T Consensus 70 ~v~s~~fR~DG~LlaaG----------D~sG~V~vfD~k~r~iLR~~~ah~a-----pv~~~~f~~~d~t~l~s~sD--d 132 (487)
T KOG0310|consen 70 VVYSVDFRSDGRLLAAG----------DESGHVKVFDMKSRVILRQLYAHQA-----PVHVTKFSPQDNTMLVSGSD--D 132 (487)
T ss_pred ceeEEEeecCCeEEEcc----------CCcCcEEEeccccHHHHHHHhhccC-----ceeEEEecccCCeEEEecCC--C
Confidence 47788999999888772 3445556667444222 33322221 24456899987776655321 0
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCC-Ce-eecCCC-ceeeeeccCCCCc
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDG-TA-KDFGTP-AVYTAVEPSPDQK 187 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G-~~-~~l~~~-~~~~~~~~SpDG~ 187 (744)
.-+-.+|+++ .. ..|..+ ..+...+|+|-..
T Consensus 133 ----------------------------------------------~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~ 166 (487)
T KOG0310|consen 133 ----------------------------------------------KVVKYWDLSTAYVQAELSGHTDYVRCGDISPAND 166 (487)
T ss_pred ----------------------------------------------ceEEEEEcCCcEEEEEecCCcceeEeeccccCCC
Confidence 1223335533 22 234444 7778899999999
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCe--eeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL--VRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDR 265 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~ 265 (744)
.|+++. .|...|.+||..... +..+..+ .| +..+-+.|.|.. |+.+.
T Consensus 167 hivvtG------------sYDg~vrl~DtR~~~~~v~elnhg-----~p---------Ve~vl~lpsgs~-iasAg---- 215 (487)
T KOG0310|consen 167 HIVVTG------------SYDGKVRLWDTRSLTSRVVELNHG-----CP---------VESVLALPSGSL-IASAG---- 215 (487)
T ss_pred eEEEec------------CCCceEEEEEeccCCceeEEecCC-----Cc---------eeeEEEcCCCCE-EEEcC----
Confidence 998883 334578889876543 2222221 22 557788888875 54441
Q ss_pred CCCCccCCccceEEeccCCCCCCCCce-EeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeecc
Q 004574 266 GDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDR 344 (744)
Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~ 344 (744)
+ ..+-+||+- .|++.. ......-.+..+.+..|++.|+..+- ..++.+.+... .+.+...
T Consensus 216 G---------n~vkVWDl~--~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sL----D~~VKVfd~t~----~Kvv~s~ 276 (487)
T KOG0310|consen 216 G---------NSVKVWDLT--TGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSL----DRHVKVFDTTN----YKVVHSW 276 (487)
T ss_pred C---------CeEEEEEec--CCceehhhhhcccceEEEEEeecCCceEeeccc----ccceEEEEccc----eEEEEee
Confidence 1 137777771 233322 22335667888999999988887662 23455666433 1233221
Q ss_pred ccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 345 VFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 345 ~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+.. ....++.|||++.++..-
T Consensus 277 ~~~~-----pvLsiavs~dd~t~viGm 298 (487)
T KOG0310|consen 277 KYPG-----PVLSIAVSPDDQTVVIGM 298 (487)
T ss_pred eccc-----ceeeEEecCCCceEEEec
Confidence 1111 112255799998887764
No 385
>PF05577 Peptidase_S28: Serine carboxypeptidase S28; InterPro: IPR008758 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S28 (clan SC). The predicted active site residues for members of this family and family S10 occur in the same order in the sequence: S, D, H. These serine proteases include several eukaryotic enzymes such as lysosomal Pro-X carboxypeptidase, dipeptidyl-peptidase II, and thymus-specific serine peptidase [, , , ].; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 3N2Z_B 3JYH_A 3N0T_C.
Probab=96.23 E-value=0.02 Score=62.07 Aligned_cols=58 Identities=26% Similarity=0.220 Sum_probs=44.0
Q ss_pred hHHHHHHHHHHHHHcC-CCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCCC
Q 004574 574 PNDSAEAAVEEVVRRG-VADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYNK 631 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~-~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~~ 631 (744)
.+.|+...+++++.+. ..+..+++++|.|+||.+|.++-.++|+.|.|+++.++++..
T Consensus 92 ALaD~a~F~~~~~~~~~~~~~~pwI~~GgSY~G~Laaw~r~kyP~~~~ga~ASSapv~a 150 (434)
T PF05577_consen 92 ALADLAYFIRYVKKKYNTAPNSPWIVFGGSYGGALAAWFRLKYPHLFDGAWASSAPVQA 150 (434)
T ss_dssp HHHHHHHHHHHHHHHTTTGCC--EEEEEETHHHHHHHHHHHH-TTT-SEEEEET--CCH
T ss_pred HHHHHHHHHHHHHHhhcCCCCCCEEEECCcchhHHHHHHHhhCCCeeEEEEeccceeee
Confidence 3458999999998653 334569999999999999999999999999999999988653
No 386
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=96.22 E-value=0.68 Score=49.05 Aligned_cols=205 Identities=13% Similarity=0.045 Sum_probs=117.7
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
......+++|++++... .....|.+++.++.....-..... .-..++++||++.++.+-...
T Consensus 76 p~~i~v~~~~~~vyv~~---------~~~~~v~vid~~~~~~~~~~~vG~-----~P~~~~~~~~~~~vYV~n~~~---- 137 (381)
T COG3391 76 PAGVAVNPAGNKVYVTT---------GDSNTVSVIDTATNTVLGSIPVGL-----GPVGLAVDPDGKYVYVANAGN---- 137 (381)
T ss_pred ccceeeCCCCCeEEEec---------CCCCeEEEEcCcccceeeEeeecc-----CCceEEECCCCCEEEEEeccc----
Confidence 44567888888775543 224678888855443322211111 124679999999998873210
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeec-CCCceeeeeccCCCCceEE
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDF-GTPAVYTAVEPSPDQKYVL 190 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l-~~~~~~~~~~~SpDG~~i~ 190 (744)
+...+.++|. +++.... .........+++|||++++
T Consensus 138 ------------------------------------------~~~~vsvid~~t~~~~~~~~vG~~P~~~a~~p~g~~vy 175 (381)
T COG3391 138 ------------------------------------------GNNTVSVIDAATNKVTATIPVGNTPTGVAVDPDGNKVY 175 (381)
T ss_pred ------------------------------------------CCceEEEEeCCCCeEEEEEecCCCcceEEECCCCCeEE
Confidence 1157788887 4533322 2222336889999999998
Q ss_pred EEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCc
Q 004574 191 ITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANV 270 (744)
Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~ 270 (744)
...... ..+.+.+..+....+ ...... .....++..+.++|||++ +|... ....
T Consensus 176 v~~~~~------------~~v~vi~~~~~~v~~-~~~~~~-------~~~~~~P~~i~v~~~g~~-~yV~~-~~~~---- 229 (381)
T COG3391 176 VTNSDD------------NTVSVIDTSGNSVVR-GSVGSL-------VGVGTGPAGIAVDPDGNR-VYVAN-DGSG---- 229 (381)
T ss_pred EEecCC------------CeEEEEeCCCcceec-cccccc-------cccCCCCceEEECCCCCE-EEEEe-ccCC----
Confidence 875332 368888877665554 221110 011123667899999996 44442 2111
Q ss_pred cCCccceEEeccCCCCCCCCceEe-eeecc-ceeceeeccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 271 EVSPRDIIYTQPAEPAEGEKPEIL-HKLDL-RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 271 ~~~~~~~l~~~~~~~~~~~~~~~l-~~~~~-~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
...+.+++. ..+..... ..... .......+|+|+.++..... ...+..+|..+
T Consensus 230 ----~~~v~~id~---~~~~v~~~~~~~~~~~~~~v~~~p~g~~~yv~~~~---~~~V~vid~~~ 284 (381)
T COG3391 230 ----SNNVLKIDT---ATGNVTATDLPVGSGAPRGVAVDPAGKAAYVANSQ---GGTVSVIDGAT 284 (381)
T ss_pred ----CceEEEEeC---CCceEEEeccccccCCCCceeECCCCCEEEEEecC---CCeEEEEeCCC
Confidence 234777776 33333322 11111 34457889999988776422 34677887665
No 387
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.21 E-value=1.3 Score=42.91 Aligned_cols=135 Identities=16% Similarity=0.123 Sum_probs=69.4
Q ss_pred EEEEEcCCC--CeeecCCC---ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC
Q 004574 158 QLVLGSLDG--TAKDFGTP---AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED 232 (744)
Q Consensus 158 ~l~~~~~~G--~~~~l~~~---~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~ 232 (744)
-.+++|.++ ++..+... ..+---.|||||+.||-+.+..+ ....-|-+||...+ ..++...+.
T Consensus 92 f~~vfD~~~~~~pv~~~s~~~RHfyGHGvfs~dG~~LYATEndfd--------~~rGViGvYd~r~~-fqrvgE~~t--- 159 (366)
T COG3490 92 FAMVFDPNGAQEPVTLVSQEGRHFYGHGVFSPDGRLLYATENDFD--------PNRGVIGVYDAREG-FQRVGEFST--- 159 (366)
T ss_pred eEEEECCCCCcCcEEEecccCceeecccccCCCCcEEEeecCCCC--------CCCceEEEEecccc-cceeccccc---
Confidence 356678855 55555443 23335579999998876654431 12346888887633 233322221
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEee-----cCCCCCcc-CCccceEEeccCCCCCCCCceE--ee--eecccee
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQ-----DRGDANVE-VSPRDIIYTQPAEPAEGEKPEI--LH--KLDLRFR 302 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~-----~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~--l~--~~~~~~~ 302 (744)
...|+.++.|.+||+. |+...-. +-+....+ .++.-.+.+++.. ++.-.++ |. .....+.
T Consensus 160 -------~GiGpHev~lm~DGrt-lvvanGGIethpdfgR~~lNldsMePSlvlld~a--tG~liekh~Lp~~l~~lSiR 229 (366)
T COG3490 160 -------HGIGPHEVTLMADGRT-LVVANGGIETHPDFGRTELNLDSMEPSLVLLDAA--TGNLIEKHTLPASLRQLSIR 229 (366)
T ss_pred -------CCcCcceeEEecCCcE-EEEeCCceecccccCccccchhhcCccEEEEecc--ccchhhhccCchhhhhccee
Confidence 2234668999999996 5554210 11111111 2233346666631 2222222 22 1133556
Q ss_pred ceeeccCCceEE
Q 004574 303 SVSWCDDSLALV 314 (744)
Q Consensus 303 ~~~~SpDg~~l~ 314 (744)
.+..-+||+.++
T Consensus 230 Hld~g~dgtvwf 241 (366)
T COG3490 230 HLDIGRDGTVWF 241 (366)
T ss_pred eeeeCCCCcEEE
Confidence 677778887443
No 388
>KOG3975 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.19 E-value=0.33 Score=45.93 Aligned_cols=52 Identities=19% Similarity=0.200 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC-CC-ceeEEEEccCC
Q 004574 576 DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA-PH-LFCCGIARSGS 628 (744)
Q Consensus 576 ~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~-p~-~~~~~v~~~~~ 628 (744)
+.+..-++++++.-.-| .||.++|||-|+++.+.+.... ++ .+..++++-|-
T Consensus 93 ~QV~HKlaFik~~~Pk~-~ki~iiGHSiGaYm~Lqil~~~k~~~~vqKa~~LFPT 146 (301)
T KOG3975|consen 93 DQVDHKLAFIKEYVPKD-RKIYIIGHSIGAYMVLQILPSIKLVFSVQKAVLLFPT 146 (301)
T ss_pred hHHHHHHHHHHHhCCCC-CEEEEEecchhHHHHHHHhhhcccccceEEEEEecch
Confidence 36777788888764333 6999999999999999998743 22 45666666653
No 389
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=96.15 E-value=0.13 Score=49.74 Aligned_cols=191 Identities=12% Similarity=0.103 Sum_probs=105.6
Q ss_pred CcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCC
Q 004574 31 AKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRR 110 (744)
Q Consensus 31 ~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~ 110 (744)
.-+...++.|-|.+|+.. ....-+.++|+.+-+--.-.. +...+...+..+..|+.|+..+-.+.+
T Consensus 217 ~~vrsiSfHPsGefllvg----------TdHp~~rlYdv~T~Qcfvsan-Pd~qht~ai~~V~Ys~t~~lYvTaSkD--- 282 (430)
T KOG0640|consen 217 EPVRSISFHPSGEFLLVG----------TDHPTLRLYDVNTYQCFVSAN-PDDQHTGAITQVRYSSTGSLYVTASKD--- 282 (430)
T ss_pred ceeeeEeecCCCceEEEe----------cCCCceeEEeccceeEeeecC-cccccccceeEEEecCCccEEEEeccC---
Confidence 357889999999999985 344567778887755432222 322233345677899999855443321
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-CeeecCCC---ceeeeeccCCC
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-TAKDFGTP---AVYTAVEPSPD 185 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~~~~l~~~---~~~~~~~~SpD 185 (744)
+.|-++|= ++ =++.+... ..+.+..|+.+
T Consensus 283 ----------------------------------------------G~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn 316 (430)
T KOG0640|consen 283 ----------------------------------------------GAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKN 316 (430)
T ss_pred ----------------------------------------------CcEEeeccccHHHHHHHHhhcCCceeeeEEEccC
Confidence 34555554 33 23333332 56778999999
Q ss_pred CceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC-CCcccCCccCCCCccceecCCCceEEEEEeec
Q 004574 186 QKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED-IPVCYNSVREGMRSISWRADKPSTLYWVEAQD 264 (744)
Q Consensus 186 G~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~ 264 (744)
||+|+-...+ ..+++|++.+++....-.+.+..+ ....-+++-....++...||...
T Consensus 317 ~kyiLsSG~D-------------S~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNhtEdyVl~pDEas--------- 374 (430)
T KOG0640|consen 317 GKYILSSGKD-------------STVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHTEDYVLFPDEAS--------- 374 (430)
T ss_pred CeEEeecCCc-------------ceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCccceEEcccccc---------
Confidence 9999766444 368888888665444333332211 11111111111222333444322
Q ss_pred CCCCCccCCccceEEeccCCCCCCCCceEeee--eccceeceeeccCCceEEEee
Q 004574 265 RGDANVEVSPRDIIYTQPAEPAEGEKPEILHK--LDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~--~~~~~~~~~~SpDg~~l~~~~ 317 (744)
..+..+|. -++..+.+.. .++.+..+.-||.+--++..+
T Consensus 375 -----------~slcsWda---Rtadr~~l~slgHn~a~R~i~HSP~~p~FmTcs 415 (430)
T KOG0640|consen 375 -----------NSLCSWDA---RTADRVALLSLGHNGAVRWIVHSPVEPAFMTCS 415 (430)
T ss_pred -----------Cceeeccc---cchhhhhhcccCCCCCceEEEeCCCCCceeeec
Confidence 23566665 3344444432 345566677788887666554
No 390
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=96.14 E-value=1.7 Score=46.05 Aligned_cols=214 Identities=11% Similarity=0.016 Sum_probs=117.4
Q ss_pred eeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCce
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPST 256 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~ 256 (744)
..+.+.+++|++++...... ..+.++|........-.... ..+..++++|+++.
T Consensus 76 p~~i~v~~~~~~vyv~~~~~------------~~v~vid~~~~~~~~~~~vG-------------~~P~~~~~~~~~~~- 129 (381)
T COG3391 76 PAGVAVNPAGNKVYVTTGDS------------NTVSVIDTATNTVLGSIPVG-------------LGPVGLAVDPDGKY- 129 (381)
T ss_pred ccceeeCCCCCeEEEecCCC------------CeEEEEcCcccceeeEeeec-------------cCCceEEECCCCCE-
Confidence 34677899999888775442 36777774443322211111 12668899999997
Q ss_pred EEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCC
Q 004574 257 LYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDV 336 (744)
Q Consensus 257 l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~ 336 (744)
+|...... ....+.+++. .+............-...+++|+|+.++... .....|..+|..+.
T Consensus 130 vYV~n~~~---------~~~~vsvid~---~t~~~~~~~~vG~~P~~~a~~p~g~~vyv~~---~~~~~v~vi~~~~~-- 192 (381)
T COG3391 130 VYVANAGN---------GNNTVSVIDA---ATNKVTATIPVGNTPTGVAVDPDGNKVYVTN---SDDNTVSVIDTSGN-- 192 (381)
T ss_pred EEEEeccc---------CCceEEEEeC---CCCeEEEEEecCCCcceEEECCCCCeEEEEe---cCCCeEEEEeCCCc--
Confidence 66664221 1235788776 4443332222221226789999999777654 34557888886652
Q ss_pred cceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccch
Q 004574 337 APRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNRE 416 (744)
Q Consensus 337 ~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~ 416 (744)
...+-...........|.. +.++|||.+++...... ....+.++|..++........-+.
T Consensus 193 ~v~~~~~~~~~~~~~~P~~--i~v~~~g~~~yV~~~~~------------------~~~~v~~id~~~~~v~~~~~~~~~ 252 (381)
T COG3391 193 SVVRGSVGSLVGVGTGPAG--IAVDPDGNRVYVANDGS------------------GSNNVLKIDTATGNVTATDLPVGS 252 (381)
T ss_pred ceeccccccccccCCCCce--EEECCCCCEEEEEeccC------------------CCceEEEEeCCCceEEEecccccc
Confidence 2221110101111112222 55899999887765221 024588888887766543111111
Q ss_pred hhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceee
Q 004574 417 KYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQ 466 (744)
Q Consensus 417 ~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~ 466 (744)
. .....+.+|+|+.+.. .... ...++.+|..+.....
T Consensus 253 --------~--~~~~v~~~p~g~~~yv-~~~~--~~~V~vid~~~~~v~~ 289 (381)
T COG3391 253 --------G--APRGVAVDPAGKAAYV-ANSQ--GGTVSVIDGATDRVVK 289 (381)
T ss_pred --------C--CCCceeECCCCCEEEE-EecC--CCeEEEEeCCCCceee
Confidence 0 1122578899976444 4333 4568888866654443
No 391
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=96.13 E-value=1.1 Score=41.38 Aligned_cols=138 Identities=8% Similarity=0.034 Sum_probs=73.9
Q ss_pred eEEEEEcC-CC-CeeecCCC-ceee-eeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-AVYT-AVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED 232 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~~~~-~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~ 232 (744)
..||+-|- .| ....+.-+ +-+. -++|+ |--++-.+. ...|..||+.-....+.......++
T Consensus 163 c~iy~tdc~~g~~~~a~sghtghilalyswn--~~m~~sgsq-------------dktirfwdlrv~~~v~~l~~~~~~~ 227 (350)
T KOG0641|consen 163 CKIYITDCGRGQGFHALSGHTGHILALYSWN--GAMFASGSQ-------------DKTIRFWDLRVNSCVNTLDNDFHDG 227 (350)
T ss_pred ceEEEeecCCCCcceeecCCcccEEEEEEec--CcEEEccCC-------------CceEEEEeeeccceeeeccCcccCC
Confidence 67888888 77 34444433 3333 45675 433322211 2367778775433222222211110
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC-ceEeeeeccceeceeeccCCc
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK-PEILHKLDLRFRSVSWCDDSL 311 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~l~~~~~~~~~~~~SpDg~ 311 (744)
.. ..--+..++..|.|+- |+ ... ......++++ .++. .+...+....+..+.|||.-.
T Consensus 228 gl-----essavaav~vdpsgrl-l~-sg~-----------~dssc~lydi---rg~r~iq~f~phsadir~vrfsp~a~ 286 (350)
T KOG0641|consen 228 GL-----ESSAVAAVAVDPSGRL-LA-SGH-----------ADSSCMLYDI---RGGRMIQRFHPHSADIRCVRFSPGAH 286 (350)
T ss_pred Cc-----ccceeEEEEECCCcce-ee-ecc-----------CCCceEEEEe---eCCceeeeeCCCccceeEEEeCCCce
Confidence 00 0001345677888873 32 211 1112455555 3444 445666677889999999998
Q ss_pred eEEEeeeeeccceeEEEEcCCCC
Q 004574 312 ALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 312 ~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+++..+. ...|.+.|+++.
T Consensus 287 yllt~sy----d~~ikltdlqgd 305 (350)
T KOG0641|consen 287 YLLTCSY----DMKIKLTDLQGD 305 (350)
T ss_pred EEEEecc----cceEEEeecccc
Confidence 8887762 235778888874
No 392
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=96.05 E-value=0.094 Score=52.69 Aligned_cols=60 Identities=13% Similarity=0.242 Sum_probs=43.6
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+..++|+||+..+++.+..+ .+++||+..++...+..... ..+..++|.|-++
T Consensus 124 ~diydL~Ws~d~~~l~s~s~dn-------------s~~l~Dv~~G~l~~~~~dh~------------~yvqgvawDpl~q 178 (434)
T KOG1009|consen 124 DDIYDLAWSPDSNFLVSGSVDN-------------SVRLWDVHAGQLLAILDDHE------------HYVQGVAWDPLNQ 178 (434)
T ss_pred cchhhhhccCCCceeeeeeccc-------------eEEEEEeccceeEeeccccc------------cccceeecchhhh
Confidence 5677899999999999887775 68999988666555443322 2266789999887
Q ss_pred ceEEEE
Q 004574 255 STLYWV 260 (744)
Q Consensus 255 ~~l~~~ 260 (744)
+ ++-.
T Consensus 179 y-v~s~ 183 (434)
T KOG1009|consen 179 Y-VASK 183 (434)
T ss_pred h-hhhh
Confidence 6 4444
No 393
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=96.02 E-value=0.81 Score=48.43 Aligned_cols=40 Identities=18% Similarity=0.179 Sum_probs=29.6
Q ss_pred ceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccc
Q 004574 300 RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRV 345 (744)
Q Consensus 300 ~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 345 (744)
.+..+.|+.||-.++..+ ....++++|+-. .++..+-++.
T Consensus 230 svTal~F~d~gL~~aVGt----s~G~v~iyDLRa--~~pl~~kdh~ 269 (703)
T KOG2321|consen 230 SVTALKFRDDGLHVAVGT----STGSVLIYDLRA--SKPLLVKDHG 269 (703)
T ss_pred cceEEEecCCceeEEeec----cCCcEEEEEccc--CCceeecccC
Confidence 477889999999999876 456799999987 4544444433
No 394
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=96.01 E-value=0.36 Score=47.16 Aligned_cols=58 Identities=5% Similarity=0.072 Sum_probs=38.5
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEE
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLI 102 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~ 102 (744)
.++..+|.||.+.|+.+. +....|+.++++|.-.+++...... ....+++.-+|++++
T Consensus 23 e~SGLTy~pd~~tLfaV~---------d~~~~i~els~~G~vlr~i~l~g~~----D~EgI~y~g~~~~vl 80 (248)
T PF06977_consen 23 ELSGLTYNPDTGTLFAVQ---------DEPGEIYELSLDGKVLRRIPLDGFG----DYEGITYLGNGRYVL 80 (248)
T ss_dssp -EEEEEEETTTTEEEEEE---------TTTTEEEEEETT--EEEEEE-SS-S----SEEEEEE-STTEEEE
T ss_pred CccccEEcCCCCeEEEEE---------CCCCEEEEEcCCCCEEEEEeCCCCC----CceeEEEECCCEEEE
Confidence 388999999999988876 5568899999887666666433321 245678887776554
No 395
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.92 E-value=0.52 Score=48.35 Aligned_cols=58 Identities=24% Similarity=0.209 Sum_probs=36.4
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEE
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFT 104 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~ 104 (744)
.-..+.||||+|||+. .....|.|.+.++.+.++.+.+... .+.+++|=-.-..|+..
T Consensus 205 il~~avS~Dgkylatg----------g~d~~v~Iw~~~t~ehv~~~~ghr~----~V~~L~fr~gt~~lys~ 262 (479)
T KOG0299|consen 205 ILTLAVSSDGKYLATG----------GRDRHVQIWDCDTLEHVKVFKGHRG----AVSSLAFRKGTSELYSA 262 (479)
T ss_pred eEEEEEcCCCcEEEec----------CCCceEEEecCcccchhhccccccc----ceeeeeeecCccceeee
Confidence 4467899999999993 2223444679899998887433221 34555555444445544
No 396
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.91 E-value=1.1 Score=43.81 Aligned_cols=18 Identities=11% Similarity=0.146 Sum_probs=15.4
Q ss_pred cccceEEecCCcEEEEEe
Q 004574 88 VFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 88 ~~~~~~wspDg~~l~~~~ 105 (744)
.++.+.|+||.+.|+.+.
T Consensus 87 nvS~LTynp~~rtLFav~ 104 (316)
T COG3204 87 NVSSLTYNPDTRTLFAVT 104 (316)
T ss_pred cccceeeCCCcceEEEec
Confidence 478899999999998873
No 397
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=95.90 E-value=0.36 Score=53.26 Aligned_cols=91 Identities=12% Similarity=-0.054 Sum_probs=62.2
Q ss_pred ceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCC
Q 004574 209 QKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEG 288 (744)
Q Consensus 209 ~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (744)
..|.++|..+..+.+...+..+. +.++.|||||++ |+.++ ....|.++|+ .+
T Consensus 556 f~I~vvD~~t~kvvR~f~gh~nr------------itd~~FS~DgrW-lisas------------mD~tIr~wDl---pt 607 (910)
T KOG1539|consen 556 FSIRVVDVVTRKVVREFWGHGNR------------ITDMTFSPDGRW-LISAS------------MDSTIRTWDL---PT 607 (910)
T ss_pred eeEEEEEchhhhhhHHhhccccc------------eeeeEeCCCCcE-EEEee------------cCCcEEEEec---cC
Confidence 47888998877655554443332 668899999999 66554 2235888888 55
Q ss_pred CCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEc
Q 004574 289 EKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVC 330 (744)
Q Consensus 289 ~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~ 330 (744)
+........+....++++||.|..|+.+..+. ..||++.
T Consensus 608 ~~lID~~~vd~~~~sls~SPngD~LAT~Hvd~---~gIylWs 646 (910)
T KOG1539|consen 608 GTLIDGLLVDSPCTSLSFSPNGDFLATVHVDQ---NGIYLWS 646 (910)
T ss_pred cceeeeEecCCcceeeEECCCCCEEEEEEecC---ceEEEEE
Confidence 55444444566778899999999999887332 3466654
No 398
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.84 E-value=0.017 Score=56.86 Aligned_cols=122 Identities=12% Similarity=0.141 Sum_probs=78.6
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCC-ceeccccCCCc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETG-EAKPLFESPDI 83 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg-~~~~lt~~~~~ 83 (744)
++|.++|+ .+.+.|+.+.-+.+....+|+| +-.-|++. +...+||..|..-- .+..+-.+..
T Consensus 210 rsIvLyD~------R~~~Pl~KVi~~mRTN~IswnP--eafnF~~a--------~ED~nlY~~DmR~l~~p~~v~~dhv- 272 (433)
T KOG0268|consen 210 RSIVLYDL------RQASPLKKVILTMRTNTICWNP--EAFNFVAA--------NEDHNLYTYDMRNLSRPLNVHKDHV- 272 (433)
T ss_pred CceEEEec------ccCCccceeeeeccccceecCc--cccceeec--------cccccceehhhhhhcccchhhcccc-
Confidence 57889998 4455555555666788899999 45666543 56688999986632 2222221111
Q ss_pred cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEc
Q 004574 84 CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGS 163 (744)
Q Consensus 84 ~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 163 (744)
+ -+.++.+||-|+-++..+.+ ..|-++.
T Consensus 273 --s-AV~dVdfsptG~EfvsgsyD-------------------------------------------------ksIRIf~ 300 (433)
T KOG0268|consen 273 --S-AVMDVDFSPTGQEFVSGSYD-------------------------------------------------KSIRIFP 300 (433)
T ss_pred --e-eEEEeccCCCcchhcccccc-------------------------------------------------ceEEEee
Confidence 1 25688999999988865432 2455556
Q ss_pred C-CCCeeec--CCC-ceeeeeccCCCCceEEEEEee
Q 004574 164 L-DGTAKDF--GTP-AVYTAVEPSPDQKYVLITSMH 195 (744)
Q Consensus 164 ~-~G~~~~l--~~~-~~~~~~~~SpDG~~i~~~~~~ 195 (744)
+ .|..+.+ |.. ..+....||-|.++|+-.+++
T Consensus 301 ~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~SGSdd 336 (433)
T KOG0268|consen 301 VNHGHSRDIYHTKRMQHVFCVKYSMDSKYIISGSDD 336 (433)
T ss_pred cCCCcchhhhhHhhhheeeEEEEeccccEEEecCCC
Confidence 6 4444433 222 567789999999999866555
No 399
>KOG4388 consensus Hormone-sensitive lipase HSL [Lipid transport and metabolism]
Probab=95.84 E-value=0.02 Score=60.08 Aligned_cols=68 Identities=15% Similarity=0.056 Sum_probs=53.9
Q ss_pred CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCcc----ccHHHHHHHHHHHHHHhccCCC
Q 004574 666 KPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAAR----ENVMHVIWETDRWLQKYCLSNT 737 (744)
Q Consensus 666 ~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~----~~~~~~~~~~~~fl~~~l~~~~ 737 (744)
.|+.|+...-|+. .+...-+.++|+..|.++.+.++++-.|+|... .+..+..+..++=|...|+...
T Consensus 788 Pp~~i~ac~mDP~----LDD~vmfA~kLr~lG~~v~l~vle~lPHGFLnft~ls~E~~~~~~~CI~rl~~~L~~~~ 859 (880)
T KOG4388|consen 788 PPVHIVACAMDPM----LDDSVMFARKLRNLGQPVTLRVLEDLPHGFLNFTALSRETRQAAELCIERLRLVLTPPA 859 (880)
T ss_pred CCceEEEeccCcc----hhHHHHHHHHHHhcCCceeehhhhcCCccceeHHhhCHHHHHHHHHHHHHHHHHhCCCC
Confidence 5788999999987 678888999999999999999999999998742 3334556667777777776543
No 400
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=95.79 E-value=1.4 Score=45.77 Aligned_cols=135 Identities=10% Similarity=0.067 Sum_probs=76.8
Q ss_pred EEeccCCCCCCCCceEeeee-ccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCC
Q 004574 278 IYTQPAEPAEGEKPEILHKL-DLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSP 356 (744)
Q Consensus 278 l~~~~~~~~~~~~~~~l~~~-~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 356 (744)
|.+..+. .+.....++.. ...+.-+.+||--+.++..+.++ + .+-++|+++ .+++..+...-. .|..
T Consensus 145 iiih~~~--t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~-G--~VtlwDv~g--~sp~~~~~~~Hs----AP~~- 212 (673)
T KOG4378|consen 145 IIIHGTK--TKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDK-G--AVTLWDVQG--MSPIFHASEAHS----APCR- 212 (673)
T ss_pred EEEEecc--cCccccceecCCCCeEEEeecccccceeeEeeccC-C--eEEEEeccC--CCcccchhhhcc----CCcC-
Confidence 6666652 23334445544 33556788999999888776333 3 355667776 344443322111 1111
Q ss_pred ceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCcee-EEeeccchhhhhheeeeecCCcceecc
Q 004574 357 MMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKE-RIWESNREKYFETAVALVFGQGEEDIN 435 (744)
Q Consensus 357 ~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~s 435 (744)
.+++||-...|+... +....|++||....+.. +|+-.. +. ...+|+
T Consensus 213 gicfspsne~l~vsV--------------------G~Dkki~~yD~~s~~s~~~l~y~~------Pl-------stvaf~ 259 (673)
T KOG4378|consen 213 GICFSPSNEALLVSV--------------------GYDKKINIYDIRSQASTDRLTYSH------PL-------STVAFS 259 (673)
T ss_pred cceecCCccceEEEe--------------------cccceEEEeecccccccceeeecC------Cc-------ceeeec
Confidence 167899888887775 34456999998654432 232111 11 124888
Q ss_pred cCCCEEEEEEecCCCCceEEEEECCC
Q 004574 436 LNQLKILTSKESKTEITQYHILSWPL 461 (744)
Q Consensus 436 ~d~~~~~~~~~~~~~~~~i~~~~~~~ 461 (744)
++|-.|+.. +..+.|+.+|+.+
T Consensus 260 ~~G~~L~aG----~s~G~~i~YD~R~ 281 (673)
T KOG4378|consen 260 ECGTYLCAG----NSKGELIAYDMRS 281 (673)
T ss_pred CCceEEEee----cCCceEEEEeccc
Confidence 888655532 3356788888755
No 401
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.74 E-value=2.2 Score=41.64 Aligned_cols=73 Identities=12% Similarity=0.146 Sum_probs=50.1
Q ss_pred eEEEEEcCCC--CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC
Q 004574 157 AQLVLGSLDG--TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI 233 (744)
Q Consensus 157 ~~l~~~~~~G--~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~ 233 (744)
.++|.+..+| ..+...+. +.+...+||.||..++..... .++-+||+..++..++..+...
T Consensus 52 VR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~D-------------k~~k~wDL~S~Q~~~v~~Hd~p--- 115 (347)
T KOG0647|consen 52 VRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGCD-------------KQAKLWDLASGQVSQVAAHDAP--- 115 (347)
T ss_pred eEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeeccC-------------CceEEEEccCCCeeeeeecccc---
Confidence 4566665566 33333333 778899999999877666444 3788999999988887766544
Q ss_pred CcccCCccCCCCccceecCCCc
Q 004574 234 PVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 234 ~~~~~~~~~~~~~~~~spDg~~ 255 (744)
++.+.|-+...+
T Consensus 116 ----------vkt~~wv~~~~~ 127 (347)
T KOG0647|consen 116 ----------VKTCHWVPGMNY 127 (347)
T ss_pred ----------eeEEEEecCCCc
Confidence 666778766543
No 402
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.70 E-value=4.1 Score=45.85 Aligned_cols=251 Identities=11% Similarity=0.114 Sum_probs=125.7
Q ss_pred EEEEEcC-CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCC
Q 004574 158 QLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIP 234 (744)
Q Consensus 158 ~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~ 234 (744)
.|.+++. ++ .+.-+|-+ ..+--..|.|....|+-.+.+ ..+.+||+.|=+.+.......++...
T Consensus 116 TIrIWNwqsr~~iavltGHnHYVMcAqFhptEDlIVSaSLD-------------QTVRVWDisGLRkk~~~pg~~e~~~~ 182 (1202)
T KOG0292|consen 116 TIRIWNWQSRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLD-------------QTVRVWDISGLRKKNKAPGSLEDQMR 182 (1202)
T ss_pred eEEEEeccCCceEEEEecCceEEEeeccCCccceEEEeccc-------------ceEEEEeecchhccCCCCCCchhhhh
Confidence 4666676 55 44555555 555567799987766654333 37999999887766665442111110
Q ss_pred ------------c-----ccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE--ee
Q 004574 235 ------------V-----CYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI--LH 295 (744)
Q Consensus 235 ------------~-----~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~--l~ 295 (744)
. =++....|+.-++|.|--. | +++ +++.+ ...||.++. +.-... .-
T Consensus 183 ~~~~~~dLfg~~DaVVK~VLEGHDRGVNwaAfhpTlp--l-iVS---G~DDR-----qVKlWrmne----tKaWEvDtcr 247 (1202)
T KOG0292|consen 183 GQQGNSDLFGQTDAVVKHVLEGHDRGVNWAAFHPTLP--L-IVS---GADDR-----QVKLWRMNE----TKAWEVDTCR 247 (1202)
T ss_pred ccccchhhcCCcCeeeeeeecccccccceEEecCCcc--e-EEe---cCCcc-----eeeEEEecc----ccceeehhhh
Confidence 0 0122233344445555433 1 222 11211 124677664 111111 11
Q ss_pred eeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeee---ccccccccCCCCCCceeeCCCCCeEEEEee
Q 004574 296 KLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLF---DRVFENVYSDPGSPMMTRTSTGTNVIAKIK 372 (744)
Q Consensus 296 ~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~---~~~~~~~~~~~~~~~~~~spdg~~l~~~~~ 372 (744)
.+-..++++-|.|....|+..+ .+.. |.++|++. ....+.. ...+.-+..+|.-..++-..|+-.++|..+
T Consensus 248 gH~nnVssvlfhp~q~lIlSns--EDks--irVwDm~k--Rt~v~tfrrendRFW~laahP~lNLfAAgHDsGm~VFkle 321 (1202)
T KOG0292|consen 248 GHYNNVSSVLFHPHQDLILSNS--EDKS--IRVWDMTK--RTSVQTFRRENDRFWILAAHPELNLFAAGHDSGMIVFKLE 321 (1202)
T ss_pred cccCCcceEEecCccceeEecC--CCcc--EEEEeccc--ccceeeeeccCCeEEEEEecCCcceeeeecCCceEEEEEc
Confidence 1234567788888877777654 2233 55556554 2222222 222222233444444566778888888876
Q ss_pred ecC-----CcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEec
Q 004574 373 KEN-----DEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKES 447 (744)
Q Consensus 373 ~~~-----~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~ 447 (744)
+.. .++..+|+. ..+|+.+|+.+.+...+..-... ..+......++++|..+.++....-
T Consensus 322 RErpa~~v~~n~LfYvk---------d~~i~~~d~~t~~d~~v~~lr~~------g~~~~~~~smsYNpae~~vlics~~ 386 (1202)
T KOG0292|consen 322 RERPAYAVNGNGLFYVK---------DRFIRSYDLRTQKDTAVASLRRP------GTLWQPPRSLSYNPAENAVLICSNL 386 (1202)
T ss_pred ccCceEEEcCCEEEEEc---------cceEEeeeccccccceeEeccCC------CcccCCcceeeeccccCeEEEEecc
Confidence 421 122222222 45688999988666655543211 1111223335777777666655443
Q ss_pred CCCCceEEEE
Q 004574 448 KTEITQYHIL 457 (744)
Q Consensus 448 ~~~~~~i~~~ 457 (744)
.+....++.+
T Consensus 387 ~n~~y~L~~i 396 (1202)
T KOG0292|consen 387 DNGEYELVQI 396 (1202)
T ss_pred CCCeEEEEEe
Confidence 4444444444
No 403
>PRK13614 lipoprotein LpqB; Provisional
Probab=95.67 E-value=1.1 Score=49.44 Aligned_cols=55 Identities=11% Similarity=0.090 Sum_probs=37.8
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEE
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIF 103 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~ 103 (744)
....|+.|+||+.+|++. .....||+... ++..+.+..+.. ...+.|.++| ++..
T Consensus 344 ~~~s~avS~~g~~~A~~~---------~~~~~l~~~~~-g~~~~~~~~g~~------Lt~PS~d~~g-~vWt 398 (573)
T PRK13614 344 GPASPAESPVSQTVAFLN---------GSRTTLYTVSP-GQPARALTSGST------LTRPSFSPQD-WVWT 398 (573)
T ss_pred cccceeecCCCceEEEec---------CCCcEEEEecC-CCcceeeecCCC------ccCCcccCCC-CEEE
Confidence 456899999999999963 22357888775 456666544432 4567888888 5543
No 404
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.62 E-value=0.31 Score=53.25 Aligned_cols=165 Identities=8% Similarity=0.081 Sum_probs=99.6
Q ss_pred eEEEEEcCC--CCeeec---CCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCC
Q 004574 157 AQLVLGSLD--GTAKDF---GTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPA 230 (744)
Q Consensus 157 ~~l~~~~~~--G~~~~l---~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~ 230 (744)
+.|-++|++ +...++ .++ ..+..+.|++-.-.|+++...+ ..+-.||+..+.-+.......+
T Consensus 110 G~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQD------------g~vK~~DlR~~~S~~t~~~nSE 177 (839)
T KOG0269|consen 110 GVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQD------------GTVKCWDLRSKKSKSTFRSNSE 177 (839)
T ss_pred CcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCC------------ceEEEEeeecccccccccccch
Confidence 567777773 333332 344 7788999999888888876553 3677788776655554433222
Q ss_pred CCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCC
Q 004574 231 EDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDS 310 (744)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg 310 (744)
.++++.|||--. -.|.+..++ +.|..+|++. ...-..+++.+.+.+..+.|+|++
T Consensus 178 ------------SiRDV~fsp~~~--~~F~s~~ds----------G~lqlWDlRq-p~r~~~k~~AH~GpV~c~nwhPnr 232 (839)
T KOG0269|consen 178 ------------SIRDVKFSPGYG--NKFASIHDS----------GYLQLWDLRQ-PDRCEKKLTAHNGPVLCLNWHPNR 232 (839)
T ss_pred ------------hhhceeeccCCC--ceEEEecCC----------ceEEEeeccC-chhHHHHhhcccCceEEEeecCCC
Confidence 288999999764 345543333 3466666631 112224467778888999999999
Q ss_pred ceEEEeeeeeccceeEEEEcCCCCCCcce-ee-eccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 311 LALVNETWYKTSQTRTWLVCPGSKDVAPR-VL-FDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 311 ~~l~~~~~~~~~~~~l~~~~~~~~~~~~~-~l-~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+|+... + ...+.++++.++..... .+ |....+. +.|-|+-++.+...
T Consensus 233 ~~lATGG--R--DK~vkiWd~t~~~~~~~~tInTiapv~r---------VkWRP~~~~hLAtc 282 (839)
T KOG0269|consen 233 EWLATGG--R--DKMVKIWDMTDSRAKPKHTINTIAPVGR---------VKWRPARSYHLATC 282 (839)
T ss_pred ceeeecC--C--CccEEEEeccCCCccceeEEeecceeee---------eeeccCccchhhhh
Confidence 9999876 2 23466667766422211 12 1112221 56888887555444
No 405
>KOG4328 consensus WD40 protein [Function unknown]
Probab=95.58 E-value=3 Score=43.02 Aligned_cols=170 Identities=12% Similarity=0.074 Sum_probs=91.4
Q ss_pred eEEEEEcCCC------CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCC
Q 004574 157 AQLVLGSLDG------TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPP 229 (744)
Q Consensus 157 ~~l~~~~~~G------~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~ 229 (744)
+||-++|+.+ .....+.+ +.+..+.|+|-...-+|.+.- ...|...|+++.....+....-
T Consensus 210 G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSy------------DGtiR~~D~~~~i~e~v~s~~~ 277 (498)
T KOG4328|consen 210 GQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSY------------DGTIRLQDFEGNISEEVLSLDT 277 (498)
T ss_pred CcEEEEecCCCCCccCceEEeccCCccccceEecCCChhheeeecc------------CceeeeeeecchhhHHHhhcCc
Confidence 6777777732 12222334 678889999988654444322 2367777887776555543321
Q ss_pred CCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccC
Q 004574 230 AEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDD 309 (744)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpD 309 (744)
. ......+.++.+... ++|..... .+-++|++- ++.+...+.-....+.++++.|-
T Consensus 278 d----------~~~fs~~d~~~e~~~-vl~~~~~G------------~f~~iD~R~-~~s~~~~~~lh~kKI~sv~~NP~ 333 (498)
T KOG4328|consen 278 D----------NIWFSSLDFSAESRS-VLFGDNVG------------NFNVIDLRT-DGSEYENLRLHKKKITSVALNPV 333 (498)
T ss_pred c----------ceeeeeccccCCCcc-EEEeeccc------------ceEEEEeec-CCccchhhhhhhcccceeecCCC
Confidence 1 011345667777665 66664221 133333320 33444445555668899999999
Q ss_pred CceEEEeeeeeccceeEEEEcCCCC--CCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 310 SLALVNETWYKTSQTRTWLVCPGSK--DVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 310 g~~l~~~~~~~~~~~~l~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
-.+++.++..+ .. ..++|+-.- ..++...+-.+...+. ...+||+|-.|+...
T Consensus 334 ~p~~laT~s~D-~T--~kIWD~R~l~~K~sp~lst~~HrrsV~------sAyFSPs~gtl~TT~ 388 (498)
T KOG4328|consen 334 CPWFLATASLD-QT--AKIWDLRQLRGKASPFLSTLPHRRSVN------SAYFSPSGGTLLTTC 388 (498)
T ss_pred CchheeecccC-cc--eeeeehhhhcCCCCcceecccccceee------eeEEcCCCCceEeec
Confidence 98877665222 33 334454332 1222222222222111 145899998887775
No 406
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=95.53 E-value=2.7 Score=41.17 Aligned_cols=120 Identities=18% Similarity=0.248 Sum_probs=77.1
Q ss_pred CceeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCc
Q 004574 20 PEKEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNST 99 (744)
Q Consensus 20 ~~~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~ 99 (744)
...+|...|+. .++...|||.+..|+..++ ++ .|.+++....+.++....... +...+|.++.+
T Consensus 4 ~~~~l~npP~d-~IS~v~f~~~~~~LLvssW--------Dg--slrlYdv~~~~l~~~~~~~~p-----lL~c~F~d~~~ 67 (323)
T KOG1036|consen 4 NEFELENPPED-GISSVKFSPSSSDLLVSSW--------DG--SLRLYDVPANSLKLKFKHGAP-----LLDCAFADEST 67 (323)
T ss_pred cccccCCCChh-ceeeEEEcCcCCcEEEEec--------cC--cEEEEeccchhhhhheecCCc-----eeeeeccCCce
Confidence 34566654544 5899999999999999875 33 455567777777666544432 33446776554
Q ss_pred EEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-cee
Q 004574 100 LLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVY 177 (744)
Q Consensus 100 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~ 177 (744)
+++... .++|-++|+ +|+..++..+ ..+
T Consensus 68 -~~~G~~-------------------------------------------------dg~vr~~Dln~~~~~~igth~~~i 97 (323)
T KOG1036|consen 68 -IVTGGL-------------------------------------------------DGQVRRYDLNTGNEDQIGTHDEGI 97 (323)
T ss_pred -EEEecc-------------------------------------------------CceEEEEEecCCcceeeccCCCce
Confidence 443311 168999999 6688888777 777
Q ss_pred eeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCC
Q 004574 178 TAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDG 218 (744)
Q Consensus 178 ~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g 218 (744)
..+.+++--..|+-. .+...|-+||...
T Consensus 98 ~ci~~~~~~~~vIsg-------------sWD~~ik~wD~R~ 125 (323)
T KOG1036|consen 98 RCIEYSYEVGCVISG-------------SWDKTIKFWDPRN 125 (323)
T ss_pred EEEEeeccCCeEEEc-------------ccCccEEEEeccc
Confidence 777777644444333 2334788888765
No 407
>KOG4389 consensus Acetylcholinesterase/Butyrylcholinesterase [Signal transduction mechanisms]
Probab=95.50 E-value=0.031 Score=57.82 Aligned_cols=119 Identities=27% Similarity=0.362 Sum_probs=70.2
Q ss_pred EEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEE-ecCC-CCCCC----
Q 004574 495 LTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVL-AGPS-IPIIG---- 568 (744)
Q Consensus 495 l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~-~~~~-~~~~g---- 568 (744)
+..-+|.|. .++.. .-|+|+++||||.++...--+ |.+ ..|+..+-+++ ..++ .+..|
T Consensus 121 LYlNVW~P~-~~p~n---~tVlVWiyGGGF~sGt~SLdv------YdG------k~la~~envIvVs~NYRvG~FGFL~l 184 (601)
T KOG4389|consen 121 LYLNVWAPA-ADPYN---LTVLVWIYGGGFYSGTPSLDV------YDG------KFLAAVENVIVVSMNYRVGAFGFLYL 184 (601)
T ss_pred eEEEEeccC-CCCCC---ceEEEEEEcCccccCCcceee------ecc------ceeeeeccEEEEEeeeeeccceEEec
Confidence 444566674 23332 459999999998765544322 222 23454443333 2222 22222
Q ss_pred ------CCCCChHHHHHHHHHHHHHc---CCCCCCcEEEEEechHHHHHH-HHHH-hCCCceeEEEEccCCCC
Q 004574 569 ------EGDKLPNDSAEAAVEEVVRR---GVADPSRIAVGGHSYGAFMTA-HLLA-HAPHLFCCGIARSGSYN 630 (744)
Q Consensus 569 ------~g~~~~~~d~~~~~~~l~~~---~~~d~~~i~l~G~S~GG~~a~-~~~~-~~p~~~~~~v~~~~~~~ 630 (744)
.|..-+. |-+-|+.|++++ -.-|+++|.|+|.|+|+.-+. ++.+ .....|+-+|+.+|..+
T Consensus 185 ~~~~eaPGNmGl~-DQqLAl~WV~~Ni~aFGGnp~~vTLFGESAGaASv~aHLlsP~S~glF~raIlQSGS~~ 256 (601)
T KOG4389|consen 185 PGHPEAPGNMGLL-DQQLALQWVQENIAAFGGNPSRVTLFGESAGAASVVAHLLSPGSRGLFHRAILQSGSLN 256 (601)
T ss_pred CCCCCCCCccchH-HHHHHHHHHHHhHHHhCCCcceEEEeccccchhhhhheecCCCchhhHHHHHhhcCCCC
Confidence 1222234 778899999986 236899999999999987553 2221 11236888888888654
No 408
>KOG2100 consensus Dipeptidyl aminopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=95.44 E-value=2.2 Score=49.31 Aligned_cols=78 Identities=18% Similarity=0.216 Sum_probs=49.0
Q ss_pred cceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeecccee-ceeeccCCceEEEeeeee-ccce
Q 004574 247 ISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFR-SVSWCDDSLALVNETWYK-TSQT 324 (744)
Q Consensus 247 ~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~-~~~~SpDg~~l~~~~~~~-~~~~ 324 (744)
..++.|+...+++.+..+++. .++..+... .++.+..++.+...+. -+.|+.|.+.++|.+... .+..
T Consensus 345 ~~~~~d~~~~~~~~~~~~~~~--------~hi~~~~~~--~~~~~~~lt~g~w~v~~i~~~~~~~~~i~f~~~~~~~~~~ 414 (755)
T KOG2100|consen 345 PVFSSDGSSYLKVDSVSDGGY--------NHIAYLKLS--NGSEPRMLTSGNWEVTSILGYDKDSNRIYFDAYEEDPSER 414 (755)
T ss_pred ceEeecCCceeEEEeeccCCE--------EEEEEEEcC--CCCccccccccceEEEEeccccCCCceEEEEecCCCCCce
Confidence 467888866455554443331 134444431 2336666777766553 455778899999987443 5778
Q ss_pred eEEEEcCCCC
Q 004574 325 RTWLVCPGSK 334 (744)
Q Consensus 325 ~l~~~~~~~~ 334 (744)
+||.+++...
T Consensus 415 ~ly~i~~~~~ 424 (755)
T KOG2100|consen 415 HLYSISLGSG 424 (755)
T ss_pred EEEEEEcccc
Confidence 9999998874
No 409
>KOG1551 consensus Uncharacterized conserved protein [Function unknown]
Probab=95.40 E-value=0.18 Score=47.89 Aligned_cols=68 Identities=15% Similarity=0.095 Sum_probs=43.6
Q ss_pred ccccCCCCCC-----EEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHh
Q 004574 658 ITHANKIKKP-----ILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 658 ~~~~~~~~~P-----~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
..++....+| +.++..++|..+| ......+.+... .++...++ +||.-........+.++|.+-|++.
T Consensus 294 ~T~v~~fp~Pvdpsl~ivv~A~~D~Yip--r~gv~~lQ~~WP----g~eVr~~e-gGHVsayl~k~dlfRR~I~d~L~R~ 366 (371)
T KOG1551|consen 294 CTHVANFPVPVDPSLIIVVQAKEDAYIP--RTGVRSLQEIWP----GCEVRYLE-GGHVSAYLFKQDLFRRAIVDGLDRL 366 (371)
T ss_pred hchhhcCCCCCCCCeEEEEEecCCcccc--ccCcHHHHHhCC----CCEEEEee-cCceeeeehhchHHHHHHHHHHHhh
Confidence 4445555444 6778899999998 655665555443 34555555 5797554445566777788887764
No 410
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=95.36 E-value=1.9 Score=41.78 Aligned_cols=119 Identities=13% Similarity=0.064 Sum_probs=70.6
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
..+....|-|=..-++.++ ++...+-+||..+-+..-....+.. +...+|||=..
T Consensus 102 y~iss~~WyP~DtGmFtss------------SFDhtlKVWDtnTlQ~a~~F~me~~-------------VYshamSp~a~ 156 (397)
T KOG4283|consen 102 YAISSAIWYPIDTGMFTSS------------SFDHTLKVWDTNTLQEAVDFKMEGK-------------VYSHAMSPMAM 156 (397)
T ss_pred eeeeeeEEeeecCceeecc------------cccceEEEeecccceeeEEeecCce-------------eehhhcChhhh
Confidence 3455666777655554443 2334788888877654443333322 44567888765
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE-eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCC
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~ 332 (744)
. -..++.. . ...++-++|+ .+|.... |......+-.+.|||-..+++++. ..++.-++|-+--.
T Consensus 157 s-HcLiA~g-t--------r~~~VrLCDi---~SGs~sH~LsGHr~~vlaV~Wsp~~e~vLatg-saDg~irlWDiRra 221 (397)
T KOG4283|consen 157 S-HCLIAAG-T--------RDVQVRLCDI---ASGSFSHTLSGHRDGVLAVEWSPSSEWVLATG-SADGAIRLWDIRRA 221 (397)
T ss_pred c-ceEEEEe-c--------CCCcEEEEec---cCCcceeeeccccCceEEEEeccCceeEEEec-CCCceEEEEEeecc
Confidence 4 2222211 1 1124788888 6676655 444577889999999999999887 33344555554433
No 411
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=95.35 E-value=0.41 Score=51.81 Aligned_cols=79 Identities=14% Similarity=-0.092 Sum_probs=50.0
Q ss_pred EEeccCCCCCCCCceEeeeec----cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCC
Q 004574 278 IYTQPAEPAEGEKPEILHKLD----LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDP 353 (744)
Q Consensus 278 l~~~~~~~~~~~~~~~l~~~~----~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~ 353 (744)
|-+++. .+++.++++.+. +..-.+...|.|.+|+.+..+ ..|.++|.-++ +-....-++.+.+-+
T Consensus 620 irif~i---~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsd----ktl~~~Df~sg--EcvA~m~GHsE~VTG-- 688 (1080)
T KOG1408|consen 620 IRIFDI---ESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSD----KTLCFVDFVSG--ECVAQMTGHSEAVTG-- 688 (1080)
T ss_pred eEEEec---cccceeeeecccccCCCceEEEEECCCccEEEEeecC----CceEEEEeccc--hhhhhhcCcchheee--
Confidence 666666 667777777653 445566778999999998733 35888887774 222211222221111
Q ss_pred CCCceeeCCCCCeEEEEe
Q 004574 354 GSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 354 ~~~~~~~spdg~~l~~~~ 371 (744)
+.|++|-+.|+..+
T Consensus 689 ----~kF~nDCkHlISvs 702 (1080)
T KOG1408|consen 689 ----VKFLNDCKHLISVS 702 (1080)
T ss_pred ----eeecccchhheeec
Confidence 45899999998876
No 412
>PRK13614 lipoprotein LpqB; Provisional
Probab=95.28 E-value=0.84 Score=50.28 Aligned_cols=143 Identities=15% Similarity=0.133 Sum_probs=80.7
Q ss_pred EEEEEcCCCCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCee-e-----eccCCCCCC
Q 004574 158 QLVLGSLDGTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLV-R-----ELCDLPPAE 231 (744)
Q Consensus 158 ~l~~~~~~G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~-~-----~l~~~~~~~ 231 (744)
.++.....+..+.+........|+|.++| .|+-..... +..+.++..+|... . .+. .++-.
T Consensus 366 ~l~~~~~g~~~~~~~~g~~Lt~PS~d~~g-~vWtv~~g~-----------~~~vv~~~~~g~~~~~~~~~~~v~-~~~l~ 432 (573)
T PRK13614 366 TLYTVSPGQPARALTSGSTLTRPSFSPQD-WVWTAGPGG-----------NGRIVAYRPTGVAEGAQAPTVTLT-ADWLA 432 (573)
T ss_pred EEEEecCCCcceeeecCCCccCCcccCCC-CEEEeeCCC-----------CceEEEEecCCCcccccccceeec-ccccC
Confidence 56766664456666556667899999998 777553322 12455555443211 0 111 11111
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee-----ccceeceee
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL-----DLRFRSVSW 306 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~-----~~~~~~~~~ 306 (744)
+ ..+..+..|+||-+ ++.+...++.. +|++.-+.--..|.++.|+.. .....+++|
T Consensus 433 g---------~~I~~lrvSrDG~R-~Avi~~~~g~~---------~V~va~V~R~~~G~P~~L~~~~~~~~~~~~~sl~W 493 (573)
T PRK13614 433 G---------RTVKELRVSREGVR-ALVISEQNGKS---------RVQVAGIVRNEDGTPRELTAPITLAADSDADTGAW 493 (573)
T ss_pred C---------CeeEEEEECCCccE-EEEEEEeCCcc---------EEEEEEEEeCCCCCeEEccCceecccCCCcceeEE
Confidence 0 11668899999999 77665433322 244432211134566666533 246778999
Q ss_pred ccCCceEEEeeeeeccceeEEEEcCCC
Q 004574 307 CDDSLALVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 307 SpDg~~l~~~~~~~~~~~~l~~~~~~~ 333 (744)
..++..++... ..+.....+++.+++
T Consensus 494 ~~~~sl~V~~~-~~~~~~~~~~v~v~~ 519 (573)
T PRK13614 494 VGDSTVVVTKA-SATSNVVPELLSVDA 519 (573)
T ss_pred cCCCEEEEEec-cCCCcceEEEEEeCC
Confidence 99998666654 333455677777754
No 413
>COG3946 VirJ Type IV secretory pathway, VirJ component [Intracellular trafficking and secretion]
Probab=95.26 E-value=0.061 Score=54.47 Aligned_cols=167 Identities=14% Similarity=0.089 Sum_probs=88.2
Q ss_pred CchhHHHHHhCCeEEEecCCCCCC--CCCCCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeE
Q 004574 544 TPTSSLIFLARRFAVLAGPSIPII--GEGDKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCC 621 (744)
Q Consensus 544 ~~~~~~~~~~~G~~v~~~~~~~~~--g~g~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~ 621 (744)
-......|.++|+-|+..+...+. ....++.-.|+.+.+++..++.. ..++.|+|+|+|+=+-..+-.+-|...+.
T Consensus 276 Dk~v~~~l~~~gvpVvGvdsLRYfW~~rtPe~~a~Dl~r~i~~y~~~w~--~~~~~liGySfGADvlP~~~n~L~~~~r~ 353 (456)
T COG3946 276 DKEVAEALQKQGVPVVGVDSLRYFWSERTPEQIAADLSRLIRFYARRWG--AKRVLLIGYSFGADVLPFAYNRLPPATRQ 353 (456)
T ss_pred hHHHHHHHHHCCCceeeeehhhhhhccCCHHHHHHHHHHHHHHHHHhhC--cceEEEEeecccchhhHHHHHhCCHHHHH
Confidence 345677889999999984443221 11112334489999999888754 36899999999998877777666553333
Q ss_pred EEEccCCCCCC-CCCCcccccccchhhcHHHHHhcCcccccCCCC-CCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCc
Q 004574 622 GIARSGSYNKT-LTPFGFQTEFRTLWEATNVYIEMSPITHANKIK-KPILIIHGEVDDKVGLFPMQAERFFDALKGHGAL 699 (744)
Q Consensus 622 ~v~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~ 699 (744)
.|-+..+.... ...+.+..+ -|-....-.....+..+.++. .-+++|+|.+|+-..|+ .++..+
T Consensus 354 ~v~~~~ll~l~~~~~fe~~v~---gWlg~~~~g~~~~~~~~~~l~~~~v~CiYG~~e~d~~Cp---------~l~~~~-- 419 (456)
T COG3946 354 RVRMVSLLGLGRTADFEISVE---GWLGMAGEGAGDVVPDIAKLPLARVQCIYGQEEKDTACP---------SLKAKG-- 419 (456)
T ss_pred HHHHHHHHhccccceEEEEEe---eeeccCCcCCCCcchhhhhCCcceeEEEecCccccccCC---------cchhhc--
Confidence 22222211100 001111100 010000000112333344454 45899999765443222 233333
Q ss_pred EEEEEeCCCCcccCccccHHHHHHHHHHHH
Q 004574 700 SRLVLLPFEHHVYAARENVMHVIWETDRWL 729 (744)
Q Consensus 700 ~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl 729 (744)
++.+-+||+ |.|. +..+...+.|++=+
T Consensus 420 ~~~v~lpGg-HHFd--~dy~~la~~il~~~ 446 (456)
T COG3946 420 VDTVKLPGG-HHFD--GDYEKLAKAILQGM 446 (456)
T ss_pred ceeEecCCC-cccC--ccHHHHHHHHHHHH
Confidence 477788986 5553 33445555555544
No 414
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=95.18 E-value=0.67 Score=48.87 Aligned_cols=69 Identities=10% Similarity=0.039 Sum_probs=42.4
Q ss_pred eeeeccCCCCceEEEEEeeCCccccc-------ccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccce
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSYKV-------PCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISW 249 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~~~-------~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (744)
...++|.|||+ |++........... ........++++++++++.+.+...-.+ +.+++|
T Consensus 126 ~~~l~~gpDG~-LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a~G~rn-------------p~Gl~~ 191 (367)
T TIGR02604 126 LNSLAWGPDGW-LYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVAHGFQN-------------PYGHSV 191 (367)
T ss_pred ccCceECCCCC-EEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEecCcCC-------------CccceE
Confidence 55789999995 66654422110000 0111235799999999887766544222 567899
Q ss_pred ecCCCceEEEEE
Q 004574 250 RADKPSTLYWVE 261 (744)
Q Consensus 250 spDg~~~l~~~~ 261 (744)
+|+|+ ++++.
T Consensus 192 d~~G~--l~~td 201 (367)
T TIGR02604 192 DSWGD--VFFCD 201 (367)
T ss_pred CCCCC--EEEEc
Confidence 99987 65553
No 415
>PRK13613 lipoprotein LpqB; Provisional
Probab=95.17 E-value=2.7 Score=46.97 Aligned_cols=103 Identities=21% Similarity=0.149 Sum_probs=62.7
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCce-----eccccCCCccccccccceEEecCCcEEEEEec
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEA-----KPLFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~-----~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
....++.|+||+.+|++. .....||+-++.++.. +.+.... ....+.|.++| .|... .
T Consensus 364 ~~~s~avS~~g~~~A~v~---------~~~~~l~vg~~~~~~~~~~~~~~~~~~~------~Lt~PS~d~~g-~vWtv-d 426 (599)
T PRK13613 364 PLRRVAVSRDESRAAGIS---------ADGDSVYVGSLTPGASIGVHSWGVTADG------RLTSPSWDGRG-DLWVV-D 426 (599)
T ss_pred CccceEEcCCCceEEEEc---------CCCcEEEEeccCCCCccccccceeeccC------cccCCcCcCCC-CEEEe-c
Confidence 467899999999999974 2345788877654443 2332222 24577999988 45433 2
Q ss_pred CCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCCC----ceeeeecc
Q 004574 107 SSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGTP----AVYTAVEP 182 (744)
Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~~----~~~~~~~~ 182 (744)
...+.. .-+.++.-+|+...+... ..+..+..
T Consensus 427 ~~~~~~--------------------------------------------~vl~v~~~~G~~~~V~~~~l~g~~I~~lrv 462 (599)
T PRK13613 427 RDPADP--------------------------------------------RLLWLLQGDGEPVEVRTPELDGHRVVAVRV 462 (599)
T ss_pred CCCCCc--------------------------------------------eEEEEEcCCCcEEEeeccccCCCEeEEEEE
Confidence 111100 113333335543333222 36889999
Q ss_pred CCCCceEEEEEee
Q 004574 183 SPDQKYVLITSMH 195 (744)
Q Consensus 183 SpDG~~i~~~~~~ 195 (744)
|+||-++++....
T Consensus 463 SrDG~RvAvv~~~ 475 (599)
T PRK13613 463 ARDGVRVALIVEK 475 (599)
T ss_pred CCCccEEEEEEec
Confidence 9999999999764
No 416
>KOG3967 consensus Uncharacterized conserved protein [Function unknown]
Probab=95.13 E-value=0.19 Score=46.04 Aligned_cols=104 Identities=16% Similarity=0.067 Sum_probs=54.1
Q ss_pred ceEEEEECCCCCcccccCC-cccCCCCccCCCCchhHHHHHhCCeEEEecCCCCCC----CC-----CCCChHHHHHHHH
Q 004574 513 LPCLFWAYPEDYKSKDAAG-QVRGSPNEFSGMTPTSSLIFLARRFAVLAGPSIPII----GE-----GDKLPNDSAEAAV 582 (744)
Q Consensus 513 ~p~vv~~HG~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~G~~v~~~~~~~~~----g~-----g~~~~~~d~~~~~ 582 (744)
..++|++||.|..-.++-. ++.-+.+.-.+....+.......||.|+.-+...-+ ++ +.....+.+.-+-
T Consensus 101 ~kLlVLIHGSGvVrAGQWARrLIIN~~Ld~GTQiPyi~rAv~~Gygviv~N~N~~~kfye~k~np~kyirt~veh~~yvw 180 (297)
T KOG3967|consen 101 QKLLVLIHGSGVVRAGQWARRLIINEDLDSGTQIPYIKRAVAEGYGVIVLNPNRERKFYEKKRNPQKYIRTPVEHAKYVW 180 (297)
T ss_pred cceEEEEecCceEecchHhhhhhhccccccCCcChHHHHHHHcCCcEEEeCCchhhhhhhcccCcchhccchHHHHHHHH
Confidence 4589999998743222111 111111111222233455556789988761111100 00 1111122233333
Q ss_pred HHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCc
Q 004574 583 EEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHL 618 (744)
Q Consensus 583 ~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~ 618 (744)
..+.. ...++.|+++.||+||.+++-++.+.|+.
T Consensus 181 ~~~v~--pa~~~sv~vvahsyGG~~t~~l~~~f~~d 214 (297)
T KOG3967|consen 181 KNIVL--PAKAESVFVVAHSYGGSLTLDLVERFPDD 214 (297)
T ss_pred HHHhc--ccCcceEEEEEeccCChhHHHHHHhcCCc
Confidence 33332 23457899999999999999999998753
No 417
>PRK13615 lipoprotein LpqB; Provisional
Probab=94.94 E-value=1.6 Score=48.03 Aligned_cols=145 Identities=16% Similarity=0.102 Sum_probs=81.2
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEecCCCCCC
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDP 112 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~ 112 (744)
...++.|+||+.+|++.. ...||+-+.. +..+.+.... ....+.|.++| .+...... ..
T Consensus 336 ~~s~avS~dg~~~A~v~~----------~~~l~vg~~~-~~~~~~~~~~------~Lt~PS~d~~g-~vWtv~~g-~~-- 394 (557)
T PRK13615 336 ADAATLSADGRQAAVRNA----------SGVWSVGDGD-RDAVLLDTRP------GLVAPSLDAQG-YVWSTPAS-DP-- 394 (557)
T ss_pred cccceEcCCCceEEEEcC----------CceEEEecCC-CcceeeccCC------ccccCcCcCCC-CEEEEeCC-Cc--
Confidence 368999999999999731 2356665533 4566654333 24577999998 55543211 00
Q ss_pred CCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcCCCCeeecCC----CceeeeeccCCCCce
Q 004574 113 PKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSLDGTAKDFGT----PAVYTAVEPSPDQKY 188 (744)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~G~~~~l~~----~~~~~~~~~SpDG~~ 188 (744)
..+.....+|+...+.- .+.+..+..|+||-+
T Consensus 395 --------------------------------------------~~l~~~~~~G~~~~v~v~~~~~~~I~~lrvSrDG~R 430 (557)
T PRK13615 395 --------------------------------------------RGLVAWGPDGVGHPVAVSWTATGRVVSLEVARDGAR 430 (557)
T ss_pred --------------------------------------------eEEEEecCCCceEEeeccccCCCeeEEEEeCCCccE
Confidence 12222233453333321 257889999999999
Q ss_pred EEEEEeeCCcccccccCCCcceEEE--EeCCCCeeeec-cCCCCCCCCCcccCCccCCCCccceecCCCceEEEEE
Q 004574 189 VLITSMHRPYSYKVPCARFSQKVQV--WTTDGKLVREL-CDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 189 i~~~~~~~~~~~~~~~~~~~~~l~~--~~~~g~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~ 261 (744)
+++...... ..+|++ +..+++..+.| +.. .. +......+.++.|..+++ |+.+.
T Consensus 431 ~Avi~~~~g----------~~~V~va~V~R~~~~P~~L~~~p-~~------l~~~l~~v~sl~W~~~~~--laVl~ 487 (557)
T PRK13615 431 VLVQLETGA----------GPQLLVASIVRDGGVPTSLTTTP-LE------LLASPGTPLDATWVDELD--VATLT 487 (557)
T ss_pred EEEEEecCC----------CCEEEEEEEEeCCCcceEeeecc-EE------cccCcCcceeeEEcCCCE--EEEEe
Confidence 999876432 124554 22244434444 221 00 000001256789998887 65554
No 418
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=94.92 E-value=0.55 Score=50.88 Aligned_cols=58 Identities=17% Similarity=0.293 Sum_probs=33.6
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
+.+.+|.-||++|+.+-..+......-+++.||=.+++|.. .....||+|.+.++|..
T Consensus 118 V~SmsWn~dG~kIcIvYeDGavIVGsvdGNRIwgKeLkg~~---------------l~hv~ws~D~~~~Lf~~ 175 (1189)
T KOG2041|consen 118 VVSMSWNLDGTKICIVYEDGAVIVGSVDGNRIWGKELKGQL---------------LAHVLWSEDLEQALFKK 175 (1189)
T ss_pred EEEEEEcCCCcEEEEEEccCCEEEEeeccceecchhcchhe---------------ccceeecccHHHHHhhh
Confidence 55789999999998874311111111223344433332211 23558999999998864
No 419
>KOG2521 consensus Uncharacterized conserved protein [Function unknown]
Probab=94.84 E-value=1 Score=45.94 Aligned_cols=68 Identities=16% Similarity=0.131 Sum_probs=60.7
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCcccCccccHHHHHHHHHHHHHHhcc
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHHVYAARENVMHVIWETDRWLQKYCL 734 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H~~~~~~~~~~~~~~~~~fl~~~l~ 734 (744)
..+.+.+.+..|.++| .++.+++.+..+..|..++..-+.++-|..+....+..+...+.+|+.....
T Consensus 225 ~~~~ly~~s~~d~v~~--~~~ie~f~~~~~~~g~~v~s~~~~ds~H~~h~r~~p~~y~~~~~~Fl~~~~~ 292 (350)
T KOG2521|consen 225 PWNQLYLYSDNDDVLP--ADEIEKFIALRREKGVNVKSVKFKDSEHVAHFRSFPKTYLKKCSEFLRSVIS 292 (350)
T ss_pred cccceeecCCcccccc--HHHHHHHHHHHHhcCceEEEeeccCccceeeeccCcHHHHHHHHHHHHhccc
Confidence 5678888899999998 9999999899999999999999999999988788889999999999988654
No 420
>PRK13615 lipoprotein LpqB; Provisional
Probab=94.73 E-value=4.9 Score=44.30 Aligned_cols=142 Identities=15% Similarity=0.113 Sum_probs=80.0
Q ss_pred EEEEEcCCCCeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCccc
Q 004574 158 QLVLGSLDGTAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCY 237 (744)
Q Consensus 158 ~l~~~~~~G~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~ 237 (744)
.+++-...|..+.+........|+|.++| .|+-..... ...+.....+|.. ..+ ..++...
T Consensus 356 ~l~vg~~~~~~~~~~~~~~Lt~PS~d~~g-~vWtv~~g~-----------~~~l~~~~~~G~~-~~v-~v~~~~~----- 416 (557)
T PRK13615 356 VWSVGDGDRDAVLLDTRPGLVAPSLDAQG-YVWSTPASD-----------PRGLVAWGPDGVG-HPV-AVSWTAT----- 416 (557)
T ss_pred eEEEecCCCcceeeccCCccccCcCcCCC-CEEEEeCCC-----------ceEEEEecCCCce-EEe-eccccCC-----
Confidence 45555545556666555667899999999 777554332 1233334433432 222 1222211
Q ss_pred CCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEe-ee------eccceeceeeccCC
Q 004574 238 NSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEIL-HK------LDLRFRSVSWCDDS 310 (744)
Q Consensus 238 ~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l-~~------~~~~~~~~~~SpDg 310 (744)
..+..+..|+||.+ ++.+...++.. +|++.-+.- .++.++.| +. ......+++|..++
T Consensus 417 ----~~I~~lrvSrDG~R-~Avi~~~~g~~---------~V~va~V~R-~~~~P~~L~~~p~~l~~~l~~v~sl~W~~~~ 481 (557)
T PRK13615 417 ----GRVVSLEVARDGAR-VLVQLETGAGP---------QLLVASIVR-DGGVPTSLTTTPLELLASPGTPLDATWVDEL 481 (557)
T ss_pred ----CeeEEEEeCCCccE-EEEEEecCCCC---------EEEEEEEEe-CCCcceEeeeccEEcccCcCcceeeEEcCCC
Confidence 12678999999999 77765333322 255433321 23444455 32 23467789999999
Q ss_pred ceEEEeeeeeccceeEEEEcCCCC
Q 004574 311 LALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 311 ~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+..+... .......++++.+.+.
T Consensus 482 ~laVl~~-~~~~~~~v~~v~v~g~ 504 (557)
T PRK13615 482 DVATLTL-APDGERQVELHQVGGP 504 (557)
T ss_pred EEEEEec-cCCCCceEEEEECCCc
Confidence 8655543 2234456888888864
No 421
>PF11187 DUF2974: Protein of unknown function (DUF2974); InterPro: IPR024499 This family of proteins has no known function.
Probab=94.62 E-value=0.052 Score=52.21 Aligned_cols=50 Identities=20% Similarity=0.213 Sum_probs=35.6
Q ss_pred HHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC----CCceeEEEEccCC
Q 004574 579 EAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA----PHLFCCGIARSGS 628 (744)
Q Consensus 579 ~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~----p~~~~~~v~~~~~ 628 (744)
..|++|+.+...-.+.+|.+.|||.||++|..++..- .+++..++...++
T Consensus 69 ~~A~~yl~~~~~~~~~~i~v~GHSkGGnLA~yaa~~~~~~~~~rI~~vy~fDgP 122 (224)
T PF11187_consen 69 KSALAYLKKIAKKYPGKIYVTGHSKGGNLAQYAAANCDDEIQDRISKVYSFDGP 122 (224)
T ss_pred HHHHHHHHHHHHhCCCCEEEEEechhhHHHHHHHHHccHHHhhheeEEEEeeCC
Confidence 5566666653211234699999999999999998873 2477888877765
No 422
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=94.62 E-value=4.9 Score=39.33 Aligned_cols=123 Identities=12% Similarity=0.132 Sum_probs=67.2
Q ss_pred eecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCcc
Q 004574 169 KDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSI 247 (744)
Q Consensus 169 ~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 247 (744)
+.|..- .++++++|.||.++|+.+.++. ..|+.++.+|+-++++.-....+ ..++
T Consensus 15 ~~l~g~~~e~SGLTy~pd~~tLfaV~d~~------------~~i~els~~G~vlr~i~l~g~~D------------~EgI 70 (248)
T PF06977_consen 15 KPLPGILDELSGLTYNPDTGTLFAVQDEP------------GEIYELSLDGKVLRRIPLDGFGD------------YEGI 70 (248)
T ss_dssp EE-TT--S-EEEEEEETTTTEEEEEETTT------------TEEEEEETT--EEEEEE-SS-SS------------EEEE
T ss_pred eECCCccCCccccEEcCCCCeEEEEECCC------------CEEEEEcCCCCEEEEEeCCCCCC------------ceeE
Confidence 344333 5689999999999998887764 47899999888777664322111 3367
Q ss_pred ceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCC--c---eEee-----eeccceeceeeccCCceEEEee
Q 004574 248 SWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEK--P---EILH-----KLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 248 ~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~---~~l~-----~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
++.-+|. ++..+.+ +..|++++... .+.. . +.+. ..+...-.++|.|.++.|+...
T Consensus 71 ~y~g~~~--~vl~~Er-----------~~~L~~~~~~~-~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~k 136 (248)
T PF06977_consen 71 TYLGNGR--YVLSEER-----------DQRLYIFTIDD-DTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAK 136 (248)
T ss_dssp EE-STTE--EEEEETT-----------TTEEEEEEE-----TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEE
T ss_pred EEECCCE--EEEEEcC-----------CCcEEEEEEec-cccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEe
Confidence 7876664 3444311 22466666521 1111 1 1111 1234467899999988887764
Q ss_pred eeeccceeEEEEcC
Q 004574 318 WYKTSQTRTWLVCP 331 (744)
Q Consensus 318 ~~~~~~~~l~~~~~ 331 (744)
.. ....||.++.
T Consensus 137 E~--~P~~l~~~~~ 148 (248)
T PF06977_consen 137 ER--KPKRLYEVNG 148 (248)
T ss_dssp ES--SSEEEEEEES
T ss_pred CC--CChhhEEEcc
Confidence 22 3346888775
No 423
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=94.61 E-value=5.1 Score=39.47 Aligned_cols=73 Identities=14% Similarity=0.202 Sum_probs=43.8
Q ss_pred eEEEEEcC-CC-CeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI 233 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~ 233 (744)
++|.++++ +- ....+..+ +.+..++..|.|| |+.....+ ..+..||+-.++.--+......
T Consensus 107 G~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS~K-LALsVg~D------------~~lr~WNLV~Gr~a~v~~L~~~--- 170 (362)
T KOG0294|consen 107 GHIIIWRVGSWELLKSLKAHKGQVTDLSIHPSGK-LALSVGGD------------QVLRTWNLVRGRVAFVLNLKNK--- 170 (362)
T ss_pred CcEEEEEcCCeEEeeeecccccccceeEecCCCc-eEEEEcCC------------ceeeeehhhcCccceeeccCCc---
Confidence 56777776 33 33344444 6688899999887 66664443 2678888754432222222211
Q ss_pred CcccCCccCCCCccceecCCCc
Q 004574 234 PVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 234 ~~~~~~~~~~~~~~~~spDg~~ 255 (744)
...+.|+|.|.+
T Consensus 171 ----------at~v~w~~~Gd~ 182 (362)
T KOG0294|consen 171 ----------ATLVSWSPQGDH 182 (362)
T ss_pred ----------ceeeEEcCCCCE
Confidence 224689999997
No 424
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=94.59 E-value=2 Score=47.63 Aligned_cols=163 Identities=18% Similarity=0.172 Sum_probs=92.2
Q ss_pred EEEEEcCCC-CeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCC
Q 004574 158 QLVLGSLDG-TAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIP 234 (744)
Q Consensus 158 ~l~~~~~~G-~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~ 234 (744)
.|-+..|.| +.+.+.+. ....+++..--++.|+++.... .+|.+-.++|++.|.|....+..
T Consensus 1048 SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS~l------------D~IevA~LdG~~rkvLf~tdLVN--- 1112 (1289)
T KOG1214|consen 1048 SISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDSVL------------DKIEVALLDGSERKVLFYTDLVN--- 1112 (1289)
T ss_pred ccccccccCCCCceeecccCCCccceeeeeccceeeeecccc------------chhheeecCCceeeEEEeecccC---
Confidence 344455656 66655444 2222333333367777775543 26888889999988887654432
Q ss_pred cccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeecc-ceeceeeccCCceE
Q 004574 235 VCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDL-RFRSVSWCDDSLAL 313 (744)
Q Consensus 235 ~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~-~~~~~~~SpDg~~l 313 (744)
++.+...+=+.. |||+. ++ ..+-.|-..+. ++...+.|...+. --+.+.|.|..+.|
T Consensus 1113 ---------PR~iv~D~~rgn-LYwtD-Wn--------RenPkIets~m---DG~NrRilin~DigLPNGLtfdpfs~~L 1170 (1289)
T KOG1214|consen 1113 ---------PRAIVVDPIRGN-LYWTD-WN--------RENPKIETSSM---DGENRRILINTDIGLPNGLTFDPFSKLL 1170 (1289)
T ss_pred ---------cceEEeecccCc-eeecc-cc--------ccCCcceeecc---CCccceEEeecccCCCCCceeCccccee
Confidence 667788777777 99884 11 12223445555 5555555555543 34567788888877
Q ss_pred EEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 314 VNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 314 ~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
.+. +.++.+|--+..++. + ++..-.+.+. | |+...+++.+++..
T Consensus 1171 CWv---DAGt~rleC~~p~g~-g--RR~i~~~LqY----P----F~itsy~~~fY~TD 1214 (1289)
T KOG1214|consen 1171 CWV---DAGTKRLECTLPDGT-G--RRVIQNNLQY----P----FSITSYADHFYHTD 1214 (1289)
T ss_pred eEE---ecCCcceeEecCCCC-c--chhhhhcccC----c----eeeeeccccceeec
Confidence 753 335556655555542 1 2332223322 1 33456777776664
No 425
>KOG2565 consensus Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=94.52 E-value=0.12 Score=51.64 Aligned_cols=117 Identities=15% Similarity=0.146 Sum_probs=70.1
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhC---C------eEEEe
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLAR---R------FAVLA 560 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---G------~~v~~ 560 (744)
.+|.++|...++|+.-+ +++..+|++ ++||.+.+.++-. ..+..|... | |.|++
T Consensus 131 IeGL~iHFlhvk~p~~k-~~k~v~PlL-l~HGwPGsv~EFy---------------kfIPlLT~p~~hg~~~d~~FEVI~ 193 (469)
T KOG2565|consen 131 IEGLKIHFLHVKPPQKK-KKKKVKPLL-LLHGWPGSVREFY---------------KFIPLLTDPKRHGNESDYAFEVIA 193 (469)
T ss_pred hcceeEEEEEecCCccc-cCCcccceE-EecCCCchHHHHH---------------hhhhhhcCccccCCccceeEEEec
Confidence 57899999999887642 233345654 5899763322111 112222222 3 67888
Q ss_pred cCCCCCCCCCCCCh-------HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 561 GPSIPIIGEGDKLP-------NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 561 ~~~~~~~g~g~~~~-------~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
++ .+|+|.++. ...+++++.-|.=+ +.-++..|-|.-+|..++..+|..+|+.+.+.-+..|.
T Consensus 194 -PS--lPGygwSd~~sk~GFn~~a~ArvmrkLMlR--Lg~nkffiqGgDwGSiI~snlasLyPenV~GlHlnm~~ 263 (469)
T KOG2565|consen 194 -PS--LPGYGWSDAPSKTGFNAAATARVMRKLMLR--LGYNKFFIQGGDWGSIIGSNLASLYPENVLGLHLNMCF 263 (469)
T ss_pred -cC--CCCcccCcCCccCCccHHHHHHHHHHHHHH--hCcceeEeecCchHHHHHHHHHhhcchhhhHhhhcccc
Confidence 33 334443321 11334444333222 23378999999999999999999999998887655554
No 426
>PRK10252 entF enterobactin synthase subunit F; Provisional
Probab=94.48 E-value=0.38 Score=60.53 Aligned_cols=35 Identities=11% Similarity=-0.071 Sum_probs=29.1
Q ss_pred CcEEEEEechHHHHHHHHHHh---CCCceeEEEEccCC
Q 004574 594 SRIAVGGHSYGAFMTAHLLAH---APHLFCCGIARSGS 628 (744)
Q Consensus 594 ~~i~l~G~S~GG~~a~~~~~~---~p~~~~~~v~~~~~ 628 (744)
.++.++||||||.+|..++.+ .++.+..++++.+.
T Consensus 1133 ~p~~l~G~S~Gg~vA~e~A~~l~~~~~~v~~l~l~~~~ 1170 (1296)
T PRK10252 1133 GPYHLLGYSLGGTLAQGIAARLRARGEEVAFLGLLDTW 1170 (1296)
T ss_pred CCEEEEEechhhHHHHHHHHHHHHcCCceeEEEEecCC
Confidence 479999999999999999875 46788888877653
No 427
>PLN02606 palmitoyl-protein thioesterase
Probab=94.45 E-value=0.31 Score=48.34 Aligned_cols=55 Identities=13% Similarity=0.119 Sum_probs=42.7
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--ceeEEEEccCCC
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH--LFCCGIARSGSY 629 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~--~~~~~v~~~~~~ 629 (744)
..+.+..+++.|++.+... +-+.++|+|.||.++=.++.+-|+ .++-.|.+++..
T Consensus 76 ~~~Qv~~vce~l~~~~~L~-~G~naIGfSQGglflRa~ierc~~~p~V~nlISlggph 132 (306)
T PLN02606 76 LRQQASIACEKIKQMKELS-EGYNIVAESQGNLVARGLIEFCDNAPPVINYVSLGGPH 132 (306)
T ss_pred HHHHHHHHHHHHhcchhhc-CceEEEEEcchhHHHHHHHHHCCCCCCcceEEEecCCc
Confidence 4567888888888754443 579999999999999888888766 488888887654
No 428
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=94.26 E-value=0.83 Score=48.17 Aligned_cols=130 Identities=12% Similarity=0.131 Sum_probs=70.2
Q ss_pred EEEEEcC---CCCe---eecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEE-eCCCC-----eeeec
Q 004574 158 QLVLGSL---DGTA---KDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVW-TTDGK-----LVREL 224 (744)
Q Consensus 158 ~l~~~~~---~G~~---~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~-~~~g~-----~~~~l 224 (744)
+|++++- +|.. +.+.+. ....++++.+|| |++. .. .+|+.+ +.++. +.+.|
T Consensus 48 rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~~G--lyV~-~~-------------~~i~~~~d~~gdg~ad~~~~~l 111 (367)
T TIGR02604 48 RILILEDADGDGKYDKSNVFAEELSMVTGLAVAVGG--VYVA-TP-------------PDILFLRDKDGDDKADGEREVL 111 (367)
T ss_pred EEEEEEcCCCCCCcceeEEeecCCCCccceeEecCC--EEEe-CC-------------CeEEEEeCCCCCCCCCCccEEE
Confidence 6766654 4533 333333 445688899998 5554 22 156666 44332 22223
Q ss_pred cC-CCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCC--------CCccCCccceEEeccCCCCCCCCceEee
Q 004574 225 CD-LPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGD--------ANVEVSPRDIIYTQPAEPAEGEKPEILH 295 (744)
Q Consensus 225 ~~-~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~--------~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 295 (744)
.. .+.... ...+.+..+.|.|||+ ||+.....+.. ........+.++.++. ++++.+.+.
T Consensus 112 ~~~~~~~~~------~~~~~~~~l~~gpDG~--LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~p---dg~~~e~~a 180 (367)
T TIGR02604 112 LSGFGGQIN------NHHHSLNSLAWGPDGW--LYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNP---DGGKLRVVA 180 (367)
T ss_pred EEccCCCCC------cccccccCceECCCCC--EEEecccCCCceeccCCCccCcccccCceEEEEec---CCCeEEEEe
Confidence 22 211100 0113366789999996 77764422110 0011223467899887 666666555
Q ss_pred eeccceeceeeccCCceEE
Q 004574 296 KLDLRFRSVSWCDDSLALV 314 (744)
Q Consensus 296 ~~~~~~~~~~~SpDg~~l~ 314 (744)
.+-.....++|+|+|+.++
T Consensus 181 ~G~rnp~Gl~~d~~G~l~~ 199 (367)
T TIGR02604 181 HGFQNPYGHSVDSWGDVFF 199 (367)
T ss_pred cCcCCCccceECCCCCEEE
Confidence 4445567899999997644
No 429
>KOG1520 consensus Predicted alkaloid synthase/Surface mucin Hemomucin [General function prediction only]
Probab=94.17 E-value=1.8 Score=44.08 Aligned_cols=117 Identities=16% Similarity=0.165 Sum_probs=69.7
Q ss_pred eeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc-ccccccceEEecCCcEEEEEecCCCCCCCC
Q 004574 36 VSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC-LNAVFGSFVWVNNSTLLIFTIPSSRRDPPK 114 (744)
Q Consensus 36 p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~-~~~~~~~~~wspDg~~l~~~~~~~~~~~~~ 114 (744)
.++...|.-|..+ +.---||.++.+||.+..++...+.. +. ....+...++| .|+|+-...+-..
T Consensus 120 l~f~~~ggdL~Va----------DAYlGL~~V~p~g~~a~~l~~~~~G~~~k-f~N~ldI~~~g-~vyFTDSSsk~~~-- 185 (376)
T KOG1520|consen 120 IRFDKKGGDLYVA----------DAYLGLLKVGPEGGLAELLADEAEGKPFK-FLNDLDIDPEG-VVYFTDSSSKYDR-- 185 (376)
T ss_pred EEeccCCCeEEEE----------ecceeeEEECCCCCcceeccccccCeeee-ecCceeEcCCC-eEEEeccccccch--
Confidence 4666666444332 55577899999999998887665411 11 13356677744 5777633211110
Q ss_pred CCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEE
Q 004574 115 KTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLIT 192 (744)
Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~ 192 (744)
.-..- ..+.+...++++.+|. +...+.|.+. .-..+++.|||+..|+|.
T Consensus 186 --------------------rd~~~---------a~l~g~~~GRl~~YD~~tK~~~VLld~L~F~NGlaLS~d~sfvl~~ 236 (376)
T KOG1520|consen 186 --------------------RDFVF---------AALEGDPTGRLFRYDPSTKVTKVLLDGLYFPNGLALSPDGSFVLVA 236 (376)
T ss_pred --------------------hheEE---------eeecCCCccceEEecCcccchhhhhhcccccccccCCCCCCEEEEE
Confidence 00000 1112223468999998 4466666666 566689999999999988
Q ss_pred Eee
Q 004574 193 SMH 195 (744)
Q Consensus 193 ~~~ 195 (744)
...
T Consensus 237 Et~ 239 (376)
T KOG1520|consen 237 ETT 239 (376)
T ss_pred eec
Confidence 554
No 430
>PF05057 DUF676: Putative serine esterase (DUF676); InterPro: IPR007751 This domain, whose function is unknown, is found within a group of putative lipases.
Probab=94.12 E-value=0.044 Score=52.84 Aligned_cols=20 Identities=30% Similarity=0.512 Sum_probs=17.4
Q ss_pred CcEEEEEechHHHHHHHHHH
Q 004574 594 SRIAVGGHSYGAFMTAHLLA 613 (744)
Q Consensus 594 ~~i~l~G~S~GG~~a~~~~~ 613 (744)
.+|.++||||||.++-.++.
T Consensus 78 ~~IsfIgHSLGGli~r~al~ 97 (217)
T PF05057_consen 78 RKISFIGHSLGGLIARYALG 97 (217)
T ss_pred ccceEEEecccHHHHHHHHH
Confidence 58999999999999876665
No 431
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=94.12 E-value=1.4 Score=47.00 Aligned_cols=103 Identities=14% Similarity=0.156 Sum_probs=57.8
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQ 323 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~ 323 (744)
++.+.|.-+|.+ |+.+...++ ...+++..+. +......+-...+.+..+.|.|---+|+.++ +
T Consensus 524 i~~vtWHrkGDY-latV~~~~~---------~~~VliHQLS--K~~sQ~PF~kskG~vq~v~FHPs~p~lfVaT-----q 586 (733)
T KOG0650|consen 524 IRQVTWHRKGDY-LATVMPDSG---------NKSVLIHQLS--KRKSQSPFRKSKGLVQRVKFHPSKPYLFVAT-----Q 586 (733)
T ss_pred cceeeeecCCce-EEEeccCCC---------cceEEEEecc--cccccCchhhcCCceeEEEecCCCceEEEEe-----c
Confidence 778899999998 665543222 2247776662 1112222323466777788888887777665 3
Q ss_pred eeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 324 TRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 324 ~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
..+.++|+...... +.+..+... + . .++-+|.|..|+..+
T Consensus 587 ~~vRiYdL~kqelv-KkL~tg~kw-i----S--~msihp~GDnli~gs 626 (733)
T KOG0650|consen 587 RSVRIYDLSKQELV-KKLLTGSKW-I----S--SMSIHPNGDNLILGS 626 (733)
T ss_pred cceEEEehhHHHHH-HHHhcCCee-e----e--eeeecCCCCeEEEec
Confidence 35778887653111 122221111 0 0 144678888887765
No 432
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=93.97 E-value=5.2 Score=37.15 Aligned_cols=74 Identities=19% Similarity=0.187 Sum_probs=41.0
Q ss_pred eeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECC----CCc--eec----cccCCCccccccccce
Q 004574 23 EVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAE----TGE--AKP----LFESPDICLNAVFGSF 92 (744)
Q Consensus 23 ~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~----gg~--~~~----lt~~~~~~~~~~~~~~ 92 (744)
-++-+...+.++...|.|.|...|.-++ +....|.-++.- .+- +++ +.... ....+.+--.
T Consensus 25 ~i~~l~dsqairav~fhp~g~lyavgsn--------skt~ric~yp~l~~~r~~hea~~~pp~v~~kr~-khhkgsiyc~ 95 (350)
T KOG0641|consen 25 AINILEDSQAIRAVAFHPAGGLYAVGSN--------SKTFRICAYPALIDLRHAHEAAKQPPSVLCKRN-KHHKGSIYCT 95 (350)
T ss_pred EEEEecchhheeeEEecCCCceEEeccC--------CceEEEEccccccCcccccccccCCCeEEeeec-cccCccEEEE
Confidence 4444566667889999999998887543 333445544321 111 111 11110 0111223456
Q ss_pred EEecCCcEEEEEe
Q 004574 93 VWVNNSTLLIFTI 105 (744)
Q Consensus 93 ~wspDg~~l~~~~ 105 (744)
+|||+|+.|+-.+
T Consensus 96 ~ws~~geliatgs 108 (350)
T KOG0641|consen 96 AWSPCGELIATGS 108 (350)
T ss_pred EecCccCeEEecC
Confidence 9999999887643
No 433
>KOG2183 consensus Prolylcarboxypeptidase (angiotensinase C) [Posttranslational modification, protein turnover, chaperones; General function prediction only]
Probab=93.90 E-value=0.052 Score=55.01 Aligned_cols=55 Identities=22% Similarity=0.256 Sum_probs=45.8
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
.+.|....+.+|++........|.++|.|+||++|.++=.++|..+.++++.+.+
T Consensus 147 ALADfA~ll~~lK~~~~a~~~pvIafGGSYGGMLaAWfRlKYPHiv~GAlAaSAP 201 (492)
T KOG2183|consen 147 ALADFAELLTFLKRDLSAEASPVIAFGGSYGGMLAAWFRLKYPHIVLGALAASAP 201 (492)
T ss_pred HHHHHHHHHHHHhhccccccCcEEEecCchhhHHHHHHHhcChhhhhhhhhccCc
Confidence 3458889999998875555678999999999999999999999988887766543
No 434
>KOG2182 consensus Hydrolytic enzymes of the alpha/beta hydrolase fold [Posttranslational modification, protein turnover, chaperones; General function prediction only]
Probab=93.89 E-value=0.46 Score=49.89 Aligned_cols=127 Identities=14% Similarity=0.041 Sum_probs=78.3
Q ss_pred CCCeEEEEEEEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeEEEe--cCCCCC-
Q 004574 490 KDGVPLTATLYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFAVLA--GPSIPI- 566 (744)
Q Consensus 490 ~~g~~l~~~~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~v~~--~~~~~~- 566 (744)
..+...+=++|.+..+...+ -|+.||+-|-|- +.+.|-.... ........+.|-.|+. .+.++.
T Consensus 66 ~~~~~~Qq~~y~n~~~~~~~---gPiFLmIGGEgp--------~~~~wv~~~~--~~~~~~AkkfgA~v~~lEHRFYG~S 132 (514)
T KOG2182|consen 66 SNGKFFQQRFYNNNQWAKPG---GPIFLMIGGEGP--------ESDKWVGNEN--LTWLQWAKKFGATVFQLEHRFYGQS 132 (514)
T ss_pred chhhhhhhheeeccccccCC---CceEEEEcCCCC--------CCCCccccCc--chHHHHHHHhCCeeEEeeeeccccC
Confidence 33444444566665553222 588888877542 1111211211 1233344466888876 222221
Q ss_pred --CCCCCC---------ChHHHHHHHHHHHHHcC-CCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCC
Q 004574 567 --IGEGDK---------LPNDSAEAAVEEVVRRG-VADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSY 629 (744)
Q Consensus 567 --~g~g~~---------~~~~d~~~~~~~l~~~~-~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~ 629 (744)
.+.+.. .++.|+..+|+.+..+- .-+..+.+.+|.|+-|.+++++=.++|+++.+.|+.++++
T Consensus 133 ~P~~~~st~nlk~LSs~QALaDla~fI~~~n~k~n~~~~~~WitFGgSYsGsLsAW~R~~yPel~~GsvASSapv 207 (514)
T KOG2182|consen 133 SPIGDLSTSNLKYLSSLQALADLAEFIKAMNAKFNFSDDSKWITFGGSYSGSLSAWFREKYPELTVGSVASSAPV 207 (514)
T ss_pred CCCCCCcccchhhhhHHHHHHHHHHHHHHHHhhcCCCCCCCeEEECCCchhHHHHHHHHhCchhheeecccccce
Confidence 111111 24458888888888764 4444599999999999999999999999999999988764
No 435
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=93.81 E-value=7 Score=38.03 Aligned_cols=53 Identities=17% Similarity=0.230 Sum_probs=28.5
Q ss_pred eEEEEEcC-CCC-eeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC-CCeeee
Q 004574 157 AQLVLGSL-DGT-AKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD-GKLVRE 223 (744)
Q Consensus 157 ~~l~~~~~-~G~-~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~-g~~~~~ 223 (744)
.+|+.+|. +|+ ..+.......... ...++..|++.... ..++.+|.. |+...+
T Consensus 46 ~~l~~~d~~tG~~~W~~~~~~~~~~~-~~~~~~~v~v~~~~-------------~~l~~~d~~tG~~~W~ 101 (238)
T PF13360_consen 46 GNLYALDAKTGKVLWRFDLPGPISGA-PVVDGGRVYVGTSD-------------GSLYALDAKTGKVLWS 101 (238)
T ss_dssp SEEEEEETTTSEEEEEEECSSCGGSG-EEEETTEEEEEETT-------------SEEEEEETTTSCEEEE
T ss_pred CEEEEEECCCCCEEEEeeccccccce-eeecccccccccce-------------eeeEecccCCcceeee
Confidence 68999998 884 3333222211111 13456667666422 268888854 544444
No 436
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=93.81 E-value=2.8 Score=43.37 Aligned_cols=81 Identities=12% Similarity=0.001 Sum_probs=42.6
Q ss_pred CCccceecCCCceEEEEEeecCC--CCCccCCccceEEeccCCCCCCC-------------CceEeeeeccceeceeecc
Q 004574 244 MRSISWRADKPSTLYWVEAQDRG--DANVEVSPRDIIYTQPAEPAEGE-------------KPEILHKLDLRFRSVSWCD 308 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~--~~~~~~~~~~~l~~~~~~~~~~~-------------~~~~l~~~~~~~~~~~~Sp 308 (744)
...++|.|||. ||+..-..+. ........++.|+.++. ++. ....+..+-.....++|.|
T Consensus 116 g~~l~fgpDG~--LYvs~G~~~~~~~~~~~~~~~G~ilri~~---dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~ 190 (331)
T PF07995_consen 116 GGGLAFGPDGK--LYVSVGDGGNDDNAQDPNSLRGKILRIDP---DGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDP 190 (331)
T ss_dssp EEEEEE-TTSE--EEEEEB-TTTGGGGCSTTSSTTEEEEEET---TSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEET
T ss_pred CccccCCCCCc--EEEEeCCCCCcccccccccccceEEEecc---cCcCCCCCccccCCCceEEEEEeCCCccccEEEEC
Confidence 44689999993 6655433222 12222345677888887 332 2333444556667788999
Q ss_pred C-CceEEEeeeeeccceeEEEEc
Q 004574 309 D-SLALVNETWYKTSQTRTWLVC 330 (744)
Q Consensus 309 D-g~~l~~~~~~~~~~~~l~~~~ 330 (744)
. |+.++ .-+......+|.++.
T Consensus 191 ~tg~l~~-~d~G~~~~dein~i~ 212 (331)
T PF07995_consen 191 NTGRLWA-ADNGPDGWDEINRIE 212 (331)
T ss_dssp TTTEEEE-EEE-SSSSEEEEEE-
T ss_pred CCCcEEE-EccCCCCCcEEEEec
Confidence 8 65443 333333445666654
No 437
>cd00741 Lipase Lipase. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation", the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=93.80 E-value=0.14 Score=46.24 Aligned_cols=36 Identities=17% Similarity=0.072 Sum_probs=26.8
Q ss_pred CCcEEEEEechHHHHHHHHHHhCCC----ceeEEEEccCC
Q 004574 593 PSRIAVGGHSYGAFMTAHLLAHAPH----LFCCGIARSGS 628 (744)
Q Consensus 593 ~~~i~l~G~S~GG~~a~~~~~~~p~----~~~~~v~~~~~ 628 (744)
..+|.++|||+||.+|..++..... ....++.++++
T Consensus 27 ~~~i~v~GHSlGg~lA~l~a~~~~~~~~~~~~~~~~fg~p 66 (153)
T cd00741 27 DYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRVYTFGPP 66 (153)
T ss_pred CCeEEEEEcCHHHHHHHHHHHHHHhccCCCceEEEEeCCC
Confidence 4689999999999999998876533 34445555554
No 438
>COG4287 PqaA PhoPQ-activated pathogenicity-related protein [General function prediction only]
Probab=93.76 E-value=0.38 Score=48.12 Aligned_cols=146 Identities=16% Similarity=0.189 Sum_probs=84.0
Q ss_pred HHHHHHHHHHHc-CCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEcc-CCCCCC----------CCCC-----ccc
Q 004574 577 SAEAAVEEVVRR-GVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARS-GSYNKT----------LTPF-----GFQ 639 (744)
Q Consensus 577 d~~~~~~~l~~~-~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~-~~~~~~----------~~~~-----~~~ 639 (744)
-+.++++-..+. ..+.-+++++.|-|--|..++..|-.+|+ +.++|.+. -..+.. ...| .+.
T Consensus 216 a~srAMdlAq~eL~q~~Ik~F~VTGaSKRgWttwLTAIaDpr-v~aIvp~v~D~Lni~a~L~hiyrsYGgnwpi~l~pyy 294 (507)
T COG4287 216 AVSRAMDLAQDELEQVEIKGFMVTGASKRGWTTWLTAIADPR-VFAIVPFVYDNLNIEAQLLHIYRSYGGNWPIKLAPYY 294 (507)
T ss_pred HHHHHHHHHHhhhhheeeeeEEEeccccchHHHHHHHhcCcc-hhhhhhhHHhhcccHHHHHHHHHhhCCCCCcccchhH
Confidence 344444444432 23445789999999999999988888864 44444332 111110 0000 011
Q ss_pred cccc-chhhcH--H-HHHhcCccccc-----CCCCCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCCcEEEEEeCCCCc
Q 004574 640 TEFR-TLWEAT--N-VYIEMSPITHA-----NKIKKPILIIHGEVDDKVGLFPMQAERFFDALKGHGALSRLVLLPFEHH 710 (744)
Q Consensus 640 ~~~~-~~~~~~--~-~~~~~~~~~~~-----~~~~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~H 710 (744)
.+.- ...+.+ . .+.-.+|+.+. .++..|-.++.+..|.... ++.+.-++..|.. ..-+...|++.|
T Consensus 295 aegi~erl~tp~fkqL~~IiDPlay~~try~~RLalpKyivnaSgDdff~--pDsa~lYyd~LPG---~kaLrmvPN~~H 369 (507)
T COG4287 295 AEGIDERLETPLFKQLLEIIDPLAYRNTRYQLRLALPKYIVNASGDDFFV--PDSANLYYDDLPG---EKALRMVPNDPH 369 (507)
T ss_pred hhhHHHhhcCHHHHHHHHhhcHHHHhhhhhhhhccccceeecccCCcccC--CCccceeeccCCC---ceeeeeCCCCcc
Confidence 1100 000111 1 12223555554 6788999999999998887 8999888888764 347888999999
Q ss_pred ccCccccHHHHHHHHHHHHHHh
Q 004574 711 VYAARENVMHVIWETDRWLQKY 732 (744)
Q Consensus 711 ~~~~~~~~~~~~~~~~~fl~~~ 732 (744)
... .....+.+..|+++.
T Consensus 370 ~~~----n~~i~esl~~flnrf 387 (507)
T COG4287 370 NLI----NQFIKESLEPFLNRF 387 (507)
T ss_pred hhh----HHHHHHHHHHHHHHH
Confidence 654 223334455555543
No 439
>PF07519 Tannase: Tannase and feruloyl esterase; InterPro: IPR011118 This family includes fungal tannase [] and feruloyl esterase [, ]. It also includes several bacterial homologues of unknown function.
Probab=93.76 E-value=0.58 Score=50.82 Aligned_cols=66 Identities=15% Similarity=0.337 Sum_probs=53.7
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhC-CC-------cEEEEEeCCCCcccCcc-ccHHHHHHHHHHHHHHh
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGH-GA-------LSRLVLLPFEHHVYAAR-ENVMHVIWETDRWLQKY 732 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~-~~-------~~~~~~~~~~~H~~~~~-~~~~~~~~~~~~fl~~~ 732 (744)
--.+|+.||..|..|+ +..+.++|+++.+. +. -.+|...||.+|+.... ....+.+..+.+|.++-
T Consensus 353 GGKLI~~HG~aD~~I~--p~~ti~YY~~V~~~~g~~~~~v~dF~RlF~vPGm~HC~gG~g~~~~d~l~aL~~WVE~G 427 (474)
T PF07519_consen 353 GGKLILYHGWADPLIP--PQGTIDYYERVVARMGGALADVDDFYRLFMVPGMGHCGGGPGPDPFDALTALVDWVENG 427 (474)
T ss_pred CCeEEEEecCCCCccC--CCcHHHHHHHHHHhcccccccccceeEEEecCCCcccCCCCCCCCCCHHHHHHHHHhCC
Confidence 3569999999999999 99999999988763 22 26999999999997654 34458889999999854
No 440
>PLN02633 palmitoyl protein thioesterase family protein
Probab=93.72 E-value=0.59 Score=46.50 Aligned_cols=55 Identities=20% Similarity=0.178 Sum_probs=42.8
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--ceeEEEEccCCC
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH--LFCCGIARSGSY 629 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~--~~~~~v~~~~~~ 629 (744)
..+.+..+++.|++.+... +-+.++|+|.||.++=.++.+-|+ .++-.|.+++..
T Consensus 75 ~~~Qve~vce~l~~~~~l~-~G~naIGfSQGGlflRa~ierc~~~p~V~nlISlggph 131 (314)
T PLN02633 75 LTQQAEIACEKVKQMKELS-QGYNIVGRSQGNLVARGLIEFCDGGPPVYNYISLAGPH 131 (314)
T ss_pred HHHHHHHHHHHHhhchhhh-CcEEEEEEccchHHHHHHHHHCCCCCCcceEEEecCCC
Confidence 4557888888888754443 579999999999999888888766 488888887653
No 441
>PF01764 Lipase_3: Lipase (class 3); InterPro: IPR002921 Triglyceride lipases are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. This family of lipases have been called Class 3 as they are not closely related to other lipase families.; GO: 0004806 triglyceride lipase activity, 0006629 lipid metabolic process; PDB: 1LGY_A 1DTE_A 1DT5_F 4DYH_B 1DU4_C 4EA6_B 1GT6_B 1EIN_A 1DT3_A 1TIB_A ....
Probab=93.70 E-value=0.14 Score=45.37 Aligned_cols=50 Identities=20% Similarity=0.201 Sum_probs=32.5
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC-------CCceeEEEEccCC
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA-------PHLFCCGIARSGS 628 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~-------p~~~~~~v~~~~~ 628 (744)
.+.+.+..+.++.. ..+|.+.|||+||.+|..++... +..+++....+|.
T Consensus 49 ~~~~~l~~~~~~~~--~~~i~itGHSLGGalA~l~a~~l~~~~~~~~~~~~~~~fg~P~ 105 (140)
T PF01764_consen 49 QILDALKELVEKYP--DYSIVITGHSLGGALASLAAADLASHGPSSSSNVKCYTFGAPR 105 (140)
T ss_dssp HHHHHHHHHHHHST--TSEEEEEEETHHHHHHHHHHHHHHHCTTTSTTTEEEEEES-S-
T ss_pred HHHHHHHHHHhccc--CccchhhccchHHHHHHHHHHhhhhcccccccceeeeecCCcc
Confidence 45555555555543 37899999999999999888652 1345555555553
No 442
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=93.70 E-value=1.9 Score=42.51 Aligned_cols=67 Identities=16% Similarity=0.176 Sum_probs=46.0
Q ss_pred CCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEE
Q 004574 28 PDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFT 104 (744)
Q Consensus 28 ~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~ 104 (744)
.+|..+...++-|+--.+.... +....|.+.++.+..-..+..+-+ ++...+.++.||+||.+|+..
T Consensus 133 ghG~sINeik~~p~~~qlvls~---------SkD~svRlwnI~~~~Cv~VfGG~e-gHrdeVLSvD~~~~gd~i~Sc 199 (385)
T KOG1034|consen 133 GHGGSINEIKFHPDRPQLVLSA---------SKDHSVRLWNIQTDVCVAVFGGVE-GHRDEVLSVDFSLDGDRIASC 199 (385)
T ss_pred ccCccchhhhcCCCCCcEEEEe---------cCCceEEEEeccCCeEEEEecccc-cccCcEEEEEEcCCCCeeecc
Confidence 4566788899999986666644 223456666877777776654432 233457789999999988754
No 443
>PF02450 LCAT: Lecithin:cholesterol acyltransferase; InterPro: IPR003386 Lecithin:cholesterol acyltransferase (LACT), also known as phosphatidylcholine-sterol acyltransferase (2.3.1.43 from EC), is involved in extracellular metabolism of plasma lipoproteins, including cholesterol. It esterifies the free cholesterol transported in plasma lipoproteins, and is activated by apolipoprotein A-I. Defects in LACT cause Norum and Fish eye diseases. This family also includes phospholipid:diacylglycerol acyltransferase (PDAT)(2.3.1.158 from EC), which is involved in triacylglycerol formation by an acyl-CoA independent pathway. The enzyme specifically transfers acyl groups from the sn-2 position of a phospholipid to diacylglycerol, thus forming an sn-1-lysophospholipid [].; GO: 0008374 O-acyltransferase activity, 0006629 lipid metabolic process
Probab=93.63 E-value=0.18 Score=53.40 Aligned_cols=81 Identities=12% Similarity=0.116 Sum_probs=52.1
Q ss_pred hhHHHHHhCCeEE----EecCCCCCCCC-C-CCChHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC--
Q 004574 546 TSSLIFLARRFAV----LAGPSIPIIGE-G-DKLPNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-- 617 (744)
Q Consensus 546 ~~~~~~~~~G~~v----~~~~~~~~~g~-g-~~~~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-- 617 (744)
..+..|.+.||.. ++.+ ++.+-. . .......+...|+.+.+.. .+||.|+||||||.++..++...+.
T Consensus 69 ~li~~L~~~GY~~~~~l~~~p-YDWR~~~~~~~~~~~~lk~~ie~~~~~~---~~kv~li~HSmGgl~~~~fl~~~~~~~ 144 (389)
T PF02450_consen 69 KLIENLEKLGYDRGKDLFAAP-YDWRLSPAERDEYFTKLKQLIEEAYKKN---GKKVVLIAHSMGGLVARYFLQWMPQEE 144 (389)
T ss_pred HHHHHHHhcCcccCCEEEEEe-echhhchhhHHHHHHHHHHHHHHHHHhc---CCcEEEEEeCCCchHHHHHHHhccchh
Confidence 4566788888843 2211 121111 1 1123335666666665543 4799999999999999988877643
Q ss_pred ----ceeEEEEccCCCC
Q 004574 618 ----LFCCGIARSGSYN 630 (744)
Q Consensus 618 ----~~~~~v~~~~~~~ 630 (744)
.++..|.+++++.
T Consensus 145 W~~~~i~~~i~i~~p~~ 161 (389)
T PF02450_consen 145 WKDKYIKRFISIGTPFG 161 (389)
T ss_pred hHHhhhhEEEEeCCCCC
Confidence 5899999988754
No 444
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=93.60 E-value=3.4 Score=44.26 Aligned_cols=57 Identities=21% Similarity=0.299 Sum_probs=33.4
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCcc---ccccccceEEecCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDIC---LNAVFGSFVWVNNS 98 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~---~~~~~~~~~wspDg 98 (744)
......|.|||+.+ ++. ...+.|++++..++..+.+...+... -..+...++++||=
T Consensus 31 ~Pw~maflPDG~ll-VtE---------R~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF 90 (454)
T TIGR03606 31 KPWALLWGPDNQLW-VTE---------RATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDF 90 (454)
T ss_pred CceEEEEcCCCeEE-EEE---------ecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCc
Confidence 46778999999643 332 11368899987766655443322110 02245577888773
No 445
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=93.58 E-value=11 Score=39.63 Aligned_cols=57 Identities=7% Similarity=0.116 Sum_probs=38.2
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceec--cccCCCccccccccceEEecCCcEE
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKP--LFESPDICLNAVFGSFVWVNNSTLL 101 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~--lt~~~~~~~~~~~~~~~wspDg~~l 101 (744)
+....|.|-+.-|..+ -++++|+..+++++...+ -++....+ . .+..+.|.+||..|
T Consensus 203 v~~a~FHPtd~nliit----------~Gk~H~~Fw~~~~~~l~k~~~~fek~ek-k-~Vl~v~F~engdvi 261 (626)
T KOG2106|consen 203 VFLATFHPTDPNLIIT----------CGKGHLYFWTLRGGSLVKRQGIFEKREK-K-FVLCVTFLENGDVI 261 (626)
T ss_pred EEEEEeccCCCcEEEE----------eCCceEEEEEccCCceEEEeeccccccc-e-EEEEEEEcCCCCEE
Confidence 5677888888887775 456889999999886422 22222211 1 46678888888866
No 446
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=93.54 E-value=8.3 Score=38.05 Aligned_cols=130 Identities=13% Similarity=0.122 Sum_probs=76.3
Q ss_pred eEEEEEcC-CC-CeeecCCC-ceeeeeccCCCCc--eEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTP-AVYTAVEPSPDQK--YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPA 230 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~-~~~~~~~~SpDG~--~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~ 230 (744)
..|.++|+ .- +...|..+ +.+..+.|+++-. .|+-.+.+ .+|-+|+.+.-+ .+.+..+...
T Consensus 63 etI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdD-------------G~i~iw~~~~W~~~~slK~H~~~ 129 (362)
T KOG0294|consen 63 ETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDD-------------GHIIIWRVGSWELLKSLKAHKGQ 129 (362)
T ss_pred CcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCC-------------CcEEEEEcCCeEEeeeecccccc
Confidence 46778888 33 55666666 7888888888765 44433222 377788766542 1222222222
Q ss_pred CCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCC
Q 004574 231 EDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDS 310 (744)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg 310 (744)
+.+++..|.|+ |+..-. ++. .|-.+++ -.|....+..-......+.|+|.|
T Consensus 130 -------------Vt~lsiHPS~K--LALsVg---~D~--------~lr~WNL---V~Gr~a~v~~L~~~at~v~w~~~G 180 (362)
T KOG0294|consen 130 -------------VTDLSIHPSGK--LALSVG---GDQ--------VLRTWNL---VRGRVAFVLNLKNKATLVSWSPQG 180 (362)
T ss_pred -------------cceeEecCCCc--eEEEEc---CCc--------eeeeehh---hcCccceeeccCCcceeeEEcCCC
Confidence 66789999998 666532 222 2555555 444444444444445558999999
Q ss_pred ceEEEeeeeeccceeEEEEcC
Q 004574 311 LALVNETWYKTSQTRTWLVCP 331 (744)
Q Consensus 311 ~~l~~~~~~~~~~~~l~~~~~ 331 (744)
.+++.... ....+|..+.
T Consensus 181 d~F~v~~~---~~i~i~q~d~ 198 (362)
T KOG0294|consen 181 DHFVVSGR---NKIDIYQLDN 198 (362)
T ss_pred CEEEEEec---cEEEEEeccc
Confidence 99887662 2234555553
No 447
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=93.54 E-value=2.3 Score=41.20 Aligned_cols=195 Identities=18% Similarity=0.189 Sum_probs=104.2
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCcee-----ccccCCCccccccccceEEecCCcEEEEEec
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAK-----PLFESPDICLNAVFGSFVWVNNSTLLIFTIP 106 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~-----~lt~~~~~~~~~~~~~~~wspDg~~l~~~~~ 106 (744)
.+...+=||-.++|.-++....-+...-..--||.++-.-++.. ++...+...+. .+.-+.|-||+..|+....
T Consensus 65 Evw~las~P~d~~ilaT~yn~~s~s~vl~~aaiw~ipe~~~~S~~~tlE~v~~Ldteavg-~i~cvew~Pns~klasm~d 143 (370)
T KOG1007|consen 65 EVWDLASSPFDQRILATVYNDTSDSGVLTGAAIWQIPEPLGQSNSSTLECVASLDTEAVG-KINCVEWEPNSDKLASMDD 143 (370)
T ss_pred ceehhhcCCCCCceEEEEEeccCCCcceeeEEEEecccccCccccchhhHhhcCCHHHhC-ceeeEEEcCCCCeeEEecc
Confidence 36677778888888777652211111123456888876555422 22222211111 2346789999999987521
Q ss_pred CCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CC-C-eeecCCC------cee
Q 004574 107 SSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DG-T-AKDFGTP------AVY 177 (744)
Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G-~-~~~l~~~------~~~ 177 (744)
.+|.++++ ++ + ...+... ...
T Consensus 144 --------------------------------------------------n~i~l~~l~ess~~vaev~ss~s~e~~~~f 173 (370)
T KOG1007|consen 144 --------------------------------------------------NNIVLWSLDESSKIVAEVLSSESAEMRHSF 173 (370)
T ss_pred --------------------------------------------------CceEEEEcccCcchheeeccccccccccee
Confidence 35666666 44 3 2222211 345
Q ss_pred eeeccCC--CCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 178 TAVEPSP--DQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 178 ~~~~~Sp--DG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
.+-+||| ||..++.+... .++-||..+.. ...|-+..+ .-++++-|.|+-+
T Consensus 174 tsg~WspHHdgnqv~tt~d~--------------tl~~~D~RT~~~~~sI~dAHg------------q~vrdlDfNpnkq 227 (370)
T KOG1007|consen 174 TSGAWSPHHDGNQVATTSDS--------------TLQFWDLRTMKKNNSIEDAHG------------QRVRDLDFNPNKQ 227 (370)
T ss_pred cccccCCCCccceEEEeCCC--------------cEEEEEccchhhhcchhhhhc------------ceeeeccCCCCce
Confidence 5778999 99999877444 57888876432 111111111 1267788999987
Q ss_pred ceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCc-eEeeeeccceeceeeccCCceEEEee
Q 004574 255 STLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKP-EILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
..|+-+ .+.+ -|.++|.+ +...+ ..+.....-+-.+.|.|--..|+.+.
T Consensus 228 ~~lvt~--gDdg----------yvriWD~R--~tk~pv~el~~HsHWvW~VRfn~~hdqLiLs~ 277 (370)
T KOG1007|consen 228 HILVTC--GDDG----------YVRIWDTR--KTKFPVQELPGHSHWVWAVRFNPEHDQLILSG 277 (370)
T ss_pred EEEEEc--CCCc----------cEEEEecc--CCCccccccCCCceEEEEEEecCccceEEEec
Confidence 633322 1111 25555553 22332 33443444455566667666655544
No 448
>KOG1214 consensus Nidogen and related basement membrane protein proteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=93.51 E-value=4.5 Score=45.04 Aligned_cols=178 Identities=13% Similarity=0.155 Sum_probs=101.8
Q ss_pred hhhhccceeeeeEEEEEcCCC-Ceee------cCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC
Q 004574 146 YDESLFDYYTTAQLVLGSLDG-TAKD------FGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD 217 (744)
Q Consensus 146 ~~~~~~~~~~~~~l~~~~~~G-~~~~------l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 217 (744)
..+..+.+....+|..+.++| +.++ |.-+ ..+.++.|.=-.+.|+.+.... ..|-+-.+.
T Consensus 988 ~~gt~LL~aqg~~I~~lplng~~~~K~~ak~~l~~p~~IiVGidfDC~e~mvyWtDv~g------------~SI~rasL~ 1055 (1289)
T KOG1214|consen 988 SVGTFLLYAQGQQIGYLPLNGTRLQKDAAKTLLSLPGSIIVGIDFDCRERMVYWTDVAG------------RSISRASLE 1055 (1289)
T ss_pred CCcceEEEeccceEEEeecCcchhchhhhhceEecccceeeeeecccccceEEEeecCC------------Ccccccccc
Confidence 334445555668899999987 3322 2222 4455666655555555443322 245566667
Q ss_pred CCeeeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee
Q 004574 218 GKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL 297 (744)
Q Consensus 218 g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 297 (744)
|.+.+.+....+.. +.+++..--++. +||+.. ..++|-+..+ ++.+.+.|+..
T Consensus 1056 G~Ep~ti~n~~L~S------------PEGiAVDh~~Rn-~ywtDS-----------~lD~IevA~L---dG~~rkvLf~t 1108 (1289)
T KOG1214|consen 1056 GAEPETIVNSGLIS------------PEGIAVDHIRRN-MYWTDS-----------VLDKIEVALL---DGSERKVLFYT 1108 (1289)
T ss_pred CCCCceeecccCCC------------ccceeeeeccce-eeeecc-----------ccchhheeec---CCceeeEEEee
Confidence 77666655443221 334455444565 888741 3344666666 66666666554
Q ss_pred cc-ceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEe
Q 004574 298 DL-RFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKI 371 (744)
Q Consensus 298 ~~-~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~ 371 (744)
+. .-..+...+=+..|+++-|.++ +-.|-+.++++. ..+.+...+.+--. | +.+.|..+.|.|..
T Consensus 1109 dLVNPR~iv~D~~rgnLYwtDWnRe-nPkIets~mDG~--NrRilin~DigLPN---G---Ltfdpfs~~LCWvD 1174 (1289)
T KOG1214|consen 1109 DLVNPRAIVVDPIRGNLYWTDWNRE-NPKIETSSMDGE--NRRILINTDIGLPN---G---LTFDPFSKLLCWVD 1174 (1289)
T ss_pred cccCcceEEeecccCceeecccccc-CCcceeeccCCc--cceEEeecccCCCC---C---ceeCcccceeeEEe
Confidence 33 4456777788899999887663 345778888874 33444444433211 1 44677777777764
No 449
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=93.42 E-value=7.9 Score=38.83 Aligned_cols=181 Identities=12% Similarity=0.089 Sum_probs=107.3
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceec-cccCCCccccccccceEEecCCcEEEEEecCCCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKP-LFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRR 110 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~-lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~ 110 (744)
++....+-|-.+|++-- ....-|-|+|+++|+.+. ||.+.. .+..+++|+-.-+|+....+
T Consensus 153 WVr~vavdP~n~wf~tg----------s~DrtikIwDlatg~LkltltGhi~-----~vr~vavS~rHpYlFs~ged--- 214 (460)
T KOG0285|consen 153 WVRSVAVDPGNEWFATG----------SADRTIKIWDLATGQLKLTLTGHIE-----TVRGVAVSKRHPYLFSAGED--- 214 (460)
T ss_pred eEEEEeeCCCceeEEec----------CCCceeEEEEcccCeEEEeecchhh-----eeeeeeecccCceEEEecCC---
Confidence 67777778877776663 333456666999998765 333333 46788999888777765321
Q ss_pred CCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC-CCCeeecC-CC-ceeeeeccCCCCc
Q 004574 111 DPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL-DGTAKDFG-TP-AVYTAVEPSPDQK 187 (744)
Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~G~~~~l~-~~-~~~~~~~~SpDG~ 187 (744)
.++-.+|+ ..+.-+-- -+ ..+..+...|.=.
T Consensus 215 ----------------------------------------------k~VKCwDLe~nkvIR~YhGHlS~V~~L~lhPTld 248 (460)
T KOG0285|consen 215 ----------------------------------------------KQVKCWDLEYNKVIRHYHGHLSGVYCLDLHPTLD 248 (460)
T ss_pred ----------------------------------------------CeeEEEechhhhhHHHhccccceeEEEeccccce
Confidence 47777888 33333222 22 4566677777655
Q ss_pred eEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccC-CCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCC
Q 004574 188 YVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCD-LPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRG 266 (744)
Q Consensus 188 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~ 266 (744)
.| ++...+ ..+.+||+.++....... +... +..+.+.|-... ++-.+
T Consensus 249 vl-~t~grD------------st~RvWDiRtr~~V~~l~GH~~~-------------V~~V~~~~~dpq-vit~S----- 296 (460)
T KOG0285|consen 249 VL-VTGGRD------------STIRVWDIRTRASVHVLSGHTNP-------------VASVMCQPTDPQ-VITGS----- 296 (460)
T ss_pred eE-EecCCc------------ceEEEeeecccceEEEecCCCCc-------------ceeEEeecCCCc-eEEec-----
Confidence 44 443332 368889988765433332 2222 333444433333 44442
Q ss_pred CCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 267 DANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 267 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
....|.++|+. .+.....++.....+..++..|+-..++.++
T Consensus 297 -------~D~tvrlWDl~--agkt~~tlt~hkksvral~lhP~e~~fASas 338 (460)
T KOG0285|consen 297 -------HDSTVRLWDLR--AGKTMITLTHHKKSVRALCLHPKENLFASAS 338 (460)
T ss_pred -------CCceEEEeeec--cCceeEeeecccceeeEEecCCchhhhhccC
Confidence 22346777764 4555666777778888888888877666544
No 450
>COG1075 LipA Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]
Probab=93.42 E-value=0.22 Score=51.59 Aligned_cols=50 Identities=18% Similarity=0.171 Sum_probs=37.6
Q ss_pred HHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCC--CceeEEEEccCCC
Q 004574 578 AEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAP--HLFCCGIARSGSY 629 (744)
Q Consensus 578 ~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p--~~~~~~v~~~~~~ 629 (744)
+.+-++.+..... .+++.++||||||.++..++...+ .+++.++.++.+.
T Consensus 113 l~~~V~~~l~~~g--a~~v~LigHS~GG~~~ry~~~~~~~~~~V~~~~tl~tp~ 164 (336)
T COG1075 113 LFAYVDEVLAKTG--AKKVNLIGHSMGGLDSRYYLGVLGGANRVASVVTLGTPH 164 (336)
T ss_pred HHHHHHHHHhhcC--CCceEEEeecccchhhHHHHhhcCccceEEEEEEeccCC
Confidence 4444444444332 378999999999999999998887 7899999988763
No 451
>TIGR03712 acc_sec_asp2 accessory Sec system protein Asp2. This protein is designated Asp2 because, along with SecY2, SecA2, and other proteins it is part of the accessory secretory protein system. The system is involved in the export of serine-rich glycoproteins important for virulence in a number of Gram-positive species, including Streptococcus gordonii and Staphylococcus aureus. This protein family is assigned to transport rather than glycosylation function, but the specific molecular role is unknown.
Probab=93.37 E-value=5.9 Score=41.95 Aligned_cols=106 Identities=16% Similarity=0.125 Sum_probs=68.6
Q ss_pred EEeCCCCCCCCCCCceEEEEECCCCCcccccCCcccCCCCccCCCCchhHHHHHhCCeE--EEecCCCCCCC---CCCCC
Q 004574 499 LYLPPGYDQSKDGPLPCLFWAYPEDYKSKDAAGQVRGSPNEFSGMTPTSSLIFLARRFA--VLAGPSIPIIG---EGDKL 573 (744)
Q Consensus 499 ~~~P~~~~~~~~~~~p~vv~~HG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~--v~~~~~~~~~g---~g~~~ 573 (744)
.+-|.+. +-|+.|++-|. +. ..++.+ .-+|-..|.- .+..++ -..| -|..+
T Consensus 281 YFnPGD~------KPPL~VYFSGy-----------R~-aEGFEg-----y~MMk~Lg~PfLL~~DpR-leGGaFYlGs~e 336 (511)
T TIGR03712 281 YFNPGDF------KPPLNVYFSGY-----------RP-AEGFEG-----YFMMKRLGAPFLLIGDPR-LEGGAFYLGSDE 336 (511)
T ss_pred ecCCcCC------CCCeEEeeccC-----------cc-cCcchh-----HHHHHhcCCCeEEeeccc-cccceeeeCcHH
Confidence 4446653 25899999873 11 122332 2244556644 444222 1222 25555
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCCCC
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGSYN 630 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~~~ 630 (744)
..+-+.+.|+...+.-..+.+.+.|.|.|||.+-|+..+++- .-.|+|..-|+++
T Consensus 337 yE~~I~~~I~~~L~~LgF~~~qLILSGlSMGTfgAlYYga~l--~P~AIiVgKPL~N 391 (511)
T TIGR03712 337 YEQGIINVIQEKLDYLGFDHDQLILSGLSMGTFGALYYGAKL--SPHAIIVGKPLVN 391 (511)
T ss_pred HHHHHHHHHHHHHHHhCCCHHHeeeccccccchhhhhhcccC--CCceEEEcCcccc
Confidence 566788888777776667888999999999999999999875 2367888888765
No 452
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=93.28 E-value=11 Score=38.82 Aligned_cols=122 Identities=12% Similarity=0.077 Sum_probs=64.0
Q ss_pred ceEEEEeCCCCeeeeccCCC-CCC-CCCcccCCccCCCCccceecCCCceEEEEEeec-CCCCCc---cCCccceEEecc
Q 004574 209 QKVQVWTTDGKLVRELCDLP-PAE-DIPVCYNSVREGMRSISWRADKPSTLYWVEAQD-RGDANV---EVSPRDIIYTQP 282 (744)
Q Consensus 209 ~~l~~~~~~g~~~~~l~~~~-~~~-~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~-~~~~~~---~~~~~~~l~~~~ 282 (744)
..|+.++.+|...+.+.... ... .....--....|..+++++|||+. |+.+.... ..+... .....-+|+.++
T Consensus 112 p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~-l~~~~E~~l~~d~~~~~~~~~~~~ri~~~d 190 (326)
T PF13449_consen 112 PRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRT-LFAAMESPLKQDGPRANPDNGSPLRILRYD 190 (326)
T ss_pred CEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCE-EEEEECccccCCCcccccccCceEEEEEec
Confidence 57999999977666552211 100 000001112345678999999997 44443222 111110 111224577777
Q ss_pred CCCCCCCC-ceE-eee--------eccceeceeeccCCceEEEeeee---eccceeEEEEcCCC
Q 004574 283 AEPAEGEK-PEI-LHK--------LDLRFRSVSWCDDSLALVNETWY---KTSQTRTWLVCPGS 333 (744)
Q Consensus 283 ~~~~~~~~-~~~-l~~--------~~~~~~~~~~SpDg~~l~~~~~~---~~~~~~l~~~~~~~ 333 (744)
.. ..+. ..+ ... ....+..+.+-+|++.|+..... .....+||.+++..
T Consensus 191 ~~--~~~~~~~~~~y~ld~~~~~~~~~~isd~~al~d~~lLvLER~~~~~~~~~~ri~~v~l~~ 252 (326)
T PF13449_consen 191 PK--TPGEPVAEYAYPLDPPPTAPGDNGISDIAALPDGRLLVLERDFSPGTGNYKRIYRVDLSD 252 (326)
T ss_pred CC--CCCccceEEEEeCCccccccCCCCceeEEEECCCcEEEEEccCCCCccceEEEEEEEccc
Confidence 62 1121 122 111 24567788899999965554311 23567899999865
No 453
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=93.26 E-value=4.2 Score=42.06 Aligned_cols=112 Identities=13% Similarity=0.039 Sum_probs=58.5
Q ss_pred eeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe-eeeccCC------CCCCCCCcccCC
Q 004574 168 AKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL-VRELCDL------PPAEDIPVCYNS 239 (744)
Q Consensus 168 ~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~-~~~l~~~------~~~~~~~~~~~~ 239 (744)
.+.+..+ ..+..++.|||+++++-.+.. ..|.-|+...+. .+-+... .+....+.. ..
T Consensus 135 ~~~~~~H~~s~~~vals~d~~~~fsask~-------------g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r-~~ 200 (479)
T KOG0299|consen 135 FRVIGKHQLSVTSVALSPDDKRVFSASKD-------------GTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESR-KG 200 (479)
T ss_pred ceeeccccCcceEEEeeccccceeecCCC-------------cceeeeehhcCcccccccccchhhhhccCCCCccc-cc
Confidence 3444444 677899999999988644333 245556654332 2112111 111111111 01
Q ss_pred ccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEe-eeeccceeceeeccC
Q 004574 240 VREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEIL-HKLDLRFRSVSWCDD 309 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l-~~~~~~~~~~~~SpD 309 (744)
....+..++.|+||++ |++.. .+. ++.+++. .+.++... ......+.+++|-..
T Consensus 201 h~keil~~avS~Dgky-latgg----~d~--------~v~Iw~~---~t~ehv~~~~ghr~~V~~L~fr~g 255 (479)
T KOG0299|consen 201 HVKEILTLAVSSDGKY-LATGG----RDR--------HVQIWDC---DTLEHVKVFKGHRGAVSSLAFRKG 255 (479)
T ss_pred ccceeEEEEEcCCCcE-EEecC----CCc--------eEEEecC---cccchhhcccccccceeeeeeecC
Confidence 1122446889999997 65541 111 3557777 55665554 444556666666433
No 454
>COG3319 Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=93.19 E-value=0.32 Score=47.61 Aligned_cols=51 Identities=16% Similarity=-0.018 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHc-CCCCCCcEEEEEechHHHHHHHHHHhC---CCceeEEEEccCCCC
Q 004574 577 SAEAAVEEVVRR-GVADPSRIAVGGHSYGAFMTAHLLAHA---PHLFCCGIARSGSYN 630 (744)
Q Consensus 577 d~~~~~~~l~~~-~~~d~~~i~l~G~S~GG~~a~~~~~~~---p~~~~~~v~~~~~~~ 630 (744)
-+...++.+++. +. ..+.|.|+|+||.+|.-+|.+- -+-++-++++..+..
T Consensus 50 ~a~~yv~~Ir~~QP~---GPy~L~G~S~GG~vA~evA~qL~~~G~~Va~L~llD~~~~ 104 (257)
T COG3319 50 MAAAYVAAIRRVQPE---GPYVLLGWSLGGAVAFEVAAQLEAQGEEVAFLGLLDAVPP 104 (257)
T ss_pred HHHHHHHHHHHhCCC---CCEEEEeeccccHHHHHHHHHHHhCCCeEEEEEEeccCCC
Confidence 344445555553 32 4699999999999999888653 235666666665543
No 455
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=93.19 E-value=2.8 Score=42.22 Aligned_cols=74 Identities=15% Similarity=0.181 Sum_probs=49.0
Q ss_pred eEEEEEcC-CC-CeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCe---eeeccCCCCCC
Q 004574 157 AQLVLGSL-DG-TAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKL---VRELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~-~G-~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~---~~~l~~~~~~~ 231 (744)
-.|-++|+ +| ....++.......+..+|+.+.|+-.+.. ..|.+||+..+. +++-..+..+-
T Consensus 281 HTIk~WDletg~~~~~~~~~ksl~~i~~~~~~~Ll~~gssd-------------r~irl~DPR~~~gs~v~~s~~gH~nw 347 (423)
T KOG0313|consen 281 HTIKVWDLETGGLKSTLTTNKSLNCISYSPLSKLLASGSSD-------------RHIRLWDPRTGDGSVVSQSLIGHKNW 347 (423)
T ss_pred ceEEEEEeecccceeeeecCcceeEeecccccceeeecCCC-------------CceeecCCCCCCCceeEEeeecchhh
Confidence 35778888 66 66677767777788999988888766554 378899986443 22211111111
Q ss_pred CCCcccCCccCCCCccceecCCCc
Q 004574 232 DIPVCYNSVREGMRSISWRADKPS 255 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~ 255 (744)
+..+.|+|-...
T Consensus 348 ------------Vssvkwsp~~~~ 359 (423)
T KOG0313|consen 348 ------------VSSVKWSPTNEF 359 (423)
T ss_pred ------------hhheecCCCCce
Confidence 557899999886
No 456
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=93.02 E-value=8.2 Score=42.42 Aligned_cols=60 Identities=12% Similarity=0.077 Sum_probs=40.1
Q ss_pred ccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEEEe
Q 004574 33 INFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 33 ~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
+..-.++-.+++|++.+. .+-||+..-.+|+-+.+++..... .......|++..++++..
T Consensus 36 v~lTc~dst~~~l~~GsS----------~G~lyl~~R~~~~~~~~~~~~~~~---~~~~~~vs~~e~lvAagt 95 (726)
T KOG3621|consen 36 VKLTCVDATEEYLAMGSS----------AGSVYLYNRHTGEMRKLKNEGATG---ITCVRSVSSVEYLVAAGT 95 (726)
T ss_pred EEEEEeecCCceEEEecc----------cceEEEEecCchhhhcccccCccc---eEEEEEecchhHhhhhhc
Confidence 444566777888888643 367999998899988887644221 123446778888777753
No 457
>PRK13613 lipoprotein LpqB; Provisional
Probab=92.96 E-value=3.8 Score=45.78 Aligned_cols=145 Identities=11% Similarity=0.056 Sum_probs=78.9
Q ss_pred eEEEEEcCCC--C----eeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCC
Q 004574 157 AQLVLGSLDG--T----AKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPA 230 (744)
Q Consensus 157 ~~l~~~~~~G--~----~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~ 230 (744)
..+++-++.+ . .+.+........|+|.++| .|+......+ ....+.+....|.. ..+.. +.-
T Consensus 385 ~~l~vg~~~~~~~~~~~~~~~~~~~~Lt~PS~d~~g-~vWtvd~~~~---------~~~vl~v~~~~G~~-~~V~~-~~l 452 (599)
T PRK13613 385 DSVYVGSLTPGASIGVHSWGVTADGRLTSPSWDGRG-DLWVVDRDPA---------DPRLLWLLQGDGEP-VEVRT-PEL 452 (599)
T ss_pred cEEEEeccCCCCccccccceeeccCcccCCcCcCCC-CEEEecCCCC---------CceEEEEEcCCCcE-EEeec-ccc
Confidence 3566666522 3 3344445667889999998 6765521110 01125555544443 22222 111
Q ss_pred CCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEee------eeccceece
Q 004574 231 EDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILH------KLDLRFRSV 304 (744)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~------~~~~~~~~~ 304 (744)
.+ ..+..+..|+||-+ ++.+....+.. +|++.-+.--..+. ..|+ .....+.++
T Consensus 453 ~g---------~~I~~lrvSrDG~R-vAvv~~~~g~~---------~v~va~V~R~~~G~-~~l~~~~~l~~~l~~v~~~ 512 (599)
T PRK13613 453 DG---------HRVVAVRVARDGVR-VALIVEKDGRR---------SLQIGRIVRDAKAV-VSVEEFRSLAPELEDVTDM 512 (599)
T ss_pred CC---------CEeEEEEECCCccE-EEEEEecCCCc---------EEEEEEEEeCCCCc-EEeeccEEeccCCCcccee
Confidence 11 12678999999999 77765432322 23333321112233 3332 233457889
Q ss_pred eeccCCceEEEeeeeeccceeEEEEcCCCC
Q 004574 305 SWCDDSLALVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 305 ~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
+|..++..++... ..+....+|++++++.
T Consensus 513 ~W~~~~sL~Vlg~-~~~~~~~v~~v~vdG~ 541 (599)
T PRK13613 513 SWAGDSQLVVLGR-EEGGVQQARYVQVDGS 541 (599)
T ss_pred EEcCCCEEEEEec-cCCCCcceEEEecCCc
Confidence 9999998666443 2334567999999985
No 458
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=92.81 E-value=16 Score=39.24 Aligned_cols=182 Identities=12% Similarity=0.043 Sum_probs=88.8
Q ss_pred eEEeccCCCCCCCCceE-eeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeee-ccccc---cccC
Q 004574 277 IIYTQPAEPAEGEKPEI-LHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLF-DRVFE---NVYS 351 (744)
Q Consensus 277 ~l~~~~~~~~~~~~~~~-l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~-~~~~~---~~~~ 351 (744)
.||.+++ +.|..-. +-...+.++.+..++-...|++... ...+--+|.-.. .....|- ..+.. ....
T Consensus 156 evYRlNL---EqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~----~g~VEfwDpR~k-srv~~l~~~~~v~s~pg~~~ 227 (703)
T KOG2321|consen 156 EVYRLNL---EQGRFLNPFETDSGELNVVSINEEHGLLACGTE----DGVVEFWDPRDK-SRVGTLDAASSVNSHPGGDA 227 (703)
T ss_pred ceEEEEc---cccccccccccccccceeeeecCccceEEeccc----CceEEEecchhh-hhheeeecccccCCCccccc
Confidence 3899988 4444332 2222355666777777777777551 112223333221 0111110 00100 0011
Q ss_pred CCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcc
Q 004574 352 DPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGE 431 (744)
Q Consensus 352 ~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~ 431 (744)
.+....+.|+-||-.+++... ..+++++|+.+.+.-.+-....+ -....
T Consensus 228 ~~svTal~F~d~gL~~aVGts---------------------~G~v~iyDLRa~~pl~~kdh~~e---~pi~~------- 276 (703)
T KOG2321|consen 228 APSVTALKFRDDGLHVAVGTS---------------------TGSVLIYDLRASKPLLVKDHGYE---LPIKK------- 276 (703)
T ss_pred cCcceEEEecCCceeEEeecc---------------------CCcEEEEEcccCCceeecccCCc---cceee-------
Confidence 112233557777877766542 12488889877665433322221 01111
Q ss_pred eecccC--CCEEEEEEecCCCCceEEEEECCCCceeeeecCCCCCCCcCCCceEEEEEEcCCCeEEEEEEEeCC
Q 004574 432 EDINLN--QLKILTSKESKTEITQYHILSWPLKKSSQITNFPHPYPTLASLQKEMIKYQRKDGVPLTATLYLPP 503 (744)
Q Consensus 432 ~~~s~d--~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~~~~~~~~~~~~~i~~~~~~g~~l~~~~~~P~ 503 (744)
+.|-+. ++.++ +. ....+.+|+..+|+......+..++..+..++..-+.|.+.+...++. +|+|.
T Consensus 277 l~~~~~~~q~~v~-S~----Dk~~~kiWd~~~Gk~~asiEpt~~lND~C~~p~sGm~f~Ane~~~m~~-yyiP~ 344 (703)
T KOG2321|consen 277 LDWQDTDQQNKVV-SM----DKRILKIWDECTGKPMASIEPTSDLNDFCFVPGSGMFFTANESSKMHT-YYIPS 344 (703)
T ss_pred ecccccCCCceEE-ec----chHHhhhcccccCCceeeccccCCcCceeeecCCceEEEecCCCccee-EEccc
Confidence 122222 22332 11 123466777778877665555556666777776666666667666665 55564
No 459
>PF00450 Peptidase_S10: Serine carboxypeptidase; InterPro: IPR001563 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S10 (clan SC). The type example is carboxypeptidase Y from Saccharomyces cerevisiae (Baker's yeast) []. All known carboxypeptidases are either metallo carboxypeptidases or serine carboxypeptidases (3.4.16.5 from EC and 3.4.16.6 from EC). The catalytic activity of the serine carboxypeptidases, like that of the trypsin family serine proteases, is provided by a charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which is itself hydrogen-bonded to a serine []. The sequences surrounding the active site serine and histidine residues are highly conserved in all the serine carboxypeptidases.; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 1AC5_A 1WHS_B 3SC2_B 1WHT_A 1BCR_A 1BCS_A 1GXS_A 1IVY_A 1WPX_A 1YSC_A ....
Probab=92.57 E-value=0.64 Score=50.10 Aligned_cols=63 Identities=21% Similarity=0.266 Sum_probs=43.8
Q ss_pred CCCEEEEeeCCCCCCCCCHHHHHHHHHHHHhCCC----------------------cEEEEEeCCCCcccCccccHHHHH
Q 004574 665 KKPILIIHGEVDDKVGLFPMQAERFFDALKGHGA----------------------LSRLVLLPFEHHVYAARENVMHVI 722 (744)
Q Consensus 665 ~~P~l~i~G~~D~~v~~~~~~~~~~~~~l~~~~~----------------------~~~~~~~~~~~H~~~~~~~~~~~~ 722 (744)
..++||.+|..|.+++ ...++.+.+.|.-.+. +..++.+.++||+.. ...+....
T Consensus 330 ~irVLiy~Gd~D~i~n--~~Gt~~~i~~L~w~~~~~f~~~~~~~~~~~~G~~k~~~~ltf~~V~~AGHmvP-~dqP~~a~ 406 (415)
T PF00450_consen 330 GIRVLIYNGDLDLICN--FLGTERWIDNLNWSGKDGFRQWPRKVNGQVAGYVKQYGNLTFVTVRGAGHMVP-QDQPEAAL 406 (415)
T ss_dssp T-EEEEEEETT-SSS---HHHHHHHHHCTECTEEEEEEEEEEETTCSEEEEEEEETTEEEEEETT--SSHH-HHSHHHHH
T ss_pred cceeEEeccCCCEEEE--eccchhhhhccccCcccccccccccccccccceeEEeccEEEEEEcCCcccCh-hhCHHHHH
Confidence 4899999999999998 9999999888742211 246888899999865 55577777
Q ss_pred HHHHHHHH
Q 004574 723 WETDRWLQ 730 (744)
Q Consensus 723 ~~~~~fl~ 730 (744)
..+..||.
T Consensus 407 ~m~~~fl~ 414 (415)
T PF00450_consen 407 QMFRRFLK 414 (415)
T ss_dssp HHHHHHHC
T ss_pred HHHHHHhc
Confidence 77777763
No 460
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=92.42 E-value=5.2 Score=41.51 Aligned_cols=39 Identities=21% Similarity=0.173 Sum_probs=28.6
Q ss_pred eEEEEEcC-CCCeeecCCC--ceeeeeccCCCCceEEEEEee
Q 004574 157 AQLVLGSL-DGTAKDFGTP--AVYTAVEPSPDQKYVLITSMH 195 (744)
Q Consensus 157 ~~l~~~~~-~G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~ 195 (744)
++||++.+ +|..-.+... ..+.-+.||-||+.|+-.+.+
T Consensus 103 g~lYlWelssG~LL~v~~aHYQ~ITcL~fs~dgs~iiTgskD 144 (476)
T KOG0646|consen 103 GNLYLWELSSGILLNVLSAHYQSITCLKFSDDGSHIITGSKD 144 (476)
T ss_pred CcEEEEEeccccHHHHHHhhccceeEEEEeCCCcEEEecCCC
Confidence 68999999 8855444333 677788999999987655433
No 461
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.35 E-value=12 Score=38.68 Aligned_cols=138 Identities=14% Similarity=0.159 Sum_probs=76.8
Q ss_pred eEEEEEcC-CCCeeecCCC--ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCC
Q 004574 157 AQLVLGSL-DGTAKDFGTP--AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDI 233 (744)
Q Consensus 157 ~~l~~~~~-~G~~~~l~~~--~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~ 233 (744)
..+-++|+ +|++.+.... +.++.+.|.|-.-.++.+..-. ..+-++|... .......-
T Consensus 266 ~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D------------~~V~l~D~R~-----~~~s~~~w-- 326 (463)
T KOG0270|consen 266 KTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYD------------GTVALKDCRD-----PSNSGKEW-- 326 (463)
T ss_pred ceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEecccc------------ceEEeeeccC-----ccccCceE--
Confidence 46888899 8877766554 8899999999887777665432 2444444321 00000000
Q ss_pred CcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCce-EeeeeccceeceeeccCCce
Q 004574 234 PVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPE-ILHKLDLRFRSVSWCDDSLA 312 (744)
Q Consensus 234 ~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~l~~~~~~~~~~~~SpDg~~ 312 (744)
.....+..++|.|.... .++++. ..+.|+-+|.+ ..+++. .+--++..++.+++++.-..
T Consensus 327 -----k~~g~VEkv~w~~~se~-~f~~~t-----------ddG~v~~~D~R--~~~~~vwt~~AHd~~ISgl~~n~~~p~ 387 (463)
T KOG0270|consen 327 -----KFDGEVEKVAWDPHSEN-SFFVST-----------DDGTVYYFDIR--NPGKPVWTLKAHDDEISGLSVNIQTPG 387 (463)
T ss_pred -----EeccceEEEEecCCCce-eEEEec-----------CCceEEeeecC--CCCCceeEEEeccCCcceEEecCCCCc
Confidence 00011334567777654 444432 22346666664 333332 34445778888888888777
Q ss_pred EEEeeeeeccceeEEEEcCCC
Q 004574 313 LVNETWYKTSQTRTWLVCPGS 333 (744)
Q Consensus 313 l~~~~~~~~~~~~l~~~~~~~ 333 (744)
++.+. .....-.||-++.+.
T Consensus 388 ~l~t~-s~d~~Vklw~~~~~~ 407 (463)
T KOG0270|consen 388 LLSTA-STDKVVKLWKFDVDS 407 (463)
T ss_pred ceeec-cccceEEEEeecCCC
Confidence 76654 222444566665544
No 462
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=92.29 E-value=0.72 Score=36.74 Aligned_cols=81 Identities=14% Similarity=0.133 Sum_probs=51.0
Q ss_pred ccceecCCCceEEEEEeecCCCCC------ccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeee
Q 004574 246 SISWRADKPSTLYWVEAQDRGDAN------VEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWY 319 (744)
Q Consensus 246 ~~~~spDg~~~l~~~~~~~~~~~~------~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~ 319 (744)
++...+++.. |||+......... .+....++|+.++. .+++.+.|..+-...+.++.|+|+..++++.
T Consensus 2 dldv~~~~g~-vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp---~t~~~~vl~~~L~fpNGVals~d~~~vlv~E-- 75 (89)
T PF03088_consen 2 DLDVDQDTGT-VYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDP---STKETTVLLDGLYFPNGVALSPDESFVLVAE-- 75 (89)
T ss_dssp EEEE-TTT---EEEEES-SS--TTGHHHHHHHT---EEEEEEET---TTTEEEEEEEEESSEEEEEE-TTSSEEEEEE--
T ss_pred ceeEecCCCE-EEEEeCccccCccceeeeeecCCCCcCEEEEEC---CCCeEEEehhCCCccCeEEEcCCCCEEEEEe--
Confidence 3456777444 8888764332211 12356678999998 7788778888878889999999999998865
Q ss_pred eccceeEEEEcCCC
Q 004574 320 KTSQTRTWLVCPGS 333 (744)
Q Consensus 320 ~~~~~~l~~~~~~~ 333 (744)
....+|.++-+.+
T Consensus 76 -t~~~Ri~rywl~G 88 (89)
T PF03088_consen 76 -TGRYRILRYWLKG 88 (89)
T ss_dssp -GGGTEEEEEESSS
T ss_pred -ccCceEEEEEEeC
Confidence 2445677766554
No 463
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=92.29 E-value=12 Score=36.46 Aligned_cols=235 Identities=14% Similarity=0.003 Sum_probs=0.0
Q ss_pred CeeecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCc
Q 004574 167 TAKDFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRS 246 (744)
Q Consensus 167 ~~~~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 246 (744)
....+........++-+|||. |+|+.+... .|-..|+.+++.++..-..++. +..
T Consensus 54 ~~fpvp~G~ap~dvapapdG~-VWft~qg~g------------aiGhLdP~tGev~~ypLg~Ga~------------Phg 108 (353)
T COG4257 54 AEFPVPNGSAPFDVAPAPDGA-VWFTAQGTG------------AIGHLDPATGEVETYPLGSGAS------------PHG 108 (353)
T ss_pred ceeccCCCCCccccccCCCCc-eEEecCccc------------cceecCCCCCceEEEecCCCCC------------Cce
Q ss_pred cceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee----ccceeceeeccCCceEEEeeeeecc
Q 004574 247 ISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL----DLRFRSVSWCDDSLALVNETWYKTS 322 (744)
Q Consensus 247 ~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~----~~~~~~~~~SpDg~~l~~~~~~~~~ 322 (744)
+..-|||. ..++ +++.. |.+++- ++.+.++++-. ......+.|.++|+ |.|+. .
T Consensus 109 iv~gpdg~--~Wit---d~~~a---------I~R~dp---kt~evt~f~lp~~~a~~nlet~vfD~~G~-lWFt~----q 166 (353)
T COG4257 109 IVVGPDGS--AWIT---DTGLA---------IGRLDP---KTLEVTRFPLPLEHADANLETAVFDPWGN-LWFTG----Q 166 (353)
T ss_pred EEECCCCC--eeEe---cCcce---------eEEecC---cccceEEeecccccCCCcccceeeCCCcc-EEEee----c
Q ss_pred ceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEEEccCCCCCCCCCceEEEEec
Q 004574 323 QTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYILLNGRGFTPEGNIPFLDLFDI 402 (744)
Q Consensus 323 ~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~l~~~d~ 402 (744)
..-.=++|... ....++....+. +-.-++..|||+ +.|.+ ...+.|.++|.
T Consensus 167 ~G~yGrLdPa~---~~i~vfpaPqG~-----gpyGi~atpdGs-vwyas--------------------lagnaiaridp 217 (353)
T COG4257 167 IGAYGRLDPAR---NVISVFPAPQGG-----GPYGICATPDGS-VWYAS--------------------LAGNAIARIDP 217 (353)
T ss_pred cccceecCccc---CceeeeccCCCC-----CCcceEECCCCc-EEEEe--------------------ccccceEEccc
Q ss_pred CCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCCCCCCcCCCce
Q 004574 403 NTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPHPYPTLASLQK 482 (744)
Q Consensus 403 ~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~~~~~~~~~~~ 482 (744)
.++-.+++-..+.. ..+...+..+|.|...+ +....+.++++|+.... +.+++ ..-.....
T Consensus 218 ~~~~aev~p~P~~~---------~~gsRriwsdpig~~wi----ttwg~g~l~rfdPs~~s---W~eyp---LPgs~arp 278 (353)
T COG4257 218 FAGHAEVVPQPNAL---------KAGSRRIWSDPIGRAWI----TTWGTGSLHRFDPSVTS---WIEYP---LPGSKARP 278 (353)
T ss_pred ccCCcceecCCCcc---------cccccccccCccCcEEE----eccCCceeeEeCccccc---ceeee---CCCCCCCc
Q ss_pred EEEEEEcCCCeEEE
Q 004574 483 EMIKYQRKDGVPLT 496 (744)
Q Consensus 483 ~~i~~~~~~g~~l~ 496 (744)
+.+.+...+-..++
T Consensus 279 ys~rVD~~grVW~s 292 (353)
T COG4257 279 YSMRVDRHGRVWLS 292 (353)
T ss_pred ceeeeccCCcEEee
No 464
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=92.16 E-value=1.1 Score=47.91 Aligned_cols=76 Identities=20% Similarity=0.364 Sum_probs=37.3
Q ss_pred eEEEEeCCCCe-eeeccCCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCC-
Q 004574 210 KVQVWTTDGKL-VRELCDLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAE- 287 (744)
Q Consensus 210 ~l~~~~~~g~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~- 287 (744)
.|..||...+. ++++...+ +..+.||+||+. ++++. ...+++++.. .+
T Consensus 127 ~i~~yDw~~~~~i~~i~v~~---------------vk~V~Ws~~g~~-val~t-------------~~~i~il~~~-~~~ 176 (443)
T PF04053_consen 127 FICFYDWETGKLIRRIDVSA---------------VKYVIWSDDGEL-VALVT-------------KDSIYILKYN-LEA 176 (443)
T ss_dssp EEEEE-TTT--EEEEESS-E----------------EEEEE-TTSSE-EEEE--------------S-SEEEEEE--HHH
T ss_pred CEEEEEhhHcceeeEEecCC---------------CcEEEEECCCCE-EEEEe-------------CCeEEEEEec-chh
Confidence 48888887543 44443221 346789999987 77763 2236665541 01
Q ss_pred ------CC---CceEeeeeccceeceeeccCCceEEEee
Q 004574 288 ------GE---KPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 288 ------~~---~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
.| ....+.....++.+..|-.| -++|++
T Consensus 177 ~~~~~~~g~e~~f~~~~E~~~~IkSg~W~~d--~fiYtT 213 (443)
T PF04053_consen 177 VAAIPEEGVEDAFELIHEISERIKSGCWVED--CFIYTT 213 (443)
T ss_dssp HHHBTTTB-GGGEEEEEEE-S--SEEEEETT--EEEEE-
T ss_pred cccccccCchhceEEEEEecceeEEEEEEcC--EEEEEc
Confidence 01 12223333678888899877 777765
No 465
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.00 E-value=2 Score=47.32 Aligned_cols=169 Identities=15% Similarity=0.216 Sum_probs=102.2
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCc
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDI 83 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~ 83 (744)
..|-+|+|.. + +..++++.+.+.. .+....|++---.|.... .+++.-.+| |+...+.+..+.+...
T Consensus 110 G~i~vWdlnk--~-~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSG-------SQDg~vK~~--DlR~~~S~~t~~~nSE 177 (839)
T KOG0269|consen 110 GVISVWDLNK--S-IRNKLLTVFNEHERSANKLDFHSTEPNILISG-------SQDGTVKCW--DLRSKKSKSTFRSNSE 177 (839)
T ss_pred CcEEEEecCc--c-ccchhhhHhhhhccceeeeeeccCCccEEEec-------CCCceEEEE--eeecccccccccccch
Confidence 3577788844 1 2345555445544 566778887776776633 224445555 6667677766655321
Q ss_pred cccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEc
Q 004574 84 CLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGS 163 (744)
Q Consensus 84 ~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 163 (744)
.+.++.|||--...+++..+. +.|-++|
T Consensus 178 ----SiRDV~fsp~~~~~F~s~~ds------------------------------------------------G~lqlWD 205 (839)
T KOG0269|consen 178 ----SIRDVKFSPGYGNKFASIHDS------------------------------------------------GYLQLWD 205 (839)
T ss_pred ----hhhceeeccCCCceEEEecCC------------------------------------------------ceEEEee
Confidence 467889998654444443211 3344445
Q ss_pred C---CCCeeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCC
Q 004574 164 L---DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNS 239 (744)
Q Consensus 164 ~---~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~ 239 (744)
+ +-=..+++.+ +.+.-..|+|++.+|+-...+ ..+-+|+..+.+.+.+.... ...|
T Consensus 206 lRqp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRD-------------K~vkiWd~t~~~~~~~~tIn--Tiap----- 265 (839)
T KOG0269|consen 206 LRQPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRD-------------KMVKIWDMTDSRAKPKHTIN--TIAP----- 265 (839)
T ss_pred ccCchhHHHHhhcccCceEEEeecCCCceeeecCCC-------------ccEEEEeccCCCccceeEEe--ecce-----
Confidence 5 2245566767 888899999999888755422 36888888876555443221 1112
Q ss_pred ccCCCCccceecCCCceEEEEE
Q 004574 240 VREGMRSISWRADKPSTLYWVE 261 (744)
Q Consensus 240 ~~~~~~~~~~spDg~~~l~~~~ 261 (744)
+..+.|-|+-+++|+-++
T Consensus 266 ----v~rVkWRP~~~~hLAtcs 283 (839)
T KOG0269|consen 266 ----VGRVKWRPARSYHLATCS 283 (839)
T ss_pred ----eeeeeeccCccchhhhhh
Confidence 556899999988666654
No 466
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=91.74 E-value=20 Score=37.93 Aligned_cols=81 Identities=14% Similarity=0.078 Sum_probs=39.0
Q ss_pred eEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCceeeeecCCC-CC
Q 004574 396 FLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSSQITNFPH-PY 474 (744)
Q Consensus 396 ~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~~lt~~~~-~~ 474 (744)
.|+.+|..+|+. +|..... ... .. .+....++.++.... -+.++.+|..+|+..--..... ..
T Consensus 290 ~l~~~d~~tG~~--~W~~~~~--~~~--~~------ssp~i~g~~l~~~~~----~G~l~~~d~~tG~~~~~~~~~~~~~ 353 (377)
T TIGR03300 290 VVVALDRRSGSE--LWKNDEL--KYR--QL------TAPAVVGGYLVVGDF----EGYLHWLSREDGSFVARLKTDGSGI 353 (377)
T ss_pred eEEEEECCCCcE--EEccccc--cCC--cc------ccCEEECCEEEEEeC----CCEEEEEECCCCCEEEEEEcCCCcc
Confidence 488999988864 4543211 000 00 011123455444322 3579999988887653222222 12
Q ss_pred CCcCCCceEEEEEEcCCC
Q 004574 475 PTLASLQKEMIKYQRKDG 492 (744)
Q Consensus 475 ~~~~~~~~~~i~~~~~~g 492 (744)
...+.+..+.+.+...+|
T Consensus 354 ~~sp~~~~~~l~v~~~dG 371 (377)
T TIGR03300 354 ASPPVVVGDGLLVQTRDG 371 (377)
T ss_pred ccCCEEECCEEEEEeCCc
Confidence 222223334555555555
No 467
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=91.56 E-value=1.2 Score=35.58 Aligned_cols=41 Identities=20% Similarity=0.186 Sum_probs=31.8
Q ss_pred eeeEEEEEcC-CCCeeecCCC-ceeeeeccCCCCceEEEEEee
Q 004574 155 TTAQLVLGSL-DGTAKDFGTP-AVYTAVEPSPDQKYVLITSMH 195 (744)
Q Consensus 155 ~~~~l~~~~~-~G~~~~l~~~-~~~~~~~~SpDG~~i~~~~~~ 195 (744)
..++|+.+|. +++.+.|... ....+++.|||++.|++....
T Consensus 35 ~~GRll~ydp~t~~~~vl~~~L~fpNGVals~d~~~vlv~Et~ 77 (89)
T PF03088_consen 35 PTGRLLRYDPSTKETTVLLDGLYFPNGVALSPDESFVLVAETG 77 (89)
T ss_dssp --EEEEEEETTTTEEEEEEEEESSEEEEEE-TTSSEEEEEEGG
T ss_pred CCcCEEEEECCCCeEEEehhCCCccCeEEEcCCCCEEEEEecc
Confidence 4589999999 6677777766 667789999999999988654
No 468
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=91.46 E-value=8.4 Score=40.04 Aligned_cols=61 Identities=10% Similarity=0.169 Sum_probs=41.7
Q ss_pred ecCCCceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccce
Q 004574 170 DFGTPAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISW 249 (744)
Q Consensus 170 ~l~~~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (744)
.+..++.+..++-+|+|.+|+-.. .. .+||+|.+.++..-.+....... +.-+.|
T Consensus 77 ~~v~Pg~v~al~s~n~G~~l~ag~-i~------------g~lYlWelssG~LL~v~~aHYQ~------------ITcL~f 131 (476)
T KOG0646|consen 77 YIVLPGPVHALASSNLGYFLLAGT-IS------------GNLYLWELSSGILLNVLSAHYQS------------ITCLKF 131 (476)
T ss_pred hcccccceeeeecCCCceEEEeec-cc------------CcEEEEEeccccHHHHHHhhccc------------eeEEEE
Confidence 334457788999999999886542 22 37999999987655444333321 556789
Q ss_pred ecCCCc
Q 004574 250 RADKPS 255 (744)
Q Consensus 250 spDg~~ 255 (744)
+-||..
T Consensus 132 s~dgs~ 137 (476)
T KOG0646|consen 132 SDDGSH 137 (476)
T ss_pred eCCCcE
Confidence 999974
No 469
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=91.23 E-value=15 Score=35.62 Aligned_cols=51 Identities=16% Similarity=0.165 Sum_probs=30.1
Q ss_pred EEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCCCEEEEEEecCCCCceEEEEECCCCcee
Q 004574 397 LDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQLKILTSKESKTEITQYHILSWPLKKSS 465 (744)
Q Consensus 397 l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~~~~~~~~~~~~~~~~i~~~~~~~g~~~ 465 (744)
+..+|+.+|+. +|..... .... ....+++.+++.. . ...++.+|+.+|+..
T Consensus 185 ~~~~d~~tg~~--~w~~~~~----~~~~--------~~~~~~~~l~~~~-~---~~~l~~~d~~tG~~~ 235 (238)
T PF13360_consen 185 VVAVDLATGEK--LWSKPIS----GIYS--------LPSVDGGTLYVTS-S---DGRLYALDLKTGKVV 235 (238)
T ss_dssp EEEEETTTTEE--EEEECSS-----ECE--------CEECCCTEEEEEE-T---TTEEEEEETTTTEEE
T ss_pred EEEEECCCCCE--EEEecCC----CccC--------CceeeCCEEEEEe-C---CCEEEEEECCCCCEE
Confidence 56669988874 3533211 1111 1345566666544 2 368999999998754
No 470
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=91.17 E-value=17 Score=39.00 Aligned_cols=37 Identities=14% Similarity=-0.114 Sum_probs=20.6
Q ss_pred eecccCCCEEEEE---EecCCCCceEEEEECCCCceeeeec
Q 004574 432 EDINLNQLKILTS---KESKTEITQYHILSWPLKKSSQITN 469 (744)
Q Consensus 432 ~~~s~d~~~~~~~---~~~~~~~~~i~~~~~~~g~~~~lt~ 469 (744)
++++|.| ..+|. ..-...-..|++-.|.++.+.+|.-
T Consensus 347 psiapsg-~~~y~~~g~~~p~w~g~llv~~L~~~~l~r~~l 386 (454)
T TIGR03606 347 PTIAPSS-AYYYKGGEKGITGWENSLLIPSLKRGVIYRIKL 386 (454)
T ss_pred CCcCCce-eEEEecCcccCcccCCCEEEEEcCCCeEEEEEe
Confidence 3556665 23332 1223344567777777777777653
No 471
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=91.16 E-value=23 Score=39.16 Aligned_cols=25 Identities=8% Similarity=0.171 Sum_probs=20.2
Q ss_pred eeeeccceeceeeccCCceEEEeee
Q 004574 294 LHKLDLRFRSVSWCDDSLALVNETW 318 (744)
Q Consensus 294 l~~~~~~~~~~~~SpDg~~l~~~~~ 318 (744)
+.+....+..+.|+++|..|+...+
T Consensus 254 v~dtgm~~vgakWnh~G~vLAvcG~ 278 (1189)
T KOG2041|consen 254 VVDTGMKIVGAKWNHNGAVLAVCGN 278 (1189)
T ss_pred EEecccEeecceecCCCcEEEEccC
Confidence 4555688889999999999887663
No 472
>PF11288 DUF3089: Protein of unknown function (DUF3089); InterPro: IPR021440 This family of proteins has no known function.
Probab=91.03 E-value=0.46 Score=44.55 Aligned_cols=40 Identities=20% Similarity=0.218 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC
Q 004574 575 NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA 615 (744)
Q Consensus 575 ~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~ 615 (744)
+.|+.++.++.+++..- ...+.|+|||.|+.+...|+.+.
T Consensus 77 y~DV~~AF~~yL~~~n~-GRPfILaGHSQGs~~l~~LL~e~ 116 (207)
T PF11288_consen 77 YSDVRAAFDYYLANYNN-GRPFILAGHSQGSMHLLRLLKEE 116 (207)
T ss_pred HHHHHHHHHHHHHhcCC-CCCEEEEEeChHHHHHHHHHHHH
Confidence 45999999998887532 25799999999999999998764
No 473
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=90.96 E-value=14 Score=38.24 Aligned_cols=46 Identities=13% Similarity=0.055 Sum_probs=30.1
Q ss_pred eeEEEEECCCCceeccc--cC--------CCccccccccceEEecCCcEEEEEecC
Q 004574 62 LRVWIADAETGEAKPLF--ES--------PDICLNAVFGSFVWVNNSTLLIFTIPS 107 (744)
Q Consensus 62 ~~l~~~~~~gg~~~~lt--~~--------~~~~~~~~~~~~~wspDg~~l~~~~~~ 107 (744)
..|+.++.+|...+++. .. .....+.++..++++|||+.|+.....
T Consensus 112 p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~~E~ 167 (326)
T PF13449_consen 112 PRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAAMES 167 (326)
T ss_pred CEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEEECc
Confidence 89999998854434431 11 112234467799999999988776543
No 474
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=90.86 E-value=1.4 Score=47.96 Aligned_cols=61 Identities=7% Similarity=0.048 Sum_probs=39.4
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEee
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNET 317 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~ 317 (744)
...++||++|++ ++--+ .+.. +..++|-+.. -+...++.+....+..++|||-+++++...
T Consensus 81 ~t~vAfS~~Gry-vatGE--cG~~------pa~kVw~la~----h~vVAEfvdHKY~vtcvaFsp~~kyvvSVG 141 (1080)
T KOG1408|consen 81 LTCVAFSQNGRY-VATGE--CGRT------PASKVWSLAF----HGVVAEFVDHKYNVTCVAFSPGNKYVVSVG 141 (1080)
T ss_pred eeEEEEcCCCcE-EEecc--cCCC------ccceeeeecc----ccchhhhhhccccceeeeecCCCcEEEeec
Confidence 557899999996 44321 1111 1223444332 345556777888999999999999998654
No 475
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=90.83 E-value=12 Score=40.52 Aligned_cols=268 Identities=13% Similarity=0.114 Sum_probs=0.0
Q ss_pred CceeeecCCCCC-cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCC
Q 004574 20 PEKEVHGYPDGA-KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNS 98 (744)
Q Consensus 20 ~~~~l~~~~~~~-~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg 98 (744)
+.++|.+..... .+....|-| |+-..... .|...+..+|+++++.+..- -..++...+...++.|+.
T Consensus 89 ee~~lk~~~aH~nAifDl~wap-ge~~lVsa---------sGDsT~r~Wdvk~s~l~G~~--~~~GH~~SvkS~cf~~~n 156 (720)
T KOG0321|consen 89 EERQLKKPLAHKNAIFDLKWAP-GESLLVSA---------SGDSTIRPWDVKTSRLVGGR--LNLGHTGSVKSECFMPTN 156 (720)
T ss_pred hhhhhcccccccceeEeeccCC-CceeEEEc---------cCCceeeeeeeccceeecce--eecccccccchhhhccCC
Q ss_pred cEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeeeeEEEEEcC--CC----------
Q 004574 99 TLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTTAQLVLGSL--DG---------- 166 (744)
Q Consensus 99 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~G---------- 166 (744)
..++.+.-++ +++.++|. ++
T Consensus 157 ~~vF~tGgRD------------------------------------------------g~illWD~R~n~~d~~e~~~~~ 188 (720)
T KOG0321|consen 157 PAVFCTGGRD------------------------------------------------GEILLWDCRCNGVDALEEFDNR 188 (720)
T ss_pred CcceeeccCC------------------------------------------------CcEEEEEEeccchhhHHHHhhh
Q ss_pred ----------CeeecCCC--------ceeee---eccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeecc
Q 004574 167 ----------TAKDFGTP--------AVYTA---VEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELC 225 (744)
Q Consensus 167 ----------~~~~l~~~--------~~~~~---~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~ 225 (744)
..++++.. ..+.. ..+..|...|+-....+. -|-+||+.....----
T Consensus 189 ~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~fkDe~tlaSaga~D~------------~iKVWDLRk~~~~~r~ 256 (720)
T KOG0321|consen 189 IYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLFKDESTLASAGAADS------------TIKVWDLRKNYTAYRQ 256 (720)
T ss_pred hhccccCCCCCCchhhccccccccccCceeeeeEEEEEeccceeeeccCCCc------------ceEEEeeccccccccc
Q ss_pred CCCCCCCCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceecee
Q 004574 226 DLPPAEDIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVS 305 (744)
Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~ 305 (744)
........++. .....|...+....-|.++++-+ ..+.||.++. .-.+..+..++.+...-....
T Consensus 257 ep~~~~~~~t~-skrs~G~~nL~lDssGt~L~AsC-------------tD~sIy~ynm-~s~s~sP~~~~sg~~~~sf~v 321 (720)
T KOG0321|consen 257 EPRGSDKYPTH-SKRSVGQVNLILDSSGTYLFASC-------------TDNSIYFYNM-RSLSISPVAEFSGKLNSSFYV 321 (720)
T ss_pred CCCcccCccCc-ccceeeeEEEEecCCCCeEEEEe-------------cCCcEEEEec-cccCcCchhhccCcccceeee
Q ss_pred ---eccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcceEEE
Q 004574 306 ---WCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQIYIL 382 (744)
Q Consensus 306 ---~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~~~~~ 382 (744)
.|||+.+++..+++. +.|++.+... .....+..++...+.. ++|.|-...=+...
T Consensus 322 ks~lSpd~~~l~SgSsd~----~ayiw~vs~~-e~~~~~l~Ght~eVt~------V~w~pS~~t~v~Tc----------- 379 (720)
T KOG0321|consen 322 KSELSPDDCSLLSGSSDE----QAYIWVVSSP-EAPPALLLGHTREVTT------VRWLPSATTPVATC----------- 379 (720)
T ss_pred eeecCCCCceEeccCCCc----ceeeeeecCc-cCChhhhhCcceEEEE------EeeccccCCCceee-----------
Q ss_pred EccCCCCCCCCCceEEEEecCCC
Q 004574 383 LNGRGFTPEGNIPFLDLFDINTG 405 (744)
Q Consensus 383 ~~~~g~~~~~~~~~l~~~d~~~g 405 (744)
.+...+.+|++..+
T Consensus 380 ---------SdD~~~kiW~l~~~ 393 (720)
T KOG0321|consen 380 ---------SDDFRVKIWRLSNG 393 (720)
T ss_pred ---------ccCcceEEEeccCc
No 476
>PF07519 Tannase: Tannase and feruloyl esterase; InterPro: IPR011118 This family includes fungal tannase [] and feruloyl esterase [, ]. It also includes several bacterial homologues of unknown function.
Probab=90.57 E-value=0.35 Score=52.44 Aligned_cols=52 Identities=13% Similarity=0.054 Sum_probs=42.8
Q ss_pred HHHHHHHHHHHc-CCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccCC
Q 004574 577 SAEAAVEEVVRR-GVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSGS 628 (744)
Q Consensus 577 d~~~~~~~l~~~-~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~~ 628 (744)
+...+.+.|.+. ....+++-+..|.|-||.-++.+|.++|+.|.++|+.+|.
T Consensus 97 ~~~~~aK~l~~~~Yg~~p~~sY~~GcS~GGRqgl~~AQryP~dfDGIlAgaPA 149 (474)
T PF07519_consen 97 ETTVVAKALIEAFYGKAPKYSYFSGCSTGGRQGLMAAQRYPEDFDGILAGAPA 149 (474)
T ss_pred HHHHHHHHHHHHHhCCCCCceEEEEeCCCcchHHHHHHhChhhcCeEEeCCch
Confidence 445555555554 3456789999999999999999999999999999999985
No 477
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=90.52 E-value=13 Score=40.48 Aligned_cols=49 Identities=16% Similarity=0.005 Sum_probs=30.9
Q ss_pred EEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEEEcCC
Q 004574 278 IYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPG 332 (744)
Q Consensus 278 l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~ 332 (744)
|.+++-. ...+.-+|-.+...+.-+-.++||+.++.++.+ +. |.++|+.
T Consensus 195 lr~wDpr--t~~kimkLrGHTdNVr~ll~~dDGt~~ls~sSD--gt--IrlWdLg 243 (735)
T KOG0308|consen 195 LRLWDPR--TCKKIMKLRGHTDNVRVLLVNDDGTRLLSASSD--GT--IRLWDLG 243 (735)
T ss_pred eEEeccc--cccceeeeeccccceEEEEEcCCCCeEeecCCC--ce--EEeeecc
Confidence 6666542 344445565556667778899999999987632 33 4455554
No 478
>PLN02454 triacylglycerol lipase
Probab=90.47 E-value=0.49 Score=49.31 Aligned_cols=41 Identities=22% Similarity=0.272 Sum_probs=30.3
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
..+++...|+.+.++..-..-+|.+.|||+||.+|+.+|..
T Consensus 208 ~r~qvl~~V~~l~~~Yp~~~~sI~vTGHSLGGALAtLaA~d 248 (414)
T PLN02454 208 ARSQLLAKIKELLERYKDEKLSIVLTGHSLGASLATLAAFD 248 (414)
T ss_pred HHHHHHHHHHHHHHhCCCCCceEEEEecCHHHHHHHHHHHH
Confidence 45577788877777543222359999999999999988854
No 479
>PF03283 PAE: Pectinacetylesterase
Probab=90.45 E-value=0.38 Score=49.98 Aligned_cols=37 Identities=14% Similarity=0.286 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHH
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLA 613 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~ 613 (744)
-+.+++++|.+++.-++++|.|.|.|+||..++.-+-
T Consensus 139 i~~avl~~l~~~gl~~a~~vlltG~SAGG~g~~~~~d 175 (361)
T PF03283_consen 139 ILRAVLDDLLSNGLPNAKQVLLTGCSAGGLGAILHAD 175 (361)
T ss_pred HHHHHHHHHHHhcCcccceEEEeccChHHHHHHHHHH
Confidence 5788999999986667899999999999999876553
No 480
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=90.43 E-value=20 Score=38.82 Aligned_cols=144 Identities=14% Similarity=0.038 Sum_probs=81.6
Q ss_pred eEEEEEcCCC-CeeecCC---CceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCC
Q 004574 157 AQLVLGSLDG-TAKDFGT---PAVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAED 232 (744)
Q Consensus 157 ~~l~~~~~~G-~~~~l~~---~~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~ 232 (744)
.+|.++..+| +.+.|.. ........||-...+.+++....-. ..+...-+..+|+....+.+++......-.
T Consensus 184 aNl~L~~~~~~klEvL~yirTE~dPl~~~Fs~~~~~qi~tVE~s~s----~~g~~~~d~ciYE~~r~klqrvsvtsipL~ 259 (545)
T PF11768_consen 184 ANLHLLSCSGGKLEVLSYIRTENDPLDVEFSLNQPYQIHTVEQSIS----VKGEPSADSCIYECSRNKLQRVSVTSIPLP 259 (545)
T ss_pred ccEEEEEecCCcEEEEEEEEecCCcEEEEccCCCCcEEEEEEEecC----CCCCceeEEEEEEeecCceeEEEEEEEecC
Confidence 4577777755 6655532 2444566777755555555443210 011222344556666665555533321111
Q ss_pred CCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCce
Q 004574 233 IPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLA 312 (744)
Q Consensus 233 ~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~ 312 (744)
..+...+++|+.+++++-+. .+.|.++|. . .....++........++|.|||..
T Consensus 260 ---------s~v~~ca~sp~E~kLvlGC~-------------DgSiiLyD~---~-~~~t~~~ka~~~P~~iaWHp~gai 313 (545)
T PF11768_consen 260 ---------SQVICCARSPSEDKLVLGCE-------------DGSIILYDT---T-RGVTLLAKAEFIPTLIAWHPDGAI 313 (545)
T ss_pred ---------CcceEEecCcccceEEEEec-------------CCeEEEEEc---C-CCeeeeeeecccceEEEEcCCCcE
Confidence 11557899999998444442 224677775 2 234445555666788999999998
Q ss_pred EEEeeeeeccceeEEEEcCCCC
Q 004574 313 LVNETWYKTSQTRTWLVCPGSK 334 (744)
Q Consensus 313 l~~~~~~~~~~~~l~~~~~~~~ 334 (744)
++..+ ..++|.+.|+.-+
T Consensus 314 ~~V~s----~qGelQ~FD~ALs 331 (545)
T PF11768_consen 314 FVVGS----EQGELQCFDMALS 331 (545)
T ss_pred EEEEc----CCceEEEEEeecC
Confidence 88765 4456777776653
No 481
>PF15525 DUF4652: Domain of unknown function (DUF4652)
Probab=89.95 E-value=5.4 Score=36.24 Aligned_cols=64 Identities=14% Similarity=0.141 Sum_probs=44.4
Q ss_pred cCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCC-ccccccccceEEecCCcEEEEEe
Q 004574 39 SPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPD-ICLNAVFGSFVWVNNSTLLIFTI 105 (744)
Q Consensus 39 SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~-~~~~~~~~~~~wspDg~~l~~~~ 105 (744)
|-+|++=|++-..+.+ +.-.+-+.||+.++.++...+|--... ....| ..+.|-.|...++...
T Consensus 66 s~~~~~saciegkg~~-a~eEgiGkIYIkn~~~~~~~~L~i~~~~~k~sP--K~i~WiDD~~L~vIIG 130 (200)
T PF15525_consen 66 SENGKYSACIEGKGPE-AEEEGIGKIYIKNLNNNNWWSLQIDQNEEKYSP--KYIEWIDDNNLAVIIG 130 (200)
T ss_pred ccCCceeEEEEcCCCc-cccccceeEEEEecCCCceEEEEecCcccccCC--ceeEEecCCcEEEEEc
Confidence 7789999998764432 233667999999999999887743332 22333 3678998888776653
No 482
>PF02089 Palm_thioest: Palmitoyl protein thioesterase; InterPro: IPR002472 Neuronal ceroid lipofuscinoses (NCL) represent a group of encephalopathies that occur in 1 in 12,500 children. Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis []. The most common mutation results in intracellular accumulation of the polypeptide and undetectable enzyme activity in the brain. Direct sequencing of cDNAs derived from brain RNA of INCL patients has shown a mis-sense transversion of A to T at nucleotide position 364, which results in substitution of Trp for Arg at position 122 in the protein - Arg 122 is immediately adjacent to a lipase consensus sequence that contains the putative active site Ser of PPT. The occurrence of this and two other independent mutations in the PPT gene strongly suggests that defects in this gene cause INCL.; GO: 0008474 palmitoyl-(protein) hydrolase activity, 0006464 protein modification process; PDB: 3GRO_B 1PJA_A 1EXW_A 1EH5_A 1EI9_A.
Probab=89.93 E-value=1.5 Score=43.31 Aligned_cols=53 Identities=15% Similarity=0.144 Sum_probs=35.7
Q ss_pred HHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCC-ceeEEEEccCCC
Q 004574 576 DSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPH-LFCCGIARSGSY 629 (744)
Q Consensus 576 ~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~-~~~~~v~~~~~~ 629 (744)
+.+..+++.++..+... +-+.++|+|.||.++=.++.+-|+ .++-.|.+++..
T Consensus 63 ~Qv~~vc~~l~~~p~L~-~G~~~IGfSQGgl~lRa~vq~c~~~~V~nlISlggph 116 (279)
T PF02089_consen 63 DQVEQVCEQLANDPELA-NGFNAIGFSQGGLFLRAYVQRCNDPPVHNLISLGGPH 116 (279)
T ss_dssp HHHHHHHHHHHH-GGGT-T-EEEEEETCHHHHHHHHHHH-TSS-EEEEEEES--T
T ss_pred HHHHHHHHHHhhChhhh-cceeeeeeccccHHHHHHHHHCCCCCceeEEEecCcc
Confidence 35666777777654443 579999999999999888877654 588888887653
No 483
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=89.68 E-value=20 Score=38.53 Aligned_cols=137 Identities=13% Similarity=0.105 Sum_probs=82.8
Q ss_pred eEEEEEcC-CCCee-ecCCC---ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCC
Q 004574 157 AQLVLGSL-DGTAK-DFGTP---AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAE 231 (744)
Q Consensus 157 ~~l~~~~~-~G~~~-~l~~~---~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~ 231 (744)
+.++.+++ .|+.+ ++... +.+....|+.+-..| |+.... .++-.|+....+...+.......
T Consensus 80 g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ci-yS~~ad------------~~v~~~~~~~~~~~~~~~~~~~~ 146 (541)
T KOG4547|consen 80 GSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCI-YSVGAD------------LKVVYILEKEKVIIRIWKEQKPL 146 (541)
T ss_pred ccEEEEEecCCeEEEEEecCCCCCcceeeecccccCce-EecCCc------------eeEEEEecccceeeeeeccCCCc
Confidence 67888888 66543 44422 566667777655544 454332 36667777777666665442211
Q ss_pred CCCcccCCccCCCCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceE-eeeeccceeceeeccC-
Q 004574 232 DIPVCYNSVREGMRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEI-LHKLDLRFRSVSWCDD- 309 (744)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-l~~~~~~~~~~~~SpD- 309 (744)
+..+..+|||+. |+..+ .+|-+++. ++++... ++.....++.++|-.+
T Consensus 147 ------------~~sl~is~D~~~-l~~as--------------~~ik~~~~---~~kevv~~ftgh~s~v~t~~f~~~~ 196 (541)
T KOG4547|consen 147 ------------VSSLCISPDGKI-LLTAS--------------RQIKVLDI---ETKEVVITFTGHGSPVRTLSFTTLI 196 (541)
T ss_pred ------------cceEEEcCCCCE-EEecc--------------ceEEEEEc---cCceEEEEecCCCcceEEEEEEEec
Confidence 557889999986 44332 24777777 5555443 4444667778887766
Q ss_pred ----CceEEEeeeeeccceeEEEEcCCCCCCc
Q 004574 310 ----SLALVNETWYKTSQTRTWLVCPGSKDVA 337 (744)
Q Consensus 310 ----g~~l~~~~~~~~~~~~l~~~~~~~~~~~ 337 (744)
|++++... .......+|.++-.....+
T Consensus 197 ~g~~G~~vLssa-~~~r~i~~w~v~~~~kkks 227 (541)
T KOG4547|consen 197 DGIIGKYVLSSA-AAERGITVWVVEKEDKKKS 227 (541)
T ss_pred cccccceeeecc-ccccceeEEEEEcccccch
Confidence 77766554 3334567788776554333
No 484
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=89.65 E-value=2.9 Score=42.09 Aligned_cols=137 Identities=18% Similarity=0.251 Sum_probs=79.9
Q ss_pred CccceeEeecCCCCCCCCc----eeeecCCCCCcccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCc--eec
Q 004574 3 FFTGIGIHRLLPDDSLGPE----KEVHGYPDGAKINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGE--AKP 76 (744)
Q Consensus 3 ~~~~~~~~~~~~~~~~g~~----~~l~~~~~~~~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~--~~~ 76 (744)
|..+|+++.... |.- +.++ .+..++-...|||.-+-+.+.+. ....|.|.|+..+. +-.
T Consensus 232 c~~~I~lw~~~~----g~W~vd~~Pf~--gH~~SVEDLqWSptE~~vfaScS---------~DgsIrIWDiRs~~~~~~~ 296 (440)
T KOG0302|consen 232 CVKGIHLWEPST----GSWKVDQRPFT--GHTKSVEDLQWSPTEDGVFASCS---------CDGSIRIWDIRSGPKKAAV 296 (440)
T ss_pred cccceEeeeecc----CceeecCcccc--ccccchhhhccCCccCceEEeee---------cCceEEEEEecCCCcccee
Confidence 556677777644 421 2333 34557889999999998888663 23445555666653 333
Q ss_pred cccCCCccccccccceEEecCCcEEEEEecCCCCCCCCCCCCCCCCeeeecCCCcccccccccccCCCchhhhccceeee
Q 004574 77 LFESPDICLNAVFGSFVWVNNSTLLIFTIPSSRRDPPKKTMVPLGPKIQSNEQKNIIISRMTDNLLKDEYDESLFDYYTT 156 (744)
Q Consensus 77 lt~~~~~~~~~~~~~~~wspDg~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (744)
++... +..+.-+.|+-+-..|++...+ |.
T Consensus 297 ~~kAh----~sDVNVISWnr~~~lLasG~Dd--Gt--------------------------------------------- 325 (440)
T KOG0302|consen 297 STKAH----NSDVNVISWNRREPLLASGGDD--GT--------------------------------------------- 325 (440)
T ss_pred Eeecc----CCceeeEEccCCcceeeecCCC--ce---------------------------------------------
Confidence 33222 2235567899777666654211 11
Q ss_pred eEEEEEcC-C-CC-eeecCCC-ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCC
Q 004574 157 AQLVLGSL-D-GT-AKDFGTP-AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTD 217 (744)
Q Consensus 157 ~~l~~~~~-~-G~-~~~l~~~-~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 217 (744)
-.||-+.. . |+ ...++-+ ..+..+.|+|-...++..+... .+|-+||+.
T Consensus 326 ~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~p~e~s~iaasg~D------------~QitiWDls 378 (440)
T KOG0302|consen 326 LSIWDLRQFKSGQPVATFKYHKAPITSIEWHPHEDSVIAASGED------------NQITIWDLS 378 (440)
T ss_pred EEEEEhhhccCCCcceeEEeccCCeeEEEeccccCceEEeccCC------------CcEEEEEee
Confidence 23443333 2 22 2233334 7788999999988877665554 378888764
No 485
>cd00519 Lipase_3 Lipase (class 3). Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=89.63 E-value=0.69 Score=45.09 Aligned_cols=50 Identities=14% Similarity=0.104 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhC-----CCceeEEEEccCC
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHA-----PHLFCCGIARSGS 628 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~-----p~~~~~~v~~~~~ 628 (744)
++...+..++++. ...+|.+.|||+||.+|..++... +..+.++...+|.
T Consensus 113 ~~~~~~~~~~~~~--p~~~i~vtGHSLGGaiA~l~a~~l~~~~~~~~i~~~tFg~P~ 167 (229)
T cd00519 113 QVLPELKSALKQY--PDYKIIVTGHSLGGALASLLALDLRLRGPGSDVTVYTFGQPR 167 (229)
T ss_pred HHHHHHHHHHhhC--CCceEEEEccCHHHHHHHHHHHHHHhhCCCCceEEEEeCCCC
Confidence 4444444444432 236899999999999998887652 2345655555554
No 486
>KOG3724 consensus Negative regulator of COPII vesicle formation [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.21 E-value=0.42 Score=52.77 Aligned_cols=52 Identities=17% Similarity=0.158 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHcC----CCC---CCcEEEEEechHHHHHHHHHHhC---CCceeEEEEccCC
Q 004574 577 SAEAAVEEVVRRG----VAD---PSRIAVGGHSYGAFMTAHLLAHA---PHLFCCGIARSGS 628 (744)
Q Consensus 577 d~~~~~~~l~~~~----~~d---~~~i~l~G~S~GG~~a~~~~~~~---p~~~~~~v~~~~~ 628 (744)
=+.+||.++.++. ..+ |..|.|+||||||.+|..++... +..+.-++..+.+
T Consensus 158 YV~dAIk~ILslYr~~~e~~~p~P~sVILVGHSMGGiVAra~~tlkn~~~~sVntIITlssP 219 (973)
T KOG3724|consen 158 YVNDAIKYILSLYRGEREYASPLPHSVILVGHSMGGIVARATLTLKNEVQGSVNTIITLSSP 219 (973)
T ss_pred HHHHHHHHHHHHhhcccccCCCCCceEEEEeccchhHHHHHHHhhhhhccchhhhhhhhcCc
Confidence 3556666666532 223 56799999999999986665442 2334444445544
No 487
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=88.95 E-value=25 Score=34.72 Aligned_cols=59 Identities=7% Similarity=0.085 Sum_probs=39.4
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCcEEEE
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNSTLLIF 103 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~~l~~ 103 (744)
.++...|+||-+.|.-+. +...+|.-++.+|.-.+.++-..-. ....+.|.-+|+.++.
T Consensus 87 nvS~LTynp~~rtLFav~---------n~p~~iVElt~~GdlirtiPL~g~~----DpE~Ieyig~n~fvi~ 145 (316)
T COG3204 87 NVSSLTYNPDTRTLFAVT---------NKPAAIVELTKEGDLIRTIPLTGFS----DPETIEYIGGNQFVIV 145 (316)
T ss_pred cccceeeCCCcceEEEec---------CCCceEEEEecCCceEEEecccccC----ChhHeEEecCCEEEEE
Confidence 488999999998887664 4556777778777666655322111 1345688888876654
No 488
>PLN02408 phospholipase A1
Probab=88.91 E-value=0.71 Score=47.47 Aligned_cols=40 Identities=15% Similarity=0.302 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 575 NDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 575 ~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
.+++.+.|..+.++..-...+|.+.|||+||.+|..+|..
T Consensus 181 r~qVl~eI~~ll~~y~~~~~sI~vTGHSLGGALAtLaA~d 220 (365)
T PLN02408 181 QEMVREEIARLLQSYGDEPLSLTITGHSLGAALATLTAYD 220 (365)
T ss_pred HHHHHHHHHHHHHhcCCCCceEEEeccchHHHHHHHHHHH
Confidence 3456666666666533223479999999999999988765
No 489
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=88.90 E-value=22 Score=35.34 Aligned_cols=72 Identities=10% Similarity=0.080 Sum_probs=44.9
Q ss_pred CCccceecCCCceEEEEEeecCCCCCccCCccceEEeccCCCCCCCCceEeeee--ccceeceeeccCCceEEEeeeeec
Q 004574 244 MRSISWRADKPSTLYWVEAQDRGDANVEVSPRDIIYTQPAEPAEGEKPEILHKL--DLRFRSVSWCDDSLALVNETWYKT 321 (744)
Q Consensus 244 ~~~~~~spDg~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~--~~~~~~~~~SpDg~~l~~~~~~~~ 321 (744)
+.-++.+-+|.. |+-.+. .-..|-+++.. ++....++-.+ ...+..++||||+.+|+.++ +.
T Consensus 184 Iacv~Ln~~Gt~-vATaSt-----------kGTLIRIFdt~--~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsS-dK- 247 (346)
T KOG2111|consen 184 IACVALNLQGTL-VATAST-----------KGTLIRIFDTE--DGTLLQELRRGVDRADIYCIAFSPNSSWLAVSS-DK- 247 (346)
T ss_pred eeEEEEcCCccE-EEEecc-----------CcEEEEEEEcC--CCcEeeeeecCCchheEEEEEeCCCccEEEEEc-CC-
Confidence 556788889975 554431 12246777773 33333334332 45678899999999999987 33
Q ss_pred cceeEEEEcC
Q 004574 322 SQTRTWLVCP 331 (744)
Q Consensus 322 ~~~~l~~~~~ 331 (744)
+.-||+.+..
T Consensus 248 gTlHiF~l~~ 257 (346)
T KOG2111|consen 248 GTLHIFSLRD 257 (346)
T ss_pred CeEEEEEeec
Confidence 5556666553
No 490
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=88.57 E-value=27 Score=34.53 Aligned_cols=87 Identities=9% Similarity=-0.031 Sum_probs=47.6
Q ss_pred cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcc
Q 004574 299 LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQ 378 (744)
Q Consensus 299 ~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~ 378 (744)
..++.++|+|-.+.|+... ...-+-++|+... ...+++..- .. ....++++-||..||....-
T Consensus 233 yPVNai~Fhp~~~tfaTgG----sDG~V~~Wd~~~r-Krl~q~~~~-~~------SI~slsfs~dG~~LAia~sy----- 295 (323)
T KOG1036|consen 233 YPVNAIAFHPIHGTFATGG----SDGIVNIWDLFNR-KRLKQLAKY-ET------SISSLSFSMDGSLLAIASSY----- 295 (323)
T ss_pred EEeceeEeccccceEEecC----CCceEEEccCcch-hhhhhccCC-CC------ceEEEEeccCCCeEEEEech-----
Confidence 4578899999977777654 2234667776652 222222211 11 11226799999999988632
Q ss_pred eEEEEccCCCCCCCCCceEEEEecCCCc
Q 004574 379 IYILLNGRGFTPEGNIPFLDLFDINTGS 406 (744)
Q Consensus 379 ~~~~~~~~g~~~~~~~~~l~~~d~~~g~ 406 (744)
.+- .+..+....+.+++.++.+-+
T Consensus 296 ~ye----~~~~~~~~~~~i~I~~l~d~e 319 (323)
T KOG1036|consen 296 QYE----RADTPTHERNAIFIRDLTDYE 319 (323)
T ss_pred hhh----cCCCCCCCCCceEEEeccccc
Confidence 110 111223345567776665433
No 491
>PLN02571 triacylglycerol lipase
Probab=88.50 E-value=0.73 Score=48.09 Aligned_cols=41 Identities=22% Similarity=0.275 Sum_probs=28.7
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
..+++.+.+..+.++..-..-+|.+.|||+||.+|..+|..
T Consensus 206 ar~qvl~eV~~L~~~y~~e~~sI~VTGHSLGGALAtLaA~d 246 (413)
T PLN02571 206 ARDQVLNEVGRLVEKYKDEEISITICGHSLGAALATLNAVD 246 (413)
T ss_pred HHHHHHHHHHHHHHhcCcccccEEEeccchHHHHHHHHHHH
Confidence 34466667766665432112379999999999999988764
No 492
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=88.42 E-value=27 Score=34.29 Aligned_cols=71 Identities=20% Similarity=0.256 Sum_probs=38.7
Q ss_pred eeecCCCCCcccceeec--CCC-CeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCCc
Q 004574 23 EVHGYPDGAKINFVSWS--PDG-KRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNST 99 (744)
Q Consensus 23 ~l~~~~~~~~~~~p~~S--pDG-~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg~ 99 (744)
++.....-.....-.|| ||- .+||..+- ..+.++.-+|.-++.++++......-+. -.-+.-+.|.||.+
T Consensus 37 eiy~Y~ap~~lya~~Ws~~~~~~~rla~gS~----~Ee~~Nkvqiv~ld~~s~e~~~~a~fd~---~YP~tK~~wiPd~~ 109 (364)
T KOG0290|consen 37 EIYTYNAPWPLYAMNWSVRPDKKFRLAVGSF----IEEYNNKVQIVQLDEDSGELVEDANFDH---PYPVTKLMWIPDSK 109 (364)
T ss_pred eEEEecCCCceeeeccccCCCcceeEEEeee----ccccCCeeEEEEEccCCCceeccCCCCC---CCCccceEecCCcc
Confidence 34433333345566777 443 35666554 2233455667667777887665543211 11245678999886
Q ss_pred E
Q 004574 100 L 100 (744)
Q Consensus 100 ~ 100 (744)
-
T Consensus 110 g 110 (364)
T KOG0290|consen 110 G 110 (364)
T ss_pred c
Confidence 3
No 493
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=88.32 E-value=22 Score=35.89 Aligned_cols=64 Identities=14% Similarity=0.102 Sum_probs=38.3
Q ss_pred cceeEeecCCCCCCCCceeeecCCCCC-----cccceeecC-C---CCeEEEeeecccccccCCCceeEEEEECCCCcee
Q 004574 5 TGIGIHRLLPDDSLGPEKEVHGYPDGA-----KINFVSWSP-D---GKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAK 75 (744)
Q Consensus 5 ~~~~~~~~~~~~~~g~~~~l~~~~~~~-----~~~~p~~Sp-D---G~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~ 75 (744)
..|..++|.. ++..+--.+|... ....+++-. + ++.+||++. .+...|.|+|+.+|+.+
T Consensus 34 pKLv~~Dl~t----~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD--------~~~~glIV~dl~~~~s~ 101 (287)
T PF03022_consen 34 PKLVAFDLKT----NQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITD--------SGGPGLIVYDLATGKSW 101 (287)
T ss_dssp -EEEEEETTT----TCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEE--------TTTCEEEEEETTTTEEE
T ss_pred cEEEEEECCC----CcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeC--------CCcCcEEEEEccCCcEE
Confidence 4677777765 5443332244332 223455544 1 237899875 44568999999999999
Q ss_pred ccccC
Q 004574 76 PLFES 80 (744)
Q Consensus 76 ~lt~~ 80 (744)
++...
T Consensus 102 Rv~~~ 106 (287)
T PF03022_consen 102 RVLHN 106 (287)
T ss_dssp EEETC
T ss_pred EEecC
Confidence 98755
No 494
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=88.27 E-value=11 Score=39.07 Aligned_cols=42 Identities=12% Similarity=0.050 Sum_probs=23.6
Q ss_pred eeeeccCCCCceEEEEEeeCCccc-ccccCCCcceEEEEeCCCC
Q 004574 177 YTAVEPSPDQKYVLITSMHRPYSY-KVPCARFSQKVQVWTTDGK 219 (744)
Q Consensus 177 ~~~~~~SpDG~~i~~~~~~~~~~~-~~~~~~~~~~l~~~~~~g~ 219 (744)
-..++|.||| +|++......... .........+|.+++.+|+
T Consensus 116 g~~l~fgpDG-~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~ 158 (331)
T PF07995_consen 116 GGGLAFGPDG-KLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGS 158 (331)
T ss_dssp EEEEEE-TTS-EEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSS
T ss_pred CccccCCCCC-cEEEEeCCCCCcccccccccccceEEEecccCc
Confidence 3468999999 6776665443211 1111123457888888875
No 495
>KOG1520 consensus Predicted alkaloid synthase/Surface mucin Hemomucin [General function prediction only]
Probab=88.06 E-value=6.6 Score=40.23 Aligned_cols=130 Identities=9% Similarity=0.067 Sum_probs=82.7
Q ss_pred ceeeeeccCCCCceEEEEEeeCCcccccccCCCcceEEEEeCCCCeeeeccCCCCCCCCCcccCCccCCCCccceecCCC
Q 004574 175 AVYTAVEPSPDQKYVLITSMHRPYSYKVPCARFSQKVQVWTTDGKLVRELCDLPPAEDIPVCYNSVREGMRSISWRADKP 254 (744)
Q Consensus 175 ~~~~~~~~SpDG~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~spDg~ 254 (744)
+...+++|...|..+++. +.. .-|+.++.+|+..+.+++.... .+. .-..++...++|.
T Consensus 115 GRPLGl~f~~~ggdL~Va-DAY------------lGL~~V~p~g~~a~~l~~~~~G--~~~------kf~N~ldI~~~g~ 173 (376)
T KOG1520|consen 115 GRPLGIRFDKKGGDLYVA-DAY------------LGLLKVGPEGGLAELLADEAEG--KPF------KFLNDLDIDPEGV 173 (376)
T ss_pred CCcceEEeccCCCeEEEE-ecc------------eeeEEECCCCCcceeccccccC--eee------eecCceeEcCCCe
Confidence 667788999999777655 321 2478888888876666544211 110 0134567788775
Q ss_pred ceEEEEEeecCCCCC------ccCCccceEEeccCCCCCCCCceEeeeeccceeceeeccCCceEEEeeeeeccceeEEE
Q 004574 255 STLYWVEAQDRGDAN------VEVSPRDIIYTQPAEPAEGEKPEILHKLDLRFRSVSWCDDSLALVNETWYKTSQTRTWL 328 (744)
Q Consensus 255 ~~l~~~~~~~~~~~~------~~~~~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~ 328 (744)
+||++..+.-+.+ ......++++.+|. .+...+.|.+.-.-.+.++.|||+..+++... ...+|.+
T Consensus 174 --vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~---~tK~~~VLld~L~F~NGlaLS~d~sfvl~~Et---~~~ri~r 245 (376)
T KOG1520|consen 174 --VYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDP---STKVTKVLLDGLYFPNGLALSPDGSFVLVAET---TTARIKR 245 (376)
T ss_pred --EEEeccccccchhheEEeeecCCCccceEEecC---cccchhhhhhcccccccccCCCCCCEEEEEee---ccceeee
Confidence 8888765433221 12345677888887 56666677777777888999999999988752 2233444
Q ss_pred EcCCC
Q 004574 329 VCPGS 333 (744)
Q Consensus 329 ~~~~~ 333 (744)
+=+.+
T Consensus 246 ywi~g 250 (376)
T KOG1520|consen 246 YWIKG 250 (376)
T ss_pred eEecC
Confidence 44444
No 496
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=88.00 E-value=7.8 Score=42.69 Aligned_cols=37 Identities=24% Similarity=0.187 Sum_probs=28.2
Q ss_pred EEEEcC-CCCeeecCCC---ceeeeeccCCCCceEEEEEee
Q 004574 159 LVLGSL-DGTAKDFGTP---AVYTAVEPSPDQKYVLITSMH 195 (744)
Q Consensus 159 l~~~~~-~G~~~~l~~~---~~~~~~~~SpDG~~i~~~~~~ 195 (744)
++..+. .|+.+++... .++.++.|+|||+.|++...+
T Consensus 482 ~~~~~~~~g~~~rf~~~P~gaE~tG~~fspDg~tlFvniQH 522 (524)
T PF05787_consen 482 VWAYDPDTGELKRFLVGPNGAEITGPCFSPDGRTLFVNIQH 522 (524)
T ss_pred eeeccccccceeeeccCCCCcccccceECCCCCEEEEEEeC
Confidence 566666 6777777544 678899999999998876543
No 497
>PLN02324 triacylglycerol lipase
Probab=87.94 E-value=0.82 Score=47.65 Aligned_cols=41 Identities=20% Similarity=0.299 Sum_probs=29.6
Q ss_pred hHHHHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHh
Q 004574 574 PNDSAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAH 614 (744)
Q Consensus 574 ~~~d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~ 614 (744)
..+.+.+.|..+.++..-...+|.+.|||+||.+|..+|..
T Consensus 195 areqVl~eV~~L~~~Yp~e~~sItvTGHSLGGALAtLaA~d 235 (415)
T PLN02324 195 AQEQVQGELKRLLELYKNEEISITFTGHSLGAVMSVLSAAD 235 (415)
T ss_pred HHHHHHHHHHHHHHHCCCCCceEEEecCcHHHHHHHHHHHH
Confidence 44567777777776532222479999999999999988754
No 498
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=87.73 E-value=34 Score=38.35 Aligned_cols=51 Identities=24% Similarity=0.372 Sum_probs=33.9
Q ss_pred cccceeecCCCCeEEEeeecccccccCCCceeEEEEECCCCceeccccCCCccccccccceEEecCC
Q 004574 32 KINFVSWSPDGKRIAFSVRVDEEDNVSSCKLRVWIADAETGEAKPLFESPDICLNAVFGSFVWVNNS 98 (744)
Q Consensus 32 ~~~~p~~SpDG~~laf~~~~~~~~~~~~~~~~l~~~~~~gg~~~~lt~~~~~~~~~~~~~~~wspDg 98 (744)
.-...-|||.| .|||. ..+-|.++|..+-+..+...... ..+..+.|+|--
T Consensus 17 N~~A~Dw~~~G-LiAyg-----------shslV~VVDs~s~q~iqsie~h~----s~V~~VrWap~~ 67 (1062)
T KOG1912|consen 17 NRNAADWSPSG-LIAYG-----------SHSLVSVVDSRSLQLIQSIELHQ----SAVTSVRWAPAP 67 (1062)
T ss_pred cccccccCccc-eEEEe-----------cCceEEEEehhhhhhhhccccCc----cceeEEEeccCC
Confidence 35678899988 89994 44778999976655444332222 246677898743
No 499
>PF06259 Abhydrolase_8: Alpha/beta hydrolase; InterPro: IPR010427 This is a family of uncharacterised proteins found in Actinobacteria. Computational analysis suggests that they may belong to the alpha-beta hydrolase family of enzymes, as they are predicted to form the core secondary structures and catalytic machinery common to these proteins []. Genomic context suggests that they may function as lipases, controlling the concentration of their putative phospholipid substrates.
Probab=87.58 E-value=1.5 Score=40.32 Aligned_cols=50 Identities=12% Similarity=0.096 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHcCCCCCCcEEEEEechHHHHHHHHHHhCCCceeEEEEccC
Q 004574 577 SAEAAVEEVVRRGVADPSRIAVGGHSYGAFMTAHLLAHAPHLFCCGIARSG 627 (744)
Q Consensus 577 d~~~~~~~l~~~~~~d~~~i~l~G~S~GG~~a~~~~~~~p~~~~~~v~~~~ 627 (744)
++.++++-|.... ....++.++|||||+.++..++...+-.+..+|++..
T Consensus 93 ~L~~f~~gl~a~~-~~~~~~tv~GHSYGS~v~G~A~~~~~~~vddvv~~GS 142 (177)
T PF06259_consen 93 RLARFLDGLRATH-GPDAHLTVVGHSYGSTVVGLAAQQGGLRVDDVVLVGS 142 (177)
T ss_pred HHHHHHHHhhhhc-CCCCCEEEEEecchhHHHHHHhhhCCCCcccEEEECC
Confidence 5666666666554 3346899999999999998888774455555555443
No 500
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=87.55 E-value=25 Score=34.06 Aligned_cols=134 Identities=10% Similarity=0.039 Sum_probs=69.9
Q ss_pred cceeceeeccCCceEEEeeeeeccceeEEEEcCCCCCCcceeeeccccccccCCCCCCceeeCCCCCeEEEEeeecCCcc
Q 004574 299 LRFRSVSWCDDSLALVNETWYKTSQTRTWLVCPGSKDVAPRVLFDRVFENVYSDPGSPMMTRTSTGTNVIAKIKKENDEQ 378 (744)
Q Consensus 299 ~~~~~~~~SpDg~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~spdg~~l~~~~~~~~~~~ 378 (744)
....+.+.|+|+++++... ...+++++.++....-...++..+..+ +.+..+||.....+|.....
T Consensus 159 ~~~ns~~~snd~~~~~~Vg----ds~~Vf~y~id~~sey~~~~~~a~t~D-----~gF~~S~s~~~~~FAv~~Qd----- 224 (344)
T KOG4532|consen 159 LTQNSLHYSNDPSWGSSVG----DSRRVFRYAIDDESEYIENIYEAPTSD-----HGFYNSFSENDLQFAVVFQD----- 224 (344)
T ss_pred cceeeeEEcCCCceEEEec----CCCcceEEEeCCccceeeeeEecccCC-----CceeeeeccCcceEEEEecC-----
Confidence 4477889999999988765 222455555444212222233333332 34445688888877776522
Q ss_pred eEEEEccCCCCCCCCCceEEEEecCCCceeEEeeccchhhhhheeeeecCCcceecccCC-CEEEEEEecCCCCceEEEE
Q 004574 379 IYILLNGRGFTPEGNIPFLDLFDINTGSKERIWESNREKYFETAVALVFGQGEEDINLNQ-LKILTSKESKTEITQYHIL 457 (744)
Q Consensus 379 ~~~~~~~~g~~~~~~~~~l~~~d~~~g~~~~l~~~~~~~~~~~~~~~~~~~~~~~~s~d~-~~~~~~~~~~~~~~~i~~~ 457 (744)
..+.+||...-.+--.+.+... +...+....+-|||-| -.|+|..+ .-.-+.++
T Consensus 225 ----------------g~~~I~DVR~~~tpm~~~sstr------p~hnGa~R~c~Fsl~g~lDLLf~sE---hfs~~hv~ 279 (344)
T KOG4532|consen 225 ----------------GTCAIYDVRNMATPMAEISSTR------PHHNGAFRVCRFSLYGLLDLLFISE---HFSRVHVV 279 (344)
T ss_pred ----------------CcEEEEEecccccchhhhcccC------CCCCCceEEEEecCCCcceEEEEec---CcceEEEE
Confidence 1367777653322212211111 1223334456677655 23444332 23457777
Q ss_pred ECCCCceeeeecCC
Q 004574 458 SWPLKKSSQITNFP 471 (744)
Q Consensus 458 ~~~~g~~~~lt~~~ 471 (744)
|+.++...++.-++
T Consensus 280 D~R~~~~~q~I~i~ 293 (344)
T KOG4532|consen 280 DTRNYVNHQVIVIP 293 (344)
T ss_pred EcccCceeeEEecC
Confidence 87777665555443
Done!