Query gi|254781093|ref|YP_003065506.1| hypothetical protein CLIBASIA_04975 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 225 No_of_seqs 143 out of 1487 Neff 7.7 Searched_HMMs 39220 Date Mon May 30 04:28:01 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781093.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG2945 Predicted hydrolase of 100.0 0 0 385.6 17.8 205 1-211 4-209 (210) 2 TIGR03100 hydr1_PEP hydrolase, 100.0 2.5E-31 6.4E-36 208.6 21.8 203 4-209 5-274 (274) 3 COG1506 DAP2 Dipeptidyl aminop 100.0 1.2E-29 2.9E-34 198.6 21.1 208 2-213 366-619 (620) 4 pfam00326 Peptidase_S9 Prolyl 100.0 2.4E-27 6.1E-32 184.7 15.2 164 49-213 6-211 (212) 5 KOG4667 consensus 100.0 2.4E-27 6.2E-32 184.6 13.9 205 3-219 12-267 (269) 6 PRK10566 esterase; Provisional 99.9 8.1E-26 2.1E-30 175.5 18.0 192 11-211 10-249 (249) 7 KOG1552 consensus 99.9 1.3E-25 3.2E-30 174.3 18.2 198 5-214 39-256 (258) 8 KOG1455 consensus 99.9 1.8E-24 4.5E-29 167.4 18.2 201 3-210 29-312 (313) 9 PRK10673 hypothetical protein; 99.9 3.3E-23 8.4E-28 159.8 17.1 183 12-210 2-255 (255) 10 pfam01738 DLH Dienelactone hyd 99.9 1.3E-22 3.3E-27 156.2 20.0 191 13-210 1-216 (216) 11 PRK10985 putative hydrolase; P 99.9 8.1E-22 2.1E-26 151.4 17.9 201 5-214 35-324 (325) 12 KOG4391 consensus 99.9 1.2E-22 3.1E-27 156.4 13.7 209 3-221 56-293 (300) 13 COG2267 PldB Lysophospholipase 99.9 2.7E-21 6.8E-26 148.3 20.5 202 3-213 11-297 (298) 14 COG0412 Dienelactone hydrolase 99.9 6.2E-21 1.6E-25 146.1 22.2 205 2-212 3-235 (236) 15 TIGR03343 biphenyl_bphD 2-hydr 99.9 3.6E-22 9.3E-27 153.5 15.8 181 13-209 20-282 (282) 16 TIGR03611 RutD pyrimidine util 99.9 1.6E-21 4E-26 149.7 17.7 178 16-209 4-257 (257) 17 PRK00870 haloalkane dehalogena 99.9 1.3E-21 3.4E-26 150.1 17.3 192 6-210 26-301 (302) 18 PRK10349 carboxylesterase BioH 99.9 9.7E-22 2.5E-26 150.9 16.4 176 16-210 4-256 (256) 19 PRK10749 lysophospholipase L2; 99.9 1.5E-20 3.9E-25 143.7 20.9 200 3-210 32-329 (330) 20 TIGR03056 bchO_mg_che_rel puta 99.9 5.2E-21 1.3E-25 146.5 17.8 180 13-208 16-278 (278) 21 PRK03592 haloalkane dehalogena 99.9 1.2E-19 3.1E-24 138.3 19.2 188 1-210 9-288 (294) 22 KOG2100 consensus 99.8 1.8E-19 4.7E-24 137.2 15.7 201 12-214 509-751 (755) 23 COG0429 Predicted hydrolase of 99.8 1.3E-18 3.4E-23 132.0 17.9 201 5-213 53-343 (345) 24 TIGR03101 hydr2_PEP hydrolase, 99.8 1.2E-18 3.1E-23 132.3 17.5 127 4-135 3-135 (266) 25 PRK11460 esterase YpfH; Provis 99.8 2E-18 5E-23 131.0 18.3 188 16-214 7-212 (230) 26 PRK03204 haloalkane dehalogena 99.8 6.4E-19 1.6E-23 133.9 15.6 183 5-207 18-285 (286) 27 COG1647 Esterase/lipase [Gener 99.8 4.1E-19 1E-23 135.1 13.1 172 27-209 17-243 (243) 28 pfam02230 Abhydrolase_2 Phosph 99.8 7.9E-18 2E-22 127.4 18.6 186 17-211 3-213 (213) 29 PRK11126 acyl-CoA thioester hy 99.8 3.1E-18 8E-23 129.8 14.9 164 24-210 1-242 (242) 30 pfam05448 AXE1 Acetyl xylan es 99.8 1.2E-17 2.9E-22 126.4 16.6 195 3-209 58-319 (320) 31 pfam02129 Peptidase_S15 X-Pro 99.8 1E-16 2.6E-21 120.6 16.7 179 10-191 1-264 (265) 32 TIGR02427 protocat_pcaD 3-oxoa 99.8 2.7E-17 6.9E-22 124.1 13.6 171 20-208 9-256 (256) 33 PRK10162 acetyl esterase; Prov 99.7 4.4E-15 1.1E-19 110.8 20.6 200 2-211 61-319 (319) 34 KOG2281 consensus 99.7 9.6E-16 2.4E-20 114.8 16.1 205 3-209 614-866 (867) 35 KOG1838 consensus 99.7 7.4E-15 1.9E-19 109.5 17.8 201 4-212 96-390 (409) 36 COG3571 Predicted hydrolase of 99.7 4.4E-15 1.1E-19 110.8 16.5 186 15-210 4-211 (213) 37 pfam00561 Abhydrolase_1 alpha/ 99.7 8E-16 2.1E-20 115.3 12.6 138 58-203 1-223 (225) 38 COG0400 Predicted esterase [Ge 99.7 7.4E-15 1.9E-19 109.5 16.6 182 17-211 10-206 (207) 39 pfam07859 Abhydrolase_3 alpha/ 99.7 8E-15 2E-19 109.3 15.5 155 28-193 1-208 (209) 40 KOG3043 consensus 99.7 9.8E-15 2.5E-19 108.7 15.4 193 12-212 27-242 (242) 41 KOG1454 consensus 99.6 6E-14 1.5E-18 104.0 18.0 196 2-211 26-325 (326) 42 KOG4178 consensus 99.6 3.9E-14 1E-18 105.1 16.9 185 13-210 32-320 (322) 43 TIGR01250 pro_imino_pep_2 prol 99.6 1E-13 2.6E-18 102.6 16.7 190 6-208 7-302 (302) 44 COG3458 Acetyl esterase (deace 99.6 4.1E-14 1E-18 105.0 14.1 195 3-209 58-316 (321) 45 COG0657 Aes Esterase/lipase [L 99.6 7.2E-13 1.8E-17 97.5 20.3 193 6-208 56-308 (312) 46 pfam03096 Ndr Ndr family. This 99.6 1.3E-12 3.3E-17 96.0 18.6 190 4-209 2-280 (285) 47 TIGR01738 bioH putative pimelo 99.6 4.9E-14 1.2E-18 104.5 10.7 165 26-207 5-248 (248) 48 COG2936 Predicted acyl esteras 99.6 2.3E-14 6E-19 106.5 9.0 128 3-132 21-157 (563) 49 pfam06342 DUF1057 Alpha/beta h 99.5 2.6E-12 6.6E-17 94.2 17.9 157 6-171 11-237 (297) 50 KOG4409 consensus 99.5 1.9E-12 4.8E-17 95.0 15.7 178 21-210 86-364 (365) 51 pfam10503 Esterase_phd Esteras 99.5 1.2E-12 3.1E-17 96.1 13.8 148 24-174 20-203 (226) 52 COG4757 Predicted alpha/beta h 99.5 3.1E-12 7.9E-17 93.7 13.6 195 3-207 7-280 (281) 53 pfam03403 PAF-AH_p_II isoform 99.5 9.3E-12 2.4E-16 90.8 16.0 192 13-213 86-354 (372) 54 PRK10115 protease 2; Provision 99.4 2.2E-11 5.6E-16 88.5 17.2 212 3-215 418-680 (686) 55 PRK05077 frsA fermentation/res 99.4 1.1E-11 2.9E-16 90.3 14.5 191 2-211 169-413 (414) 56 pfam08840 BAAT_C BAAT / Acyl-C 99.4 2E-11 5E-16 88.8 14.0 131 82-212 3-210 (211) 57 pfam08538 DUF1749 Protein of u 99.4 4E-11 1E-15 87.0 14.8 190 13-208 20-303 (303) 58 TIGR01249 pro_imino_pep_1 prol 99.4 1.3E-11 3.4E-16 89.9 11.4 183 12-209 13-308 (310) 59 KOG2564 consensus 99.4 4.9E-11 1.2E-15 86.5 14.2 195 1-210 50-327 (343) 60 PRK05371 x-prolyl-dipeptidyl a 99.4 5.4E-11 1.4E-15 86.2 14.4 159 52-212 277-524 (769) 61 pfam06500 DUF1100 Alpha/beta h 99.3 8.3E-11 2.1E-15 85.1 14.4 190 3-211 167-410 (411) 62 PRK05855 short chain dehydroge 99.3 1.8E-10 4.5E-15 83.1 16.1 182 9-210 10-292 (582) 63 KOG1515 consensus 99.3 8.8E-10 2.2E-14 78.9 18.7 194 9-210 70-335 (336) 64 pfam03583 LIP Secretory lipase 99.3 1.3E-10 3.4E-15 83.8 14.3 164 48-217 17-289 (291) 65 KOG2382 consensus 99.3 2.2E-10 5.7E-15 82.5 15.4 183 18-211 45-314 (315) 66 pfam00975 Thioesterase Thioest 99.3 5.1E-10 1.3E-14 80.3 14.7 165 28-207 3-224 (224) 67 PRK00175 metX homoserine O-ace 99.2 1.4E-09 3.5E-14 77.7 16.6 189 21-214 43-377 (379) 68 pfam12048 DUF3530 Protein of u 99.2 3.3E-09 8.5E-14 75.4 17.7 201 4-210 58-293 (294) 69 COG3509 LpqC Poly(3-hydroxybut 99.2 1.1E-10 2.9E-15 84.3 10.0 199 5-210 39-307 (312) 70 COG4188 Predicted dienelactone 99.2 2.1E-10 5.4E-15 82.7 10.3 163 24-194 70-298 (365) 71 KOG2984 consensus 99.2 2.1E-10 5.4E-15 82.6 8.8 153 51-210 65-276 (277) 72 pfam06821 DUF1234 Alpha/Beta h 99.1 1.5E-09 3.7E-14 77.6 12.6 142 28-193 1-155 (171) 73 pfam00756 Esterase Putative es 99.1 2.5E-08 6.4E-13 70.1 18.8 116 86-203 95-240 (243) 74 COG1505 Serine proteases of th 99.1 1.9E-09 4.8E-14 76.9 12.5 202 4-211 397-647 (648) 75 pfam05728 UPF0227 Uncharacteri 99.1 4.3E-09 1.1E-13 74.8 14.2 152 28-207 2-186 (187) 76 PRK06765 homoserine O-acetyltr 99.1 1.7E-09 4.3E-14 77.2 11.8 187 22-213 49-385 (387) 77 COG1770 PtrB Protease II [Amin 99.1 4.9E-09 1.3E-13 74.4 14.1 200 4-206 422-672 (682) 78 smart00824 PKS_TE Thioesterase 99.1 4E-09 1E-13 74.9 13.6 165 30-207 1-212 (212) 79 COG3208 GrsT Predicted thioest 99.1 2E-09 5E-14 76.8 11.9 167 27-208 8-234 (244) 80 TIGR01607 PST-A Plasmodium sub 99.1 3.7E-11 9.4E-16 87.2 2.8 157 48-208 65-378 (379) 81 KOG2237 consensus 99.1 1.5E-09 3.9E-14 77.4 11.1 210 3-215 443-710 (712) 82 TIGR02821 fghA_ester_D S-formy 99.1 7E-10 1.8E-14 79.5 9.2 190 14-212 29-273 (279) 83 pfam06057 VirJ Bacterial virul 99.1 3.6E-09 9.2E-14 75.2 12.0 148 47-211 19-192 (192) 84 PRK11071 esterase YqiA; Provis 99.1 1.3E-08 3.3E-13 71.9 14.2 155 26-208 2-189 (190) 85 pfam09752 DUF2048 Uncharacteri 99.1 3.2E-08 8.3E-13 69.5 16.1 173 23-209 89-337 (337) 86 COG3545 Predicted esterase of 99.0 1.3E-08 3.2E-13 72.0 13.4 161 26-210 3-179 (181) 87 KOG3253 consensus 99.0 2.9E-08 7.4E-13 69.8 14.1 178 23-213 174-377 (784) 88 COG0596 MhpC Predicted hydrola 99.0 1.5E-07 3.8E-12 65.5 17.6 169 25-208 21-280 (282) 89 KOG1553 consensus 99.0 1.9E-09 5E-14 76.8 7.2 148 6-163 219-400 (517) 90 COG1073 Hydrolases of the alph 98.9 2E-08 5E-13 70.8 11.5 193 12-212 32-299 (299) 91 KOG4627 consensus 98.9 2E-08 5E-13 70.8 11.0 172 7-191 50-247 (270) 92 KOG2624 consensus 98.9 2.1E-07 5.4E-12 64.6 15.0 203 3-211 50-399 (403) 93 pfam06028 DUF915 Alpha/beta hy 98.9 4.4E-07 1.1E-11 62.6 16.1 176 24-207 6-247 (249) 94 TIGR01836 PHA_synth_III_C poly 98.9 1E-08 2.7E-13 72.4 7.5 152 48-210 94-367 (367) 95 PRK08775 homoserine O-acetyltr 98.8 4.8E-07 1.2E-11 62.5 15.5 186 13-210 46-339 (343) 96 TIGR01840 esterase_phb esteras 98.8 7.9E-08 2E-12 67.1 10.6 150 22-177 15-218 (231) 97 COG4099 Predicted peptidase [G 98.8 2.9E-08 7.5E-13 69.7 8.3 178 7-190 167-365 (387) 98 PRK06489 hypothetical protein; 98.8 1.8E-07 4.6E-12 65.0 12.3 176 25-209 69-356 (358) 99 pfam03959 FSH1 Serine hydrolas 98.8 1.2E-06 3E-11 60.1 16.4 112 81-197 80-205 (210) 100 KOG2551 consensus 98.8 6.1E-08 1.6E-12 67.8 9.5 124 85-218 92-228 (230) 101 pfam12146 Hydrolase_4 Putative 98.8 3.5E-08 8.9E-13 69.3 8.3 73 12-89 3-78 (79) 102 COG2021 MET2 Homoserine acetyl 98.8 8.9E-07 2.3E-11 60.8 14.9 180 23-209 49-367 (368) 103 KOG3101 consensus 98.7 1.4E-07 3.5E-12 65.7 10.0 176 14-193 30-263 (283) 104 PRK07581 hypothetical protein; 98.7 3.8E-07 9.6E-12 63.1 12.1 178 23-208 39-339 (340) 105 COG2272 PnbA Carboxylesterase 98.6 2.9E-07 7.4E-12 63.8 8.9 115 16-135 84-218 (491) 106 pfam07224 Chlorophyllase Chlor 98.6 7.8E-07 2E-11 61.2 10.8 189 18-219 39-283 (307) 107 pfam05677 DUF818 Chlamydia CHL 98.6 7.5E-07 1.9E-11 61.3 10.1 138 16-157 128-301 (364) 108 KOG2112 consensus 98.5 1.9E-06 4.8E-11 58.9 11.1 118 88-209 81-203 (206) 109 pfam05705 DUF829 Eukaryotic pr 98.5 1.6E-05 4E-10 53.3 15.7 170 26-206 1-238 (239) 110 KOG3847 consensus 98.5 4.1E-06 1.1E-10 56.8 12.7 193 12-213 102-374 (399) 111 pfam11144 DUF2920 Protein of u 98.5 2.8E-05 7.1E-10 51.8 16.6 171 12-186 21-332 (403) 112 COG3243 PhaC Poly(3-hydroxyalk 98.5 7.3E-06 1.9E-10 55.3 12.9 185 19-213 100-402 (445) 113 pfam10230 DUF2305 Uncharacteri 98.5 1.5E-05 3.9E-10 53.4 13.9 105 24-133 1-115 (260) 114 PRK13604 luxD acyl transferase 98.4 9.1E-06 2.3E-10 54.7 12.3 180 5-196 11-247 (305) 115 pfam02273 Acyl_transf_2 Acyl t 98.4 1.1E-05 2.7E-10 54.3 12.5 180 5-196 6-242 (294) 116 PRK10439 enterobactin/ferric e 98.4 2.5E-05 6.3E-10 52.1 14.3 187 15-212 180-395 (398) 117 pfam11339 DUF3141 Protein of u 98.3 9.5E-05 2.4E-09 48.6 14.9 158 22-191 66-348 (581) 118 pfam00135 COesterase Carboxyle 98.3 8.3E-06 2.1E-10 55.0 9.5 116 16-132 96-225 (517) 119 COG3946 VirJ Type IV secretory 98.2 0.00012 2.9E-09 48.1 14.5 148 47-212 277-451 (456) 120 cd00707 Pancreat_lipase_like P 98.2 2E-05 5E-10 52.7 10.5 112 22-138 33-151 (275) 121 pfam10142 PhoPQ_related PhoPQ- 98.2 6.5E-06 1.7E-10 55.6 7.4 119 90-214 158-319 (360) 122 cd00312 Esterase_lipase Estera 98.2 1.9E-05 4.8E-10 52.8 8.9 115 16-132 83-211 (493) 123 COG2382 Fes Enterochelin ester 98.1 2.4E-05 6.1E-10 52.2 9.2 185 13-207 83-296 (299) 124 pfam05990 DUF900 Alpha/beta hy 98.1 0.00015 3.8E-09 47.4 12.6 176 21-203 15-227 (230) 125 COG4782 Uncharacterized protei 98.1 0.00013 3.3E-09 47.8 11.8 107 22-135 113-235 (377) 126 KOG4840 consensus 98.1 0.0001 2.6E-09 48.4 11.1 167 44-213 53-292 (299) 127 pfam07819 PGAP1 PGAP1-like pro 97.9 0.00011 2.7E-09 48.3 9.3 127 27-165 6-177 (225) 128 KOG1516 consensus 97.9 7.3E-05 1.9E-09 49.3 8.4 93 25-117 112-212 (545) 129 PRK10252 entF enterobactin syn 97.9 0.0018 4.7E-08 40.9 15.9 171 25-213 1065-1293(1293) 130 TIGR03230 lipo_lipase lipoprot 97.9 0.00017 4.4E-09 47.0 10.2 133 22-158 38-183 (442) 131 KOG3975 consensus 97.9 0.0016 4E-08 41.2 15.0 178 22-208 26-301 (301) 132 COG0627 Predicted esterase [Ge 97.8 0.00056 1.4E-08 43.9 11.2 186 23-213 52-314 (316) 133 COG4814 Uncharacterized protei 97.8 0.0017 4.4E-08 41.0 13.5 175 28-209 48-286 (288) 134 pfam00151 Lipase Lipase. 97.8 0.00028 7E-09 45.8 9.0 133 22-159 68-215 (329) 135 pfam01674 Lipase_2 Lipase (cla 97.7 0.00039 1E-08 44.9 9.4 99 25-133 2-122 (218) 136 TIGR01392 homoserO_Ac_trn homo 97.7 0.00083 2.1E-08 42.9 10.8 64 144-208 329-395 (395) 137 PRK07868 acyl-CoA synthetase; 97.6 0.00011 2.9E-09 48.1 5.5 169 12-191 50-338 (990) 138 COG3150 Predicted esterase [Ge 97.4 0.0077 2E-07 37.1 12.4 155 28-208 2-187 (191) 139 COG3319 Thioesterase domains o 97.3 0.0011 2.7E-08 42.3 7.1 97 27-134 2-103 (257) 140 pfam09994 DUF2235 Uncharacteri 97.3 0.0029 7.4E-08 39.7 9.3 74 46-119 23-102 (261) 141 PRK12467 peptide synthase; Pro 97.3 0.0026 6.5E-08 40.0 9.0 97 23-130 3691-3792(3951) 142 TIGR00976 /NonD hydrolase CocE 97.3 0.0027 6.8E-08 39.9 8.6 120 8-127 3-154 (663) 143 PRK04940 hypothetical protein; 97.2 0.017 4.4E-07 35.0 13.0 115 83-208 41-178 (179) 144 KOG2521 consensus 97.1 0.021 5.4E-07 34.5 15.1 183 21-214 35-294 (350) 145 COG1075 LipA Predicted acetylt 97.1 0.0062 1.6E-07 37.7 9.3 98 27-134 61-164 (336) 146 pfam08386 Abhydrolase_4 TAP-li 97.1 0.0014 3.5E-08 41.6 5.5 59 146-209 34-93 (103) 147 pfam02089 Palm_thioest Palmito 97.0 0.013 3.2E-07 35.8 9.6 102 28-133 8-115 (279) 148 pfam05577 Peptidase_S28 Serine 96.9 0.03 7.5E-07 33.6 11.0 110 23-135 27-149 (433) 149 pfam02450 LACT Lecithin:choles 96.7 0.0035 8.9E-08 39.2 5.2 81 45-134 66-160 (382) 150 COG2819 Predicted hydrolase of 96.7 0.025 6.5E-07 34.0 9.6 110 82-191 117-244 (264) 151 TIGR01838 PHA_synth_I poly(R)- 96.7 0.03 7.7E-07 33.5 9.8 157 48-212 211-496 (541) 152 pfam07082 DUF1350 Protein of u 96.4 0.04 1E-06 32.8 9.1 161 43-214 33-236 (250) 153 pfam06259 DUF1023 Alpha/beta h 96.3 0.021 5.5E-07 34.4 7.3 80 82-161 91-171 (177) 154 KOG1551 consensus 96.3 0.017 4.5E-07 35.0 6.6 190 7-212 96-368 (371) 155 pfam01083 Cutinase Cutinase. 96.2 0.03 7.5E-07 33.6 7.6 148 40-211 22-178 (179) 156 KOG2541 consensus 96.2 0.096 2.5E-06 30.5 10.0 101 25-133 24-127 (296) 157 KOG3724 consensus 96.2 0.098 2.5E-06 30.5 10.0 51 82-133 156-219 (973) 158 pfam11288 DUF3089 Protein of u 95.8 0.016 4E-07 35.2 4.6 65 57-121 45-116 (201) 159 cd00519 Lipase_3 Lipase (class 95.8 0.041 1.1E-06 32.7 6.6 121 23-159 61-194 (229) 160 pfam11187 DUF2974 Protein of u 95.7 0.018 4.6E-07 34.9 4.7 72 60-131 38-120 (224) 161 TIGR03502 lipase_Pla1_cef extr 95.6 0.13 3.2E-06 29.8 8.6 89 25-118 449-573 (792) 162 cd00741 Lipase Lipase. Lipase 95.5 0.07 1.8E-06 31.3 6.9 56 83-139 12-73 (153) 163 pfam06850 PHB_depo_C PHB de-po 95.5 0.01 2.6E-07 36.4 2.5 65 146-210 134-202 (203) 164 pfam01764 Lipase_3 Lipase (cla 95.4 0.024 6.2E-07 34.1 4.4 37 83-120 47-83 (141) 165 pfam07519 Tannase Tannase and 95.4 0.028 7.1E-07 33.7 4.6 66 148-213 331-406 (451) 166 KOG2183 consensus 95.4 0.17 4.3E-06 29.1 8.6 101 24-134 80-202 (492) 167 TIGR01849 PHB_depoly_PhaZ poly 95.4 0.068 1.7E-06 31.4 6.4 174 25-210 109-414 (414) 168 pfam10340 DUF2424 Protein of u 95.1 0.28 7.1E-06 27.7 14.4 119 9-137 103-238 (374) 169 COG4287 PqaA PhoPQ-activated p 95.0 0.029 7.5E-07 33.6 3.7 117 87-209 219-386 (507) 170 COG3673 Uncharacterized conser 94.6 0.35 9E-06 27.1 8.5 90 22-119 28-141 (423) 171 KOG2565 consensus 94.3 0.18 4.5E-06 28.9 6.4 93 28-127 155-257 (469) 172 pfam05057 DUF676 Putative seri 94.3 0.44 1.1E-05 26.5 9.4 85 26-118 6-93 (212) 173 KOG4569 consensus 93.8 0.088 2.2E-06 30.7 4.0 36 83-119 155-190 (336) 174 KOG4388 consensus 93.6 0.58 1.5E-05 25.8 9.8 112 10-130 382-504 (880) 175 KOG2369 consensus 93.1 0.24 6.1E-06 28.1 5.2 83 45-133 125-224 (473) 176 KOG2931 consensus 92.6 0.84 2.1E-05 24.8 16.3 190 3-208 24-304 (326) 177 KOG4540 consensus 91.1 0.43 1.1E-05 26.6 4.6 45 86-132 260-307 (425) 178 COG5153 CVT17 Putative lipase 91.1 0.43 1.1E-05 26.6 4.6 45 86-132 260-307 (425) 179 KOG3967 consensus 87.2 2.4 6E-05 22.1 6.3 195 4-214 75-294 (297) 180 KOG4389 consensus 86.6 1.4 3.5E-05 23.6 4.6 101 17-119 126-238 (601) 181 pfam00450 Peptidase_S10 Serine 86.2 2.7 6.8E-05 21.8 6.7 62 147-208 331-414 (415) 182 pfam05277 DUF726 Protein of un 84.6 2.6 6.7E-05 21.9 5.2 68 93-161 211-288 (343) 183 pfam10605 3HBOH 3HB-oligomer h 82.9 2.2 5.7E-05 22.3 4.3 45 147-191 555-602 (689) 184 COG4553 DepA Poly-beta-hydroxy 80.5 4.6 0.00012 20.4 11.7 64 148-211 341-408 (415) 185 pfam08643 DUF1776 Fungal famil 79.3 4 0.0001 20.7 4.6 32 25-63 3-34 (296) 186 PRK12838 carbamoyl phosphate s 79.2 5.1 0.00013 20.1 7.1 68 48-118 180-254 (356) 187 pfam04301 DUF452 Protein of un 78.8 2.3 5.9E-05 22.2 3.2 33 98-131 55-87 (213) 188 PRK05569 flavodoxin; Provision 78.3 5.4 0.00014 20.0 6.4 66 45-111 18-94 (141) 189 pfam08237 PE-PPE PE-PPE domain 78.1 5.5 0.00014 19.9 5.1 35 82-119 32-66 (227) 190 PRK09739 hypothetical protein; 76.8 4.6 0.00012 20.4 4.3 41 24-67 4-44 (201) 191 cd04962 GT1_like_5 This family 75.9 5 0.00013 20.2 4.3 35 27-65 3-37 (371) 192 COG1663 LpxK Tetraacyldisaccha 73.7 7.2 0.00018 19.2 4.6 47 25-72 45-91 (336) 193 PRK05654 acetyl-CoA carboxylas 71.3 8.2 0.00021 18.9 6.4 139 23-168 122-275 (288) 194 pfam02606 LpxK Tetraacyldisacc 71.2 5.8 0.00015 19.8 3.7 46 26-72 34-79 (318) 195 PRK12564 carbamoyl phosphate s 69.5 9 0.00023 18.6 7.7 68 49-118 185-259 (355) 196 COG0331 FabD (acyl-carrier-pro 68.4 6.4 0.00016 19.5 3.4 32 88-119 73-104 (310) 197 cd01533 4RHOD_Repeat_2 Member 67.7 2 5.1E-05 22.6 0.7 36 23-67 65-100 (109) 198 PRK00652 lpxK tetraacyldisacch 67.0 7.1 0.00018 19.3 3.4 47 26-73 48-94 (334) 199 PRK01906 tetraacyldisaccharide 66.7 10 0.00025 18.4 4.1 47 25-72 54-100 (339) 200 TIGR03282 methan_mark_13 putat 65.5 11 0.00028 18.2 5.6 67 28-110 21-87 (352) 201 KOG3424 consensus 65.3 11 0.00028 18.1 4.7 48 23-73 20-71 (132) 202 smart00827 PKS_AT Acyl transfe 65.0 8.7 0.00022 18.7 3.5 111 88-210 72-211 (298) 203 cd01534 4RHOD_Repeat_3 Member 63.8 12 0.0003 18.0 4.2 34 23-67 55-88 (95) 204 TIGR00128 fabD malonyl CoA-acy 63.8 9.2 0.00023 18.6 3.4 112 98-216 83-228 (295) 205 COG2939 Carboxypeptidase C (ca 63.4 9.1 0.00023 18.6 3.4 74 60-133 149-235 (498) 206 cd01518 RHOD_YceA Member of th 62.4 12 0.00032 17.8 4.8 48 10-67 31-94 (101) 207 KOG2385 consensus 62.2 12 0.00032 17.8 4.8 45 92-136 439-489 (633) 208 TIGR01369 CPSaseII_lrg carbamo 62.1 12 0.00032 17.8 4.4 81 46-135 596-682 (1089) 209 TIGR01368 CPSaseIIsmall carbam 60.4 13 0.00034 17.6 6.4 73 39-118 205-285 (383) 210 TIGR01357 aroB 3-dehydroquinat 59.5 14 0.00035 17.5 7.7 77 25-118 26-102 (361) 211 PRK00313 lpxK tetraacyldisacch 59.0 14 0.00036 17.4 4.3 44 26-70 50-93 (332) 212 PRK05568 flavodoxin; Provision 58.8 14 0.00036 17.4 6.6 64 45-110 18-92 (142) 213 CHL00197 carA carbamoyl-phosph 57.9 15 0.00038 17.3 8.1 74 39-118 203-283 (383) 214 PRK10279 hypothetical protein; 57.7 14 0.00036 17.5 3.5 22 49-74 23-44 (300) 215 KOG1282 consensus 56.5 16 0.0004 17.2 4.4 63 148-210 365-448 (454) 216 cd01522 RHOD_1 Member of the R 53.9 17 0.00044 16.9 4.3 28 23-59 63-90 (117) 217 COG0505 CarA Carbamoylphosphat 53.9 17 0.00044 16.9 7.6 65 49-115 193-264 (368) 218 pfam06441 EHN Epoxide hydrolas 53.2 18 0.00045 16.9 3.9 17 144-160 86-102 (108) 219 cd07225 Pat_PNPLA6_PNPLA7 Pata 52.9 18 0.00046 16.8 3.5 31 87-119 32-62 (306) 220 cd01526 RHOD_ThiF Member of th 50.8 19 0.00049 16.6 3.7 36 23-67 71-106 (122) 221 cd01523 RHOD_Lact_B Member of 49.9 20 0.00051 16.5 3.5 33 24-67 61-93 (100) 222 PRK13869 plasmid-partitioning 49.7 20 0.00051 16.5 4.8 38 26-66 122-159 (405) 223 pfam09142 TruB_C tRNA Pseudour 48.1 19 0.00049 16.6 3.0 23 8-30 33-55 (56) 224 COG0288 CynT Carbonic anhydras 47.6 22 0.00055 16.3 4.8 79 24-120 34-112 (207) 225 COG2885 OmpA Outer membrane pr 46.0 23 0.00059 16.2 6.2 59 46-107 102-162 (190) 226 TIGR03131 malonate_mdcH malona 45.8 23 0.00059 16.2 9.7 28 90-119 68-95 (295) 227 PRK13705 plasmid-partitioning 45.7 23 0.00059 16.1 5.0 43 21-66 102-145 (388) 228 pfam10236 DAP3 Mitochondrial r 44.8 16 0.00041 17.1 2.1 44 150-194 127-171 (274) 229 cd01447 Polysulfide_ST Polysul 44.7 24 0.00062 16.1 3.5 36 22-67 59-94 (103) 230 PRK06756 flavodoxin; Provision 44.6 24 0.00062 16.0 6.5 62 46-107 9-81 (138) 231 cd01521 RHOD_PspE2 Member of t 44.4 15 0.00038 17.3 1.9 11 49-59 82-92 (110) 232 pfam11909 NdhN NADH-quinone ox 44.0 22 0.00056 16.3 2.7 131 24-193 17-147 (154) 233 cd07227 Pat_Fungal_NTE1 Fungal 43.7 25 0.00064 16.0 3.7 18 150-167 161-178 (269) 234 pfam03283 PAE Pectinacetyleste 43.2 26 0.00065 15.9 4.2 35 83-117 138-173 (366) 235 PRK00162 glpE thiosulfate sulf 42.9 26 0.00066 15.9 3.0 11 49-59 74-84 (104) 236 TIGR03453 partition_RepA plasm 42.7 26 0.00066 15.9 4.8 31 36-66 112-142 (387) 237 pfam00698 Acyl_transf_1 Acyl t 42.4 23 0.00058 16.2 2.6 29 90-120 75-103 (319) 238 cd01444 GlpE_ST GlpE sulfurtra 42.3 26 0.00067 15.8 3.3 10 49-58 72-81 (96) 239 TIGR02347 chap_CCT_zeta T-comp 41.6 17 0.00045 16.9 1.9 53 139-193 370-426 (548) 240 pfam05576 Peptidase_S37 PS-10 41.4 25 0.00063 16.0 2.7 71 58-130 89-165 (448) 241 cd01529 4RHOD_Repeats Member o 41.2 27 0.0007 15.7 2.9 35 23-67 55-89 (96) 242 TIGR02802 Pal_lipo peptidoglyc 40.4 28 0.00072 15.7 3.3 55 46-110 19-80 (104) 243 PRK06490 glutamine amidotransf 40.0 29 0.00073 15.6 9.5 88 21-119 8-108 (243) 244 COG0607 PspE Rhodanese-related 40.0 29 0.00073 15.6 3.5 36 22-67 59-94 (110) 245 COG4947 Uncharacterized protei 39.9 29 0.00073 15.6 6.9 104 23-135 25-137 (227) 246 KOG4372 consensus 39.8 29 0.00073 15.6 3.3 21 99-119 149-169 (405) 247 COG0426 FpaA Uncharacterized f 39.6 29 0.00074 15.6 5.0 13 101-113 248-260 (388) 248 KOG2182 consensus 39.6 29 0.00074 15.6 9.7 85 49-133 110-206 (514) 249 cd01519 RHOD_HSP67B2 Member of 38.7 30 0.00077 15.5 3.7 35 23-67 65-99 (106) 250 cd01524 RHOD_Pyr_redox Member 38.3 30 0.00077 15.5 3.6 30 23-61 50-79 (90) 251 cd00884 beta_CA_cladeB Carboni 37.9 31 0.00079 15.4 5.6 81 24-117 24-104 (190) 252 CHL00174 accD acetyl-CoA carbo 37.8 31 0.00079 15.4 7.4 156 11-172 128-299 (305) 253 PRK08762 molybdopterin biosynt 37.8 31 0.00079 15.4 3.2 108 10-129 43-167 (379) 254 KOG1000 consensus 36.2 33 0.00084 15.2 11.0 147 20-192 488-640 (689) 255 TIGR01064 pyruv_kin pyruvate k 35.6 34 0.00086 15.2 5.0 27 56-94 42-68 (513) 256 TIGR01452 PGP_euk phosphoglyco 35.4 34 0.00086 15.2 4.9 51 43-110 45-99 (288) 257 COG0469 PykF Pyruvate kinase [ 35.2 34 0.00087 15.1 4.5 24 49-77 22-45 (477) 258 TIGR01302 IMP_dehydrog inosine 35.0 34 0.00087 15.1 4.9 55 48-121 242-296 (476) 259 COG2513 PrpB PEP phosphonomuta 35.0 34 0.00087 15.1 9.2 46 49-94 30-75 (289) 260 COG4567 Response regulator con 33.5 36 0.00093 15.0 5.8 67 43-109 19-92 (182) 261 pfam06792 UPF0261 Uncharacteri 32.0 39 0.00098 14.8 9.2 27 39-65 8-36 (403) 262 PTZ00071 40S ribosomal protein 31.3 40 0.001 14.8 3.8 47 24-73 23-74 (133) 263 PHA02519 plasmid partition pro 31.3 40 0.001 14.8 4.9 31 36-66 114-145 (387) 264 pfam12531 DUF3731 DNA-K relate 31.2 40 0.001 14.7 3.7 61 150-210 147-213 (249) 265 PRK07053 glutamine amidotransf 31.0 40 0.001 14.7 7.0 83 25-119 3-101 (235) 266 PRK10287 thiosulfate:cyanide s 30.9 40 0.001 14.7 2.9 31 22-61 58-88 (104) 267 pfam06309 Torsin Torsin. This 30.8 40 0.001 14.7 3.9 38 20-60 47-84 (127) 268 TIGR02493 PFLA pyruvate format 30.6 41 0.001 14.7 5.0 40 165-209 152-192 (243) 269 TIGR01717 AMP-nucleosdse AMP n 30.3 25 0.00063 16.0 1.2 39 56-110 268-307 (486) 270 cd01532 4RHOD_Repeat_1 Member 30.1 42 0.0011 14.6 4.5 32 23-61 49-80 (92) 271 PRK10510 putative outer membra 29.8 42 0.0011 14.6 6.2 12 49-60 12-23 (219) 272 pfam05292 MCD Malonyl-CoA deca 28.5 44 0.0011 14.5 2.3 42 98-139 188-233 (354) 273 cd01979 Pchlide_reductase_N Pc 28.1 45 0.0011 14.4 6.4 80 26-117 88-172 (396) 274 TIGR02193 heptsyl_trn_I lipopo 27.2 47 0.0012 14.3 3.0 24 150-173 208-233 (359) 275 pfam12525 DUF3726 Protein of u 27.0 47 0.0012 14.3 2.4 25 65-96 15-39 (80) 276 KOG2170 consensus 26.9 47 0.0012 14.3 3.7 37 21-60 105-141 (344) 277 cd01528 RHOD_2 Member of the R 26.8 47 0.0012 14.3 3.6 18 49-66 7-24 (101) 278 PRK09767 hypothetical protein; 26.7 48 0.0012 14.3 5.8 30 51-94 77-107 (116) 279 KOG2323 consensus 26.5 48 0.0012 14.2 3.7 47 25-84 21-67 (501) 280 cd07228 Pat_NTE_like_bacteria 26.3 48 0.0012 14.2 3.4 35 86-122 16-50 (175) 281 PRK06207 aspartate aminotransf 25.9 49 0.0013 14.2 5.4 39 26-64 181-219 (406) 282 pfam01656 CbiA CobQ/CobB/MinD/ 25.9 49 0.0013 14.2 2.6 29 38-66 8-36 (212) 283 PRK10437 carbonic anhydrase; P 25.7 50 0.0013 14.2 3.1 68 39-117 41-108 (220) 284 PRK08250 glutamine amidotransf 25.6 50 0.0013 14.2 6.4 82 27-119 3-102 (235) 285 KOG4207 consensus 25.5 40 0.001 14.7 1.6 24 56-92 55-78 (256) 286 pfam10561 UPF0565 Uncharacteri 25.4 45 0.0012 14.4 1.8 23 98-120 187-209 (299) 287 COG1752 RssA Predicted esteras 25.2 51 0.0013 14.1 3.9 24 48-75 28-51 (306) 288 TIGR00873 gnd 6-phosphoglucona 24.6 50 0.0013 14.1 1.9 70 147-219 136-217 (480) 289 PRK12815 carB carbamoyl phosph 24.6 52 0.0013 14.0 6.4 68 46-122 577-650 (1068) 290 COG1225 Bcp Peroxiredoxin [Pos 24.4 53 0.0013 14.0 5.6 60 2-64 11-71 (157) 291 PRK09967 putative outer membra 24.1 53 0.0014 14.0 6.6 27 46-74 71-97 (160) 292 PRK04155 chaperone protein Hch 23.8 34 0.00088 15.1 1.0 27 40-67 73-101 (288) 293 COG4822 CbiK Cobalamin biosynt 23.6 54 0.0014 13.9 6.6 123 82-213 117-259 (265) 294 PRK07411 hypothetical protein; 22.6 57 0.0015 13.8 2.5 16 48-63 52-67 (390) 295 cd03802 GT1_AviGT4_like This f 22.6 57 0.0015 13.8 4.2 37 27-65 3-44 (335) 296 TIGR02080 O_succ_thio_ly O-suc 22.5 57 0.0015 13.8 3.5 29 28-63 92-120 (383) 297 TIGR02195 heptsyl_trn_II lipop 22.4 58 0.0015 13.8 5.2 114 23-163 188-309 (361) 298 TIGR01753 flav_short flavodoxi 22.4 58 0.0015 13.8 2.3 71 39-110 8-92 (146) 299 TIGR00456 argS arginyl-tRNA sy 22.4 58 0.0015 13.8 2.6 42 27-68 119-166 (600) 300 pfam09989 DUF2229 CoA enzyme a 22.1 59 0.0015 13.7 2.2 17 49-65 17-33 (218) 301 PRK02399 hypothetical protein; 21.9 59 0.0015 13.7 9.1 27 39-65 9-37 (407) 302 cd07025 Peptidase_S66 LD-Carbo 21.5 60 0.0015 13.7 7.2 22 44-65 14-35 (282) 303 cd03822 GT1_ecORF704_like This 21.5 60 0.0015 13.7 4.1 38 27-67 2-40 (366) 304 PRK12615 galactose-6-phosphate 21.3 61 0.0015 13.6 6.9 110 47-175 16-126 (171) 305 COG0018 ArgS Arginyl-tRNA synt 20.9 62 0.0016 13.6 2.6 43 27-69 118-166 (577) 306 pfam00150 Cellulase Cellulase 20.7 62 0.0016 13.6 6.2 60 36-96 17-78 (276) 307 TIGR02302 aProt_lowcomp conser 20.7 53 0.0014 14.0 1.4 21 11-31 183-203 (899) 308 pfam05706 CDKN3 Cyclin-depende 20.5 63 0.0016 13.5 5.3 51 51-112 94-144 (207) 309 cd07210 Pat_hypo_W_succinogene 20.5 63 0.0016 13.5 4.0 17 150-166 147-163 (221) 310 KOG1153 consensus 20.5 63 0.0016 13.5 3.7 56 77-133 289-351 (501) 311 COG0777 AccD Acetyl-CoA carbox 20.4 64 0.0016 13.5 7.9 137 24-167 121-272 (294) No 1 >COG2945 Predicted hydrolase of the alpha/beta superfamily [General function prediction only] Probab=100.00 E-value=0 Score=385.58 Aligned_cols=205 Identities=60% Similarity=1.056 Sum_probs=194.9 Q ss_pred CCEEEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH Q ss_conf 97089658996187999668887787999957899888899899999999999973984999853155887677852102 Q gi|254781093|r 1 MPEVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG 80 (225) Q Consensus 1 m~~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~ 80 (225) |+||+|+||.|+||++|.+++.+.+|++|+|||||+|||+|+|++++++++.|.++||+||||||||+|+|+|+||+|.| T Consensus 4 ~~~v~i~Gp~G~le~~~~~~~~~~~~iAli~HPHPl~gGtm~nkvv~~la~~l~~~G~atlRfNfRgvG~S~G~fD~GiG 83 (210) T COG2945 4 MPTVIINGPAGRLEGRYEPAKTPAAPIALICHPHPLFGGTMNNKVVQTLARALVKRGFATLRFNFRGVGRSQGEFDNGIG 83 (210) T ss_pred CCCEEECCCCCCCEECCCCCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCC T ss_conf 77478427754211033787788775279638981226766779999999999868963896403325677786568855 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCC Q ss_conf 38999998999986235467405888720368787645038621330122757443332223433477377616865502 Q gi|254781093|r 81 ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVA 160 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~v 160 (225) |++|+.++++|+++++++...++|+|||||++|++++|.++|++..+++++|+.+.|+++++.|||.|+|+|||+.|+++ T Consensus 84 E~~Da~aaldW~~~~hp~s~~~~l~GfSFGa~Ia~~la~r~~e~~~~is~~p~~~~~dfs~l~P~P~~~lvi~g~~Ddvv 163 (210) T COG2945 84 ELEDAAAALDWLQARHPDSASCWLAGFSFGAYIAMQLAMRRPEILVFISILPPINAYDFSFLAPCPSPGLVIQGDADDVV 163 (210) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHCCCCCCCCEEEECCHHHHH T ss_conf 28889999999986288843453103565899999999746555503540477673013222678987306754703222 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH-HHC Q ss_conf 899999999998607799607999787984755858999999999999-731 Q gi|254781093|r 161 TTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLD-NSL 211 (225) Q Consensus 161 p~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~-~~L 211 (225) +++...+| +++.+.++++++++||||++|.+++.+.+.+||. +++ T Consensus 164 ~l~~~l~~------~~~~~~~~i~i~~a~HFF~gKl~~l~~~i~d~l~~r~l 209 (210) T COG2945 164 DLVAVLKW------QESIKITVITIPGADHFFHGKLIELRDTIADFLEDRRL 209 (210) T ss_pred CHHHHHHH------HCCCCCCEEEECCCCCEECCCHHHHHHHHHHHHHCCCC T ss_conf 58999876------32788855886698832100178999999987320025 No 2 >TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily. Probab=100.00 E-value=2.5e-31 Score=208.64 Aligned_cols=203 Identities=21% Similarity=0.215 Sum_probs=157.6 Q ss_pred EEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHH Q ss_conf 89658996187999668887787999957899888899899999999999973984999853155887677852102389 Q gi|254781093|r 4 VVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELS 83 (225) Q Consensus 4 v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~ 83 (225) |.|+...-+|.++++.|.++....+||||+.+.+- ...++.+..+|+.|++.||.|+||||||+|+|+|.+..-..+.+ T Consensus 5 ~~~~~~g~~L~G~l~~p~~~~~~~vli~~~g~~~r-~g~~r~~v~la~~la~~g~~~~rfd~~g~g~s~g~~~~~~~~~~ 83 (274) T TIGR03100 5 LTFSCEGETLVGVLHIPGASHTTGVLIVVGGPQYR-VGSHRQFVLLARRLAEAGFPVLRFDYRGMGDSEGENLGFEGIDA 83 (274) T ss_pred EEEECCCEEEEEEECCCCCCCCCEEEEECCCCCCC-CCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHH T ss_conf 99965992899998379988887289988997666-67642799999999978987999758888788988787255489 Q ss_pred HHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC------------------------- Q ss_conf 9999899998623546740588872036878764503862133012275744333------------------------- Q gi|254781093|r 84 DAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD------------------------- 138 (225) Q Consensus 84 D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~------------------------- 138 (225) |+.++++|+++..++.++|+++|+|+||++++.+|.++++|+++|++.|-..... T Consensus 84 D~~aA~d~~~~~~~~~~~ivl~G~c~Ga~~al~~a~~~~~v~glvL~nPw~~~~~~~~~~~~~~~y~~rL~~~~~w~kll 163 (274) T TIGR03100 84 DIAAAIDAFREAAPHLRRIVAWGLCDAASAALLYAPADLRVAGLVLLNPWVRTEAAQAASRIRHYYLGQLLSADFWRKLL 163 (274) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHCHHHHHHHC T ss_conf 99999999986377867379999768799999875327884379997774465443101455567898850868776641 Q ss_pred -----C-------------------------------CCCCCCCCCEEEEECCCCCCCCHH----HHHHHHHHHHHCCCC Q ss_conf -----2-------------------------------223433477377616865502899----999999998607799 Q gi|254781093|r 139 -----F-------------------------------SFLAPCPSSGLIINGSNDTVATTS----DVKDLVNKLMNQKGI 178 (225) Q Consensus 139 -----~-------------------------------~~l~~~~~p~LiIhG~~D~~vp~~----~~~~~~~~l~~~~~~ 178 (225) + ..+.....|+|+|||++|..++.- ......+.+.. .. T Consensus 164 ~g~v~~~~~~~gl~~~l~~~~~~~~~~~~~~~~~r~~~~l~~~~gpvLiV~g~~D~t~~~f~~~~~~~~~~~~l~~--~p 241 (274) T TIGR03100 164 SGEVNLGSSLRGLGDALLKARQKGDEVAHGGLAERMKAGLERFQGPVLFILSGNDLTAQEFADSVLGEPAWRGALE--DP 241 (274) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHC--CC T ss_conf 5765367776647888764210356654214789999999855998899988986740343666504899998620--89 Q ss_pred CEEEEEECCCCCCCCC--CHHHHHHHHHHHHHH Q ss_conf 6079997879847558--589999999999997 Q gi|254781093|r 179 SITHKVIPDANHFFIG--KVDELINECAHYLDN 209 (225) Q Consensus 179 ~~~~~~i~ga~H~f~~--~~~~l~~~i~~fl~~ 209 (225) .++++++|+|||.|+. +.+++.+.+.+||++ T Consensus 242 ~~~~v~L~~ADHtFS~~~~r~~v~~~t~~Wl~r 274 (274) T TIGR03100 242 GIERVEIDGADHTFSDRVWREWVAARTTEWLRR 274 (274) T ss_pred CCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 843998189998889889999999999999749 No 3 >COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism] Probab=99.97 E-value=1.2e-29 Score=198.64 Aligned_cols=208 Identities=21% Similarity=0.346 Sum_probs=163.5 Q ss_pred CEEEECCCCC-EEEEEEECCCCCCC----CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC- Q ss_conf 7089658996-18799966888778----79999578998888998999999999999739849998531558876778- Q gi|254781093|r 2 PEVVFNGPSG-RLEGRYQPSTNPNA----PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF- 75 (225) Q Consensus 2 ~~v~i~g~~G-~l~~~~~~~~~~~~----~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~- 75 (225) +.+.+++.+| .+.+++..|.+... |++|+.|+-|. .+....+....+.|+.+||+|+.+||||+++..-+| T Consensus 366 e~~~~~~~dG~~i~~~l~~P~~~~~~k~yP~i~~~hGGP~---~~~~~~~~~~~q~~~~~G~~V~~~n~RGS~GyG~~F~ 442 (620) T COG1506 366 EPVTYKSNDGETIHGWLYKPPGFDPRKKYPLIVYIHGGPS---AQVGYSFNPEIQVLASAGYAVLAPNYRGSTGYGREFA 442 (620) T ss_pred EEEEEECCCCCEEEEEEEECCCCCCCCCCCEEEEECCCCC---CCCCCCCCHHHHHHHHCCEEEEECCCCCCCCCCHHHH T ss_conf 4899867998488899980688786677888999788986---3654650588899985992999426888788523334 Q ss_pred -----CCCCHHHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC----------- Q ss_conf -----52102389999989999862-3546740588872036878764503862133012275744333----------- Q gi|254781093|r 76 -----DYGDGELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD----------- 138 (225) Q Consensus 76 -----~~g~~E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~----------- 138 (225) +.|..+.+|+.++++|+.++ ..|..+++++|+||||+++++++...+..++.+..++..+... T Consensus 443 ~~~~~~~g~~~~~D~~~~~~~l~~~~~~d~~ri~i~G~SyGGymtl~~~~~~~~f~a~~~~~~~~~~~~~~~~~~~~~~~ 522 (620) T COG1506 443 DAIRGDWGGVDLEDLIAAVDALVKLPLVDPERIGITGGSYGGYMTLLAATKTPRFKAAVAVAGGVDWLLYFGESTEGLRF 522 (620) T ss_pred HHCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHCCCHHHCC T ss_conf 43036657675999999999999778866102599820889999999986588625999915641035664036714314 Q ss_pred ---------------------CCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC-- Q ss_conf ---------------------222343347737761686550289999999999860779960799978798475585-- Q gi|254781093|r 139 ---------------------FSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK-- 195 (225) Q Consensus 139 ---------------------~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~-- 195 (225) .........|+|+|||+.|..||+++++.++++|+. ++.+++++++|+.+|.|... T Consensus 523 ~~~~~~~~~~~~~~~~~~~sp~~~~~~i~~P~LliHG~~D~~v~~~q~~~~~~aL~~-~g~~~~~~~~p~e~H~~~~~~~ 601 (620) T COG1506 523 DPEENGGGPPEDREKYEDRSPIFYADNIKTPLLLIHGEEDDRVPIEQAEQLVDALKR-KGKPVELVVFPDEGHGFSRPEN 601 (620) T ss_pred CHHHHCCCCCCCHHHHHHHCCHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHH-CCCEEEEEEECCCCCCCCCHHH T ss_conf 878764687544588876295635402579879983278878897999999999997-7975899995799988784688 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 899999999999973122 Q gi|254781093|r 196 VDELINECAHYLDNSLDE 213 (225) Q Consensus 196 ~~~l~~~i~~fl~~~L~~ 213 (225) .....+.+.+|+++++.. T Consensus 602 ~~~~~~~~~~~~~~~~~~ 619 (620) T COG1506 602 RVKVLKEILDWFKRHLKQ 619 (620) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999987457 No 4 >pfam00326 Peptidase_S9 Prolyl oligopeptidase family. Probab=99.95 E-value=2.4e-27 Score=184.66 Aligned_cols=164 Identities=23% Similarity=0.290 Sum_probs=131.5 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCCC------CCCCHHHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCC Q ss_conf 999999739849998531558876778------52102389999989999862-35467405888720368787645038 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGIGRSEGEF------DYGDGELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRR 121 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~G~S~G~~------~~g~~E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~ 121 (225) ..+.|+++||+|+.+||||++...-+| +.|..|++|+.++++|++++ ..|+++|+++|+|+||++++.++.++ T Consensus 6 ~~~~~a~~Gy~V~~~n~RGs~g~G~~~~~~~~~~~G~~~~~D~~~~v~~l~~~~~iD~~RI~v~G~S~GG~la~~~~~~~ 85 (212) T pfam00326 6 NLQLLADRGYVVAVANGRGSGGYGRAWHDAGKGDLGQNEFDDFIAAAEYLIAQGYVDPDRLAIWGGSYGGYLTGAALNQR 85 (212) T ss_pred HHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCHHHEEEEEECCCHHHHHHHHCCC T ss_conf 99999969948999889999620799999887413868799999999999986998855748976762209889986048 Q ss_pred CC-HHEEEECCCCCCCCC---------------------------CCCC---C--CCCCCEEEEECCCCCCCCHHHHHHH Q ss_conf 62-133012275744333---------------------------2223---4--3347737761686550289999999 Q gi|254781093|r 122 PE-INGFISVAPQPKSYD---------------------------FSFL---A--PCPSSGLIINGSNDTVATTSDVKDL 168 (225) Q Consensus 122 p~-i~~~v~isp~~~~~~---------------------------~~~l---~--~~~~p~LiIhG~~D~~vp~~~~~~~ 168 (225) |+ ++++|+.+|..+... .+.+ . ....|+|+|||++|++||++++.++ T Consensus 86 ~~~~~a~v~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~P~LiihG~~D~~Vp~~~s~~l 165 (212) T pfam00326 86 PDLFKAAVAVVPVVDWLTYMSDTSPFTERYMEWGNPWDNEEGYRYLSPYSPYDNVKAYPPLLLIHGLHDDRVPPAEALKL 165 (212) T ss_pred CCHHHHHCCCCCCCCHHHHCCCCCCHHHHHHCCCCCCCCHHHHHHHCCCCHHHHHCCCCCEEEEECCCCCCCCHHHHHHH T ss_conf 51333200257731388864687510467741799533999999858657455506889989996899999898999999 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCCCCCH--HHHHHHHHHHHHHHCCC Q ss_conf 9998607799607999787984755858--99999999999973122 Q gi|254781093|r 169 VNKLMNQKGISITHKVIPDANHFFIGKV--DELINECAHYLDNSLDE 213 (225) Q Consensus 169 ~~~l~~~~~~~~~~~~i~ga~H~f~~~~--~~l~~~i~~fl~~~L~~ 213 (225) +++++.. ++++++.++||++|.|.+.+ ....+.+.+|++++|.. T Consensus 166 ~~~L~~~-g~~~~l~~~p~~gHg~~~~~~~~~~~~~~~~~~~~~l~~ 211 (212) T pfam00326 166 VAALQKT-GKNVLLLIFPDEGHGGGKPRNKREEYARELAFLLKVLGG 211 (212) T ss_pred HHHHHHC-CCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999968-999899996948979989899999999999999997189 No 5 >KOG4667 consensus Probab=99.95 E-value=2.4e-27 Score=184.62 Aligned_cols=205 Identities=21% Similarity=0.386 Sum_probs=161.4 Q ss_pred EEEECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC-- Q ss_conf 089658996-18799966888778799995789988889989999999999997398499985315588767785210-- Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGD-- 79 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~-- 79 (225) .|.||...+ .|.+.++..+.. -+||+|| +|-.+.++.++..+|.+|++.||.++||||||.|+|+|+|++|. T Consensus 12 ~ivi~n~~ne~lvg~lh~tgs~--e~vvlcH---GfrS~Kn~~~~~~vA~~~e~~gis~fRfDF~GnGeS~gsf~~Gn~~ 86 (269) T KOG4667 12 KIVIPNSRNEKLVGLLHETGST--EIVVLCH---GFRSHKNAIIMKNVAKALEKEGISAFRFDFSGNGESEGSFYYGNYN 86 (269) T ss_pred EEEECCCCCCHHHCCEECCCCC--EEEEEEE---CCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCC T ss_conf 9981067772111301013883--3999950---4544541589999999998637248999855787767740467663 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCC------------------------- Q ss_conf 2389999989999862354674058887203687876450386213301227574------------------------- Q gi|254781093|r 80 GELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQP------------------------- 134 (225) Q Consensus 80 ~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~------------------------- 134 (225) .|.+|+..|++|+...+.-. + +++|||-||.+++.++.+.+++..+|-++.-- T Consensus 87 ~eadDL~sV~q~~s~~nr~v-~-vi~gHSkGg~Vvl~ya~K~~d~~~viNcsGRydl~~~I~eRlg~~~l~~ike~Gfid 164 (269) T KOG4667 87 TEADDLHSVIQYFSNSNRVV-P-VILGHSKGGDVVLLYASKYHDIRNVINCSGRYDLKNGINERLGEDYLERIKEQGFID 164 (269) T ss_pred CHHHHHHHHHHHHCCCCEEE-E-EEEEECCCCHHHHHHHHHHCCCHHEEECCCCCCHHCCHHHHHCCCHHHHHHHCCCEE T ss_conf 22778999999860676678-8-998506752899998876058023687456412011266551332899998588430 Q ss_pred -----CCCCC--------CC----------CCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf -----43332--------22----------34334773776168655028999999999986077996079997879847 Q gi|254781093|r 135 -----KSYDF--------SF----------LAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 135 -----~~~~~--------~~----------l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~ 191 (225) ..|.+ +. -.+.+.++|-+||..|++||++.+.++++.++.. .+++|+||||. T Consensus 165 ~~~rkG~y~~rvt~eSlmdrLntd~h~aclkId~~C~VLTvhGs~D~IVPve~AkefAk~i~nH-----~L~iIEgADHn 239 (269) T KOG4667 165 VGPRKGKYGYRVTEESLMDRLNTDIHEACLKIDKQCRVLTVHGSEDEIVPVEDAKEFAKIIPNH-----KLEIIEGADHN 239 (269) T ss_pred CCCCCCCCCCEECHHHHHHHHHCHHHHHHCCCCCCCCEEEEECCCCCEEECHHHHHHHHHCCCC-----CEEEECCCCCC T ss_conf 5765698672642788887872304366348676671699843678366615479999856688-----45873488767 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 5585899999999999973122100124 Q gi|254781093|r 192 FIGKVDELINECAHYLDNSLDEKFTLLK 219 (225) Q Consensus 192 f~~~~~~l~~~i~~fl~~~L~~~~~~~~ 219 (225) |.+++.++......|.+-+.+++..+.. T Consensus 240 yt~~q~~l~~lgl~f~k~r~n~s~~~~~ 267 (269) T KOG4667 240 YTGHQSQLVSLGLEFIKTRINESNLLVD 267 (269) T ss_pred CCCHHHHHHHHCCEEEEEEECCCCCCCC T ss_conf 4210546766100468753024762002 No 6 >PRK10566 esterase; Provisional Probab=99.95 E-value=8.1e-26 Score=175.46 Aligned_cols=192 Identities=16% Similarity=0.162 Sum_probs=141.5 Q ss_pred CEEEEEEECC-CC--CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCC-CCCCCC--------C Q ss_conf 6187999668-88--7787999957899888899899999999999973984999853155887-677852--------1 Q gi|254781093|r 11 GRLEGRYQPS-TN--PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRS-EGEFDY--------G 78 (225) Q Consensus 11 G~l~~~~~~~-~~--~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S-~G~~~~--------g 78 (225) +.+.++..-| +. .+-|+||+.|++| ++.+. ....++.|+++||.|+.||.+|+|.+ .|.... . T Consensus 10 ~~ip~l~~~~~~~~~~~lP~Vvf~HG~~---~~~e~--~~~~~~~la~~Gy~V~~Pd~~~hG~r~~~~~~~~~~~fw~~~ 84 (249) T PRK10566 10 AGIEVLHAYPVGQRDQPLPCVIFYHGFT---SSSLV--YSYFAVALAQAGFRVIMPDAPDHGARFSGDEARRLNQFWQIL 84 (249) T ss_pred CCCCEEEEECCCCCCCCCCEEEEECCCC---CCCCC--HHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 7812699846877889877899968996---64001--157999999789889963688887566776111102469999 Q ss_pred CHHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC------------------- Q ss_conf 023899999899998623-546740588872036878764503862133012275744333------------------- Q gi|254781093|r 79 DGELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD------------------- 138 (225) Q Consensus 79 ~~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~------------------- 138 (225) ..+++|+...++|+++++ .|+++|+++|+||||++|+.+++++|++++.+++.+...... T Consensus 85 ~~~~~e~~~~v~~l~~~g~vD~~RigI~G~S~GG~tal~ala~~p~i~aav~~~G~~~~~~l~~~~~~~~~~~~~~~~~~ 164 (249) T PRK10566 85 LQNMQEFPTLRAAIQQENWLLDDRLAVGGASMGGMTALGIMARHPEVKCAASLMGSGYFTSLARTLFPPLIPETPAQQAE 164 (249) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEECCHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 86299999999999984887753369982388999999999619860599999566359999986074544477300667 Q ss_pred ----CCCC---------CCC-CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCC--EEEEEECCCCCCCCCCHHHHHHH Q ss_conf ----2223---------433-4773776168655028999999999986077996--07999787984755858999999 Q gi|254781093|r 139 ----FSFL---------APC-PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGIS--ITHKVIPDANHFFIGKVDELINE 202 (225) Q Consensus 139 ----~~~l---------~~~-~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~--~~~~~i~ga~H~f~~~~~~l~~~ 202 (225) .+.+ ... ..|+|++||++|++||++++++++++++.+ +.+ +++...||++|.++ .+..++ T Consensus 165 ~~~~~~~l~~~d~~~~~~~l~~rPLLl~HG~~D~~VP~~qs~~~~~aLr~~-g~~~~l~~~~~eg~gH~vs---~~~l~~ 240 (249) T PRK10566 165 FDNIIAPLAEWEVTHQLEQLADRPLLLWHGEADDVVPAAESLRLQQALRET-GLDKNLTCLWQPGVRHRIT---PEALDA 240 (249) T ss_pred HHHHHCCHHCCCHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCCEEEEEECCCCCCCCC---HHHHHH T ss_conf 875425021048444276536998699835889987989999999999975-9980079997399998578---999999 Q ss_pred HHHHHHHHC Q ss_conf 999999731 Q gi|254781093|r 203 CAHYLDNSL 211 (225) Q Consensus 203 i~~fl~~~L 211 (225) +.+|++++| T Consensus 241 ~v~ff~~hL 249 (249) T PRK10566 241 TVAFFRQHL 249 (249) T ss_pred HHHHHHHHC T ss_conf 999999749 No 7 >KOG1552 consensus Probab=99.94 E-value=1.3e-25 Score=174.32 Aligned_cols=198 Identities=20% Similarity=0.292 Sum_probs=158.4 Q ss_pred EECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHH-HHHHHHHHHH-CCCEEEEEEECCCCCCCCCCCCCCHH Q ss_conf 9658996-1879996688877879999578998888998999-9999999997-39849998531558876778521023 Q gi|254781093|r 5 VFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNI-VYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFDYGDGE 81 (225) Q Consensus 5 ~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~-~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~~g~~E 81 (225) ..+...| .+.+.|..+.....+++++|| ||..+.- +..+...|.. ..++++.|||+|+|.|+|+.++ ..+ T Consensus 39 ~~~t~rgn~~~~~y~~~~~~~~~~lly~h------GNa~Dlgq~~~~~~~l~~~ln~nv~~~DYSGyG~S~G~psE-~n~ 111 (258) T KOG1552 39 KVKTSRGNEIVCMYVRPPEAAHPTLLYSH------GNAADLGQMVELFKELSIFLNCNVVSYDYSGYGRSSGKPSE-RNL 111 (258) T ss_pred EEECCCCCEEEEEEECCCCCCCCEEEECC------CCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC-CCC T ss_conf 86057898889999737645561799747------74244688999999876404503899834555567897563-340 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC-----------------CCCCCC Q ss_conf 899999899998623546740588872036878764503862133012275744333-----------------222343 Q gi|254781093|r 82 LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD-----------------FSFLAP 144 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~-----------------~~~l~~ 144 (225) .+|+.||.+|+++.+....+|+|.|+|+|+..++.+|++.| ++++|+.||..+..+ .+.... T Consensus 112 y~Di~avye~Lr~~~g~~~~Iil~G~SiGt~~tv~Lasr~~-~~alVL~SPf~S~~rv~~~~~~~~~~~d~f~~i~kI~~ 190 (258) T KOG1552 112 YADIKAVYEWLRNRYGSPERIILYGQSIGTVPTVDLASRYP-LAAVVLHSPFTSGMRVAFPDTKTTYCFDAFPNIEKISK 190 (258) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHCCC-CCEEEEECCCHHHHHHHCCCCCEEEEECCCCCCCCCEE T ss_conf 55499999999975599861899974687235556762188-72599946423325560148635786012211576135 Q ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 3477377616865502899999999998607799607999787984755858999999999999731221 Q gi|254781093|r 145 CPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDEK 214 (225) Q Consensus 145 ~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~ 214 (225) ...|+|+|||++|++||+++..+++++++.+ ++...+.|++|......+++.+.+..|+....++. T Consensus 191 i~~PVLiiHgtdDevv~~sHg~~Lye~~k~~----~epl~v~g~gH~~~~~~~~yi~~l~~f~~~~~~~~ 256 (258) T KOG1552 191 ITCPVLIIHGTDDEVVDFSHGKALYERCKEK----VEPLWVKGAGHNDIELYPEYIEHLRRFISSVLPSQ 256 (258) T ss_pred ECCCEEEEECCCCCEECCCCCHHHHHHCCCC----CCCCEEECCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 0377899945667310454358899754335----78707866987654447899999999998752557 No 8 >KOG1455 consensus Probab=99.93 E-value=1.8e-24 Score=167.41 Aligned_cols=201 Identities=22% Similarity=0.339 Sum_probs=150.7 Q ss_pred EEEECCCCC-EEEEEEECC-C-CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC- Q ss_conf 089658996-187999668-8-877879999578998888998999999999999739849998531558876778521- Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPS-T-NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG- 78 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~-~-~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g- 78 (225) +-++.+++| .|-+-+..| . ..+++.+++||+ ||+ .....+..+|..|+..||.|+.+|++|+|+|+|..-+- T Consensus 29 ~~~~~n~rG~~lft~~W~p~~~~~pr~lv~~~HG---~g~-~~s~~~~~~a~~l~~~g~~v~a~D~~GhG~SdGl~~yi~ 104 (313) T KOG1455 29 ESFFTNPRGAKLFTQSWLPLSGTEPRGLVFLCHG---YGE-HSSWRYQSTAKRLAKSGFAVYAIDYEGHGRSDGLHAYVP 104 (313) T ss_pred EEEEECCCCCEEEEEECCCCCCCCCCEEEEEECC---CCC-CCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCC T ss_conf 0137747778767774356789987469999747---763-210468999999985797489754257776788744677 Q ss_pred --CHHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCC--------------- Q ss_conf --023899999899998623-546740588872036878764503862-1330122757443332--------------- Q gi|254781093|r 79 --DGELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDF--------------- 139 (225) Q Consensus 79 --~~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~--------------- 139 (225) ..-++|+.+-++.++.+. ....+++|+|+||||+|++.++.++|. ..|+|+++|.....+- T Consensus 105 ~~d~~v~D~~~~~~~i~~~~e~~~lp~FL~GeSMGGAV~Ll~~~k~p~~w~G~ilvaPmc~i~~~~kp~p~v~~~l~~l~ 184 (313) T KOG1455 105 SFDLVVDDVISFFDSIKEREENKGLPRFLFGESMGGAVALLIALKDPNFWDGAILVAPMCKISEDTKPHPPVISILTLLS 184 (313) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 57888988999998876412458997644305744699999976187610241000041225776689828999999998 Q ss_pred -------------------------------------------------------CCCCCCCCCEEEEECCCCCCCCHHH Q ss_conf -------------------------------------------------------2234334773776168655028999 Q gi|254781093|r 140 -------------------------------------------------------SFLAPCPSSGLIINGSNDTVATTSD 164 (225) Q Consensus 140 -------------------------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~ 164 (225) ..+..+..|.||+||++|.++.++. T Consensus 185 ~liP~wk~vp~~d~~~~~~kdp~~r~~~~~npl~y~g~pRl~T~~ElLr~~~~le~~l~~vtvPflilHG~dD~VTDp~~ 264 (313) T KOG1455 185 KLIPTWKIVPTKDIIDVAFKDPEKRKILRSDPLCYTGKPRLKTAYELLRVTADLEKNLNEVTVPFLILHGTDDKVTDPKV 264 (313) T ss_pred HHCCCEEECCCCCCCCCCCCCHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCHH T ss_conf 75871000587654320007899998752398153387308899999999899997332454347998658884058078 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCCCCC-----CHHHHHHHHHHHHHHH Q ss_conf 999999986077996079997879847558-----5899999999999973 Q gi|254781093|r 165 VKDLVNKLMNQKGISITHKVIPDANHFFIG-----KVDELINECAHYLDNS 210 (225) Q Consensus 165 ~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-----~~~~l~~~i~~fl~~~ 210 (225) ++.|++.+..+ +.++.+|||.=|.... ..+.+...|.+||+++ T Consensus 265 Sk~Lye~A~S~---DKTlKlYpGm~H~Ll~gE~~en~e~Vf~DI~~Wl~~r 312 (313) T KOG1455 265 SKELYEKASSS---DKTLKLYPGMWHSLLSGEPDENVEIVFGDIISWLDER 312 (313) T ss_pred HHHHHHHCCCC---CCCEECCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHC T ss_conf 89999856577---8843405447778541798554999999999999733 No 9 >PRK10673 hypothetical protein; Provisional Probab=99.92 E-value=3.3e-23 Score=159.75 Aligned_cols=183 Identities=21% Similarity=0.312 Sum_probs=129.5 Q ss_pred EEEEEEECCCCC-CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH-HHHHHHHHH Q ss_conf 187999668887-787999957899888899899999999999973984999853155887677852102-389999989 Q gi|254781093|r 12 RLEGRYQPSTNP-NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG-ELSDAAAAL 89 (225) Q Consensus 12 ~l~~~~~~~~~~-~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~-E~~D~~aa~ 89 (225) +|...+...+++ +.+.+|++| +++|+.++ ...+++.|++ .|.|+++|.||+|+|+........ -.+|+.+.+ T Consensus 2 ~~~~~~~~~~~P~~~~PlvllH---G~~~~~~~--w~~~~~~L~~-~~rVia~DlrGhG~S~~~~~~~~~~~a~dl~~ll 75 (255) T PRK10673 2 KLNIRAQTAQNPHNNSPIVLVH---GLFGSLDN--LGVLARDLVN-DHNIIQVDMRNHGLSPRDPVMDYPAMAQDLLDTL 75 (255) T ss_pred EEEEEECCCCCCCCCCCEEEEC---CCCCCHHH--HHHHHHHHHC-CCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 0167624788999899778889---85559889--9999999835-9958998289999997988789999999999999 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC------------------------------ Q ss_conf 9998623546740588872036878764503862-133012275744333------------------------------ Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD------------------------------ 138 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~------------------------------ 138 (225) |. .+ ..++.|+||||||.+|+.+|.++|+ |+++|++...+..+. T Consensus 76 d~---lg--i~~~~lvGhSmGG~va~~~A~~~P~rV~~Lvlid~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (255) T PRK10673 76 DA---LQ--IDKATFIGHSMGGKAVMALTALAPDRIDKLVAIDIAPVDYHVRRHDEIFAAINAVSESDATTRQQAAAIMR 150 (255) T ss_pred HH---HC--CCCEEEEEECHHHHHHHHHHHHCCHHCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 98---39--99647852660489999999856111467899825876534110589999999754412535999999999 Q ss_pred -------------------------------------CCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEE Q ss_conf -------------------------------------2223433477377616865502899999999998607799607 Q gi|254781093|r 139 -------------------------------------FSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISIT 181 (225) Q Consensus 139 -------------------------------------~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~ 181 (225) ...+...+.|+|+|+|++|.++|.+..+.+.+.++.+ + T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~P~L~i~G~~d~~~~~~~~~~l~~~~P~a-----~ 225 (255) T PRK10673 151 QHLNEEGVIQFLLKSFVDGEWRFNVPVLWDQYPHIVGWEKIPAWDHPALFIPGGNSPYVSEQYRDDLLAQFPQA-----R 225 (255) T ss_pred HHCCCHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHCCCC-----E T ss_conf 85371789999998512411331849999878875263002677888599950789988999999999878997-----8 Q ss_pred EEEECCCCCCCC-CCHHHHHHHHHHHHHHH Q ss_conf 999787984755-85899999999999973 Q gi|254781093|r 182 HKVIPDANHFFI-GKVDELINECAHYLDNS 210 (225) Q Consensus 182 ~~~i~ga~H~f~-~~~~~l~~~i~~fl~~~ 210 (225) +++|+|++|+-+ ++-+++.+.+.+||+++ T Consensus 226 ~~~i~~aGH~~~~E~P~~~~~~i~~FL~~k 255 (255) T PRK10673 226 AHVIAGAGHWVHAEKPDAVLRAIRRYLNDK 255 (255) T ss_pred EEEECCCCCCCHHHCHHHHHHHHHHHHCCC T ss_conf 999799999027879999999999987769 No 10 >pfam01738 DLH Dienelactone hydrolase family. Probab=99.92 E-value=1.3e-22 Score=156.19 Aligned_cols=191 Identities=13% Similarity=0.211 Sum_probs=139.7 Q ss_pred EEEEEECCCC-CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CCCCCCCCCCC-----------C- Q ss_conf 8799966888-77879999578998888998999999999999739849998531-55887677852-----------1- Q gi|254781093|r 13 LEGRYQPSTN-PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GIGRSEGEFDY-----------G- 78 (225) Q Consensus 13 l~~~~~~~~~-~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~G~S~G~~~~-----------g- 78 (225) +++++..|.+ .+.|.+|++|.. .|- +..+..+|+.|+++||.|+.+|+. |.+.+.+.... . T Consensus 1 ~~ay~a~P~~~gp~PaVll~h~~---~Gl--~~~~~~~A~~lA~~Gy~v~~pDlf~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (216) T pfam01738 1 FGAYLALPAGGGPGPVVVVFQEI---FGV--NNFIRDIAARLAEEGYVALCPDLYARQGPDPTDPREAARAMRGLLSKRM 75 (216) T ss_pred CCEEEEECCCCCCCCEEEEECCC---CCC--CHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCH T ss_conf 97799718999986889997178---789--9899999999997799799831235888998987999999998403478 Q ss_pred CHHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCC-CCCCCCCCCCCCEEEEECCC Q ss_conf 023899999899998623-5467405888720368787645038621330122757443-33222343347737761686 Q gi|254781093|r 79 DGELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKS-YDFSFLAPCPSSGLIINGSN 156 (225) Q Consensus 79 ~~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~-~~~~~l~~~~~p~LiIhG~~ 156 (225) ..-..|+.++++|++++. .+..+|.++|||+||.+++.++.+ +.+.+.++..|.... .+.........|+|+++|++ T Consensus 76 ~~~~~d~~aa~~~l~~~~~~~~~rig~~Gfc~GG~~a~~~a~~-~~~~a~v~~yg~~~~~~~~~~~~~i~~Pvl~~~g~~ 154 (216) T pfam01738 76 EAVVDDLLAAINYLRGQPYVDTKKVGVVGFCLGGRLAFLLAAR-NGVDAAVSFYGGLPPEPPLDEAPAIKAPILAHFGEE 154 (216) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHC-CCCCEEEEEECCCCCCCCHHHCCCCCCCEEEEECCC T ss_conf 8899999999999983777788828999866489999999607-898879996068888730232024677767650377 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC---------HHHHHHHHHHHHHHH Q ss_conf 550289999999999860779960799978798475585---------899999999999973 Q gi|254781093|r 157 DTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK---------VDELINECAHYLDNS 210 (225) Q Consensus 157 D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~---------~~~l~~~i~~fl~~~ 210 (225) |+.+|.+...++.+++... +.++++++|||++|.|... .++.-+.+.+||+++ T Consensus 155 D~~~p~~~~~~~~~~l~~~-g~~~~~~~ypga~HgF~~~~~~~y~~~aa~~aw~~~~~ff~~h 216 (216) T pfam01738 155 DPFVPAEARDLLEEALRAA-NVDHELHWYPGAGHAFAADGRESYDAAAAEDAWERTLEFFKQY 216 (216) T ss_pred CCCCCHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999863-9988999979898468899987789999999999999998559 No 11 >PRK10985 putative hydrolase; Provisional Probab=99.90 E-value=8.1e-22 Score=151.39 Aligned_cols=201 Identities=21% Similarity=0.331 Sum_probs=145.3 Q ss_pred EECCCCC-EEEEEEE--CCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC--CCC Q ss_conf 9658996-1879996--6888778799995789988889989999999999997398499985315588767785--210 Q gi|254781093|r 5 VFNGPSG-RLEGRYQ--PSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD--YGD 79 (225) Q Consensus 5 ~i~g~~G-~l~~~~~--~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~--~g~ 79 (225) .+..+|| .+.--|. +.....+|++|++| +.+|+.+...+..+++.+.++||.++++|+||++++.-+-. +-. T Consensus 35 ~l~~~DG~~i~Ldw~~~~~~~~~~p~villH---GL~G~s~s~Yi~~l~~~~~~~G~~~vv~n~RGc~g~~~~~~r~y~~ 111 (325) T PRK10985 35 RLELPDGDFVDLAWSEDPAQARHKPRLVLFH---GLEGSFNSPYAHGLLEAAQKRGWLGVVMHFRGCSGEPNRLHRIYHS 111 (325) T ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEC---CCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCEECC T ss_conf 9998999989986057887678988899988---8889998779999999999789979998077899997777673627 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCC---CCHHEEEECCCCCCCC--------------C---- Q ss_conf 238999998999986235467405888720368787645038---6213301227574433--------------3---- Q gi|254781093|r 80 GELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRR---PEINGFISVAPQPKSY--------------D---- 138 (225) Q Consensus 80 ~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~---p~i~~~v~isp~~~~~--------------~---- 138 (225) |..+|+..++++++++++. .+++++|||+||.+.+.++++. +.+.+.+++++|.+.. . T Consensus 112 g~t~Dl~~~v~~i~~~~p~-~~i~~vGfSlGgnillkyLge~g~~~~l~aav~vs~P~dl~~~~~~l~~~~~r~Y~~~~~ 190 (325) T PRK10985 112 GETEDARFFLRWLQREFGH-VPTAAVGYSLGGNMLACYLAKEGNDLPLDAAVIVSAPLMLEACSYRMEQGFSRVYQRYLL 190 (325) T ss_pred CCHHHHHHHHHHHHHHCCC-CCEEEEECCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHCCCHHHHHHHHH T ss_conf 8568999999999987899-857886201578999999874077777037998468557899999985620167999999 Q ss_pred ---------------------------------C------------------------CCCCCCCCCEEEEECCCCCCCC Q ss_conf ---------------------------------2------------------------2234334773776168655028 Q gi|254781093|r 139 ---------------------------------F------------------------SFLAPCPSSGLIINGSNDTVAT 161 (225) Q Consensus 139 ---------------------------------~------------------------~~l~~~~~p~LiIhG~~D~~vp 161 (225) | ..+....+|+|+|+..+|+++| T Consensus 191 ~~lk~~~~~k~~~~~~~~~~~~~~~~~~~tl~eFD~~~t~p~~Gf~s~~dYY~~~S~~~~l~~I~vP~L~l~A~DDP~~~ 270 (325) T PRK10985 191 NLLKANATRKLAAYPGTLPINLAQLKSVRRLREFDDLITAKIHGFADAIDYYRQCSALPLLNQIAKPTLIIHAKDDPFMT 270 (325) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHHHCCCCCEEEEECCCCCCCC T ss_conf 99999999998747441247999986179799999865565259899899998539364540277664999747789899 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCCC--CCCCC----HHHHHHHHHHHHHHHCCCC Q ss_conf 99999999998607799607999787984--75585----8999999999999731221 Q gi|254781093|r 162 TSDVKDLVNKLMNQKGISITHKVIPDANH--FFIGK----VDELINECAHYLDNSLDEK 214 (225) Q Consensus 162 ~~~~~~~~~~l~~~~~~~~~~~~i~ga~H--~f~~~----~~~l~~~i~~fl~~~L~~~ 214 (225) .+..-+. +. ....+.+.+-+.+|| |+.+. .-=+.+.+.+|++..|+.+ T Consensus 271 ~~~iP~~-~~----~~~~~~l~~t~~GGHvGF~eg~~~~~~~W~~~~i~~~~~~~l~~~ 324 (325) T PRK10985 271 HEVIPKP-ES----LPPNVEYQLTEHGGHVGFIGGTLLKPQMWLESRIPDWLTTYLEAK 324 (325) T ss_pred HHHCCCH-HH----CCCCEEEEECCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 1438756-76----599748982799873653778999988466999999999997247 No 12 >KOG4391 consensus Probab=99.90 E-value=1.2e-22 Score=156.38 Aligned_cols=209 Identities=18% Similarity=0.298 Sum_probs=159.2 Q ss_pred EEEECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHH-HHHHCCCEEEEEEECCCCCCCCCCCCCCH Q ss_conf 089658996-1879996688877879999578998888998999999999-99973984999853155887677852102 Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFY-LFQQRGFVSLRFNFRGIGRSEGEFDYGDG 80 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~-~l~~~G~~vl~fd~RG~G~S~G~~~~g~~ 80 (225) .+.+-++|- .|++++...+. .+|+++++||+. |||.++. ..++ .+.+.+++|+..+|||+|.|+|+.++ .| T Consensus 56 ~i~l~T~D~vtL~a~~~~~E~-S~pTlLyfh~NA---GNmGhr~--~i~~~fy~~l~mnv~ivsYRGYG~S~GspsE-~G 128 (300) T KOG4391 56 RIELRTRDKVTLDAYLMLSES-SRPTLLYFHANA---GNMGHRL--PIARVFYVNLKMNVLIVSYRGYGKSEGSPSE-EG 128 (300) T ss_pred EEEEECCCCEEEEEEEECCCC-CCCEEEEECCCC---CCCCCHH--HHHHHHHHHCCCEEEEEEEECCCCCCCCCCC-CC T ss_conf 899874763367644401357-885489983687---7511000--6789999972731999982034568899541-01 Q ss_pred HHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECC--------------CCCCC-------- Q ss_conf 3899999899998623-546740588872036878764503862-13301227--------------57443-------- Q gi|254781093|r 81 ELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVA--------------PQPKS-------- 136 (225) Q Consensus 81 E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~is--------------p~~~~-------- 136 (225) -.-|.++|+||+-.+. -+.++++|+|.|+||++|+.+|++..+ +.++|.-. |.+.. T Consensus 129 L~lDs~avldyl~t~~~~dktkivlfGrSlGGAvai~lask~~~ri~~~ivENTF~SIp~~~i~~v~p~~~k~i~~lc~k 208 (300) T KOG4391 129 LKLDSEAVLDYLMTRPDLDKTKIVLFGRSLGGAVAIHLASKNSDRISAIIVENTFLSIPHMAIPLVFPFPMKYIPLLCYK 208 (300) T ss_pred EECCHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEEECCCHHHEEEEEEECHHCCCHHHHHHEECCCHHHHHHHHHHH T ss_conf 10048999998850766774159998325677268885003401300332311000355442002223436677789877 Q ss_pred ---CCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf ---33222343347737761686550289999999999860779960799978798475585899999999999973122 Q gi|254781093|r 137 ---YDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDE 213 (225) Q Consensus 137 ---~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~ 213 (225) .+.+.+..|..|.|+|.|.+|+.||+-+++.+++......+ .+.++|+..|-=+--.+-+-+++.+||.+.... T Consensus 209 n~~~S~~ki~~~~~P~LFiSGlkDelVPP~~Mr~Ly~~c~S~~K---rl~eFP~gtHNDT~i~dGYfq~i~dFlaE~~~~ 285 (300) T KOG4391 209 NKWLSYRKIGQCRMPFLFISGLKDELVPPVMMRQLYELCPSRTK---RLAEFPDGTHNDTWICDGYFQAIEDFLAEVVKS 285 (300) T ss_pred HHHCCHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCHHHH---HHEECCCCCCCCEEEECCHHHHHHHHHHHHCCC T ss_conf 56413213224557647850576345895999999876821223---222278986686078535899999999986127 Q ss_pred CCCCCCCC Q ss_conf 10012433 Q gi|254781093|r 214 KFTLLKSI 221 (225) Q Consensus 214 ~~~~~~~~ 221 (225) +++.-+++ T Consensus 286 ~P~~~a~~ 293 (300) T KOG4391 286 SPEEMAKT 293 (300) T ss_pred CHHHHCCC T ss_conf 96774876 No 13 >COG2267 PldB Lysophospholipase [Lipid metabolism] Probab=99.90 E-value=2.7e-21 Score=148.26 Aligned_cols=202 Identities=23% Similarity=0.361 Sum_probs=145.9 Q ss_pred EEEECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCC-CCC---CC Q ss_conf 089658996-1879996688877879999578998888998999999999999739849998531558876-778---52 Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSE-GEF---DY 77 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~-G~~---~~ 77 (225) |-.+.+.+| ++......+..++.++||++|+. ..+..-...++..|..+||.|+.+|.||+|+|. |.- +. T Consensus 11 ~~~~~~~d~~~~~~~~~~~~~~~~g~Vvl~HG~-----~Eh~~ry~~la~~l~~~G~~V~~~D~RGhG~S~r~~rg~~~~ 85 (298) T COG2267 11 EGYFTGADGTRLRYRTWAAPEPPKGVVVLVHGL-----GEHSGRYEELADDLAARGFDVYALDLRGHGRSPRGQRGHVDS 85 (298) T ss_pred CCEEECCCCCEEEEEEECCCCCCCCEEEEECCC-----HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCHH T ss_conf 443404798467888523788988789992695-----067889999999998599879983689988899776677210 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCC------------------- Q ss_conf 102389999989999862354674058887203687876450386-213301227574433------------------- Q gi|254781093|r 78 GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSY------------------- 137 (225) Q Consensus 78 g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~------------------- 137 (225) -.--..|+.+.++.+.+.+++ .+++|+||||||.+++.++.+.+ .+.++|+.||..... T Consensus 86 f~~~~~dl~~~~~~~~~~~~~-~p~~l~gHSmGg~Ia~~~~~~~~~~i~~~vLssP~~~l~~~~~~~~~~~~~~~~~~~~ 164 (298) T COG2267 86 FADYVDDLDAFVETIAEPDPG-LPVFLLGHSMGGLIALLYLARYPPRIDGLVLSSPALGLGGAILRLILARLALKLLGRI 164 (298) T ss_pred HHHHHHHHHHHHHHHHCCCCC-CCEEEEEECCHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 899999999999998534899-9889999573699999999847856566999886423662034568899876544030 Q ss_pred ----CCCC----------C-----------------------------------------CCCCCCEEEEECCCCCCCC- Q ss_conf ----3222----------3-----------------------------------------4334773776168655028- Q gi|254781093|r 138 ----DFSF----------L-----------------------------------------APCPSSGLIINGSNDTVAT- 161 (225) Q Consensus 138 ----~~~~----------l-----------------------------------------~~~~~p~LiIhG~~D~~vp- 161 (225) .+.. + .....|+|++||++|.+|+ T Consensus 165 ~p~~~~~~~~~~~~~~~~~sr~~~~~~~~~~dP~~~~~~~~~~w~~~~~~a~~~~~~~~~~~~~~PvLll~g~~D~vv~~ 244 (298) T COG2267 165 RPKLPVDSNLLEGVLTDDLSRDPAEVAAYEADPLIGVGGPVSRWVDLALLAGRVPALRDAPAIALPVLLLQGGDDRVVDN 244 (298) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCC T ss_conf 65553356544565534353699999986349520268853799999999743332033434579858730377836677 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-C-CH--HHHHHHHHHHHHHHCCC Q ss_conf 99999999998607799607999787984755-8-58--99999999999973122 Q gi|254781093|r 162 TSDVKDLVNKLMNQKGISITHKVIPDANHFFI-G-KV--DELINECAHYLDNSLDE 213 (225) Q Consensus 162 ~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~-~-~~--~~l~~~i~~fl~~~L~~ 213 (225) .+...++++++..+ +.++++++|+-|... + .. +++.+.+.+||.+.+.. T Consensus 245 ~~~~~~~~~~~~~~---~~~~~~~~g~~He~~~E~~~~r~~~~~~~~~~l~~~~~~ 297 (298) T COG2267 245 VEGLARFFERAGSP---DKELKVIPGAYHELLNEPDRAREEVLKDILAWLAEALPS 297 (298) T ss_pred HHHHHHHHHHCCCC---CCEEEECCCCCCHHHHCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 37899999965999---842777689770565277758999999999998750169 No 14 >COG0412 Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.90 E-value=6.2e-21 Score=146.07 Aligned_cols=205 Identities=20% Similarity=0.316 Sum_probs=158.7 Q ss_pred CEEEECCCCCEEEEEEECCCCCCC-CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC-CCCCCCCC---- Q ss_conf 708965899618799966888778-799995789988889989999999999997398499985315-58876778---- Q gi|254781093|r 2 PEVVFNGPSGRLEGRYQPSTNPNA-PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG-IGRSEGEF---- 75 (225) Q Consensus 2 ~~v~i~g~~G~l~~~~~~~~~~~~-~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG-~G~S~G~~---- 75 (225) .+|.++.+++.+.+.+..|.+... |.||+.|-. +|- |..+.+.|++|+..||.++.+|+-+ .|.+...- T Consensus 3 ~~v~~~~~~~~~~~~~a~P~~~~~~P~VIv~hei--~Gl---~~~i~~~a~rlA~~Gy~v~~Pdl~~~~~~~~~~~~~~~ 77 (236) T COG0412 3 TDVTIPAPDGELPAYLARPAGAGGFPGVIVLHEI--FGL---NPHIRDVARRLAKAGYVVLAPDLYGRQGDPTDIEDEPA 77 (236) T ss_pred CCEEECCCCCEEEEEEECCCCCCCCCEEEEEECC--CCC---CHHHHHHHHHHHHCCCEEEECHHHCCCCCCCCCCCHHH T ss_conf 5458327985333589566778999889998335--788---87899999999958986973154446888876564189 Q ss_pred --CCC-------CHHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCCCCCCC Q ss_conf --521-------023899999899998623-5467405888720368787645038621330122757443332223433 Q gi|254781093|r 76 --DYG-------DGELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFSFLAPC 145 (225) Q Consensus 76 --~~g-------~~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~~l~~~ 145 (225) ..+ .....|+.++++|++++. .+..+|.++||||||.+++.++.+.|++++.++..|....-....-... T Consensus 78 ~~~~~~~~~~~~~~~~~d~~a~~~~L~~~~~~~~~~ig~~GfC~GG~~a~~~a~~~~~v~a~v~fyg~~~~~~~~~~~~~ 157 (236) T COG0412 78 ELETGLVERVDPAEVLADIDAALDYLARQPQVDPKRIGVVGFCMGGGLALLAATRAPEVKAAVAFYGGLIADDTADAPKI 157 (236) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCCC T ss_conf 87512320478788664599999999866688886489999880379999752446687779984688777663200026 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC------------HHHHHHHHHHHHHHHCC Q ss_conf 47737761686550289999999999860779960799978798475585------------89999999999997312 Q gi|254781093|r 146 PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK------------VDELINECAHYLDNSLD 212 (225) Q Consensus 146 ~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~------------~~~l~~~i~~fl~~~L~ 212 (225) ..|+|+++|+.|..+|.+....+.+++... +..+++.+++++.|.|.+. .++.-+.+.+|+++.+. T Consensus 158 ~~pvl~~~~~~D~~~p~~~~~~~~~~~~~~-~~~~~~~~y~ga~H~F~~~~~~~~~~y~~~aa~~a~~~~~~ff~~~~~ 235 (236) T COG0412 158 KVPVLLHLAGEDPYIPAADVDALAAALEDA-GVKVDLEIYPGAGHGFANDRADYHPGYDAAAAEDAWQRVLAFFKRLLG 235 (236) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 898999814767767836889999999856-886269997898621147776556777978999999999999998606 No 15 >TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself. Probab=99.90 E-value=3.6e-22 Score=153.48 Aligned_cols=181 Identities=17% Similarity=0.245 Sum_probs=123.2 Q ss_pred EEEEEECCCCCCCCEEEEECCCCCCCCCCCCH--HHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CC--CCHHHHHHHH Q ss_conf 87999668887787999957899888899899--9999999999739849998531558876778-52--1023899999 Q gi|254781093|r 13 LEGRYQPSTNPNAPIALILHPHPRFGGTMNDN--IVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DY--GDGELSDAAA 87 (225) Q Consensus 13 l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~--~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~--g~~E~~D~~a 87 (225) +.-.|...+ .+ +.+|++|++ |++.... ....+ ..|.+.+|.|+.+|.||+|+|+-.. +. +....+|+.+ T Consensus 20 ~ri~Y~~~G-~G-~pvvllHG~---~~~~~~W~~~~~~l-~~~~~~~yrVIa~Dl~G~G~S~~~~~~~~~~~~~a~~~~~ 93 (282) T TIGR03343 20 FRIHYNEAG-NG-EAVIMLHGG---GPGAGGWSNYYRNI-GPFVDAGYRVILKDSPGFNKSDAVVMDEQRGLVNARAVKG 93 (282) T ss_pred EEEEEEEEC-CC-CEEEEECCC---CCCHHHHHHHHHHH-HHHHHCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 899999974-68-818988399---98742789999977-9999789859997189988899876555324799999999 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC----------------------------- Q ss_conf 899998623546740588872036878764503862-13301227574433----------------------------- Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY----------------------------- 137 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~----------------------------- 137 (225) .++ ..+ ..++.|+||||||.+|+.+|.++|+ |+++|+++|..... T Consensus 94 lld---~L~--i~~~~lvG~S~GG~iA~~~A~~~P~rV~~LvL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (282) T TIGR03343 94 LMD---ALD--IEKAHLVGNSMGGATALNFALEYPDRIGKLILMGPGGLGPSLFAPMPMEGIKLLFKLYAEPSYETLKQM 168 (282) T ss_pred HHH---HCC--CCCEEEEEECCHHHHHHHHHHHCHHHHHEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHH T ss_conf 999---749--983799973737999999999786862478998887666212465708899999997515789999999 Q ss_pred ------CC----------------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf ------32----------------------------------------22343347737761686550289999999999 Q gi|254781093|r 138 ------DF----------------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNK 171 (225) Q Consensus 138 ------~~----------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~ 171 (225) +. ..+...+.|+|+|+|++|.++|++.++++.+. T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~l~~i~~PtLvi~G~~D~~~p~~~~~~l~~~ 248 (282) T TIGR03343 169 LNVFLFDQSLITEELLQGRWENIQRQPEHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDRFVPLDHGLKLLWN 248 (282) T ss_pred HHHHCCCHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHH T ss_conf 99853480006899999999998729788999999976401202344788721898779998378874589999999986 Q ss_pred HHHCCCCCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHH Q ss_conf 8607799607999787984755-8589999999999997 Q gi|254781093|r 172 LMNQKGISITHKVIPDANHFFI-GKVDELINECAHYLDN 209 (225) Q Consensus 172 l~~~~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~~fl~~ 209 (225) ++.+ +++++|+++|+-. ++.+++.+.+.+||++ T Consensus 249 ip~a-----~l~~i~~aGH~~~~E~P~~fn~~l~~FLk~ 282 (282) T TIGR03343 249 MPDA-----QLHVFSRCGHWAQWEHADAFNRLVIDFLRN 282 (282) T ss_pred CCCC-----EEEEECCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 8998-----899989999954886499999999998486 No 16 >TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067. Probab=99.89 E-value=1.6e-21 Score=149.65 Aligned_cols=178 Identities=21% Similarity=0.322 Sum_probs=126.6 Q ss_pred EEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC--H-HHHHHHHHHHHH Q ss_conf 9966888778799995789988889989999999999997398499985315588767785210--2-389999989999 Q gi|254781093|r 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGD--G-ELSDAAAALDWV 92 (225) Q Consensus 16 ~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~--~-E~~D~~aa~~~l 92 (225) ..+-+..+.+|++|++| +++++. .........|++. |.|+.+|+||+|.|+...+... . -.+|+.+.++. T Consensus 4 ~~~g~~~~~~P~lvllH---G~~~~~--~~w~~~~~~L~~~-~rvi~~Dl~G~G~S~~~~~~~ys~~~~a~dl~~~l~~- 76 (257) T TIGR03611 4 ELHGPPDADAPTVVLSS---GLGGSG--SYWAPQLDVLTQR-FHVVTYDHRGTGRSPEELPPGYSIAHMADDVLQLLDA- 76 (257) T ss_pred EEECCCCCCCCEEEEEC---CCCCCH--HHHHHHHHHHHCC-CEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH- T ss_conf 99589999999599979---986688--8999999998269-8699978999999999999999999999999999997- Q ss_pred HHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC-------------------------------C- Q ss_conf 8623546740588872036878764503862-133012275744333-------------------------------2- Q gi|254781093|r 93 QSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD-------------------------------F- 139 (225) Q Consensus 93 ~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~-------------------------------~- 139 (225) .+ ..++.|+||||||.+|+.+|.++|+ |.++|++++...... . T Consensus 77 --l~--~~~~~lvG~S~Gg~va~~~a~~~P~~V~~lil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (257) T TIGR03611 77 --LG--IERFHFVGHALGGLIGLQLALRYPERLLSLVLINAWSRPDPHTRRCFDVRIALLQHAGPEAYVHAQALFLYPAD 152 (257) T ss_pred --CC--CCCCEEEEECCCHHHHHHHHHHCHHHEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCHH T ss_conf --69--98726998477289999999969565408889888666883566678999999983277889987677516745 Q ss_pred ---------------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf ---------------------------------------22343347737761686550289999999999860779960 Q gi|254781093|r 140 ---------------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISI 180 (225) Q Consensus 140 ---------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~ 180 (225) ..+.....|+|+|+|++|.++|.+.+..+.+.++. . T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~l~~i~~P~Lii~G~~D~~~p~~~~~~~~~~ip~-----~ 227 (257) T TIGR03611 153 WISENAARLAADEAHALAHFPGKANVLRRINALEAFDVSARLDRIQHPVLLIANRDDMLVPYTQSLRLAAALPN-----A 227 (257) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHCCC-----C T ss_conf 44230467768999998737768899999998862584888854899779996268876899999999987899-----8 Q ss_pred EEEEECCCCCCC-CCCHHHHHHHHHHHHHH Q ss_conf 799978798475-58589999999999997 Q gi|254781093|r 181 THKVIPDANHFF-IGKVDELINECAHYLDN 209 (225) Q Consensus 181 ~~~~i~ga~H~f-~~~~~~l~~~i~~fl~~ 209 (225) ++++||+++|+- .++.+++.+.+.+||+. T Consensus 228 ~~~~i~~~GH~~~~e~P~~~~~~l~~FLks 257 (257) T TIGR03611 228 QLKLLPYGGHASNVTDPETFNRALLDFLKT 257 (257) T ss_pred EEEEECCCCCCHHHHCHHHHHHHHHHHHCC T ss_conf 899989999710786999999999999675 No 17 >PRK00870 haloalkane dehalogenase; Provisional Probab=99.89 E-value=1.3e-21 Score=150.05 Aligned_cols=192 Identities=17% Similarity=0.297 Sum_probs=131.8 Q ss_pred ECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCC---CCCCCH- Q ss_conf 658996-187999668887787999957899888899899999999999973984999853155887677---852102- Q gi|254781093|r 6 FNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGE---FDYGDG- 80 (225) Q Consensus 6 i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~---~~~g~~- 80 (225) .++.+| -|.-.|...+.+.++.+|++|++|... .........|++.||-|+++|.||+|.|+-- .++... T Consensus 26 ~~~~~~~~lr~hY~e~G~~~GppvvLlHG~p~~~-----~~Wr~~i~~La~~GyRVIapDlrG~G~SdkP~~~~~Yt~~~ 100 (302) T PRK00870 26 VDDGDGGPLRMHYVDEGPADGEPVLLLHGEPSWS-----YLYRKMIPILAAAGHRVIAPDLIGFGRSDKPTRREDYTYAR 100 (302) T ss_pred ECCCCCCEEEEEEEECCCCCCCEEEEECCCCCCH-----HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCHHH T ss_conf 3477776378999853789998899989999987-----89999999998689989993689989999999878879999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC----------------------- Q ss_conf 3899999899998623546740588872036878764503862-1330122757443----------------------- Q gi|254781093|r 81 ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS----------------------- 136 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~----------------------- 136 (225) -.+|+.+.++.+ + ..++.|+||||||.+|+.+|.++|+ ++++|+++..... T Consensus 101 ~a~~l~~lld~L---g--l~~~~LVGhs~GG~vA~~~A~~~PerV~~LVl~~~~~p~~~~~~~~~~~~~~~~~~~~p~~~ 175 (302) T PRK00870 101 HVEWMRSWFEQL---D--LTGVTLVCQDWGGLIGLRLAAEHPDRFARLVVANTGLPTGDGPMPDAFWAWRAFSQESPVLP 175 (302) T ss_pred HHHHHHHHHHHH---C--CCCEEEEEEECCCHHHHHHHHHCHHHHCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCHH T ss_conf 999999999984---9--98579999731608999999969787148999888776665545189999999986283014 Q ss_pred --------------------CC--C--------------------------------CCCCCCCCCEEEEECCCCCCCCH Q ss_conf --------------------33--2--------------------------------22343347737761686550289 Q gi|254781093|r 137 --------------------YD--F--------------------------------SFLAPCPSSGLIINGSNDTVATT 162 (225) Q Consensus 137 --------------------~~--~--------------------------------~~l~~~~~p~LiIhG~~D~~vp~ 162 (225) +. + ..+..+..|+|+|+|++|.+.+. T Consensus 176 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~P~Lvi~G~~D~~~~~ 255 (302) T PRK00870 176 VGRLVNGGTVRDLSDAVRAAYDAPFPDESYKAGARAFPLLVPTSPDDPAVAANRAAWEVLERWDKPFLTAFSDSDPITGG 255 (302) T ss_pred HHHHHHCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCH T ss_conf 99997424434489999987407898678999999878755578433357988999998723799879998278876652 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHHH Q ss_conf 9999999998607799607999787984755-85899999999999973 Q gi|254781093|r 163 SDVKDLVNKLMNQKGISITHKVIPDANHFFI-GKVDELINECAHYLDNS 210 (225) Q Consensus 163 ~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~~fl~~~ 210 (225) .. ..+.+.++.++ ..++++|++++|+-. ++-+++.+.|.+||++. T Consensus 256 ~~-~~~~~~ip~a~--~~~~~~i~~aGH~~~~E~Pe~~~~~i~~FLr~~ 301 (302) T PRK00870 256 GD-AILQKRIPGAA--GQPHTTIKGGGHFLQEDSGEELAEAIVSFIRAN 301 (302) T ss_pred HH-HHHHHHCCCCC--CCCEEEECCCCCCHHHCCHHHHHHHHHHHHHHC T ss_conf 67-99999678887--886589799988058759999999999999758 No 18 >PRK10349 carboxylesterase BioH; Provisional Probab=99.89 E-value=9.7e-22 Score=150.91 Aligned_cols=176 Identities=20% Similarity=0.255 Sum_probs=123.3 Q ss_pred EEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99668887787999957899888899899999999999973984999853155887677852102389999989999862 Q gi|254781093|r 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSL 95 (225) Q Consensus 16 ~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~ 95 (225) +|+.-.+...+.+|++| ++|++... -......|.+ .|.|+++|+||+|+|++. + ....+|..+.+ .+. T Consensus 4 i~Y~~~G~G~~plvllH---G~g~~~~~--W~~~~~~L~~-~~rvi~~DlpG~G~S~~~-~--~~~~~~~a~~l---~~~ 71 (256) T PRK10349 4 IWWQTKGQGNVHLVLLH---GWGLNAEV--WRCIDEELSS-HFTLHLVDLPGFGRSRGF-G--ALSLADMAEAV---LKQ 71 (256) T ss_pred EEEEEECCCCCCEEEEC---CCCCCHHH--HHHHHHHHHC-CCEEEEECCCCCCCCCCC-C--CCCHHHHHHHH---HHH T ss_conf 69996612899779989---99878789--9999999734-777999579888889998-8--45899999999---973 Q ss_pred CCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC-----C-----------------C-------C------ Q ss_conf 3546740588872036878764503862-1330122757443-----3-----------------3-------2------ Q gi|254781093|r 96 NPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS-----Y-----------------D-------F------ 139 (225) Q Consensus 96 ~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~-----~-----------------~-------~------ 139 (225) .. .+..++||||||.+|+.+|.++|+ |+++|++++.+.. + + + T Consensus 72 ~~--~~~~lvG~SmGG~iA~~~A~~~P~rV~~Lvlv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (256) T PRK10349 72 AP--DKAIWLGWSLGGLVASQIALTHPERVQALVTVASSPCFSARDEWPGIKPDVLAGFQQQLSDDFQRTVERFLALQTM 149 (256) T ss_pred CC--CCEEEEEECHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 87--6737999896189999999979555542415577744445545554469999999999888899999999876531 Q ss_pred ----------------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf ----------------------------------------2234334773776168655028999999999986077996 Q gi|254781093|r 140 ----------------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGIS 179 (225) Q Consensus 140 ----------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~ 179 (225) ..|.....|+|+|+|++|.++|++.++.+.+.++.+ T Consensus 150 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~d~~~~L~~i~~PtLvi~G~~D~~vp~~~~~~l~~~ip~a---- 225 (256) T PRK10349 150 GTETARQDARALKKTVLALPMPEVDVLNGGLEILKTVDLRQPLQNVSMPFLRLYGYLDGLVPRKVVPMLDKLWPHS---- 225 (256) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCC---- T ss_conf 6334578999999999807788799999999999806458787318999799983899887999999999868998---- Q ss_pred EEEEEECCCCCC-CCCCHHHHHHHHHHHHHHH Q ss_conf 079997879847-5585899999999999973 Q gi|254781093|r 180 ITHKVIPDANHF-FIGKVDELINECAHYLDNS 210 (225) Q Consensus 180 ~~~~~i~ga~H~-f~~~~~~l~~~i~~fl~~~ 210 (225) +++++++++|+ +-++.+++.+.+.+||++. T Consensus 226 -~l~ii~~~GH~p~~E~P~~f~~~ll~Fl~r~ 256 (256) T PRK10349 226 -ESYIFAKAAHAPFISHPAEFCHLLVALKQRV 256 (256) T ss_pred -EEEEECCCCCCHHHHCHHHHHHHHHHHHHHC T ss_conf -8999799998325439999999999999709 No 19 >PRK10749 lysophospholipase L2; Provisional Probab=99.89 E-value=1.5e-20 Score=143.68 Aligned_cols=200 Identities=20% Similarity=0.206 Sum_probs=141.4 Q ss_pred EEEECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCH-HHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC---C Q ss_conf 089658996-187999668887787999957899888899899-99999999997398499985315588767785---2 Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDN-IVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD---Y 77 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~-~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~---~ 77 (225) |=.|.|.+| +|.-...... .+++++||+| |..+.- -...++..|.++||.|+.+|.||+|.|+...+ . T Consensus 32 ~~~f~g~d~~~i~y~~~~~~-~~~~~IVI~~------Gr~E~~~KY~E~~~~l~~~Gy~V~~~DhRGqG~S~Rl~~d~~~ 104 (330) T PRK10749 32 EAEFTGVDNIPVRFVRFRAQ-HHDRVVVICP------GRIESYVKYAELAYDLFHLGFDVLIIDHRGQGRSGRLLADPHR 104 (330) T ss_pred CCEEECCCCCEEEEEECCCC-CCCCEEEECC------CCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCC T ss_conf 50697569986999986688-8885599828------9741089999999999988998999778888877776789987 Q ss_pred CCH-----HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC--------------- Q ss_conf 102-----3899999899998623546740588872036878764503862-1330122757443--------------- Q gi|254781093|r 78 GDG-----ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS--------------- 136 (225) Q Consensus 78 g~~-----E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~--------------- 136 (225) |.- =++|+.+.++.+....+ ..+++|+||||||.+++.++.++|. +.++|+.||.... T Consensus 105 GhV~~F~dYv~Dl~~~~~~~~~~~~-~~~~~llgHSMGG~Ia~~~l~~~~~~f~aavLSAPMlgI~~~~p~~la~~ll~~ 183 (330) T PRK10749 105 GHVNKFNDYVDDLAAFWQQEIQPGP-WRKRYILAHSMGGAILTLFLQRHPGVFDAIALCAPMFGIVIRLPSWMARQILNW 183 (330) T ss_pred EEECCHHHHHHHHHHHHHHHHCCCC-CCCEEEEEECHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHH T ss_conf 6318888899999999999854568-886389973627899999999678655559995775357675408999999998 Q ss_pred ----------C-------C---C----------------------------------------------CCCCCCCCCEE Q ss_conf ----------3-------3---2----------------------------------------------22343347737 Q gi|254781093|r 137 ----------Y-------D---F----------------------------------------------SFLAPCPSSGL 150 (225) Q Consensus 137 ----------~-------~---~----------------------------------------------~~l~~~~~p~L 150 (225) + . + ........|+| T Consensus 184 ~~~~~~~~~~y~~g~~~~~~~pf~~N~LT~d~~Ry~~~~~~~~~~P~l~lGgpT~~Wv~eal~a~~~~~~~a~~I~~PvL 263 (330) T PRK10749 184 AEGHPRIRDGYAIGTGRWRALPFAINVLTHSRERYRRNLRFYADDPELRVGGPTYHWVRESILAGEQVLAGAGDDTTPTL 263 (330) T ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 86434555445678887777865557454887899999999974966404786599999999999999864604799989 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHCCC--CCEEEEEECCCCCC-CCCC---HHHHHHHHHHHHHHH Q ss_conf 761686550289999999999860779--96079997879847-5585---899999999999973 Q gi|254781093|r 151 IINGSNDTVATTSDVKDLVNKLMNQKG--ISITHKVIPDANHF-FIGK---VDELINECAHYLDNS 210 (225) Q Consensus 151 iIhG~~D~~vp~~~~~~~~~~l~~~~~--~~~~~~~i~ga~H~-f~~~---~~~l~~~i~~fl~~~ 210 (225) +++|++|.+|+.+..+++++.+..... ..-++++|+|+-|- |.+. .+++.+.|.+|++++ T Consensus 264 il~a~~D~iVd~~a~~~f~~~~~~~~~~~~~~~l~~i~gArHEil~E~D~~R~~~l~~I~~F~~~h 329 (330) T PRK10749 264 LLQAEEERVVDNRMHDRFCELRTAAGHPCEGGRPLVIKGAYHEILFEKDAMRSVALNAIVDFFNRH 329 (330) T ss_pred EEEECCCCEECHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHCCCHHHHHHHHHHHHHHHHHC T ss_conf 998289811083999999999875257666870887589601675398799999999999999764 No 20 >TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity. Probab=99.88 E-value=5.2e-21 Score=146.51 Aligned_cols=180 Identities=17% Similarity=0.227 Sum_probs=128.5 Q ss_pred EEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC--CCCHH-HHHHHHHH Q ss_conf 8799966888778799995789988889989999999999997398499985315588767785--21023-89999989 Q gi|254781093|r 13 LEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD--YGDGE-LSDAAAAL 89 (225) Q Consensus 13 l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~--~g~~E-~~D~~aa~ 89 (225) ++-.|...+.+.+|++|++|+ +|++..+ ...+...|+ .+|.|+.+|.||+|.|+...+ +...+ .+|+.+.+ T Consensus 16 ~~~hy~~~G~~~gp~lvllHG---~~~~~~~--w~~~~~~L~-~~~rvi~~Dl~G~G~S~~p~~~~~~~~~~a~dl~~ll 89 (278) T TIGR03056 16 FHWHVQDMGPTAGPLLLLLHG---TGASTHS--WRDLMPPLA-RSFRVVAPDLPGHGFTRAPFRFRFTLPSMAEDLSALC 89 (278) T ss_pred EEEEEEEECCCCCCEEEEECC---CCCCHHH--HHHHHHHHH-CCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 899998558899983899899---9876889--999999984-6998999967998888999889999999999999989 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC------------------------------ Q ss_conf 9998623546740588872036878764503862-133012275744333------------------------------ Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD------------------------------ 138 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~------------------------------ 138 (225) +. . +.+++.|+||||||.+|+.+|.++|+ ++++++++++...++ T Consensus 90 ~~---l--~i~~~~lvGhS~Gg~vA~~~A~~~P~~v~~lv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (278) T TIGR03056 90 AA---E--GLSPDGVIGHSAGAAIALRLALDGPVTPRMVVGINAALMPFEGMAGTLFPYMARVLACNPFTPPMMSRGAAD 164 (278) T ss_pred HH---C--CCCCEEEEECCCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCC T ss_conf 86---2--998659994541799999999968752558999766667742101234488998876372226888763012 Q ss_pred --------------------------------------------C----CCCCCCCCCEEEEECCCCCCCCHHHHHHHHH Q ss_conf --------------------------------------------2----2234334773776168655028999999999 Q gi|254781093|r 139 --------------------------------------------F----SFLAPCPSSGLIINGSNDTVATTSDVKDLVN 170 (225) Q Consensus 139 --------------------------------------------~----~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~ 170 (225) . ..+.....|+|+|+|++|..+|++..+++.+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~i~~P~lvi~G~~D~~~~~~~~~~l~~ 244 (278) T TIGR03056 165 QQRVERLIRDTGSLLDKAGMTYYGRLIRSPAHVDGALSMMAQWDLAPLNRDLPRITIPLHLIAGEEDKAVPPDESKRAAT 244 (278) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHH T ss_conf 89999998521111367899999998549666789999998401137887787578988999857899979999999998 Q ss_pred HHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHH Q ss_conf 986077996079997879847558-58999999999999 Q gi|254781093|r 171 KLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLD 208 (225) Q Consensus 171 ~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~ 208 (225) .++.+ +++++++++|+.+. +-+++.+.|.+||+ T Consensus 245 ~~p~a-----~~~~i~~aGH~~~~E~Pe~v~~~i~~fle 278 (278) T TIGR03056 245 RVPTA-----TLHVVPGGGHLVHEEQADGVVGLILQAAE 278 (278) T ss_pred HCCCC-----EEEEECCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 68998-----89998999850689759999999999859 No 21 >PRK03592 haloalkane dehalogenase; Provisional Probab=99.86 E-value=1.2e-19 Score=138.27 Aligned_cols=188 Identities=20% Similarity=0.296 Sum_probs=128.2 Q ss_pred CCEEEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCC-CCCCC Q ss_conf 97089658996187999668887787999957899888899899999999999973984999853155887677-85210 Q gi|254781093|r 1 MPEVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGE-FDYGD 79 (225) Q Consensus 1 m~~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~-~~~g~ 79 (225) |..|-++| +.-.|...+. .+.+|++|++| ++.. ....+...|++.+ .|+++|.||+|+|+.. .++.. T Consensus 9 ~~~v~v~G----~~l~y~~~G~--G~plvllHG~~---~~~~--~W~~~~~~L~~~~-rvia~Dl~G~G~S~~p~~~y~~ 76 (294) T PRK03592 9 MKRVEILG----SRMAYIETGQ--GDPIVFLHGNP---TSSY--LWRNIIPHLAGLG-RCLAPDLIGMGASDKPDIDYTF 76 (294) T ss_pred CCEEEECC----EEEEEEEECC--CCEEEEECCCC---CCHH--HHHHHHHHHHHCC-EEEEECCCCCCCCCCCCCCCCH T ss_conf 87599999----8999999846--88089989999---8789--9999999983098-8999738999988899998889 Q ss_pred -HHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC-------------------- Q ss_conf -23899999899998623546740588872036878764503862-13301227574433-------------------- Q gi|254781093|r 80 -GELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY-------------------- 137 (225) Q Consensus 80 -~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~-------------------- 137 (225) .-.+|+.++++.+ + ..++.|+||||||.+|+.+|.++|+ +++++++.+..... T Consensus 77 ~~~a~dl~~ll~~L---g--~~~~~lvGhS~Gg~va~~~A~~~P~rv~~lvl~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (294) T PRK03592 77 ADHRRYLDAFFDAL---D--LRNVVLVLHDWGSALGFDWARRHPDRVRGIAFMEAIVRPMTWADFPPAVRELFRALRSPG 151 (294) T ss_pred HHHHHHHHHHHHHC---C--CCCEEEEEECCCCHHHHHHHHHCCCCCCCEEEECCCCCCCCHHHCCHHHHHHHHHHHCCC T ss_conf 99999999999976---9--998799986888499999999684334626897156787553430778999999964877 Q ss_pred -----------------------CC---------------------------------------------CCCCCCCCCE Q ss_conf -----------------------32---------------------------------------------2234334773 Q gi|254781093|r 138 -----------------------DF---------------------------------------------SFLAPCPSSG 149 (225) Q Consensus 138 -----------------------~~---------------------------------------------~~l~~~~~p~ 149 (225) .+ ..+.....|+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~P~ 231 (294) T PRK03592 152 AGERMVLEQNVFVEKVLPGSILRKLSDEEMAVYRRPFPTPESRRPTLSWPRELPIDGEPADVVALIEEYAQWLSASDQPK 231 (294) T ss_pred CCHHHHHHHHHHHHHHCHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCE T ss_conf 20678876657898754043203589999999870048867789999999856336880578999999999874379987 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHHHH Q ss_conf 776168655028999999999986077996079997879847558-5899999999999973 Q gi|254781093|r 150 LIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLDNS 210 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~~~ 210 (225) |+|+|++|..++++..+++.+.++.. +++++++++|+.+. +-+++.+.|.+||++. T Consensus 232 L~i~G~~d~~~~~~~~~~~~~~~p~~-----~~~~i~~aGH~~~~E~Pe~v~~~i~~FL~~l 288 (294) T PRK03592 232 LLINAEPGAILTTGRIRDFARSWPNQ-----TEITVPGGLHFLQEDSPDEIGAAIAAWLRRL 288 (294) T ss_pred EEEECCCCCCCCHHHHHHHHHHCCCC-----EEEEECCCCCCHHHCCHHHHHHHHHHHHHHC T ss_conf 99970788766999999999868998-----7999899822227529999999999999746 No 22 >KOG2100 consensus Probab=99.84 E-value=1.8e-19 Score=137.20 Aligned_cols=201 Identities=21% Similarity=0.317 Sum_probs=143.7 Q ss_pred EEEEEEECCC----CCCCCEEEEECCCCCCCCCCCCHHHHHHHH-HHHHCCCEEEEEEECCCCCCCCCCC------CCCH Q ss_conf 1879996688----877879999578998888998999999999-9997398499985315588767785------2102 Q gi|254781093|r 12 RLEGRYQPST----NPNAPIALILHPHPRFGGTMNDNIVYQLFY-LFQQRGFVSLRFNFRGIGRSEGEFD------YGDG 80 (225) Q Consensus 12 ~l~~~~~~~~----~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~-~l~~~G~~vl~fd~RG~G~S~G~~~------~g~~ 80 (225) .+.+.+..|. ...=|+++..|+-|. .....++...+... .+...|++|+.+|+||.|.....|- .|.- T Consensus 509 ~~~~~~~lP~~~~~~~kyPllv~~yGGP~-sq~v~~~~~~~~~~~~~s~~g~~v~~vd~RGs~~~G~~~~~~~~~~lG~~ 587 (755) T KOG2100 509 TANAILILPPNFDPSKKYPLLVVVYGGPG-SQSVTSKFSVDWNEVVVSSRGFAVLQVDGRGSGGYGWDFRSALPRNLGDV 587 (755) T ss_pred EEEEEEECCCCCCCCCCCCEEEEECCCCC-CCCCCCEEEECCHHEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHCCCC T ss_conf 38999946899888787778998268987-53123257633231587346669999847776775145788878755882 Q ss_pred HHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCC-CH-HEEEECCCCCCCCCC------------------ Q ss_conf 389999989999862-354674058887203687876450386-21-330122757443332------------------ Q gi|254781093|r 81 ELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRP-EI-NGFISVAPQPKSYDF------------------ 139 (225) Q Consensus 81 E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p-~i-~~~v~isp~~~~~~~------------------ 139 (225) |+.|...++.++.+. ..|.++|+++|+|+||+++++++..++ .+ ++.++++|..+..-+ T Consensus 588 ev~D~~~~~~~~~~~~~iD~~ri~i~GwSyGGy~t~~~l~~~~~~~fkcgvavaPVtd~~~yds~~terymg~p~~~~~~ 667 (755) T KOG2100 588 EVKDQIEAVKKVLKLPFIDRSRVAIWGWSYGGYLTLKLLESDPGDVFKCGVAVAPVTDWLYYDSTYTERYMGLPSENDKG 667 (755) T ss_pred CHHHHHHHHHHHHHCCCCCHHHEEEEEECHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEECCCCCEECCCCCCCCCCC T ss_conf 11548999999985465355883898216088999998740844446762464141424740430201016898622321 Q ss_pred -------CCCCCCCCC-EEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCH--HHHHHHHHHHHHH Q ss_conf -------223433477-377616865502899999999998607799607999787984755858--9999999999997 Q gi|254781093|r 140 -------SFLAPCPSS-GLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKV--DELINECAHYLDN 209 (225) Q Consensus 140 -------~~l~~~~~p-~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~--~~l~~~i~~fl~~ 209 (225) ........| .|++||+.|+.|+.+++..++++|.. ++++.+..++|+.+|.++.+. ..+...+..|+.. T Consensus 668 y~e~~~~~~~~~~~~~~~LliHGt~DdnVh~q~s~~~~~aL~~-~gv~~~~~vypde~H~is~~~~~~~~~~~~~~~~~~ 746 (755) T KOG2100 668 YEESSVSSPANNIKTPKLLLIHGTEDDNVHFQQSAILIKALQN-AGVPFRLLVYPDENHGISYVEVISHLYEKLDRFLRD 746 (755) T ss_pred HHHCCCCCHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH-CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 2112410012311466479997466766584669999999997-699448997389876643454058999999999998 Q ss_pred HCCCC Q ss_conf 31221 Q gi|254781093|r 210 SLDEK 214 (225) Q Consensus 210 ~L~~~ 214 (225) ++... T Consensus 747 ~~~~~ 751 (755) T KOG2100 747 CFGSP 751 (755) T ss_pred HCCCC T ss_conf 65888 No 23 >COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only] Probab=99.83 E-value=1.3e-18 Score=131.99 Aligned_cols=201 Identities=21% Similarity=0.355 Sum_probs=141.4 Q ss_pred EECCCCC--EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC--CCCH Q ss_conf 9658996--18799966888778799995789988889989999999999997398499985315588767785--2102 Q gi|254781093|r 5 VFNGPSG--RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD--YGDG 80 (225) Q Consensus 5 ~i~g~~G--~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~--~g~~ 80 (225) .+..|+| .+-+|..+|.....|.+|++| +..|+.+++.+..+++.+.++||.++.||+||++++--+-. +-.| T Consensus 53 ~v~~pdg~~~~ldw~~~p~~~~~P~vVl~H---GL~G~s~s~y~r~L~~~~~~rg~~~Vv~~~Rgcs~~~n~~p~~yh~G 129 (345) T COG0429 53 RLETPDGGFIDLDWSEDPRAAKKPLVVLFH---GLEGSSNSPYARGLMRALSRRGWLVVVFHFRGCSGEANTSPRLYHSG 129 (345) T ss_pred EEECCCCCEEEEEECCCCCCCCCCEEEEEE---CCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCEECCC T ss_conf 997499997777631486445785499994---25788867899999999996688099986534568766676144165 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC---CCCHHEEEECCCCCCC--------------------- Q ss_conf 3899999899998623546740588872036878764503---8621330122757443--------------------- Q gi|254781093|r 81 ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR---RPEINGFISVAPQPKS--------------------- 136 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~---~p~i~~~v~isp~~~~--------------------- 136 (225) +.+|+...++|++++++. ++++.+|+|+||.+-+.+.++ ++.+.+.+.+|-|... T Consensus 130 ~t~D~~~~l~~l~~~~~~-r~~~avG~SLGgnmLa~ylgeeg~d~~~~aa~~vs~P~Dl~~~~~~l~~~~s~~ly~r~l~ 208 (345) T COG0429 130 ETEDIRFFLDWLKARFPP-RPLYAVGFSLGGNMLANYLGEEGDDLPLDAAVAVSAPFDLEACAYRLDSGFSLRLYSRYLL 208 (345) T ss_pred CHHHHHHHHHHHHHHCCC-CCEEEEEECCCHHHHHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 126899999999985799-9669999553399999988761567652101465078889987877518635541399999 Q ss_pred --------------------------------CCCC------------------------CCCCCCCCEEEEECCCCCCC Q ss_conf --------------------------------3322------------------------23433477377616865502 Q gi|254781093|r 137 --------------------------------YDFS------------------------FLAPCPSSGLIINGSNDTVA 160 (225) Q Consensus 137 --------------------------------~~~~------------------------~l~~~~~p~LiIhG~~D~~v 160 (225) ++|+ .|....+|+||||..+|+++ T Consensus 209 ~~L~~~~~~kl~~l~~~~p~~~~~~ik~~~ti~eFD~~~Tap~~Gf~da~dYYr~aSs~~~L~~Ir~PtLii~A~DDP~~ 288 (345) T COG0429 209 RNLKRNAARKLKELEPSLPGTVLAAIKRCRTIREFDDLLTAPLHGFADAEDYYRQASSLPLLPKIRKPTLIINAKDDPFM 288 (345) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHCHHHHCCCEEEECCCCCCCHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCC T ss_conf 99999999888753753671799999862468750330322025888589999855542013566653489864788987 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEEEECCCCC-CC-CCCH--H--HHHHHHHHHHHHHCCC Q ss_conf 899999999998607799607999787984-75-5858--9--9999999999973122 Q gi|254781093|r 161 TTSDVKDLVNKLMNQKGISITHKVIPDANH-FF-IGKV--D--ELINECAHYLDNSLDE 213 (225) Q Consensus 161 p~~~~~~~~~~l~~~~~~~~~~~~i~ga~H-~f-~~~~--~--~l~~~i~~fl~~~L~~ 213 (225) +++..-+.-.. ....+++.+-+-++| .| .++. . =+.+.+.+||+..+.. T Consensus 289 ~~~~iP~~~~~----~np~v~l~~t~~GGHvGfl~~~~~~~~~W~~~ri~~~l~~~~~~ 343 (345) T COG0429 289 PPEVIPKLQEM----LNPNVLLQLTEHGGHVGFLGGKLLHPQMWLEQRILDWLDPFLEA 343 (345) T ss_pred CHHHCCCCHHC----CCCCEEEEEECCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 84548862002----79864788506785578526854240666999999999878751 No 24 >TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily. Probab=99.83 E-value=1.2e-18 Score=132.29 Aligned_cols=127 Identities=20% Similarity=0.319 Sum_probs=107.0 Q ss_pred EEECCCCCEEEEEEECCCCC-CCCEEEEECCCCCCCC--CCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH Q ss_conf 89658996187999668887-7879999578998888--99899999999999973984999853155887677852102 Q gi|254781093|r 4 VVFNGPSGRLEGRYQPSTNP-NAPIALILHPHPRFGG--TMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG 80 (225) Q Consensus 4 v~i~g~~G~l~~~~~~~~~~-~~~~vv~~Hp~p~~GG--~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~ 80 (225) .|++++.|.+-|+|++|.+. +++++|+|||+ |. |...+.+..+|+.|++.||.||||||+|+|+|.|+|....= T Consensus 3 ffl~~~~g~~F~~~h~P~~~~~~g~vl~~~PF---aeE~n~srRm~a~~A~~laa~G~~vLrfD~~G~GDSaGdf~~a~w 79 (266) T TIGR03101 3 FFLDAPHGFRFCLYHPPVAVGPRGVVIYLPPF---AEEMNKSRRMVALQARAFAAGGFGVLQIDLYGCGDSAGDFAAARW 79 (266) T ss_pred EECCCCCCCEEEEEECCCCCCCCCEEEEECCC---HHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHH T ss_conf 02058998658997379888985079993840---765314489999999999867974996244677786455540228 Q ss_pred H--HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCC Q ss_conf 3--89999989999862354674058887203687876450386-2133012275744 Q gi|254781093|r 81 E--LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPK 135 (225) Q Consensus 81 E--~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~ 135 (225) | .+|+.++.+|+++.+ ..++.|+|...|+..|+.+|.+.+ .++++|++.|..+ T Consensus 80 e~W~~Dv~aA~~~L~~~g--~~~v~LwGLRlGAlLA~~aA~~~~~~v~~LVLWqPV~~ 135 (266) T TIGR03101 80 DVWKEDVAAAYRWLIEQG--HPPVTLWGLRLGALLALDAANPLAAKCNRLVLWQPVVS 135 (266) T ss_pred HHHHHHHHHHHHHHHHCC--CCCEEEEEHHHHHHHHHHHHHHCCCCCCEEEEEECCCC T ss_conf 889999999999998649--99568984004089999998644567673899702357 No 25 >PRK11460 esterase YpfH; Provisional Probab=99.83 E-value=2e-18 Score=130.98 Aligned_cols=188 Identities=18% Similarity=0.254 Sum_probs=128.4 Q ss_pred EEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC--CCEEEEEEEC-CCCCCCCC--CCC-C---CHHHHHHH Q ss_conf 99668887787999957899888899899999999999973--9849998531-55887677--852-1---02389999 Q gi|254781093|r 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQR--GFVSLRFNFR-GIGRSEGE--FDY-G---DGELSDAA 86 (225) Q Consensus 16 ~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~--G~~vl~fd~R-G~G~S~G~--~~~-g---~~E~~D~~ 86 (225) .+..|+++.+.++|++| ++|++.+| +..+++.|... ....+..|-- .++...|. |+- + ....+++. T Consensus 7 ~~~~p~~~~~~LvILLH---G~Gadg~D--L~~La~~l~~~lP~a~fv~~~ap~~~~~~~G~qWF~l~~~~~~~~~~~~~ 81 (230) T PRK11460 7 VVQSPDKPAQQLLLLFH---GVGDNPVA--MGEIGSWFAPAFPDALVVSVGGAEPSGNPAGRQWFSVQGITEDNRQARVA 81 (230) T ss_pred EECCCCCCHHHEEEEEC---CCCCCHHH--HHHHHHHHHHHCCCCEEECCCCCCCCCCCCHHCCCCCCCCCHHHHHHHHH T ss_conf 85699988245799982---67988777--99999988865899889579997567896430030268998567888999 Q ss_pred H----HHHHHH----HHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCC Q ss_conf 9----899998----623546740588872036878764503862-1330122757443332223433477377616865 Q gi|254781093|r 87 A----ALDWVQ----SLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDFSFLAPCPSSGLIINGSND 157 (225) Q Consensus 87 a----a~~~l~----~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D 157 (225) + ..++++ +.+.+.++++|+|||.|+.+++.++.++++ +.++|+.|+.... .........|++++||+.| T Consensus 82 ~~~~~l~~~i~~~~~~~gl~~~~i~L~GFSQGamMsL~~~l~~~~~~~giv~~SG~l~~--l~~~~~~~~pV~LiHG~~D 159 (230) T PRK11460 82 AIMPTFIETVRYWQQQSGVGASATALIGFSQGAIMALESVKAEPGLAGRVIAFSGRYAS--LPETAPTATTIHLIHGGED 159 (230) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHEEEEEECHHHHHHHHHHHCCCCCCCEEEEECCCCCC--CCCCCCCCCCEEEEECCCC T ss_conf 99999999999999983999778899997503699998886186646569996553256--6500156898899847999 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 502899999999998607799607999787984755858999999999999731221 Q gi|254781093|r 158 TVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDEK 214 (225) Q Consensus 158 ~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~ 214 (225) ++||++.+++-.+.+.+. +.++++++++|.+|.-. ++-.+...+||+++|++. T Consensus 160 ~VVP~~~~~~A~~~L~~~-G~~v~~~~~~glGH~I~---~egl~~a~~FLk~~lp~~ 212 (230) T PRK11460 160 PVIDVAHAVAAQEALIAL-GGDVTLDIAEDLGHAID---PRLMQFALDRLRYTVPKR 212 (230) T ss_pred CCCCHHHHHHHHHHHHHC-CCCEEEEECCCCCCCCC---HHHHHHHHHHHHHHCCHH T ss_conf 863889999999999987-99279986289999889---999999999998737286 No 26 >PRK03204 haloalkane dehalogenase; Provisional Probab=99.82 E-value=6.4e-19 Score=133.91 Aligned_cols=183 Identities=20% Similarity=0.268 Sum_probs=119.7 Q ss_pred EECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC--CCCCH-H Q ss_conf 96589961879996688877879999578998888998999999999999739849998531558876778--52102-3 Q gi|254781093|r 5 VFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF--DYGDG-E 81 (225) Q Consensus 5 ~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~--~~g~~-E 81 (225) +|+..+|||. |...+ ..|.+|++|++| ++. .....+...|++. |.|+.+|.||+|.|+..- ++... - T Consensus 18 ~~~~~g~rlh--Y~~~G--~GppvvLlHG~~---~~~--~~w~~~~~~La~~-~rVIa~Dl~G~G~S~~P~~~~y~~~~~ 87 (286) T PRK03204 18 WFDSSRGRIH--YVDEG--TGPPILLCHGNP---TWS--FLYRDIIVALRDR-FRCVAPDYLGFGLSERPSGFGYQIDEH 87 (286) T ss_pred EEEECCCEEE--EEEEC--CCCEEEEECCCC---CCH--HHHHHHHHHHHCC-CEEEEECCCCCCCCCCCCCCCCCHHHH T ss_conf 5986991899--99974--587699989999---767--7999999998319-689997089989888999888689999 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCC---CC------------CC------- Q ss_conf 899999899998623546740588872036878764503862-13301227574---43------------33------- Q gi|254781093|r 82 LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQP---KS------------YD------- 138 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~---~~------------~~------- 138 (225) .+|+.+.++. .+ ..++.|+||||||.+|+.+|.++|+ |+++|++++.. .. .. T Consensus 88 a~~l~~lld~---Lg--l~~~~lvGhs~GG~va~~~A~~~P~rV~~Lvl~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 162 (286) T PRK03204 88 ARVIGEFVDH---LG--LDRYLSMGQDWGGPISMAVAVERADRVRGVVLGNTWFWPADTLAMKAFSRVMSSPPVQYAILR 162 (286) T ss_pred HHHHHHHHHH---HC--CCCEEEEEECCCHHHHHHHHHHCCHHHEEEEEECCCCCCCCHHHHHHHHHHHHCCHHHHHHHH T ss_conf 9999999997---48--997499997377899999999693540058996576687623567899998606567899985 Q ss_pred ----------------C--------CC------------------------CC----C-----CCCCEEEEECCCCCCCC Q ss_conf ----------------2--------22------------------------34----3-----34773776168655028 Q gi|254781093|r 139 ----------------F--------SF------------------------LA----P-----CPSSGLIINGSNDTVAT 161 (225) Q Consensus 139 ----------------~--------~~------------------------l~----~-----~~~p~LiIhG~~D~~vp 161 (225) . .. +. . ...|+|+|+|++|..++ T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~PtLviwG~~D~~~~ 242 (286) T PRK03204 163 RNFFVERLIPAGTEHRPSSAVMAHYRAVQPNAAARRGVAEMPKQILAARPLLARLAREVPATLGTKPTLLIWGMKDVAFR 242 (286) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC T ss_conf 65689987551024588699999986107885778999996888874357899998865765389997999938998789 Q ss_pred HHHH-HHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHH Q ss_conf 9999-99999986077996079997879847558-5899999999999 Q gi|254781093|r 162 TSDV-KDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYL 207 (225) Q Consensus 162 ~~~~-~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl 207 (225) ++.. ..+.+.++ +.+++++|+++|+-.. +-+++.+.|.+|+ T Consensus 243 p~~~~~~~~~~~p-----~a~lv~ip~aGH~~~~E~Pe~va~aIleff 285 (286) T PRK03204 243 PKTIIPRLSATFP-----DHVLVELPNAKHFIQEDAPDRIAAAIIERF 285 (286) T ss_pred CHHHHHHHHHHCC-----CCEEEEECCCCCCHHHCCHHHHHHHHHHHC T ss_conf 3899999999789-----978999799986278759999999999865 No 27 >COG1647 Esterase/lipase [General function prediction only] Probab=99.82 E-value=4.1e-19 Score=135.10 Aligned_cols=172 Identities=16% Similarity=0.243 Sum_probs=135.4 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC-CC-CHHHHHHHHHHHHHHHHCCCCCCEEE Q ss_conf 99995789988889989999999999997398499985315588767785-21-02389999989999862354674058 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD-YG-DGELSDAAAALDWVQSLNPESKSCWI 104 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~-~g-~~E~~D~~aa~~~l~~~~~~~~~i~l 104 (225) .|+++| +|.||- +.+..|++.|.+.||.|..++|+|||..--.|= .+ .--.+|+.++.+++.+++ .+.|.+ T Consensus 17 AVLllH---GFTGt~--~Dvr~Lgr~L~e~GyTv~aP~ypGHG~~~e~fl~t~~~DW~~~v~d~Y~~L~~~g--y~eI~v 89 (243) T COG1647 17 AVLLLH---GFTGTP--RDVRMLGRYLNENGYTVYAPRYPGHGTLPEDFLKTTPRDWWEDVEDGYRDLKEAG--YDEIAV 89 (243) T ss_pred EEEEEE---CCCCCC--HHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHCC--CCEEEE T ss_conf 999983---368980--8899999999878955746888999999899850799999999999999998757--985899 Q ss_pred EEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCC-------------------------------------------- Q ss_conf 887203687876450386213301227574433322-------------------------------------------- Q gi|254781093|r 105 AGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFS-------------------------------------------- 140 (225) Q Consensus 105 ~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~-------------------------------------------- 140 (225) +|.||||..|+.+|...| +++++..+++.+..+.. T Consensus 90 ~GlSmGGv~alkla~~~p-~K~iv~m~a~~~~k~~~~iie~~l~y~~~~kk~e~k~~e~~~~e~~~~~~~~~~~~~~~~~ 168 (243) T COG1647 90 VGLSMGGVFALKLAYHYP-PKKIVPMCAPVNVKSWRIIIEGLLEYFRNAKKYEGKDQEQIDKEMKSYKDTPMTTTAQLKK 168 (243) T ss_pred EEECCHHHHHHHHHHHCC-CCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 851203699999985379-5523640477665541034578999999856514889899999999841360779999999 Q ss_pred -------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC--CHHHHHHHHHHHHHH Q ss_conf -------234334773776168655028999999999986077996079997879847558--589999999999997 Q gi|254781093|r 141 -------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG--KVDELINECAHYLDN 209 (225) Q Consensus 141 -------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~--~~~~l~~~i~~fl~~ 209 (225) .+.....|+++++|.+|+.||.+.+.-+++....- +.++..+++++|..+. ..+++.+.|..||+. T Consensus 169 ~i~~~~~~~~~I~~pt~vvq~~~D~mv~~~sA~~Iy~~v~s~---~KeL~~~e~SgHVIt~D~Erd~v~e~V~~FL~~ 243 (243) T COG1647 169 LIKDARRSLDKIYSPTLVVQGRQDEMVPAESANFIYDHVESD---DKELKWLEGSGHVITLDKERDQVEEDVITFLEK 243 (243) T ss_pred HHHHHHHHHHHCCCCHHHEECCCCCCCCHHHHHHHHHHCCCC---CCEEEEECCCCCEEECCHHHHHHHHHHHHHHHC T ss_conf 999987436563014420441568888878989999830587---502678726881654526589999999998609 No 28 >pfam02230 Abhydrolase_2 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561. Probab=99.81 E-value=7.9e-18 Score=127.37 Aligned_cols=186 Identities=17% Similarity=0.215 Sum_probs=123.7 Q ss_pred EECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEEE--CCC---CCCCC-C-CC------CCCHHH Q ss_conf 966888778799995789988889989999999999997-3984999853--155---88767-7-85------210238 Q gi|254781093|r 17 YQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQ-RGFVSLRFNF--RGI---GRSEG-E-FD------YGDGEL 82 (225) Q Consensus 17 ~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~-~G~~vl~fd~--RG~---G~S~G-~-~~------~g~~E~ 82 (225) +.+|..+.++++|++| ++|+|.++ +..+++.+.. -++..+.++- +-. ++..+ . |+ .+.... T Consensus 3 v~~p~~p~~~llilLH---G~G~n~~~--l~~l~~~~~~~p~~~~i~p~AP~~~~~~~~g~~~~~Wf~~~~~~~~~~~~~ 77 (213) T pfam02230 3 IESPAKPATALVIFLH---GLGDDGHG--WAFAAKCMAPLPNTKFIFPHAPERPVTLNGGRRMPAWFDLVGLSPNGSEDE 77 (213) T ss_pred CCCCCCCCCCEEEEEE---CCCCCHHH--HHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEECCCCCCCCCCCCH T ss_conf 1287999887899997---89999778--899898750389988997069986754578976451443546795201179 Q ss_pred HHHHHH-------HHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC-C--CCCCCCCCCEEE Q ss_conf 999998-------99998623546740588872036878764503862-133012275744333-2--223433477377 Q gi|254781093|r 83 SDAAAA-------LDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD-F--SFLAPCPSSGLI 151 (225) Q Consensus 83 ~D~~aa-------~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~-~--~~l~~~~~p~Li 151 (225) +++... ++...+.+.+.++|+|+|||+||.+|+.++.+.|+ +.+++++|+...... + ........|+++ T Consensus 78 ~~~~~s~~~i~~~i~~~~~~~i~~~ri~l~GFSQGa~~a~~~~l~~~~~~~g~i~lsG~l~~~~~~~~~~~~~~~~pi~~ 157 (213) T pfam02230 78 AGIKNSAELIEELIDAEQKTGIPSSRIILGGFSQGGAVALYTALTLPQPLAGIIALSGALPLPPKFPQHPTALADIPILL 157 (213) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHEEEEEECHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHHCCCCCEEE T ss_conf 99999999999999999981999302799998877899999998094013989997488789621234455425997894 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 616865502899999999998607799607999787984755858999999999999731 Q gi|254781093|r 152 INGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 152 IhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L 211 (225) .||++|++||.+.+++.++.+.. .+.+++++++++.+|..+ .+-.+.+.+||+++| T Consensus 158 ~HG~~D~vvp~~~~~~~~~~L~~-~g~~v~~~~y~~~gH~I~---~~ei~~~~~fL~k~L 213 (213) T pfam02230 158 IHGTEDPVVPLALGKLAKEYLKT-LGNTVEFHEYPGMGHSIC---PQEMQDIKSFLSKHL 213 (213) T ss_pred ECCCCCCCCCHHHHHHHHHHHHH-CCCCEEEEEECCCCCCCC---HHHHHHHHHHHHHHC T ss_conf 15899982179999999999997-799869999799999689---999999999999659 No 29 >PRK11126 acyl-CoA thioester hydrolase YfbB; Provisional Probab=99.80 E-value=3.1e-18 Score=129.79 Aligned_cols=164 Identities=21% Similarity=0.275 Sum_probs=106.1 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH-HHHHHHHHCCCCCCE Q ss_conf 7879999578998888998999999999999739849998531558876778521023899999-899998623546740 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAA-ALDWVQSLNPESKSC 102 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~a-a~~~l~~~~~~~~~i 102 (225) +.|++|++| ++||+..+ ....+..| .+|.|+.+|.||+|+|+.....+ .++..+ +.+.+...+ .+++ T Consensus 1 G~P~lVflH---G~~~~~~~--W~~~~~~L--~~~~via~DlpGhG~S~~~~~~~---~~~~a~~l~~~l~~l~--~~~~ 68 (242) T PRK11126 1 GLPWLVFLH---GLLGSGQD--WQPVGEAL--PDYPRLYIDLPGHGGSAAISVDG---FADVSQLLSQTLVSYN--ILPF 68 (242) T ss_pred CCCEEEEEC---CCCCCHHH--HHHHHHHC--CCCCEEEEECCCCCCCCCCCCCC---HHHHHHHHHHHHHHCC--CCCE T ss_conf 998799938---87779799--99999977--99937998797989998989999---9999999999999859--9975 Q ss_pred EEEEECHHHHHHHHHHHCCC--CHHEEEECCCCCCC-------------------C----------------CC------ Q ss_conf 58887203687876450386--21330122757443-------------------3----------------32------ Q gi|254781093|r 103 WIAGYSFGAWISMQLLMRRP--EINGFISVAPQPKS-------------------Y----------------DF------ 139 (225) Q Consensus 103 ~l~G~S~Gg~val~~a~~~p--~i~~~v~isp~~~~-------------------~----------------~~------ 139 (225) .|+||||||.+|+.+|.+.+ .++++++.+..+.. + -+ T Consensus 69 ~LvGhSmGG~vAl~~A~~~~~~~~~~lil~~~~~gl~~~~~~~~r~~~~~~~a~~~~~~~~~~~l~~~~~~~~f~~l~~~ 148 (242) T PRK11126 69 WLVGYSLGGRIAMYYACQGQLPGLCGLIVEGGNPGLQNAEERQARWQSDRQWAQRFRQEPLEQVLADWYQQPVFASLNAE 148 (242) T ss_pred EEEEECHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHH T ss_conf 99986779999999997459700256674268868799899999997069999999863099999999871411579999 Q ss_pred ---------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf ---------------------------------22343347737761686550289999999999860779960799978 Q gi|254781093|r 140 ---------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP 186 (225) Q Consensus 140 ---------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ 186 (225) ..+.....|+|+|+|++|... +++++. . ..++++|| T Consensus 149 ~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~P~L~i~Ge~D~~~-----~~la~~----~--~~~~~~ip 217 (242) T PRK11126 149 QRQQLVAKRSNNNGAAVAAMLEATSLAKQPDLLPALQALTFPFYYLCGERDSKF-----RALAQE----S--ALPLHVIP 217 (242) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHCCCCEEEEECCCCHHH-----HHHHHH----C--CCCEEEEC T ss_conf 999999998723679999999974321288779999748999799980897588-----999997----6--99988979 Q ss_pred CCCCCCC-CCHHHHHHHHHHHHHHH Q ss_conf 7984755-85899999999999973 Q gi|254781093|r 187 DANHFFI-GKVDELINECAHYLDNS 210 (225) Q Consensus 187 ga~H~f~-~~~~~l~~~i~~fl~~~ 210 (225) +++|.-+ ++-+++.+.+.+||+.+ T Consensus 218 ~aGH~~~~E~P~~f~~~i~~FL~~~ 242 (242) T PRK11126 218 RAGHNAHRENPAAFAASLAQILRLI 242 (242) T ss_pred CCCCCHHHHCHHHHHHHHHHHHHHC T ss_conf 9999128879999999999997109 No 30 >pfam05448 AXE1 Acetyl xylan esterase (AXE1). This family consists of several bacterial acetyl xylan esterase proteins. Acetyl xylan esterases are enzymes that hydrolyse the ester linkages of the acetyl groups in position 2 and/or 3 of the xylose moieties of natural acetylated xylan from hardwood. These enzymes are one of the accessory enzymes which are part of the xylanolytic system, together with xylanases, beta-xylosidases, alpha-arabinofuranosidases and methylglucuronidases; these are all required for the complete hydrolysis of xylan. Probab=99.79 E-value=1.2e-17 Score=126.37 Aligned_cols=195 Identities=20% Similarity=0.252 Sum_probs=140.2 Q ss_pred EEEECCCCC-EEEEEEECCCC--CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC---- Q ss_conf 089658996-18799966888--77879999578998888998999999999999739849998531558876778---- Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTN--PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF---- 75 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~--~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~---- 75 (225) +|.|++.+| ++.|++..|.+ .+.|++|.+|+ +||...... . ...++..||++|.+|.||.|....+. T Consensus 58 dv~f~~~~g~~I~G~l~~P~~~~~k~P~vv~~hG---yg~~~~~~~-~--~~~~a~~G~avl~~d~RGqgg~~~d~~~~~ 131 (320) T pfam05448 58 DLTFTGFGGARIKAWLVLPKHSEGKHPALVEFHG---YNGGRGDWH-D--NLHWAAAGYAVFAMDVRGQGGLSEDPGGVT 131 (320) T ss_pred EEEEECCCCCEEEEEEEECCCCCCCCCEEEEECC---CCCCCCCHH-H--CCCHHCCCCEEEEECCCCCCCCCCCCCCCC T ss_conf 9999617997699999606888898008999768---888878711-2--010103896899857888887777755557 Q ss_pred --------CCCC---HH-------HHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCC Q ss_conf --------5210---23-------89999989999862-35467405888720368787645038621330122757443 Q gi|254781093|r 76 --------DYGD---GE-------LSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKS 136 (225) Q Consensus 76 --------~~g~---~E-------~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~ 136 (225) ..|. -| ..|+.-++++++++ ..|.++|++.|.|+||.+|+.+|+-+|+|++.++..|.... T Consensus 132 ~~~~~g~~~~G~~d~~e~~yyr~~~ld~~RAvd~l~~lp~vD~~ri~v~G~SQGG~lala~AaL~~rv~~~~a~vP~l~D 211 (320) T pfam05448 132 GPTVKGHMTRGLLDGPDHYYYRRVFLDAVRAVEIVMSLPEVDEERIGVYGASQGGALALAAAALSPRIKKVVADYPFLSD 211 (320) T ss_pred CCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHCCCCCCEEEECCCCCCC T ss_conf 86666244428778767878999998899999999818875701179997770479899787528410579868884135 Q ss_pred C-----------CC-------C-----------------------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC Q ss_conf 3-----------32-------2-----------------------23433477377616865502899999999998607 Q gi|254781093|r 137 Y-----------DF-------S-----------------------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQ 175 (225) Q Consensus 137 ~-----------~~-------~-----------------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~ 175 (225) + .+ . +-.....|+++..|-.|++||++.....++++..+ T Consensus 212 ~~~~~~~~~~~~~y~e~~~y~~~~~~~~~~~~~~~~tL~YfD~~nfA~ri~~pv~~~~gl~D~vcPPst~fAayN~i~~~ 291 (320) T pfam05448 212 FKRAVELDLAEEPYDEIFRYFKFFDPHHEREEEVFRTLSYIDIKNLAHRIKGPVLMSVGLMDDVCPPSTVFAAYNHLTTE 291 (320) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHCCCCC T ss_conf 35464247788817889988740487642189998402033688789760776899840478988982249987155997 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 7996079997879847558589999999999997 Q gi|254781093|r 176 KGISITHKVIPDANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 176 ~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~ 209 (225) | ++.++|..+|..-. ..-.+...+||.+ T Consensus 292 K----~l~vyp~~gHe~~~--~~~~~~~~~~l~~ 319 (320) T pfam05448 292 K----EIKVYPEYGHEGGP--GFQEDQQYKFLSR 319 (320) T ss_pred E----EEEECCCCCCCCCH--HHHHHHHHHHHHC T ss_conf 3----79965777877725--6769999999862 No 31 >pfam02129 Peptidase_S15 X-Pro dipeptidyl-peptidase (S15 family). Probab=99.76 E-value=1e-16 Score=120.65 Aligned_cols=179 Identities=19% Similarity=0.245 Sum_probs=120.1 Q ss_pred CC-EEEEEEECC--CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCC-CCHHHHHH Q ss_conf 96-187999668--887787999957899888899899999999999973984999853155887677852-10238999 Q gi|254781093|r 10 SG-RLEGRYQPS--TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDY-GDGELSDA 85 (225) Q Consensus 10 ~G-~l~~~~~~~--~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~-g~~E~~D~ 85 (225) || +|.+-+..| .+.+-|++|.-.|+..-...........-...|+++||+|+..|.||.|.|+|.|+. +..|.+|. T Consensus 1 DGv~L~advy~P~~~~~~~P~il~rtpY~r~~~~~~~~~~~~~~~~~a~~GYavV~qD~RG~g~S~G~~~~~~~~E~~Dg 80 (265) T pfam02129 1 DGVRLAADIYRPAAGGGPVPVLLTRSPYGKRDPGASTLALAHPEWEFAARGYAVVVQDVRGTGGSEGVFTVGGPQEVADG 80 (265) T ss_pred CCCEEEEEEEEECCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHCCEEEEEECCCCCCCCCCCCCCCCHHHHHHH T ss_conf 98889999997789999851899965878988766542223545779839908999877868899985036982577789 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCCCC------------------------- Q ss_conf 9989999862354674058887203687876450386-21330122757443332------------------------- Q gi|254781093|r 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSYDF------------------------- 139 (225) Q Consensus 86 ~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~~~------------------------- 139 (225) .++++|+.+|--...+|.+.|.||+|.+++.+|+..| .+++++.+.+..+.++. T Consensus 81 ~d~ieW~a~QpwsnG~VGm~G~SY~G~tq~~~A~~~pP~LkAi~p~~~~~d~y~~~~~~gG~~~~~~~~~W~~~~~~~~~ 160 (265) T pfam02129 81 KDVIDWLAGQPWCNGKVGMTGISYLGTTQLLAAATGPPGLKAIAPESAISDLYDHYYREGGAVRAPGGLGWEDLDLLAEA 160 (265) T ss_pred HHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHCCEECCCCCCCCCCCHHHHHHHHCCC T ss_conf 99999998598878926888256788889999850998503894145554113101207982213530079987532124 Q ss_pred -------------------------------------------------------CCCCCCCCCEEEEECCCCCCCCHHH Q ss_conf -------------------------------------------------------2234334773776168655028999 Q gi|254781093|r 140 -------------------------------------------------------SFLAPCPSSGLIINGSNDTVATTSD 164 (225) Q Consensus 140 -------------------------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~ 164 (225) ..+....+|+|+|.|..|..+. .. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~W~~~~~~~~~~~i~vP~l~v~GW~D~~~~-~~ 239 (265) T pfam02129 161 LTSRRADDGDALRARARYEAAGDELLAELDRAPPFLSWLLQTGDYDAFWQDRNYLEDADKVKAPVLLVGGWYDWNVK-NG 239 (265) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHCCHHHHHCCCCCCEEEEECCCCCCCC-HH T ss_conf 45678655777788764045446688645433347899974899676667269656752699898998336884752-43 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 999999986077996079997879847 Q gi|254781093|r 165 VKDLVNKLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 165 ~~~~~~~l~~~~~~~~~~~~i~ga~H~ 191 (225) +.+.+++++.+. .+..+++=| -+|. T Consensus 240 ~~~~~~~l~~~~-~~~~LiiGP-W~H~ 264 (265) T pfam02129 240 VIQLYEALRAPG-VKKKLILGP-WTHS 264 (265) T ss_pred HHHHHHHHHCCC-CCCEEEEEC-CCCC T ss_conf 999999986589-997599808-8778 No 32 >TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase; InterPro: IPR012790 These proteins are 3-oxoadipate enol-lactonases (also called beta-ketoadipate enol-lactone hydrolase), which catalyse the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate can all be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this entry were chosen from within protocatechuate degradation operons of at least three genes of the pathway, and from genomes with the complete pathway through beta-ketoadipate.; GO: 0047570 3-oxoadipate enol-lactonase activity, 0019614 catechol catabolic process. Probab=99.76 E-value=2.7e-17 Score=124.14 Aligned_cols=171 Identities=23% Similarity=0.380 Sum_probs=123.6 Q ss_pred CCCCCCCEEEEECCCCCCCC--CCCCHHHHHHHHHHHHCCCEEEEEEECCCCCC--C-CCCCCCCHH-HHHHHHHHHHHH Q ss_conf 88877879999578998888--99899999999999973984999853155887--6-778521023-899999899998 Q gi|254781093|r 20 STNPNAPIALILHPHPRFGG--TMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRS--E-GEFDYGDGE-LSDAAAALDWVQ 93 (225) Q Consensus 20 ~~~~~~~~vv~~Hp~p~~GG--~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S--~-G~~~~g~~E-~~D~~aa~~~l~ 93 (225) ...+++|++|++- ..|- .|.++++. .....|.+||+|-||||.| + ++-++.... ..|+.+.+|.+ T Consensus 9 G~~~~~P~Lvf~N---SLGTDl~~Wd~v~~-----~L~~~f~~lryD~RGHG~Srad~~~~pYsi~dLa~Dv~aLlD~l- 79 (256) T TIGR02427 9 GAAAGAPVLVFIN---SLGTDLRMWDPVLP-----ALTADFRVLRYDKRGHGLSRADVPEGPYSIEDLADDVLALLDHL- 79 (256) T ss_pred CCCCCCCEEEECC---CHHCCHHHHHHHHH-----HCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH- T ss_conf 7788886788717---41044655689986-----20279179988508898755778888867877899999998787- Q ss_pred HHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC-----C------------------------------ Q ss_conf 623546740588872036878764503862-1330122757443-----3------------------------------ Q gi|254781093|r 94 SLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS-----Y------------------------------ 137 (225) Q Consensus 94 ~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~-----~------------------------------ 137 (225) +..+..++|-|+||.++..+++++|+ |+++|+...+... | T Consensus 80 ----~i~~a~~cGLS~GGliaq~La~~~pdrV~~lvl~nTAakIG~~~~W~~Ri~~v~~~Gqal~al~Dav~~RWFt~~F 155 (256) T TIGR02427 80 ----GIEKAVFCGLSLGGLIAQGLAARRPDRVRALVLSNTAAKIGTPESWNARIAAVRAEGQALAALADAVLERWFTPGF 155 (256) T ss_pred ----HHHHHEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHH T ss_conf ----2524112443378899999862006777798876224315797504899999997172346777421112368555 Q ss_pred ---------------------------------CC-CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf ---------------------------------32-22343347737761686550289999999999860779960799 Q gi|254781093|r 138 ---------------------------------DF-SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHK 183 (225) Q Consensus 138 ---------------------------------~~-~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~ 183 (225) || +.+....+|+|+|-|++|..+||+.++++++..+.++ +. T Consensus 156 ~~~~~~~~~~~~~Ml~~q~~~GYag~CaAirdaDf~~~~~~I~VPtL~iaG~~D~sTPP~~~r~iA~~vpGa~-----~~ 230 (256) T TIGR02427 156 REAHPARLDLYRNMLVRQPPDGYAGCCAAIRDADFRDRLGAIAVPTLVIAGDEDGSTPPELVREIADLVPGAR-----FA 230 (256) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHCCCCE-----EE T ss_conf 3210899999999986089868789999996068887762144441476367788898678999997369971-----68 Q ss_pred EECCCCCCC-CCCHHHHHHHHHHHHH Q ss_conf 978798475-5858999999999999 Q gi|254781093|r 184 VIPDANHFF-IGKVDELINECAHYLD 208 (225) Q Consensus 184 ~i~ga~H~f-~~~~~~l~~~i~~fl~ 208 (225) +|++++|.= .+.-+.+.+.+.+||+ T Consensus 231 ~i~~agHlp~~E~P~A~~~~l~~FL~ 256 (256) T TIGR02427 231 EIRGAGHLPCVEQPEAFNAALRDFLR 256 (256) T ss_pred EECCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 77788888850465899999999709 No 33 >PRK10162 acetyl esterase; Provisional Probab=99.72 E-value=4.4e-15 Score=110.80 Aligned_cols=200 Identities=17% Similarity=0.202 Sum_probs=136.8 Q ss_pred CEEEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEEECCCCCCCCCCCCCCH Q ss_conf 708965899618799966888778799995789988889989999999999997-3984999853155887677852102 Q gi|254781093|r 2 PEVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFDYGDG 80 (225) Q Consensus 2 ~~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~~g~~ 80 (225) .+..|+++.|.+...++.|.....++++++|+-..--|+.+. -..+++.|++ .|+.|+.+|||=. -|-.| .. T Consensus 61 ~~~~i~~~~g~i~~RiY~p~~~~~~vily~HGGGfv~Gs~~t--hd~~~r~lA~~~g~~VvsVDYRLA--PEh~f---Pa 133 (319) T PRK10162 61 RAYMVPTKYGQVETRLFCPQPDSPATLFYLHGGGFILGNLDT--HDRIMRLLASYSQCTVIGIDYTLS--PEARF---PQ 133 (319) T ss_pred EEEEEECCCCCEEEEEECCCCCCCEEEEEECCCEEECCCHHH--HHHHHHHHHHHHCCEEEEECCCCC--CCCCC---CH T ss_conf 565665799981389964489998189998698100688789--899999999971998999138989--88988---57 Q ss_pred HHHHHHHHHHHHHHH----CCCCCCEEEEEECHHHHHHHHHHH--CC-----CCHHEEEECCCCCCCC------------ Q ss_conf 389999989999862----354674058887203687876450--38-----6213301227574433------------ Q gi|254781093|r 81 ELSDAAAALDWVQSL----NPESKSCWIAGYSFGAWISMQLLM--RR-----PEINGFISVAPQPKSY------------ 137 (225) Q Consensus 81 E~~D~~aa~~~l~~~----~~~~~~i~l~G~S~Gg~val~~a~--~~-----p~i~~~v~isp~~~~~------------ 137 (225) -++|+.+++.|+.+. +.+..+|+++|-|-||.+|+.++. ++ +.+.+++++.|..... T Consensus 134 aleD~~aa~~w~~~~a~~~g~d~~rI~v~GDSAGG~LA~a~al~lrd~~~~~~~~~~~~l~~~~~~~~~~~s~~~~~~~~ 213 (319) T PRK10162 134 AIEEIVAACCYFHQQAEDYQINMSRIGFAGDSAGAMLALASALWLRDKQIDCGKVAGVLLWYGLYGLRDSVTRRLLGGVW 213 (319) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHCEEEECCCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCHHHHCCCC T ss_conf 99999999999997688709891104566348126899999998664468877634689856767899981466635666 Q ss_pred C-------------------------CC----CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf 3-------------------------22----234334773776168655028999999999986077996079997879 Q gi|254781093|r 138 D-------------------------FS----FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDA 188 (225) Q Consensus 138 ~-------------------------~~----~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga 188 (225) + ++ .+..---|++|+.++.|... ++.+.+++++..+ |+++++++++|. T Consensus 214 ~~lt~~~~~~~~~~yl~~~~~~~~P~~~~~~~d~~~glPPtlI~~a~~DpLr--Dd~~~ya~~L~~a-Gv~v~~~~y~G~ 290 (319) T PRK10162 214 DGLTQQDLQMYEEAYLSNDADRESPYYCLFNNDLTREVPPCFIAGAEFDPLL--DDSRLLYQTLAAH-QQPCEFKLYPGT 290 (319) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCH--HHHHHHHHHHHHC-CCCEEEEEECCC T ss_conf 5558999999999855897443464347667600169998799853767103--4299999999986-997899997998 Q ss_pred CCCCCC------CHHHHHHHHHHHHHHHC Q ss_conf 847558------58999999999999731 Q gi|254781093|r 189 NHFFIG------KVDELINECAHYLDNSL 211 (225) Q Consensus 189 ~H~f~~------~~~~l~~~i~~fl~~~L 211 (225) -|.|.. ...+..+.+.+|++++| T Consensus 291 ~HgF~~~~~~l~~A~~Al~~~a~F~~~~l 319 (319) T PRK10162 291 LHAFLHYSRMMKTADEALRDGAQFFTAQL 319 (319) T ss_pred CEEHHCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 27232264638799999999999999539 No 34 >KOG2281 consensus Probab=99.71 E-value=9.6e-16 Score=114.81 Aligned_cols=205 Identities=20% Similarity=0.318 Sum_probs=146.4 Q ss_pred EEE-ECCCCC-EEEEEEECCC----CCCCCEEEEECCCCCCCCCCCCH---HHHHHHHHHHHCCCEEEEEEECCCCCCCC Q ss_conf 089-658996-1879996688----87787999957899888899899---99999999997398499985315588767 Q gi|254781093|r 3 EVV-FNGPSG-RLEGRYQPST----NPNAPIALILHPHPRFGGTMNDN---IVYQLFYLFQQRGFVSLRFNFRGIGRSEG 73 (225) Q Consensus 3 ~v~-i~g~~G-~l~~~~~~~~----~~~~~~vv~~Hp~p~~GG~~~~~---~~~~la~~l~~~G~~vl~fd~RG~G~S~G 73 (225) |++ |+.+.| +|=|.+..|. +.+-|+++...+-|+-. .-+|. +.+.-...|+.+||.|+.+|-||+-+-.= T Consensus 614 eif~fqs~tg~~lYgmiyKPhn~~pgkkYptvl~VYGGP~VQ-lVnnsfkgi~ylR~~~LaslGy~Vv~IDnRGS~hRGl 692 (867) T KOG2281 614 EIFSFQSKTGLTLYGMIYKPHNFQPGKKYPTVLNVYGGPGVQ-LVNNSFKGIQYLRFCRLASLGYVVVFIDNRGSAHRGL 692 (867) T ss_pred HHEEEECCCCCEEEEEEECCCCCCCCCCCCEEEEECCCCEEE-EEECCCCCEEEHHHHHHHHCCEEEEEECCCCCCCCCC T ss_conf 213111388847999996454477788776499961797058-8613444312010135753755999985887401440 Q ss_pred CCC------CCCHHHHHHHHHHHHHHHHC--CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCC------ Q ss_conf 785------21023899999899998623--5467405888720368787645038621330122757443332------ Q gi|254781093|r 74 EFD------YGDGELSDAAAALDWVQSLN--PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDF------ 139 (225) Q Consensus 74 ~~~------~g~~E~~D~~aa~~~l~~~~--~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~------ 139 (225) +|. .|.-|++|-...+.|+.++. .|.++|++-||||||+++++..+++|+|-.+.-.+.|+..|++ T Consensus 693 kFE~~ik~kmGqVE~eDQVeglq~Laeq~gfidmdrV~vhGWSYGGYLSlm~L~~~P~IfrvAIAGapVT~W~~YDTgYT 772 (867) T KOG2281 693 KFESHIKKKMGQVEVEDQVEGLQMLAEQTGFIDMDRVGVHGWSYGGYLSLMGLAQYPNIFRVAIAGAPVTDWRLYDTGYT 772 (867) T ss_pred HHHHHHHHCCCEEEEHHHHHHHHHHHHHCCCCCHHHEEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCEEEEECCCCCH T ss_conf 36888861277177567778999999850865612457843353028888887617540367741785113342024213 Q ss_pred -----------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC- Q ss_conf -----------------------22343347737761686550289999999999860779960799978798475585- Q gi|254781093|r 140 -----------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK- 195 (225) Q Consensus 140 -----------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~- 195 (225) ..+.+-+.-.|+|||--|+.|...+...++.++-++ |.+.+++++|+-.|...+. T Consensus 773 ERYMg~P~~nE~gY~agSV~~~VeklpdepnRLlLvHGliDENVHF~Hts~Lvs~lvka-gKpyeL~IfP~ERHsiR~~e 851 (867) T KOG2281 773 ERYMGYPDNNEHGYGAGSVAGHVEKLPDEPNRLLLVHGLIDENVHFAHTSRLVSALVKA-GKPYELQIFPNERHSIRNPE 851 (867) T ss_pred HHHCCCCCCCHHCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHC-CCCEEEEECCCCCCCCCCCC T ss_conf 65437986301113301288887508887760799850334530346689999999867-99348997763014567775 Q ss_pred -HHHHHHHHHHHHHH Q ss_conf -89999999999997 Q gi|254781093|r 196 -VDELINECAHYLDN 209 (225) Q Consensus 196 -~~~l~~~i~~fl~~ 209 (225) .+.+...+..||.+ T Consensus 852 s~~~yE~rll~FlQ~ 866 (867) T KOG2281 852 SGIYYEARLLHFLQE 866 (867) T ss_pred CCHHHHHHHHHHHHH T ss_conf 412499999999853 No 35 >KOG1838 consensus Probab=99.69 E-value=7.4e-15 Score=109.45 Aligned_cols=201 Identities=19% Similarity=0.309 Sum_probs=132.6 Q ss_pred EEECCCCCE-EEE-EEECCCC------CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC Q ss_conf 896589961-879-9966888------77879999578998888998999999999999739849998531558876778 Q gi|254781093|r 4 VVFNGPSGR-LEG-RYQPSTN------PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF 75 (225) Q Consensus 4 v~i~g~~G~-l~~-~~~~~~~------~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~ 75 (225) +.|..+||- +.. |+.++.. ...|++|+|||.. ||++ ..-+..++..+++.||-|+.||-||+|+|+=+. T Consensus 96 eii~~~DGG~~~lDW~~~~~~~~~~~~~~~P~vvilpGlt--g~S~-~~YVr~lv~~a~~~G~r~VVfN~RG~~g~~LtT 172 (409) T KOG1838 96 EIIKTSDGGTVTLDWVENPDSRCRTDDGTDPIVVILPGLT--GGSH-ESYVRHLVHEAQRKGYRVVVFNHRGLGGSKLTT 172 (409) T ss_pred EEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEECCCC--CCCH-HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCC T ss_conf 8999279988988622475323577789975799946888--9871-379999999998679679998788778874678 Q ss_pred CC--CCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC---CHHEEEECCCCCCCC------------- Q ss_conf 52--102389999989999862354674058887203687876450386---213301227574433------------- Q gi|254781093|r 76 DY--GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP---EINGFISVAPQPKSY------------- 137 (225) Q Consensus 76 ~~--g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p---~i~~~v~isp~~~~~------------- 137 (225) +. -.|-.+|+.++++++++++|+. +++.+|+||||.+.+.+.++.. .+.+.++++-|-+.. T Consensus 173 pr~f~ag~t~Dl~~~v~~i~~~~P~a-~l~avG~S~Gg~iL~nYLGE~g~~~~l~~a~~v~~Pwd~~~~~~~~~~~~~~~ 251 (409) T KOG1838 173 PRLFTAGWTEDLREVVNHIKKRYPQA-PLFAVGFSMGGNILTNYLGEEGDNTPLIAAVAVCNPWDLLAASRSIETPLYRR 251 (409) T ss_pred CCEEECCCHHHHHHHHHHHHHHCCCC-CEEEEEECCHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHCCCCHH T ss_conf 76650478789999999999748998-44899854409999998664167788405789725404566666775152067 Q ss_pred --------------------------C------------C------------------------CCCCCCCCCEEEEECC Q ss_conf --------------------------3------------2------------------------2234334773776168 Q gi|254781093|r 138 --------------------------D------------F------------------------SFLAPCPSSGLIINGS 155 (225) Q Consensus 138 --------------------------~------------~------------------------~~l~~~~~p~LiIhG~ 155 (225) + | ..+....+|.|+|+.. T Consensus 252 ~y~~~l~~~l~~~~~~~r~~~~~~~vd~d~~~~~~SvreFD~~~t~~~~gf~~~deYY~~aSs~~~v~~I~VP~L~ina~ 331 (409) T KOG1838 252 FYNRALTLNLKRIVLRHRHTLFEDPVDFDVILKSRSVREFDEALTRPMFGFKSVDEYYKKASSSNYVDKIKVPLLCINAA 331 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHCCHHHHCCCCCCCEEEEECC T ss_conf 88999987678887631366650511333103018398887655324268886899986055354345542457998568 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCC--CCCCC---HHH-HHHHHHHHHHHHCC Q ss_conf 65502899999999998607799607999787984--75585---899-99999999997312 Q gi|254781093|r 156 NDTVATTSDVKDLVNKLMNQKGISITHKVIPDANH--FFIGK---VDE-LINECAHYLDNSLD 212 (225) Q Consensus 156 ~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H--~f~~~---~~~-l~~~i~~fl~~~L~ 212 (225) +|+++|.+.. .+ +.+.+ +..+-+.+-.-++| ||.+. ..- +.+.+.+|+...+. T Consensus 332 DDPv~p~~~i-p~-~~~~~--np~v~l~~T~~GGHlgfleg~~p~~~~w~~~~l~ef~~~~~~ 390 (409) T KOG1838 332 DDPVVPEEAI-PI-DDIKS--NPNVLLVITSHGGHLGFLEGLWPSARTWMDKLLVEFLGNAIF 390 (409) T ss_pred CCCCCCCCCC-CH-HHHHC--CCCEEEEEECCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 8997880328-98-99844--995899995798646651267876542599999999998874 No 36 >COG3571 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only] Probab=99.69 E-value=4.4e-15 Score=110.80 Aligned_cols=186 Identities=20% Similarity=0.293 Sum_probs=130.5 Q ss_pred EEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC-CCCCCC--CCCCCCHHHHH-HHHHHH Q ss_conf 99966888778799995789988889989999999999997398499985315-588767--78521023899-999899 Q gi|254781093|r 15 GRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG-IGRSEG--EFDYGDGELSD-AAAALD 90 (225) Q Consensus 15 ~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG-~G~S~G--~~~~g~~E~~D-~~aa~~ 90 (225) ..++.|.++...++|+.| +.|++|++..+...|..|+.+||.|.||+|-= .-+-+| ....+.+..++ -..++. T Consensus 4 ~~~~~pag~~~~tilLaH---GAGasmdSt~m~~~a~~la~~G~~vaRfefpYma~Rrtg~rkPp~~~~t~~~~~~~~~a 80 (213) T COG3571 4 GFLFDPAGPAPVTILLAH---GAGASMDSTSMTAVAAALARRGWLVARFEFPYMAARRTGRRKPPPGSGTLNPEYIVAIA 80 (213) T ss_pred CCCCCCCCCCCEEEEEEC---CCCCCCCCHHHHHHHHHHHHCCEEEEEEECCHHHHCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 523378999877899843---78988777799999999973741677762505430545687996944557889999999 Q ss_pred HHHHHCCCCCCEEEEEECHHHHHHHHHHHCC-CCHHEEEECCCC------CCCCCCCCCCCCCCCEEEEECCCCCCCCHH Q ss_conf 9986235467405888720368787645038-621330122757------443332223433477377616865502899 Q gi|254781093|r 91 WVQSLNPESKSCWIAGYSFGAWISMQLLMRR-PEINGFISVAPQ------PKSYDFSFLAPCPSSGLIINGSNDTVATTS 163 (225) Q Consensus 91 ~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~-p~i~~~v~isp~------~~~~~~~~l~~~~~p~LiIhG~~D~~vp~~ 163 (225) -++... +..++++-|+||||.++.+++... .+|.++++++-| +.......|.....|+||.||++|++-..+ T Consensus 81 ql~~~l-~~gpLi~GGkSmGGR~aSmvade~~A~i~~L~clgYPfhppGKPe~~Rt~HL~gl~tPtli~qGtrD~fGtr~ 159 (213) T COG3571 81 QLRAGL-AEGPLIIGGKSMGGRVASMVADELQAPIDGLVCLGYPFHPPGKPEQLRTEHLTGLKTPTLITQGTRDEFGTRD 159 (213) T ss_pred HHHHCC-CCCCEEECCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCHHHHCCCCCCCEEEEECCCCCCCCHH T ss_conf 998224-6786565253556337889887636974158884576689998441103213677777478622545445899 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-----------CHHHHHHHHHHHHHHH Q ss_conf 9999999986077996079997879847558-----------5899999999999973 Q gi|254781093|r 164 DVKDLVNKLMNQKGISITHKVIPDANHFFIG-----------KVDELINECAHYLDNS 210 (225) Q Consensus 164 ~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-----------~~~~l~~~i~~fl~~~ 210 (225) ++-.+ .+ ..+++++-+.++||-..- +...+.+.|..|+++. T Consensus 160 ~Va~y--~l----s~~iev~wl~~adHDLkp~k~vsgls~~~hL~~~A~~va~~~~~l 211 (213) T COG3571 160 EVAGY--AL----SDPIEVVWLEDADHDLKPRKLVSGLSTADHLKTLAEQVAGWARRL 211 (213) T ss_pred HHHHH--HC----CCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 98753--13----886589984057666563000033528999999999999998641 No 37 >pfam00561 Abhydrolase_1 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes. Probab=99.68 E-value=8e-16 Score=115.27 Aligned_cols=138 Identities=25% Similarity=0.294 Sum_probs=95.1 Q ss_pred CEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHH-HHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCC Q ss_conf 8499985315588767785210238999998999-98623546740588872036878764503862-133012275744 Q gi|254781093|r 58 FVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDW-VQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPK 135 (225) Q Consensus 58 ~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~-l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~ 135 (225) |.|+.+|.||+|.|+...... ...+|....+.. +++.+ .+++.++||||||.+++.++.++|+ ++++|+++++.. T Consensus 1 ~~vi~~D~~G~G~S~~~~~~~-~~~~~~~~di~~l~~~l~--i~~~~liGhS~Gg~va~~~a~~~p~~v~~lil~~~~~~ 77 (225) T pfam00561 1 FDVIAFDLRGFGRSSPPDLAD-YRFDDLAEDLEALLQALG--LDKVNLVGHSMGGLIALAYAAKYPDRVKALVLVGTVHP 77 (225) T ss_pred CEEEEECCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHCC--CCCEEEEEECCCCHHHHHHHHHCHHHHCEEEEECCCCC T ss_conf 979997589999996989899-899999999999999769--99779999728839999999969386028889898862 Q ss_pred CCCC---------------------------------------------------------------------------- Q ss_conf 3332---------------------------------------------------------------------------- Q gi|254781093|r 136 SYDF---------------------------------------------------------------------------- 139 (225) Q Consensus 136 ~~~~---------------------------------------------------------------------------- 139 (225) .... T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (225) T pfam00561 78 AGLSSPLTPRGNLLGLLLDNFFNRLYDSVEALLGRAIKQFQALGRPFVSDFLKQFELSSLIRFSETLALDGLLGVALGYY 157 (225) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 22321345567766766777888743226789988666665304632125889888887741113355667888876544 Q ss_pred ------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHH Q ss_conf ------2234334773776168655028999999999986077996079997879847558-589999999 Q gi|254781093|r 140 ------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINEC 203 (225) Q Consensus 140 ------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i 203 (225) ..+.....|+++|+|++|.++|++..+.+.+.++. .+++++||++|+..- +.+++.+.+ T Consensus 158 ~~~~~~~~~~~~~~Pvl~i~G~~D~~~~~~~~~~~~~~~~~-----~~~~~i~~~gH~~~~e~p~~~~~~i 223 (225) T pfam00561 158 LVWDRSAALKDIDVPTLIIWGDDDPLVPPDASEKLAALFPN-----AQLVVIDDAGHLAQLEKPDEVAELI 223 (225) T ss_pred HCCCHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHCCC-----CEEEEECCCCCCHHHHCHHHHHHHH T ss_conf 11345989854899879996079997699999999987899-----8899989999703872999999997 No 38 >COG0400 Predicted esterase [General function prediction only] Probab=99.68 E-value=7.4e-15 Score=109.48 Aligned_cols=182 Identities=23% Similarity=0.276 Sum_probs=121.1 Q ss_pred EECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC-----------CCCCCCCCCCC--HHHH Q ss_conf 9668887787999957899888899899999999999973984999853155-----------88767785210--2389 Q gi|254781093|r 17 YQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGI-----------GRSEGEFDYGD--GELS 83 (225) Q Consensus 17 ~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~-----------G~S~G~~~~g~--~E~~ 83 (225) +..++.+..+++|+|| +.||+..+ +..+.+.+.- .+..+ ..||- ...+|+|+... -+.+ T Consensus 10 i~~~~~p~~~~iilLH---G~Ggde~~--~~~~~~~~~P-~~~~i--s~rG~v~~~g~~~~f~~~~~~~~d~edl~~~~~ 81 (207) T COG0400 10 IEKPGDPAAPLLILLH---GLGGDELD--LVPLPELILP-NATLV--SPRGPVAENGGPRFFRRYDEGSFDQEDLDLETE 81 (207) T ss_pred CCCCCCCCCCEEEEEE---CCCCCHHH--HHHHHHHCCC-CCEEE--CCCCCCCCCCCCCCEEECCCCCCCHHHHHHHHH T ss_conf 3488999986899984---58997555--4311655199-97487--578874556702360433777633566888799 Q ss_pred HHHHHHHHHHH-HCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCC Q ss_conf 99998999986-23546740588872036878764503862-13301227574433322234334773776168655028 Q gi|254781093|r 84 DAAAALDWVQS-LNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVAT 161 (225) Q Consensus 84 D~~aa~~~l~~-~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~vp 161 (225) ...+.++.+++ .+.+.+++++.|||.|+.+++.+..+.+. .++.++.+|...............|+|++||++|++|| T Consensus 82 ~~~~~l~~~~~~~gi~~~~ii~~GfSqGA~ial~~~l~~~~~~~~ail~~g~~~~~~~~~~~~~~~pill~hG~~Dpvvp 161 (207) T COG0400 82 KLAEFLEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAILFSGMLPLEPELLPDLAGTPILLSHGTEDPVVP 161 (207) T ss_pred HHHHHHHHHHHHHCCCHHHEEEEECCHHHHHHHHHHHHCCHHHCCCHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCC T ss_conf 99999999999839981226898517079999999985850005202207867888766532479728871467688468 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99999999998607799607999787984755858999999999999731 Q gi|254781093|r 162 TSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 162 ~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L 211 (225) ...+.++.+.+.. .+.+++++.++ .+|-.. ++-.+.+.+|+.+.+ T Consensus 162 ~~~~~~l~~~l~~-~g~~v~~~~~~-~GH~i~---~e~~~~~~~wl~~~~ 206 (207) T COG0400 162 LALAEALAEYLTA-SGADVEVRWHE-GGHEIP---PEELEAARSWLANTL 206 (207) T ss_pred HHHHHHHHHHHHH-CCCCEEEEEEC-CCCCCC---HHHHHHHHHHHHHCC T ss_conf 8899999999998-19977999836-888689---999999999998606 No 39 >pfam07859 Abhydrolase_3 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes. Probab=99.66 E-value=8e-15 Score=109.28 Aligned_cols=155 Identities=20% Similarity=0.264 Sum_probs=109.7 Q ss_pred EEEECCCCCCCCCCCCH-HHHHHHHHHHHC-CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHH----CCCCCC Q ss_conf 99957899888899899-999999999973-984999853155887677852102389999989999862----354674 Q gi|254781093|r 28 ALILHPHPRFGGTMNDN-IVYQLFYLFQQR-GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSL----NPESKS 101 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~-~~~~la~~l~~~-G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~----~~~~~~ 101 (225) +|++|+-.... .++ ....+++.|+++ ||.|+.+|||=. .+..|. ..++|+.++++|++++ +.++++ T Consensus 1 ii~iHGGg~~~---g~~~~~~~~~~~la~~~g~~v~~~dYrla--Pe~~~p---~~~~D~~~a~~~l~~~~~~~~~d~~~ 72 (209) T pfam07859 1 LVYFHGGGFVL---GSADTHDRLCRRLAARAGAVVVSVDYRLA--PEHPFP---AAIEDAYAALRWLAEHAAELGADPSR 72 (209) T ss_pred CEEECCCHHHC---CCHHHHHHHHHHHHHHCCCEEEEEECCCC--CCCCCC---HHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 99993812221---98688999999999873988999879998--889882---79999999999999989980999542 Q ss_pred EEEEEECHHHHHHHHHHHC-----CCCHHEEEECCCCCCCCCC-----------------------------------CC Q ss_conf 0588872036878764503-----8621330122757443332-----------------------------------22 Q gi|254781093|r 102 CWIAGYSFGAWISMQLLMR-----RPEINGFISVAPQPKSYDF-----------------------------------SF 141 (225) Q Consensus 102 i~l~G~S~Gg~val~~a~~-----~p~i~~~v~isp~~~~~~~-----------------------------------~~ 141 (225) |+++|.|-||.+|+.++.. .+.++++++++|..+.... .. T Consensus 73 I~l~G~SAGG~La~~~~~~~~~~~~~~p~~~~~~~p~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (209) T pfam07859 73 IAVAGDSAGGNLAAALALRARDRGLPLPAGQVLIYPGLDLRTESESYEAREYADGPLLTRDDMDWFWRLYLPGADRDHPL 152 (209) T ss_pred EEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC T ss_conf 69999783077999999997654898856699977822788886112222036776669999999999727877745733 Q ss_pred CCCC-------CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 3433-------477377616865502899999999998607799607999787984755 Q gi|254781093|r 142 LAPC-------PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFI 193 (225) Q Consensus 142 l~~~-------~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~ 193 (225) ..|. -.|++++||+.|.++ ++++.+++++++ .|+++++++++|.+|.|. T Consensus 153 ~sP~~~~~~~~~PP~~i~~G~~D~l~--~~s~~f~~~L~~-~Gv~~~l~~~~g~~H~F~ 208 (209) T pfam07859 153 ASPLFAADLSGLPPALVVVAEFDPLR--DEGEAYAERLRA-AGVPVELVEYPGMIHGFH 208 (209) T ss_pred CCCCCCCCCCCCCCEEEEECCCCCCH--HHHHHHHHHHHH-CCCCEEEEEECCCEEEEE T ss_conf 38443753016998799966868877--999999999998-799789999899347536 No 40 >KOG3043 consensus Probab=99.66 E-value=9.8e-15 Score=108.73 Aligned_cols=193 Identities=20% Similarity=0.260 Sum_probs=142.2 Q ss_pred EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE-CCC-CCCCCCCC------CCC---H Q ss_conf 187999668887787999957899888899899999999999973984999853-155-88767785------210---2 Q gi|254781093|r 12 RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNF-RGI-GRSEGEFD------YGD---G 80 (225) Q Consensus 12 ~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~-RG~-G~S~G~~~------~g~---~ 80 (225) -|++.+.-...++..+++|.- -||-... .....|..++..||.|+.+|| ||- ..++++.+ .+. . T Consensus 27 gldaYv~gs~~~~~~li~i~D---vfG~~~~--n~r~~Adk~A~~Gy~v~vPD~~~Gdp~~~~~~~~~~~~w~~~~~~~~ 101 (242) T KOG3043 27 GLDAYVVGSTSSKKVLIVIQD---VFGFQFP--NTREGADKVALNGYTVLVPDFFRGDPWSPSLQKSERPEWMKGHSPPK 101 (242) T ss_pred CEEEEEECCCCCCEEEEEEEE---EECCCCH--HHHHHHHHHHCCCCEEECCHHHCCCCCCCCCCHHHHHHHHHCCCCCC T ss_conf 726899668888758999986---3036647--78999998722884797252205898998887135389884479542 Q ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCC Q ss_conf 38999998999986235467405888720368787645038621330122757443332223433477377616865502 Q gi|254781093|r 81 ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVA 160 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~v 160 (225) -..|+.++++|++.++ +..+|.++||+|||.++..+.+..+++.+.++..|-.. +..+......|+|++.++.|+++ T Consensus 102 ~~~~i~~v~k~lk~~g-~~kkIGv~GfCwGak~vv~~~~~~~~f~a~v~~hps~~--d~~D~~~vk~Pilfl~ae~D~~~ 178 (242) T KOG3043 102 IWKDITAVVKWLKNHG-DSKKIGVVGFCWGAKVVVTLSAKDPEFDAGVSFHPSFV--DSADIANVKAPILFLFAELDEDV 178 (242) T ss_pred CHHHHHHHHHHHHHCC-CCCEEEEEEEEECCEEEEEEECCCHHHEEEEEECCCCC--CHHHHHCCCCCEEEEEECCCCCC T ss_conf 1067899999999729-86156578885255078885043431203457447758--86677417898798730356667 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-C-----------HHHHHHHHHHHHHHHCC Q ss_conf 8999999999986077996079997879847558-5-----------89999999999997312 Q gi|254781093|r 161 TTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-K-----------VDELINECAHYLDNSLD 212 (225) Q Consensus 161 p~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~-----------~~~l~~~i~~fl~~~L~ 212 (225) |++....|-+++.......-++++++|-.|.|.. + .++..+....|++.+|. T Consensus 179 p~~~v~~~ee~lk~~~~~~~~v~~f~g~~HGf~~~r~~~~~Ped~~~~eea~~~~~~Wf~~y~~ 242 (242) T KOG3043 179 PPKDVKAWEEKLKENPAVGSQVKTFSGVGHGFVARRANISSPEDKKAAEEAYQRFISWFKHYLA 242 (242) T ss_pred CHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8778899999874076643168970786504331115779856778899999999999999639 No 41 >KOG1454 consensus Probab=99.64 E-value=6e-14 Score=103.98 Aligned_cols=196 Identities=20% Similarity=0.296 Sum_probs=124.8 Q ss_pred CEEEECCCCC--EEEEEEECCC-------CCCCCEEEEECCCCCCCCCCCC--HHHHHHHHHHHHCCCEEEEEEECCCCC Q ss_conf 7089658996--1879996688-------8778799995789988889989--999999999997398499985315588 Q gi|254781093|r 2 PEVVFNGPSG--RLEGRYQPST-------NPNAPIALILHPHPRFGGTMND--NIVYQLFYLFQQRGFVSLRFNFRGIGR 70 (225) Q Consensus 2 ~~v~i~g~~G--~l~~~~~~~~-------~~~~~~vv~~Hp~p~~GG~~~~--~~~~~la~~l~~~G~~vl~fd~RG~G~ 70 (225) .++.++-+.| ....-|.+.. ...++.+|++| +||++... +.+-.+.+.. |+.++++|+-|.|. T Consensus 26 ~~~~i~~~~g~~~~~~~w~~~~~~~~~~~~~~~~pvlllH---GF~~~~~~w~~~~~~L~~~~---~~~v~aiDl~G~g~ 99 (326) T KOG1454 26 RSTSIEIPWGPLTIRSKWIPNLDKYGSPGDKDKPPVLLLH---GFGASSFSWRRVVPLLSKAK---GLRVLAIDLPGHGY 99 (326) T ss_pred CCEEEECCCCCCCEEEEEECCCCCCCCCCCCCCCCEEEEC---CCCCCCHHHHHHCCCCHHHC---CEEEEEEECCCCCC T ss_conf 5358875788752378960751235685668899789962---65577044644315331334---81899997167887 Q ss_pred CCCCCCCCC-HHHHHHHHHH-HHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEE---ECCCCCCCCCC----- Q ss_conf 767785210-2389999989-9998623546740588872036878764503862-13301---22757443332----- Q gi|254781093|r 71 SEGEFDYGD-GELSDAAAAL-DWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFI---SVAPQPKSYDF----- 139 (225) Q Consensus 71 S~G~~~~g~-~E~~D~~aa~-~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v---~isp~~~~~~~----- 139 (225) |. .-+.+. --..+....+ +...+.. ..++.++|||+||.+|+.+|+..|+ ++.++ .+.|+....+. T Consensus 100 ~s-~~~~~~~y~~~~~v~~i~~~~~~~~--~~~~~lvghS~Gg~va~~~Aa~~P~~V~~lv~~~~~~~~~~~~~~~~~~~ 176 (326) T KOG1454 100 SS-PLPRGPLYTLRELVELIRRFVKEVF--VEPVSLVGHSLGGIVALKAAAYYPETVDSLVLLDLLGPPVYSTPKGIKGL 176 (326) T ss_pred CC-CCCCCCCCCHHHHHHHHHHHHHHHC--CCCEEEEEECHHHHHHHHHHHHCHHHHHHHHHHHEECCCCCCCCCHHHHH T ss_conf 78-8888987578999999999986322--77238985670889999999867465055642210056655666305677 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254781093|r 140 -------------------------------------------------------------------------------- 139 (225) Q Consensus 140 -------------------------------------------------------------------------------- 139 (225) T Consensus 177 ~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 256 (326) T KOG1454 177 RRLLDKFLSALELLIPLSLTEPVRLVSEGLLRCLKVVYTDPSRLLEKLLHLLSRPVKEHFHRDARLSLFLELLGFDENLL 256 (326) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHEECCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHH T ss_conf 77776442155524763110167897987642200014676534544146514654442168889999870255303478 Q ss_pred CCCCCCC-CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHHHHC Q ss_conf 2234334-773776168655028999999999986077996079997879847558-58999999999999731 Q gi|254781093|r 140 SFLAPCP-SSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLDNSL 211 (225) Q Consensus 140 ~~l~~~~-~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~~~L 211 (225) +.+.... .|+|+|+|++|+++|.+.++.+.+.+ ...++++|+++||.=+. +.+++.+.+..|+.+.. T Consensus 257 ~~~~~i~~~pvlii~G~~D~~~p~~~~~~~~~~~-----pn~~~~~I~~~gH~~h~e~Pe~~~~~i~~Fi~~~~ 325 (326) T KOG1454 257 SLIKKIWKCPVLIIWGDKDQIVPLELAEELKKKL-----PNAELVEIPGAGHLPHLERPEEVAALLRSFIARLR 325 (326) T ss_pred HHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHC-----CCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 7525336875599984778724889999998538-----89669995899967545888999999999998607 No 42 >KOG4178 consensus Probab=99.64 E-value=3.9e-14 Score=105.10 Aligned_cols=185 Identities=23% Similarity=0.294 Sum_probs=130.8 Q ss_pred EEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC---CCCHH-HHHHHHH Q ss_conf 8799966888778799995789988889989999999999997398499985315588767785---21023-8999998 Q gi|254781093|r 13 LEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD---YGDGE-LSDAAAA 88 (225) Q Consensus 13 l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~---~g~~E-~~D~~aa 88 (225) +...|...+....|+++++|++|..--+.+ ..-..|+.+||.|+.+|+||+|.|+-... +..-+ ..|+.+. T Consensus 32 I~~h~~e~g~~~gP~illlHGfPe~wyswr-----~q~~~la~~~~rviA~DlrGyG~Sd~P~~~~~Yt~~~l~~di~~l 106 (322) T KOG4178 32 IRLHYVEGGPGDGPIVLLLHGFPESWYSWR-----HQIPGLASRGYRVIAPDLRGYGFSDAPPHISEYTIDELVGDIVAL 106 (322) T ss_pred EEEEEEEECCCCCCEEEEECCCCCCCHHHH-----HHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCEEHHHHHHHHHHH T ss_conf 899988404789987999806985004366-----540466406448996378888888899972301299999899999 Q ss_pred HHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCC--------C----------------------- Q ss_conf 99998623546740588872036878764503862-133012275744--------3----------------------- Q gi|254781093|r 89 LDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPK--------S----------------------- 136 (225) Q Consensus 89 ~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~--------~----------------------- 136 (225) ++.+- .+++.++||++|+.+|..++..+|+ +.++++++-+.. . T Consensus 107 ld~Lg-----~~k~~lvgHDwGaivaw~la~~~Perv~~lv~~nv~~~~p~~~~~~~~~~~f~~~~y~~~fQ~~~~~E~~ 181 (322) T KOG4178 107 LDHLG-----LKKAFLVGHDWGAIVAWRLALFYPERVDGLVTLNVPFPNPKLKPLDSSKAIFGKSYYICLFQEPGKPETE 181 (322) T ss_pred HHHHC-----CCEEEEEECCCHHHHHHHHHHHCHHHCCEEEEECCCCCCCCCCHHHHHCCCCCCCCEEEECCCCCCCCHH T ss_conf 98736-----1336889525324999999986842334479804688785444112102436754025631256773012 Q ss_pred ---------------------------------CC----------------------------------CCCCCCCCCCE Q ss_conf ---------------------------------33----------------------------------22234334773 Q gi|254781093|r 137 ---------------------------------YD----------------------------------FSFLAPCPSSG 149 (225) Q Consensus 137 ---------------------------------~~----------------------------------~~~l~~~~~p~ 149 (225) |. ...++....|+ T Consensus 182 ~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~t~edi~~~~~~f~~~g~~gplNyyrn~~r~w~a~~~~~~~i~iPv 261 (322) T KOG4178 182 LSKDDTEMLVKTFRTRKTPGPLIVPKQPNENPLWLTEEDIAFYVSKFQIDGFTGPLNYYRNFRRNWEAAPWALAKITIPV 261 (322) T ss_pred HCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCHHCCCCCCCCCCCE T ss_conf 11002577677642035677444577778761000177899886305443303554066777648132046452010426 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHHHH Q ss_conf 776168655028999999999986077996079997879847558-5899999999999973 Q gi|254781093|r 150 LIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLDNS 210 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~~~ 210 (225) ++|.|+.|.+.+.....+.+++.-. ...+.++++|++|+-+. +-+++.+.+..|+++. T Consensus 262 ~fi~G~~D~v~~~p~~~~~~rk~vp---~l~~~vv~~~~gH~vqqe~p~~v~~~i~~f~~~~ 320 (322) T KOG4178 262 LFIWGDLDPVLPYPIFGELYRKDVP---RLTERVVIEGIGHFVQQEKPQEVNQAILGFINSF 320 (322) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHC---CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9998547546460667789998601---3410589658765402138799999999999862 No 43 >TIGR01250 pro_imino_pep_2 proline-specific peptidases; InterPro: IPR005945 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. One of the members of this family is the tricorn protease (TRI) interacting factor 1 from Thermoplasma acidophilum. Factor 1 (F1) is a 33.5 kDa serine peptidase of the alpha/beta-hydrolase family. Tricorn generates small peptides, which are cleaved by F1 to yield single amino acids , . ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=99.61 E-value=1e-13 Score=102.64 Aligned_cols=190 Identities=18% Similarity=0.302 Sum_probs=126.9 Q ss_pred ECCCCCEEEEEEECCCCCC-CCEEEEECCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEEECCCCCCCCCCCCCCHHH- Q ss_conf 6589961879996688877-8799995789988889989999999999997-398499985315588767785210238- Q gi|254781093|r 6 FNGPSGRLEGRYQPSTNPN-APIALILHPHPRFGGTMNDNIVYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFDYGDGEL- 82 (225) Q Consensus 6 i~g~~G~l~~~~~~~~~~~-~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~~g~~E~- 82 (225) |+-..|+ .+|..-.+.+ +.-+|++||-| |+|. .-+..+.+.+.+ .|+.|+.+|-=|+|.|+- .|+...|. T Consensus 7 ~~~~Gg~--~~~~~~~~~g~~~~Ll~lHGGP--G~~H--~Yl~~l~~~~~~~~G~~V~~YDQLGcg~Sd~-Pd~s~PE~~ 79 (302) T TIGR01250 7 ITVDGGY--VLFTKTGGEGEKIKLLVLHGGP--GSSH--EYLENLKELLKEDLGREVIMYDQLGCGYSDQ-PDDSDPEIA 79 (302) T ss_pred EEECCCE--EEEEEECCCCCCCEEEEEECCC--CCCH--HHHHHHHHHHHHCCCCEEEEEECCCCCCCCC-CCCCCCHHH T ss_conf 7206868--9998731789892189970889--8874--2379999997640782799983245777785-888773123 Q ss_pred HHH---HHHHHH---HHHH-CCCCCCEEEEEECHHHHHHHHHHHCC-C-CHHEEEECCCCCCC----------------- Q ss_conf 999---998999---9862-35467405888720368787645038-6-21330122757443----------------- Q gi|254781093|r 83 SDA---AAALDW---VQSL-NPESKSCWIAGYSFGAWISMQLLMRR-P-EINGFISVAPQPKS----------------- 136 (225) Q Consensus 83 ~D~---~aa~~~---l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~-p-~i~~~v~isp~~~~----------------- 136 (225) .++ .-.++- +|+. +.+...++|+|+|+||..|+.+|.+. + .++++|.-|+..+. T Consensus 80 ~kl~t~d~fV~E~e~vR~~L~l~~~nfyllG~SWGG~LA~~Yal~Yh~~~lkglI~ss~~~s~pey~~~~~r~~~~~Lp~ 159 (302) T TIGR01250 80 EKLWTIDYFVDELEEVREKLGLDKKNFYLLGHSWGGLLAQEYALKYHGQHLKGLIISSMLDSAPEYVKELNRLRDKKLPP 159 (302) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCH T ss_conf 45420002389999999985578652789702678999999999737898269998556567247899999877632899 Q ss_pred ---------------------------------------------------------------------------CCC-C Q ss_conf ---------------------------------------------------------------------------332-2 Q gi|254781093|r 137 ---------------------------------------------------------------------------YDF-S 140 (225) Q Consensus 137 ---------------------------------------------------------------------------~~~-~ 140 (225) ||+ + T Consensus 160 ~~~~~i~~~E~~g~~~~~~Y~~~v~~~~~~~~~~~~~~~P~~l~~~~~~~~n~~vY~~m~GpnEF~~~Gda~L~dWD~~D 239 (302) T TIGR01250 160 EVREAIKRCEASGDYDDPEYQEAVEVFYYQHLICRLRKWPEALKRLKSGMRNTEVYNIMQGPNEFTITGDANLKDWDITD 239 (302) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCEEECCCEEEECCHHC T ss_conf 89999999742689889899999986220124204677547899987315503667744289565351364343020103 Q ss_pred CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC-CCCCHHHHHHHHHHHHH Q ss_conf 234334773776168655028999999999986077996079997879847-55858999999999999 Q gi|254781093|r 141 FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF-FIGKVDELINECAHYLD 208 (225) Q Consensus 141 ~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~-f~~~~~~l~~~i~~fl~ 208 (225) .|....+|+|++.|+.|+.. ++.++...+.++..+ +++.|+++|. |.+..+...+.+.+||+ T Consensus 240 ~L~~I~vPTLlt~G~~D~~~-p~~~~~m~~~i~~Sr-----l~~f~~g~H~~m~e~~~~y~~~l~~Fl~ 302 (302) T TIGR01250 240 KLSEIKVPTLLTVGEFDTMT-PEAAREMQELIAKSR-----LVVFPDGSHMTMIEEPEVYFKLLSDFLR 302 (302) T ss_pred CHHHCCCCCEEEEECCCCCC-HHHHHHHHHHCCCCE-----EEECCCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 15435876003342046578-899999987607873-----8984788513753483889999998709 No 44 >COG3458 Acetyl esterase (deacetylase) [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.61 E-value=4.1e-14 Score=105.01 Aligned_cols=195 Identities=23% Similarity=0.256 Sum_probs=138.4 Q ss_pred EEEECCCCC-EEEEEEECCCC--CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCC------ Q ss_conf 089658996-18799966888--778799995789988889989999999999997398499985315588767------ Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTN--PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEG------ 73 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~--~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G------ 73 (225) +|.|+|-+| |+.+||..|.. ...|.+|-.|++ +|+..... ..+ -++..||+|+..|.||.|.|.- T Consensus 58 dvTf~g~~g~rI~gwlvlP~~~~~~~P~vV~fhGY---~g~~g~~~-~~l--~wa~~Gyavf~MdvRGQg~~~~dt~~~p 131 (321) T COG3458 58 DVTFTGYGGARIKGWLVLPRHEKGKLPAVVQFHGY---GGRGGEWH-DML--HWAVAGYAVFVMDVRGQGSSSQDTADPP 131 (321) T ss_pred EEEEECCCCCEEEEEEEEECCCCCCCCEEEEEEEC---CCCCCCCC-CCC--CCCCCCEEEEEEECCCCCCCCCCCCCCC T ss_conf 99972358835778998413457745569999613---67888732-001--6422451699985146777655678898 Q ss_pred ---CC----CCCC----------HHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCC Q ss_conf ---78----5210----------23899999899998623-546740588872036878764503862133012275744 Q gi|254781093|r 74 ---EF----DYGD----------GELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPK 135 (225) Q Consensus 74 ---~~----~~g~----------~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~ 135 (225) ++ ..|. +-..|+.-+++-+.+.. .|..+|.+.|.|+||.+++.+++-+|+|++.+++-|... T Consensus 132 ~~~s~pG~mtrGilD~kd~yyyr~v~~D~~~ave~~~sl~~vde~Ri~v~G~SqGGglalaaaal~~rik~~~~~~Pfl~ 211 (321) T COG3458 132 GGPSDPGFMTRGILDRKDTYYYRGVFLDAVRAVEILASLDEVDEERIGVTGGSQGGGLALAAAALDPRIKAVVADYPFLS 211 (321) T ss_pred CCCCCCCEEEEECCCCCCCEEEEEEHHHHHHHHHHHHCCCCCCHHHEEEECCCCCCHHHHHHHHCCHHHHCCCCCCCCCC T ss_conf 98767751676203688736973013779999999826670263146773366675055410105846534345665435 Q ss_pred CCCCC------------------------------------CC-CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 33322------------------------------------23-433477377616865502899999999998607799 Q gi|254781093|r 136 SYDFS------------------------------------FL-APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGI 178 (225) Q Consensus 136 ~~~~~------------------------------------~l-~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~ 178 (225) .++.. .+ .....|+|+.-|=-|++||++.....++++...| T Consensus 212 df~r~i~~~~~~~ydei~~y~k~h~~~e~~v~~TL~yfD~~n~A~RiK~pvL~svgL~D~vcpPstqFA~yN~l~~~K-- 289 (321) T COG3458 212 DFPRAIELATEGPYDEIQTYFKRHDPKEAEVFETLSYFDIVNLAARIKVPVLMSVGLMDPVCPPSTQFAAYNALTTSK-- 289 (321) T ss_pred CCHHHEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHCCCCCC-- T ss_conf 651012314647089999999852846889998875666665787606656886212578889712677753036773-- Q ss_pred CEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 6079997879847558589999999999997 Q gi|254781093|r 179 SITHKVIPDANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 179 ~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~ 209 (225) +..+++--+|..-. .--.+.+..|++. T Consensus 290 --~i~iy~~~aHe~~p--~~~~~~~~~~l~~ 316 (321) T COG3458 290 --TIEIYPYFAHEGGP--GFQSRQQVHFLKI 316 (321) T ss_pred --EEEEEECCCCCCCC--CHHHHHHHHHHHH T ss_conf --27874033544475--2367899999876 No 45 >COG0657 Aes Esterase/lipase [Lipid metabolism] Probab=99.60 E-value=7.2e-13 Score=97.51 Aligned_cols=193 Identities=23% Similarity=0.320 Sum_probs=127.6 Q ss_pred ECCCCCE-EEE-EEEC--CCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH Q ss_conf 6589961-879-9966--88877879999578998888998999999999999739849998531558876778521023 Q gi|254781093|r 6 FNGPSGR-LEG-RYQP--STNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE 81 (225) Q Consensus 6 i~g~~G~-l~~-~~~~--~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E 81 (225) ++++.+. +.. +|.| ......|++|++|+-....|+.+.. -..++..+...|+.|+.+|||=. .+..|. .. T Consensus 56 ~~~~~~~~~~~~~y~p~~~~~~~~p~vly~HGGg~~~g~~~~~-~~~~~~~~~~~g~~vv~vdYrla--Pe~~~p---~~ 129 (312) T COG0657 56 LAGPSGDGVPVRVYRPDRKAAATAPVVLYLHGGGWVLGSLRTH-DALVARLAAAAGAVVVSVDYRLA--PEHPFP---AA 129 (312) T ss_pred CCCCCCCCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHH-HHHHHHHHHHCCCEEEEEEECCC--CCCCCC---CH T ss_conf 4677777750599824776678876799988975447880356-89999999876988999722678--888885---04 Q ss_pred HHHHHHHHHHHHHH----CCCCCCEEEEEECHHHHHHHHHHHCC-----CCHHEEEECCCCCCCCC-------------- Q ss_conf 89999989999862----35467405888720368787645038-----62133012275744333-------------- Q gi|254781093|r 82 LSDAAAALDWVQSL----NPESKSCWIAGYSFGAWISMQLLMRR-----PEINGFISVAPQPKSYD-------------- 138 (225) Q Consensus 82 ~~D~~aa~~~l~~~----~~~~~~i~l~G~S~Gg~val~~a~~~-----p~i~~~v~isp~~~~~~-------------- 138 (225) ++|+.+++.|++++ +.++++|+++|.|-||.+++.++... +...+.++++|...... T Consensus 130 ~~d~~~a~~~l~~~~~~~g~dp~~i~v~GdSAGG~La~~~a~~~~~~~~~~p~~~~li~P~~d~~~~~~~~~~~~~~~~~ 209 (312) T COG0657 130 LEDAYAAYRWLRANAAELGIDPSRIAVAGDSAGGHLALALALAARDRGLPLPAAQVLISPLLDLTSSAASLPGYGEADLL 209 (312) T ss_pred HHHHHHHHHHHHHCCHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCHHEEECCCCCCCCCCCHHHHHCCCCCC T ss_conf 99999999999843464277866359998584399999999975423688842220333332444454102330334431 Q ss_pred ------------------------CC-----CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCC Q ss_conf ------------------------22-----2343347737761686550289999999999860779960799978798 Q gi|254781093|r 139 ------------------------FS-----FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDAN 189 (225) Q Consensus 139 ------------------------~~-----~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~ 189 (225) .+ .+.. -.|+++++|+.|.+.+ +.+.+.++++.+ ++.++++.++|.. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~p~~spl~~~~~~~-lPP~~i~~a~~D~l~~--~~~~~a~~L~~a-gv~~~~~~~~g~~ 285 (312) T COG0657 210 DAAAILAWFADLYLGAAPDREDPEASPLASDDLSG-LPPTLIQTAEFDPLRD--EGEAYAERLRAA-GVPVELRVYPGMI 285 (312) T ss_pred CHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCEEEEECCCCCCCH--HHHHHHHHHHHC-CCCEEEEECCCCC T ss_conf 26788878999850466444585528654665358-9998998447887730--399999999966-9954888768974 Q ss_pred CCCCC--C--HHHHHHHHHHHHH Q ss_conf 47558--5--8999999999999 Q gi|254781093|r 190 HFFIG--K--VDELINECAHYLD 208 (225) Q Consensus 190 H~f~~--~--~~~l~~~i~~fl~ 208 (225) |.|.. . ..+....+.+|++ T Consensus 286 H~f~~~~~~~a~~~~~~~~~~l~ 308 (312) T COG0657 286 HGFDLLTGPEARSALRQIAAFLR 308 (312) T ss_pred CCCCCCCCHHHHHHHHHHHHHHH T ss_conf 45013588889999999999998 No 46 >pfam03096 Ndr Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (su Probab=99.56 E-value=1.3e-12 Score=95.96 Aligned_cols=190 Identities=19% Similarity=0.265 Sum_probs=128.2 Q ss_pred EEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHH-HHHH-----HHHHHHCCCEEEEEEECCCCCCCCCCCC Q ss_conf 896589961879996688877879999578998888998999-9999-----9999973984999853155887677852 Q gi|254781093|r 4 VVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNI-VYQL-----FYLFQQRGFVSLRFNFRGIGRSEGEFDY 77 (225) Q Consensus 4 v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~-~~~l-----a~~l~~~G~~vl~fd~RG~G~S~G~~~~ 77 (225) -.++++-|.+...+.=....++|++|..| ..|.| ++. ++.+ ++.|. .-|.|+.+|..|+|.+.-.++. T Consensus 2 ~~v~t~~G~~~v~v~g~~~~~kp~~~t~h---dlg~n--~~s~f~~~f~~~~~~~l~-~~f~v~~iD~pGh~~ga~~~~~ 75 (285) T pfam03096 2 HIIETPCGSVHVTVYGDPEGKKPPILTYH---DLGLN--HKSCFQGLFNSESMQEIL-ENFCIYHVDAPGQEDGAASFPG 75 (285) T ss_pred CCCCCCCCEEEEEEEECCCCCCCEEEEEC---CCCCC--HHHHHHHHHCCHHHHHHH-HCCEEEEECCCCCCCCCCCCCC T ss_conf 75547985189999806789988799857---76546--287756530657689987-4267999628987789998998 Q ss_pred CC-----HH-HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCC----------- Q ss_conf 10-----23-899999899998623546740588872036878764503862-1330122757443332----------- Q gi|254781093|r 78 GD-----GE-LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDF----------- 139 (225) Q Consensus 78 g~-----~E-~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~----------- 139 (225) +. -+ .+|+.++++++ ..+++.++|+|+||++++..|.++|+ +.++|++++.+...++ T Consensus 76 ~~~~ps~~~la~~l~~vld~l-----~i~~~~~~G~s~G~~i~~~fA~~~p~rv~~LvLi~~~~~~~~w~Ew~~~k~~~~ 150 (285) T pfam03096 76 GYPYPSMDDLADMLPVVLDHF-----RLKSVIGMGVGAGAYILARFALKHPERVEGLVLINPTPKAAGWIEWFYNKLSSK 150 (285) T ss_pred CCCCCCHHHHHHHHHHHHHHC-----CCCEEEEEEECHHHHHHHHHHHHCHHHEEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 888878999999999999971-----987699984367889999999869676105799657877767789999999887 Q ss_pred --------------------C--------------------------------------CCC------CCCCCEEEEECC Q ss_conf --------------------2--------------------------------------234------334773776168 Q gi|254781093|r 140 --------------------S--------------------------------------FLA------PCPSSGLIINGS 155 (225) Q Consensus 140 --------------------~--------------------------------------~l~------~~~~p~LiIhG~ 155 (225) + ++. ....|+|+|.|+ T Consensus 151 ~L~~~Gmt~~~~d~ll~h~Fg~~~~~~n~div~~~R~~l~~~~n~~nl~~fl~a~~~R~dl~~~~~~~~~~cpvLlv~Gd 230 (285) T pfam03096 151 LLYYYGMTDSAKDYLLAHYFGKEELSNNSDIVQEYRKFLKERLNPKNLQLYLEAYNSRRDLTIERPGLEEVCPVLLVVGD 230 (285) T ss_pred HHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHCCCCCCCCEEEEECC T ss_conf 77654643789999998765766445887999999999985449899999999984777660220446877666999637 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHH Q ss_conf 65502899999999998607799607999787984755-8589999999999997 Q gi|254781093|r 156 NDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFI-GKVDELINECAHYLDN 209 (225) Q Consensus 156 ~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~~fl~~ 209 (225) +++.++ .+.....++... ..+++.+++++|.=. +.-+.+.+.+..||.- T Consensus 231 ~sp~~~--~~v~~~~~ld~~---~stllki~dcG~mv~~EqP~k~~e~i~~FLqG 280 (285) T pfam03096 231 NSPHVD--AVVECNTKLDPT---KTTLLKVADCGGLVQQEQPGKLTESFKLFLQG 280 (285) T ss_pred CCCCHH--HHHHHHHHCCCC---CCEEEEECCCCCCHHHCCCHHHHHHHHHHHHC T ss_conf 882077--899998602843---22599956778861103809999999999854 No 47 >TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH; InterPro: IPR010076 This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown. This enzyme belongs to the alpha/beta hydrolase fold family (IPR000073 from INTERPRO). Members of this family are restricted to the proteobacteria.; GO: 0009102 biotin biosynthetic process. Probab=99.55 E-value=4.9e-14 Score=104.55 Aligned_cols=165 Identities=18% Similarity=0.298 Sum_probs=118.6 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCC-CCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEE Q ss_conf 799995789988889989999999999997398499985315588-7677852102389999989999862354674058 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGR-SEGEFDYGDGELSDAAAALDWVQSLNPESKSCWI 104 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~-S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l 104 (225) .-+|+.|+| | ||..++..+.+.|. .-|.+...|.=|+|+ |.+.. ..|+.++.+=+.++. +.+-||| T Consensus 5 ~~lvLiHGW---g--~n~~vf~~l~~~L~-~hf~l~~VDLPGhG~n~~~~~------p~~l~~~a~Ai~~~~-p~~A~wL 71 (248) T TIGR01738 5 KKLVLIHGW---G--MNSEVFDDLVERLS-AHFTLHLVDLPGHGDNSTTLR------PLSLAEVADAIAAQA-PAKAIWL 71 (248) T ss_pred CEEEEEECH---H--HHHHHHHHHHHHCC-CCCEEEEEECCCCCCCCCCCC------CCCHHHHHHHHHHHC-CCCCEEE T ss_conf 414685053---4--32788899877422-675068875588887556678------047789999999738-9987387 Q ss_pred EEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC-----CC------------------C--------------------- Q ss_conf 8872036878764503862-1330122757443-----33------------------2--------------------- Q gi|254781093|r 105 AGYSFGAWISMQLLMRRPE-INGFISVAPQPKS-----YD------------------F--------------------- 139 (225) Q Consensus 105 ~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~-----~~------------------~--------------------- 139 (225) |||+||-+|+.+|++.|+ |+++|.++..+.. |+ | T Consensus 72 -GWSLGGLvA~~~A~~hP~~V~~Lv~~As~p~f~a~edwP~g~~~~~L~~f~~~L~~dy~~ti~rFLaLq~lGt~~A~~~ 150 (248) T TIGR01738 72 -GWSLGGLVALHLAATHPDKVRSLVTVASSPCFTAREDWPEGIEPDVLTKFVEQLKSDYQKTIERFLALQTLGTPDARQD 150 (248) T ss_pred -ECCHHHHHHHHHHHHCCHHHHHHHHHHHCHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf -0236789999998625278888988862201103677622367027789999999989999999987530388306899 Q ss_pred --------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf --------------------------------223433477377616865502899999999998607799607999787 Q gi|254781093|r 140 --------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPD 187 (225) Q Consensus 140 --------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~g 187 (225) ..|..+..|.|.|+|.+|.+||....+.+-+.++ ...-|..++++ T Consensus 151 ~~~l~~~l~~~~~Pn~~~L~~gL~~l~~~DlR~~l~~i~~P~L~~~G~~D~lvP~~~~~~l~~l~p---sG~se~~~~~~ 227 (248) T TIGR01738 151 IRRLKQDLLARPTPNTQALQAGLELLARVDLRRALQKISVPVLRLYGQLDGLVPAKVIEALDRLAP---SGQSELLVFEK 227 (248) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHHHHHHHCC---CCCCEEEEEEC T ss_conf 999999987178998888999999998631456543068844534176567771789999997467---89831212406 Q ss_pred CCCC-CCCCHHHHHHHHHHHH Q ss_conf 9847-5585899999999999 Q gi|254781093|r 188 ANHF-FIGKVDELINECAHYL 207 (225) Q Consensus 188 a~H~-f~~~~~~l~~~i~~fl 207 (225) +.|. |-.+.++++....+|+ T Consensus 228 aaHAPFlShA~~f~~~l~~f~ 248 (248) T TIGR01738 228 AAHAPFLSHAEAFCALLRDFV 248 (248) T ss_pred CCCCCCCCCHHHHHHHHHHHC T ss_conf 688860100899999998629 No 48 >COG2936 Predicted acyl esterases [General function prediction only] Probab=99.55 E-value=2.3e-14 Score=106.46 Aligned_cols=128 Identities=21% Similarity=0.377 Sum_probs=98.5 Q ss_pred EEEECCCCC-EEEE-EEECCCCCCCCEEEEEC--CCCCCCCCCCCHHHHHHHH---HHHHCCCEEEEEEECCCCCCCCCC Q ss_conf 089658996-1879-99668887787999957--8998888998999999999---999739849998531558876778 Q gi|254781093|r 3 EVVFNGPSG-RLEG-RYQPSTNPNAPIALILH--PHPRFGGTMNDNIVYQLFY---LFQQRGFVSLRFNFRGIGRSEGEF 75 (225) Q Consensus 3 ~v~i~g~~G-~l~~-~~~~~~~~~~~~vv~~H--p~p~~GG~~~~~~~~~la~---~l~~~G~~vl~fd~RG~G~S~G~~ 75 (225) +|+++..|| ||.+ +|.|.+..+.|+++..+ |+-.-.|+.-+ ....+. .++.+||+|+..|-||.|.|+|.| T Consensus 21 ~v~V~MRDGvrL~~dIy~Pa~~g~~Pvll~~~~~Py~k~~~~~~~--~~~~~p~~~~~aa~GYavV~qDvRG~~~SeG~~ 98 (563) T COG2936 21 DVMVPMRDGVRLAADIYRPAGAGPLPVLLSRTRLPYRKRNGTFGP--QLSALPQPAWFAAQGYAVVNQDVRGRGGSEGVF 98 (563) T ss_pred EEEEEECCCEEEEEEEECCCCCCCCCEEEEEECCCCCCCCCCCCC--HHHCCCCCCEEECCCEEEEEECCCCCCCCCCCC T ss_conf 256780587089998871377888866688412663344666762--112265555463276499985265565677622 Q ss_pred CCCCH-HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCC Q ss_conf 52102-389999989999862354674058887203687876450386-2133012275 Q gi|254781093|r 76 DYGDG-ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAP 132 (225) Q Consensus 76 ~~g~~-E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp 132 (225) +--.. |.+|-...++|+.+|--...+++.+|.||+|+..+.+|+..| .+++++..++ T Consensus 99 ~~~~~~E~~Dg~D~I~Wia~QpWsNG~Vgm~G~SY~g~tq~~~Aa~~pPaLkai~p~~~ 157 (563) T COG2936 99 DPESSREAEDGYDTIEWLAKQPWSNGNVGMLGLSYLGFTQLAAAALQPPALKAIAPTEG 157 (563) T ss_pred CEECCCCCCCHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHCCCCHHEEECCCCC T ss_conf 12014324332379999986874478013110567899999998319930236623455 No 49 >pfam06342 DUF1057 Alpha/beta hydrolase of unknown function (DUF1057). This family consists of several Caenorhabditis elegans specific proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=99.53 E-value=2.6e-12 Score=94.16 Aligned_cols=157 Identities=21% Similarity=0.289 Sum_probs=113.3 Q ss_pred ECCCCCE---EEEEEECCC--CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCC--C Q ss_conf 6589961---879996688--87787999957899888899899999999999973984999853155887677852--1 Q gi|254781093|r 6 FNGPSGR---LEGRYQPST--NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDY--G 78 (225) Q Consensus 6 i~g~~G~---l~~~~~~~~--~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~--g 78 (225) ++.-.|+ +++.|.... +.+.++||.+|+-| ||.+| +.-++..|.+.|+-+++.||-|.|.++|..+. . T Consensus 11 ~~~~~~k~v~v~A~y~D~~~~gs~~GTVV~~HGsP---GSH~D--FkYi~~~l~~~~iR~IgiN~PGf~~t~~~~~~~yt 85 (297) T pfam06342 11 FQTELGKTVEVQAVYEDSLTSGSPFGTVVAFHGSP---GSHND--FKYIRSKFEDLNIRFIGVNYPGFEFTTGYPGQSHT 85 (297) T ss_pred HHHHCCCEEEEEEEEEECCCCCCCCEEEEEECCCC---CCCCC--HHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCC T ss_conf 76504856888889984489998752799953899---97422--67766667663917999668987678897434557 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCC--------------------- Q ss_conf 02389999989999862354674058887203687876450386213301227574433--------------------- Q gi|254781093|r 79 DGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSY--------------------- 137 (225) Q Consensus 79 ~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~--------------------- 137 (225) .-|.++...++ +.+...+ .+++++|||+|+-.|++++..+| +.++++++|+.-+. T Consensus 86 n~~r~~~~~al--l~~l~i~-~~~i~~GHSrGce~al~~~~~~p-~~g~vlin~~G~rpH~gi~P~~r~~~i~~v~~~L~ 161 (297) T pfam06342 86 NQERNSYSKAL--LEELELK-GKLIIMGHSRGCENALQTATTRP-AHGLVMINPTGFRIHKGIRPKSRMETINWVYKLLP 161 (297) T ss_pred CHHHHHHHHHH--HHHCCCC-CCEEEEEECCCHHHHHHHHHCCC-CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 57799999999--9972877-72599985552477888762587-61489864787666667681789999999999989 Q ss_pred -----------------------------------C-------CCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf -----------------------------------3-------222343347737761686550289999999999 Q gi|254781093|r 138 -----------------------------------D-------FSFLAPCPSSGLIINGSNDTVATTSDVKDLVNK 171 (225) Q Consensus 138 -----------------------------------~-------~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~ 171 (225) + ...|...+.|+|+..|.+|..|-.+.+.+.... T Consensus 162 ~~l~~~i~~~~y~~~GfkV~~Geea~~air~~~~~~f~~q~~~I~~l~~k~~k~li~fg~~D~LiEeeI~~E~~~~ 237 (297) T pfam06342 162 LFLGDSILYFYYKSVGLKVSDGEEAAAAMRSMQTFALEEQLESIDKLNEKPTKTFIVFGGKDHLVEEEIVFEALMK 237 (297) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHH T ss_conf 9999999999999827004774899998887730074677889999618898579997476434578999999985 No 50 >KOG4409 consensus Probab=99.51 E-value=1.9e-12 Score=94.98 Aligned_cols=178 Identities=17% Similarity=0.313 Sum_probs=116.1 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCC-CCCC--CCHHHHHHHHHHHHHHHHCC Q ss_conf 88778799995789988889989999999999997398499985315588767-7852--10238999998999986235 Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEG-EFDY--GDGELSDAAAALDWVQSLNP 97 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G-~~~~--g~~E~~D~~aa~~~l~~~~~ 97 (225) ...++...|+.| +||+... .+..-...|+. .+.|..||.-|.|+|.- .|+. ...|.+=+...-+|-.+.+. T Consensus 86 ~~~~~~plVliH---GyGAg~g--~f~~Nf~~La~-~~~vyaiDllG~G~SSRP~F~~d~~~~e~~fvesiE~WR~~~~L 159 (365) T KOG4409 86 ESANKTPLVLIH---GYGAGLG--LFFRNFDDLAK-IRNVYAIDLLGFGRSSRPKFSIDPTTAEKEFVESIEQWRKKMGL 159 (365) T ss_pred CCCCCCCEEEEE---CCCHHHH--HHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 656787579971---5531678--99986666652-38237862667887779988878651119999999999997698 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC----C---------------------------------- Q ss_conf 46740588872036878764503862-13301227574433----3---------------------------------- Q gi|254781093|r 98 ESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY----D---------------------------------- 138 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~----~---------------------------------- 138 (225) .+..|+||||||++|..+|.++|+ |+.+|+++|..... + T Consensus 160 --~KmilvGHSfGGYLaa~YAlKyPerV~kLiLvsP~Gf~~~~~~~~~~~~~~~~w~~~~~~~~~~~nPl~~LR~~Gp~G 237 (365) T KOG4409 160 --EKMILVGHSFGGYLAAKYALKYPERVEKLILVSPWGFPEKPDSEPEFTKPPPEWYKALFLVATNFNPLALLRLMGPLG 237 (365) T ss_pred --CCEEEEECCCHHHHHHHHHHHCHHHHCEEEEECCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHCCCHHHHHHHCCCCC T ss_conf --200676315027999999986857543379866656656777645315897288865356660677789998605402 Q ss_pred --------------C------CCC--------------------------------------CCCCCCEEEEECCCCCCC Q ss_conf --------------2------223--------------------------------------433477377616865502 Q gi|254781093|r 139 --------------F------SFL--------------------------------------APCPSSGLIINGSNDTVA 160 (225) Q Consensus 139 --------------~------~~l--------------------------------------~~~~~p~LiIhG~~D~~v 160 (225) + +.| -....|+++|||++|=+- T Consensus 238 p~Lv~~~~~d~~~k~~~~~~ed~l~~YiY~~n~~~psgE~~fk~l~~~~g~Ar~Pm~~r~~~l~~~~pv~fiyG~~dWmD 317 (365) T KOG4409 238 PKLVSRLRPDRFRKFPSLIEEDFLHEYIYHCNAQNPSGETAFKNLFEPGGWARRPMIQRLRELKKDVPVTFIYGDRDWMD 317 (365) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCC T ss_conf 67786550889974644221678999999854889827899999875262234069999986266897799944841002 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEEEECCCCCC-CCCCHHHHHHHHHHHHHHH Q ss_conf 8999999999986077996079997879847-5585899999999999973 Q gi|254781093|r 161 TTSDVKDLVNKLMNQKGISITHKVIPDANHF-FIGKVDELINECAHYLDNS 210 (225) Q Consensus 161 p~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~-f~~~~~~l~~~i~~fl~~~ 210 (225) - .........+ ....++.++||+|+|. |-++.+...+.+..++++. T Consensus 318 ~-~~g~~~~~~~---~~~~~~~~~v~~aGHhvylDnp~~Fn~~v~~~~~~~ 364 (365) T KOG4409 318 K-NAGLEVTKSL---MKEYVEIIIVPGAGHHVYLDNPEFFNQIVLEECDKV 364 (365) T ss_pred C-HHHHHHHHHH---HCCCCEEEEECCCCCEEECCCHHHHHHHHHHHHHCC T ss_conf 3-3579899776---205550799569973365479899999999987525 No 51 >pfam10503 Esterase_phd Esterase PHB depolymerase. This family of proteins include acetyl xylan esterases (AXE), feruloyl esterases (FAE), and poly(3-hydroxybutyrate) (PHB) depolymerases. Probab=99.50 E-value=1.2e-12 Score=96.15 Aligned_cols=148 Identities=19% Similarity=0.220 Sum_probs=97.9 Q ss_pred CCCEEEEECCCCCCCCCCCCHH-HHHHHHHHHHCCCEEEEEEECCCCCCCCCC--------CCCCHHHHHHHHHHHHHHH Q ss_conf 7879999578998888998999-999999999739849998531558876778--------5210238999998999986 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNI-VYQLFYLFQQRGFVSLRFNFRGIGRSEGEF--------DYGDGELSDAAAALDWVQS 94 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~-~~~la~~l~~~G~~vl~fd~RG~G~S~G~~--------~~g~~E~~D~~aa~~~l~~ 94 (225) +.|+||.+| +.+++.++-. ...+.+...+.||.|+-++-.......+.| ..+.+|...+...++++.+ T Consensus 20 ~~PLVV~LH---Gc~q~a~~~~~~t~~~~lAd~~GFiVvyPeq~~~~n~~~cWnw~~~~~~~~g~gd~~~I~~mi~~v~~ 96 (226) T pfam10503 20 AMPLVVMLH---GCQQHADDFAAGTRMNALADEHGFAVLYPEQSLRAHAHGCWHWFEPDHDQAGRGEAATIAALADAIAD 96 (226) T ss_pred CCCEEEEEE---CCCCCHHHHHHHCCHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 858799952---89999999985557289998779289936774667899888878864556899769999999999999 Q ss_pred H-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEEC-CCCCCC--------C---------CC--------CCCCCCCC Q ss_conf 2-35467405888720368787645038621330122-757443--------3---------32--------22343347 Q gi|254781093|r 95 L-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFISV-APQPKS--------Y---------DF--------SFLAPCPS 147 (225) Q Consensus 95 ~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~i-sp~~~~--------~---------~~--------~~l~~~~~ 147 (225) + ..|.+||++.|+|.||+++..+++.+|++.+.+.+ |..+-. + .. ..-...+. T Consensus 97 ~y~iD~~RVyvtGlSaGg~Ma~~La~~~PD~FAAva~~sGlp~g~a~~~~~a~~~m~~g~~~~~~~~~~~~~~~~~~~~~ 176 (226) T pfam10503 97 AHGFDAARVYVAGLSAGAALADLLAHCHPDRFAAVALHSGLALGCAADGASALDAMRRGLGPAPAAATDALSDAGAYPGL 176 (226) T ss_pred HCCCCCCCEEEEEECCHHHHHHHHHHHCCCHHEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCC T ss_conf 72998031789875848999999987586121366000366755445668799986278999806655311046788999 Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHH Q ss_conf 737761686550289999999999860 Q gi|254781093|r 148 SGLIINGSNDTVATTSDVKDLVNKLMN 174 (225) Q Consensus 148 p~LiIhG~~D~~vp~~~~~~~~~~l~~ 174 (225) |.+|+||+.|.+|.+.+..++.+.-.. T Consensus 177 P~~v~HG~~D~tV~p~Na~~~~~Qw~~ 203 (226) T pfam10503 177 PAIIFHGDADHTVAPKNADQLTAAFMR 203 (226) T ss_pred CEEEEECCCCCEECCCHHHHHHHHHHH T ss_conf 889996499851471029999999998 No 52 >COG4757 Predicted alpha/beta hydrolase [General function prediction only] Probab=99.46 E-value=3.1e-12 Score=93.68 Aligned_cols=195 Identities=22% Similarity=0.326 Sum_probs=125.7 Q ss_pred EEEECCCCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC----C Q ss_conf 089658996-18799966888778799995789988889989999999999997398499985315588767785----2 Q gi|254781093|r 3 EVVFNGPSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD----Y 77 (225) Q Consensus 3 ~v~i~g~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~----~ 77 (225) |+.+|-+|| +|.+-..|+++..++-+++.- .. | +....++.+|..++++||.|+.|||||.|+|.=.-. . T Consensus 7 e~~l~~~DG~~l~~~~~pA~~~~~g~~~va~---a~-G-v~~~fYRrfA~~a~~~Gf~Vlt~dyRG~g~S~p~~~~~~~~ 81 (281) T COG4757 7 EAHLPAPDGYSLPGQRFPADGKASGRLVVAG---AT-G-VGQYFYRRFAAAAAKAGFEVLTFDYRGIGQSRPASLSGSQW 81 (281) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEECC---CC-C-CCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCC T ss_conf 3020258886576231368987777477326---67-7-30567579999863067318987512556777310036765 Q ss_pred CCH--HHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEE-------------------------C Q ss_conf 102--3899999899998623546740588872036878764503862133012-------------------------2 Q gi|254781093|r 78 GDG--ELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFIS-------------------------V 130 (225) Q Consensus 78 g~~--E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~-------------------------i 130 (225) +.. ...|..++++++++..+ ..+.+.+||||||.+ +-++..++...++.. + T Consensus 82 ~~~DwA~~D~~aal~~~~~~~~-~~P~y~vgHS~GGqa-~gL~~~~~k~~a~~vfG~gagwsg~m~~~~~l~~~~l~~lv 159 (281) T COG4757 82 RYLDWARLDFPAALAALKKALP-GHPLYFVGHSFGGQA-LGLLGQHPKYAAFAVFGSGAGWSGWMGLRERLGAVLLWNLV 159 (281) T ss_pred CHHHHHHCCHHHHHHHHHHHCC-CCCEEEEECCCCCEE-ECCCCCCCCCCEEEEECCCCCCCCCHHHHHCCCCEEECCCC T ss_conf 3456564125999999873377-996388621445200-01155676333046742552223300011022100241321 Q ss_pred CCCCCCCC------------------------------CCC-----------CCCCCCCEEEEECCCCCCCCHHHHHHHH Q ss_conf 75744333------------------------------222-----------3433477377616865502899999999 Q gi|254781093|r 131 APQPKSYD------------------------------FSF-----------LAPCPSSGLIINGSNDTVATTSDVKDLV 169 (225) Q Consensus 131 sp~~~~~~------------------------------~~~-----------l~~~~~p~LiIhG~~D~~vp~~~~~~~~ 169 (225) .|++..+. +.+ .+.+..|+..+-.++|+-+|......+. T Consensus 160 ~p~lt~w~g~~p~~l~G~G~d~p~~v~RdW~RwcR~p~y~fddp~~~~~~q~yaaVrtPi~~~~~~DD~w~P~As~d~f~ 239 (281) T COG4757 160 GPPLTFWKGYMPKDLLGLGSDLPGTVMRDWARWCRHPRYYFDDPAMRNYRQVYAAVRTPITFSRALDDPWAPPASRDAFA 239 (281) T ss_pred CCCHHHCCCCCCHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHH T ss_conf 35321101228386617886686258889999863864123686575799999885475466326778768878999998 Q ss_pred HHHHHCCCCCEEEEEECCC----CCC--CCCCHHHHHHHHHHHH Q ss_conf 9986077996079997879----847--5585899999999999 Q gi|254781093|r 170 NKLMNQKGISITHKVIPDA----NHF--FIGKVDELINECAHYL 207 (225) Q Consensus 170 ~~l~~~~~~~~~~~~i~ga----~H~--f~~~~~~l~~~i~~fl 207 (225) +-..++ +++.+.++-+ +|+ |.+..|.+-+...+|+ T Consensus 240 ~~y~nA---pl~~~~~~~~~~~lGH~gyfR~~~Ealwk~~L~w~ 280 (281) T COG4757 240 SFYRNA---PLEMRDLPRAEGPLGHMGYFREPFEALWKEMLGWF 280 (281) T ss_pred HHHHCC---CCCCEECCCCCCCCCCHHHHCCCHHHHHHHHHHHH T ss_conf 762147---63002137222755402554062177899998754 No 53 >pfam03403 PAF-AH_p_II isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. Three known PAF-AHs are the brain heterotrimeric PAF-AH Ib, whose catalytic beta and gamma subunits are aligned in pfam02266, the extracellular, plasma PAF-AH (pPAF-AH), and the intracellular PAF-AH isoform II (PAF-AH II). This family aligns pPAF-AH and PAF-AH II, whose similarity was previously noted. Probab=99.46 E-value=9.3e-12 Score=90.80 Aligned_cols=192 Identities=16% Similarity=0.180 Sum_probs=116.1 Q ss_pred EEEEEECCC--CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCC--------------CCCC- Q ss_conf 879996688--87787999957899888899899999999999973984999853155887--------------6778- Q gi|254781093|r 13 LEGRYQPST--NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRS--------------EGEF- 75 (225) Q Consensus 13 l~~~~~~~~--~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S--------------~G~~- 75 (225) +.+....|- +..-|++|+.| ++||++.. ...+|..|+..||.|+..|.|..-.+ +..+ T Consensus 86 ~~a~~nap~~~~~~~PvvIFSH---Glgg~R~~--yS~~~eeLAS~GyVV~aidH~d~Sa~~t~~~~~~~~~~~~~~~~~ 160 (372) T pfam03403 86 LPAIWNAPFKTGEKYPLIVFSH---GLGAFRTI--YSAICIELASHGFVVAAVEHRDRSASATYFFKDKNAAEVEEPSWI 160 (372) T ss_pred CCCCCCCCCCCCCCCCEEEECC---CCCCCHHH--HHHHHHHHHCCCEEEEECCCCCCCCEEEEECCCCCCCCCCCCCEE T ss_conf 4321478777789874899889---98762567--899999987183299971567886403775477766546654212 Q ss_pred --CCCC---------H----HHHHHHHHHHHHHHH---------------------CCCCCCEEEEEECHHHHHHHHHHH Q ss_conf --5210---------2----389999989999862---------------------354674058887203687876450 Q gi|254781093|r 76 --DYGD---------G----ELSDAAAALDWVQSL---------------------NPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 76 --~~g~---------~----E~~D~~aa~~~l~~~---------------------~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) .... . -.+|+..+++.+++. ..|.++|.++||||||+.|++++. T Consensus 161 ~~~~~~~~~~~~~r~~qv~~R~~e~~~vl~~l~~l~~g~~~~~~~~~~~~~~~l~g~lD~~~I~~~GHSfGGATa~~~l~ 240 (372) T pfam03403 161 YLRDVNAEEEFKIRNEQVGQRAQECSKALSIILQIDLGTPVENVLDSDFDLQQLKGNLDTSKIAVIGHSFGGATVIQSLS 240 (372) T ss_pred ECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCCCCEEEEEECCCHHHHHHHHH T ss_conf 01355641568888899999999999999999985258853112466545677605456463579853403799999986 Q ss_pred CCCCHHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC-CCC---- Q ss_conf 386213301227574433322234334773776168655028999999999986077996079997879847-558---- Q gi|254781093|r 120 RRPEINGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF-FIG---- 194 (225) Q Consensus 120 ~~p~i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~-f~~---- 194 (225) .++++++.|.+-+-.--.+-......+.|.|+|..+.=+ -.++... .+++..... .-....|.|+.|. |++ T Consensus 241 ~D~R~~agI~LDgwm~Pl~~~~~~~~~~P~l~i~se~f~--~~~n~~~-~~~~~~~~~-~~~~~ti~gs~H~sftD~~~~ 316 (372) T pfam03403 241 EDTRFRCGIALDAWMYPVGQDQYSQARQPLLFINSEKFQ--WNENIFK-MKKIYSPDK-ESKMITLKGSVHQNFSDFPFV 316 (372) T ss_pred HCCCCCEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCC--CCCCHHH-HHHHHCCCC-CCEEEEECCCCCCCCCCCHHH T ss_conf 187713798415523657632000368877998345456--7556899-998734788-724999789746786641023 Q ss_pred --C-----------------HHHHHHHHHHHHHHHCCC Q ss_conf --5-----------------899999999999973122 Q gi|254781093|r 195 --K-----------------VDELINECAHYLDNSLDE 213 (225) Q Consensus 195 --~-----------------~~~l~~~i~~fl~~~L~~ 213 (225) + .+-..+.+..||+++|.. T Consensus 317 ~p~~~~~~~~~~g~idp~~~~~i~n~~~l~Fl~~hL~~ 354 (372) T pfam03403 317 TGKIIGKFFKLKGEIDPYEAMDIANRASLAFLQKHLDL 354 (372) T ss_pred CHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 51878864267766699999999999999999987177 No 54 >PRK10115 protease 2; Provisional Probab=99.45 E-value=2.2e-11 Score=88.54 Aligned_cols=212 Identities=18% Similarity=0.164 Sum_probs=135.9 Q ss_pred EEEECCCCC-EEEE--EEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CCC Q ss_conf 089658996-1879--996688877879999578998888998999999999999739849998531558876778-521 Q gi|254781093|r 3 EVVFNGPSG-RLEG--RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DYG 78 (225) Q Consensus 3 ~v~i~g~~G-~l~~--~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~g 78 (225) .|+.+..|| ++.- .+.........-.++++++++||=+++ +.+....-.|.++|+.....|-||-|+-.-.| ..| T Consensus 418 rv~~~SkDGt~VP~siv~rk~~~~dg~~P~lLyGYG~ygis~~-p~Fs~~~l~ll~rG~v~AiahvRGGgE~G~~Wh~~G 496 (686) T PRK10115 418 HLWITARDGVEVPVSLVYHRKHFRKGHNPLLVYGYGSYGASID-ADFSFSRLSLLDRGFVYAIVHVRGGGELGQQWYEDG 496 (686) T ss_pred EEEEECCCCCEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCC-CCCCHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHH T ss_conf 9999889997831799980688767898779999764666327-862665756865897899996157887744789866 Q ss_pred CHH-----HHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCH-HEEEECCCCCCCCCC------------ Q ss_conf 023-----89999989999862-35467405888720368787645038621-330122757443332------------ Q gi|254781093|r 79 DGE-----LSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEI-NGFISVAPQPKSYDF------------ 139 (225) Q Consensus 79 ~~E-----~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i-~~~v~isp~~~~~~~------------ 139 (225) ..+ .+|..|+.+||.++ ...+.++++.|-|-||.+...++.++|++ +++|+--|.+....+ T Consensus 497 ~~~~K~n~f~Dfia~ae~Li~~g~t~~~~l~~~G~SaGGLLvga~~n~~Pelf~a~v~~Vp~vD~l~~~~d~~~plt~~e 576 (686) T PRK10115 497 KFLKKKNTFNDYLDACDALLKLGYGSPSLCYGMGGSAGGMLMGVAINERPELFHGVIAQVPFVDVVTTMLDESIPLTTGE 576 (686) T ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECHHHHHHHHHHHHCHHHHCEEEECCCCCCCEECCCCCCCCCCCCH T ss_conf 56459647799999999999839998302899972758888999885195763557866772232121334787888641 Q ss_pred -------------------CC---CCC-CCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCC--CCEEEEEECCCCCCCC- Q ss_conf -------------------22---343-347737761686550289999999999860779--9607999787984755- Q gi|254781093|r 140 -------------------SF---LAP-CPSSGLIINGSNDTVATTSDVKDLVNKLMNQKG--ISITHKVIPDANHFFI- 193 (225) Q Consensus 140 -------------------~~---l~~-~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~--~~~~~~~i~ga~H~f~- 193 (225) +. +.+ .=-++|+..|.+|++|++-+..+|+.+|+..+. .++-+.+--++||+-. T Consensus 577 ~~E~G~p~~~~~~~~i~~YsPy~nv~~~~YP~~l~tt~~~D~RV~~~~~~K~~A~L~~~~~~~~p~ll~~~~~aGHg~~~ 656 (686) T PRK10115 577 FEEWGNPQDPQYYEYMKSYSPYDNVTAQAYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDDHLLLLCTDMDSGHGGKS 656 (686) T ss_pred HHHCCCCCCHHHHHHHHHCCCHHCCCCCCCCHHHEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC T ss_conf 20137968999999998659242589788986878842688887977779999999863659970899964768677888 Q ss_pred CCHHHHHHHH--HHHHHHHCCCCC Q ss_conf 8589999999--999997312210 Q gi|254781093|r 194 GKVDELINEC--AHYLDNSLDEKF 215 (225) Q Consensus 194 ~~~~~l~~~i--~~fl~~~L~~~~ 215 (225) ++...+.+.. -.||=+.++... T Consensus 657 ~~~~~~~e~A~~~aFll~~~~~~~ 680 (686) T PRK10115 657 GRFKSYEGVAMEYAFLIALAQGTL 680 (686) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 889999999999999999730669 No 55 >PRK05077 frsA fermentation/respiration switch protein; Reviewed Probab=99.42 E-value=1.1e-11 Score=90.29 Aligned_cols=191 Identities=21% Similarity=0.370 Sum_probs=132.3 Q ss_pred CEEEECCCCC-EEEEEEECCC-CCCCCEEEEECCCCCCCCCCCCH--HHHHHHH-HHHHCCCEEEEEEECCCCCCCCCCC Q ss_conf 7089658996-1879996688-87787999957899888899899--9999999-9997398499985315588767785 Q gi|254781093|r 2 PEVVFNGPSG-RLEGRYQPST-NPNAPIALILHPHPRFGGTMNDN--IVYQLFY-LFQQRGFVSLRFNFRGIGRSEGEFD 76 (225) Q Consensus 2 ~~v~i~g~~G-~l~~~~~~~~-~~~~~~vv~~Hp~p~~GG~~~~~--~~~~la~-~l~~~G~~vl~fd~RG~G~S~G~~~ 76 (225) -|+.||-++| .+.|.+|.|. ..+.|+|++| |..|.- .+.++.+ .|+-+|++.|..|--++|.|.- |. T Consensus 169 k~i~f~~~~G~~i~gfLHLP~~~~p~PtViv~-------GgLDsLQ~D~~rLF~dylaP~giAMLTlDMPSvG~Ss~-w~ 240 (414) T PRK05077 169 KELEFPVPGGKPITGFLHLPKGDGPFPTVLMC-------GGLDSLQTDYYRLFRDYLAPRGIAMLTIDMPSVGFSSK-WK 240 (414) T ss_pred EEEEEEECCCCEEEEEEECCCCCCCCCEEEEE-------CCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCC-CC T ss_conf 37888726996578998469999988759995-------67247788899999985264673379703787765363-54 Q ss_pred CCCHHHHHH----HHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCCC------------ Q ss_conf 210238999----99899998623-54674058887203687876450386-2133012275744333------------ Q gi|254781093|r 77 YGDGELSDA----AAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSYD------------ 138 (225) Q Consensus 77 ~g~~E~~D~----~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~~------------ 138 (225) -.+|. ++++.++...- .|-.+|.++||-|||.+|..+|--.| +++++++++|++...- T Consensus 241 ----LtqDsS~LHQaVL~~L~~vPWVDh~Rv~~~GfRFG~NvAvRLayLEp~rlKavv~lG~~VH~~~~~~~~~~~~P~M 316 (414) T PRK05077 241 ----LTQDSSLLHQAVLNALPNVPWVDHTRVAAFGFRFGANVAVRLAYLEPPRLKAVACLGPVVHALLSDPDRQQQVPEM 316 (414) T ss_pred ----CCCHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEECHHHHHHHHHHCCCCEEEEEEECHHHHHHHCCHHHHHHCCHH T ss_conf ----4430679999999855689863230356776420206999998626200126887262888883688888768789 Q ss_pred -------------------------C-----CCC-CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf -------------------------2-----223-433477377616865502899999999998607799607999787 Q gi|254781093|r 139 -------------------------F-----SFL-APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPD 187 (225) Q Consensus 139 -------------------------~-----~~l-~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~g 187 (225) + ..| ..|++|.|-+.-++|.++|.++.+-++. ....|. ...+| T Consensus 317 ylDvlASRLg~~~~~~~~L~~~l~~~SLK~QGlLgRr~~vPmLa~~~k~Dp~sP~~d~~Lia~--sS~~Gk---~~~i~- 390 (414) T PRK05077 317 YLDVLASRLGMHDASDEALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLIAS--SSADGK---LLEIP- 390 (414) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHH--HCCCCC---EEECC- T ss_conf 999999874643388899987763300765022568789762440258999798688999997--445884---56447- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 984755858999999999999731 Q gi|254781093|r 188 ANHFFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 188 a~H~f~~~~~~l~~~i~~fl~~~L 211 (225) ..-.|.+. ++..+.+.+||+++| T Consensus 391 ~~pv~~~~-~~aL~~~~~Wl~~~l 413 (414) T PRK05077 391 FKPVYRNF-DKALQEIIDWLEDKL 413 (414) T ss_pred CCCHHHHH-HHHHHHHHHHHHHHC T ss_conf 86346679-999999999998643 No 56 >pfam08840 BAAT_C BAAT / Acyl-CoA thioester hydrolase C terminal. This catalytic domain is found at the C terminal of acyl-CoA thioester hydrolases and bile acid-CoA:amino acid N-acetyltransferases (BAAT). Probab=99.39 E-value=2e-11 Score=88.84 Aligned_cols=131 Identities=13% Similarity=0.186 Sum_probs=96.5 Q ss_pred HHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCC----------------C---- Q ss_conf 89999989999862-35467405888720368787645038621330122757443332----------------2---- Q gi|254781093|r 82 LSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDF----------------S---- 140 (225) Q Consensus 82 ~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~----------------~---- 140 (225) +|=...|++|++++ ..+.++|.|+|.|+||-+|+.+|+..|+|.++|+++|....+.. . T Consensus 3 LEYFe~Ai~~L~~~p~v~~~~IGl~G~SkGaelaL~~As~~~~I~avV~i~gs~~~~~~~~~yk~~~lp~~~~~~~~~~~ 82 (211) T pfam08840 3 LEYFEEAINYLLRHPKVKGDKIGLLGISKGGELALSMASFLKQISAVVSINGSAVVYGGPLLYKDNPLPPLGFGMEKIKV 82 (211) T ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEEECHHHHHHHHHHHHCCCCCEEEEECCCCCEECCCCCCCCCCCCCCCCCCCEEEE T ss_conf 69999999999959877899889999788999999998607777789997896014467320189527886656120254 Q ss_pred -------------------------CCCCCCCCEEEEECCCCCCCCHH-HHHHHHHHHHHCCC-CCEEEEEECCCCCCCC Q ss_conf -------------------------23433477377616865502899-99999999860779-9607999787984755 Q gi|254781093|r 141 -------------------------FLAPCPSSGLIINGSNDTVATTS-DVKDLVNKLMNQKG-ISITHKVIPDANHFFI 193 (225) Q Consensus 141 -------------------------~l~~~~~p~LiIhG~~D~~vp~~-~~~~~~~~l~~~~~-~~~~~~~i~ga~H~f~ 193 (225) .+.....|.|+|.|++|...|-+ .++...++++.... ..+++..|||+||++. T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~a~IpvEki~~piLli~G~dD~~wpS~~~a~~i~~rL~~~g~~~~~~~l~Yp~aGH~i~ 162 (211) T pfam08840 83 NKNGLLDIRDMFNLPLSKPALKALIPVERAKGPFLFVVGEDDHNWPSVEYAEIACERLKKHGKKVEVQVLTYPGAGHLIE 162 (211) T ss_pred CCCCCEEEHHHHCCCCCCCCCCCCCCHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEC T ss_conf 27874540123316554544124054755688789997567677755999999999999819998628998189998588 Q ss_pred ----------------------C-------CHHHHHHHHHHHHHHHCC Q ss_conf ----------------------8-------589999999999997312 Q gi|254781093|r 194 ----------------------G-------KVDELINECAHYLDNSLD 212 (225) Q Consensus 194 ----------------------~-------~~~~l~~~i~~fl~~~L~ 212 (225) + -+++.=+.+.+||+++|. T Consensus 163 pPY~P~~~~~~~~~~~~~~~~GG~~~~ha~Aq~~sW~~i~~Ff~~hL~ 210 (211) T pfam08840 163 PPYFPHCRASYHALVGMPVLWGGEPKAHAKAQEDSWKKIQAFFKKHLG 210 (211) T ss_pred CCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 899987223135556762553884078799999999999999998728 No 57 >pfam08538 DUF1749 Protein of unknown function (DUF1749). This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. Probab=99.38 E-value=4e-11 Score=86.97 Aligned_cols=190 Identities=21% Similarity=0.250 Sum_probs=116.6 Q ss_pred EEEEEE-CCCCCCCC-EEEEECCCCCCC-CCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC--CHHHHHHHH Q ss_conf 879996-68887787-999957899888-8998999999999999739849998531558876778521--023899999 Q gi|254781093|r 13 LEGRYQ-PSTNPNAP-IALILHPHPRFG-GTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG--DGELSDAAA 87 (225) Q Consensus 13 l~~~~~-~~~~~~~~-~vv~~Hp~p~~G-G~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g--~~E~~D~~a 87 (225) |.+.-. +.....+| .+|+.- +.| |-..-+-+..||++|...||+++.+..+-+-.. |..+ ...++|+.+ T Consensus 20 lvafE~~~~~~~~~p~~llFIG---GLtDGl~tvpY~~~La~aL~~~~wsl~q~~lsSSY~G---wG~~SL~~D~~Ei~~ 93 (303) T pfam08538 20 LVAFEFETSGGLTKPNALLFIG---GLGDGLLTVPYVQPLADALDESGWSLVQPQLSSSYGG---WGTGSLDRDDEEIQA 93 (303) T ss_pred CEEEEECCCCCCCCCCEEEEEC---CCCCCCCCCCCHHHHHHHHCCCCCEEEEEEEECCCCC---CCCCCHHHHHHHHHH T ss_conf 3589944899888997799978---7778775664579999985538837999985035467---462523300999999 Q ss_pred HHHHHHHHC---CCCCCEEEEEECHHHHHHHHHHHC-----CCCHHEEEECCCCCCCC---------------------- Q ss_conf 899998623---546740588872036878764503-----86213301227574433---------------------- Q gi|254781093|r 88 ALDWVQSLN---PESKSCWIAGYSFGAWISMQLLMR-----RPEINGFISVAPQPKSY---------------------- 137 (225) Q Consensus 88 a~~~l~~~~---~~~~~i~l~G~S~Gg~val~~a~~-----~p~i~~~v~isp~~~~~---------------------- 137 (225) .++|++... ....+|+|+|||-|..-++.+..+ ++.|.|.|+-+|...+- T Consensus 94 ~v~Ylr~~~~~~~~~~kIVLMGHSTGcQDv~~Yl~~~~~~~~~~vdGaIlQAPVSDREa~~~~~~~~~~~~~~v~~A~~~ 173 (303) T pfam08538 94 LVEYLRTTKGGTFGRRKIVLMGHSTGSQDVMHYLTSGNEMGRPKVDGGILQAPVSDREAILTVIEERKKYKDLVAVAKKM 173 (303) T ss_pred HHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999860567678862899845877047999971789887664105888777463999985045058999999999999 Q ss_pred ---------------------------CC--------------CCC---------CCCCCCEEEEECCCCCCCCHHH-HH Q ss_conf ---------------------------32--------------223---------4334773776168655028999-99 Q gi|254781093|r 138 ---------------------------DF--------------SFL---------APCPSSGLIINGSNDTVATTSD-VK 166 (225) Q Consensus 138 ---------------------------~~--------------~~l---------~~~~~p~LiIhG~~D~~vp~~~-~~ 166 (225) +| ++| .....|.|++++++|++||..- -+ T Consensus 174 i~~g~~~~ilP~~~~~~~~~~tPitA~Rf~SL~s~~gdDD~FSSDL~de~L~~tfG~i~~plLvl~sg~DE~VP~~VDke 253 (303) T pfam08538 174 VDKGKGDEILPREFAEKMGWETPVSAYRFWSLASPRGDDDYFSSDLPDEDLAKTFGKVKEPLLILYSENDEFVPEWVDKE 253 (303) T ss_pred HHCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCHH T ss_conf 96699984378524000047897149999973189998664656799899999854768736999668877278656999 Q ss_pred HHHHHHHHCCCCC---EEEEEECCCCCCCCCC-----HHHHHHHHHHHHH Q ss_conf 9999986077996---0799978798475585-----8999999999999 Q gi|254781093|r 167 DLVNKLMNQKGIS---ITHKVIPDANHFFIGK-----VDELINECAHYLD 208 (225) Q Consensus 167 ~~~~~l~~~~~~~---~~~~~i~ga~H~f~~~-----~~~l~~~i~~fl~ 208 (225) ++.++-+.+.... -.--+||||+|...+. .+.|.+.|..||+ T Consensus 254 ~Ll~RW~~a~~~~~ws~~SgiIpGAsH~v~~~~q~e~~~~Lv~rV~~flk 303 (303) T pfam08538 254 DLLARWKRFTKKKYWSSLSGIIPGATHNVGGKSAEEAQKWLVERVLSFLK 303 (303) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 99999998658984066667377986778987736899999999999749 No 58 >TIGR01249 pro_imino_pep_1 proline iminopeptidase; InterPro: IPR005944 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=99.36 E-value=1.3e-11 Score=89.85 Aligned_cols=183 Identities=17% Similarity=0.242 Sum_probs=121.9 Q ss_pred EEEEEEECCCC-CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC-CHHHHHHHHHH Q ss_conf 18799966888-77879999578998888998999999999999739849998531558876778521-02389999989 Q gi|254781093|r 12 RLEGRYQPSTN-PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG-DGELSDAAAAL 89 (225) Q Consensus 12 ~l~~~~~~~~~-~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g-~~E~~D~~aa~ 89 (225) -.+-+|....+ |..-.||++|+-|+-|.+-.++ +.+--.=|-.+.||-||+|+|.- ++.- .--+.|+.+=+ T Consensus 13 d~H~LYye~~GnP~G~PV~~lHGGPGsGt~~~~r------~fFdpe~~rIvL~DQRGcGkS~p-~a~~~eNtTWdLV~Di 85 (310) T TIGR01249 13 DNHQLYYEQSGNPDGKPVVFLHGGPGSGTDPECR------RFFDPETYRIVLLDQRGCGKSTP-HACLEENTTWDLVADI 85 (310) T ss_pred CEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCC------CCCCCCCEEEEEEEECCCCCCCC-CCCCCCCCCHHHHHHH T ss_conf 6135422106798995489975687899883446------45376635899983078889862-4332247705667439 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECC---------------------------------CCCC Q ss_conf 9998623546740588872036878764503862-13301227---------------------------------5744 Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVA---------------------------------PQPK 135 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~is---------------------------------p~~~ 135 (225) +-+|+.- ...+=-|+|=|.|+.+||.+|..+|+ +.++|+=+ |... T Consensus 86 EkLR~~L-~I~~W~vFGGSWGStLALaYAq~HP~~v~~lvLRgiFL~R~~e~~w~~~~G~~~~~YP~~w~~F~d~IP~~~ 164 (310) T TIGR01249 86 EKLREKL-GIKKWLVFGGSWGSTLALAYAQTHPEKVTGLVLRGIFLLREKELSWFYEGGLASMIYPDAWQRFVDSIPENE 164 (310) T ss_pred HHHHHHC-CCCCEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCC T ss_conf 9999862-897148853877899999986016246535565567632866789997268702347255665410587401 Q ss_pred CCCCCCC---------------------------------C--------------------------------------- Q ss_conf 3332223---------------------------------4--------------------------------------- Q gi|254781093|r 136 SYDFSFL---------------------------------A--------------------------------------- 143 (225) Q Consensus 136 ~~~~~~l---------------------------------~--------------------------------------- 143 (225) +.++..| . T Consensus 165 r~sY~~lv~ayh~~l~~~De~~~~~aAkAW~~WE~~t~~L~~~~~~~~~aed~~~~la~ArlEnHYfVNkgFl~~e~~lL 244 (310) T TIGR01249 165 RNSYEQLVNAYHDRLQSEDEETKLAAAKAWVDWESATTLLRPENEIVSTAEDAKFSLALARLENHYFVNKGFLDSENFLL 244 (310) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 36578999999874057537999999864666763565513471434111477899999877510220464053468888 Q ss_pred -C----CCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf -3----34773776168655028999999999986077996079997879847558589999999999997 Q gi|254781093|r 144 -P----CPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 144 -~----~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~ 209 (225) . ...|+.|+||==|-+||..+.-+|-++.+++ ++.++++|||. ..++.+.+++..-+++ T Consensus 245 ~ni~~i~~i~~~iv~GRyDl~cPl~~awaL~kafPea-----~L~v~~~AGHs--a~dp~~~a~lv~a~~~ 308 (310) T TIGR01249 245 DNISKIRNIPTVIVHGRYDLICPLQSAWALHKAFPEA-----ELKVVNNAGHS--AFDPNNLAALVVALET 308 (310) T ss_pred HHHHHHCCCCEEEEECCEEHCCHHCCHHHHHHCCCCC-----EEEEECCCCCC--CCCHHHHHHHHHHHHH T ss_conf 6677640687379844600002003544675218551-----46674578866--5765479999999861 No 59 >KOG2564 consensus Probab=99.36 E-value=4.9e-11 Score=86.47 Aligned_cols=195 Identities=18% Similarity=0.220 Sum_probs=119.2 Q ss_pred CCEEEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEEECCCCCCCCCCC-CC Q ss_conf 9708965899618799966888778799995789988889989999999999997-398499985315588767785-21 Q gi|254781093|r 1 MPEVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFD-YG 78 (225) Q Consensus 1 m~~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~-~g 78 (225) |++|.|++.++....++.-+..+..|+.++.| +.|-++- .+..+|+.+.. .-+.|+.+|.||+|++.=+-. .- T Consensus 50 kedv~i~~~~~t~n~Y~t~~~~t~gpil~l~H---G~G~S~L--SfA~~a~el~s~~~~r~~a~DlRgHGeTk~~~e~dl 124 (343) T KOG2564 50 KEDVSIDGSDLTFNVYLTLPSATEGPILLLLH---GGGSSAL--SFAIFASELKSKIRCRCLALDLRGHGETKVENEDDL 124 (343) T ss_pred CCCCCCCCCCCEEEEEEECCCCCCCCEEEEEE---CCCCCCH--HHHHHHHHHHHHCCEEEEEEECCCCCCCCCCCHHHC T ss_conf 02466678764377888147888765799960---5864302--079999999864140498750345676546884652 Q ss_pred CHH--HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC--CCCHHEEEECCCC------------------CCC Q ss_conf 023--899999899998623546740588872036878764503--8621330122757------------------443 Q gi|254781093|r 79 DGE--LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR--RPEINGFISVAPQ------------------PKS 136 (225) Q Consensus 79 ~~E--~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~--~p~i~~~v~isp~------------------~~~ 136 (225) ..| ..|.-+++.++-.. ...+|+|+||||||++|...|.. -|.+.|++.+--. ++. T Consensus 125 S~eT~~KD~~~~i~~~fge--~~~~iilVGHSmGGaIav~~a~~k~lpsl~Gl~viDVVEgtAmeAL~~m~~fL~~rP~~ 202 (343) T KOG2564 125 SLETMSKDFGAVIKELFGE--LPPQIILVGHSMGGAIAVHTAASKTLPSLAGLVVIDVVEGTAMEALNSMQHFLRNRPKS 202 (343) T ss_pred CHHHHHHHHHHHHHHHHCC--CCCCEEEEECCCCCHHHHHHHHHHHCHHHHCEEEEEEECHHHHHHHHHHHHHHHCCCCC T ss_conf 7888888999999998356--89866998426662332306565222233063899875257999988778887559852 Q ss_pred CC------------------------------------------------------CCCC----CCCCCCEEEEECCCCC Q ss_conf 33------------------------------------------------------2223----4334773776168655 Q gi|254781093|r 137 YD------------------------------------------------------FSFL----APCPSSGLIINGSNDT 158 (225) Q Consensus 137 ~~------------------------------------------------------~~~l----~~~~~p~LiIhG~~D~ 158 (225) +. |..| -.+|.|.|+|-...|. T Consensus 203 F~Si~~Ai~W~v~sg~~Rn~~SArVsmP~~~~~~~eGh~yvwrtdL~kte~YW~gWF~gLS~~Fl~~p~~klLilAg~d~ 282 (343) T KOG2564 203 FKSIEDAIEWHVRSGQLRNRDSARVSMPSQLKQCEEGHCYVWRTDLEKTEQYWKGWFKGLSDKFLGLPVPKLLILAGVDR 282 (343) T ss_pred CCCHHHHHHHHHCCCCCCCCCCCEEECCHHEEECCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCC T ss_conf 22346677888602510165641574432304426787379996431321567777740006752797662467722344 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC-HHHHHHHHHHHHHHH Q ss_conf 0289999999999860779960799978798475585-899999999999973 Q gi|254781093|r 159 VATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK-VDELINECAHYLDNS 210 (225) Q Consensus 159 ~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~-~~~l~~~i~~fl~~~ 210 (225) .-- +-.-.|-..+.++.+++..+|+-++. ...+...+.+|+.++ T Consensus 283 LDk--------dLtiGQMQGk~Q~~vL~~~GH~v~ED~P~kva~~~~~f~~Rn 327 (343) T KOG2564 283 LDK--------DLTIGQMQGKFQLQVLPLCGHFVHEDSPHKVAECLCVFWIRN 327 (343) T ss_pred CCC--------CEEEEEECCCEEEEEECCCCCEECCCCCCHHHHHHHHHHHHH T ss_conf 676--------315665325245663001271231478604799887887550 No 60 >PRK05371 x-prolyl-dipeptidyl aminopeptidase; Provisional Probab=99.36 E-value=5.4e-11 Score=86.22 Aligned_cols=159 Identities=18% Similarity=0.259 Sum_probs=116.5 Q ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCC-CHHHHHHHHHHHHHHHH---CC------------CCCCEEEEEECHHHHHHH Q ss_conf 999739849998531558876778521-02389999989999862---35------------467405888720368787 Q gi|254781093|r 52 LFQQRGFVSLRFNFRGIGRSEGEFDYG-DGELSDAAAALDWVQSL---NP------------ESKSCWIAGYSFGAWISM 115 (225) Q Consensus 52 ~l~~~G~~vl~fd~RG~G~S~G~~~~g-~~E~~D~~aa~~~l~~~---~~------------~~~~i~l~G~S~Gg~val 115 (225) .|..+||+|+-.+--|.++|+|-.+.| ..|.+-..||+|||..+ +. ...++++.|-||-|.++. T Consensus 277 Yfl~RGfAvVy~~giGT~~SdG~~t~G~~~Et~s~kAVIdWLnGr~~AfT~r~~~~~vkA~WSnGkVaMtG~SYlGTL~~ 356 (769) T PRK05371 277 YFLARGFAVVYVSGIGTRGSDGCPTTGDPQEIEAMKAVIDWLNGRATAFTSRTRGHQVKADWSNGKVAMTGKSYLGTLPT 356 (769) T ss_pred EEECCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCEEEEECCCCCCCHH T ss_conf 34404228998336776778887666998897788999998758864235678984688652378300252142441044 Q ss_pred HHHHCCC-CHHEEEECCCCCCCCCC------------------------------------------------------- Q ss_conf 6450386-21330122757443332------------------------------------------------------- Q gi|254781093|r 116 QLLMRRP-EINGFISVAPQPKSYDF------------------------------------------------------- 139 (225) Q Consensus 116 ~~a~~~p-~i~~~v~isp~~~~~~~------------------------------------------------------- 139 (225) .+|.... .++.+|.+++-.+.|++ T Consensus 357 avATTGV~GLetIIpeAaISSWYdYYR~nGlV~aPgGyqGED~DvLa~~~~Sr~~~~~d~~~~~~~~~~~l~~~~~~~DR 436 (769) T PRK05371 357 AVATTGVEGLKTIIPEAAISSWYDYYRENGLVRAPGGYQGEDLDVLAELTLSRNLLAGDYLRHNEACEKLLAELTAAQDR 436 (769) T ss_pred HHHHCCCCCCEEEEECCCCCCHHHHHHCCCCEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 67624877614664321012489885438815078886664226888876134456431223307899999998750455 Q ss_pred ---------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC--CHHHHHHH Q ss_conf ---------------2234334773776168655028999999999986077996079997879847558--58999999 Q gi|254781093|r 140 ---------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG--KVDELINE 202 (225) Q Consensus 140 ---------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~--~~~~l~~~ 202 (225) .....+..+++++||-+|--|-+.++..|+++++.. +++.. ..+-..+|.+.. +..+..+. T Consensus 437 ~TGdYn~FW~~RNYl~~a~~iKa~vl~vHGLnDwNVKp~~v~~~~~AL~~~-~v~~k-l~LHQg~H~y~~~~~s~df~d~ 514 (769) T PRK05371 437 KTGDYNDFWDDRNYLKNADNIKADVLVVHGLNDWNVKPKQVYQWWDALPRN-GVPKH-LFLHQGQHVYPNNWQSIDFTDS 514 (769) T ss_pred CCCCCCHHHHHCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCCE-EEEECCCCCCCCCCCCCHHHHH T ss_conf 667743123102552320033302899743664576888999999999858-99716-9981577689774332048999 Q ss_pred HHHHHHHHCC Q ss_conf 9999997312 Q gi|254781093|r 203 CAHYLDNSLD 212 (225) Q Consensus 203 i~~fl~~~L~ 212 (225) +-.|+..+|- T Consensus 515 mN~Wfs~~L~ 524 (769) T PRK05371 515 MNAWLTHKLL 524 (769) T ss_pred HHHHHHHHHH T ss_conf 9999988875 No 61 >pfam06500 DUF1100 Alpha/beta hydrolase of unknown function (DUF1100). This family consists of several hypothetical bacterial proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=99.34 E-value=8.3e-11 Score=85.07 Aligned_cols=190 Identities=17% Similarity=0.313 Sum_probs=131.0 Q ss_pred EEEECCCCCEEEEEEECCC-CCCCCEEEEECCCCCCCCCCCCH--HHHHHHH-HHHHCCCEEEEEEECCCCCCCCCCCCC Q ss_conf 0896589961879996688-87787999957899888899899--9999999-999739849998531558876778521 Q gi|254781093|r 3 EVVFNGPSGRLEGRYQPST-NPNAPIALILHPHPRFGGTMNDN--IVYQLFY-LFQQRGFVSLRFNFRGIGRSEGEFDYG 78 (225) Q Consensus 3 ~v~i~g~~G~l~~~~~~~~-~~~~~~vv~~Hp~p~~GG~~~~~--~~~~la~-~l~~~G~~vl~fd~RG~G~S~G~~~~g 78 (225) ++.||-+.|.+.|.+|.|. ..+.|+|++| |..|+- .+.++.+ .|+-+|++.|..|--++|-|.- |. T Consensus 167 ~l~f~~~g~~i~gfLHLP~~~~P~PtViv~-------ggLDsLQ~D~~rLF~dyLaP~giAMLTlDMPSvG~Ss~-w~-- 236 (411) T pfam06500 167 QLEFPYQKGKITGFLHLPNTDKPFPVVLVS-------AGLDSLQTDMWRLFRDYLAPRDIAMLTIDMPSVGASSK-WP-- 236 (411) T ss_pred EEEEEECCCEEEEEEECCCCCCCCCEEEEE-------CCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCC-CC-- T ss_conf 888762697378998568999988869995-------77257788899999985374573379704787775464-55-- Q ss_pred CHHHHHH----HHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCC--------------- Q ss_conf 0238999----99899998623-54674058887203687876450386-213301227574433--------------- Q gi|254781093|r 79 DGELSDA----AAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSY--------------- 137 (225) Q Consensus 79 ~~E~~D~----~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~--------------- 137 (225) -.+|. ++++.++..-- .|-.+|.++||-|||.+|..++--.| .++++++++|++... T Consensus 237 --LtqDtS~LHQaVL~~L~~vPWVDh~Rv~~~GfRFG~NvAvRLayLEp~rlKav~~lG~~VH~ll~~~~~~~~~P~Myl 314 (411) T pfam06500 237 --LTEDSSCLHQAVLNALADVPWVDHHRVGLFGFRFGGNAMVRLAFLESEKVKACVCLGAPVHDLFTSPKKLQKMPKMYL 314 (411) T ss_pred --CCCCHHHHHHHHHHHCCCCCCCCCCEEEEEEEECCHHHHHHHHHHCCCCEEEEEEECHHHHHHHCCHHHHHHCCHHHH T ss_conf --555147999999985579986324245787642023699999850601013899837388887358887744768999 Q ss_pred ------------------------C---CCCC--CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf ------------------------3---2223--4334773776168655028999999999986077996079997879 Q gi|254781093|r 138 ------------------------D---FSFL--APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDA 188 (225) Q Consensus 138 ------------------------~---~~~l--~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga 188 (225) + ...| ..|++|.|-+.=++|.++|.++.+-++. ....|. ...+| . T Consensus 315 DvlASRLG~~~~d~~~L~~el~~~SLK~QGlL~gRr~~vPmLa~~~~~Dp~sP~~d~~Lia~--sS~~Gk---~~~i~-~ 388 (411) T pfam06500 315 DVLASRLGKSDVDDESLSGQMNAWSLKVQGFLSGRRTKTPILAGSLEGDPVSPYSDNQLVAS--FSADGK---AKKIP-S 388 (411) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHCCHHHHCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHH--HCCCCC---EEECC-C T ss_conf 99998737643788999987523104552134677589851541127999897678899998--556884---56568-9 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 84755858999999999999731 Q gi|254781093|r 189 NHFFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 189 ~H~f~~~~~~l~~~i~~fl~~~L 211 (225) .-.|.+.+ +..+.+.+||+++| T Consensus 389 ~pv~~~~~-~aL~~~~~WL~~~l 410 (411) T pfam06500 389 KTIYQGYE-QSLDLAIKWLEDEL 410 (411) T ss_pred CCHHHHHH-HHHHHHHHHHHHHC T ss_conf 74255799-99999999998533 No 62 >PRK05855 short chain dehydrogenase; Validated Probab=99.34 E-value=1.8e-10 Score=83.11 Aligned_cols=182 Identities=23% Similarity=0.324 Sum_probs=119.8 Q ss_pred CCC-EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHH-HHHHHHHHCCCEEEEEEECCCCCCCCCCC-CCC-H--HH Q ss_conf 996-187999668887787999957899888899899999-99999997398499985315588767785-210-2--38 Q gi|254781093|r 9 PSG-RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVY-QLFYLFQQRGFVSLRFNFRGIGRSEGEFD-YGD-G--EL 82 (225) Q Consensus 9 ~~G-~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~-~la~~l~~~G~~vl~fd~RG~G~S~G~~~-~g~-~--E~ 82 (225) .+| +|.+ | .-+.+.+|++|+.|+|| ||..+. .++..|++. |.|++||-||.|+|+-.-. .+. - -. T Consensus 10 ~~gv~lav-~-~~G~~~~ptvvlvHGyP------D~~~vW~~v~~~La~~-f~vv~yDvRGaG~S~~P~~~~~yrl~~l~ 80 (582) T PRK05855 10 SGGVRLAV-Y-EWGDPDRPTVVLVHGYP------DSHEVWDGVAPLLAER-FRVVAYDVRGAGLSSAPKRTSAYTLARLA 80 (582) T ss_pred CCCEEEEE-E-ECCCCCCCEEEEEECCC------CCHHHHHHHHHHHCCC-EEEEEECCCCCCCCCCCCCCCCCCHHHHH T ss_conf 39979999-8-52789998489980688------8366689999985046-05995557788878999863324099999 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC----CHHEEEECCCCCCC---------------------- Q ss_conf 9999989999862354674058887203687876450386----21330122757443---------------------- Q gi|254781093|r 83 SDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP----EINGFISVAPQPKS---------------------- 136 (225) Q Consensus 83 ~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p----~i~~~v~isp~~~~---------------------- 136 (225) +|+.||+|-+ .++ +++.|+||-+|+..+..+.. +| +|..+.++|.|... T Consensus 81 ~D~~AV~dav---~p~-rpvHlv~HDWGSiq~We~vt-~~~~~~RiaS~tSiSGP~ldh~g~~~r~~~~~~~p~~~~~~~ 155 (582) T PRK05855 81 DDFAAVIDAL---SPG-RPVHVLAHDWGSIQGWEYVT-RPRLAGRIASFTSVSGPSLDHVGFWLRSRLRRPTPRSLARLL 155 (582) T ss_pred HHHHHHHHHH---CCC-CCEEEEECCCCCHHHHHHHC-CCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 8999999864---899-83488752754332322324-854245235676057887688999998751279847899999 Q ss_pred ------CC------------------------------------------------------CCC----CC----CCCCC Q ss_conf ------33------------------------------------------------------222----34----33477 Q gi|254781093|r 137 ------YD------------------------------------------------------FSF----LA----PCPSS 148 (225) Q Consensus 137 ------~~------------------------------------------------------~~~----l~----~~~~p 148 (225) |. -.+ +. ...+| T Consensus 156 ~Q~~~s~Yi~~f~~P~lpe~~~r~~~~~~~~~~l~~~e~~~~~~~~~~~~~~d~~~g~~lYRAN~~~~~~~pr~r~~~vP 235 (582) T PRK05855 156 GQLARSWYVYLFHLPVLPELLWRLGLGRAWPRLLRRVEKTPVDQIPSPTQASDGAHGVKLYRANFIRSLLAPRERYTHVP 235 (582) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 98998899999857442288875231456799986336888777778313567773365411457643478888778887 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHHHH Q ss_conf 3776168655028999999999986077996079997879847558-5899999999999973 Q gi|254781093|r 149 GLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLDNS 210 (225) Q Consensus 149 ~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~~~ 210 (225) +++|..++|.+|.+...+.+.... .......++ ++|.... +-+.+.+.+.+|++.. T Consensus 236 Vq~iv~~~D~~v~p~l~~~~~~w~-----~~l~rr~i~-a~HW~p~s~P~~iA~~~~efv~~~ 292 (582) T PRK05855 236 VQLIVPTGDPYVRPALYDDLARWV-----PRLWRREIK-AGHWLPMSHPQVLAAAVAELIDAV 292 (582) T ss_pred EEEEEECCCCCCCHHHHCCHHHCC-----CCCEEEEEC-CCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 799751788776977744144417-----860898603-785376679899999999999863 No 63 >KOG1515 consensus Probab=99.32 E-value=8.8e-10 Score=78.92 Aligned_cols=194 Identities=22% Similarity=0.310 Sum_probs=131.3 Q ss_pred CCCEEEEEEECCC-C---CCCCEEEEECCCCCCCCCCCCHHHHHHHHHH-HHCCCEEEEEEECCCCCCCCCCCCCCHHHH Q ss_conf 9961879996688-8---7787999957899888899899999999999-973984999853155887677852102389 Q gi|254781093|r 9 PSGRLEGRYQPST-N---PNAPIALILHPHPRFGGTMNDNIVYQLFYLF-QQRGFVSLRFNFRGIGRSEGEFDYGDGELS 83 (225) Q Consensus 9 ~~G~l~~~~~~~~-~---~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l-~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~ 83 (225) +.+.|..++..|. . ...|++|+.|+-+.-=|+.+.+....++..+ .+.+..|+.+|||=.- |=.| ....+ T Consensus 70 ~~~~l~vRly~P~~~~~~~~~p~lvyfHGGGf~~~S~~~~~y~~~~~~~a~~~~~vvvSVdYRLAP--Eh~~---Pa~y~ 144 (336) T KOG1515 70 PFTNLPVRLYRPTSSSSETKLPVLVYFHGGGFCLGSANSPAYDSFCTRLAAELNCVVVSVDYRLAP--EHPF---PAAYD 144 (336) T ss_pred CCCCEEEEEECCCCCCCCCCCEEEEEEECCCCEECCCCCCHHHHHHHHHHHHCCEEEEECCCCCCC--CCCC---CCCCH T ss_conf 899857899727777755676089999688027687887336789999997759299933764289--9989---85414 Q ss_pred HHHHHHHHHHHH-----CCCCCCEEEEEECHHHHHHHHHHHC-------CCCHHEEEECCCCCCCCCCC----------- Q ss_conf 999989999862-----3546740588872036878764503-------86213301227574433322----------- Q gi|254781093|r 84 DAAAALDWVQSL-----NPESKSCWIAGYSFGAWISMQLLMR-------RPEINGFISVAPQPKSYDFS----------- 140 (225) Q Consensus 84 D~~aa~~~l~~~-----~~~~~~i~l~G~S~Gg~val~~a~~-------~p~i~~~v~isp~~~~~~~~----------- 140 (225) |..+|+.|+.++ +.|.++|+|+|=|-||.+|..++.+ .+.+++.|++.|.....++. T Consensus 145 D~~~Al~w~~~~~~~~~~~D~~rv~l~GDSaGGNia~~va~r~~~~~~~~~ki~g~ili~P~~~~~~~~~~e~~~~~~~~ 224 (336) T KOG1515 145 DGWAALKWVLKNSWLKLGADPSRVFLAGDSAGGNIAHVVAQRAADEKLSKPKIKGQILIYPFFQGTDRTESEKQQNLNGS 224 (336) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHCCC T ss_conf 79999999997579985899661899835864899999999986435777655799998045378878778888762389 Q ss_pred --------------------------CCCCC------------CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEE Q ss_conf --------------------------23433------------4773776168655028999999999986077996079 Q gi|254781093|r 141 --------------------------FLAPC------------PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITH 182 (225) Q Consensus 141 --------------------------~l~~~------------~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~ 182 (225) ...+. -.++|++.++.|... +....++++++. .|+.+++ T Consensus 225 ~~~~~~~~~~~w~~~lP~~~~~~~~p~~np~~~~~~~d~~~~~lp~tlv~~ag~D~L~--D~~~~Y~~~Lkk-~Gv~v~~ 301 (336) T KOG1515 225 PELARPKIDKWWRLLLPNGKTDLDHPFINPVGNSLAKDLSGLGLPPTLVVVAGYDVLR--DEGLAYAEKLKK-AGVEVTL 301 (336) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHH--HHHHHHHHHHHH-CCCEEEE T ss_conf 2031788999999769777787678620465565556732257996699995750404--626999999987-6981799 Q ss_pred EEECCCCCCCCC------CHHHHHHHHHHHHHHH Q ss_conf 997879847558------5899999999999973 Q gi|254781093|r 183 KVIPDANHFFIG------KVDELINECAHYLDNS 210 (225) Q Consensus 183 ~~i~ga~H~f~~------~~~~l~~~i~~fl~~~ 210 (225) +.++++.|.|.- ...++.+.+..|+++. T Consensus 302 ~~~e~~~H~~~~~~~~~~~a~~~~~~i~~fi~~~ 335 (336) T KOG1515 302 IHYEDGFHGFHILDPSSKEAHALMDAIVEFIKSN 335 (336) T ss_pred EEECCCEEEEEECCCCHHHHHHHHHHHHHHHHHC T ss_conf 9977970589964886046899999999998623 No 64 >pfam03583 LIP Secretory lipase. These lipases are expressed and secreted during the infection cycle of these pathogens. In particular, C. albicans has a large number of different lipases, possibly reflecting broad lipolytic activity, which may contribute to the persistence and virulence of C. albicans in human tissue. Probab=99.31 E-value=1.3e-10 Score=83.83 Aligned_cols=164 Identities=16% Similarity=0.149 Sum_probs=103.8 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHH----CC-CCCCEEEEEECHHHHHHHHHHHCC- Q ss_conf 999999973984999853155887677852102389999989999862----35-467405888720368787645038- Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSL----NP-ESKSCWIAGYSFGAWISMQLLMRR- 121 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~----~~-~~~~i~l~G~S~Gg~val~~a~~~- 121 (225) .+...|.++||.|+..||-|-|. .|-.+..+-..+..+++-+++. +. ...++.++|||.||..+..++... T Consensus 17 ~~i~~~L~~Gy~Vv~pDY~Glgt---~y~~g~~~g~avLD~iRAa~~~~~~~gl~~~~~v~l~GySqGg~At~~AAela~ 93 (291) T pfam03583 17 LLIVPLLDQGWYVVVPDYEGPKS---TFTVGRQSGYAVLDSIRAALKSGDFSGITSDAKVALWGYSGGGLASGWAAELQP 93 (291) T ss_pred HHHHHHHHCCCEEEECCCCCCCC---CCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHH T ss_conf 99999998899999689899998---777743659999999999985455458898885799876767799999998716 Q ss_pred ---CCH----HEEEECCCCCCCCC-------------------------------------------------------- Q ss_conf ---621----33012275744333-------------------------------------------------------- Q gi|254781093|r 122 ---PEI----NGFISVAPQPKSYD-------------------------------------------------------- 138 (225) Q Consensus 122 ---p~i----~~~v~isp~~~~~~-------------------------------------------------------- 138 (225) |++ .|.+..+|+.+... T Consensus 94 ~YAPEL~~ni~G~a~gg~padl~~~~~~~dg~~~~g~~~~~l~Gl~~~yPel~~~l~~~l~~~g~~~~~~~~~~C~~~~~ 173 (291) T pfam03583 94 SYAPELKKNLIGAALGGFVANITATAEAVDGTVFAGLVPNALNGLANEYPDFKKILYEELNDAGREALKQGTEMCLGDAV 173 (291) T ss_pred HHCCCCCCCCEEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHH T ss_conf 53800004705787147755889999872686278899999999988586567899987479899999999745278899 Q ss_pred ---------------C----CCCC--------------------CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCC-C Q ss_conf ---------------2----2234--------------------3347737761686550289999999999860779-9 Q gi|254781093|r 139 ---------------F----SFLA--------------------PCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKG-I 178 (225) Q Consensus 139 ---------------~----~~l~--------------------~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~-~ 178 (225) + +.+. ....|+++.||..|+++|++..++++++.... + . T Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~l~~~~~~~~~~~~~P~~Pv~i~~g~~D~vvP~~~~~~l~~~~Ca~-G~a 252 (291) T pfam03583 174 IRYAGRQYFTGDNRDFEKGWDLLEDPVINKTLEDNGLLVYDKSAVPQIPVFVYHGTHDEIVPIADIDALYKNWCAW-GIA 252 (291) T ss_pred HHHHHHHHCCCCCCCCCCCHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC-CCC T ss_conf 9864554314654235661111038789999998622146888799999898866888633628999999999976-998 Q ss_pred CEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 607999787984755858999999999999731221001 Q gi|254781093|r 179 SITHKVIPDANHFFIGKVDELINECAHYLDNSLDEKFTL 217 (225) Q Consensus 179 ~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~~~~ 217 (225) +++++.+++.+|+-.- ..-......||+++++-++.. T Consensus 253 ~V~~~~~~~~~H~~~~--~~~~~~a~~wl~dRf~G~Pa~ 289 (291) T pfam03583 253 SVEFAEDLSNGHFTEA--LVGAPAALTWLDDRFSGKPAV 289 (291) T ss_pred CEEEEECCCCCHHHHH--HCCCHHHHHHHHHHHCCCCCC T ss_conf 3799868998546577--536578999999997799999 No 65 >KOG2382 consensus Probab=99.31 E-value=2.2e-10 Score=82.51 Aligned_cols=183 Identities=19% Similarity=0.253 Sum_probs=119.8 Q ss_pred ECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHH-CCCEEEEEEECCCCCCCCCCCCCCH-HHHHHHHHHHHHHHH Q ss_conf 66888778799995789988889989999999999997-3984999853155887677852102-389999989999862 Q gi|254781093|r 18 QPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFDYGDG-ELSDAAAALDWVQSL 95 (225) Q Consensus 18 ~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~~g~~-E~~D~~aa~~~l~~~ 95 (225) ...+...+|.++++|+ +=|+..|. ..+++.|.+ .|-.++..|-|.+|.|.-.-..... -.+|+...+++.+.. T Consensus 45 ~~~~~~~~Pp~i~lHG---l~GS~~Nw--~sv~k~Ls~~l~~~v~~vd~RnHG~Sp~~~~h~~~~ma~dv~~Fi~~v~~~ 119 (315) T KOG2382 45 SSENLERAPPAIILHG---LLGSKENW--RSVAKNLSRKLGRDVYAVDVRNHGSSPKITVHNYEAMAEDVKLFIDGVGGS 119 (315) T ss_pred CCCCCCCCCCEEEECC---CCCCCCCH--HHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 3666677996687122---22678787--999998323126736887422478886635568888898999999970222 Q ss_pred CCCCCCEEEEEECHHH-HHHHHHHHCCCC-HHEEEE--CCCCCCCC---------------C------------------ Q ss_conf 3546740588872036-878764503862-133012--27574433---------------3------------------ Q gi|254781093|r 96 NPESKSCWIAGYSFGA-WISMQLLMRRPE-INGFIS--VAPQPKSY---------------D------------------ 138 (225) Q Consensus 96 ~~~~~~i~l~G~S~Gg-~val~~a~~~p~-i~~~v~--isp~~~~~---------------~------------------ 138 (225) . ...++.+.|||||| .+++..+...|. +..+|. ++|..... + T Consensus 120 ~-~~~~~~l~GHsmGG~~~~m~~t~~~p~~~~rliv~D~sP~~~~~~~~e~~e~i~~m~~~d~~~~~~~~rke~~~~l~~ 198 (315) T KOG2382 120 T-RLDPVVLLGHSMGGVKVAMAETLKKPDLIERLIVEDISPGGVGRSYGEYRELIKAMIQLDLSIGVSRGRKEALKSLIE 198 (315) T ss_pred C-CCCCCEECCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 3-337832403576369999999875675211257883487667866245999999997035533455437799999998 Q ss_pred ---------------------------------------------CCCC--CCCCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf ---------------------------------------------2223--43347737761686550289999999999 Q gi|254781093|r 139 ---------------------------------------------FSFL--APCPSSGLIINGSNDTVATTSDVKDLVNK 171 (225) Q Consensus 139 ---------------------------------------------~~~l--~~~~~p~LiIhG~~D~~vp~~~~~~~~~~ 171 (225) ...+ .+...|+|+|+|.++.++|.++...+.+- T Consensus 199 ~~~d~~~~~fi~~nl~~~~~~~s~~w~~nl~~i~~~~~~~~~~s~~~~l~~~~~~~pvlfi~g~~S~fv~~~~~~~~~~~ 278 (315) T KOG2382 199 VGFDNLVRQFILTNLKKSPSDGSFLWRVNLDSIASLLDEYEILSYWADLEDGPYTGPVLFIKGLQSKFVPDEHYPRMEKI 278 (315) T ss_pred HHCCHHHHHHHHHHCCCCCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHH T ss_conf 74105899999975476777870688757899999999877505510002566566436886377887582677999986 Q ss_pred HHHCCCCCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHHHC Q ss_conf 8607799607999787984755-858999999999999731 Q gi|254781093|r 172 LMNQKGISITHKVIPDANHFFI-GKVDELINECAHYLDNSL 211 (225) Q Consensus 172 l~~~~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~~fl~~~L 211 (225) ++ .++++.+++++|.-+ ++-+++.+.+.+|+.+.. T Consensus 279 fp-----~~e~~~ld~aGHwVh~E~P~~~~~~i~~Fl~~~~ 314 (315) T KOG2382 279 FP-----NVEVHELDEAGHWVHLEKPEEFIESISEFLEEPE 314 (315) T ss_pred CC-----CHHEEECCCCCCEEECCCHHHHHHHHHHHHCCCC T ss_conf 24-----1114650667843662797999999999741368 No 66 >pfam00975 Thioesterase Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa. Probab=99.26 E-value=5.1e-10 Score=80.31 Aligned_cols=165 Identities=16% Similarity=0.271 Sum_probs=98.9 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHH-HHHHHHHHHCCCCCCEEEEE Q ss_conf 99957899888899899999999999973984999853155887677852102389999-98999986235467405888 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAA-AALDWVQSLNPESKSCWIAG 106 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~-aa~~~l~~~~~~~~~i~l~G 106 (225) ++++| .-||+. ..+..+++.|.. ...|+.+...|.+.++=... ..+++. ..++-+++..++ .+..|+| T Consensus 3 Lfc~p---~aGG~~--~~y~~la~~l~~-~~~v~~l~~pG~~~~e~~~~----~~~~~a~~~~~~i~~~~~~-gPy~L~G 71 (224) T pfam00975 3 LFCFP---PAGGSA--SYFRNLARRLPG-TAEFNAVQLPGRERGEPPLT----SIEELAEEYAEALRRIQPE-GPYALFG 71 (224) T ss_pred EEEEC---CCCCCH--HHHHHHHHHCCC-CCEEEEEECCCCCCCCCCCC----CHHHHHHHHHHHHHHHCCC-CCEEEEE T ss_conf 99977---998788--999999985899-97799973898899999999----9999999999999986799-9979996 Q ss_pred ECHHHHHHHHHHHC----CCCHHEEEECCCCCCCCC---C---------------------------------------- Q ss_conf 72036878764503----862133012275744333---2---------------------------------------- Q gi|254781093|r 107 YSFGAWISMQLLMR----RPEINGFISVAPQPKSYD---F---------------------------------------- 139 (225) Q Consensus 107 ~S~Gg~val~~a~~----~p~i~~~v~isp~~~~~~---~---------------------------------------- 139 (225) |||||.+|..+|.+ ...+..++++..++.... . T Consensus 72 hS~Gg~lA~evA~~L~~~g~~v~~l~l~d~~~P~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~d 151 (224) T pfam00975 72 HSMGGLLAFEVARRLERRGEEPAGLILSDAYAPHFGRREASHLLGDDELLAELRRIGGTPEALLEDEELLSLLLPALRAD 151 (224) T ss_pred ECCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHHHHH T ss_conf 44456999999999997799930899978899988664333334679999999984599734407999999999999999 Q ss_pred --------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHH Q ss_conf --------2234334773776168655028999999999986077996079997879847558-5899999999999 Q gi|254781093|r 140 --------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYL 207 (225) Q Consensus 140 --------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl 207 (225) ......+.|.+++.|.+|+.++.+...+..+... ...++.+.+|| ||||.. +.+++.+.|.+|| T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~W~~~~---~~~~~~~~vpG-~Hf~~~~~~~~va~~i~~~l 224 (224) T pfam00975 152 YRALENYPIHPIAAPSDATLFYGADDPLNTLDWLAAWWREYT---PGEFDTHVLPG-DHFYLNEHAEEVLETIHDWL 224 (224) T ss_pred HHHHHHCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHC---CCCCEEEEECC-CCCCCCCCHHHHHHHHHHHC T ss_conf 999984788987789998999645899888547885999738---89947999768-98822669999999999659 No 67 >PRK00175 metX homoserine O-acetyltransferase; Provisional Probab=99.25 E-value=1.4e-09 Score=77.71 Aligned_cols=189 Identities=17% Similarity=0.219 Sum_probs=121.0 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCH--------------HHHHHHHHHHHCCCEEEEEEECCC-CCCCCCCC----CCC-- Q ss_conf 887787999957899888899899--------------999999999973984999853155-88767785----210-- Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDN--------------IVYQLFYLFQQRGFVSLRFNFRGI-GRSEGEFD----YGD-- 79 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~--------------~~~~la~~l~~~G~~vl~fd~RG~-G~S~G~~~----~g~-- 79 (225) ......++|+|| .+.|+.+-- -+.---+++--.-|-|+++|.-|. -+|+|.-+ .|. T Consensus 43 n~~~dNaILv~H---altG~~~~~~~~~~~~~~~GWW~~lIGpgkaiDt~kyfVI~~N~lG~c~GSTgP~s~~p~tg~~y 119 (379) T PRK00175 43 NADRSNAVLICH---ALTGDHHVAGPHPDDDGKPGWWDNMVGPGKPIDTDRFFVICVNVLGGCKGSTGPSSINPATGKPY 119 (379) T ss_pred CCCCCCEEEEEC---CCCCCHHHCCCCCCCCCCCCCHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 788987899947---87874320467766778877277635899764756469994576788878899888898889956 Q ss_pred H------HHHHHHHHHHHHHHHCCCCCCE-EEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC--------------- Q ss_conf 2------3899999899998623546740-588872036878764503862-1330122757443--------------- Q gi|254781093|r 80 G------ELSDAAAALDWVQSLNPESKSC-WIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS--------------- 136 (225) Q Consensus 80 ~------E~~D~~aa~~~l~~~~~~~~~i-~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~--------------- 136 (225) + -+.|...+-.-+.+. ...+++ .++|.||||..|+.-|..+|+ ++.+|.++..... T Consensus 120 g~~FP~iti~D~V~aq~~Ll~~-LGI~~l~aViG~SmGG~qAlewA~~yPd~v~~ii~Ia~~ar~s~~~ia~~~~~r~aI 198 (379) T PRK00175 120 GSDFPVITIRDWVRAQARLLDA-LGITRLAAVVGGSMGGMQALQWAIDYPDRVRHALVIASSARLSAQNIAFNEVARQAI 198 (379) T ss_pred CCCCCCCCHHHHHHHHHHHHHH-CCCCEEEEEECCCHHHHHHHHHHHHCCHHHHHEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 5789945299999999999997-398979889755808999999998495753303445202025788999999999999 Q ss_pred ----------C-C----------------------------------------------C-------------------- Q ss_conf ----------3-3----------------------------------------------2-------------------- Q gi|254781093|r 137 ----------Y-D----------------------------------------------F-------------------- 139 (225) Q Consensus 137 ----------~-~----------------------------------------------~-------------------- 139 (225) | + | T Consensus 199 ~~Dp~w~~G~Y~~~~~~P~~GL~~AR~~a~ltYrs~~~~~~rFgr~~~~~~~~~~~~~~f~vEsYL~~qg~kf~~rFDaN 278 (379) T PRK00175 199 LADPDFHGGDYYEHGVYPKRGLRVARMIGHITYLSDDELDEKFGRDLQRDALEFGFDVEFQVESYLRYQGDKFVERFDAN 278 (379) T ss_pred HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCEEHHHHHHHHHHHHHCCCCHH T ss_conf 85925556875546889606899999999977348999998857343565456664444005579877656554067989 Q ss_pred -----------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC-CCCC-CCCC Q ss_conf -----------------------22343347737761686550289999999999860779960799978-7984-7558 Q gi|254781093|r 140 -----------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP-DANH-FFIG 194 (225) Q Consensus 140 -----------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~-ga~H-~f~~ 194 (225) ..|.....++|+|..+.|.+.|+++.+++++.++.+ +.++++++|+ ..+| .|-- T Consensus 279 syl~l~~a~d~~Di~~~~~gdl~~AL~~I~a~~Lvi~i~sD~lFPp~~~~~~a~~l~~~-~~~~~~~eI~S~~GHDaFLi 357 (379) T PRK00175 279 SYLLLTRALDYFDPARGHGGDLAAALAGIKARFLVVSFTTDWLFPPARSREIVDALLAA-GADVSYAEIDSPHGHDAFLL 357 (379) T ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCC T ss_conf 99999999985485444587899997308998899960755072899999999999866-99769999689997450155 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 58999999999999731221 Q gi|254781093|r 195 KVDELINECAHYLDNSLDEK 214 (225) Q Consensus 195 ~~~~l~~~i~~fl~~~L~~~ 214 (225) ..+++...+.+||++.-.++ T Consensus 358 e~~~~~~~i~~FL~~~~~~~ 377 (379) T PRK00175 358 DDPRYGPLVRAFLERAARER 377 (379) T ss_pred CHHHHHHHHHHHHHHHHHHC T ss_conf 88999999999999999863 No 68 >pfam12048 DUF3530 Protein of unknown function (DUF3530). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 272 to 336 amino acids in length. These proteins are distantly related to alpa/beta hydrolases so they may act as enzymes. Probab=99.23 E-value=3.3e-09 Score=75.42 Aligned_cols=201 Identities=15% Similarity=0.184 Sum_probs=141.0 Q ss_pred EEECCCCCEEEEEEECCCC-CCCCEEEEECCCCCCCCCCCCH-HHHHHHHHHHHCCCEEEEEEECCC-CC---------- Q ss_conf 8965899618799966888-7787999957899888899899-999999999973984999853155-88---------- Q gi|254781093|r 4 VVFNGPSGRLEGRYQPSTN-PNAPIALILHPHPRFGGTMNDN-IVYQLFYLFQQRGFVSLRFNFRGI-GR---------- 70 (225) Q Consensus 4 v~i~g~~G~l~~~~~~~~~-~~~~~vv~~Hp~p~~GG~~~~~-~~~~la~~l~~~G~~vl~fd~RG~-G~---------- 70 (225) +.|+..+.+..++|.+..+ ...+.+||+| ..|.+.|.+ .+..|-+.|.+.||++|....--- .. T Consensus 58 ~~L~~g~~~flaL~~p~~~~~~~GavIil~---~~g~~adwp~~i~~LR~~L~~~GW~TLSit~P~~~~~~~p~~~~~~~ 134 (294) T pfam12048 58 KWLDAGDEKFLALWRPANTGKPQGAVIILP---DTGETADWPGLIAPLRRELPDAGWATLSLTPPDPPSAAPPNRPTEAE 134 (294) T ss_pred HHHHCCCEEEEEEEECCCCCCCCCEEEEEC---CCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCC T ss_conf 033159845666761301788750599955---89989984768999997565038437874379876445776667764 Q ss_pred ---CCC--CCCC-CC-------HH--HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC--CHHEEEECCCC Q ss_conf ---767--7852-10-------23--89999989999862354674058887203687876450386--21330122757 Q gi|254781093|r 71 ---SEG--EFDY-GD-------GE--LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP--EINGFISVAPQ 133 (225) Q Consensus 71 ---S~G--~~~~-g~-------~E--~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p--~i~~~v~isp~ 133 (225) +.| ..+. .. .| .+-+.++++++.++. ..+++|+|++.|++.++.+.+..+ .+.++|.+++. T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~l~ari~a~~~~~~~~~--~~~ivll~~G~gA~~a~~~l~~~~~~~~daLV~I~~~ 212 (294) T pfam12048 135 EVTKAGAKQLSSPSNEELLEAQEERLFARLEALIAFAQQYG--GKNIVLLGQGTGAGWAIRYLAQKPSPAPDALVLISAY 212 (294) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC--CCEEEEEEECCCHHHHHHHHHHCCCCCCCEEEEEECC T ss_conf 45676666667743445525679999999999999997578--8448999827718999999982898779869998178 Q ss_pred CCCCCC-----CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 443332-----223433477377616865502899999999998607799607999787984755858999999999999 Q gi|254781093|r 134 PKSYDF-----SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208 (225) Q Consensus 134 ~~~~~~-----~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~ 208 (225) ...... ..++....|+|=|++.+|.....+.........+..+.....-+.+++..+.+...+ ++.+.|..||+ T Consensus 213 ~p~~~~~~~L~~~la~l~iPvLDiy~~~~~~~~~~~a~~R~~~a~r~~~~~Y~Q~~L~~~~~~~~~~~-~l~~rI~Gwl~ 291 (294) T pfam12048 213 SPQSDLNKALAEQLAQLALPVLDIYSADGNPASLQAAKQRKQAAKRAKKLDYRQIQLPLLPGNPSAWQ-ELLRRIRGWLK 291 (294) T ss_pred CCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCHHHHH-HHHHHHHHHHH T ss_conf 98643255289887406887687566889989998699999999754899857866446889837899-99999999886 Q ss_pred HH Q ss_conf 73 Q gi|254781093|r 209 NS 210 (225) Q Consensus 209 ~~ 210 (225) +. T Consensus 292 ~~ 293 (294) T pfam12048 292 RL 293 (294) T ss_pred HC T ss_conf 12 No 69 >COG3509 LpqC Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.22 E-value=1.1e-10 Score=84.27 Aligned_cols=199 Identities=16% Similarity=0.160 Sum_probs=120.7 Q ss_pred EECCCCCEEEEEEECCCCC--CCCEEEEECCCCCCCCCCCCHH-HH---HHHHHHHHCCCEEEEEE-ECCCC------CC Q ss_conf 9658996187999668887--7879999578998888998999-99---99999997398499985-31558------87 Q gi|254781093|r 5 VFNGPSGRLEGRYQPSTNP--NAPIALILHPHPRFGGTMNDNI-VY---QLFYLFQQRGFVSLRFN-FRGIG------RS 71 (225) Q Consensus 5 ~i~g~~G~l~~~~~~~~~~--~~~~vv~~Hp~p~~GG~~~~~~-~~---~la~~l~~~G~~vl~fd-~RG~G------~S 71 (225) .|....++.+.+|+.|.+. ..|++|++| |+..|-. .. .+-+...+.||.|+.+| |.+.+ .+ T Consensus 39 s~~~~g~~r~y~l~vP~g~~~~apLvv~LH------G~~~sgag~~~~sg~d~lAd~~gFlV~yPdg~~~~wn~~~~~~~ 112 (312) T COG3509 39 SFDVNGLKRSYRLYVPPGLPSGAPLVVVLH------GSGGSGAGQLHGTGWDALADREGFLVAYPDGYDRAWNANGCGNW 112 (312) T ss_pred CCCCCCCCCCEEEECCCCCCCCCCEEEEEE------CCCCCHHHHHCCCCHHHHHCCCCCEEECCCCCCCCCCCCCCCCC T ss_conf 411278753168971898899987799983------68887678641355565524158679776766655688756556 Q ss_pred CCCC--CCCCHHHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEEC-CCCCCCCCC-------- Q ss_conf 6778--52102389999989999862-35467405888720368787645038621330122-757443332-------- Q gi|254781093|r 72 EGEF--DYGDGELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFISV-APQPKSYDF-------- 139 (225) Q Consensus 72 ~G~~--~~g~~E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~i-sp~~~~~~~-------- 139 (225) -|.- ..+..|+.++.+.++-+..+ ..|+.+|++.|+|-||.|+..++..+|++-+.+.+ +.... .+. T Consensus 113 ~~p~~~~~g~ddVgflr~lva~l~~~~gidp~RVyvtGlS~GG~Ma~~lac~~p~~faa~A~VAg~~~-~~~a~~~~rp~ 191 (312) T COG3509 113 FGPADRRRGVDDVGFLRALVAKLVNEYGIDPARVYVTGLSNGGRMANRLACEYPDIFAAIAPVAGLLA-LGVACTPPRPV 191 (312) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCHHHHHHHHHHCCCCCCCCEEEEECCCC-CCCCCCCCCCH T ss_conf 88664447850788999999998774076742279982075789999987407543112013304567-77556899725 Q ss_pred ---------CCCCCC-CCCEEEEECCCCCCCCHHHHHHHHHHHHHC----------------------CCCCEEEEEECC Q ss_conf ---------223433-477377616865502899999999998607----------------------799607999787 Q gi|254781093|r 140 ---------SFLAPC-PSSGLIINGSNDTVATTSDVKDLVNKLMNQ----------------------KGISITHKVIPD 187 (225) Q Consensus 140 ---------~~l~~~-~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~----------------------~~~~~~~~~i~g 187 (225) +.+.+. .-++-|-||..|..|+...+.++...+... .+..+++..|.| T Consensus 192 ~~m~~~G~~Dp~~p~~gG~~~~g~g~~~~~v~~~~~~~~Waa~ng~~~~p~~~~~~~~~~~~~~~~~~~~~~V~~y~i~g 271 (312) T COG3509 192 SVMAFHGTADPLNPYHGGGVPIGRGQRDGVVSAADLAARWAAVNGCQAGPDTAELPDVGDGTDYDTCDGNARVELYTIDG 271 (312) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEECCCCCCCEEEEEEEC T ss_conf 68973588998888879986646343466436899999999854888897543157776562321147985268999807 Q ss_pred CCCCCCCCHH-------------HHHHHHHHHHHHH Q ss_conf 9847558589-------------9999999999973 Q gi|254781093|r 188 ANHFFIGKVD-------------ELINECAHYLDNS 210 (225) Q Consensus 188 a~H~f~~~~~-------------~l~~~i~~fl~~~ 210 (225) .+|.+..... ...+.+-+|...+ T Consensus 272 ~GH~wp~~~~~~~~~~g~~t~~~dat~~iw~Ff~~~ 307 (312) T COG3509 272 GGHTWPGGTQYGPAALGMSTRGFDATERIWRFFRQH 307 (312) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 865575777777655443345755099999999852 No 70 >COG4188 Predicted dienelactone hydrolase [General function prediction only] Probab=99.20 E-value=2.1e-10 Score=82.65 Aligned_cols=163 Identities=21% Similarity=0.273 Sum_probs=105.3 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC--CCCCCC------CC--CCCHHHHHHHHHHHHHH Q ss_conf 787999957899888899899999999999973984999853155--887677------85--21023899999899998 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGI--GRSEGE------FD--YGDGELSDAAAALDWVQ 93 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~--G~S~G~------~~--~g~~E~~D~~aa~~~l~ 93 (225) ..|+|++.||. |++.++ +..+|+.|++.||.|...+.-|. |+.+-. +. ....+..|+..+++++. T Consensus 70 ~~PlvvlshG~---Gs~~~~--f~~~A~~lAs~Gf~Va~~~hpgs~~~~~~~~~~~~~~~~p~~~~erp~dis~lLd~L~ 144 (365) T COG4188 70 LLPLVVLSHGS---GSYVTG--FAWLAEHLASYGFVVAAPDHPGSNAGGAPAAYAGPGSYAPAEWWERPLDISALLDALL 144 (365) T ss_pred CCCEEEECCCC---CCCCCC--HHHHHHHHHHCCEEEEECCCCCCCCCCCCHHHCCCCCCCHHHHHCCCCCHHHHHHHHH T ss_conf 67769836888---887531--3666777863863898526898642458731247765562545415000799999999 Q ss_pred HH---C-----CCCCCEEEEEECHHHHHHHHHHHCCCCHHE--------------------------------------- Q ss_conf 62---3-----546740588872036878764503862133--------------------------------------- Q gi|254781093|r 94 SL---N-----PESKSCWIAGYSFGAWISMQLLMRRPEING--------------------------------------- 126 (225) Q Consensus 94 ~~---~-----~~~~~i~l~G~S~Gg~val~~a~~~p~i~~--------------------------------------- 126 (225) +. - .+..+|.++||||||+.++.++..+.+... T Consensus 145 ~~~~sP~l~~~ld~~~Vgv~GhS~GG~T~m~laGA~~~~~~~~~~C~~~~~~~~~~~~~~~~~l~q~~av~~~~~~~~~r 224 (365) T COG4188 145 QLTASPALAGRLDPQRVGVLGHSFGGYTAMELAGAELDAEALLQHCESASRICLDPPGLNGRLLNQCAAVWLPRQAYDLR 224 (365) T ss_pred HHHCCCCCCCCCCCCCEEEEECCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCHHHHCCC T ss_conf 74238421344675543787226665899985035306799999865533214688776766640353436651320455 Q ss_pred ------EEECCCCCC-CCCCCCCCCCCCCEEEEECCCCCCCCHHHHHH-HHHHHHHCCCCCEEEEEECCCCCC-CCC Q ss_conf ------012275744-33322234334773776168655028999999-999986077996079997879847-558 Q gi|254781093|r 127 ------FISVAPQPK-SYDFSFLAPCPSSGLIINGSNDTVATTSDVKD-LVNKLMNQKGISITHKVIPDANHF-FIG 194 (225) Q Consensus 127 ------~v~isp~~~-~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~-~~~~l~~~~~~~~~~~~i~ga~H~-f~~ 194 (225) ++++.|..+ .+..+-+.....|++++-|..|.+.|+..-.. -+..+ .+....+..++++.|+ |.+ T Consensus 225 DpriravvA~~p~~~~~Fg~tgl~~v~~P~~~~a~s~D~~aP~~~~~~~~f~~l---~g~~k~~~~vp~a~h~sfl~ 298 (365) T COG4188 225 DPRIRAVVAINPALGMIFGTTGLVKVTDPVLLAAGSADGFAPPVTEQIRPFGYL---PGALKYLRLVPGATHFSFLE 298 (365) T ss_pred CCCCEEEEECCCCCCCCCCCCCCEEEECCEEEECCCCCCCCCCCCCCCCCCCCC---CCCHHHEEECCCCCCCCCCC T ss_conf 600135664167754552644221321322563145555577645553320027---76413203247986643034 No 71 >KOG2984 consensus Probab=99.16 E-value=2.1e-10 Score=82.61 Aligned_cols=153 Identities=18% Similarity=0.223 Sum_probs=105.5 Q ss_pred HHHHHCCCEEEEEEECCCCCCCCC-CCCCC-HHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEE Q ss_conf 999973984999853155887677-85210-23899999899998623546740588872036878764503862-1330 Q gi|254781093|r 51 YLFQQRGFVSLRFNFRGIGRSEGE-FDYGD-GELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGF 127 (225) Q Consensus 51 ~~l~~~G~~vl~fd~RG~G~S~G~-~~~g~-~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~ 127 (225) +.+...-+.++..|-+|+|.|--. -+.+. --..|++++++-++... -.++-++|||=||-.|+.+|++.++ |... T Consensus 65 ~l~k~l~~TivawDPpGYG~SrPP~Rkf~~~ff~~Da~~avdLM~aLk--~~~fsvlGWSdGgiTalivAak~~e~v~rm 142 (277) T KOG2984 65 SLFKPLQVTIVAWDPPGYGTSRPPERKFEVQFFMKDAEYAVDLMEALK--LEPFSVLGWSDGGITALIVAAKGKEKVNRM 142 (277) T ss_pred HCCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCCEEEEEECCCCEEEEEEECCCHHHHHHH T ss_conf 358878537998789988888997332207899872899999999727--787047631478757887502476654342 Q ss_pred EECCCCCCC----------------------------CCC---------------------------CCCCCCCCCEEEE Q ss_conf 122757443----------------------------332---------------------------2234334773776 Q gi|254781093|r 128 ISVAPQPKS----------------------------YDF---------------------------SFLAPCPSSGLII 152 (225) Q Consensus 128 v~isp~~~~----------------------------~~~---------------------------~~l~~~~~p~LiI 152 (225) |-++..+-. |.. ..+..+..|+||+ T Consensus 143 iiwga~ayvn~~~~ma~kgiRdv~kWs~r~R~P~e~~Yg~e~f~~~wa~wvD~v~qf~~~~dG~fCr~~lp~vkcPtli~ 222 (277) T KOG2984 143 IIWGAAAYVNHLGAMAFKGIRDVNKWSARGRQPYEDHYGPETFRTQWAAWVDVVDQFHSFCDGRFCRLVLPQVKCPTLIM 222 (277) T ss_pred EEECCCCEECCHHHHHHHCHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCEEEE T ss_conf 26422123352367787431777555254036277741878899999999999999861689846765255345873676 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHHH Q ss_conf 16865502899999999998607799607999787984755-85899999999999973 Q gi|254781093|r 153 NGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFI-GKVDELINECAHYLDNS 210 (225) Q Consensus 153 hG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~~fl~~~ 210 (225) ||..|++|+-.++-=+-...+. .++++.|.+.|-|+ .+.++..+.+.+||++. T Consensus 223 hG~kDp~~~~~hv~fi~~~~~~-----a~~~~~peGkHn~hLrya~eFnklv~dFl~~~ 276 (277) T KOG2984 223 HGGKDPFCGDPHVCFIPVLKSL-----AKVEIHPEGKHNFHLRYAKEFNKLVLDFLKST 276 (277) T ss_pred ECCCCCCCCCCCCCCHHHHCCC-----CEEEECCCCCCCEEEECHHHHHHHHHHHHHCC T ss_conf 0786877788776514442555-----44887667775435642698879999997416 No 72 >pfam06821 DUF1234 Alpha/Beta hydrolase family of unknown function (DUF1234). The Crystal Structure Of The Yden Gene Product from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins. Probab=99.15 E-value=1.5e-09 Score=77.57 Aligned_cols=142 Identities=18% Similarity=0.185 Sum_probs=92.5 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 99957899888899899999999999973984999853155887677852102389999989999862354674058887 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGY 107 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~ 107 (225) |+|+| +++||..+.=..-+.+.+. .+.|+... +++ .-..++-.+.++-..++. + .+++|+|| T Consensus 1 VlIvh---G~~~s~~~HWq~wl~~~~~----~~~~v~~~-------d~~--~P~~~~W~~~L~~~l~~~-d-~~~ilVaH 62 (171) T pfam06821 1 ILIVP---GYGGSGPGHWQSWWERRLP----AARRVEQA-------DWL--QPVLDDWVAALSAAVAAA-P-GPVVLVAH 62 (171) T ss_pred CEEEC---CCCCCCCHHHHHHHHHHCC----CCEEECCC-------CCC--CCCHHHHHHHHHHHHHHC-C-CCEEEEEE T ss_conf 98938---9899985367999998787----98794798-------989--989999999999999736-8-98599973 Q ss_pred CHHHHHHHHHHHCC--CCHHEEEECCCCCCCC---------CCC--CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH Q ss_conf 20368787645038--6213301227574433---------322--2343347737761686550289999999999860 Q gi|254781093|r 108 SFGAWISMQLLMRR--PEINGFISVAPQPKSY---------DFS--FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMN 174 (225) Q Consensus 108 S~Gg~val~~a~~~--p~i~~~v~isp~~~~~---------~~~--~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~ 174 (225) |+|...++..+.+. ..|+++++++|+-... .|. .....+.|.++|..++|+++|.+.++.+++.+.. T Consensus 63 SLGc~~~l~~l~~~~~~~i~g~~LVAp~d~~~~~~~~~~~~~f~~~~~~~l~~~~~viaS~nDp~~~~~~a~~lA~~~ga 142 (171) T pfam06821 63 SLGCLAVAHWAEPQLRAKVAGALLVAPADVEPSAPRPAALATFAPLPRDPLPFPSLVVASRNDPYCPFERAESLAQAWGA 142 (171) T ss_pred CHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCC T ss_conf 57899999998774142204489972688655666623320125576556897779998279998899999999998099 Q ss_pred CCCCCEEEEEECCCCCCCC Q ss_conf 7799607999787984755 Q gi|254781093|r 175 QKGISITHKVIPDANHFFI 193 (225) Q Consensus 175 ~~~~~~~~~~i~ga~H~f~ 193 (225) +++.+++++||-. T Consensus 143 ------~~~~~~~~GHfn~ 155 (171) T pfam06821 143 ------EFVDLGHAGHINV 155 (171) T ss_pred ------EEEECCCCCCCCC T ss_conf ------3897589887633 No 73 >pfam00756 Esterase Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is related to the pfam00135 family. Probab=99.14 E-value=2.5e-08 Score=70.13 Aligned_cols=116 Identities=15% Similarity=0.125 Sum_probs=77.3 Q ss_pred HHHHHHHHHHCC--CCCCEEEEEECHHHHHHHHHHHCCCCH-HEEEECCCCCCCCCCCCC-----------------CCC Q ss_conf 998999986235--467405888720368787645038621-330122757443332223-----------------433 Q gi|254781093|r 86 AAALDWVQSLNP--ESKSCWIAGYSFGAWISMQLLMRRPEI-NGFISVAPQPKSYDFSFL-----------------APC 145 (225) Q Consensus 86 ~aa~~~l~~~~~--~~~~i~l~G~S~Gg~val~~a~~~p~i-~~~v~isp~~~~~~~~~l-----------------~~~ 145 (225) ..++.++++..+ +.+..+++|+||||..|+.++.++|+. .++++.||....+..... ... T Consensus 95 ~el~p~i~~~y~~~~~~~~ai~G~SmGG~~Al~~al~~Pd~F~~~~~~Sg~~~~w~~~~~~~~~~~d~~~~~~~~~~~~~ 174 (243) T pfam00756 95 QELPPLLDANFPTAPDGDRALAGQSMGGLGALYLALHYPDRFGSVSSFSPINSMWGPEDDPAWQEGDPVLLAVALSANNT 174 (243) T ss_pred HHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCHHHHCCHHHHHHHHHCCCC T ss_conf 99999999857857876717985287699999999968253062313088667789876510221788999987322677 Q ss_pred CCCEEEEECCCCCCCC----------HHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 4773776168655028----------999999999986077996079997879847558589999999 Q gi|254781093|r 146 PSSGLIINGSNDTVAT----------TSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINEC 203 (225) Q Consensus 146 ~~p~LiIhG~~D~~vp----------~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i 203 (225) ...+.+..|+.|.... .+..+++.+.+.. +++++++++++| +|-|.-.+..|.+.. T Consensus 175 ~~~i~~~~G~~e~~~~~~~~~~~~~~~~~n~~l~~~L~~-~gi~~~y~~~~G-gH~w~~Wr~~L~~~L 240 (243) T pfam00756 175 RLRIYLDVGTREDFLGDQLPVEILDEAAPNRELAEQLAY-RGVGGHDHVYYG-GHDWAYWRAQLIAAL 240 (243) T ss_pred CCEEEEECCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHH-CCCCCEEEEECC-CCCHHHHHHHHHHHH T ss_conf 865999758986013322462679999999799999998-799807986487-059557889999874 No 74 >COG1505 Serine proteases of the peptidase family S9A [Amino acid transport and metabolism] Probab=99.13 E-value=1.9e-09 Score=76.90 Aligned_cols=202 Identities=19% Similarity=0.194 Sum_probs=127.5 Q ss_pred EEECCCCC-EEEEEEEC-C-CCCCCCEEEEECCCCCCCCCC--CCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CC Q ss_conf 89658996-18799966-8-887787999957899888899--8999999999999739849998531558876778-52 Q gi|254781093|r 4 VVFNGPSG-RLEGRYQP-S-TNPNAPIALILHPHPRFGGTM--NDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DY 77 (225) Q Consensus 4 v~i~g~~G-~l~~~~~~-~-~~~~~~~vv~~Hp~p~~GG~~--~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~ 77 (225) .+-...|| |+...+.. . +..+.|+. +|. |||.. ..+.+...-+...++|..-+.-|.||-|+-.=.| +. T Consensus 397 ~~atSkDGT~IPYFiv~K~~~~D~~pTl--l~a---YGGF~vsltP~fs~~~~~WLerGg~fv~ANIRGGGEfGp~WH~A 471 (648) T COG1505 397 FFATSKDGTRIPYFIVRKGAKKDENPTL--LYA---YGGFNISLTPRFSGSRKLWLERGGVFVLANIRGGGEFGPEWHQA 471 (648) T ss_pred EEEECCCCCCCCEEEEECCCCCCCCCEE--EEE---CCCCCCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCHHHHHH T ss_conf 9997278854217998547777888658--995---15624455776423778887618869999614677568799997 Q ss_pred CCH-----HHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCCHHEE-EECCCCCCCCCCCCC-------- Q ss_conf 102-----3899999899998623-5467405888720368787645038621330-122757443332223-------- Q gi|254781093|r 78 GDG-----ELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPEINGF-ISVAPQPKSYDFSFL-------- 142 (225) Q Consensus 78 g~~-----E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~-v~isp~~~~~~~~~l-------- 142 (225) |.. -.+|..||.+++.+++ ..+.++.+.|-|-||.+...+..++|++-+. |+-.|-..+..+..+ T Consensus 472 a~k~nrq~vfdDf~AVaedLi~rgitspe~lgi~GGSNGGLLvg~alTQrPelfgA~v~eVPllDMlRYh~l~aGasW~~ 551 (648) T COG1505 472 GMKENKQNVFDDFIAVAEDLIKRGITSPEKLGIQGGSNGGLLVGAALTQRPELFGAAVCEVPLLDMLRYHLLTAGSSWIA 551 (648) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCCEEEEEEECCCHHHHCCEEECCCHHHHHHHCCCCCCHHHHH T ss_conf 76403201367799999999981899878962005777764676663148144182443563066543401566601375 Q ss_pred -----------------CCC--------CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHH Q ss_conf -----------------433--------4773776168655028999999999986077996079997879847558589 Q gi|254781093|r 143 -----------------APC--------PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVD 197 (225) Q Consensus 143 -----------------~~~--------~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~ 197 (225) +|. --|+||-.+++|++|.|.++++++.+|.... .++-+.+--+++|.=...+. T Consensus 552 EYG~Pd~P~d~~~l~~YSPy~nl~~g~kYP~~LITTs~~DDRVHPaHArKfaa~L~e~~-~pv~~~e~t~gGH~g~~~~~ 630 (648) T COG1505 552 EYGNPDDPEDRAFLLAYSPYHNLKPGQKYPPTLITTSLHDDRVHPAHARKFAAKLQEVG-APVLLREETKGGHGGAAPTA 630 (648) T ss_pred HCCCCCCHHHHHHHHHCCCHHCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCHH T ss_conf 51898987898998863943117765557975787035556516068899999998538-96599952477656789757 Q ss_pred H-HHH--HHHHHHHHHC Q ss_conf 9-999--9999999731 Q gi|254781093|r 198 E-LIN--ECAHYLDNSL 211 (225) Q Consensus 198 ~-l~~--~i~~fl~~~L 211 (225) + ..+ -+-.||.+.| T Consensus 631 ~~A~~~a~~~afl~r~L 647 (648) T COG1505 631 EIARELADLLAFLLRTL 647 (648) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 88999999999999861 No 75 >pfam05728 UPF0227 Uncharacterized protein family (UPF0227). Despite being classed as uncharacterized proteins, the members of this family are almost certainly enzymes that are distantly related to the pfam00561. Probab=99.13 E-value=4.3e-09 Score=74.77 Aligned_cols=152 Identities=16% Similarity=0.148 Sum_probs=97.0 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCC--CEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEE Q ss_conf 999578998888998999999999999739--849998531558876778521023899999899998623546740588 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRG--FVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIA 105 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G--~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~ 105 (225) ++++| +|..+..+.=...+.+.+++.| ..+..+|.- . .-+.+.+-++-+.+.. ..+++.|+ T Consensus 2 IlYLH---GF~SSp~S~Ka~~l~~~~~~~~~~~~~~~P~L~----------~---~P~~ai~~l~~~I~~~-~~~~~~Li 64 (187) T pfam05728 2 ILYLH---GFNSSPGSHKAQLLLQFIAQDVRFINYSTPHLP----------H---DPQQALKELEKAVAEL-GDDNPLLV 64 (187) T ss_pred EEEEC---CCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCC----------C---CHHHHHHHHHHHHHHC-CCCCEEEE T ss_conf 69956---878898778999999999851998679789999----------8---8999999999999965-89974899 Q ss_pred EECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCC------------------------CCC-------CCCCCCEEEEEC Q ss_conf 8720368787645038621330122757443332------------------------223-------433477377616 Q gi|254781093|r 106 GYSFGAWISMQLLMRRPEINGFISVAPQPKSYDF------------------------SFL-------APCPSSGLIING 154 (225) Q Consensus 106 G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~------------------------~~l-------~~~~~p~LiIhG 154 (225) |.|+||+-|..+|-+.. +++ |++.|++.-+.. ..+ ...|...|++++ T Consensus 65 GSSLGGfyAt~lae~~~-~~a-VLINPAv~p~~~l~~~lG~~~n~~t~e~~~~~~~h~~~lk~l~~~~~~~p~~~lvLlq 142 (187) T pfam05728 65 GSSLGGYYATWLGFRCG-LRQ-VLFNPAVRPYENLAGKLGEQANPYTGQTYVLEEYHDIELKCLEVFRLKNPDRCLVLLQ 142 (187) T ss_pred ECCCHHHHHHHHHHHCC-CCE-EEECCCCCHHHHHHHHCCCCCCCCCCCCEEECHHHHHHHHHHCCCCCCCCCCEEEEEE T ss_conf 42707999999998639-877-9975897755618976498767888972687289999998605333478613899985 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 86550289999999999860779960799978798475585899999999999 Q gi|254781093|r 155 SNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYL 207 (225) Q Consensus 155 ~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl 207 (225) +.|++.+.+.+.++++. . ...+.+|+||.|.+.++.+ ..|.+|+ T Consensus 143 ~gDEvLdyr~a~~~y~~---~-----~~~i~~GGdH~f~~f~~~l-~~I~~Fl 186 (187) T pfam05728 143 KGDEVLDYRRAAAHYRP---Y-----YEIVWDGGDHKFKDFSRHL-QQIKAFK 186 (187) T ss_pred CCCCCCCHHHHHHHHCC---C-----EEEEECCCCCCCCCHHHHH-HHHHHHC T ss_conf 68844489999999579---8-----3699759988857499999-9999855 No 76 >PRK06765 homoserine O-acetyltransferase; Provisional Probab=99.12 E-value=1.7e-09 Score=77.18 Aligned_cols=187 Identities=16% Similarity=0.139 Sum_probs=117.4 Q ss_pred CCCCCEEEEECCCCCCCCCCC--------CH------HHHHHHHHHHHCCCEEEEEEECCCCC-------CCCCCC---- Q ss_conf 877879999578998888998--------99------99999999997398499985315588-------767785---- Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMN--------DN------IVYQLFYLFQQRGFVSLRFNFRGIGR-------SEGEFD---- 76 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~--------~~------~~~~la~~l~~~G~~vl~fd~RG~G~-------S~G~~~---- 76 (225) .....++++|| .+.|+.+ +. .+..--+.+--.-|-|+++|.-|.+. |.|.-+ T Consensus 49 ~~~~NaILv~H---altG~~h~ag~~~~~d~~~gWW~~liG~gk~iDt~kyfVIc~N~lG~~~g~~p~~~stGP~S~~p~ 125 (387) T PRK06765 49 RAKSNVILITH---YFSATSHAAGKYTAHDEEAGYWDGLIGPGKAIDTNKYFVISTDTLCNVQVKDPNVITTGPASINPK 125 (387) T ss_pred CCCCCEEEEEC---CCCCCHHHCCCCCCCCCCCCCHHHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99997899948---857763322677777888771876358998657676699994577777778988787899999989 Q ss_pred CC----C----HHHHHHHHHHHHHHHHCCCCCCE-EEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC---------- Q ss_conf 21----0----23899999899998623546740-588872036878764503862-1330122757443---------- Q gi|254781093|r 77 YG----D----GELSDAAAALDWVQSLNPESKSC-WIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS---------- 136 (225) Q Consensus 77 ~g----~----~E~~D~~aa~~~l~~~~~~~~~i-~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~---------- 136 (225) .| . --+.|...+-.-+.+. ...+++ .++|.||||..|+.-|..+|+ ++.+|.++..... T Consensus 126 tg~pyg~~FP~vTi~D~V~aq~~LLd~-LGI~kl~aViGgSmGGmqAlewa~~~Pd~v~~~i~Ia~sa~~s~~~ia~~e~ 204 (387) T PRK06765 126 TGKPYGMDFPVVTILDFVRVQKELLED-LGIARLHAVMGPSMGGMQAQEWAVHYPHMVERMIGVIGNPQNDAWTSVNLAQ 204 (387) T ss_pred CCCCCCCCCCCEEHHHHHHHHHHHHHH-HCCCEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 998665899940328899999999998-3988572114658188999999997958998898876301489999999999 Q ss_pred ---------------C---------------------------------------------------------------- Q ss_conf ---------------3---------------------------------------------------------------- Q gi|254781093|r 137 ---------------Y---------------------------------------------------------------- 137 (225) Q Consensus 137 ---------------~---------------------------------------------------------------- 137 (225) | T Consensus 205 ~r~aI~~Dp~w~~G~Y~~~~~P~~GL~~ar~~~~~~y~S~~~~~~~fgr~~~~~~~~~~~~~~~~vesyL~~~g~k~~~r 284 (387) T PRK06765 205 WAIAIRLDPNWKGGKYYGEEQPLKGLTLANRMMTMNAFDEHFYETTFPRNSIEVEPYEKVSTLTSFEKEINKATYARAEL 284 (387) T ss_pred HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99998509434488777878952259999999998715999999884823456542100231268999999999887653 Q ss_pred -C---------------C--------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC-CCCC-C Q ss_conf -3---------------2--------22343347737761686550289999999999860779960799978-7984-7 Q gi|254781093|r 138 -D---------------F--------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP-DANH-F 191 (225) Q Consensus 138 -~---------------~--------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~-ga~H-~ 191 (225) | . ..|.....++|+|..+.|-+.|+++.+++++.++.. +.++++++|+ .-|| . T Consensus 285 fDaNsyL~l~ra~d~~Di~~~~~~l~~aL~~I~a~~Lvi~i~sD~lFpp~e~~e~~~~l~~~-~~~~~~~eI~S~~GHDa 363 (387) T PRK06765 285 VDANHWLYLAKAVQLFDAGHGFSSLEEALSNIEANVLMIPCKQDLLQPPRYNYKMVDILQKQ-GKYAEVYEIEGINGHMA 363 (387) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCC T ss_conf 59778999999877546556688899999738998899985773176989999999999855-99879999799998361 Q ss_pred CCCCHHHHHHHHHHHHHHHCCC Q ss_conf 5585899999999999973122 Q gi|254781093|r 192 FIGKVDELINECAHYLDNSLDE 213 (225) Q Consensus 192 f~~~~~~l~~~i~~fl~~~L~~ 213 (225) |--..+.+.+.|.+||++++.. T Consensus 364 FLie~~~~~~~i~~FL~~~~~~ 385 (387) T PRK06765 364 GVFDIHLFEKKIYEFLNRKVSS 385 (387) T ss_pred CCCCHHHHHHHHHHHHHHHHHH T ss_conf 0628999999999999876763 No 77 >COG1770 PtrB Protease II [Amino acid transport and metabolism] Probab=99.11 E-value=4.9e-09 Score=74.41 Aligned_cols=200 Identities=21% Similarity=0.202 Sum_probs=135.1 Q ss_pred EEECCCCC-E--EEEEEECC--CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CC Q ss_conf 89658996-1--87999668--8877879999578998888998999999999999739849998531558876778-52 Q gi|254781093|r 4 VVFNGPSG-R--LEGRYQPS--TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DY 77 (225) Q Consensus 4 v~i~g~~G-~--l~~~~~~~--~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~ 77 (225) ++.+..+| + +.-.|... ...+.|++++ +++.||-+++- .+....-.|..+||.--.-.-||-|.-...| +. T Consensus 422 iwa~a~dgv~VPVSLvyrkd~~~~g~~p~lLy--gYGaYG~s~~p-~Fs~~~lSLlDRGfiyAIAHVRGGgelG~~WYe~ 498 (682) T COG1770 422 IWATADDGVQVPVSLVYRKDTKLDGSAPLLLY--GYGAYGISMDP-SFSIARLSLLDRGFVYAIAHVRGGGELGRAWYED 498 (682) T ss_pred EEEECCCCEEEEEEEEEECCCCCCCCCCEEEE--EECCCCCCCCC-CCCCCEEEEECCCEEEEEEEEECCCCCCHHHHHH T ss_conf 98875898084179999415667899857999--95345655786-6441242200172699999950664347578873 Q ss_pred CCHH-----HHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCC----------- Q ss_conf 1023-----899999899998623-546740588872036878764503862-1330122757443332----------- Q gi|254781093|r 78 GDGE-----LSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDF----------- 139 (225) Q Consensus 78 g~~E-----~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~----------- 139 (225) |..+ ..|..|+.++|.+++ .+..+|+++|-|-||.+...++..+|+ .+++|+-.|.++..+. T Consensus 499 GK~l~K~NTf~DFIa~a~~Lv~~g~~~~~~i~a~GGSAGGmLmGav~N~~P~lf~~iiA~VPFVDvltTMlD~slPLT~~ 578 (682) T COG1770 499 GKLLNKKNTFTDFIAAARHLVKEGYTSPDRIVAIGGSAGGMLMGAVANMAPDLFAGIIAQVPFVDVLTTMLDPSLPLTVT 578 (682) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHHHHHHHCHHHHHHEEECCCCCCHHHHHCCCCCCCCCC T ss_conf 44553336478899999999971767824158963572348999998618465431243277333123203877898852 Q ss_pred -----------------------CCCCCCC-CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCC--EEEEEECCCCC-CC Q ss_conf -----------------------2234334-773776168655028999999999986077996--07999787984-75 Q gi|254781093|r 140 -----------------------SFLAPCP-SSGLIINGSNDTVATTSDVKDLVNKLMNQKGIS--ITHKVIPDANH-FF 192 (225) Q Consensus 140 -----------------------~~l~~~~-~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~--~~~~~i~ga~H-~f 192 (225) +-+...+ .++|++.|..|..|..=+--+|+.+++..+... +-+.+-.+++| .. T Consensus 579 E~~EWGNP~d~e~y~yikSYSPYdNV~a~~YP~ilv~~Gl~D~rV~YwEpAKWvAkLR~~~td~~plLlkt~M~aGHgG~ 658 (682) T COG1770 579 EWDEWGNPLDPEYYDYIKSYSPYDNVEAQPYPAILVTTGLNDPRVQYWEPAKWVAKLRELKTDGNPLLLKTNMDAGHGGA 658 (682) T ss_pred CHHHHCCCCCHHHHHHHHHCCCHHCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCC T ss_conf 03430797778899998625933214557888368970345786145207899998764136897179984134657877 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 58589999999999 Q gi|254781093|r 193 IGKVDELINECAHY 206 (225) Q Consensus 193 ~~~~~~l~~~i~~f 206 (225) +++.+.+.+...+| T Consensus 659 SgRf~~lee~A~eY 672 (682) T COG1770 659 SGRFQRLEEIAFEY 672 (682) T ss_pred CCCHHHHHHHHHHH T ss_conf 87368899999998 No 78 >smart00824 PKS_TE Thioesterase. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa. Probab=99.11 E-value=4e-09 Score=74.93 Aligned_cols=165 Identities=13% Similarity=0.102 Sum_probs=97.6 Q ss_pred EECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHH-HHHHHHHCCCCCCEEEEEEC Q ss_conf 95789988889989999999999997398499985315588767785210238999998-99998623546740588872 Q gi|254781093|r 30 ILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAA-LDWVQSLNPESKSCWIAGYS 108 (225) Q Consensus 30 ~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa-~~~l~~~~~~~~~i~l~G~S 108 (225) ||=|++.-||+-. .+..+|+.|. ....|+..-++|.+. |+.- ...++++... ++-+++..++ .+..|+||| T Consensus 1 vC~p~~~agg~~~--~Y~~lA~~L~-~~~~v~alq~pG~~~--~ep~--p~s~e~lA~~~~~ai~~~~p~-gPy~L~GhS 72 (212) T smart00824 1 ICFPSTAAPSGPH--EYARLAAALR-GRRDVSALPLPGFGP--GEPL--PASADALVEAQAEAVLRAAGG-RPFVLVGHS 72 (212) T ss_pred CCCCCCCCCCCHH--HHHHHHHHCC-CCCCEEEEECCCCCC--CCCC--CCCHHHHHHHHHHHHHHHCCC-CCEEEEEEC T ss_conf 9898888888899--9999998579-998689854899899--9989--889999999999999986799-987999976 Q ss_pred HHHHHHHHHHHCC----CCHHEEEECCCCC-CCCC---------------------------------CC-----CCCCC Q ss_conf 0368787645038----6213301227574-4333---------------------------------22-----23433 Q gi|254781093|r 109 FGAWISMQLLMRR----PEINGFISVAPQP-KSYD---------------------------------FS-----FLAPC 145 (225) Q Consensus 109 ~Gg~val~~a~~~----p~i~~~v~isp~~-~~~~---------------------------------~~-----~l~~~ 145 (225) |||.+|..+|.+- ..+..++++.... .... +. ...+. T Consensus 73 ~Gg~vA~E~A~~L~a~G~~v~~L~llD~~~p~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~l~a~~~~~~l~~~~~p~~~ 152 (212) T smart00824 73 SGGLLAHAVAARLEARGIPPAAVVLLDTYPPGDPAPEGWLPELLRGVFEREDSFVPMDDARLTAMGAYLRLFGGWTPGPV 152 (212) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 85899999999999779994289995888998756442047888899986514567777889986999999735789888 Q ss_pred CCCEEEEEC-CCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC--CCCHHHHHHHHHHHH Q ss_conf 477377616-86550289999999999860779960799978798475--585899999999999 Q gi|254781093|r 146 PSSGLIING-SNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFF--IGKVDELINECAHYL 207 (225) Q Consensus 146 ~~p~LiIhG-~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f--~~~~~~l~~~i~~fl 207 (225) ..|++++.+ +.|.-++.+....|.+-. ...++.+.+|| |||+ ..+...+.+.|.+|| T Consensus 153 ~~p~ll~~a~~~~~~~~~~~~~~W~~~~----~~~~~~~~vpG-dHf~ml~e~~~~~A~~i~~wl 212 (212) T smart00824 153 AAPTLLVRASEPLAEWPDEDPDGWRAHW----PLPHTVVDVPG-DHFTMMEEHAAATARAVHDWL 212 (212) T ss_pred CCCEEEEECCCCCCCCCCHHHHHHHHCC----CCCCEEEEECC-CCHHHHHHHHHHHHHHHHHHC T ss_conf 8987999805888888821345687458----99966999789-772141430999999999769 No 79 >COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=99.11 E-value=2e-09 Score=76.82 Aligned_cols=167 Identities=21% Similarity=0.310 Sum_probs=100.9 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHH-C-C-CCCCEE Q ss_conf 999957899888899899999999999973984999853155887677852102389999989999862-3-5-467405 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSL-N-P-ESKSCW 103 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~-~-~-~~~~i~ 103 (225) .-++|=||. ||+.. .+..+.+.|.. -+.++...+-|.|+--++ .-..|+.+.+|-+... . + ..++.+ T Consensus 8 ~~L~cfP~A--GGsa~--~fr~W~~~lp~-~iel~avqlPGR~~r~~e-----p~~~di~~Lad~la~el~~~~~d~P~a 77 (244) T COG3208 8 LRLFCFPHA--GGSAS--LFRSWSRRLPA-DIELLAVQLPGRGDRFGE-----PLLTDIESLADELANELLPPLLDAPFA 77 (244) T ss_pred CEEEEECCC--CCCHH--HHHHHHHHCCC-HHHEEEECCCCCCCCCCC-----CCCCCHHHHHHHHHHHHCCCCCCCCEE T ss_conf 249984688--98778--87777764884-143463127884012478-----611059999999998746436799764 Q ss_pred EEEECHHHHHHHHHHHCCCC----HHEE-EECCCCCC------------------------------------------- Q ss_conf 88872036878764503862----1330-12275744------------------------------------------- Q gi|254781093|r 104 IAGYSFGAWISMQLLMRRPE----INGF-ISVAPQPK------------------------------------------- 135 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~p~----i~~~-v~isp~~~------------------------------------------- 135 (225) ++||||||.+|..+|.+... +.++ |+-+.++. T Consensus 78 lfGHSmGa~lAfEvArrl~~~g~~p~~lfisg~~aP~~~~~~~i~~~~D~~~l~~l~~lgG~p~e~led~El~~l~LPil 157 (244) T COG3208 78 LFGHSMGAMLAFEVARRLERAGLPPRALFISGCRAPHYDRGKQIHHLDDADFLADLVDLGGTPPELLEDPELMALFLPIL 157 (244) T ss_pred ECCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHCCHHHHHHHHHHH T ss_conf 22025047999999999997699853488856779987566776679989999999985799968945999999998999 Q ss_pred --------CCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHH Q ss_conf --------33322234334773776168655028999999999986077996079997879847558-589999999999 Q gi|254781093|r 136 --------SYDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHY 206 (225) Q Consensus 136 --------~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~f 206 (225) .|.+..-.+...|+.++-|++|..|+.+.+..|.+..++ .+++++++| ||||-+ +.+++.+.+.+. T Consensus 158 RAD~~~~e~Y~~~~~~pl~~pi~~~~G~~D~~vs~~~~~~W~~~t~~----~f~l~~fdG-gHFfl~~~~~~v~~~i~~~ 232 (244) T COG3208 158 RADFRALESYRYPPPAPLACPIHAFGGEKDHEVSRDELGAWREHTKG----DFTLRVFDG-GHFFLNQQREEVLARLEQH 232 (244) T ss_pred HHHHHHHCCCCCCCCCCCCCCEEEECCCCCHHCCHHHHHHHHHHHCC----CCEEEEECC-CCEEHHHHHHHHHHHHHHH T ss_conf 88999862400689987676657752576623009999999996567----841898069-6201254199999999998 Q ss_pred HH Q ss_conf 99 Q gi|254781093|r 207 LD 208 (225) Q Consensus 207 l~ 208 (225) +. T Consensus 233 l~ 234 (244) T COG3208 233 LA 234 (244) T ss_pred HH T ss_conf 42 No 80 >TIGR01607 PST-A Plasmodium subtelomeric family (PST-A); InterPro: IPR006494 These proteins represent a paralogous family of genes found in Plasmodium falciparum and Plasmodium yoelii that are closely related to various phospholipases and lysophospholipases of plants as well as generally being related to the alpha/beta-fold superfamily of hydrolases. These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii. . Probab=99.10 E-value=3.7e-11 Score=87.21 Aligned_cols=157 Identities=18% Similarity=0.351 Sum_probs=107.8 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH---HHHH-HHHHHHHH---------H------HC----------CC Q ss_conf 9999999739849998531558876778521023---8999-99899998---------6------23----------54 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE---LSDA-AAALDWVQ---------S------LN----------PE 98 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E---~~D~-~aa~~~l~---------~------~~----------~~ 98 (225) .+-+.|-+.||+|...|.=|||+|+| |+.-+|- -+|+ -.+++|++ + +. .. T Consensus 65 SWIE~fN~~GYSVYgLDLQGHGESdG-~~nLrgHin~FDDlVyDvIqYmn~I~~si~~E~~dP~~y~~dd~~~~i~~~~~ 143 (379) T TIGR01607 65 SWIEKFNKNGYSVYGLDLQGHGESDG-LQNLRGHINDFDDLVYDVIQYMNRINDSIILENEDPKEYKSDDESYDIVNTKE 143 (379) T ss_pred EEEEECCCCCCEEEEEECCCCCCCCH-HHCCCCCCCCCCCEEHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCC T ss_conf 25651167976387534356762320-33067665644664503789988863321045678300234654341101143 Q ss_pred ---CCCEEEEEECHHHHHHHHHHH---CCC----------------------------------------------CHHE Q ss_conf ---674058887203687876450---386----------------------------------------------2133 Q gi|254781093|r 99 ---SKSCWIAGYSFGAWISMQLLM---RRP----------------------------------------------EING 126 (225) Q Consensus 99 ---~~~i~l~G~S~Gg~val~~a~---~~p----------------------------------------------~i~~ 126 (225) -.|++|+||||||-+||++.- +.. .|+| T Consensus 144 kivrLPmYi~G~SMGGNIaLR~LelL~Ks~EkN~~~~~D~~~~~~~~~~~~D~~N~~D~~~~D~KN~~~YN~~dkLNIKG 223 (379) T TIGR01607 144 KIVRLPMYIVGLSMGGNIALRVLELLNKSKEKNKIKIEDAEDYKKIDDLVNDFINIYDDAIDDIKNDDKYNLNDKLNIKG 223 (379) T ss_pred CEECCCHHHHHCCCCHHHHHHHHHHHCCCCHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCCCCCCCCCCCCE T ss_conf 20014235432054468999999973342100012346411222256677554411465212204588421133686003 Q ss_pred EEECCCCCCCCC-----------------------------------C---CC--------------------------- Q ss_conf 012275744333-----------------------------------2---22--------------------------- Q gi|254781093|r 127 FISVAPQPKSYD-----------------------------------F---SF--------------------------- 141 (225) Q Consensus 127 ~v~isp~~~~~~-----------------------------------~---~~--------------------------- 141 (225) .|++|+..+.-. | .. T Consensus 224 CiSLsGM~Si~~~~s~~~~kfKYfylP~~~~~S~~~Pt~r~~~~~~~ye~~PYVndl~~~DK~R~~g~IT~~~a~eLikA 303 (379) T TIGR01607 224 CISLSGMISIKEVASPDSFKFKYFYLPVMNFMSRIAPTFRISKKKLKYEKFPYVNDLIKFDKYRYDGGITFKLAYELIKA 303 (379) T ss_pred EEECCCCEEEEECCCCCCCCEEEEHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHCCCCEEECCCHHHHHHHHHHHH T ss_conf 56503425762206788731110032278888750577777877775100774011111066434773067889999999 Q ss_pred -------C--CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC--CHHHHHHHHHHHHH Q ss_conf -------3--4334773776168655028999999999986077996079997879847558--58999999999999 Q gi|254781093|r 142 -------L--APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG--KVDELINECAHYLD 208 (225) Q Consensus 142 -------l--~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~--~~~~l~~~i~~fl~ 208 (225) + -|-..|+|+||...|-+|-.+-+.-|+.++.-. ..|++.+++-||..+= -.|++.+.|.+|+. T Consensus 304 ~~~L~~di~yIPKdIP~LFiHSk~Dc~C~YeG~~~FyNKl~~~---NKEl~t~edMDH~iTiEPGNE~~L~KiIeWI~ 378 (379) T TIGR01607 304 TDTLNDDIDYIPKDIPILFIHSKDDCICYYEGVVLFYNKLNIS---NKELYTVEDMDHVITIEPGNEKVLKKIIEWIS 378 (379) T ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCEEEECCEEEEEEECCCC---CCCCCCEECCCCEEECCCCCCCHHHHHHHHCC T ss_conf 9876532432567776467854577434456817764202567---74101100676203104787506667763127 No 81 >KOG2237 consensus Probab=99.10 E-value=1.5e-09 Score=77.44 Aligned_cols=210 Identities=17% Similarity=0.213 Sum_probs=130.5 Q ss_pred EEEECCCCCE---EEEEEECCC--CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC- Q ss_conf 0896589961---879996688--8778799995789988889989999999999997398499985315588767785- Q gi|254781093|r 3 EVVFNGPSGR---LEGRYQPST--NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD- 76 (225) Q Consensus 3 ~v~i~g~~G~---l~~~~~~~~--~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~- 76 (225) .++.+..||. +--.|-... ...+|. ++|+|+.||=+++ +.+..--..|.++|+...-.|-||-|+-+-+|- T Consensus 443 r~~~~SkDGt~VPM~Iv~kk~~k~dg~~P~--LLygYGay~isl~-p~f~~srl~lld~G~Vla~a~VRGGGe~G~~WHk 519 (712) T KOG2237 443 RIEVSSKDGTKVPMFIVYKKDIKLDGSKPL--LLYGYGAYGISLD-PSFRASRLSLLDRGWVLAYANVRGGGEYGEQWHK 519 (712) T ss_pred EEEEECCCCCCCCEEEEEECHHHHCCCCCE--EEEEECCCCEEEC-CCCCCCEEEEEECCEEEEEEEECCCCCCCCCHHH T ss_conf 999745888764269998101443079964--9998544542405-5546112588816559999762367623400010 Q ss_pred CCCH-----HHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHE-EEECCCCCCCCC----------- Q ss_conf 2102-----389999989999862-3546740588872036878764503862133-012275744333----------- Q gi|254781093|r 77 YGDG-----ELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPEING-FISVAPQPKSYD----------- 138 (225) Q Consensus 77 ~g~~-----E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~-~v~isp~~~~~~----------- 138 (225) .|.. -.+|.++..+||.++ +..++++.+.|+|-||.++..+.-.+|++-+ +|+-.|.+.... T Consensus 520 ~G~lakKqN~f~Dfia~AeyLve~gyt~~~kL~i~G~SaGGlLvga~iN~rPdLF~avia~VpfmDvL~t~~~tilplt~ 599 (712) T KOG2237 520 DGRLAKKQNSFDDFIACAEYLVENGYTQPSKLAIEGGSAGGLLVGACINQRPDLFGAVIAKVPFMDVLNTHKDTILPLTT 599 (712) T ss_pred CCCHHHHCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCHHHHHHCCCCHHHHHHHHCCCCEEHHHHHCCCCCCCCH T ss_conf 55034310448889999999997587772223675146766266777615906765431168630023440257655606 Q ss_pred -----------------CCCCCCC--------CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCC------CCEEEEEECC Q ss_conf -----------------2223433--------47737761686550289999999999860779------9607999787 Q gi|254781093|r 139 -----------------FSFLAPC--------PSSGLIINGSNDTVATTSDVKDLVNKLMNQKG------ISITHKVIPD 187 (225) Q Consensus 139 -----------------~~~l~~~--------~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~------~~~~~~~i~g 187 (225) ++.+.+. -.++|+..+++|++|++-++.+|.++++.... .++-+.+-.+ T Consensus 600 sd~ee~g~p~~~~~~~~i~~y~pv~~i~~q~~YPS~lvtta~hD~RV~~~~~~K~vAklre~~~~~~~q~~pvll~i~~~ 679 (712) T KOG2237 600 SDYEEWGNPEDFEDLIKISPYSPVDNIKKQVQYPSMLVTTADHDDRVGPLESLKWVAKLREATCDSLKQTNPVLLRIETK 679 (712) T ss_pred HHHCCCCCHHHHHHHHEECCCCCCCCCCHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCHHCCCCEEEEEECC T ss_conf 66245698012114430055676677713213761478611277752432057899999998633520178779998538 Q ss_pred CCCCCCCCH-HHHHHH--HHHHHHHHCCCCC Q ss_conf 984755858-999999--9999997312210 Q gi|254781093|r 188 ANHFFIGKV-DELINE--CAHYLDNSLDEKF 215 (225) Q Consensus 188 a~H~f~~~~-~~l~~~--i~~fl~~~L~~~~ 215 (225) |||+--+.. ..+.+. .-.||.+.++..+ T Consensus 680 agH~~~~~~~k~~~E~a~~yaFl~K~~~~~~ 710 (712) T KOG2237 680 AGHGAEKPRFKQIEEAAFRYAFLAKMLNSDW 710 (712) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 8654677357887888999999998733765 No 82 >TIGR02821 fghA_ester_D S-formylglutathione hydrolase; InterPro: IPR014186 This entry describes proteins from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (3.1.2.12 from EC). Eukaryotic members, particularly the human protein, are better known as esterase D (3.1.1.1 from EC), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.. Probab=99.10 E-value=7e-10 Score=79.49 Aligned_cols=190 Identities=19% Similarity=0.278 Sum_probs=118.7 Q ss_pred EEEEECCCCCCCC-EEEEECCCCCCCCCCCCHHHHHHHHH-HHHCCCEEEEEEE--CCCCC--CCCCCCCCCHH--HHHH Q ss_conf 7999668887787-99995789988889989999999999-9973984999853--15588--76778521023--8999 Q gi|254781093|r 14 EGRYQPSTNPNAP-IALILHPHPRFGGTMNDNIVYQLFYL-FQQRGFVSLRFNF--RGIGR--SEGEFDYGDGE--LSDA 85 (225) Q Consensus 14 ~~~~~~~~~~~~~-~vv~~Hp~p~~GG~~~~~~~~~la~~-l~~~G~~vl~fd~--RG~G~--S~G~~~~g~~E--~~D~ 85 (225) -++|.||+...++ |-|+- =..+...|.+|-.+-.-+++ -++.|+.++.+|. |+.|= -.-.||.|.|- .-|+ T Consensus 29 F~vf~PPqA~~g~kVPvL~-yLsGLTCT~en~m~Kag~~R~AAe~G~aLVaPDTSPR~~gi~gE~D~wDfG~GAGFYvDA 107 (279) T TIGR02821 29 FAVFLPPQAAAGPKVPVLY-YLSGLTCTEENFMIKAGAQRFAAEYGLALVAPDTSPRGTGIAGEDDEWDFGKGAGFYVDA 107 (279) T ss_pred ECCCCCHHHHCCCCCCEEE-ECCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 3146885773389972331-000013115789987778999976890797267898777776786542147987521102 Q ss_pred HH------------HHHHH----HHHCC-CCCCEEEEEECHHHHHHHHHHHCCCCHHE-EEECCCCCC----C------- Q ss_conf 99------------89999----86235-46740588872036878764503862133-012275744----3------- Q gi|254781093|r 86 AA------------ALDWV----QSLNP-ESKSCWIAGYSFGAWISMQLLMRRPEING-FISVAPQPK----S------- 136 (225) Q Consensus 86 ~a------------a~~~l----~~~~~-~~~~i~l~G~S~Gg~val~~a~~~p~i~~-~v~isp~~~----~------- 136 (225) .. +++.| .++.| |..+-.|+||||||.-||..|-|+|++-. +-+-+|-+. - T Consensus 108 t~~PWs~hYRmysYv~~ELpal~~~~fPld~~r~GI~GHSMGGhGAL~~AlknP~~fkSVSAFAPI~~P~~~pWG~KAFs 187 (279) T TIGR02821 108 TEEPWSQHYRMYSYVVQELPALVAAQFPLDGERQGITGHSMGGHGALVIALKNPDRFKSVSAFAPIVAPSRSPWGQKAFS 187 (279) T ss_pred HHCHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCHHHHH T ss_conf 20123320702355888999999985892201160135555467999998618873133124112267887988546786 Q ss_pred ------------CCCCCC----C--CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHH Q ss_conf ------------332223----4--3347737761686550289999999999860779960799978798475585899 Q gi|254781093|r 137 ------------YDFSFL----A--PCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDE 198 (225) Q Consensus 137 ------------~~~~~l----~--~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~ 198 (225) ||.+.| . ......||=+|..|++..-.---+.++.+.+..+...++..-.|.||.|- T Consensus 188 ~YLG~D~~~W~~yDA~~L~~~~~~~r~~~~ILiDQG~aD~fL~~qL~pd~~e~Ac~~~gq~L~LR~~~GYDHSYy----- 262 (279) T TIGR02821 188 AYLGADEAAWRSYDASLLVAELGAKRFHSTILIDQGLADQFLDEQLRPDALEQACKAAGQALTLRRQAGYDHSYY----- 262 (279) T ss_pred HHCCCCHHHHHHCCHHHHHHHCCCCCCCCCCCEECCCCCHHHHHCCCHHHHHHHHHHHCCCCEEECCCCCCCHHH----- T ss_conf 532778445530386899973466304776532257774534523688899999997089724410013542168----- Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999997312 Q gi|254781093|r 199 LINECAHYLDNSLD 212 (225) Q Consensus 199 l~~~i~~fl~~~L~ 212 (225) .|..|++.+|. T Consensus 263 ---Fi~sFi~DHl~ 273 (279) T TIGR02821 263 ---FIASFIEDHLR 273 (279) T ss_pred ---HHHHHHHHHHH T ss_conf ---78848887899 No 83 >pfam06057 VirJ Bacterial virulence protein (VirJ). This family consists of several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localized to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium. Probab=99.07 E-value=3.6e-09 Score=75.22 Aligned_cols=148 Identities=22% Similarity=0.208 Sum_probs=97.4 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC--- Q ss_conf 99999999739849998531558876778521023-89999989999862354674058887203687876450386--- Q gi|254781093|r 47 YQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE-LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP--- 122 (225) Q Consensus 47 ~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E-~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p--- 122 (225) ..++++|++.|+.|+-+|-.=+=-|+-+ ..| ..|+..++++-+++- ..+++.|+|||||+-+.-.+..+.| T Consensus 19 ~~va~~L~~~GvpVvGvdSLrYFW~~kt----P~q~a~Dl~~ii~~Y~~~w-~~~~v~LiGYSFGADvlP~~~n~LP~~~ 93 (192) T pfam06057 19 KEVGSALQKQGVPVVGVDSLRYFWSERT----PEEVADDLDRIIDTYRKRW-KVKNVVLIGYSFGADVLPAAYNRLPPAT 93 (192) T ss_pred HHHHHHHHHCCCCEEEECHHHHHHCCCC----HHHHHHHHHHHHHHHHHHH-CCCEEEEEEECCCCCHHHHHHHHCCHHH T ss_conf 9999999977983655326888756599----8999999999999999985-8965999961787226665461099999 Q ss_pred --CHHEEEECCCCCCCCCCC-------------------CCCC-CCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf --213301227574433322-------------------2343-347737761686550289999999999860779960 Q gi|254781093|r 123 --EINGFISVAPQPKSYDFS-------------------FLAP-CPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISI 180 (225) Q Consensus 123 --~i~~~v~isp~~~~~~~~-------------------~l~~-~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~ 180 (225) .|..+++++|.... +|. .++. .+.+++.|+|++++-.. ++..+...+ T Consensus 94 r~~v~~v~LL~~~~~a-dfeihv~~wlg~~~~~~~~~~pei~~l~~~~v~CiyG~~E~d~~----------cp~l~~~~~ 162 (192) T pfam06057 94 KQRVRQVSLLALGKKA-DFEISVEGWLGWDGEGKADPVPDLARLPPARVQCIYGQDEKDTA----------CPSLRQRGA 162 (192) T ss_pred HHHHHEEEEECCCCCC-EEEEEEHHHCCCCCCCCCCCCHHHHCCCCCEEEEEECCCCCCCC----------CCCCCCCCC T ss_conf 8541178884468853-18998323016788775675467860998706999758878766----------856567887 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 7999787984755858999999999999731 Q gi|254781093|r 181 THKVIPDANHFFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 181 ~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L 211 (225) +.+.+|| ||.|.+..+.+.+.|.+=|++++ T Consensus 163 ~~i~lpG-gHHFd~Dy~~la~~Il~~l~~r~ 192 (192) T pfam06057 163 EVVALPG-GHHFDGDYEALAKRILQGLRARL 192 (192) T ss_pred EEEECCC-CCCCCCCHHHHHHHHHHHHHHCC T ss_conf 7996589-87778759999999999998329 No 84 >PRK11071 esterase YqiA; Provisional Probab=99.06 E-value=1.3e-08 Score=71.87 Aligned_cols=155 Identities=18% Similarity=0.202 Sum_probs=99.6 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHC--CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 7999957899888899899999999999973--98499985315588767785210238999998999986235467405 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQR--GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~--G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) +.+|++| +|..+-...=...+.+.+++. ++.+++++.-- .-+++.+-++-+.+... ..++. T Consensus 2 ~~ilYlH---GF~SSp~S~KA~~l~~~~~~~~~~~~~~~P~L~~-------------~P~~A~~~le~li~~~~-~~~~~ 64 (190) T PRK11071 2 STLLYLH---GFNSSPRSAKATLLKNWLAQHHPDIEMIVPQLPP-------------YPADAAELLESIVLEHG-GDSLG 64 (190) T ss_pred CEEEEEC---CCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCC-------------CHHHHHHHHHHHHHHCC-CCCEE T ss_conf 7589951---7788875889999999999609996599189989-------------99999999999998568-99769 Q ss_pred EEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC-------------------C-----CCCC-------CCCCCEEEE Q ss_conf 88872036878764503862133012275744333-------------------2-----2234-------334773776 Q gi|254781093|r 104 IAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD-------------------F-----SFLA-------PCPSSGLII 152 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~-------------------~-----~~l~-------~~~~p~LiI 152 (225) |+|.|+||+-|..+|.+.. +++ |++.|++.-+. + ..|. ..|...+++ T Consensus 65 lIGSSLGGfyAt~Laeky~-~~a-VLINPAV~P~~~L~~~lG~~~n~~t~e~~~~t~~~i~~L~~l~~~~i~~p~~~~lL 142 (190) T PRK11071 65 LVGSSLGGYYATWLSQCFM-LPA-VVVNPAVRPFELLTDYLGQNENPYTGQQYVLESRHIYDLKVMQIDPLESPDLIWLL 142 (190) T ss_pred EEECCHHHHHHHHHHHHHC-CCE-EEECCCCCHHHHHHHHCCCCCCCCCCCEEEECHHHHHHHHHCCCCCCCCCCEEEEE T ss_conf 9955858999999999858-987-99716768478999972987588879758971899999996245566886307999 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 16865502899999999998607799607999787984755858999999999999 Q gi|254781093|r 153 NGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208 (225) Q Consensus 153 hG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~ 208 (225) ..+.|++.+...+.+++.. + ...+.+|+||.|.+.++ ....|.+||. T Consensus 143 lq~GDEvLdyr~a~~~y~~---~-----~~~i~~GGdH~f~~fe~-~l~~I~~Fl~ 189 (190) T PRK11071 143 QQTGDEVLDYRQAVAYYAS---C-----RQTVEEGGNHAFVGFER-YFNQIVDFLG 189 (190) T ss_pred ECCCCCCCCHHHHHHHHCC---C-----EEEEECCCCCCCCHHHH-HHHHHHHHHC T ss_conf 7178850379999998579---8-----17986799877314999-8999999967 No 85 >pfam09752 DUF2048 Uncharacterized conserved protein (DUF2048). The proteins in this family are conserved from plants to vertebrates. The function is unknown. Probab=99.05 E-value=3.2e-08 Score=69.47 Aligned_cols=173 Identities=18% Similarity=0.320 Sum_probs=105.3 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHCCCEEEEEE--ECCCCCCCCCCC-----------CCCHHHHHHHH Q ss_conf 778799995789988889989999--999999997398499985--315588767785-----------21023899999 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIV--YQLFYLFQQRGFVSLRFN--FRGIGRSEGEFD-----------YGDGELSDAAA 87 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~--~~la~~l~~~G~~vl~fd--~RG~G~S~G~~~-----------~g~~E~~D~~a 87 (225) +.+|++|-+- |+-|+-.. ..+|.-|.+.|++++... |.|.=+-.+++. .|.+-+....+ T Consensus 89 ~~~~vcihLa------gTGD~~f~rR~~~a~pLlk~gi~siile~pyYG~RkP~~Q~~~~l~~VsDl~~mg~~~i~E~~~ 162 (337) T pfam09752 89 PKRPVCVHLA------GTGDHGFWRRRRLARPLLKEGIGSIILENPFYGLRRPKGQRRSSLRNVSDLFLMGAATILESRA 162 (337) T ss_pred CCCCEEEEEC------CCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 8885499847------8675117889887788986484068861564446698431125221187999977789999999 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEE-ECCC--CCCCC-----------C--------------- Q ss_conf 89999862354674058887203687876450386213301-2275--74433-----------3--------------- Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFI-SVAP--QPKSY-----------D--------------- 138 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v-~isp--~~~~~-----------~--------------- 138 (225) .+.|+++++. .++.+.|+||||.+|.++++.-|..-++| ++|+ +...+ + T Consensus 163 Ll~w~~~~g~--g~~g~tG~SmGG~mAalaas~~p~Pia~vPclswsSAs~vfteGvLs~~i~W~~L~~q~~~~~~~~~~ 240 (337) T pfam09752 163 LLHWLEREGY--GPLGVTGLSMGGHMAALAASNWPKPLAVVPCLAWSSASSVFTEGVLSHSIAWDALEKQIRDLAAQVES 240 (337) T ss_pred HHHHHHHCCC--CCCEEEEECCCHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCHHHHHCHHHCCHHHHHH T ss_conf 9999997288--86136642331277776640589873120220467773043110203666778872422011001112 Q ss_pred -------------------------CCCCCCC-----CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf -------------------------2223433-----4773776168655028999999999986077996079997879 Q gi|254781093|r 139 -------------------------FSFLAPC-----PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDA 188 (225) Q Consensus 139 -------------------------~~~l~~~-----~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga 188 (225) ...+... +....+|..++|.+||-+....+.+--+ + .|...++| T Consensus 241 ~~~~~~~~~~~~~E~~~~m~g~mds~Thl~nf~~P~dp~lii~V~A~~D~YVPR~~v~~Lq~iWP---g--~EiR~l~g- 314 (337) T pfam09752 241 DKLDGAKERGADKEAAREMLGLMDSLTSLTNFPVPKDPSAAIFVAAKDDGYVPRESVATLQEIWP---G--SEVRWLDG- 314 (337) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHCCHHCCCCCCCCCEEEEEEECCCCEECCHHHCCHHHHCC---C--CEEEEEEC- T ss_conf 33331011213689999999876310202127998888617999825871217156360988589---9--73799706- Q ss_pred CC--CCCCCHHHHHHHHHHHHHH Q ss_conf 84--7558589999999999997 Q gi|254781093|r 189 NH--FFIGKVDELINECAHYLDN 209 (225) Q Consensus 189 ~H--~f~~~~~~l~~~i~~fl~~ 209 (225) || +|-.+...+.++|.+-+++ T Consensus 315 GHVsA~l~~q~~fR~aI~dAf~R 337 (337) T pfam09752 315 GHVSAYLLHQDAFRRAIKDALER 337 (337) T ss_pred CCEEEEEECHHHHHHHHHHHHCC T ss_conf 87788874157899999998519 No 86 >COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only] Probab=99.03 E-value=1.3e-08 Score=71.97 Aligned_cols=161 Identities=18% Similarity=0.169 Sum_probs=107.6 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEE Q ss_conf 79999578998888998999999999999739849998531558876778521023899999899998623546740588 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIA 105 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~ 105 (225) .-+|||| +++||-.+. ..+.+.+.--.+-|++.- +++ ..+++|-.+.++-.... -.++++|+ T Consensus 3 ~~~lIVp---G~~~Sg~~H----Wq~~we~~l~~a~rveq~-------~w~--~P~~~dWi~~l~~~v~a--~~~~~vlV 64 (181) T COG3545 3 TDVLIVP---GYGGSGPNH----WQSRWESALPNARRVEQD-------DWE--APVLDDWIARLEKEVNA--AEGPVVLV 64 (181) T ss_pred CEEEEEC---CCCCCCHHH----HHHHHHHHCCCCHHCCCC-------CCC--CCCHHHHHHHHHHHHHC--CCCCEEEE T ss_conf 5099946---888998127----899987418322112667-------878--88799999999999960--67974999 Q ss_pred EECHHHHHHHHHHHCC-CCHHEEEECCCCCCCC-----------CCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 8720368787645038-6213301227574433-----------322234334773776168655028999999999986 Q gi|254781093|r 106 GYSFGAWISMQLLMRR-PEINGFISVAPQPKSY-----------DFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLM 173 (225) Q Consensus 106 G~S~Gg~val~~a~~~-p~i~~~v~isp~~~~~-----------~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~ 173 (225) +||+|..++++.+.+. ..|+|+++++|+-... +.....+.+-|.++|+..+|++++++.+..++++.. T Consensus 65 AHSLGc~~v~h~~~~~~~~V~GalLVAppd~~~~~~~~~~~~tf~~~p~~~lpfps~vvaSrnDp~~~~~~a~~~a~~wg 144 (181) T COG3545 65 AHSLGCATVAHWAEHIQRQVAGALLVAPPDVSRPEIRPKHLMTFDPIPREPLPFPSVVVASRNDPYVSYEHAEDLANAWG 144 (181) T ss_pred EECCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHCC T ss_conf 84545799999998642043158995687755555440423444788666678763589726998789899999998602 Q ss_pred HCCCCCEEEEEECCCCCC----CCCCHHHHHHHHHHHHHHH Q ss_conf 077996079997879847----5585899999999999973 Q gi|254781093|r 174 NQKGISITHKVIPDANHF----FIGKVDELINECAHYLDNS 210 (225) Q Consensus 174 ~~~~~~~~~~~i~ga~H~----f~~~~~~l~~~i~~fl~~~ 210 (225) . .++.+..++|+ ..+..++......+|+.+. T Consensus 145 s------~lv~~g~~GHiN~~sG~g~wpeg~~~l~~~~s~~ 179 (181) T COG3545 145 S------ALVDVGEGGHINAESGFGPWPEGYALLAQLLSRA 179 (181) T ss_pred H------HHEECCCCCCCCHHHCCCCCHHHHHHHHHHHHHH T ss_conf 7------6545355664323306887477999999876530 No 87 >KOG3253 consensus Probab=99.00 E-value=2.9e-08 Score=69.77 Aligned_cols=178 Identities=17% Similarity=0.223 Sum_probs=110.9 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCC--CEEEEEEEC-CCCCCCCCCCCCCHH-HHHHHHHHHHH----HH Q ss_conf 77879999578998888998999999999999739--849998531-558876778521023-89999989999----86 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRG--FVSLRFNFR-GIGRSEGEFDYGDGE-LSDAAAALDWV----QS 94 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G--~~vl~fd~R-G~G~S~G~~~~g~~E-~~D~~aa~~~l----~~ 94 (225) +..|++|+.|.-|.. ++-++ -+..+.++|.-.| ..+-.||++ +.| | ...-+ .+-...+..|. .. T Consensus 174 ~~spl~i~aps~p~a-p~tSd-~~~~wqs~lsl~gevvev~tfdl~n~ig---G---~nI~h~ae~~vSf~r~kvlei~g 245 (784) T KOG3253 174 PASPLAIKAPSTPLA-PKTSD-RMWSWQSRLSLKGEVVEVPTFDLNNPIG---G---ANIKHAAEYSVSFDRYKVLEITG 245 (784) T ss_pred CCCCEEEECCCCCCC-CCCCH-HHHHHHHHHHHHCEEEEECCCCCCCCCC---C---CCHHHHHHHHHHHHHHHHHHHHC T ss_conf 577317962688878-76635-7776999875420021000245567777---7---23488888877776543332203 Q ss_pred HCCCCCCEEEEEECHHHHHHHHHHHCCC--CHHEEEECCCCCCCCC------CCCCCCCCCCEEEEECCCCCCCCHHHHH Q ss_conf 2354674058887203687876450386--2133012275744333------2223433477377616865502899999 Q gi|254781093|r 95 LNPESKSCWIAGYSFGAWISMQLLMRRP--EINGFISVAPQPKSYD------FSFLAPCPSSGLIINGSNDTVATTSDVK 166 (225) Q Consensus 95 ~~~~~~~i~l~G~S~Gg~val~~a~~~p--~i~~~v~isp~~~~~~------~~~l~~~~~p~LiIhG~~D~~vp~~~~~ 166 (225) .+ ....|+|+|+|+|+.++.+....+- +|.++|+++-+...-+ .+.|-....|+|+|.|.+|..++++.++ T Consensus 246 ef-pha~IiLvGrsmGAlVachVSpsnsdv~V~~vVCigypl~~vdgprgirDE~Lldmk~PVLFV~Gsnd~mcspn~ME 324 (784) T KOG3253 246 EF-PHAPIILVGRSMGALVACHVSPSNSDVEVDAVVCIGYPLDTVDGPRGIRDEALLDMKQPVLFVIGSNDHMCSPNSME 324 (784) T ss_pred CC-CCCCEEEEECCCCCEEEEEECCCCCCCEEEEEEEECCCCCCCCCCCCCCCHHHHHCCCCEEEEECCCCCCCCHHHHH T ss_conf 37-89855887424473016875125677347899971563567776668862566741796699836875667988999 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCC-C---------HHHHHHHHHHHHHHHCCC Q ss_conf 9999986077996079997879847558-5---------899999999999973122 Q gi|254781093|r 167 DLVNKLMNQKGISITHKVIPDANHFFIG-K---------VDELINECAHYLDNSLDE 213 (225) Q Consensus 167 ~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~---------~~~l~~~i~~fl~~~L~~ 213 (225) ++.++++. ..++++|.++||.+.- + ..++...+.+||.+.++. T Consensus 325 ~vreKMqA----~~elhVI~~adhsmaipk~k~esegltqseVd~~i~~aI~efvt~ 377 (784) T KOG3253 325 EVREKMQA----EVELHVIGGADHSMAIPKRKVESEGLTQSEVDSAIAQAIKEFVTI 377 (784) T ss_pred HHHHHHHC----CCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99998625----516999657774424776654444520999999999999999977 No 88 >COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only] Probab=99.00 E-value=1.5e-07 Score=65.46 Aligned_cols=169 Identities=25% Similarity=0.414 Sum_probs=99.8 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC--CCEEEEEEECCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCC Q ss_conf 87999957899888899899999999999973--9849998531558876778521023-89999989999862354674 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQR--GFVSLRFNFRGIGRSEGEFDYGDGE-LSDAAAALDWVQSLNPESKS 101 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~--G~~vl~fd~RG~G~S~G~~~~g~~E-~~D~~aa~~~l~~~~~~~~~ 101 (225) .+.+++.|+.+ ++.... ......+... .|.++.+|.||+|.|. ........ .+|+.. ++..... .+ T Consensus 21 ~~~i~~~hg~~---~~~~~~--~~~~~~~~~~~~~~~~~~~d~~g~g~s~-~~~~~~~~~~~~~~~---~~~~~~~--~~ 89 (282) T COG0596 21 GPPLVLLHGFP---GSSSVW--RPVFKVLPALAARYRVIAPDLRGHGRSD-PAGYSLSAYADDLAA---LLDALGL--EK 89 (282) T ss_pred CCEEEEECCCC---CCHHHH--HHHHHHHHHCCCCCEEEEECCCCCCCCC-CCCCCHHHHHHHHHH---HHHHHCC--CC T ss_conf 97289978987---516677--7655555540467149996788888886-544558889999999---9998288--86 Q ss_pred EEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCC-----------CC-------------C--------------CCC- Q ss_conf 0588872036878764503862-133012275744-----------33-------------3--------------222- Q gi|254781093|r 102 CWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPK-----------SY-------------D--------------FSF- 141 (225) Q Consensus 102 i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~-----------~~-------------~--------------~~~- 141 (225) +.++|||+||.+++.++.+.|+ +++++++++... .. . ... T Consensus 90 ~~l~G~S~Gg~~~~~~~~~~p~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (282) T COG0596 90 VVLVGHSMGGAVALALALRHPDRVRGLVLIGPAPPPGLLEAALRQPAGAAPLAALADLLLGLDAAAFAALLAALGLLAAL 169 (282) T ss_pred EEEEEECHHHHHHHHHHHHCHHHCCEEEEECCCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 69999886699999999859676267999767654102311111111003444444443423244289998860025666 Q ss_pred -----------------------------------------------CCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH Q ss_conf -----------------------------------------------343347737761686550289999999999860 Q gi|254781093|r 142 -----------------------------------------------LAPCPSSGLIINGSNDTVATTSDVKDLVNKLMN 174 (225) Q Consensus 142 -----------------------------------------------l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~ 174 (225) +.....|+|+|+|+.|.+.|......+.+.+.. T Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~l~i~g~~d~~~~~~~~~~~~~~~~~ 249 (282) T COG0596 170 AAAARAGLAEALRAPLLGAAAAAFARAARADLAAALLALLDRDLRAALARITVPTLIIHGEDDPVVPAELARRLAAALPN 249 (282) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHCCC T ss_conf 55443455554146678887766555445778887775420025667640799979997788767886889999986788 Q ss_pred CCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHH Q ss_conf 77996079997879847558-58999999999999 Q gi|254781093|r 175 QKGISITHKVIPDANHFFIG-KVDELINECAHYLD 208 (225) Q Consensus 175 ~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~ 208 (225) ..++.++++++|+... +.+.+.+.+.+|++ T Consensus 250 ----~~~~~~~~~~gH~~~~~~p~~~~~~i~~~~~ 280 (282) T COG0596 250 ----DARLVVIPGAGHFPHLEAPEAFAAALLAFLE 280 (282) T ss_pred ----CCEEEEECCCCCCCCHHCHHHHHHHHHHHHH T ss_conf ----8569998998757311099999999999874 No 89 >KOG1553 consensus Probab=98.97 E-value=1.9e-09 Score=76.84 Aligned_cols=148 Identities=19% Similarity=0.287 Sum_probs=100.0 Q ss_pred ECCCCC-EEEEEEECCC---CC-CCCEEEEECCCCCC--CCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC Q ss_conf 658996-1879996688---87-78799995789988--88998999999999999739849998531558876778521 Q gi|254781093|r 6 FNGPSG-RLEGRYQPST---NP-NAPIALILHPHPRF--GGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG 78 (225) Q Consensus 6 i~g~~G-~l~~~~~~~~---~~-~~~~vv~~Hp~p~~--GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g 78 (225) |..-|| .+++.+.... .+ .+..||+|-++.+| =|.|+.+ +++||.||-+|.-|.++|+|.. +. T Consensus 219 iks~dgneiDtmF~d~r~n~~~ngq~LvIC~EGNAGFYEvG~m~tP---------~~lgYsvLGwNhPGFagSTG~P-~p 288 (517) T KOG1553 219 IKSSDGNEIDTMFLDGRPNQSGNGQDLVICFEGNAGFYEVGVMNTP---------AQLGYSVLGWNHPGFAGSTGLP-YP 288 (517) T ss_pred EEECCCCCHHHEEECCCCCCCCCCCEEEEEECCCCCCEEEEEECCH---------HHHCCEEECCCCCCCCCCCCCC-CC T ss_conf 8604886213002448877788886389995587652676432476---------8728424315788756667998-76 Q ss_pred CHHHHHHHHHHHHHHH-HCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC------------------- Q ss_conf 0238999998999986-23546740588872036878764503862133012275744333------------------- Q gi|254781093|r 79 DGELSDAAAALDWVQS-LNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD------------------- 138 (225) Q Consensus 79 ~~E~~D~~aa~~~l~~-~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~------------------- 138 (225) ..+..-+.+|++|+.+ .+.....|+|.|||.||+.++.+|+.+|+++++|+-+..-+..+ T Consensus 289 ~n~~nA~DaVvQfAI~~Lgf~~edIilygWSIGGF~~~waAs~YPdVkavvLDAtFDDllpLAl~rMP~~~~giV~~aiR 368 (517) T KOG1553 289 VNTLNAADAVVQFAIQVLGFRQEDIILYGWSIGGFPVAWAASNYPDVKAVVLDATFDDLLPLALFRMPTFFSGIVEHAIR 368 (517) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHCHHHHHHHHHHHHH T ss_conf 41057889999999998189801069997520783677786428874178850310133667765130889999999999 Q ss_pred -------CCCCCCCCCCEEEEECCCCCCCCHH Q ss_conf -------2223433477377616865502899 Q gi|254781093|r 139 -------FSFLAPCPSSGLIINGSNDTVATTS 163 (225) Q Consensus 139 -------~~~l~~~~~p~LiIhG~~D~~vp~~ 163 (225) ...+....-|+++|--++|+++... T Consensus 369 nh~NLnnaell~ry~GPi~lIRRt~dEIitt~ 400 (517) T KOG1553 369 NHMNLNNAELLARYKGPIRLIRRTQDEIITTA 400 (517) T ss_pred HHCCCCHHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 86065469999760596147665237660056 No 90 >COG1073 Hydrolases of the alpha/beta superfamily [General function prediction only] Probab=98.94 E-value=2e-08 Score=70.77 Aligned_cols=193 Identities=19% Similarity=0.244 Sum_probs=110.2 Q ss_pred EEEEEEECCC-C---CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEE----------EECCCCCCCCCC-- Q ss_conf 1879996688-8---77879999578998888998999999999999739849998----------531558876778-- Q gi|254781093|r 12 RLEGRYQPST-N---PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRF----------NFRGIGRSEGEF-- 75 (225) Q Consensus 12 ~l~~~~~~~~-~---~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~f----------d~RG~G~S~G~~-- 75 (225) .+.+.++-+. . ..-|.+++.| ++++...... ..+..++..++.++.+ +.+|++.+.+.. T Consensus 32 ~~~~~l~~p~~~~~~~~~p~v~~~h---~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 106 (299) T COG1073 32 ALAAVLHLPPSGNEEKKLPAVVFLH---GFGSSKEQSL--GYAVLLAEKGYRVLAGDASLFGESGGDPRGLADSEGYAED 106 (299) T ss_pred CEEEEEEECCCCCCCCCCCEEEECC---CCCCCCCCCH--HHHHHHCCCCEEECCCCCCCCCCCEEEECCCCCCHHHHHH T ss_conf 1257998568767677778678715---6554214430--3556430561453024233356616882477430345555 Q ss_pred -CC--------------------------CCHHHHHHHHHHHHHHHHC--CCCCCEEEEEECHHHHHHHHHHHCC----- Q ss_conf -52--------------------------1023899999899998623--5467405888720368787645038----- Q gi|254781093|r 76 -DY--------------------------GDGELSDAAAALDWVQSLN--PESKSCWIAGYSFGAWISMQLLMRR----- 121 (225) Q Consensus 76 -~~--------------------------g~~E~~D~~aa~~~l~~~~--~~~~~i~l~G~S~Gg~val~~a~~~----- 121 (225) .. ..+...|...+..|+.... .+..++.+.|.|+|+..++...... T Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~g~s~g~~~~~~~~~~~~~~~~ 186 (299) T COG1073 107 FSAAVLLLLSEGVLDKDYRLLGASLGPRILAGLSLGGPSAGALLAWGPTRLDASRIVVWGESLGGALALLLLGANPELAR 186 (299) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHCCCCHHHC T ss_conf 54310101222466687775412258713678993351699998636454333440588623562888865200202111 Q ss_pred CCHHEEEECCCCCCC--------------------CCCCCCCCCC-CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 621330122757443--------------------3322234334-7737761686550289999999999860779960 Q gi|254781093|r 122 PEINGFISVAPQPKS--------------------YDFSFLAPCP-SSGLIINGSNDTVATTSDVKDLVNKLMNQKGISI 180 (225) Q Consensus 122 p~i~~~v~isp~~~~--------------------~~~~~l~~~~-~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~ 180 (225) ..+..++.-.+.... ..++.+.... .|+|++||++|++||......++++.... +. T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~i~~~P~l~~~G~~D~~vp~~~~~~~~~~~~~~---~~ 263 (299) T COG1073 187 ELIDYLITPGGFAPLPAPEAPLDTLPLRAVLLLLLDPFDDAEKISPRPVLLVHGERDEVVPLRDAEDLYEAARER---PK 263 (299) T ss_pred CCCCEEECCCCCCCCCCCCCHHCCCCCCHHHHCCCCCHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCC---CC T ss_conf 335478726864566555300013333103431557411455306786299977778725889987899974316---84 Q ss_pred EEEEECCCCCCCCC-C---HHHHHHHHHHHHHHHCC Q ss_conf 79997879847558-5---89999999999997312 Q gi|254781093|r 181 THKVIPDANHFFIG-K---VDELINECAHYLDNSLD 212 (225) Q Consensus 181 ~~~~i~ga~H~f~~-~---~~~l~~~i~~fl~~~L~ 212 (225) ....+++++|.... . ..+..+.+.+|+.+.+. T Consensus 264 ~~~~~~~~~H~~~~~~~~~~~~~~~~~~~f~~~~l~ 299 (299) T COG1073 264 KLLFVPGGGHIDLYDNPPAVEQALDKLAEFLERHLL 299 (299) T ss_pred EEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 698438965322467745889999999999997519 No 91 >KOG4627 consensus Probab=98.93 E-value=2e-08 Score=70.78 Aligned_cols=172 Identities=16% Similarity=0.242 Sum_probs=117.8 Q ss_pred CCCCCE-EEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 589961-8799966888778799995789988889989999999999997398499985315588767785210238999 Q gi|254781093|r 7 NGPSGR-LEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDA 85 (225) Q Consensus 7 ~g~~G~-l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~ 85 (225) -|+.|+ +.-+|-+ ....++.|+.|+-=+.-|++ +-....+.-+.++||.|...+|-=+ .+++. -..-..|+ T Consensus 50 Yg~~g~q~VDIwg~--~~~~klfIfIHGGYW~~g~r--k~clsiv~~a~~~gY~vasvgY~l~--~q~ht--L~qt~~~~ 121 (270) T KOG4627 50 YGEGGRQLVDIWGS--TNQAKLFIFIHGGYWQEGDR--KMCLSIVGPAVRRGYRVASVGYNLC--PQVHT--LEQTMTQF 121 (270) T ss_pred CCCCCCEEEEEECC--CCCCCEEEEEECCHHHCCCH--HCCCCHHHHHHHCCEEEEEECCCCC--CCCCC--HHHHHHHH T ss_conf 57887658987657--77760899993661330752--1030201156646707998346767--65332--99999999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC--CCCHHEEEECCCCC-----------------------CCCCCC Q ss_conf 99899998623546740588872036878764503--86213301227574-----------------------433322 Q gi|254781093|r 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR--RPEINGFISVAPQP-----------------------KSYDFS 140 (225) Q Consensus 86 ~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~--~p~i~~~v~isp~~-----------------------~~~~~~ 140 (225) ...++|+.+..+..+.+.+.|||-|+..|+++.+| .|.|.|+++.+..- ...++. T Consensus 122 ~~gv~filk~~~n~k~l~~gGHSaGAHLa~qav~R~r~prI~gl~l~~GvY~l~EL~~te~g~dlgLt~~~ae~~Scdl~ 201 (270) T KOG4627 122 THGVNFILKYTENTKVLTFGGHSAGAHLAAQAVMRQRSPRIWGLILLCGVYDLRELSNTESGNDLGLTERNAESVSCDLW 201 (270) T ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCHH T ss_conf 97788999865664269982552688999999987408257889887437639987277632213755000232374477 Q ss_pred CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 234334773776168655028999999999986077996079997879847 Q gi|254781093|r 141 FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 141 ~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~ 191 (225) .+..+..+.|++.+++|.---+++.+++++.+.++ .+..+++.+|+ T Consensus 202 ~~~~v~~~ilVv~~~~espklieQnrdf~~q~~~a-----~~~~f~n~~hy 247 (270) T KOG4627 202 EYTDVTVWILVVAAEHESPKLIEQNRDFADQLRKA-----SFTLFKNYDHY 247 (270) T ss_pred HHCCCEEEEEEEEECCCCCHHHHHHHHHHHHHHHC-----CEEECCCCCHH T ss_conf 74186035667662246817888523689885012-----00013785145 No 92 >KOG2624 consensus Probab=98.89 E-value=2.1e-07 Score=64.59 Aligned_cols=203 Identities=24% Similarity=0.261 Sum_probs=136.0 Q ss_pred EEEECCCCCEEEEEEECCC-CCCCCEEEEECCCCCCCCCCCCHH----HHHHHHHHHHCCCEEEEEEECCCCCCCC---- Q ss_conf 0896589961879996688-877879999578998888998999----9999999997398499985315588767---- Q gi|254781093|r 3 EVVFNGPSGRLEGRYQPST-NPNAPIALILHPHPRFGGTMNDNI----VYQLFYLFQQRGFVSLRFNFRGIGRSEG---- 73 (225) Q Consensus 3 ~v~i~g~~G~l~~~~~~~~-~~~~~~vv~~Hp~p~~GG~~~~~~----~~~la~~l~~~G~~vl~fd~RG~G~S~G---- 73 (225) +-...+.||=+-++-..|. +.++|+|.+.|+. + .+.++.+ -..++-.|+++||.|..=|.||.-=|.. T Consensus 50 ~h~V~T~DgYiL~lhRIp~~~~~rp~Vll~HGL--l-~sS~~Wv~n~p~~sLaf~LadaGYDVWLgN~RGn~ySr~h~~l 126 (403) T KOG2624 50 EHEVTTEDGYILTLHRIPRGKKKRPVVLLQHGL--L-ASSSSWVLNGPEQSLAFLLADAGYDVWLGNNRGNTYSRKHKKL 126 (403) T ss_pred EEEEECCCCEEEEEEEECCCCCCCCCEEEEECC--C-CCCCCCEECCCCCCHHHHHHHCCCCEEEECCCCCCCCHHHEEC T ss_conf 999874687499885101788999958996034--0-0156012368543589999976885121237576004432503 Q ss_pred ------C-CCCCCHH--HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC----HHEEEECCCCCCCC--- Q ss_conf ------7-8521023--899999899998623546740588872036878764503862----13301227574433--- Q gi|254781093|r 74 ------E-FDYGDGE--LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE----INGFISVAPQPKSY--- 137 (225) Q Consensus 74 ------~-~~~g~~E--~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~----i~~~v~isp~~~~~--- 137 (225) + |+...-| ..|+-|.+||+.+.. .-.++..+|||.|+.+.....+.+|+ |+.+++++|++... T Consensus 127 ~~~~~~~FW~FS~~Em~~yDLPA~IdyIL~~T-~~~kl~yvGHSQGtt~~fv~lS~~p~~~~kI~~~~aLAP~~~~k~~~ 205 (403) T KOG2624 127 SPSSDKEFWDFSWHEMGTYDLPAMIDYILEKT-GQEKLHYVGHSQGTTTFFVMLSERPEYNKKIKSFIALAPAAFPKHIK 205 (403) T ss_pred CCCCCCCEEECCHHHHHHCCHHHHHHHHHHHC-CCCCEEEEEEECCCHHHEEHHCCCCHHHHHHHEEEEECCHHHHCCCC T ss_conf 77677352433155412257889999999715-65605899987562222001003612664445034536432201244 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254781093|r 138 -------------------------------------------------------------------------------- 137 (225) Q Consensus 138 -------------------------------------------------------------------------------- 137 (225) T Consensus 206 ~~~~~~~~~~~~~~~~~~~~fg~~~f~p~~~~~~~~~~~~C~~~~~~~~lC~~~~~~~~G~~~~~~n~~~~~~~~~h~pa 285 (403) T KOG2624 206 SLLNKFLDPFLGAFSLLPLLFGRKEFLPSNLFIKKFARKICSGSKIFADLCSNFLFLLVGWNSNNWNTTLLPVYLAHLPA 285 (403) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHCCCCHHHCCCCC T ss_conf 68887556531115678886487210503567999999984333678999999999980963676420013012104787 Q ss_pred ----------------------CCC----------------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf ----------------------322----------------234334773776168655028999999999986077996 Q gi|254781093|r 138 ----------------------DFS----------------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGIS 179 (225) Q Consensus 138 ----------------------~~~----------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~ 179 (225) |+. .+.....|+-+.+|+.|-.+.++.+..+...+..... T Consensus 286 gtSvk~~~H~~Q~~~s~~f~~yD~G~~~N~~~Y~q~~pP~Y~l~~i~~P~~l~~g~~D~l~~~~DV~~~~~~~~~~~~-- 363 (403) T KOG2624 286 GTSVKNIVHWAQIVRSGKFRKYDYGSKRNLKHYGQSTPPEYDLTNIKVPTALYYGDNDWLADPEDVLILLLVLPNSVI-- 363 (403) T ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCC-- T ss_conf 740888999998722797214678861137661899999877444766879996687552788999999985545300-- Q ss_pred EEEEEECCCCCC--CCC--CHHHHHHHHHHHHHHHC Q ss_conf 079997879847--558--58999999999999731 Q gi|254781093|r 180 ITHKVIPDANHF--FIG--KVDELINECAHYLDNSL 211 (225) Q Consensus 180 ~~~~~i~ga~H~--f~~--~~~~l~~~i~~fl~~~L 211 (225) ....-+++-+|+ ..+ -.+++.+.|.+.+++.. T Consensus 364 ~~~~~~~~ynHlDFi~g~da~~~vy~~vi~~~~~~~ 399 (403) T KOG2624 364 KYIVPIPEYNHLDFIWGLDAKEEVYDPVIERLRLFE 399 (403) T ss_pred CCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 021257875533322046708999999999987431 No 93 >pfam06028 DUF915 Alpha/beta hydrolase of unknown function (DUF915). This family consists of several bacterial proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=98.86 E-value=4.4e-07 Score=62.63 Aligned_cols=176 Identities=18% Similarity=0.220 Sum_probs=110.5 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHH-HCCC--EEEEEEEC--CCCCCCCCC-------------CCC-CHH--- Q ss_conf 7879999578998888998999999999999-7398--49998531--558876778-------------521-023--- Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQ-QRGF--VSLRFNFR--GIGRSEGEF-------------DYG-DGE--- 81 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~-~~G~--~vl~fd~R--G~G~S~G~~-------------~~g-~~E--- 81 (225) .-|+ |+.| ++||+.+. +..+.+.+. +.|. .+++.+-. |.=.-+|++ +.. .+. T Consensus 6 ~tPT-ifih---G~~g~~~s--~~~mv~~~~~k~g~~~~~l~v~V~~~G~v~~~G~~~~~~~nPiIqV~F~dN~~~~~~~ 79 (249) T pfam06028 6 MIPT-IFIH---GSGGTASS--FNFMVNQLLNKYGVGNKSLTAYVSKDGSVKVEGKISKDAKNPIIKVGFEDNRNASFDK 79 (249) T ss_pred CCCE-EEEC---CCCCCCHH--HHHHHHHHHHHCCCCCEEEEEEECCCCCEEEEEEECCCCCCCEEEEEECCCCCCCHHH T ss_conf 8986-9989---88887055--9999999987348886159999948982999877057787987999934588899899 Q ss_pred -HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC------CCCHHEEEECCCCCCCCCC--------------- Q ss_conf -899999899998623546740588872036878764503------8621330122757443332--------------- Q gi|254781093|r 82 -LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR------RPEINGFISVAPQPKSYDF--------------- 139 (225) Q Consensus 82 -~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~------~p~i~~~v~isp~~~~~~~--------------- 139 (225) ..=+..|+.+|++++ .-+++-++||||||...+.++.. .|.+..+|+++.|.+.... T Consensus 80 qa~wl~~vl~~L~~~Y-~~~~~N~VGHSmGg~~~~~yl~~~~~~~~~P~l~K~V~Ia~pfng~~~~~~~~~~~~~~~~~p 158 (249) T pfam06028 80 QAKWLKKVMSYLKSNY-NFKKFNAVGHSMGGLAATYYLEDYGNDKSLPQLNKLVTIGGPFNGETNNDLGQTLAIVLKDGP 158 (249) T ss_pred HHHHHHHHHHHHHHHC-CCCEEEEEEECCCHHHHHHHHHHHCCCCCCCCHHEEEEECCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999999999854-975677998766439999999984467788623159996268575123677532111246898 Q ss_pred -------CCC-------CCCCCCEEEEECC------CCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC--CCCCCCCCHH Q ss_conf -------223-------4334773776168------65502899999999998607799607999787--9847558589 Q gi|254781093|r 140 -------SFL-------APCPSSGLIINGS------NDTVATTSDVKDLVNKLMNQKGISITHKVIPD--ANHFFIGKVD 197 (225) Q Consensus 140 -------~~l-------~~~~~p~LiIhG~------~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~g--a~H~f~~~~~ 197 (225) ..+ -|....+|-|.|+ .|..||...+..+.--+. ......+-++|+| |.|.---..+ T Consensus 159 ~~~~~~y~~l~~~~~~~~p~~i~vLnI~G~~~~g~~sDg~V~~~Ss~sl~yl~~-~~~~~Y~e~~v~G~~a~HS~LheN~ 237 (249) T pfam06028 159 KNKTPMYDYLIDNYKKKIPSDLEVLNIAGDLDNGKQTDGIVPWASSLSGKYLFQ-KNAKSYTEHLITGKEAGHSKLHENP 237 (249) T ss_pred CCCCHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCCCEEEHHHHHHHHHHHC-CCCCCEEEEEEECCCCCCCCCCCCH T ss_conf 646999999998665248999789998423688998886264899999999861-5668539999958997427687888 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254781093|r 198 ELINECAHYL 207 (225) Q Consensus 198 ~l~~~i~~fl 207 (225) ++.+.|.+|| T Consensus 238 ~V~~~I~~FL 247 (249) T pfam06028 238 EVDKYVKQFL 247 (249) T ss_pred HHHHHHHHHC T ss_conf 9999999851 No 94 >TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit; InterPro: IPR010125 This entry represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (IPR010123 from INTERPRO) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerise short-chain-length hydroxyalkanoates.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process. Probab=98.86 E-value=1e-08 Score=72.45 Aligned_cols=152 Identities=17% Similarity=0.180 Sum_probs=112.8 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCCCCC--CHHHHH-----HHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC Q ss_conf 9999999739849998531558876778521--023899-----999899998623546740588872036878764503 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG--DGELSD-----AAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR 120 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g--~~E~~D-----~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~ 120 (225) .+-+-|.+.|+.|...|= |..|.. .--++| +..+++++++.+ +...|=|.|+|+||.+++.+|+- T Consensus 94 Slv~~LL~~G~DVYLiDW-------GyP~~~D~~ltLdDY~~~YI~~cV~~i~~~~-~~d~i~lLG~CqGGTfsL~yAaL 165 (367) T TIGR01836 94 SLVRGLLERGQDVYLIDW-------GYPDRADRYLTLDDYVNGYIDKCVDYICRTS-KLDQISLLGICQGGTFSLCYAAL 165 (367) T ss_pred HHHHHHHHCCCCEEEEEC-------CCCCHHHHCCCHHHHHHHHHHHHHHHHHHHC-CCCCCCEEEEECCHHHHHHHHHH T ss_conf 688888856984899606-------8987333114046667677899999998650-78961043143104789999972 Q ss_pred CC-C-HHEEEECCCCCCCCCC--------------------------------------C-------------------- Q ss_conf 86-2-1330122757443332--------------------------------------2-------------------- Q gi|254781093|r 121 RP-E-INGFISVAPQPKSYDF--------------------------------------S-------------------- 140 (225) Q Consensus 121 ~p-~-i~~~v~isp~~~~~~~--------------------------------------~-------------------- 140 (225) .+ . |+-+|.+.+|+..... . T Consensus 166 ~~~kPiKNLvtMv~PVDF~~p~~~l~~~~GCtlGa~~~DiDl~VdtmGNiP~elln~~flmLKPf~l~~qKY~~L~d~~~ 245 (367) T TIGR01836 166 YPDKPIKNLVTMVTPVDFETPGNLLSNWDGCTLGARHVDIDLAVDTMGNIPGELLNLTFLMLKPFSLGYQKYVNLVDILE 245 (367) T ss_pred CCCCCCCCEEEECCCCCCCCCCHHHHHCCCCEECHHHCCHHHHHHHCCCCCHHHHHHHHHHCCCHHCCCCCCCCCHHHCC T ss_conf 50478322178726625658733765416731001121255677520687677886575521511206897301021026 Q ss_pred ---------------------------------------------------CCCCCCCCEEEEECCCCCCCCHHHHHHHH Q ss_conf ---------------------------------------------------23433477377616865502899999999 Q gi|254781093|r 141 ---------------------------------------------------FLAPCPSSGLIINGSNDTVATTSDVKDLV 169 (225) Q Consensus 141 ---------------------------------------------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~ 169 (225) .|.....|+|=|.+++|..||+..+.+|- T Consensus 246 D~~~v~nFlRMEKWifDsPd~AGEafRQf~kdFYQ~N~li~G~~~iGg~~vdL~Ni~~PvLNiyA~~DHLV~P~~S~aL~ 325 (367) T TIGR01836 246 DESKVENFLRMEKWIFDSPDQAGEAFRQFVKDFYQQNGLINGEVEIGGRKVDLKNIKMPVLNIYAERDHLVPPDASKALN 325 (367) T ss_pred CHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCEECEEEECCEEECCEECCCCHHHHHHCCCCCCCCHHHHHHC T ss_conf 85898626555223217888321358999999987166142407986545321225655125643347755516999745 Q ss_pred HHHHHCCCCCEEEEEECCCCC---CCCCC-HHHHHHHHHHHHHHH Q ss_conf 998607799607999787984---75585-899999999999973 Q gi|254781093|r 170 NKLMNQKGISITHKVIPDANH---FFIGK-VDELINECAHYLDNS 210 (225) Q Consensus 170 ~~l~~~~~~~~~~~~i~ga~H---~f~~~-~~~l~~~i~~fl~~~ 210 (225) ....+ ..+.+...++ +|| .++++ +.|+.=+|.+||+++ T Consensus 326 ~~v~s--n~Dyt~~~F~-~GHiG~yvSgkaQkev~P~Ig~WL~~R 367 (367) T TIGR01836 326 DLVSS--NEDYTELSFP-GGHIGIYVSGKAQKEVPPAIGKWLQER 367 (367) T ss_pred CCCCC--CCCCEECCCC-CCEEEEEECCEEEECCCHHHHHHHHCC T ss_conf 67543--3662100479-843899977853313682688885139 No 95 >PRK08775 homoserine O-acetyltransferase; Provisional Probab=98.84 E-value=4.8e-07 Score=62.45 Aligned_cols=186 Identities=18% Similarity=0.200 Sum_probs=115.6 Q ss_pred EEEEEECCCCCCCCEEEEECCCCCCCCCCC------CH------HHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH Q ss_conf 879996688877879999578998888998------99------999999999973984999853155887677852102 Q gi|254781093|r 13 LEGRYQPSTNPNAPIALILHPHPRFGGTMN------DN------IVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG 80 (225) Q Consensus 13 l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~------~~------~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~ 80 (225) +...|.--+..+.+++++||... |+.+ ++ -+.---+.+--.-|-|+++|+-|.- .+.+. .- T Consensus 46 ~~~~yet~G~~~~navlv~HaLt---g~~H~ag~~~~~~~GWWd~liGpGkaiDT~ryfVic~N~lG~~--g~~~p--~i 118 (343) T PRK08775 46 LRLRYELIGPANAPVVFVAGGIS---AHRHVAATANFPEKGWVEGLVGSGRALDPARRRLLAFDFLGAD--GSLDA--PI 118 (343) T ss_pred EEEEEEEECCCCCCEEEEECCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCC--CCCCC--CC T ss_conf 37777652478998899907867---7300258998998978523538998847887699974677888--76789--98 Q ss_pred HHHHHHHHHHHHHHHCCCCCCE-EEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC-------------------- Q ss_conf 3899999899998623546740-588872036878764503862-133012275744333-------------------- Q gi|254781093|r 81 ELSDAAAALDWVQSLNPESKSC-WIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD-------------------- 138 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~~~~~i-~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~-------------------- 138 (225) .+.|...+..-+.+. ...+++ .++|-||||+.|+.-|..+|+ ++.+|.++....... T Consensus 119 t~~D~vraq~~Lld~-LGI~~l~AVIGGSmGGMqALeWA~~yPd~V~~~i~IAt~~r~sa~~IA~~~vqR~AI~~d~~~~ 197 (343) T PRK08775 119 DTADQADAIAALLDA-LGIRRLHGFVGYSYGALVGLQFAIRHAARVRTLVVVSGAHRAHPYAAAWRALQRRAVALGQLQC 197 (343) T ss_pred CHHHHHHHHHHHHHH-CCCCEEEEEECCCHHHHHHHHHHHHCHHHHHCEEEEECCCCCCHHHHHHHHHHHHHHHCCHHCC T ss_conf 678999999999986-2988674231578899999999986929663176751475279999999999999997180015 Q ss_pred --------------------------C----------------------------------------------CCCCCCC Q ss_conf --------------------------2----------------------------------------------2234334 Q gi|254781093|r 139 --------------------------F----------------------------------------------SFLAPCP 146 (225) Q Consensus 139 --------------------------~----------------------------------------------~~l~~~~ 146 (225) | -.++... T Consensus 198 ~g~~GLalARmia~iTYrS~~~~~~RF~~~~~~~~~~~~f~VEsYL~yqG~kf~~RFDaNsYl~LtrAmD~hd~~~~~i~ 277 (343) T PRK08775 198 AEHHGLALARQFAMLSYRTPEEFEERFDAPPEVINGRVRVAAEDYLDAAGAQYVARTPVNAYLRLSESIDLHRVDPAQVT 277 (343) T ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHCC T ss_conf 86407999999988871699999987588743447653201988999738999875690489999999875355565477 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC-CCCC-CCCCCHHHHHHHHHHHHHHH Q ss_conf 7737761686550289999999999860779960799978-7984-75585899999999999973 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP-DANH-FFIGKVDELINECAHYLDNS 210 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~-ga~H-~f~~~~~~l~~~i~~fl~~~ 210 (225) .|+|+|-=+.|...|+++.+++++.+.. ..++++|+ ..+| .|--..+++...+.+||+.. T Consensus 278 a~~lvi~i~SD~LfP~~e~~~l~~~l~~----~~~~~eI~S~~GHDaFLiE~e~v~~il~~fL~~~ 339 (343) T PRK08775 278 VPTVVVAVEGDRLVPLADLVALVEGLGP----RGSLRVLRSPYGHDAFLKEIDRIDAILTTALRST 339 (343) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHCCC----CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 8879999578605799999999986588----9439998899983720067999999999999853 No 96 >TIGR01840 esterase_phb esterase, PHB depolymerase family; InterPro: IPR010126 This entry describes a group of lipases of the ab-hydrolase family. They include bacterial depolymerases for poly(3-hydroxybutyrate) (PHB) and related polyhydroxyalkanoates (PHA), as well as acetyl xylan esterases, and feruloyl esterases from fungi.; GO: 0005576 extracellular region. Probab=98.81 E-value=7.9e-08 Score=67.14 Aligned_cols=150 Identities=18% Similarity=0.276 Sum_probs=102.8 Q ss_pred CCCCCEEEEECCCCCCCCCCCC---HHH------HHHHHHHHHCCCEEEEEEECCCCCC----CCCC--------CCCCH Q ss_conf 8778799995789988889989---999------9999999973984999853155887----6778--------52102 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMND---NIV------YQLFYLFQQRGFVSLRFNFRGIGRS----EGEF--------DYGDG 80 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~---~~~------~~la~~l~~~G~~vl~fd~RG~G~S----~G~~--------~~g~~ 80 (225) ..++++||.+| |+... .++ +.+-+.-.+-||.+|.++--+.-.| .|=| .++.| T Consensus 15 ~~~raLV~~LH------GC~QTAs~~~~~~GdnGyNW~~~A~~YGf~~vaP~~~~~~~~~~~~sGCw~wf~~~h~~R~~G 88 (231) T TIGR01840 15 TGKRALVLALH------GCGQTASAYVIDKGDNGYNWKAAADKYGFVVVAPEQTSSNSSNKQASGCWDWFETEHRARGTG 88 (231) T ss_pred CCCCCEEEEEC------CCCCCCCHHHHHCCCCCHHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 68973477622------774111125451688762078898558968861420276665423455657777555677776 Q ss_pred HHHHHHHHHHHHHH--H-CCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEE-CC-CCCCC-----------C------- Q ss_conf 38999998999986--2-3546740588872036878764503862133012-27-57443-----------3------- Q gi|254781093|r 81 ELSDAAAALDWVQS--L-NPESKSCWIAGYSFGAWISMQLLMRRPEINGFIS-VA-PQPKS-----------Y------- 137 (225) Q Consensus 81 E~~D~~aa~~~l~~--~-~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~-is-p~~~~-----------~------- 137 (225) |..++...++.++. . ..|++||+|.|.|-||+++.-+++..|++-+-.+ .| |+... + T Consensus 89 e~~~l~~li~~~~~~~~~~IDP~rvyvTGLSaGGgmT~V~~~t~PdvFAg~A~~aG~pyG~a~~~~~~~~~~~~~~~~~~ 168 (231) T TIGR01840 89 EVVSLKQLIDAVKADTNYSIDPNRVYVTGLSAGGGMTAVLGATYPDVFAGGASNAGLPYGEASSSISALPKMSHSTAKTA 168 (231) T ss_pred CHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 52789999999734778863885458842142389999887405635531011257874654210005778888766327 Q ss_pred ----------CCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf ----------3222343347737761686550289999999999860779 Q gi|254781093|r 138 ----------DFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKG 177 (225) Q Consensus 138 ----------~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~ 177 (225) .++.-.+.|+|.-|+||++|-+|-+.+.....++....-+ T Consensus 169 ~~~~~~~~g~~s~~~~~~pIP~~v~~G~~D~TV~~~~~~~~~d~~~~~~g 218 (231) T TIGR01840 169 AEVGRLVRGSQSEYNGPTPIPMSVVHGDADYTVLPGNAEEIVDAMLKVYG 218 (231) T ss_pred HHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 88878760665677778886127877785750251001268999998728 No 97 >COG4099 Predicted peptidase [General function prediction only] Probab=98.80 E-value=2.9e-08 Score=69.74 Aligned_cols=178 Identities=20% Similarity=0.262 Sum_probs=108.6 Q ss_pred CCCCC-EEEEEEECCCC----CCC-CEEEEECCCCCCCCCCCCHHHH----HHHHHHHHCCCEEEEEEECCC-CCCCCCC Q ss_conf 58996-18799966888----778-7999957899888899899999----999999973984999853155-8876778 Q gi|254781093|r 7 NGPSG-RLEGRYQPSTN----PNA-PIALILHPHPRFGGTMNDNIVY----QLFYLFQQRGFVSLRFNFRGI-GRSEGEF 75 (225) Q Consensus 7 ~g~~G-~l~~~~~~~~~----~~~-~~vv~~Hp~p~~GG~~~~~~~~----~la~~l~~~G~~vl~fd~RG~-G~S~G~~ 75 (225) +...| .|...+.-|++ ..- |.++++|+- +.+|+-+..++. .++++..+-+|=||.+-|--. -.|+-.. T Consensus 167 d~~tgneLkYrly~Pkdy~pdkky~PLvlfLHga-gq~g~dn~~~l~sg~gaiawa~pedqcfVlAPQy~~if~d~e~~t 245 (387) T COG4099 167 DESTGNELKYRLYTPKDYAPDKKYYPLVLFLHGA-GQGGSDNDKVLSSGIGAIAWAGPEDQCFVLAPQYNPIFADSEEKT 245 (387) T ss_pred CCCCCCEEEEEEECCCCCCCCCCCCCEEEEEECC-CCCCCHHHHHHHCCCCCEEEECCCCCEEEECCCCCCCCCCCCCCC T ss_conf 4256855558873566668887645379999448-877741566663586513561566743897146552124432352 Q ss_pred CCCCHHHHHHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCC-CCCCCCCCCCCCCEEEE Q ss_conf 52102389999989999862-3546740588872036878764503862-133012275744-33322234334773776 Q gi|254781093|r 76 DYGDGELSDAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPK-SYDFSFLAPCPSSGLII 152 (225) Q Consensus 76 ~~g~~E~~D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~-~~~~~~l~~~~~p~LiI 152 (225) +.-.-+.-|+ ..+-+.++ +.|.+||++.|.|+||+.++.++.+.|+ .++.+.++..-+ .+-.+.+. ..|..+. T Consensus 246 ~~~l~~~idl--i~~vlas~ynID~sRIYviGlSrG~~gt~al~~kfPdfFAaa~~iaG~~d~v~lv~~lk--~~piWvf 321 (387) T COG4099 246 LLYLIEKIDL--ILEVLASTYNIDRSRIYVIGLSRGGFGTWALAEKFPDFFAAAVPIAGGGDRVYLVRTLK--KAPIWVF 321 (387) T ss_pred CHHHHHHHHH--HHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHCCHHHHEEEEECCCCCHHHHHHHHC--CCCEEEE T ss_conf 1357899999--99998533575621389996267634568987758054421346347886065543322--5866999 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHCCCCCEEE-------EEECCCCC Q ss_conf 168655028999999999986077996079-------99787984 Q gi|254781093|r 153 NGSNDTVATTSDVKDLVNKLMNQKGISITH-------KVIPDANH 190 (225) Q Consensus 153 hG~~D~~vp~~~~~~~~~~l~~~~~~~~~~-------~~i~ga~H 190 (225) |+++|.++|.+.++-++++++.-.+ ++.+ .+.+|-+| T Consensus 322 hs~dDkv~Pv~nSrv~y~~lk~~~~-kv~Ytaf~~g~~~~eG~d~ 365 (387) T COG4099 322 HSSDDKVIPVSNSRVLYERLKALDR-KVNYTAFLEGTTVLEGVDH 365 (387) T ss_pred EECCCCCCCCCCCEEEHHHHHHHCC-CCCHHHHHHCCCCCCCCCC T ss_conf 8158873436765543687775266-3102120011233346577 No 98 >PRK06489 hypothetical protein; Provisional Probab=98.80 E-value=1.8e-07 Score=65.00 Aligned_cols=176 Identities=19% Similarity=0.181 Sum_probs=102.4 Q ss_pred CCEEEEECCCCCCCCCCCC---HH----HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH------HHHHHHHHHHH Q ss_conf 8799995789988889989---99----99999999973984999853155887677852102------38999998999 Q gi|254781093|r 25 APIALILHPHPRFGGTMND---NI----VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG------ELSDAAAALDW 91 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~---~~----~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~------E~~D~~aa~~~ 91 (225) ...|++||+. +|+... +- +..--+.|-..-|-++..|.-|.|.|.. .++|.+ -+.|...+-.- T Consensus 69 ~NaVLv~H~~---~g~~~~~~~~g~w~~liGpG~~lDt~kyfiI~~n~lG~g~Ss~-Ps~~~~~~FP~~ti~D~V~aQ~~ 144 (358) T PRK06489 69 DNAVLVLHGT---GGSGKSFLSPTFAGELFGPGQPLDATKYYIILPDGIGHGKSSK-PSDGLRAKFPRYDYADMVEAQYR 144 (358) T ss_pred CCEEEEEECC---CCCCCCCCCCCCHHHCCCCCCCCCCCCEEEEEECCCCCCCCCC-CCCCCCCCCCEEEHHHHHHHHHH T ss_conf 7659998257---7874436898608864579988788866999835888999999-99888889983222578999999 Q ss_pred HHHHCCCCCC-EEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCC-CC----------------CC-------------- Q ss_conf 9862354674-0588872036878764503862-13301227574-43----------------33-------------- Q gi|254781093|r 92 VQSLNPESKS-CWIAGYSFGAWISMQLLMRRPE-INGFISVAPQP-KS----------------YD-------------- 138 (225) Q Consensus 92 l~~~~~~~~~-i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~-~~----------------~~-------------- 138 (225) +..++....+ -.++|.||||..+++-+..+|+ +++++.++..+ .. .| T Consensus 145 Ll~e~lgI~~l~~v~G~SmGgmqa~~Wa~~yPd~v~~~ipia~~p~~~s~~N~~~~e~~~~aI~~DP~w~~G~Y~~qP~~ 224 (358) T PRK06489 145 LVTEGLGVKHLRLILGTSMGGMHAWVWGERYPDFMDALMPMASQPVEMSGRNWMWRRMLIDAIRNDPDWNGGNYTTQPRA 224 (358) T ss_pred HHHHHCCCCEEEEEECCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH T ss_conf 99876298668656325788899999998796898745631057442208779999999999861955458857768266 Q ss_pred ---------------------------------------------------------C---CCCCCCCCCEEEEECCCCC Q ss_conf ---------------------------------------------------------2---2234334773776168655 Q gi|254781093|r 139 ---------------------------------------------------------F---SFLAPCPSSGLIINGSNDT 158 (225) Q Consensus 139 ---------------------------------------------------------~---~~l~~~~~p~LiIhG~~D~ 158 (225) + ..|.....|+|+|...+|. T Consensus 225 gl~aa~m~~~~t~~s~~~~q~~~ptr~~~d~~vd~~~~~~~~~DANd~Ly~~kAs~~yDp~~~L~~IkA~vLvI~~a~D~ 304 (358) T PRK06489 225 MKRANTAFGIATGGGTLAYQKQAPTRAAADKAVDERLAAPVTADANDFLYQWDSSRDYNPSPDLEKIKAPVLAVNSADDE 304 (358) T ss_pred HHHHHHHHHHHHCCCHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCHHHHHHCCCCEEEEECCCCC T ss_conf 99999999999814899998758716677899999874333678116999987527479635586473358998565546 Q ss_pred CCCHHHHH--HHHHHHHHCCCCCEEEEEECC----CCCCCCCCHHHHHHHHHHHHHH Q ss_conf 02899999--999998607799607999787----9847558589999999999997 Q gi|254781093|r 159 VATTSDVK--DLVNKLMNQKGISITHKVIPD----ANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 159 ~vp~~~~~--~~~~~l~~~~~~~~~~~~i~g----a~H~f~~~~~~l~~~i~~fl~~ 209 (225) +-|+|... ...+++++ -++++||. -+|+-++...-..+.+.+||.+ T Consensus 305 ~fPPE~g~~e~~~~~v~~-----~~~~~ip~~~~t~GH~t~~~~~~~~~~L~~~L~~ 356 (358) T PRK06489 305 RNPPELGVMEAALKRVKN-----GRLVLIPASDETRGHGTTGNAKFWKAYLAELLAQ 356 (358) T ss_pred CCCHHHHHHHHHHHHCCC-----CEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 288203058999974769-----6499955776746776665767899999999841 No 99 >pfam03959 FSH1 Serine hydrolase (FSH1). This is a family of serine hydrolases. Probab=98.80 E-value=1.2e-06 Score=60.08 Aligned_cols=112 Identities=21% Similarity=0.219 Sum_probs=69.4 Q ss_pred HHHHHHHHHHHHHHHCC-CCCCEEEEEECHHHHHHHHHHH---------CCCCHHEEEECCCCCCC-CCC---CCCCCCC Q ss_conf 38999998999986235-4674058887203687876450---------38621330122757443-332---2234334 Q gi|254781093|r 81 ELSDAAAALDWVQSLNP-ESKSCWIAGYSFGAWISMQLLM---------RRPEINGFISVAPQPKS-YDF---SFLAPCP 146 (225) Q Consensus 81 E~~D~~aa~~~l~~~~~-~~~~i~l~G~S~Gg~val~~a~---------~~p~i~~~v~isp~~~~-~~~---~~l~~~~ 146 (225) +..++..+++|+.+.-. +..-..|+|||.||.+|..++. ..|.++.+|+++..... ..+ ....... T Consensus 80 ~~~~~~~sl~~l~~~i~~~gPfdGIlGFSQGa~la~~l~~~~~~~~~~~~~p~~kfaI~~sG~~~~~~~~~~~~~~~~i~ 159 (210) T pfam03959 80 EYRGLDESLDYVRDYIKENGPFDGILGFSQGAALAAILASLLEEGLPLTDHPPLKFAILISGFRPREPKYAEFYTEPPIQ 159 (210) T ss_pred CCCCHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCHHHHHCCCCCCCC T ss_conf 32799999999999998549924897514668999999999985156567998029999626888973330012467778 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHH Q ss_conf 773776168655028999999999986077996079997879847558589 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVD 197 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~ 197 (225) .|+|.|+|++|.++|.+.+..+++..... .++ +.-+.+|..-.+.+ T Consensus 160 iPtlHi~G~~D~~v~~~~s~~L~~~~~~~----~~v-~~H~gGH~vP~~~~ 205 (210) T pfam03959 160 TPSLHVIGELDTVVPEERSRKLAEACKNS----ATV-LEHPGGHFVPNKKP 205 (210) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHCCCC----CEE-EECCCCCCCCCCHH T ss_conf 75489981898777769999999972899----689-97399884979736 No 100 >KOG2551 consensus Probab=98.79 E-value=6.1e-08 Score=67.83 Aligned_cols=124 Identities=16% Similarity=0.212 Sum_probs=90.0 Q ss_pred HHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH---------CCCCHHEEEECCCCCCC----CCCCCCCCCCCCEEE Q ss_conf 99989999862354674058887203687876450---------38621330122757443----332223433477377 Q gi|254781093|r 85 AAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM---------RRPEINGFISVAPQPKS----YDFSFLAPCPSSGLI 151 (225) Q Consensus 85 ~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~---------~~p~i~~~v~isp~~~~----~~~~~l~~~~~p~Li 151 (225) +..+.+|+++.+|= =.|+|||.|+.++..+++ ..|.++-+|++|..... .......+...|.|- T Consensus 92 l~yl~~~i~enGPF---DGllGFSQGA~laa~l~~~~~~~~~~~~~P~~kF~v~~SGf~~~~~~~~~~~~~~~i~~PSLH 168 (230) T KOG2551 92 LEYLEDYIKENGPF---DGLLGFSQGAALAALLAGLGQKGLPYVKQPPFKFAVFISGFKFPSKKLDESAYKRPLSTPSLH 168 (230) T ss_pred HHHHHHHHHHHCCC---CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCCCHHHHHHHCCCCCCCEEE T ss_conf 99999999970997---540222144788987614443587555789847999983477786234565533677787057 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 6168655028999999999986077996079997879847558589999999999997312210012 Q gi|254781093|r 152 INGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDEKFTLL 218 (225) Q Consensus 152 IhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~~~~~ 218 (225) |.|+.|+++|.+.+..+++....+ .++.--++|+.-+.. ...+.+.+|++..+++..+-. T Consensus 169 i~G~~D~iv~~~~s~~L~~~~~~a------~vl~HpggH~VP~~~-~~~~~i~~fi~~~~~~~~e~~ 228 (230) T KOG2551 169 IFGETDTIVPSERSEQLAESFKDA------TVLEHPGGHIVPNKA-KYKEKIADFIQSFLQEESEEH 228 (230) T ss_pred EECCCCEEECCHHHHHHHHHCCCC------EEEECCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHHH T ss_conf 843655341326779999856787------689548875478710-789999999999998653653 No 101 >pfam12146 Hydrolase_4 Putative lysophospholipase. This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. It is found in association with pfam00561. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins. Probab=98.79 E-value=3.5e-08 Score=69.27 Aligned_cols=73 Identities=18% Similarity=0.221 Sum_probs=52.8 Q ss_pred EEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC--C-HHHHHHHHH Q ss_conf 1879996688877879999578998888998999999999999739849998531558876778521--0-238999998 Q gi|254781093|r 12 RLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG--D-GELSDAAAA 88 (225) Q Consensus 12 ~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g--~-~E~~D~~aa 88 (225) +|......|.+++++++|++|++. ++ ......+++.|++.||.|+.+|+||+|+|+|..... . .-++|+.+. T Consensus 3 ~l~~~~w~p~~~~~~~vl~~HG~~---~~--~~~~~~~a~~l~~~G~~V~~~D~rGhG~S~g~rg~~~~~~~~v~Dl~~~ 77 (79) T pfam12146 3 KLFYRRWLPAGPPKAVVVLVHGGG---EH--SGRYAELAEELAAQGYAVYAYDHRGHGRSPGKRGHVPSFDDYVDDLDTF 77 (79) T ss_pred EEEEEECCCCCCCCEEEEEECCCC---CH--HHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 889996479999874999989995---36--7899999999996999598701275589999667879999999999998 Q ss_pred H Q ss_conf 9 Q gi|254781093|r 89 L 89 (225) Q Consensus 89 ~ 89 (225) + T Consensus 78 i 78 (79) T pfam12146 78 V 78 (79) T ss_pred H T ss_conf 6 No 102 >COG2021 MET2 Homoserine acetyltransferase [Amino acid transport and metabolism] Probab=98.76 E-value=8.9e-07 Score=60.81 Aligned_cols=180 Identities=17% Similarity=0.170 Sum_probs=110.1 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHH-----HHHHHHHHCC-------CEEEEEEECCCC-CCCCCCCC---C----C--- Q ss_conf 7787999957899888899899999-----9999999739-------849998531558-87677852---1----0--- Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVY-----QLFYLFQQRG-------FVSLRFNFRGIG-RSEGEFDY---G----D--- 79 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~-----~la~~l~~~G-------~~vl~fd~RG~G-~S~G~~~~---g----~--- 79 (225) ....+|+|||.. +|+.+.--.. -.-+.|.--| |-|+++|.-|.. +|.|.-+. | . T Consensus 49 ~~~NaVli~HaL---tG~~h~~~~~~~~~~GWW~~liGpG~~iDt~r~fvIc~NvlG~c~GStgP~s~~p~g~~yg~~FP 125 (368) T COG2021 49 EKDNAVLICHAL---TGDSHAAGTADDGEKGWWDDLIGPGKPIDTERFFVICTNVLGGCKGSTGPSSINPGGKPYGSDFP 125 (368) T ss_pred CCCCEEEEECCC---CCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 677569993565---67631246688777761887248998778021699981477888789998876877782566798 Q ss_pred -HHHHHHHHHHHHHHHHCCCCCCEE-EEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC------------------- Q ss_conf -238999998999986235467405-88872036878764503862-13301227574433------------------- Q gi|254781093|r 80 -GELSDAAAALDWVQSLNPESKSCW-IAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY------------------- 137 (225) Q Consensus 80 -~E~~D~~aa~~~l~~~~~~~~~i~-l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~------------------- 137 (225) --+.|...+-.-+.+.. ..+++. ++|-||||+.|+.-+..+|+ ++.++.++..+... T Consensus 126 ~~ti~D~V~aq~~ll~~L-GI~~l~avvGgSmGGMqaleWa~~yPd~V~~~i~ia~~~r~s~~~ia~~~~~r~AI~~DP~ 204 (368) T COG2021 126 VITIRDMVRAQRLLLDAL-GIKKLAAVVGGSMGGMQALEWAIRYPDRVRRAIPIATAARLSAQNIAFNEVQRQAIEADPD 204 (368) T ss_pred CCCHHHHHHHHHHHHHHC-CCCEEEEECCCCHHHHHHHHHHHHCHHHHHHHHEECCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 331899999999999862-9404610024575879999999869198754520134445788999999999999973987 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254781093|r 138 -------------------------------------------------------------------------------- 137 (225) Q Consensus 138 -------------------------------------------------------------------------------- 137 (225) T Consensus 205 ~n~G~Y~~~~~P~~GL~~AR~l~~ltYrS~~~~~~rF~r~~~~~~~~~~~~~f~vESYL~~qg~kf~~rfDaNsYL~lt~ 284 (368) T COG2021 205 WNGGDYYEGTQPERGLRLARMLAHLTYRSEEELDERFGRRLQADPLRGGGVRFAVESYLDYQGDKFVARFDANSYLYLTR 284 (368) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 55777668878314689999999997469899998846445445567886068999999998778876058403999998 Q ss_pred -----CC--------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCC-CCCCCHHHHHHHH Q ss_conf -----32--------223433477377616865502899999999998607799607999787984-7558589999999 Q gi|254781093|r 138 -----DF--------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANH-FFIGKVDELINEC 203 (225) Q Consensus 138 -----~~--------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H-~f~~~~~~l~~~i 203 (225) |. ..|+.+..|+|++-=+.|...|++..+++.+.++.... . ..+....+| .|....+.+...+ T Consensus 285 ald~~D~s~~~~~l~~al~~i~~~~lv~gi~sD~lfp~~~~~~~~~~L~~~~~--~-~~i~S~~GHDaFL~e~~~~~~~i 361 (368) T COG2021 285 ALDYHDVSRGRGDLTAALARIKAPVLVVGITSDWLFPPELQRALAEALPAAGA--L-REIDSPYGHDAFLVESEAVGPLI 361 (368) T ss_pred HHHHCCCCCCCCCHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCC--E-EEECCCCCCHHHHCCHHHHHHHH T ss_conf 87731887776758889745766779997244555798999999985455674--5-88458888432420465666999 Q ss_pred HHHHHH Q ss_conf 999997 Q gi|254781093|r 204 AHYLDN 209 (225) Q Consensus 204 ~~fl~~ 209 (225) ..||+. T Consensus 362 ~~fL~~ 367 (368) T COG2021 362 RKFLAL 367 (368) T ss_pred HHHHHC T ss_conf 998623 No 103 >KOG3101 consensus Probab=98.73 E-value=1.4e-07 Score=65.70 Aligned_cols=176 Identities=20% Similarity=0.315 Sum_probs=102.3 Q ss_pred EEEEECCCCC---CCCEEEEECCCCCCCCCCCCHHHHH-HHHHHHHCCCEEEEEEE--CCC---CCCCCCCCCCCHH--H Q ss_conf 7999668887---7879999578998888998999999-99999973984999853--155---8876778521023--8 Q gi|254781093|r 14 EGRYQPSTNP---NAPIALILHPHPRFGGTMNDNIVYQ-LFYLFQQRGFVSLRFNF--RGI---GRSEGEFDYGDGE--L 82 (225) Q Consensus 14 ~~~~~~~~~~---~~~~vv~~Hp~p~~GG~~~~~~~~~-la~~l~~~G~~vl~fd~--RG~---G~S~G~~~~g~~E--~ 82 (225) -++|.||..+ +.|++-.+- +...+.+|-+--. .-+...+.|++|+.+|. ||+ |+++ +||.|.|. . T Consensus 30 f~vylPp~a~~~k~~P~lf~LS---GLTCT~~Nfi~Ksg~qq~As~hgl~vV~PDTSPRG~~v~g~~e-swDFG~GAGFY 105 (283) T KOG3101 30 FGVYLPPDAPRGKRCPVLFYLS---GLTCTHENFIEKSGFQQQASKHGLAVVAPDTSPRGVEVAGDDE-SWDFGQGAGFY 105 (283) T ss_pred EEEECCCCCCCCCCCCEEEEEC---CCCCCCHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCC-CCCCCCCCEEE T ss_conf 9886588765688686689842---7744515467665677767545859988899987653579854-33346774258 Q ss_pred -----HHH---HHHHHHHHHH-----C-----CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC------- Q ss_conf -----999---9989999862-----3-----546740588872036878764503862-1330122757443------- Q gi|254781093|r 83 -----SDA---AAALDWVQSL-----N-----PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS------- 136 (225) Q Consensus 83 -----~D~---~aa~~~l~~~-----~-----~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~------- 136 (225) +-. --..+|+.++ + .|..++.++||||||.-|+..+.+++. -..+.+.+|-.+- T Consensus 106 vnAt~epw~~~yrMYdYv~kELp~~l~~~~~pld~~k~~IfGHSMGGhGAl~~~Lkn~~kykSvSAFAPI~NP~~cpWGq 185 (283) T KOG3101 106 VNATQEPWAKHYRMYDYVVKELPQLLNSANVPLDPLKVGIFGHSMGGHGALTIYLKNPSKYKSVSAFAPICNPINCPWGQ 185 (283) T ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCEECCCCCCCCEEEEEECCCCCCCCEECCCCCCCCCCCCCHH T ss_conf 82451567642308999999988873215665560342054035677762799972832013010004646865484157 Q ss_pred ----------------CCCCCC----CCCCCCEEEEECCCCCCCCHHHH-HHHHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf ----------------332223----43347737761686550289999-9999998607799607999787984755 Q gi|254781093|r 137 ----------------YDFSFL----APCPSSGLIINGSNDTVATTSDV-KDLVNKLMNQKGISITHKVIPDANHFFI 193 (225) Q Consensus 137 ----------------~~~~~l----~~~~~p~LiIhG~~D~~vp~~~~-~~~~~~l~~~~~~~~~~~~i~ga~H~f~ 193 (225) ||...| ...+...||=+|+.|++.+-+.. +.+.++....+...+.+...+|-||.|- T Consensus 186 KAf~gYLG~~ka~W~~yDat~lik~y~~~~~~ilIdqG~~D~Fl~~qLlPe~l~~a~~~~~~~~v~~r~~~gyDHSYy 263 (283) T KOG3101 186 KAFTGYLGDNKAQWEAYDATHLIKNYRGVGDDILIDQGAADNFLAEQLLPENLLEACKATWQAPVVFRLQEGYDHSYY 263 (283) T ss_pred HHHHCCCCCCHHHHHHCCHHHHHHHCCCCCCCEEEECCCCCHHHHHHCCHHHHHHHHHCCCCCCEEEEEECCCCCCEE T ss_conf 776421378867876405478897468997507874386541346651848899974144656479983057775266 No 104 >PRK07581 hypothetical protein; Validated Probab=98.73 E-value=3.8e-07 Score=63.06 Aligned_cols=178 Identities=17% Similarity=0.177 Sum_probs=107.6 Q ss_pred CCCCEEEEECCCCCCCCCCCCH-HHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCC---CCH------HHHHHHHHHHHH Q ss_conf 7787999957899888899899-999999999973984999853155887677852---102------389999989999 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDN-IVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDY---GDG------ELSDAAAALDWV 92 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~-~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~---g~~------E~~D~~aa~~~l 92 (225) ....++++|| .++|+.... -+.---+.+--.-|-|+++|.-|.|.|.+..+. ..+ -+.|...+-..+ T Consensus 39 ~~~Nailv~h---~~tg~~~~~~wliGpg~~iDt~kyfVIc~N~lG~g~ss~p~~~~~~~~~~~FP~~ti~D~V~aq~~L 115 (340) T PRK07581 39 AKDNAILYPT---WYSGTHQDNEWLIGPGRALDPEKYFIIVPNMFGNGLSSSPSNTPAPYNAARFPHVTIYDNVRAQHRL 115 (340) T ss_pred CCCCEEEECC---CCCCCCCCCCEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEHHHHHHHHHH T ss_conf 8986799768---7678887787443289864767659998358788989999899997777889853627999999999 Q ss_pred HHHCCCCCCE-EEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC--------------------------------- Q ss_conf 8623546740-588872036878764503862-13301227574433--------------------------------- Q gi|254781093|r 93 QSLNPESKSC-WIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY--------------------------------- 137 (225) Q Consensus 93 ~~~~~~~~~i-~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~--------------------------------- 137 (225) ..+....+++ .++|.||||..|+.-|..+|+ ++.+|.++..+... T Consensus 116 l~e~LgI~~l~~ViG~SmGGmqaleWa~~yPd~v~~~v~ia~~a~~s~~~ia~~e~~r~AI~~Dp~w~~G~Y~~~P~~GL 195 (340) T PRK07581 116 LTEHFGIERLALVTGWSMGAQQTYHWAVRFPDMVERAAPICGTAKCTPHNFVFLEGLKAALTADPAFKGGWYTEKPERGL 195 (340) T ss_pred HHHHCCCCEEEEEECCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH T ss_conf 99752988688885667899999999986969887562115665689879999999999998393557888888613439 Q ss_pred ------------------------------------------------------------CC-----------CCCCCCC Q ss_conf ------------------------------------------------------------32-----------2234334 Q gi|254781093|r 138 ------------------------------------------------------------DF-----------SFLAPCP 146 (225) Q Consensus 138 ------------------------------------------------------------~~-----------~~l~~~~ 146 (225) |. ..|.... T Consensus 196 ra~ar~~a~~~~s~~~~r~~~~~~~g~~~vEsyL~~~~~~~~~rfDANsyl~lt~a~~~~Dv~~~~~~~g~l~~AL~~Ik 275 (340) T PRK07581 196 RAAARVYAGWGFSQAFYRQEDWRAMGYSSLEDFLVGFWEGNFLPRDPNNLLAMIWTWQRGDVSRNTAFGGDLAAALGAIK 275 (340) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCC T ss_conf 99999999986555777645443127214889999999888761788899999999774564347676875999997589 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC-CCCC-CCC-CC----HHHHHHHHHHHHH Q ss_conf 7737761686550289999999999860779960799978-7984-755-85----8999999999999 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP-DANH-FFI-GK----VDELINECAHYLD 208 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~-ga~H-~f~-~~----~~~l~~~i~~fl~ 208 (225) .++|+|-.+.|-+.|+++.+..++.++.+ ++++|+ --+| .|. +. ...+.+.+.+||. T Consensus 276 a~~Lvi~i~sD~lFPp~e~~~~a~~ip~a-----~~~~I~S~~GHdAfl~g~~~~d~~~~~~~i~~~La 339 (340) T PRK07581 276 AKTFVMPGSTDLYFPPEDCAAEQALIPNA-----ELRPIPSIWGHLAGLPGQNPADIAFIDAALKELLA 339 (340) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHCCCC-----EEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 99899973765386989999999856797-----69997999620046889997799999999999853 No 105 >COG2272 PnbA Carboxylesterase type B [Lipid metabolism] Probab=98.62 E-value=2.9e-07 Score=63.76 Aligned_cols=115 Identities=23% Similarity=0.342 Sum_probs=75.4 Q ss_pred EEECC-CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCC-CEEEEEEEC-CC-C--------CCCCCCCCCCHHHH Q ss_conf 99668-8877879999578998888998999999999999739-849998531-55-8--------87677852102389 Q gi|254781093|r 16 RYQPS-TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRG-FVSLRFNFR-GI-G--------RSEGEFDYGDGELS 83 (225) Q Consensus 16 ~~~~~-~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G-~~vl~fd~R-G~-G--------~S~G~~~~g~~E~~ 83 (225) +|.|. ..++.||.|+.|+-...+|+..++ .++. ..|+++| +.++.+||| |. | .++...++ --+. T Consensus 84 IwaP~~~a~~~PVmV~IHGG~y~~Gs~s~~-~ydg-s~La~~g~vVvVSvNYRLG~lGfL~~~~~~~~~~~~~n--~Gl~ 159 (491) T COG2272 84 IWAPEVPAEKLPVMVYIHGGGYIMGSGSEP-LYDG-SALAARGDVVVVSVNYRLGALGFLDLSSLDTEDAFASN--LGLL 159 (491) T ss_pred EECCCCCCCCCCEEEEEECCCCCCCCCCCC-CCCH-HHHHHCCCEEEEEECCCCCCCEEEEHHHCCCCCCCCCC--CCHH T ss_conf 612688888885799994661036877664-4586-88975498799985762455001200420565443355--1389 Q ss_pred HHHHHHHHHHHH----CCCCCCEEEEEECHHHHHHHHHHHCCCCHH----EEEECCCCCC Q ss_conf 999989999862----354674058887203687876450386213----3012275744 Q gi|254781093|r 84 DAAAALDWVQSL----NPESKSCWIAGYSFGAWISMQLLMRRPEIN----GFISVAPQPK 135 (225) Q Consensus 84 D~~aa~~~l~~~----~~~~~~i~l~G~S~Gg~val~~a~~~p~i~----~~v~isp~~~ 135 (225) |...+++|+++. +.|+..|-|+|.|-|++.++.+.+ .|..+ .+|+.|++.. T Consensus 160 DqilALkWV~~NIe~FGGDp~NVTl~GeSAGa~si~~Lla-~P~AkGLF~rAi~~Sg~~~ 218 (491) T COG2272 160 DQILALKWVRDNIEAFGGDPQNVTLFGESAGAASILTLLA-VPSAKGLFHRAIALSGAAS 218 (491) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHC-CCCCHHHHHHHHHHCCCCC T ss_conf 9999999999989984899631478620510799988662-7650178999998578877 No 106 >pfam07224 Chlorophyllase Chlorophyllase. This family consists of several plant specific Chlorophyllase proteins (EC:3.1.1.14). Chlorophyllase (Chlase) is the first enzyme involved in chlorophyll (Chl) degradation and catalyses the hydrolysis of ester bond to yield chlorophyllide and phytol. Probab=98.61 E-value=7.8e-07 Score=61.17 Aligned_cols=189 Identities=15% Similarity=0.237 Sum_probs=114.6 Q ss_pred ECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHH-- Q ss_conf 668887787999957899888899899999999999973984999853155887677852102389999989999862-- Q gi|254781093|r 18 QPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSL-- 95 (225) Q Consensus 18 ~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~-- 95 (225) .|.+...-|+++++| |-...|.....+-+-.+..||.|+.+-.-..- |. ..+.-|++|+.++.+|+.+- T Consensus 39 ~P~~~G~YPV~lFlH-----G~~l~NsfYsqll~HIaSHGfIvVAPQly~i~---~~-p~~~~EI~~aa~V~nWL~~GL~ 109 (307) T pfam07224 39 TPKEAGTYPVVLFLH-----GTMLSNEFYSLFFNHIASHGFIVVAPQLYRLF---PP-PSQQDEIDSAAEVANWLPLGLQ 109 (307) T ss_pred CCCCCCCCCEEEEEC-----CCEECCHHHHHHHHHHHCCCEEEECCEEEECC---CC-CCCCHHHHHHHHHHHHHHHCCH T ss_conf 357688641799850-----42212268999999876065089845100135---89-8860788889999877763054 Q ss_pred -------CCCCCCEEEEEECHHHHHHHHHHHCCC---CHHEEEECCCCCCCCCCCC-----------CCCCCCCEEEEE- Q ss_conf -------354674058887203687876450386---2133012275744333222-----------343347737761- Q gi|254781093|r 96 -------NPESKSCWIAGYSFGAWISMQLLMRRP---EINGFISVAPQPKSYDFSF-----------LAPCPSSGLIIN- 153 (225) Q Consensus 96 -------~~~~~~i~l~G~S~Gg~val~~a~~~p---~i~~~v~isp~~~~~~~~~-----------l~~~~~p~LiIh- 153 (225) .++..++.|+|||-||-+|..+|.... ...++|.+-|......... --+...|++||- T Consensus 110 ~~LP~~V~pdl~klalaGHSRGGktAFalALG~~~~lkfSALiGvDPV~G~~k~~qt~P~ILTy~p~Sfdl~iPv~VIGT 189 (307) T pfam07224 110 VVLPTGVEANLSKLALSGHSRGGKTAFALALGYSLDVTFSALIGVDPVAGTSKDDRTDPHVLTYKPNSFNLSIPVTVIGS 189 (307) T ss_pred HHCCCCCCCCHHHEEEECCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEEC T ss_conf 23887657671120331247743879999851356664346751366665466877899644237876677885699855 Q ss_pred --CCCCCCCCHH-----HHHHHHHHHHHCCCCCEEEEEECCCCCC--C-------C----------CC--HHHHHH---- Q ss_conf --6865502899-----9999999986077996079997879847--5-------5----------85--899999---- Q gi|254781093|r 154 --GSNDTVATTS-----DVKDLVNKLMNQKGISITHKVIPDANHF--F-------I----------GK--VDELIN---- 201 (225) Q Consensus 154 --G~~D~~vp~~-----~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f-------~----------~~--~~~l~~---- 201 (225) |++-...|+- +-++|++..+.+. . .-+..+.+|. . . ++ .+.+++ T Consensus 190 GLG~~~~l~PpCAP~gvnH~eFf~ECk~p~---~-hFVa~dYGHmDmLDDd~~g~~g~~s~c~Ckng~~~r~pMRR~vGG 265 (307) T pfam07224 190 GLGARNNHTPPCAPTGVNHEEFFDECKENS---A-HFVATDYGHMDMLDDDRLGPIGVTSSCMCKNGFEPRAPMRRFVGG 265 (307) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHCCCE---E-EEEECCCCCCEECCCCCCCCCCHHHEEHHCCCCCCHHHHHHHHCC T ss_conf 878767678987989878899987526761---3-576447776310047886631030120102799753678875444 Q ss_pred HHHHHHHHHCCCCCCCCC Q ss_conf 999999973122100124 Q gi|254781093|r 202 ECAHYLDNSLDEKFTLLK 219 (225) Q Consensus 202 ~i~~fl~~~L~~~~~~~~ 219 (225) .+..||+..+.....-.. T Consensus 266 ivVAFL~a~l~g~~~dl~ 283 (307) T pfam07224 266 IVVAFLKAYFRDDGREYR 283 (307) T ss_pred EEHHHHHHHHCCCHHHHH T ss_conf 409999998758788999 No 107 >pfam05677 DUF818 Chlamydia CHLPS protein (DUF818). This family consists of several Chlamydia CHLPS proteins, the function of which are unknown. Probab=98.58 E-value=7.5e-07 Score=61.26 Aligned_cols=138 Identities=17% Similarity=0.256 Sum_probs=84.9 Q ss_pred EEECCCCCCCCEEEEECCCCCCCCCCCCHHHHH-----HHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 996688877879999578998888998999999-----999999739849998531558876778521023899999899 Q gi|254781093|r 16 RYQPSTNPNAPIALILHPHPRFGGTMNDNIVYQ-----LFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALD 90 (225) Q Consensus 16 ~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~-----la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~ 90 (225) ...-|...+.-..++.-++ +++++|+.+.. +-+.....|-++|.|||-|++.|.|.-+. ..-..--+|.+. T Consensus 128 ~I~~p~A~~dRWmLiSlGn---s~~lE~ra~~~~~dd~~~~~Ak~~~sNiLVfNYPGVmsS~G~~sr-~~l~kayqacvr 203 (364) T pfam05677 128 EICLPQAPPDRWMLISLGN---SDCLEERAMIGLKKSSWQRLAKLIGANILVFNYPGVMSSTGSITR-ENLAKAYQACVR 203 (364) T ss_pred EEECCCCCCCCEEEEECCC---CHHHHHHHHHHCCCHHHHHHHHHHCCCEEEEECCCCCCCCCCCCH-HHHHHHHHHHHH T ss_conf 7766889986069997478---146654788825756999999974688899807862257898898-999999999999 Q ss_pred HHHHH--CCCCCCEEEEEECHHHHHHHHHHHCCCC-----HHEEE-------ECCCCCCC--------------CC---C Q ss_conf 99862--3546740588872036878764503862-----13301-------22757443--------------33---2 Q gi|254781093|r 91 WVQSL--NPESKSCWIAGYSFGAWISMQLLMRRPE-----INGFI-------SVAPQPKS--------------YD---F 139 (225) Q Consensus 91 ~l~~~--~~~~~~i~l~G~S~Gg~val~~a~~~p~-----i~~~v-------~isp~~~~--------------~~---~ 139 (225) |++.+ ++.++.|+-.|||+||.+-..+..+++- ++-++ ++++.+.. |. . T Consensus 204 yLrD~~~Gp~AkqII~YGySlG~~vQa~AL~~~~~dgsdg~~w~~VKdR~p~sl~~~A~~~~~~ig~~~~kl~gW~i~s~ 283 (364) T pfam05677 204 YLRDKEQGPGAKQIIAYGYSLGGLVQAEALSKEVIDGSDGTSWIVVKDRGPRSLAAVANQFCGPIGKLIARLFGWNIDSV 283 (364) T ss_pred HHHCCCCCCCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 97537668873014541124105788888743875477882599994688531788999987368999999862554530 Q ss_pred CCCCCCCCCEEEEECCCC Q ss_conf 223433477377616865 Q gi|254781093|r 140 SFLAPCPSSGLIINGSND 157 (225) Q Consensus 140 ~~l~~~~~p~LiIhG~~D 157 (225) ..-...+.|-++|||++- T Consensus 284 k~S~~L~cPEifiy~~D~ 301 (364) T pfam05677 284 KRSEDLPCPEIFIYGTDS 301 (364) T ss_pred CCCCCCCCCEEEEECCCC T ss_conf 002568897489855876 No 108 >KOG2112 consensus Probab=98.54 E-value=1.9e-06 Score=58.85 Aligned_cols=118 Identities=17% Similarity=0.214 Sum_probs=79.8 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCCC--CCCCCC--CCCCEEEEECCCCCCCCH Q ss_conf 89999862354674058887203687876450386-2133012275744333--222343--347737761686550289 Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSYD--FSFLAP--CPSSGLIINGSNDTVATT 162 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~~--~~~l~~--~~~p~LiIhG~~D~~vp~ 162 (225) .+++-.+.+...++|.+.|+|+||++++..+...+ .+.+....++-..... +....+ ...|.+..||+.|++||. T Consensus 81 Li~~e~~~Gi~~~rI~igGfs~G~a~aL~~~~~~~~~l~G~~~~s~~~p~~~~~~~~~~~~~~~~~i~~~Hg~~d~~vp~ 160 (206) T KOG2112 81 LIDNEPANGIPSNRIGIGGFSQGGALALYSALTYPKALGGIFALSGFLPRASIGLPGWLPGVNYTPILLCHGTADPLVPF 160 (206) T ss_pred HHHHHHHCCCCCCCEEECCCCCHHHHHHHHHHCCCCCCCEEECCCCCCCCCHHHCCCCCCCCCCCHHHEECCCCCCEEEH T ss_conf 99989875998445167443742788998772144001236413332565022036776666754120023567740032 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 99999999986077996079997879847558589999999999997 Q gi|254781093|r 163 SDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 163 ~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~ 209 (225) .-.....+.+... +..++++.++|-.|.-.. +++ +.+..|+++ T Consensus 161 ~~g~~s~~~l~~~-~~~~~f~~y~g~~h~~~~--~e~-~~~~~~~~~ 203 (206) T KOG2112 161 RFGEKSAQFLKSL-GVRVTFKPYPGLGHSTSP--QEL-DDLKSWIKT 203 (206) T ss_pred HHHHHHHHHHHHC-CCCEEEEECCCCCCCCCH--HHH-HHHHHHHHH T ss_conf 7788889999972-973465416986552347--999-999999997 No 109 >pfam05705 DUF829 Eukaryotic protein of unknown function (DUF829). This family consists of several uncharacterized eukaryotic proteins. Probab=98.53 E-value=1.6e-05 Score=53.30 Aligned_cols=170 Identities=18% Similarity=0.221 Sum_probs=106.8 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHH-HHHHHHHHHHHH-CCCCCCEE Q ss_conf 7999957899888899899999999999973984999853155887677852102389-999989999862-35467405 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELS-DAAAALDWVQSL-NPESKSCW 103 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~-D~~aa~~~l~~~-~~~~~~i~ 103 (225) |++|+| ++.|. .++.+...++...+.|+.++++.-.-. .+-......+ -+..+++.+.+. ..+..++. T Consensus 1 Plvil~----gW~gA-~~~hl~KY~~~Y~~~g~~il~~~~~~~-----~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~il 70 (239) T pfam05705 1 PLVLLL----GWLGA-RPKHLAKYSDLYTRPGPDILVITSPPR-----DLLWPTKGLAPGLDKLLELLSESQRSEYWPIL 70 (239) T ss_pred CEEEEE----ECCCC-CHHHHHHHHHHHHCCCCEEEEEECCHH-----HHCCCCCCCHHHHHHHHHHHHHCCCCCCCCEE T ss_conf 989998----67899-699999999999728984999957989-----97031201069999999987540158998689 Q ss_pred EEEECHHHHHHHHHHH-----------CCCCHHEEEECCCCCCC-C-----------CC--------------------- Q ss_conf 8887203687876450-----------38621330122757443-3-----------32--------------------- Q gi|254781093|r 104 IAGYSFGAWISMQLLM-----------RRPEINGFISVAPQPKS-Y-----------DF--------------------- 139 (225) Q Consensus 104 l~G~S~Gg~val~~a~-----------~~p~i~~~v~isp~~~~-~-----------~~--------------------- 139 (225) +-.||.||...+.... ..|+++|.|.-|.|... + .. T Consensus 71 ~H~FSnGG~~~~~~l~~~~~~~~~~~~~~~~i~g~I~DS~Pg~~~~~~~~~a~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 150 (239) T pfam05705 71 FHVFSNGGPILYLCLLAALQDRTKFGKLLPRVKGQVWDSAPGIGHYHGAVGAFAAALPKLSKVASLLRALLLLALAVYLL 150 (239) T ss_pred EEEEECCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99960606999999999998355656656554799941899776578889999987177625777899999999999999 Q ss_pred --------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC--CCCCHH Q ss_conf --------------------2234334773776168655028999999999986077996079997879847--558589 Q gi|254781093|r 140 --------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF--FIGKVD 197 (225) Q Consensus 140 --------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f~~~~~ 197 (225) ....+.+.|-|.|+.+.|++++.+.+++.++..++ +|..++.+..+++.|. +..+.+ T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~lylYS~~D~li~~~dvE~~~~~~r~-~G~~V~~~~f~~S~Hv~H~r~~p~ 229 (239) T pfam05705 151 ALLILFLPSVASHSRRSLNDLANTPLPAPQLYLYSKADEVIPWRDVERHIEEARQ-RGVSVTSVCFEDSPHVGHMRKHPD 229 (239) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCCCHHHHHHHHCHH T ss_conf 9999728848999999986664189999669997089997299999999999997-699689997189845637761999 Q ss_pred HHHHHHHHH Q ss_conf 999999999 Q gi|254781093|r 198 ELINECAHY 206 (225) Q Consensus 198 ~l~~~i~~f 206 (225) ++.+++.+| T Consensus 230 ~Y~~~v~~f 238 (239) T pfam05705 230 RYWPKVVEF 238 (239) T ss_pred HHHHHHHHH T ss_conf 999999976 No 110 >KOG3847 consensus Probab=98.53 E-value=4.1e-06 Score=56.80 Aligned_cols=193 Identities=17% Similarity=0.206 Sum_probs=112.7 Q ss_pred EEEEEEECC---CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCC----------CCCCC-- Q ss_conf 187999668---887787999957899888899899999999999973984999853155887----------67785-- Q gi|254781093|r 12 RLEGRYQPS---TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRS----------EGEFD-- 76 (225) Q Consensus 12 ~l~~~~~~~---~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S----------~G~~~-- 76 (225) |+.+.+..+ +.+.=|++|+.| +.||++. +...++-.|+..||.|.....|-.-.+ .+... T Consensus 102 r~~~~~n~~~~tk~~k~PvvvFSH---GLggsRt--~YSa~c~~LAShG~VVaavEHRD~SA~~Ty~~~~~~~n~~lveq 176 (399) T KOG3847 102 RVPCIENAPLSTKNDKYPVVVFSH---GLGGSRT--LYSAYCTSLASHGFVVAAVEHRDRSACWTYVLKEKHENEPLVEQ 176 (399) T ss_pred CCCCCCCCCCCCCCCCCCEEEEEC---CCCCCHH--HHHHHHHHHHHCCEEEEEEECCCCCCEEEEEECCCCCCCCCCCC T ss_conf 034413588888899865899934---6663034--39887664754725999750256752247983235558765003 Q ss_pred --------CCC------HH-----HHHHHHHHHHHHHH----------------------CCCCCCEEEEEECHHHHHHH Q ss_conf --------210------23-----89999989999862----------------------35467405888720368787 Q gi|254781093|r 77 --------YGD------GE-----LSDAAAALDWVQSL----------------------NPESKSCWIAGYSFGAWISM 115 (225) Q Consensus 77 --------~g~------~E-----~~D~~aa~~~l~~~----------------------~~~~~~i~l~G~S~Gg~val 115 (225) ... -| ...+..|+.-+++. +.+..++.++||||||+.++ T Consensus 177 ~~~ir~v~~~ekef~irNeqv~~R~~Ec~~aL~il~~i~~g~~~~~~L~g~~~~~~~~K~nl~~s~~aViGHSFGgAT~i 256 (399) T KOG3847 177 WIKIRLVEANEKEFHIRNEQVGQRAQECQKALKILEQINDGGTPDNVLPGNNSDLEQLKGNLDTSQAAVIGHSFGGATSI 256 (399) T ss_pred CEEEEEECCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCCCHHHHHCCHHHHHHHHEECCCCCHHHH T ss_conf 05754300576058863788888899999999999975438982101667652199874640043342100255335453 Q ss_pred HHHHCCCCHHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC-CCC Q ss_conf 6450386213301227574433322234334773776168655028999999999986077996079997879847-558 Q gi|254781093|r 116 QLLMRRPEINGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF-FIG 194 (225) Q Consensus 116 ~~a~~~p~i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~-f~~ 194 (225) ...+.+.+.+..|++-.=.--.+.......+.|+|+|- -|++--.|...- .+++..+.... ...++.|+-|- |++ T Consensus 257 ~~ss~~t~FrcaI~lD~WM~Pl~~~~~~~arqP~~fin--v~~fQ~~en~~v-mKki~~~n~g~-~~it~~GsVHqnfsD 332 (399) T KOG3847 257 ASSSSHTDFRCAIALDAWMFPLDQLQYSQARQPTLFIN--VEDFQWNENLLV-MKKIESQNEGN-HVITLDGSVHQNFSD 332 (399) T ss_pred HHHCCCCCEEEEEEEEEEECCCCHHHHHHCCCCEEEEE--CCCCCCHHHHHH-HHHHHCCCCCC-EEEEECCCEECCCCC T ss_conf 35146552224656400002043334540458759998--021331467898-87531777665-699974614025556 Q ss_pred C-----------------------HHHHHHHHHHHHHHHCCC Q ss_conf 5-----------------------899999999999973122 Q gi|254781093|r 195 K-----------------------VDELINECAHYLDNSLDE 213 (225) Q Consensus 195 ~-----------------------~~~l~~~i~~fl~~~L~~ 213 (225) . .+...+....||+++++- T Consensus 333 fpfv~p~~i~k~f~~kg~~dpy~~~~~~~r~slaFLq~h~d~ 374 (399) T KOG3847 333 FPFVTPNWIGKVFKVKGETDPYEAMQIAIRASLAFLQKHLDL 374 (399) T ss_pred CCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 763047899998655777885899999999999999962433 No 111 >pfam11144 DUF2920 Protein of unknown function (DUF2920). This bacterial family of proteins has no known function. Probab=98.51 E-value=2.8e-05 Score=51.82 Aligned_cols=171 Identities=20% Similarity=0.334 Sum_probs=113.2 Q ss_pred EEEEEE-ECCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC-CCEEEEEEECCCCCC------------------ Q ss_conf 187999-668887787999957899888899899999999999973-984999853155887------------------ Q gi|254781093|r 12 RLEGRY-QPSTNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQR-GFVSLRFNFRGIGRS------------------ 71 (225) Q Consensus 12 ~l~~~~-~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~-G~~vl~fd~RG~G~S------------------ 71 (225) +||-.+ ...+.+.+++|++.. ++||+.+......+++.+++. +.+|+..||-+.|.- T Consensus 21 KLeyri~yDdeKeikaiVfiI~---G~G~d~n~~~~~~~~~~iA~~f~va~i~V~YHci~~Rpq~ga~~~~~~~D~~~l~ 97 (403) T pfam11144 21 KLEYRISYDDEKEIKAIVFIIP---GFGADANISYLDFFREYVAKNFDVAAVSVNYHCIGNRPQYGAKFYLDDEDKEILE 97 (403) T ss_pred EEEEEEECCCCCCCEEEEEEEC---CCCCCCCHHHHHHHHHHHHHHCCEEEEEECEEEECCCCCCCCCCCCCHHHHHHHH T ss_conf 1479963388877448999967---7676755589999999999875979998362112255334632247877999999 Q ss_pred -----------------------------------C-------------------CCCC-CCCHHHHHHHHHHHHHHHHC Q ss_conf -----------------------------------6-------------------7785-21023899999899998623 Q gi|254781093|r 72 -----------------------------------E-------------------GEFD-YGDGELSDAAAALDWVQSLN 96 (225) Q Consensus 72 -----------------------------------~-------------------G~~~-~g~~E~~D~~aa~~~l~~~~ 96 (225) + ++|. +|...+.|...|+-++.... T Consensus 98 ~~l~~~~i~~~~~~~~~~~~~~~~~l~~~i~~~K~~~~l~~~~~l~ls~tl~P~~~eYQNfGIMqA~D~iNAl~~lk~~~ 177 (403) T pfam11144 98 KSLKAINIDIKPINTYDNAEEIFEYLNKKIKELKQAGILDENYKLNLSVTLIPPKNEYQNFGIMQALDIINALKYLKKRF 177 (403) T ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99998599853334614399999999999997665065562503557877548974301110889999999999999855 Q ss_pred CC---CCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCC------------------------------------ Q ss_conf 54---6740588872036878764503862-1330122757443------------------------------------ Q gi|254781093|r 97 PE---SKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKS------------------------------------ 136 (225) Q Consensus 97 ~~---~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~------------------------------------ 136 (225) +. .-++++.|.|+||++|.+.|.-.|- +.++|.-|..+.. T Consensus 178 ~~~~~~lp~I~~G~SyGGYLa~L~AKIAPw~vdgVIDNSs~a~p~~~~I~GrE~~~~~~~~~~~~~~~~~i~~~~~~KT~ 257 (403) T pfam11144 178 PKFGGNLPVILIGSSYGGYLANLIAKIAPWYVDGVIDNSSYALPLLRYIFGREIDYTEYICFADFFFFKHIRLYCFDKTY 257 (403) T ss_pred CCCCCCCCEEEECCCCHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHCCEECCCCEEECCCCCCCCCCEEEEEEECCC T ss_conf 66678888899877838999999987483202579824656451356532500364123302354435864799972231 Q ss_pred CC--------CC--------CCCC---------CCCCE-EEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 33--------22--------2343---------34773-7761686550289999999999860779960799978 Q gi|254781093|r 137 YD--------FS--------FLAP---------CPSSG-LIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP 186 (225) Q Consensus 137 ~~--------~~--------~l~~---------~~~p~-LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ 186 (225) |. |+ .|.+ .+.+. ...|+..|++.|.+.=.++++.+.. -|-+++++.+. T Consensus 258 Wt~n~~sp~~F~~a~~~IR~~Ln~~HL~iqs~y~~~~~yvsYHs~~D~~~p~~~K~~l~~~~~~-lgfda~l~lik 332 (403) T pfam11144 258 WTRNKNSPYYFSNARYLIRAILNKDHLIIQSNYNPNTIYVSYHSIKDELAPAKDKEELYDILKE-LGFDATLHLIK 332 (403) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHH-CCCCEEEEEEC T ss_conf 2357898555566889998760887999986448972899984245667897899999999997-69963898734 No 112 >COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism] Probab=98.48 E-value=7.3e-06 Score=55.32 Aligned_cols=185 Identities=18% Similarity=0.191 Sum_probs=111.6 Q ss_pred CCCCC-CCCEEEEECCCCC-C---CCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 68887-7879999578998-8---88998999999999999739849998531558876778521023899999899998 Q gi|254781093|r 19 PSTNP-NAPIALILHPHPR-F---GGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQ 93 (225) Q Consensus 19 ~~~~~-~~~~vv~~Hp~p~-~---GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~ 93 (225) |.... ....++++||+=- | -=+.++ .+.+.|.++|..|...+.|+-..+++......--.+++..+++.++ T Consensus 100 P~te~v~~~PlLiVpP~iNk~yI~DL~~~~----s~V~~l~~~g~~vfvIsw~nPd~~~~~~~~edYi~e~l~~aid~V~ 175 (445) T COG3243 100 PLTEKVLKRPLLIVPPWINKFYILDLSPEK----SLVRWLLEQGLDVFVISWRNPDASLAAKNLEDYILEGLSEAIDTVK 175 (445) T ss_pred CCCCCCCCCCEEEECCCCCCEEEEECCCCC----HHHHHHHHCCCCEEEEECCCCHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 787756788448634433752478678774----0899999759836899615860765534699999999999999999 Q ss_pred HHCCCCCCEEEEEECHHHHHHHHHHHCCC--CHHEEEECCCCCCCCC--------------------------------- Q ss_conf 62354674058887203687876450386--2133012275744333--------------------------------- Q gi|254781093|r 94 SLNPESKSCWIAGYSFGAWISMQLLMRRP--EINGFISVAPQPKSYD--------------------------------- 138 (225) Q Consensus 94 ~~~~~~~~i~l~G~S~Gg~val~~a~~~p--~i~~~v~isp~~~~~~--------------------------------- 138 (225) +... -++|-++|||.||..++.+++-.+ +|+.+.+...+.+... T Consensus 176 ~itg-~~dInliGyC~GGt~~~~a~a~~~~~~I~S~T~~ts~~Df~~~g~l~~f~~~~~~~~~~~~~~~~g~ipg~~ma~ 254 (445) T COG3243 176 DITG-QKDINLIGYCVGGTLLAAALALMAAKRIKSLTLLTSPVDFSHAGDLGIFANEATIEALDADIVQKGILPGWYMAI 254 (445) T ss_pred HHHC-CCCCCEEEECCCHHHHHHHHHHHHHCCCCEEEEEECCHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 8747-651235546145699999999744312244677732010035751001268789999986422216788089999 Q ss_pred ---------------------------C-----------------------------------------CCCCCCCCCEE Q ss_conf ---------------------------2-----------------------------------------22343347737 Q gi|254781093|r 139 ---------------------------F-----------------------------------------SFLAPCPSSGL 150 (225) Q Consensus 139 ---------------------------~-----------------------------------------~~l~~~~~p~L 150 (225) | -+|.....|++ T Consensus 255 ~F~mLrp~dliw~~fV~nyl~ge~pl~fdllyWn~Dst~~~~~~~~~~Lrn~y~~N~L~~g~~~v~G~~VdL~~It~pvy 334 (445) T COG3243 255 VFFLLRPNDLIWNYFVNNYLDGEQPLPFDLLYWNADSTRLPGAAHSEYLRNFYLENRLIRGGLEVSGTMVDLGDITCPVY 334 (445) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHCHHHCCCEEECCEEECHHHCCCCEE T ss_conf 99863931101688898861788887236788607786576689999999999737443065488888942232664238 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC--CCCC-----HHH---HHHHHHHHHHHHCCC Q ss_conf 76168655028999999999986077996079997879847--5585-----899---999999999973122 Q gi|254781093|r 151 IINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF--FIGK-----VDE---LINECAHYLDNSLDE 213 (225) Q Consensus 151 iIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f~~~-----~~~---l~~~i~~fl~~~L~~ 213 (225) .+.+++|.++|.+.++...+.+. + .++++.. +++|+ ..++ .+. +......|+...-+. T Consensus 335 ~~a~~~DhIaP~~Sv~~g~~l~~---g-~~~f~l~-~sGHIa~vVN~p~~~k~~~w~n~~~~~~~Wl~~a~~~ 402 (445) T COG3243 335 NLAAEEDHIAPWSSVYLGARLLG---G-EVTFVLS-RSGHIAGVVNPPGNAKYQYWTNLPADAEAWLSGAKEH 402 (445) T ss_pred EEECCCCCCCCHHHHHHHHHHCC---C-CEEEEEE-CCCEEEEEECCCCHHHHHCCCCCCCCHHHHHHHCCCC T ss_conf 86303465566898988987549---9-6499983-6860788857823566634689832099999861558 No 113 >pfam10230 DUF2305 Uncharacterized conserved protein (DUF2305). This family of proteins is conserved from plants to humans. The function is unknown. Probab=98.45 E-value=1.5e-05 Score=53.40 Aligned_cols=105 Identities=20% Similarity=0.341 Sum_probs=63.9 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHH---CCCEEEEEEECCCCCCCCCCCCC--CHHHHHHHHHHHHHHHHCCC Q ss_conf 78799995789988889989999999999997---39849998531558876778521--02389999989999862354 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQ---RGFVSLRFNFRGIGRSEGEFDYG--DGELSDAAAALDWVQSLNPE 98 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~---~G~~vl~fd~RG~G~S~G~~~~g--~~E~~D~~aa~~~l~~~~~~ 98 (225) ++.++++..++|+.=+- .....+.|.+ ..|.++...+.|+-.+.+....+ .+--+.+.-.++++++.-+. T Consensus 1 p~~li~~IPGNPGlv~f-----Y~~Fl~~L~~~l~~~~~V~~ishaGh~~~~~~~~~~~~~~L~~QI~hk~~fi~~~~~~ 75 (260) T pfam10230 1 PRPLILLIPGNPGLVGF-----YRTFLSLLHEKLNPSFDVLGISHAGHTLSPGNNKNERVFSLQDQIEHKIDFLRAFLPK 75 (260) T ss_pred CCEEEEEECCCCCHHHH-----HHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 96799998999975999-----9999999998619986599977887768865666688647799999999999998545 Q ss_pred --CCCEEEEEECHHHHHHHHHHHCCC---CHHEEEECCCC Q ss_conf --674058887203687876450386---21330122757 Q gi|254781093|r 99 --SKSCWIAGYSFGAWISMQLLMRRP---EINGFISVAPQ 133 (225) Q Consensus 99 --~~~i~l~G~S~Gg~val~~a~~~p---~i~~~v~isp~ 133 (225) ..+++|+|||.|+++++.+..+.+ +|.+.+++-|. T Consensus 76 ~~~~k~iLiGHSIGayi~leil~r~~~~~~I~~~~lLfPT 115 (260) T pfam10230 76 KTDVKLILIGHSIGAYIALEVLKRLSLKFRIKKCVLLFPT 115 (260) T ss_pred CCCCCEEEEECCHHHHHHHHHHHHCCCCCCEEEEEEECCC T ss_conf 6898579983346899999999854557656789876775 No 114 >PRK13604 luxD acyl transferase; Provisional Probab=98.43 E-value=9.1e-06 Score=54.73 Aligned_cols=180 Identities=18% Similarity=0.259 Sum_probs=121.1 Q ss_pred EECCCCCE-EEEEEECCCC--CCC-CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CCCCCCCC---CC Q ss_conf 96589961-8799966888--778-79999578998888998999999999999739849998531-55887677---85 Q gi|254781093|r 5 VFNGPSGR-LEGRYQPSTN--PNA-PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GIGRSEGE---FD 76 (225) Q Consensus 5 ~i~g~~G~-l~~~~~~~~~--~~~-~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~G~S~G~---~~ 76 (225) +|.-.+|| +..|-++|+. +.+ .++++.- +|.-.|++ +..+|+.|...||.|+|+|-- -+|-|.|+ |+ T Consensus 11 vi~l~~~~~I~VWet~Pk~~~~~rnnTiliAs---GFarRMdh--fAglAeYLs~NGFhViRYDsLhHvGlSSG~i~eFt 85 (305) T PRK13604 11 VICLENGQSIRVWETLPKENSPKKNNTILIAS---GFARRMDH--FAGLAEYLSSNGFHVIRYDSLHHVGLSSGTIDEFT 85 (305) T ss_pred EEEECCCCEEEEEECCCCCCCCCCCCEEEEEH---HHHHHHHH--HHHHHHHHHHCCEEEEEECCCCCCCCCCCCHHHEE T ss_conf 68836997899984488667765577299802---67887788--89999998508738998456544687876562503 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCC---------C---------- Q ss_conf 210238999998999986235467405888720368787645038621330122757443---------3---------- Q gi|254781093|r 77 YGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKS---------Y---------- 137 (225) Q Consensus 77 ~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~---------~---------- 137 (225) +..| ..-+..|++|++.++. .++.++--|+-+.+|...+.... +.-+|..-..++. | T Consensus 86 MsiG-k~Sll~VidWLk~rgi--~~iGlIAaSLSARIAY~v~~eid-lsfLiTAVGVvnLR~TLEkal~yDyl~lpi~~l 161 (305) T PRK13604 86 MSIG-KNSLLTVVDWLKTRGI--QNIGLIAASLSARIAYEVINEID-LSFLITAVGVVNLRDTLEKALKYDYLQLPINEL 161 (305) T ss_pred EECC-HHHHHHHHHHHHHCCC--CHHHHHHHHHHHHHHHHHHHHCC-HHEEEEEEEEEEHHHHHHHHHCCCHHCCCHHHC T ss_conf 3013-4569999999986597--42344677678899999874212-320455435553799999984875310894469 Q ss_pred ----CCC--------------------------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf ----322--------------------------23433477377616865502899999999998607799607999787 Q gi|254781093|r 138 ----DFS--------------------------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPD 187 (225) Q Consensus 138 ----~~~--------------------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~g 187 (225) +|. .+.....|........|+-|.-+++.++.+.+...+ ..+-.+.| T Consensus 162 PedldFEghnlgs~vFvtDcf~~~Wdtl~sT~~~~~~l~iPfIAFtAn~DdWV~q~EV~~~l~~i~s~~---~klysLiG 238 (305) T PRK13604 162 PEDLDFEGHNLGSEVFVTDCFKHGWDTLDSTINKVKGLDIPFIAFTANDDSWVKQSEVIDLLDSIRSEK---CKLYSLIG 238 (305) T ss_pred CCCCCCCCCCCCCEEEEEHHHHCCCCCHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC---CEEEEEEC T ss_conf 652454565456406760176648970676999850789867999846963440899999997178875---15888624 Q ss_pred CCCCCCCCH Q ss_conf 984755858 Q gi|254781093|r 188 ANHFFIGKV 196 (225) Q Consensus 188 a~H~f~~~~ 196 (225) .+|-.-... T Consensus 239 SsHdLgeNl 247 (305) T PRK13604 239 SSHDLGENL 247 (305) T ss_pred CCCCCCCCH T ss_conf 644321154 No 115 >pfam02273 Acyl_transf_2 Acyl transferase. This bacterial family of Acyl transferases (or myristoyl-acp-specific thioesterases) catalyse the first step in the bioluminescent fatty acid reductase system. Probab=98.42 E-value=1.1e-05 Score=54.31 Aligned_cols=180 Identities=18% Similarity=0.256 Sum_probs=121.4 Q ss_pred EECCCCCE-EEEEEECCCCC--CC-CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CCCCCCCC---CC Q ss_conf 96589961-87999668887--78-79999578998888998999999999999739849998531-55887677---85 Q gi|254781093|r 5 VFNGPSGR-LEGRYQPSTNP--NA-PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GIGRSEGE---FD 76 (225) Q Consensus 5 ~i~g~~G~-l~~~~~~~~~~--~~-~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~G~S~G~---~~ 76 (225) +|.-.+|| +..|-++|+.. .+ .++++.- +|.-.|++ +..+|+.|...||.|+|+|-- -+|-|.|+ |+ T Consensus 6 vi~l~~~r~I~VWet~Pk~~~~~rnnTiliAs---GFarRMdh--fAglAeYLs~NGFhViRYDsLhHvGlSSG~i~eFt 80 (294) T pfam02273 6 VIRVNNNREIHVWETPPKENVPKRNNTIVIAS---GFARRMDH--FAGLAEYLSTNGFHVIRYDSLHHVGLSSGEIDEFT 80 (294) T ss_pred EEEECCCCEEEEEECCCCCCCCCCCCEEEEEH---HHHHHHHH--HHHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHEE T ss_conf 79936997899973588667765477299803---68887788--89999998508738998456544687866552513 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCC-------------------- Q ss_conf 210238999998999986235467405888720368787645038621330122757443-------------------- Q gi|254781093|r 77 YGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKS-------------------- 136 (225) Q Consensus 77 ~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~-------------------- 136 (225) +..| ..-+..|++|++.++. .++.++--|+-+.+|...+... ++.-+|..-..++. T Consensus 81 MsiG-k~Sll~VidWLk~rgi--~~lGlIAaSLSARIAY~v~~ei-~lsfLiTAVGVvnLR~TLEkal~~Dyl~lpi~~l 156 (294) T pfam02273 81 MSVG-KNSLLTVIDWLKTRGI--NNIGLIASSLSARIAYEVISEI-NLSFLITAVGVVNLRDTLEKALGFDYLSLPIDEL 156 (294) T ss_pred EECC-HHHHHHHHHHHHHCCC--CHHHHHHHHHHHHHHHHHHHHC-CHHEEEEEEEEEEHHHHHHHHHCCCHHCCCHHHC T ss_conf 3013-4469999999986597--3334457767789999987421-3211666545553799999984875211894469 Q ss_pred ---CCCC--------------------------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf ---3322--------------------------23433477377616865502899999999998607799607999787 Q gi|254781093|r 137 ---YDFS--------------------------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPD 187 (225) Q Consensus 137 ---~~~~--------------------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~g 187 (225) .+|. .+.....|........|+-|.-+++.++...+...+ ..+-.+.| T Consensus 157 PedldFEgh~lgs~vFvtDcf~~~Wdtl~sT~~~~~~l~iPfIAFtAn~DdWV~q~EV~~~l~~i~s~~---~klysL~G 233 (294) T pfam02273 157 PEDLDFEGHKLGSEVFVRDCFENNWDTLDSTINKVANLDIPFIAFTANNDDWVKQHEVYDLLSNIRSDK---CKIYSLLG 233 (294) T ss_pred CCCCCCCCCCCCCEEEEEHHHHCCCCCHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC---CEEEEEEC T ss_conf 652454565456406760176648970675898850689857999846963440899999997179875---15888624 Q ss_pred CCCCCCCCH Q ss_conf 984755858 Q gi|254781093|r 188 ANHFFIGKV 196 (225) Q Consensus 188 a~H~f~~~~ 196 (225) .+|-.-... T Consensus 234 SsHdLgeNl 242 (294) T pfam02273 234 SSHDLGENL 242 (294) T ss_pred CCCCCCCCH T ss_conf 644321154 No 116 >PRK10439 enterobactin/ferric enterobactin esterase; Provisional Probab=98.42 E-value=2.5e-05 Score=52.10 Aligned_cols=187 Identities=17% Similarity=0.188 Sum_probs=108.4 Q ss_pred EEEECCCC--CCCCEEEEECCCCCCCCCC-CCH-HHHHHHHHHHHCC----CEEEEEEECCCC-CCCCCCCCCCHHHHH- Q ss_conf 99966888--7787999957899888899-899-9999999999739----849998531558-876778521023899- Q gi|254781093|r 15 GRYQPSTN--PNAPIALILHPHPRFGGTM-NDN-IVYQLFYLFQQRG----FVSLRFNFRGIG-RSEGEFDYGDGELSD- 84 (225) Q Consensus 15 ~~~~~~~~--~~~~~vv~~Hp~p~~GG~~-~~~-~~~~la~~l~~~G----~~vl~fd~RG~G-~S~G~~~~g~~E~~D- 84 (225) -+|.+++. ...|++|++- |.. .++ -+...-..+.+.| ..++.+|.-+.- |++ +... +.+..+ T Consensus 180 wiy~p~~~~~~~~PLlvL~D------G~~w~~~~~~~~~Ld~l~~~g~ipp~~~v~id~~d~~~R~~-eL~~-n~~f~~~ 251 (398) T PRK10439 180 WIFTTGDAAAEERPLAVLLD------GQFWAQSMPVWPALTSLTHRGQLPPAVYVLIDAIDTTHRSQ-ELPC-NADFWLA 251 (398) T ss_pred EEECCCCCCCCCCCEEEEEC------HHHHCCCCCHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHH-HCCC-CHHHHHH T ss_conf 99648877887765699962------08950038878999999975999962999967898678776-6499-8889999 Q ss_pred H-HHHHHHHHHHC---CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC---------CCCC-----CCC Q ss_conf 9-99899998623---546740588872036878764503862-133012275744333---------2223-----433 Q gi|254781093|r 85 A-AAALDWVQSLN---PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD---------FSFL-----APC 145 (225) Q Consensus 85 ~-~aa~~~l~~~~---~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~---------~~~l-----~~~ 145 (225) + ...+-|++++. .++.+.+|+|-||||..|+.+|.+.|+ +..+++.|+-.--.+ ...+ .+. T Consensus 252 l~~eLLP~~~~~~~~~~~~~~TvVaGqS~GGLaAl~aaL~~Pe~FG~VlsqSGSfWWp~~~~~~~g~l~~~~~~~~~~~~ 331 (398) T PRK10439 252 VQQELLPQVRAIAPFSDRADRTVVAGQSFGGLSALYAGLHWPERFGCVLSQSGSFWWPHRGGQQEGVLLEQLKAGEVSAR 331 (398) T ss_pred HHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCEEEECCCCEECCCCCCCCCHHHHHHHHHCCCCCC T ss_conf 99988899998768887842359963573069999999849231276687465211167777771479999983358988 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 4773776168655028999999999986077996079997879847558589999999999997312 Q gi|254781093|r 146 PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLD 212 (225) Q Consensus 146 ~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~ 212 (225) +....+-.|..+..+ .+..+.+++.|.. ++..+++.+++| +|-|-.....|.+.+...|...+. T Consensus 332 ~lri~l~~G~~E~~i-~~~n~~l~~~L~~-~g~~v~~~~~~G-GHD~~~WR~~L~dGL~~ll~p~~~ 395 (398) T PRK10439 332 GLRIVLEAGVREPMI-MQANQALYAQLHP-AGHSIFWRQVDG-GHDALCWRGGLIQGLIDLWQPLFH 395 (398) T ss_pred CEEEEEECCCCCHHH-HHHHHHHHHHHHH-CCCCEEEEEECC-CCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 808999666885688-9999999999997-399637998479-866788877899999998635502 No 117 >pfam11339 DUF3141 Protein of unknown function (DUF3141). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=98.29 E-value=9.5e-05 Score=48.61 Aligned_cols=158 Identities=23% Similarity=0.288 Sum_probs=102.4 Q ss_pred CCCCCEEEEEC--CC-CCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH----HHHH Q ss_conf 87787999957--89-98888998999999999999739849998531558876778521023899999899----9986 Q gi|254781093|r 22 NPNAPIALILH--PH-PRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALD----WVQS 94 (225) Q Consensus 22 ~~~~~~vv~~H--p~-p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~----~l~~ 94 (225) ...+|++||-. +| |+-||.+.+..+ - ...+.|..|+-.-|+-. ...-.-++|+..+.. -+.+ T Consensus 66 ~~kRP~vViDPRAGHGpGIGGFK~DSeV---G-vAL~~GHPvYFV~F~p~-------P~pGQTl~DV~~Aea~Fv~~V~~ 134 (581) T pfam11339 66 PTKRPFVVIDPRAGHGPGIGGFKPDSEI---G-VALRAGHPCYFVGFLPD-------PEPGQTLEDVMRAEAAFLREVIE 134 (581) T ss_pred CCCCCEEEECCCCCCCCCCCCCCCCCHH---H-HHHHCCCCEEEEEECCC-------CCCCCCHHHHHHHHHHHHHHHHH T ss_conf 8889859979988889987788742088---8-99866998699987469-------98988899999999999999998 Q ss_pred HCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCC----------------------------------C-- Q ss_conf 2354674058887203687876450386213301227574433----------------------------------3-- Q gi|254781093|r 95 LNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSY----------------------------------D-- 138 (225) Q Consensus 95 ~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~----------------------------------~-- 138 (225) ++|+..+-+|+|-|.||+.++++|+.+|++.|-+.+...+.+| | T Consensus 135 ~HP~a~kP~viGNCQaGWa~~~lAA~~pdl~GPivlnGaPlSYWaG~~G~NPMRY~gGllGGsW~a~l~sDLG~G~FDGA 214 (581) T pfam11339 135 LHPDAPKPAVIGNCQAGWALLMLAALRPDLFGPIIVAGAPLSYWAGVRGKNPMRYSGGLLGGSWLTALTSDLGNGRFDGA 214 (581) T ss_pred HCCCCCCCEEEECCHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHCCCCEECHH T ss_conf 48999998689416899999999852878768766648874101677789951454045310889999876169811508 Q ss_pred -----CC----------------------------------------------------------------------CCC Q ss_conf -----22----------------------------------------------------------------------234 Q gi|254781093|r 139 -----FS----------------------------------------------------------------------FLA 143 (225) Q Consensus 139 -----~~----------------------------------------------------------------------~l~ 143 (225) |+ +|. T Consensus 215 ~LVqNFE~LNPant~W~KyY~Lya~vDtE~~RfLeFErWWgg~~~ln~~Ei~~Iv~nLFVGNrL~~g~~~~~~G~~~DLr 294 (581) T pfam11339 215 WLVQNFENLNPANTLWGKYYNLYAKVDTEAPRFLEFERWWGGHVLLNGEEIQWIVDNLFVGNRLATGELRTSDGRRIDLR 294 (581) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCEECCHHHHHHHHHHHHCCCCCCCCCEECCCCCEEEHH T ss_conf 99866550590156788765665326764244444766518514327999999998753024003582563899575343 Q ss_pred CCCCCEEEEECCCCCCCCHHHHHHHHH-------HHHHCCCCCEEEEEECCCCCC Q ss_conf 334773776168655028999999999-------986077996079997879847 Q gi|254781093|r 144 PCPSSGLIINGSNDTVATTSDVKDLVN-------KLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 144 ~~~~p~LiIhG~~D~~vp~~~~~~~~~-------~l~~~~~~~~~~~~i~ga~H~ 191 (225) ....|+.|-...-|+++|++++..|.. .++ +.+..+-+.+=+..+|. T Consensus 295 ~IrsPIvvFaS~GDNITPPqQAL~WI~dlY~~~~ei~-a~gQ~IVY~~H~~vGHL 348 (581) T pfam11339 295 NIRSPIVVFCSYGDNITPPQQALNWIADLYADVEEIR-AHGQTIVYCVHESVGHL 348 (581) T ss_pred HCCCCEEEEECCCCCCCCCHHHHCCHHHHCCCHHHHH-HCCCEEEEEECCCCCCE T ss_conf 2678889993368889996577562887648999998-58987999856987736 No 118 >pfam00135 COesterase Carboxylesterase. Probab=98.29 E-value=8.3e-06 Score=54.96 Aligned_cols=116 Identities=25% Similarity=0.288 Sum_probs=70.7 Q ss_pred EEECCC---CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CC-CC-CCCCC-CCCCHHHHHHHHH Q ss_conf 996688---877879999578998888998999999999999739849998531-55-88-76778-5210238999998 Q gi|254781093|r 16 RYQPST---NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GI-GR-SEGEF-DYGDGELSDAAAA 88 (225) Q Consensus 16 ~~~~~~---~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~-G~-S~G~~-~~g~~E~~D~~aa 88 (225) +|.|.. ..+-||+|..|+-....|+.... ...-...+++.++.|+.+||| |. |- |.+.. ..|+.-+.|..+| T Consensus 96 V~~P~~~~~~~~lPV~v~ihGG~f~~Gs~~~~-~~~~~~~~~~~~vIvVt~nYRLg~fGFl~~~~~~~~gN~Gl~Dq~~A 174 (517) T pfam00135 96 VYTPKLASEGKNLPVMVWIHGGGFQSGSASLD-DYDGPDLAASEDVVVVTINYRLGALGFLSTGDSELPGNAGLLDQVLA 174 (517) T ss_pred EEECCCCCCCCCCCEEEEEECCCCCCCCCCCC-CCCCHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 88789988889964899991686310678886-66645776129979998577356343556798888875327999999 Q ss_pred HHHHHHH----CCCCCCEEEEEECHHHHHHHHHH-HCCC--CHHEEEECCC Q ss_conf 9999862----35467405888720368787645-0386--2133012275 Q gi|254781093|r 89 LDWVQSL----NPESKSCWIAGYSFGAWISMQLL-MRRP--EINGFISVAP 132 (225) Q Consensus 89 ~~~l~~~----~~~~~~i~l~G~S~Gg~val~~a-~~~p--~i~~~v~isp 132 (225) ++|+++. +.|+++|-|+|+|-||..+..+. +... -..+.|+.|. T Consensus 175 L~WV~~nI~~FGGDP~~VTi~G~SAGa~sv~~ll~sp~~~gLF~~aI~qSG 225 (517) T pfam00135 175 LRWVKDNIAAFGGDPDNVTLFGESAGAASVSLLLLSPLSRGLFHRAILMSG 225 (517) T ss_pred HHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHCCCCHHHHHHHHHCCC T ss_conf 999999999838898748997763288888788749441376773775168 No 119 >COG3946 VirJ Type IV secretory pathway, VirJ component [Intracellular trafficking and secretion] Probab=98.25 E-value=0.00012 Score=48.09 Aligned_cols=148 Identities=18% Similarity=0.162 Sum_probs=82.0 Q ss_pred HHHHHHHHHCCCEEEEEEE-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC--- Q ss_conf 9999999973984999853-155887677852102389999989999862354674058887203687876450386--- Q gi|254781093|r 47 YQLFYLFQQRGFVSLRFNF-RGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP--- 122 (225) Q Consensus 47 ~~la~~l~~~G~~vl~fd~-RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p--- 122 (225) -.++..|+++|+.|+-.|- |=.+ |+-+. ..-..|+..++++-+.+ ...+++.|+|||||+-+-=.+..+.| T Consensus 277 k~v~~~l~~~gvpVvGvdsLRYfW-~~rtP---e~~a~Dl~r~i~~y~~~-w~~~~~~liGySfGADvlP~~~n~L~~~~ 351 (456) T COG3946 277 KEVAEALQKQGVPVVGVDSLRYFW-SERTP---EQIAADLSRLIRFYARR-WGAKRVLLIGYSFGADVLPFAYNRLPPAT 351 (456) T ss_pred HHHHHHHHHCCCCEEEEEHHHHHH-CCCCH---HHHHHHHHHHHHHHHHH-HCCCEEEEEEECCCCHHHHHHHHHCCHHH T ss_conf 999999997799556430023551-56897---88877799999999886-37643899960466404689987588778 Q ss_pred --CHHEEEE------------------CCCCCCCCCCCCCCCC-CCCEEEEECC--CCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf --2133012------------------2757443332223433-4773776168--655028999999999986077996 Q gi|254781093|r 123 --EINGFIS------------------VAPQPKSYDFSFLAPC-PSSGLIINGS--NDTVATTSDVKDLVNKLMNQKGIS 179 (225) Q Consensus 123 --~i~~~v~------------------isp~~~~~~~~~l~~~-~~p~LiIhG~--~D~~vp~~~~~~~~~~l~~~~~~~ 179 (225) .++-+.+ .+.-...-....++.. +.-+..|+|. +|+.||-. +... T Consensus 352 r~~v~~~~ll~l~~~~~fe~~v~gWlg~~~~g~~~~~~~~~~l~~~~v~CiYG~~e~d~~Cp~l------------~~~~ 419 (456) T COG3946 352 RQRVRMVSLLGLGRTADFEISVEGWLGMAGEGAGDVVPDIAKLPLARVQCIYGQEEKDTACPSL------------KAKG 419 (456) T ss_pred HHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCCCCHHHHHCCCCEEEEEECCCCCCCCCCCC------------HHHC T ss_conf 8899999997415543279999655303776777764255507744068874576655668764------------1114 Q ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 079997879847558589999999999997312 Q gi|254781093|r 180 ITHKVIPDANHFFIGKVDELINECAHYLDNSLD 212 (225) Q Consensus 180 ~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~ 212 (225) ++.+.+||..| |.+..+.|.+.|.+=++.+.+ T Consensus 420 ~~~v~lpGgHH-Fd~dy~~la~~il~~~~~r~~ 451 (456) T COG3946 420 VDTVKLPGGHH-FDGDYEKLAKAILQGMRLRAP 451 (456) T ss_pred CEEEECCCCCC-CCCCHHHHHHHHHHHHHHCCC T ss_conf 34674478766-675289999999998875168 No 120 >cd00707 Pancreat_lipase_like Pancreatic lipase-like enzymes. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site. Probab=98.24 E-value=2e-05 Score=52.72 Aligned_cols=112 Identities=15% Similarity=0.269 Sum_probs=73.1 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC-CCEEEEEEECCCCCCCCCCCCCCHH----HHHHHHHHHHHHHH- Q ss_conf 87787999957899888899899999999999973-9849998531558876778521023----89999989999862- Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQR-GFVSLRFNFRGIGRSEGEFDYGDGE----LSDAAAALDWVQSL- 95 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~-G~~vl~fd~RG~G~S~G~~~~g~~E----~~D~~aa~~~l~~~- 95 (225) .+.+|+.++.|+ |.++.+...+..+.+++.++ .++|+..|.+. + +.-.+.....- -+.+...+++|.+. T Consensus 33 n~~~pt~iiiHG---~~~~~~~~~~~~~~~a~l~~~d~NVi~VDW~~-~-a~~~Y~~a~~n~~~Vg~~va~~i~~L~~~~ 107 (275) T cd00707 33 NPSRPTRFIIHG---WTSSGEESWISDLRKAYLSRGDYNVIVVDWGR-G-ANPNYPQAVNNTRVVGAELAKFLDFLVDNT 107 (275) T ss_pred CCCCCEEEEECC---CCCCCCCCHHHHHHHHHHHCCCCEEEEEECCC-C-CCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 788986999898---79899874799999999863893899998820-0-161099999979999999999999999742 Q ss_pred CCCCCCEEEEEECHHHHHHHHHHHCC-CCHHEEEECCCCCCCCC Q ss_conf 35467405888720368787645038-62133012275744333 Q gi|254781093|r 96 NPESKSCWIAGYSFGAWISMQLLMRR-PEINGFISVAPQPKSYD 138 (225) Q Consensus 96 ~~~~~~i~l~G~S~Gg~val~~a~~~-p~i~~~v~isp~~~~~~ 138 (225) +.+.+.+.|+|||+|+.+|-.+...- ..+..+..+-|+...+. T Consensus 108 g~~~~~ihlIGhSLGAHiaG~aG~~~~~~l~rItGLDPA~P~F~ 151 (275) T cd00707 108 GLSLENVHLIGHSLGAHVAGFAGKRLNGKLGRITGLDPAGPLFS 151 (275) T ss_pred CCCHHCEEEEEECHHHHHHHHHHHHHCCCCCEEEECCCCCCCCC T ss_conf 98821058995143577877888860774262662378765346 No 121 >pfam10142 PhoPQ_related PhoPQ-activated pathogenicity-related protein. Members of this family of bacterial proteins are involved in the virulence of some pathogenic proteobacteria. Probab=98.21 E-value=6.5e-06 Score=55.60 Aligned_cols=119 Identities=16% Similarity=0.312 Sum_probs=87.9 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECC---------------------CCCCCC----------- Q ss_conf 999862354674058887203687876450386213301227---------------------574433----------- Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVA---------------------PQPKSY----------- 137 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~is---------------------p~~~~~----------- 137 (225) +++++.+...++.++.|.|-=|+.+..+|+-+++|.+++-+. ++...| T Consensus 158 ~~~~~~~~~i~~FvV~GaSKRGWttWltaa~D~RV~ai~P~Vid~ln~~~~~~h~~~~yG~ws~a~~dY~~~gi~~~l~t 237 (360) T pfam10142 158 EFLKTLGIKIKKFVVTGASKRGWTTWLTAAVDPRVAAIAPMVIDILNMRAQLKHQYESYGNWSEALKDYVAEGIDEQLNT 237 (360) T ss_pred HHHHHCCCCCHHEEEECCCCHHHHHHHHHHCCCCEEEEEEEEECCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHCC T ss_conf 99976488601169945452108988876417561479768870446089999999975689667776777183333089 Q ss_pred -----------CCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf -----------322234334773776168655028999999999986077996079997879847558589999999999 Q gi|254781093|r 138 -----------DFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHY 206 (225) Q Consensus 138 -----------~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~f 206 (225) ++..+.....|.++|-|+.|++-.+....-+++.++..| .+..+|+++|...+ .++.+.+..| T Consensus 238 ~~f~~L~~ivDP~~Y~~rl~~PKyiinatgDeFf~pDs~~~y~~~L~G~K----~lryvPN~~H~l~~--~~~~~~l~sf 311 (360) T pfam10142 238 PAFKQLLQIVDPYSYRQRLTIPKYIINATGDEFFVPDSTNFYFDDLPGEK----ALRYVPNASHGLIN--TKVLESLVPF 311 (360) T ss_pred HHHHHHHHHCCHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHCCCCCE----EEEECCCCCCCCCC--HHHHHHHHHH T ss_conf 78999988608275686278561898447896304684345561489864----69976889877651--2389999999 Q ss_pred HHHHCCCC Q ss_conf 99731221 Q gi|254781093|r 207 LDNSLDEK 214 (225) Q Consensus 207 l~~~L~~~ 214 (225) ..+.++.. T Consensus 312 ~~~~~~~~ 319 (360) T pfam10142 312 FKRVQAGR 319 (360) T ss_pred HHHHHCCC T ss_conf 99997599 No 122 >cd00312 Esterase_lipase Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate. Probab=98.16 E-value=1.9e-05 Score=52.82 Aligned_cols=115 Identities=25% Similarity=0.285 Sum_probs=68.5 Q ss_pred EEECCC---CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CC-CC-CCCCC-CCCCHHHHHHHHH Q ss_conf 996688---877879999578998888998999999999999739849998531-55-88-76778-5210238999998 Q gi|254781093|r 16 RYQPST---NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GI-GR-SEGEF-DYGDGELSDAAAA 88 (225) Q Consensus 16 ~~~~~~---~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~-G~-S~G~~-~~g~~E~~D~~aa 88 (225) +|.|.. ..+.||+|..|+-....|+........++ ....++.|+.+||| |. |- |.++. ..|+--+.|..+| T Consensus 83 V~~P~~~~~~~~lPV~v~ihGG~f~~Gs~~~~~~~~~~--~~~~~vVvVt~nYRLg~~GFl~~~~~~~~gN~Gl~Dq~~A 160 (493) T cd00312 83 VYTPKNTKPGNSLPVMVWIHGGGFMFGSGSLYPGDGLA--REGDNVIVVSINYRLGVLGFLSTGDIELPGNYGLKDQRLA 160 (493) T ss_pred EEECCCCCCCCCCCEEEEECCCCCCCCCCCCCCCHHHH--HCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 88899988889973799974897763787777956898--6599979996346446454666898778875315789999 Q ss_pred HHHHHHH----CCCCCCEEEEEECHHHHHHHHHHHC-CC--CHHEEEECCC Q ss_conf 9999862----3546740588872036878764503-86--2133012275 Q gi|254781093|r 89 LDWVQSL----NPESKSCWIAGYSFGAWISMQLLMR-RP--EINGFISVAP 132 (225) Q Consensus 89 ~~~l~~~----~~~~~~i~l~G~S~Gg~val~~a~~-~p--~i~~~v~isp 132 (225) ++|+++. +.|+++|-|+|+|-||..+..+... .. -..++|+.|. T Consensus 161 L~WV~~nI~~FGGDP~~VTi~G~SAGa~sv~~ll~sp~~~gLF~raI~~SG 211 (493) T cd00312 161 LKWVQDNIAAFGGDPDSVTIFGESAGGASVSLLLLSPDSKGLFHRAISQSG 211 (493) T ss_pred HHHHHHHHHHHCCCCHHCEEECCCCHHHHHHHHHHCCCCHHHHHHHHHCCC T ss_conf 999999999828980671670125088899999868430388888987078 No 123 >COG2382 Fes Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism] Probab=98.14 E-value=2.4e-05 Score=52.22 Aligned_cols=185 Identities=18% Similarity=0.204 Sum_probs=103.2 Q ss_pred EEEEEECCCC---CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC----CCEEEEEEECCCC-CCCCCCCCCCHH-HH Q ss_conf 8799966888---7787999957899888899899999999999973----9849998531558-876778521023-89 Q gi|254781093|r 13 LEGRYQPSTN---PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQR----GFVSLRFNFRGIG-RSEGEFDYGDGE-LS 83 (225) Q Consensus 13 l~~~~~~~~~---~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~----G~~vl~fd~RG~G-~S~G~~~~g~~E-~~ 83 (225) -..+|.|++- .+.|+.++.|+.-.+- .-+ +......|.+. .-.++-+++--+- |.+ ++-. +.. .+ T Consensus 83 ~~vv~lppgy~~~~k~pvl~~~DG~~~~~---~g~-i~~~~dsli~~g~i~pai~vgid~~d~~~R~~-~~~~-n~~~~~ 156 (299) T COG2382 83 RRVVYLPPGYNPLEKYPVLYLQDGQDWFR---SGR-IPRILDSLIAAGEIPPAILVGIDYIDVKKRRE-ELHC-NEAYWR 156 (299) T ss_pred EEEEEECCCCCCCCCCCEEEEECCHHHHH---CCC-HHHHHHHHHHCCCCCCCEEEECCCCCHHHHHH-HHCC-CHHHHH T ss_conf 69999689988664354799963288886---387-68999999970878884698118777899998-8256-089999 Q ss_pred H-HHHHHHHHHHHCC---CCCCEEEEEECHHHHHHHHHHHCCCCHHE-EEECCCCCCCCCCC---------CC------C Q ss_conf 9-9998999986235---46740588872036878764503862133-01227574433322---------23------4 Q gi|254781093|r 84 D-AAAALDWVQSLNP---ESKSCWIAGYSFGAWISMQLLMRRPEING-FISVAPQPKSYDFS---------FL------A 143 (225) Q Consensus 84 D-~~aa~~~l~~~~~---~~~~i~l~G~S~Gg~val~~a~~~p~i~~-~v~isp~~~~~~~~---------~l------~ 143 (225) . ....+-|+++.++ ++..-+|+|.||||.+++.++...|+..| +++-||.......+ .+ . T Consensus 157 ~L~~eLlP~v~~~yp~~~~a~~r~LaG~SlGG~vsL~agl~~Pe~FG~V~s~Sps~~~~~~~~~~~~~~~~~l~~~~a~~ 236 (299) T COG2382 157 FLAQELLPYVEERYPTSADADGRVLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTPLDTQPQGEVAESLKILHAIG 236 (299) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCHHHHHHHHCCCHHHCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 99988666421217643237775784036232999999845923322232148865567544322110121100232367 Q ss_pred CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 3347737761686550289999999999860779960799978798475585899999999999 Q gi|254781093|r 144 PCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYL 207 (225) Q Consensus 144 ~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl 207 (225) ....-++..-++.+.+.++.. ++.+.+.. ++.+..+..++| ||-+.-..+.+.+.+...+ T Consensus 237 ~~~~~~l~~g~~~~~~~~pNr--~L~~~L~~-~g~~~~yre~~G-gHdw~~Wr~~l~~~L~~l~ 296 (299) T COG2382 237 TDERIVLTTGGEEGDFLRPNR--ALAAQLEK-KGIPYYYREYPG-GHDWAWWRPALAEGLQLLL 296 (299) T ss_pred CCCEEEEECCCCCCCCCCHHH--HHHHHHHH-CCCCCEEEECCC-CCCHHHHHHHHHHHHHHHH T ss_conf 630478604776311136659--99999986-698513665479-9764675788999999860 No 124 >pfam05990 DUF900 Alpha/beta hydrolase of unknown function (DUF900). This family consists of several hypothetical proteins of unknown function mostly found in Rhizobium species. Members of this family have an alpha/beta hydrolase fold. Probab=98.10 E-value=0.00015 Score=47.41 Aligned_cols=176 Identities=18% Similarity=0.264 Sum_probs=92.5 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CCCCHH------HHHHHHHHHHHH Q ss_conf 8877879999578998888998999999999999739849998531558876778-521023------899999899998 Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DYGDGE------LSDAAAALDWVQ 93 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~g~~E------~~D~~aa~~~l~ 93 (225) ..+.+-++|+.|++ .-+ -...+.++||.....|+...-+-| .|-|.|+. ++..-+ ..++...+..+. T Consensus 15 ~s~~r~vlvFVHGy---N~~-f~~a~~r~aQl~~d~~~~g~~v~F--SWPS~g~~~~Y~~D~~sa~~s~~~l~~~L~~l~ 88 (230) T pfam05990 15 KSAGKRVLVFVHGY---NNS-FEDAVYRFAQIAHDLGFPGVPVVF--TWPSGASLFGYNYDRESANYSRDALERLLRYLA 88 (230) T ss_pred CCCCCEEEEEECCC---CCC-HHHHHHHHHHHHHHCCCCCCEEEE--ECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 38998599997589---999-899999999999846999746999--688999613448799999999999999999998 Q ss_pred HHCCCCCCEEEEEECHHHHHHHHHHHC------CC----CHHEEEECCCCCCCCCC----CCCCCCCCCEEEEECCCCCC Q ss_conf 623546740588872036878764503------86----21330122757443332----22343347737761686550 Q gi|254781093|r 94 SLNPESKSCWIAGYSFGAWISMQLLMR------RP----EINGFISVAPQPKSYDF----SFLAPCPSSGLIINGSNDTV 159 (225) Q Consensus 94 ~~~~~~~~i~l~G~S~Gg~val~~a~~------~p----~i~~~v~isp~~~~~~~----~~l~~~~~p~LiIhG~~D~~ 159 (225) + .+..++|.|++||||+++.+.+..+ ++ .+..+|+.+|=...-.| ..+.+...++-+....+|.- T Consensus 89 ~-~~~~~~I~ilAHSMG~rl~~~aL~~l~~~~~~~~~~~~i~~viLaApDiD~d~F~~~~~~i~~~~~~itvy~S~~D~A 167 (230) T pfam05990 89 T-TPPVKRIHLIAHSMGTWLVMEALRQLAIEADEPDVVAKIGNVILAAPDIDVDVFKKQIERLGKLSKRFTVYVSRDDRA 167 (230) T ss_pred H-CCCCCEEEEEEECCHHHHHHHHHHHHHHHCCCCCHHHHHHCEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCCHH T ss_conf 6-669874899985817899999999999856674044443126774688888999999999734688879998499679 Q ss_pred CCHHHHHHH-HHHHHHC-----------CCC-CEEEEEECCC---CCCCCCCHHHHHHHH Q ss_conf 289999999-9998607-----------799-6079997879---847558589999999 Q gi|254781093|r 160 ATTSDVKDL-VNKLMNQ-----------KGI-SITHKVIPDA---NHFFIGKVDELINEC 203 (225) Q Consensus 160 vp~~~~~~~-~~~l~~~-----------~~~-~~~~~~i~ga---~H~f~~~~~~l~~~i 203 (225) ...+....- ..++... .++ .+++.-+++. +|+|....+.+.+.+ T Consensus 168 L~~S~~l~g~~~RlG~~~p~~~~~~~~~~~v~vID~s~v~~~~~~gH~y~~~sp~v~~~i 227 (230) T pfam05990 168 LKLSRLISGDVPRLGAIDPDAEREELASAGVTVVDLSKVKGGDLLNHSKFASNPEVVQLI 227 (230) T ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 999999728998878688740154575569089989766899987876500799999985 No 125 >COG4782 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.08 E-value=0.00013 Score=47.82 Aligned_cols=107 Identities=21% Similarity=0.361 Sum_probs=70.2 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC-CCCC------HHHHHHHHHHHHHHH Q ss_conf 877879999578998888998999999999999739849998531558876778-5210------238999998999986 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEF-DYGD------GELSDAAAALDWVQS 94 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~-~~g~------~E~~D~~aa~~~l~~ 94 (225) ...+-++|+.|+| .-+.++ .+++.++.....|+-....=| +|-|.|++ ++.. ....++..++.++++ T Consensus 113 s~~k~vlvFvHGf---Nntf~d-av~R~aqI~~d~g~~~~pVvF--SWPS~g~l~~Yn~DreS~~~Sr~aLe~~lr~La~ 186 (377) T COG4782 113 SSAKTVLVFVHGF---NNTFED-AVYRTAQIVHDSGNDGVPVVF--SWPSRGSLLGYNYDRESTNYSRPALERLLRYLAT 186 (377) T ss_pred CCCCEEEEEECCC---CCCHHH-HHHHHHHHHHHCCCCCCEEEE--ECCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 6798599998025---873557-789999887631898206999--7478873200014555655338999999999974 Q ss_pred HCCCCCCEEEEEECHHHHHHHHHHH----CCC-----CHHEEEECCCCCC Q ss_conf 2354674058887203687876450----386-----2133012275744 Q gi|254781093|r 95 LNPESKSCWIAGYSFGAWISMQLLM----RRP-----EINGFISVAPQPK 135 (225) Q Consensus 95 ~~~~~~~i~l~G~S~Gg~val~~a~----~~p-----~i~~~v~isp~~~ 135 (225) ..+ .++|.|+.||||.++++.... +.. .++-+|+-+|-.. T Consensus 187 ~~~-~~~I~ilAHSMGtwl~~e~LrQLai~~~~~l~~ki~nViLAaPDiD 235 (377) T COG4782 187 DKP-VKRIYLLAHSMGTWLLMEALRQLAIRADRPLPAKIKNVILAAPDID 235 (377) T ss_pred CCC-CCEEEEEEECCHHHHHHHHHHHHHCCCCCCHHHHHHHEEEECCCCC T ss_conf 788-8628999854427999999999844687432256651486178877 No 126 >KOG4840 consensus Probab=98.06 E-value=0.0001 Score=48.41 Aligned_cols=167 Identities=16% Similarity=0.137 Sum_probs=101.3 Q ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC--C Q ss_conf 99999999999739849998531558876778521023899999899998623546740588872036878764503--8 Q gi|254781093|r 44 NIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR--R 121 (225) Q Consensus 44 ~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~--~ 121 (225) .-+..|++.|-+.+|..+.+-.|.+-.-=|+|+- ....+|+..+++++...... ..|+|+|||-|..-.+.+... . T Consensus 53 ~y~~~L~~~lde~~wslVq~q~~Ssy~G~Gt~sl-k~D~edl~~l~~Hi~~~~fS-t~vVL~GhSTGcQdi~yYlTnt~~ 130 (299) T KOG4840 53 LYTTMLNRYLDENSWSLVQPQLRSSYNGYGTFSL-KDDVEDLKCLLEHIQLCGFS-TDVVLVGHSTGCQDIMYYLTNTTK 130 (299) T ss_pred CCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCC-CCCHHHHHHHHHHHHCCCCC-CCEEEEECCCCCHHHHHHHHHCCC T ss_conf 4288998888643643664001056566565010-23589999999986215765-664997157660579999874222 Q ss_pred C-CHHEEEECCCCCCCC-CC--------------------------------------------------------CCC- Q ss_conf 6-213301227574433-32--------------------------------------------------------223- Q gi|254781093|r 122 P-EINGFISVAPQPKSY-DF--------------------------------------------------------SFL- 142 (225) Q Consensus 122 p-~i~~~v~isp~~~~~-~~--------------------------------------------------------~~l- 142 (225) + .+++.|+.+|...+- .| ++| T Consensus 131 ~r~iraaIlqApVSDrEYqf~~~~etk~l~d~l~a~~~~~grgedv~pR~~~~~~p~sa~R~~dl~~~~G~dDmFSSdLS 210 (299) T KOG4840 131 DRKIRAAILQAPVSDREYQFLEEHETKDLSDLLRAAKETIGRGEDVAPRYGGGSQPLSARRALDLFSRYGKDDMFSSDLS 210 (299) T ss_pred HHHHHHHHHHCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHCCCC T ss_conf 38899998727652155550266345789999999876522354235345888766647788889987187420001232 Q ss_pred -----------CCCCCCEEEEECCCCCCCCHHHHH-HHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf -----------433477377616865502899999-99999860779960799978798475585899999999999973 Q gi|254781093|r 143 -----------APCPSSGLIINGSNDTVATTSDVK-DLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNS 210 (225) Q Consensus 143 -----------~~~~~p~LiIhG~~D~~vp~~~~~-~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~ 210 (225) +..+.-++++.+++|++|+...-. .+..+.++.-+..-..++.. ..|-.++...++.+++..|+.+. T Consensus 211 ~D~f~~~lg~gsta~~qiif~ms~rDEyv~~~~dkk~llnR~s~~~r~~e~~~~ls-g~~~~~n~~~~~sqei~k~i~~~ 289 (299) T KOG4840 211 EDRFYMELGEGSTAGAQIIFVMSGRDEYVKADIDKKLLLNRNSRVERIPEGTHVLS-GAEDVENFLGFLSQEIEKAIGQK 289 (299) T ss_pred HHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHC T ss_conf 66899985467887756999943765446731578999876534206775541012-66417654013059999999843 Q ss_pred CCC Q ss_conf 122 Q gi|254781093|r 211 LDE 213 (225) Q Consensus 211 L~~ 213 (225) ... T Consensus 290 ~~s 292 (299) T KOG4840 290 AQS 292 (299) T ss_pred CCC T ss_conf 788 No 127 >pfam07819 PGAP1 PGAP1-like protein. The sequences found in this family are similar to PGAP1. This is an endoplasmic reticulum membrane protein with a catalytic serine containing motif that is conserved in a number of lipases. PGAP1 functions as a GPI inositol-deacylase; this deacylation is important for the efficient transport of GPI-anchored proteins from the endoplasmic reticulum to the Golgi body. Probab=97.94 E-value=0.00011 Score=48.31 Aligned_cols=127 Identities=18% Similarity=0.287 Sum_probs=70.2 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHC--------CCEEEEEEECCCCCCCCCCCC--CC---HHHHHHHHHHHHHH Q ss_conf 999957899888899899999999999973--------984999853155887677852--10---23899999899998 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQR--------GFVSLRFNFRGIGRSEGEFDY--GD---GELSDAAAALDWVQ 93 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~--------G~~vl~fd~RG~G~S~G~~~~--g~---~E~~D~~aa~~~l~ 93 (225) .|++.|++ +|++.+ ++.+|..+.+. -+..+..||.. +++. |. -+.+=+..++.++. T Consensus 6 PVLFIPGN---aGSykQ--vRSlas~~~~~~~~~~~~~~ld~ftvDfnE------elsA~~G~~l~~Qaeyv~~aI~~Il 74 (225) T pfam07819 6 PVLFIPGN---AGSYKQ--VRSIASVALRKAELNDNGFHLDFFSVDFNE------ELSAFHGRTLLDQAEYLNDAIRYIL 74 (225) T ss_pred EEEEECCC---CCCHHH--HHHHHHHHHHHHHCCCCCCCCEEEEECCCC------HHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 08986899---886788--989999999987513777542089873453------5553161889999999999999999 Q ss_pred HHC----CCCCCEEEEEECHHHHHHHHHHHCCC-----CHHEEEECCCCCCCCCCCC----------C----------CC Q ss_conf 623----54674058887203687876450386-----2133012275744333222----------3----------43 Q gi|254781093|r 94 SLN----PESKSCWIAGYSFGAWISMQLLMRRP-----EINGFISVAPQPKSYDFSF----------L----------AP 144 (225) Q Consensus 94 ~~~----~~~~~i~l~G~S~Gg~val~~a~~~p-----~i~~~v~isp~~~~~~~~~----------l----------~~ 144 (225) +++ +.++.|+|+||||||.+|-.+.. .| .|..+|.++.|-....+.+ + .. T Consensus 75 ~lY~~~~~~p~sVilvGHSMGGiVAr~~l~-~~~~~~~~V~tIItLstPh~~pp~~~d~~~~~~y~~~~~~w~~~~~~~~ 153 (225) T pfam07819 75 SLYNSGRPPPTSVILIGHSMGGLVARAALT-LPNYIPDSVNTIVTLSSPHAGPPLTFDGDLLKFYALLNEYWRRNQADGD 153 (225) T ss_pred HHHCCCCCCCCCEEEEEECCCHHHHHHHHC-CCCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 985247899971699863241177877751-7433445223788803656588866888899999999999986133567 Q ss_pred CCCCE--E-EEECCCCCCCCHHHH Q ss_conf 34773--7-761686550289999 Q gi|254781093|r 145 CPSSG--L-IINGSNDTVATTSDV 165 (225) Q Consensus 145 ~~~p~--L-iIhG~~D~~vp~~~~ 165 (225) .-..+ + +--|.+|..||.+.. T Consensus 154 ~l~~v~~vSi~GG~rD~~V~s~~t 177 (225) T pfam07819 154 SLSNVGLVSITGGIRDYQVPDDYS 177 (225) T ss_pred CCCCEEEEEECCCCCCCCCCCCCC T ss_conf 666769999347888632384222 No 128 >KOG1516 consensus Probab=97.93 E-value=7.3e-05 Score=49.27 Aligned_cols=93 Identities=26% Similarity=0.320 Sum_probs=62.0 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC-CC-C-CCCCCCCC-CCHHHHHHHHHHHHHHHH----C Q ss_conf 879999578998888998999999999999739849998531-55-8-87677852-102389999989999862----3 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR-GI-G-RSEGEFDY-GDGELSDAAAALDWVQSL----N 96 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R-G~-G-~S~G~~~~-g~~E~~D~~aa~~~l~~~----~ 96 (225) .||+|+.|+-....|+.........+..+....+.++.+||| |. | -|.|+... |+--+.|...|++|+++. + T Consensus 112 ~pV~V~iHGG~~~~gs~~~~~~~~~~~~~~~~~VVvVt~~YRLG~lGF~st~d~~~~gN~gl~Dq~~AL~wv~~~I~~FG 191 (545) T KOG1516 112 LPVMVYIHGGGFQFGSASSFEIISPAYVLLLKDVVVVTINYRLGPLGFLSTGDSAAPGNLGLFDQLLALRWVKDNIPSFG 191 (545) T ss_pred CCEEEEEECCCCCCCCCHHHHHHCCHHHCCCCCEEEEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC T ss_conf 97599980785002762112330520111568889998522003115223588778861888999999999983666668 Q ss_pred CCCCCEEEEEECHHHHHHHHH Q ss_conf 546740588872036878764 Q gi|254781093|r 97 PESKSCWIAGYSFGAWISMQL 117 (225) Q Consensus 97 ~~~~~i~l~G~S~Gg~val~~ 117 (225) .|+++|-|+|+|.||..+..+ T Consensus 192 Gdp~~VTl~G~saGa~~v~~l 212 (545) T KOG1516 192 GDPKNVTLFGHSAGAASVSLL 212 (545) T ss_pred CCCCCEEEEECCHHHHHHHHH T ss_conf 997105786415899999987 No 129 >PRK10252 entF enterobactin synthase subunit F; Provisional Probab=97.92 E-value=0.0018 Score=40.87 Aligned_cols=171 Identities=13% Similarity=0.132 Sum_probs=90.9 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH-HHHHHHHHCCCCCCEE Q ss_conf 879999578998888998999999999999739849998531558876778521023899999-8999986235467405 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAA-ALDWVQSLNPESKSCW 103 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~a-a~~~l~~~~~~~~~i~ 103 (225) ++.+.+.||- ||+.. .+..|++.|.. .+.|+-+..+|....+-... -++++.+ -++.+++..+. .+.. T Consensus 1065 ~~plf~vhp~---gG~~~--~y~~La~~L~~-~~~vyglq~~~l~~~~~~~~----si~~~a~~y~~~i~~~qp~-GPy~ 1133 (1293) T PRK10252 1065 GPTLFCFHPA---SGFAW--QFSVLSRYLDP-QWSIIGIQSPRPDGPMQTAT----SLDEVCEDHLATLLEQQPH-GPYY 1133 (1293) T ss_pred CCCEEEECCC---CCCHH--HHHHHHHHCCC-CCCEEEEECCCCCCCCCCCC----CHHHHHHHHHHHHHHHCCC-CCEE T ss_conf 9978998999---86512--59999985289-98579987887788888999----9999999999999986899-8989 Q ss_pred EEEECHHHHHHHHHHHCC----CCHHEEEECC-CCCCC--CC--------CCCC-------------------------- Q ss_conf 888720368787645038----6213301227-57443--33--------2223-------------------------- Q gi|254781093|r 104 IAGYSFGAWISMQLLMRR----PEINGFISVA-PQPKS--YD--------FSFL-------------------------- 142 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~----p~i~~~v~is-p~~~~--~~--------~~~l-------------------------- 142 (225) |.||||||.+|...|.+. .+|..++++- .++.. +. ...+ T Consensus 1134 L~GwS~GG~vA~e~A~~L~~~G~~v~~l~llDs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1213 (1293) T PRK10252 1134 LLGYSLGGTLAQGIAARLRARGEEVAFLGLLDTWPPETQNWQEKEANGLDPEVLAEINREREAFLAAQQGSLSTELFTAI 1213 (1293) T ss_pred EEEECHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 99878708999999999997799354799986799876550003434579566643446788887643134778999999 Q ss_pred ----------------CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf ----------------4334773776168655028999999999986077996079997879847558589999999999 Q gi|254781093|r 143 ----------------APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHY 206 (225) Q Consensus 143 ----------------~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~f 206 (225) .+...+++++..+++.--.......|.+-. ..++.+.+|| +|. +--.+.....+... T Consensus 1214 ~~~~~~~~~~~~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~W~~~~-----~~~~~~~v~~-~H~-~m~~~~~v~~~~~~ 1286 (1293) T PRK10252 1214 EGNYADAVRLLTTAHSVPFDGKATLFVAERTLQEGMSPEQAWSPWI-----AELDIYRQDC-AHV-DIISPEAFEKIGPI 1286 (1293) T ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHC-----CCEEEEEECC-CHH-HHCCCHHHHHHHHH T ss_conf 9999999999973789876873799980787655687310377746-----9839999358-837-87887889999999 Q ss_pred HHHHCCC Q ss_conf 9973122 Q gi|254781093|r 207 LDNSLDE 213 (225) Q Consensus 207 l~~~L~~ 213 (225) |++.|+| T Consensus 1287 l~~~L~e 1293 (1293) T PRK10252 1287 IRATLNE 1293 (1293) T ss_pred HHHHHCC T ss_conf 9998667 No 130 >TIGR03230 lipo_lipase lipoprotein lipase. Members of this protein family are lipoprotein lipase (EC 3.1.1.34), a eukaryotic triacylglycerol lipase active in plasma and similar to pancreatic and hepatic triacylglycerol lipases (EC 3.1.1.3). It is also called clearing factor. It cleaves chylomicron and VLDL triacylglycerols; it also has phospholipase A-1 activity. Probab=97.92 E-value=0.00017 Score=47.04 Aligned_cols=133 Identities=14% Similarity=0.241 Sum_probs=78.5 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHH-HHHC-CCEEEEEEECCCCCCCCCCCCCCH----HHHHHHHHHHHHHHH Q ss_conf 8778799995789988889989999999999-9973-984999853155887677852102----389999989999862 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYL-FQQR-GFVSLRFNFRGIGRSEGEFDYGDG----ELSDAAAALDWVQSL 95 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~-l~~~-G~~vl~fd~RG~G~S~G~~~~g~~----E~~D~~aa~~~l~~~ 95 (225) .+.+|+.+|.|+|-. .|...+. +..+.++ |.+. .++|+..|.++ | +...+..... --+.+...++||.+. T Consensus 38 n~s~pT~~IIHG~~~-~g~~~~W-v~~m~~all~~~~d~NVI~VDW~~-g-a~~~Y~~A~~ntr~VG~~iA~~i~~L~~~ 113 (442) T TIGR03230 38 NHETKTFIVIHGWTV-TGMFESW-VPKLVAALYEREPSANVIVVDWLS-R-AQQHYPTSAAYTKLVGKDVAKFVNWMQEE 113 (442) T ss_pred CCCCCEEEEECCCCC-CCCCCHH-HHHHHHHHHHHCCCEEEEEEECHH-H-CCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 988980999888888-8987568-999999999627981899980704-2-27235999998999999999999999985 Q ss_pred -CCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEECCCCCCCCCCCC----CCCC-CCCEEEEECCCCC Q ss_conf -354674058887203687876450386-2133012275744333222----3433-4773776168655 Q gi|254781093|r 96 -NPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISVAPQPKSYDFSF----LAPC-PSSGLIINGSNDT 158 (225) Q Consensus 96 -~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~isp~~~~~~~~~----l~~~-~~p~LiIhG~~D~ 158 (225) +...+.+.|+|||+|+.+|-.+..... .+..+..+-|+...+.... |.+- ..=+-+||-+... T Consensus 114 ~g~~~~~vHlIGhSLGAHvAG~aG~~~~~~lgRITGLDPA~P~F~~~~~~~RLd~sDA~FVDVIHTd~~~ 183 (442) T TIGR03230 114 FNYPWDNVHLLGYSLGAHVAGIAGSLTKHKVNRITGLDPAGPTFEYADAPSTLSPDDADFVDVLHTNTRG 183 (442) T ss_pred HCCCHHHEEEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHCCCCCCCCEEEEEECCCCC T ss_conf 1998113489852242012223456406875608841687521367985544694337758688169865 No 131 >KOG3975 consensus Probab=97.92 E-value=0.0016 Score=41.24 Aligned_cols=178 Identities=14% Similarity=0.183 Sum_probs=103.6 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE---EEEEEECCCCCCC------CCCC--CCCHHHHHHHHHHH Q ss_conf 87787999957899888899899999999999973984---9998531558876------7785--21023899999899 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFV---SLRFNFRGIGRSE------GEFD--YGDGELSDAAAALD 90 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~---vl~fd~RG~G~S~------G~~~--~g~~E~~D~~aa~~ 90 (225) ..+++.++...++|++-|. .+..++.|.+.=.. +....--|+-+-. ++.+ .-.+--+.+.--++ T Consensus 26 ~~~~~li~~IpGNPG~~gF-----Y~~F~~~L~~~l~~r~~~wtIsh~~H~~~P~sl~~~~s~~~~eifsL~~QV~HKla 100 (301) T KOG3975 26 GEDKPLIVWIPGNPGLLGF-----YTEFARHLHLNLIDRLPVWTISHAGHALMPASLREDHSHTNEEIFSLQDQVDHKLA 100 (301) T ss_pred CCCCEEEEEECCCCCCHHH-----HHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 8972389996699983257-----89999999986443364168703665668710003400355554351167788999 Q ss_pred HHHHHCCCCCCEEEEEECHHHHHHHHHHHCC-C--CHHEEEECCCCCCCC----------------------------CC Q ss_conf 9986235467405888720368787645038-6--213301227574433----------------------------32 Q gi|254781093|r 91 WVQSLNPESKSCWIAGYSFGAWISMQLLMRR-P--EINGFISVAPQPKSY----------------------------DF 139 (225) Q Consensus 91 ~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~-p--~i~~~v~isp~~~~~----------------------------~~ 139 (225) ++++.-|..++++++|||-|+|+.++..... + .+...+++-|...+. .+ T Consensus 101 Fik~~~Pk~~ki~iiGHSiGaYm~Lqil~~~k~~~~vqKa~~LFPTIerM~eSpnG~~~t~~l~~~~hv~~lt~yi~~~~ 180 (301) T KOG3975 101 FIKEYVPKDRKIYIIGHSIGAYMVLQILPSIKLVFSVQKAVLLFPTIERMHESPNGIRLTKVLRYLPHVVSLTSYIYWIL 180 (301) T ss_pred HHHHHCCCCCEEEEEECCHHHHHHHHHHHHCCCCCCEEEEEEECCHHHHHHCCCCCEEEEEEEEEEHHHHHEEEEEEEEC T ss_conf 99974888877999943526999999756414664147888854218887249986386522132134320130220200 Q ss_pred ---------------------CC------------------CCC-------------C---CCCEEEEECCCCCCCCHHH Q ss_conf ---------------------22------------------343-------------3---4773776168655028999 Q gi|254781093|r 140 ---------------------SF------------------LAP-------------C---PSSGLIINGSNDTVATTSD 164 (225) Q Consensus 140 ---------------------~~------------------l~~-------------~---~~p~LiIhG~~D~~vp~~~ 164 (225) .+ |+. | ....-+.+|+.|.-||.+. T Consensus 181 lp~~ir~~Li~~~l~~~n~p~e~l~tal~l~h~~v~rn~v~la~qEm~eV~~~d~e~~een~d~l~Fyygt~DgW~p~~~ 260 (301) T KOG3975 181 LPGFIRFILIKFMLCGSNGPQEFLSTALFLTHPQVVRNSVGLAAQEMEEVTTRDIEYCEENLDSLWFYYGTNDGWVPSHY 260 (301) T ss_pred CHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHH T ss_conf 84889999999733267882788754777515999998761046889999986899998517489999037888752678 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHH Q ss_conf 999999986077996079997879847558-58999999999999 Q gi|254781093|r 165 VKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLD 208 (225) Q Consensus 165 ~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~ 208 (225) ...+-+++++ .++++-+ +.-.|.|.- +.+.+.+.+.+.++ T Consensus 261 ~d~~kdd~~e---ed~~Lde-dki~HAFV~~~~q~ma~~v~d~~~ 301 (301) T KOG3975 261 YDYYKDDVPE---EDLKLDE-DKIPHAFVVKHAQYMANAVFDMIQ 301 (301) T ss_pred HHHHHHHCCH---HCEEECC-CCCCCCEEECCCHHHHHHHHHHHC T ss_conf 7888643560---0054340-359731232041899999998629 No 132 >COG0627 Predicted esterase [General function prediction only] Probab=97.81 E-value=0.00056 Score=43.94 Aligned_cols=186 Identities=18% Similarity=0.208 Sum_probs=99.1 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHH-HHHHHHHHCCCEEEEEEE--CCC--CCC--------CCCCC--------CCCHH Q ss_conf 7787999957899888899899999-999999973984999853--155--887--------67785--------21023 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVY-QLFYLFQQRGFVSLRFNF--RGI--GRS--------EGEFD--------YGDGE 81 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~-~la~~l~~~G~~vl~fd~--RG~--G~S--------~G~~~--------~g~~E 81 (225) .+-|++.++|+ ..++..+-... .+-+.....|+.++..|- |+. +-+ .|=+. .+... T Consensus 52 ~~ipV~~~l~G---~t~~~~~~~~~~g~~~~~~~~g~~~~~p~t~~~~~~~~~~vv~p~G~~~sfY~d~~~~~~~~~~~q 128 (316) T COG0627 52 RDIPVLYLLSG---LTCNEPNVYLLDGLRRQADESGWAVVTPDTSPRGAGVNISVVMPLGGGASFYSDWTQPPWASGPYQ 128 (316) T ss_pred CCCCEEEEECC---CCCCCCCEEECCCHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCC T ss_conf 99877998178---888977336616544434316869965787755577776501356873550023245654456420 Q ss_pred HHHHH--HHHHHHHHHCC-CC--CCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCC------------------ Q ss_conf 89999--98999986235-46--740588872036878764503862-13301227574433------------------ Q gi|254781093|r 82 LSDAA--AALDWVQSLNP-ES--KSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSY------------------ 137 (225) Q Consensus 82 ~~D~~--aa~~~l~~~~~-~~--~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~------------------ 137 (225) .++.. ..-+.+.+..+ +. .+..++|+||||+-|+.+|.++|+ ...+.+.||.+... T Consensus 129 ~~tfl~~ELP~~~~~~f~~~~~~~~~aI~G~SMGG~GAl~lA~~~pd~f~~~sS~Sg~~~~s~~~~~~~~~~~~~g~~~~ 208 (316) T COG0627 129 WETFLTQELPALWEAAFPADGTGDGRAIAGHSMGGYGALKLALKHPDRFKSASSFSGILSPSSPWGPTLAMGDPWGGKAF 208 (316) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHCCCCCCCEEECCCCCCCCCCCCCCCHHCCCCCCCCCH T ss_conf 88999877469999866655566773148985143777777650945011310146520666554410001364355368 Q ss_pred ---------------CCCCCC-----C----------CCCCEEEEECCCCCCCC--HHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf ---------------322234-----3----------34773776168655028--999999999986077996079997 Q gi|254781093|r 138 ---------------DFSFLA-----P----------CPSSGLIINGSNDTVAT--TSDVKDLVNKLMNQKGISITHKVI 185 (225) Q Consensus 138 ---------------~~~~l~-----~----------~~~p~LiIhG~~D~~vp--~~~~~~~~~~l~~~~~~~~~~~~i 185 (225) |...+. . .+.+.++-+|..|.+.. ....+.+.+++.. .+.+.++... T Consensus 209 ~~~~G~~~~~~w~~~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~d~g~ad~~~~~~~~~~~~~~~a~~~-~g~~~~~~~~ 287 (316) T COG0627 209 NAMLGPDSDPAWQENDPLSLIEKLVANANTRIWVYGGSPPELLIDNGPADFFLAANNLSTRAFAEALRA-AGIPNGVRDQ 287 (316) T ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEECCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHH-CCCCCEEEEC T ss_conf 874188866443626836778776530353210025678875543244204555246578999999975-3899605757 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 8798475585899999999999973122 Q gi|254781093|r 186 PDANHFFIGKVDELINECAHYLDNSLDE 213 (225) Q Consensus 186 ~ga~H~f~~~~~~l~~~i~~fl~~~L~~ 213 (225) ++.+|.|.-....| +...+|+...|.. T Consensus 288 ~~G~Hsw~~w~~~l-~~~~~~~a~~l~~ 314 (316) T COG0627 288 PGGDHSWYFWASQL-ADHLPWLAGALGL 314 (316) T ss_pred CCCCCCHHHHHHHH-HHHHHHHHHHHCC T ss_conf 99977769999999-9999999988555 No 133 >COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only] Probab=97.79 E-value=0.0017 Score=40.99 Aligned_cols=175 Identities=14% Similarity=0.143 Sum_probs=102.1 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCC-----CEEEEEEECCCCCCCCCCCC-------------CCH----HHHHH Q ss_conf 999578998888998999999999999739-----84999853155887677852-------------102----38999 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRG-----FVSLRFNFRGIGRSEGEFDY-------------GDG----ELSDA 85 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G-----~~vl~fd~RG~G~S~G~~~~-------------g~~----E~~D~ 85 (225) -|+.| ++||+.+. +-.+...|...+ --++..|--|.=.-+|..+- ... ...=+ T Consensus 48 TIfIh---GsgG~asS--~~~Mv~ql~~~~~~~~e~Lt~~V~~dgslk~tGk~~Kd~~nP~I~~gfe~n~~s~~~~s~wl 122 (288) T COG4814 48 TIFIH---GSGGTASS--LNGMVNQLLPDYKAGTESLTMTVDVDGSLKVTGKISKDAKNPIIEFGFEDNTASGLDQSKWL 122 (288) T ss_pred EEEEE---CCCCCHHH--HHHHHHHHHHCCCCCCCCEEEEECCCCCEEEEEEECCCCCCCEEEEEEECCCCCHHHHHHHH T ss_conf 69996---58887067--88999986340133456569998178837885661345778718999863867624578999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH------CCCCHHEEEECCCCCCCCC-----------CCCC--C--- Q ss_conf 9989999862354674058887203687876450------3862133012275744333-----------2223--4--- Q gi|254781093|r 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM------RRPEINGFISVAPQPKSYD-----------FSFL--A--- 143 (225) Q Consensus 86 ~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~------~~p~i~~~v~isp~~~~~~-----------~~~l--~--- 143 (225) +.++.||++.+ +...+-.+||||||.....++. ..|.++.+|+++.+.+.-. .... . T Consensus 123 k~~msyL~~~Y-~i~k~n~VGhSmGg~~~~~Y~~~yg~dks~P~lnK~V~l~gpfN~~~l~~de~v~~v~~~~~~~~~t~ 201 (288) T COG4814 123 KKAMSYLQKHY-NIPKFNAVGHSMGGLGLTYYMIDYGDDKSLPPLNKLVSLAGPFNVGNLVPDETVTDVLKDGPGLIKTP 201 (288) T ss_pred HHHHHHHHHHC-CCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCHHHEEEECCCCCCCCCCCCCCHHEEECCCCCCCCCH T ss_conf 99999999756-98601031005652778999998657777845222378524333455577763210221586433767 Q ss_pred ------------CCCCCEEEEECC------CCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC--CCCCCCCCCHHHHHHHH Q ss_conf ------------334773776168------6550289999999999860779960799978--79847558589999999 Q gi|254781093|r 144 ------------PCPSSGLIINGS------NDTVATTSDVKDLVNKLMNQKGISITHKVIP--DANHFFIGKVDELINEC 203 (225) Q Consensus 144 ------------~~~~p~LiIhG~------~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~--ga~H~f~~~~~~l~~~i 203 (225) +...-+|+|.|+ .|-.||...+.....-+..+.. .+.--+++ .+.|.=--..+...+.+ T Consensus 202 y~~y~~~n~k~v~~~~evl~IaGDl~dg~~tDG~Vp~assls~~~lf~~~~k-sy~e~~~~Gk~a~Hs~lhen~~v~~yv 280 (288) T COG4814 202 YYDYIAKNYKKVSPNTEVLLIAGDLDDGKQTDGAVPWASSLSIYHLFKKNGK-SYIESLYKGKDARHSKLHENPTVAKYV 280 (288) T ss_pred HHHHHHHCCEECCCCCEEEEEECCCCCCCCCCCCEECHHHHHHHHHHCCCCC-EEEEEEEECCCCHHHCCCCCHHHHHHH T ss_conf 8999984363579984799974134668867872112376789998665864-147776307763010167883589999 Q ss_pred HHHHHH Q ss_conf 999997 Q gi|254781093|r 204 AHYLDN 209 (225) Q Consensus 204 ~~fl~~ 209 (225) ..||-. T Consensus 281 ~~FLw~ 286 (288) T COG4814 281 KNFLWE 286 (288) T ss_pred HHHHHC T ss_conf 988406 No 134 >pfam00151 Lipase Lipase. Probab=97.77 E-value=0.00028 Score=45.81 Aligned_cols=133 Identities=15% Similarity=0.163 Sum_probs=76.8 Q ss_pred CCCCCEEEEECCCCCCCCCCCC-HHHHHHHHH-HHHCCCEEEEEEECCCCCCCCCCCCCCHH----HHHHHHHHHHHHHH Q ss_conf 8778799995789988889989-999999999-99739849998531558876778521023----89999989999862 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMND-NIVYQLFYL-FQQRGFVSLRFNFRGIGRSEGEFDYGDGE----LSDAAAALDWVQSL 95 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~-~~~~~la~~-l~~~G~~vl~fd~RG~G~S~G~~~~g~~E----~~D~~aa~~~l~~~ 95 (225) .+.+|+.+|.|+| .++... .-+..+.++ |.+..++|+..|.+. | +...+.....- -+.+...+++|.+. T Consensus 68 n~~~pt~~iiHG~---~~~~~~~~w~~~~~~a~l~~~d~NVI~VDW~~-~-a~~~Y~~A~~nt~~VG~~va~~i~~L~~~ 142 (329) T pfam00151 68 NTSKKTRAIIHGF---TDKGQEESWLSDMCKNLFQVEGVNVIVVDWGS-G-STTFYRQATLNVRVVGAEVAKLLVELEEE 142 (329) T ss_pred CCCCCEEEEEECC---CCCCCCCHHHHHHHHHHHHCCCCEEEEEECHH-H-HCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8788538998234---57888514799999999744895799997857-8-58348999997999999999999999986 Q ss_pred -CCCCCCEEEEEECHHHHHHHHHHHCCC---CHHEEEECCCCCCCCCCCC----CCCC-CCCEEEEECCCCCC Q ss_conf -354674058887203687876450386---2133012275744333222----3433-47737761686550 Q gi|254781093|r 96 -NPESKSCWIAGYSFGAWISMQLLMRRP---EINGFISVAPQPKSYDFSF----LAPC-PSSGLIINGSNDTV 159 (225) Q Consensus 96 -~~~~~~i~l~G~S~Gg~val~~a~~~p---~i~~~v~isp~~~~~~~~~----l~~~-~~p~LiIhG~~D~~ 159 (225) +.+.+.+.|+|||+|+.+|-.+..+.. .+..+..+-|+...+.... |.+- ..=+-+||-+...+ T Consensus 143 ~g~~~~~vhlIGhSLGAHiaG~aG~~~~~~~~lgRItGLDPA~P~F~~~~~~~rLd~~DA~fVDvIHT~~~~~ 215 (329) T pfam00151 143 LNVSPENVHLIGHSLGAHVAGEAGRRTKGKHKLGRITGLDPAGPYFKGTPELTRLDPGDADFVDAIHTDTRPI 215 (329) T ss_pred CCCCHHHEEEEECCHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCCCHHHCCCCCCCCEEEEEECCCCCC T ss_conf 3998678588731467777678887536765212020468764213579643163856676485885288655 No 135 >pfam01674 Lipase_2 Lipase (class 2). This family consists of hypothetical C. elegans proteins and lipases. Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids. One member, a triacylglycerol lipase, is a extracellular lipase from B. subtilis 168. Probab=97.73 E-value=0.00039 Score=44.88 Aligned_cols=99 Identities=19% Similarity=0.257 Sum_probs=56.8 Q ss_pred CCEEEEECCCCCCCCCCCCH--HHHHHHHHHHHCCCE---EEEEEECCCCCCCCCCCCCCHH---HHHHHHHHHHHHHHC Q ss_conf 87999957899888899899--999999999973984---9998531558876778521023---899999899998623 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDN--IVYQLFYLFQQRGFV---SLRFNFRGIGRSEGEFDYGDGE---LSDAAAALDWVQSLN 96 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~--~~~~la~~l~~~G~~---vl~fd~RG~G~S~G~~~~g~~E---~~D~~aa~~~l~~~~ 96 (225) .| ||+.| |+.++. ....++..|.++||. ++..+| |.+.....+...... +..+.+.++-++. . T Consensus 2 ~P-VVlVH------G~~~~a~~~w~~~~~~l~~~GY~~~evya~ty-G~~~~~~~~~~~~~~~~~~~qir~fI~~Vl~-y 72 (218) T pfam01674 2 TP-VIFVH------GNSGLAAGGWSKLRQYFKERGYTLAELYATTW-GDGNLLTSLQRAEMKCEYVKQIRRFIEAVLG-Y 72 (218) T ss_pred CC-EEEEC------CCCCCHHHHHHHHHHHHHHCCCCCHHEEECCC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH-H T ss_conf 89-99985------89951334499999999986998300466134-6887776521234201589999999999997-4 Q ss_pred CCCCCEEEEEECHHHHHHHHHHHCC--------------CCHHEEEECCCC Q ss_conf 5467405888720368787645038--------------621330122757 Q gi|254781093|r 97 PESKSCWIAGYSFGAWISMQLLMRR--------------PEINGFISVAPQ 133 (225) Q Consensus 97 ~~~~~i~l~G~S~Gg~val~~a~~~--------------p~i~~~v~isp~ 133 (225) ... +|-|+|||||+.++-.+..-. ..|..+|.++.+ T Consensus 73 TGa-kVDiVghSmG~~iaRk~I~GG~~~D~~~~LG~pl~~~V~tfvgiaga 122 (218) T pfam01674 73 TGA-KVDIVAHSMGVPIARKAILGGNCVDTNCDLGPPLTSLVDTFISVAGA 122 (218) T ss_pred CCC-CEEEEEECCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEECCC T ss_conf 378-03588642665434555306743125632377652210447875155 No 136 >TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase; InterPro: IPR006296 This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. The family does not contain a number of homologs that appear to represent sequences derived from within the family but to have changed substantially.; GO: 0004414 homoserine O-acetyltransferase activity, 0009086 methionine biosynthetic process. Probab=97.71 E-value=0.00083 Score=42.94 Aligned_cols=64 Identities=16% Similarity=0.264 Sum_probs=50.6 Q ss_pred CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC-CCCC-CCCCCH-HHHHHHHHHHHH Q ss_conf 3347737761686550289999999999860779960799978-7984-755858-999999999999 Q gi|254781093|r 144 PCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP-DANH-FFIGKV-DELINECAHYLD 208 (225) Q Consensus 144 ~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~-ga~H-~f~~~~-~~l~~~i~~fl~ 208 (225) .+..|.|||-=+.|-.-|+++.+++++.++.. ...|.+++|+ ..|| +|-=.. +++.+.|.+||+ T Consensus 329 ~~~~~~lv~s~~sD~Lfp~~~~~e~~~~l~~~-~~~V~Y~ei~S~~GHDAFL~e~d~~~~~~i~~fl~ 395 (395) T TIGR01392 329 AIKAPFLVVSITSDWLFPPAESRELAKALPAA-GLRVTYAEIESPYGHDAFLVETDEQVEELIREFLR 395 (395) T ss_pred HCCCCEEEEEEECCCCCCHHHHHHHHHHHHCC-CCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHCC T ss_conf 37988899986068766889999999998503-64066888526899860005677899999997529 No 137 >PRK07868 acyl-CoA synthetase; Validated Probab=97.64 E-value=0.00011 Score=48.13 Aligned_cols=169 Identities=22% Similarity=0.344 Sum_probs=96.0 Q ss_pred EEEEEEECCCCC-----CCCEEEEECCCCCCCCCCCCHHH-HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 187999668887-----78799995789988889989999-999999997398499985315588767785210238999 Q gi|254781093|r 12 RLEGRYQPSTNP-----NAPIALILHPHPRFGGTMNDNIV-YQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDA 85 (225) Q Consensus 12 ~l~~~~~~~~~~-----~~~~vv~~Hp~p~~GG~~~~~~~-~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~ 85 (225) ||. .|.|+... .+|.+++.||.- ..-+|.+-.- ....-.|.+.|+....+||-.--+-||-++. .++|- T Consensus 50 rlr-ryfp~~~~~g~~~~~p~vl~v~pmm-~sa~~~dvt~~~gav~ilh~~gldpwvidfgspd~~egg~~r---~ladh 124 (990) T PRK07868 50 RLR-RYFPPDNRPGQPPVGPPVLMVHPMM-MSADMWDVTREDGAVGILHRAGLDPWVIDFGSPDKVEGGMRR---NLADH 124 (990) T ss_pred HHH-HCCCCCCCCCCCCCCCCEEEECCCC-CCCCCEEECCCCCEEEEEECCCCCCEEEECCCCCCCCCCCCC---CHHHH T ss_conf 212-1079988988777899569836432-125302204567707871114887558735996311255222---44455 Q ss_pred HH----HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCC--CCHHEEEECCCCCCCC---------------------- Q ss_conf 99----8999986235467405888720368787645038--6213301227574433---------------------- Q gi|254781093|r 86 AA----ALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRR--PEINGFISVAPQPKSY---------------------- 137 (225) Q Consensus 86 ~a----a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~--p~i~~~v~isp~~~~~---------------------- 137 (225) .- ++|-+++. ..+.+.|+|||.||.-+.++|+-+ .++..+|..+.|+... T Consensus 125 vva~s~aid~v~~~--tg~dvhl~gysqggmf~yq~aayr~s~~~asiitfgspvd~~a~lp~g~pa~~~~~~a~f~adh 202 (990) T PRK07868 125 IVALSEAIDTVKDV--TGRDVHLVGYSQGGMFCYQAAAYRRSKDIASIVAFGSPVDTLAALPMGIPAGLAAAAADFMADH 202 (990) T ss_pred HHHHHHHHHHHHHH--CCCCEEEEEECCCCEEEHHHHHHHHCCCCEEEEECCCHHHHHHCCCCCCCHHHHHHHHHHHHHH T ss_conf 33498899999862--3771367410477620047778772278402786078456764287789812305678899886 Q ss_pred -----------------------------C-------------------------------------------------- Q ss_conf -----------------------------3-------------------------------------------------- Q gi|254781093|r 138 -----------------------------D-------------------------------------------------- 138 (225) Q Consensus 138 -----------------------------~-------------------------------------------------- 138 (225) | T Consensus 203 vf~rl~i~~w~ar~gfq~ldpvkt~~~r~dflrqlhdr~allpre~qrrfl~~egwvawsgpai~ellkqfi~hnrm~tg 282 (990) T PRK07868 203 VFNRLDIPSWMARTGFQMLDPVKTAKARVDFLRQLHDREALLPREQQRRFLESEGWIAWSGPAISELLKQFIAHNRMMTG 282 (990) T ss_pred HHCCCCCCHHHHHCCCEECCHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHCCCEEECCCHHHHHHHHHHHHHHHHCCC T ss_conf 30536671176650340238788889899999974134420767877666503871643657899999999875102027 Q ss_pred -------CCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf -------22234334773776168655028999999999986077996079997879847 Q gi|254781093|r 139 -------FSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 139 -------~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~ 191 (225) .-.|+....|+|-..|+.|++--+..++.+.++.+.+. ..|.. --++|| T Consensus 283 gf~i~g~~vtl~~i~cpvlafvge~ddigqpasvrgi~raap~a~--vye~~--~~aghf 338 (990) T PRK07868 283 GFAINGQMVTLTDITCPVLAFVGEVDDIGQPASVRGIRRAAPNAE--VYECL--IRAGHF 338 (990) T ss_pred CEEECCEEEEEECCCCCEEEEEECCCCCCCCHHHHHHHHHCCCCC--CEEEE--ECCCCE T ss_conf 657878587632057734899751123578466654776088764--12322--115725 No 138 >COG3150 Predicted esterase [General function prediction only] Probab=97.41 E-value=0.0077 Score=37.12 Aligned_cols=155 Identities=14% Similarity=0.182 Sum_probs=79.9 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 99957899888899899999999999973984999853155887677852102389999989999862354674058887 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGY 107 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~ 107 (225) ++++|++ -.|-.|.-.+.+-+.+.+ .+...+|+ .=.. ...-+++.+-++-+.++..+. ...|+|. T Consensus 2 ilYlHGF---nSSP~shka~~~~q~~~~---~~~~i~y~-----~p~l---~h~p~~a~~ele~~i~~~~~~-~p~ivGs 66 (191) T COG3150 2 ILYLHGF---NSSPGSHKAVLLLQFIDE---DVRDIEYS-----TPHL---PHDPQQALKELEKAVQELGDE-SPLIVGS 66 (191) T ss_pred EEEEECC---CCCCCCHHHHHHHHHHHC---CCCCCEEE-----CCCC---CCCHHHHHHHHHHHHHHCCCC-CCEEEEE T ss_conf 4788257---899600899999998740---15530421-----5889---989999999999999971899-8548860 Q ss_pred CHHHHHHHHHHHCCCCHHEEEECCCCCCCCC-----------------C----CCC--------CCCCCC--EEEEECCC Q ss_conf 2036878764503862133012275744333-----------------2----223--------433477--37761686 Q gi|254781093|r 108 SFGAWISMQLLMRRPEINGFISVAPQPKSYD-----------------F----SFL--------APCPSS--GLIINGSN 156 (225) Q Consensus 108 S~Gg~val~~a~~~p~i~~~v~isp~~~~~~-----------------~----~~l--------~~~~~p--~LiIhG~~ 156 (225) |+||+-|..++.+. .++++ .+.|++.-+. + .++ .....| ..++.-+- T Consensus 67 sLGGY~At~l~~~~-Girav-~~NPav~P~e~l~gylg~~en~ytg~~y~le~~hI~~l~~~~~~~l~~p~~l~lL~qtg 144 (191) T COG3150 67 SLGGYYATWLGFLC-GIRAV-VFNPAVRPYELLTGYLGRPENPYTGQEYVLESRHIATLCVLQFRELNRPRCLVLLSQTG 144 (191) T ss_pred CCHHHHHHHHHHHH-CCHHH-HCCCCCCCHHHHHHHCCCCCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCCEEEEECCCC T ss_conf 52178999999870-86654-14876682366641307988887764687305448888875132037985798600144 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 5502899999999998607799607999787984755858999999999999 Q gi|254781093|r 157 DTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208 (225) Q Consensus 157 D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~ 208 (225) |++.....+.+.+.... .++.+|.||.|.+....+ +.|..|+. T Consensus 145 DEvLDyr~a~a~y~~~~--------~~V~dgg~H~F~~f~~~l-~~i~aF~g 187 (191) T COG3150 145 DEVLDYRQAVAYYHPCY--------EIVWDGGDHKFKGFSRHL-QRIKAFKG 187 (191) T ss_pred CHHHHHHHHHHHHHHHH--------HEEECCCCCCCCCHHHHH-HHHHHHHC T ss_conf 07777999999753355--------345528975533356757-99999861 No 139 >COG3319 Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=97.32 E-value=0.0011 Score=42.30 Aligned_cols=97 Identities=19% Similarity=0.211 Sum_probs=59.8 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH-HHHHHHHHCCCCCCEEEE Q ss_conf 9999578998888998999999999999739849998531558876778521023899999-899998623546740588 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAA-ALDWVQSLNPESKSCWIA 105 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~a-a~~~l~~~~~~~~~i~l~ 105 (225) ++-++||-.+++ ..+..|+..|... ..++-..++|.+. +. ....+++|..+ .++-+++..|+. +..|. T Consensus 2 pLF~fhp~~G~~-----~~~~~L~~~l~~~-~~v~~l~a~g~~~--~~--~~~~~l~~~a~~yv~~Ir~~QP~G-Py~L~ 70 (257) T COG3319 2 PLFCFHPAGGSV-----LAYAPLAAALGPL-LPVYGLQAPGYGA--GE--QPFASLDDMAAAYVAAIRRVQPEG-PYVLL 70 (257) T ss_pred CEEEECCCCCCH-----HHHHHHHHHHCCC-CEEECCCCCCCCC--CC--CCCCCHHHHHHHHHHHHHHHCCCC-CEEEE T ss_conf 779975887717-----7878899984657-7010001676676--66--665799999999999999739999-97988 Q ss_pred EECHHHHHHHHHHHCC----CCHHEEEECCCCC Q ss_conf 8720368787645038----6213301227574 Q gi|254781093|r 106 GYSFGAWISMQLLMRR----PEINGFISVAPQP 134 (225) Q Consensus 106 G~S~Gg~val~~a~~~----p~i~~~v~isp~~ 134 (225) |||+||.+|...|.+. ..+..++++-++. T Consensus 71 G~S~GG~vA~evA~qL~~~G~~Va~L~llD~~~ 103 (257) T COG3319 71 GWSLGGAVAFEVAAQLEAQGEEVAFLGLLDAVP 103 (257) T ss_pred EECCCCHHHHHHHHHHHHCCCEEEEEEEECCCC T ss_conf 646664999999999996797089999851688 No 140 >pfam09994 DUF2235 Uncharacterized conserved protein (DUF2235). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=97.31 E-value=0.0029 Score=39.66 Aligned_cols=74 Identities=22% Similarity=0.367 Sum_probs=52.3 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCC------CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 9999999997398499985315588767785------2102389999989999862354674058887203687876450 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFD------YGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~------~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) +.+|++.+...+-..+.+-..|+|-.....+ .|.|-.+.+..+..|+.+.+.+.++|+|+|||-|++.|=.++. T Consensus 23 V~rL~~~~~~~~~~q~~~Y~~GVGT~~~~~~~~~g~a~G~g~~~~i~~Ay~fl~~~y~~gD~I~lFGFSRGA~tAR~la~ 102 (261) T pfam09994 23 VAKLFRALDRSGPPQIVYYDPGVGTSGSRLDKALGGAFGSGLDRNVREAYRFLARNYRPGDEIYLFGFSRGAYTARALAG 102 (261) T ss_pred HHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHH T ss_conf 99999986147987699974898477752033311254402799999999999983789988999503542899999998 No 141 >PRK12467 peptide synthase; Provisional Probab=97.31 E-value=0.0026 Score=39.98 Aligned_cols=97 Identities=14% Similarity=0.149 Sum_probs=53.0 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH-HHHHHHHHCCCCCC Q ss_conf 77879999578998888998999999999999739849998531558876778521023899999-89999862354674 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAA-ALDWVQSLNPESKS 101 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~a-a~~~l~~~~~~~~~ 101 (225) ...|.+.+.| |..|.... +..|++.| ...+.++-+..+|....+- ... .++++.+ -++.+++..|+ .+ T Consensus 3691 ~~~pplfcvh--p~~G~~~~---Y~~La~~L-~~~~pvyglq~~~l~~~~~-~~~---s~~~~a~~Y~~~ir~~qp~-GP 3759 (3951) T PRK12467 3691 TGLPALFCPH--AGFGTVFD---YEPLARQL-EGHRSVLGIQCRMLLDTNW-QDT---SLQAMAFDYINDIRQVQAK-GP 3759 (3951) T ss_pred CCCCCEEEEC--CCCCCEEE---HHHHHHHC-CCCCCEEEECCCCCCCCCC-CCC---CHHHHHHHHHHHHHHHCCC-CC T ss_conf 9998589868--88776556---88888345-8998679860788889888-987---7999999999999987899-89 Q ss_pred EEEEEECHHHHHHHHHHHC----CCCHHEEEEC Q ss_conf 0588872036878764503----8621330122 Q gi|254781093|r 102 CWIAGYSFGAWISMQLLMR----RPEINGFISV 130 (225) Q Consensus 102 i~l~G~S~Gg~val~~a~~----~p~i~~~v~i 130 (225) ..|+||||||.+|...|.+ ..+|.-++++ T Consensus 3760 y~L~GwS~GG~vA~e~A~~L~~~G~~V~~l~ll 3792 (3951) T PRK12467 3760 YHLLGWSLGGTLATLVAAELERQGQSVAFLGLV 3792 (3951) T ss_pred EEEEEECHHHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 798867770799999999999779957689998 No 142 >TIGR00976 /NonD hydrolase CocE/NonD family protein; InterPro: IPR005674 This family includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and to many other hydrolases.; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=97.25 E-value=0.0027 Score=39.88 Aligned_cols=120 Identities=23% Similarity=0.254 Sum_probs=82.4 Q ss_pred CCCC-EEEE-EEECCCCC---CCCEEEEECCCCCCCCCCC----------------------CHHH--HHHHHHHHHCCC Q ss_conf 8996-1879-99668887---7879999578998888998----------------------9999--999999997398 Q gi|254781093|r 8 GPSG-RLEG-RYQPSTNP---NAPIALILHPHPRFGGTMN----------------------DNIV--YQLFYLFQQRGF 58 (225) Q Consensus 8 g~~G-~l~~-~~~~~~~~---~~~~vv~~Hp~p~~GG~~~----------------------~~~~--~~la~~l~~~G~ 58 (225) ..+| +|.. +|.|.... ..|+++..+|+....+... .... ..--..+...|| T Consensus 3 ~~~g~~~~~~~~~p~~~~~~~~~p~~~~~~p~~~~~~~~~~g~~w~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 82 (663) T TIGR00976 3 TRDGTRLAADLYLPAGGGEKEPLPALLSRTPYGKDAGLAWPGNRWTFPQYRLLREGWGGTGPALTGLDKTEPAWFVAEGY 82 (663) T ss_pred CCCCCEEEEEEECCCCCCCCCCCCEEEEECCCCCHHCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHCCC T ss_conf 65540432000024455432345346752343300112567753335246776521576530011100123112321562 Q ss_pred EEEEEEECCCCCCCCCCCCC--CHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEE Q ss_conf 49998531558876778521--023899999899998623546740588872036878764503862-1330 Q gi|254781093|r 59 VSLRFNFRGIGRSEGEFDYG--DGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGF 127 (225) Q Consensus 59 ~vl~fd~RG~G~S~G~~~~g--~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~ 127 (225) .++..|.||.|.|+|.++.. ..|..|....++|+.++......+..+|.|+-+......+...+. ++.+ T Consensus 83 ~~~~~d~~g~~~~~g~~~~~~~~~~~~d~~~~~~w~~~~~w~~g~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 154 (663) T TIGR00976 83 AVVVQDTRGRGGSEGEWDLLGWSDEAEDGYDLLDWLAKQPWCDGNVGLLGLSYLGVTQWLAAALKPPALRAI 154 (663) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHHHCCCCCEEEE T ss_conf 588730234655544423355521000267887776530124663000011135677888752277520231 No 143 >PRK04940 hypothetical protein; Provisional Probab=97.23 E-value=0.017 Score=34.98 Aligned_cols=115 Identities=15% Similarity=0.185 Sum_probs=67.4 Q ss_pred HHHHHHHHHH---HHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCC-----------CCCC------ Q ss_conf 9999989999---8623546740588872036878764503862133012275744333-----------2223------ Q gi|254781093|r 83 SDAAAALDWV---QSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYD-----------FSFL------ 142 (225) Q Consensus 83 ~D~~aa~~~l---~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~-----------~~~l------ 142 (225) .|..-++..+ .++..+..++ ++|.|+||+-|..++-... +++ |++.|...-+. +.++ T Consensus 41 ~d~~~ll~ev~k~i~~~~d~~~l-iiGssLGGyyA~~l~~~~~-~Ka-VliNPal~P~~~m~~~i~~peey~di~~k~v~ 117 (179) T PRK04940 41 HDMQHLLKEVDKMLQLSDDERPL-ICGVGLGGYWAERIGFLCG-IRQ-VIFNPNLFPEENMEGKIDRPEEYADIATKCVT 117 (179) T ss_pred HHHHHHHHHHHHHHHHCCCCCCE-EEECCCHHHHHHHHHHHCC-CCE-EEECCCCCHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 20999999999999835878837-9954723899999999829-867-99889988046556550787126888887899 Q ss_pred ---CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf ---433477377616865502899999999998607799607999787984755858999999999999 Q gi|254781093|r 143 ---APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLD 208 (225) Q Consensus 143 ---~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~ 208 (225) ..-+...|++--..|++.....+. +.+... .+.+.-+|.+|.|.+ .++....|.+|.. T Consensus 118 ~fr~kn~~~~LvlL~~gDEVLD~~~~a---~~l~~~----~~Ii~d~gg~H~F~~-~~~~L~~I~~F~~ 178 (179) T PRK04940 118 NFREKNRDRCLVILSRNDEVLDSQRTA---EELHPY----YEIVWDEEQTHKFKN-ISPHLQRIKAFKT 178 (179) T ss_pred HHHHCCCCCEEEEEECCCHHHHHHHHH---HHHCCC----CEEEECCCCCCCCCC-HHHHHHHHHHHHC T ss_conf 986128303899985562666099999---984546----358862899867651-9999899998751 No 144 >KOG2521 consensus Probab=97.14 E-value=0.021 Score=34.45 Aligned_cols=183 Identities=17% Similarity=0.311 Sum_probs=112.8 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC----CCCCCCCCCCCCCHHHHHHHHHHHHH-HHH Q ss_conf 8877879999578998888998999999999999739849998531----55887677852102389999989999-862 Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR----GIGRSEGEFDYGDGELSDAAAALDWV-QSL 95 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R----G~G~S~G~~~~g~~E~~D~~aa~~~l-~~~ 95 (225) +...+|+|++. ++.|..+. -+...+....+.||.++++.-- ....|.+.+ -..++...+.-+ ... T Consensus 35 ~~s~k~Iv~~~----gWag~~~r-~l~ky~~~Yq~~g~~~~~~tap~~~~~~~~s~~~~-----sl~~~~~~l~~L~~~~ 104 (350) T KOG2521 35 GESEKPIVVLL----GWAGAIDR-NLMKYSKIYQDKGYIVVRITAPCPSVFLSASRRIL-----SLSLASTRLSELLSDY 104 (350) T ss_pred CCCCCCEEEEE----EECCCCCH-HHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCC-----HHHHHHHHHHHHHHHC T ss_conf 77525679995----01554404-69999998750783699956765300221136650-----1668898999876651 Q ss_pred CCCCCCEEEEEECHHHHHHHHH---HHCC--CC----HHEEEECCCCCCCC----------------------------- Q ss_conf 3546740588872036878764---5038--62----13301227574433----------------------------- Q gi|254781093|r 96 NPESKSCWIAGYSFGAWISMQL---LMRR--PE----INGFISVAPQPKSY----------------------------- 137 (225) Q Consensus 96 ~~~~~~i~l~G~S~Gg~val~~---a~~~--p~----i~~~v~isp~~~~~----------------------------- 137 (225) ..++.++...-||+||...+.. +... |. ..+.+..+-+.... T Consensus 105 ~~~~~pi~fh~FS~ng~~~~~si~~~~~~~~~~~~~~~~~~~fdS~p~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~i~ 184 (350) T KOG2521 105 NSDPCPIIFHVFSGNGVRLMYSISLQLIKHEPKAAQLSGGIIFDSAPARSSPVQLGWAVSFSSPPDDYVARWARLNYHIT 184 (350) T ss_pred CCCCCCEEEEEECCCCEEEHHHHHHHHHHCCCHHHHHCCCCEEECCCCCCCHHHHCCEECCCCCCHHHHHHHHHCCEEEE T ss_conf 47767437997247866645789998764473567645883585156524421205322001375156788873575888 Q ss_pred ------------CCC--------------------CCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf ------------322--------------------234334773776168655028999999999986077996079997 Q gi|254781093|r 138 ------------DFS--------------------FLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVI 185 (225) Q Consensus 138 ------------~~~--------------------~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i 185 (225) .+. .-...+.+.|.+.+..|.++|.+..+++.+ +.+.++..+...-. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~r~~~~~~~~~~~~ly~~s~~d~v~~~~~ie~f~~-~~~~~g~~v~s~~~ 263 (350) T KOG2521 185 LLTMAGNEGGAYLLGPLAEKISMSRKYHFLDRYEEQRNELPWNQLYLYSDNDDVLPADEIEKFIA-LRREKGVNVKSVKF 263 (350) T ss_pred EEEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCCCHHHHHHHHH-HHHHCCCEEEEEEC T ss_conf 87763012101221232201602341477889976444044342364478561222888999999-99850854797421 Q ss_pred CCCCCC--CCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 879847--55858999999999999731221 Q gi|254781093|r 186 PDANHF--FIGKVDELINECAHYLDNSLDEK 214 (225) Q Consensus 186 ~ga~H~--f~~~~~~l~~~i~~fl~~~L~~~ 214 (225) .++.|. |+.+...+.+++.+|+++..... T Consensus 264 ~ds~H~~h~r~~p~~y~~~~~~Fl~~~~~~~ 294 (350) T KOG2521 264 KDSEHVAHFRSFPKTYLKKCSEFLRSVISSY 294 (350) T ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHCCCCC T ss_conf 6764023400193889999999998415556 No 145 >COG1075 LipA Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only] Probab=97.12 E-value=0.0062 Score=37.65 Aligned_cols=98 Identities=18% Similarity=0.237 Sum_probs=60.0 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE---EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 999957899888899899999999999973984---99985315588767785210238999998999986235467405 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQRGFV---SLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~---vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) .+|+.|+. +++.++ +..+...+...|+. +..+++.+. .+..+. ....+-+.+-++-+.... ...++. T Consensus 61 pivlVhG~---~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~---~~~~~~-~~~~~ql~~~V~~~l~~~-ga~~v~ 130 (336) T COG1075 61 PIVLVHGL---GGGYGN--FLPLDYRLAILGWLTNGVYAFELSGG---DGTYSL-AVRGEQLFAYVDEVLAKT-GAKKVN 130 (336) T ss_pred EEEEECCC---CCCHHH--HHHHHHHHHHCCCCCCCEEEEECCCC---CCCCCC-CCCHHHHHHHHHHHHHHC-CCCCEE T ss_conf 29997478---666316--77776575321420055478621556---677643-233999999999998754-886257 Q ss_pred EEEECHHHHHHHHHHHCCC---CHHEEEECCCCC Q ss_conf 8887203687876450386---213301227574 Q gi|254781093|r 104 IAGYSFGAWISMQLLMRRP---EINGFISVAPQP 134 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~p---~i~~~v~isp~~ 134 (225) |+|||+||.+...++...+ .|..++.+++|- T Consensus 131 LigHS~GG~~~ry~~~~~~~~~~V~~~~tl~tp~ 164 (336) T COG1075 131 LIGHSMGGLDSRYYLGVLGGANRVASVVTLGTPH 164 (336) T ss_pred EEEECCCCHHHHHHHHHCCCCCEEEEEEEECCCC T ss_conf 8651664067789998478634056789842677 No 146 >pfam08386 Abhydrolase_4 TAP-like protein. This is a family of putative bacterial peptidases and hydrolases that bear similarity to a tripeptidyl aminopeptidase isolated from Streptomyces lividans. A member of this family is thought to be involved in the C-terminal processing of propionicin F, a bacteriocidin characterized from Propionibacterium freudenreichii. Probab=97.06 E-value=0.0014 Score=41.63 Aligned_cols=59 Identities=19% Similarity=0.268 Sum_probs=47.3 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC-CCCCHHHHHHHHHHHHHH Q ss_conf 4773776168655028999999999986077996079997879847-558589999999999997 Q gi|254781093|r 146 PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF-FIGKVDELINECAHYLDN 209 (225) Q Consensus 146 ~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~-f~~~~~~l~~~i~~fl~~ 209 (225) ..|+|+|.|+.|..+|.+.++.+.+.+... .+++++|.+|. |.....-+.+.+.+||.. T Consensus 34 ~~p~Lvv~~~~Dp~TP~~~a~~~a~~l~~s-----~lvt~~g~GH~~~~~~s~Cv~~~v~~yL~~ 93 (103) T pfam08386 34 APPVLLVQGEGDPATPYEGARALARALGNA-----VLVTVNGAGHGAYLGGNKCVDKAVDAYLLT 93 (103) T ss_pred CCCEEEECCCCCCCCHHHHHHHHHHHCCCC-----EEEEEECCCCEEECCCCHHHHHHHHHHHHC T ss_conf 999889746778886199999999977995-----599873799626217884399999999757 No 147 >pfam02089 Palm_thioest Palmitoyl protein thioesterase. Probab=96.95 E-value=0.013 Score=35.80 Aligned_cols=102 Identities=16% Similarity=0.137 Sum_probs=55.7 Q ss_pred EEEECCCCCCCCCCCCH-HHHHHHHHHHHC--CCEEEEEEECCCCCCCCCCCCCCHHHH-HHHHHHHHHHHHCCCCCCEE Q ss_conf 99957899888899899-999999999973--984999853155887677852102389-99998999986235467405 Q gi|254781093|r 28 ALILHPHPRFGGTMNDN-IVYQLFYLFQQR--GFVSLRFNFRGIGRSEGEFDYGDGELS-DAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~-~~~~la~~l~~~--G~~vl~fd~RG~G~S~G~~~~g~~E~~-D~~aa~~~l~~~~~~~~~i~ 103 (225) +|+.| +.|.+..|+ .+..+.+...+. |..|..... |.+..+-....-.+.+. .+..+++-+++...-.+-+- T Consensus 8 vViwH---GlgDsc~~~~~m~~i~~~i~~~~PG~~V~~i~i-g~~~~~D~~~s~f~~v~~Qv~~vc~~l~~~p~L~~G~n 83 (279) T pfam02089 8 LVIWH---GMGDSCCNPLSMGAIKKMVEKEIPGIHVLSLEI-GKNLMEDVENSFFLNVNSQVNMVCQILAKDPKLQQGYN 83 (279) T ss_pred EEEEC---CCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEE-CCCCCHHHHCCCEECHHHHHHHHHHHHHHCHHHHCCCC T ss_conf 89984---687145681228999999997689938999996-89700442323000599999999999973956524756 Q ss_pred EEEECHHHHHHHHHHHCC--CCHHEEEECCCC Q ss_conf 888720368787645038--621330122757 Q gi|254781093|r 104 IAGYSFGAWISMQLLMRR--PEINGFISVAPQ 133 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~--p~i~~~v~isp~ 133 (225) ++|||.||.++=.++.+. |.+..+|+++.+ T Consensus 84 ~IGfSQGgl~~R~~verc~~p~V~nlISlggp 115 (279) T pfam02089 84 AIGFSQGGQFLRAVAQRCPSPPMMNLISVGGQ 115 (279) T ss_pred EEEECCCHHHHHHHHHHCCCCCCCEEEEECCC T ss_conf 44365402666655654599873236870688 No 148 >pfam05577 Peptidase_S28 Serine carboxypeptidase S28. These serine proteases include several eukaryotic enzymes such as lysosomal Pro-X carboxypeptidase, dipeptidyl-peptidase II, and thymus-specific serine peptidase. Probab=96.88 E-value=0.03 Score=33.59 Aligned_cols=110 Identities=15% Similarity=0.157 Sum_probs=69.6 Q ss_pred CCCCEEEEECCC-CCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCC--CCCC-------CCHHHHHHHHHHHHH Q ss_conf 778799995789-988889989999999999997398499985315588767--7852-------102389999989999 Q gi|254781093|r 23 PNAPIALILHPH-PRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEG--EFDY-------GDGELSDAAAALDWV 92 (225) Q Consensus 23 ~~~~~vv~~Hp~-p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G--~~~~-------g~~E~~D~~aa~~~l 92 (225) +..|+.|++-+- +.-++...+..+..+|+. .|=.++..-.|=.|.|.= ..+. ...-++|+...+.++ T Consensus 27 ~ggPifly~gGE~~~~~~~~~~g~~~~lA~~---~~a~~v~lEHRYYG~S~P~~~~s~enL~yLt~~QALaD~a~Fi~~~ 103 (433) T pfam05577 27 NGGPIFLMIGGEGPESASWVRNGHWLDLAKE---FGALVFSLEHRFYGQSKPIGDLSTANLRYLSSLQALADVASFIKAM 103 (433) T ss_pred CCCCEEEEECCCCCCCCCCCCCCHHHHHHHH---HCCEEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9997899989987675231136289999998---3996999851035036688998811236478999999999999999 Q ss_pred HHHCC--CCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCC Q ss_conf 86235--46740588872036878764503862-133012275744 Q gi|254781093|r 93 QSLNP--ESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPK 135 (225) Q Consensus 93 ~~~~~--~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~ 135 (225) +++.. +..+++++|-||||.+|+..-.+.|. +.+.++-|.|+. T Consensus 104 k~~~~~~~~~pwI~~GGSY~G~LaAw~R~kYP~~v~ga~ASSApv~ 149 (433) T pfam05577 104 NQKFNGLSSSKWITFGGSYSGSLAAWARKKYPHLVVGAVASSAPLL 149 (433) T ss_pred HHHCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCEE T ss_conf 9863788999989987860879999999868881799996341101 No 149 >pfam02450 LACT Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol. Probab=96.71 E-value=0.0035 Score=39.19 Aligned_cols=81 Identities=15% Similarity=0.249 Sum_probs=54.1 Q ss_pred HHHHHHHHHHHCCCEE---EE---EEECCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHH Q ss_conf 9999999999739849---99---8531558876778521023-899999899998623546740588872036878764 Q gi|254781093|r 45 IVYQLFYLFQQRGFVS---LR---FNFRGIGRSEGEFDYGDGE-LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQL 117 (225) Q Consensus 45 ~~~~la~~l~~~G~~v---l~---fd~RG~G~S~G~~~~g~~E-~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~ 117 (225) +...+.+.|++.||.. ++ +|+|=. +.+ ..+ ..+++..++.+-+.+ .+++.|+|||||+.+++.. T Consensus 66 ~~~~li~~L~~~GY~~~~~l~~ApYDwR~~------p~~-~~~yf~~LK~lIE~~y~~n--g~kVvli~HSmG~~~~~~F 136 (382) T pfam02450 66 IWHKVVKNLVNIGYERNKTVSAAPYDWRLS------PAE-RDDYFKKLKQLIEEALKLS--GQKVVLIGHSMGNLLVLYF 136 (382) T ss_pred EHHHHHHHHHHCCCCCCCEEEECCCCCCCC------CHH-HHHHHHHHHHHHHHHHHHC--CCEEEEEEECCCCHHHHHH T ss_conf 099999999980987897133066332468------405-6689999999999999970--9869999757852899999 Q ss_pred HHCCC-------CHHEEEECCCCC Q ss_conf 50386-------213301227574 Q gi|254781093|r 118 LMRRP-------EINGFISVAPQP 134 (225) Q Consensus 118 a~~~p-------~i~~~v~isp~~ 134 (225) ....+ -|+++|.++++. T Consensus 137 L~~~~~~~W~dk~I~~~i~i~~~~ 160 (382) T pfam02450 137 LLWVEAEGWKDQHIDAFISLGAPL 160 (382) T ss_pred HHHCCCCCHHHHHHHHEECCCCCC T ss_conf 752531207888677523134334 No 150 >COG2819 Predicted hydrolase of the alpha/beta superfamily [General function prediction only] Probab=96.71 E-value=0.025 Score=33.98 Aligned_cols=110 Identities=16% Similarity=0.183 Sum_probs=62.1 Q ss_pred HHHHHHHH-HHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCCCC------CCCCC-CCCEEE Q ss_conf 89999989-9998623-546740588872036878764503862-13301227574433322------23433-477377 Q gi|254781093|r 82 LSDAAAAL-DWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYDFS------FLAPC-PSSGLI 151 (225) Q Consensus 82 ~~D~~aa~-~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~~~------~l~~~-~~p~Li 151 (225) ++.+...+ -|+.+.. .+..+..++||||||..++.+...+|+ ...+.++||..-..... .+..+ ..++.+ T Consensus 117 ~~fL~~~lkP~Ie~~y~~~~~~~~i~GhSlGGLfvl~aLL~~p~~F~~y~~~SPSlWw~n~~~l~~~~~~~~~~~~~i~l 196 (264) T COG2819 117 REFLTEQLKPFIEARYRTNSERTAIIGHSLGGLFVLFALLTYPDCFGRYGLISPSLWWHNEAILREIESLKLLKTKRICL 196 (264) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEECCHHHHHHHHHHCCCCHHCEEEEECCHHHHCCHHHHCCCCCCCCCCCCCEEE T ss_conf 99999866788860063686551565204103888999862830320346616145427877752265555677764389 Q ss_pred EECC--CC---CCCCHHH---HHHHHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 6168--65---5028999---999999986077996079997879847 Q gi|254781093|r 152 INGS--ND---TVATTSD---VKDLVNKLMNQKGISITHKVIPDANHF 191 (225) Q Consensus 152 IhG~--~D---~~vp~~~---~~~~~~~l~~~~~~~~~~~~i~ga~H~ 191 (225) .-|. .| ....-+. +.+..+.+..-.+..+.+...++.+|. T Consensus 197 ~iG~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~f~~~~~~~H~ 244 (264) T COG2819 197 YIGSGELDSSRSIRMAENKQEAAELSSLLEKRTGARLVFQEEPLEHHG 244 (264) T ss_pred EECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCC T ss_conf 943666676402456667999999999876424770584446566650 No 151 >TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I; InterPro: IPR010963 This entry represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerises hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of the cell.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process, 0005737 cytoplasm. Probab=96.68 E-value=0.03 Score=33.53 Aligned_cols=157 Identities=16% Similarity=0.191 Sum_probs=95.8 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCC--CCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHH-----HHHHHHHC Q ss_conf 999999973984999853155887677--85210238999998999986235467405888720368-----78764503 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGE--FDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAW-----ISMQLLMR 120 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~--~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~-----val~~a~~ 120 (225) .+-+-|.++|+.|+...=|.-..+.-. |+. -=.+=+.+|++-+++.. .-+++-.+|||-||. +|.++|.+ T Consensus 211 SlvrWlv~QG~TVF~~SWrNPd~~~A~~tfdD--Y~~~G~~~Al~~v~~it-Ge~~~N~~GYCIGGT~Ls~alA~lAArg 287 (541) T TIGR01838 211 SLVRWLVEQGHTVFLISWRNPDAEQADLTFDD--YVRDGVIAALEVVEEIT-GEKQVNAVGYCIGGTLLSTALAYLAARG 287 (541) T ss_pred HHHHHHHHCCCCEEEEEECCCCHHHHHCCHHH--HHHHHHHHHHHHHHHHC-CCEEEEEEEEEHHHHHHHHHHHHHHHCC T ss_conf 58999995499389986006275645367478--99999999999998615-5406504541044789999999999617 Q ss_pred -CCCHHEEEEC-------------------------------CCCCCCC------------------------------- Q ss_conf -8621330122-------------------------------7574433------------------------------- Q gi|254781093|r 121 -RPEINGFISV-------------------------------APQPKSY------------------------------- 137 (225) Q Consensus 121 -~p~i~~~v~i-------------------------------sp~~~~~------------------------------- 137 (225) +.+|+..-+. +.|+... T Consensus 288 ~~~ri~SaTffTTllDF~~~G~l~~F~~E~~v~~iE~~~~~~GGP~~~l~G~~la~TFSlLR~NdL~WnY~vdnYLKG~~ 367 (541) T TIGR01838 288 EKKRIKSATFFTTLLDFSDPGELGVFVDEEIVSAIERQNNQKGGPIAYLDGRSLAVTFSLLRENDLIWNYYVDNYLKGKS 367 (541) T ss_pred CCCCHHHHHHHHHHHCHHCCCCCCEEECCHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHCCCCCEEECCCCCCCCCCCC T ss_conf 89612458899998431027853112380899999998413789567506056888873124687020663043146775 Q ss_pred --CC------------------------------------------CCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf --32------------------------------------------2234334773776168655028999999999986 Q gi|254781093|r 138 --DF------------------------------------------SFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLM 173 (225) Q Consensus 138 --~~------------------------------------------~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~ 173 (225) +| -+|..+.+|+.+|-..+|-++|...++.=++.+. T Consensus 368 P~pFDLLfWN~DsTN~Pg~~h~~yLR~lYl~N~L~~~g~l~~~G~~lDL~~vk~P~y~~at~eDHIApW~~~y~G~~~lG 447 (541) T TIGR01838 368 PVPFDLLFWNSDSTNLPGKMHNFYLRNLYLQNELVSKGKLEVGGVRLDLSKVKVPVYIIATKEDHIAPWQSAYRGAALLG 447 (541) T ss_pred CCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCEEEECCCCCCCEEEEECCCCCCCCCHHHHHHHHHHC T ss_conf 78600121155011004778989987530101111688548732462034243124576134443458279999998853 Q ss_pred -HCCCCCEEEEEECCCCC--------------CCCCCHHHHHHHHHHHHHHHCC Q ss_conf -07799607999787984--------------7558589999999999997312 Q gi|254781093|r 174 -NQKGISITHKVIPDANH--------------FFIGKVDELINECAHYLDNSLD 212 (225) Q Consensus 174 -~~~~~~~~~~~i~ga~H--------------~f~~~~~~l~~~i~~fl~~~L~ 212 (225) . +.+|+ +-++|| +|++-.+++.+-...||+..-. T Consensus 448 sG----~~rFv-LgeSGHIAGvvNPP~k~KY~~WTn~N~~~~~~~~~Wl~gA~~ 496 (541) T TIGR01838 448 SG----EKRFV-LGESGHIAGVVNPPSKNKYGYWTNANDALPEDPEQWLAGAEE 496 (541) T ss_pred CC----CCEEE-ECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHCCEE T ss_conf 89----83488-526650500467887777753127766878897889751860 No 152 >pfam07082 DUF1350 Protein of unknown function (DUF1350). This family consists of several hypothetical proteins from both cyanobacteria and plants. Members of this family are typically around 250 residues in length. The function of this family is unknown but the species distribution indicates that the family may be involved in photosynthesis. Probab=96.44 E-value=0.04 Score=32.81 Aligned_cols=161 Identities=20% Similarity=0.191 Sum_probs=87.7 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC--HH-HHHHHHHHHHHHHHC-CC--CCCEEEEEECHHHHHHHH Q ss_conf 9999999999997398499985315588767785210--23-899999899998623-54--674058887203687876 Q gi|254781093|r 43 DNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGD--GE-LSDAAAALDWVQSLN-PE--SKSCWIAGYSFGAWISMQ 116 (225) Q Consensus 43 ~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~--~E-~~D~~aa~~~l~~~~-~~--~~~i~l~G~S~Gg~val~ 116 (225) +-....+.+.|+++||.|+.--|.- .||... -| ..+...+++.+.++. .+ .-+++=+|||+|+-+-+. T Consensus 33 ~vtYr~lLe~L~~~g~~ViAtpy~~------~fDH~~iA~~v~~~F~~~~~~L~~~~g~~~~~LPv~gvGHSlGckLhLL 106 (250) T pfam07082 33 QLTYRWLLEHLGEAGYVVIATPFVN------TFDHGAIALSVLNKFEYALERLVHRGGYPPAYLPIYGLGHSMGCKLHLL 106 (250) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCC------CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCHHHEE T ss_conf 8899999999987896899923678------9877999999999999999999973499844475231343323165413 Q ss_pred HHHCCC-CHHEEEECC-----------------CCCCCCCCCCCCC----------CCCCEEEEECCCCCCCCHHHHHHH Q ss_conf 450386-213301227-----------------5744333222343----------347737761686550289999999 Q gi|254781093|r 117 LLMRRP-EINGFISVA-----------------PQPKSYDFSFLAP----------CPSSGLIINGSNDTVATTSDVKDL 168 (225) Q Consensus 117 ~a~~~p-~i~~~v~is-----------------p~~~~~~~~~l~~----------~~~p~LiIhG~~D~~vp~~~~~~~ 168 (225) ..+..+ +-.+-+++| |..+ .+|..-.. .....|+|-=++|++- ++..+ T Consensus 107 i~s~~~~~R~gniliSFNN~~A~~aIPll~~l~~~l~-~EF~PsP~ET~~li~~~Y~v~rnLLIkF~~D~ID---qt~~L 182 (250) T pfam07082 107 IGSLYDVERAGNILMAFNNYPAKQAIPWMDNFGTALS-VEFTPSPTETNHLIQERYPVRRNLLIKFQNDDID---QTARL 182 (250) T ss_pred ECCCCCCCCCCCEEEECCCCCHHHCCCHHHHHCCCCC-CCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCC---CCHHH T ss_conf 2234678755313543068435642725987184335-7434687999999997058866358883798854---65999 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCCCC----C--H-HH--HHHHHHHHHHHHCCCC Q ss_conf 99986077996079997879847558----5--8-99--9999999999731221 Q gi|254781093|r 169 VNKLMNQKGISITHKVIPDANHFFIG----K--V-DE--LINECAHYLDNSLDEK 214 (225) Q Consensus 169 ~~~l~~~~~~~~~~~~i~ga~H~f~~----~--~-~~--l~~~i~~fl~~~L~~~ 214 (225) .+.++...+..++....+| ||.--- + . .+ =.+++..|+++.+... T Consensus 183 ~~~L~~R~~~~~~~~~L~G-~HlTP~~q~l~wq~g~~ftP~Dai~Q~~k~~~~~d 236 (250) T pfam07082 183 RSILRAKFGDMVTALKLPG-NHLTPLGQDLKWQTGAEFSPLDALGQWIKQSLFPD 236 (250) T ss_pred HHHHHHHCCCCEEEEECCC-CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9998632468626898789-87785667767555885683488999999988779 No 153 >pfam06259 DUF1023 Alpha/beta hydrolase of unknown function (DUF1023). Family of uncharacterized proteins found in Actinobacteria. Members of this family have an alpha/beta hydrolase fold. Probab=96.33 E-value=0.021 Score=34.43 Aligned_cols=80 Identities=14% Similarity=0.175 Sum_probs=50.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECC-CCCCCCCCCCCCCCCCCEEEEECCCCCCC Q ss_conf 89999989999862354674058887203687876450386213301227-57443332223433477377616865502 Q gi|254781093|r 82 LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVA-PQPKSYDFSFLAPCPSSGLIINGSNDTVA 160 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~is-p~~~~~~~~~l~~~~~p~LiIhG~~D~~v 160 (225) ..++...++-|+..+....++-++|||||+.++-.++....++.-+|+++ |.......+.|.-.+..+...-++.|.+- T Consensus 91 a~~L~~F~~gL~a~~~~~~~~tv~GHSYGStv~G~Aa~~g~~vDd~v~~GSPG~g~~~a~dL~~~~~~v~a~~a~~D~I~ 170 (177) T pfam06259 91 APRLNRLLRDLRATTVPGQHLTLFGHSYGSLVCGLALDDGSPVSDIVLYGSPGTEATSAAQLGTEPAHVWAMRGPDDWIA 170 (177) T ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHCCCCCCEEEEEECCCCCCC T ss_conf 99999999987501589997689974443678999862689865289977999899988882999870888728999705 Q ss_pred C Q ss_conf 8 Q gi|254781093|r 161 T 161 (225) Q Consensus 161 p 161 (225) - T Consensus 171 ~ 171 (177) T pfam06259 171 N 171 (177) T ss_pred C T ss_conf 6 No 154 >KOG1551 consensus Probab=96.29 E-value=0.017 Score=34.97 Aligned_cols=190 Identities=15% Similarity=0.161 Sum_probs=94.7 Q ss_pred CCCCCEEEE-EEECCCCCCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHCCCEEEEEE--ECCCCCCCCCCC----- Q ss_conf 589961879-9966888778799995789988889989999--999999997398499985--315588767785----- Q gi|254781093|r 7 NGPSGRLEG-RYQPSTNPNAPIALILHPHPRFGGTMNDNIV--YQLFYLFQQRGFVSLRFN--FRGIGRSEGEFD----- 76 (225) Q Consensus 7 ~g~~G~l~~-~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~~--~~la~~l~~~G~~vl~fd--~RG~G~S~G~~~----- 76 (225) |+-.|+-.. +|.| + ..+++++..- |+.|+... ..|..-+..+|+++.... |.|--.-+-++. T Consensus 96 P~~~~~A~~~~liP-Q-K~~~lcl~~a------~tgdh~y~rr~~L~~p~~k~~i~tmvle~pfYgqr~p~~q~~~~Le~ 167 (371) T KOG1551 96 PPESRTARVAWLIP-Q-KMADLCLSWA------LTGDHVYTRRLVLSKPINKREIATMVLEKPFYGQRVPEEQIIHMLEY 167 (371) T ss_pred CCCCCCEEEEEECC-C-CCCCEEEEEE------ECCCCEEEEEEEECCCHHHHCCHHEEEECCCCCCCCCHHHHHHHHHH T ss_conf 97445200233336-6-7677148985------05773167523414823530202124405544566888999999998 Q ss_pred ------CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCC--CC-------CCC--- Q ss_conf ------21023899999899998623546740588872036878764503862-133012275--74-------433--- Q gi|254781093|r 77 ------YGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAP--QP-------KSY--- 137 (225) Q Consensus 77 ------~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp--~~-------~~~--- 137 (225) .|.+-++.....+.|-... ...++.++|.||||.+|-++.+..+. |..+-++++ +. -.+ T Consensus 168 vtDlf~mG~A~I~E~~~lf~Ws~~~--g~g~~~~~g~Smgg~~a~~vgS~~q~Pva~~p~l~~~~asvs~teg~l~~~~s 245 (371) T KOG1551 168 VTDLFKMGRATIQEFVKLFTWSSAD--GLGNLNLVGRSMGGDIANQVGSLHQKPVATAPCLNSSKASVSATEGLLLQDTS 245 (371) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC--CCCCCEEEEEECCCHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 7899986089999999864643013--75663056420351788764016888750023336654526654203312047 Q ss_pred -----------------------------------------------CCCCCCCCCCC-----EEEEECCCCCCCCHHHH Q ss_conf -----------------------------------------------32223433477-----37761686550289999 Q gi|254781093|r 138 -----------------------------------------------DFSFLAPCPSS-----GLIINGSNDTVATTSDV 165 (225) Q Consensus 138 -----------------------------------------------~~~~l~~~~~p-----~LiIhG~~D~~vp~~~~ 165 (225) ++..++..|.| +.++..++|.++|-..+ T Consensus 246 ~~~~~~~~t~~~~~~~r~p~Q~~~~~~~~~srn~~~E~~~~Mr~vmd~~T~v~~fp~Pvdpsl~ivv~A~~D~Yipr~gv 325 (371) T KOG1551 246 KMKRFNQTTNKSGYTSRNPAQSYHLLSKEQSRNSRKESLIFMRGVMDECTHVANFPVPVDPSLIIVVQAKEDAYIPRTGV 325 (371) T ss_pred HHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHCCCCCCCCCEEEEEEECCCCCCCCCCC T ss_conf 77763257304456550710568998887621352889999999887601133077789987699998337752225574 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCCC--CCCCCHHHHHHHHHHHHHHHCC Q ss_conf 9999998607799607999787984--7558589999999999997312 Q gi|254781093|r 166 KDLVNKLMNQKGISITHKVIPDANH--FFIGKVDELINECAHYLDNSLD 212 (225) Q Consensus 166 ~~~~~~l~~~~~~~~~~~~i~ga~H--~f~~~~~~l~~~i~~fl~~~L~ 212 (225) ..+ +.+- .+..++ .++ .+| .|--+.++..++|.+-|++.=. T Consensus 326 ~~l-Q~~W--Pg~eVr--~~e-gGHVsayl~k~dlfRR~I~d~L~R~~k 368 (371) T KOG1551 326 RSL-QEIW--PGCEVR--YLE-GGHVSAYLFKQDLFRRAIVDGLDRLDK 368 (371) T ss_pred HHH-HHHC--CCCEEE--EEE-CCCEEEEEHHCHHHHHHHHHHHHHHHH T ss_conf 778-9748--997799--961-574320101030899999998876650 No 155 >pfam01083 Cutinase Cutinase. Probab=96.24 E-value=0.03 Score=33.59 Aligned_cols=148 Identities=14% Similarity=0.176 Sum_probs=74.6 Q ss_pred CCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHH Q ss_conf 998999999999999739849998531558876778521023-8999998999986235467405888720368787645 Q gi|254781093|r 40 TMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE-LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLL 118 (225) Q Consensus 40 ~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E-~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a 118 (225) +....+...+...+-..-..+--.||--..... .+..+..+ ..++...+....++-|+ .+|+|.|||.|+.++-.++ T Consensus 22 ~~G~~~~~al~~~~g~~~v~~~~V~YpA~~~~~-~~~~s~~~G~~~~~~~~~~~~~~CP~-tkiVl~GYSQGA~Vv~~a~ 99 (179) T pfam01083 22 SVGPAVVSALESAPGSTSVAVQGVNYPADLGQN-LYAGSSAAGINDAARLVNSAASKCPN-TKIVLGGYSQGAAVMDNAI 99 (179) T ss_pred CCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCC-CCCCCHHHHHHHHHHHHHHHHHHCCC-CCEEEEECCCHHHHHHHHH T ss_conf 540899999984538983257724766535666-66665789999999999999963999-7289864160689987652 Q ss_pred HCC-------CCHHEEEECCCCCCCCCCCCCCCC-CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCC Q ss_conf 038-------621330122757443332223433-477377616865502899999999998607799607999787984 Q gi|254781093|r 119 MRR-------PEINGFISVAPQPKSYDFSFLAPC-PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANH 190 (225) Q Consensus 119 ~~~-------p~i~~~v~isp~~~~~~~~~l~~~-~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H 190 (225) ... ..|.++++..-|-.......+.+. ...++.+.-..|.++.- .+.. -.-| T Consensus 100 ~~~~l~~~~~~~V~AvvlfGdP~~~~~~~~i~~~~~~k~~~~C~~GD~vC~~-------------~g~~-------~~~H 159 (179) T pfam01083 100 CGGGLPAAVADKVKAVVLFGDPSNGQGLPGISPLYASKTISYCATGDPVCDG-------------TGNN-------LPAH 159 (179) T ss_pred CCCCCCHHHHHHEEEEEEECCCCCCCCCCCCCCCCHHHHHHCCCCCCCEECC-------------CCCC-------CHHH T ss_conf 2688997787236789995377666897877986277775001699984089-------------9988-------5403 Q ss_pred CCCCCHHHHHHHHHHHHHHHC Q ss_conf 755858999999999999731 Q gi|254781093|r 191 FFIGKVDELINECAHYLDNSL 211 (225) Q Consensus 191 ~f~~~~~~l~~~i~~fl~~~L 211 (225) + .+.....+...+|+.++| T Consensus 160 ~--~Y~~~~~~~Aa~Fv~~k~ 178 (179) T pfam01083 160 L--SYGSDYTTQAAAFVASKL 178 (179) T ss_pred H--CHHHCCHHHHHHHHHHHC T ss_conf 2--632342558999999863 No 156 >KOG2541 consensus Probab=96.18 E-value=0.096 Score=30.50 Aligned_cols=101 Identities=21% Similarity=0.324 Sum_probs=60.1 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHC-CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 87999957899888899899999999999973-98499985315588767785210238999998999986235467405 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQR-GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~-G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) .| +|+.| +.|.+..|--+..+.+.+.+. |..+...+- |.|--.+.+- . -.+.+..|.+.+...---..-.. T Consensus 24 ~P-~ii~H---Gigd~c~~~~~~~~~q~l~~~~g~~v~~lei-g~g~~~s~l~--p-l~~Qv~~~ce~v~~m~~lsqGyn 95 (296) T KOG2541 24 VP-VIVWH---GIGDSCSSLSMANLTQLLEELPGSPVYCLEI-GDGIKDSSLM--P-LWEQVDVACEKVKQMPELSQGYN 95 (296) T ss_pred CC-EEEEE---CCCCCCCCCHHHHHHHHHHHCCCCEEEEEEE-CCCCCHHHHC--C-HHHHHHHHHHHHHCCHHCCCCEE T ss_conf 87-89980---5676535521789999998689971699980-4873143322--4-99999999999841212167637 Q ss_pred EEEECHHHHHHHHHHHC--CCCHHEEEECCCC Q ss_conf 88872036878764503--8621330122757 Q gi|254781093|r 104 IAGYSFGAWISMQLLMR--RPEINGFISVAPQ 133 (225) Q Consensus 104 l~G~S~Gg~val~~a~~--~p~i~~~v~isp~ 133 (225) ++|+|.||.++=.++.. .|.+..+|+++.| T Consensus 96 ivg~SQGglv~Raliq~cd~ppV~n~ISL~gP 127 (296) T KOG2541 96 IVGYSQGGLVARALIQFCDNPPVKNFISLGGP 127 (296) T ss_pred EEEECCCCHHHHHHHHHCCCCCCCEEEECCCC T ss_conf 99974432999999986789874315751687 No 157 >KOG3724 consensus Probab=96.17 E-value=0.098 Score=30.45 Aligned_cols=51 Identities=20% Similarity=0.285 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHHHHCC--------CCCCEEEEEECHHHHHHHHHHHCCC-----CHHEEEECCCC Q ss_conf 8999998999986235--------4674058887203687876450386-----21330122757 Q gi|254781093|r 82 LSDAAAALDWVQSLNP--------ESKSCWIAGYSFGAWISMQLLMRRP-----EINGFISVAPQ 133 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~--------~~~~i~l~G~S~Gg~val~~a~~~p-----~i~~~v~isp~ 133 (225) .+=+..|+.|+.+++. .+..++|+||||||.||-.++ ..+ .|..++..+.| T Consensus 156 tEYV~dAIk~ILslYr~~~e~~~p~P~sVILVGHSMGGiVAra~~-tlkn~~~~sVntIITlssP 219 (973) T KOG3724 156 TEYVNDAIKYILSLYRGEREYASPLPHSVILVGHSMGGIVARATL-TLKNEVQGSVNTIITLSSP 219 (973) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCHHHHHHHHH-HHHHHCCCHHHHHHHHCCC T ss_conf 999999999999986066556889995499982451329999998-5011015505531100576 No 158 >pfam11288 DUF3089 Protein of unknown function (DUF3089). This family of proteins has no known function. Probab=95.81 E-value=0.016 Score=35.24 Aligned_cols=65 Identities=18% Similarity=0.101 Sum_probs=42.5 Q ss_pred CCEEEEEEECCCC-CCC-CCC-CCCC--HH--HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCC Q ss_conf 9849998531558-876-778-5210--23--8999998999986235467405888720368787645038 Q gi|254781093|r 57 GFVSLRFNFRGIG-RSE-GEF-DYGD--GE--LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRR 121 (225) Q Consensus 57 G~~vl~fd~RG~G-~S~-G~~-~~g~--~E--~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~ 121 (225) -+.+..+-||-.- .|- ... ..+. .| .+|++.|.+|-.+...+.++++|+|||.|+..+..+.... T Consensus 45 ~~~vyAP~YRQat~~~~~~~~~~~~~~a~~~AY~DV~~AF~~yl~~~n~grPfILagHSQGs~h~~rLl~e~ 116 (201) T pfam11288 45 VCRVFAPRYRQATLGAFLATDRGEASAALDLAYSDVRRAFDAYLANDNGGRPFILAGHSQGALHLLRLLREE 116 (201) T ss_pred CCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHHH T ss_conf 285347168887888874144310389999889999999999999659999679998356899999999988 No 159 >cd00519 Lipase_3 Lipase (class 3). Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site. Probab=95.78 E-value=0.041 Score=32.70 Aligned_cols=121 Identities=17% Similarity=0.190 Sum_probs=58.6 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCH-----HHHHHHHHHHHHHHHCC Q ss_conf 7787999957899888899899999999999973984999853155887677852102-----38999998999986235 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDG-----ELSDAAAALDWVQSLNP 97 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~-----E~~D~~aa~~~l~~~~~ 97 (225) ..+.++|.+- ||.. +...+.+.-+.-..++. ...+.|.-..|.. -..++...+.-++++++ T Consensus 61 ~~~~ivv~fR------GT~s------~~d~~~dl~~~~~~~~~--~~~~~~~VH~Gf~~a~~~~~~~~~~~~~~~~~~~p 126 (229) T cd00519 61 DRKTIVIAFR------GTVS------LADWLTDLDFSPVPLDP--PLCSGGKVHSGFYSAYKSLYNQVLPELKSALKQYP 126 (229) T ss_pred CCCEEEEEEC------CCCC------HHHHHHHCEEEEEECCC--CCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9998999988------9998------89999871531030566--79997487271999999999999999999998789 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCC------CCHHEEEECCCCCCCCCCCCC--CCCCCCEEEEECCCCCC Q ss_conf 467405888720368787645038------621330122757443332223--43347737761686550 Q gi|254781093|r 98 ESKSCWIAGYSFGAWISMQLLMRR------PEINGFISVAPQPKSYDFSFL--APCPSSGLIINGSNDTV 159 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~val~~a~~~------p~i~~~v~isp~~~~~~~~~l--~~~~~p~LiIhG~~D~~ 159 (225) + .+|++.|||+||++|..++..- ..+..+..-+|.+....|... .....-..++|. +|.+ T Consensus 127 ~-~~i~vTGHSLGgalA~L~a~~l~~~~~~~~~~~~tfG~Prvgn~~fa~~~~~~~~~~~Rvv~~-~D~V 194 (229) T cd00519 127 D-YKIIVTGHSLGGALASLLALDLRLRGPGSDVTVYTFGQPRVGNAAFAEYLESTKGRVYRVVHG-NDIV 194 (229) T ss_pred C-CEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHCCCCCEEEEEEC-CCCE T ss_conf 9-569996467068999999999997179997079994799757899999997179988999978-9924 No 160 >pfam11187 DUF2974 Protein of unknown function (DUF2974). This bacterial family of proteins has no known function. Probab=95.75 E-value=0.018 Score=34.87 Aligned_cols=72 Identities=17% Similarity=0.252 Sum_probs=40.4 Q ss_pred EEEEEECCCCCCC-C---CCCCCC-HHHHHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHHHHCCC-----CHHEEE Q ss_conf 9998531558876-7---785210-23899999899998623-54674058887203687876450386-----213301 Q gi|254781093|r 60 SLRFNFRGIGRSE-G---EFDYGD-GELSDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQLLMRRP-----EINGFI 128 (225) Q Consensus 60 vl~fd~RG~G~S~-G---~~~~g~-~E~~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~a~~~p-----~i~~~v 128 (225) ++..-|||.-.|- | +|.... .++--=..|++|+.+.. ....+++|.|||-||..|.-+|+..+ +|..+. T Consensus 38 t~~VaFRGTD~TlvGWkEDfnmsy~~~vpaQ~~A~~YL~~~~~~~~~~i~l~GHSKGGNLA~YAa~~~~~~~~~rI~~vy 117 (224) T pfam11187 38 TYYVAFRGTDDTLVGWKEDFNMSFMPEVPAQRSAAKYLNKILAHYPGPIYLGGHSKGGNLAVYAAANAEPDLQDRIIKIY 117 (224) T ss_pred EEEEEEECCCCCEEEECCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCHHHHHHHEEEE T ss_conf 39999988888703371136003358987899999999999987799789995481678999999719986874632894 Q ss_pred ECC Q ss_conf 227 Q gi|254781093|r 129 SVA 131 (225) Q Consensus 129 ~is 131 (225) +.- T Consensus 118 s~D 120 (224) T pfam11187 118 SFD 120 (224) T ss_pred CCC T ss_conf 258 No 161 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=95.62 E-value=0.13 Score=29.80 Aligned_cols=89 Identities=12% Similarity=0.190 Sum_probs=50.3 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCC------CC-CCCCC------------------C Q ss_conf 87999957899888899899999999999973984999853155887------67-78521------------------0 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRS------EG-EFDYG------------------D 79 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S------~G-~~~~g------------------~ 79 (225) -|++|+-| +-++.+.+ ...+|..|...||+++.+|.-=||.- .| .+..+ + T Consensus 449 ~PvvIyqH---GITs~Ke~--a~a~A~~la~ag~AtvAID~PLHGeR~~d~~~~~~~~t~~~~~~fiNL~~l~~aRDNlR 523 (792) T TIGR03502 449 WPVVIYQH---GITGAKEN--ALAFAGTLAAAGVATIAIDHPLHGARSFDANASGVNATNANVLAYMNLASLLVARDNLR 523 (792) T ss_pred CCEEEEEE---CCCCCCHH--HHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCHHHH T ss_conf 84699985---26775045--88875432037558998257656656668777654356898434887666322111288 Q ss_pred HHHHHH---HHHHHHHHHH--------CCCCCCEEEEEECHHHHHHHHHH Q ss_conf 238999---9989999862--------35467405888720368787645 Q gi|254781093|r 80 GELSDA---AAALDWVQSL--------NPESKSCWIAGYSFGAWISMQLL 118 (225) Q Consensus 80 ~E~~D~---~aa~~~l~~~--------~~~~~~i~l~G~S~Gg~val~~a 118 (225) .-+.|+ ++++..+... ..+..++.++|||+||-+....+ T Consensus 524 Qsv~DlL~LRasL~~~~~~~~~~~~~~~~d~skV~f~GhSLGgIvG~~f~ 573 (792) T TIGR03502 524 QSILDLLGLRLSLNGSALAGAPLSGINVIDGSKVSFLGHSLGGIVGTSFI 573 (792) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHH T ss_conf 88899899999875321134544556656666216986421200053646 No 162 >cd00741 Lipase Lipase. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation", the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site. Probab=95.50 E-value=0.07 Score=31.34 Aligned_cols=56 Identities=13% Similarity=0.110 Sum_probs=31.9 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCC-----CCHHEEEECCCC-CCCCCC Q ss_conf 999998999986235467405888720368787645038-----621330122757-443332 Q gi|254781093|r 83 SDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRR-----PEINGFISVAPQ-PKSYDF 139 (225) Q Consensus 83 ~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~-----p~i~~~v~isp~-~~~~~~ 139 (225) ..+...+.-.++++++ .+|++.|||+||++|..++..- .....++..++| +....+ T Consensus 12 ~~~~~~~~~~~~~~p~-~~l~vtGHSLGgalA~l~a~~l~~~~~~~~~~~~tfg~Prvg~~~~ 73 (153) T cd00741 12 NLVLPLLKSALAQYPD-YKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRVYTFGPPRVGNAAF 73 (153) T ss_pred HHHHHHHHHHHHHCCC-CEEEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHC T ss_conf 9999999999998899-7799996362789999999999972789752899808985655111 No 163 >pfam06850 PHB_depo_C PHB de-polymerase C-terminus. This family represents the C-terminus of bacterial poly(3-hydroxybutyrate) (PHB) de-polymerase. This degrades PHB granules to oligomers and monomers of 3-hydroxy-butyric acid. Probab=95.46 E-value=0.01 Score=36.41 Aligned_cols=65 Identities=15% Similarity=0.283 Sum_probs=48.3 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC--CCC--CHHHHHHHHHHHHHHH Q ss_conf 4773776168655028999999999986077996079997879847--558--5899999999999973 Q gi|254781093|r 146 PSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF--FIG--KVDELINECAHYLDNS 210 (225) Q Consensus 146 ~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f~~--~~~~l~~~i~~fl~~~ 210 (225) ..+.|.|-|++|+++.+.+..+..+-+..........++.+|+||. |++ ..+++.=.+.+|++++ T Consensus 134 ~taLltvEGe~DDIsg~GQT~AA~~LC~glp~~~k~~~~~~g~GHYGlF~G~rwr~~i~P~i~~fi~~~ 202 (203) T pfam06850 134 RVALMTVEGENDDISGLGQTKAALDLCTGIPADRKAHHMQPGVGHYGVFNGSRFREEIYPLIRDFIREY 202 (203) T ss_pred HHHHHEECCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHHHHHHHC T ss_conf 100210126656667628899999986279999988870689992564156667643518999999864 No 164 >pfam01764 Lipase_3 Lipase (class 3). Probab=95.45 E-value=0.024 Score=34.11 Aligned_cols=37 Identities=14% Similarity=0.156 Sum_probs=24.9 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHC Q ss_conf 99999899998623546740588872036878764503 Q gi|254781093|r 83 SDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMR 120 (225) Q Consensus 83 ~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~ 120 (225) .++...+.-+.+++++ .++++.|||+||++|..++.. T Consensus 47 ~~i~~~l~~~~~~~~~-~~l~itGHSLGGa~A~l~a~~ 83 (141) T pfam01764 47 DQILEELKRLLEKYPD-YKIVVTGHSLGGALASLAAAD 83 (141) T ss_pred HHHHHHHHHHHHHCCC-CEEEEEECCHHHHHHHHHHHH T ss_conf 9999999999997899-769998056578999999999 No 165 >pfam07519 Tannase Tannase and feruloyl esterase. This family includes fungal tannase and feruloyl esterase. It also includes several bacterial homologues of unknown function. Probab=95.41 E-value=0.028 Score=33.74 Aligned_cols=66 Identities=11% Similarity=0.168 Sum_probs=50.7 Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHCCCC-------CEEEEEECCCCCCCCCC---HHHHHHHHHHHHHHHCCC Q ss_conf 7377616865502899999999998607799-------60799978798475585---899999999999973122 Q gi|254781093|r 148 SGLIINGSNDTVATTSDVKDLVNKLMNQKGI-------SITHKVIPDANHFFIGK---VDELINECAHYLDNSLDE 213 (225) Q Consensus 148 p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~-------~~~~~~i~ga~H~f~~~---~~~l~~~i~~fl~~~L~~ 213 (225) ..|+-||..|+.+|+..+.++++++...-+. -..+=++||.+|...+. .-++..++.+|.++=..+ T Consensus 331 KLi~~HG~aD~~i~~~~ti~Yy~~v~~~~g~~~~~~~dF~Rlf~vPGm~HC~gG~gp~~~d~l~aL~~WVE~G~aP 406 (451) T pfam07519 331 KLILYHGTADPSVSPASTIRYYESVVAKMGEALAATEDFYRFFLVPGMAHCGGGAGPSGVDNLTAMVDWVENGNAP 406 (451) T ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCC T ss_conf 0899816777775850589999999998276635433311123248853247998888768899999998389798 No 166 >KOG2183 consensus Probab=95.40 E-value=0.17 Score=29.06 Aligned_cols=101 Identities=19% Similarity=0.280 Sum_probs=62.5 Q ss_pred CCCEEEEECCCCCCCCCCCCH--------HHHHHHHHHHHCCCEEEEEEECCCCCCCC--C--CCC----C--CH--HHH Q ss_conf 787999957899888899899--------99999999997398499985315588767--7--852----1--02--389 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDN--------IVYQLFYLFQQRGFVSLRFNFRGIGRSEG--E--FDY----G--DG--ELS 83 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~--------~~~~la~~l~~~G~~vl~fd~RG~G~S~G--~--~~~----g--~~--E~~ 83 (225) ..|+..++ ||..+- .+.++|..| +-.++-...|=+|+|.= . +.. | .. .++ T Consensus 80 ~gPIffYt-------GNEGdie~Fa~ntGFm~D~Ap~~---~AllVFaEHRyYGeS~PFG~~s~k~~~hlgyLtseQALA 149 (492) T KOG2183 80 EGPIFFYT-------GNEGDIEWFANNTGFMWDLAPEL---KALLVFAEHRYYGESLPFGSQSYKDARHLGYLTSEQALA 149 (492) T ss_pred CCCEEEEE-------CCCCCHHHHHHCCCHHHHHHHHH---CCEEEEEEHHCCCCCCCCCCHHCCCHHHHCCCCHHHHHH T ss_conf 87669994-------78521888874121587643763---752797440004667788530005746541110898899 Q ss_pred HHHHHHHHHHHH-CCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCC Q ss_conf 999989999862-3546740588872036878764503862-13301227574 Q gi|254781093|r 84 DAAAALDWVQSL-NPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQP 134 (225) Q Consensus 84 D~~aa~~~l~~~-~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~ 134 (225) |....+..+++. .+...+++.+|-||||..|+..-.++|- +.|.++-|.|+ T Consensus 150 DfA~ll~~lK~~~~a~~~pvIafGGSYGGMLaAWfRlKYPHiv~GAlAaSAPv 202 (492) T KOG2183 150 DFAELLTFLKRDLSAEASPVIAFGGSYGGMLAAWFRLKYPHIVLGALAASAPV 202 (492) T ss_pred HHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHCCHHHHHHHHHCCCCE T ss_conf 99999999863223002958995472256999999711706664156516865 No 167 >TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular; InterPro: IPR010915 This entry represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting.. Probab=95.35 E-value=0.068 Score=31.41 Aligned_cols=174 Identities=16% Similarity=0.246 Sum_probs=102.1 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEE---EECCCCCCCCCCCCCCHHHHHH-HHHHHHHHHHCCCCC Q ss_conf 879999578998888998999999999999739849998---5315588767785210238999-998999986235467 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRF---NFRGIGRSEGEFDYGDGELSDA-AAALDWVQSLNPESK 100 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~f---d~RG~G~S~G~~~~g~~E~~D~-~aa~~~l~~~~~~~~ 100 (225) .|-|+|.=|.+ |+.-. .++...++|...- .|+-- |-|-+-=|+|+|| .+|- .-++++++-.+++ T Consensus 109 dp~vLiVAPmS---GHyAT-LLR~TV~aLLP~~-dVYiTDW~~AR~VPl~aG~FD-----~~DYIdY~ief~~~lGp~-- 176 (414) T TIGR01849 109 DPKVLIVAPMS---GHYAT-LLRSTVEALLPDH-DVYITDWVDARMVPLEAGKFD-----LEDYIDYLIEFLRFLGPD-- 176 (414) T ss_pred CCCEEEECCCH---HHHHH-HHHHHHHHHCCCC-CEEEECCCCCCCCCHHHCCCC-----HHHHHHHHHHHHHHHCCC-- T ss_conf 87127856741---25666-5567899857899-678863140160305008643-----356899999999971889-- Q ss_pred CEEEEEECHHH-----HHHHHHHHCCC-CHHEEEECCCCCCCCC-------C---------------------------- Q ss_conf 40588872036-----87876450386-2133012275744333-------2---------------------------- Q gi|254781093|r 101 SCWIAGYSFGA-----WISMQLLMRRP-EINGFISVAPQPKSYD-------F---------------------------- 139 (225) Q Consensus 101 ~i~l~G~S~Gg-----~val~~a~~~p-~i~~~v~isp~~~~~~-------~---------------------------- 139 (225) ++.+++=|.=+ ++|++.....| ..+.+++++.|.+... + T Consensus 177 r~hV~aVCQP~vPvLAA~aLmae~~~P~~P~sm~lmGGPIDaR~~PT~VN~lA~~~~~eWF~~n~i~~VP~~YpG~GR~V 256 (414) T TIGR01849 177 RIHVVAVCQPAVPVLAAVALMAENEEPAQPRSMTLMGGPIDARASPTVVNELAKEKPIEWFQENLIMRVPFPYPGAGRKV 256 (414) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCHHHHHHHCCCCHHHHHHCCEEECCCCCCCCCCCC T ss_conf 76378874684579999999874578888840576277766455521344300126727885387640842677788543 Q ss_pred ---------------------------------------------CCCC-----------C------------------- Q ss_conf ---------------------------------------------2234-----------3------------------- Q gi|254781093|r 140 ---------------------------------------------SFLA-----------P------------------- 144 (225) Q Consensus 140 ---------------------------------------------~~l~-----------~------------------- 144 (225) +.|+ . T Consensus 257 YPGFlQLagF~SmN~dRH~~aH~~~~~~LvkgDg~~Ad~H~~FYDEYLaVmDmtAEFYLqTi~~VF~q~~Lp~G~~~~~G 336 (414) T TIGR01849 257 YPGFLQLAGFISMNLDRHVKAHKDFFAHLVKGDGDEADKHRKFYDEYLAVMDMTAEFYLQTIDVVFQQFLLPKGKFIVEG 336 (414) T ss_pred CCHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCEEECC T ss_conf 71388888688406566799999999986237525778887631245320545034455489999865314477167887 Q ss_pred --------CCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC--CCC--CHHHHHHHHHHHHHHH Q ss_conf --------34773776168655028999999999986077996079997879847--558--5899999999999973 Q gi|254781093|r 145 --------CPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF--FIG--KVDELINECAHYLDNS 210 (225) Q Consensus 145 --------~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f~~--~~~~l~~~i~~fl~~~ 210 (225) -.+-+|-|=|++|++.-+=+..+-.+-+.+-..-....++.||+||. |+| ..+++.=.|.+|++++ T Consensus 337 ~rVd~~~It~~ALltvEGEnDDIsg~GQT~AA~~LCtgIpe~~k~~~~~pgvGHYGvF~GsrfR~~I~P~v~~FI~~~ 414 (414) T TIGR01849 337 KRVDPKAITKVALLTVEGENDDISGLGQTKAALKLCTGIPEDMKRHYLQPGVGHYGVFSGSRFREEIYPRVREFIRRF 414 (414) T ss_pred EEECHHHCCEEEEEEECCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHCC T ss_conf 431643413454344326522337403789999843798768877515688731354368400113660689987419 No 168 >pfam10340 DUF2424 Protein of unknown function (DUF2424). This is a family of proteins conserved in yeasts. The function is not known. Probab=95.06 E-value=0.28 Score=27.73 Aligned_cols=119 Identities=16% Similarity=0.184 Sum_probs=67.8 Q ss_pred CCCEEEEEEECCC---CCCCCEEEEECCCCCCCCCC----CCHH---HHHHHHHHHHCCCEEEEEEECCCC-CCCCCCCC Q ss_conf 9961879996688---87787999957899888899----8999---999999999739849998531558-87677852 Q gi|254781093|r 9 PSGRLEGRYQPST---NPNAPIALILHPHPRFGGTM----NDNI---VYQLFYLFQQRGFVSLRFNFRGIG-RSEGEFDY 77 (225) Q Consensus 9 ~~G~l~~~~~~~~---~~~~~~vv~~Hp~p~~GG~~----~~~~---~~~la~~l~~~G~~vl~fd~RG~G-~S~G~~~~ 77 (225) .|.+..=.+-.|. ....|+.++.| ||.+ .... +..+.+.|-+ .+.+..||.=.- .-++ +.+ T Consensus 103 ids~~~W~~k~p~~~~p~~DpVLiY~H-----GGGy~L~~~~~~lv~l~~l~~~~~~--~sIli~DYslT~~~~~~-~~y 174 (374) T pfam10340 103 IDSTKFWLRKVPETFDPKVDPILLYYH-----GGGFALKLIPVTLVFLNNLGKYFPD--MAILVSDYTVTANCPQS-YTY 174 (374) T ss_pred CCCCEEEEEECCCCCCCCCCCEEEEEE-----CCEEEECCCHHHHHHHHHHHHHCCC--CEEEEEECCEECCCCCC-CCC T ss_conf 776159999667778987796899982-----7755761322565699999975787--54999702000357667-866 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH---C--CCC-HHEEEECCCCCCCC Q ss_conf 102389999989999862354674058887203687876450---3--862-13301227574433 Q gi|254781093|r 78 GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM---R--RPE-INGFISVAPQPKSY 137 (225) Q Consensus 78 g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~---~--~p~-i~~~v~isp~~~~~ 137 (225) .-.+-++.++.+|+.... ..+.|.|+|=|-||.+++.... + ++. .+.+|++||=++.. T Consensus 175 -P~Ql~e~l~~Y~~lv~~~-G~kni~LmGDSAGGNL~L~~l~yL~~~~~~~~Pk~~i~ISPWvN~t 238 (374) T pfam10340 175 -PLQVLQCLAVYDYLTLTK-GCKNVTLMGDSAGGNLVLNILLYLHKCNKVVLPKKAIAISPWLNLT 238 (374) T ss_pred -CHHHHHHHHHHHHHHHHC-CCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCC T ss_conf -168999999999998705-7616999723776237899999998639765887148864654511 No 169 >COG4287 PqaA PhoPQ-activated pathogenicity-related protein [General function prediction only] Probab=95.00 E-value=0.029 Score=33.61 Aligned_cols=117 Identities=17% Similarity=0.257 Sum_probs=76.9 Q ss_pred HHHHHHHHHC--CCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCC---------------CCCCC----------- Q ss_conf 9899998623--5467405888720368787645038621330122757---------------44333----------- Q gi|254781093|r 87 AALDWVQSLN--PESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQ---------------PKSYD----------- 138 (225) Q Consensus 87 aa~~~l~~~~--~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~---------------~~~~~----------- 138 (225) .+++-++..- ...+...+.|-|--|+.+...|-.+|++.+++....- .+-|+ T Consensus 219 rAMdlAq~eL~q~~Ik~F~VTGaSKRgWttwLTAIaDprv~aIvp~v~D~Lni~a~L~hiyrsYGgnwpi~l~pyyaegi 298 (507) T COG4287 219 RAMDLAQDELEQVEIKGFMVTGASKRGWTTWLTAIADPRVFAIVPFVYDNLNIEAQLLHIYRSYGGNWPIKLAPYYAEGI 298 (507) T ss_pred HHHHHHHHHHHHEEEEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 99998886411215424798444532188888886086301103667763160999999998627888723143676417 Q ss_pred --------CCC----------C-----CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC Q ss_conf --------222----------3-----43347737761686550289999999999860779960799978798475585 Q gi|254781093|r 139 --------FSF----------L-----APCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK 195 (225) Q Consensus 139 --------~~~----------l-----~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~ 195 (225) |.. + .....|..||.++.|++-+++.+.-+++.++..| -+..+|+..|...+ T Consensus 299 ~erl~tp~fkqL~~IiDPlay~~try~~RLalpKyivnaSgDdff~pDsa~lYyd~LPG~k----aLrmvPN~~H~~~n- 373 (507) T COG4287 299 DERLETPLFKQLLEIIDPLAYRNTRYQLRLALPKYIVNASGDDFFVPDSANLYYDDLPGEK----ALRMVPNDPHNLIN- 373 (507) T ss_pred HHHHCCHHHHHHHHHHCHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCEEECCCCCCE----EEEECCCCCCHHHH- T ss_conf 7763488887899861678776432112125430133166885237774301110488732----25627897201457- Q ss_pred HHHHHHHHHHHHHH Q ss_conf 89999999999997 Q gi|254781093|r 196 VDELINECAHYLDN 209 (225) Q Consensus 196 ~~~l~~~i~~fl~~ 209 (225) ..+.+.+..|+.+ T Consensus 374 -~~i~esl~~flnr 386 (507) T COG4287 374 -QFIKESLEPFLNR 386 (507) T ss_pred -HHHHHHHHHHHHH T ss_conf -8888889999888 No 170 >COG3673 Uncharacterized conserved protein [Function unknown] Probab=94.63 E-value=0.35 Score=27.09 Aligned_cols=90 Identities=22% Similarity=0.373 Sum_probs=56.9 Q ss_pred CCCCCEEEEECCCCCCCCCCCC----H--HHHHHHHHHHH-CCCEEEEEEECCCCCCCCCCC-----------------C Q ss_conf 8778799995789988889989----9--99999999997-398499985315588767785-----------------2 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMND----N--IVYQLFYLFQQ-RGFVSLRFNFRGIGRSEGEFD-----------------Y 77 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~----~--~~~~la~~l~~-~G~~vl~fd~RG~G~S~G~~~-----------------~ 77 (225) ...+.+|++.- |+.++ + -+..|+..|.+ .|..++.+---|+|- .| |+ . T Consensus 28 ds~k~lV~CfD------GT~nrfg~qp~TNVv~Ly~sl~r~d~~~qv~yYd~GVGt-~G-fdavvdvrrrl~~~~~gsmF 99 (423) T COG3673 28 DSMKRLVFCFD------GTWNRFGAQPPTNVVLLYASLQRADGVTQVIYYDEGVGT-GG-FDAVVDVRRRLEKLSGGSMF 99 (423) T ss_pred CCCCEEEEEEC------CCHHHCCCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCC-CC-CHHHHHHHHHHHHHHHHHHH T ss_conf 57642899963------755325899960299999987547982489986688665-55-11467888765665447788 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 102389999989999862354674058887203687876450 Q gi|254781093|r 78 GDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 78 g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) |.|-.+.+..|..++...+-..++|+++|||-|++.|=.+|+ T Consensus 100 g~gL~~nI~~AYrFL~~~yepGD~Iy~FGFSRGAf~aRVlag 141 (423) T COG3673 100 GQGLVQNIREAYRFLIFNYEPGDEIYAFGFSRGAFSARVLAG 141 (423) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHH T ss_conf 888898799999999984687776898621521699999999 No 171 >KOG2565 consensus Probab=94.31 E-value=0.18 Score=28.90 Aligned_cols=93 Identities=24% Similarity=0.376 Sum_probs=62.0 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHC---------CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 99957899888899899999999999973---------984999853155887677852102389999989999862354 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQR---------GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPE 98 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~---------G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~ 98 (225) ++++|+|| ||-. .+++.-..|..- -|.|+.+..-|+|=|++.-..|-+.++-+.-.-.-+.+.+ T Consensus 155 lLl~HGwP---Gsv~--EFykfIPlLT~p~~hg~~~d~~FEVI~PSlPGygwSd~~sk~GFn~~a~ArvmrkLMlRLg-- 227 (469) T KOG2565 155 LLLLHGWP---GSVR--EFYKFIPLLTDPKRHGNESDYAFEVIAPSLPGYGWSDAPSKTGFNAAATARVMRKLMLRLG-- 227 (469) T ss_pred EEEECCCC---CHHH--HHHHHHHHHCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHC-- T ss_conf 58953798---5188--8876404205865468752136998615888766676876578549999999999999827-- Q ss_pred CCCEEEEEECHHHHHHHHHHHCCCC-HHEE Q ss_conf 6740588872036878764503862-1330 Q gi|254781093|r 99 SKSCWIAGYSFGAWISMQLLMRRPE-INGF 127 (225) Q Consensus 99 ~~~i~l~G~S~Gg~val~~a~~~p~-i~~~ 127 (225) .++.++-|--+|+-++..+|.-.|+ |.|+ T Consensus 228 ~nkffiqGgDwGSiI~snlasLyPenV~Gl 257 (469) T KOG2565 228 YNKFFIQGGDWGSIIGSNLASLYPENVLGL 257 (469) T ss_pred CCEEEEECCCHHHHHHHHHHHHCCHHHHHH T ss_conf 641676157367888877776550456676 No 172 >pfam05057 DUF676 Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold. Probab=94.25 E-value=0.44 Score=26.53 Aligned_cols=85 Identities=16% Similarity=0.187 Sum_probs=41.3 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHC-CCEEEEEEECCCCCCCCCCCCCCHHH--HHHHHHHHHHHHHCCCCCCE Q ss_conf 7999957899888899899999999999973-98499985315588767785210238--99999899998623546740 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQR-GFVSLRFNFRGIGRSEGEFDYGDGEL--SDAAAALDWVQSLNPESKSC 102 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~-G~~vl~fd~RG~G~S~G~~~~g~~E~--~D~~aa~~~l~~~~~~~~~i 102 (225) =.+|+.| ++-||..| +..++..|.+. .-..+ -|++.-..++....|.-+. .=+..+.+++++ ..+..+| T Consensus 6 HLvVlvH---Gl~G~~~d--m~~l~~~l~~~~~~~~~--~~~~~~~n~~~T~dGI~~~G~Rla~EI~~~i~~-~~~~~kI 77 (212) T pfam05057 6 HLVVLVH---GLWGNSAD--MEYIAEQLEKDLPDPLI--VVLMSSNNQGKTFDGIDVMGERLANEVLEFVQD-KSDKKKI 77 (212) T ss_pred EEEEEEC---CCCCCHHH--HHHHHHHHHHHCCCCEE--EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH-CCCCCEE T ss_conf 8999958---88699999--99999999987899779--998246778715248999999999999999984-6887459 Q ss_pred EEEEECHHHHHHHHHH Q ss_conf 5888720368787645 Q gi|254781093|r 103 WIAGYSFGAWISMQLL 118 (225) Q Consensus 103 ~l~G~S~Gg~val~~a 118 (225) =++|||+||-++-.+. T Consensus 78 SfIGhSLGGLi~RyAl 93 (212) T pfam05057 78 SFVGHSLGGLIARSAI 93 (212) T ss_pred EEEEECHHHHHHHHHH T ss_conf 9996442479999999 No 173 >KOG4569 consensus Probab=93.80 E-value=0.088 Score=30.73 Aligned_cols=36 Identities=28% Similarity=0.435 Sum_probs=28.1 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 9999989999862354674058887203687876450 Q gi|254781093|r 83 SDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 83 ~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) +.+.+.++-+..++++ -.|++.|||+||++|..+|. T Consensus 155 ~~~~~~~~~L~~~~~~-~~i~vTGHSLGgAlA~laa~ 190 (336) T KOG4569 155 SGLDAELRRLIELYPN-YSIWVTGHSLGGALASLAAL 190 (336) T ss_pred HHHHHHHHHHHHHCCC-CEEEEEECCCCHHHHHHHHH T ss_conf 8899999999986899-67999406870898999999 No 174 >KOG4388 consensus Probab=93.63 E-value=0.58 Score=25.81 Aligned_cols=112 Identities=13% Similarity=0.153 Sum_probs=61.6 Q ss_pred CCEEEEEEECCCCCCCCEEEEECCCCCC-CCCCCC-HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH Q ss_conf 9618799966888778799995789988-889989-99999999999739849998531558876778521023899999 Q gi|254781093|r 10 SGRLEGRYQPSTNPNAPIALILHPHPRF-GGTMND-NIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAA 87 (225) Q Consensus 10 ~G~l~~~~~~~~~~~~~~vv~~Hp~p~~-GG~~~~-~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~a 87 (225) .|+++.| ++|..+.+-+++-||+-+.- ..++.+ .-+..+|+ ++|+.++..||.= .-|-.|.. .++.+.- T Consensus 382 ~~~~~~w-h~P~p~S~sli~HcHGGGfVAqsSkSHE~YLr~Wa~---aL~cPiiSVdYSL--APEaPFPR---aleEv~f 452 (880) T KOG4388 382 QRSLELW-HRPAPRSRSLIVHCHGGGFVAQSSKSHEPYLRSWAQ---ALGCPIISVDYSL--APEAPFPR---ALEEVFF 452 (880) T ss_pred CCCCCCC-CCCCCCCCEEEEEECCCCEEEECCCCCCHHHHHHHH---HHCCCEEEEEECC--CCCCCCCC---HHHHHHH T ss_conf 6543467-899999861799963883461045543379999999---8099768765044--77899972---7899999 Q ss_pred HHHHHHH----HCCCCCCEEEEEECHHHH----HHHHHHHCCCC-HHEEEEC Q ss_conf 8999986----235467405888720368----78764503862-1330122 Q gi|254781093|r 88 ALDWVQS----LNPESKSCWIAGYSFGAW----ISMQLLMRRPE-INGFISV 130 (225) Q Consensus 88 a~~~l~~----~~~~~~~i~l~G~S~Gg~----val~~a~~~p~-i~~~v~i 130 (225) +.-|+.. .+....+|+++|-|-||. |++.++...-+ ..|+++. T Consensus 453 AYcW~inn~allG~TgEriv~aGDSAGgNL~~~VaLr~i~~gvRvPDGl~la 504 (880) T KOG4388 453 AYCWAINNCALLGSTGERIVLAGDSAGGNLCFTVALRAIAYGVRVPDGLMLA 504 (880) T ss_pred HHHHHHCCHHHHCCCCCEEEEECCCCCCCEEEHHHHHHHHHCCCCCCCEEEE T ss_conf 9999856788747655348984267876155014789998478789853785 No 175 >KOG2369 consensus Probab=93.06 E-value=0.24 Score=28.14 Aligned_cols=83 Identities=17% Similarity=0.222 Sum_probs=48.3 Q ss_pred HHHHHHHHHHHCCCE------EEEEEECC-CCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHH Q ss_conf 999999999973984------99985315-58876778521023-89999989999862354674058887203687876 Q gi|254781093|r 45 IVYQLFYLFQQRGFV------SLRFNFRG-IGRSEGEFDYGDGE-LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQ 116 (225) Q Consensus 45 ~~~~la~~l~~~G~~------vl~fd~RG-~G~S~G~~~~g~~E-~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~ 116 (225) +...+-+.|..-||- ..-+|+|= .-.| +-..+ ...++.-++.+-+.+ ..++++|++||||+.+.+. T Consensus 125 ~w~~~i~~lv~~GYe~~~~l~ga~YDwRls~~~~-----e~rd~yl~kLK~~iE~~~~~~-G~kkVvlisHSMG~l~~ly 198 (473) T KOG2369 125 YWHELIENLVGIGYERGKTLFGAPYDWRLSYHNS-----EERDQYLSKLKKKIETMYKLN-GGKKVVLISHSMGGLYVLY 198 (473) T ss_pred HHHHHHHHHHHHCCCCCCEEECCCCCHHHCCCCH-----HHHHHHHHHHHHHHHHHHHHC-CCCCEEEEECCCCCHHHHH T ss_conf 9999999987617216862550553330056875-----577799999999999999872-8974699953774179999 Q ss_pred HHHCCCC---------HHEEEECCCC Q ss_conf 4503862---------1330122757 Q gi|254781093|r 117 LLMRRPE---------INGFISVAPQ 133 (225) Q Consensus 117 ~a~~~p~---------i~~~v~isp~ 133 (225) .....+. |++++.++++ T Consensus 199 Fl~w~~~~~~~W~~k~I~sfvnig~p 224 (473) T KOG2369 199 FLKWVEAEGPAWCDKYIKSFVNIGAP 224 (473) T ss_pred HHHCCCCCCHHHHHHHHHHHHCCCCH T ss_conf 87406530045799999999705730 No 176 >KOG2931 consensus Probab=92.56 E-value=0.84 Score=24.85 Aligned_cols=190 Identities=18% Similarity=0.207 Sum_probs=111.4 Q ss_pred EEEECCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCHH-H-----HHHHHHHHHCCCEEEEEEECCCCCCCCCCC Q ss_conf 0896589961879996688877879999578998888998999-9-----999999997398499985315588767785 Q gi|254781093|r 3 EVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPRFGGTMNDNI-V-----YQLFYLFQQRGFVSLRFNFRGIGRSEGEFD 76 (225) Q Consensus 3 ~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~~GG~~~~~~-~-----~~la~~l~~~G~~vl~fd~RG~G~S~G~~~ 76 (225) |-.|+++.|-+...+.=...+++|+.|-.| ..|-| ++. + ..-++.+.+. |.++-.|--|+-.-.-.|. T Consensus 24 e~~V~T~~G~v~V~V~Gd~~~~kpaiiTyh---DlglN--~~scFq~ff~~p~m~ei~~~-fcv~HV~~PGqe~gAp~~p 97 (326) T KOG2931 24 EHDVETAHGVVHVTVYGDPKGNKPAIITYH---DLGLN--HKSCFQGFFNFPDMAEILEH-FCVYHVDAPGQEDGAPSFP 97 (326) T ss_pred EEEECCCCCCEEEEEECCCCCCCCEEEEEC---CCCCC--HHHHHHHHHCCHHHHHHHHH-EEEEECCCCCCCCCCCCCC T ss_conf 554114666379999657877886699833---65644--47676776258858999863-1899437984313786687 Q ss_pred CC--C----HHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC-HHEEEECCCCCCCCC----------- Q ss_conf 21--0----23899999899998623546740588872036878764503862-133012275744333----------- Q gi|254781093|r 77 YG--D----GELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE-INGFISVAPQPKSYD----------- 138 (225) Q Consensus 77 ~g--~----~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~-i~~~v~isp~~~~~~----------- 138 (225) .+ . +-.+++..|++|+ ..+.++=+|---|+++-+..|..+|+ |.|+|++++-+.... T Consensus 98 ~~y~yPsmd~LAd~l~~VL~~f-----~lk~vIg~GvGAGAyIL~rFAl~hp~rV~GLvLIn~~~~a~gwiew~~~K~~s 172 (326) T KOG2931 98 EGYPYPSMDDLADMLPEVLDHF-----GLKSVIGMGVGAGAYILARFALNHPERVLGLVLINCDPCAKGWIEWAYNKVSS 172 (326) T ss_pred CCCCCCCHHHHHHHHHHHHHHC-----CCCEEEEECCCCCHHHHHHHHHCCHHHEEEEEEEECCCCCCHHHHHHHHHHHH T ss_conf 8998888899998899998761-----74437995134009999998852942556799983278973389999999877 Q ss_pred --------------------C--------------------------------------CCC--------CCCCCCEEEE Q ss_conf --------------------2--------------------------------------223--------4334773776 Q gi|254781093|r 139 --------------------F--------------------------------------SFL--------APCPSSGLII 152 (225) Q Consensus 139 --------------------~--------------------------------------~~l--------~~~~~p~LiI 152 (225) | .+| .....|+|++ T Consensus 173 ~~l~~~Gmt~~~~d~ll~H~Fg~e~~~~~~diVq~Yr~~l~~~~N~~Nl~~fl~ayn~R~DL~~~r~~~~~tlkc~vllv 252 (326) T KOG2931 173 NLLYYYGMTQGVKDYLLAHHFGKEELGNNSDIVQEYRQHLGERLNPKNLALFLNAYNGRRDLSIERPKLGTTLKCPVLLV 252 (326) T ss_pred HHHHHHCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEE T ss_conf 78776032211899999988342400154789999999987358826799999986178776434787576300247999 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHH Q ss_conf 168655028999999999986077996079997879847558-58999999999999 Q gi|254781093|r 153 NGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIG-KVDELINECAHYLD 208 (225) Q Consensus 153 hG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~-~~~~l~~~i~~fl~ 208 (225) .|+.-..+ +.+.+.-.++. ....++..+.+.+-.-.. .-..+.+...-||. T Consensus 253 vGd~Sp~~--~~vv~~n~~Ld---p~~ttllk~~d~g~l~~e~qP~kl~ea~~~Flq 304 (326) T KOG2931 253 VGDNSPHV--SAVVECNSKLD---PTYTTLLKMADCGGLVQEEQPGKLAEAFKYFLQ 304 (326) T ss_pred ECCCCCHH--HHHHHHHCCCC---CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 44898015--55665421458---654408997255785433485679999999974 No 177 >KOG4540 consensus Probab=91.09 E-value=0.43 Score=26.60 Aligned_cols=45 Identities=31% Similarity=0.486 Sum_probs=28.6 Q ss_pred HHHHHH---HHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCC Q ss_conf 998999---98623546740588872036878764503862133012275 Q gi|254781093|r 86 AAALDW---VQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAP 132 (225) Q Consensus 86 ~aa~~~---l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp 132 (225) .|++|- +++.++|+ +|||.|||+||++|..+..+. .+-.+..-|| T Consensus 260 Sa~ldI~~~v~~~Ypda-~iwlTGHSLGGa~AsLlG~~f-glP~VaFesP 307 (425) T KOG4540 260 SAALDILGAVRRIYPDA-RIWLTGHSLGGAIASLLGIRF-GLPVVAFESP 307 (425) T ss_pred HHHHHHHHHHHHHCCCC-EEEEECCCCCHHHHHHHCCCC-CCCEEEECCC T ss_conf 89999999999758774-489953661068999753004-8845883384 No 178 >COG5153 CVT17 Putative lipase essential for disintegration of autophagic bodies inside the vacuole [Intracellular trafficking and secretion / Lipid metabolism] Probab=91.09 E-value=0.43 Score=26.60 Aligned_cols=45 Identities=31% Similarity=0.486 Sum_probs=28.6 Q ss_pred HHHHHH---HHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCC Q ss_conf 998999---98623546740588872036878764503862133012275 Q gi|254781093|r 86 AAALDW---VQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAP 132 (225) Q Consensus 86 ~aa~~~---l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp 132 (225) .|++|- +++.++|+ +|||.|||+||++|..+..+. .+-.+..-|| T Consensus 260 Sa~ldI~~~v~~~Ypda-~iwlTGHSLGGa~AsLlG~~f-glP~VaFesP 307 (425) T COG5153 260 SAALDILGAVRRIYPDA-RIWLTGHSLGGAIASLLGIRF-GLPVVAFESP 307 (425) T ss_pred HHHHHHHHHHHHHCCCC-EEEEECCCCCHHHHHHHCCCC-CCCEEEECCC T ss_conf 89999999999758774-489953661068999753004-8845883384 No 179 >KOG3967 consensus Probab=87.24 E-value=2.4 Score=22.14 Aligned_cols=195 Identities=12% Similarity=0.142 Sum_probs=91.7 Q ss_pred EEECCCC---CEEEEEEECCCC--CCCCEEEEECCCCC-CCCCCCCHH----------HHHHHHHHHHCCCEEEEEEECC Q ss_conf 8965899---618799966888--77879999578998-888998999----------9999999997398499985315 Q gi|254781093|r 4 VVFNGPS---GRLEGRYQPSTN--PNAPIALILHPHPR-FGGTMNDNI----------VYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 4 v~i~g~~---G~l~~~~~~~~~--~~~~~vv~~Hp~p~-~GG~~~~~~----------~~~la~~l~~~G~~vl~fd~RG 67 (225) +.||-+. -+..-++..+.. .+....|+.|+-+. -.|-+..+. -...-++..+.||+|+..|-.- T Consensus 75 ~~ip~d~~e~E~~SFiF~s~~~lt~~~kLlVLIHGSGvVrAGQWARrLIIN~~Ld~GTQiPyi~rAv~~Gygviv~N~N~ 154 (297) T KOG3967 75 VSIPVDATESEPKSFIFMSEDALTNPQKLLVLIHGSGVVRAGQWARRLIINEDLDSGTQIPYIKRAVAEGYGVIVLNPNR 154 (297) T ss_pred EEECCCCCCCCCCCEEEECHHHHCCCCCEEEEEECCCEEECCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEEECCCH T ss_conf 76227988888762488776574594215999926825760457666520366345775708999987387679967751 Q ss_pred CCC-CCCC--C-CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCCCCC Q ss_conf 588-7677--8-52102389999989999862354674058887203687876450386213301227574433322234 Q gi|254781093|r 68 IGR-SEGE--F-DYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFSFLA 143 (225) Q Consensus 68 ~G~-S~G~--~-~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~~l~ 143 (225) .-+ -++. . -+.+.+++-+..+..++.... .+..|+++-||+||+.++.+.-+.++-..+..++-.-. .+.+.. T Consensus 155 ~~kfye~k~np~kyirt~veh~~yvw~~~v~pa-~~~sv~vvahsyGG~~t~~l~~~f~~d~~v~aialTDs--~~~~p~ 231 (297) T KOG3967 155 ERKFYEKKRNPQKYIRTPVEHAKYVWKNIVLPA-KAESVFVVAHSYGGSLTLDLVERFPDDESVFAIALTDS--AMGSPQ 231 (297) T ss_pred HHHHHHCCCCCCHHCCCHHHHHHHHHHHHHCCC-CCCEEEEEEECCCCHHHHHHHHHCCCCCCEEEEEEECC--CCCCCH T ss_conf 245554235830002662788999999985415-76448999832377248999986488654578985045--445801 Q ss_pred CCCCCEEEE-----ECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 334773776-----16865502899999999998607799607999787984755858999999999999731221 Q gi|254781093|r 144 PCPSSGLII-----NGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDEK 214 (225) Q Consensus 144 ~~~~p~LiI-----hG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~ 214 (225) .+..+.|.. ...+-+-.|++++..-..++.. | -.-|-|+. --..+.|-.|++.....+ T Consensus 232 a~~~e~~~~n~c~wvasntPdt~V~~~~~~i~risA--G---------t~~HEwtS--~~aidsvFkf~ee~~e~~ 294 (297) T KOG3967 232 AKNKEYLCDNACDWVASNTPDTPVSQSKSSIRRISA--G---------TTKHEWTS--YSAIDSVFKFIEEKYEQR 294 (297) T ss_pred HCCCCHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHC--C---------CCCHHHHH--CCCHHHHHHHHHHHHCCC T ss_conf 038508898876554138999636664000144406--7---------76001210--120588999988761121 No 180 >KOG4389 consensus Probab=86.58 E-value=1.4 Score=23.55 Aligned_cols=101 Identities=26% Similarity=0.373 Sum_probs=53.6 Q ss_pred EECCCCCCCCEEEEE-CCCCCCCCCCCCHHHHHHHHHHHHC-CCEEEEEEEC-CC-C--CCCCCCC-CCCHHHHHHHHHH Q ss_conf 966888778799995-7899888899899999999999973-9849998531-55-8--8767785-2102389999989 Q gi|254781093|r 17 YQPSTNPNAPIALIL-HPHPRFGGTMNDNIVYQLFYLFQQR-GFVSLRFNFR-GI-G--RSEGEFD-YGDGELSDAAAAL 89 (225) Q Consensus 17 ~~~~~~~~~~~vv~~-Hp~p~~GG~~~~~~~~~la~~l~~~-G~~vl~fd~R-G~-G--~S~G~~~-~g~~E~~D~~aa~ 89 (225) |.|...+...+|++. .+-+.+.|+..=. +++ .+.|++. ...|+.+||| |. | -=.|+.+ -|+.-+-|-+-|+ T Consensus 126 W~P~~~p~n~tVlVWiyGGGF~sGt~SLd-vYd-Gk~la~~envIvVs~NYRvG~FGFL~l~~~~eaPGNmGL~DQqLAl 203 (601) T KOG4389 126 WAPAADPYNLTVLVWIYGGGFYSGTPSLD-VYD-GKFLAAVENVIVVSMNYRVGAFGFLYLPGHPEAPGNMGLLDQQLAL 203 (601) T ss_pred ECCCCCCCCCEEEEEEECCCCCCCCCCEE-EEC-CCEEEEECCEEEEEEEEEECCCEEEECCCCCCCCCCCCHHHHHHHH T ss_conf 41578888866999997680115886425-424-6102210648999963111154477538998899740068899999 Q ss_pred HHHHHH----CCCCCCEEEEEECHHHH-HHHHHHH Q ss_conf 999862----35467405888720368-7876450 Q gi|254781093|r 90 DWVQSL----NPESKSCWIAGYSFGAW-ISMQLLM 119 (225) Q Consensus 90 ~~l~~~----~~~~~~i~l~G~S~Gg~-val~~a~ 119 (225) .|+++. +.++.++-|+|-|-|++ +.+++.+ T Consensus 204 ~WV~~Ni~aFGGnp~~vTLFGESAGaASv~aHLls 238 (601) T KOG4389 204 QWVQENIAAFGGNPSRVTLFGESAGAASVVAHLLS 238 (601) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCCHHHHHHEECC T ss_conf 99997588748995337884146541333200017 No 181 >pfam00450 Peptidase_S10 Serine carboxypeptidase. Probab=86.23 E-value=2.7 Score=21.81 Aligned_cols=62 Identities=15% Similarity=0.204 Sum_probs=46.4 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHC---------------------CCCCEEEEEECCCCCCCC-CCHHHHHHHHH Q ss_conf 77377616865502899999999998607---------------------799607999787984755-85899999999 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQ---------------------KGISITHKVIPDANHFFI-GKVDELINECA 204 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~---------------------~~~~~~~~~i~ga~H~f~-~~~~~l~~~i~ 204 (225) .++||.+|+.|-.|+....+.|.+++... .-..+++..|.+|+|.-. ..-++..+.+. T Consensus 331 irVliY~Gd~D~~~n~~G~~~~~~~l~w~~~~~f~~~~~~~~~~~aG~~~~~~nltf~~V~~AGHmvP~dqP~~a~~m~~ 410 (415) T pfam00450 331 LRVLIYSGDHDLICNFLGTQAWIDALNWSGKDGFRPWYVSVDGQVAGYTKSYGNLTFATVKGAGHMVPEDQPEAALQMFK 410 (415) T ss_pred CEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEECCCCCCCHHHCHHHHHHHHH T ss_conf 76999967867447858999999968998766750374068987889999989868999889657680039899999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254781093|r 205 HYLD 208 (225) Q Consensus 205 ~fl~ 208 (225) +||. T Consensus 411 ~fi~ 414 (415) T pfam00450 411 RFLS 414 (415) T ss_pred HHHC T ss_conf 9857 No 182 >pfam05277 DUF726 Protein of unknown function (DUF726). This family consists of several uncharacterized eukaryotic proteins. Probab=84.59 E-value=2.6 Score=21.85 Aligned_cols=68 Identities=22% Similarity=0.226 Sum_probs=36.4 Q ss_pred HHHCCCCCCEEEEEECHHHHHHH---HHHHCCCC---HHEEEECCCCCCCCCCCCCCCCC--CCEEE--EECCCCCCCC Q ss_conf 86235467405888720368787---64503862---13301227574433322234334--77377--6168655028 Q gi|254781093|r 93 QSLNPESKSCWIAGYSFGAWISM---QLLMRRPE---INGFISVAPQPKSYDFSFLAPCP--SSGLI--INGSNDTVAT 161 (225) Q Consensus 93 ~~~~~~~~~i~l~G~S~Gg~val---~~a~~~p~---i~~~v~isp~~~~~~~~~l~~~~--~p~Li--IhG~~D~~vp 161 (225) .++....+++-|+|||+|+.+-. +..+++.. |+.+++++.|..... +...++. ++..+ +|.++|=+.- T Consensus 211 ~~r~~G~RPVtLvG~SlGARvI~~cL~~La~~~~~glVe~vvl~GaP~~~~~-~~W~~~r~VVsGR~vN~Ys~~DwiL~ 288 (343) T pfam05277 211 ISRNLGVRPVTLIGFSLGARVIFYCLLELAKRKAYGLIENVILLGAPATSDA-KVWSKARSVVSGRFVNGYSKNDWLLG 288 (343) T ss_pred HHHCCCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCH-HHHHHHHHHCCCCEEEEECCCHHHHH T ss_conf 8636899985899636138999999999974667773448999778887987-89999885506608998646333067 No 183 >pfam10605 3HBOH 3HB-oligomer hydrolase (3HBOH). D-(-)-3-hydroxybutyrate oligomer hydrolase (also known as 3HB-oligomer hydrolase) functions in the degradation of poly-3-hydroxybutyrate (PHB). It catalyses the hydrolysis of D(-)-3-hydroxybutyrate oligomers (3HB-oligomers) into 3HB-monomers. Probab=82.95 E-value=2.2 Score=22.29 Aligned_cols=45 Identities=18% Similarity=0.303 Sum_probs=33.5 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHC---CCCCEEEEEECCCCCC Q ss_conf 77377616865502899999999998607---7996079997879847 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQ---KGISITHKVIPDANHF 191 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~---~~~~~~~~~i~ga~H~ 191 (225) +|.+||||-.|..+|+.+.-+-|-.+.+. ..-.+.+.++.++-|| T Consensus 555 KPAiIV~GRsDaLlPVNhtSRpY~glN~~vEGa~S~L~Y~EVtNaqHF 602 (689) T pfam10605 555 KPAIIVHGRADALLPVNHTSRPYLGLNQQVEGAASRLSYIEVTNAQHF 602 (689) T ss_pred CCEEEEECCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCHHC T ss_conf 853898346553046666775122210021044563379996553101 No 184 >COG4553 DepA Poly-beta-hydroxyalkanoate depolymerase [Lipid metabolism] Probab=80.54 E-value=4.6 Score=20.39 Aligned_cols=64 Identities=19% Similarity=0.300 Sum_probs=43.1 Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCC--CCC--CHHHHHHHHHHHHHHHC Q ss_conf 73776168655028999999999986077996079997879847--558--58999999999999731 Q gi|254781093|r 148 SGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHF--FIG--KVDELINECAHYLDNSL 211 (225) Q Consensus 148 p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~--f~~--~~~~l~~~i~~fl~~~L 211 (225) -.+-|-||.|++.-.-+..+..+-+.+-..-......-|+.+|. |++ ..+++.-.+.+|+.++= T Consensus 341 aL~tvEGEnDDISg~GQTkAA~~LC~nIpe~~k~hy~qp~vGHYGVFnGsrfr~eI~Pri~dFi~~~d 408 (415) T COG4553 341 ALFTVEGENDDISGVGQTKAAHDLCSNIPEDMKQHYMQPDVGHYGVFNGSRFREEIVPRIRDFIRRYD 408 (415) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHCCCHHHHHHHCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHC T ss_conf 37985045554434341388888885483778887517887632014350578888889999999738 No 185 >pfam08643 DUF1776 Fungal family of unknown function (DUF1776). This is a fungal family of unknown function. One of the proteins in this family has been localized to the mitochondria. Probab=79.31 E-value=4 Score=20.74 Aligned_cols=32 Identities=16% Similarity=0.388 Sum_probs=27.3 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 879999578998888998999999999999739849998 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRF 63 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~f 63 (225) +.+||+. |++++++...+|..|.++||.|+.- T Consensus 3 ~~vVli~-------Gs~~~pi~R~iA~dL~rrGf~Vfa~ 34 (296) T pfam08643 3 REVVLVA-------GSPTEPLTRSIALDLERRGFIVFVT 34 (296) T ss_pred CEEEEEE-------CCCCCCHHHHHHHHHHHCCCEEEEE T ss_conf 1299996-------6999745899999999689789999 No 186 >PRK12838 carbamoyl phosphate synthase small subunit; Reviewed Probab=79.21 E-value=5.1 Score=20.13 Aligned_cols=68 Identities=18% Similarity=0.244 Sum_probs=48.3 Q ss_pred HHHHHHHHCCCEEEEEEECCC------CCCCCCC-CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHH Q ss_conf 999999973984999853155------8876778-5210238999998999986235467405888720368787645 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGI------GRSEGEF-DYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLL 118 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~------G~S~G~~-~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a 118 (225) .+-+.|.++|+.|..+.+.-. ..-+|-| ++|.|.-+++..+++-+++.. . ++=++|-|+|-.+-..++ T Consensus 180 nIlr~L~~rg~~V~VvP~~~~~~~i~~~~pDGiflSNGPGDP~~~~~~i~~vr~l~-~--~~PifGICLGHQllalA~ 254 (356) T PRK12838 180 SILRSLSKRGCNVTVLPYNASLEDIKNLNPDGIVLSNGPGDPKELQPYLPTIKDLA-S--SYPILGICLGHQLIALAL 254 (356) T ss_pred HHHHHHHHCCCEEEEECCCCCCCHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHH-C--CCCEEEECHHHHHHHHHH T ss_conf 99999997898899989988754023249748994389989688788999999997-4--988899748899999970 No 187 >pfam04301 DUF452 Protein of unknown function (DUF452). Probab=78.84 E-value=2.3 Score=22.20 Aligned_cols=33 Identities=15% Similarity=0.402 Sum_probs=23.3 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCCCCHHEEEECC Q ss_conf 4674058887203687876450386213301227 Q gi|254781093|r 98 ESKSCWIAGYSFGAWISMQLLMRRPEINGFISVA 131 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~is 131 (225) ..+.|.|++||||-++|.++....| +...|++. T Consensus 55 ~Y~~i~LvAWSmGVwaA~~~l~~~~-~~~aiAiN 87 (213) T pfam04301 55 AYQHIRLVAWSMGVWAAERALQGIP-LKSATAIN 87 (213) T ss_pred CCCEEEEEEEEHHHHHHHHHHCCCC-CCEEEEEE T ss_conf 2876899997759999998751488-64058980 No 188 >PRK05569 flavodoxin; Provisional Probab=78.33 E-value=5.4 Score=19.97 Aligned_cols=66 Identities=17% Similarity=0.215 Sum_probs=40.6 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCC-----CCC----CCCCCCCHHH--HHHHHHHHHHHHHCCCCCCEEEEEECHHH Q ss_conf 9999999999739849998531558-----876----7785210238--99999899998623546740588872036 Q gi|254781093|r 45 IVYQLFYLFQQRGFVSLRFNFRGIG-----RSE----GEFDYGDGEL--SDAAAALDWVQSLNPESKSCWIAGYSFGA 111 (225) Q Consensus 45 ~~~~la~~l~~~G~~vl~fd~RG~G-----~S~----G~~~~g~~E~--~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg 111 (225) ....+++.+.+.|..|-.+|..-.- +|. |+.+.+..++ .++...++++.......++++++| |||- T Consensus 18 mA~~IaeG~~~~G~ev~v~~~~~~~~~di~~~d~i~~GspT~~~~~~~~~~~~~~l~~l~~~~~~~K~~a~FG-SyGW 94 (141) T PRK05569 18 LANTIADGAKEAGAEVTIKHVADAKVEDVLEADAVAFGSPSMDNNNIEQEEMAPFLDQFKLTPNENKKCILFG-SYGW 94 (141) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEE-CCCC T ss_conf 9999999999769819999842399889971890599054558887771769999999853455898899996-6778 No 189 >pfam08237 PE-PPE PE-PPE domain. This domain is found C terminal to the PE (pfam00934) and PPE (pfam00823) domains. The secondary structure of this domain is predicted to be a mixture of alpha helices and beta strands. Probab=78.12 E-value=5.5 Score=19.93 Aligned_cols=35 Identities=23% Similarity=0.276 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 89999989999862354674058887203687876450 Q gi|254781093|r 82 LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) +..+.+++ +.+.....++.++|||.|+.|+..... T Consensus 32 ~~~L~~ai---~~~~~~g~~v~V~GySQgA~Vas~~~~ 66 (227) T pfam08237 32 VDNLNAAI---RSQLAAGEPVVVFGYSQGALVAALEKA 66 (227) T ss_pred HHHHHHHH---HHCCCCCCCEEEEEECCHHHHHHHHHH T ss_conf 99999999---730379995799995742899999999 No 190 >PRK09739 hypothetical protein; Provisional Probab=76.83 E-value=4.6 Score=20.40 Aligned_cols=41 Identities=12% Similarity=0.217 Sum_probs=30.6 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 78799995789988889989999999999997398499985315 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) .+-.+|++||.| .|.++.+....++.|.+.|..|-..|... T Consensus 4 kkiLIV~aHP~~---~S~~~ala~~~~~~l~~~GheV~v~DLy~ 44 (201) T PRK09739 4 ERIYLVWAHPRH---DSLTAKVADAIHQRAQERGIQVTELDLYR 44 (201) T ss_pred CEEEEEECCCCC---CCHHHHHHHHHHHHHHHCCCEEEEEEHHH T ss_conf 779999738998---65689999999999998799599997130 No 191 >cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=75.93 E-value=5 Score=20.17 Aligned_cols=35 Identities=20% Similarity=0.339 Sum_probs=26.9 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 999957899888899899999999999973984999853 Q gi|254781093|r 27 IALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNF 65 (225) Q Consensus 27 ~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~ 65 (225) ++++|| |..||. ..++..+|+.|+++|..|-.+.. T Consensus 3 I~i~~~--P~~GG~--e~~v~~La~~L~~~GHeV~vit~ 37 (371) T cd04962 3 IGIVCY--PTYGGS--GVVATELGKALARRGHEVHFITS 37 (371) T ss_pred EEEECC--CCCCCH--HHHHHHHHHHHHHCCCEEEEEEC T ss_conf 999899--999869--99999999999975999999956 No 192 >COG1663 LpxK Tetraacyldisaccharide-1-P 4'-kinase [Cell envelope biogenesis, outer membrane] Probab=73.67 E-value=7.2 Score=19.24 Aligned_cols=47 Identities=30% Similarity=0.306 Sum_probs=36.5 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCC Q ss_conf 879999578998888998999999999999739849998531558876 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSE 72 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~ 72 (225) .|+-|||=++=..||+--.+++..+++.|.++|+.+- +=-||+|++. T Consensus 45 ~pvPVI~VGNltvGGtGKTP~vi~la~~l~~rG~~~g-vvSRGYgg~~ 91 (336) T COG1663 45 APVPVICVGNLTVGGTGKTPVVIWLAEALQARGVRVG-VVSRGYGGKL 91 (336) T ss_pred CCCCEEEECCEEECCCCCCHHHHHHHHHHHHCCCEEE-EEECCCCCCC T ss_conf 7887899855777789968899999999986697367-9805768887 No 193 >PRK05654 acetyl-CoA carboxylase subunit beta; Validated Probab=71.28 E-value=8.2 Score=18.88 Aligned_cols=139 Identities=16% Similarity=0.172 Sum_probs=64.7 Q ss_pred CCCCEEEEECCCCCCCCCCCCHH---HHHHHHHHHHCCCEEEEEEECCCCCCCCC-CCCCCHH---HHHHHHHHHHHHHH Q ss_conf 77879999578998888998999---99999999973984999853155887677-8521023---89999989999862 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNI---VYQLFYLFQQRGFVSLRFNFRGIGRSEGE-FDYGDGE---LSDAAAALDWVQSL 95 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~---~~~la~~l~~~G~~vl~fd~RG~G~S~G~-~~~g~~E---~~D~~aa~~~l~~~ 95 (225) ...++++.+.=+...||||...+ +..+++...+....++.|.-.| |. -.+|.-. ..-..++++-+++. T Consensus 122 ~g~~v~~~~~df~F~GGSmG~~~GEki~~a~e~A~~~~~PlI~~~~SG-----GaRMQEGi~SLmQMaKts~A~~~l~~~ 196 (288) T PRK05654 122 EGMPVVLAVMDFSFMGGSMGSVVGEKIVRAVERALEEKCPLVIFSASG-----GARMQEGLLSLMQMAKTSAALKRLSEA 196 (288) T ss_pred CCEEEEEEEECHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCC-----CHHHCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999521454266457899999999999997499789996787-----633327467899889999999999976 Q ss_pred CCCCCCEEEEEECHHHHHHHHHHHCCC----CHHEEEECCCCCCCCC-C-CCCC-C-CCCCEEEEECCCCCCCCHHHHHH Q ss_conf 354674058887203687876450386----2133012275744333-2-2234-3-34773776168655028999999 Q gi|254781093|r 96 NPESKSCWIAGYSFGAWISMQLLMRRP----EINGFISVAPQPKSYD-F-SFLA-P-CPSSGLIINGSNDTVATTSDVKD 167 (225) Q Consensus 96 ~~~~~~i~l~G~S~Gg~val~~a~~~p----~i~~~v~isp~~~~~~-~-~~l~-~-~~~p~LiIhG~~D~~vp~~~~~~ 167 (225) +.. --.++..=++||..|. .|+... +..++|..+.+-..-. . ..|. . ...-.|+=||--|.+|+-.+.++ T Consensus 197 ~lp-yI~vlt~PttGGvtAS-fa~lgDiiiaEp~A~IgFAG~RVIeqti~~~LP~~FQtae~ll~~G~iD~iv~R~~lk~ 274 (288) T PRK05654 197 GLP-YISVLTDPTTGGVSAS-FAMLGDIIIAEPKALIGFAGPRVIEQTVREKLPEGFQRAEFLLEHGAIDMIVHRRELRD 274 (288) T ss_pred CCC-EEEEECCCCCCCEEEE-ECCCCCEEEEECCCEEEECCCHHHHHHCCCCCCCCHHHHHHHHHCCCCCEEECHHHHHH T ss_conf 996-8999668985894443-02478779980584587315389998508989974011899997799636645899999 Q ss_pred H Q ss_conf 9 Q gi|254781093|r 168 L 168 (225) Q Consensus 168 ~ 168 (225) . T Consensus 275 ~ 275 (288) T PRK05654 275 T 275 (288) T ss_pred H T ss_conf 9 No 194 >pfam02606 LpxK Tetraacyldisaccharide-1-P 4'-kinase. This family consists of tetraacyldisaccharide-1-P 4'-kinase also known as Lipid-A 4'-kinase or Lipid A biosynthesis protein LpxK, EC:2.7.1.130. This enzyme catalyses the reaction: ATP + 2,3-bis(3-hydroxytetradecanoyl)-D -glucosaminyl-(beta-D-1,6)-2,3-bis(3-hydroxytetradecanoyl)-D-glu cosam inyl beta-phosphate <= ADP + 2,3,2',3'-tetrakis(3-hydroxytetradecanoyl)-D- glucosaminyl-1,6-beta-D-glucosamine 1,4'-bisphosphate. This enzyme is involved in the synthesis of lipid A portion of the bacterial lipopolysaccharide layer (LPS). The family contains a P-loop motif at the N terminus. Probab=71.22 E-value=5.8 Score=19.79 Aligned_cols=46 Identities=28% Similarity=0.387 Sum_probs=35.8 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCC Q ss_conf 79999578998888998999999999999739849998531558876 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSE 72 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~ 72 (225) ++.|||=++=..|||--.+.+..+++.|.++|+.+.- =-||+|++. T Consensus 34 ~vpVI~VGNit~GGtGKTP~v~~l~~~l~~~g~~~~i-lSRGYg~~~ 79 (318) T pfam02606 34 PVPVIVVGNITVGGTGKTPLVIALAELLRARGLRPGV-LSRGYGGKS 79 (318) T ss_pred CCCEEEECCEEECCCCHHHHHHHHHHHHHHCCCCEEE-ECCCCCCCC T ss_conf 9999999898458878589999999999976994478-326767657 No 195 >PRK12564 carbamoyl phosphate synthase small subunit; Reviewed Probab=69.49 E-value=9 Score=18.64 Aligned_cols=68 Identities=18% Similarity=0.207 Sum_probs=43.7 Q ss_pred HHHHHHHCCCEEEEEEECCC------CCCCCCC-CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHH Q ss_conf 99999973984999853155------8876778-5210238999998999986235467405888720368787645 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGI------GRSEGEF-DYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLL 118 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~------G~S~G~~-~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a 118 (225) +-+.|.++|+.+..+-+.-. ..-+|-| ++|.|.-+++..+++-+++.-. .++=++|-|+|=.+-..+. T Consensus 185 Ilr~L~~rg~~v~VvP~~~~~~~I~~~~pDGIflSNGPGDP~~~~~~i~~ik~li~--~~~PifGICLGHQllalA~ 259 (355) T PRK12564 185 ILRELAERGCRVTVVPATTTAEEILALNPDGVFLSNGPGDPAALDYAIEMIKELLE--GKIPIFGICLGHQLLALAL 259 (355) T ss_pred HHHHHHHCCCEEEEECCCCCHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHHC--CCCCEEEECHHHHHHHHHC T ss_conf 99999987986999689787657641488889967999996887999999999852--6987899888799999971 No 196 >COG0331 FabD (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism] Probab=68.38 E-value=6.4 Score=19.54 Aligned_cols=32 Identities=19% Similarity=0.218 Sum_probs=21.2 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 89999862354674058887203687876450 Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) +.+-++++.....+.+++|||+|=+.|+.++. T Consensus 73 ~~~~l~~~~~~~~p~~~aGHSlGEysAla~ag 104 (310) T COG0331 73 AYRVLAEQGLGVKPDFVAGHSLGEYSALAAAG 104 (310) T ss_pred HHHHHHHHCCCCCCCEEECCCHHHHHHHHHCC T ss_conf 99999984588887554024386899999805 No 197 >cd01533 4RHOD_Repeat_2 Member of the Rhodanese Homology Domain superfamily, repeat 2. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 2nd repeat which does contain the putative catalytic Cys residue. Probab=67.66 E-value=2 Score=22.58 Aligned_cols=36 Identities=19% Similarity=0.180 Sum_probs=17.3 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 778799995789988889989999999999997398499985315 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...+++++|+ +|++. ...++.|.++||.--.+|++| T Consensus 65 ~~~~ivv~C~-----~G~RS----~~aa~~L~~~G~~n~V~~l~G 100 (109) T cd01533 65 PRTPIVVNCA-----GRTRS----IIGAQSLINAGLPNPVAALRN 100 (109) T ss_pred CCCEEEEECC-----CCHHH----HHHHHHHHHCCCCCCCEECCC T ss_conf 9985999889-----98499----999999998699886017058 No 198 >PRK00652 lpxK tetraacyldisaccharide 4'-kinase; Reviewed Probab=67.00 E-value=7.1 Score=19.27 Aligned_cols=47 Identities=26% Similarity=0.269 Sum_probs=36.3 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCC Q ss_conf 799995789988889989999999999997398499985315588767 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEG 73 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G 73 (225) |+-|||=++=.-||+--.+++..+++.|.+.|+.+.- =.||+|++.- T Consensus 48 ~vpVI~VGNitvGGtGKTP~v~~la~~l~~~g~~~~I-lSRGYg~~~~ 94 (334) T PRK00652 48 PVPVISVGNLTVGGNGKTPVVIWLAEQLQARGVKVGV-VSRGYGGKSK 94 (334) T ss_pred CCCEEEEECEEECCCCHHHHHHHHHHHHHHCCCCEEE-EECCCCCCCC T ss_conf 9989999088878877799999999999976993678-7346676567 No 199 >PRK01906 tetraacyldisaccharide 4'-kinase; Provisional Probab=66.68 E-value=10 Score=18.36 Aligned_cols=47 Identities=21% Similarity=0.291 Sum_probs=36.7 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCC Q ss_conf 879999578998888998999999999999739849998531558876 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSE 72 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~ 72 (225) .++-|||=++=.-|||--.+++..+++.|.++|+.+... .||+|++. T Consensus 54 ~~vPVI~VGNitvGGTGKTP~vi~L~~~L~~~G~k~~Il-SRGYg~~~ 100 (339) T PRK01906 54 VGVPVVVVGNVTVGGTGKTPTVIALVDALRAAGFTPGVV-SRGYGAKV 100 (339) T ss_pred CCCCEEEECCEEECCCCHHHHHHHHHHHHHHCCCCEEEE-ECCCCCCC T ss_conf 899999986887688757799999999999769955998-54645556 No 200 >TIGR03282 methan_mark_13 putative methanogenesis marker 13 metalloprotein. Members of this protein family, to date, are found in a completed prokaryotic genome if and only if the species is one of the archaeal methanogens. The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it. This metal cluster-binding family is related to nitrogenase structural protein NifD and accessory protein NifE, among others. Probab=65.46 E-value=11 Score=18.15 Aligned_cols=67 Identities=15% Similarity=0.297 Sum_probs=42.1 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 99957899888899899999999999973984999853155887677852102389999989999862354674058887 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGY 107 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~ 107 (225) ++++|+=+ |.+ ++.++.|..-|+.|+.-.. ++-.|=+|. .+-+..+++++.+.. .++.+.++|- T Consensus 21 viILHGPs--GCc------Fr~aRLLE~DGvrV~TTam-----~endfiFG~--~e~L~~vl~~~~e~f-~p~~igVVGT 84 (352) T TIGR03282 21 VIILHGPS--GCC------FRTARLLEEDGVRVFTTGM-----DENDFVFGA--SEKLVKVIRYAEEKF-KPELIGVVGT 84 (352) T ss_pred EEEEECCC--CCC------CCCCHHHHCCCCEEEEECC-----CCCCEEECC--HHHHHHHHHHHHHHH-CCCEEEEEEE T ss_conf 89960898--655------4300233217946876203-----666426645--899999999999872-9644899740 Q ss_pred CHH Q ss_conf 203 Q gi|254781093|r 108 SFG 110 (225) Q Consensus 108 S~G 110 (225) |-- T Consensus 85 CaS 87 (352) T TIGR03282 85 CAS 87 (352) T ss_pred CHH T ss_conf 225 No 201 >KOG3424 consensus Probab=65.26 E-value=11 Score=18.13 Aligned_cols=48 Identities=25% Similarity=0.409 Sum_probs=29.6 Q ss_pred CCCCEEEEECCCCCCCCCCC-CHHHHHHHHHHHHCCCEEEEEEEC---CCCCCCC Q ss_conf 77879999578998888998-999999999999739849998531---5588767 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMN-DNIVYQLFYLFQQRGFVSLRFNFR---GIGRSEG 73 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~-~~~~~~la~~l~~~G~~vl~fd~R---G~G~S~G 73 (225) ...-++.++||. . -+-. +.+-..+|+....---+|+.|+|| |.|+|.| T Consensus 20 RKqmvvdvlHPG--~-a~vsK~EIrEKlA~mYkt~~d~V~vfgfrt~fGggkstg 71 (132) T KOG3424 20 RKQMVVDVLHPG--K-ANVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTG 71 (132) T ss_pred HHHEEEEEECCC--C-CCCCHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCE T ss_conf 221468885389--8-998989999999999647877089988641147865440 No 202 >smart00827 PKS_AT Acyl transferase domain in polyketide synthase (PKS) enzymes. Probab=65.01 E-value=8.7 Score=18.73 Aligned_cols=111 Identities=16% Similarity=0.220 Sum_probs=49.7 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCC---H----------------HEEEECCCCCCC--------CCCC Q ss_conf 899998623546740588872036878764503862---1----------------330122757443--------3322 Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPE---I----------------NGFISVAPQPKS--------YDFS 140 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~---i----------------~~~v~isp~~~~--------~~~~ 140 (225) ..+.+++.+.. +-+++|||+|=+.|+.++..-.- + .+.+++...... .+.- T Consensus 72 l~~~l~~~Gi~--Pd~v~GhSlGE~aAl~aaG~l~~edal~lv~~R~~~m~~~~~~G~M~aV~~~~~~v~~~~~~~~~~v 149 (298) T smart00827 72 LARLWRSWGVR--PDAVVGHSLGEIAAAYVAGVLSLEDAARLVAARGRLMQALPGGGAMLAVGLSEEEVEELLAGYGGRV 149 (298) T ss_pred HHHHHHHCCCC--CCEEEECCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHCCCEE T ss_conf 99999985998--7778655798999999849998999999999999999971678824665689999999987258907 Q ss_pred CCCCCCCC-EEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC-HHHHHHHHHHHHHHH Q ss_conf 23433477-37761686550289999999999860779960799978798475585-899999999999973 Q gi|254781093|r 141 FLAPCPSS-GLIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGK-VDELINECAHYLDNS 210 (225) Q Consensus 141 ~l~~~~~p-~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~-~~~l~~~i~~fl~~~ 210 (225) .++....| -.+|-|+.+.+ ..+.+.+.. +++ ....++ .+|.||.. ...+.+...++|++. T Consensus 150 ~iA~~Nsp~q~VisG~~~~i------~~~~~~l~~-~g~--~~~~L~-v~~afHS~lm~~a~~~~~~~l~~~ 211 (298) T smart00827 150 SVAAVNGPSSVVLSGDEDAV------DELAAALEA-RGI--RARRLK-VDHAFHSPHMDPILDEFREALAGI 211 (298) T ss_pred EEEEECCCCCEEEECCHHHH------HHHHHHHHH-CCC--CEEECC-CCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 99985289828983789999------999999985-798--269817-988555885999999999998369 No 203 >cd01534 4RHOD_Repeat_3 Member of the Rhodanese Homology Domain superfamily, repeat 3. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 3rd repeat which does not contain the putative catalytic Cys residue. Probab=63.80 E-value=12 Score=17.96 Aligned_cols=34 Identities=21% Similarity=0.162 Sum_probs=22.3 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 778799995789988889989999999999997398499985315 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...+++++|+ .|.+. ...|+.|.++||.|. +..| T Consensus 55 k~~~ivl~C~-----~G~RS----~~AA~~L~~~G~~v~--~L~G 88 (95) T cd01534 55 RGARIVLADD-----DGVRA----DMTASWLAQMGWEVY--VLEG 88 (95) T ss_pred CCCEEEEECC-----CCCCH----HHHHHHHHHCCCCEE--EECC T ss_conf 8983999859-----99879----999999998699769--9578 No 204 >TIGR00128 fabD malonyl CoA-acyl carrier protein transacylase; InterPro: IPR004410 Malonyl CoA-acyl carrier protein transacylase 2.3.1.39 from EC is involved in fatty acid biosynthesis and transfers the malonyl moeity from coenzyme A to acyl-carrier protein.; GO: 0004314 [acyl-carrier-protein] S-malonyltransferase activity, 0006633 fatty acid biosynthetic process. Probab=63.78 E-value=9.2 Score=18.58 Aligned_cols=112 Identities=19% Similarity=0.155 Sum_probs=59.3 Q ss_pred CCCCEEEEEECHHHHHHHHHHHCCCCHHEEEE----------CCCC-------------------CCCCCC----CCCCC Q ss_conf 46740588872036878764503862133012----------2757-------------------443332----22343 Q gi|254781093|r 98 ESKSCWIAGYSFGAWISMQLLMRRPEINGFIS----------VAPQ-------------------PKSYDF----SFLAP 144 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~val~~a~~~p~i~~~v~----------isp~-------------------~~~~~~----~~l~~ 144 (225) ..++-+++|||+|=|-|+.+|.--.=-.++=+ ++|. +....- =.++. T Consensus 83 ~~~P~f~AGHSLGEYsAL~aAGalDF~~al~LV~~RGe~M~~a~p~g~G~MaAv~Gld~e~l~~~~~~~~~eG~~V~~AN 162 (295) T TIGR00128 83 GLKPDFVAGHSLGEYSALVAAGALDFETALKLVKKRGELMQEAVPEGGGAMAAVLGLDEEQLAQALEEATEEGEVVDLAN 162 (295) T ss_pred CCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEEE T ss_conf 84653462463478999998535148889999998899888515788874167654788899999987754796678876 Q ss_pred CCCCE-EEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 34773-7761686550289999999999860779960799978798475585899999999999973122100 Q gi|254781093|r 145 CPSSG-LIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDEKFT 216 (225) Q Consensus 145 ~~~p~-LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~~~~ 216 (225) .-.|. .||-|..|++ |.+...++....+| .+-...+-|+-|. .......+....+|++..-..++ T Consensus 163 ~N~P~QvVIsG~k~~v---E~a~~~~k~~~gak--r~~PL~VSgafHs--~~M~~a~~~f~~~L~~~~~~~~~ 228 (295) T TIGR00128 163 FNSPGQVVISGTKDEV---EAAAALLKTEAGAK--RALPLEVSGAFHS--RLMKPAAEKFAETLEACQFNDAT 228 (295) T ss_pred CCCCCCEEEECCHHHH---HHHHHHHHHHCCCE--EEECCCCCCHHHH--HHHHHHHHHHHHHHHHHCCCCCC T ss_conf 3789617983567899---99999725443870--7520457705577--66556899999999874268775 No 205 >COG2939 Carboxypeptidase C (cathepsin A) [Amino acid transport and metabolism] Probab=63.36 E-value=9.1 Score=18.61 Aligned_cols=74 Identities=19% Similarity=0.151 Sum_probs=39.4 Q ss_pred EEEEE-ECCCCCCCCCCCCC----CHHHHHHHHHHHHHHHHCC----CCCCEEEEEECHHHHHHHHHHHCCC----CHHE Q ss_conf 99985-31558876778521----0238999998999986235----4674058887203687876450386----2133 Q gi|254781093|r 60 SLRFN-FRGIGRSEGEFDYG----DGELSDAAAALDWVQSLNP----ESKSCWIAGYSFGAWISMQLLMRRP----EING 126 (225) Q Consensus 60 vl~fd-~RG~G~S~G~~~~g----~~E~~D~~aa~~~l~~~~~----~~~~i~l~G~S~Gg~val~~a~~~p----~i~~ 126 (225) .+-.| --|.|-|.+.-+++ .+--+|+..+.+.+.+..+ ..++.+|+|-||||.=....|..-. ...+ T Consensus 149 LvFiDqPvGTGfS~a~~~e~~~d~~~~~~D~~~~~~~f~~~fp~~~r~~~~~~L~GESYgg~yip~~A~~L~~~~~~~~~ 228 (498) T COG2939 149 LVFIDQPVGTGFSRALGDEKKKDFEGAGKDVYSFLRLFFDKFPHYARLLSPKFLAGESYGGHYIPVFAHELLEDNIALNG 228 (498) T ss_pred EEEEECCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 69971676667454554211223100211299999999999898765047606730452102568999999872205577 Q ss_pred EEECCCC Q ss_conf 0122757 Q gi|254781093|r 127 FISVAPQ 133 (225) Q Consensus 127 ~v~isp~ 133 (225) .+.+++. T Consensus 229 ~~nlssv 235 (498) T COG2939 229 NVNLSSV 235 (498) T ss_pred CEEEEEE T ss_conf 2674523 No 206 >cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins. Probab=62.40 E-value=12 Score=17.80 Aligned_cols=48 Identities=19% Similarity=0.443 Sum_probs=28.5 Q ss_pred CCEEEEEEECCC----------------CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 961879996688----------------8778799995789988889989999999999997398499985315 Q gi|254781093|r 10 SGRLEGRYQPST----------------NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 10 ~G~l~~~~~~~~----------------~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) .|.+++....|- ...++++|+|+ +|.+. ...++.|.++||.=+ +|..| T Consensus 31 ~Ghi~gA~nip~~~f~~~~~~~~~~~~~~k~k~ivvyC~-----~G~RS----~~Aa~~L~~~Gf~~V-~~L~G 94 (101) T cd01518 31 IGHFKGAVNPDVDTFREFPFWLDENLDLLKGKKVLMYCT-----GGIRC----EKASAYLKERGFKNV-YQLKG 94 (101) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEECC-----CCHHH----HHHHHHHHHCCCCCE-EEECC T ss_conf 785303185883024442799987212143876999859-----98279----999999998498706-89663 No 207 >KOG2385 consensus Probab=62.18 E-value=12 Score=17.78 Aligned_cols=45 Identities=18% Similarity=0.341 Sum_probs=28.2 Q ss_pred HHHHCCCCCCEEEEEECHHHHHHHHH---HHCCCC---HHEEEECCCCCCC Q ss_conf 98623546740588872036878764---503862---1330122757443 Q gi|254781093|r 92 VQSLNPESKSCWIAGYSFGAWISMQL---LMRRPE---INGFISVAPQPKS 136 (225) Q Consensus 92 l~~~~~~~~~i~l~G~S~Gg~val~~---a~~~p~---i~~~v~isp~~~~ 136 (225) ++.+....++|-|+|||+|+.+-... .++.-+ |+.+++.+.|+.. T Consensus 439 L~~r~qG~RPVTLVGFSLGARvIf~CL~~Lakkke~~iIEnViL~GaPv~~ 489 (633) T KOG2385 439 LCKRSQGNRPVTLVGFSLGARVIFECLLELAKKKEVGIIENVILFGAPVPT 489 (633) T ss_pred HHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCC T ss_conf 987415898625763032157999999998641454513235660687657 No 208 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=62.14 E-value=12 Score=17.78 Aligned_cols=81 Identities=19% Similarity=0.351 Sum_probs=48.6 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC---HH---HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 9999999997398499985315588767785210---23---89999989999862354674058887203687876450 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGD---GE---LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~---~E---~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) .+..+++|+++||-++-+||-==.=|+ +||... -| .||+. +-++...++ -|++ ||||.+++.+|. T Consensus 596 ~Vh~~~aLr~~Gye~ImiN~NPETVST-DyD~sDrLYFEplt~E~Vm---~I~e~E~~~--GVIV---q~GGQtp~nlA~ 666 (1089) T TIGR01369 596 CVHAVLALREAGYETIMINYNPETVST-DYDTSDRLYFEPLTFEDVM---NIIELEKPE--GVIV---QFGGQTPLNLAK 666 (1089) T ss_pred HHHHHHHHHHCCCEEEEEECCCCCCCC-CHHHHHHCCCCCCCCCCCE---EHEEECCCC--EEEE---ECCCHHHHHHHH T ss_conf 899999998729959999778997436-6665101158763503100---110005866--7999---748732678999 Q ss_pred CCCCHHEEEECCCCCC Q ss_conf 3862133012275744 Q gi|254781093|r 120 RRPEINGFISVAPQPK 135 (225) Q Consensus 120 ~~p~i~~~v~isp~~~ 135 (225) +..+..|+-.++..+. T Consensus 667 ~L~~~GG~~iLGTS~~ 682 (1089) T TIGR01369 667 ELEEAGGVPILGTSPE 682 (1089) T ss_pred HHHHCCCCEEECCCHH T ss_conf 9997089317368857 No 209 >TIGR01368 CPSaseIIsmall carbamoyl-phosphate synthase, small subunit; InterPro: IPR006274 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents the small subunit of the glutamine-dependent form (6.3.5.5 from EC) of carbamoyl phosphate synthase, CPSase II. The small subunit catalyses the hydrolysis of glutamine to ammonia, which in turn used by the large chain to synthesize carbamoyl phosphate. The C-terminal domain of the small subunit of CPSase has glutamine amidotransferase activity. Note that the sequence from the mammalian urea cycle form has lost the active site Cys, resulting in an ammonia-dependent form, CPSase I (6.3.4.16 from EC). CPSases of pyrimidine biosynthesis, arginine biosynthesis, and the urea cycle may be encoded by one or by several genes, depending on the species. ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=60.36 E-value=13 Score=17.59 Aligned_cols=73 Identities=16% Similarity=0.244 Sum_probs=52.9 Q ss_pred CCCCCHHHHHHHHHHHHCCCEEEEEEECCC------CCCCCCC-CCCCHHH-HHHHHHHHHHHHHCCCCCCEEEEEECHH Q ss_conf 899899999999999973984999853155------8876778-5210238-9999989999862354674058887203 Q gi|254781093|r 39 GTMNDNIVYQLFYLFQQRGFVSLRFNFRGI------GRSEGEF-DYGDGEL-SDAAAALDWVQSLNPESKSCWIAGYSFG 110 (225) Q Consensus 39 G~~~~~~~~~la~~l~~~G~~vl~fd~RG~------G~S~G~~-~~g~~E~-~D~~aa~~~l~~~~~~~~~i~l~G~S~G 110 (225) |-+.| +-+.|.++|+.|.-+-|-=. -.-+|=| |+|.|.- +++..+++-+++..-...++ +|-||| T Consensus 205 G~K~n-----Ilr~L~~rG~~v~vVP~~~~~~~i~~~~PDGiflSNGPGDPH~~~~~~i~~~~~l~~~~~P~--fGICLG 277 (383) T TIGR01368 205 GVKQN-----ILRRLVKRGCEVTVVPYDTDAEEIKKYNPDGIFLSNGPGDPHAAVEPAIETVRELLEAKIPI--FGICLG 277 (383) T ss_pred CCCHH-----HHHHHHHCCCEEEEECCCCCHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC--CCCCHH T ss_conf 88755-----89899766987999678998899986578889990879880567899999999998448971--002666 Q ss_pred HHHHHHHH Q ss_conf 68787645 Q gi|254781093|r 111 AWISMQLL 118 (225) Q Consensus 111 g~val~~a 118 (225) =.+-.+++ T Consensus 278 HQllALA~ 285 (383) T TIGR01368 278 HQLLALAF 285 (383) T ss_pred HHHHHHHC T ss_conf 89999973 No 210 >TIGR01357 aroB 3-dehydroquinate synthase; InterPro: IPR016037 The 3-dehydroquinate synthase (4.2.3.4 from EC) domain is present in isolation in various bacterial 3-dehydroquinate synthases and also present as a domain in the pentafunctional AROM polypeptide (P07547 from SWISSPROT) . 3-dehydroquinate (DHQ) synthase catalyses the formation of dehydroquinate (DHQ) and orthophosphate from 3-deoxy-D-arabino heptulosonic 7 phosphate . This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids.; GO: 0003856 3-dehydroquinate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=59.50 E-value=14 Score=17.49 Aligned_cols=77 Identities=18% Similarity=0.238 Sum_probs=48.5 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEE Q ss_conf 87999957899888899899999999999973984999853155887677852102389999989999862354674058 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWI 104 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l 104 (225) ..++++++++ -.......+.+.++++|+.|+.+.+- +|+- .-.++-+....+++.+++.+-+...+ T Consensus 26 ~~~~~~~d~~------v~~~~~~~~~~~~~~~g~~v~~~~~p-----~GE~---~K~l~~~~~l~~~L~~~~~~R~~~i~ 91 (361) T TIGR01357 26 SKLVIVTDET------VADLYAEKLLEALQSAGFNVLVLTVP-----DGEE---SKSLETVQRLYDQLLEAGLDRSSTII 91 (361) T ss_pred CEEEEEECHH------HHHHHHHHHHHHHHHCCCCEEEEEEC-----CCCC---CCCHHHHHHHHHHHHHCCCCCCCEEE T ss_conf 5079986745------68888889999998538752688707-----8987---45488899999999965889564788 Q ss_pred EEECHHHHHHHHHH Q ss_conf 88720368787645 Q gi|254781093|r 105 AGYSFGAWISMQLL 118 (225) Q Consensus 105 ~G~S~Gg~val~~a 118 (225) +.||.++..+| T Consensus 92 ---~~GGGvvgDla 102 (361) T TIGR01357 92 ---ALGGGVVGDLA 102 (361) T ss_pred ---EECCHHHHHHH T ss_conf ---85453778789 No 211 >PRK00313 lpxK tetraacyldisaccharide 4'-kinase; Provisional Probab=59.01 E-value=14 Score=17.44 Aligned_cols=44 Identities=25% Similarity=0.357 Sum_probs=34.6 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCC Q ss_conf 799995789988889989999999999997398499985315588 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGR 70 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~ 70 (225) ++-|||=++=.-||+--.+++..+++.|.++|+.+.. =-||+|+ T Consensus 50 ~vPVI~VGNitvGGTGKTP~v~~La~~L~~~G~~~~I-iSRGYg~ 93 (332) T PRK00313 50 PVPVIVVGNITVGGTGKTPLILWLIEHCRRRGLRVGV-VSRGYGA 93 (332) T ss_pred CCCEEEECCEEECCCCHHHHHHHHHHHHHHCCCCEEE-EECCCCC T ss_conf 9989998787358877779999999999977996589-8646567 No 212 >PRK05568 flavodoxin; Provisional Probab=58.83 E-value=14 Score=17.43 Aligned_cols=64 Identities=16% Similarity=0.224 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCC-----CCC----CCCCCCCHHH--HHHHHHHHHHHHHCCCCCCEEEEEECHH Q ss_conf 9999999999739849998531558-----876----7785210238--9999989999862354674058887203 Q gi|254781093|r 45 IVYQLFYLFQQRGFVSLRFNFRGIG-----RSE----GEFDYGDGEL--SDAAAALDWVQSLNPESKSCWIAGYSFG 110 (225) Q Consensus 45 ~~~~la~~l~~~G~~vl~fd~RG~G-----~S~----G~~~~g~~E~--~D~~aa~~~l~~~~~~~~~i~l~G~S~G 110 (225) ....+++.+.+.|..|-.+|..-.- ++. |+.+.+..++ .++...++.+... ...++++++| ||| T Consensus 18 mA~~Iaegl~~~Gv~V~~~~~~~~~~~di~~~d~i~~GspT~~~~~~~~~~~~~~~~~~~~~-~~gK~~a~FG-SyG 92 (142) T PRK05568 18 MANLIAEGAKENGLEVKLLNVSEASVDDVENADVVALGSPAMGCEVLEEGEMEPFVEEISGL-VKGKKTLLFG-SYG 92 (142) T ss_pred HHHHHHHHHHHCCCEEEEEEECCCCHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHHH-CCCCEEEEEE-ECC T ss_conf 99999999986698399998004998899739968994776487667736679999998866-5999899999-434 No 213 >CHL00197 carA carbamoyl-phosphate synthase arginine-specific small subunit; Provisional Probab=57.87 E-value=15 Score=17.33 Aligned_cols=74 Identities=18% Similarity=0.162 Sum_probs=49.3 Q ss_pred CCCCCHHHHHHHHHHHHCCCEEEEEEECCCC------CCCCCC-CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHH Q ss_conf 8998999999999999739849998531558------876778-521023899999899998623546740588872036 Q gi|254781093|r 39 GTMNDNIVYQLFYLFQQRGFVSLRFNFRGIG------RSEGEF-DYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGA 111 (225) Q Consensus 39 G~~~~~~~~~la~~l~~~G~~vl~fd~RG~G------~S~G~~-~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg 111 (225) |-..| +-+.|.++|+.+..+-+.-.- .-+|-| ++|.|.-+++..+++-+++.-.. .++=++|-|+|- T Consensus 203 GiK~n-----ILr~L~~~g~~v~VvP~~~~~~~I~~~~PDGIfLSNGPGDP~~~~~~i~~i~~li~~-~~iPifGICLGH 276 (383) T CHL00197 203 GVKYN-----ILRRLKSFGCEITIVPATSTYQDIMAYQPDGILLSNGPGDPSTVHYGIKTVKKLIKK-ANIPIFGICMGH 276 (383) T ss_pred CCCHH-----HHHHHHHCCCEEEEECCCCCCCHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHC-CCCCEEEECHHH T ss_conf 77188-----999999789989998999851134256888899789999978989999999999964-899789955889 Q ss_pred HHHHHHH Q ss_conf 8787645 Q gi|254781093|r 112 WISMQLL 118 (225) Q Consensus 112 ~val~~a 118 (225) .+-.+++ T Consensus 277 QlLalA~ 283 (383) T CHL00197 277 QILSLAL 283 (383) T ss_pred HHHHHHC T ss_conf 9999973 No 214 >PRK10279 hypothetical protein; Provisional Probab=57.66 E-value=14 Score=17.45 Aligned_cols=22 Identities=18% Similarity=0.320 Sum_probs=10.2 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCC Q ss_conf 99999973984999853155887677 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGIGRSEGE 74 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~G~S~G~ 74 (225) .-++|.+.|+. +|+= .|-|-|. T Consensus 23 vl~aLee~gI~---id~I-~GtS~GA 44 (300) T PRK10279 23 VINALKKVGIE---IDIV-AGCSIGS 44 (300) T ss_pred HHHHHHHCCCC---CCEE-EEECHHH T ss_conf 99999986999---7789-8407999 No 215 >KOG1282 consensus Probab=56.54 E-value=16 Score=17.19 Aligned_cols=63 Identities=17% Similarity=0.202 Sum_probs=44.6 Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHCCC-------C-------------CEEEEEECCCCCCCCC-CHHHHHHHHHHH Q ss_conf 737761686550289999999999860779-------9-------------6079997879847558-589999999999 Q gi|254781093|r 148 SGLIINGSNDTVATTSDVKDLVNKLMNQKG-------I-------------SITHKVIPDANHFFIG-KVDELINECAHY 206 (225) Q Consensus 148 p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~-------~-------------~~~~~~i~ga~H~f~~-~~~~l~~~i~~f 206 (225) ++||-.|+.|-+||.-..+.|.+.+.-... . .+.+..|.||+|.-.- +.+.....+..| T Consensus 365 rvliysGD~D~~~p~~gt~~~i~~L~~~~~~~~~pW~~~~~qvaG~~~~Y~~ltf~tVrGaGH~VP~~~p~~al~m~~~f 444 (454) T KOG1282 365 RVLIYSGDHDLVVPFLGTQAWIKSLNLSITDEWRPWYHKGGQVAGYTKTYGGLTFATVRGAGHMVPYDKPESALIMFQRF 444 (454) T ss_pred EEEEEECCCCEECCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEECCEEEEEEECCCCCCCCCCCHHHHHHHHHH T ss_conf 79999688550376450599999626835457607753797133368985687799994775457878847899999999 Q ss_pred HHHH Q ss_conf 9973 Q gi|254781093|r 207 LDNS 210 (225) Q Consensus 207 l~~~ 210 (225) |... T Consensus 445 l~g~ 448 (454) T KOG1282 445 LNGQ 448 (454) T ss_pred HCCC T ss_conf 7599 No 216 >cd01522 RHOD_1 Member of the Rhodanese Homology Domain superfamily, subgroup 1. This CD includes the putative rhodanese-related sulfurtransferases of several uncharacterized proteins. Probab=53.93 E-value=17 Score=16.93 Aligned_cols=28 Identities=21% Similarity=0.335 Sum_probs=17.5 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE Q ss_conf 7787999957899888899899999999999973984 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFV 59 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~ 59 (225) ..++++++|+ .|.+. ...|+.|.+.||. T Consensus 63 ~d~~ivv~C~-----sG~RS----~~Aa~~L~~~Gf~ 90 (117) T cd01522 63 KDRPVLLLCR-----SGNRS----IAAAEAAAQAGFT 90 (117) T ss_pred CCCEEEEECC-----CCCHH----HHHHHHHHHCCCC T ss_conf 9986999889-----98159----9999999985987 No 217 >COG0505 CarA Carbamoylphosphate synthase small subunit [Amino acid transport and metabolism / Nucleotide transport and metabolism] Probab=53.88 E-value=17 Score=16.93 Aligned_cols=65 Identities=17% Similarity=0.181 Sum_probs=31.8 Q ss_pred HHHHHHHCCCEEEEEEECCC------CCCCCCC-CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHH Q ss_conf 99999973984999853155------8876778-5210238999998999986235467405888720368787 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGI------GRSEGEF-DYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISM 115 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~------G~S~G~~-~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val 115 (225) +-+.|.++|+.+..+-|.=. =.-+|-| |+|.|.-+++..+++.+++.- . ..+=++|-|+|=.+-. T Consensus 193 Ilr~L~~rg~~vtVVP~~t~~eeIl~~~pDGiflSNGPGDP~~~~~~i~~ik~l~-~-~~iPifGICLGHQlla 264 (368) T COG0505 193 ILRELVKRGCRVTVVPADTSAEEILALNPDGIFLSNGPGDPAPLDYAIETIKELL-G-TKIPIFGICLGHQLLA 264 (368) T ss_pred HHHHHHHCCCEEEEECCCCCHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHH-C-CCCCEEEECHHHHHHH T ss_conf 9999997798499976989999998518997997489988067789999999986-4-6898378868889999 No 218 >pfam06441 EHN Epoxide hydrolase N terminus. This family represents the N-terminal region of the eukaryotic epoxide hydrolase protein. Epoxide hydrolases (EC:3.3.2.3) comprise a group of functionally related enzymes that catalyse the addition of water to oxirane compounds (epoxides), thereby usually generating vicinal trans-diols. EHs have been found in all types of living organisms, including mammals, invertebrates, plants, fungi and bacteria. In animals, the major interest in EH is directed towards their detoxification capacity for epoxides since they are important safeguards against the cytotoxic and genotoxic potential of oxirane derivatives that are often reactive electrophiles because of the high tension of the three-membered ring system and the strong polarisation of the C--O bonds. This is of significant relevance because epoxides are frequent intermediary metabolites which arise during the biotransformation of foreign compounds. This family is often found in conjunction with pf Probab=53.22 E-value=18 Score=16.86 Aligned_cols=17 Identities=18% Similarity=0.182 Sum_probs=11.9 Q ss_pred CCCCCEEEEECCCCCCC Q ss_conf 33477377616865502 Q gi|254781093|r 144 PCPSSGLIINGSNDTVA 160 (225) Q Consensus 144 ~~~~p~LiIhG~~D~~v 160 (225) +-..|.|++||.=..+. T Consensus 86 ~~aiPLll~HGWPgSf~ 102 (108) T pfam06441 86 PDAIPLLLTHGWPGSFL 102 (108) T ss_pred CCCEEEEEECCCCCCHH T ss_conf 99706899589997187 No 219 >cd07225 Pat_PNPLA6_PNPLA7 Patatin-like phospholipase domain containing protein 6 and protein 7. Patatin-like phospholipase domain containing protein 6 (PNPLA6) and protein 7 (PNPLA7) are 60% identical to each other. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. PNPLA7 is an insulin-regulated phospholipase that is homologous to Neuropathy Target Esterase (NTE or PNPLA6) and is also known as NTE-related esterase (NRE). Human NRE is predominantly expressed in prostate, white adipose, and pancreatic tissue. NRE Probab=52.90 E-value=18 Score=16.83 Aligned_cols=31 Identities=16% Similarity=0.234 Sum_probs=13.8 Q ss_pred HHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 989999862354674058887203687876450 Q gi|254781093|r 87 AALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 87 aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) -+++.+.+.+...+ ++.|-|+|+.++..+|+ T Consensus 32 GVLkaLeE~GI~id--~I~GtS~GAivgalyA~ 62 (306) T cd07225 32 GVIKALEEAGIPVD--MVGGTSIGAFIGALYAE 62 (306) T ss_pred HHHHHHHHCCCCCC--EEEECHHHHHHHHHHHC T ss_conf 99999998699931--99830398999999987 No 220 >cd01526 RHOD_ThiF Member of the Rhodanese Homology Domain superfamily. This CD includes several putative molybdopterin synthase sulfurylases including the molybdenum cofactor biosynthetic protein (CnxF) of Aspergillus nidulans and the molybdenum cofactor synthesis protein 3 (MOCS3) of Homo sapiens. These rhodanese-like domains are found C-terminal of the ThiF and MoeZ_MoeB domains. Probab=50.81 E-value=19 Score=16.63 Aligned_cols=36 Identities=14% Similarity=0.252 Sum_probs=17.8 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 778799995789988889989999999999997398499985315 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...+++++|+ +|... ...|+.|.++||.--..|.+| T Consensus 71 ~d~~Ivv~C~-----~G~rS----~~Aa~~L~~~G~~~~V~~l~G 106 (122) T cd01526 71 KDSPIYVVCR-----RGNDS----QTAVRKLKELGLERFVRDIIG 106 (122) T ss_pred CCCCEEEECC-----CCCHH----HHHHHHHHHCCCCCCEEECCC T ss_conf 9987899849-----99659----999999998199864787648 No 221 >cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain. Probab=49.92 E-value=20 Score=16.55 Aligned_cols=33 Identities=15% Similarity=0.183 Sum_probs=18.4 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 78799995789988889989999999999997398499985315 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) .+.++|+|. .|.. ....|+.|+++||.| .|..| T Consensus 61 d~~ivv~C~-----~G~r----S~~aa~~L~~~G~~v--~~L~G 93 (100) T cd01523 61 DQEVTVICA-----KEGS----SQFVAELLAERGYDV--DYLAG 93 (100) T ss_pred CCEEEEEEC-----CCHH----HHHHHHHHHHCCCEE--EEECC T ss_conf 983999928-----9656----999999999879837--98366 No 222 >PRK13869 plasmid-partitioning protein RepA; Provisional Probab=49.71 E-value=20 Score=16.53 Aligned_cols=38 Identities=13% Similarity=0.222 Sum_probs=27.0 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 79999578998888998999999999999739849998531 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR 66 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R 66 (225) .++-+.. .=||..-.-....+|+.|+..|+-||.+|.= T Consensus 122 kVIaVaN---~KGGVGKTTtav~LA~~LA~~G~RVLlIDLD 159 (405) T PRK13869 122 QVIAVTN---FKGGSGKTTTSAHLAQYLALQGYRVLAVDLD 159 (405) T ss_pred EEEEEEC---CCCCCHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 2899978---8877659999999999999779988999645 No 223 >pfam09142 TruB_C tRNA Pseudouridine synthase II, C terminal. The C terminal domain of tRNA Pseudouridine synthase II adopts a PUA (pfam01472) fold, with a four-stranded mixed beta-sheet flanked by one alpha-helix on each side. It allows for binding of the enzyme to RNA, as well as stabilisation of the RNA molecule. Probab=48.12 E-value=19 Score=16.63 Aligned_cols=23 Identities=22% Similarity=0.459 Sum_probs=16.6 Q ss_pred CCCCEEEEEEECCCCCCCCEEEE Q ss_conf 89961879996688877879999 Q gi|254781093|r 8 GPSGRLEGRYQPSTNPNAPIALI 30 (225) Q Consensus 8 g~~G~l~~~~~~~~~~~~~~vv~ 30 (225) +|||++-+++...++..++++|+ T Consensus 33 ~PdG~liAL~~~~g~~a~~v~Vf 55 (56) T pfam09142 33 APDGRLIALLEERGGRARPVVVF 55 (56) T ss_pred CCCCCEEEEEECCCCCEEEEEEE T ss_conf 99996899984479965057875 No 224 >COG0288 CynT Carbonic anhydrase [Inorganic ion transport and metabolism] Probab=47.59 E-value=22 Score=16.33 Aligned_cols=79 Identities=19% Similarity=0.173 Sum_probs=51.3 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 78799995789988889989999999999997398499985315588767785210238999998999986235467405 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) ..|-++++ |++|+|+. ++.....|.+=+ |-.|..|..--.++ .++.++++|+...- +.+.|+ T Consensus 34 Q~P~~lii-------~C~DSRv~---~e~i~~~~pGdl-fV~RNaGniV~~~~------~~~l~sleyAv~~L-~v~~Ii 95 (207) T COG0288 34 QSPKALII-------TCSDSRVP---PELITGLGPGDL-FVIRNAGNIVTHPD------GSVLRSLEYAVYVL-GVKEII 95 (207) T ss_pred CCCCEEEE-------EECCCCCC---HHHHHCCCCCCE-EEEEECCCCCCCCC------CCHHHHHHHHHHHC-CCCEEE T ss_conf 99857999-------87057888---899838998657-89963053258886------31467899999974-998899 Q ss_pred EEEECHHHHHHHHHHHC Q ss_conf 88872036878764503 Q gi|254781093|r 104 IAGYSFGAWISMQLLMR 120 (225) Q Consensus 104 l~G~S~Gg~val~~a~~ 120 (225) ++||+==|++.+.+... T Consensus 96 V~GH~~CGav~aa~~~~ 112 (207) T COG0288 96 VCGHTDCGAVKAALDDQ 112 (207) T ss_pred EECCCCCHHHHHCCCCC T ss_conf 96689847877025341 No 225 >COG2885 OmpA Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane] Probab=45.97 E-value=23 Score=16.17 Aligned_cols=59 Identities=20% Similarity=0.233 Sum_probs=28.1 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH--HHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 999999999739849998531558876778521023--89999989999862354674058887 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE--LSDAAAALDWVQSLNPESKSCWIAGY 107 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E--~~D~~aa~~~l~~~~~~~~~i~l~G~ 107 (225) +..+|+.|.+.. -.++.-.||.+|.|+-.+ +.. ..-+.+|.+||.+++....+|...|| T Consensus 102 L~~~a~~L~~~p--~~~i~V~GHTD~~Gs~~y-N~~LS~rRA~aV~~~L~~~Gv~~~~i~~~G~ 162 (190) T COG2885 102 LDELAKYLKKNP--ITRILVEGHTDSTGSDEY-NQALSERRAEAVADYLVSQGVVADRISTVGY 162 (190) T ss_pred HHHHHHHHHHCC--CCEEEEEECCCCCCCHHH-HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEC T ss_conf 999999998789--977999755788798678-7688899999999999984997011789861 No 226 >TIGR03131 malonate_mdcH malonate decarboxylase, epsilon subunit. Members of this protein family are the epsilon subunit of malonate decarboxylase. This subunit has malonyl-CoA/dephospho-CoA acyltransferase activity. Malonate decarboxylase may be a soluble enzyme, or linked to membrane subunits and active as a sodium pump. The epsilon subunit is closely related to the malonyl CoA-acyl carrier protein (ACP) transacylase family described by TIGR00128, but acts on an ACP subunit of malonate decarboxylase that has an unusual coenzyme A derivative as its prothetic group. Probab=45.83 E-value=23 Score=16.16 Aligned_cols=28 Identities=25% Similarity=0.402 Sum_probs=19.1 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 999862354674058887203687876450 Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) +.+++.+. ++-+++|||+|-+.|+.++. T Consensus 68 ~~l~~~g~--~P~~v~GHSlGE~aA~~~AG 95 (295) T TIGR03131 68 RALLALLP--RPSAVAGYSVGEYAAAVVAG 95 (295) T ss_pred HHHHHCCC--CCCEEEECCHHHHHHHHHCC T ss_conf 99996499--88889767775899999839 No 227 >PRK13705 plasmid-partitioning protein SopA; Provisional Probab=45.66 E-value=23 Score=16.14 Aligned_cols=43 Identities=14% Similarity=0.184 Sum_probs=28.5 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE-EC Q ss_conf 88778799995789988889989999999999997398499985-31 Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFN-FR 66 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd-~R 66 (225) ++....++-+.. .=||..-.-....+|+.|+..|+-||..| += T Consensus 102 ~g~~~~VIAVaN---qKGGvGKTTTavnLA~~LAl~G~RVLlID~LD 145 (388) T PRK13705 102 EDEFPPVIGVAA---HKGGVYKTSVSVHLAQDLALKGLRVLLVEGND 145 (388) T ss_pred CCCCCEEEEEEC---CCCCCCHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 999872899952---78885599999999999997799089995878 No 228 >pfam10236 DAP3 Mitochondrial ribosomal death-associated protein 3. This is a family of conserved proteins which were originally described as death-associated-protein-3 (DAP-3). The proteins carry a P-loop DNA-binding motif, and induce apoptosis. DAP3 has been shown to be a pro-apoptotic factor in the mitochondrial matrix and to be crucial for mitochondrial biogenesis and so has also been designated as MRP-S29 (mitochondrial ribosomal protein subunit 29). Probab=44.77 E-value=16 Score=17.10 Aligned_cols=44 Identities=20% Similarity=0.149 Sum_probs=33.9 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHCCC-CCEEEEEECCCCCCCCC Q ss_conf 7761686550289999999999860779-96079997879847558 Q gi|254781093|r 150 LIINGSNDTVATTSDVKDLVNKLMNQKG-ISITHKVIPDANHFFIG 194 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~~~~~~l~~~~~-~~~~~~~i~ga~H~f~~ 194 (225) |+-+|-++.-........+.+.+..+.+ .++ +..|+|.+|+|.. T Consensus 127 L~~~gi~~~~~a~~~~~al~~EL~~~s~~~pv-LvavDg~n~~~~~ 171 (274) T pfam10236 127 LASLGINDAKSAWDVFQALWKELTAQSKAPPV-LVAVDGFNALFGP 171 (274) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCE-EEEECCHHHHHCC T ss_conf 99862347024689999999999834799988-9996585675576 No 229 >cd01447 Polysulfide_ST Polysulfide-sulfurtransferase - Rhodanese Homology Domain. This domain is believed to serve as a polysulfide binding and transferase domain in anaerobic gram-negative bacteria, functioning in oxidative phosphorylation with polysulfide-sulfur as a terminal electron acceptor. The active site contains the same conserved cysteine that is the catalytic residue in other Rhodanese Homology Domain proteins. Probab=44.71 E-value=24 Score=16.05 Aligned_cols=36 Identities=17% Similarity=0.241 Sum_probs=21.6 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 8778799995789988889989999999999997398499985315 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...++++++|. .|.+ ....++.|.++||.-+ +|+.| T Consensus 59 ~~~~~iv~~C~-----~G~R----S~~aa~~L~~~Gy~~v-~~l~G 94 (103) T cd01447 59 AEDKPFVFYCA-----SGWR----SALAGKTLQDMGLKPV-YNIEG 94 (103) T ss_pred CCCCEEEEECC-----CCCH----HHHHHHHHHHCCCCCE-EECCC T ss_conf 88887999869-----9805----9999999998598443-88457 No 230 >PRK06756 flavodoxin; Provisional Probab=44.58 E-value=24 Score=16.04 Aligned_cols=62 Identities=16% Similarity=0.231 Sum_probs=39.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCC------CC----CCCCCCCHHHH-HHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 9999999997398499985315588------76----77852102389-999989999862354674058887 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGR------SE----GEFDYGDGELS-DAAAALDWVQSLNPESKSCWIAGY 107 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~------S~----G~~~~g~~E~~-D~~aa~~~l~~~~~~~~~i~l~G~ 107 (225) ...+++.+.+.|..|-.+|..-.-. .. |.++.|.+++. +....++.+.......++++++|. T Consensus 9 A~~Ia~g~~~~G~~V~~~~~~~~~~~~~l~~~d~~ilGspT~g~g~lp~~~~~f~~~l~~~~l~gk~~a~FGs 81 (138) T PRK06756 9 ADHIAGVIRETENEIEVIDIMDSPEASILEQYDGIILGAYTWGDGDLPDDFLDFYDAMDSIDLTGKKAAVFGS 81 (138) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 9999999997699079997026773768975897999888768783777899999998626447976999933 No 231 >cd01521 RHOD_PspE2 Member of the Rhodanese Homology Domain superfamily. This CD includes the putative rhodanese-like protein, Psp2, of Yersinia pestis biovar Medievalis and other similar uncharacterized proteins. Probab=44.36 E-value=15 Score=17.29 Aligned_cols=11 Identities=18% Similarity=0.229 Sum_probs=5.5 Q ss_pred HHHHHHHCCCE Q ss_conf 99999973984 Q gi|254781093|r 49 LFYLFQQRGFV 59 (225) Q Consensus 49 la~~l~~~G~~ 59 (225) .|..|.++||. T Consensus 82 aA~~L~~~Gf~ 92 (110) T cd01521 82 AALKLAELGFP 92 (110) T ss_pred HHHHHHHCCCC T ss_conf 99999977997 No 232 >pfam11909 NdhN NADH-quinone oxidoreductase cyanobacterial subunit N. The proton-pumping NADH:ubiquinone oxidoreductase catalyzes the electron transfer from NADH to ubiquinone linked with proton translocation across the membrane. It is the largest, most complex and least understood of the respiratory chain enzymes and is referred to as Complex I. The subunit composition of the enzyme varies between groups of organisms. Complex I originating from mammalian mitochondria contains 45 different proteins, whereas in bacteria, the corresponding complex NDH-1 consists of 14 different polypeptides. Homologues of these 14 proteins are found among subunits of the mitochondrial complex I, and therefore bacterial NDH-1 might be considered a model proton-pumping NADH dehydrogenase with a minimal set of subunits. Escherichia coli NDH-1 readily disintegrates into 3 subcomplexes: a water-soluble NADH dehydrogenase fragment (NuoE, -F, and -G),the connecting fragment (NuoB, -C, -D, and -I), and the membra Probab=43.96 E-value=22 Score=16.31 Aligned_cols=131 Identities=18% Similarity=0.260 Sum_probs=64.6 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 78799995789988889989999999999997398499985315588767785210238999998999986235467405 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) ...++|+.. ..||... ++-++|...||.++...-||-|+-+ .|+.+.+. .++-. T Consensus 17 ~gaLavy~P---lEGG~Eg-----Ry~RRLRa~GY~t~~~SArGLGD~e-----------------ayL~~~HG-VRPpH 70 (154) T pfam11909 17 AGALAVYAP---LEGGYEG-----RYLRRLRAAGYRTLITSARGLGDPE-----------------AFLTQLHG-VRPPH 70 (154) T ss_pred CCCEEEECC---CCCCHHH-----HHHHHHHHCCCEEEEECCCCCCCHH-----------------HHHHHCCC-CCCCC T ss_conf 595089716---7776047-----8899998568158884156778999-----------------99721368-79987 Q ss_pred EEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 88872036878764503862133012275744333222343347737761686550289999999999860779960799 Q gi|254781093|r 104 IAGYSFGAWISMQLLMRRPEINGFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQKGISITHK 183 (225) Q Consensus 104 l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~ 183 (225) | |+.-=| +...+ +-+-.-||...+.++.|.+..++ |++.=-+-++.+-++...+.. +.+ ....+.++ T Consensus 71 L-Gk~~ig--------r~~av-G~v~~vpP~~~y~Le~Lp~~~kg-lvLWliEG~vLS~~El~~L~~-L~~-~ePrlKvV 137 (154) T pfam11909 71 L-GKQSIG--------RNAAV-GEVYYVPPQLGYYLEELPPKAKG-LVLWLLEGQVLSKSELEYLCD-LPQ-REPRLKVV 137 (154) T ss_pred C-CCCCCC--------CCCCC-CEEEEECCHHHHHHHHCCCCCCC-EEEEEEECEECCHHHHHHHHH-CCC-CCCCEEEE T ss_conf 7-864345--------67653-30488581665677636987775-799998340447999999984-022-37863899 Q ss_pred EECCCCCCCC Q ss_conf 9787984755 Q gi|254781093|r 184 VIPDANHFFI 193 (225) Q Consensus 184 ~i~ga~H~f~ 193 (225) +--|.+-.|. T Consensus 138 vEmGg~R~~r 147 (154) T pfam11909 138 VEMGGDRSFR 147 (154) T ss_pred EEECCCEEEE T ss_conf 9817731587 No 233 >cd07227 Pat_Fungal_NTE1 Fungal patatin-like phospholipase domain containing protein 6. These are fungal Neuropathy Target Esterase (NTE), commonly referred to as NTE1. Patatin-like phospholipase. NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. This family includes NTE1 from fungi. Probab=43.65 E-value=25 Score=15.95 Aligned_cols=18 Identities=11% Similarity=0.302 Sum_probs=10.9 Q ss_pred EEEECCCCCCCCHHHHHH Q ss_conf 776168655028999999 Q gi|254781093|r 150 LIINGSNDTVATTSDVKD 167 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~~ 167 (225) ++|=|--=+-+|++.+++ T Consensus 161 ~lvDGGv~nnlPvd~~~~ 178 (269) T cd07227 161 MLLDGGYMDNLPVSPMRS 178 (269) T ss_pred EEECCCCCCCHHHHHHHH T ss_conf 997188066626999998 No 234 >pfam03283 PAE Pectinacetylesterase. Probab=43.18 E-value=26 Score=15.91 Aligned_cols=35 Identities=20% Similarity=0.162 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHHC-CCCCCEEEEEECHHHHHHHHH Q ss_conf 99999899998623-546740588872036878764 Q gi|254781093|r 83 SDAAAALDWVQSLN-PESKSCWIAGYSFGAWISMQL 117 (225) Q Consensus 83 ~D~~aa~~~l~~~~-~~~~~i~l~G~S~Gg~val~~ 117 (225) .=+.|+++.+...+ ..++.++|.|-|-||..++.- T Consensus 138 rI~~avi~dLl~~Gl~~A~~viLsG~SAGGLa~~lh 173 (366) T pfam03283 138 RIWEAVIDDLLSKGMKKAKQVILSGCSAGGLAAILH 173 (366) T ss_pred HHHHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHH T ss_conf 999999999998444105647872467511777766 No 235 >PRK00162 glpE thiosulfate sulfurtransferase; Validated Probab=42.87 E-value=26 Score=15.88 Aligned_cols=11 Identities=27% Similarity=0.307 Sum_probs=5.1 Q ss_pred HHHHHHHCCCE Q ss_conf 99999973984 Q gi|254781093|r 49 LFYLFQQRGFV 59 (225) Q Consensus 49 la~~l~~~G~~ 59 (225) .|+.|.++||. T Consensus 74 aa~~L~~~Gf~ 84 (104) T PRK00162 74 AAQYLLQQGFD 84 (104) T ss_pred HHHHHHHCCCC T ss_conf 99999973764 No 236 >TIGR03453 partition_RepA plasmid partitioning protein RepA. Members of this family are the RepA (or ParA) protein involved in replicon partitioning. All known examples occur in bacterial species with two or more replicons, on a plasmid or the smaller chromosome. Note that an apparent exception may be seen as a pseudomolecule from assembly of an incompletely sequenced genome. Members of this family belong to a larger family that also includes the enzyme cobyrinic acid a,c-diamide synthase, but assignment of that name to members of this family would be in error. Probab=42.68 E-value=26 Score=15.86 Aligned_cols=31 Identities=19% Similarity=0.274 Sum_probs=24.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 8888998999999999999739849998531 Q gi|254781093|r 36 RFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR 66 (225) Q Consensus 36 ~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R 66 (225) .=||..-.-....+|+.|+.+|+-||.+|+= T Consensus 112 ~KGGVGKTTtav~LA~~LA~~G~RVLvIDLD 142 (387) T TIGR03453 112 FKGGSGKTTTSAHLAQYLALRGYRVLAIDLD 142 (387) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 8876569999999999999779988999537 No 237 >pfam00698 Acyl_transf_1 Acyl transferase domain. Probab=42.39 E-value=23 Score=16.23 Aligned_cols=29 Identities=14% Similarity=0.146 Sum_probs=18.8 Q ss_pred HHHHHHCCCCCCEEEEEECHHHHHHHHHHHC Q ss_conf 9998623546740588872036878764503 Q gi|254781093|r 90 DWVQSLNPESKSCWIAGYSFGAWISMQLLMR 120 (225) Q Consensus 90 ~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~ 120 (225) +.+++.+ .++-+++|||+|=+.|+.++.- T Consensus 75 ~~l~~~G--i~P~~v~GHSlGE~aA~~~aG~ 103 (319) T pfam00698 75 ALLRSYG--VTPAAVVGHSLGEIAAAVVAGA 103 (319) T ss_pred HHHHHCC--CCCCEEEECCCCHHHHHHHCCC T ss_conf 0033229--8766686637048999987289 No 238 >cd01444 GlpE_ST GlpE sulfurtransferase (ST) and homologs are members of the Rhodanese Homology Domain superfamily. Unlike other rhodanese sulfurtransferases, GlpE is a single domain protein but indications are that it functions as a dimer. The active site contains a catalytically active cysteine. Probab=42.29 E-value=26 Score=15.83 Aligned_cols=10 Identities=30% Similarity=0.567 Sum_probs=4.3 Q ss_pred HHHHHHHCCC Q ss_conf 9999997398 Q gi|254781093|r 49 LFYLFQQRGF 58 (225) Q Consensus 49 la~~l~~~G~ 58 (225) .++.|.+.|| T Consensus 72 aa~~L~~~G~ 81 (96) T cd01444 72 LAQALREAGF 81 (96) T ss_pred HHHHHHHHCC T ss_conf 9999998289 No 239 >TIGR02347 chap_CCT_zeta T-complex protein 1, zeta subunit; InterPro: IPR012722 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT zeta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=41.56 E-value=17 Score=16.90 Aligned_cols=53 Identities=26% Similarity=0.461 Sum_probs=36.2 Q ss_pred CCCCCCCCCC---EEEEECCCCCCCCHHHHHH-HHHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 2223433477---3776168655028999999-99998607799607999787984755 Q gi|254781093|r 139 FSFLAPCPSS---GLIINGSNDTVATTSDVKD-LVNKLMNQKGISITHKVIPDANHFFI 193 (225) Q Consensus 139 ~~~l~~~~~p---~LiIhG~~D~~vp~~~~~~-~~~~l~~~~~~~~~~~~i~ga~H~f~ 193 (225) |.|...|..| +++|.|.+|..+ ++..+ ..+.|+.-++.=-+--+++||+=|.- T Consensus 370 yTFiEe~~~PkScTiLIkGpn~~ti--~QiKDAvRDGLRAv~N~i~D~cv~~GAGAFE~ 426 (548) T TIGR02347 370 YTFIEEVKNPKSCTILIKGPNDHTI--KQIKDAVRDGLRAVKNAIEDKCVVPGAGAFEI 426 (548) T ss_pred CEEEEECCCCCEEEEEEECCCHHHH--HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 0022103878616887328864467--76467665225764545413551356179999 No 240 >pfam05576 Peptidase_S37 PS-10 peptidase S37. These serine proteases have been found in Streptomyces species. Probab=41.41 E-value=25 Score=15.98 Aligned_cols=71 Identities=14% Similarity=0.119 Sum_probs=36.2 Q ss_pred CEEEEEEECCCCCCCC---CCCCC--CHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC-CHHEEEEC Q ss_conf 8499985315588767---78521--02389999989999862354674058887203687876450386-21330122 Q gi|254781093|r 58 FVSLRFNFRGIGRSEG---EFDYG--DGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP-EINGFISV 130 (225) Q Consensus 58 ~~vl~fd~RG~G~S~G---~~~~g--~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p-~i~~~v~i 130 (225) -+-+...||=.+.|.= +|+.- .....|...++.-++..++. +=+-.|-|-||..|+..=.-.| ++.+.|.. T Consensus 89 ~NqvsvE~RfF~~SrP~p~DWs~Lti~qAA~D~Hrv~~A~k~iY~g--kWiSTG~SKGGmTa~y~rrFyP~DvD~tVaY 165 (448) T pfam05576 89 GNQVSMEYRYFTPSRPAPADWSKLDIWQAASDQHRIFKALKPLYSK--NWISTGGSKGGMTATYYERFYPRDMDGVVAY 165 (448) T ss_pred CCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC--CCEECCCCCCCEEEEEEEEECCCCCCEEEEE T ss_conf 6446888741158989988802300966404678999998864578--6354366778534687732188777856651 No 241 >cd01529 4RHOD_Repeats Member of the Rhodanese Homology Domain superfamily. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. Only the second and most of the fourth repeats contain the putative catalytic Cys residue. This CD aligns the 1st , 2nd, 3rd, and 4th repeats. Probab=41.15 E-value=27 Score=15.72 Aligned_cols=35 Identities=11% Similarity=-0.004 Sum_probs=21.0 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 778799995789988889989999999999997398499985315 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...+++|+|+ +|.+ ....++.|.++||.-+ ++..| T Consensus 55 ~~~~ivv~C~-----~G~R----s~~Aa~~L~~~G~~~V-~~L~G 89 (96) T cd01529 55 RATRYVLTCD-----GSLL----ARFAAQELLALGGKPV-ALLDG 89 (96) T ss_pred CCCEEEEECC-----CCCH----HHHHHHHHHHCCCCCE-EEECC T ss_conf 9997999869-----9846----9999999998589880-99059 No 242 >TIGR02802 Pal_lipo peptidoglycan-associated lipoprotein; InterPro: IPR014169 Members in this entry represent the peptidoglycan-associated lipoprotein (Pal or OprL) protein of the Tol-Pal system. The system appears to be involved both in the maintenance of outer membrane integrity and in the import of certain organic molecules as nutrients. Members of this family contain a hydrodrophobic lipoprotein signal sequence, a conserved N-terminal cleavage and modification site, a poorly conserved low-complexity region, together comprising about 65 amino acids, and a well-conserved C-terminal domain. The entry represents only the conserved C-terminal domain.. Probab=40.42 E-value=28 Score=15.65 Aligned_cols=55 Identities=22% Similarity=0.254 Sum_probs=32.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCC-CCHHH------HHHHHHHHHHHHHCCCCCCEEEEEECHH Q ss_conf 99999999973984999853155887677852-10238------9999989999862354674058887203 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDY-GDGEL------SDAAAALDWVQSLNPESKSCWIAGYSFG 110 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~-g~~E~------~D~~aa~~~l~~~~~~~~~i~l~G~S~G 110 (225) +..=|+.|.+..=.-+.. ||+.|+ |.-|. .=+.||-+||..++....+|=++ ||| T Consensus 19 l~~hA~~L~~~p~~~V~i--------EGH~DERGTrEYNlALGeRRA~AV~~yL~~~GV~~~q~~~v--SYG 80 (104) T TIGR02802 19 LDAHAQYLKKNPSVRVTI--------EGHTDERGTREYNLALGERRANAVKDYLQAKGVSASQIETV--SYG 80 (104) T ss_pred HHHHHHHHHHCCCCEEEE--------EEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEE--ECC T ss_conf 999899998679937888--------63078888877777788999999999999738961100463--026 No 243 >PRK06490 glutamine amidotransferase; Provisional Probab=39.98 E-value=29 Score=15.61 Aligned_cols=88 Identities=19% Similarity=0.200 Sum_probs=44.9 Q ss_pred CCCCCCEEEEEC-CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE-ECCCC--CCCCCCCC-----CCH----HHHHHHH Q ss_conf 887787999957-89988889989999999999997398499985-31558--87677852-----102----3899999 Q gi|254781093|r 21 TNPNAPIALILH-PHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFN-FRGIG--RSEGEFDY-----GDG----ELSDAAA 87 (225) Q Consensus 21 ~~~~~~~vv~~H-p~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd-~RG~G--~S~G~~~~-----g~~----E~~D~~a 87 (225) ....++++||.| ++..- -.+.+.|.++||..-.+. +.|-- .+--.|+. |.. +..=+.. T Consensus 8 ~~~~k~vLviqH~~~e~p---------G~i~~~L~~~G~~~~i~r~~~gd~lP~~l~~~dglvVlGGpmsa~D~~p~l~~ 78 (243) T PRK06490 8 ARDKRPILIVLHQERSTP---------GRVGQLLRERGYPLDIRRPRLGDPLPETLEDHAGAVIFGGPMSANDPDDFIRR 78 (243) T ss_pred CCCCCCEEEEECCCCCCC---------HHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHH T ss_conf 378985799966899997---------29999999789858999678999899975460589992899998998717999 Q ss_pred HHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 89999862354674058887203687876450 Q gi|254781093|r 88 ALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 88 a~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) -.+|+++-.....+ ++|.|+|+.+-..+.. T Consensus 79 e~~~I~~~l~~~~P--~LGICLGaQLlA~alG 108 (243) T PRK06490 79 EIDWISVPLKENKP--FLGICLGAQMLARHLG 108 (243) T ss_pred HHHHHHHHHHCCCC--EEEECHHHHHHHHHHC T ss_conf 99999999986998--8998884999999839 No 244 >COG0607 PspE Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism] Probab=39.97 E-value=29 Score=15.61 Aligned_cols=36 Identities=19% Similarity=0.395 Sum_probs=23.7 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 8778799995789988889989999999999997398499985315 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...++++|+|. .|.+. ...++.|..+||... +|+.| T Consensus 59 ~~~~~ivv~C~-----~G~rS----~~aa~~L~~~G~~~~-~~l~g 94 (110) T COG0607 59 PDDDPIVVYCA-----SGVRS----AAAAAALKLAGFTNV-YNLDG 94 (110) T ss_pred CCCCEEEEECC-----CCCCH----HHHHHHHHHCCCCCC-EECCC T ss_conf 67986999999-----97558----999999998699751-00478 No 245 >COG4947 Uncharacterized protein conserved in bacteria [Function unknown] Probab=39.93 E-value=29 Score=15.60 Aligned_cols=104 Identities=21% Similarity=0.357 Sum_probs=55.5 Q ss_pred CCCCEEEEECCCCCCCCCCCC----HHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHH----HHHHHHHHHHHHH Q ss_conf 778799995789988889989----99999999999739849998531558876778521023----8999998999986 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMND----NIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE----LSDAAAALDWVQS 94 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~----~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E----~~D~~aa~~~l~~ 94 (225) ...|++|+- .-||.-+. -.+..+|.. -+.|-.- .|...|. .||. |=.+... ++--.|--.|+++ T Consensus 25 aG~pVvvFp----ts~Grf~eyed~G~v~ala~f-ie~G~vQ-lft~~gl-dsES-f~a~h~~~adr~~rH~AyerYv~e 96 (227) T COG4947 25 AGIPVVVFP----TSGGRFNEYEDFGMVDALASF-IEEGLVQ-LFTLSGL-DSES-FLATHKNAADRAERHRAYERYVIE 96 (227) T ss_pred CCCCEEEEE----CCCCCCHHHHHCCCHHHHHHH-HHCCCEE-EEEECCC-CHHH-HHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 897379976----699851234530028999999-8608279-9984364-3676-765237878888988999999998 Q ss_pred HCCCCCCEEEEEECHHHHHHHHHHHCCCCH-HEEEECCCCCC Q ss_conf 235467405888720368787645038621-33012275744 Q gi|254781093|r 95 LNPESKSCWIAGYSFGAWISMQLLMRRPEI-NGFISVAPQPK 135 (225) Q Consensus 95 ~~~~~~~i~l~G~S~Gg~val~~a~~~p~i-~~~v~isp~~~ 135 (225) +....+ ..+.|-|||++.|+...-+.|++ .++|++|..-. T Consensus 97 Ealpgs-~~~sgcsmGayhA~nfvfrhP~lftkvialSGvYd 137 (227) T COG4947 97 EALPGS-TIVSGCSMGAYHAANFVFRHPHLFTKVIALSGVYD 137 (227) T ss_pred HHCCCC-CCCCCCCHHHHHHHHHHEECHHHHHHHEEECCEEE T ss_conf 664797-43153353566543131018167553301201301 No 246 >KOG4372 consensus Probab=39.83 E-value=29 Score=15.59 Aligned_cols=21 Identities=14% Similarity=0.162 Sum_probs=15.2 Q ss_pred CCCEEEEEECHHHHHHHHHHH Q ss_conf 674058887203687876450 Q gi|254781093|r 99 SKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 99 ~~~i~l~G~S~Gg~val~~a~ 119 (225) .++|=.+|||+||.++-.+.+ T Consensus 149 i~kISfvghSLGGLvar~AIg 169 (405) T KOG4372 149 IEKISFVGHSLGGLVARYAIG 169 (405) T ss_pred CCEEEEEEEECCCEEEEEEEE T ss_conf 430004554157722467877 No 247 >COG0426 FpaA Uncharacterized flavoproteins [Energy production and conversion] Probab=39.61 E-value=29 Score=15.57 Aligned_cols=13 Identities=8% Similarity=-0.350 Sum_probs=5.4 Q ss_pred CEEEEEECHHHHH Q ss_conf 4058887203687 Q gi|254781093|r 101 SCWIAGYSFGAWI 113 (225) Q Consensus 101 ~i~l~G~S~Gg~v 113 (225) ++.|+--||=|.. T Consensus 248 ~V~l~Y~smyg~T 260 (388) T COG0426 248 KVDLIYDSMYGNT 260 (388) T ss_pred EEEEEEECCCCCH T ss_conf 1799984223787 No 248 >KOG2182 consensus Probab=39.60 E-value=29 Score=15.57 Aligned_cols=85 Identities=15% Similarity=0.099 Sum_probs=55.7 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCCC--CC-------CCHHHHHHHHHHHHHHHHC-C-CCCCEEEEEECHHHHHHHHH Q ss_conf 999999739849998531558876778--52-------1023899999899998623-5-46740588872036878764 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGIGRSEGEF--DY-------GDGELSDAAAALDWVQSLN-P-ESKSCWIAGYSFGAWISMQL 117 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~G~S~G~~--~~-------g~~E~~D~~aa~~~l~~~~-~-~~~~i~l~G~S~Gg~val~~ 117 (225) ..+...+.|=.|+....|=+|.|.=.- +. ....++|+...+.-+..+. . +..+-+.+|-||-|.+++.. T Consensus 110 ~~~~AkkfgA~v~~lEHRFYG~S~P~~~~st~nlk~LSs~QALaDla~fI~~~n~k~n~~~~~~WitFGgSYsGsLsAW~ 189 (514) T KOG2182 110 WLQWAKKFGATVFQLEHRFYGQSSPIGDLSTSNLKYLSSLQALADLAEFIKAMNAKFNFSDDSKWITFGGSYSGSLSAWF 189 (514) T ss_pred HHHHHHHHCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHHHH T ss_conf 99999982982688640003667788877640056660898999899999998765277789976997887700899999 Q ss_pred HHCCCC-HHEEEECCCC Q ss_conf 503862-1330122757 Q gi|254781093|r 118 LMRRPE-INGFISVAPQ 133 (225) Q Consensus 118 a~~~p~-i~~~v~isp~ 133 (225) =...|+ +.|.|+-|.| T Consensus 190 R~~yPel~~GsvASSap 206 (514) T KOG2182 190 REKYPELTVGSVASSAP 206 (514) T ss_pred HHHCCHHHEEECCCCCC T ss_conf 87583454343014653 No 249 >cd01519 RHOD_HSP67B2 Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized. Probab=38.66 E-value=30 Score=15.48 Aligned_cols=35 Identities=14% Similarity=0.230 Sum_probs=19.2 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 778799995789988889989999999999997398499985315 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) ...+++++|+ +|... ...+..|.++||.-+ .||.| T Consensus 65 ~~~~iv~yC~-----~G~rS----~~a~~~l~~~Gy~~v-~~~~G 99 (106) T cd01519 65 KDKELIFYCK-----AGVRS----KAAAELARSLGYENV-GNYPG 99 (106) T ss_pred CCCCEEEECC-----CCCHH----HHHHHHHHHCCCCCE-EECCC T ss_conf 8996899839-----99739----999999998699885-88669 No 250 >cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain. Probab=38.33 E-value=30 Score=15.45 Aligned_cols=30 Identities=20% Similarity=0.231 Sum_probs=15.9 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 778799995789988889989999999999997398499 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSL 61 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl 61 (225) ..++++++|. .|.+. ...++.|.+.||.|. T Consensus 50 k~~~ivv~C~-----~G~rS----~~aa~~L~~~G~~v~ 79 (90) T cd01524 50 KDKEIIVYCA-----VGLRG----YIAARILTQNGFKVK 79 (90) T ss_pred CCCEEEEECC-----CCCHH----HHHHHHHHHCCCCEE T ss_conf 5870999889-----98059----999999998699889 No 251 >cd00884 beta_CA_cladeB Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=37.86 E-value=31 Score=15.41 Aligned_cols=81 Identities=22% Similarity=0.301 Sum_probs=45.7 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 78799995789988889989999999999997398499985315588767785210238999998999986235467405 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCW 103 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~ 103 (225) ..|-++++ |++|.|+.- ..+-..|.+=+ |-.|..|.=-..++. ......+.++++|+.... ..+.|+ T Consensus 24 Q~P~a~vi-------~CsDSRV~p---e~if~~~~Gdl-FVvRnagNiVpp~~~-~~~~~~~~aslEyAv~~L-~v~~Iv 90 (190) T cd00884 24 QSPKALFI-------ACSDSRVVP---ALITQTQPGEL-FVVRNVGNLVPPYEP-DGGFHGTSAAIEYAVAVL-KVEHIV 90 (190) T ss_pred CCCCEEEE-------EEECCCCCH---HHHHCCCCCCE-EEEEECCCCCCCCCC-CCCCHHHHHHHHHHHHHC-CCCEEE T ss_conf 99967999-------865358999---99828999767-998412233587767-766303789999999963-986799 Q ss_pred EEEECHHHHHHHHH Q ss_conf 88872036878764 Q gi|254781093|r 104 IAGYSFGAWISMQL 117 (225) Q Consensus 104 l~G~S~Gg~val~~ 117 (225) ++|||-=|++...+ T Consensus 91 V~GHs~CGav~Aa~ 104 (190) T cd00884 91 VCGHSDCGGIRALL 104 (190) T ss_pred EECCCCCHHHHHHH T ss_conf 94587766999886 No 252 >CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed Probab=37.78 E-value=31 Score=15.40 Aligned_cols=156 Identities=16% Similarity=0.153 Sum_probs=75.3 Q ss_pred CEEEEEEECCCC-CCCCEEEEECCCCCCCCCCCCHH---HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHH---H Q ss_conf 618799966888-77879999578998888998999---9999999997398499985315588767785210238---9 Q gi|254781093|r 11 GRLEGRYQPSTN-PNAPIALILHPHPRFGGTMNDNI---VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGEL---S 83 (225) Q Consensus 11 G~l~~~~~~~~~-~~~~~vv~~Hp~p~~GG~~~~~~---~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~---~ 83 (225) |.-++.+.-.+. .+.++++.+-=+...||+|..-+ +..+++.-.+....++.|.-.|--| -.+|.--+ . T Consensus 128 g~~dAvv~g~G~I~g~~vvv~vmDF~FmGGSMGsvvGEki~ra~e~A~~~k~PlIi~saSGGAR----MQEGilSLMQMa 203 (305) T CHL00174 128 GLTDAVQTGIGQLNGIPVALGVMDFQFMGGSMGSVVGEKITRLIEYATNESLPLIIVCASGGAR----MQEGSLSLMQMA 203 (305) T ss_pred CCCEEEEEEEEEECCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC----CCCCHHHHHHHH T ss_conf 9980799999999997799999724011565207889999999999997399789990787634----346277898779 Q ss_pred HHHHHHH-HHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC----CHHEEEECCCCCCCCC-C-CCCC-C-CCCCEEEEEC Q ss_conf 9999899-99862354674058887203687876450386----2133012275744333-2-2234-3-3477377616 Q gi|254781093|r 84 DAAAALD-WVQSLNPESKSCWIAGYSFGAWISMQLLMRRP----EINGFISVAPQPKSYD-F-SFLA-P-CPSSGLIING 154 (225) Q Consensus 84 D~~aa~~-~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p----~i~~~v~isp~~~~~~-~-~~l~-~-~~~p~LiIhG 154 (225) -..+++. ++.+.+.. --.+|.-=++||..|. .|+... +..++|.-+.+-..-. . ..|. . ...-.|+=|| T Consensus 204 Kts~Al~~~~~~~~lp-yIsvlt~PTtGGVtAS-fA~lgDiiiAEP~AlIGFAG~RVIeqTi~~~LPegFQtaEfllehG 281 (305) T CHL00174 204 KISSALYDYQSNKKLF-YISILTSPTTGGVTAS-FGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFHKG 281 (305) T ss_pred HHHHHHHHHHHCCCCC-EEEEECCCCCCCCEEE-ECCCCCEEEECCCCEEEECCCHHHHHHHCCCCCCCCCCHHHHHHCC T ss_conf 9999999998457873-8999737887780124-1025665897588667605617888861898998632269999779 Q ss_pred CCCCCCCHHHHHHHHHHH Q ss_conf 865502899999999998 Q gi|254781093|r 155 SNDTVATTSDVKDLVNKL 172 (225) Q Consensus 155 ~~D~~vp~~~~~~~~~~l 172 (225) --|.+|+-.+.++-...+ T Consensus 282 ~iD~IV~R~~lk~~l~~l 299 (305) T CHL00174 282 LFDLIVPRNLLKGVLSEL 299 (305) T ss_pred CCCEEECHHHHHHHHHHH T ss_conf 971676589999999999 No 253 >PRK08762 molybdopterin biosynthesis protein MoeB; Validated Probab=37.76 E-value=31 Score=15.40 Aligned_cols=108 Identities=17% Similarity=0.138 Sum_probs=52.1 Q ss_pred CCEEEEEEECC-CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCC---CCCCCCCCCCCHHHHHH Q ss_conf 96187999668-887787999957899888899899999999999973984999853155---88767785210238999 Q gi|254781093|r 10 SGRLEGRYQPS-TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGI---GRSEGEFDYGDGELSDA 85 (225) Q Consensus 10 ~G~l~~~~~~~-~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~---G~S~G~~~~g~~E~~D~ 85 (225) .|.|+...... ....++++|+|+ +|.+ -...|+.|.++||.-+ +|..|- |+.+|..=+-..++.|- T Consensus 43 ~~~le~~~~~~~pdkd~~ivvyC~-----sG~R----S~~Aa~~L~~~Gy~~V-~~l~GG~~aW~~aGlPvErp~~~~~~ 112 (379) T PRK08762 43 QGFLELQPATHLPDHDREIVLICQ-----SGKR----SAHAAADLRELGYRRV-ASVAGGTSAWRDAGLPLERPTLLTDE 112 (379) T ss_pred HHHHHHHHHHCCCCCCCEEEEECC-----CCHH----HHHHHHHHHHCCCCCC-EECCCCHHHHHHCCCCCCCCCCCCHH T ss_conf 244554133018899985999999-----9879----9999999997699660-88657499999779985558868811 Q ss_pred HHHH-HHHHHH-----------CCCCCCEEEEEE-CHHHHHHHHHHHCCCCHHEEEE Q ss_conf 9989-999862-----------354674058887-2036878764503862133012 Q gi|254781093|r 86 AAAL-DWVQSL-----------NPESKSCWIAGY-SFGAWISMQLLMRRPEINGFIS 129 (225) Q Consensus 86 ~aa~-~~l~~~-----------~~~~~~i~l~G~-S~Gg~val~~a~~~p~i~~~v~ 129 (225) +..+ +|-|+. .....++.++|- -+|+.++..+|+.. |..+.+ T Consensus 113 ~~~~eRYsRqi~lp~iG~~GQ~kL~~a~VlivG~GGLGs~~a~yLA~aG--VG~i~l 167 (379) T PRK08762 113 QDFLERYSRHLRLPQVGEEGQRRLARARVLLIGAGGLGSPAAFYLAAAG--VGHLRI 167 (379) T ss_pred HHHHHHHHHEECCCCCCHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHC--CCEEEE T ss_conf 3034431201156226999999997397899888755799999999837--975897 No 254 >KOG1000 consensus Probab=36.20 E-value=33 Score=15.25 Aligned_cols=147 Identities=11% Similarity=0.032 Sum_probs=59.4 Q ss_pred CCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 88877879999578998888998999999999999739849998531558876778521023899999899998623546 Q gi|254781093|r 20 STNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPES 99 (225) Q Consensus 20 ~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~ 99 (225) ++.++.-.+|+|| ++.++..+-..+.+++...+|+| | +.. ..+..++. +-. |.-+. T Consensus 488 ~d~~~~KflVFaH---------H~~vLd~Iq~~~~~r~vg~IRID--G------st~--s~~R~ll~---qsF--Q~see 543 (689) T KOG1000 488 PDAPPRKFLVFAH---------HQIVLDTIQVEVNKRKVGSIRID--G------STP--SHRRTLLC---QSF--QTSEE 543 (689) T ss_pred CCCCCCEEEEEEH---------HHHHHHHHHHHHHHCCCCEEEEC--C------CCC--CHHHHHHH---HHH--CCCCC T ss_conf 5688735999941---------48799999999987479758856--9------898--43478999---873--14631 Q ss_pred CCEEEEEECHHHHHHHHHHHCCCCHHEEEECCCCCCCC------CCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 74058887203687876450386213301227574433------322234334773776168655028999999999986 Q gi|254781093|r 100 KSCWIAGYSFGAWISMQLLMRRPEINGFISVAPQPKSY------DFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLM 173 (225) Q Consensus 100 ~~i~l~G~S~Gg~val~~a~~~p~i~~~v~isp~~~~~------~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~ 173 (225) -++++.+---+|. .+.+.+.+.-+.+-+-+.|..-.. ....-..+-.-.|+--|+.|++.=+.--.+ .+.+. T Consensus 544 v~VAvlsItA~gv-GLt~tAa~~VVFaEL~wnPgvLlQAEDRaHRiGQkssV~v~ylvAKgT~Ddy~Wp~l~~K-L~vl~ 621 (689) T KOG1000 544 VRVAVLSITAAGV-GLTLTAASVVVFAELHWNPGVLLQAEDRAHRIGQKSSVFVQYLVAKGTADDYMWPMLQQK-LDVLG 621 (689) T ss_pred EEEEEEEEEECCC-CEEEECCCEEEEEEECCCCCEEEECHHHHHHCCCCCEEEEEEEEECCCHHHHHHHHHHHH-HHHHH T ss_conf 4799999863144-414651325899982378754775146566405665113899995484577778999999-87886 Q ss_pred HCCCCCEEEEEECCCCCCC Q ss_conf 0779960799978798475 Q gi|254781093|r 174 NQKGISITHKVIPDANHFF 192 (225) Q Consensus 174 ~~~~~~~~~~~i~ga~H~f 192 (225) ..--...++...+..+-.| T Consensus 622 s~gl~s~~~~~~e~~g~s~ 640 (689) T KOG1000 622 SVGLSSDTFRTAEKMGLSF 640 (689) T ss_pred HCCCCCCCCEEECCCCEEE T ss_conf 6256756441201156010 No 255 >TIGR01064 pyruv_kin pyruvate kinase; InterPro: IPR001697 Pyruvate kinase (2.7.1.40 from EC) (PK) catalyses the final step in glycolysis , the conversion of phosphoenolpyruvate to pyruvate with concomitant phosphorylation of ADP to ATP:ADP + phosphoenolpyruvate = ATP + pyruvate The enzyme, which is found in all living organisms, requires both magnesium and potassium ions for its activity. In vertebrates, there are four tissue-specific isozymes: L (liver), R (red cells), M1 (muscle, heart and brain), and M2 (early foetal tissue). In plants, PK exists as cytoplasmic and plastid isozymes, while most bacteria and lower eukaryotes have one form, except in certain bacteria, such as Escherichia coli, that have two isozymes. All isozymes appear to be tetramers of identical subunits of ~500 residues. PK helps control the rate of glycolysis, along with phosphofructokinase (IPR000023 from INTERPRO) and hexokinase (IPR001312 from INTERPRO). PK possesses allosteric sites for numerous effectors, yet the isozymes respond differently, in keeping with their different tissue distributions . The activity of L-type (liver) PK is increased by fructose-1,6-bisphosphate (F1,6BP) and lowered by ATP and alanine (gluconeogenic precursor), therefore when glucose levels are high, glycolysis is promoted, and when levels are low, gluconeogenesis is promoted. L-type PK is also hormonally regulated, being activated by insulin and inhibited by glucagon, which covalently modifies the PK enzyme. M1-type (muscle, brain) PK is inhibited by ATP, but F1,6BP and alanine have no effect, which correlates with the function of muscle and brain, as opposed to the liver. The structure of several pyruvate kinases from various organisms have been determined , . The protein comprises three-four domains: a small N-terminal helical domain (absent in bacterial PK), a beta/alpha-barrel domain, a beta-barrel domain (inserted within the beta/alpha-barrel domain), and a 3-layer alpha/beta/alpha sandwich domain. ; GO: 0000287 magnesium ion binding, 0004743 pyruvate kinase activity, 0030955 potassium ion binding, 0006096 glycolysis. Probab=35.57 E-value=34 Score=15.19 Aligned_cols=27 Identities=26% Similarity=0.478 Sum_probs=18.5 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 398499985315588767785210238999998999986 Q gi|254781093|r 56 RGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQS 94 (225) Q Consensus 56 ~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~ 94 (225) +|.+|.|+|| |-|+| ||-...++-+|+ T Consensus 42 aGmnVaRlNF-----SHG~~-------E~h~~~i~~vR~ 68 (513) T TIGR01064 42 AGMNVARLNF-----SHGSH-------EEHGKRIENVRE 68 (513) T ss_pred HCCCEEEECC-----CCCCH-------HHHHHHHHHHHH T ss_conf 0886355426-----79898-------899999999999 No 256 >TIGR01452 PGP_euk phosphoglycolate/pyridoxal phosphate phosphatase family; InterPro: IPR006349 This family of sequences represent 2-phosphoglycolate phosphatase which is limited to the eukaryotic lineage. PGP is an essential enzyme in the glycolate salvage pathway in higher organisms (photorespiration in plants). Phosphoglycolate results from the oxidase activity of RubisCO in the Calvin cycle when concentrations of carbon dioxide are low relative to oxygen. In mammals, PGP is found in many tissues, notably in red blood cells where P-glycolate is and important activator of the hydrolysis of 2,3-bisphosphoglycerate, a major modifier of the oxygen affinity of hemoglobin. The PGPase enzyme described here is a member of the Haloacid dehalogenase superfamily of hydrolase enzymes. Unlike the bacterial PGP equivalog (IPR006346 from INTERPRO), which is a member of class (subfamily) I, this enzyme is a member of class (subfamily) II . These two enzymes have almost certainly arisen from convergent evolution (although these two ancestors may themselves have diverged from a more distant HAD superfamily progenitor). One of the sequences in the group comes from Chlamydomonas reinhardtii, a photosynthetic alga. The enzyme has been purified and characterised and these data are fully consistent with the assignment of function as a PGPase involved in photorespiration . Another, from Homo sapiens, is supported as a member of the family indirectly. Biochemical characterization of partially purified PGP's from various tissues including red blood cells have been performed while one gene for PGP has been localized to chromosome 16p13.3 . The Homo sapiens sequence maps to chromosome 22. There is indeed a related gene on chromosome 16 (and it is expressed, since EST's are found) which shows 46 0dentity and 59603b08ositives by BLAST2 (E=1e-66). Presumably, these two genes are isoforms and have the same catalytic function while being expressed in different tissues and may be differently regulated. The sequence from Caenorhabditis elegans, is only supported by sequence similarity. This family is closely related to a family of bacterial sequences including the Escherichia coli NagD and Bacillus subtilus AraL genes which are characterised by the ability to hydrolyze para-nitrophenylphosphate (pNPPases or NPPases). The Chlamydomonas reinhardtii PGPase does not catalyze this reaction and so presumably these two groups have different functions and substrate specificities. Many of the genes in this family have been annotated as pNPPases due to this association.; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=35.43 E-value=34 Score=15.17 Aligned_cols=51 Identities=22% Similarity=0.294 Sum_probs=33.7 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC----CCCCEEEEEECHH Q ss_conf 9999999999997398499985315588767785210238999998999986235----4674058887203 Q gi|254781093|r 43 DNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNP----ESKSCWIAGYSFG 110 (225) Q Consensus 43 ~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~----~~~~i~l~G~S~G 110 (225) .|.=..++..|+++||+.+.=+. | .-+.+|.++|+++.| +.++|+|+| --| T Consensus 45 tksRa~ya~kfa~LG~n~~ae~l---------~-------ssa~~aA~~L~~rlPG~~~~~k~VyviG-e~G 99 (288) T TIGR01452 45 TKSRAEYAKKFAKLGYNSVAEEL---------F-------SSALVAAALLRRRLPGAIDANKKVYVIG-EEG 99 (288) T ss_pred CHHHHHHHHHHHHCCCCCHHHHH---------C-------CHHHHHHHHHHHHCCCCCCCCCEEEEEC-CCC T ss_conf 41048999999751776024552---------2-------6789999999873778888886289977-702 No 257 >COG0469 PykF Pyruvate kinase [Carbohydrate transport and metabolism] Probab=35.19 E-value=34 Score=15.15 Aligned_cols=24 Identities=25% Similarity=0.519 Sum_probs=17.7 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCCCCC Q ss_conf 99999973984999853155887677852 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGIGRSEGEFDY 77 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~G~S~G~~~~ 77 (225) .-+.|.+.|.+|+|+|| |.|++.+ T Consensus 22 ~l~~li~aG~nV~RlNf-----SHG~~e~ 45 (477) T COG0469 22 MLEKLIEAGMNVVRLNF-----SHGDHEE 45 (477) T ss_pred HHHHHHHCCCCEEEEEC-----CCCCHHH T ss_conf 99999970586898856-----8898689 No 258 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=35.05 E-value=34 Score=15.13 Aligned_cols=55 Identities=7% Similarity=0.044 Sum_probs=39.2 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCC Q ss_conf 99999997398499985315588767785210238999998999986235467405888720368787645038 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRR 121 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~ 121 (225) .-+..|.+.|..+|..|. |+|+ -.-....+.|+++.+|+.+-| ||.||...+++. T Consensus 242 ~R~~~L~~AGvDv~viDs-----shGh-------s~~vl~~ik~~k~~Yp~~~ii-------aGNVaT~~~a~~ 296 (476) T TIGR01302 242 ERAEALVEAGVDVIVIDS-----SHGH-------SIYVLDSIKKIKKTYPDLDII-------AGNVATAEQAKA 296 (476) T ss_pred HHHHHHHHCCCCEEEEEC-----CCCC-------CHHHHHHHHHHHHHCCEEEEE-------ECCCCCHHHHHH T ss_conf 999999965965899816-----6545-------378999999998638805799-------434411788988 No 259 >COG2513 PrpB PEP phosphonomutase and related enzymes [Carbohydrate transport and metabolism] Probab=35.04 E-value=34 Score=15.13 Aligned_cols=46 Identities=24% Similarity=0.264 Sum_probs=28.9 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9999997398499985315588767785210238999998999986 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQS 94 (225) Q Consensus 49 la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~ 94 (225) -|+...+.||.++.-.--|+-.|-|..|.+..-.+++...+..+.+ T Consensus 30 sA~la~~aGF~al~~sg~~vA~slG~pD~~~~t~~e~~~~vrrI~~ 75 (289) T COG2513 30 SALLAERAGFKALYLSGAGVAASLGLPDLGITTLDEVLADARRITD 75 (289) T ss_pred HHHHHHHCCCEEEEECCHHHHHHCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 9999997697489725488898617986341659999999999986 No 260 >COG4567 Response regulator consisting of a CheY-like receiver domain and a Fis-type HTH domain [Signal transduction mechanisms / Transcription] Probab=33.47 E-value=36 Score=14.98 Aligned_cols=67 Identities=13% Similarity=0.184 Sum_probs=36.1 Q ss_pred CHHHHHHHHHHHHCCCEEEEEEECCCCC--CCC-CCCCCCH--HHH--HHHHHHHHHHHHCCCCCCEEEEEECH Q ss_conf 9999999999997398499985315588--767-7852102--389--99998999986235467405888720 Q gi|254781093|r 43 DNIVYQLFYLFQQRGFVSLRFNFRGIGR--SEG-EFDYGDG--ELS--DAAAALDWVQSLNPESKSCWIAGYSF 109 (225) Q Consensus 43 ~~~~~~la~~l~~~G~~vl~fd~RG~G~--S~G-~~~~g~~--E~~--D~~aa~~~l~~~~~~~~~i~l~G~S~ 109 (225) .+....|++++.++||.|..-.-..-+- ..- .+.+..- .+. -=.++++-+++..+|.+-++|.||+- T Consensus 19 ~~f~~~LaRa~e~RGf~v~~a~~~~eal~~art~~PayAvvDlkL~~gsGL~~i~~lr~~~~d~rivvLTGy~s 92 (182) T COG4567 19 TPFLRTLARAMERRGFAVVTAESVEEALAAARTAPPAYAVVDLKLGDGSGLAVIEALRERRADMRIVVLTGYAS 92 (182) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHCCCCCEEEEEECCHH T ss_conf 69999999998606862576144999999985179852889864068874689999982497643898845117 No 261 >pfam06792 UPF0261 Uncharacterized protein family (UPF0261). Probab=31.98 E-value=39 Score=14.83 Aligned_cols=27 Identities=22% Similarity=0.308 Sum_probs=17.6 Q ss_pred CCCCCHH--HHHHHHHHHHCCCEEEEEEE Q ss_conf 8998999--99999999973984999853 Q gi|254781093|r 39 GTMNDNI--VYQLFYLFQQRGFVSLRFNF 65 (225) Q Consensus 39 G~~~~~~--~~~la~~l~~~G~~vl~fd~ 65 (225) |+.|.|- +.-+.+.+.++|..|+..|. T Consensus 8 gT~DTKg~E~~ylr~~i~~~G~~v~~iDv 36 (403) T pfam06792 8 GTLDTKGEELRFLRDLIEAQGLEVLVIDV 36 (403) T ss_pred ECCCCCHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 70367839999999999987995899981 No 262 >PTZ00071 40S ribosomal protein S24; Provisional Probab=31.31 E-value=40 Score=14.76 Aligned_cols=47 Identities=30% Similarity=0.511 Sum_probs=26.4 Q ss_pred CCCEEEEECCCCCCCCCCC-CHHHHHHHHHHHHC-CCEEEEEEEC---CCCCCCC Q ss_conf 7879999578998888998-99999999999973-9849998531---5588767 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMN-DNIVYQLFYLFQQR-GFVSLRFNFR---GIGRSEG 73 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~-~~~~~~la~~l~~~-G~~vl~fd~R---G~G~S~G 73 (225) ..-++-++||. - .+-. +.+-..||+.+.-. -=.|+.|.|| |-|.|.| T Consensus 23 KQ~vvdvlHPg--~-~~vsK~eirekLAkmyK~~d~d~V~vfGfrT~FGGGkStG 74 (133) T PTZ00071 23 KQFVVEVLHPG--K-GTVSKEDIKKRLAKLYKVADENTIVLFGFKTKFGGGRTTG 74 (133) T ss_pred EEEEEEEECCC--C-CCCCHHHHHHHHHHHHCCCCCCEEEEECCEECCCCCCEEE T ss_conf 68899960499--8-9869999999999982889998799950272478963102 No 263 >PHA02519 plasmid partition protein SopA; Reviewed Probab=31.30 E-value=40 Score=14.76 Aligned_cols=31 Identities=13% Similarity=0.109 Sum_probs=23.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCEEEEEE-EC Q ss_conf 88889989999999999997398499985-31 Q gi|254781093|r 36 RFGGTMNDNIVYQLFYLFQQRGFVSLRFN-FR 66 (225) Q Consensus 36 ~~GG~~~~~~~~~la~~l~~~G~~vl~fd-~R 66 (225) .=||..-.-....||+.|+.+||-||.+| += T Consensus 114 ~KGGVGKTTTavnLA~~LAl~G~RVL~ID~lD 145 (387) T PHA02519 114 HKGGVYKTSSAVHTAQWLALQGHRVLLIEGND 145 (387) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 68877699999999999997699689995988 No 264 >pfam12531 DUF3731 DNA-K related protein. This domain family is found in bacteria, and is approximately 250 amino acids in length. There are two conserved sequence motifs: RPG and WRR. The proteins in this family are frequently annotated as DNA-K related proteins however there is little accompanying literature to confirm this. Probab=31.16 E-value=40 Score=14.75 Aligned_cols=61 Identities=11% Similarity=0.025 Sum_probs=36.8 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEC------CCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 7761686550289999999999860779960799978------798475585899999999999973 Q gi|254781093|r 150 LIINGSNDTVATTSDVKDLVNKLMNQKGISITHKVIP------DANHFFIGKVDELINECAHYLDNS 210 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~------ga~H~f~~~~~~l~~~i~~fl~~~ 210 (225) -.+||+.+.+||++.+..|.+++....-...+...+. -.|.-..+-.+++.+.|.++|+.. T Consensus 147 ~p~YGs~h~Vvp~~~a~~wl~~lL~~dwkk~~~~afA~~qlaR~TgDR~rDl~~~~R~~v~~~L~~~ 213 (249) T pfam12531 147 TPFYGSAHNVVPPETAEQWLEALLALDWKKEEPAAFAAVQLARLTGDRARDLPEALRERVLEKLKAS 213 (249) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 7565762455699999999999984076646133899999988608955567999999999998737 No 265 >PRK07053 glutamine amidotransferase; Provisional Probab=30.96 E-value=40 Score=14.73 Aligned_cols=83 Identities=16% Similarity=0.111 Sum_probs=41.9 Q ss_pred CCEEEEEC-CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCC----CCCCCCC-----CCHH-HHH-----HHHH Q ss_conf 87999957-89988889989999999999997398499985315588----7677852-----1023-899-----9998 Q gi|254781093|r 25 APIALILH-PHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGR----SEGEFDY-----GDGE-LSD-----AAAA 88 (225) Q Consensus 25 ~~~vv~~H-p~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~----S~G~~~~-----g~~E-~~D-----~~aa 88 (225) +.++|+-| ++-.- -.+.+.|.++|+..-.++... |+ +--.||. |..- .+| +..- T Consensus 3 k~ilvlqH~~~E~p---------G~i~~~l~~~g~~~~~~~~~~-~~~~p~~~~~~d~livlGGpms~~d~~~~Pwl~~e 72 (235) T PRK07053 3 KTAVAIRHVAFEDL---------GSFEQVLGERGYRVRYVDVGV-DDLEVLDALEPDLLVVLGGPIGVYDDALYPFLAPE 72 (235) T ss_pred CEEEEEECCCCCCC---------HHHHHHHHHCCCEEEEEECCC-CCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHH T ss_conf 53999968998898---------699999997899599996788-98899996667999993899898877668558999 Q ss_pred HHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 9999862354674058887203687876450 Q gi|254781093|r 89 LDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 89 ~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) .+++++-... .+=++|.|+|+.+-..+.. T Consensus 73 ~~lIr~a~~~--~~PvLGIClG~QLlA~AlG 101 (235) T PRK07053 73 IALLRQRLAA--GLPTLGICLGAQLIARALG 101 (235) T ss_pred HHHHHHHHHC--CCCEEEECHHHHHHHHHHC T ss_conf 9999999986--9988998473899999709 No 266 >PRK10287 thiosulfate:cyanide sulfurtransferase; Provisional Probab=30.92 E-value=40 Score=14.72 Aligned_cols=31 Identities=13% Similarity=0.274 Sum_probs=20.3 Q ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 8778799995789988889989999999999997398499 Q gi|254781093|r 22 NPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSL 61 (225) Q Consensus 22 ~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl 61 (225) ....+++|+|+ .|.+. ...++.|.++||.-+ T Consensus 58 dK~~~IvvyC~-----sG~RS----~~A~~~L~~~Gy~nV 88 (104) T PRK10287 58 DKNDTVKVYCN-----AGRQS----GQAKEILSEMGYTHV 88 (104) T ss_pred CCCCCEEEECC-----CCHHH----HHHHHHHHHCCCCCE T ss_conf 77993899869-----98389----999999997699737 No 267 >pfam06309 Torsin Torsin. This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterized by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of (pfam00004) proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia. Probab=30.83 E-value=40 Score=14.71 Aligned_cols=38 Identities=21% Similarity=0.324 Sum_probs=24.5 Q ss_pred CCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEE Q ss_conf 88877879999578998888998999999999999739849 Q gi|254781093|r 20 STNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVS 60 (225) Q Consensus 20 ~~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~v 60 (225) ...+.+|.|+-+|+++ |+--|-+...+|+.|=..|... T Consensus 47 ~~~p~KpLVlSfHG~t---GtGKn~vs~liA~~Ly~~G~~S 84 (127) T pfam06309 47 NPKPRKPLVLSFHGWT---GTGKNFVAEIIADNLYRDGLRS 84 (127) T ss_pred CCCCCCCEEEEECCCC---CCCHHHHHHHHHHHHHHCCCCC T ss_conf 8999997488701899---9879899999999987543478 No 268 >TIGR02493 PFLA pyruvate formate-lyase 1-activating enzyme; InterPro: IPR012838 Members of this family are iron-sulphur proteins with a radical-SAM domain. A single glycine residue in 2.3.1.54 from EC, formate C-acetyltransferase (formate-pyruvate lyase), is oxidized to the corresponding radical by transfer of a proton from its CH2 to AdoMet with concomitant cleavage of the latter. The reaction requires Fe2+. The first stage is reduction of the AdoMet to give methionine and the 5'-deoxyadenosin-5-yl radical, which then abstracts a hydrogen radical from the glycine residue .; GO: 0043365 [formate-C-acetyltransferase]-activating enzyme activity. Probab=30.61 E-value=41 Score=14.69 Aligned_cols=40 Identities=15% Similarity=0.359 Sum_probs=23.6 Q ss_pred HHHHHHHHHHC-CCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 99999998607-7996079997879847558589999999999997 Q gi|254781093|r 165 VKDLVNKLMNQ-KGISITHKVIPDANHFFIGKVDELINECAHYLDN 209 (225) Q Consensus 165 ~~~~~~~l~~~-~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~ 209 (225) ..++++.+... +..++.++++|| |++..+++ +.+.+|++. T Consensus 152 tl~Fa~~L~~~~kP~WiRYVlVPG----yTD~~eDi-~~l~~fv~~ 192 (243) T TIGR02493 152 TLEFAKYLAKRNKPIWIRYVLVPG----YTDSEEDI-EALAEFVKT 192 (243) T ss_pred HHHHHHHHHHCCCCEEEEEEECCC----CCCCHHHH-HHHHHHHHH T ss_conf 589999999658988999986588----77998999-999999974 No 269 >TIGR01717 AMP-nucleosdse AMP nucleosidase; InterPro: IPR011271 This entry represents the AMP nucleosidase from proteobacteria but also including sequences from Corynebacterium, a Gram-positive organism. The Escherichia coli protein is the most well studied .. Probab=30.28 E-value=25 Score=16.00 Aligned_cols=39 Identities=31% Similarity=0.551 Sum_probs=19.9 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEE-EECHH Q ss_conf 39849998531558876778521023899999899998623546740588-87203 Q gi|254781093|r 56 RGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIA-GYSFG 110 (225) Q Consensus 56 ~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~-G~S~G 110 (225) .|=+.=.+|- |+|-|--+ -+-|..||+. | .+||| |||-| T Consensus 268 ~G~GITlvNI-GVGPSNAK------tITDHLAVLR------P---~~WlMiGHCaG 307 (486) T TIGR01717 268 DGDGITLVNI-GVGPSNAK------TITDHLAVLR------P---HAWLMIGHCAG 307 (486) T ss_pred CCCCEEEEEC-CCCCCCHH------HHHHHHHHCC------C---CCEEEECCCCC T ss_conf 6993289960-74886212------3541022118------8---74287222304 No 270 >cd01532 4RHOD_Repeat_1 Member of the Rhodanese Homology Domain superfamily, repeat 1. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 1st repeat which does not contain the putative catalytic Cys residue. Probab=30.08 E-value=42 Score=14.64 Aligned_cols=32 Identities=16% Similarity=0.340 Sum_probs=20.7 Q ss_pred CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 778799995789988889989999999999997398499 Q gi|254781093|r 23 PNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSL 61 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl 61 (225) ...|++|+|+ +|... .....++.|.++||.-+ T Consensus 49 kd~~Ivvyc~-----~g~~~--~s~~Aa~~L~~~Gy~~V 80 (92) T cd01532 49 RDTPIVVYGE-----GGGED--LAPRAARRLSELGYTDV 80 (92) T ss_pred CCCEEEEEEC-----CCCCH--HHHHHHHHHHHCCCCCE T ss_conf 9982999969-----99962--99999999998699686 No 271 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=29.79 E-value=42 Score=14.61 Aligned_cols=12 Identities=8% Similarity=0.036 Sum_probs=5.9 Q ss_pred HHHHHHHCCCEE Q ss_conf 999999739849 Q gi|254781093|r 49 LFYLFQQRGFVS 60 (225) Q Consensus 49 la~~l~~~G~~v 60 (225) ++-.|.-.||.+ T Consensus 12 ~a~~l~lsgC~t 23 (219) T PRK10510 12 VSGALAVSGCTT 23 (219) T ss_pred HHHHHHHHCCCC T ss_conf 999987413567 No 272 >pfam05292 MCD Malonyl-CoA decarboxylase (MCD). This family consists of several eukaryotic malonyl-CoA decarboxylase (MLYCD) proteins. Malonyl-CoA, in addition to being an intermediate in the de novo synthesis of fatty acids, is an inhibitor of carnitine palmitoyltransferase I, the enzyme that regulates the transfer of long-chain fatty acyl-CoA into mitochondria, where they are oxidized. After exercise, malonyl-CoA decarboxylase participates with acetyl-CoA carboxylase in regulating the concentration of malonyl-CoA in liver and adipose tissue, as well as in muscle. Malonyl-CoA decarboxylase is regulated by AMP-activated protein kinase (AMPK). Probab=28.54 E-value=44 Score=14.47 Aligned_cols=42 Identities=19% Similarity=0.383 Sum_probs=31.4 Q ss_pred CCCCEEEEEECHHHH----HHHHHHHCCCCHHEEEECCCCCCCCCC Q ss_conf 467405888720368----787645038621330122757443332 Q gi|254781093|r 98 ESKSCWIAGYSFGAW----ISMQLLMRRPEINGFISVAPQPKSYDF 139 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~----val~~a~~~p~i~~~v~isp~~~~~~~ 139 (225) ...+-.|.|-|||-+ |+-.+-...|.++.|+.+||.+....+ T Consensus 188 SN~Q~GL~GIsfGNfLIKrVV~eL~~E~P~LktF~TLSPIPGF~~W 233 (354) T pfam05292 188 SNCQPGLAGINLGNFLIKRVITLLKKDMPHVSTFATLSPIPGFMQW 233 (354) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHH T ss_conf 2665433456447789999999999758787325646886267999 No 273 >cd01979 Pchlide_reductase_N Pchlide_reductase_N: N protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=28.05 E-value=45 Score=14.42 Aligned_cols=80 Identities=20% Similarity=0.193 Sum_probs=43.2 Q ss_pred CEEEEECCCCCCCCCCCCHH----HHHHHHHHHHC-CCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 79999578998888998999----99999999973-98499985315588767785210238999998999986235467 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNI----VYQLFYLFQQR-GFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESK 100 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~----~~~la~~l~~~-G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~ 100 (225) |.+|++- ||.-.-+ +..+|+.|... |.-|+ +|||+|- +++|+. ||-.-+.|.+..+-+...+.+ T Consensus 88 p~~iFlv------gtCpseVIk~DLe~~A~rls~~~~v~Vv--~vs~sGi-ettFTQ--GEDa~LaAlvp~~P~~~~~~~ 156 (396) T cd01979 88 PSVIFLI------GSCTTEVIKMDLEGAAPRLSAEIGVPIL--VASASGL-DYTFTQ--GEDTVLAALVPRCPEKPSPER 156 (396) T ss_pred CCEEEEE------CCCCHHHHHHHHHHHHHHHHHCCCCEEE--EEECCCC-CCCHHH--HHHHHHHHHHHHCCCCCCCCC T ss_conf 9479997------2585777752499998875326795599--9526875-475767--999999999864743237888 Q ss_pred CEEEEEECHHHHHHHHH Q ss_conf 40588872036878764 Q gi|254781093|r 101 SCWIAGYSFGAWISMQL 117 (225) Q Consensus 101 ~i~l~G~S~Gg~val~~ 117 (225) ++.++| |+--.+.-+. T Consensus 157 ~LvlvG-sl~d~vedq~ 172 (396) T cd01979 157 SLVLVG-SLPDIVEDQL 172 (396) T ss_pred CEEEEE-ECCCHHHHHH T ss_conf 648997-0581689999 No 274 >TIGR02193 heptsyl_trn_I lipopolysaccharide heptosyltransferase I; InterPro: IPR011908 This family consists of examples of ADP-heptose:LPS heptosyltransferase I, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria .; GO: 0016757 transferase activity transferring glycosyl groups, 0009103 lipopolysaccharide biosynthetic process. Probab=27.22 E-value=47 Score=14.33 Aligned_cols=24 Identities=13% Similarity=0.336 Sum_probs=9.8 Q ss_pred EEEECC--CCCCCCHHHHHHHHHHHH Q ss_conf 776168--655028999999999986 Q gi|254781093|r 150 LIINGS--NDTVATTSDVKDLVNKLM 173 (225) Q Consensus 150 LiIhG~--~D~~vp~~~~~~~~~~l~ 173 (225) +++|++ +|..=|.+.=.++.+.|. T Consensus 208 vllHaTSR~dK~WP~~~W~~l~~~L~ 233 (359) T TIGR02193 208 VLLHATSRDDKKWPEERWRELARLLL 233 (359) T ss_pred EEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 98214450023586789999999970 No 275 >pfam12525 DUF3726 Protein of unknown function (DUF3726). This domain family is found in bacteria and eukaryotes, and is approximately 80 amino acids in length. There is a single completely conserved residue E that may be functionally important. Probab=27.05 E-value=47 Score=14.31 Aligned_cols=25 Identities=44% Similarity=0.783 Sum_probs=17.6 Q ss_pred ECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 31558876778521023899999899998623 Q gi|254781093|r 65 FRGIGRSEGEFDYGDGELSDAAAALDWVQSLN 96 (225) Q Consensus 65 ~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~ 96 (225) +||.|.|- |+.+|+..++.|++..+ T Consensus 15 ~~g~G~sw-------G~aee~a~~v~~Le~~g 39 (80) T pfam12525 15 FRGAGRSW-------GEAEEAANMVAWLEMHG 39 (80) T ss_pred HHHCCCCC-------CHHHHHHHHHHHHHHCC T ss_conf 97539981-------13999999999999959 No 276 >KOG2170 consensus Probab=26.88 E-value=47 Score=14.29 Aligned_cols=37 Identities=24% Similarity=0.362 Sum_probs=21.7 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEE Q ss_conf 8877879999578998888998999999999999739849 Q gi|254781093|r 21 TNPNAPIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVS 60 (225) Q Consensus 21 ~~~~~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~v 60 (225) ..+.+|.++=+|+++ |+--|-|...+|+-+-+.|-.+ T Consensus 105 ~~p~KPLvLSfHG~t---GTGKN~Va~iiA~n~~~~Gl~S 141 (344) T KOG2170 105 PNPRKPLVLSFHGWT---GTGKNYVAEIIAENLYRGGLRS 141 (344) T ss_pred CCCCCCEEEEECCCC---CCCHHHHHHHHHHHHHHCCCCC T ss_conf 999987589830899---8756489999999987511256 No 277 >cd01528 RHOD_2 Member of the Rhodanese Homology Domain superfamily, subgroup 2. Subgroup 2 includes uncharacterized putative rhodanese-related domains. Probab=26.82 E-value=47 Score=14.29 Aligned_cols=18 Identities=11% Similarity=0.051 Sum_probs=7.6 Q ss_pred HHHHHHHCCCEEEEEEEC Q ss_conf 999999739849998531 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNFR 66 (225) Q Consensus 49 la~~l~~~G~~vl~fd~R 66 (225) +.+.|.+..=.++.+|-| T Consensus 7 l~~~l~~~~~~~~liDVR 24 (101) T cd01528 7 LAEWLADEREEPVLIDVR 24 (101) T ss_pred HHHHHHCCCCCEEEEECC T ss_conf 999997399983999788 No 278 >PRK09767 hypothetical protein; Provisional Probab=26.66 E-value=48 Score=14.27 Aligned_cols=30 Identities=23% Similarity=0.361 Sum_probs=20.2 Q ss_pred HHHHHCCCEEEEEEECCCCCCCCCCCCCCHH-HHHHHHHHHHHHH Q ss_conf 9999739849998531558876778521023-8999998999986 Q gi|254781093|r 51 YLFQQRGFVSLRFNFRGIGRSEGEFDYGDGE-LSDAAAALDWVQS 94 (225) Q Consensus 51 ~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E-~~D~~aa~~~l~~ 94 (225) ..|.+.||.|+||.- .| ..++.+|++-+.+ T Consensus 77 ~~L~~~G~~VlRF~N--------------~dV~~~~~~Vl~~I~~ 107 (116) T PRK09767 77 SWLESQGWTVLRFWN--------------NEIDCNEEAVLEIILQ 107 (116) T ss_pred HHHHHCCCEEEEEEH--------------HHHHHHHHHHHHHHHH T ss_conf 999988997999768--------------9988789999999999 No 279 >KOG2323 consensus Probab=26.47 E-value=48 Score=14.25 Aligned_cols=47 Identities=21% Similarity=0.374 Sum_probs=27.9 Q ss_pred CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 879999578998888998999999999999739849998531558876778521023899 Q gi|254781093|r 25 APIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSD 84 (225) Q Consensus 25 ~~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D 84 (225) +-+-++|-= |-..+ ....-+.|..+|.++.|+|| |.|++++ ..|+.| T Consensus 21 ~~t~ivcti-----gP~s~--s~e~l~~m~~aGmniarlNf-----ShGs~~~-h~~tid 67 (501) T KOG2323 21 RKTKIVCTI-----GPASR--SVEMLRKLLKAGMNIARLNF-----SHGSHEY-HQETID 67 (501) T ss_pred CCEEEEECC-----CCCCC--HHHHHHHHHHCCCCEEEEEC-----CCCCHHH-HHHHHH T ss_conf 100147516-----78656--09999999977885799876-----8897678-999998 No 280 >cd07228 Pat_NTE_like_bacteria Bacterial patatin-like phospholipase domain containing protein 6. Bacterial patatin-like phospholipase domain containing protein 6. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. This group includes YCHK and rssA from Escherichia coli as well as Ylbk from Bacillus amyloliquefaciens. Probab=26.34 E-value=48 Score=14.23 Aligned_cols=35 Identities=23% Similarity=0.176 Sum_probs=20.7 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCC Q ss_conf 9989999862354674058887203687876450386 Q gi|254781093|r 86 AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRP 122 (225) Q Consensus 86 ~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p 122 (225) .-+++++++.+.+.+ .+.|-|.|+.+++.++..+. T Consensus 16 ~Gvl~~L~e~gi~~d--~i~GtSaGAi~aa~~a~g~~ 50 (175) T cd07228 16 IGVLRALEEEGIEID--IIAGSSIGALVGALYAAGHL 50 (175) T ss_pred HHHHHHHHHCCCCCC--EEEEECHHHHHHHHHHCCCC T ss_conf 999999998699867--89864489999999981898 No 281 >PRK06207 aspartate aminotransferase; Provisional Probab=25.93 E-value=49 Score=14.19 Aligned_cols=39 Identities=15% Similarity=0.145 Sum_probs=26.2 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 799995789988889989999999999997398499985 Q gi|254781093|r 26 PIALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFN 64 (225) Q Consensus 26 ~~vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd 64 (225) ..+|+|-||-.-|..+....+..+++...+.|+.++.=+ T Consensus 181 k~iil~nP~NPTG~v~s~e~l~~l~~la~~~~i~visDE 219 (406) T PRK06207 181 RVFLFSNPNNPAGVVYSPEEIAQIAALARRYGATVIADQ 219 (406) T ss_pred EEEEECCCCCCCCHHCCHHHHHHHHHHHHHCCEEEEEEC T ss_conf 699988997985721339999999998755795999964 No 282 >pfam01656 CbiA CobQ/CobB/MinD/ParA nucleotide binding domain. This family consists of various cobyrinic acid a,c-diamide synthases. These include CbiA and CbiP from S.typhimurium, and CobQ from R. capsulatus. These amidases catalyse amidations to various side chains of hydrogenobyrinic acid or cobyrinic acid a,c-diamide in the biosynthesis of cobalamin (vitamin B12) from uroporphyrinogen III. Vitamin B12 is an important cofactor and an essential nutrient for many plants and animals and is primarily produced by bacteria. The family also contains dethiobiotin synthetases as well as the plasmid partitioning proteins of the MinD/ParA family. Probab=25.87 E-value=49 Score=14.18 Aligned_cols=29 Identities=21% Similarity=0.398 Sum_probs=22.7 Q ss_pred CCCCCCHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 88998999999999999739849998531 Q gi|254781093|r 38 GGTMNDNIVYQLFYLFQQRGFVSLRFNFR 66 (225) Q Consensus 38 GG~~~~~~~~~la~~l~~~G~~vl~fd~R 66 (225) ||.--.-+...+|..|++.|+.|+.+|.- T Consensus 8 GGVGKTT~a~nLA~~la~~G~rVlliD~D 36 (212) T pfam01656 8 GGVGKTTLAANLARALAKRGYRVLLIDLD 36 (212) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 98069999999999999789978998389 No 283 >PRK10437 carbonic anhydrase; Provisional Probab=25.68 E-value=50 Score=14.16 Aligned_cols=68 Identities=12% Similarity=0.059 Sum_probs=36.6 Q ss_pred CCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHH Q ss_conf 8998999999999999739849998531558876778521023899999899998623546740588872036878764 Q gi|254781093|r 39 GTMNDNIVYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQL 117 (225) Q Consensus 39 G~~~~~~~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~ 117 (225) |+.|.|+. ...+....=+-+ |=.|..|.=--..| ..+.++++|+...- ..+.|+++|||-=|.+...+ T Consensus 41 gCsDSRV~---p~~I~~~~PGel-FV~RNIaNlV~~~d------~~~~aaIEYAV~~L-~V~~IIVcGHs~CGgv~A~l 108 (220) T PRK10437 41 GCSDSRVP---AERLTGLEPGEL-FVHRNVANLVIHTD------LNCLSVVQYAVDVL-EVEHIIICGHYGCGGVQAAV 108 (220) T ss_pred EECCCCCC---HHHHCCCCCCCE-EEEEECCCCCCCCC------CHHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHHH T ss_conf 73377889---999738998667-89963345478888------40899999999974-98879995587640578886 No 284 >PRK08250 glutamine amidotransferase; Provisional Probab=25.59 E-value=50 Score=14.15 Aligned_cols=82 Identities=17% Similarity=0.276 Sum_probs=39.7 Q ss_pred EEEEEC-CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE-CCCCCC--CCCCCC-----CC-H--HH-HH-----HHHH Q ss_conf 999957-899888899899999999999973984999853-155887--677852-----10-2--38-99-----9998 Q gi|254781093|r 27 IALILH-PHPRFGGTMNDNIVYQLFYLFQQRGFVSLRFNF-RGIGRS--EGEFDY-----GD-G--EL-SD-----AAAA 88 (225) Q Consensus 27 ~vv~~H-p~p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~-RG~G~S--~G~~~~-----g~-~--E~-~D-----~~aa 88 (225) +.|+-| |+-+-| .+...|.++|+.+-..+. +|---. -..||. |. + +. ++ +..- T Consensus 3 ilviqH~~~E~pG---------~i~~~l~~~G~~~~~~~~~~g~~lP~~~~~~d~liVlGGpms~~~~~e~~p~l~l~~E 73 (235) T PRK08250 3 VHFIIHESFESPG---------AYLKWAEDRGYTISWSRVYAGEALPENADGFDMLIVLGGPQSPRTTREECPYFDSKAE 73 (235) T ss_pred EEEEECCCCCCCC---------HHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCHHHH T ss_conf 9999818998970---------7899999789969999668999899981025889997899877773212787470999 Q ss_pred HHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 9999862354674058887203687876450 Q gi|254781093|r 89 LDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 89 ~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) .+++++......+ ++|.|+|+.+-..+.. T Consensus 74 ~~lir~a~~~~~P--vlGIClG~QliA~AlG 102 (235) T PRK08250 74 QHLINQAITARKA--VVGVCLGSQLIGEALG 102 (235) T ss_pred HHHHHHHHHCCCC--EEEEEHHHHHHHHHCC T ss_conf 9999999986998--8998167799999809 No 285 >KOG4207 consensus Probab=25.48 E-value=40 Score=14.74 Aligned_cols=24 Identities=38% Similarity=0.567 Sum_probs=15.9 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 3984999853155887677852102389999989999 Q gi|254781093|r 56 RGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWV 92 (225) Q Consensus 56 ~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l 92 (225) .||+-++|.|+ -+++|+.+++|++ T Consensus 55 RgFaFVrf~~k-------------~daedA~damDG~ 78 (256) T KOG4207 55 RGFAFVRFHDK-------------RDAEDALDAMDGA 78 (256) T ss_pred CCEEEEEEEEC-------------CHHHHHHHHHCCE T ss_conf 65068986411-------------1177899752350 No 286 >pfam10561 UPF0565 Uncharacterized protein family UPF0565. This family of proteins has no known function. Probab=25.43 E-value=45 Score=14.40 Aligned_cols=23 Identities=22% Similarity=0.346 Sum_probs=16.7 Q ss_pred CCCCEEEEEECHHHHHHHHHHHC Q ss_conf 46740588872036878764503 Q gi|254781093|r 98 ESKSCWIAGYSFGAWISMQLLMR 120 (225) Q Consensus 98 ~~~~i~l~G~S~Gg~val~~a~~ 120 (225) +..++.|+|||-|..|--++... T Consensus 187 ~~~~l~LIGFSKGCvVLNQll~E 209 (299) T pfam10561 187 DKPKLVLIGFSKGCVVLNQLIYE 209 (299) T ss_pred CCCCEEEEEECCCHHHHHHHHHH T ss_conf 67986999847625799999999 No 287 >COG1752 RssA Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only] Probab=25.17 E-value=51 Score=14.10 Aligned_cols=24 Identities=17% Similarity=0.405 Sum_probs=14.1 Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCC Q ss_conf 9999999739849998531558876778 Q gi|254781093|r 48 QLFYLFQQRGFVSLRFNFRGIGRSEGEF 75 (225) Q Consensus 48 ~la~~l~~~G~~vl~fd~RG~G~S~G~~ 75 (225) -..++|.+.| +.||.= +|-|.|.. T Consensus 28 GVl~aL~e~g---i~~~~i-aGtS~GAi 51 (306) T COG1752 28 GVLKALEEAG---IPIDVI-AGTSAGAI 51 (306) T ss_pred HHHHHHHHCC---CCCCEE-EEECHHHH T ss_conf 9999999769---998879-87388899 No 288 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=24.60 E-value=50 Score=14.13 Aligned_cols=70 Identities=19% Similarity=0.182 Sum_probs=42.2 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCC----C-CCEEEEEECCCCCCC---CC---C-HHHHHHHHHHHHHHHCCCC Q ss_conf 773776168655028999999999986077----9-960799978798475---58---5-8999999999999731221 Q gi|254781093|r 147 SSGLIINGSNDTVATTSDVKDLVNKLMNQK----G-ISITHKVIPDANHFF---IG---K-VDELINECAHYLDNSLDEK 214 (225) Q Consensus 147 ~p~LiIhG~~D~~vp~~~~~~~~~~l~~~~----~-~~~~~~~i~ga~H~f---~~---~-~~~l~~~i~~fl~~~L~~~ 214 (225) -|-|.=-|+.+- .+.+..+++++..+. + ...+|+==.||+||- || + .-+|.-+.=.||++.|+-+ T Consensus 136 GPSiMPGG~~~A---y~~~~PIl~~IAAkv~dPdg~PCC~wiG~~GAGHyVKMVHNGIEYGDMQLI~EAY~~lK~~L~L~ 212 (480) T TIGR00873 136 GPSIMPGGSAEA---YPLVAPILQKIAAKVDDPDGEPCCTWIGPDGAGHYVKMVHNGIEYGDMQLIAEAYDILKEGLGLS 212 (480) T ss_pred CCCCCCCCCHHH---HHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 887788879878---99996788765542168532942243786798430532576711777989999999998651888 Q ss_pred CCCCC Q ss_conf 00124 Q gi|254781093|r 215 FTLLK 219 (225) Q Consensus 215 ~~~~~ 219 (225) .+-.. T Consensus 213 ~~~i~ 217 (480) T TIGR00873 213 NEEIA 217 (480) T ss_pred HHHHH T ss_conf 88999 No 289 >PRK12815 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=24.57 E-value=52 Score=14.03 Aligned_cols=68 Identities=16% Similarity=0.224 Sum_probs=42.9 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC---CHH---HHHHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 999999999739849998531558876778521---023---89999989999862354674058887203687876450 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYG---DGE---LSDAAAALDWVQSLNPESKSCWIAGYSFGAWISMQLLM 119 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g---~~E---~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~ 119 (225) .+..+++|.+.||-++..|+.---=|+ +||.. .-| .+|+...++ ...++ -+++ +|||..++.+|. T Consensus 577 ~vha~~aLr~~G~etImiN~NPETVST-DyD~sDrLYFEPlt~E~V~~I~~---~E~p~--gViv---qfGGQt~lnla~ 647 (1068) T PRK12815 577 CVHAAFALKKEGYETIMINNNPETVST-DYDTADRLYFEPITLEDILNVAE---AENIK--GVIV---QFGGQTAINLAK 647 (1068) T ss_pred HHHHHHHHHHCCCEEEEECCCCCCCCC-CCCCCCCEEECCCCHHHHHHHHH---HHCCC--EEEE---ECCCCCHHHHHH T ss_conf 899999999689748996278431236-74557735753687999999999---85899--7999---648823679999 Q ss_pred CCC Q ss_conf 386 Q gi|254781093|r 120 RRP 122 (225) Q Consensus 120 ~~p 122 (225) ... T Consensus 648 ~L~ 650 (1068) T PRK12815 648 PLE 650 (1068) T ss_pred HHH T ss_conf 999 No 290 >COG1225 Bcp Peroxiredoxin [Posttranslational modification, protein turnover, chaperones] Probab=24.37 E-value=53 Score=14.01 Aligned_cols=60 Identities=22% Similarity=0.257 Sum_probs=34.3 Q ss_pred CEEEECCCCCEEEEEEECCCCCCCCEEEEECCCCC-CCCCCCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 70896589961879996688877879999578998-8889989999999999997398499985 Q gi|254781093|r 2 PEVVFNGPSGRLEGRYQPSTNPNAPIALILHPHPR-FGGTMNDNIVYQLFYLFQQRGFVSLRFN 64 (225) Q Consensus 2 ~~v~i~g~~G~l~~~~~~~~~~~~~~vv~~Hp~p~-~GG~~~~~~~~~la~~l~~~G~~vl~fd 64 (225) |+..+|+.+|.. +...+-..+++|++..|--. -|.+..-..+++....|.+.|..|+-.. T Consensus 11 PdF~Lp~~~g~~---v~Lsd~~Gk~VVLyFYPk~~TpgCT~Ea~~Frd~~~ef~~~~a~V~GIS 71 (157) T COG1225 11 PDFELPDQDGET---VSLSDLRGKPVVLYFYPKDFTPGCTTEACDFRDLLEEFEKLGAVVLGIS 71 (157) T ss_pred CCEEEECCCCCE---EEHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 875835699999---8548836982899987889998516999999998999986797899980 No 291 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=24.12 E-value=53 Score=13.98 Aligned_cols=27 Identities=19% Similarity=0.329 Sum_probs=11.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCC Q ss_conf 99999999973984999853155887677 Q gi|254781093|r 46 VYQLFYLFQQRGFVSLRFNFRGIGRSEGE 74 (225) Q Consensus 46 ~~~la~~l~~~G~~vl~fd~RG~G~S~G~ 74 (225) +..+|+.|.+.+...++.. ||-++.|+ T Consensus 71 L~~~A~~L~~~p~~~v~I~--GHTD~~G~ 97 (160) T PRK09967 71 IQTMAAKLASTGLTHARMD--GHTDNYGE 97 (160) T ss_pred HHHHHHHHHHCCCCEEEEE--EECCCCCC T ss_conf 9999999997899629998--74799899 No 292 >PRK04155 chaperone protein HchA; Provisional Probab=23.80 E-value=34 Score=15.12 Aligned_cols=27 Identities=22% Similarity=0.212 Sum_probs=12.3 Q ss_pred CCCCHHHHHHHH--HHHHCCCEEEEEEECC Q ss_conf 998999999999--9997398499985315 Q gi|254781093|r 40 TMNDNIVYQLFY--LFQQRGFVSLRFNFRG 67 (225) Q Consensus 40 ~~~~~~~~~la~--~l~~~G~~vl~fd~RG 67 (225) +-|++| ..|-. -|.+.||..=.....| T Consensus 73 TGNHPV-E~llPm~HL~~AGF~~dvaT~sG 101 (288) T PRK04155 73 TGNHPV-ETLLPMYHLDKAGFEFDVATLSG 101 (288) T ss_pred CCCCCH-HHHHHHHHHHHCCCCEEEECCCC T ss_conf 689608-88777888875687268852789 No 293 >COG4822 CbiK Cobalamin biosynthesis protein CbiK, Co2+ chelatase [Coenzyme metabolism] Probab=23.63 E-value=54 Score=13.92 Aligned_cols=123 Identities=11% Similarity=0.187 Sum_probs=58.0 Q ss_pred HHHHHHHHHHHHHHCC---CCCCEEEEEE-----CHHHHHHHHHHHCCCCHHEE--EECCCCCCCCC-CCCCC-----CC Q ss_conf 8999998999986235---4674058887-----20368787645038621330--12275744333-22234-----33 Q gi|254781093|r 82 LSDAAAALDWVQSLNP---ESKSCWIAGY-----SFGAWISMQLLMRRPEINGF--ISVAPQPKSYD-FSFLA-----PC 145 (225) Q Consensus 82 ~~D~~aa~~~l~~~~~---~~~~i~l~G~-----S~Gg~val~~a~~~p~i~~~--v~isp~~~~~~-~~~l~-----~~ 145 (225) .+|....++-++++.| ....++++|| |+-++..+....+..+...+ .++-..+..-. ...|. .+ T Consensus 117 k~DYe~~v~aik~~~ppl~k~e~~vlmgHGt~h~s~~~YacLd~~~~~~~f~~v~v~~ve~yP~~d~vi~~l~~~~~~~v 196 (265) T COG4822 117 KNDYEICVEAIKDQIPPLNKDEILVLMGHGTDHHSNAAYACLDHVLDEYGFDNVFVAAVEGYPLVDTVIEYLRKNGIKEV 196 (265) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHCCCCEE T ss_conf 10599999999976688676727999835887517789999999998648885699982478758999999997687648 Q ss_pred -CCCEEEEECCC---CCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf -47737761686---550289999999999860779960799978798475585899999999999973122 Q gi|254781093|r 146 -PSSGLIINGSN---DTVATTSDVKDLVNKLMNQKGISITHKVIPDANHFFIGKVDELINECAHYLDNSLDE 213 (225) Q Consensus 146 -~~p~LiIhG~~---D~~vp~~~~~~~~~~l~~~~~~~~~~~~i~ga~H~f~~~~~~l~~~i~~fl~~~L~~ 213 (225) -.|.+++-|+. |.--.-+ ..|...+. +.|..++.. +.|-+ ..+++.....+.|+..++. T Consensus 197 ~L~PlMlvAG~Ha~nDMasdde--dswk~il~-~~G~~v~~~-l~GLG-----E~~~iq~ifi~Hik~aie~ 259 (265) T COG4822 197 HLIPLMLVAGDHAKNDMASDDE--DSWKNILE-KNGFKVEVY-LHGLG-----ENPAIQAIFIDHIKDAIER 259 (265) T ss_pred EEEEEEEEECHHHHHHHCCCCH--HHHHHHHH-HCCCEEEEE-EECCC-----CCHHHHHHHHHHHHHHHHH T ss_conf 8865688604022321112634--78999998-679546887-60577-----7678999999999999853 No 294 >PRK07411 hypothetical protein; Validated Probab=22.64 E-value=57 Score=13.81 Aligned_cols=16 Identities=13% Similarity=0.287 Sum_probs=9.4 Q ss_pred HHHHHHHHCCCEEEEE Q ss_conf 9999999739849998 Q gi|254781093|r 48 QLFYLFQQRGFVSLRF 63 (225) Q Consensus 48 ~la~~l~~~G~~vl~f 63 (225) .++..|++.|++-+.. T Consensus 52 p~~~yLaaaGvG~i~i 67 (390) T PRK07411 52 PLLLYLAAAGIGRIGI 67 (390) T ss_pred HHHHHHHHCCCCEEEE T ss_conf 9999999838975999 No 295 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=22.57 E-value=57 Score=13.80 Aligned_cols=37 Identities=35% Similarity=0.523 Sum_probs=25.7 Q ss_pred EEEEECCC-----CCCCCCCCCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 99995789-----9888899899999999999973984999853 Q gi|254781093|r 27 IALILHPH-----PRFGGTMNDNIVYQLFYLFQQRGFVSLRFNF 65 (225) Q Consensus 27 ~vv~~Hp~-----p~~GG~~~~~~~~~la~~l~~~G~~vl~fd~ 65 (225) +++++..+ |.+||. .+.+..+++.|+++|+.|..+.. T Consensus 3 I~~v~~~~~p~pP~~~GG~--e~~~~~La~~L~~~Gh~V~v~~~ 44 (335) T cd03802 3 IALVAPPREPVPPPAYGGT--ERVVAALTEGLVARGHEVTLFAS 44 (335) T ss_pred EEEECCCCCCCCCCCCCCH--HHHHHHHHHHHHHCCCEEEEEEC T ss_conf 6998884003699998979--99999999999976998999962 No 296 >TIGR02080 O_succ_thio_ly O-succinylhomoserine (thiol)-lyase; InterPro: IPR011821 Proteins in this entry are important in the interconversion of the sulphur-containing amino acids L-cysteine and L-methionine in bacteria. In bacteria, inorganic sulphur is reduced to sulphide, which is then used to displace the acetyl group of O-acetyl-L-serine to form the essential amino acid L-cysteine. Biosynthesis of L-methionine then occurs via the transsulphuration pathway which involves the sequential action of two pyridoxal 5'-phosphate (PLP) enzymes, cystathionine gamma-synthase (CGS) and cystathionine beta-lyase (CBL). Firstly CGS catalyses the formation of the intermediate cystathione from a homoserine ester and L-cysteine. This intermediate is then converted by CBL to L-homoserine, which is subsequently methylated by methionine synthase to form L-methionine. For more information see . The CGS enzyme (P00935 from SWISSPROT) from Escherichia coli is, like other CGS enzymes, a homotetramer composed of two catalytically active dimers . The active site of each dimer is found at the interface between the monomers and involves residues from each monomer. Catalysis occurs by alpha,gamma-elimination/replacement reactions.. Probab=22.51 E-value=57 Score=13.79 Aligned_cols=29 Identities=34% Similarity=0.797 Sum_probs=21.2 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEE Q ss_conf 999578998888998999999999999739849998 Q gi|254781093|r 28 ALILHPHPRFGGTMNDNIVYQLFYLFQQRGFVSLRF 63 (225) Q Consensus 28 vv~~Hp~p~~GG~~~~~~~~~la~~l~~~G~~vl~f 63 (225) +|+. ||=.|||| ++|-..++++|.--++| T Consensus 92 LlvA-PHDCYGGt------YRLl~~lA~kG~~~v~f 120 (383) T TIGR02080 92 LLVA-PHDCYGGT------YRLLNALAKKGQFKVQF 120 (383) T ss_pred EEEE-CCCCCCCH------HHHHHHHHHCCCEEEEE T ss_conf 6880-43267704------88886452078338998 No 297 >TIGR02195 heptsyl_trn_II lipopolysaccharide heptosyltransferase II; InterPro: IPR011910 This family consists of examples of ADP-heptose:LPS heptosyltransferase II, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria.; GO: 0016757 transferase activity transferring glycosyl groups, 0009103 lipopolysaccharide biosynthetic process. Probab=22.42 E-value=58 Score=13.78 Aligned_cols=114 Identities=18% Similarity=0.169 Sum_probs=57.5 Q ss_pred CCCCEEEEECCCCCCCC--CCCCHHHHHHHHHHHHCC-CEEEEEEECCCCCCCCCCCCCCHHHHH-HHHHHHHHHHHCCC Q ss_conf 77879999578998888--998999999999999739-849998531558876778521023899-99989999862354 Q gi|254781093|r 23 PNAPIALILHPHPRFGG--TMNDNIVYQLFYLFQQRG-FVSLRFNFRGIGRSEGEFDYGDGELSD-AAAALDWVQSLNPE 98 (225) Q Consensus 23 ~~~~~vv~~Hp~p~~GG--~~~~~~~~~la~~l~~~G-~~vl~fd~RG~G~S~G~~~~g~~E~~D-~~aa~~~l~~~~~~ 98 (225) +.+|++.+|. -.-||= .+=+..+..||+.+-+.| |.|..|= |.-+ .-. .+| ..+...-..++. T Consensus 188 P~rP~ialCP-GAEfGpAKRWP~~HyA~LA~~~~~~Ggy~V~lFG------S~kD--~~~--~~~fI~~~~~~~~~~~-- 254 (361) T TIGR02195 188 PERPIIALCP-GAEFGPAKRWPAEHYAELAKKLIAQGGYQVVLFG------SKKD--KPV--GDEFIEALAPGELREY-- 254 (361) T ss_pred CCCCEEEECC-CCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEC------CCCC--HHH--HHHHHHHHHHHHHHHH-- T ss_conf 7887687587-7667755678538999999999870691899842------8455--789--9999998634468899-- Q ss_pred CCCEEEEE-ECHHHHHHHHHHHCCCCHHEEEECCCCCCCCCCC--CC-CCCCCCEEEEECCCCCCCCHH Q ss_conf 67405888-7203687876450386213301227574433322--23-433477377616865502899 Q gi|254781093|r 99 SKSCWIAG-YSFGAWISMQLLMRRPEINGFISVAPQPKSYDFS--FL-APCPSSGLIINGSNDTVATTS 163 (225) Q Consensus 99 ~~~i~l~G-~S~Gg~val~~a~~~p~i~~~v~isp~~~~~~~~--~l-~~~~~p~LiIhG~~D~~vp~~ 163 (225) .++ |+| -|+-=+|.+ |+.+-++...|-. +. +.+..|+--++|..|.-..|- T Consensus 255 ~~N--LaG~T~L~EAvdL------------ia~a~avV~NDSGLMHVAAAL~rPlVAlYGsTsP~fTPP 309 (361) T TIGR02195 255 CDN--LAGETSLDEAVDL------------IALAKAVVSNDSGLMHVAAALNRPLVALYGSTSPDFTPP 309 (361) T ss_pred HHH--CCCCCCHHHHHHH------------HHHCCEEEECCHHHHHHHHHCCCCEEEEECCCCCCCCCC T ss_conf 741--0478888899999------------871560121560679999962798899736856888887 No 298 >TIGR01753 flav_short flavodoxin; InterPro: IPR010087 Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They function as electron transfer agents in a variety of microbial metabolic processes, including nitrogen fixation by nitrogenase , sulphite reduction , and light-dependent NADP+ reduction during photosynthesis . This entry describes the short chain type. Many of these are involved in sulphite reduction.; GO: 0010181 FMN binding, 0006118 electron transport. Probab=22.41 E-value=58 Score=13.78 Aligned_cols=71 Identities=17% Similarity=0.133 Sum_probs=42.9 Q ss_pred CCCCCHHHH-HHHHHHHHCCCEEEEEEECCCCC----C--C----CCCCCCCH--H-HHHHHHHHHHHHHHCCCCCCEEE Q ss_conf 899899999-99999997398499985315588----7--6----77852102--3-89999989999862354674058 Q gi|254781093|r 39 GTMNDNIVY-QLFYLFQQRGFVSLRFNFRGIGR----S--E----GEFDYGDG--E-LSDAAAALDWVQSLNPESKSCWI 104 (225) Q Consensus 39 G~~~~~~~~-~la~~l~~~G~~vl~fd~RG~G~----S--~----G~~~~g~~--E-~~D~~aa~~~l~~~~~~~~~i~l 104 (225) +|-|.-.+. .+++.|...|.-|=+++-.=.-. + + |.++.|.+ | -+|..--++-+.+.....+++++ T Consensus 8 ~TGNTE~~A~~I~e~l~~~g~~V~~~~~~d~d~~~~~~~~D~~~~G~~t~Gd~~Lel~ddf~pF~e~~~~~~~~gK~va~ 87 (146) T TIGR01753 8 MTGNTEEMANIIEEGLKEAGADVDLLEVADVDKEDLLSGYDVVLLGCSTWGDEDLELQDDFEPFFEELEDIDLGGKKVAL 87 (146) T ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 88128999999999998568862578520178012446588788713647988997442354689889987407977999 Q ss_pred EEECHH Q ss_conf 887203 Q gi|254781093|r 105 AGYSFG 110 (225) Q Consensus 105 ~G~S~G 110 (225) +| |+| T Consensus 88 FG-Sgd 92 (146) T TIGR01753 88 FG-SGD 92 (146) T ss_pred EE-CCC T ss_conf 70-688 No 299 >TIGR00456 argS arginyl-tRNA synthetase; InterPro: IPR001278 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Arginyl-tRNA synthetase (6.1.1.19 from EC) has been crystallized and preliminary X-ray crystallographic analysis of yeast arginyl-tRNA synthetase-yeast tRNAArg complexes is available . ; GO: 0000166 nucleotide binding, 0004814 arginine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006420 arginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=22.37 E-value=58 Score=13.77 Aligned_cols=42 Identities=17% Similarity=0.243 Sum_probs=29.5 Q ss_pred EEEEEC--CCC---CCCCCCCCHHH-HHHHHHHHHCCCEEEEEEECCC Q ss_conf 999957--899---88889989999-9999999973984999853155 Q gi|254781093|r 27 IALILH--PHP---RFGGTMNDNIV-YQLFYLFQQRGFVSLRFNFRGI 68 (225) Q Consensus 27 ~vv~~H--p~p---~~GG~~~~~~~-~~la~~l~~~G~~vl~fd~RG~ 68 (225) -+|+=| |+| .|=|+..|-++ ..|++.|.-+||.|.|-||=|= T Consensus 119 ki~~EfsSaNp~gplH~GHlR~aiIGD~L~r~l~~~G~~V~r~yyVND 166 (600) T TIGR00456 119 KIIIEFSSANPAGPLHIGHLRNAIIGDSLARILEFLGYDVIREYYVND 166 (600) T ss_pred EEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 799987658889761100002577788999999875970798520054 No 300 >pfam09989 DUF2229 CoA enzyme activase uncharacterized domain (DUF2229). Members of this family include various bacterial hypothetical proteins, as well as CoA enzyme activases. The exact function of this domain has not, as yet, been defined. Probab=22.07 E-value=59 Score=13.74 Aligned_cols=17 Identities=24% Similarity=0.309 Sum_probs=12.8 Q ss_pred HHHHHHHCCCEEEEEEE Q ss_conf 99999973984999853 Q gi|254781093|r 49 LFYLFQQRGFVSLRFNF 65 (225) Q Consensus 49 la~~l~~~G~~vl~fd~ 65 (225) +...|.++|+.|+..+. T Consensus 17 w~~fF~~LG~~VV~S~~ 33 (218) T pfam09989 17 WFTFFTELGFEVVLSPP 33 (218) T ss_pred HHHHHHHCCCEEEECCC T ss_conf 99999985967998998 No 301 >PRK02399 hypothetical protein; Provisional Probab=21.95 E-value=59 Score=13.72 Aligned_cols=27 Identities=15% Similarity=0.244 Sum_probs=17.2 Q ss_pred CCCCCHH--HHHHHHHHHHCCCEEEEEEE Q ss_conf 8998999--99999999973984999853 Q gi|254781093|r 39 GTMNDNI--VYQLFYLFQQRGFVSLRFNF 65 (225) Q Consensus 39 G~~~~~~--~~~la~~l~~~G~~vl~fd~ 65 (225) |+.|.|- +.-+.+.+.++|..++..|. T Consensus 9 gT~DTK~~E~~ylr~~i~~~G~~v~~iDv 37 (407) T PRK02399 9 GTLDTKGEELAYVKDLIEAAGLEVVTVDV 37 (407) T ss_pred ECCCCCHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 72477829999999999986996799983 No 302 >cd07025 Peptidase_S66 LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein. LD-carboxypeptidase (Muramoyltetrapeptide carboxypeptidase; EC 3.4.17.13; Merops family S66; initially described as Carboxypeptidase II) family also includes the microcin c7 self-immunity protein (MccF) as well as uncharacterized proteins including hypothetical proteins. LD-carboxypeptidase hydrolyzes the amide bond that links the dibasic amino acids to C-terminal D-amino acids. The physiological substrates of LD-carboxypeptidase are tetrapeptide fragments (such as UDP-MurNAc-tetrapeptides) that are produced when bacterial cell walls are degraded; they contain an L-configured residue (L-lysine or meso-diaminopimelic acid residue) as the penultimate residue and D-alanine as the ultimate residue. A possible role of LD-carboxypeptidase is in peptidoglycan recycling whereby the resulting tripeptide (precursor for murein synthesis) can be reconverted into peptidoglycan by attachment o Probab=21.51 E-value=60 Score=13.67 Aligned_cols=22 Identities=5% Similarity=-0.066 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHCCCEEEEEEE Q ss_conf 9999999999973984999853 Q gi|254781093|r 44 NIVYQLFYLFQQRGFVSLRFNF 65 (225) Q Consensus 44 ~~~~~la~~l~~~G~~vl~fd~ 65 (225) ..+...-+.|.+.||.+...+. T Consensus 14 ~~~~~~~~~L~~~G~~v~~~~~ 35 (282) T cd07025 14 ERLERAIARLESLGLEVVVGPH 35 (282) T ss_pred HHHHHHHHHHHHCCCEEEECCC T ss_conf 9999999999968998987876 No 303 >cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides. Probab=21.46 E-value=60 Score=13.66 Aligned_cols=38 Identities=29% Similarity=0.353 Sum_probs=25.7 Q ss_pred EEEEECCCC-CCCCCCCCHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 999957899-88889989999999999997398499985315 Q gi|254781093|r 27 IALILHPHP-RFGGTMNDNIVYQLFYLFQQRGFVSLRFNFRG 67 (225) Q Consensus 27 ~vv~~Hp~p-~~GG~~~~~~~~~la~~l~~~G~~vl~fd~RG 67 (225) +++++ |+| ..||- ...+..+++.|+++|..|..+.++. T Consensus 2 I~~i~-~ypP~~GGi--~~~~~~La~~L~~~Gh~V~v~~~~~ 40 (366) T cd03822 2 IALVS-PYPPRKCGI--ATFTTDLVNALSARGPDVLVVSVAA 40 (366) T ss_pred EEEEE-CCCCCCCCH--HHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 89991-899999838--9999999999986799899995888 No 304 >PRK12615 galactose-6-phosphate isomerase subunit LacB; Reviewed Probab=21.32 E-value=61 Score=13.64 Aligned_cols=110 Identities=11% Similarity=0.141 Sum_probs=60.1 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHH-HHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHCCCCHH Q ss_conf 999999997398499985315588767785210238999-9989999862354674058887203687876450386213 Q gi|254781093|r 47 YQLFYLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDA-AAALDWVQSLNPESKSCWIAGYSFGAWISMQLLMRRPEIN 125 (225) Q Consensus 47 ~~la~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~-~aa~~~l~~~~~~~~~i~l~G~S~Gg~val~~a~~~p~i~ 125 (225) ..+...|.+.||.|. |+ | .++...-..-|. ..|.+.+.+.. ...-|.++|.-.|-.+ +|-+.+.|+ T Consensus 16 ~~i~~~L~~~G~eV~--D~-G------t~~~~svDYPd~a~~Va~~V~~~~-~~~GIliCGTGiGmsI---aANK~~GIR 82 (171) T PRK12615 16 MAVSDFLKSKGYDVI--DC-G------TYDHTRTHYPIFGKKVGEAVVNGQ-ADLGVCICGTGVGINN---AVNKVPGIR 82 (171) T ss_pred HHHHHHHHHCCCEEE--EC-C------CCCCCCCCCHHHHHHHHHHHHCCC-CCEEEEEECCCHHHHH---HHHCCCCEE T ss_conf 999999998899789--77-9------999887882699999999997699-7646998178576999---975479738 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHC Q ss_conf 30122757443332223433477377616865502899999999998607 Q gi|254781093|r 126 GFISVAPQPKSYDFSFLAPCPSSGLIINGSNDTVATTSDVKDLVNKLMNQ 175 (225) Q Consensus 126 ~~v~isp~~~~~~~~~l~~~~~p~LiIhG~~D~~vp~~~~~~~~~~l~~~ 175 (225) +.++..+...... ...-...+|.+-+ .++..+.+.++.+..-.. T Consensus 83 AAlc~d~~~A~~a---R~HNnANVL~lGa---R~i~~~~A~~iv~~fL~t 126 (171) T PRK12615 83 SALVRDMTTALYA---KEELNANVIGFGG---KITGELLMCDIIDAFIKA 126 (171) T ss_pred EEEECCHHHHHHH---HHHCCCEEEEECH---HHCCHHHHHHHHHHHHCC T ss_conf 9996888999999---9845972899763---025999999999999779 No 305 >COG0018 ArgS Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=20.87 E-value=62 Score=13.59 Aligned_cols=43 Identities=19% Similarity=0.198 Sum_probs=29.7 Q ss_pred EEEEEC--CCC---CCCCCCCCHHH-HHHHHHHHHCCCEEEEEEECCCC Q ss_conf 999957--899---88889989999-99999999739849998531558 Q gi|254781093|r 27 IALILH--PHP---RFGGTMNDNIV-YQLFYLFQQRGFVSLRFNFRGIG 69 (225) Q Consensus 27 ~vv~~H--p~p---~~GG~~~~~~~-~~la~~l~~~G~~vl~fd~RG~G 69 (225) .+++=| |+| .|=|+..|-++ ..+|+.|...||.|.|-||=|-+ T Consensus 118 kV~iE~sSaNptkplHiGHlR~aiiGDsLaril~~~Gy~V~r~~yvnD~ 166 (577) T COG0018 118 KVVIEYSSANPTGPLHIGHLRNAIIGDSLARILEFLGYDVTRENYVNDW 166 (577) T ss_pred EEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCH T ss_conf 7999874799999852332353688999999999839976678667857 No 306 >pfam00150 Cellulase Cellulase (glycosyl hydrolase family 5). Probab=20.71 E-value=62 Score=13.56 Aligned_cols=60 Identities=13% Similarity=0.118 Sum_probs=32.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCEEEEEEEC--CCCCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 8888998999999999999739849998531--558876778521023899999899998623 Q gi|254781093|r 36 RFGGTMNDNIVYQLFYLFQQRGFVSLRFNFR--GIGRSEGEFDYGDGELSDAAAALDWVQSLN 96 (225) Q Consensus 36 ~~GG~~~~~~~~~la~~l~~~G~~vl~fd~R--G~G~S~G~~~~g~~E~~D~~aa~~~l~~~~ 96 (225) .++|+.. .....+-+.+.+.|+.++|.-+. .+....+.+.....-++.+..+++|+.+++ T Consensus 17 ~~~~~~~-~~~~~~~~~i~~~GfN~vRlP~~~~~~~~~~~~~~~~~~~l~~ld~vv~~a~~~g 78 (276) T pfam00150 17 TFWGNPF-PDTKDMIDEVKAAGFNTVRIPVSWEALYPGEPGYKIDPAWLNRVDEVVDAAIDNG 78 (276) T ss_pred CCCCCCC-CCHHHHHHHHHHCCCCEEEEEEEHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 6888886-5289999999985999597410999931677888669899999999999999889 No 307 >TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302; InterPro: IPR012683 Members of this family are bacterial proteins from the alphaproteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted to be outside the membrane. Bradyrhizobium japonicum contains two tandem copies that are, together, homologous to the single members found in other species. . Probab=20.69 E-value=53 Score=13.98 Aligned_cols=21 Identities=10% Similarity=0.167 Sum_probs=17.1 Q ss_pred CEEEEEEECCCCCCCCEEEEE Q ss_conf 618799966888778799995 Q gi|254781093|r 11 GRLEGRYQPSTNPNAPIALIL 31 (225) Q Consensus 11 G~l~~~~~~~~~~~~~~vv~~ 31 (225) =|++||++||.-++++.+++. T Consensus 183 iRiDAWVTPP~YTgraPIfLT 203 (899) T TIGR02302 183 IRIDAWVTPPVYTGRAPIFLT 203 (899) T ss_pred EEECCCCCCCCCCCCCCEEEC T ss_conf 651331177776788876731 No 308 >pfam05706 CDKN3 Cyclin-dependent kinase inhibitor 3 (CDKN3). This family consists of cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner. Probab=20.48 E-value=63 Score=13.54 Aligned_cols=51 Identities=18% Similarity=0.302 Sum_probs=29.1 Q ss_pred HHHHHCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHH Q ss_conf 99997398499985315588767785210238999998999986235467405888720368 Q gi|254781093|r 51 YLFQQRGFVSLRFNFRGIGRSEGEFDYGDGELSDAAAALDWVQSLNPESKSCWIAGYSFGAW 112 (225) Q Consensus 51 ~~l~~~G~~vl~fd~RG~G~S~G~~~~g~~E~~D~~aa~~~l~~~~~~~~~i~l~G~S~Gg~ 112 (225) ....+.|+.|+.+-++ |.+..+.+-+...++++...-.+.+++.+ |++||. T Consensus 94 d~yq~~G~~vhH~Pip---------Dg~~P~~~~c~~ileEl~~~L~~grk~li--hc~ggl 144 (207) T pfam05706 94 DLYQQCGIITHHHPIA---------DGGTPDIASCCEIMEELATCLKNNRKTLI--HSVGGL 144 (207) T ss_pred HHHHHCCCEEEECCCC---------CCCCCCHHHHHHHHHHHHHHHHCCCEEEE--EEECCC T ss_conf 9999769168854778---------99998789999999999999983984899--984465 No 309 >cd07210 Pat_hypo_W_succinogenes_WS1459_like Hypothetical patatin similar to WS1459 of Wolinella succinogenes. Patatin-like phospholipase. This family predominantly consists of bacterial patatin glycoproteins. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of a lipid acyl hydrolase, catalyzing the cleavage of fatty acids from membrane lipids. Members of this family have also been found in vertebrates. Probab=20.48 E-value=63 Score=13.54 Aligned_cols=17 Identities=6% Similarity=0.157 Sum_probs=8.8 Q ss_pred EEEECCCCCCCCHHHHH Q ss_conf 77616865502899999 Q gi|254781093|r 150 LIINGSNDTVATTSDVK 166 (225) Q Consensus 150 LiIhG~~D~~vp~~~~~ 166 (225) ..+=|--=+-+|++..+ T Consensus 147 ~yvDGGv~dn~Pv~~~~ 163 (221) T cd07210 147 PFVDGGVADRLPFDALR 163 (221) T ss_pred EEEECCCCCCCCHHHHH T ss_conf 99856857771866897 No 310 >KOG1153 consensus Probab=20.45 E-value=63 Score=13.53 Aligned_cols=56 Identities=20% Similarity=0.380 Sum_probs=31.7 Q ss_pred CCCHHHHHHHHHHHHHHHHC-----CCCCCEEEEEECHHHH--HHHHHHHCCCCHHEEEECCCC Q ss_conf 21023899999899998623-----5467405888720368--787645038621330122757 Q gi|254781093|r 77 YGDGELSDAAAALDWVQSLN-----PESKSCWIAGYSFGAW--ISMQLLMRRPEINGFISVAPQ 133 (225) Q Consensus 77 ~g~~E~~D~~aa~~~l~~~~-----~~~~~i~l~G~S~Gg~--val~~a~~~p~i~~~v~isp~ 133 (225) +|.|++.|+..-++|+.+.+ ....+. ++-.|.||. .++..|-...--.|++...++ T Consensus 289 dGsGt~Sdvi~GvE~~~k~h~~~k~~~~k~s-v~NlSlGg~~S~aLn~AV~~A~~~Gi~fa~AA 351 (501) T KOG1153 289 DGSGTVSDVIKGVEFVVKHHEKKKKKEGKKS-VANLSLGGFRSAALNMAVNAASERGIHFAVAA 351 (501) T ss_pred CCCEEHHHHHHHHHHHHHHHHHHHCCCCCCE-EEEEECCCCCCHHHHHHHHHHHHCCEEEEECC T ss_conf 8957399887677999987666403468971-79983378630889999998864674999717 No 311 >COG0777 AccD Acetyl-CoA carboxylase beta subunit [Lipid metabolism] Probab=20.36 E-value=64 Score=13.52 Aligned_cols=137 Identities=15% Similarity=0.177 Sum_probs=57.6 Q ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHH---HHHHCCCEEEEEEECCCCCCCCC-CCCCCH---HHHHHHHHHHHHHHHC Q ss_conf 7879999578998888998999999999---99973984999853155887677-852102---3899999899998623 Q gi|254781093|r 24 NAPIALILHPHPRFGGTMNDNIVYQLFY---LFQQRGFVSLRFNFRGIGRSEGE-FDYGDG---ELSDAAAALDWVQSLN 96 (225) Q Consensus 24 ~~~~vv~~Hp~p~~GG~~~~~~~~~la~---~l~~~G~~vl~fd~RG~G~S~G~-~~~g~~---E~~D~~aa~~~l~~~~ 96 (225) ..|+++.+.-+...||||..-+-..+.+ ...+.-...+.|.-.| |. -.+|.- ...-..|++.-+++.+ T Consensus 121 G~pvv~av~df~FmgGSmGsVvGeki~ra~E~A~e~k~P~v~f~aSG-----GARMQEg~lSLMQMaktsaAl~~l~ea~ 195 (294) T COG0777 121 GLPVVLAVMDFAFMGGSMGSVVGEKITRAIERAIEDKLPLVLFSASG-----GARMQEGILSLMQMAKTSAALKRLSEAG 195 (294) T ss_pred CEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCC-----CHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 86789999742442455058999999999999998579889995676-----4657688999999999999999998759 Q ss_pred CCCCCEEEEEECHHHHHHHHHHHCCC---CHHEEEECCCCCCCCCCCCCCCCC-----CCEEEEECCCCCCCCHHHHHH Q ss_conf 54674058887203687876450386---213301227574433322234334-----773776168655028999999 Q gi|254781093|r 97 PESKSCWIAGYSFGAWISMQLLMRRP---EINGFISVAPQPKSYDFSFLAPCP-----SSGLIINGSNDTVATTSDVKD 167 (225) Q Consensus 97 ~~~~~i~l~G~S~Gg~val~~a~~~p---~i~~~v~isp~~~~~~~~~l~~~~-----~p~LiIhG~~D~~vp~~~~~~ 167 (225) ... -.+|..=-+||..|..+...+- +..++|..+.+-.. ....-...| .--|+=||--|.+|+-.+.+. T Consensus 196 lpy-IsVLt~PTtGGVsASfA~lGDi~iAEP~AlIGFAGpRVI-EQTire~LPegfQ~aEfLlehG~iD~iv~R~elr~ 272 (294) T COG0777 196 LPY-ISVLTDPTTGGVSASFAMLGDIIIAEPGALIGFAGPRVI-EQTIREKLPEGFQTAEFLLEHGMIDMIVHRDELRT 272 (294) T ss_pred CCE-EEEECCCCCCCHHHHHHHCCCEEECCCCCCCCCCCCHHH-HHHHCCCCCCCHHHHHHHHHCCCCCEEECHHHHHH T ss_conf 966-999558986664676775267465176300015763244-65644316862346899997587305633588999 Done!