Query psy3685
Match_columns 1042
No_of_seqs 1050 out of 5931
Neff 9.4
Searched_HMMs 46136
Date Fri Aug 16 22:58:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3685.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3685hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2182|consensus 100.0 2.7E-66 5.8E-71 554.9 27.7 313 727-1041 48-365 (514)
2 KOG2183|consensus 100.0 6.1E-66 1.3E-70 534.0 22.7 291 727-1019 42-342 (492)
3 PF05577 Peptidase_S28: Serine 100.0 3.1E-61 6.7E-66 558.2 16.2 269 737-1005 1-272 (434)
4 PF05576 Peptidase_S37: PS-10 99.9 9.9E-27 2.1E-31 245.4 16.0 268 732-1033 33-307 (448)
5 KOG1074|consensus 99.9 5.3E-28 1.1E-32 270.2 6.5 206 504-711 605-936 (958)
6 KOG2462|consensus 99.9 2.4E-25 5.2E-30 221.3 4.8 134 451-607 130-265 (279)
7 KOG1074|consensus 99.9 6.4E-25 1.4E-29 245.7 8.2 215 370-584 605-933 (958)
8 KOG2462|consensus 99.9 3.5E-25 7.5E-30 220.1 3.0 127 503-630 129-261 (279)
9 KOG3608|consensus 99.9 1.6E-24 3.5E-29 218.8 7.7 213 371-610 135-377 (467)
10 KOG3608|consensus 99.9 2.6E-23 5.5E-28 210.2 6.3 213 263-476 135-377 (467)
11 KOG3623|consensus 99.8 5.1E-20 1.1E-24 202.4 5.2 102 317-418 211-330 (1007)
12 KOG3623|consensus 99.7 1E-18 2.2E-23 192.4 4.5 102 63-164 211-330 (1007)
13 KOG3576|consensus 99.5 5.9E-15 1.3E-19 138.6 1.4 109 503-611 116-238 (267)
14 KOG3576|consensus 99.5 1.2E-14 2.6E-19 136.6 2.3 109 451-584 117-238 (267)
15 PLN02298 hydrolase, alpha/beta 99.1 3.3E-10 7.1E-15 127.4 13.2 113 764-885 58-170 (330)
16 COG2267 PldB Lysophospholipase 99.1 2.2E-10 4.7E-15 124.9 10.2 111 766-888 35-146 (298)
17 KOG1455|consensus 99.1 6.2E-10 1.3E-14 114.8 12.5 113 767-889 56-169 (313)
18 PLN02385 hydrolase; alpha/beta 99.1 5E-10 1.1E-14 126.8 11.9 111 765-885 87-198 (349)
19 PRK00870 haloalkane dehalogena 99.0 1.1E-09 2.4E-14 121.4 12.6 107 763-884 44-150 (302)
20 PLN02824 hydrolase, alpha/beta 99.0 1.1E-09 2.3E-14 121.2 11.3 110 764-884 28-137 (294)
21 KOG4178|consensus 99.0 9.2E-10 2E-14 115.6 9.9 118 752-885 32-149 (322)
22 PHA02857 monoglyceride lipase; 99.0 1.9E-09 4E-14 118.1 13.0 109 765-885 25-133 (276)
23 PLN03086 PRLI-interacting fact 99.0 5.2E-10 1.1E-14 127.2 8.5 87 91-181 455-552 (567)
24 PRK10749 lysophospholipase L2; 99.0 1.4E-09 3.1E-14 121.8 11.6 114 764-885 53-167 (330)
25 PLN03086 PRLI-interacting fact 99.0 1.4E-09 3E-14 123.8 9.4 141 452-606 408-561 (567)
26 KOG2564|consensus 99.0 5E-09 1.1E-13 105.6 12.3 166 732-918 44-212 (343)
27 TIGR01607 PST-A Plasmodium sub 98.9 3.2E-09 7E-14 118.6 10.2 115 767-886 23-187 (332)
28 TIGR02240 PHA_depoly_arom poly 98.9 3.9E-09 8.4E-14 115.5 9.3 115 751-885 13-127 (276)
29 TIGR03101 hydr2_PEP hydrolase, 98.9 1.1E-08 2.4E-13 108.9 12.3 111 765-887 25-137 (266)
30 PF12697 Abhydrolase_6: Alpha/ 98.9 7E-09 1.5E-13 109.1 10.1 103 768-886 1-103 (228)
31 PHA02768 hypothetical protein; 98.9 1.2E-09 2.6E-14 82.0 2.7 49 585-634 5-53 (55)
32 TIGR01250 pro_imino_pep_2 prol 98.8 1.3E-08 2.8E-13 111.7 11.4 106 765-884 25-131 (288)
33 TIGR02427 protocat_pcaD 3-oxoa 98.8 4.8E-09 1.1E-13 112.3 7.6 104 764-885 12-115 (251)
34 TIGR01840 esterase_phb esteras 98.8 2E-08 4.3E-13 104.8 11.6 122 762-886 10-132 (212)
35 PRK10673 acyl-CoA esterase; Pr 98.8 1.3E-08 2.8E-13 110.0 10.5 100 763-881 14-113 (255)
36 PLN02652 hydrolase; alpha/beta 98.8 1.6E-08 3.4E-13 114.9 11.5 108 765-886 136-247 (395)
37 PLN02211 methyl indole-3-aceta 98.8 1.9E-08 4.1E-13 109.3 11.3 107 763-884 16-122 (273)
38 PRK11126 2-succinyl-6-hydroxy- 98.8 1.3E-08 2.8E-13 108.9 10.1 101 764-884 1-102 (242)
39 TIGR03056 bchO_mg_che_rel puta 98.8 1.2E-08 2.7E-13 111.5 9.6 105 764-885 27-131 (278)
40 PLN02965 Probable pheophorbida 98.8 1.6E-08 3.5E-13 109.1 10.1 101 768-883 6-106 (255)
41 PRK03592 haloalkane dehalogena 98.8 1.7E-08 3.6E-13 111.7 10.3 104 764-885 26-129 (295)
42 PHA00733 hypothetical protein 98.8 3.1E-09 6.7E-14 98.7 3.6 58 552-610 67-124 (128)
43 TIGR03611 RutD pyrimidine util 98.8 2.9E-08 6.3E-13 106.9 11.1 105 763-884 11-115 (257)
44 KOG4409|consensus 98.8 2E-08 4.2E-13 105.9 8.8 108 763-884 88-195 (365)
45 PLN02511 hydrolase 98.8 2.3E-08 5.1E-13 114.1 10.3 112 763-886 98-212 (388)
46 PLN03087 BODYGUARD 1 domain co 98.7 6.5E-08 1.4E-12 111.3 12.8 108 764-885 200-310 (481)
47 PRK03204 haloalkane dehalogena 98.7 7.9E-08 1.7E-12 105.5 12.0 104 764-884 33-136 (286)
48 PLN03084 alpha/beta hydrolase 98.7 3.8E-08 8.3E-13 111.1 9.7 108 764-885 126-233 (383)
49 PLN02679 hydrolase, alpha/beta 98.7 4.2E-08 9.2E-13 111.2 9.7 103 765-884 88-191 (360)
50 TIGR03343 biphenyl_bphD 2-hydr 98.7 3.4E-08 7.4E-13 108.4 8.7 107 763-883 28-135 (282)
51 TIGR01249 pro_imino_pep_1 prol 98.7 3.6E-08 7.8E-13 109.4 8.7 79 794-884 52-130 (306)
52 PRK06489 hypothetical protein; 98.7 4.6E-08 1E-12 111.2 9.6 112 765-883 69-188 (360)
53 TIGR03695 menH_SHCHC 2-succiny 98.7 4.8E-08 1E-12 104.4 9.3 104 765-884 1-105 (251)
54 PHA02768 hypothetical protein; 98.7 8.3E-09 1.8E-13 77.6 1.8 42 559-601 6-47 (55)
55 PLN02894 hydrolase, alpha/beta 98.6 1.9E-07 4.1E-12 107.1 13.2 109 763-884 103-211 (402)
56 PRK08775 homoserine O-acetyltr 98.6 6.4E-08 1.4E-12 109.2 8.9 106 764-884 57-173 (343)
57 KOG3993|consensus 98.6 1.2E-08 2.6E-13 107.5 2.8 188 504-707 267-484 (500)
58 PRK10985 putative hydrolase; P 98.6 1E-07 2.2E-12 106.5 10.2 111 764-886 57-170 (324)
59 PHA00733 hypothetical protein 98.6 3.3E-08 7.2E-13 91.8 4.9 58 418-476 67-124 (128)
60 PRK10349 carboxylesterase BioH 98.6 7.9E-08 1.7E-12 103.8 8.8 98 762-882 10-107 (256)
61 KOG3993|consensus 98.6 1E-08 2.2E-13 108.1 0.9 106 343-449 267-381 (500)
62 PLN02578 hydrolase 98.6 1.2E-07 2.5E-12 107.5 9.2 101 764-882 85-185 (354)
63 PRK10566 esterase; Provisional 98.6 3.2E-07 7E-12 98.6 12.2 109 763-877 25-135 (249)
64 PRK07581 hypothetical protein; 98.6 1.2E-07 2.7E-12 107.0 8.8 139 739-884 12-159 (339)
65 TIGR01738 bioH putative pimelo 98.5 1.1E-07 2.4E-12 101.4 6.7 95 766-883 5-99 (245)
66 TIGR02821 fghA_ester_D S-formy 98.5 1.2E-06 2.6E-11 95.4 12.9 120 763-886 40-175 (275)
67 TIGR03100 hydr1_PEP hydrolase, 98.5 9.7E-07 2.1E-11 96.1 11.4 108 765-885 27-135 (274)
68 TIGR01392 homoserO_Ac_trn homo 98.5 8.1E-07 1.7E-11 100.7 11.2 115 764-885 30-163 (351)
69 PRK14875 acetoin dehydrogenase 98.4 1E-06 2.2E-11 101.0 12.2 104 763-884 129-232 (371)
70 KOG4391|consensus 98.4 6.2E-07 1.4E-11 87.1 8.3 190 754-966 70-284 (300)
71 PLN02980 2-oxoglutarate decarb 98.4 5.4E-07 1.2E-11 119.9 10.6 121 752-883 1356-1479(1655)
72 TIGR03230 lipo_lipase lipoprot 98.4 1.7E-06 3.7E-11 97.7 12.8 114 758-882 34-152 (442)
73 PLN02442 S-formylglutathione h 98.4 2.1E-06 4.5E-11 93.7 12.3 144 738-885 20-179 (283)
74 PF10503 Esterase_phd: Esteras 98.4 2E-06 4.3E-11 88.4 11.0 118 764-884 15-132 (220)
75 PF00561 Abhydrolase_1: alpha/ 98.4 7.2E-07 1.6E-11 94.3 8.2 77 797-883 2-78 (230)
76 PRK05077 frsA fermentation/res 98.4 2.5E-06 5.5E-11 97.9 12.2 109 764-886 193-302 (414)
77 cd00707 Pancreat_lipase_like P 98.3 1.3E-06 2.8E-11 94.5 9.0 114 759-881 30-144 (275)
78 COG1506 DAP2 Dipeptidyl aminop 98.3 1.2E-06 2.6E-11 106.2 8.3 120 766-895 395-518 (620)
79 TIGR00976 /NonD putative hydro 98.3 2E-06 4.4E-11 103.1 10.2 111 764-885 21-133 (550)
80 KOG2382|consensus 98.3 6.9E-06 1.5E-10 86.9 12.8 124 764-901 51-177 (315)
81 TIGR01836 PHA_synth_III_C poly 98.3 3.9E-06 8.4E-11 95.0 11.7 108 765-887 63-174 (350)
82 PRK00175 metX homoserine O-ace 98.3 5E-06 1.1E-10 95.0 12.0 114 765-885 48-183 (379)
83 KOG1454|consensus 98.3 2.4E-06 5.2E-11 94.3 9.0 110 764-887 57-169 (326)
84 PHA00616 hypothetical protein 98.3 2.6E-07 5.7E-12 66.0 0.8 33 682-714 1-33 (44)
85 PRK13604 luxD acyl transferase 98.2 4.5E-06 9.8E-11 89.4 10.3 106 764-885 36-142 (307)
86 PTZ00472 serine carboxypeptida 98.2 8E-06 1.7E-10 94.7 13.1 127 763-894 75-226 (462)
87 KOG1552|consensus 98.2 4.6E-06 9.9E-11 85.0 8.4 104 765-884 60-163 (258)
88 PRK05855 short chain dehydroge 98.2 3E-06 6.5E-11 103.6 8.1 106 763-883 23-130 (582)
89 PLN02872 triacylglycerol lipas 98.1 2.8E-06 6E-11 96.3 5.5 115 764-884 73-197 (395)
90 KOG1146|consensus 98.1 7.7E-06 1.7E-10 99.2 9.3 121 559-708 1229-1354(1406)
91 PF13465 zf-H2C2_2: Zinc-finge 98.1 4E-07 8.6E-12 58.5 -0.9 24 670-693 2-25 (26)
92 PF13465 zf-H2C2_2: Zinc-finge 98.1 1.4E-06 3E-11 56.0 1.5 24 600-623 1-25 (26)
93 PRK11460 putative hydrolase; P 98.1 2.4E-05 5.2E-10 82.7 11.8 117 762-883 13-137 (232)
94 TIGR03502 lipase_Pla1_cef extr 98.1 1E-05 2.2E-10 97.2 9.7 103 764-869 448-575 (792)
95 PF12146 Hydrolase_4: Putative 98.1 5E-06 1.1E-10 70.5 4.8 69 759-837 7-79 (79)
96 COG1647 Esterase/lipase [Gener 98.0 6.2E-05 1.3E-09 74.5 12.2 111 762-889 12-123 (243)
97 PF12695 Abhydrolase_5: Alpha/ 98.0 1.5E-05 3.3E-10 77.4 8.1 93 767-882 1-93 (145)
98 PRK11071 esterase YqiA; Provis 98.0 1.4E-05 3E-10 81.4 7.9 82 766-872 2-84 (190)
99 PLN00021 chlorophyllase 98.0 2.2E-05 4.9E-10 86.1 9.9 103 763-882 50-164 (313)
100 PHA00732 hypothetical protein 98.0 4.7E-06 1E-10 69.7 2.9 45 559-608 2-47 (79)
101 PHA00616 hypothetical protein 98.0 2.6E-06 5.7E-11 61.0 0.9 33 585-617 1-34 (44)
102 COG0596 MhpC Predicted hydrola 97.9 2.6E-05 5.7E-10 83.6 8.7 103 765-885 21-124 (282)
103 PF00326 Peptidase_S9: Prolyl 97.9 7.8E-06 1.7E-10 85.5 4.4 94 790-889 10-104 (213)
104 KOG2281|consensus 97.9 2E-05 4.3E-10 88.5 7.2 118 765-887 642-765 (867)
105 PHA00732 hypothetical protein 97.9 7.6E-06 1.6E-10 68.5 2.9 48 531-583 1-49 (79)
106 PRK10162 acetyl esterase; Prov 97.9 5E-05 1.1E-09 84.4 9.5 107 764-885 80-196 (318)
107 KOG2984|consensus 97.9 2.7E-05 5.9E-10 75.1 6.2 102 767-881 45-146 (277)
108 PF05677 DUF818: Chlamydia CHL 97.8 8.1E-05 1.8E-09 78.9 9.9 97 765-871 137-237 (365)
109 COG0429 Predicted hydrolase of 97.8 6.2E-05 1.4E-09 79.5 8.4 111 764-886 74-187 (345)
110 PRK10115 protease 2; Provision 97.8 4.4E-05 9.6E-10 93.3 8.4 115 764-887 444-562 (686)
111 KOG1146|consensus 97.8 1.2E-05 2.6E-10 97.6 3.1 103 505-611 1229-1354(1406)
112 PF11144 DUF2920: Protein of u 97.7 0.00041 8.8E-09 76.2 13.2 138 749-886 19-221 (403)
113 KOG1838|consensus 97.7 0.00047 1E-08 75.9 12.9 112 763-886 123-237 (409)
114 COG3509 LpqC Poly(3-hydroxybut 97.6 0.00048 1E-08 71.7 10.8 112 761-877 57-172 (312)
115 PF02129 Peptidase_S15: X-Pro 97.6 0.00014 3.1E-09 79.0 7.5 113 765-888 20-140 (272)
116 PF00450 Peptidase_S10: Serine 97.6 0.00054 1.2E-08 79.8 12.7 134 754-891 28-188 (415)
117 PF07859 Abhydrolase_3: alpha/ 97.6 0.0002 4.4E-09 74.6 8.2 104 768-886 1-112 (211)
118 PF09752 DUF2048: Uncharacteri 97.5 0.00052 1.1E-08 74.0 10.2 138 730-875 53-201 (348)
119 PLN02209 serine carboxypeptida 97.4 0.001 2.2E-08 76.3 11.9 123 763-890 66-218 (437)
120 PF00096 zf-C2H2: Zinc finger, 97.4 5E-05 1.1E-09 47.5 0.6 23 683-705 1-23 (23)
121 PF00756 Esterase: Putative es 97.4 0.00032 6.9E-09 75.4 7.2 53 834-887 101-153 (251)
122 PF08538 DUF1749: Protein of u 97.4 0.0018 3.9E-08 68.9 12.3 104 766-882 34-146 (303)
123 PRK06765 homoserine O-acetyltr 97.4 0.00044 9.5E-09 78.6 8.2 154 740-901 28-214 (389)
124 PLN03016 sinapoylglucose-malat 97.4 0.0011 2.5E-08 75.9 11.5 123 763-890 64-216 (433)
125 COG0400 Predicted esterase [Ge 97.4 0.00034 7.3E-09 71.2 6.4 120 762-886 15-136 (207)
126 TIGR01838 PHA_synth_I poly(R)- 97.3 0.0018 4E-08 75.8 13.3 105 765-886 189-304 (532)
127 PF00096 zf-C2H2: Zinc finger, 97.3 9.4E-05 2E-09 46.2 1.0 22 586-607 1-22 (23)
128 PF05448 AXE1: Acetyl xylan es 97.3 0.00087 1.9E-08 73.8 9.1 111 765-881 83-206 (320)
129 PF07819 PGAP1: PGAP1-like pro 97.3 0.0025 5.4E-08 66.6 12.0 58 830-887 64-126 (225)
130 PF05605 zf-Di19: Drought indu 97.2 0.00039 8.4E-09 54.1 4.1 48 505-555 3-53 (54)
131 PF02230 Abhydrolase_2: Phosph 97.2 0.00055 1.2E-08 71.6 6.7 120 763-887 12-143 (216)
132 PF05605 zf-Di19: Drought indu 97.2 0.00029 6.4E-09 54.8 3.3 48 559-609 3-53 (54)
133 PF12756 zf-C2H2_2: C2H2 type 97.2 0.00016 3.4E-09 65.2 1.7 73 533-608 1-73 (100)
134 PF06500 DUF1100: Alpha/beta h 97.2 0.00034 7.3E-09 77.6 4.5 111 764-888 189-300 (411)
135 COG2021 MET2 Homoserine acetyl 97.2 0.0035 7.6E-08 67.8 11.9 184 751-940 33-245 (368)
136 PF10230 DUF2305: Uncharacteri 97.2 0.0029 6.3E-08 68.1 11.5 116 766-886 3-124 (266)
137 COG0657 Aes Esterase/lipase [L 97.2 0.0015 3.2E-08 72.8 9.5 111 762-887 76-194 (312)
138 cd00312 Esterase_lipase Estera 97.2 0.0012 2.6E-08 78.7 9.4 112 763-885 93-214 (493)
139 KOG2100|consensus 97.1 0.0012 2.6E-08 81.1 8.7 117 766-887 527-647 (755)
140 PF03403 PAF-AH_p_II: Platelet 97.1 0.0012 2.6E-08 74.6 7.9 117 765-885 100-263 (379)
141 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00026 5.6E-09 63.8 2.1 22 373-394 2-23 (100)
142 COG0412 Dienelactone hydrolase 97.1 0.0026 5.5E-08 67.0 9.8 124 753-882 16-144 (236)
143 PF00975 Thioesterase: Thioest 97.0 0.0016 3.4E-08 68.8 7.6 81 766-868 2-85 (229)
144 KOG4667|consensus 97.0 0.0023 5E-08 63.0 7.7 98 763-872 31-128 (269)
145 PF12740 Chlorophyllase2: Chlo 97.0 0.0026 5.7E-08 66.4 8.7 103 765-885 17-131 (259)
146 PF06057 VirJ: Bacterial virul 97.0 0.0013 2.9E-08 64.7 5.9 96 767-880 4-99 (192)
147 COG2945 Predicted hydrolase of 96.8 0.0054 1.2E-07 59.7 8.6 110 762-885 25-138 (210)
148 PF13894 zf-C2H2_4: C2H2-type 96.8 0.00054 1.2E-08 43.2 1.3 23 586-608 1-23 (24)
149 PRK07868 acyl-CoA synthetase; 96.8 0.0077 1.7E-07 77.9 12.8 111 763-886 65-179 (994)
150 PF13894 zf-C2H2_4: C2H2-type 96.8 0.00051 1.1E-08 43.3 1.1 24 683-706 1-24 (24)
151 PRK10439 enterobactin/ferric e 96.7 0.016 3.4E-07 66.4 13.2 52 834-885 272-324 (411)
152 PLN02733 phosphatidylcholine-s 96.7 0.0027 5.9E-08 72.6 6.9 88 786-887 113-204 (440)
153 KOG4627|consensus 96.7 0.0037 8.1E-08 61.0 6.6 103 762-882 64-170 (270)
154 PF13912 zf-C2H2_6: C2H2-type 96.7 0.00054 1.2E-08 44.7 0.7 26 682-707 1-26 (27)
155 PF01738 DLH: Dienelactone hyd 96.7 0.0022 4.7E-08 67.2 5.6 113 764-882 13-130 (218)
156 KOG1282|consensus 96.7 0.023 4.9E-07 64.8 13.6 137 749-889 56-218 (454)
157 PF13912 zf-C2H2_6: C2H2-type 96.6 0.00092 2E-08 43.6 1.3 25 585-609 1-25 (27)
158 PF03096 Ndr: Ndr family; Int 96.5 0.0087 1.9E-07 63.2 8.2 110 764-885 22-135 (283)
159 PF06342 DUF1057: Alpha/beta h 96.5 0.038 8.3E-07 57.7 12.7 98 767-882 37-135 (297)
160 COG1770 PtrB Protease II [Amin 96.4 0.0072 1.6E-07 69.8 7.4 142 735-885 404-563 (682)
161 COG4099 Predicted peptidase [G 96.4 0.02 4.3E-07 59.4 9.7 46 838-884 259-304 (387)
162 KOG1553|consensus 96.4 0.022 4.7E-07 60.0 9.9 79 785-878 259-339 (517)
163 PRK05371 x-prolyl-dipeptidyl a 96.3 0.0099 2.1E-07 73.5 8.8 87 789-884 274-373 (767)
164 PF05728 UPF0227: Uncharacteri 96.3 0.0094 2E-07 60.0 7.0 42 824-872 41-82 (187)
165 COG2819 Predicted hydrolase of 96.3 0.01 2.2E-07 61.8 7.3 48 839-887 128-175 (264)
166 PF12715 Abhydrolase_7: Abhydr 96.2 0.016 3.5E-07 63.5 8.6 127 753-883 103-259 (390)
167 PF01674 Lipase_2: Lipase (cla 96.2 0.016 3.5E-07 59.7 8.0 92 765-870 2-96 (219)
168 smart00355 ZnF_C2H2 zinc finge 96.2 0.0033 7.2E-08 40.3 1.9 25 683-707 1-25 (26)
169 KOG1515|consensus 96.2 0.016 3.5E-07 63.7 8.3 110 763-887 88-210 (336)
170 PF09237 GAGA: GAGA factor; I 96.1 0.0037 8E-08 45.8 2.2 33 678-710 20-52 (54)
171 KOG2931|consensus 95.9 0.053 1.1E-06 56.6 10.2 109 764-884 45-157 (326)
172 COG5189 SFP1 Putative transcri 95.9 0.0031 6.8E-08 64.8 1.3 24 502-525 347-372 (423)
173 COG3571 Predicted hydrolase of 95.9 0.065 1.4E-06 50.4 9.7 106 767-886 16-126 (213)
174 COG5189 SFP1 Putative transcri 95.9 0.0033 7.1E-08 64.7 1.2 67 529-606 347-419 (423)
175 COG4757 Predicted alpha/beta h 95.8 0.018 4E-07 57.6 6.2 92 768-868 33-124 (281)
176 PF06821 Ser_hydrolase: Serine 95.8 0.064 1.4E-06 53.3 10.1 91 768-885 1-92 (171)
177 COG3458 Acetyl esterase (deace 95.6 0.067 1.4E-06 55.1 9.2 104 764-872 82-199 (321)
178 smart00355 ZnF_C2H2 zinc finge 95.4 0.0095 2.1E-07 38.1 1.7 19 588-606 3-21 (26)
179 COG3208 GrsT Predicted thioest 95.4 0.023 5E-07 58.1 5.2 58 796-867 34-92 (244)
180 COG2939 Carboxypeptidase C (ca 95.3 0.017 3.6E-07 65.1 4.5 99 764-867 100-216 (498)
181 KOG1283|consensus 95.3 0.054 1.2E-06 56.8 7.7 105 763-872 29-145 (414)
182 COG2272 PnbA Carboxylesterase 95.2 0.031 6.7E-07 63.0 5.9 110 765-885 94-218 (491)
183 PF10340 DUF2424: Protein of u 95.2 0.12 2.5E-06 57.3 10.3 121 751-887 105-238 (374)
184 PRK04940 hypothetical protein; 95.1 0.062 1.3E-06 53.0 7.1 87 767-878 1-87 (180)
185 TIGR01839 PHA_synth_II poly(R) 95.0 0.44 9.6E-06 55.7 14.9 87 787-886 239-330 (560)
186 COG3319 Thioesterase domains o 95.0 0.065 1.4E-06 56.6 7.5 58 796-868 27-84 (257)
187 PRK10252 entF enterobactin syn 95.0 0.082 1.8E-06 71.4 10.6 97 765-882 1068-1169(1296)
188 PF00135 COesterase: Carboxyle 94.9 0.041 9E-07 66.4 6.7 112 765-885 125-246 (535)
189 KOG3847|consensus 94.9 0.07 1.5E-06 55.9 7.0 116 765-884 118-275 (399)
190 KOG2231|consensus 94.8 0.037 8E-07 64.9 5.4 30 855-884 463-492 (669)
191 PF06259 Abhydrolase_8: Alpha/ 94.7 0.24 5.3E-06 49.1 10.2 59 824-884 86-144 (177)
192 PF09237 GAGA: GAGA factor; I 94.7 0.017 3.7E-07 42.5 1.5 26 585-610 24-49 (54)
193 PF07224 Chlorophyllase: Chlor 94.7 0.072 1.6E-06 54.7 6.4 104 764-884 45-156 (307)
194 PF13909 zf-H2C2_5: C2H2-type 94.7 0.013 2.9E-07 36.8 0.8 23 586-609 1-23 (24)
195 COG4188 Predicted dienelactone 94.6 0.088 1.9E-06 57.4 7.3 103 765-871 71-181 (365)
196 COG0627 Predicted esterase [Ge 94.6 0.11 2.3E-06 56.9 8.0 125 763-887 52-190 (316)
197 PLN02213 sinapoylglucose-malat 94.6 0.24 5.2E-06 55.0 11.0 87 796-887 2-99 (319)
198 PF12874 zf-met: Zinc-finger o 94.5 0.013 2.7E-07 37.3 0.4 23 683-705 1-23 (25)
199 KOG2231|consensus 94.5 0.045 9.7E-07 64.2 5.0 47 145-194 184-236 (669)
200 COG5048 FOG: Zn-finger [Genera 94.4 0.036 7.9E-07 65.1 4.5 157 557-718 288-454 (467)
201 KOG4840|consensus 94.4 0.037 8E-07 54.9 3.6 103 766-881 37-141 (299)
202 cd00741 Lipase Lipase. Lipase 94.3 0.095 2.1E-06 51.3 6.5 56 829-887 11-70 (153)
203 KOG2237|consensus 94.3 0.069 1.5E-06 61.5 5.8 136 738-885 441-585 (712)
204 PRK04860 hypothetical protein; 94.2 0.028 6.1E-07 54.4 2.3 36 681-720 118-153 (160)
205 PF12874 zf-met: Zinc-finger o 94.2 0.022 4.8E-07 36.2 1.0 23 586-608 1-23 (25)
206 PF00151 Lipase: Lipase; Inte 94.2 0.056 1.2E-06 59.8 4.9 99 761-872 67-173 (331)
207 PRK04860 hypothetical protein; 94.1 0.028 6.1E-07 54.4 2.1 37 585-625 119-156 (160)
208 PLN00413 triacylglycerol lipas 94.1 0.2 4.3E-06 56.8 8.9 48 847-894 282-337 (479)
209 PLN03037 lipase class 3 family 94.1 0.19 4.1E-06 57.4 8.8 67 826-895 298-369 (525)
210 PLN02310 triacylglycerol lipas 93.9 0.14 3.1E-06 57.1 7.4 68 825-895 188-259 (405)
211 KOG3724|consensus 93.9 0.12 2.6E-06 60.8 6.9 35 851-886 184-222 (973)
212 PF13909 zf-H2C2_5: C2H2-type 93.7 0.038 8.3E-07 34.7 1.4 22 505-527 1-22 (24)
213 PF07082 DUF1350: Protein of u 93.6 0.28 6.1E-06 50.7 8.3 105 763-881 13-122 (250)
214 COG4947 Uncharacterized protei 93.5 0.13 2.8E-06 49.0 5.2 114 761-887 23-139 (227)
215 KOG2785|consensus 93.4 0.12 2.7E-06 55.6 5.4 56 424-479 166-250 (390)
216 KOG2785|consensus 93.3 0.1 2.2E-06 56.2 4.7 82 197-284 3-90 (390)
217 PF05990 DUF900: Alpha/beta hy 93.2 0.26 5.7E-06 51.8 7.7 89 763-866 16-110 (233)
218 PF01764 Lipase_3: Lipase (cla 93.2 0.085 1.8E-06 50.6 3.8 53 831-886 49-107 (140)
219 PF08840 BAAT_C: BAAT / Acyl-C 93.1 0.13 2.8E-06 53.3 5.2 36 849-885 22-57 (213)
220 cd00519 Lipase_3 Lipase (class 93.1 0.13 2.9E-06 54.2 5.3 42 825-869 107-148 (229)
221 smart00824 PKS_TE Thioesterase 93.1 0.39 8.4E-06 49.4 8.9 71 795-880 25-98 (212)
222 PF12171 zf-C2H2_jaz: Zinc-fin 93.0 0.04 8.7E-07 35.8 0.7 22 586-607 2-23 (27)
223 PF12171 zf-C2H2_jaz: Zinc-fin 93.0 0.028 6.1E-07 36.5 0.0 22 683-704 2-23 (27)
224 PF05057 DUF676: Putative seri 92.8 0.39 8.5E-06 50.0 8.3 44 824-868 54-97 (217)
225 PLN02571 triacylglycerol lipas 92.7 0.29 6.3E-06 54.9 7.4 40 825-869 207-246 (413)
226 COG1505 Serine proteases of th 92.7 0.079 1.7E-06 60.7 2.9 116 764-886 420-537 (648)
227 PLN02162 triacylglycerol lipas 92.6 0.3 6.6E-06 55.1 7.3 50 847-896 276-333 (475)
228 PLN02454 triacylglycerol lipas 92.4 0.28 6.2E-06 54.9 6.8 68 827-895 207-281 (414)
229 KOG3101|consensus 92.2 0.14 3.1E-06 50.5 3.6 124 765-890 44-181 (283)
230 PF02450 LCAT: Lecithin:choles 92.1 0.21 4.5E-06 57.1 5.6 59 825-887 99-163 (389)
231 COG2382 Fes Enterochelin ester 92.0 0.42 9E-06 50.8 7.1 124 752-889 83-217 (299)
232 PF03583 LIP: Secretory lipase 91.9 0.67 1.5E-05 50.6 9.0 88 788-885 20-114 (290)
233 COG1075 LipA Predicted acetylt 91.8 0.4 8.6E-06 53.6 7.3 38 849-886 127-166 (336)
234 PLN02719 triacylglycerol lipas 91.5 0.34 7.5E-06 55.3 6.3 69 825-895 276-355 (518)
235 KOG2565|consensus 91.5 0.18 3.9E-06 54.3 3.7 109 765-886 153-266 (469)
236 COG5236 Uncharacterized conser 91.4 0.14 3E-06 53.7 2.8 117 585-744 151-273 (493)
237 COG5236 Uncharacterized conser 91.3 0.2 4.4E-06 52.5 3.8 127 143-287 151-306 (493)
238 PLN02753 triacylglycerol lipas 91.3 0.44 9.6E-06 54.7 6.8 69 825-895 290-369 (531)
239 PLN02408 phospholipase A1 90.5 0.42 9.1E-06 52.9 5.6 64 826-894 182-250 (365)
240 PF11187 DUF2974: Protein of u 90.4 0.51 1.1E-05 49.1 5.9 49 831-883 70-122 (224)
241 PLN02324 triacylglycerol lipas 90.3 0.58 1.3E-05 52.5 6.6 65 826-895 197-275 (415)
242 COG3946 VirJ Type IV secretory 90.2 0.81 1.8E-05 50.2 7.2 59 787-862 280-339 (456)
243 COG5048 FOG: Zn-finger [Genera 89.9 0.25 5.3E-06 58.0 3.6 53 8-60 289-348 (467)
244 KOG2624|consensus 89.8 0.61 1.3E-05 52.7 6.3 118 763-885 71-200 (403)
245 PF01083 Cutinase: Cutinase; 89.7 1.4 3E-05 44.2 8.2 109 768-886 8-124 (179)
246 COG2936 Predicted acyl esteras 89.6 0.54 1.2E-05 54.7 5.8 83 794-886 79-161 (563)
247 COG4782 Uncharacterized protei 89.2 1.4 3E-05 48.0 8.1 96 762-867 113-209 (377)
248 TIGR00622 ssl1 transcription f 89.2 0.74 1.6E-05 41.2 5.0 107 8-150 1-110 (112)
249 PLN02934 triacylglycerol lipas 89.2 0.83 1.8E-05 52.3 6.7 50 847-896 319-376 (515)
250 PLN02761 lipase class 3 family 89.2 0.81 1.7E-05 52.6 6.6 70 826-896 272-353 (527)
251 COG3150 Predicted esterase [Ge 89.0 1.2 2.5E-05 42.9 6.4 79 768-870 2-80 (191)
252 KOG3967|consensus 88.8 3.8 8.2E-05 40.8 10.0 99 760-873 96-214 (297)
253 KOG2369|consensus 88.7 0.25 5.5E-06 55.4 2.2 59 824-885 160-226 (473)
254 PF13913 zf-C2HC_2: zinc-finge 88.5 0.28 6.1E-06 31.1 1.5 21 683-704 3-23 (25)
255 PLN02802 triacylglycerol lipas 88.4 0.72 1.6E-05 52.8 5.6 64 826-894 312-380 (509)
256 TIGR00622 ssl1 transcription f 87.7 1.6 3.5E-05 39.0 6.2 21 450-470 14-34 (112)
257 KOG2482|consensus 87.3 0.82 1.8E-05 48.3 4.7 143 332-474 131-357 (423)
258 PF13913 zf-C2HC_2: zinc-finge 87.2 0.36 7.9E-06 30.6 1.3 19 587-606 4-22 (25)
259 KOG2482|consensus 85.9 0.65 1.4E-05 49.0 3.1 50 263-312 280-357 (423)
260 COG5153 CVT17 Putative lipase 85.6 0.92 2E-05 46.9 4.0 49 831-885 261-309 (425)
261 KOG4540|consensus 85.6 0.92 2E-05 46.9 4.0 49 831-885 261-309 (425)
262 TIGR03712 acc_sec_asp2 accesso 84.8 7.1 0.00015 44.5 10.8 128 765-909 289-426 (511)
263 smart00451 ZnF_U1 U1-like zinc 84.8 0.49 1.1E-05 32.8 1.2 23 682-704 3-25 (35)
264 PF06028 DUF915: Alpha/beta hy 84.7 2.5 5.4E-05 44.9 7.0 61 825-888 82-147 (255)
265 PF11339 DUF3141: Protein of u 84.2 9 0.00019 44.0 11.2 106 765-885 69-176 (581)
266 smart00451 ZnF_U1 U1-like zinc 83.6 0.6 1.3E-05 32.3 1.2 23 585-607 3-25 (35)
267 PF07519 Tannase: Tannase and 83.5 1.3 2.7E-05 52.0 4.6 111 768-886 31-152 (474)
268 KOG1551|consensus 83.1 2.5 5.5E-05 43.5 5.8 130 732-874 80-220 (371)
269 PLN02517 phosphatidylcholine-s 82.5 2.3 4.9E-05 49.7 5.9 57 826-885 193-264 (642)
270 COG3545 Predicted esterase of 82.3 2.2 4.9E-05 41.5 4.9 55 832-887 43-97 (181)
271 KOG4388|consensus 82.0 6.8 0.00015 45.0 9.1 97 766-876 397-503 (880)
272 PLN02847 triacylglycerol lipas 81.7 2.7 5.8E-05 49.1 6.1 22 847-868 249-270 (633)
273 KOG2112|consensus 81.4 3.1 6.8E-05 41.8 5.7 59 825-885 71-129 (206)
274 KOG2893|consensus 81.3 0.61 1.3E-05 46.4 0.7 45 146-193 13-58 (341)
275 KOG1516|consensus 80.1 2.6 5.6E-05 50.9 5.8 115 765-885 112-233 (545)
276 KOG3975|consensus 79.4 15 0.00033 37.9 9.8 109 749-867 15-128 (301)
277 KOG2893|consensus 77.8 0.82 1.8E-05 45.5 0.4 41 400-443 13-53 (341)
278 cd00350 rubredoxin_like Rubred 77.4 1.5 3.2E-05 30.0 1.5 11 9-19 2-12 (33)
279 PF03959 FSH1: Serine hydrolas 77.1 5.2 0.00011 41.4 6.2 112 765-886 4-147 (212)
280 KOG4569|consensus 76.4 3.4 7.4E-05 46.1 4.9 58 826-890 155-218 (336)
281 COG4814 Uncharacterized protei 71.7 4.9 0.00011 41.6 4.1 39 832-873 122-160 (288)
282 PF12048 DUF3530: Protein of u 71.4 52 0.0011 36.3 12.5 49 850-898 194-243 (310)
283 KOG2186|consensus 71.4 1.9 4.1E-05 43.9 1.2 46 9-56 4-49 (276)
284 PF02089 Palm_thioest: Palmito 70.7 13 0.00027 39.8 7.1 67 822-893 58-125 (279)
285 PLN02633 palmitoyl protein thi 67.1 36 0.00079 36.9 9.6 112 765-893 26-140 (314)
286 PLN02748 tRNA dimethylallyltra 65.5 20 0.00044 41.6 8.0 85 763-857 20-120 (468)
287 COG4049 Uncharacterized protei 64.9 2.3 5E-05 32.1 0.2 31 677-707 12-42 (65)
288 KOG4173|consensus 64.2 2.6 5.6E-05 41.3 0.4 49 504-554 79-129 (253)
289 KOG4173|consensus 61.7 3.1 6.7E-05 40.8 0.5 98 513-612 48-173 (253)
290 smart00834 CxxC_CXXC_SSSS Puta 60.4 4.9 0.00011 28.9 1.3 29 9-43 6-34 (41)
291 PF11288 DUF3089: Protein of u 60.0 17 0.00037 37.0 5.4 44 825-870 73-116 (207)
292 PF06524 NOA36: NOA36 protein; 59.9 5.8 0.00013 40.5 2.0 25 115-139 208-232 (314)
293 COG1107 Archaea-specific RecJ- 59.6 1E+02 0.0022 36.0 11.8 137 765-938 418-563 (715)
294 COG0846 SIR2 NAD-dependent pro 58.3 25 0.00054 37.1 6.5 102 674-803 114-217 (250)
295 TIGR01849 PHB_depoly_PhaZ poly 58.3 39 0.00085 38.6 8.5 89 784-889 120-213 (406)
296 PRK00091 miaA tRNA delta(2)-is 57.3 36 0.00079 37.3 7.8 40 766-810 5-46 (307)
297 PLN02606 palmitoyl-protein thi 56.5 93 0.002 33.8 10.4 43 850-892 96-140 (306)
298 KOG2186|consensus 55.4 7.5 0.00016 39.8 1.9 47 559-607 4-50 (276)
299 TIGR02098 MJ0042_CXXC MJ0042 f 55.1 7.3 0.00016 27.5 1.4 13 9-21 3-15 (38)
300 TIGR02605 CxxC_CxxC_SSSS putat 54.9 6.6 0.00014 30.1 1.2 30 8-43 5-34 (52)
301 COG2888 Predicted Zn-ribbon RN 54.6 10 0.00022 29.5 2.1 35 682-721 27-61 (61)
302 COG4049 Uncharacterized protei 54.5 3.8 8.3E-05 30.9 -0.2 22 36-57 18-39 (65)
303 PRK00398 rpoP DNA-directed RNA 54.4 7.2 0.00016 29.0 1.3 30 7-45 2-31 (46)
304 PF09723 Zn-ribbon_8: Zinc rib 54.3 6.8 0.00015 28.5 1.1 30 8-43 5-34 (42)
305 smart00659 RPOLCX RNA polymera 54.3 8.6 0.00019 28.3 1.6 27 8-44 2-28 (44)
306 PF09949 DUF2183: Uncharacteri 54.0 65 0.0014 28.7 7.4 32 847-878 63-96 (100)
307 PF09986 DUF2225: Uncharacteri 53.9 6.4 0.00014 40.7 1.3 13 9-21 6-18 (214)
308 PF12013 DUF3505: Protein of u 53.6 31 0.00067 31.3 5.6 26 504-529 80-109 (109)
309 smart00531 TFIIE Transcription 53.5 6.7 0.00015 37.8 1.3 35 35-72 99-133 (147)
310 PF02273 Acyl_transf_2: Acyl t 52.7 67 0.0014 33.5 8.1 106 764-885 29-135 (294)
311 TIGR00373 conserved hypothetic 52.3 6.9 0.00015 38.2 1.2 16 36-51 110-125 (158)
312 PF12013 DUF3505: Protein of u 52.0 19 0.0004 32.7 3.9 25 452-476 81-109 (109)
313 smart00614 ZnF_BED BED zinc fi 51.1 10 0.00022 28.9 1.6 23 684-706 20-48 (50)
314 PF13717 zinc_ribbon_4: zinc-r 50.8 11 0.00024 26.4 1.7 12 10-21 4-15 (36)
315 KOG2906|consensus 50.3 12 0.00025 32.3 2.0 75 612-692 21-103 (105)
316 KOG3253|consensus 50.0 33 0.00072 40.1 6.1 112 763-885 174-287 (784)
317 PLN02840 tRNA dimethylallyltra 49.9 49 0.0011 37.8 7.5 84 764-857 20-119 (421)
318 PF10571 UPF0547: Uncharacteri 49.7 13 0.00028 23.9 1.7 10 506-515 16-25 (26)
319 PF13719 zinc_ribbon_5: zinc-r 48.6 12 0.00027 26.3 1.6 10 10-19 4-13 (37)
320 PRK14890 putative Zn-ribbon RN 47.8 14 0.0003 28.8 1.9 35 682-721 25-59 (59)
321 PLN02165 adenylate isopentenyl 47.5 57 0.0012 36.0 7.3 43 763-810 41-85 (334)
322 PF00301 Rubredoxin: Rubredoxi 46.1 15 0.00032 27.5 1.8 34 479-512 2-42 (47)
323 PRK06266 transcription initiat 45.8 10 0.00022 37.9 1.2 29 35-71 117-145 (178)
324 PF06792 UPF0261: Uncharacteri 44.2 1.9E+02 0.0042 32.8 10.8 152 766-923 2-173 (403)
325 KOG2807|consensus 44.0 24 0.00052 37.7 3.5 31 115-150 344-374 (378)
326 cd00730 rubredoxin Rubredoxin; 43.3 23 0.0005 26.9 2.5 8 505-512 35-42 (50)
327 COG0324 MiaA tRNA delta(2)-iso 43.1 80 0.0017 34.5 7.5 71 784-859 17-103 (308)
328 PF03604 DNA_RNApol_7kD: DNA d 43.0 18 0.00039 24.6 1.7 11 9-19 1-11 (32)
329 PF06524 NOA36: NOA36 protein; 42.5 18 0.00038 37.2 2.3 26 314-339 207-232 (314)
330 TIGR00174 miaA tRNA isopenteny 42.1 78 0.0017 34.3 7.3 81 767-857 1-97 (287)
331 smart00734 ZnF_Rad18 Rad18-lik 42.1 18 0.00038 23.3 1.5 20 683-703 2-21 (26)
332 PF09538 FYDLN_acid: Protein o 42.0 17 0.00037 32.8 1.9 9 64-72 11-19 (108)
333 KOG2541|consensus 42.0 1.5E+02 0.0033 31.4 8.8 112 765-892 24-136 (296)
334 COG5151 SSL1 RNA polymerase II 40.7 29 0.00063 36.5 3.6 48 480-527 364-411 (421)
335 PHA00626 hypothetical protein 40.3 16 0.00034 28.0 1.1 12 62-73 23-34 (59)
336 PF02892 zf-BED: BED zinc fing 40.2 16 0.00035 26.9 1.2 26 680-705 14-44 (45)
337 PRK14556 pyrH uridylate kinase 39.6 95 0.0021 32.8 7.3 95 765-867 136-238 (249)
338 KOG3043|consensus 39.6 13 0.00028 38.0 0.8 88 790-884 63-155 (242)
339 smart00531 TFIIE Transcription 39.2 16 0.00034 35.3 1.4 13 559-571 100-112 (147)
340 PF09538 FYDLN_acid: Protein o 39.2 23 0.00049 32.0 2.2 9 506-514 28-36 (108)
341 COG3243 PhaC Poly(3-hydroxyalk 39.0 92 0.002 35.2 7.3 86 787-885 131-218 (445)
342 PRK04023 DNA polymerase II lar 38.8 27 0.00057 43.5 3.4 49 531-623 626-674 (1121)
343 PRK00464 nrdR transcriptional 38.7 12 0.00027 36.1 0.5 8 65-72 31-38 (154)
344 PRK02399 hypothetical protein; 38.6 1.5E+02 0.0032 33.6 8.9 33 850-882 98-134 (406)
345 PF03283 PAE: Pectinacetyleste 38.6 68 0.0015 36.1 6.5 52 835-887 143-199 (361)
346 PF14353 CpXC: CpXC protein 37.3 11 0.00023 35.4 -0.1 21 35-55 38-58 (128)
347 COG5441 Uncharacterized conser 36.9 1.7E+02 0.0036 31.3 8.2 134 786-924 20-172 (401)
348 KOG1202|consensus 36.9 6.6E+02 0.014 32.7 14.3 37 829-868 2164-2201(2376)
349 cd01412 SIRT5_Af1_CobB SIRT5_A 36.7 92 0.002 32.5 6.8 99 675-804 102-200 (224)
350 PRK08118 topology modulation p 36.6 1.4E+02 0.0031 29.3 7.8 36 767-807 3-38 (167)
351 COG1996 RPC10 DNA-directed RNA 36.2 16 0.00035 27.5 0.7 29 7-44 5-33 (49)
352 PF05705 DUF829: Eukaryotic pr 35.3 1.6E+02 0.0035 30.9 8.5 105 766-887 1-115 (240)
353 PF05443 ROS_MUCR: ROS/MUCR tr 34.8 15 0.00032 34.4 0.4 29 683-714 73-101 (132)
354 COG0068 HypF Hydrogenase matur 34.6 8.2 0.00018 45.9 -1.6 79 63-152 102-182 (750)
355 COG1576 Uncharacterized conser 33.0 58 0.0012 31.3 3.9 43 795-860 67-109 (155)
356 TIGR01206 lysW lysine biosynth 32.3 32 0.0007 26.6 1.8 32 8-46 2-33 (54)
357 COG1592 Rubrerythrin [Energy p 32.1 24 0.00052 34.4 1.4 8 532-539 135-142 (166)
358 COG1592 Rubrerythrin [Energy p 31.8 26 0.00057 34.2 1.6 22 558-592 134-156 (166)
359 PRK14729 miaA tRNA delta(2)-is 31.4 1.5E+02 0.0032 32.5 7.3 82 765-857 4-101 (300)
360 TIGR00373 conserved hypothetic 31.4 22 0.00047 34.8 1.0 16 532-547 110-125 (158)
361 PF14353 CpXC: CpXC protein 30.9 51 0.0011 30.8 3.4 21 289-309 38-58 (128)
362 PF08274 PhnA_Zn_Ribbon: PhnA 30.6 25 0.00055 23.5 0.8 11 33-43 17-27 (30)
363 PF15269 zf-C2H2_7: Zinc-finge 30.5 29 0.00063 25.1 1.2 23 451-473 20-42 (54)
364 PRK14905 triosephosphate isome 30.5 2.1E+02 0.0045 32.2 8.5 123 795-935 168-294 (355)
365 PRK04023 DNA polymerase II lar 30.4 43 0.00094 41.7 3.3 9 425-433 627-635 (1121)
366 COG4957 Predicted transcriptio 29.7 29 0.00063 31.9 1.3 25 683-710 77-101 (148)
367 PRK00103 rRNA large subunit me 29.6 60 0.0013 31.6 3.6 47 795-863 67-113 (157)
368 PF05443 ROS_MUCR: ROS/MUCR tr 29.6 25 0.00053 33.0 0.9 23 586-611 73-95 (132)
369 cd01413 SIR2_Af2 SIR2_Af2: Arc 28.9 1.4E+02 0.0029 31.2 6.4 101 675-803 106-206 (222)
370 KOG1280|consensus 28.5 23 0.00049 38.2 0.5 35 585-619 79-116 (381)
371 KOG4389|consensus 28.3 1.4E+02 0.0029 34.5 6.4 109 767-884 137-255 (601)
372 cd01407 SIR2-fam SIR2 family o 28.1 1.6E+02 0.0034 30.6 6.7 102 674-803 101-202 (218)
373 PF10081 Abhydrolase_9: Alpha/ 27.5 2.2E+02 0.0048 30.5 7.5 108 766-884 33-147 (289)
374 TIGR02300 FYDLN_acid conserved 27.3 47 0.001 30.6 2.2 11 531-541 26-36 (129)
375 KOG1280|consensus 27.0 51 0.0011 35.7 2.8 39 680-718 77-117 (381)
376 PRK09678 DNA-binding transcrip 26.9 29 0.00063 28.6 0.8 13 36-48 28-42 (72)
377 PRK06266 transcription initiat 26.6 31 0.00067 34.5 1.1 16 63-78 118-133 (178)
378 PF08237 PE-PPE: PE-PPE domain 26.5 1.3E+02 0.0028 31.3 5.8 46 821-867 21-66 (225)
379 TIGR00515 accD acetyl-CoA carb 26.4 1.3E+02 0.0028 32.6 5.8 17 849-865 194-212 (285)
380 PF12760 Zn_Tnp_IS1595: Transp 26.4 46 0.001 24.7 1.7 10 61-70 36-45 (46)
381 TIGR00246 tRNA_RlmH_YbeA rRNA 26.3 76 0.0016 30.8 3.6 47 794-863 64-110 (153)
382 PRK14138 NAD-dependent deacety 26.0 1.3E+02 0.0029 31.7 5.8 101 676-803 113-213 (244)
383 PF15269 zf-C2H2_7: Zinc-finge 25.8 45 0.00099 24.1 1.5 23 143-165 20-42 (54)
384 KOG2593|consensus 25.8 30 0.00066 38.7 0.9 11 697-707 188-198 (436)
385 COG1773 Rubredoxin [Energy pro 25.8 52 0.0011 25.5 1.9 12 479-490 4-15 (55)
386 PRK09678 DNA-binding transcrip 25.0 27 0.0006 28.8 0.3 41 36-77 2-44 (72)
387 PF01972 SDH_sah: Serine dehyd 24.6 2.8E+02 0.006 29.6 7.5 82 786-878 39-125 (285)
388 PF09845 DUF2072: Zn-ribbon co 24.4 46 0.00099 30.9 1.6 27 585-621 1-28 (131)
389 PTZ00409 Sir2 (Silent Informat 24.4 2.1E+02 0.0046 30.8 7.0 105 675-803 130-234 (271)
390 PRK00481 NAD-dependent deacety 23.4 1.8E+02 0.0038 30.8 6.2 99 675-804 115-213 (242)
391 PF02590 SPOUT_MTase: Predicte 23.4 77 0.0017 30.8 3.1 52 794-869 66-117 (155)
392 COG1997 RPL43A Ribosomal prote 23.1 42 0.00091 28.5 1.1 11 63-73 54-64 (89)
393 PF02176 zf-TRAF: TRAF-type zi 23.1 60 0.0013 25.5 2.0 7 50-56 25-31 (60)
394 TIGR01384 TFS_arch transcripti 23.0 1E+02 0.0022 27.5 3.7 11 683-693 91-101 (104)
395 PF01221 Dynein_light: Dynein 23.0 63 0.0014 28.0 2.2 29 830-862 36-66 (89)
396 cd01410 SIRT7 SIRT7: Eukaryoti 22.9 2.7E+02 0.0058 28.6 7.1 103 675-803 88-190 (206)
397 KOG2960|consensus 22.6 86 0.0019 31.6 3.2 33 849-881 77-110 (328)
398 KOG1701|consensus 22.5 68 0.0015 35.8 2.8 38 506-543 276-314 (468)
399 PF09994 DUF2235: Uncharacteri 22.5 4.8E+02 0.01 28.2 9.4 20 848-867 91-110 (277)
400 PF12242 Eno-Rase_NADH_b: NAD( 22.3 2.9E+02 0.0062 23.3 5.6 42 829-870 20-61 (78)
401 PF05191 ADK_lid: Adenylate ki 22.1 40 0.00087 23.6 0.6 8 588-595 4-11 (36)
402 PRK13938 phosphoheptose isomer 21.9 3.5E+02 0.0075 27.5 7.7 23 849-871 46-68 (196)
403 COG4474 Uncharacterized protei 21.1 7.7E+02 0.017 24.2 9.0 31 848-878 42-77 (180)
404 PF04083 Abhydro_lipase: Parti 20.8 79 0.0017 25.4 2.2 23 763-785 41-63 (63)
405 KOG2029|consensus 20.7 1.2E+02 0.0025 35.8 4.2 22 847-868 524-545 (697)
406 TIGR00162 conserved hypothetic 20.7 8.8E+02 0.019 24.4 11.0 39 764-802 14-55 (188)
407 COG3962 Acetolactate synthase 20.3 81 0.0018 35.8 2.8 43 763-808 461-503 (617)
408 COG4957 Predicted transcriptio 20.1 50 0.0011 30.5 1.0 23 586-611 77-99 (148)
409 PF14446 Prok-RING_1: Prokaryo 20.1 66 0.0014 24.9 1.5 25 683-719 6-30 (54)
No 1
>KOG2182|consensus
Probab=100.00 E-value=2.7e-66 Score=554.87 Aligned_cols=313 Identities=40% Similarity=0.706 Sum_probs=278.0
Q ss_pred cCCCCcchhhhhcccCCCCCcccceEEEEEecccccCCCcEEEEECCCCCCccccc--ccccHHHHHHHcCCeEEEEeee
Q psy3685 727 EKLPSDQWFEQKLDHFNIEEPRTWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWM--VEGTWLDYAHNHNALAVQVEHR 804 (1042)
Q Consensus 727 ~~~~~~~~f~q~~dhf~~~~~~tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~--~~~~~~~la~~~~a~~~~~ehR 804 (1042)
..+++++||+|++|||+..+..+|+|||+.|+.|++++|||||||||||+.++.|+ ..+.+..||+++||.|+.||||
T Consensus 48 ~~~~~~~~~~Q~lDhF~~~~~~~~Qq~~y~n~~~~~~~gPiFLmIGGEgp~~~~wv~~~~~~~~~~AkkfgA~v~~lEHR 127 (514)
T KOG2182|consen 48 PANVEQSTFTQKLDHFDSSNGKFFQQRFYNNNQWAKPGGPIFLMIGGEGPESDKWVGNENLTWLQWAKKFGATVFQLEHR 127 (514)
T ss_pred cccccccchhhhhhhhhcchhhhhhhheeeccccccCCCceEEEEcCCCCCCCCccccCcchHHHHHHHhCCeeEEeeee
Confidence 35788999999999996444445555555555566999999999999999998885 3447889999999999999999
Q ss_pred ecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 805 FYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 805 ~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
|||+|.|..++++.||+|||++|||+|+|+||+.++.+++..++.|||+|||||+|+||||+|++|||++.|||||||||
T Consensus 128 FYG~S~P~~~~st~nlk~LSs~QALaDla~fI~~~n~k~n~~~~~~WitFGgSYsGsLsAW~R~~yPel~~GsvASSapv 207 (514)
T KOG2182|consen 128 FYGQSSPIGDLSTSNLKYLSSLQALADLAEFIKAMNAKFNFSDDSKWITFGGSYSGSLSAWFREKYPELTVGSVASSAPV 207 (514)
T ss_pred ccccCCCCCCCcccchhhhhHHHHHHHHHHHHHHHHhhcCCCCCCCeEEECCCchhHHHHHHHHhCchhheeecccccce
Confidence 99999999999999999999999999999999999999987666799999999999999999999999999999999999
Q ss_pred ccccCChhhHHHHHHHhhccChhhHHHHHHHHHHHHHHhcCCchHHHHHHHhcCCCCCCC-CChhcHHHHHHHHHHHhhh
Q psy3685 885 RAVVDFPEYFGVVADALATVSTECVKAVQTATHTISKMLKSPSDAKYLTEQFKLCTPLDI-NNAKDVSSFVESLADNIAG 963 (1042)
Q Consensus 885 ~~~~~~~~~~~~v~~~~~~~~~~c~~~i~~~~~~~~~~~~~~~~~~~l~~~f~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 963 (1042)
+|++||+||+++|++.++.++++|.++|+.+|..|+.++.+.+++++|++.|++|++++. .+..|+..|+.+|+.+|++
T Consensus 208 ~A~~DF~EY~~VVe~s~~~~~~~C~~ai~~~f~~~~~l~~t~~gr~~Lk~~Fnl~~~f~~~~s~~d~~~ff~nv~~~Fqg 287 (514)
T KOG2182|consen 208 LAKVDFYEYLMVVEESLRRYSPECADAIKEGFKSMEELLLTKGGRQALKSLFNLCPPFDNNVSDTDQHNFFSNVYSNFQG 287 (514)
T ss_pred eEEecHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHhcccHHHHHHHHhccCCccccchhHHHHHHHHHHHHHhhhh
Confidence 999999999999999999999999999999999999999999999999999999999952 2445788999999999999
Q ss_pred hhccCCCCCCC-CCCCCHHHHHHHhhcC-CCCHHHHHHHHHHHHHhcCCCCCccccccHHHHHHHhhcCCCCCCCCCCCC
Q psy3685 964 VVQYNKDNRES-SNHITIDDLCALLMDK-KQDPVARYAAVNDLILSKEKKKNPCVDYKYEKMIEDMKNTSWDSEMANGGK 1041 (1042)
Q Consensus 964 ~~~y~~~~~~~-~~~~~~~~~C~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~ 1041 (1042)
++||.++..++ ...+.|+++|++|++. ..+++.++..++.+++.. .+..|+|.+|+.||..|+++.+..+.+.++|
T Consensus 288 vvQY~gd~~~~~~~~~~i~~~C~~l~n~t~~d~v~~~~~~~~~~~~~--~~~~c~~~~Y~~~i~~~~n~~~~~~~~~a~r 365 (514)
T KOG2182|consen 288 VVQYSGDNSNATASGLGIPAMCDILNNKTPGDDVVAVNKYMNWFNNG--FGYGCLDNTYNGMISYLKNSTEPGEDAAADR 365 (514)
T ss_pred heeecCCCCcccccccChhHHHHHhhcCCCCchHHHHHHHHHHHHhc--cCCCcCCccHHHHHHHhhcccCcCcccccch
Confidence 99999998544 4568999999999985 478899999988888763 3467999999999999998777666666666
No 2
>KOG2183|consensus
Probab=100.00 E-value=6.1e-66 Score=534.03 Aligned_cols=291 Identities=31% Similarity=0.537 Sum_probs=263.4
Q ss_pred cCCCCcchhhhhcccCCCCCcccceEEEEEecccccCC-CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeee
Q psy3685 727 EKLPSDQWFEQKLDHFNIEEPRTWKQRYFVNSTWYKHG-GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRF 805 (1042)
Q Consensus 727 ~~~~~~~~f~q~~dhf~~~~~~tf~qry~~~~~~~~~~-~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~ 805 (1042)
+-++++.||.|+||||++.++.||.|||+||+++|+++ ||||||+|+||++..+..++|++.++|.+++|+||++||||
T Consensus 42 ~~~ye~~yf~q~LDHFsF~~~~tF~qRylin~~fw~~g~gPIffYtGNEGdie~Fa~ntGFm~D~Ap~~~AllVFaEHRy 121 (492)
T KOG2183|consen 42 EYNYETRYFQQPLDHFSFTDNKTFDQRYLINDDFWKKGEGPIFFYTGNEGDIEWFANNTGFMWDLAPELKALLVFAEHRY 121 (492)
T ss_pred cccceeEEeecccccccccCccceeeEEEEecccccCCCCceEEEeCCcccHHHHHhccchHHhhhHhhCceEEEeehhc
Confidence 55788999999999999999999999999999999877 99999999999999988999999999999999999999999
Q ss_pred cccccCCCCCCc---ccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 806 YGKSHPLSDLSV---ESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 806 ~G~S~p~~~~~~---~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
||+|.|+++.+. ++|.|||+||||||+|++|+.++.+.+ +...|||+||||||||||||||+||||+|.||+|+||
T Consensus 122 YGeS~PFG~~s~k~~~hlgyLtseQALADfA~ll~~lK~~~~-a~~~pvIafGGSYGGMLaAWfRlKYPHiv~GAlAaSA 200 (492)
T KOG2183|consen 122 YGESLPFGSQSYKDARHLGYLTSEQALADFAELLTFLKRDLS-AEASPVIAFGGSYGGMLAAWFRLKYPHIVLGALAASA 200 (492)
T ss_pred cccCCCCcchhccChhhhccccHHHHHHHHHHHHHHHhhccc-cccCcEEEecCchhhHHHHHHHhcChhhhhhhhhccC
Confidence 999999998654 569999999999999999999999854 4567999999999999999999999999999999999
Q ss_pred ccccc---cCChhhHHHHHHHhhccChhhHHHHHHHHHHHHHHhcCCchHHHHHHHhcCCCCCCCCChhcHHHHHHHHHH
Q psy3685 883 PLRAV---VDFPEYFGVVADALATVSTECVKAVQTATHTISKMLKSPSDAKYLTEQFKLCTPLDINNAKDVSSFVESLAD 959 (1042)
Q Consensus 883 ~~~~~---~~~~~~~~~v~~~~~~~~~~c~~~i~~~~~~~~~~~~~~~~~~~l~~~f~~~~~~~~~~~~~~~~~~~~~~~ 959 (1042)
||... +|-..|+..|.+.+...+++|...|+++|..|+.+..++.+.++|...|++|.+++ +|..++.+|++..+.
T Consensus 201 Pvl~f~d~vp~~~f~~ivT~~F~~as~~C~~~I~~sW~ai~~l~~~~nG~q~Ls~~f~lc~~ln-~d~~~l~d~l~ea~~ 279 (492)
T KOG2183|consen 201 PVLYFEDTVPKDVFYRIVTRDFKDASPNCRNTIRKSWDAIDRLAAKDNGLQILSKAFKLCKPLN-DDIGDLKDYLREAYE 279 (492)
T ss_pred ceEeecCCCCcchhhhHHHHHHHhhcHHHHHHHHHHHHHHHHHhcCcchHHHHHHHhhhccccc-ccHHHHHHHHHHHHH
Confidence 99755 56778999999999999999999999999999999999999999999999999998 689999999999999
Q ss_pred Hhhhh-hccCCCCCCCCCCCCHHHHHHHhhcCCCCHHHHHHHHHHHH--HhcCCCCCcccccc
Q psy3685 960 NIAGV-VQYNKDNRESSNHITIDDLCALLMDKKQDPVARYAAVNDLI--LSKEKKKNPCVDYK 1019 (1042)
Q Consensus 960 ~~~~~-~~y~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~c~~~~ 1019 (1042)
.++|+ ++|+.++..++|++||+++|+.|+....+..+.+.++...+ +++|+|...|+|++
T Consensus 280 ylAMVdYPy~t~Fl~pLPa~PV~~~C~~i~~~~~~~~~ll~~i~a~~~~yyNytg~~~C~d~s 342 (492)
T KOG2183|consen 280 YLAMVDYPYPTSFLAPLPAWPVKVVCKYINAPGPNDSDLLDRIFAAVNLYYNYTGSEKCYDIS 342 (492)
T ss_pred HHHHhcCCCCccccCcCCCCcHHHHHHHhccCCCChHHHHHHHHHHhhheeccCCCcchhccc
Confidence 99999 99999999999999999999999866544344444443332 23456788899997
No 3
>PF05577 Peptidase_S28: Serine carboxypeptidase S28; InterPro: IPR008758 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S28 (clan SC). The predicted active site residues for members of this family and family S10 occur in the same order in the sequence: S, D, H. These serine proteases include several eukaryotic enzymes such as lysosomal Pro-X carboxypeptidase, dipeptidyl-peptidase II, and thymus-specific serine peptidase [, , , ].; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 3N2Z_B 3JYH_A 3N0T_C.
Probab=100.00 E-value=3.1e-61 Score=558.19 Aligned_cols=269 Identities=46% Similarity=0.802 Sum_probs=219.6
Q ss_pred hhcccCCCCCcccceEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCC
Q psy3685 737 QKLDHFNIEEPRTWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLS 816 (1042)
Q Consensus 737 q~~dhf~~~~~~tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~ 816 (1042)
|+|||||+.+++||+||||+|++||+++||||||+||||+++..++..|++.+||+++||+||+|||||||+|.|+++++
T Consensus 1 Q~lDHf~~~~~~tf~qRY~~n~~~~~~~gpifl~~ggE~~~~~~~~~~~~~~~lA~~~~a~~v~lEHRyYG~S~P~~~~s 80 (434)
T PF05577_consen 1 QPLDHFNPSNNGTFSQRYWVNDQYYKPGGPIFLYIGGEGPIEPFWINNGFMWELAKEFGALVVALEHRYYGKSQPFGDLS 80 (434)
T ss_dssp EES-SS-SSTT-EEEEEEEEE-TT--TTSEEEEEE--SS-HHHHHHH-HHHHHHHHHHTEEEEEE--TTSTTB-TTGGGG
T ss_pred CCCCCCCCCCCCeEEEEEEEEhhhcCCCCCEEEEECCCCccchhhhcCChHHHHHHHcCCcEEEeehhhhcCCCCccccc
Confidence 89999999889999999999999999999999999999999998888899999999999999999999999999999999
Q ss_pred cccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccCChhhHHH
Q psy3685 817 VESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVDFPEYFGV 896 (1042)
Q Consensus 817 ~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~~~~~~~~ 896 (1042)
++||+|||++|||+|+|+||++++.++...++.|||+|||||||+||||+|+|||++|+|||||||||.+++||++|+++
T Consensus 81 ~~nL~yLt~~QALaD~a~F~~~~~~~~~~~~~~pwI~~GgSY~G~Laaw~r~kyP~~~~ga~ASSapv~a~~df~~y~~~ 160 (434)
T PF05577_consen 81 TENLRYLTSEQALADLAYFIRYVKKKYNTAPNSPWIVFGGSYGGALAAWFRLKYPHLFDGAWASSAPVQAKVDFWEYFEV 160 (434)
T ss_dssp GSTTTC-SHHHHHHHHHHHHHHHHHHTTTGCC--EEEEEETHHHHHHHHHHHH-TTT-SEEEEET--CCHCCTTTHHHHH
T ss_pred hhhHHhcCHHHHHHHHHHHHHHHHHhhcCCCCCCEEEECCcchhHHHHHHHhhCCCeeEEEEeccceeeeecccHHHHHH
Confidence 99999999999999999999999988866677899999999999999999999999999999999999999999999999
Q ss_pred HHHHhhccChhhHHHHHHHHHHHHHHhcCCchHHHHHHHhcCCCCCCCCChhcHHHHHHHHHHHhhhhhccCCCCC--CC
Q psy3685 897 VADALATVSTECVKAVQTATHTISKMLKSPSDAKYLTEQFKLCTPLDINNAKDVSSFVESLADNIAGVVQYNKDNR--ES 974 (1042)
Q Consensus 897 v~~~~~~~~~~c~~~i~~~~~~~~~~~~~~~~~~~l~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~--~~ 974 (1042)
|++.+...+++|.++|++|++.|++++.++.+.++|++.|++|.+++..+..|+..|+..++.++++++||+++.. .+
T Consensus 161 v~~~~~~~~~~C~~~i~~a~~~i~~~~~~~~~~~~l~~~f~~~~~~~~~~~~d~~~~~~~~~~~~~~~~qy~~~~~~~~~ 240 (434)
T PF05577_consen 161 VTESLRKYGPNCYDAIRAAFDQIDKLLKTGNGRQQLKKKFKLCFPLDDKNDDDFAYFFSSIADAFQGMVQYPYPGNFNSP 240 (434)
T ss_dssp HHHHHHCCSCCHHHHHHHHHHHHHHHCCTCHHHHHHHHHCTBSS---TCHCHHHHHHHHHHHHHHHHHT--SS-EESSSE
T ss_pred HHHHHHhhccHHHHHHHHHHHHHHHHhhcccHHHHHHHHhhhccccccccchHHHHHHHHHHHHHHHHHhcCCCcccccC
Confidence 9999998888899999999999999999988888999999999999865557888899999999999999987654 23
Q ss_pred CCCCCHHHHHHHhhcCCC-CHHHHHHHHHHHH
Q psy3685 975 SNHITIDDLCALLMDKKQ-DPVARYAAVNDLI 1005 (1042)
Q Consensus 975 ~~~~~~~~~C~~~~~~~~-~~~~~~~~~~~~~ 1005 (1042)
.+.+++..+|+.|++... +....++.+..++
T Consensus 241 ~~~~~i~~~C~~l~~~~~~~~~~~~~~~~~~~ 272 (434)
T PF05577_consen 241 LPAWPIRQLCDSLTNASWPDEVLRLAALAQWY 272 (434)
T ss_dssp E-SSHHHHHHHHCHTSSSHHHHHHHHHHHHHH
T ss_pred CCCcchHHHhhhhcccccCchhHHHHHHHHHH
Confidence 456789999999987643 3334455555554
No 4
>PF05576 Peptidase_S37: PS-10 peptidase S37; InterPro: IPR008761 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. These group of serine peptidases belong to MEROPS peptidase family S37 (clan SC). The members of this group of secreted peptidases are restricted to bacteria. In Streptomyces lividans the peptidase removes tripeptides from the N terminus of extracellular proteins (tripeptidyl aminopeptidase,Tap) [, ].
Probab=99.94 E-value=9.9e-27 Score=245.37 Aligned_cols=268 Identities=19% Similarity=0.297 Sum_probs=187.8
Q ss_pred cchhhhhcccCCCCCcccceEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccC
Q psy3685 732 DQWFEQKLDHFNIEEPRTWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHP 811 (1042)
Q Consensus 732 ~~~f~q~~dhf~~~~~~tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p 811 (1042)
..+|+|||||.+ ++.+||+||..+..+... .|+||++.|.+.... ....+++..++|+.|.+||||||.|.|
T Consensus 33 vl~y~QPvDH~~-P~~gtF~QRvtLlHk~~d--rPtV~~T~GY~~~~~-----p~r~Ept~Lld~NQl~vEhRfF~~SrP 104 (448)
T PF05576_consen 33 VLRYTQPVDHRH-PEKGTFQQRVTLLHKDFD--RPTVLYTEGYNVSTS-----PRRSEPTQLLDGNQLSVEHRFFGPSRP 104 (448)
T ss_pred EEeeecCCCCCC-CCCCceEEEEEEEEcCCC--CCeEEEecCcccccC-----ccccchhHhhccceEEEEEeeccCCCC
Confidence 457999999977 678899999999886554 799999999987532 223488999999999999999999999
Q ss_pred CCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc-cCC
Q psy3685 812 LSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV-VDF 890 (1042)
Q Consensus 812 ~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~-~~~ 890 (1042)
.+ .+|.|||++||.+|.+.++++++..|. .+||..|+|-|||.|.++|..||+.|+|.|+..||.... .+-
T Consensus 105 ~p----~DW~~Lti~QAA~D~Hri~~A~K~iY~----~kWISTG~SKGGmTa~y~rrFyP~DVD~tVaYVAP~~~~~~eD 176 (448)
T PF05576_consen 105 EP----ADWSYLTIWQAASDQHRIVQAFKPIYP----GKWISTGGSKGGMTAVYYRRFYPDDVDGTVAYVAPNDVVNRED 176 (448)
T ss_pred CC----CCcccccHhHhhHHHHHHHHHHHhhcc----CCceecCcCCCceeEEEEeeeCCCCCCeeeeeecccccCcccc
Confidence 75 789999999999999999999999995 389999999999999999999999999999999999753 443
Q ss_pred hhhHHHHHHHhhccChhhHHHHHHHHHHHHHHhcCCchHHHHHHHhcCCCCCCCCChhc-HHHHHHHHHHHhhhhhccCC
Q psy3685 891 PEYFGVVADALATVSTECVKAVQTATHTISKMLKSPSDAKYLTEQFKLCTPLDINNAKD-VSSFVESLADNIAGVVQYNK 969 (1042)
Q Consensus 891 ~~~~~~v~~~~~~~~~~c~~~i~~~~~~~~~~~~~~~~~~~l~~~f~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~y~~ 969 (1042)
..|...+.+. ..++|.+.|...-..+- .. ........+.+-.-.++..+.+.. ...|-..+.+...+++||+.
T Consensus 177 ~~y~~Fl~~V---Gt~eCR~~l~~~Qre~L--~R-R~~l~~~~~~yAa~~g~TF~~vG~~dra~E~~VLe~~faFWQy~~ 250 (448)
T PF05576_consen 177 SRYDRFLEKV---GTAECRDKLNDFQREAL--KR-RDELLPRYEAYAAENGLTFRTVGSLDRAYEYAVLEYPFAFWQYGT 250 (448)
T ss_pred hhHHHHHHhc---CCHHHHHHHHHHHHHHH--hh-HHHHHHHHHHHHHHcCCEEeecCcHHHHHHHHHhhhhhHhhccCC
Confidence 4444443332 25789988877643332 11 111111111111222222223222 35677889999999999985
Q ss_pred CCCCCCCCCCHHHHHHHhhcC--CCCHHHHHHHHHHHH---HhcCCCCCccccccHHHHHHHhhcCCCC
Q psy3685 970 DNRESSNHITIDDLCALLMDK--KQDPVARYAAVNDLI---LSKEKKKNPCVDYKYEKMIEDMKNTSWD 1033 (1042)
Q Consensus 970 ~~~~~~~~~~~~~~C~~~~~~--~~~~~~~~~~~~~~~---~~~~~~~~~c~~~~~~~~~~~~~~~~~~ 1033 (1042)
...|+.|.+. +.+..++++.+..+. ...+.+-.+-.++-||. ...|+..+++
T Consensus 251 -----------~~~C~~IP~~~~~AsddeL~~~l~~isg~s~ysDq~l~~y~pyyyQA-~teLG~p~~~ 307 (448)
T PF05576_consen 251 -----------PADCASIPADAKTASDDELFDFLDAISGFSFYSDQGLEPYTPYYYQA-GTELGYPGYD 307 (448)
T ss_pred -----------ccchhcCCCCcCCCCHHHHHHHHHhhcCccccccCCcccccChHHHH-HhhcCCCCCC
Confidence 3357777642 345555555444432 22222333455666664 4555555544
No 5
>KOG1074|consensus
Probab=99.94 E-value=5.3e-28 Score=270.22 Aligned_cols=206 Identities=23% Similarity=0.481 Sum_probs=166.0
Q ss_pred ccccCCCCcccCCchhHHHHHHHhcCC-ccccCccCcccCCchhHHHHHHHhcCc-----ccccC---cCCcccCCchhh
Q psy3685 504 RYKCDLCSKEFSSKGNLTEHMKQHQGI-RYKCDLCSKEFSRKDNLTEHKEIHQGI-----RYKCD---LCSKEFSRKDNL 574 (1042)
Q Consensus 504 ~~~C~~C~~~f~~~~~L~~H~~~H~~~-~~~C~~C~k~f~~~~~L~~H~~~H~~~-----~~~C~---~C~k~F~~~~~L 574 (1042)
|-+|-+|-+...-++.|+.|.|+|+|+ ||+|.+||++|.++.+|+.|+-+|... ++.|+ +|-+.|.....|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 468999999999999999999999998 999999999999999999999999765 49999 999999999999
Q ss_pred hhHhhhhcCC--------------cccCCccccccCCchhHHHHhhhc----------------CC-C----cccCCCCC
Q psy3685 575 TEHKEIHQGI--------------RYKCDLCSKKFSRKLNLTKHMKIH----------------QG-I----RYKCDLCS 619 (1042)
Q Consensus 575 ~~H~~~H~~~--------------~~~C~~C~k~F~~~~~L~~H~~~H----------------~~-~----~~~C~~C~ 619 (1042)
..|+++|.+. .=+|..|.+.|.....+..++--| ++ . +..+..|+
T Consensus 685 pQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~ 764 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCG 764 (958)
T ss_pred cceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccc
Confidence 9999999842 147999999998888888877655 22 1 46788999
Q ss_pred cccccCCCCcc--------------------------cccCCcCcc-cC--------Ccccc------------------
Q psy3685 620 KEFSRKDIPYE--------------------------VTVKEESYE-GD--------SINVE------------------ 646 (1042)
Q Consensus 620 k~F~~~~~l~~--------------------------~~~~~~~~~-c~--------~~~~~------------------ 646 (1042)
..+.....+.. .++.+++.. ++ .+++.
T Consensus 765 ~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~ 844 (958)
T KOG1074|consen 765 RELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATK 844 (958)
T ss_pred cccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccccc
Confidence 98876543322 223333331 11 00000
Q ss_pred -----------------------------ccCCCcccccccccccccccccccccccccccCCccccCCCCCCccCCcch
Q psy3685 647 -----------------------------ENNGNDAVTRDCLEFFKTEQNTTLGKLNKILHQDIRYKCDLCSKDFSRKDN 697 (1042)
Q Consensus 647 -----------------------------~~~~~~~~~~~C~~~f~~~~~~~l~~h~~~h~~~~~~~C~~C~k~F~~~~~ 697 (1042)
.+..+..+|.+|++.|.. .++|..|+|+|+++|||.|.+|+++|+.+++
T Consensus 845 ~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsS--SsALqiH~rTHtg~KPF~C~fC~~aFttrgn 922 (958)
T KOG1074|consen 845 TNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSS--SAALEIHMRTHTGPKPFFCHFCEEAFTTRGN 922 (958)
T ss_pred cccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccc--hHHHHHhhhcCCCCCCccchhhhhhhhhhhh
Confidence 001344678999999999 9999999999999999999999999999999
Q ss_pred HHHHHhhccCCCCc
Q psy3685 698 LTEHKEIHQGIRVS 711 (1042)
Q Consensus 698 L~~H~~~H~g~~~~ 711 (1042)
|+.||.+|....|+
T Consensus 923 LKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 923 LKVHMGTHMWVQPP 936 (958)
T ss_pred hhhhhccccccCCC
Confidence 99999999887663
No 6
>KOG2462|consensus
Probab=99.91 E-value=2.4e-25 Score=221.27 Aligned_cols=134 Identities=33% Similarity=0.702 Sum_probs=118.9
Q ss_pred cccCCCCCcccCCchhHHHHHHhhcCCcccCCcCCccccCcccccccccccCCccccCCCCcccCCchhHHHHHHHhcCC
Q psy3685 451 RYKCDLCSKEFSRKGNLTKHMKLHQGIRYKCDLCLKEFSRKDNLTDRKIHQGIRYKCDLCSKEFSSKGNLTEHMKQHQGI 530 (1042)
Q Consensus 451 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~l~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~ 530 (1042)
+|+|+.|++.+.+.++|.+|.++|-... ..+.+.|++|||.|.+-..|+.|+|+|+ .
T Consensus 130 r~~c~eCgk~ysT~snLsrHkQ~H~~~~----------------------s~ka~~C~~C~K~YvSmpALkMHirTH~-l 186 (279)
T KOG2462|consen 130 RYKCPECGKSYSTSSNLSRHKQTHRSLD----------------------SKKAFSCKYCGKVYVSMPALKMHIRTHT-L 186 (279)
T ss_pred ceeccccccccccccccchhhccccccc----------------------ccccccCCCCCceeeehHHHhhHhhccC-C
Confidence 6889999999999999999998885321 1246889999999999999999999997 6
Q ss_pred ccccCccCcccCCchhHHHHHHHhcCc-ccccCcCCcccCCchhhhhHhhhhcCC-cccCCccccccCCchhHHHHhhh
Q psy3685 531 RYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCSKKFSRKLNLTKHMKI 607 (1042)
Q Consensus 531 ~~~C~~C~k~f~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~L~~H~~~ 607 (1042)
+++|.+|||.|...+-|+-|+|+|+|+ ||.|+.|+|+|..+++|+.||++|.+. +|+|..|+|+|..++.|.+|...
T Consensus 187 ~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 187 PCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred CcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 899999999999999999999999997 899999999999999999999999986 89999999999999999999753
No 7
>KOG1074|consensus
Probab=99.91 E-value=6.4e-25 Score=245.74 Aligned_cols=215 Identities=24% Similarity=0.527 Sum_probs=133.9
Q ss_pred ccccCCCCCccCCchhHHHHHHhhcCC-ccccCccCccccCcchHHhHhhhhcCc-----eeccC---ccccccCCchhH
Q psy3685 370 RYKCDLCSKEFSRKGNLTKHMKLHQGI-RYKCDLCLKEFSRKDNLTEHKEIHQGI-----RYKCD---LCSKKFSRKLNL 440 (1042)
Q Consensus 370 ~~~C~~C~k~f~~~~~L~~H~~~h~~~-~~~C~~C~k~f~~~~~L~~H~~~h~~~-----~~~C~---~C~~~f~~~~~L 440 (1042)
|.+|-+|-+..+-++.|+.|.|+|+|+ ||+|.+||+.|.++.+|+.|+-+|..+ ++.|+ +|-+.|.....|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 456777777777777777777777665 677777777777777777777666543 36677 677777777777
Q ss_pred HHHHhhcCCC--------------cccCCCCCcccCCchhHHHHHHhhcC-----------------C----cccCCcCC
Q psy3685 441 TKHMKIHQGI--------------RYKCDLCSKEFSRKGNLTKHMKLHQG-----------------I----RYKCDLCL 485 (1042)
Q Consensus 441 ~~H~~~H~~~--------------~~~C~~C~k~f~~~~~L~~H~~~H~~-----------------~----~~~C~~C~ 485 (1042)
..|+++|.+. .-+|..|.+.|.....+..++-.|.+ . +..+..|+
T Consensus 685 pQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~ 764 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCG 764 (958)
T ss_pred cceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccc
Confidence 7777776521 24566777777666666666655411 0 23445555
Q ss_pred ccccCcccccc-------------------------cccccCCcc-ccCCCCcccCCchhHH----HH---H--------
Q psy3685 486 KEFSRKDNLTD-------------------------RKIHQGIRY-KCDLCSKEFSSKGNLT----EH---M-------- 524 (1042)
Q Consensus 486 ~~f~~~~~l~~-------------------------~~~~~~~~~-~C~~C~~~f~~~~~L~----~H---~-------- 524 (1042)
..+.....+.. .....+++. .+.+++..-...-... .- +
T Consensus 765 ~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~ 844 (958)
T KOG1074|consen 765 RELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATK 844 (958)
T ss_pred cccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccccc
Confidence 54433322211 000111333 4444443322211100 00 0
Q ss_pred ---HHhc-------------------C------CccccCccCcccCCchhHHHHHHHhcCc-ccccCcCCcccCCchhhh
Q psy3685 525 ---KQHQ-------------------G------IRYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCSKEFSRKDNLT 575 (1042)
Q Consensus 525 ---~~H~-------------------~------~~~~C~~C~k~f~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~L~ 575 (1042)
.++. + ....|.+|++.|.+.++|+.|+|+|+++ ||.|.+|++.|..+.+|+
T Consensus 845 ~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLK 924 (958)
T KOG1074|consen 845 TNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLK 924 (958)
T ss_pred cccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhh
Confidence 0000 0 1267999999999999999999999986 899999999999999999
Q ss_pred hHhhhhcCC
Q psy3685 576 EHKEIHQGI 584 (1042)
Q Consensus 576 ~H~~~H~~~ 584 (1042)
.||.+|...
T Consensus 925 vHMgtH~w~ 933 (958)
T KOG1074|consen 925 VHMGTHMWV 933 (958)
T ss_pred hhhcccccc
Confidence 999999764
No 8
>KOG2462|consensus
Probab=99.90 E-value=3.5e-25 Score=220.12 Aligned_cols=127 Identities=32% Similarity=0.698 Sum_probs=120.9
Q ss_pred CccccCCCCcccCCchhHHHHHHHhcCC----ccccCccCcccCCchhHHHHHHHhcCcccccCcCCcccCCchhhhhHh
Q psy3685 503 IRYKCDLCSKEFSSKGNLTEHMKQHQGI----RYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKEFSRKDNLTEHK 578 (1042)
Q Consensus 503 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~----~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~ 578 (1042)
..|+|+.||+.+.+.++|.+|...|-.. .+.|++|+|.|.+..+|+.|+|+|+ .+++|.+|||.|.+..-|+-|+
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQGHi 207 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQGHI 207 (279)
T ss_pred CceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhccc
Confidence 4699999999999999999999999653 6999999999999999999999997 6899999999999999999999
Q ss_pred hhhcCC-cccCCccccccCCchhHHHHhhhcCCC-cccCCCCCcccccCCCCcc
Q psy3685 579 EIHQGI-RYKCDLCSKKFSRKLNLTKHMKIHQGI-RYKCDLCSKEFSRKDIPYE 630 (1042)
Q Consensus 579 ~~H~~~-~~~C~~C~k~F~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~l~~ 630 (1042)
|+|||+ ||.|+.|+|+|..+++|+.||++|.+. +|+|..|+|+|++++.|..
T Consensus 208 RTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnK 261 (279)
T KOG2462|consen 208 RTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNK 261 (279)
T ss_pred ccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHH
Confidence 999998 999999999999999999999999997 9999999999999988875
No 9
>KOG3608|consensus
Probab=99.90 E-value=1.6e-24 Score=218.84 Aligned_cols=213 Identities=27% Similarity=0.578 Sum_probs=186.8
Q ss_pred ccc--CCCCCccCCchhHHHHHHhhcCC--------------cccc--CccCccccCcchHHhHhhhhcCc-eeccCccc
Q psy3685 371 YKC--DLCSKEFSRKGNLTKHMKLHQGI--------------RYKC--DLCLKEFSRKDNLTEHKEIHQGI-RYKCDLCS 431 (1042)
Q Consensus 371 ~~C--~~C~k~f~~~~~L~~H~~~h~~~--------------~~~C--~~C~k~f~~~~~L~~H~~~h~~~-~~~C~~C~ 431 (1042)
|.| +.|+..|.+...+..|+..|..- .+.| ..|.+.|.++..|++|.+.|+++ ...|+.||
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 556 46999999999999998877421 2445 56999999999999999999986 68899999
Q ss_pred cccCCchhHHHHHhhcCCC---cccCCCCCcccCCchhHHHHHHhhcCCcccCCcCCccccCcccccccccccCCccccC
Q psy3685 432 KKFSRKLNLTKHMKIHQGI---RYKCDLCSKEFSRKGNLTKHMKLHQGIRYKCDLCLKEFSRKDNLTDRKIHQGIRYKCD 508 (1042)
Q Consensus 432 ~~f~~~~~L~~H~~~H~~~---~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~l~~~~~~~~~~~~C~ 508 (1042)
.-|.++..|-.|.+..+.. +|.|..|.|.|.+...|..|+..|-. -|+|+
T Consensus 215 ~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn---------------------------~ykCp 267 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN---------------------------CYKCP 267 (467)
T ss_pred HHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh---------------------------ccccc
Confidence 9999999999998876643 89999999999999999999998853 58999
Q ss_pred CCCcccCCchhHHHHHHHhcC--CccccCccCcccCCchhHHHHHHHhcCcccccCc--CCcccCCchhhhhHhhhhc-C
Q psy3685 509 LCSKEFSSKGNLTEHMKQHQG--IRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDL--CSKEFSRKDNLTEHKEIHQ-G 583 (1042)
Q Consensus 509 ~C~~~f~~~~~L~~H~~~H~~--~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~--C~k~F~~~~~L~~H~~~H~-~ 583 (1042)
.|+.+....++|.+||+.-+. +||+|+.|++.|.+.+.|.+|..+|....|+|+. |..+|++..+|++|++.++ |
T Consensus 268 lCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg 347 (467)
T KOG3608|consen 268 LCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEG 347 (467)
T ss_pred ccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccC
Confidence 999999999999999986544 3999999999999999999999999977899988 9999999999999998776 4
Q ss_pred C---cccCCccccccCCchhHHHHhhhcCC
Q psy3685 584 I---RYKCDLCSKKFSRKLNLTKHMKIHQG 610 (1042)
Q Consensus 584 ~---~~~C~~C~k~F~~~~~L~~H~~~H~~ 610 (1042)
. +|.|-.|++.|++-.+|..|++.-|+
T Consensus 348 ~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 348 NNPILYACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred CCCCceeeecchhhhccchhHHHHHHHhhc
Confidence 3 79999999999999999999887666
No 10
>KOG3608|consensus
Probab=99.88 E-value=2.6e-23 Score=210.20 Aligned_cols=213 Identities=25% Similarity=0.570 Sum_probs=182.7
Q ss_pred ccc--CCCccccCCchHHHHHHHhhcC--------------CcccCC--CCCccccccccccccccccccC-cccCCccc
Q psy3685 263 YKC--DLCSKKFSRKLNLTKHMKIHQG--------------IRYKCD--LCSKEFSRKGNLTKHMKLHQGI-RYKCDLCL 323 (1042)
Q Consensus 263 ~~C--~~C~~~f~~~~~L~~H~~~h~~--------------~~~~C~--~C~~~f~~~~~L~~H~~~h~~~-~~~C~~C~ 323 (1042)
|.| ..|+..|.+...+..|+-.|.. ..+.|. .|-+.+.++..|++|++.|+++ ...|+.||
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 445 4677888887777777766632 124554 5888899999999999999887 57899999
Q ss_pred ccccCccccccccccccc---ccccCCcccccccCchHHHHHHhhccccccccCCCCCccCCchhHHHHHHhhcC--Ccc
Q psy3685 324 KEFSRKDNLTEHKEIHQG---IRYKCDLCSKKFSRKLNLTKHMKIHQGIRYKCDLCSKEFSRKGNLTKHMKLHQG--IRY 398 (1042)
Q Consensus 324 ~~f~~~~~l~~H~~~h~~---~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~C~k~f~~~~~L~~H~~~h~~--~~~ 398 (1042)
.-|+++..|-.|.+..+. .+|+|..|.|.|.+...|..|+..|-. -|+|+.|+.+....++|..|++.-+. +||
T Consensus 215 ~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-~ykCplCdmtc~~~ssL~~H~r~rHs~dkpf 293 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-CYKCPLCDMTCSSASSLTTHIRYRHSKDKPF 293 (467)
T ss_pred HHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh-cccccccccCCCChHHHHHHHHhhhccCCCc
Confidence 999999999999876654 379999999999999999999998865 69999999999999999999987654 599
Q ss_pred ccCccCccccCcchHHhHhhhhcCceeccCc--cccccCCchhHHHHHhhcCC----CcccCCCCCcccCCchhHHHHHH
Q psy3685 399 KCDLCLKEFSRKDNLTEHKEIHQGIRYKCDL--CSKKFSRKLNLTKHMKIHQG----IRYKCDLCSKEFSRKGNLTKHMK 472 (1042)
Q Consensus 399 ~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~----~~~~C~~C~k~f~~~~~L~~H~~ 472 (1042)
+|+.|++.|.+.+.|..|..+|+...|+|+. |..+|.+...|++|++.+++ .+|.|..|++.|++-.+|..|+.
T Consensus 294 KCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~ 373 (467)
T KOG3608|consen 294 KCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLM 373 (467)
T ss_pred cccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHH
Confidence 9999999999999999999999988899998 99999999999999998763 27999999999999999999987
Q ss_pred hhcC
Q psy3685 473 LHQG 476 (1042)
Q Consensus 473 ~H~~ 476 (1042)
.-++
T Consensus 374 kkH~ 377 (467)
T KOG3608|consen 374 KKHG 377 (467)
T ss_pred Hhhc
Confidence 6555
No 11
>KOG3623|consensus
Probab=99.79 E-value=5.1e-20 Score=202.45 Aligned_cols=102 Identities=30% Similarity=0.712 Sum_probs=92.5
Q ss_pred ccCCcccccccCcccccccccccccc---cccCCcccccccCchHHHHHHhhccc--------------cccccCCCCCc
Q psy3685 317 YKCDLCLKEFSRKDNLTEHKEIHQGI---RYKCDLCSKKFSRKLNLTKHMKIHQG--------------IRYKCDLCSKE 379 (1042)
Q Consensus 317 ~~C~~C~~~f~~~~~l~~H~~~h~~~---~~~C~~C~k~f~~~~~L~~H~~~h~~--------------~~~~C~~C~k~ 379 (1042)
..|++|.+.+.....|++|++..+.+ .|.|..|..+|..+..|.+||.+|.. +.|+|..|||.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 68999999999999999998765443 69999999999999999999998853 26999999999
Q ss_pred cCCchhHHHHHHhhcCC-ccccCccCccccCcchHHhHhh
Q psy3685 380 FSRKGNLTKHMKLHQGI-RYKCDLCLKEFSRKDNLTEHKE 418 (1042)
Q Consensus 380 f~~~~~L~~H~~~h~~~-~~~C~~C~k~f~~~~~L~~H~~ 418 (1042)
|..+-.|+.|+|+|.|+ ||.|+.|+|.|+....+..|+.
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 99999999999999986 9999999999999999988863
No 12
>KOG3623|consensus
Probab=99.73 E-value=1e-18 Score=192.36 Aligned_cols=102 Identities=32% Similarity=0.688 Sum_probs=91.8
Q ss_pred cccCccCccccCchhhhhHhhhccC---CcccCCCCCccccCcchhhhhhhhccC--------------CccccCccCcc
Q psy3685 63 YKCNLCSKEFSRNCYLTEHMEQHQG---IRYKCDLCSKDFSYKSSLIEHKKLHQG--------------IRYKCNLCSKE 125 (1042)
Q Consensus 63 ~~C~~C~~~f~~~~~L~~H~~~H~~---~~~~C~~C~~~f~~~~~L~~H~~~h~~--------------~~~~C~~C~~~ 125 (1042)
..|+.|++.+.....|+.|++--+. ..|.|..|..+|.++..|.+|+.+|.. ..|+|..|+|.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 5799999999999999999875433 468999999999999999999999853 15999999999
Q ss_pred ccCchhHHHHHhHhcCC-ccccCCCCCCCCChhHHHHHHH
Q psy3685 126 FSRNCYLTEHMEQHQGI-RYKCDLCSKDFSYKSSLIEHKK 164 (1042)
Q Consensus 126 F~~~~~L~~H~~~h~~~-~~~C~~C~~~F~~~~~L~~H~~ 164 (1042)
|+.+..|+.|+|+|.|+ ||.|+-|+|+|+...++..||-
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 99999999999999986 9999999999999999998874
No 13
>KOG3576|consensus
Probab=99.49 E-value=5.9e-15 Score=138.62 Aligned_cols=109 Identities=31% Similarity=0.682 Sum_probs=91.0
Q ss_pred CccccCCCCcccCCchhHHHHHHHhcCC-ccccCccCcccCCchhHHHHHHHhcCc-ccccCcCCcccCCchhhhhHhhh
Q psy3685 503 IRYKCDLCSKEFSSKGNLTEHMKQHQGI-RYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCSKEFSRKDNLTEHKEI 580 (1042)
Q Consensus 503 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~-~~~C~~C~k~f~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~L~~H~~~ 580 (1042)
..|.|.+|+|.|.-..-|.+||+-|... ++-|..|||.|...-.|++|+|+|+|. ||+|..|+|+|+++-+|..|++.
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~k 195 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKK 195 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHH
Confidence 4688999999998888888898888876 788888999998888889999988886 88899999999888888888765
Q ss_pred hcCC------------cccCCccccccCCchhHHHHhhhcCCC
Q psy3685 581 HQGI------------RYKCDLCSKKFSRKLNLTKHMKIHQGI 611 (1042)
Q Consensus 581 H~~~------------~~~C~~C~k~F~~~~~L~~H~~~H~~~ 611 (1042)
-+|. -|.|+.||.+-.....+..|++.|+..
T Consensus 196 vhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 196 VHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 4442 278888888888888888888888753
No 14
>KOG3576|consensus
Probab=99.47 E-value=1.2e-14 Score=136.60 Aligned_cols=109 Identities=30% Similarity=0.697 Sum_probs=97.6
Q ss_pred cccCCCCCcccCCchhHHHHHHhhcCCcccCCcCCccccCcccccccccccCCccccCCCCcccCCchhHHHHHHHhcCC
Q psy3685 451 RYKCDLCSKEFSRKGNLTKHMKLHQGIRYKCDLCLKEFSRKDNLTDRKIHQGIRYKCDLCSKEFSSKGNLTEHMKQHQGI 530 (1042)
Q Consensus 451 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~~~f~~~~~l~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~ 530 (1042)
.|.|.+|+|.|..+..|.+|++-|... +++-|..||+.|...-+|++|+|+|+|.
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~kch~~v-------------------------kr~lct~cgkgfndtfdlkrh~rthtgv 171 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLKCHSDV-------------------------KRHLCTFCGKGFNDTFDLKRHTRTHTGV 171 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhhhccHH-------------------------HHHHHhhccCcccchhhhhhhhccccCc
Confidence 688999999999999999999988653 3688999999999999999999999998
Q ss_pred -ccccCccCcccCCchhHHHHHHHhcCc------------ccccCcCCcccCCchhhhhHhhhhcCC
Q psy3685 531 -RYKCDLCSKEFSRKDNLTEHKEIHQGI------------RYKCDLCSKEFSRKDNLTEHKEIHQGI 584 (1042)
Q Consensus 531 -~~~C~~C~k~f~~~~~L~~H~~~H~~~------------~~~C~~C~k~F~~~~~L~~H~~~H~~~ 584 (1042)
||+|..|+|+|..+-+|..|++.-+|. .|.|+.||.+-.....+..|+..|+..
T Consensus 172 rpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 172 RPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred cccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 999999999999999999998776653 389999999999999999999988764
No 15
>PLN02298 hydrolase, alpha/beta fold family protein
Probab=99.13 E-value=3.3e-10 Score=127.41 Aligned_cols=113 Identities=17% Similarity=0.233 Sum_probs=86.5
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
..|+||+++|.|... .|....+...|+.. |+.|+++|+||||.|.+... ...+.+..++|+..+++.++...
T Consensus 58 ~~~~VvllHG~~~~~-~~~~~~~~~~L~~~-Gy~V~~~D~rGhG~S~~~~~------~~~~~~~~~~D~~~~i~~l~~~~ 129 (330)
T PLN02298 58 PRALIFMVHGYGNDI-SWTFQSTAIFLAQM-GFACFALDLEGHGRSEGLRA------YVPNVDLVVEDCLSFFNSVKQRE 129 (330)
T ss_pred CceEEEEEcCCCCCc-ceehhHHHHHHHhC-CCEEEEecCCCCCCCCCccc------cCCCHHHHHHHHHHHHHHHHhcc
Confidence 356899999987543 23333344456664 89999999999999953211 12378889999999999987653
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
..+ +.|++++|+|+||++|.++..++|+.|.|+|+.++++.
T Consensus 130 ~~~-~~~i~l~GhSmGG~ia~~~a~~~p~~v~~lvl~~~~~~ 170 (330)
T PLN02298 130 EFQ-GLPRFLYGESMGGAICLLIHLANPEGFDGAVLVAPMCK 170 (330)
T ss_pred cCC-CCCEEEEEecchhHHHHHHHhcCcccceeEEEeccccc
Confidence 222 24899999999999999999999999999999987664
No 16
>COG2267 PldB Lysophospholipase [Lipid metabolism]
Probab=99.11 E-value=2.2e-10 Score=124.93 Aligned_cols=111 Identities=20% Similarity=0.197 Sum_probs=90.5
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHHHHHHHhhhcC
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~~~~~~~~~~~ 844 (1042)
-+||+++|.++....+. .++..|+.+ |..|+++||||+|.|.. ...... +.++.++|+..|++.+....
T Consensus 35 g~Vvl~HG~~Eh~~ry~--~la~~l~~~-G~~V~~~D~RGhG~S~r------~~rg~~~~f~~~~~dl~~~~~~~~~~~- 104 (298)
T COG2267 35 GVVVLVHGLGEHSGRYE--ELADDLAAR-GFDVYALDLRGHGRSPR------GQRGHVDSFADYVDDLDAFVETIAEPD- 104 (298)
T ss_pred cEEEEecCchHHHHHHH--HHHHHHHhC-CCEEEEecCCCCCCCCC------CCcCCchhHHHHHHHHHHHHHHHhccC-
Confidence 48999999988766433 356667776 99999999999999963 122223 47889999999999988753
Q ss_pred CCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccccc
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVV 888 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~ 888 (1042)
...|++++|||+||.||+.+...+|..|+|+|+||+.+....
T Consensus 105 --~~~p~~l~gHSmGg~Ia~~~~~~~~~~i~~~vLssP~~~l~~ 146 (298)
T COG2267 105 --PGLPVFLLGHSMGGLIALLYLARYPPRIDGLVLSSPALGLGG 146 (298)
T ss_pred --CCCCeEEEEeCcHHHHHHHHHHhCCccccEEEEECccccCCh
Confidence 346999999999999999999999999999999998887663
No 17
>KOG1455|consensus
Probab=99.10 E-value=6.2e-10 Score=114.78 Aligned_cols=113 Identities=23% Similarity=0.299 Sum_probs=89.7
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHHHHHHHhhhcCC
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
+|++++|.|+.+. |.+..+...||.. |..|+++||+|+|.|.-. -.|. +.+-+++|+..|...++..-..
T Consensus 56 lv~~~HG~g~~~s-~~~~~~a~~l~~~-g~~v~a~D~~GhG~SdGl-------~~yi~~~d~~v~D~~~~~~~i~~~~e~ 126 (313)
T KOG1455|consen 56 LVFLCHGYGEHSS-WRYQSTAKRLAKS-GFAVYAIDYEGHGRSDGL-------HAYVPSFDLVVDDVISFFDSIKEREEN 126 (313)
T ss_pred EEEEEcCCcccch-hhHHHHHHHHHhC-CCeEEEeeccCCCcCCCC-------cccCCcHHHHHHHHHHHHHHHhhcccc
Confidence 7888899887643 4555567777776 999999999999999421 2455 7888999999999976554322
Q ss_pred CCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccC
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVD 889 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~ 889 (1042)
.+-|.+++|+||||++|..+..+.|+..+|+|++++...+..+
T Consensus 127 -~~lp~FL~GeSMGGAV~Ll~~~k~p~~w~G~ilvaPmc~i~~~ 169 (313)
T KOG1455|consen 127 -KGLPRFLFGESMGGAVALLIALKDPNFWDGAILVAPMCKISED 169 (313)
T ss_pred -CCCCeeeeecCcchHHHHHHHhhCCcccccceeeecccccCCc
Confidence 3468999999999999999999999999999999876655443
No 18
>PLN02385 hydrolase; alpha/beta fold family protein
Probab=99.08 E-value=5e-10 Score=126.78 Aligned_cols=111 Identities=19% Similarity=0.197 Sum_probs=84.3
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccC-CCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQY-LSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~-lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.|+||+++|.|.....|. ..+...|++. |..||++|+||||.|.... .+ .+.+..++|+..++..+....
T Consensus 87 ~~~iv~lHG~~~~~~~~~-~~~~~~l~~~-g~~v~~~D~~G~G~S~~~~-------~~~~~~~~~~~dv~~~l~~l~~~~ 157 (349)
T PLN02385 87 KAAVCFCHGYGDTCTFFF-EGIARKIASS-GYGVFAMDYPGFGLSEGLH-------GYIPSFDDLVDDVIEHYSKIKGNP 157 (349)
T ss_pred CeEEEEECCCCCccchHH-HHHHHHHHhC-CCEEEEecCCCCCCCCCCC-------CCcCCHHHHHHHHHHHHHHHHhcc
Confidence 468999999876543221 2344566664 8999999999999995321 23 377888999999998876542
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.. +..|++++|+||||++|+.+..++|+.|.|+|..++...
T Consensus 158 ~~-~~~~~~LvGhSmGG~val~~a~~~p~~v~glVLi~p~~~ 198 (349)
T PLN02385 158 EF-RGLPSFLFGQSMGGAVALKVHLKQPNAWDGAILVAPMCK 198 (349)
T ss_pred cc-CCCCEEEEEeccchHHHHHHHHhCcchhhheeEeccccc
Confidence 22 234899999999999999999999999999998886543
No 19
>PRK00870 haloalkane dehalogenase; Provisional
Probab=99.04 E-value=1.1e-09 Score=121.41 Aligned_cols=107 Identities=14% Similarity=0.122 Sum_probs=81.2
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|.||+++|.+.....|. .++..|++. +..||++|+||||.|.+.+.. ...+.+..++|++.+++.+..
T Consensus 44 ~~~~~lvliHG~~~~~~~w~--~~~~~L~~~-gy~vi~~Dl~G~G~S~~~~~~-----~~~~~~~~a~~l~~~l~~l~~- 114 (302)
T PRK00870 44 ADGPPVLLLHGEPSWSYLYR--KMIPILAAA-GHRVIAPDLIGFGRSDKPTRR-----EDYTYARHVEWMRSWFEQLDL- 114 (302)
T ss_pred CCCCEEEEECCCCCchhhHH--HHHHHHHhC-CCEEEEECCCCCCCCCCCCCc-----ccCCHHHHHHHHHHHHHHcCC-
Confidence 35788999999765555443 234456544 789999999999999653211 234677888888888876531
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+++++|+||||++|+.+..+||+.|.++|..++.+
T Consensus 115 ------~~v~lvGhS~Gg~ia~~~a~~~p~~v~~lvl~~~~~ 150 (302)
T PRK00870 115 ------TDVTLVCQDWGGLIGLRLAAEHPDRFARLVVANTGL 150 (302)
T ss_pred ------CCEEEEEEChHHHHHHHHHHhChhheeEEEEeCCCC
Confidence 379999999999999999999999999999877543
No 20
>PLN02824 hydrolase, alpha/beta fold family protein
Probab=99.02 E-value=1.1e-09 Score=121.19 Aligned_cols=110 Identities=19% Similarity=0.211 Sum_probs=85.9
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
++|.||+++|-+.....|.. ++..|++. +.||++|.||||.|.+.+......-...++++..+|++.|++.+..
T Consensus 28 ~~~~vlllHG~~~~~~~w~~--~~~~L~~~--~~vi~~DlpG~G~S~~~~~~~~~~~~~~~~~~~a~~l~~~l~~l~~-- 101 (294)
T PLN02824 28 SGPALVLVHGFGGNADHWRK--NTPVLAKS--HRVYAIDLLGYGYSDKPNPRSAPPNSFYTFETWGEQLNDFCSDVVG-- 101 (294)
T ss_pred CCCeEEEECCCCCChhHHHH--HHHHHHhC--CeEEEEcCCCCCCCCCCccccccccccCCHHHHHHHHHHHHHHhcC--
Confidence 46889999998776665653 45667765 5999999999999965321111111356889999999999987642
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+++++|+|+||++|+.+..++|+.|.+.|..+++.
T Consensus 102 -----~~~~lvGhS~Gg~va~~~a~~~p~~v~~lili~~~~ 137 (294)
T PLN02824 102 -----DPAFVICNSVGGVVGLQAAVDAPELVRGVMLINISL 137 (294)
T ss_pred -----CCeEEEEeCHHHHHHHHHHHhChhheeEEEEECCCc
Confidence 379999999999999999999999999999887655
No 21
>KOG4178|consensus
Probab=99.02 E-value=9.2e-10 Score=115.60 Aligned_cols=118 Identities=19% Similarity=0.272 Sum_probs=96.9
Q ss_pred EEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhH
Q psy3685 752 QRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALAD 831 (1042)
Q Consensus 752 qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D 831 (1042)
-||++-.. +..+||++|+++|.-+.+..|... +..||.. |..|||+|.||||.|.+-. .....|++..++|
T Consensus 32 I~~h~~e~-g~~~gP~illlHGfPe~wyswr~q--~~~la~~-~~rviA~DlrGyG~Sd~P~-----~~~~Yt~~~l~~d 102 (322)
T KOG4178|consen 32 IRLHYVEG-GPGDGPIVLLLHGFPESWYSWRHQ--IPGLASR-GYRVIAPDLRGYGFSDAPP-----HISEYTIDELVGD 102 (322)
T ss_pred EEEEEEee-cCCCCCEEEEEccCCccchhhhhh--hhhhhhc-ceEEEecCCCCCCCCCCCC-----CcceeeHHHHHHH
Confidence 46666555 567899999999998888877763 5567776 6899999999999996533 2334588888999
Q ss_pred HHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 832 LAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 832 ~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+..|+..+.. .+++++||+||+++|-++++.||+.|.|.|..+.|-.
T Consensus 103 i~~lld~Lg~-------~k~~lvgHDwGaivaw~la~~~Perv~~lv~~nv~~~ 149 (322)
T KOG4178|consen 103 IVALLDHLGL-------KKAFLVGHDWGAIVAWRLALFYPERVDGLVTLNVPFP 149 (322)
T ss_pred HHHHHHHhcc-------ceeEEEeccchhHHHHHHHHhChhhcceEEEecCCCC
Confidence 9999998862 4899999999999999999999999999998876655
No 22
>PHA02857 monoglyceride lipase; Provisional
Probab=99.02 E-value=1.9e-09 Score=118.06 Aligned_cols=109 Identities=16% Similarity=0.202 Sum_probs=84.0
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
.||||+++|.+.....|.. +...|++. |..|+++|+||||.|.+. ..+ .-+....+.|+..++..++..+
T Consensus 25 ~~~v~llHG~~~~~~~~~~--~~~~l~~~-g~~via~D~~G~G~S~~~-~~~-----~~~~~~~~~d~~~~l~~~~~~~- 94 (276)
T PHA02857 25 KALVFISHGAGEHSGRYEE--LAENISSL-GILVFSHDHIGHGRSNGE-KMM-----IDDFGVYVRDVVQHVVTIKSTY- 94 (276)
T ss_pred CEEEEEeCCCccccchHHH--HHHHHHhC-CCEEEEccCCCCCCCCCc-cCC-----cCCHHHHHHHHHHHHHHHHhhC-
Confidence 4689999998766554443 45556664 899999999999999652 111 2255666788888887776554
Q ss_pred CCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+..|++++|+|+||++|..+..++|+.+.|+|+.++++.
T Consensus 95 --~~~~~~lvG~S~GG~ia~~~a~~~p~~i~~lil~~p~~~ 133 (276)
T PHA02857 95 --PGVPVFLLGHSMGATISILAAYKNPNLFTAMILMSPLVN 133 (276)
T ss_pred --CCCCEEEEEcCchHHHHHHHHHhCccccceEEEeccccc
Confidence 235899999999999999999999999999999987664
No 23
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.02 E-value=5.2e-10 Score=127.21 Aligned_cols=87 Identities=16% Similarity=0.466 Sum_probs=40.7
Q ss_pred cCCCCCccccCcchhhhhhhhccCCccccCccCccccCchhHHHHHhHhcCC-ccccCCCCCCCC----------ChhHH
Q psy3685 91 KCDLCSKDFSYKSSLIEHKKLHQGIRYKCNLCSKEFSRNCYLTEHMEQHQGI-RYKCDLCSKDFS----------YKSSL 159 (1042)
Q Consensus 91 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~F~~~~~L~~H~~~h~~~-~~~C~~C~~~F~----------~~~~L 159 (1042)
.|+.|++.|. ...|..|+++|+ .++.|+ |++.+ .+..|..|+..|... ++.|+.|++.|. ..+.|
T Consensus 455 ~C~~Cgk~f~-~s~LekH~~~~H-kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~L 530 (567)
T PLN03086 455 HCEKCGQAFQ-QGEMEKHMKVFH-EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGM 530 (567)
T ss_pred cCCCCCCccc-hHHHHHHHHhcC-CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhH
Confidence 4555555543 344455555443 345555 55433 334555555444332 455555555542 12345
Q ss_pred HHHHHhcCCCccccCCCCcccc
Q psy3685 160 IEHKKLHQGIRYKCNLCSKEYF 181 (1042)
Q Consensus 160 ~~H~~~h~~~~~~C~~C~~~f~ 181 (1042)
..|.......++.|..|++.+.
T Consensus 531 t~HE~~CG~rt~~C~~Cgk~Vr 552 (567)
T PLN03086 531 SEHESICGSRTAPCDSCGRSVM 552 (567)
T ss_pred HHHHHhcCCcceEccccCCeee
Confidence 5555554444555555554443
No 24
>PRK10749 lysophospholipase L2; Provisional
Probab=99.01 E-value=1.4e-09 Score=121.82 Aligned_cols=114 Identities=18% Similarity=0.125 Sum_probs=85.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCccccc-CCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQ-YLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~-~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
++|+||+++|.+.....|. .++..+++ .|..|+++|+||+|.|.+..+ ..... ..+.+..++|+..+++.+...
T Consensus 53 ~~~~vll~HG~~~~~~~y~--~~~~~l~~-~g~~v~~~D~~G~G~S~~~~~--~~~~~~~~~~~~~~~d~~~~~~~~~~~ 127 (330)
T PRK10749 53 HDRVVVICPGRIESYVKYA--ELAYDLFH-LGYDVLIIDHRGQGRSGRLLD--DPHRGHVERFNDYVDDLAAFWQQEIQP 127 (330)
T ss_pred CCcEEEEECCccchHHHHH--HHHHHHHH-CCCeEEEEcCCCCCCCCCCCC--CCCcCccccHHHHHHHHHHHHHHHHhc
Confidence 4578999999875543222 23445655 489999999999999964211 11111 238899999999999887554
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+ +..|++++|+||||++|+.+..++|+.|.|+|.++++..
T Consensus 128 ~---~~~~~~l~GhSmGG~ia~~~a~~~p~~v~~lvl~~p~~~ 167 (330)
T PRK10749 128 G---PYRKRYALAHSMGGAILTLFLQRHPGVFDAIALCAPMFG 167 (330)
T ss_pred C---CCCCeEEEEEcHHHHHHHHHHHhCCCCcceEEEECchhc
Confidence 3 235899999999999999999999999999998876543
No 25
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.97 E-value=1.4e-09 Score=123.83 Aligned_cols=141 Identities=18% Similarity=0.436 Sum_probs=70.3
Q ss_pred ccCCCCCcccCCchhHHHHHHhhcCCcccCCc--CCccccCcccccccccccCCccccCCCCcccCCchhHHHHHHHhcC
Q psy3685 452 YKCDLCSKEFSRKGNLTKHMKLHQGIRYKCDL--CLKEFSRKDNLTDRKIHQGIRYKCDLCSKEFSSKGNLTEHMKQHQG 529 (1042)
Q Consensus 452 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~--C~~~f~~~~~l~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 529 (1042)
-.|+.|...... ..|..|.....-..-.|+. |+..|.+. .+. ..+.|+.|++.|. ...|..|++.|+
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~-el~-------~H~~C~~Cgk~f~-~s~LekH~~~~H- 476 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYCSRHNVVCPHDGCGIVLRVE-EAK-------NHVHCEKCGQAFQ-QGEMEKHMKVFH- 476 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhCCCcceeCCcccccceeecc-ccc-------cCccCCCCCCccc-hHHHHHHHHhcC-
Confidence 457777665543 3444555432222223442 44444211 111 1245666666663 455666666554
Q ss_pred CccccCccCcccCCchhHHHHHHHhcCc-ccccCcCCcccCC----------chhhhhHhhhhcCCcccCCccccccCCc
Q psy3685 530 IRYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCSKEFSR----------KDNLTEHKEIHQGIRYKCDLCSKKFSRK 598 (1042)
Q Consensus 530 ~~~~C~~C~k~f~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~----------~~~L~~H~~~H~~~~~~C~~C~k~F~~~ 598 (1042)
.++.|+ ||+.+ .+..|..|+++|.+. ++.|..|++.|.. .+.|..|+.+....++.|..||+.+..+
T Consensus 477 kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 477 EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICGSRTAPCDSCGRSVMLK 554 (567)
T ss_pred CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhcCCcceEccccCCeeeeh
Confidence 456666 66433 445666666555543 5666666666531 2355556555533455666666555432
Q ss_pred hhHHHHhh
Q psy3685 599 LNLTKHMK 606 (1042)
Q Consensus 599 ~~L~~H~~ 606 (1042)
.|..|+.
T Consensus 555 -dm~~H~~ 561 (567)
T PLN03086 555 -EMDIHQI 561 (567)
T ss_pred -hHHHHHH
Confidence 3445544
No 26
>KOG2564|consensus
Probab=98.97 E-value=5e-09 Score=105.64 Aligned_cols=166 Identities=19% Similarity=0.214 Sum_probs=107.3
Q ss_pred cchhhhhcccCCCCCcccceEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccC
Q psy3685 732 DQWFEQKLDHFNIEEPRTWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHP 811 (1042)
Q Consensus 732 ~~~f~q~~dhf~~~~~~tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p 811 (1042)
..||+-..|---.....||+--|-.+. .+.||||++++|-|.....|.. +..+|+......++++|.|++|+|+-
T Consensus 44 s~yFdekedv~i~~~~~t~n~Y~t~~~---~t~gpil~l~HG~G~S~LSfA~--~a~el~s~~~~r~~a~DlRgHGeTk~ 118 (343)
T KOG2564|consen 44 SDYFDEKEDVSIDGSDLTFNVYLTLPS---ATEGPILLLLHGGGSSALSFAI--FASELKSKIRCRCLALDLRGHGETKV 118 (343)
T ss_pred HHhhccccccccCCCcceEEEEEecCC---CCCccEEEEeecCcccchhHHH--HHHHHHhhcceeEEEeeccccCcccc
Confidence 344776666422222336665333333 5589999999998876665553 56788888889999999999999964
Q ss_pred CCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHH--HhCCCeEEEEEeecccccccc-
Q psy3685 812 LSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLR--YKYPHLVHGAMSASGPLRAVV- 888 (1042)
Q Consensus 812 ~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~--~~yP~~~~g~i~ssa~~~~~~- 888 (1042)
-+ =..|+.|-.+-|+..+|.++=.+ ...++|++|||+||+||+..+ ..-|. +.|.+- +..+.
T Consensus 119 ~~------e~dlS~eT~~KD~~~~i~~~fge----~~~~iilVGHSmGGaIav~~a~~k~lps-l~Gl~v----iDVVEg 183 (343)
T KOG2564|consen 119 EN------EDDLSLETMSKDFGAVIKELFGE----LPPQIILVGHSMGGAIAVHTAASKTLPS-LAGLVV----IDVVEG 183 (343)
T ss_pred CC------hhhcCHHHHHHHHHHHHHHHhcc----CCCceEEEeccccchhhhhhhhhhhchh-hhceEE----EEEech
Confidence 22 23477777889999888765432 235899999999999999876 35666 444441 12111
Q ss_pred CChhhHHHHHHHhhccChhhHHHHHHHHHH
Q psy3685 889 DFPEYFGVVADALATVSTECVKAVQTATHT 918 (1042)
Q Consensus 889 ~~~~~~~~v~~~~~~~~~~c~~~i~~~~~~ 918 (1042)
..-++...+...|+ ..|+-.+.|.+|+.+
T Consensus 184 tAmeAL~~m~~fL~-~rP~~F~Si~~Ai~W 212 (343)
T KOG2564|consen 184 TAMEALNSMQHFLR-NRPKSFKSIEDAIEW 212 (343)
T ss_pred HHHHHHHHHHHHHh-cCCccccchhhHHHH
Confidence 12234444444443 246666777776554
No 27
>TIGR01607 PST-A Plasmodium subtelomeric family (PST-A). These genes are preferentially located in the subtelomeric regions of the chromosomes of both P. falciparum and P. yoelii.
Probab=98.92 E-value=3.2e-09 Score=118.61 Aligned_cols=115 Identities=16% Similarity=0.167 Sum_probs=83.6
Q ss_pred EEEEECCCCCCcc-ccc----------------------c-cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccC
Q psy3685 767 IFLMIGGEGEASA-KWM----------------------V-EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQY 822 (1042)
Q Consensus 767 i~~~~gg~g~~~~-~~~----------------------~-~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~ 822 (1042)
+|++++|.|+... .+. + ..++..|+++ |..|+++||||+|.|.... ....+
T Consensus 23 ~v~i~HG~~eh~~~~~~~~~~~~~~~~~~~~~~~~ry~~y~~~~~~~l~~~-G~~V~~~D~rGHG~S~~~~----~~~g~ 97 (332)
T TIGR01607 23 IIVLIHGLKSHLRLQFLKINAKIVNNDRAVLIDTDNYYIYKDSWIENFNKN-GYSVYGLDLQGHGESDGLQ----NLRGH 97 (332)
T ss_pred EEEEECCCchhhhhhhhhcCcccCCCCeeEEEcCCcceEeeHHHHHHHHHC-CCcEEEecccccCCCcccc----ccccc
Confidence 9999999887764 111 1 1344556665 9999999999999985321 11223
Q ss_pred -CCHHHHHhHHHHHHHHHhhhc-----------------CCCCCCcEEEEccchhHHHHHHHHHhCCC--------eEEE
Q psy3685 823 -LSAEQALADLAYFITSMNTLY-----------------SLPAHTKWIAFGGSYPGALAAWLRYKYPH--------LVHG 876 (1042)
Q Consensus 823 -lt~~q~l~D~~~~~~~~~~~~-----------------~~~~~~~~i~~G~Sygg~la~~~~~~yP~--------~~~g 876 (1042)
-+.+..++|+..+++.++... ..+.+.|+|++||||||+|++.+..++|+ .+.|
T Consensus 98 ~~~~~~~v~Dl~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~GhSmGg~i~~~~~~~~~~~~~~~~~~~i~g 177 (332)
T TIGR01607 98 INCFDDLVYDVIQYMNRINDSIILENETKSDDESYDIVNTKENRLPMYIIGLSMGGNIALRLLELLGKSNENNDKLNIKG 177 (332)
T ss_pred hhhHHHHHHHHHHHHHHhhhhhccccccccccccccccccccCCCceeEeeccCccHHHHHHHHHhccccccccccccce
Confidence 378889999999999886521 01114699999999999999999887764 5889
Q ss_pred EEeecccccc
Q psy3685 877 AMSASGPLRA 886 (1042)
Q Consensus 877 ~i~ssa~~~~ 886 (1042)
+|++|+++..
T Consensus 178 ~i~~s~~~~i 187 (332)
T TIGR01607 178 CISLSGMISI 187 (332)
T ss_pred EEEeccceEE
Confidence 9988877643
No 28
>TIGR02240 PHA_depoly_arom poly(3-hydroxyalkanoate) depolymerase. This family consists of the polyhydroxyalkanoic acid (PHA) depolymerase of Pseudomonas oleovorans, Pseudomonas putida BM01, and related species. This enzyme is part of polyester storage and mobilization system as in many bacteria. However, species containing this enzyme are unusual in their capacity to produce aromatic polyesters when grown on carbon sources such as benzoic acid or phenylacetic acid.
Probab=98.89 E-value=3.9e-09 Score=115.46 Aligned_cols=115 Identities=17% Similarity=0.091 Sum_probs=84.7
Q ss_pred eEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHh
Q psy3685 751 KQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALA 830 (1042)
Q Consensus 751 ~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~ 830 (1042)
+.+|++-. ..+++|.||+++|.|.....|. .++..|++ +..||++|+||||.|.... ...+.+...+
T Consensus 13 ~~~~~~~~--~~~~~~plvllHG~~~~~~~w~--~~~~~L~~--~~~vi~~Dl~G~G~S~~~~-------~~~~~~~~~~ 79 (276)
T TIGR02240 13 SIRTAVRP--GKEGLTPLLIFNGIGANLELVF--PFIEALDP--DLEVIAFDVPGVGGSSTPR-------HPYRFPGLAK 79 (276)
T ss_pred EEEEEEec--CCCCCCcEEEEeCCCcchHHHH--HHHHHhcc--CceEEEECCCCCCCCCCCC-------CcCcHHHHHH
Confidence 46666521 1234567788888766555443 33444554 5799999999999995321 1246777888
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
|+..||.++.. .+++++|+||||++|+.++.++|+.|.+.|..+++..
T Consensus 80 ~~~~~i~~l~~-------~~~~LvG~S~GG~va~~~a~~~p~~v~~lvl~~~~~~ 127 (276)
T TIGR02240 80 LAARMLDYLDY-------GQVNAIGVSWGGALAQQFAHDYPERCKKLILAATAAG 127 (276)
T ss_pred HHHHHHHHhCc-------CceEEEEECHHHHHHHHHHHHCHHHhhheEEeccCCc
Confidence 88888887632 3799999999999999999999999999998887764
No 29
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=98.89 E-value=1.1e-08 Score=108.88 Aligned_cols=111 Identities=11% Similarity=-0.112 Sum_probs=82.3
Q ss_pred CcEEEEECCCCCCcccccc--cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 765 GPIFLMIGGEGEASAKWMV--EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~--~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.|+||+++|.|.....+.. ..+...|++. |..|+.+|+|+||.|.... ...+.++.++|+..+++.++..
T Consensus 25 ~~~VlllHG~g~~~~~~~~~~~~la~~La~~-Gy~Vl~~Dl~G~G~S~g~~-------~~~~~~~~~~Dv~~ai~~L~~~ 96 (266)
T TIGR03101 25 RGVVIYLPPFAEEMNKSRRMVALQARAFAAG-GFGVLQIDLYGCGDSAGDF-------AAARWDVWKEDVAAAYRWLIEQ 96 (266)
T ss_pred ceEEEEECCCcccccchhHHHHHHHHHHHHC-CCEEEEECCCCCCCCCCcc-------ccCCHHHHHHHHHHHHHHHHhc
Confidence 4789999987653221111 1123345543 8999999999999995321 1236778889999999888764
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
. ..|++++|+|+||.+|..+..++|+.+.+.|..++++...
T Consensus 97 ~----~~~v~LvG~SmGG~vAl~~A~~~p~~v~~lVL~~P~~~g~ 137 (266)
T TIGR03101 97 G----HPPVTLWGLRLGALLALDAANPLAAKCNRLVLWQPVVSGK 137 (266)
T ss_pred C----CCCEEEEEECHHHHHHHHHHHhCccccceEEEeccccchH
Confidence 2 2489999999999999999999999999999888666544
No 30
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=98.87 E-value=7e-09 Score=109.09 Aligned_cols=103 Identities=26% Similarity=0.319 Sum_probs=81.4
Q ss_pred EEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 768 FLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
||+++|-|.....|.. ++..|+ -|..||++|.||+|.|.+..+ ....+.++.++|+.++++.+..
T Consensus 1 vv~~hG~~~~~~~~~~--~~~~l~--~~~~v~~~d~~G~G~s~~~~~-----~~~~~~~~~~~~l~~~l~~~~~------ 65 (228)
T PF12697_consen 1 VVFLHGFGGSSESWDP--LAEALA--RGYRVIAFDLPGHGRSDPPPD-----YSPYSIEDYAEDLAELLDALGI------ 65 (228)
T ss_dssp EEEE-STTTTGGGGHH--HHHHHH--TTSEEEEEECTTSTTSSSHSS-----GSGGSHHHHHHHHHHHHHHTTT------
T ss_pred eEEECCCCCCHHHHHH--HHHHHh--CCCEEEEEecCCccccccccc-----cCCcchhhhhhhhhhccccccc------
Confidence 5777777665554443 455665 389999999999999976432 3346788889999999887654
Q ss_pred CCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 848 HTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
.+++++|+|+||.+|+.+..++|+.|.|+|..+++...
T Consensus 66 -~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~vl~~~~~~~ 103 (228)
T PF12697_consen 66 -KKVILVGHSMGGMIALRLAARYPDRVKGLVLLSPPPPL 103 (228)
T ss_dssp -SSEEEEEETHHHHHHHHHHHHSGGGEEEEEEESESSSH
T ss_pred -ccccccccccccccccccccccccccccceeecccccc
Confidence 38999999999999999999999999999999888753
No 31
>PHA02768 hypothetical protein; Provisional
Probab=98.86 E-value=1.2e-09 Score=82.02 Aligned_cols=49 Identities=18% Similarity=0.481 Sum_probs=40.7
Q ss_pred cccCCccccccCCchhHHHHhhhcCCCcccCCCCCcccccCCCCcccccC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQGIRYKCDLCSKEFSRKDIPYEVTVK 634 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~l~~~~~~ 634 (1042)
-|+|+.||+.|.+.++|..||++|+ .||+|..|++.|++++.|.++..+
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~-k~~kc~~C~k~f~~~s~l~~~~~~ 53 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN-TNLKLSNCKRISLRTGEYIEIKAE 53 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC-CcccCCcccceecccceeEEEecc
Confidence 3788999999999999999999988 688999999999888887775543
No 32
>TIGR01250 pro_imino_pep_2 proline-specific peptidases, Bacillus coagulans-type subfamily. This model describes a subfamily of the alpha/beta fold family of hydrolases. Characterized members include prolinases (Pro-Xaa dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC 3.4.11.5), and a leucyl aminopeptidase
Probab=98.84 E-value=1.3e-08 Score=111.70 Aligned_cols=106 Identities=18% Similarity=0.198 Sum_probs=76.6
Q ss_pred CcEEEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
+|.||++ ||.|.....|. .+..+....|..||++|+||+|.|...++ .-.+.++++.++|+..++..+..
T Consensus 25 ~~~vl~~hG~~g~~~~~~~---~~~~~l~~~g~~vi~~d~~G~G~s~~~~~----~~~~~~~~~~~~~~~~~~~~~~~-- 95 (288)
T TIGR01250 25 KIKLLLLHGGPGMSHEYLE---NLRELLKEEGREVIMYDQLGCGYSDQPDD----SDELWTIDYFVDELEEVREKLGL-- 95 (288)
T ss_pred CCeEEEEcCCCCccHHHHH---HHHHHHHhcCCEEEEEcCCCCCCCCCCCc----ccccccHHHHHHHHHHHHHHcCC--
Confidence 4445555 55555433222 24455555589999999999999964321 11246888899998888775532
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+++++|+||||.+|.++...+|+.|.+.|..+++.
T Consensus 96 -----~~~~liG~S~Gg~ia~~~a~~~p~~v~~lvl~~~~~ 131 (288)
T TIGR01250 96 -----DKFYLLGHSWGGMLAQEYALKYGQHLKGLIISSMLD 131 (288)
T ss_pred -----CcEEEEEeehHHHHHHHHHHhCccccceeeEecccc
Confidence 369999999999999999999999999999776543
No 33
>TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate.
Probab=98.84 E-value=4.8e-09 Score=112.34 Aligned_cols=104 Identities=17% Similarity=0.225 Sum_probs=81.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
+.|+||++||.|.....|.. ++..|+. +..|+++|+||+|.|.+.. ...+.++.++|+..+++.+..
T Consensus 12 ~~~~li~~hg~~~~~~~~~~--~~~~l~~--~~~v~~~d~~G~G~s~~~~-------~~~~~~~~~~~~~~~i~~~~~-- 78 (251)
T TIGR02427 12 GAPVLVFINSLGTDLRMWDP--VLPALTP--DFRVLRYDKRGHGLSDAPE-------GPYSIEDLADDVLALLDHLGI-- 78 (251)
T ss_pred CCCeEEEEcCcccchhhHHH--HHHHhhc--ccEEEEecCCCCCCCCCCC-------CCCCHHHHHHHHHHHHHHhCC--
Confidence 67899999998776654432 3444443 7899999999999994321 235788888999888876532
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.+++++|+|+||++|..++.++|+.|.++|+.+++..
T Consensus 79 -----~~v~liG~S~Gg~~a~~~a~~~p~~v~~li~~~~~~~ 115 (251)
T TIGR02427 79 -----ERAVFCGLSLGGLIAQGLAARRPDRVRALVLSNTAAK 115 (251)
T ss_pred -----CceEEEEeCchHHHHHHHHHHCHHHhHHHhhccCccc
Confidence 3799999999999999999999999999998776543
No 34
>TIGR01840 esterase_phb esterase, PHB depolymerase family. This model describes a subfamily among lipases of the ab-hydrolase family. This subfamily includes bacterial depolymerases for poly(3-hydroxybutyrate) (PHB) and related polyhydroxyalkanoates (PHA), as well as acetyl xylan esterases, feruloyl esterases, and others from fungi.
Probab=98.83 E-value=2e-08 Score=104.83 Aligned_cols=122 Identities=20% Similarity=0.290 Sum_probs=86.2
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCC-CCcccccCCCHHHHHhHHHHHHHHHh
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSD-LSVESLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~-~~~~~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
....|+||+++|.|.....+....-+..+|++.|+.||+.|.|++|.+....+ ....... .......|+..++..+.
T Consensus 10 ~~~~P~vv~lHG~~~~~~~~~~~~~~~~~a~~~g~~Vv~Pd~~g~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~ 87 (212)
T TIGR01840 10 TGPRALVLALHGCGQTASAYVIDWGWKAAADRYGFVLVAPEQTSYNSSNNCWDWFFTHHRA--RGTGEVESLHQLIDAVK 87 (212)
T ss_pred CCCCCEEEEeCCCCCCHHHHhhhcChHHHHHhCCeEEEecCCcCccccCCCCCCCCccccC--CCCccHHHHHHHHHHHH
Confidence 44579999998877554434322225678888899999999999875532111 0000000 01234678888888888
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
..+..+. .+++++|+|+||.+|+++...+|+.+.|+++.|+++..
T Consensus 88 ~~~~id~-~~i~l~G~S~Gg~~a~~~a~~~p~~~~~~~~~~g~~~~ 132 (212)
T TIGR01840 88 ANYSIDP-NRVYVTGLSAGGGMTAVLGCTYPDVFAGGASNAGLPYG 132 (212)
T ss_pred HhcCcCh-hheEEEEECHHHHHHHHHHHhCchhheEEEeecCCccc
Confidence 8775533 48999999999999999999999999999888877643
No 35
>PRK10673 acyl-CoA esterase; Provisional
Probab=98.83 E-value=1.3e-08 Score=109.96 Aligned_cols=100 Identities=16% Similarity=0.089 Sum_probs=77.0
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|.||+++|.+.....|. .+...|++ +..||++|.||+|.|.+.+ .+|.++..+|+.+++..+..
T Consensus 14 ~~~~~iv~lhG~~~~~~~~~--~~~~~l~~--~~~vi~~D~~G~G~s~~~~--------~~~~~~~~~d~~~~l~~l~~- 80 (255)
T PRK10673 14 HNNSPIVLVHGLFGSLDNLG--VLARDLVN--DHDIIQVDMRNHGLSPRDP--------VMNYPAMAQDLLDTLDALQI- 80 (255)
T ss_pred CCCCCEEEECCCCCchhHHH--HHHHHHhh--CCeEEEECCCCCCCCCCCC--------CCCHHHHHHHHHHHHHHcCC-
Confidence 45678888877654433332 23445555 5799999999999997642 25788889999999987522
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
.+++++|+|+||++|+.+..++|+.|.+.|..+
T Consensus 81 ------~~~~lvGhS~Gg~va~~~a~~~~~~v~~lvli~ 113 (255)
T PRK10673 81 ------EKATFIGHSMGGKAVMALTALAPDRIDKLVAID 113 (255)
T ss_pred ------CceEEEEECHHHHHHHHHHHhCHhhcceEEEEe
Confidence 369999999999999999999999999988754
No 36
>PLN02652 hydrolase; alpha/beta fold family protein
Probab=98.83 E-value=1.6e-08 Score=114.92 Aligned_cols=108 Identities=21% Similarity=0.240 Sum_probs=82.4
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~~~~~~~~~~ 843 (1042)
.|+||+++|.+.....| ..+...|+++ |+.|+++|+|+||.|... ..|. +.++.++|+..+++.+..++
T Consensus 136 ~~~Vl~lHG~~~~~~~~--~~~a~~L~~~-Gy~V~~~D~rGhG~S~~~-------~~~~~~~~~~~~Dl~~~l~~l~~~~ 205 (395)
T PLN02652 136 RGILIIIHGLNEHSGRY--LHFAKQLTSC-GFGVYAMDWIGHGGSDGL-------HGYVPSLDYVVEDTEAFLEKIRSEN 205 (395)
T ss_pred ceEEEEECCchHHHHHH--HHHHHHHHHC-CCEEEEeCCCCCCCCCCC-------CCCCcCHHHHHHHHHHHHHHHHHhC
Confidence 46899999987653333 3455566654 899999999999999532 1233 67888999999999998765
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCC---eEEEEEeecccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPH---LVHGAMSASGPLRA 886 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~---~~~g~i~ssa~~~~ 886 (1042)
. ..|++++|+|+||.+|+++. .+|+ .|.|.|+.|+.+..
T Consensus 206 ~---~~~i~lvGhSmGG~ial~~a-~~p~~~~~v~glVL~sP~l~~ 247 (395)
T PLN02652 206 P---GVPCFLFGHSTGGAVVLKAA-SYPSIEDKLEGIVLTSPALRV 247 (395)
T ss_pred C---CCCEEEEEECHHHHHHHHHH-hccCcccccceEEEECccccc
Confidence 2 35899999999999998776 4675 79999998876643
No 37
>PLN02211 methyl indole-3-acetate methyltransferase
Probab=98.82 E-value=1.9e-08 Score=109.34 Aligned_cols=107 Identities=17% Similarity=0.191 Sum_probs=80.2
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|.||+++|.|.....|. .+...|+++ |..|+++|+|++|.|.+.+. ...+.++.++|+..+++.+..
T Consensus 16 ~~~p~vvliHG~~~~~~~w~--~~~~~L~~~-g~~vi~~dl~g~G~s~~~~~------~~~~~~~~~~~l~~~i~~l~~- 85 (273)
T PLN02211 16 RQPPHFVLIHGISGGSWCWY--KIRCLMENS-GYKVTCIDLKSAGIDQSDAD------SVTTFDEYNKPLIDFLSSLPE- 85 (273)
T ss_pred CCCCeEEEECCCCCCcCcHH--HHHHHHHhC-CCEEEEecccCCCCCCCCcc------cCCCHHHHHHHHHHHHHhcCC-
Confidence 45789999999876544443 334445544 88999999999998844221 235777778888888775421
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
..+++++|+||||++++.+..++|+.|.+.|..++.+
T Consensus 86 -----~~~v~lvGhS~GG~v~~~~a~~~p~~v~~lv~~~~~~ 122 (273)
T PLN02211 86 -----NEKVILVGHSAGGLSVTQAIHRFPKKICLAVYVAATM 122 (273)
T ss_pred -----CCCEEEEEECchHHHHHHHHHhChhheeEEEEecccc
Confidence 2489999999999999999999999999999876643
No 38
>PRK11126 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase; Provisional
Probab=98.82 E-value=1.3e-08 Score=108.91 Aligned_cols=101 Identities=16% Similarity=0.114 Sum_probs=77.6
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
|+|+||+++|-+.....|.. ++..| . +..|+++|.||||.|.+... .+.++..+|+..+++.+.
T Consensus 1 ~~p~vvllHG~~~~~~~w~~--~~~~l--~-~~~vi~~D~~G~G~S~~~~~--------~~~~~~~~~l~~~l~~~~--- 64 (242)
T PRK11126 1 GLPWLVFLHGLLGSGQDWQP--VGEAL--P-DYPRLYIDLPGHGGSAAISV--------DGFADVSRLLSQTLQSYN--- 64 (242)
T ss_pred CCCEEEEECCCCCChHHHHH--HHHHc--C-CCCEEEecCCCCCCCCCccc--------cCHHHHHHHHHHHHHHcC---
Confidence 57889999888776655543 33344 2 68999999999999965321 167788888888887642
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCe-EEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHL-VHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~-~~g~i~ssa~~ 884 (1042)
..+++++|+||||.+|..+..+||+. |.+.|..+++.
T Consensus 65 ----~~~~~lvG~S~Gg~va~~~a~~~~~~~v~~lvl~~~~~ 102 (242)
T PRK11126 65 ----ILPYWLVGYSLGGRIAMYYACQGLAGGLCGLIVEGGNP 102 (242)
T ss_pred ----CCCeEEEEECHHHHHHHHHHHhCCcccccEEEEeCCCC
Confidence 24899999999999999999999765 99988776554
No 39
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=98.81 E-value=1.2e-08 Score=111.54 Aligned_cols=105 Identities=17% Similarity=0.167 Sum_probs=80.6
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
++|+||+++|.|.....|. .++..|++ ++.|+++|+||+|.|.+... ...+++...+|++++++.+.
T Consensus 27 ~~~~vv~~hG~~~~~~~~~--~~~~~l~~--~~~vi~~D~~G~G~S~~~~~------~~~~~~~~~~~l~~~i~~~~--- 93 (278)
T TIGR03056 27 AGPLLLLLHGTGASTHSWR--DLMPPLAR--SFRVVAPDLPGHGFTRAPFR------FRFTLPSMAEDLSALCAAEG--- 93 (278)
T ss_pred CCCeEEEEcCCCCCHHHHH--HHHHHHhh--CcEEEeecCCCCCCCCCccc------cCCCHHHHHHHHHHHHHHcC---
Confidence 4789999988776555443 34556665 57999999999999965321 13478888899998887542
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
..+++++|+||||++|+.+..++|+.+.+.|+.+++..
T Consensus 94 ----~~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~v~~~~~~~ 131 (278)
T TIGR03056 94 ----LSPDGVIGHSAGAAIALRLALDGPVTPRMVVGINAALM 131 (278)
T ss_pred ----CCCceEEEECccHHHHHHHHHhCCcccceEEEEcCccc
Confidence 13789999999999999999999999998887766543
No 40
>PLN02965 Probable pheophorbidase
Probab=98.80 E-value=1.6e-08 Score=109.08 Aligned_cols=101 Identities=17% Similarity=0.125 Sum_probs=76.6
Q ss_pred EEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 768 FLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
||+++|.|.....|. .++..|+.. +..|+++|+||||.|....+ ...|.++..+|+..+++.+..
T Consensus 6 vvllHG~~~~~~~w~--~~~~~L~~~-~~~via~Dl~G~G~S~~~~~------~~~~~~~~a~dl~~~l~~l~~------ 70 (255)
T PLN02965 6 FVFVHGASHGAWCWY--KLATLLDAA-GFKSTCVDLTGAGISLTDSN------TVSSSDQYNRPLFALLSDLPP------ 70 (255)
T ss_pred EEEECCCCCCcCcHH--HHHHHHhhC-CceEEEecCCcCCCCCCCcc------ccCCHHHHHHHHHHHHHhcCC------
Confidence 778888775544444 345556543 78999999999999942211 245678888999999886531
Q ss_pred CCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 848 HTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
..+++++|+||||++|..+..+||+.|.++|..++.
T Consensus 71 ~~~~~lvGhSmGG~ia~~~a~~~p~~v~~lvl~~~~ 106 (255)
T PLN02965 71 DHKVILVGHSIGGGSVTEALCKFTDKISMAIYVAAA 106 (255)
T ss_pred CCCEEEEecCcchHHHHHHHHhCchheeEEEEEccc
Confidence 138999999999999999999999999999876654
No 41
>PRK03592 haloalkane dehalogenase; Provisional
Probab=98.80 E-value=1.7e-08 Score=111.68 Aligned_cols=104 Identities=14% Similarity=0.117 Sum_probs=82.2
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
++|.||+++|-+.....|. .++..|++. +.||++|.||||.|.+.+. ..+.+...+|+..+++.+..
T Consensus 26 ~g~~vvllHG~~~~~~~w~--~~~~~L~~~--~~via~D~~G~G~S~~~~~-------~~~~~~~a~dl~~ll~~l~~-- 92 (295)
T PRK03592 26 EGDPIVFLHGNPTSSYLWR--NIIPHLAGL--GRCLAPDLIGMGASDKPDI-------DYTFADHARYLDAWFDALGL-- 92 (295)
T ss_pred CCCEEEEECCCCCCHHHHH--HHHHHHhhC--CEEEEEcCCCCCCCCCCCC-------CCCHHHHHHHHHHHHHHhCC--
Confidence 4677888888766655554 356677776 3999999999999954321 14778888999999887642
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.+++++|+|+||.+|..+..+||+.|.+.|..++++.
T Consensus 93 -----~~~~lvGhS~Gg~ia~~~a~~~p~~v~~lil~~~~~~ 129 (295)
T PRK03592 93 -----DDVVLVGHDWGSALGFDWAARHPDRVRGIAFMEAIVR 129 (295)
T ss_pred -----CCeEEEEECHHHHHHHHHHHhChhheeEEEEECCCCC
Confidence 3899999999999999999999999999998887543
No 42
>PHA00733 hypothetical protein
Probab=98.79 E-value=3.1e-09 Score=98.68 Aligned_cols=58 Identities=22% Similarity=0.394 Sum_probs=35.0
Q ss_pred HHhcCcccccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhHHHHhhhcCC
Q psy3685 552 EIHQGIRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLTKHMKIHQG 610 (1042)
Q Consensus 552 ~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ 610 (1042)
..+...||.|+.||+.|.+...|..|++.|. .+|.|++|++.|.....|..|+...|+
T Consensus 67 ~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~-~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 67 TSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE-HSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred ccCCCCCccCCCCCCcCCCHHHHHHHHhcCC-cCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 3334446666666666666666666666552 256666666666666666666665554
No 43
>TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067.
Probab=98.77 E-value=2.9e-08 Score=106.95 Aligned_cols=105 Identities=16% Similarity=0.227 Sum_probs=80.7
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|+||+++|.+.....|.. ++..|++ +..|+++|.||+|.|.... -...+.++.++|+.++++.+..
T Consensus 11 ~~~~~iv~lhG~~~~~~~~~~--~~~~l~~--~~~vi~~D~~G~G~S~~~~------~~~~~~~~~~~~~~~~i~~~~~- 79 (257)
T TIGR03611 11 ADAPVVVLSSGLGGSGSYWAP--QLDVLTQ--RFHVVTYDHRGTGRSPGEL------PPGYSIAHMADDVLQLLDALNI- 79 (257)
T ss_pred CCCCEEEEEcCCCcchhHHHH--HHHHHHh--ccEEEEEcCCCCCCCCCCC------cccCCHHHHHHHHHHHHHHhCC-
Confidence 457889999888766554542 2334443 6899999999999995421 1235788899999999876532
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+++++|+|+||++|..++.++|+.|.++|..++..
T Consensus 80 ------~~~~l~G~S~Gg~~a~~~a~~~~~~v~~~i~~~~~~ 115 (257)
T TIGR03611 80 ------ERFHFVGHALGGLIGLQLALRYPERLLSLVLINAWS 115 (257)
T ss_pred ------CcEEEEEechhHHHHHHHHHHChHHhHHheeecCCC
Confidence 379999999999999999999999999988776543
No 44
>KOG4409|consensus
Probab=98.77 E-value=2e-08 Score=105.86 Aligned_cols=108 Identities=20% Similarity=0.246 Sum_probs=77.1
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.+.+-+|+++|.|+..+.|..+ +..||+ .-.|+++|..|+|.|.-. .++.+.-. -.++.++-++++.. .
T Consensus 88 ~~~~plVliHGyGAg~g~f~~N--f~~La~--~~~vyaiDllG~G~SSRP-~F~~d~~~--~e~~fvesiE~WR~----~ 156 (365)
T KOG4409|consen 88 ANKTPLVLIHGYGAGLGLFFRN--FDDLAK--IRNVYAIDLLGFGRSSRP-KFSIDPTT--AEKEFVESIEQWRK----K 156 (365)
T ss_pred cCCCcEEEEeccchhHHHHHHh--hhhhhh--cCceEEecccCCCCCCCC-CCCCCccc--chHHHHHHHHHHHH----H
Confidence 4456689999999988887753 778888 569999999999999431 22221100 11244444444433 3
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.++ .+.||+|||+||-||+-|++|||+.|.-.|..|+.-
T Consensus 157 ~~L---~KmilvGHSfGGYLaa~YAlKyPerV~kLiLvsP~G 195 (365)
T KOG4409|consen 157 MGL---EKMILVGHSFGGYLAAKYALKYPERVEKLILVSPWG 195 (365)
T ss_pred cCC---cceeEeeccchHHHHHHHHHhChHhhceEEEecccc
Confidence 333 389999999999999999999999999888777443
No 45
>PLN02511 hydrolase
Probab=98.76 E-value=2.3e-08 Score=114.11 Aligned_cols=112 Identities=16% Similarity=0.178 Sum_probs=81.8
Q ss_pred CCCcEEEEECCCCC-CcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 763 HGGPIFLMIGGEGE-ASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~-~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
+++|+||+++|.+. ....|.. .++.. +.+.|..||++|+||+|.|..... ++ ......+|+..++++++.
T Consensus 98 ~~~p~vvllHG~~g~s~~~y~~-~~~~~-~~~~g~~vv~~d~rG~G~s~~~~~------~~-~~~~~~~Dl~~~i~~l~~ 168 (388)
T PLN02511 98 ADAPVLILLPGLTGGSDDSYVR-HMLLR-ARSKGWRVVVFNSRGCADSPVTTP------QF-YSASFTGDLRQVVDHVAG 168 (388)
T ss_pred CCCCEEEEECCCCCCCCCHHHH-HHHHH-HHHCCCEEEEEecCCCCCCCCCCc------CE-EcCCchHHHHHHHHHHHH
Confidence 46789999977643 3332332 12222 334589999999999999954211 11 123456899999999998
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCe--EEEEEeecccccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL--VHGAMSASGPLRA 886 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~--~~g~i~ssa~~~~ 886 (1042)
.+. +.+++++|+|+||++++.+..++|+. |.|+|+.|+|...
T Consensus 169 ~~~---~~~~~lvG~SlGg~i~~~yl~~~~~~~~v~~~v~is~p~~l 212 (388)
T PLN02511 169 RYP---SANLYAAGWSLGANILVNYLGEEGENCPLSGAVSLCNPFDL 212 (388)
T ss_pred HCC---CCCEEEEEechhHHHHHHHHHhcCCCCCceEEEEECCCcCH
Confidence 763 35899999999999999999999998 8899988888753
No 46
>PLN03087 BODYGUARD 1 domain containing hydrolase; Provisional
Probab=98.74 E-value=6.5e-08 Score=111.31 Aligned_cols=108 Identities=18% Similarity=0.161 Sum_probs=78.1
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHH--HcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHH-HHHHHHh
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAH--NHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLA-YFITSMN 840 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~--~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~-~~~~~~~ 840 (1042)
.+|.||+++|.+.....|.. .++..+++ ..+..||++|.||||.|....+ ...+.++.++|++ .|++.+.
T Consensus 200 ~k~~VVLlHG~~~s~~~W~~-~~~~~L~~~~~~~yrVia~Dl~G~G~S~~p~~------~~ytl~~~a~~l~~~ll~~lg 272 (481)
T PLN03087 200 AKEDVLFIHGFISSSAFWTE-TLFPNFSDAAKSTYRLFAVDLLGFGRSPKPAD------SLYTLREHLEMIERSVLERYK 272 (481)
T ss_pred CCCeEEEECCCCccHHHHHH-HHHHHHHHHhhCCCEEEEECCCCCCCCcCCCC------CcCCHHHHHHHHHHHHHHHcC
Confidence 35678888887765555543 12234443 2378999999999999953221 2357777777774 5655432
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
..+++++|+|+||++|..++.+||+.|.+.|..+++..
T Consensus 273 -------~~k~~LVGhSmGG~iAl~~A~~~Pe~V~~LVLi~~~~~ 310 (481)
T PLN03087 273 -------VKSFHIVAHSLGCILALALAVKHPGAVKSLTLLAPPYY 310 (481)
T ss_pred -------CCCEEEEEECHHHHHHHHHHHhChHhccEEEEECCCcc
Confidence 24899999999999999999999999999998877654
No 47
>PRK03204 haloalkane dehalogenase; Provisional
Probab=98.71 E-value=7.9e-08 Score=105.46 Aligned_cols=104 Identities=13% Similarity=0.195 Sum_probs=74.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.+|.||+++|.+.....|. .++..|++ +..||++|+||||.|....+. ..+.+...+|+..+++.+
T Consensus 33 ~~~~iv~lHG~~~~~~~~~--~~~~~l~~--~~~vi~~D~~G~G~S~~~~~~------~~~~~~~~~~~~~~~~~~---- 98 (286)
T PRK03204 33 TGPPILLCHGNPTWSFLYR--DIIVALRD--RFRCVAPDYLGFGLSERPSGF------GYQIDEHARVIGEFVDHL---- 98 (286)
T ss_pred CCCEEEEECCCCccHHHHH--HHHHHHhC--CcEEEEECCCCCCCCCCCCcc------ccCHHHHHHHHHHHHHHh----
Confidence 4678888888664433333 23334444 579999999999999543211 135566666666666553
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
+ ..+++++|+||||++|..+...+|+.|.++|..+++.
T Consensus 99 ~---~~~~~lvG~S~Gg~va~~~a~~~p~~v~~lvl~~~~~ 136 (286)
T PRK03204 99 G---LDRYLSMGQDWGGPISMAVAVERADRVRGVVLGNTWF 136 (286)
T ss_pred C---CCCEEEEEECccHHHHHHHHHhChhheeEEEEECccc
Confidence 2 1479999999999999999999999999999876654
No 48
>PLN03084 alpha/beta hydrolase fold protein; Provisional
Probab=98.71 E-value=3.8e-08 Score=111.05 Aligned_cols=108 Identities=14% Similarity=0.093 Sum_probs=85.1
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
++|+||+++|.+.....|. .++..|++ +..||++|.||||.|.+.... .....++++.++|+..|++.+..
T Consensus 126 ~~~~ivllHG~~~~~~~w~--~~~~~L~~--~~~Via~DlpG~G~S~~p~~~---~~~~ys~~~~a~~l~~~i~~l~~-- 196 (383)
T PLN03084 126 NNPPVLLIHGFPSQAYSYR--KVLPVLSK--NYHAIAFDWLGFGFSDKPQPG---YGFNYTLDEYVSSLESLIDELKS-- 196 (383)
T ss_pred CCCeEEEECCCCCCHHHHH--HHHHHHhc--CCEEEEECCCCCCCCCCCccc---ccccCCHHHHHHHHHHHHHHhCC--
Confidence 4789999999876655554 34556665 689999999999999643211 11235888899999999987642
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.+++++|+|+||++|..+...||+.|.++|..++++.
T Consensus 197 -----~~~~LvG~s~GG~ia~~~a~~~P~~v~~lILi~~~~~ 233 (383)
T PLN03084 197 -----DKVSLVVQGYFSPPVVKYASAHPDKIKKLILLNPPLT 233 (383)
T ss_pred -----CCceEEEECHHHHHHHHHHHhChHhhcEEEEECCCCc
Confidence 3799999999999999999999999999999987753
No 49
>PLN02679 hydrolase, alpha/beta fold family protein
Probab=98.70 E-value=4.2e-08 Score=111.23 Aligned_cols=103 Identities=17% Similarity=0.158 Sum_probs=78.5
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
+|.||+++|.|.....|.. ++..|++ +..||++|+||||.|.+.++ . ..+++...+|+..|++.+..
T Consensus 88 gp~lvllHG~~~~~~~w~~--~~~~L~~--~~~via~Dl~G~G~S~~~~~-----~-~~~~~~~a~~l~~~l~~l~~--- 154 (360)
T PLN02679 88 GPPVLLVHGFGASIPHWRR--NIGVLAK--NYTVYAIDLLGFGASDKPPG-----F-SYTMETWAELILDFLEEVVQ--- 154 (360)
T ss_pred CCeEEEECCCCCCHHHHHH--HHHHHhc--CCEEEEECCCCCCCCCCCCC-----c-cccHHHHHHHHHHHHHHhcC---
Confidence 5889999998876655543 3445655 68999999999999965322 1 24777788888888876532
Q ss_pred CCCCCcEEEEccchhHHHHHHHHH-hCCCeEEEEEeecccc
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRY-KYPHLVHGAMSASGPL 884 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~-~yP~~~~g~i~ssa~~ 884 (1042)
.+++++|+|+||.+|..+.. .+|+.|.+.|..+++.
T Consensus 155 ----~~~~lvGhS~Gg~ia~~~a~~~~P~rV~~LVLi~~~~ 191 (360)
T PLN02679 155 ----KPTVLIGNSVGSLACVIAASESTRDLVRGLVLLNCAG 191 (360)
T ss_pred ----CCeEEEEECHHHHHHHHHHHhcChhhcCEEEEECCcc
Confidence 38999999999999988776 6899999999777653
No 50
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=98.70 E-value=3.4e-08 Score=108.42 Aligned_cols=107 Identities=15% Similarity=0.173 Sum_probs=74.1
Q ss_pred CCCcEEEEECCCCCCcccccc-cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMV-EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~-~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
+++|.||+++|-|.....|.. -..+..+++. +..||++|.||+|.|.+.... .. .+. ...+|+.+|++.+..
T Consensus 28 g~~~~ivllHG~~~~~~~~~~~~~~~~~l~~~-~~~vi~~D~~G~G~S~~~~~~--~~---~~~-~~~~~l~~~l~~l~~ 100 (282)
T TIGR03343 28 GNGEAVIMLHGGGPGAGGWSNYYRNIGPFVDA-GYRVILKDSPGFNKSDAVVMD--EQ---RGL-VNARAVKGLMDALDI 100 (282)
T ss_pred CCCCeEEEECCCCCchhhHHHHHHHHHHHHhC-CCEEEEECCCCCCCCCCCcCc--cc---ccc-hhHHHHHHHHHHcCC
Confidence 356788889887655444431 1123344443 799999999999999653210 00 011 124677777765422
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
.+++++|+||||++|..+..+||+.|.++|..+++
T Consensus 101 -------~~~~lvG~S~Gg~ia~~~a~~~p~~v~~lvl~~~~ 135 (282)
T TIGR03343 101 -------EKAHLVGNSMGGATALNFALEYPDRIGKLILMGPG 135 (282)
T ss_pred -------CCeeEEEECchHHHHHHHHHhChHhhceEEEECCC
Confidence 37999999999999999999999999999987754
No 51
>TIGR01249 pro_imino_pep_1 proline iminopeptidase, Neisseria-type subfamily. This model represents one of two related families of proline iminopeptidase in the alpha/beta fold hydrolase family. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.
Probab=98.69 E-value=3.6e-08 Score=109.42 Aligned_cols=79 Identities=25% Similarity=0.337 Sum_probs=62.7
Q ss_pred cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCe
Q psy3685 794 HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL 873 (1042)
Q Consensus 794 ~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~ 873 (1042)
.+..||++|+||||.|.+.... .-.+.++.++|+..+++.+.. .+++++|+||||++|+.+..+||+.
T Consensus 52 ~~~~vi~~D~~G~G~S~~~~~~-----~~~~~~~~~~dl~~l~~~l~~-------~~~~lvG~S~GG~ia~~~a~~~p~~ 119 (306)
T TIGR01249 52 ETYRIVLFDQRGCGKSTPHACL-----EENTTWDLVADIEKLREKLGI-------KNWLVFGGSWGSTLALAYAQTHPEV 119 (306)
T ss_pred cCCEEEEECCCCCCCCCCCCCc-----ccCCHHHHHHHHHHHHHHcCC-------CCEEEEEECHHHHHHHHHHHHChHh
Confidence 3689999999999999753211 123677788888888766531 3799999999999999999999999
Q ss_pred EEEEEeecccc
Q psy3685 874 VHGAMSASGPL 884 (1042)
Q Consensus 874 ~~g~i~ssa~~ 884 (1042)
|.+.|..++.+
T Consensus 120 v~~lvl~~~~~ 130 (306)
T TIGR01249 120 VTGLVLRGIFL 130 (306)
T ss_pred hhhheeecccc
Confidence 99988877654
No 52
>PRK06489 hypothetical protein; Provisional
Probab=98.69 E-value=4.6e-08 Score=111.16 Aligned_cols=112 Identities=14% Similarity=0.159 Sum_probs=73.0
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHH------HcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHH-H
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAH------NHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFI-T 837 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~------~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~-~ 837 (1042)
+|.||+++|-+.....|....+...|.. .-+..||++|.||||.|....+....+....++++.++|+..++ +
T Consensus 69 gpplvllHG~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~Via~Dl~GhG~S~~p~~~~~~~~~~~~~~~~a~~~~~~l~~ 148 (360)
T PRK06489 69 DNAVLVLHGTGGSGKSFLSPTFAGELFGPGQPLDASKYFIILPDGIGHGKSSKPSDGLRAAFPRYDYDDMVEAQYRLVTE 148 (360)
T ss_pred CCeEEEeCCCCCchhhhccchhHHHhcCCCCcccccCCEEEEeCCCCCCCCCCCCcCCCCCCCcccHHHHHHHHHHHHHH
Confidence 6888888887655444432223323310 12579999999999999532221111112246777777766654 3
Q ss_pred HHhhhcCCCCCCcE-EEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 838 SMNTLYSLPAHTKW-IAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 838 ~~~~~~~~~~~~~~-i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
.+. . .++ +++|+||||++|..+..+||+.|.+.|..++.
T Consensus 149 ~lg----i---~~~~~lvG~SmGG~vAl~~A~~~P~~V~~LVLi~s~ 188 (360)
T PRK06489 149 GLG----V---KHLRLILGTSMGGMHAWMWGEKYPDFMDALMPMASQ 188 (360)
T ss_pred hcC----C---CceeEEEEECHHHHHHHHHHHhCchhhheeeeeccC
Confidence 332 1 255 68999999999999999999999999976543
No 53
>TIGR03695 menH_SHCHC 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase. This protein catalyzes the formation of SHCHC, or (1 R,6 R)-2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate, by elmination of pyruvate from 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPHCHC). Note that SHCHC synthase activity previously was attributed to MenD, which in fact is SEPHCHC synthase.
Probab=98.69 E-value=4.8e-08 Score=104.39 Aligned_cols=104 Identities=20% Similarity=0.318 Sum_probs=76.1
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhH-HHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALAD-LAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D-~~~~~~~~~~~~ 843 (1042)
+|+||+++|.+.....|. .+...|+ .+..|+++|+||+|.|.... .....+.++++.| +..+++.+.
T Consensus 1 ~~~vv~~hG~~~~~~~~~--~~~~~L~--~~~~v~~~d~~g~G~s~~~~-----~~~~~~~~~~~~~~~~~~~~~~~--- 68 (251)
T TIGR03695 1 KPVLVFLHGFLGSGADWQ--ALIELLG--PHFRCLAIDLPGHGSSQSPD-----EIERYDFEEAAQDILATLLDQLG--- 68 (251)
T ss_pred CCEEEEEcCCCCchhhHH--HHHHHhc--ccCeEEEEcCCCCCCCCCCC-----ccChhhHHHHHHHHHHHHHHHcC---
Confidence 477888888766554443 3444555 37899999999999994321 1234567777777 555554431
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
..+++++|+|+||.+|+.+..++|+.|.++|+.+++.
T Consensus 69 ----~~~~~l~G~S~Gg~ia~~~a~~~~~~v~~lil~~~~~ 105 (251)
T TIGR03695 69 ----IEPFFLVGYSMGGRIALYYALQYPERVQGLILESGSP 105 (251)
T ss_pred ----CCeEEEEEeccHHHHHHHHHHhCchheeeeEEecCCC
Confidence 2489999999999999999999999999999877654
No 54
>PHA02768 hypothetical protein; Provisional
Probab=98.67 E-value=8.3e-09 Score=77.58 Aligned_cols=42 Identities=19% Similarity=0.490 Sum_probs=20.6
Q ss_pred cccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhH
Q psy3685 559 YKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNL 601 (1042)
Q Consensus 559 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L 601 (1042)
|+|+.||+.|.+.++|..|+++|+ .+|+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~-k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN-TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC-CcccCCcccceeccccee
Confidence 445555555555555555555544 344555555555444443
No 55
>PLN02894 hydrolase, alpha/beta fold family protein
Probab=98.65 E-value=1.9e-07 Score=107.12 Aligned_cols=109 Identities=18% Similarity=0.223 Sum_probs=73.9
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|+||+++|.|.....|.. .+..|++ +..|+++|+|++|.|...+ . .+.+.+++.++++..+..+...
T Consensus 103 ~~~p~vvllHG~~~~~~~~~~--~~~~L~~--~~~vi~~D~rG~G~S~~~~-~-----~~~~~~~~~~~~~~~i~~~~~~ 172 (402)
T PLN02894 103 EDAPTLVMVHGYGASQGFFFR--NFDALAS--RFRVIAIDQLGWGGSSRPD-F-----TCKSTEETEAWFIDSFEEWRKA 172 (402)
T ss_pred CCCCEEEEECCCCcchhHHHH--HHHHHHh--CCEEEEECCCCCCCCCCCC-c-----ccccHHHHHHHHHHHHHHHHHH
Confidence 457889999888765544432 3456665 4799999999999995321 1 1223444443333333222222
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
... .+++++|+|+||.||+.+..+||+.|.++|..+++.
T Consensus 173 l~~---~~~~lvGhS~GG~la~~~a~~~p~~v~~lvl~~p~~ 211 (402)
T PLN02894 173 KNL---SNFILLGHSFGGYVAAKYALKHPEHVQHLILVGPAG 211 (402)
T ss_pred cCC---CCeEEEEECHHHHHHHHHHHhCchhhcEEEEECCcc
Confidence 222 389999999999999999999999999988776543
No 56
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=98.64 E-value=6.4e-08 Score=109.22 Aligned_cols=106 Identities=14% Similarity=0.171 Sum_probs=73.7
Q ss_pred CCcEEEEECCCCCCccccc-----ccccHHHHHH---H---cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHH
Q psy3685 764 GGPIFLMIGGEGEASAKWM-----VEGTWLDYAH---N---HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADL 832 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~-----~~~~~~~la~---~---~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~ 832 (1042)
+.|+||+.||.+....... ..+++..+.. . -+..||++|+||+|.|.+. ..+++...+|+
T Consensus 57 ~~p~vll~g~~~~~~~~~~~~~~~~~~~w~~~v~~~~~L~~~~~~Vi~~Dl~G~g~s~~~---------~~~~~~~a~dl 127 (343)
T PRK08775 57 GAPVVFVAGGISAHRHVAATATFPEKGWWEGLVGSGRALDPARFRLLAFDFIGADGSLDV---------PIDTADQADAI 127 (343)
T ss_pred CCCEEEEecCCCcccccccccCCCCCCcchhccCCCCccCccccEEEEEeCCCCCCCCCC---------CCCHHHHHHHH
Confidence 5688888888776542000 0012222221 1 1579999999999988432 12456667888
Q ss_pred HHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 833 AYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 833 ~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
+.|++.+..+ .+++++|+||||++|..+..+||+.|.+.|..++..
T Consensus 128 ~~ll~~l~l~------~~~~lvG~SmGG~vA~~~A~~~P~~V~~LvLi~s~~ 173 (343)
T PRK08775 128 ALLLDALGIA------RLHAFVGYSYGALVGLQFASRHPARVRTLVVVSGAH 173 (343)
T ss_pred HHHHHHcCCC------cceEEEEECHHHHHHHHHHHHChHhhheEEEECccc
Confidence 8888876321 246899999999999999999999999999776544
No 57
>KOG3993|consensus
Probab=98.64 E-value=1.2e-08 Score=107.52 Aligned_cols=188 Identities=21% Similarity=0.337 Sum_probs=114.6
Q ss_pred ccccCCCCcccCCchhHHHHHH--HhcCCccccCccCcccCCchhHHHHHHHhcCccccc--Cc--CCcccCCchhhhhH
Q psy3685 504 RYKCDLCSKEFSSKGNLTEHMK--QHQGIRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKC--DL--CSKEFSRKDNLTEH 577 (1042)
Q Consensus 504 ~~~C~~C~~~f~~~~~L~~H~~--~H~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C--~~--C~k~F~~~~~L~~H 577 (1042)
-|.|..|...|...-.|.+|.= +-+ ..|+|+.|+|.|.-..+|..|.|-|.-.+=-= .. =.+.-.+. .+.+-
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~-vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~r-ae~~e 344 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVH-VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETR-AEVQE 344 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEE-eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhh-hhhhh
Confidence 4888888888888888888741 111 25888888888888888888888874321000 00 00000000 01000
Q ss_pred hhhhcC-----CcccCCccccccCCchhHHHHhhhcCCCc------------------ccCCCCCcccccCCCCcccccC
Q psy3685 578 KEIHQG-----IRYKCDLCSKKFSRKLNLTKHMKIHQGIR------------------YKCDLCSKEFSRKDIPYEVTVK 634 (1042)
Q Consensus 578 ~~~H~~-----~~~~C~~C~k~F~~~~~L~~H~~~H~~~~------------------~~C~~C~k~F~~~~~l~~~~~~ 634 (1042)
..-.| .-|.|.+|+|+|++...|++|+-+|+..+ +-|..|+-.+...+ +|..
T Consensus 345 -a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~----~~g~ 419 (500)
T KOG3993|consen 345 -AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASD----SHGD 419 (500)
T ss_pred -ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccccccccccccc----cccc
Confidence 00022 14999999999999999999998886421 22444443333211 1211
Q ss_pred CcCcccCCccccccCCCcccccccccccccccccccccccccccCCccccCCCCCCccCCcchHHHHHhh-ccC
Q psy3685 635 EESYEGDSINVEENNGNDAVTRDCLEFFKTEQNTTLGKLNKILHQDIRYKCDLCSKDFSRKDNLTEHKEI-HQG 707 (1042)
Q Consensus 635 ~~~~~c~~~~~~~~~~~~~~~~~C~~~f~~~~~~~l~~h~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~-H~g 707 (1042)
+..+.-. ...-..+.+|+..+++ +..-..+.+....+..|.|.+|--+|...-+|.+|... |..
T Consensus 420 ~vl~~a~-------sael~~pp~~~~ppss--s~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps 484 (500)
T KOG3993|consen 420 EVLYVAG-------SAELELPPYDGSPPSS--SGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS 484 (500)
T ss_pred ceeeeec-------cccccCCCCCCCCccc--CCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence 2111111 1122345578877777 66677777777778889999999999999999999754 533
No 58
>PRK10985 putative hydrolase; Provisional
Probab=98.63 E-value=1e-07 Score=106.53 Aligned_cols=111 Identities=18% Similarity=0.164 Sum_probs=77.5
Q ss_pred CCcEEEEECCCCCCcc-cccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEASA-KWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~-~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
..|+||+++|.++... .+. ..++..|++ .|..|+.++.||+|.|.... .. ..+.. .++|++.+++.++..
T Consensus 57 ~~p~vll~HG~~g~~~~~~~-~~~~~~l~~-~G~~v~~~d~rG~g~~~~~~---~~---~~~~~-~~~D~~~~i~~l~~~ 127 (324)
T PRK10985 57 HKPRLVLFHGLEGSFNSPYA-HGLLEAAQK-RGWLGVVMHFRGCSGEPNRL---HR---IYHSG-ETEDARFFLRWLQRE 127 (324)
T ss_pred CCCEEEEeCCCCCCCcCHHH-HHHHHHHHH-CCCEEEEEeCCCCCCCccCC---cc---eECCC-chHHHHHHHHHHHHh
Confidence 4689999988754422 222 223444554 49999999999999763210 01 11111 368999999988876
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCe--EEEEEeecccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHL--VHGAMSASGPLRA 886 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~--~~g~i~ssa~~~~ 886 (1042)
++ ..|++++|+|+||++++.+..++++. +.++|+.++|...
T Consensus 128 ~~---~~~~~~vG~S~GG~i~~~~~~~~~~~~~~~~~v~i~~p~~~ 170 (324)
T PRK10985 128 FG---HVPTAAVGYSLGGNMLACLLAKEGDDLPLDAAVIVSAPLML 170 (324)
T ss_pred CC---CCCEEEEEecchHHHHHHHHHhhCCCCCccEEEEEcCCCCH
Confidence 53 35899999999999988888887754 8888888888753
No 59
>PHA00733 hypothetical protein
Probab=98.63 E-value=3.3e-08 Score=91.84 Aligned_cols=58 Identities=24% Similarity=0.482 Sum_probs=33.8
Q ss_pred hhhcCceeccCccccccCCchhHHHHHhhcCCCcccCCCCCcccCCchhHHHHHHhhcC
Q psy3685 418 EIHQGIRYKCDLCSKKFSRKLNLTKHMKIHQGIRYKCDLCSKEFSRKGNLTKHMKLHQG 476 (1042)
Q Consensus 418 ~~h~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~C~~C~k~f~~~~~L~~H~~~H~~ 476 (1042)
..+...||.|+.|++.|.+...|..|++.| ..+|.|+.|++.|.....|..|+...++
T Consensus 67 ~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h-~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 67 TSKAVSPYVCPLCLMPFSSSVSLKQHIRYT-EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred ccCCCCCccCCCCCCcCCCHHHHHHHHhcC-CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 333444566666666666666666666554 2356666666666666666666655443
No 60
>PRK10349 carboxylesterase BioH; Provisional
Probab=98.62 E-value=7.9e-08 Score=103.83 Aligned_cols=98 Identities=17% Similarity=0.200 Sum_probs=71.1
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
+.|.|.||+++|.|.....|.. ++..|++ +..|+++|+||+|.|...+ ..++++.++|+.++
T Consensus 10 G~g~~~ivllHG~~~~~~~w~~--~~~~L~~--~~~vi~~Dl~G~G~S~~~~--------~~~~~~~~~~l~~~------ 71 (256)
T PRK10349 10 GQGNVHLVLLHGWGLNAEVWRC--IDEELSS--HFTLHLVDLPGFGRSRGFG--------ALSLADMAEAVLQQ------ 71 (256)
T ss_pred CCCCCeEEEECCCCCChhHHHH--HHHHHhc--CCEEEEecCCCCCCCCCCC--------CCCHHHHHHHHHhc------
Confidence 3344568888887766665642 4555654 4799999999999995321 24566665555431
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
. ..+++++|+|+||.+|..++.++|+.|.+.|..++
T Consensus 72 --~---~~~~~lvGhS~Gg~ia~~~a~~~p~~v~~lili~~ 107 (256)
T PRK10349 72 --A---PDKAIWLGWSLGGLVASQIALTHPERVQALVTVAS 107 (256)
T ss_pred --C---CCCeEEEEECHHHHHHHHHHHhChHhhheEEEecC
Confidence 1 13799999999999999999999999999986544
No 61
>KOG3993|consensus
Probab=98.60 E-value=1e-08 Score=108.07 Aligned_cols=106 Identities=29% Similarity=0.544 Sum_probs=62.4
Q ss_pred cccCCcccccccCchHHHHHHh--hccccccccCCCCCccCCchhHHHHHHhhcCCccccCcc---Cc-cccCcchHHhH
Q psy3685 343 RYKCDLCSKKFSRKLNLTKHMK--IHQGIRYKCDLCSKEFSRKGNLTKHMKLHQGIRYKCDLC---LK-EFSRKDNLTEH 416 (1042)
Q Consensus 343 ~~~C~~C~k~f~~~~~L~~H~~--~h~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~C---~k-~f~~~~~L~~H 416 (1042)
.|.|..|...|.+.-.|.+|.- +-+ ..|+|+.|+|.|+-..+|..|.|+|...+-.=..= .+ .-.+....++-
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~-vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVH-VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEE-eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 3788888888888888888852 211 24888888888888888888888886542110000 00 00000001110
Q ss_pred hhh---hcCceeccCccccccCCchhHHHHHhhcCC
Q psy3685 417 KEI---HQGIRYKCDLCSKKFSRKLNLTKHMKIHQG 449 (1042)
Q Consensus 417 ~~~---h~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 449 (1042)
.+. -+..-|.|.+|+|+|.....|++|+.+|+.
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~ 381 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQR 381 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhc
Confidence 110 011247888888888888888888777764
No 62
>PLN02578 hydrolase
Probab=98.59 E-value=1.2e-07 Score=107.52 Aligned_cols=101 Identities=20% Similarity=0.197 Sum_probs=77.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.||.||+++|.|.....|.. ++..|++ +..||++|.||+|.|.... ...+.+...+|+..|++.+..
T Consensus 85 ~g~~vvliHG~~~~~~~w~~--~~~~l~~--~~~v~~~D~~G~G~S~~~~-------~~~~~~~~a~~l~~~i~~~~~-- 151 (354)
T PLN02578 85 EGLPIVLIHGFGASAFHWRY--NIPELAK--KYKVYALDLLGFGWSDKAL-------IEYDAMVWRDQVADFVKEVVK-- 151 (354)
T ss_pred CCCeEEEECCCCCCHHHHHH--HHHHHhc--CCEEEEECCCCCCCCCCcc-------cccCHHHHHHHHHHHHHHhcc--
Confidence 34557888888776555543 3456665 5799999999999995321 124677777899999887642
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
.+++++|+|+||++|..++.+||+.|.+.|..++
T Consensus 152 -----~~~~lvG~S~Gg~ia~~~A~~~p~~v~~lvLv~~ 185 (354)
T PLN02578 152 -----EPAVLVGNSLGGFTALSTAVGYPELVAGVALLNS 185 (354)
T ss_pred -----CCeEEEEECHHHHHHHHHHHhChHhcceEEEECC
Confidence 3799999999999999999999999999886543
No 63
>PRK10566 esterase; Provisional
Probab=98.59 E-value=3.2e-07 Score=98.61 Aligned_cols=109 Identities=12% Similarity=0.039 Sum_probs=74.2
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccC--CCHHHHHhHHHHHHHHHh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQY--LSAEQALADLAYFITSMN 840 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~--lt~~q~l~D~~~~~~~~~ 840 (1042)
+..|+||+++|.+.....|. .+...|+++ |..|+++|.|++|.|.+... ...+.. -...++++|++.++.++.
T Consensus 25 ~~~p~vv~~HG~~~~~~~~~--~~~~~l~~~-G~~v~~~d~~g~G~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~ 99 (249)
T PRK10566 25 TPLPTVFFYHGFTSSKLVYS--YFAVALAQA-GFRVIMPDAPMHGARFSGDE--ARRLNHFWQILLQNMQEFPTLRAAIR 99 (249)
T ss_pred CCCCEEEEeCCCCcccchHH--HHHHHHHhC-CCEEEEecCCcccccCCCcc--ccchhhHHHHHHHHHHHHHHHHHHHH
Confidence 34689999988765433222 234456654 89999999999998754211 111111 123456788888888876
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEE
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGA 877 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~ 877 (1042)
..... +..+++++|+|+||.+|+++..++|++..++
T Consensus 100 ~~~~~-~~~~i~v~G~S~Gg~~al~~~~~~~~~~~~~ 135 (249)
T PRK10566 100 EEGWL-LDDRLAVGGASMGGMTALGIMARHPWVKCVA 135 (249)
T ss_pred hcCCc-CccceeEEeecccHHHHHHHHHhCCCeeEEE
Confidence 64322 2358999999999999999999999976544
No 64
>PRK07581 hypothetical protein; Validated
Probab=98.57 E-value=1.2e-07 Score=106.95 Aligned_cols=139 Identities=11% Similarity=0.055 Sum_probs=78.2
Q ss_pred cccCCCCCcccce-EEEEEecccc-c-CCCcEEEEECCCCCCcccccc-cccHHHHHHHcCCeEEEEeeeecccccCCCC
Q psy3685 739 LDHFNIEEPRTWK-QRYFVNSTWY-K-HGGPIFLMIGGEGEASAKWMV-EGTWLDYAHNHNALAVQVEHRFYGKSHPLSD 814 (1042)
Q Consensus 739 ~dhf~~~~~~tf~-qry~~~~~~~-~-~~~pi~~~~gg~g~~~~~~~~-~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~ 814 (1042)
|..|-.+++.+-. .|..+....- . .+.|+||+.||.+.....|.. -+....|+. .+..||++|.||||.|.+..+
T Consensus 12 ~~~~~~~~g~~~~~~~l~y~~~G~~~~~~~~~vll~~~~~~~~~~~~~~~~~~~~l~~-~~~~vi~~D~~G~G~S~~~~~ 90 (339)
T PRK07581 12 LGDVELQSGATLPDARLAYKTYGTLNAAKDNAILYPTWYSGTHQDNEWLIGPGRALDP-EKYFIIIPNMFGNGLSSSPSN 90 (339)
T ss_pred eCCeEecCCCCcCCceEEEEecCccCCCCCCEEEEeCCCCCCcccchhhccCCCccCc-CceEEEEecCCCCCCCCCCCC
Confidence 3455555544432 3433332211 1 234788888877643332211 000013332 267999999999999964322
Q ss_pred ----CCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCc-EEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 815 ----LSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTK-WIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 815 ----~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~-~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+.+.+...+ ..+|++.....+....+. .+ ++++|+||||++|..+...||+.|.+.|..++..
T Consensus 91 ~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~lgi---~~~~~lvG~S~GG~va~~~a~~~P~~V~~Lvli~~~~ 159 (339)
T PRK07581 91 TPAPFNAARFPHVT---IYDNVRAQHRLLTEKFGI---ERLALVVGWSMGAQQTYHWAVRYPDMVERAAPIAGTA 159 (339)
T ss_pred CCCCCCCCCCCcee---HHHHHHHHHHHHHHHhCC---CceEEEEEeCHHHHHHHHHHHHCHHHHhhheeeecCC
Confidence 2222222222 334444433323222222 36 5899999999999999999999999988665543
No 65
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=98.53 E-value=1.1e-07 Score=101.37 Aligned_cols=95 Identities=18% Similarity=0.106 Sum_probs=69.1
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCC
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
|.||+++|.|.....|. .++..|++ +..|+++|+||+|.|.+.. ..+.+++++|+..++
T Consensus 5 ~~iv~~HG~~~~~~~~~--~~~~~l~~--~~~vi~~d~~G~G~s~~~~--------~~~~~~~~~~~~~~~--------- 63 (245)
T TIGR01738 5 VHLVLIHGWGMNAEVFR--CLDEELSA--HFTLHLVDLPGHGRSRGFG--------PLSLADAAEAIAAQA--------- 63 (245)
T ss_pred ceEEEEcCCCCchhhHH--HHHHhhcc--CeEEEEecCCcCccCCCCC--------CcCHHHHHHHHHHhC---------
Confidence 77888888776555443 33445554 5899999999999986532 235555555554332
Q ss_pred CCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
..+++++|+|+||.+|+.+..+||+.|.++|..++.
T Consensus 64 --~~~~~lvG~S~Gg~~a~~~a~~~p~~v~~~il~~~~ 99 (245)
T TIGR01738 64 --PDPAIWLGWSLGGLVALHIAATHPDRVRALVTVASS 99 (245)
T ss_pred --CCCeEEEEEcHHHHHHHHHHHHCHHhhheeeEecCC
Confidence 138999999999999999999999999998865443
No 66
>TIGR02821 fghA_ester_D S-formylglutathione hydrolase. This model describes a protein family from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (EC 3.1.2.12). Members in eukaryotes such as the human protein are better known as esterase D (EC 3.1.1.1), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.
Probab=98.48 E-value=1.2e-06 Score=95.42 Aligned_cols=120 Identities=16% Similarity=0.157 Sum_probs=77.6
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee--eecccccCC------------CCCCcccc--cCCCHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH--RFYGKSHPL------------SDLSVESL--QYLSAE 826 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh--R~~G~S~p~------------~~~~~~~~--~~lt~~ 826 (1042)
...|+|++++|.+.....|...+.+..+|.+.|..||+.|. |+.|.+.-. -|.+...+ .+-..+
T Consensus 40 ~~~P~vvllHG~~~~~~~~~~~~~~~~la~~~g~~Vv~Pd~~~~g~~~~~~~~~w~~g~~~~~~~d~~~~~~~~~~~~~~ 119 (275)
T TIGR02821 40 GPVPVLWYLSGLTCTHENFMIKAGAQRFAAEHGLALVAPDTSPRGTGIAGEDDAWDFGKGAGFYVDATEEPWSQHYRMYS 119 (275)
T ss_pred CCCCEEEEccCCCCCccHHHhhhHHHHHHhhcCcEEEEeCCCCCcCCCCCCcccccccCCccccccCCcCcccccchHHH
Confidence 34799999999887666666555566788888999999997 555533100 00000000 011112
Q ss_pred HHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 827 QALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 827 q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
.+++|+..++ ...+... ..+++++|+|+||.+|+++..++|+.+.++++.|+....
T Consensus 120 ~~~~~l~~~~---~~~~~~~-~~~~~~~G~S~GG~~a~~~a~~~p~~~~~~~~~~~~~~~ 175 (275)
T TIGR02821 120 YIVQELPALV---AAQFPLD-GERQGITGHSMGGHGALVIALKNPDRFKSVSAFAPIVAP 175 (275)
T ss_pred HHHHHHHHHH---HhhCCCC-CCceEEEEEChhHHHHHHHHHhCcccceEEEEECCccCc
Confidence 2234444443 3334332 248999999999999999999999999999988776543
No 67
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=98.45 E-value=9.7e-07 Score=96.06 Aligned_cols=108 Identities=17% Similarity=0.074 Sum_probs=77.8
Q ss_pred CcEEEEECCCCCCcccccc-cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMV-EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~-~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.+|||+.||.+.....+.. ..+...|+++ |..++++|.||+|.|.+. ..+.++..+|+..+++.++...
T Consensus 27 ~~vv~i~gg~~~~~g~~~~~~~la~~l~~~-G~~v~~~Dl~G~G~S~~~---------~~~~~~~~~d~~~~~~~l~~~~ 96 (274)
T TIGR03100 27 TGVLIVVGGPQYRVGSHRQFVLLARRLAEA-GFPVLRFDYRGMGDSEGE---------NLGFEGIDADIAAAIDAFREAA 96 (274)
T ss_pred CeEEEEeCCccccCCchhHHHHHHHHHHHC-CCEEEEeCCCCCCCCCCC---------CCCHHHHHHHHHHHHHHHHhhC
Confidence 3577777876544332221 2233445554 899999999999998532 1356778899999999998764
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
. ...+++++|+|+||.+|+.++.. |+.|.|+|..|+++.
T Consensus 97 ~--g~~~i~l~G~S~Gg~~a~~~a~~-~~~v~~lil~~p~~~ 135 (274)
T TIGR03100 97 P--HLRRIVAWGLCDAASAALLYAPA-DLRVAGLVLLNPWVR 135 (274)
T ss_pred C--CCCcEEEEEECHHHHHHHHHhhh-CCCccEEEEECCccC
Confidence 2 22469999999999999988765 567999999987754
No 68
>TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase. This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. This model has been rebuilt to identify sequences more broadly, including a number of sequences suggested to be homoserine O-acetyltransferase based on proximity to other Met biosynthesis genes.
Probab=98.45 E-value=8.1e-07 Score=100.66 Aligned_cols=115 Identities=16% Similarity=0.159 Sum_probs=78.0
Q ss_pred CCcEEEEECCCCCCcccc-----cccccHHHHH---H---HcCCeEEEEeeee--cccccCCCCCCcc-----cccCCCH
Q psy3685 764 GGPIFLMIGGEGEASAKW-----MVEGTWLDYA---H---NHNALAVQVEHRF--YGKSHPLSDLSVE-----SLQYLSA 825 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~-----~~~~~~~~la---~---~~~a~~~~~ehR~--~G~S~p~~~~~~~-----~~~~lt~ 825 (1042)
++|+||+++|.+...... .+.|++..+. + .-+..||++|+|| +|.|.|.+..... +....|+
T Consensus 30 ~~~~vll~Hg~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~vi~~D~~G~~~g~s~~~~~~~~~~~~~~~~~~~~~ 109 (351)
T TIGR01392 30 RSNAVLVCHALTGDAHVAGYHDDGDPGWWDDLIGPGRAIDTDRYFVVCSNVLGGCYGSTGPSSINPGGRPYGSDFPLITI 109 (351)
T ss_pred CCCEEEEcCCcCcchhhcccCCCCCCCchhhccCCCCCcCCCceEEEEecCCCCCCCCCCCCCCCCCCCcCCCCCCCCcH
Confidence 467899999987633110 0123333332 1 2368999999999 6777653211111 1123578
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCc-EEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTK-WIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~-~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
++.++|++.+++.+. . .+ ++++|+||||++|..+..+||+.|.++|..+++..
T Consensus 110 ~~~~~~~~~~~~~l~----~---~~~~~l~G~S~Gg~ia~~~a~~~p~~v~~lvl~~~~~~ 163 (351)
T TIGR01392 110 RDDVKAQKLLLDHLG----I---EQIAAVVGGSMGGMQALEWAIDYPERVRAIVVLATSAR 163 (351)
T ss_pred HHHHHHHHHHHHHcC----C---CCceEEEEECHHHHHHHHHHHHChHhhheEEEEccCCc
Confidence 888888888887652 2 25 99999999999999999999999999998776554
No 69
>PRK14875 acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional
Probab=98.45 E-value=1e-06 Score=100.98 Aligned_cols=104 Identities=18% Similarity=0.179 Sum_probs=77.2
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|.||+++|.|.....|.. +...|++ +..|+++|.|++|.|.+.. ...+.++..+|+..++..+.
T Consensus 129 ~~~~~vl~~HG~~~~~~~~~~--~~~~l~~--~~~v~~~d~~g~G~s~~~~-------~~~~~~~~~~~~~~~~~~~~-- 195 (371)
T PRK14875 129 GDGTPVVLIHGFGGDLNNWLF--NHAALAA--GRPVIALDLPGHGASSKAV-------GAGSLDELAAAVLAFLDALG-- 195 (371)
T ss_pred CCCCeEEEECCCCCccchHHH--HHHHHhc--CCEEEEEcCCCCCCCCCCC-------CCCCHHHHHHHHHHHHHhcC--
Confidence 446888889887765554442 2334554 4799999999999995432 13466777777777775432
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
..+++++|+|+||.+|..++.++|+.+.+.|+.+++.
T Consensus 196 -----~~~~~lvG~S~Gg~~a~~~a~~~~~~v~~lv~~~~~~ 232 (371)
T PRK14875 196 -----IERAHLVGHSMGGAVALRLAARAPQRVASLTLIAPAG 232 (371)
T ss_pred -----CccEEEEeechHHHHHHHHHHhCchheeEEEEECcCC
Confidence 1379999999999999999999999999999877653
No 70
>KOG4391|consensus
Probab=98.44 E-value=6.2e-07 Score=87.05 Aligned_cols=190 Identities=23% Similarity=0.243 Sum_probs=109.9
Q ss_pred EEEecccccCCCcEEEEECCC-CCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHH
Q psy3685 754 YFVNSTWYKHGGPIFLMIGGE-GEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADL 832 (1042)
Q Consensus 754 y~~~~~~~~~~~pi~~~~gg~-g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~ 832 (1042)
||+..+- +.|.+||.+|. |...-... .+..+-.+++.+|+.++.||||+|.-.+ |.+.| +| |.
T Consensus 70 ~~~~~E~---S~pTlLyfh~NAGNmGhr~~---i~~~fy~~l~mnv~ivsYRGYG~S~Gsp--sE~GL-~l-------Ds 133 (300)
T KOG4391|consen 70 YLMLSES---SRPTLLYFHANAGNMGHRLP---IARVFYVNLKMNVLIVSYRGYGKSEGSP--SEEGL-KL-------DS 133 (300)
T ss_pred eeecccC---CCceEEEEccCCCcccchhh---HHHHHHHHcCceEEEEEeeccccCCCCc--cccce-ec-------cH
Confidence 5555333 67889988554 44432211 2334455679999999999999994332 12221 22 33
Q ss_pred HHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEee----ccccccccCChhhHHHHHHHhhccChhh
Q psy3685 833 AYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSA----SGPLRAVVDFPEYFGVVADALATVSTEC 908 (1042)
Q Consensus 833 ~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~s----sa~~~~~~~~~~~~~~v~~~~~~~~~~c 908 (1042)
...+.++-..-.+ +..++|+||-|.||++|..++.++-+.+.|+|.- |-|-.++.-+.-|.. +. .+.-|
T Consensus 134 ~avldyl~t~~~~-dktkivlfGrSlGGAvai~lask~~~ri~~~ivENTF~SIp~~~i~~v~p~~~---k~---i~~lc 206 (300)
T KOG4391|consen 134 EAVLDYLMTRPDL-DKTKIVLFGRSLGGAVAIHLASKNSDRISAIIVENTFLSIPHMAIPLVFPFPM---KY---IPLLC 206 (300)
T ss_pred HHHHHHHhcCccC-CcceEEEEecccCCeeEEEeeccchhheeeeeeechhccchhhhhheeccchh---hH---HHHHH
Confidence 3445555443222 4469999999999999999999999999998843 222222222111111 11 13447
Q ss_pred HHHHHHHHHHHHH-----HhcCC-----chHHHHHHHhcCCCCCC----------CCChhcHHHHHHHHHHHhhhhhc
Q psy3685 909 VKAVQTATHTISK-----MLKSP-----SDAKYLTEQFKLCTPLD----------INNAKDVSSFVESLADNIAGVVQ 966 (1042)
Q Consensus 909 ~~~i~~~~~~~~~-----~~~~~-----~~~~~l~~~f~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~ 966 (1042)
.+++......|.+ ++.++ ..+..+.++|.+|+.-. -+|.--.+.++..|.+.++..++
T Consensus 207 ~kn~~~S~~ki~~~~~P~LFiSGlkDelVPP~~Mr~Ly~~c~S~~Krl~eFP~gtHNDT~i~dGYfq~i~dFlaE~~~ 284 (300)
T KOG4391|consen 207 YKNKWLSYRKIGQCRMPFLFISGLKDELVPPVMMRQLYELCPSRTKRLAEFPDGTHNDTWICDGYFQAIEDFLAEVVK 284 (300)
T ss_pred HHhhhcchhhhccccCceEEeecCccccCCcHHHHHHHHhCchhhhhheeCCCCccCceEEeccHHHHHHHHHHHhcc
Confidence 7777777666652 11122 22335677888887632 12222334466666666666544
No 71
>PLN02980 2-oxoglutarate decarboxylase/ hydro-lyase/ magnesium ion binding / thiamin pyrophosphate binding
Probab=98.43 E-value=5.4e-07 Score=119.86 Aligned_cols=121 Identities=17% Similarity=0.195 Sum_probs=87.5
Q ss_pred EEEEEeccccc--CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCc-ccccCCCHHHH
Q psy3685 752 QRYFVNSTWYK--HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSV-ESLQYLSAEQA 828 (1042)
Q Consensus 752 qry~~~~~~~~--~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~-~~~~~lt~~q~ 828 (1042)
-+||+....+. +++|+||+++|.+.....|.. ++..|++ +..||++|+||||.|...+.... ..-..++++..
T Consensus 1356 ~~~~i~~~~~G~~~~~~~vVllHG~~~s~~~w~~--~~~~L~~--~~rVi~~Dl~G~G~S~~~~~~~~~~~~~~~si~~~ 1431 (1655)
T PLN02980 1356 FSCLIKVHEVGQNAEGSVVLFLHGFLGTGEDWIP--IMKAISG--SARCISIDLPGHGGSKIQNHAKETQTEPTLSVELV 1431 (1655)
T ss_pred eEEEEEEEecCCCCCCCeEEEECCCCCCHHHHHH--HHHHHhC--CCEEEEEcCCCCCCCCCccccccccccccCCHHHH
Confidence 46777655453 246889999988776665542 3445554 47999999999999964322111 11124578888
Q ss_pred HhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 829 LADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 829 l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
.+|++.+++.+.. .+++++|+||||++|..+..+||+.|.++|..++.
T Consensus 1432 a~~l~~ll~~l~~-------~~v~LvGhSmGG~iAl~~A~~~P~~V~~lVlis~~ 1479 (1655)
T PLN02980 1432 ADLLYKLIEHITP-------GKVTLVGYSMGARIALYMALRFSDKIEGAVIISGS 1479 (1655)
T ss_pred HHHHHHHHHHhCC-------CCEEEEEECHHHHHHHHHHHhChHhhCEEEEECCC
Confidence 8888888876431 38999999999999999999999999999876653
No 72
>TIGR03230 lipo_lipase lipoprotein lipase. Members of this protein family are lipoprotein lipase (EC 3.1.1.34), a eukaryotic triacylglycerol lipase active in plasma and similar to pancreatic and hepatic triacylglycerol lipases (EC 3.1.1.3). It is also called clearing factor. It cleaves chylomicron and VLDL triacylglycerols; it also has phospholipase A-1 activity.
Probab=98.43 E-value=1.7e-06 Score=97.68 Aligned_cols=114 Identities=16% Similarity=0.126 Sum_probs=80.7
Q ss_pred cccccCCCcEEEEECCCCCCcccccccccHHHHHHH-----cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHH
Q psy3685 758 STWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHN-----HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADL 832 (1042)
Q Consensus 758 ~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~-----~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~ 832 (1042)
+..+++.+|.+|+++|.+.... + ..+..++++. -++.||++|.|++|.|..... . . .+..+-+|+
T Consensus 34 ~s~Fn~~~ptvIlIHG~~~s~~-~--~~w~~~l~~al~~~~~d~nVI~VDw~g~g~s~y~~a--~---~--~t~~vg~~l 103 (442)
T TIGR03230 34 DCNFNHETKTFIVIHGWTVTGM-F--ESWVPKLVAALYEREPSANVIVVDWLSRAQQHYPTS--A---A--YTKLVGKDV 103 (442)
T ss_pred hcCcCCCCCeEEEECCCCcCCc-c--hhhHHHHHHHHHhccCCCEEEEEECCCcCCCCCccc--c---c--cHHHHHHHH
Confidence 3457788999999999875321 1 1122234332 268999999999998732211 1 1 235666889
Q ss_pred HHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 833 AYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 833 ~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
+.||+.+.....++ -.++.++|+|+||.+|..+...+|+.|.+.++..+
T Consensus 104 a~lI~~L~~~~gl~-l~~VhLIGHSLGAhIAg~ag~~~p~rV~rItgLDP 152 (442)
T TIGR03230 104 AKFVNWMQEEFNYP-WDNVHLLGYSLGAHVAGIAGSLTKHKVNRITGLDP 152 (442)
T ss_pred HHHHHHHHHhhCCC-CCcEEEEEECHHHHHHHHHHHhCCcceeEEEEEcC
Confidence 99999887654432 24899999999999999999999999888776553
No 73
>PLN02442 S-formylglutathione hydrolase
Probab=98.39 E-value=2.1e-06 Score=93.74 Aligned_cols=144 Identities=11% Similarity=0.074 Sum_probs=84.4
Q ss_pred hcccCCCCCcccceEEEEEecccc-cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCC---
Q psy3685 738 KLDHFNIEEPRTWKQRYFVNSTWY-KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLS--- 813 (1042)
Q Consensus 738 ~~dhf~~~~~~tf~qry~~~~~~~-~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~--- 813 (1042)
.+.++.+.-++...-+.|+=+ .. ..+-|||++++|.+.....|...+-+..++...|..||+.|-.+.|.-.+.+
T Consensus 20 ~~~~~s~~l~~~~~~~vy~P~-~~~~~~~Pvv~~lHG~~~~~~~~~~~~~~~~~~~~~g~~Vv~pd~~~~g~~~~~~~~~ 98 (283)
T PLN02442 20 RYKHFSSTLGCSMTFSVYFPP-ASDSGKVPVLYWLSGLTCTDENFIQKSGAQRAAAARGIALVAPDTSPRGLNVEGEADS 98 (283)
T ss_pred EEEEeccccCCceEEEEEcCC-cccCCCCCEEEEecCCCcChHHHHHhhhHHHHHhhcCeEEEecCCCCCCCCCCCCccc
Confidence 444555444444443333333 22 2356899999887654444544333456666779999999976555111100
Q ss_pred -----------CCCccccc-CCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 814 -----------DLSVESLQ-YLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 814 -----------~~~~~~~~-~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
+...+.++ +-..+..++++...+...-... +..+++++|+|+||.+|+++..+||+.+.++++.|
T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~---~~~~~~i~G~S~GG~~a~~~a~~~p~~~~~~~~~~ 175 (283)
T PLN02442 99 WDFGVGAGFYLNATQEKWKNWRMYDYVVKELPKLLSDNFDQL---DTSRASIFGHSMGGHGALTIYLKNPDKYKSVSAFA 175 (283)
T ss_pred cccCCCcceeeccccCCCcccchhhhHHHHHHHHHHHHHHhc---CCCceEEEEEChhHHHHHHHHHhCchhEEEEEEEC
Confidence 00111110 0112234455555554432222 22479999999999999999999999999988888
Q ss_pred cccc
Q psy3685 882 GPLR 885 (1042)
Q Consensus 882 a~~~ 885 (1042)
++..
T Consensus 176 ~~~~ 179 (283)
T PLN02442 176 PIAN 179 (283)
T ss_pred CccC
Confidence 7654
No 74
>PF10503 Esterase_phd: Esterase PHB depolymerase
Probab=98.38 E-value=2e-06 Score=88.37 Aligned_cols=118 Identities=18% Similarity=0.234 Sum_probs=79.5
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.-|+||++||-+.....+....-+.+||++.|+.|++.|.-.-....-.-+... ....-...+ .+.++.+|+++...+
T Consensus 15 ~~PLVv~LHG~~~~a~~~~~~s~~~~lAd~~GfivvyP~~~~~~~~~~cw~w~~-~~~~~g~~d-~~~i~~lv~~v~~~~ 92 (220)
T PF10503_consen 15 PVPLVVVLHGCGQSAEDFAAGSGWNALADREGFIVVYPEQSRRANPQGCWNWFS-DDQQRGGGD-VAFIAALVDYVAARY 92 (220)
T ss_pred CCCEEEEeCCCCCCHHHHHhhcCHHHHhhcCCeEEEcccccccCCCCCcccccc-cccccCccc-hhhHHHHHHhHhhhc
Confidence 358999999987665544444456789999999999998542211110000000 000111111 345777788888888
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
..+. .+|.+.|.|-||++|..+...|||+|.|+...|++.
T Consensus 93 ~iD~-~RVyv~G~S~Gg~ma~~la~~~pd~faa~a~~sG~~ 132 (220)
T PF10503_consen 93 NIDP-SRVYVTGLSNGGMMANVLACAYPDLFAAVAVVSGVP 132 (220)
T ss_pred ccCC-CceeeEEECHHHHHHHHHHHhCCccceEEEeecccc
Confidence 7655 589999999999999999999999999866666554
No 75
>PF00561 Abhydrolase_1: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR000073 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents fold-1 of alpha/beta hydrolase.; PDB: 2VAT_E 2VAX_C 2VAV_H 2PSJ_A 2PSH_B 2PSE_A 2PSF_A 2PSD_A 2EDA_A 1CIJ_A ....
Probab=98.38 E-value=7.2e-07 Score=94.31 Aligned_cols=77 Identities=23% Similarity=0.237 Sum_probs=60.2
Q ss_pred eEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEE
Q psy3685 797 LAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHG 876 (1042)
Q Consensus 797 ~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g 876 (1042)
.||.++.||+|.|.|-. ...+...+.+...+|+..+++.+.. .+|+++|+||||++++.+...||+.|.+
T Consensus 2 ~vi~~d~rG~g~S~~~~---~~~~~~~~~~~~~~~~~~~~~~l~~-------~~~~~vG~S~Gg~~~~~~a~~~p~~v~~ 71 (230)
T PF00561_consen 2 DVILFDLRGFGYSSPHW---DPDFPDYTTDDLAADLEALREALGI-------KKINLVGHSMGGMLALEYAAQYPERVKK 71 (230)
T ss_dssp EEEEEECTTSTTSSSCC---GSGSCTHCHHHHHHHHHHHHHHHTT-------SSEEEEEETHHHHHHHHHHHHSGGGEEE
T ss_pred EEEEEeCCCCCCCCCCc---cCCcccccHHHHHHHHHHHHHHhCC-------CCeEEEEECCChHHHHHHHHHCchhhcC
Confidence 58999999999998720 1123445666666666666664432 3699999999999999999999999999
Q ss_pred EEeeccc
Q psy3685 877 AMSASGP 883 (1042)
Q Consensus 877 ~i~ssa~ 883 (1042)
.|+.+++
T Consensus 72 lvl~~~~ 78 (230)
T PF00561_consen 72 LVLISPP 78 (230)
T ss_dssp EEEESES
T ss_pred cEEEeee
Confidence 9999987
No 76
>PRK05077 frsA fermentation/respiration switch protein; Reviewed
Probab=98.35 E-value=2.5e-06 Score=97.87 Aligned_cols=109 Identities=16% Similarity=0.145 Sum_probs=72.8
Q ss_pred CCcEEEEECCCCCCc-ccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEAS-AKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~-~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
..|+||+.||.++.. ..|. .+...|++. |..|+.+|.||+|.|...+ ++ . ..+... ..+++++...
T Consensus 193 ~~P~Vli~gG~~~~~~~~~~--~~~~~La~~-Gy~vl~~D~pG~G~s~~~~-~~-~-----d~~~~~---~avld~l~~~ 259 (414)
T PRK05077 193 PFPTVLVCGGLDSLQTDYYR--LFRDYLAPR-GIAMLTIDMPSVGFSSKWK-LT-Q-----DSSLLH---QAVLNALPNV 259 (414)
T ss_pred CccEEEEeCCcccchhhhHH--HHHHHHHhC-CCEEEEECCCCCCCCCCCC-cc-c-----cHHHHH---HHHHHHHHhC
Confidence 468999999876542 2222 123345554 9999999999999995431 11 0 011111 2334444332
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
-.. +..+++++|+|+||.+|..++..+|+.|.|+|+.++++..
T Consensus 260 ~~v-d~~ri~l~G~S~GG~~Al~~A~~~p~ri~a~V~~~~~~~~ 302 (414)
T PRK05077 260 PWV-DHTRVAAFGFRFGANVAVRLAYLEPPRLKAVACLGPVVHT 302 (414)
T ss_pred ccc-CcccEEEEEEChHHHHHHHHHHhCCcCceEEEEECCccch
Confidence 111 2358999999999999999999999999999999988753
No 77
>cd00707 Pancreat_lipase_like Pancreatic lipase-like enzymes. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=98.34 E-value=1.3e-06 Score=94.50 Aligned_cols=114 Identities=12% Similarity=0.111 Sum_probs=78.7
Q ss_pred ccccCCCcEEEEECCCCCCc-ccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHH
Q psy3685 759 TWYKHGGPIFLMIGGEGEAS-AKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFIT 837 (1042)
Q Consensus 759 ~~~~~~~pi~~~~gg~g~~~-~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~ 837 (1042)
..+++++|++|+++|.+... ..|.. .+...+..+.+..||++|.++++.+. +. ....+++.+.+|++.+++
T Consensus 30 ~~f~~~~p~vilIHG~~~~~~~~~~~-~l~~~ll~~~~~nVi~vD~~~~~~~~-y~------~a~~~~~~v~~~la~~l~ 101 (275)
T cd00707 30 SNFNPSRPTRFIIHGWTSSGEESWIS-DLRKAYLSRGDYNVIVVDWGRGANPN-YP------QAVNNTRVVGAELAKFLD 101 (275)
T ss_pred cCCCCCCCcEEEEcCCCCCCCCcHHH-HHHHHHHhcCCCEEEEEECccccccC-hH------HHHHhHHHHHHHHHHHHH
Confidence 34567889999999976543 33322 11222333447899999999873321 10 011245566678899999
Q ss_pred HHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 838 SMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 838 ~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
.+......+ ..+++++|+|+||.+|.++...+|+.|.+.|+-.
T Consensus 102 ~L~~~~g~~-~~~i~lIGhSlGa~vAg~~a~~~~~~v~~iv~LD 144 (275)
T cd00707 102 FLVDNTGLS-LENVHLIGHSLGAHVAGFAGKRLNGKLGRITGLD 144 (275)
T ss_pred HHHHhcCCC-hHHEEEEEecHHHHHHHHHHHHhcCccceeEEec
Confidence 887764332 2479999999999999999999999888888764
No 78
>COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]
Probab=98.30 E-value=1.2e-06 Score=106.18 Aligned_cols=120 Identities=19% Similarity=0.214 Sum_probs=82.7
Q ss_pred cEEEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeee---ecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 766 PIFLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR---FYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 766 pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR---~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
|+||++ ||+..... +.....+..||.+ |++|++...| |||+.... .-...+... .++|+...++.+.
T Consensus 395 P~i~~~hGGP~~~~~-~~~~~~~q~~~~~-G~~V~~~n~RGS~GyG~~F~~--~~~~~~g~~----~~~D~~~~~~~l~- 465 (620)
T COG1506 395 PLIVYIHGGPSAQVG-YSFNPEIQVLASA-GYAVLAPNYRGSTGYGREFAD--AIRGDWGGV----DLEDLIAAVDALV- 465 (620)
T ss_pred CEEEEeCCCCccccc-cccchhhHHHhcC-CeEEEEeCCCCCCccHHHHHH--hhhhccCCc----cHHHHHHHHHHHH-
Confidence 899999 66533333 4444556667776 9999999999 66665321 112223333 4566666666443
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccCChhhHH
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVDFPEYFG 895 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~~~~~~~ 895 (1042)
+.+..+..++.++|+||||-+++|...+.| .+.++|+..+++.....+..+..
T Consensus 466 ~~~~~d~~ri~i~G~SyGGymtl~~~~~~~-~f~a~~~~~~~~~~~~~~~~~~~ 518 (620)
T COG1506 466 KLPLVDPERIGITGGSYGGYMTLLAATKTP-RFKAAVAVAGGVDWLLYFGESTE 518 (620)
T ss_pred hCCCcChHHeEEeccChHHHHHHHHHhcCc-hhheEEeccCcchhhhhccccch
Confidence 334434458999999999999999999999 89999999988877766555444
No 79
>TIGR00976 /NonD putative hydrolase, CocE/NonD family. This model represents a protein subfamily that includes the cocaine esterase CocE, several glutaryl-7-ACA acylases, and the putative diester hydrolase NonD of Streptomyces griseus (all hydrolases). This family shows extensive, low-level similarity to a family of xaa-pro dipeptidyl-peptidases, and local similarity by PSI-BLAST to many other hydrolases.
Probab=98.30 E-value=2e-06 Score=103.15 Aligned_cols=111 Identities=14% Similarity=0.081 Sum_probs=82.3
Q ss_pred CCcEEEEECCCCCCccc-c-cccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 764 GGPIFLMIGGEGEASAK-W-MVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~-~-~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
..|+||+++|.|..... + ........|+.+ |+.||.+|.||+|.|.-. +..++ .+.++|+..+|+++..
T Consensus 21 ~~P~Il~~~gyg~~~~~~~~~~~~~~~~l~~~-Gy~vv~~D~RG~g~S~g~-------~~~~~-~~~~~D~~~~i~~l~~ 91 (550)
T TIGR00976 21 PVPVILSRTPYGKDAGLRWGLDKTEPAWFVAQ-GYAVVIQDTRGRGASEGE-------FDLLG-SDEAADGYDLVDWIAK 91 (550)
T ss_pred CCCEEEEecCCCCchhhccccccccHHHHHhC-CcEEEEEeccccccCCCc-------eEecC-cccchHHHHHHHHHHh
Confidence 36899999988754320 1 111234456665 999999999999999421 12223 5678999999999876
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+-- .+.++.++|+||||.+|.+++..+|..+.|+|+.++...
T Consensus 92 q~~--~~~~v~~~G~S~GG~~a~~~a~~~~~~l~aiv~~~~~~d 133 (550)
T TIGR00976 92 QPW--CDGNVGMLGVSYLAVTQLLAAVLQPPALRAIAPQEGVWD 133 (550)
T ss_pred CCC--CCCcEEEEEeChHHHHHHHHhccCCCceeEEeecCcccc
Confidence 521 235899999999999999999999999999998776654
No 80
>KOG2382|consensus
Probab=98.30 E-value=6.9e-06 Score=86.92 Aligned_cols=124 Identities=18% Similarity=0.235 Sum_probs=89.6
Q ss_pred CCcEEEEECCC-CCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGE-GEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~-g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.+|-+++++|- |+. ..|. .+-..|+..+++.++++|.|.+|.|--. .-.+.+.+.+|+..||...+..
T Consensus 51 ~~Pp~i~lHGl~GS~-~Nw~--sv~k~Ls~~l~~~v~~vd~RnHG~Sp~~--------~~h~~~~ma~dv~~Fi~~v~~~ 119 (315)
T KOG2382|consen 51 RAPPAIILHGLLGSK-ENWR--SVAKNLSRKLGRDVYAVDVRNHGSSPKI--------TVHNYEAMAEDVKLFIDGVGGS 119 (315)
T ss_pred CCCceEEecccccCC-CCHH--HHHHHhcccccCceEEEecccCCCCccc--------cccCHHHHHHHHHHHHHHcccc
Confidence 35656666665 444 3344 3566899999999999999999998332 2334677889999999988754
Q ss_pred cCCCCCCcEEEEccchhH-HHHHHHHHhCCCeEEEEE-eeccccccccCChhhHHHHHHHh
Q psy3685 843 YSLPAHTKWIAFGGSYPG-ALAAWLRYKYPHLVHGAM-SASGPLRAVVDFPEYFGVVADAL 901 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg-~la~~~~~~yP~~~~g~i-~ssa~~~~~~~~~~~~~~v~~~~ 901 (1042)
+ ...|++++|||+|| .+++.+..++|+.+--+| --.+|......+.++.+.+....
T Consensus 120 ~---~~~~~~l~GHsmGG~~~~m~~t~~~p~~~~rliv~D~sP~~~~~~~~e~~e~i~~m~ 177 (315)
T KOG2382|consen 120 T---RLDPVVLLGHSMGGVKVAMAETLKKPDLIERLIVEDISPGGVGRSYGEYRELIKAMI 177 (315)
T ss_pred c---ccCCceecccCcchHHHHHHHHHhcCcccceeEEEecCCccCCcccchHHHHHHHHH
Confidence 3 23589999999999 888999999999977555 44555544445566666665544
No 81
>TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit. This model represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (TIGR01834) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerize short-chain-length hydroxyalkanoates.
Probab=98.29 E-value=3.9e-06 Score=95.00 Aligned_cols=108 Identities=8% Similarity=0.095 Sum_probs=78.0
Q ss_pred CcEEEEECCCCCCcccc---cccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHH-hHHHHHHHHHh
Q psy3685 765 GPIFLMIGGEGEASAKW---MVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQAL-ADLAYFITSMN 840 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~---~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l-~D~~~~~~~~~ 840 (1042)
.| ||+++|.......+ ....++..|+++ |..|+.+|.|++|.|.. ..+.+..+ +|+..+++++.
T Consensus 63 ~p-vl~v~~~~~~~~~~d~~~~~~~~~~L~~~-G~~V~~~D~~g~g~s~~----------~~~~~d~~~~~~~~~v~~l~ 130 (350)
T TIGR01836 63 TP-LLIVYALVNRPYMLDLQEDRSLVRGLLER-GQDVYLIDWGYPDRADR----------YLTLDDYINGYIDKCVDYIC 130 (350)
T ss_pred Cc-EEEeccccccceeccCCCCchHHHHHHHC-CCeEEEEeCCCCCHHHh----------cCCHHHHHHHHHHHHHHHHH
Confidence 45 67776653221111 123455556664 89999999999987632 23555554 45788888887
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
...+ ..+++++|+|+||++++.+...+|+.|.+.|+.++|+...
T Consensus 131 ~~~~---~~~i~lvGhS~GG~i~~~~~~~~~~~v~~lv~~~~p~~~~ 174 (350)
T TIGR01836 131 RTSK---LDQISLLGICQGGTFSLCYAALYPDKIKNLVTMVTPVDFE 174 (350)
T ss_pred HHhC---CCcccEEEECHHHHHHHHHHHhCchheeeEEEeccccccC
Confidence 7653 3589999999999999999999999999999999988643
No 82
>PRK00175 metX homoserine O-acetyltransferase; Provisional
Probab=98.26 E-value=5e-06 Score=95.05 Aligned_cols=114 Identities=16% Similarity=0.115 Sum_probs=76.4
Q ss_pred CcEEEEECCCCCCccccc-------ccccHHHHHH------HcCCeEEEEeeeec-ccc-cCCCCCCc--c----cccCC
Q psy3685 765 GPIFLMIGGEGEASAKWM-------VEGTWLDYAH------NHNALAVQVEHRFY-GKS-HPLSDLSV--E----SLQYL 823 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~-------~~~~~~~la~------~~~a~~~~~ehR~~-G~S-~p~~~~~~--~----~~~~l 823 (1042)
+|.||+++|.+.....|. ..|++..+.. .-+..||++|.|+. |.| .|.+.... . +..-.
T Consensus 48 ~p~vvl~HG~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~vi~~Dl~G~~~~s~~~~~~~~~~~~~~~~~~~~~ 127 (379)
T PRK00175 48 SNAVLICHALTGDHHVAGPHSPDDPKPGWWDNMVGPGKPIDTDRYFVICSNVLGGCKGSTGPSSINPDTGKPYGSDFPVI 127 (379)
T ss_pred CCEEEEeCCcCCchhhcccccccCCCCcchhhccCCCCccCccceEEEeccCCCCCCCCCCCCCCCCCCCCcccCCCCcC
Confidence 688999988876543221 1133333321 12679999999983 444 44321100 0 11235
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCc-EEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTK-WIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~-~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
|++...+|+..|++.+.. .+ .+++|+||||++|..+..+||+.|.+.|..++...
T Consensus 128 ~~~~~~~~~~~~l~~l~~-------~~~~~lvG~S~Gg~ia~~~a~~~p~~v~~lvl~~~~~~ 183 (379)
T PRK00175 128 TIRDWVRAQARLLDALGI-------TRLAAVVGGSMGGMQALEWAIDYPDRVRSALVIASSAR 183 (379)
T ss_pred CHHHHHHHHHHHHHHhCC-------CCceEEEEECHHHHHHHHHHHhChHhhhEEEEECCCcc
Confidence 888888899888887543 25 58999999999999999999999999997776553
No 83
>KOG1454|consensus
Probab=98.26 E-value=2.4e-06 Score=94.29 Aligned_cols=110 Identities=20% Similarity=0.217 Sum_probs=82.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
++|-||+++|.|+....|.. .+..|++..|..|+++|..|+|.|.+.+..+. | .+.|.+..|..+-.++
T Consensus 57 ~~~pvlllHGF~~~~~~w~~--~~~~L~~~~~~~v~aiDl~G~g~~s~~~~~~~----y-----~~~~~v~~i~~~~~~~ 125 (326)
T KOG1454|consen 57 DKPPVLLLHGFGASSFSWRR--VVPLLSKAKGLRVLAIDLPGHGYSSPLPRGPL----Y-----TLRELVELIRRFVKEV 125 (326)
T ss_pred CCCcEEEeccccCCcccHhh--hccccccccceEEEEEecCCCCcCCCCCCCCc----e-----ehhHHHHHHHHHHHhh
Confidence 57889999999987666664 36678888788999999999997665543222 1 2345555555544444
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEE---eeccccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAM---SASGPLRAV 887 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i---~ssa~~~~~ 887 (1042)
. ..|++++|+||||.+|..++..||+.|.+.| ...+++...
T Consensus 126 ~---~~~~~lvghS~Gg~va~~~Aa~~P~~V~~lv~~~~~~~~~~~~ 169 (326)
T KOG1454|consen 126 F---VEPVSLVGHSLGGIVALKAAAYYPETVDSLVLLDLLGPPVYST 169 (326)
T ss_pred c---CcceEEEEeCcHHHHHHHHHHhCcccccceeeecccccccccC
Confidence 2 2479999999999999999999999999999 666666543
No 84
>PHA00616 hypothetical protein
Probab=98.26 E-value=2.6e-07 Score=65.97 Aligned_cols=33 Identities=21% Similarity=0.350 Sum_probs=31.2
Q ss_pred cccCCCCCCccCCcchHHHHHhhccCCCCcccC
Q psy3685 682 RYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDIN 714 (1042)
Q Consensus 682 ~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~ 714 (1042)
||+|+.||+.|.++++|.+|++.|+|++|+.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 799999999999999999999999999998775
No 85
>PRK13604 luxD acyl transferase; Provisional
Probab=98.25 E-value=4.5e-06 Score=89.43 Aligned_cols=106 Identities=12% Similarity=0.057 Sum_probs=74.9
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeec-ccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFY-GKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~-G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
..|++++.+|.+..... +..+...|+++ |..|+.+|.|+. |+|.. ++...|+.-.+.|+...|++++..
T Consensus 36 ~~~~vIi~HGf~~~~~~--~~~~A~~La~~-G~~vLrfD~rg~~GeS~G-------~~~~~t~s~g~~Dl~aaid~lk~~ 105 (307)
T PRK13604 36 KNNTILIASGFARRMDH--FAGLAEYLSSN-GFHVIRYDSLHHVGLSSG-------TIDEFTMSIGKNSLLTVVDWLNTR 105 (307)
T ss_pred CCCEEEEeCCCCCChHH--HHHHHHHHHHC-CCEEEEecCCCCCCCCCC-------ccccCcccccHHHHHHHHHHHHhc
Confidence 34788889999875332 33344455554 999999999986 99822 222333333579999999999875
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
. ..+++++|+|+||++|...+...| +.+.|+.|+...
T Consensus 106 ~----~~~I~LiG~SmGgava~~~A~~~~--v~~lI~~sp~~~ 142 (307)
T PRK13604 106 G----INNLGLIAASLSARIAYEVINEID--LSFLITAVGVVN 142 (307)
T ss_pred C----CCceEEEEECHHHHHHHHHhcCCC--CCEEEEcCCccc
Confidence 2 247999999999999866666544 777887776554
No 86
>PTZ00472 serine carboxypeptidase (CBP1); Provisional
Probab=98.24 E-value=8e-06 Score=94.70 Aligned_cols=127 Identities=19% Similarity=0.215 Sum_probs=82.8
Q ss_pred CCCcEEEEE-CCCCCCccc--ccccc----------c-HHHHHHHcCCeEEEEee-eecccccCCCCCCcccccCCCHHH
Q psy3685 763 HGGPIFLMI-GGEGEASAK--WMVEG----------T-WLDYAHNHNALAVQVEH-RFYGKSHPLSDLSVESLQYLSAEQ 827 (1042)
Q Consensus 763 ~~~pi~~~~-gg~g~~~~~--~~~~~----------~-~~~la~~~~a~~~~~eh-R~~G~S~p~~~~~~~~~~~lt~~q 827 (1042)
.+.|++|++ ||+|..+.. +...| + ....+....|.||++|+ +|.|.|..... + ...+.++
T Consensus 75 ~~~Pl~lwlnGGPG~ss~~G~f~E~GP~~i~~~~~~~~~n~~sW~~~~~~l~iDqP~G~G~S~~~~~----~-~~~~~~~ 149 (462)
T PTZ00472 75 PEAPVLLWMTGGPGCSSMFALLAENGPCLMNETTGDIYNNTYSWNNEAYVIYVDQPAGVGFSYADKA----D-YDHNESE 149 (462)
T ss_pred CCCCEEEEECCCCcHHHHHhhhccCCCeEEeCCCCceeECCcccccccCeEEEeCCCCcCcccCCCC----C-CCCChHH
Confidence 467999999 999865421 11111 1 12334445689999996 79999975321 1 1246799
Q ss_pred HHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhC----------CCeEEEEEeeccccccccCChhhH
Q psy3685 828 ALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKY----------PHLVHGAMSASGPLRAVVDFPEYF 894 (1042)
Q Consensus 828 ~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~y----------P~~~~g~i~ssa~~~~~~~~~~~~ 894 (1042)
+.+|+.+|++.+-..+....+.|++++|.||||..+..++.+- +=.+.|.+...+.+....++..|.
T Consensus 150 ~a~d~~~~l~~f~~~~p~~~~~~~~i~GeSygG~y~p~~a~~i~~~n~~~~~~~inLkGi~IGNg~~dp~~q~~~~~ 226 (462)
T PTZ00472 150 VSEDMYNFLQAFFGSHEDLRANDLFVVGESYGGHYAPATAYRINMGNKKGDGLYINLAGLAVGNGLTDPYTQYASYP 226 (462)
T ss_pred HHHHHHHHHHHHHHhCccccCCCEEEEeecchhhhHHHHHHHHHhhccccCCceeeeEEEEEeccccChhhhcccHH
Confidence 9999999999887776544457999999999998775555331 112567666666665554444333
No 87
>KOG1552|consensus
Probab=98.19 E-value=4.6e-06 Score=84.97 Aligned_cols=104 Identities=17% Similarity=0.185 Sum_probs=83.1
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
.+++||.+|.....+ ....++..|+..++.+|+.+|.+|||.|.-.+. .. ...+|+....++++..++
T Consensus 60 ~~~lly~hGNa~Dlg--q~~~~~~~l~~~ln~nv~~~DYSGyG~S~G~ps--E~--------n~y~Di~avye~Lr~~~g 127 (258)
T KOG1552|consen 60 HPTLLYSHGNAADLG--QMVELFKELSIFLNCNVVSYDYSGYGRSSGKPS--ER--------NLYADIKAVYEWLRNRYG 127 (258)
T ss_pred ceEEEEcCCcccchH--HHHHHHHHHhhcccceEEEEecccccccCCCcc--cc--------cchhhHHHHHHHHHhhcC
Confidence 479999988733222 333456778888899999999999999954331 22 367999999999999986
Q ss_pred CCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
++.++|++|.|.|...+..++.++| ++|.|.-|+-.
T Consensus 128 --~~~~Iil~G~SiGt~~tv~Lasr~~--~~alVL~SPf~ 163 (258)
T KOG1552|consen 128 --SPERIILYGQSIGTVPTVDLASRYP--LAAVVLHSPFT 163 (258)
T ss_pred --CCceEEEEEecCCchhhhhHhhcCC--cceEEEeccch
Confidence 3469999999999999999999999 89999888544
No 88
>PRK05855 short chain dehydrogenase; Validated
Probab=98.17 E-value=3e-06 Score=103.61 Aligned_cols=106 Identities=14% Similarity=0.094 Sum_probs=75.7
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+++|+||+++|.+.....|.. ++..|++ +..||++|+||||.|.+.. .....|.+..++|+..+++.+..
T Consensus 23 ~~~~~ivllHG~~~~~~~w~~--~~~~L~~--~~~Vi~~D~~G~G~S~~~~-----~~~~~~~~~~a~dl~~~i~~l~~- 92 (582)
T PRK05855 23 PDRPTVVLVHGYPDNHEVWDG--VAPLLAD--RFRVVAYDVRGAGRSSAPK-----RTAAYTLARLADDFAAVIDAVSP- 92 (582)
T ss_pred CCCCeEEEEcCCCchHHHHHH--HHHHhhc--ceEEEEecCCCCCCCCCCC-----cccccCHHHHHHHHHHHHHHhCC-
Confidence 457899999888766555543 3445543 6899999999999996432 11246889999999999987642
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHh--CCCeEEEEEeeccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYK--YPHLVHGAMSASGP 883 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~--yP~~~~g~i~ssa~ 883 (1042)
..|++++|+||||++|..+... +|..+.+.++.++|
T Consensus 93 -----~~~~~lvGhS~Gg~~a~~~a~~~~~~~~v~~~~~~~~~ 130 (582)
T PRK05855 93 -----DRPVHLLAHDWGSIQGWEAVTRPRAAGRIASFTSVSGP 130 (582)
T ss_pred -----CCcEEEEecChHHHHHHHHHhCccchhhhhhheeccCC
Confidence 1379999999999888665544 56666655555544
No 89
>PLN02872 triacylglycerol lipase
Probab=98.11 E-value=2.8e-06 Score=96.28 Aligned_cols=115 Identities=21% Similarity=0.115 Sum_probs=77.6
Q ss_pred CCcEEEEECCCCCCccccccc----ccHHHHHHHcCCeEEEEeeeecccccCCCCCCcc---cccCCCHHHHHhHHHHHH
Q psy3685 764 GGPIFLMIGGEGEASAKWMVE----GTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVE---SLQYLSAEQALADLAYFI 836 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~----~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~---~~~~lt~~q~l~D~~~~~ 836 (1042)
++|+||+++|-+..+..|... ++...||++ |..|+..+.||+|.|..-.+++.. -|.+---+.|+.|+.++|
T Consensus 73 ~~~~Vll~HGl~~ss~~w~~~~~~~sla~~La~~-GydV~l~n~RG~~~s~gh~~~~~~~~~fw~~s~~e~a~~Dl~a~i 151 (395)
T PLN02872 73 RGPPVLLQHGLFMAGDAWFLNSPEQSLGFILADH-GFDVWVGNVRGTRWSYGHVTLSEKDKEFWDWSWQELALYDLAEMI 151 (395)
T ss_pred CCCeEEEeCcccccccceeecCcccchHHHHHhC-CCCcccccccccccccCCCCCCccchhccCCcHHHHHHHHHHHHH
Confidence 367888888876665656432 334456765 899999999998877432222222 233333356668999999
Q ss_pred HHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC---eEEEEEeecccc
Q psy3685 837 TSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH---LVHGAMSASGPL 884 (1042)
Q Consensus 837 ~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~---~~~g~i~ssa~~ 884 (1042)
+++...- ..+++++|+|+||+++. ....+|+ .|.++++.++++
T Consensus 152 d~i~~~~----~~~v~~VGhS~Gg~~~~-~~~~~p~~~~~v~~~~~l~P~~ 197 (395)
T PLN02872 152 HYVYSIT----NSKIFIVGHSQGTIMSL-AALTQPNVVEMVEAAALLCPIS 197 (395)
T ss_pred HHHHhcc----CCceEEEEECHHHHHHH-HHhhChHHHHHHHHHHHhcchh
Confidence 9987532 24899999999999886 4557887 455666555443
No 90
>KOG1146|consensus
Probab=98.11 E-value=7.7e-06 Score=99.20 Aligned_cols=121 Identities=12% Similarity=0.111 Sum_probs=70.7
Q ss_pred cccCcCCcccCCchhhhhHhhhhcCC-----cccCCccccccCCchhHHHHhhhcCCCcccCCCCCcccccCCCCccccc
Q psy3685 559 YKCDLCSKEFSRKDNLTEHKEIHQGI-----RYKCDLCSKKFSRKLNLTKHMKIHQGIRYKCDLCSKEFSRKDIPYEVTV 633 (1042)
Q Consensus 559 ~~C~~C~k~F~~~~~L~~H~~~H~~~-----~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~l~~~~~ 633 (1042)
..|..|...|.....+..|-+---.+ -+.|..|++.|.....+. |+-. ..+|.|..|...|.....|....
T Consensus 1229 l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~~--~~~~~~~~~~~~~~~~~~l~~~~- 1304 (1406)
T KOG1146|consen 1229 LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLDV--THRYLCRQCKMAFDGEAPLTAHQ- 1304 (1406)
T ss_pred ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-eccc--chhHHHHHHHhhhcchhHHHHHH-
Confidence 44666777777666665553321111 266777777777666665 4422 12577777777776554443311
Q ss_pred CCcCcccCCccccccCCCcccccccccccccccccccccccccccCCccccCCCCCCccCCcchHHHHHhhccCC
Q psy3685 634 KEESYEGDSINVEENNGNDAVTRDCLEFFKTEQNTTLGKLNKILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGI 708 (1042)
Q Consensus 634 ~~~~~~c~~~~~~~~~~~~~~~~~C~~~f~~~~~~~l~~h~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~ 708 (1042)
+.|.+.++. ...+.-|+..+.--++| |..|...|+....|.+||++-.++
T Consensus 1305 ----------------------~k~~~~~~~--~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1305 ----------------------RKFCFAGRG--SGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred ----------------------HHHHhccCc--cccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhc
Confidence 022233332 23333344444444567 999999999999999999984333
No 91
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.10 E-value=4e-07 Score=58.55 Aligned_cols=24 Identities=33% Similarity=0.650 Sum_probs=11.5
Q ss_pred cccccccccCCccccCCCCCCccC
Q psy3685 670 LGKLNKILHQDIRYKCDLCSKDFS 693 (1042)
Q Consensus 670 l~~h~~~h~~~~~~~C~~C~k~F~ 693 (1042)
|.+|+++|++++||.|++|+++|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 344444444444444444444443
No 92
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.09 E-value=1.4e-06 Score=56.04 Aligned_cols=24 Identities=54% Similarity=1.082 Sum_probs=15.4
Q ss_pred hHHHHhhhcCCC-cccCCCCCcccc
Q psy3685 600 NLTKHMKIHQGI-RYKCDLCSKEFS 623 (1042)
Q Consensus 600 ~L~~H~~~H~~~-~~~C~~C~k~F~ 623 (1042)
+|++||++|+|+ ||+|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 356666666665 666666666664
No 93
>PRK11460 putative hydrolase; Provisional
Probab=98.09 E-value=2.4e-05 Score=82.65 Aligned_cols=117 Identities=13% Similarity=0.019 Sum_probs=72.4
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCC--------CCCcccccCCCHHHHHhHHH
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLS--------DLSVESLQYLSAEQALADLA 833 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~--------~~~~~~~~~lt~~q~l~D~~ 833 (1042)
.+..|+||+++|.|.....|.. +...|++.. ..+.++..|+++.+.+.. ..+.++ ..-.++.+++++.
T Consensus 13 ~~~~~~vIlLHG~G~~~~~~~~--l~~~l~~~~-~~~~~i~~~g~~~~~~~~g~~W~~~~~~~~~~-~~~~~~~~~~~l~ 88 (232)
T PRK11460 13 KPAQQLLLLFHGVGDNPVAMGE--IGSWFAPAF-PDALVVSVGGPEPSGNGAGRQWFSVQGITEDN-RQARVAAIMPTFI 88 (232)
T ss_pred CCCCcEEEEEeCCCCChHHHHH--HHHHHHHHC-CCCEEECCCCCCCcCCCCCcccccCCCCCccc-hHHHHHHHHHHHH
Confidence 3457899999999887655442 344565542 223344445543221110 011111 1113445566666
Q ss_pred HHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccc
Q psy3685 834 YFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGP 883 (1042)
Q Consensus 834 ~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~ 883 (1042)
++++++...+..+ ..+++++|+|+||++|+++..++|+.+.++|+.|+.
T Consensus 89 ~~i~~~~~~~~~~-~~~i~l~GfS~Gg~~al~~a~~~~~~~~~vv~~sg~ 137 (232)
T PRK11460 89 ETVRYWQQQSGVG-ASATALIGFSQGAIMALEAVKAEPGLAGRVIAFSGR 137 (232)
T ss_pred HHHHHHHHhcCCC-hhhEEEEEECHHHHHHHHHHHhCCCcceEEEEeccc
Confidence 6677666665443 348999999999999999999999998888877754
No 94
>TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae
Probab=98.08 E-value=1e-05 Score=97.19 Aligned_cols=103 Identities=17% Similarity=0.176 Sum_probs=73.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCC--C----CcccccCC----------CHHH
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSD--L----SVESLQYL----------SAEQ 827 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~--~----~~~~~~~l----------t~~q 827 (1042)
+.|+|++++|.+.....|. .+...|+++ |..||++|+|+||+|....+ . +...+.|+ ..+|
T Consensus 448 g~P~VVllHG~~g~~~~~~--~lA~~La~~-Gy~VIaiDlpGHG~S~~~~~~~~~~a~~~~~~~y~Nl~~l~~aRDn~rQ 524 (792)
T TIGR03502 448 GWPVVIYQHGITGAKENAL--AFAGTLAAA-GVATIAIDHPLHGARSFDANASGVNATNANVLAYMNLASLLVARDNLRQ 524 (792)
T ss_pred CCcEEEEeCCCCCCHHHHH--HHHHHHHhC-CcEEEEeCCCCCCccccccccccccccccCccceeccccccccccCHHH
Confidence 3479999999887665444 234455554 78999999999999943211 1 11223342 5699
Q ss_pred HHhHHHHHHHHHh------hhc---CCCCCCcEEEEccchhHHHHHHHHHh
Q psy3685 828 ALADLAYFITSMN------TLY---SLPAHTKWIAFGGSYPGALAAWLRYK 869 (1042)
Q Consensus 828 ~l~D~~~~~~~~~------~~~---~~~~~~~~i~~G~Sygg~la~~~~~~ 869 (1042)
++.|+..++..++ ..+ +..+..||+++|||+||.+++.+...
T Consensus 525 ~v~Dll~L~~~l~~~~~~~~~~~~~~~~~~~~V~~lGHSLGgiig~~~~~~ 575 (792)
T TIGR03502 525 SILDLLGLRLSLNGSALAGAPLSGINVIDGSKVSFLGHSLGGIVGTSFIAY 575 (792)
T ss_pred HHHHHHHHHHHHhcccccccccccccCCCCCcEEEEecCHHHHHHHHHHHh
Confidence 9999999999988 221 11234699999999999999998854
No 95
>PF12146 Hydrolase_4: Putative lysophospholipase; InterPro: IPR022742 This domain is found in bacteria and eukaryotes and is approximately 110 amino acids in length. Many members are annotated as being lysophospholipases, and others as alpha-beta hydrolase fold-containing proteins.
Probab=98.06 E-value=5e-06 Score=70.51 Aligned_cols=69 Identities=22% Similarity=0.220 Sum_probs=53.2
Q ss_pred ccccCCC---cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHH
Q psy3685 759 TWYKHGG---PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAY 834 (1042)
Q Consensus 759 ~~~~~~~---pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~ 834 (1042)
+.|.|.+ .||++++|.++.+. .+..+...|+++ |..|+++||||+|+|.+ ...++ +.++.++|+..
T Consensus 7 ~~w~p~~~~k~~v~i~HG~~eh~~--ry~~~a~~L~~~-G~~V~~~D~rGhG~S~g-------~rg~~~~~~~~v~D~~~ 76 (79)
T PF12146_consen 7 RRWKPENPPKAVVVIVHGFGEHSG--RYAHLAEFLAEQ-GYAVFAYDHRGHGRSEG-------KRGHIDSFDDYVDDLHQ 76 (79)
T ss_pred EEecCCCCCCEEEEEeCCcHHHHH--HHHHHHHHHHhC-CCEEEEECCCcCCCCCC-------cccccCCHHHHHHHHHH
Confidence 3465544 49999999988766 454567777775 99999999999999953 33444 78999999999
Q ss_pred HHH
Q psy3685 835 FIT 837 (1042)
Q Consensus 835 ~~~ 837 (1042)
|++
T Consensus 77 ~~~ 79 (79)
T PF12146_consen 77 FIQ 79 (79)
T ss_pred HhC
Confidence 874
No 96
>COG1647 Esterase/lipase [General function prediction only]
Probab=98.03 E-value=6.2e-05 Score=74.48 Aligned_cols=111 Identities=15% Similarity=0.111 Sum_probs=80.9
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
.+|+-.+|+++|..+....... +...|.+. |..+.+.-.||+|-. .+.+-.-|.+.=+.|+..-.+.++.
T Consensus 12 ~~G~~AVLllHGFTGt~~Dvr~--Lgr~L~e~-GyTv~aP~ypGHG~~-------~e~fl~t~~~DW~~~v~d~Y~~L~~ 81 (243)
T COG1647 12 EGGNRAVLLLHGFTGTPRDVRM--LGRYLNEN-GYTVYAPRYPGHGTL-------PEDFLKTTPRDWWEDVEDGYRDLKE 81 (243)
T ss_pred ccCCEEEEEEeccCCCcHHHHH--HHHHHHHC-CceEecCCCCCCCCC-------HHHHhcCCHHHHHHHHHHHHHHHHH
Confidence 4556778888887443322221 22234444 999999999999965 2334444666677777777777763
Q ss_pred h-cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccC
Q psy3685 842 L-YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVD 889 (1042)
Q Consensus 842 ~-~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~ 889 (1042)
. | ..|.+.|-|+||.+|+|++..|| +.++|..|||+.+..+
T Consensus 82 ~gy-----~eI~v~GlSmGGv~alkla~~~p--~K~iv~m~a~~~~k~~ 123 (243)
T COG1647 82 AGY-----DEIAVVGLSMGGVFALKLAYHYP--PKKIVPMCAPVNVKSW 123 (243)
T ss_pred cCC-----CeEEEEeecchhHHHHHHHhhCC--ccceeeecCCcccccc
Confidence 3 3 36999999999999999999999 9999999999987655
No 97
>PF12695 Abhydrolase_5: Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=98.01 E-value=1.5e-05 Score=77.36 Aligned_cols=93 Identities=18% Similarity=0.190 Sum_probs=66.0
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCC
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLP 846 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~ 846 (1042)
+||+++|.|.....| ..+...|++. |..++.+|.|+.|.+.-. .++..+++.+...+.
T Consensus 1 ~vv~~HG~~~~~~~~--~~~~~~l~~~-G~~v~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~-- 58 (145)
T PF12695_consen 1 VVVLLHGWGGSRRDY--QPLAEALAEQ-GYAVVAFDYPGHGDSDGA-----------------DAVERVLADIRAGYP-- 58 (145)
T ss_dssp EEEEECTTTTTTHHH--HHHHHHHHHT-TEEEEEESCTTSTTSHHS-----------------HHHHHHHHHHHHHHC--
T ss_pred CEEEECCCCCCHHHH--HHHHHHHHHC-CCEEEEEecCCCCccchh-----------------HHHHHHHHHHHhhcC--
Confidence 467777766544433 3456677777 999999999999987221 144444444323222
Q ss_pred CCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
+..+++++|+|+||.+|+++..++ ..+.++|+.++
T Consensus 59 ~~~~i~l~G~S~Gg~~a~~~~~~~-~~v~~~v~~~~ 93 (145)
T PF12695_consen 59 DPDRIILIGHSMGGAIAANLAARN-PRVKAVVLLSP 93 (145)
T ss_dssp TCCEEEEEEETHHHHHHHHHHHHS-TTESEEEEESE
T ss_pred CCCcEEEEEEccCcHHHHHHhhhc-cceeEEEEecC
Confidence 235899999999999999999999 66888888876
No 98
>PRK11071 esterase YqiA; Provisional
Probab=98.01 E-value=1.4e-05 Score=81.42 Aligned_cols=82 Identities=15% Similarity=0.067 Sum_probs=57.6
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHH-cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHN-HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~-~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
|.||+++|.++....|....+...+++. .+..++++|.|++| +++.+++..+++.+ .
T Consensus 2 p~illlHGf~ss~~~~~~~~~~~~l~~~~~~~~v~~~dl~g~~------------------~~~~~~l~~l~~~~----~ 59 (190)
T PRK11071 2 STLLYLHGFNSSPRSAKATLLKNWLAQHHPDIEMIVPQLPPYP------------------ADAAELLESLVLEH----G 59 (190)
T ss_pred CeEEEECCCCCCcchHHHHHHHHHHHHhCCCCeEEeCCCCCCH------------------HHHHHHHHHHHHHc----C
Confidence 6799999987765555432222233332 36789999999875 23556666666542 1
Q ss_pred CCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
..+++++|+|+||.+|.+++.++|.
T Consensus 60 ---~~~~~lvG~S~Gg~~a~~~a~~~~~ 84 (190)
T PRK11071 60 ---GDPLGLVGSSLGGYYATWLSQCFML 84 (190)
T ss_pred ---CCCeEEEEECHHHHHHHHHHHHcCC
Confidence 2479999999999999999999994
No 99
>PLN00021 chlorophyllase
Probab=98.00 E-value=2.2e-05 Score=86.14 Aligned_cols=103 Identities=15% Similarity=0.085 Sum_probs=64.8
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
...|+||+++|.+.....| ..+...||.. |..||+.|+++++.+.. .. .+.|..+++.++...
T Consensus 50 g~~PvVv~lHG~~~~~~~y--~~l~~~Las~-G~~VvapD~~g~~~~~~-----~~---------~i~d~~~~~~~l~~~ 112 (313)
T PLN00021 50 GTYPVLLFLHGYLLYNSFY--SQLLQHIASH-GFIVVAPQLYTLAGPDG-----TD---------EIKDAAAVINWLSSG 112 (313)
T ss_pred CCCCEEEEECCCCCCcccH--HHHHHHHHhC-CCEEEEecCCCcCCCCc-----hh---------hHHHHHHHHHHHHhh
Confidence 3569999999887654422 2345566665 89999999887543211 00 123333333333321
Q ss_pred c-------CCCCCCcEEEEccchhHHHHHHHHHhCCCe-----EEEEEeecc
Q psy3685 843 Y-------SLPAHTKWIAFGGSYPGALAAWLRYKYPHL-----VHGAMSASG 882 (1042)
Q Consensus 843 ~-------~~~~~~~~i~~G~Sygg~la~~~~~~yP~~-----~~g~i~ssa 882 (1042)
. ...+-.+++++|+|+||.+|..+..++|+. |.+.|+..+
T Consensus 113 l~~~l~~~~~~d~~~v~l~GHS~GG~iA~~lA~~~~~~~~~~~v~ali~ldP 164 (313)
T PLN00021 113 LAAVLPEGVRPDLSKLALAGHSRGGKTAFALALGKAAVSLPLKFSALIGLDP 164 (313)
T ss_pred hhhhcccccccChhheEEEEECcchHHHHHHHhhccccccccceeeEEeecc
Confidence 1 001224799999999999999999999964 566666543
No 100
>PHA00732 hypothetical protein
Probab=97.97 E-value=4.7e-06 Score=69.75 Aligned_cols=45 Identities=27% Similarity=0.569 Sum_probs=21.8
Q ss_pred cccCcCCcccCCchhhhhHhhh-hcCCcccCCccccccCCchhHHHHhhhc
Q psy3685 559 YKCDLCSKEFSRKDNLTEHKEI-HQGIRYKCDLCSKKFSRKLNLTKHMKIH 608 (1042)
Q Consensus 559 ~~C~~C~k~F~~~~~L~~H~~~-H~~~~~~C~~C~k~F~~~~~L~~H~~~H 608 (1042)
|.|+.||++|.+...|+.|++. |++ +.|+.||++|. .|..|++++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~~--~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHTL--TKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccCC--CccCCCCCEeC---Chhhhhccc
Confidence 4455555555555555555542 332 34555555554 345555443
No 101
>PHA00616 hypothetical protein
Probab=97.95 E-value=2.6e-06 Score=60.97 Aligned_cols=33 Identities=18% Similarity=0.411 Sum_probs=24.6
Q ss_pred cccCCccccccCCchhHHHHhhhcCCC-cccCCC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQGI-RYKCDL 617 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~~~-~~~C~~ 617 (1042)
||+|+.||+.|..+++|..|++.|||+ |+.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 577777777777777777777777776 666654
No 102
>COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
Probab=97.93 E-value=2.6e-05 Score=83.61 Aligned_cols=103 Identities=21% Similarity=0.242 Sum_probs=70.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcC-CeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHN-ALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~-a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
+|.+++++|.+.....|.. ....++.... ..++++|.|++|.|. .. ..+.....+|+..|+..+..
T Consensus 21 ~~~i~~~hg~~~~~~~~~~--~~~~~~~~~~~~~~~~~d~~g~g~s~--~~-------~~~~~~~~~~~~~~~~~~~~-- 87 (282)
T COG0596 21 GPPLVLLHGFPGSSSVWRP--VFKVLPALAARYRVIAPDLRGHGRSD--PA-------GYSLSAYADDLAALLDALGL-- 87 (282)
T ss_pred CCeEEEeCCCCCchhhhHH--HHHHhhccccceEEEEecccCCCCCC--cc-------cccHHHHHHHHHHHHHHhCC--
Confidence 4567777666554444443 1122333211 799999999999996 10 11222236777777764432
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.+++++|+||||.++..+..++|+.+.++|..++++.
T Consensus 88 -----~~~~l~G~S~Gg~~~~~~~~~~p~~~~~~v~~~~~~~ 124 (282)
T COG0596 88 -----EKVVLVGHSMGGAVALALALRHPDRVRGLVLIGPAPP 124 (282)
T ss_pred -----CceEEEEecccHHHHHHHHHhcchhhheeeEecCCCC
Confidence 2599999999999999999999999999998887764
No 103
>PF00326 Peptidase_S9: Prolyl oligopeptidase family This family belongs to family S9 of the peptidase classification.; InterPro: IPR001375 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain covers the active site serine of the serine peptidases belonging to MEROPS peptidase family S9 (prolyl oligopeptidase family, clan SC). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Examples of protein families containing this domain are: Prolyl endopeptidase (3.4.21.26 from EC) (PE) (also called post-proline cleaving enzyme). PE is an enzyme that cleaves peptide bonds on the C-terminal side of prolyl residues. The sequence of PE has been obtained from a mammalian species (pig) and from bacteria (Flavobacterium meningosepticum and Aeromonas hydrophila); there is a high degree of sequence conservation between these sequences. Escherichia coli protease II (3.4.21.83 from EC) (oligopeptidase B) (gene prtB) which cleaves peptide bonds on the C-terminal side of lysyl and argininyl residues. Dipeptidyl peptidase IV (3.4.14.5 from EC) (DPP IV). DPP IV is an enzyme that removes N-terminal dipeptides sequentially from polypeptides having unsubstituted N-termini provided that the penultimate residue is proline. Saccharomyces cerevisiae (Baker's yeast) vacuolar dipeptidyl aminopeptidases A and B (DPAP A and DPAP B), encoded by the STE13 and DAP2 genes respectively. DPAP A is responsible for the proteolytic maturation of the alpha-factor precursor. Acylamino-acid-releasing enzyme (3.4.19.1 from EC) (acyl-peptide hydrolase). This enzyme catalyses the hydrolysis of the amino-terminal peptide bond of an N-acetylated protein to generate a N-acetylated amino acid and a protein with a free amino-terminus. These proteins belong to MEROPS peptidase families S9A, S9B and S9C.; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis; PDB: 2AJ8_D 1ORV_D 2AJB_C 2BUC_D 1ORW_D 2AJC_D 2AJD_C 2BUA_A 2HU8_B 3O4J_B ....
Probab=97.93 E-value=7.8e-06 Score=85.46 Aligned_cols=94 Identities=17% Similarity=0.096 Sum_probs=67.0
Q ss_pred HHHHcCCeEEEEeeeecccccC-CCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH
Q psy3685 790 YAHNHNALAVQVEHRFYGKSHP-LSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 790 la~~~~a~~~~~ehR~~G~S~p-~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
||.+ |+.|+.++.|+.|.... + ......-.-...++|+...++++...... +..++.++|+||||.+|+++..
T Consensus 10 la~~-Gy~v~~~~~rGs~g~g~~~----~~~~~~~~~~~~~~D~~~~i~~l~~~~~i-D~~ri~i~G~S~GG~~a~~~~~ 83 (213)
T PF00326_consen 10 LASQ-GYAVLVPNYRGSGGYGKDF----HEAGRGDWGQADVDDVVAAIEYLIKQYYI-DPDRIGIMGHSYGGYLALLAAT 83 (213)
T ss_dssp HHTT-T-EEEEEE-TTSSSSHHHH----HHTTTTGTTHHHHHHHHHHHHHHHHTTSE-EEEEEEEEEETHHHHHHHHHHH
T ss_pred HHhC-CEEEEEEcCCCCCccchhH----HHhhhccccccchhhHHHHHHHHhccccc-cceeEEEEcccccccccchhhc
Confidence 4444 99999999998653211 0 01112223456789999999999877633 2358999999999999999999
Q ss_pred hCCCeEEEEEeeccccccccC
Q psy3685 869 KYPHLVHGAMSASGPLRAVVD 889 (1042)
Q Consensus 869 ~yP~~~~g~i~ssa~~~~~~~ 889 (1042)
.+|+.+.++|+.++++.....
T Consensus 84 ~~~~~f~a~v~~~g~~d~~~~ 104 (213)
T PF00326_consen 84 QHPDRFKAAVAGAGVSDLFSY 104 (213)
T ss_dssp HTCCGSSEEEEESE-SSTTCS
T ss_pred ccceeeeeeeccceecchhcc
Confidence 999999999998877755433
No 104
>KOG2281|consensus
Probab=97.91 E-value=2e-05 Score=88.52 Aligned_cols=118 Identities=14% Similarity=0.122 Sum_probs=77.0
Q ss_pred CcEEEEE-CCCCCC--cccccccc-c-HHHHHHHcCCeEEEEeeeecccc-cCCCCCCcccccCCCHHHHHhHHHHHHHH
Q psy3685 765 GPIFLMI-GGEGEA--SAKWMVEG-T-WLDYAHNHNALAVQVEHRFYGKS-HPLSDLSVESLQYLSAEQALADLAYFITS 838 (1042)
Q Consensus 765 ~pi~~~~-gg~g~~--~~~~~~~~-~-~~~la~~~~a~~~~~ehR~~G~S-~p~~~~~~~~~~~lt~~q~l~D~~~~~~~ 838 (1042)
=|+|+++ ||++.- ...|...- + +..||. +|..||.+|-||--.- .-|+.-=..++++.. ++|-++=++.
T Consensus 642 Yptvl~VYGGP~VQlVnnsfkgi~ylR~~~Las-lGy~Vv~IDnRGS~hRGlkFE~~ik~kmGqVE----~eDQVeglq~ 716 (867)
T KOG2281|consen 642 YPTVLNVYGGPGVQLVNNSFKGIQYLRFCRLAS-LGYVVVFIDNRGSAHRGLKFESHIKKKMGQVE----VEDQVEGLQM 716 (867)
T ss_pred CceEEEEcCCCceEEeeccccceehhhhhhhhh-cceEEEEEcCCCccccchhhHHHHhhccCeee----ehhhHHHHHH
Confidence 4788888 998753 22222211 1 223454 5999999999984221 111111112344443 4555555566
Q ss_pred HhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 839 MNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 839 ~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
+...+++.+-.+|.+-|.||||-||+.+..+||+.+..|||.+++..+.
T Consensus 717 Laeq~gfidmdrV~vhGWSYGGYLSlm~L~~~P~IfrvAIAGapVT~W~ 765 (867)
T KOG2281|consen 717 LAEQTGFIDMDRVGVHGWSYGGYLSLMGLAQYPNIFRVAIAGAPVTDWR 765 (867)
T ss_pred HHHhcCcccchheeEeccccccHHHHHHhhcCcceeeEEeccCcceeee
Confidence 6666655444589999999999999999999999999999887666554
No 105
>PHA00732 hypothetical protein
Probab=97.90 E-value=7.6e-06 Score=68.53 Aligned_cols=48 Identities=25% Similarity=0.490 Sum_probs=29.4
Q ss_pred ccccCccCcccCCchhHHHHHHH-hcCcccccCcCCcccCCchhhhhHhhhhcC
Q psy3685 531 RYKCDLCSKEFSRKDNLTEHKEI-HQGIRYKCDLCSKEFSRKDNLTEHKEIHQG 583 (1042)
Q Consensus 531 ~~~C~~C~k~f~~~~~L~~H~~~-H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ 583 (1042)
||.|+.|++.|.+..+|+.|++. |+ ++.|+.||++|. .|..|++++.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT--LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC--CCccCCCCCEeC---ChhhhhcccCC
Confidence 45666666666666666666663 43 346777777665 46666655544
No 106
>PRK10162 acetyl esterase; Provisional
Probab=97.86 E-value=5e-05 Score=84.44 Aligned_cols=107 Identities=12% Similarity=0.048 Sum_probs=70.7
Q ss_pred CCcEEEEECCCCCCccc-ccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh-
Q psy3685 764 GGPIFLMIGGEGEASAK-WMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT- 841 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~-~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~- 841 (1042)
+.|+|||++|-|-..+. .....+...||...|+.||.+|.|--.+. |+ ..+++|+...++++..
T Consensus 80 ~~p~vv~~HGGg~~~g~~~~~~~~~~~la~~~g~~Vv~vdYrlape~-~~-------------p~~~~D~~~a~~~l~~~ 145 (318)
T PRK10162 80 SQATLFYLHGGGFILGNLDTHDRIMRLLASYSGCTVIGIDYTLSPEA-RF-------------PQAIEEIVAVCCYFHQH 145 (318)
T ss_pred CCCEEEEEeCCcccCCCchhhhHHHHHHHHHcCCEEEEecCCCCCCC-CC-------------CCcHHHHHHHHHHHHHh
Confidence 46899999664422221 11223466788888999999998853321 11 1256777776666643
Q ss_pred --hcCCCCCCcEEEEccchhHHHHHHHHHhC------CCeEEEEEeeccccc
Q psy3685 842 --LYSLPAHTKWIAFGGSYPGALAAWLRYKY------PHLVHGAMSASGPLR 885 (1042)
Q Consensus 842 --~~~~~~~~~~i~~G~Sygg~la~~~~~~y------P~~~~g~i~ssa~~~ 885 (1042)
+++.+ ..+++++|.|.||.||+++.+.. |..+.|.|..++.+.
T Consensus 146 ~~~~~~d-~~~i~l~G~SaGG~la~~~a~~~~~~~~~~~~~~~~vl~~p~~~ 196 (318)
T PRK10162 146 AEDYGIN-MSRIGFAGDSAGAMLALASALWLRDKQIDCGKVAGVLLWYGLYG 196 (318)
T ss_pred HHHhCCC-hhHEEEEEECHHHHHHHHHHHHHHhcCCCccChhheEEECCccC
Confidence 34432 24899999999999999988765 356777777766554
No 107
>KOG2984|consensus
Probab=97.86 E-value=2.7e-05 Score=75.11 Aligned_cols=102 Identities=20% Similarity=0.193 Sum_probs=68.8
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCC
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLP 846 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~ 846 (1042)
|+++-|--|+....|.. .+..+-+.+...||+||.||||.|.|.+ +..-++-...|..+-+.-++..
T Consensus 45 iLlipGalGs~~tDf~p--ql~~l~k~l~~TivawDPpGYG~SrPP~-------Rkf~~~ff~~Da~~avdLM~aL---- 111 (277)
T KOG2984|consen 45 ILLIPGALGSYKTDFPP--QLLSLFKPLQVTIVAWDPPGYGTSRPPE-------RKFEVQFFMKDAEYAVDLMEAL---- 111 (277)
T ss_pred eEecccccccccccCCH--HHHhcCCCCceEEEEECCCCCCCCCCCc-------ccchHHHHHHhHHHHHHHHHHh----
Confidence 55555666766555543 2445555556789999999999999964 2333444445544444433332
Q ss_pred CCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
+-.|+-++|.|-||..|...+.|||+.|.-.|.-+
T Consensus 112 k~~~fsvlGWSdGgiTalivAak~~e~v~rmiiwg 146 (277)
T KOG2984|consen 112 KLEPFSVLGWSDGGITALIVAAKGKEKVNRMIIWG 146 (277)
T ss_pred CCCCeeEeeecCCCeEEEEeeccChhhhhhheeec
Confidence 11489999999999999999999999998544333
No 108
>PF05677 DUF818: Chlamydia CHLPS protein (DUF818); InterPro: IPR008536 This family of unknown function includes several Chlamydia CHLPS proteins and Legionella SidB proteins.
Probab=97.83 E-value=8.1e-05 Score=78.92 Aligned_cols=97 Identities=19% Similarity=0.117 Sum_probs=73.5
Q ss_pred CcEEEEECCCCCCccc-cc---ccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHh
Q psy3685 765 GPIFLMIGGEGEASAK-WM---VEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~-~~---~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
+-++|+..|.|..-.. .. ....+..+|++.+|+|+.+-.||+|.|. +..|.+..+.|....+++++
T Consensus 137 ~RWiL~s~GNg~~~E~~~~~~~~~~~~~~~ak~~~aNvl~fNYpGVg~S~----------G~~s~~dLv~~~~a~v~yL~ 206 (365)
T PF05677_consen 137 QRWILVSNGNGECYENRAMLDYKDDWIQRFAKELGANVLVFNYPGVGSST----------GPPSRKDLVKDYQACVRYLR 206 (365)
T ss_pred CcEEEEEcCChHHhhhhhhhccccHHHHHHHHHcCCcEEEECCCccccCC----------CCCCHHHHHHHHHHHHHHHH
Confidence 3477777776654322 11 2245788999999999999999999994 23466889999999999998
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHHHhCC
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLRYKYP 871 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP 871 (1042)
.+..++.-..+|++|+|.||+++|....+.+
T Consensus 207 d~~~G~ka~~Ii~yG~SLGG~Vqa~AL~~~~ 237 (365)
T PF05677_consen 207 DEEQGPKAKNIILYGHSLGGGVQAEALKKEV 237 (365)
T ss_pred hcccCCChheEEEeeccccHHHHHHHHHhcc
Confidence 7665544457999999999999998655543
No 109
>COG0429 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=97.80 E-value=6.2e-05 Score=79.52 Aligned_cols=111 Identities=22% Similarity=0.302 Sum_probs=78.2
Q ss_pred CCcEEEEECCC-CCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGE-GEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~-g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
-.|.||.++|- |+....++. +++..+.++ |..+|.+.-|+.|.+.-. +. .-|-+-+ -+|++.|+..++..
T Consensus 74 ~~P~vVl~HGL~G~s~s~y~r-~L~~~~~~r-g~~~Vv~~~Rgcs~~~n~---~p--~~yh~G~--t~D~~~~l~~l~~~ 144 (345)
T COG0429 74 KKPLVVLFHGLEGSSNSPYAR-GLMRALSRR-GWLVVVFHFRGCSGEANT---SP--RLYHSGE--TEDIRFFLDWLKAR 144 (345)
T ss_pred CCceEEEEeccCCCCcCHHHH-HHHHHHHhc-CCeEEEEecccccCCccc---Cc--ceecccc--hhHHHHHHHHHHHh
Confidence 35878888664 655544333 555555555 899999999999988321 11 0111111 19999999999886
Q ss_pred cCCCCCCcEEEEccchhH-HHHHHHHHhCCCe-EEEEEeecccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPG-ALAAWLRYKYPHL-VHGAMSASGPLRA 886 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg-~la~~~~~~yP~~-~~g~i~ssa~~~~ 886 (1042)
+ ...|+..+|.|.|| |||-|+..+--+. +.||++.|+|...
T Consensus 145 ~---~~r~~~avG~SLGgnmLa~ylgeeg~d~~~~aa~~vs~P~Dl 187 (345)
T COG0429 145 F---PPRPLYAVGFSLGGNMLANYLGEEGDDLPLDAAVAVSAPFDL 187 (345)
T ss_pred C---CCCceEEEEecccHHHHHHHHHhhccCcccceeeeeeCHHHH
Confidence 5 34699999999999 8898888655443 6899999999864
No 110
>PRK10115 protease 2; Provisional
Probab=97.79 E-value=4.4e-05 Score=93.26 Aligned_cols=115 Identities=17% Similarity=0.177 Sum_probs=79.9
Q ss_pred CCcEEEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeeeec---ccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 764 GGPIFLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFY---GKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 764 ~~pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~---G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
..|+|||+ ||+|..... .....+..|+.+ |+.|+..--||= |+..-- .-..+.-...++|+...++++
T Consensus 444 ~~P~ll~~hGg~~~~~~p-~f~~~~~~l~~r-G~~v~~~n~RGs~g~G~~w~~------~g~~~~k~~~~~D~~a~~~~L 515 (686)
T PRK10115 444 HNPLLVYGYGSYGASIDA-DFSFSRLSLLDR-GFVYAIVHVRGGGELGQQWYE------DGKFLKKKNTFNDYLDACDAL 515 (686)
T ss_pred CCCEEEEEECCCCCCCCC-CccHHHHHHHHC-CcEEEEEEcCCCCccCHHHHH------hhhhhcCCCcHHHHHHHHHHH
Confidence 35999999 666654321 222334567776 999999999973 433211 001122224678888888877
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
... +..+..++.++|+||||.|++|+.-.+|+++.|+|+..+++...
T Consensus 516 v~~-g~~d~~rl~i~G~S~GG~l~~~~~~~~Pdlf~A~v~~vp~~D~~ 562 (686)
T PRK10115 516 LKL-GYGSPSLCYGMGGSAGGMLMGVAINQRPELFHGVIAQVPFVDVV 562 (686)
T ss_pred HHc-CCCChHHeEEEEECHHHHHHHHHHhcChhheeEEEecCCchhHh
Confidence 655 33344689999999999999999999999999999998777654
No 111
>KOG1146|consensus
Probab=97.78 E-value=1.2e-05 Score=97.60 Aligned_cols=103 Identities=17% Similarity=0.232 Sum_probs=73.3
Q ss_pred cccCCCCcccCCchhHHHHHHHhcC-----CccccCccCcccCCchhHHHHHHHhcCcccccCcCCcccCCchhhhhHhh
Q psy3685 505 YKCDLCSKEFSSKGNLTEHMKQHQG-----IRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKEFSRKDNLTEHKE 579 (1042)
Q Consensus 505 ~~C~~C~~~f~~~~~L~~H~~~H~~-----~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~ 579 (1042)
..|..|...|.....+..|-+--.. -.+.|..|++.|...-.+. |+- ...+|.|.+|...|...+.|..|.+
T Consensus 1229 l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~--~~~~~~~~~~~~~~~~~~~l~~~~~ 1305 (1406)
T KOG1146|consen 1229 LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLD--VTHRYLCRQCKMAFDGEAPLTAHQR 1305 (1406)
T ss_pred ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-ecc--cchhHHHHHHHhhhcchhHHHHHHH
Confidence 4577777777777776665432211 1477888888888777776 543 2236888888888888888888874
Q ss_pred hh-----------------cCC-cccCCccccccCCchhHHHHhhhcCCC
Q psy3685 580 IH-----------------QGI-RYKCDLCSKKFSRKLNLTKHMKIHQGI 611 (1042)
Q Consensus 580 ~H-----------------~~~-~~~C~~C~k~F~~~~~L~~H~~~H~~~ 611 (1042)
.- ... .| |..|...|.....|..||++-+++
T Consensus 1306 k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1306 KFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred HHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhc
Confidence 32 112 35 999999999999999999976543
No 112
>PF11144 DUF2920: Protein of unknown function (DUF2920); InterPro: IPR022605 This bacterial family of proteins has no known function.
Probab=97.71 E-value=0.00041 Score=76.25 Aligned_cols=138 Identities=20% Similarity=0.159 Sum_probs=86.7
Q ss_pred cceEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccC------CCCCCc-----
Q psy3685 749 TWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHP------LSDLSV----- 817 (1042)
Q Consensus 749 tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p------~~~~~~----- 817 (1042)
+=+..|-+...--++..-|||++||.|+........-....+|++++.++|.+..=++|.=.. +++...
T Consensus 19 ~sKLEyri~ydd~Ke~kaIvfiI~GfG~dan~~~~d~~r~~iA~~fnvv~I~V~YHCf~~R~q~~A~~~~~~~D~~iLk~ 98 (403)
T PF11144_consen 19 ESKLEYRISYDDEKEIKAIVFIIPGFGADANSNYLDFMREYIAKKFNVVVISVNYHCFCNRPQYGAKFYFDDIDKEILKK 98 (403)
T ss_pred cceeeEEeecCCCCCceEEEEEeCCcCCCcchHHHHHHHHHHHHhCCEEEEEeeeeheeeccccCchhcCCHHHHHHHHH
Confidence 334445554333455566999999999865544444456678999999999999666652211 111000
Q ss_pred ----ccc-----cC-CCHHHHHhHHHHHHH-------------------------------------------HHhhhcC
Q psy3685 818 ----ESL-----QY-LSAEQALADLAYFIT-------------------------------------------SMNTLYS 844 (1042)
Q Consensus 818 ----~~~-----~~-lt~~q~l~D~~~~~~-------------------------------------------~~~~~~~ 844 (1042)
-++ .. -+..+++..+.+.|. +++..+.
T Consensus 99 ~L~~i~i~~~~i~~~~~~~~~~~~L~~~I~~lK~~~~L~~d~kl~ls~tl~P~n~EYQN~GIMqAiD~INAl~~l~k~~~ 178 (403)
T PF11144_consen 99 SLEKINIDSESINTYDNAEQIYELLNQNITELKEQGILPQDYKLNLSCTLIPPNGEYQNFGIMQAIDIINALLDLKKIFP 178 (403)
T ss_pred HHHHcCccccccccchhHHHHHHHHHHHHHHHHhcCCCCCCcEEeEEEEecCCchhhhhhHHHHHHHHHHHHHHHHHhhh
Confidence 011 11 133333333333333 3333333
Q ss_pred CCC-CCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 845 LPA-HTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 845 ~~~-~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
..+ +.|+|++|+||||-||...+.-=|.+|+|.|=.|+.+..
T Consensus 179 ~~~~~lp~I~~G~s~G~yla~l~~k~aP~~~~~~iDns~~~~p 221 (403)
T PF11144_consen 179 KNGGGLPKIYIGSSHGGYLAHLCAKIAPWLFDGVIDNSSYALP 221 (403)
T ss_pred cccCCCcEEEEecCcHHHHHHHHHhhCccceeEEEecCccccc
Confidence 222 359999999999999999999999999999988877754
No 113
>KOG1838|consensus
Probab=97.66 E-value=0.00047 Score=75.88 Aligned_cols=112 Identities=17% Similarity=0.195 Sum_probs=79.3
Q ss_pred CCCcEEEEECCCC-CCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 763 HGGPIFLMIGGEG-EASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 763 ~~~pi~~~~gg~g-~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
.+.|++|++.|-. +....++. .+...|.+.|..+|.+-|||.|.|.-+ +.. ..|.- --+|+.+++++++.
T Consensus 123 ~~~P~vvilpGltg~S~~~YVr--~lv~~a~~~G~r~VVfN~RG~~g~~Lt----Tpr--~f~ag-~t~Dl~~~v~~i~~ 193 (409)
T KOG1838|consen 123 GTDPIVVILPGLTGGSHESYVR--HLVHEAQRKGYRVVVFNHRGLGGSKLT----TPR--LFTAG-WTEDLREVVNHIKK 193 (409)
T ss_pred CCCcEEEEecCCCCCChhHHHH--HHHHHHHhCCcEEEEECCCCCCCCccC----CCc--eeecC-CHHHHHHHHHHHHH
Confidence 4569999997763 33333332 255667777999999999999988432 111 11111 13899999999999
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHH-HhCCCe-EEEEEeecccccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLR-YKYPHL-VHGAMSASGPLRA 886 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~-~~yP~~-~~g~i~ssa~~~~ 886 (1042)
.|. .+|.+++|.|+||+|-.-|. ..-.+. +.||++.|.|.++
T Consensus 194 ~~P---~a~l~avG~S~Gg~iL~nYLGE~g~~~~l~~a~~v~~Pwd~ 237 (409)
T KOG1838|consen 194 RYP---QAPLFAVGFSMGGNILTNYLGEEGDNTPLIAAVAVCNPWDL 237 (409)
T ss_pred hCC---CCceEEEEecchHHHHHHHhhhccCCCCceeEEEEeccchh
Confidence 984 46999999999998776655 444444 7888888888875
No 114
>COG3509 LpqC Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=97.58 E-value=0.00048 Score=71.67 Aligned_cols=112 Identities=16% Similarity=0.265 Sum_probs=76.3
Q ss_pred ccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCC--CC--cccccCCCHHHHHhHHHHHH
Q psy3685 761 YKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSD--LS--VESLQYLSAEQALADLAYFI 836 (1042)
Q Consensus 761 ~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~--~~--~~~~~~lt~~q~l~D~~~~~ 836 (1042)
...+.|++|+++|-+..........-+..||++.|.+|++.| +|..+.+.+- .. .+. +-.-.+ -+.+++.+|
T Consensus 57 ~~~~apLvv~LHG~~~sgag~~~~sg~d~lAd~~gFlV~yPd--g~~~~wn~~~~~~~~~p~~-~~~g~d-dVgflr~lv 132 (312)
T COG3509 57 LPSGAPLVVVLHGSGGSGAGQLHGTGWDALADREGFLVAYPD--GYDRAWNANGCGNWFGPAD-RRRGVD-DVGFLRALV 132 (312)
T ss_pred CCCCCCEEEEEecCCCChHHhhcccchhhhhcccCcEEECcC--ccccccCCCcccccCCccc-ccCCcc-HHHHHHHHH
Confidence 344568999997765443333332236789999999999996 5555542110 00 000 011122 247778888
Q ss_pred HHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEE
Q psy3685 837 TSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGA 877 (1042)
Q Consensus 837 ~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~ 877 (1042)
..+..+|..++ .+|++.|-|-||.+|.++.-.||+++.|+
T Consensus 133 a~l~~~~gidp-~RVyvtGlS~GG~Ma~~lac~~p~~faa~ 172 (312)
T COG3509 133 AKLVNEYGIDP-ARVYVTGLSNGGRMANRLACEYPDIFAAI 172 (312)
T ss_pred HHHHHhcCcCc-ceEEEEeeCcHHHHHHHHHhcCcccccce
Confidence 88889987754 58999999999999999999999998854
No 115
>PF02129 Peptidase_S15: X-Pro dipeptidyl-peptidase (S15 family); InterPro: IPR000383 This entry represents a domain found peptidases Xaa-Pro dipeptidyl-peptidase and glutaryl-7-aminocephalosporanic-acid acylase, which belong to MEROPS peptidase families S15 and S45 respectively []. It is also found in hydrolases from the CocE/NonD family. Cocaine esterase (CocE) hydrolyzes cocaine endowing the bacteria with the ability to utilise cocaine as a sole source of carbon and energy []. ; GO: 0004177 aminopeptidase activity, 0006508 proteolysis; PDB: 1LNS_A 3PUI_A 3PUH_B 1JU3_A 3I2I_A 3I2G_A 1JU4_A 3I2K_A 3IDA_A 3I2H_A ....
Probab=97.58 E-value=0.00014 Score=79.02 Aligned_cols=113 Identities=17% Similarity=0.083 Sum_probs=76.2
Q ss_pred CcEEEEECCCCCCcc-ccccccc-------HHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHH
Q psy3685 765 GPIFLMIGGEGEASA-KWMVEGT-------WLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFI 836 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~-~~~~~~~-------~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~ 836 (1042)
-|+||...+.|.... ....... ...++++ |+.||..|.||.|.|. ..+..+ .++-.+|....|
T Consensus 20 ~P~il~~tpY~~~~~~~~~~~~~~~~~~~~~~~~~~~-GY~vV~~D~RG~g~S~-------G~~~~~-~~~e~~D~~d~I 90 (272)
T PF02129_consen 20 FPVILTRTPYGKGDQTASDLAGANPGPPSARRPFAER-GYAVVVQDVRGTGGSE-------GEFDPM-SPNEAQDGYDTI 90 (272)
T ss_dssp EEEEEEEESSTCTC-HHHHHHTTCHHSHGGGHHHHHT-T-EEEEEE-TTSTTS--------S-B-TT-SHHHHHHHHHHH
T ss_pred ccEEEEccCcCCCCCcccchhhhhcccchhHHHHHhC-CCEEEEECCcccccCC-------CccccC-ChhHHHHHHHHH
Confidence 378999888874321 1111111 1126666 9999999999999993 233333 666779999999
Q ss_pred HHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccccc
Q psy3685 837 TSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVV 888 (1042)
Q Consensus 837 ~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~ 888 (1042)
+.+..+ .- .+.+|-++|.||+|..+.+.+..-|..+.|.+..+++.....
T Consensus 91 ~W~~~Q-pw-s~G~VGm~G~SY~G~~q~~~A~~~~p~LkAi~p~~~~~d~~~ 140 (272)
T PF02129_consen 91 EWIAAQ-PW-SNGKVGMYGISYGGFTQWAAAARRPPHLKAIVPQSGWSDLYR 140 (272)
T ss_dssp HHHHHC-TT-EEEEEEEEEETHHHHHHHHHHTTT-TTEEEEEEESE-SBTCC
T ss_pred HHHHhC-CC-CCCeEEeeccCHHHHHHHHHHhcCCCCceEEEecccCCcccc
Confidence 999776 22 234899999999999999999988888888888777665443
No 116
>PF00450 Peptidase_S10: Serine carboxypeptidase; InterPro: IPR001563 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of serine peptidases belong to MEROPS peptidase family S10 (clan SC). The type example is carboxypeptidase Y from Saccharomyces cerevisiae (Baker's yeast) []. All known carboxypeptidases are either metallo carboxypeptidases or serine carboxypeptidases (3.4.16.5 from EC and 3.4.16.6 from EC). The catalytic activity of the serine carboxypeptidases, like that of the trypsin family serine proteases, is provided by a charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which is itself hydrogen-bonded to a serine []. The sequences surrounding the active site serine and histidine residues are highly conserved in all the serine carboxypeptidases.; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 1AC5_A 1WHS_B 3SC2_B 1WHT_A 1BCR_A 1BCS_A 1GXS_A 1IVY_A 1WPX_A 1YSC_A ....
Probab=97.56 E-value=0.00054 Score=79.76 Aligned_cols=134 Identities=19% Similarity=0.207 Sum_probs=83.0
Q ss_pred EEEecc-cccCCCcEEEEE-CCCCCCccc---------ccc-cc----cHHHHHHHcCCeEEEEeee-ecccccCCCCCC
Q psy3685 754 YFVNST-WYKHGGPIFLMI-GGEGEASAK---------WMV-EG----TWLDYAHNHNALAVQVEHR-FYGKSHPLSDLS 816 (1042)
Q Consensus 754 y~~~~~-~~~~~~pi~~~~-gg~g~~~~~---------~~~-~~----~~~~la~~~~a~~~~~ehR-~~G~S~p~~~~~ 816 (1042)
||+-.. .-....||+|++ ||+|..+.. .+. .+ .....+....|.||++|++ +.|-|....
T Consensus 28 yw~~~s~~~~~~~Pl~~wlnGGPG~SS~~g~f~e~GP~~~~~~~~~~l~~n~~sW~~~an~l~iD~PvGtGfS~~~~--- 104 (415)
T PF00450_consen 28 YWFFESRNDPEDDPLILWLNGGPGCSSMWGLFGENGPFRINPDGPYTLEDNPYSWNKFANLLFIDQPVGTGFSYGND--- 104 (415)
T ss_dssp EEEEE-SSGGCSS-EEEEEE-TTTB-THHHHHCTTSSEEEETTSTSEEEE-TT-GGGTSEEEEE--STTSTT-EESS---
T ss_pred EEEEEeCCCCCCccEEEEecCCceeccccccccccCceEEeecccccccccccccccccceEEEeecCceEEeeccc---
Confidence 555433 323568999988 999865532 111 11 2334556667999999965 899997643
Q ss_pred cccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHH----HHHHHhC------CCeEEEEEeecccccc
Q psy3685 817 VESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALA----AWLRYKY------PHLVHGAMSASGPLRA 886 (1042)
Q Consensus 817 ~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la----~~~~~~y------P~~~~g~i~ssa~~~~ 886 (1042)
......+.+++.+|+..|++.+-..++.-...|+.++|-||||..+ ..+.... +=.+.|.+..++.+..
T Consensus 105 -~~~~~~~~~~~a~~~~~fl~~f~~~~p~~~~~~~yi~GESYgG~yvP~~a~~i~~~~~~~~~~~inLkGi~IGng~~dp 183 (415)
T PF00450_consen 105 -PSDYVWNDDQAAEDLYEFLQQFFQKFPEYRSNPLYIAGESYGGHYVPALASYILQQNKKGDQPKINLKGIAIGNGWIDP 183 (415)
T ss_dssp -GGGGS-SHHHHHHHHHHHHHHHHHHSGGGTTSEEEEEEETTHHHHHHHHHHHHHHHTCC--STTSEEEEEEEESE-SBH
T ss_pred -cccccchhhHHHHHHHHHHHHhhhhhhhccCCCEEEEccccccccchhhHHhhhhccccccccccccccceecCccccc
Confidence 2224458999999999999998777765455699999999999654 4444444 3347888888877766
Q ss_pred ccCCh
Q psy3685 887 VVDFP 891 (1042)
Q Consensus 887 ~~~~~ 891 (1042)
...+.
T Consensus 184 ~~~~~ 188 (415)
T PF00450_consen 184 RIQYN 188 (415)
T ss_dssp HHHHH
T ss_pred cccce
Confidence 54433
No 117
>PF07859 Abhydrolase_3: alpha/beta hydrolase fold A web page of Esterases and alpha/beta hydrolases.; InterPro: IPR013094 The alpha/beta hydrolase fold [] is common to a number of hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is an alpha/beta-sheet (rather than a barrel), containing 8 strands connected by helices []. The enzymes are believed to have diverged from a common ancestor, preserving the arrangement of the catalytic residues. All have a catalytic triad, the elements of which are borne on loops, which are the best conserved structural features of the fold. Esterase (EST) from Pseudomonas putida is a member of the alpha/beta hydrolase fold superfamily of enzymes []. In most of the family members the beta-strands are parallels, but some have an inversion of the first strands, which gives it an antiparallel orientation. The catalytic triad residues are presented on loops. One of these is the nucleophile elbow and is the most conserved feature of the fold. Some other members lack one or all of the catalytic residues. Some members are therefore inactive but others are involved in surface recognition. The ESTHER database [] gathers and annotates all the published information related to gene and protein sequences of this superfamily []. This entry represents the catalytic domain fold-3 of alpha/beta hydrolase. ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3D7R_B 2C7B_B 3ZWQ_B 2YH2_B 3BXP_A 3D3N_A 1LZK_A 1LZL_A 2O7V_A 2O7R_A ....
Probab=97.56 E-value=0.0002 Score=74.63 Aligned_cols=104 Identities=17% Similarity=0.082 Sum_probs=70.1
Q ss_pred EEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh---c
Q psy3685 768 FLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL---Y 843 (1042)
Q Consensus 768 ~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~---~ 843 (1042)
|||+ ||-+..........+...+|.+.|+.|+.++.|-..+. +..++++|+...++++... +
T Consensus 1 v~~~HGGg~~~g~~~~~~~~~~~la~~~g~~v~~~~Yrl~p~~--------------~~p~~~~D~~~a~~~l~~~~~~~ 66 (211)
T PF07859_consen 1 VVYIHGGGWVMGSKESHWPFAARLAAERGFVVVSIDYRLAPEA--------------PFPAALEDVKAAYRWLLKNADKL 66 (211)
T ss_dssp EEEE--STTTSCGTTTHHHHHHHHHHHHTSEEEEEE---TTTS--------------STTHHHHHHHHHHHHHHHTHHHH
T ss_pred CEEECCcccccCChHHHHHHHHHHHhhccEEEEEeeccccccc--------------cccccccccccceeeeccccccc
Confidence 4555 55433322222334566789888999999999964221 3456899999999998776 3
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCCe----EEEEEeecccccc
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPHL----VHGAMSASGPLRA 886 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~~----~~g~i~ssa~~~~ 886 (1042)
..+ ..+++++|.|-||.||+.+.....+. +.|.++.+++...
T Consensus 67 ~~d-~~~i~l~G~SAGg~la~~~~~~~~~~~~~~~~~~~~~~p~~d~ 112 (211)
T PF07859_consen 67 GID-PERIVLIGDSAGGHLALSLALRARDRGLPKPKGIILISPWTDL 112 (211)
T ss_dssp TEE-EEEEEEEEETHHHHHHHHHHHHHHHTTTCHESEEEEESCHSST
T ss_pred ccc-ccceEEeecccccchhhhhhhhhhhhcccchhhhhcccccccc
Confidence 332 24899999999999999988755543 8888888875544
No 118
>PF09752 DUF2048: Uncharacterized conserved protein (DUF2048); InterPro: IPR019149 This family of proteins has no known function.
Probab=97.50 E-value=0.00052 Score=74.05 Aligned_cols=138 Identities=17% Similarity=0.262 Sum_probs=98.2
Q ss_pred CCcchhhhhcccCC----CCCcccceEEEEEecccc-cCCCcEEEEECCCCCCccccccccc-HHHHHHHcCCeEEEEee
Q psy3685 730 PSDQWFEQKLDHFN----IEEPRTWKQRYFVNSTWY-KHGGPIFLMIGGEGEASAKWMVEGT-WLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 730 ~~~~~f~q~~dhf~----~~~~~tf~qry~~~~~~~-~~~~pi~~~~gg~g~~~~~~~~~~~-~~~la~~~~a~~~~~eh 803 (1042)
..++.|.-|+++.= +.+.++=.-+|++ .+-| .+..||.+.+.|-|+. .+|....+ ...|+++ |-.-+.+|-
T Consensus 53 ~~eG~F~SP~~~~~~~~lP~es~~a~~~~~~-P~~~~~~~rp~~IhLagTGDh-~f~rR~~l~a~pLl~~-gi~s~~le~ 129 (348)
T PF09752_consen 53 IREGEFRSPLAFYLPGLLPEESRTARFQLLL-PKRWDSPYRPVCIHLAGTGDH-GFWRRRRLMARPLLKE-GIASLILEN 129 (348)
T ss_pred EEEeEeCCchhhhccccCChhHhheEEEEEE-CCccccCCCceEEEecCCCcc-chhhhhhhhhhHHHHc-CcceEEEec
Confidence 45788999987641 1222232223433 4444 4568999999999985 44554444 6789998 999999999
Q ss_pred eecccccCCCCCCccccc-----CCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEE
Q psy3685 804 RFYGKSHPLSDLSVESLQ-----YLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVH 875 (1042)
Q Consensus 804 R~~G~S~p~~~~~~~~~~-----~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~ 875 (1042)
+|||.=+|.+.. ...|. ++-..+.+.+.+.++.+++.+ +. .|+.+.|-|+||.+|+..+...|.-|.
T Consensus 130 Pyyg~RkP~~Q~-~s~l~~VsDl~~~g~~~i~E~~~Ll~Wl~~~-G~---~~~g~~G~SmGG~~A~laa~~~p~pv~ 201 (348)
T PF09752_consen 130 PYYGQRKPKDQR-RSSLRNVSDLFVMGRATILESRALLHWLERE-GY---GPLGLTGISMGGHMAALAASNWPRPVA 201 (348)
T ss_pred ccccccChhHhh-cccccchhHHHHHHhHHHHHHHHHHHHHHhc-CC---CceEEEEechhHhhHHhhhhcCCCcee
Confidence 999999985421 11111 123367788888999998887 32 489999999999999999999999754
No 119
>PLN02209 serine carboxypeptidase
Probab=97.40 E-value=0.001 Score=76.26 Aligned_cols=123 Identities=15% Similarity=0.192 Sum_probs=78.5
Q ss_pred CCCcEEEEE-CCCCCCccc--cccccc----------------HHHHHHHcCCeEEEEe-eeecccccCCCCCCcccccC
Q psy3685 763 HGGPIFLMI-GGEGEASAK--WMVEGT----------------WLDYAHNHNALAVQVE-HRFYGKSHPLSDLSVESLQY 822 (1042)
Q Consensus 763 ~~~pi~~~~-gg~g~~~~~--~~~~~~----------------~~~la~~~~a~~~~~e-hR~~G~S~p~~~~~~~~~~~ 822 (1042)
...|++|++ ||+|..+.. +...|. ....+....|.|+++| .-|.|-|+...+ ..+
T Consensus 66 ~~~Pl~lWlnGGPG~SS~~g~f~e~GP~~~~~~~~~~~~~~l~~n~~sW~~~anllfiDqPvGtGfSy~~~~-----~~~ 140 (437)
T PLN02209 66 QEDPLIIWLNGGPGCSCLSGLFFENGPLALKNKVYNGSVPSLVSTTYSWTKTANIIFLDQPVGSGFSYSKTP-----IER 140 (437)
T ss_pred CCCCEEEEECCCCcHHHhhhHHHhcCCceeccCCCCCCcccceeCCCchhhcCcEEEecCCCCCCccCCCCC-----CCc
Confidence 357999999 999865431 111111 1223555578999999 568898864322 113
Q ss_pred CCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHH----HHHHHHHhC-----C-CeEEEEEeeccccccccCC
Q psy3685 823 LSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGA----LAAWLRYKY-----P-HLVHGAMSASGPLRAVVDF 890 (1042)
Q Consensus 823 lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~----la~~~~~~y-----P-~~~~g~i~ssa~~~~~~~~ 890 (1042)
-+.+++++|+..|++.+-..++.-.+.|+.++|-||||. ||..+.... | =.+.|.+...+.+.+..+.
T Consensus 141 ~~~~~~a~~~~~fl~~f~~~~p~~~~~~~yi~GESYaG~yvP~~a~~i~~~~~~~~~~~inl~Gi~igng~td~~~q~ 218 (437)
T PLN02209 141 TSDTSEVKKIHEFLQKWLIKHPQFLSNPFYVVGDSYSGMIVPALVHEISKGNYICCNPPINLQGYVLGNPITHIEFEQ 218 (437)
T ss_pred cCCHHHHHHHHHHHHHHHHhCccccCCCEEEEecCcCceehHHHHHHHHhhcccccCCceeeeeEEecCcccChhhhh
Confidence 345566699999998877666543456899999999995 555554433 1 1356777666666554443
No 120
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.39 E-value=5e-05 Score=47.47 Aligned_cols=23 Identities=52% Similarity=1.011 Sum_probs=21.8
Q ss_pred ccCCCCCCccCCcchHHHHHhhc
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIH 705 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H 705 (1042)
|.|+.|++.|+++.+|..|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999986
No 121
>PF00756 Esterase: Putative esterase; InterPro: IPR000801 This family contains several seemingly unrelated proteins, including human esterase D; mycobacterial antigen 85, which is responsible for the high affinity of mycobacteria to fibronectin; Corynebacterium glutamicum major secreted protein PS1; and hypothetical proteins from Escherichia coli, yeast, mycobacteria and Haemophilus influenzae.; PDB: 3LS2_A 1VA5_B 1DQZ_B 3HRH_A 1DQY_A 2GZR_A 2GZS_A 3GFF_A 1R88_A 3E4D_D ....
Probab=97.38 E-value=0.00032 Score=75.40 Aligned_cols=53 Identities=23% Similarity=0.269 Sum_probs=43.5
Q ss_pred HHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 834 YFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 834 ~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
+++..+...|...++. +++.|.|+||..|.++.++||+++.++++.|+.+...
T Consensus 101 el~p~i~~~~~~~~~~-~~i~G~S~GG~~Al~~~l~~Pd~F~~~~~~S~~~~~~ 153 (251)
T PF00756_consen 101 ELIPYIEANYRTDPDR-RAIAGHSMGGYGALYLALRHPDLFGAVIAFSGALDPS 153 (251)
T ss_dssp HHHHHHHHHSSEEECC-EEEEEETHHHHHHHHHHHHSTTTESEEEEESEESETT
T ss_pred cchhHHHHhcccccce-eEEeccCCCcHHHHHHHHhCccccccccccCcccccc
Confidence 5566677777654433 9999999999999999999999999999999765544
No 122
>PF08538 DUF1749: Protein of unknown function (DUF1749); InterPro: IPR013744 This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. ; PDB: 2Q0X_B.
Probab=97.37 E-value=0.0018 Score=68.91 Aligned_cols=104 Identities=16% Similarity=0.064 Sum_probs=63.9
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHc---CCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNH---NALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~---~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.+||+|||-|..-. ...++..||+.+ +..||.+..+- | ...+++-+.++=++|++.+|++++..
T Consensus 34 ~~llfIGGLtDGl~---tvpY~~~La~aL~~~~wsl~q~~LsS---S-------y~G~G~~SL~~D~~eI~~~v~ylr~~ 100 (303)
T PF08538_consen 34 NALLFIGGLTDGLL---TVPYLPDLAEALEETGWSLFQVQLSS---S-------YSGWGTSSLDRDVEEIAQLVEYLRSE 100 (303)
T ss_dssp SEEEEE--TT--TT----STCHHHHHHHHT-TT-EEEEE--GG---G-------BTTS-S--HHHHHHHHHHHHHHHHHH
T ss_pred cEEEEECCCCCCCC---CCchHHHHHHHhccCCeEEEEEEecC---c-------cCCcCcchhhhHHHHHHHHHHHHHHh
Confidence 38999999865422 334455666554 67788777663 1 12344558889999999999999987
Q ss_pred cCC-CCCCcEEEEccchhHHHHHHHHHhCC-----CeEEEEEeecc
Q psy3685 843 YSL-PAHTKWIAFGGSYPGALAAWLRYKYP-----HLVHGAMSASG 882 (1042)
Q Consensus 843 ~~~-~~~~~~i~~G~Sygg~la~~~~~~yP-----~~~~g~i~ssa 882 (1042)
-+. .+..+|||+|||-|---...|..+.. ..|+|+|.-++
T Consensus 101 ~~g~~~~~kIVLmGHSTGcQdvl~Yl~~~~~~~~~~~VdG~ILQAp 146 (303)
T PF08538_consen 101 KGGHFGREKIVLMGHSTGCQDVLHYLSSPNPSPSRPPVDGAILQAP 146 (303)
T ss_dssp S------S-EEEEEECCHHHHHHHHHHH-TT---CCCEEEEEEEEE
T ss_pred hccccCCccEEEEecCCCcHHHHHHHhccCccccccceEEEEEeCC
Confidence 321 23458999999999988888776553 66999997653
No 123
>PRK06765 homoserine O-acetyltransferase; Provisional
Probab=97.36 E-value=0.00044 Score=78.56 Aligned_cols=154 Identities=16% Similarity=0.148 Sum_probs=89.6
Q ss_pred ccCCCCCcccceEEEEEeccccc----CCCcEEEEECCCCCCcc-------cccccccHHHHHH------HcCCeEEEEe
Q psy3685 740 DHFNIEEPRTWKQRYFVNSTWYK----HGGPIFLMIGGEGEASA-------KWMVEGTWLDYAH------NHNALAVQVE 802 (1042)
Q Consensus 740 dhf~~~~~~tf~qry~~~~~~~~----~~~pi~~~~gg~g~~~~-------~~~~~~~~~~la~------~~~a~~~~~e 802 (1042)
.-|..++.++-++ .-|-.+-|. ...+|||+.++.+..+. .....|++..|.- --...||..+
T Consensus 28 ~~f~l~~G~~l~~-~~~~Y~t~G~ln~~~~n~vlv~h~~tg~~h~~~~~~~~~~~~gww~~~iG~g~~lDt~~yfvi~~n 106 (389)
T PRK06765 28 KEFTTEGGRTIPD-VQMGYETYGTLNRAKSNVILITHYFSATSHAAGKYTADDEESGYWDGLIGPGKAIDTNKYFVISTD 106 (389)
T ss_pred CCEEccCCCCcCC-ceEEEEeccccCCCCCCEEEEeCCCCCchhhcccccccCCCcccHHhccCCCCCcCCCceEEEEec
Confidence 3466566666532 333334343 23589999988754221 0111344433311 1246788888
Q ss_pred eeeccc--------ccCCC---CCC--cc-cccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcE-EEEccchhHHHHHHHH
Q psy3685 803 HRFYGK--------SHPLS---DLS--VE-SLQYLSAEQALADLAYFITSMNTLYSLPAHTKW-IAFGGSYPGALAAWLR 867 (1042)
Q Consensus 803 hR~~G~--------S~p~~---~~~--~~-~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~-i~~G~Sygg~la~~~~ 867 (1042)
.-|=|. +.|.. +.. .. +.-.+|+++.++|+..+++.+.. .++ +++|+||||++|..+.
T Consensus 107 ~lG~~~~~~p~~g~tgp~s~~p~tg~~~~~~fP~~t~~d~~~~~~~ll~~lgi-------~~~~~vvG~SmGG~ial~~a 179 (389)
T PRK06765 107 TLCNVQVKDPNVITTGPASINPKTGKPYGMDFPVVTILDFVRVQKELIKSLGI-------ARLHAVMGPSMGGMQAQEWA 179 (389)
T ss_pred ccCCCcCCCCCCCCCCCCCCCcCCCCccCCCCCcCcHHHHHHHHHHHHHHcCC-------CCceEEEEECHHHHHHHHHH
Confidence 876443 22221 100 00 12236888888888888876432 256 5999999999999999
Q ss_pred HhCCCeEEEEEeeccccccccCC-hhhHHHHHHHh
Q psy3685 868 YKYPHLVHGAMSASGPLRAVVDF-PEYFGVVADAL 901 (1042)
Q Consensus 868 ~~yP~~~~g~i~ssa~~~~~~~~-~~~~~~v~~~~ 901 (1042)
.+||+.|.++|..++......-. ..+.+...+.+
T Consensus 180 ~~~P~~v~~lv~ia~~~~~~~~~~~~~~~~~~~ai 214 (389)
T PRK06765 180 VHYPHMVERMIGVIGNPQNDAWTSVNVLQNWAEAI 214 (389)
T ss_pred HHChHhhheEEEEecCCCCChhHHHHHHHHHHHHH
Confidence 99999999988765544332222 34555444444
No 124
>PLN03016 sinapoylglucose-malate O-sinapoyltransferase
Probab=97.36 E-value=0.0011 Score=75.90 Aligned_cols=123 Identities=15% Similarity=0.135 Sum_probs=75.2
Q ss_pred CCCcEEEEE-CCCCCCcc--cccccccH----------------HHHHHHcCCeEEEEe-eeecccccCCCCCCcccccC
Q psy3685 763 HGGPIFLMI-GGEGEASA--KWMVEGTW----------------LDYAHNHNALAVQVE-HRFYGKSHPLSDLSVESLQY 822 (1042)
Q Consensus 763 ~~~pi~~~~-gg~g~~~~--~~~~~~~~----------------~~la~~~~a~~~~~e-hR~~G~S~p~~~~~~~~~~~ 822 (1042)
...|++|++ ||+|..+. .+...|.. ...+....|.|+++| .-|.|-|+...+. .+
T Consensus 64 ~~~P~~lWlnGGPG~SS~~g~~~e~GP~~~~~~~~~~~~~~l~~n~~sW~~~anllfiDqPvGtGfSy~~~~~-----~~ 138 (433)
T PLN03016 64 KEDPLLIWLNGGPGCSCLGGIIFENGPVGLKFEVFNGSAPSLFSTTYSWTKMANIIFLDQPVGSGFSYSKTPI-----DK 138 (433)
T ss_pred ccCCEEEEEcCCCcHHHHHHHHHhcCCceeeccccCCCCCceeeCCCchhhcCcEEEecCCCCCCccCCCCCC-----Cc
Confidence 467999999 99986552 12222211 223444569999999 5688998643221 12
Q ss_pred CCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHH----HHHHHHHhC------CCeEEEEEeeccccccccCC
Q psy3685 823 LSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGA----LAAWLRYKY------PHLVHGAMSASGPLRAVVDF 890 (1042)
Q Consensus 823 lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~----la~~~~~~y------P~~~~g~i~ssa~~~~~~~~ 890 (1042)
.+.+++++|+.+|++.+-..++.-...|+.++|-||||. ||..+.... +=.+.|.+...+.+.+..++
T Consensus 139 ~~d~~~a~~~~~fl~~f~~~~p~~~~~~~yi~GESYaG~yvP~la~~i~~~n~~~~~~~inLkGi~iGNg~t~~~~~~ 216 (433)
T PLN03016 139 TGDISEVKRTHEFLQKWLSRHPQYFSNPLYVVGDSYSGMIVPALVQEISQGNYICCEPPINLQGYMLGNPVTYMDFEQ 216 (433)
T ss_pred cCCHHHHHHHHHHHHHHHHhChhhcCCCEEEEccCccceehHHHHHHHHhhcccccCCcccceeeEecCCCcCchhhh
Confidence 333334489999998876665432346899999999996 444443332 22466766666655554433
No 125
>COG0400 Predicted esterase [General function prediction only]
Probab=97.35 E-value=0.00034 Score=71.22 Aligned_cols=120 Identities=20% Similarity=0.202 Sum_probs=71.8
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--ecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
+|+.|+||+++|.|.....+.. +...+.. ++.+|.+-=+ .+|.-.-+.-.....+..=..+...+.+++|+..+
T Consensus 15 ~p~~~~iilLHG~Ggde~~~~~--~~~~~~P--~~~~is~rG~v~~~g~~~~f~~~~~~~~d~edl~~~~~~~~~~l~~~ 90 (207)
T COG0400 15 DPAAPLLILLHGLGGDELDLVP--LPELILP--NATLVSPRGPVAENGGPRFFRRYDEGSFDQEDLDLETEKLAEFLEEL 90 (207)
T ss_pred CCCCcEEEEEecCCCChhhhhh--hhhhcCC--CCeEEcCCCCccccCcccceeecCCCccchhhHHHHHHHHHHHHHHH
Confidence 4677899999999854333222 2222222 3444443211 00100000000001111113344556666777777
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
...++.+. .++|++|.|=|++||+.+.+++|+++.|+|+.|+.+..
T Consensus 91 ~~~~gi~~-~~ii~~GfSqGA~ial~~~l~~~~~~~~ail~~g~~~~ 136 (207)
T COG0400 91 AEEYGIDS-SRIILIGFSQGANIALSLGLTLPGLFAGAILFSGMLPL 136 (207)
T ss_pred HHHhCCCh-hheEEEecChHHHHHHHHHHhCchhhccchhcCCcCCC
Confidence 77777653 58999999999999999999999999999998876643
No 126
>TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I. This model represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=97.35 E-value=0.0018 Score=75.78 Aligned_cols=105 Identities=10% Similarity=0.037 Sum_probs=69.5
Q ss_pred CcEEEEECCCCCCcccccc-----cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCH-HHHHhHHHHHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWMV-----EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSA-EQALADLAYFITS 838 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~-----~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~-~q~l~D~~~~~~~ 838 (1042)
.| ||++.|.. ...++. .+++..|+++ |..||++|.|++|.|... ++. +-+.+++...|..
T Consensus 189 ~P-lLiVp~~i--~k~yilDL~p~~Slv~~L~~q-Gf~V~~iDwrgpg~s~~~----------~~~ddY~~~~i~~al~~ 254 (532)
T TIGR01838 189 TP-LLIVPPWI--NKYYILDLRPQNSLVRWLVEQ-GHTVFVISWRNPDASQAD----------KTFDDYIRDGVIAALEV 254 (532)
T ss_pred Cc-EEEECccc--ccceeeecccchHHHHHHHHC-CcEEEEEECCCCCccccc----------CChhhhHHHHHHHHHHH
Confidence 45 66666643 222322 2456666665 999999999999987321 112 2344456666666
Q ss_pred HhhhcCCCCCCcEEEEccchhHHHH----HHHHHhC-CCeEEEEEeecccccc
Q psy3685 839 MNTLYSLPAHTKWIAFGGSYPGALA----AWLRYKY-PHLVHGAMSASGPLRA 886 (1042)
Q Consensus 839 ~~~~~~~~~~~~~i~~G~Sygg~la----~~~~~~y-P~~~~g~i~ssa~~~~ 886 (1042)
+....+ ..+++++|+|+||+|+ +++.... |+.|.+++..++++..
T Consensus 255 v~~~~g---~~kv~lvG~cmGGtl~a~ala~~aa~~~~~rv~slvll~t~~Df 304 (532)
T TIGR01838 255 VEAITG---EKQVNCVGYCIGGTLLSTALAYLAARGDDKRIKSATFFTTLLDF 304 (532)
T ss_pred HHHhcC---CCCeEEEEECcCcHHHHHHHHHHHHhCCCCccceEEEEecCcCC
Confidence 665442 3589999999999985 3455554 8889999988888754
No 127
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.29 E-value=9.4e-05 Score=46.22 Aligned_cols=22 Identities=55% Similarity=1.076 Sum_probs=12.2
Q ss_pred ccCCccccccCCchhHHHHhhh
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKI 607 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~ 607 (1042)
|+|++|++.|.++..|+.|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555554
No 128
>PF05448 AXE1: Acetyl xylan esterase (AXE1); InterPro: IPR008391 This family consists of several bacterial acetyl xylan esterase proteins. Acetyl xylan esterases are enzymes that hydrolyse the ester linkages of the acetyl groups in position 2 and/or 3 of the xylose moieties of natural acetylated xylan from hardwood. These enzymes are one of the accessory enzymes which are part of the xylanolytic system, together with xylanases, beta-xylosidases, alpha-arabinofuranosidases and methylglucuronidases; these are all required for the complete hydrolysis of xylan [].; PDB: 1VLQ_H 3M81_E 3M82_D 3M83_C 3FCY_A 1ODS_F 1ODT_C 1L7A_A 3FYT_A 2XLB_F ....
Probab=97.28 E-value=0.00087 Score=73.80 Aligned_cols=111 Identities=19% Similarity=0.133 Sum_probs=66.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCC-CCCC-c----------cc-ccCCCHHHHHhH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPL-SDLS-V----------ES-LQYLSAEQALAD 831 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~-~~~~-~----------~~-~~~lt~~q~l~D 831 (1042)
-|+||..+|.|.....+.. ...+|.. |..++++|-||.|...+. ...+ . ++ ...+-...++.|
T Consensus 83 ~Pavv~~hGyg~~~~~~~~---~~~~a~~-G~~vl~~d~rGqg~~~~d~~~~~~~~~~g~~~~g~~~~~e~~yyr~~~~D 158 (320)
T PF05448_consen 83 LPAVVQFHGYGGRSGDPFD---LLPWAAA-GYAVLAMDVRGQGGRSPDYRGSSGGTLKGHITRGIDDNPEDYYYRRVYLD 158 (320)
T ss_dssp EEEEEEE--TT--GGGHHH---HHHHHHT-T-EEEEE--TTTSSSS-B-SSBSSS-SSSSTTTTTTS-TTT-HHHHHHHH
T ss_pred cCEEEEecCCCCCCCCccc---ccccccC-CeEEEEecCCCCCCCCCCccccCCCCCccHHhcCccCchHHHHHHHHHHH
Confidence 4788998888876444332 3356665 999999999999933221 1111 0 11 112223456788
Q ss_pred HHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 832 LAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 832 ~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
...-+..+...-.. +..+++++|+|.||.||++.+...|. |.++++..
T Consensus 159 ~~ravd~l~slpev-D~~rI~v~G~SqGG~lal~~aaLd~r-v~~~~~~v 206 (320)
T PF05448_consen 159 AVRAVDFLRSLPEV-DGKRIGVTGGSQGGGLALAAAALDPR-VKAAAADV 206 (320)
T ss_dssp HHHHHHHHHTSTTE-EEEEEEEEEETHHHHHHHHHHHHSST--SEEEEES
T ss_pred HHHHHHHHHhCCCc-CcceEEEEeecCchHHHHHHHHhCcc-ccEEEecC
Confidence 88888887764322 22479999999999999999999998 66666554
No 129
>PF07819 PGAP1: PGAP1-like protein; InterPro: IPR012908 The sequences found in this family are similar to PGAP1 (Q765A7 from SWISSPROT). This is an endoplasmic reticulum membrane protein with a catalytic serine-containing motif that is conserved in a number of lipases. PGAP1 functions as a GPI inositol-deacylase; this deacylation is important for the efficient transport of GPI-anchored proteins from the endoplasmic reticulum to the Golgi body [].; GO: 0016788 hydrolase activity, acting on ester bonds, 0006505 GPI anchor metabolic process, 0006886 intracellular protein transport, 0031227 intrinsic to endoplasmic reticulum membrane
Probab=97.27 E-value=0.0025 Score=66.57 Aligned_cols=58 Identities=24% Similarity=0.165 Sum_probs=41.0
Q ss_pred hHHHHHHHHHhhhc--CCCCCCcEEEEccchhHHHHHHHHHhCC---CeEEEEEeeccccccc
Q psy3685 830 ADLAYFITSMNTLY--SLPAHTKWIAFGGSYPGALAAWLRYKYP---HLVHGAMSASGPLRAV 887 (1042)
Q Consensus 830 ~D~~~~~~~~~~~~--~~~~~~~~i~~G~Sygg~la~~~~~~yP---~~~~g~i~ssa~~~~~ 887 (1042)
+-+++.|+.+...+ ...+..++|++||||||.+|..+....+ +.|.+.|.-++|....
T Consensus 64 ~~~~~~i~~i~~~~~~~~~~~~~vilVgHSmGGlvar~~l~~~~~~~~~v~~iitl~tPh~g~ 126 (225)
T PF07819_consen 64 EFLAEAIKYILELYKSNRPPPRSVILVGHSMGGLVARSALSLPNYDPDSVKTIITLGTPHRGS 126 (225)
T ss_pred HHHHHHHHHHHHhhhhccCCCCceEEEEEchhhHHHHHHHhccccccccEEEEEEEcCCCCCc
Confidence 33444455554444 2234468999999999999988776544 5799999999998654
No 130
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.24 E-value=0.00039 Score=54.11 Aligned_cols=48 Identities=31% Similarity=0.642 Sum_probs=25.3
Q ss_pred cccCCCCcccCCchhHHHHHHH-hcCC--ccccCccCcccCCchhHHHHHHHhc
Q psy3685 505 YKCDLCSKEFSSKGNLTEHMKQ-HQGI--RYKCDLCSKEFSRKDNLTEHKEIHQ 555 (1042)
Q Consensus 505 ~~C~~C~~~f~~~~~L~~H~~~-H~~~--~~~C~~C~k~f~~~~~L~~H~~~H~ 555 (1042)
|.|++|++ ..+...|..|... |.+. .+.|++|...+. .+|..|+..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 56666666 3445566666543 3222 456666665433 35666655543
No 131
>PF02230 Abhydrolase_2: Phospholipase/Carboxylesterase; InterPro: IPR003140 This entry represents the alpha/beta hydrolase domain found in phospholipases [], carboxylesterases [] and thioesterases.; GO: 0016787 hydrolase activity; PDB: 3U0V_A 1AUR_A 1AUO_B 1FJ2_B 3CN9_A 3CN7_A.
Probab=97.23 E-value=0.00055 Score=71.55 Aligned_cols=120 Identities=18% Similarity=0.215 Sum_probs=63.2
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHH-HHcCCeEEEEeee--e----ccc----ccCCCCCCccc-ccCCCHHHHHh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYA-HNHNALAVQVEHR--F----YGK----SHPLSDLSVES-LQYLSAEQALA 830 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la-~~~~a~~~~~ehR--~----~G~----S~p~~~~~~~~-~~~lt~~q~l~ 830 (1042)
+..|+||++||.|.....+.. ...+. ..-+..+|+..=+ . .|. ..+....+.+. ...-.++++++
T Consensus 12 ~~~~lvi~LHG~G~~~~~~~~---~~~~~~~~~~~~~i~p~ap~~~~~~~~g~~~~~Wf~~~~~~~~~~~~~~~i~~s~~ 88 (216)
T PF02230_consen 12 KAKPLVILLHGYGDSEDLFAL---LAELNLALPNTRFISPRAPSRPVTVPGGYRMPAWFDIYDFDPEGPEDEAGIEESAE 88 (216)
T ss_dssp T-SEEEEEE--TTS-HHHHHH---HHHHHTCSTTEEEEEE---EEE-GGGTT-EEE-SS-BSCSSSSSEB-HHHHHHHHH
T ss_pred CCceEEEEECCCCCCcchhHH---HHhhcccCCceEEEeccCCCCCcccccccCCCceeeccCCCcchhhhHHHHHHHHH
Confidence 457899999999876532221 11111 1123344443221 1 232 12222222222 11113445555
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
-+..||....... . +..+++++|.|-||++|..+.+.+|+.+.|+|+-|+.+...
T Consensus 89 ~l~~li~~~~~~~-i-~~~ri~l~GFSQGa~~al~~~l~~p~~~~gvv~lsG~~~~~ 143 (216)
T PF02230_consen 89 RLDELIDEEVAYG-I-DPSRIFLGGFSQGAAMALYLALRYPEPLAGVVALSGYLPPE 143 (216)
T ss_dssp HHHHHHHHHHHTT----GGGEEEEEETHHHHHHHHHHHCTSSTSSEEEEES---TTG
T ss_pred HHHHHHHHHHHcC-C-ChhheehhhhhhHHHHHHHHHHHcCcCcCEEEEeecccccc
Confidence 5666666544332 3 23589999999999999999999999999999999877543
No 132
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.22 E-value=0.00029 Score=54.77 Aligned_cols=48 Identities=29% Similarity=0.632 Sum_probs=21.6
Q ss_pred cccCcCCcccCCchhhhhHhhh-hcCC--cccCCccccccCCchhHHHHhhhcC
Q psy3685 559 YKCDLCSKEFSRKDNLTEHKEI-HQGI--RYKCDLCSKKFSRKLNLTKHMKIHQ 609 (1042)
Q Consensus 559 ~~C~~C~k~F~~~~~L~~H~~~-H~~~--~~~C~~C~k~F~~~~~L~~H~~~H~ 609 (1042)
|.|+.|++. .+...|..|... |... .+.|++|...+. .+|..||+.+|
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 445555552 223445555332 2222 355555555433 25555555444
No 133
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.18 E-value=0.00016 Score=65.17 Aligned_cols=73 Identities=23% Similarity=0.479 Sum_probs=16.6
Q ss_pred ccCccCcccCCchhHHHHHHHhcCcccccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhHHHHhhhc
Q psy3685 533 KCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLTKHMKIH 608 (1042)
Q Consensus 533 ~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H 608 (1042)
+|..|+..|.+...|..|+...++.... ....+.....+..+.+.-....+.|.+|++.|.....|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP---DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ---------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc---cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 3666666666666666666554443211 112222333444444333333567777777777777777777654
No 134
>PF06500 DUF1100: Alpha/beta hydrolase of unknown function (DUF1100); InterPro: IPR010520 Proteins in this entry display esterase activity toward pNP-butyrate []. This entry also includes 2,6-dihydropseudooxynicotine hydrolase which has a role in nicotine catabolism by cleaving a C-C bond in 2,6-dihydroxypseudooxyicotine [, ].; PDB: 3OUR_A 3MVE_B 2JBW_C.
Probab=97.18 E-value=0.00034 Score=77.64 Aligned_cols=111 Identities=20% Similarity=0.200 Sum_probs=64.6
Q ss_pred CCcEEEEECCCCCCcccccccccHHH-HHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLD-YAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~-la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.-|+||++||.......+.. .+.+ +++ .|.+++.+|-++-|.|...+ ++.+ +-...|++-|. +..+. .
T Consensus 189 p~P~VIv~gGlDs~qeD~~~--l~~~~l~~-rGiA~LtvDmPG~G~s~~~~-l~~D---~~~l~~aVLd~---L~~~p-~ 257 (411)
T PF06500_consen 189 PYPTVIVCGGLDSLQEDLYR--LFRDYLAP-RGIAMLTVDMPGQGESPKWP-LTQD---SSRLHQAVLDY---LASRP-W 257 (411)
T ss_dssp -EEEEEEE--TTS-GGGGHH--HHHCCCHH-CT-EEEEE--TTSGGGTTT--S-S----CCHHHHHHHHH---HHHST-T
T ss_pred CCCEEEEeCCcchhHHHHHH--HHHHHHHh-CCCEEEEEccCCCcccccCC-CCcC---HHHHHHHHHHH---HhcCC-c
Confidence 34899999998766543221 1223 344 49999999999999983211 2211 11233444333 22211 1
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVV 888 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~ 888 (1042)
+ +..+|.++|.|+||.+|.-++...|+.+.|.|+.+|+|...+
T Consensus 258 V---D~~RV~~~G~SfGGy~AvRlA~le~~RlkavV~~Ga~vh~~f 300 (411)
T PF06500_consen 258 V---DHTRVGAWGFSFGGYYAVRLAALEDPRLKAVVALGAPVHHFF 300 (411)
T ss_dssp E---EEEEEEEEEETHHHHHHHHHHHHTTTT-SEEEEES---SCGG
T ss_pred c---ChhheEEEEeccchHHHHHHHHhcccceeeEeeeCchHhhhh
Confidence 1 235899999999999999999999999999999999986654
No 135
>COG2021 MET2 Homoserine acetyltransferase [Amino acid transport and metabolism]
Probab=97.17 E-value=0.0035 Score=67.82 Aligned_cols=184 Identities=16% Similarity=0.077 Sum_probs=98.8
Q ss_pred eEEEEEeccccc----CCCcEEEEECCCCCCc---cc--ccccccHHHHHHH------cCCeEEEEeeee--cccccCCC
Q psy3685 751 KQRYFVNSTWYK----HGGPIFLMIGGEGEAS---AK--WMVEGTWLDYAHN------HNALAVQVEHRF--YGKSHPLS 813 (1042)
Q Consensus 751 ~qry~~~~~~~~----~~~pi~~~~gg~g~~~---~~--~~~~~~~~~la~~------~~a~~~~~ehR~--~G~S~p~~ 813 (1042)
-+.++|-.+-|. ...++||+.+|..... .. ....|++.++..- ...-||.+.-=| +|-|.|.+
T Consensus 33 l~~~~vay~T~Gtln~~~~NaVli~HaLtG~~h~~~~~~~~~~GWW~~liGpG~~iDt~r~fvIc~NvlG~c~GStgP~s 112 (368)
T COG2021 33 LSDARVAYETYGTLNAEKDNAVLICHALTGDSHAAGTADDGEKGWWDDLIGPGKPIDTERFFVICTNVLGGCKGSTGPSS 112 (368)
T ss_pred ccCcEEEEEecccccccCCceEEEeccccCcccccccCCCCCCccHHHhcCCCCCCCccceEEEEecCCCCCCCCCCCCC
Confidence 345677766553 2356888888774421 11 1112455544221 134566666544 35556643
Q ss_pred CCCcccccCCC--HHHHHhHHHHHHHHHhhhcCCCCCCcE-EEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccCC
Q psy3685 814 DLSVESLQYLS--AEQALADLAYFITSMNTLYSLPAHTKW-IAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVDF 890 (1042)
Q Consensus 814 ~~~~~~~~~lt--~~q~l~D~~~~~~~~~~~~~~~~~~~~-i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~~ 890 (1042)
.... .--|-+ ..-.|.|.+.+-+.+...++. .+| .++|||||||.|.-....||+.|..+|.-++..+.....
T Consensus 113 ~~p~-g~~yg~~FP~~ti~D~V~aq~~ll~~LGI---~~l~avvGgSmGGMqaleWa~~yPd~V~~~i~ia~~~r~s~~~ 188 (368)
T COG2021 113 INPG-GKPYGSDFPVITIRDMVRAQRLLLDALGI---KKLAAVVGGSMGGMQALEWAIRYPDRVRRAIPIATAARLSAQN 188 (368)
T ss_pred cCCC-CCccccCCCcccHHHHHHHHHHHHHhcCc---ceEeeeeccChHHHHHHHHHHhChHHHhhhheecccccCCHHH
Confidence 2111 101111 122345555555555555443 355 589999999999999999999999877666666555555
Q ss_pred hhhHHHHHHHhhcc--------ChhhHHHHHHHHHHHHHH-hcCCchHHHHHHHhcCCC
Q psy3685 891 PEYFGVVADALATV--------STECVKAVQTATHTISKM-LKSPSDAKYLTEQFKLCT 940 (1042)
Q Consensus 891 ~~~~~~v~~~~~~~--------~~~c~~~i~~~~~~~~~~-~~~~~~~~~l~~~f~~~~ 940 (1042)
..|.++..+++... ...+.. ..+......+ ..+..+.+.+.+.|+.-.
T Consensus 189 ia~~~~~r~AI~~DP~~n~G~Y~~~~~P--~~GL~~AR~l~~ltYrS~~~~~~rF~r~~ 245 (368)
T COG2021 189 IAFNEVQRQAIEADPDWNGGDYYEGTQP--ERGLRLARMLAHLTYRSEEELDERFGRRL 245 (368)
T ss_pred HHHHHHHHHHHHhCCCccCCCccCCCCc--chhHHHHHHHHHHHccCHHHHHHHhcccc
Confidence 56666666666431 111221 1222222221 124445566888888733
No 136
>PF10230 DUF2305: Uncharacterised conserved protein (DUF2305); InterPro: IPR019363 This entry contains proteins that have no known function.
Probab=97.17 E-value=0.0029 Score=68.06 Aligned_cols=116 Identities=16% Similarity=0.145 Sum_probs=84.6
Q ss_pred cEEEEE-CCCCCCcccccccccHHHHHHH--cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 766 PIFLMI-GGEGEASAKWMVEGTWLDYAHN--HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 766 pi~~~~-gg~g~~~~~~~~~~~~~~la~~--~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
++||++ |++|-+.. |..|+..|.+. .+..|+++-|.|+-.+...++.+ .+-...+.++=++--..|++.+...
T Consensus 3 ~li~~IPGNPGlv~f---Y~~Fl~~L~~~l~~~~~i~~ish~Gh~~~~~~~~~~-~~~~~~sL~~QI~hk~~~i~~~~~~ 78 (266)
T PF10230_consen 3 PLIVFIPGNPGLVEF---YEEFLSALYEKLNPQFEILGISHAGHSTSPSNSKFS-PNGRLFSLQDQIEHKIDFIKELIPQ 78 (266)
T ss_pred EEEEEECCCCChHHH---HHHHHHHHHHhCCCCCeeEEecCCCCcCCccccccc-CCCCccCHHHHHHHHHHHHHHHhhh
Confidence 567777 77775533 44567777766 48899999999997775543222 3456668888788888888877665
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCC---CeEEEEEeecccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYP---HLVHGAMSASGPLRA 886 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP---~~~~g~i~ssa~~~~ 886 (1042)
.. .++.++|++|||.|+-|+.-+..++| ..|.+++.-=+.+..
T Consensus 79 ~~-~~~~~liLiGHSIGayi~levl~r~~~~~~~V~~~~lLfPTi~~ 124 (266)
T PF10230_consen 79 KN-KPNVKLILIGHSIGAYIALEVLKRLPDLKFRVKKVILLFPTIED 124 (266)
T ss_pred hc-CCCCcEEEEeCcHHHHHHHHHHHhccccCCceeEEEEeCCcccc
Confidence 43 13469999999999999999999999 667777766665543
No 137
>COG0657 Aes Esterase/lipase [Lipid metabolism]
Probab=97.16 E-value=0.0015 Score=72.79 Aligned_cols=111 Identities=20% Similarity=0.172 Sum_probs=73.7
Q ss_pred cCCCcEEEEECCCCCCccccc-ccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHh
Q psy3685 762 KHGGPIFLMIGGEGEASAKWM-VEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~-~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
..+.|+|||++|-|=...... ...+...++...|+.||.++.|---+- +++ .+++|+..-+.++.
T Consensus 76 ~~~~p~vly~HGGg~~~g~~~~~~~~~~~~~~~~g~~vv~vdYrlaPe~-~~p-------------~~~~d~~~a~~~l~ 141 (312)
T COG0657 76 AATAPVVLYLHGGGWVLGSLRTHDALVARLAAAAGAVVVSVDYRLAPEH-PFP-------------AALEDAYAAYRWLR 141 (312)
T ss_pred CCCCcEEEEEeCCeeeecChhhhHHHHHHHHHHcCCEEEecCCCCCCCC-CCC-------------chHHHHHHHHHHHH
Confidence 346899999966443322222 224677889999999999999975443 222 26677666666665
Q ss_pred hh---cCCCCCCcEEEEccchhHHHHHHHHHhCCC----eEEEEEeeccccccc
Q psy3685 841 TL---YSLPAHTKWIAFGGSYPGALAAWLRYKYPH----LVHGAMSASGPLRAV 887 (1042)
Q Consensus 841 ~~---~~~~~~~~~i~~G~Sygg~la~~~~~~yP~----~~~g~i~ssa~~~~~ 887 (1042)
.. ++.+. .+++++|.|-||.||+++.+.--+ ...+.+..++.+...
T Consensus 142 ~~~~~~g~dp-~~i~v~GdSAGG~La~~~a~~~~~~~~~~p~~~~li~P~~d~~ 194 (312)
T COG0657 142 ANAAELGIDP-SRIAVAGDSAGGHLALALALAARDRGLPLPAAQVLISPLLDLT 194 (312)
T ss_pred hhhHhhCCCc-cceEEEecCcccHHHHHHHHHHHhcCCCCceEEEEEecccCCc
Confidence 43 44433 479999999999999998865443 355666666555443
No 138
>cd00312 Esterase_lipase Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.
Probab=97.15 E-value=0.0012 Score=78.67 Aligned_cols=112 Identities=21% Similarity=0.192 Sum_probs=72.7
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcC-CeEEEEeee----ecccccCCCCCCcccccCCCHHHHHhHHHHHHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHN-ALAVQVEHR----FYGKSHPLSDLSVESLQYLSAEQALADLAYFIT 837 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~-a~~~~~ehR----~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~ 837 (1042)
...|||||++|-|-..+.... .....++.+.+ ..||.+..| +|+.+. +. -.....+|.|....++
T Consensus 93 ~~~pv~v~ihGG~~~~g~~~~-~~~~~~~~~~~~~~vv~~~yRlg~~g~~~~~---~~------~~~~n~g~~D~~~al~ 162 (493)
T cd00312 93 NSLPVMVWIHGGGFMFGSGSL-YPGDGLAREGDNVIVVSINYRLGVLGFLSTG---DI------ELPGNYGLKDQRLALK 162 (493)
T ss_pred CCCCEEEEEcCCccccCCCCC-CChHHHHhcCCCEEEEEecccccccccccCC---CC------CCCcchhHHHHHHHHH
Confidence 456999999664322221111 13456676655 899999999 222221 10 1223346777777777
Q ss_pred HHhhh---cCCCCCCcEEEEccchhHHHHHHHHHh--CCCeEEEEEeeccccc
Q psy3685 838 SMNTL---YSLPAHTKWIAFGGSYPGALAAWLRYK--YPHLVHGAMSASGPLR 885 (1042)
Q Consensus 838 ~~~~~---~~~~~~~~~i~~G~Sygg~la~~~~~~--yP~~~~g~i~ssa~~~ 885 (1042)
+++.. ++.++ .+|.++|+|.||.+++++... .+.++.++|+.|+++.
T Consensus 163 wv~~~i~~fggd~-~~v~~~G~SaG~~~~~~~~~~~~~~~lf~~~i~~sg~~~ 214 (493)
T cd00312 163 WVQDNIAAFGGDP-DSVTIFGESAGGASVSLLLLSPDSKGLFHRAISQSGSAL 214 (493)
T ss_pred HHHHHHHHhCCCc-ceEEEEeecHHHHHhhhHhhCcchhHHHHHHhhhcCCcc
Confidence 77654 44433 489999999999999988875 4678999998887764
No 139
>KOG2100|consensus
Probab=97.10 E-value=0.0012 Score=81.12 Aligned_cols=117 Identities=16% Similarity=0.121 Sum_probs=75.8
Q ss_pred cEEEEE-CCCCCCcccc-cccccHHHHHHHcCCeEEEEeeeecccccCC-CCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 766 PIFLMI-GGEGEASAKW-MVEGTWLDYAHNHNALAVQVEHRFYGKSHPL-SDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 766 pi~~~~-gg~g~~~~~~-~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~-~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
|++|.+ ||+++....- ...++...++-..|+.|+.+|-||-|...+. ...-..+|+.. -+.|....++.+...
T Consensus 527 Pllv~~yGGP~sq~v~~~~~~~~~~~~~s~~g~~v~~vd~RGs~~~G~~~~~~~~~~lG~~----ev~D~~~~~~~~~~~ 602 (755)
T KOG2100|consen 527 PLLVVVYGGPGSQSVTSKFSVDWNEVVVSSRGFAVLQVDGRGSGGYGWDFRSALPRNLGDV----EVKDQIEAVKKVLKL 602 (755)
T ss_pred CEEEEecCCCCcceeeeeEEecHHHHhhccCCeEEEEEcCCCcCCcchhHHHHhhhhcCCc----chHHHHHHHHHHHhc
Confidence 688777 8887432211 1223444567788999999999987654331 00011223322 346666666666555
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCC-CeEEEEEeeccccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYP-HLVHGAMSASGPLRAV 887 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP-~~~~g~i~ssa~~~~~ 887 (1042)
. ..+..++.++|+||||-+++++...+| +++..++|-+++....
T Consensus 603 ~-~iD~~ri~i~GwSyGGy~t~~~l~~~~~~~fkcgvavaPVtd~~ 647 (755)
T KOG2100|consen 603 P-FIDRSRVAIWGWSYGGYLTLKLLESDPGDVFKCGVAVAPVTDWL 647 (755)
T ss_pred c-cccHHHeEEeccChHHHHHHHHhhhCcCceEEEEEEecceeeee
Confidence 4 334568999999999999999999999 6666667766555444
No 140
>PF03403 PAF-AH_p_II: Platelet-activating factor acetylhydrolase, isoform II; PDB: 3F98_B 3F97_B 3D59_A 3F96_A 3D5E_B 3F9C_A.
Probab=97.09 E-value=0.0012 Score=74.63 Aligned_cols=117 Identities=16% Similarity=0.122 Sum_probs=59.9
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccc--cCCCCCC-----------------------ccc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKS--HPLSDLS-----------------------VES 819 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S--~p~~~~~-----------------------~~~ 819 (1042)
=|||||.+|.|+... .++.+..+||-. |+.|+++|||+--.+ .+..+-. .+.
T Consensus 100 ~PvvIFSHGlgg~R~--~yS~~~~eLAS~-GyVV~aieHrDgSa~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (379)
T PF03403_consen 100 FPVVIFSHGLGGSRT--SYSAICGELASH-GYVVAAIEHRDGSAPATYFMRDGSGAEVEPYVVEYLEEEWIPLRDFDPEE 176 (379)
T ss_dssp EEEEEEE--TT--TT--TTHHHHHHHHHT-T-EEEEE---SS-SSEEEE-SSHHHHHHT---------EEEE-----GGG
T ss_pred CCEEEEeCCCCcchh--hHHHHHHHHHhC-CeEEEEeccCCCceeEEEeccCCCccccccccccccccceeccccccchh
Confidence 489999999988765 577788899998 999999999974211 1101100 000
Q ss_pred ccCCC---HHHHHhHHHHHHHHHhhhcCCC-------------------CCCcEEEEccchhHHHHHHHHHhCCCeEEEE
Q psy3685 820 LQYLS---AEQALADLAYFITSMNTLYSLP-------------------AHTKWIAFGGSYPGALAAWLRYKYPHLVHGA 877 (1042)
Q Consensus 820 ~~~lt---~~q~l~D~~~~~~~~~~~~~~~-------------------~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~ 877 (1042)
..-+. +++-++|+...+..+....... +-.+++++|||+||+.|+....+-+. +.++
T Consensus 177 ~~~~R~~QL~~R~~Ei~~~l~~L~~i~~G~~~~~~l~~~~~l~~~~grlD~~~i~~~GHSFGGATa~~~l~~d~r-~~~~ 255 (379)
T PF03403_consen 177 EFELRNAQLRQRVAEIQFVLDALEEINSGDPVENVLPSSFDLSQFKGRLDLSRIGLAGHSFGGATALQALRQDTR-FKAG 255 (379)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHTT-----SS--SS-GGGGTT-EEEEEEEEEEETHHHHHHHHHHHH-TT---EE
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccCCccCHHHHhhhcchhheeeeecCchHHHHHHHHhhccC-cceE
Confidence 00111 1122555555555554311100 01359999999999999987766644 6666
Q ss_pred Eeeccccc
Q psy3685 878 MSASGPLR 885 (1042)
Q Consensus 878 i~ssa~~~ 885 (1042)
|+--+.+.
T Consensus 256 I~LD~W~~ 263 (379)
T PF03403_consen 256 ILLDPWMF 263 (379)
T ss_dssp EEES---T
T ss_pred EEeCCccc
Confidence 66555443
No 141
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.08 E-value=0.00026 Score=63.75 Aligned_cols=22 Identities=36% Similarity=0.756 Sum_probs=0.0
Q ss_pred cCCCCCccCCchhHHHHHHhhc
Q psy3685 373 CDLCSKEFSRKGNLTKHMKLHQ 394 (1042)
Q Consensus 373 C~~C~k~f~~~~~L~~H~~~h~ 394 (1042)
|..|+..|.+...|..||...+
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H 23 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKH 23 (100)
T ss_dssp ----------------------
T ss_pred cccccccccccccccccccccc
Confidence 4455555555555555554433
No 142
>COG0412 Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=97.08 E-value=0.0026 Score=67.05 Aligned_cols=124 Identities=14% Similarity=0.074 Sum_probs=82.8
Q ss_pred EEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeee-cccccCCCCCCcccc----cCCCHHH
Q psy3685 753 RYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRF-YGKSHPLSDLSVESL----QYLSAEQ 827 (1042)
Q Consensus 753 ry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~-~G~S~p~~~~~~~~~----~~lt~~q 827 (1042)
-|+.....-.+- |+||++++-....+... .+...||.+ |+.+++.|.=+ -|.+.+..+....-. .-.+..+
T Consensus 16 ~~~a~P~~~~~~-P~VIv~hei~Gl~~~i~--~~a~rlA~~-Gy~v~~Pdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (236)
T COG0412 16 AYLARPAGAGGF-PGVIVLHEIFGLNPHIR--DVARRLAKA-GYVVLAPDLYGRQGDPTDIEDEPAELETGLVERVDPAE 91 (236)
T ss_pred EEEecCCcCCCC-CEEEEEecccCCchHHH--HHHHHHHhC-CcEEEechhhccCCCCCcccccHHHHhhhhhccCCHHH
Confidence 455555443322 89999988776666322 345567776 99999998543 244444332111111 1245589
Q ss_pred HHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 828 ALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 828 ~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
.++|+...+.++...-.. ...++.++|.||||.+|..+..+.| .|.|+|+--+
T Consensus 92 ~~~d~~a~~~~L~~~~~~-~~~~ig~~GfC~GG~~a~~~a~~~~-~v~a~v~fyg 144 (236)
T COG0412 92 VLADIDAALDYLARQPQV-DPKRIGVVGFCMGGGLALLAATRAP-EVKAAVAFYG 144 (236)
T ss_pred HHHHHHHHHHHHHhCCCC-CCceEEEEEEcccHHHHHHhhcccC-CccEEEEecC
Confidence 999999999999876422 2347999999999999999999999 6666665443
No 143
>PF00975 Thioesterase: Thioesterase domain; InterPro: IPR001031 Thioesterase domains often occur integrated in or associated with peptide synthetases which are involved in the non-ribosomal synthesis of peptide antibiotics []. Thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates.; GO: 0016788 hydrolase activity, acting on ester bonds, 0009058 biosynthetic process; PDB: 2RON_A 2K2Q_B 3LCR_B 2HFJ_B 1MNQ_A 1MN6_B 1MNA_B 2HFK_B 2H7Y_B 2H7X_A ....
Probab=97.01 E-value=0.0016 Score=68.84 Aligned_cols=81 Identities=22% Similarity=0.199 Sum_probs=55.7
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCC---eEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNA---LAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a---~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
|||++-+|.|.... +..||+.+.. .|++++.++.+...| ..-++++.++++++-|.....
T Consensus 2 ~lf~~p~~gG~~~~-------y~~la~~l~~~~~~v~~i~~~~~~~~~~---------~~~si~~la~~y~~~I~~~~~- 64 (229)
T PF00975_consen 2 PLFCFPPAGGSASS-------YRPLARALPDDVIGVYGIEYPGRGDDEP---------PPDSIEELASRYAEAIRARQP- 64 (229)
T ss_dssp EEEEESSTTCSGGG-------GHHHHHHHTTTEEEEEEECSTTSCTTSH---------EESSHHHHHHHHHHHHHHHTS-
T ss_pred eEEEEcCCccCHHH-------HHHHHHhCCCCeEEEEEEecCCCCCCCC---------CCCCHHHHHHHHHHHhhhhCC-
Confidence 66666677664432 3456666554 489999888763322 235888888888877765432
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHH
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
+.|++++|+|+||.||.-++.
T Consensus 65 -----~gp~~L~G~S~Gg~lA~E~A~ 85 (229)
T PF00975_consen 65 -----EGPYVLAGWSFGGILAFEMAR 85 (229)
T ss_dssp -----SSSEEEEEETHHHHHHHHHHH
T ss_pred -----CCCeeehccCccHHHHHHHHH
Confidence 239999999999999977663
No 144
>KOG4667|consensus
Probab=97.00 E-value=0.0023 Score=63.02 Aligned_cols=98 Identities=13% Similarity=0.096 Sum_probs=70.4
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
++..|+|+.+|.-+.-.......+...|+++ |..++-+|-||-|+|.- ..-.+|.+++ .+|+..+++++...
T Consensus 31 gs~e~vvlcHGfrS~Kn~~~~~~vA~~~e~~-gis~fRfDF~GnGeS~g--sf~~Gn~~~e-----adDL~sV~q~~s~~ 102 (269)
T KOG4667|consen 31 GSTEIVVLCHGFRSHKNAIIMKNVAKALEKE-GISAFRFDFSGNGESEG--SFYYGNYNTE-----ADDLHSVIQYFSNS 102 (269)
T ss_pred CCceEEEEeeccccccchHHHHHHHHHHHhc-CceEEEEEecCCCCcCC--ccccCcccch-----HHHHHHHHHHhccC
Confidence 3456889999986654433444444555555 99999999999999943 2223444443 29999999998763
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
- ..--|++|||-||.++..++.||++
T Consensus 103 n----r~v~vi~gHSkGg~Vvl~ya~K~~d 128 (269)
T KOG4667|consen 103 N----RVVPVILGHSKGGDVVLLYASKYHD 128 (269)
T ss_pred c----eEEEEEEeecCccHHHHHHHHhhcC
Confidence 1 1123789999999999999999999
No 145
>PF12740 Chlorophyllase2: Chlorophyllase enzyme; InterPro: IPR010821 This family consists of several chlorophyllase proteins (3.1.1.14 from EC). Chlorophyllase (Chlase) is the first enzyme involved in chlorophyll degradation and catalyses the hydrolysis of the ester bond to yield chlorophyllide and phytol [, , ].; GO: 0047746 chlorophyllase activity, 0015996 chlorophyll catabolic process
Probab=96.99 E-value=0.0026 Score=66.41 Aligned_cols=103 Identities=17% Similarity=0.164 Sum_probs=69.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc-
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY- 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~- 843 (1042)
=||+|+++|.+ ....| ++.+...+|-- |..||+.+....+. + ...--+++++++|.++....
T Consensus 17 yPVv~f~~G~~-~~~s~-Ys~ll~hvASh-GyIVV~~d~~~~~~--~------------~~~~~~~~~~~vi~Wl~~~L~ 79 (259)
T PF12740_consen 17 YPVVLFLHGFL-LINSW-YSQLLEHVASH-GYIVVAPDLYSIGG--P------------DDTDEVASAAEVIDWLAKGLE 79 (259)
T ss_pred cCEEEEeCCcC-CCHHH-HHHHHHHHHhC-ceEEEEecccccCC--C------------CcchhHHHHHHHHHHHHhcch
Confidence 48999999998 44444 66667777764 99999999433211 1 11123456666666654422
Q ss_pred ---C---CCCCCcEEEEccchhHHHHHHHHHhC-----CCeEEEEEeeccccc
Q psy3685 844 ---S---LPAHTKWIAFGGSYPGALAAWLRYKY-----PHLVHGAMSASGPLR 885 (1042)
Q Consensus 844 ---~---~~~~~~~i~~G~Sygg~la~~~~~~y-----P~~~~g~i~ssa~~~ 885 (1042)
. .++-.++-+.|||-||-+|.-+.+.+ +..+.|+|+.. ||.
T Consensus 80 ~~l~~~v~~D~s~l~l~GHSrGGk~Af~~al~~~~~~~~~~~~ali~lD-PVd 131 (259)
T PF12740_consen 80 SKLPLGVKPDFSKLALAGHSRGGKVAFAMALGNASSSLDLRFSALILLD-PVD 131 (259)
T ss_pred hhccccccccccceEEeeeCCCCHHHHHHHhhhcccccccceeEEEEec-ccc
Confidence 1 12334799999999999999998888 67898998876 444
No 146
>PF06057 VirJ: Bacterial virulence protein (VirJ); InterPro: IPR010333 This entry contains several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localised to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium [].
Probab=96.97 E-value=0.0013 Score=64.71 Aligned_cols=96 Identities=18% Similarity=0.223 Sum_probs=70.8
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCC
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLP 846 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~ 846 (1042)
++|++-|.|.... ...++...|+++ |..||.++-+-|--+ --|.+|+.+|++.+|+.+...++.
T Consensus 4 ~~v~~SGDgGw~~--~d~~~a~~l~~~-G~~VvGvdsl~Yfw~------------~rtP~~~a~Dl~~~i~~y~~~w~~- 67 (192)
T PF06057_consen 4 LAVFFSGDGGWRD--LDKQIAEALAKQ-GVPVVGVDSLRYFWS------------ERTPEQTAADLARIIRHYRARWGR- 67 (192)
T ss_pred EEEEEeCCCCchh--hhHHHHHHHHHC-CCeEEEechHHHHhh------------hCCHHHHHHHHHHHHHHHHHHhCC-
Confidence 4566666665543 344566677777 999999986444333 237899999999999999998853
Q ss_pred CCCcEEEEccchhHHHHHHHHHhCCCeEEEEEee
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSA 880 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~s 880 (1042)
.+|||+|-|+|+.+.-..--+-|..+..-|+.
T Consensus 68 --~~vvLiGYSFGADvlP~~~nrLp~~~r~~v~~ 99 (192)
T PF06057_consen 68 --KRVVLIGYSFGADVLPFIYNRLPAALRARVAQ 99 (192)
T ss_pred --ceEEEEeecCCchhHHHHHhhCCHHHHhheeE
Confidence 58999999999988888777888665554433
No 147
>COG2945 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=96.83 E-value=0.0054 Score=59.72 Aligned_cols=110 Identities=17% Similarity=0.131 Sum_probs=73.9
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHH---HcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHH
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAH---NHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITS 838 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~---~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~ 838 (1042)
.+..||.|..+-- +..+.-.++-....+|+ +.|..++=++.|+.|.|.-.-|-... -++|.+..+++
T Consensus 25 ~~~~~iAli~HPH-Pl~gGtm~nkvv~~la~~l~~~G~atlRfNfRgVG~S~G~fD~GiG---------E~~Da~aaldW 94 (210)
T COG2945 25 TPAAPIALICHPH-PLFGGTMNNKVVQTLARALVKRGFATLRFNFRGVGRSQGEFDNGIG---------ELEDAAAALDW 94 (210)
T ss_pred CCCCceEEecCCC-ccccCccCCHHHHHHHHHHHhCCceEEeecccccccccCcccCCcc---------hHHHHHHHHHH
Confidence 3567887777432 11111122223333333 35999999999999999543222222 35899999999
Q ss_pred HhhhcCCCCCCcE-EEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 839 MNTLYSLPAHTKW-IAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 839 ~~~~~~~~~~~~~-i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
++... ++++. .+.|.|+||.+|+.++++-|+.. +.|+-++|+.
T Consensus 95 ~~~~h---p~s~~~~l~GfSFGa~Ia~~la~r~~e~~-~~is~~p~~~ 138 (210)
T COG2945 95 LQARH---PDSASCWLAGFSFGAYIAMQLAMRRPEIL-VFISILPPIN 138 (210)
T ss_pred HHhhC---CCchhhhhcccchHHHHHHHHHHhccccc-ceeeccCCCC
Confidence 99876 33455 78899999999999999998843 4555666666
No 148
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.82 E-value=0.00054 Score=43.19 Aligned_cols=23 Identities=39% Similarity=0.980 Sum_probs=12.1
Q ss_pred ccCCccccccCCchhHHHHhhhc
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKIH 608 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~H 608 (1042)
|.|++|++.|.+...|+.|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45555555555555555555554
No 149
>PRK07868 acyl-CoA synthetase; Validated
Probab=96.81 E-value=0.0077 Score=77.90 Aligned_cols=111 Identities=13% Similarity=0.106 Sum_probs=73.4
Q ss_pred CCCcEEEEECCCCCCccccccc---ccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVE---GTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~---~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
..+|.+|+++|.+.....|... .++..|+++ |..|+++| +|.|.+.+. . ...+.+..+.++..++..+
T Consensus 65 ~~~~plllvhg~~~~~~~~d~~~~~s~v~~L~~~-g~~v~~~d---~G~~~~~~~----~-~~~~l~~~i~~l~~~l~~v 135 (994)
T PRK07868 65 PVGPPVLMVHPMMMSADMWDVTRDDGAVGILHRA-GLDPWVID---FGSPDKVEG----G-MERNLADHVVALSEAIDTV 135 (994)
T ss_pred CCCCcEEEECCCCCCccceecCCcccHHHHHHHC-CCEEEEEc---CCCCChhHc----C-ccCCHHHHHHHHHHHHHHH
Confidence 3567888898876665555432 235556665 78999999 576643211 0 1245555555555555554
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHh-CCCeEEEEEeecccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYK-YPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~-yP~~~~g~i~ssa~~~~ 886 (1042)
+..- ..++.++|+|+||++|+.+... .|+.|.+.|..++|+..
T Consensus 136 ~~~~----~~~v~lvG~s~GG~~a~~~aa~~~~~~v~~lvl~~~~~d~ 179 (994)
T PRK07868 136 KDVT----GRDVHLVGYSQGGMFCYQAAAYRRSKDIASIVTFGSPVDT 179 (994)
T ss_pred HHhh----CCceEEEEEChhHHHHHHHHHhcCCCccceEEEEeccccc
Confidence 4332 1379999999999999887764 45689999988888754
No 150
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.81 E-value=0.00051 Score=43.31 Aligned_cols=24 Identities=33% Similarity=0.890 Sum_probs=20.3
Q ss_pred ccCCCCCCccCCcchHHHHHhhcc
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIHQ 706 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H~ 706 (1042)
|.|++|++.|.++.+|..|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 689999999999999999999873
No 151
>PRK10439 enterobactin/ferric enterobactin esterase; Provisional
Probab=96.74 E-value=0.016 Score=66.38 Aligned_cols=52 Identities=15% Similarity=0.307 Sum_probs=41.4
Q ss_pred HHHHHHhhhcCCCC-CCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 834 YFITSMNTLYSLPA-HTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 834 ~~~~~~~~~~~~~~-~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+++..+...|.... ...+++.|.||||..|.++.++||+.|.++++-|+.+.
T Consensus 272 eLlP~I~~~y~~~~d~~~~~IaG~S~GGl~AL~~al~~Pd~Fg~v~s~Sgs~w 324 (411)
T PRK10439 272 ELLPQVRAIAPFSDDADRTVVAGQSFGGLAALYAGLHWPERFGCVLSQSGSFW 324 (411)
T ss_pred HHHHHHHHhCCCCCCccceEEEEEChHHHHHHHHHHhCcccccEEEEecccee
Confidence 44455666665422 24689999999999999999999999999999997764
No 152
>PLN02733 phosphatidylcholine-sterol O-acyltransferase
Probab=96.72 E-value=0.0027 Score=72.63 Aligned_cols=88 Identities=10% Similarity=0.032 Sum_probs=64.3
Q ss_pred cHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHH
Q psy3685 786 TWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAW 865 (1042)
Q Consensus 786 ~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~ 865 (1042)
++..|++ .| -++..+.+++|-+.... ...++.++|++.+|+.+....+ ..|++++||||||.+|..
T Consensus 113 li~~L~~-~G-Y~~~~dL~g~gYDwR~~---------~~~~~~~~~Lk~lIe~~~~~~g---~~kV~LVGHSMGGlva~~ 178 (440)
T PLN02733 113 MIEQLIK-WG-YKEGKTLFGFGYDFRQS---------NRLPETMDGLKKKLETVYKASG---GKKVNIISHSMGGLLVKC 178 (440)
T ss_pred HHHHHHH-cC-CccCCCcccCCCCcccc---------ccHHHHHHHHHHHHHHHHHHcC---CCCEEEEEECHhHHHHHH
Confidence 3444555 35 45578899988764321 1256778999999998877653 358999999999999999
Q ss_pred HHHhCCCe----EEEEEeeccccccc
Q psy3685 866 LRYKYPHL----VHGAMSASGPLRAV 887 (1042)
Q Consensus 866 ~~~~yP~~----~~g~i~ssa~~~~~ 887 (1042)
|...+|+. |...|+.++|....
T Consensus 179 fl~~~p~~~~k~I~~~I~la~P~~Gs 204 (440)
T PLN02733 179 FMSLHSDVFEKYVNSWIAIAAPFQGA 204 (440)
T ss_pred HHHHCCHhHHhHhccEEEECCCCCCC
Confidence 99999975 56667777776543
No 153
>KOG4627|consensus
Probab=96.72 E-value=0.0037 Score=61.01 Aligned_cols=103 Identities=20% Similarity=0.219 Sum_probs=67.3
Q ss_pred cCCCcEEEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHh
Q psy3685 762 KHGGPIFLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 762 ~~~~pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
....|+|+|+ ||.+..... ...-.+..-|.+.|+.|+.+ +||.+. +.-|.+|.+.|+.+++.++.
T Consensus 64 ~~~~klfIfIHGGYW~~g~r-k~clsiv~~a~~~gY~vasv---gY~l~~----------q~htL~qt~~~~~~gv~fil 129 (270)
T KOG4627|consen 64 TNQAKLFIFIHGGYWQEGDR-KMCLSIVGPAVRRGYRVASV---GYNLCP----------QVHTLEQTMTQFTHGVNFIL 129 (270)
T ss_pred CCCccEEEEEecchhhcCch-hcccchhhhhhhcCeEEEEe---ccCcCc----------ccccHHHHHHHHHHHHHHHH
Confidence 3457899999 776544321 11112445566668777775 466652 23488999999999999988
Q ss_pred hhcCCCCCCcEEE-EccchhHHHHHH--HHHhCCCeEEEEEeecc
Q psy3685 841 TLYSLPAHTKWIA-FGGSYPGALAAW--LRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 841 ~~~~~~~~~~~i~-~G~Sygg~la~~--~~~~yP~~~~g~i~ssa 882 (1042)
+.+. ++++|+ -|||-|+-||+. +|++.|- |+|++.+++
T Consensus 130 k~~~---n~k~l~~gGHSaGAHLa~qav~R~r~pr-I~gl~l~~G 170 (270)
T KOG4627|consen 130 KYTE---NTKVLTFGGHSAGAHLAAQAVMRQRSPR-IWGLILLCG 170 (270)
T ss_pred Hhcc---cceeEEEcccchHHHHHHHHHHHhcCch-HHHHHHHhh
Confidence 8773 345555 567999999866 5555554 566665553
No 154
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.71 E-value=0.00054 Score=44.68 Aligned_cols=26 Identities=42% Similarity=0.772 Sum_probs=24.0
Q ss_pred cccCCCCCCccCCcchHHHHHhhccC
Q psy3685 682 RYKCDLCSKDFSRKDNLTEHKEIHQG 707 (1042)
Q Consensus 682 ~~~C~~C~k~F~~~~~L~~H~~~H~g 707 (1042)
||.|+.|++.|.+..+|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68999999999999999999998854
No 155
>PF01738 DLH: Dienelactone hydrolase family; InterPro: IPR002925 Dienelactone hydrolases play a crucial role in chlorocatechol degradation via the modified ortho cleavage pathway. Enzymes induced in 4-fluorobenzoate-utilizing bacteria have been classified into three groups on the basis of their specificity towards cis- and trans-dienelactone []. Some proteins contain repeated small fragments of this domain (for example rat kan-1 protein).; GO: 0016787 hydrolase activity; PDB: 1GGV_A 1ZIY_A 1ZI6_A 1ZIC_A 1ZJ5_A 1ZI8_A 1ZJ4_A 1ZI9_A 1ZIX_A 3F67_A.
Probab=96.70 E-value=0.0022 Score=67.22 Aligned_cols=113 Identities=19% Similarity=0.168 Sum_probs=69.8
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccc-cCCCCCC-cccccCC---CHHHHHhHHHHHHHH
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKS-HPLSDLS-VESLQYL---SAEQALADLAYFITS 838 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S-~p~~~~~-~~~~~~l---t~~q~l~D~~~~~~~ 838 (1042)
..|.||++++-....+. ...+...||++ |..|++.|. |.|.. .|.+... ...+..+ ..+++++|+...+.+
T Consensus 13 ~~~~Vvv~~d~~G~~~~--~~~~ad~lA~~-Gy~v~~pD~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~~ 88 (218)
T PF01738_consen 13 PRPAVVVIHDIFGLNPN--IRDLADRLAEE-GYVVLAPDL-FGGRGAPPSDPEEAFAAMRELFAPRPEQVAADLQAAVDY 88 (218)
T ss_dssp SEEEEEEE-BTTBS-HH--HHHHHHHHHHT-T-EEEEE-C-CCCTS--CCCHHCHHHHHHHCHHHSHHHHHHHHHHHHHH
T ss_pred CCCEEEEEcCCCCCchH--HHHHHHHHHhc-CCCEEeccc-ccCCCCCccchhhHHHHHHHHHhhhHHHHHHHHHHHHHH
Confidence 35778888766554431 12345567776 999999995 44444 2322111 1122222 267899999999999
Q ss_pred HhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 839 MNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 839 ~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
++..-.. ...+|.++|.||||.+|..+.... ..+.|+|+.-+
T Consensus 89 l~~~~~~-~~~kig~vGfc~GG~~a~~~a~~~-~~~~a~v~~yg 130 (218)
T PF01738_consen 89 LRAQPEV-DPGKIGVVGFCWGGKLALLLAARD-PRVDAAVSFYG 130 (218)
T ss_dssp HHCTTTC-EEEEEEEEEETHHHHHHHHHHCCT-TTSSEEEEES-
T ss_pred HHhcccc-CCCcEEEEEEecchHHhhhhhhhc-cccceEEEEcC
Confidence 8876422 235899999999999999999887 45666666554
No 156
>KOG1282|consensus
Probab=96.67 E-value=0.023 Score=64.78 Aligned_cols=137 Identities=19% Similarity=0.214 Sum_probs=86.9
Q ss_pred cceEEEEEecccccC-CCcEEEEE-CCCCCCc---------ccccc-cc---cHHHHHHHcCCeEEEEeee-ecccccCC
Q psy3685 749 TWKQRYFVNSTWYKH-GGPIFLMI-GGEGEAS---------AKWMV-EG---TWLDYAHNHNALAVQVEHR-FYGKSHPL 812 (1042)
Q Consensus 749 tf~qry~~~~~~~~~-~~pi~~~~-gg~g~~~---------~~~~~-~~---~~~~la~~~~a~~~~~ehR-~~G~S~p~ 812 (1042)
.-..-||+-..--.| ..|+||++ ||+|-.+ ++.+. +| .....+....|+|++||.+ |.|-|+-.
T Consensus 56 ~~~LFYwf~eS~~~P~~dPlvLWLnGGPGCSSl~G~~~E~GPf~v~~~G~tL~~N~ySWnk~aNiLfLd~PvGvGFSYs~ 135 (454)
T KOG1282|consen 56 GRQLFYWFFESENNPETDPLVLWLNGGPGCSSLGGLFEENGPFRVKYNGKTLYLNPYSWNKEANILFLDQPVGVGFSYSN 135 (454)
T ss_pred CceEEEEEEEccCCCCCCCEEEEeCCCCCccchhhhhhhcCCeEEcCCCCcceeCCccccccccEEEEecCCcCCccccC
Confidence 344556655443444 48999999 9997432 11111 11 1223456667999999996 78888642
Q ss_pred CCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhH----HHHHHHHHhC-----CC-eEEEEEeecc
Q psy3685 813 SDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPG----ALAAWLRYKY-----PH-LVHGAMSASG 882 (1042)
Q Consensus 813 ~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg----~la~~~~~~y-----P~-~~~g~i~ssa 882 (1042)
.. ..+. .+.+.+..|+-+|+...-.+++--.+.++.+.|-|||| +||..+.... |. -+.|.+..-+
T Consensus 136 ~~---~~~~-~~D~~~A~d~~~FL~~wf~kfPey~~~~fyI~GESYAG~YVP~La~~I~~~N~~~~~~~iNLkG~~IGNg 211 (454)
T KOG1282|consen 136 TS---SDYK-TGDDGTAKDNYEFLQKWFEKFPEYKSNDFYIAGESYAGHYVPALAQEILKGNKKCCKPNINLKGYAIGNG 211 (454)
T ss_pred CC---CcCc-CCcHHHHHHHHHHHHHHHHhChhhcCCCeEEecccccceehHHHHHHHHhccccccCCcccceEEEecCc
Confidence 21 1111 46778889999998877667654455689999999999 7887777654 32 3556665555
Q ss_pred ccccccC
Q psy3685 883 PLRAVVD 889 (1042)
Q Consensus 883 ~~~~~~~ 889 (1042)
.+....|
T Consensus 212 ~td~~~~ 218 (454)
T KOG1282|consen 212 LTDPEID 218 (454)
T ss_pred ccCcccc
Confidence 5544433
No 157
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.63 E-value=0.00092 Score=43.58 Aligned_cols=25 Identities=36% Similarity=0.701 Sum_probs=16.5
Q ss_pred cccCCccccccCCchhHHHHhhhcC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQ 609 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~ 609 (1042)
||.|+.|++.|.....|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4666666666666666666666664
No 158
>PF03096 Ndr: Ndr family; InterPro: IPR004142 This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted []. The precise molecular and cellular function of members of this family is still unknown, yet they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (Common sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases (IPR000073 from INTERPRO), suggesting that this family may have an enzymatic function.; PDB: 2QMQ_A 2XMR_B 2XMQ_B 2XMS_A.
Probab=96.50 E-value=0.0087 Score=63.20 Aligned_cols=110 Identities=17% Similarity=0.143 Sum_probs=71.7
Q ss_pred CCcEEEEECCCCCCccc----ccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 764 GGPIFLMIGGEGEASAK----WMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~----~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
+.|+||-.+.-|-.... +.+..-+.++.+ +..++=+|-+|...--|. + .++..|.|+||..+++.+++.++
T Consensus 22 ~kp~ilT~HDvGlNh~scF~~ff~~~~m~~i~~--~f~i~Hi~aPGqe~ga~~--~-p~~y~yPsmd~LAe~l~~Vl~~f 96 (283)
T PF03096_consen 22 NKPAILTYHDVGLNHKSCFQGFFNFEDMQEILQ--NFCIYHIDAPGQEEGAAT--L-PEGYQYPSMDQLAEMLPEVLDHF 96 (283)
T ss_dssp TS-EEEEE--TT--HHHHCHHHHCSHHHHHHHT--TSEEEEEE-TTTSTT--------TT-----HHHHHCTHHHHHHHH
T ss_pred CCceEEEeccccccchHHHHHHhcchhHHHHhh--ceEEEEEeCCCCCCCccc--c-cccccccCHHHHHHHHHHHHHhC
Confidence 47888888888754322 223344566666 468999999988654331 1 24578999999999999999988
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.-+ .||-||-..|+.|=+.|+++||+.|.|.|..+....
T Consensus 97 ~lk-------~vIg~GvGAGAnIL~rfAl~~p~~V~GLiLvn~~~~ 135 (283)
T PF03096_consen 97 GLK-------SVIGFGVGAGANILARFALKHPERVLGLILVNPTCT 135 (283)
T ss_dssp T----------EEEEEETHHHHHHHHHHHHSGGGEEEEEEES---S
T ss_pred Ccc-------EEEEEeeccchhhhhhccccCccceeEEEEEecCCC
Confidence 764 499999999999999999999999999998875543
No 159
>PF06342 DUF1057: Alpha/beta hydrolase of unknown function (DUF1057); InterPro: IPR010463 This entry consists of proteins of unknown function which have an alpha/beta hydrolase fold.
Probab=96.49 E-value=0.038 Score=57.73 Aligned_cols=98 Identities=14% Similarity=0.144 Sum_probs=70.8
Q ss_pred EEE-EECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCC
Q psy3685 767 IFL-MIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 767 i~~-~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
.+| +-|-+|+... |.+ +...-.+.|-.+|.+-.+|+|.+...++++..| ++-+.|+.++.....+
T Consensus 37 TVv~~hGsPGSH~D-FkY---i~~~l~~~~iR~I~iN~PGf~~t~~~~~~~~~n----------~er~~~~~~ll~~l~i 102 (297)
T PF06342_consen 37 TVVAFHGSPGSHND-FKY---IRPPLDEAGIRFIGINYPGFGFTPGYPDQQYTN----------EERQNFVNALLDELGI 102 (297)
T ss_pred eEEEecCCCCCccc-hhh---hhhHHHHcCeEEEEeCCCCCCCCCCCcccccCh----------HHHHHHHHHHHHHcCC
Confidence 344 4466677654 333 556666679999999999999997666555444 5666777777777665
Q ss_pred CCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
. .++|.+|||.|+..|+.++...| +.|++..++
T Consensus 103 ~--~~~i~~gHSrGcenal~la~~~~--~~g~~lin~ 135 (297)
T PF06342_consen 103 K--GKLIFLGHSRGCENALQLAVTHP--LHGLVLINP 135 (297)
T ss_pred C--CceEEEEeccchHHHHHHHhcCc--cceEEEecC
Confidence 4 48999999999999999999998 335444333
No 160
>COG1770 PtrB Protease II [Amino acid transport and metabolism]
Probab=96.39 E-value=0.0072 Score=69.78 Aligned_cols=142 Identities=17% Similarity=0.190 Sum_probs=92.9
Q ss_pred hhhhccc-CCCCCcccceEEEEEecc---------cc------cCCCcEEEEE-CCCCCCcccccccccHHHHHHHcCCe
Q psy3685 735 FEQKLDH-FNIEEPRTWKQRYFVNST---------WY------KHGGPIFLMI-GGEGEASAKWMVEGTWLDYAHNHNAL 797 (1042)
Q Consensus 735 f~q~~dh-f~~~~~~tf~qry~~~~~---------~~------~~~~pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~ 797 (1042)
.+|+|-= ||+ ..-+.+|.|+... .| .+.+|++||. |-.|...+.... .....|.+ .+.
T Consensus 404 kqqeV~~g~dp--~~Y~s~riwa~a~dgv~VPVSLvyrkd~~~~g~~p~lLygYGaYG~s~~p~Fs-~~~lSLlD--RGf 478 (682)
T COG1770 404 KQQEVPGGFDP--EDYVSRRIWATADDGVQVPVSLVYRKDTKLDGSAPLLLYGYGAYGISMDPSFS-IARLSLLD--RGF 478 (682)
T ss_pred EeccCCCCCCh--hHeEEEEEEEEcCCCcEeeEEEEEecccCCCCCCcEEEEEeccccccCCcCcc-cceeeeec--Cce
Confidence 6677654 664 3467899998721 12 2457899999 666765443332 22344555 478
Q ss_pred EEEEee-eecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEE
Q psy3685 798 AVQVEH-RFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHG 876 (1042)
Q Consensus 798 ~~~~eh-R~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g 876 (1042)
|+++=| ||=|.= - ..-.++=+.|+-.....|+.+..++|..+- ......+++.|||.||+|.....-.-|+++.|
T Consensus 479 iyAIAHVRGGgel-G--~~WYe~GK~l~K~NTf~DFIa~a~~Lv~~g-~~~~~~i~a~GGSAGGmLmGav~N~~P~lf~~ 554 (682)
T COG1770 479 VYAIAHVRGGGEL-G--RAWYEDGKLLNKKNTFTDFIAAARHLVKEG-YTSPDRIVAIGGSAGGMLMGAVANMAPDLFAG 554 (682)
T ss_pred EEEEEEeeccccc-C--hHHHHhhhhhhccccHHHHHHHHHHHHHcC-cCCccceEEeccCchhHHHHHHHhhChhhhhh
Confidence 999998 553321 1 111233344555556677777777765542 22334799999999999999999999999999
Q ss_pred EEeeccccc
Q psy3685 877 AMSASGPLR 885 (1042)
Q Consensus 877 ~i~ssa~~~ 885 (1042)
+||-++-|.
T Consensus 555 iiA~VPFVD 563 (682)
T COG1770 555 IIAQVPFVD 563 (682)
T ss_pred eeecCCccc
Confidence 999886664
No 161
>COG4099 Predicted peptidase [General function prediction only]
Probab=96.38 E-value=0.02 Score=59.42 Aligned_cols=46 Identities=20% Similarity=0.242 Sum_probs=37.8
Q ss_pred HHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 838 SMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 838 ~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+...|+.+. +++.++|-|.||+-+-.+.++|||.++|||..++.-
T Consensus 259 vlas~ynID~-sRIYviGlSrG~~gt~al~~kfPdfFAaa~~iaG~~ 304 (387)
T COG4099 259 VLASTYNIDR-SRIYVIGLSRGGFGTWALAEKFPDFFAAAVPIAGGG 304 (387)
T ss_pred HHhhccCccc-ceEEEEeecCcchhhHHHHHhCchhhheeeeecCCC
Confidence 4555676644 689999999999999999999999999999776543
No 162
>KOG1553|consensus
Probab=96.35 E-value=0.022 Score=60.05 Aligned_cols=79 Identities=20% Similarity=0.276 Sum_probs=52.6
Q ss_pred ccHHHHHHHcCCeEEEEeeeeccccc--CCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHH
Q psy3685 785 GTWLDYAHNHNALAVQVEHRFYGKSH--PLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGAL 862 (1042)
Q Consensus 785 ~~~~~la~~~~a~~~~~ehR~~G~S~--p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~l 862 (1042)
|.+..-++ .|+.|+.|.|+|||.|. |++. ....|++-+.+|+... .+.+ ..-+|++|.|-||.-
T Consensus 259 G~m~tP~~-lgYsvLGwNhPGFagSTG~P~p~---------n~~nA~DaVvQfAI~~---Lgf~-~edIilygWSIGGF~ 324 (517)
T KOG1553|consen 259 GVMNTPAQ-LGYSVLGWNHPGFAGSTGLPYPV---------NTLNAADAVVQFAIQV---LGFR-QEDIILYGWSIGGFP 324 (517)
T ss_pred eeecChHH-hCceeeccCCCCccccCCCCCcc---------cchHHHHHHHHHHHHH---cCCC-ccceEEEEeecCCch
Confidence 44433343 49999999999999994 3321 1223444445554332 2222 246999999999999
Q ss_pred HHHHHHhCCCeEEEEE
Q psy3685 863 AAWLRYKYPHLVHGAM 878 (1042)
Q Consensus 863 a~~~~~~yP~~~~g~i 878 (1042)
++|.+.-||| |.|.|
T Consensus 325 ~~waAs~YPd-Vkavv 339 (517)
T KOG1553|consen 325 VAWAASNYPD-VKAVV 339 (517)
T ss_pred HHHHhhcCCC-ceEEE
Confidence 9999999999 44444
No 163
>PRK05371 x-prolyl-dipeptidyl aminopeptidase; Provisional
Probab=96.33 E-value=0.0099 Score=73.49 Aligned_cols=87 Identities=14% Similarity=-0.008 Sum_probs=65.4
Q ss_pred HHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCC-C------------CCCcEEEEc
Q psy3685 789 DYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSL-P------------AHTKWIAFG 855 (1042)
Q Consensus 789 ~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~-~------------~~~~~i~~G 855 (1042)
.++.+ |++||+.|-||.|.|.-.. ..+ ..+..+|....|+++...... . .+.+|.++|
T Consensus 274 ~~~~r-GYaVV~~D~RGtg~SeG~~-------~~~-~~~E~~D~~~vIeWl~~~~~~~~d~~~~~~~kq~WsnGkVGm~G 344 (767)
T PRK05371 274 YFLPR-GFAVVYVSGIGTRGSDGCP-------TTG-DYQEIESMKAVIDWLNGRATAYTDRTRGKEVKADWSNGKVAMTG 344 (767)
T ss_pred HHHhC-CeEEEEEcCCCCCCCCCcC-------ccC-CHHHHHHHHHHHHHHhhCCccccccccccccccCCCCCeeEEEE
Confidence 35555 9999999999999994321 111 245668999999998743110 0 135899999
Q ss_pred cchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 856 GSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 856 ~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.||||.++.+.+...|..+.++|+.++..
T Consensus 345 ~SY~G~~~~~aAa~~pp~LkAIVp~a~is 373 (767)
T PRK05371 345 KSYLGTLPNAVATTGVEGLETIIPEAAIS 373 (767)
T ss_pred EcHHHHHHHHHHhhCCCcceEEEeeCCCC
Confidence 99999999999999999999999876553
No 164
>PF05728 UPF0227: Uncharacterised protein family (UPF0227); InterPro: IPR008886 Despite being classed as uncharacterised proteins, the members of this family are almost certainly enzymes in that they contain a domain distantly related to IPR000073 from INTERPRO. One of the members of this family YqiA has been shown to be a esterase []. Other members, which include the Escherichia coli (strain K12) YcfP protein are uncharacterised.
Probab=96.32 E-value=0.0094 Score=59.97 Aligned_cols=42 Identities=26% Similarity=0.407 Sum_probs=33.0
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
+.++|++.+..+|..... ..|+++|.|+||..|+|++.+|+-
T Consensus 41 ~p~~a~~~l~~~i~~~~~-------~~~~liGSSlGG~~A~~La~~~~~ 82 (187)
T PF05728_consen 41 FPEEAIAQLEQLIEELKP-------ENVVLIGSSLGGFYATYLAERYGL 82 (187)
T ss_pred CHHHHHHHHHHHHHhCCC-------CCeEEEEEChHHHHHHHHHHHhCC
Confidence 567787777777765432 239999999999999999999963
No 165
>COG2819 Predicted hydrolase of the alpha/beta superfamily [General function prediction only]
Probab=96.31 E-value=0.01 Score=61.77 Aligned_cols=48 Identities=17% Similarity=0.144 Sum_probs=39.4
Q ss_pred HhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 839 MNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 839 ~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
+...|.... .+-.+||+||||.++....++||+.+..+++.|+.+++.
T Consensus 128 Ie~~y~~~~-~~~~i~GhSlGGLfvl~aLL~~p~~F~~y~~~SPSlWw~ 175 (264)
T COG2819 128 IEARYRTNS-ERTAIIGHSLGGLFVLFALLTYPDCFGRYGLISPSLWWH 175 (264)
T ss_pred HhcccccCc-ccceeeeecchhHHHHHHHhcCcchhceeeeecchhhhC
Confidence 344454433 468999999999999999999999999999999888764
No 166
>PF12715 Abhydrolase_7: Abhydrolase family; PDB: 3NUZ_C 3G8Y_A.
Probab=96.22 E-value=0.016 Score=63.47 Aligned_cols=127 Identities=15% Similarity=0.145 Sum_probs=66.0
Q ss_pred EEEEecccccCCCcEEEEECCCCCCccc--------------cc--ccccHHHHHHHcCCeEEEEeeeecccccCCCCCC
Q psy3685 753 RYFVNSTWYKHGGPIFLMIGGEGEASAK--------------WM--VEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLS 816 (1042)
Q Consensus 753 ry~~~~~~~~~~~pi~~~~gg~g~~~~~--------------~~--~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~ 816 (1042)
=|++-.+-.++.-|.||+++|-|..... +. ..++-.+||++ |..|+++|-+++|+..+.....
T Consensus 103 aylLvPd~~~~p~PAVL~lHgHg~~Ke~~~g~~gv~~~~~~~~~~~~~~~g~~LAk~-GYVvla~D~~g~GER~~~e~~~ 181 (390)
T PF12715_consen 103 AYLLVPDGAKGPFPAVLCLHGHGGGKEKMAGEDGVSPDLKDDYDDPKQDYGDQLAKR-GYVVLAPDALGFGERGDMEGAA 181 (390)
T ss_dssp EEEEEETT--S-EEEEEEE--TT--HHHHCT---SSGCG--STTSTTT-HHHHHHTT-TSEEEEE--TTSGGG-SSCCCT
T ss_pred EEEEecCCCCCCCCEEEEeCCCCCCcccccCCcccccccchhhccccccHHHHHHhC-CCEEEEEccccccccccccccc
Confidence 3444444435556899999776543211 11 11235678887 9999999999999987654322
Q ss_pred c-ccccC-------------CCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 817 V-ESLQY-------------LSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 817 ~-~~~~~-------------lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
. .+..+ +....+.+|+ ..++++...-.. +..++.++|.|+||..+.|+...-|.+ .++|+++.
T Consensus 182 ~~~~~~~~~la~~~l~lG~S~~G~~~~ddm-r~lDfL~slpeV-D~~RIG~~GfSmGg~~a~~LaALDdRI-ka~v~~~~ 258 (390)
T PF12715_consen 182 QGSNYDCQALARNLLMLGRSLAGLMAWDDM-RALDFLASLPEV-DPDRIGCMGFSMGGYRAWWLAALDDRI-KATVANGY 258 (390)
T ss_dssp TTTS--HHHHHHHHHHTT--HHHHHHHHHH-HHHHHHCT-TTE-EEEEEEEEEEGGGHHHHHHHHHH-TT---EEEEES-
T ss_pred cccchhHHHHHHHHHHcCcCHHHHHHHHHH-HHHHHHhcCccc-CccceEEEeecccHHHHHHHHHcchhh-HhHhhhhh
Confidence 1 01000 0111222222 344555443211 234799999999999988888888775 66666654
Q ss_pred c
Q psy3685 883 P 883 (1042)
Q Consensus 883 ~ 883 (1042)
.
T Consensus 259 l 259 (390)
T PF12715_consen 259 L 259 (390)
T ss_dssp B
T ss_pred h
Confidence 3
No 167
>PF01674 Lipase_2: Lipase (class 2); InterPro: IPR002918 Lipases or triacylglycerol acylhydrolases hydrolyse ester bonds in triacylglycerol giving diacylglycerol, monoacylglycerol, glycerol and free fatty acids []. This group of lipases has been called class 2 as they are not clearly related to other lipase families, and includes LipA and LipB from Bacillus subtilis [] and uncharacterised proteins from Caenorhabditis.; PDB: 2VTV_B 2X76_A 2X5X_A 2QXU_A 3QMM_A 1I6W_A 3D2C_J 2QXT_B 1R50_A 1T2N_A ....
Probab=96.18 E-value=0.016 Score=59.69 Aligned_cols=92 Identities=20% Similarity=0.228 Sum_probs=48.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCe---EEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNAL---AVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~---~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
.||||+-|..+.....|.. +...|+.+ |+. |+++.. |.....+. ..+.. .+.+. +..++.||..+..
T Consensus 2 ~PVVlVHG~~~~~~~~w~~--~~~~l~~~-GY~~~~vya~ty---g~~~~~~~--~~~~~-~~~~~-~~~l~~fI~~Vl~ 71 (219)
T PF01674_consen 2 RPVVLVHGTGGNAYSNWST--LAPYLKAA-GYCDSEVYALTY---GSGNGSPS--VQNAH-MSCES-AKQLRAFIDAVLA 71 (219)
T ss_dssp --EEEE--TTTTTCGGCCH--HHHHHHHT-T--CCCEEEE-----S-CCHHTH--HHHHH-B-HHH-HHHHHHHHHHHHH
T ss_pred CCEEEECCCCcchhhCHHH--HHHHHHHc-CCCcceeEeccC---CCCCCCCc--ccccc-cchhh-HHHHHHHHHHHHH
Confidence 5766555544434454543 23445544 655 677763 33221110 11111 23333 4899999999876
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhC
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKY 870 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~y 870 (1042)
.- +. +|=++|||+||+||.|+.+..
T Consensus 72 ~T---Ga-kVDIVgHS~G~~iaR~yi~~~ 96 (219)
T PF01674_consen 72 YT---GA-KVDIVGHSMGGTIARYYIKGG 96 (219)
T ss_dssp HH---T---EEEEEETCHHHHHHHHHHHC
T ss_pred hh---CC-EEEEEEcCCcCHHHHHHHHHc
Confidence 54 33 899999999999999998754
No 168
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.16 E-value=0.0033 Score=40.28 Aligned_cols=25 Identities=40% Similarity=0.777 Sum_probs=22.8
Q ss_pred ccCCCCCCccCCcchHHHHHhhccC
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIHQG 707 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H~g 707 (1042)
|.|+.|+++|....+|..|+++|..
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~~ 25 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHXX 25 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhcc
Confidence 6899999999999999999998853
No 169
>KOG1515|consensus
Probab=96.15 E-value=0.016 Score=63.66 Aligned_cols=110 Identities=15% Similarity=0.117 Sum_probs=73.6
Q ss_pred CCCcEEEEECCCCCCcc---cccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 763 HGGPIFLMIGGEGEASA---KWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~---~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
..-|+|||.+|-|-+.. .-.+.++...+|.+.++.||.+|.|=-=+. |++ . +.+|...-+.++
T Consensus 88 ~~~p~lvyfHGGGf~~~S~~~~~y~~~~~~~a~~~~~vvvSVdYRLAPEh-~~P---a----------~y~D~~~Al~w~ 153 (336)
T KOG1515|consen 88 TKLPVLVYFHGGGFCLGSANSPAYDSFCTRLAAELNCVVVSVDYRLAPEH-PFP---A----------AYDDGWAALKWV 153 (336)
T ss_pred cCceEEEEEeCCccEeCCCCCchhHHHHHHHHHHcCeEEEecCcccCCCC-CCC---c----------cchHHHHHHHHH
Confidence 45789999955443222 223566788899999999999998874332 222 1 445555555555
Q ss_pred hhh----cCCCCCCcEEEEccchhHHHHHHHHHh------CCCeEEEEEeeccccccc
Q psy3685 840 NTL----YSLPAHTKWIAFGGSYPGALAAWLRYK------YPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 840 ~~~----~~~~~~~~~i~~G~Sygg~la~~~~~~------yP~~~~g~i~ssa~~~~~ 887 (1042)
... +.. +-.++++.|-|-||+||+.+.++ -|-.+.|.|...+..++.
T Consensus 154 ~~~~~~~~~~-D~~rv~l~GDSaGGNia~~va~r~~~~~~~~~ki~g~ili~P~~~~~ 210 (336)
T KOG1515|consen 154 LKNSWLKLGA-DPSRVFLAGDSAGGNIAHVVAQRAADEKLSKPKIKGQILIYPFFQGT 210 (336)
T ss_pred HHhHHHHhCC-CcccEEEEccCccHHHHHHHHHHHhhccCCCcceEEEEEEecccCCC
Confidence 442 222 22479999999999999887653 246688998888666654
No 170
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.15 E-value=0.0037 Score=45.85 Aligned_cols=33 Identities=21% Similarity=0.389 Sum_probs=24.7
Q ss_pred cCCccccCCCCCCccCCcchHHHHHhhccCCCC
Q psy3685 678 HQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRV 710 (1042)
Q Consensus 678 ~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~ 710 (1042)
..+.|-.|++|+..+++..+|++|++++++.+|
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 346789999999999999999999999999886
No 171
>KOG2931|consensus
Probab=95.93 E-value=0.053 Score=56.59 Aligned_cols=109 Identities=13% Similarity=0.085 Sum_probs=78.7
Q ss_pred CCcEEEEECCCCCCccc----ccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 764 GGPIFLMIGGEGEASAK----WMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~----~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
++|+|+-.++-|-.... +.+..-+.++.+. +-++-++-+|.=.--|. + .++..|.|+|+..+|+..++.++
T Consensus 45 ~kpaiiTyhDlglN~~scFq~ff~~p~m~ei~~~--fcv~HV~~PGqe~gAp~--~-p~~y~yPsmd~LAd~l~~VL~~f 119 (326)
T KOG2931|consen 45 NKPAIITYHDLGLNHKSCFQGFFNFPDMAEILEH--FCVYHVDAPGQEDGAPS--F-PEGYPYPSMDDLADMLPEVLDHF 119 (326)
T ss_pred CCceEEEecccccchHhHhHHhhcCHhHHHHHhh--eEEEecCCCccccCCcc--C-CCCCCCCCHHHHHHHHHHHHHhc
Confidence 47786666777754433 2222335555554 67888887776433221 1 24567899999999999998887
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.-+ -+|-||-.-|+.|-+.|++++|+.|-|.|..+..-
T Consensus 120 ~lk-------~vIg~GvGAGAyIL~rFAl~hp~rV~GLvLIn~~~ 157 (326)
T KOG2931|consen 120 GLK-------SVIGMGVGAGAYILARFALNHPERVLGLVLINCDP 157 (326)
T ss_pred Ccc-------eEEEecccccHHHHHHHHhcChhheeEEEEEecCC
Confidence 653 49999999999999999999999999999876544
No 172
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.91 E-value=0.0031 Score=64.83 Aligned_cols=24 Identities=38% Similarity=0.861 Sum_probs=14.3
Q ss_pred CCccccCC--CCcccCCchhHHHHHH
Q psy3685 502 GIRYKCDL--CSKEFSSKGNLTEHMK 525 (1042)
Q Consensus 502 ~~~~~C~~--C~~~f~~~~~L~~H~~ 525 (1042)
++||+|++ |+|++++..-|+.|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~l 372 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHML 372 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhh
Confidence 45666654 6666666666666654
No 173
>COG3571 Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]
Probab=95.91 E-value=0.065 Score=50.41 Aligned_cols=106 Identities=17% Similarity=0.203 Sum_probs=65.7
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeee-----cccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRF-----YGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~-----~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
+||+.+|.|.....-..+.....||.+ |.+|+-+|..| +|.-+|.+...+-|-. +...|..+..
T Consensus 16 tilLaHGAGasmdSt~m~~~a~~la~~-G~~vaRfefpYma~Rrtg~rkPp~~~~t~~~~----------~~~~~aql~~ 84 (213)
T COG3571 16 TILLAHGAGASMDSTSMTAVAAALARR-GWLVARFEFPYMAARRTGRRKPPPGSGTLNPE----------YIVAIAQLRA 84 (213)
T ss_pred EEEEecCCCCCCCCHHHHHHHHHHHhC-ceeEEEeecchhhhccccCCCCcCccccCCHH----------HHHHHHHHHh
Confidence 667779998765544444555667776 78888888655 4655554433222211 1122222332
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
. + ...|.|+=|.||||-+|+.....---.|+|.+.-+=|+.+
T Consensus 85 ~--l-~~gpLi~GGkSmGGR~aSmvade~~A~i~~L~clgYPfhp 126 (213)
T COG3571 85 G--L-AEGPLIIGGKSMGGRVASMVADELQAPIDGLVCLGYPFHP 126 (213)
T ss_pred c--c-cCCceeeccccccchHHHHHHHhhcCCcceEEEecCccCC
Confidence 2 2 2248999999999999988875444448888888777643
No 174
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.87 E-value=0.0033 Score=64.70 Aligned_cols=67 Identities=24% Similarity=0.496 Sum_probs=37.7
Q ss_pred CCccccCc--cCcccCCchhHHHHHHHhc-Cc---ccccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhHH
Q psy3685 529 GIRYKCDL--CSKEFSRKDNLTEHKEIHQ-GI---RYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLT 602 (1042)
Q Consensus 529 ~~~~~C~~--C~k~f~~~~~L~~H~~~H~-~~---~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~ 602 (1042)
++||+|++ |.|.+++...|+-|+.--+ .. +-.=++=-..| .-..+||.|++|+|.|+....|+
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F-----------~~~~KPYrCevC~KRYKNlNGLK 415 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF-----------SAKDKPYRCEVCDKRYKNLNGLK 415 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc-----------cccCCceeccccchhhccCccce
Confidence 45777765 7777777777777765321 10 00000000111 11234788888888888888888
Q ss_pred HHhh
Q psy3685 603 KHMK 606 (1042)
Q Consensus 603 ~H~~ 606 (1042)
-|++
T Consensus 416 YHr~ 419 (423)
T COG5189 416 YHRK 419 (423)
T ss_pred eccc
Confidence 7754
No 175
>COG4757 Predicted alpha/beta hydrolase [General function prediction only]
Probab=95.83 E-value=0.018 Score=57.63 Aligned_cols=92 Identities=20% Similarity=0.238 Sum_probs=63.7
Q ss_pred EEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 768 FLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
++.-|+.|....++. . +.++|...|..|+..|.|+-|.|.|.. ++...++|+ |=|..|+...++.++... +
T Consensus 33 ~~va~a~Gv~~~fYR--r-fA~~a~~~Gf~Vlt~dyRG~g~S~p~~-~~~~~~~~~--DwA~~D~~aal~~~~~~~---~ 103 (281)
T COG4757 33 LVVAGATGVGQYFYR--R-FAAAAAKAGFEVLTFDYRGIGQSRPAS-LSGSQWRYL--DWARLDFPAALAALKKAL---P 103 (281)
T ss_pred EEecccCCcchhHhH--H-HHHHhhccCceEEEEecccccCCCccc-cccCccchh--hhhhcchHHHHHHHHhhC---C
Confidence 444466665444333 2 345555669999999999999999963 333345555 457788888888888754 3
Q ss_pred CCcEEEEccchhHHHHHHHHH
Q psy3685 848 HTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~~ 868 (1042)
.-|...+||||||-+...+.+
T Consensus 104 ~~P~y~vgHS~GGqa~gL~~~ 124 (281)
T COG4757 104 GHPLYFVGHSFGGQALGLLGQ 124 (281)
T ss_pred CCceEEeeccccceeeccccc
Confidence 358999999999977665543
No 176
>PF06821 Ser_hydrolase: Serine hydrolase; InterPro: IPR010662 This family contains a number of hypothetical bacterial proteins of unknown function, which may be cytosolic. The Crystal Structure Of The Yden Gene Product Swiss:P96671 from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins [].; GO: 0016787 hydrolase activity; PDB: 3BDV_B 2QS9_A 1UXO_A.
Probab=95.81 E-value=0.064 Score=53.34 Aligned_cols=91 Identities=15% Similarity=0.082 Sum_probs=59.0
Q ss_pred EEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 768 FLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
|++++|.++... ..++.-|++++... +-++.... .. -|+.+++..+.......
T Consensus 1 v~IvhG~~~s~~----~HW~~wl~~~l~~~-~~V~~~~~--~~-------------------P~~~~W~~~l~~~i~~~- 53 (171)
T PF06821_consen 1 VLIVHGYGGSPP----DHWQPWLERQLENS-VRVEQPDW--DN-------------------PDLDEWVQALDQAIDAI- 53 (171)
T ss_dssp EEEE--TTSSTT----TSTHHHHHHHHTTS-EEEEEC----TS---------------------HHHHHHHHHHCCHC--
T ss_pred CEEeCCCCCCCc----cHHHHHHHHhCCCC-eEEecccc--CC-------------------CCHHHHHHHHHHHHhhc-
Confidence 577888876533 23445677776555 55554332 11 35667777777765432
Q ss_pred CCcEEEEccchhHHHHHHHH-HhCCCeEEEEEeeccccc
Q psy3685 848 HTKWIAFGGSYPGALAAWLR-YKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~-~~yP~~~~g~i~ssa~~~ 885 (1042)
+.++|++|||.|...++.+. ...+..|.|++..|++-.
T Consensus 54 ~~~~ilVaHSLGc~~~l~~l~~~~~~~v~g~lLVAp~~~ 92 (171)
T PF06821_consen 54 DEPTILVAHSLGCLTALRWLAEQSQKKVAGALLVAPFDP 92 (171)
T ss_dssp TTTEEEEEETHHHHHHHHHHHHTCCSSEEEEEEES--SC
T ss_pred CCCeEEEEeCHHHHHHHHHHhhcccccccEEEEEcCCCc
Confidence 35799999999999998888 889999999999998765
No 177
>COG3458 Acetyl esterase (deacetylase) [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=95.59 E-value=0.067 Score=55.14 Aligned_cols=104 Identities=18% Similarity=0.140 Sum_probs=62.8
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccC---CCCCC--------cccc---cCCCHHHHH
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHP---LSDLS--------VESL---QYLSAEQAL 829 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p---~~~~~--------~~~~---~~lt~~q~l 829 (1042)
.-|.||--+|.+...+.|-. +..||-. |..+++.+-||=|.|.. -+-.+ ..++ ..+-..-..
T Consensus 82 ~~P~vV~fhGY~g~~g~~~~---~l~wa~~-Gyavf~MdvRGQg~~~~dt~~~p~~~s~pG~mtrGilD~kd~yyyr~v~ 157 (321)
T COG3458 82 KLPAVVQFHGYGGRGGEWHD---MLHWAVA-GYAVFVMDVRGQGSSSQDTADPPGGPSDPGFMTRGILDRKDTYYYRGVF 157 (321)
T ss_pred ccceEEEEeeccCCCCCccc---ccccccc-ceeEEEEecccCCCccccCCCCCCCCcCCceeEeecccCCCceEEeeeh
Confidence 35766655666554443332 4456655 89999999999998833 11111 0111 111123345
Q ss_pred hHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 830 ADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 830 ~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
.|++..+..+......+ ..++.+.|+|-||.||+.....-|-
T Consensus 158 ~D~~~ave~~~sl~~vd-e~Ri~v~G~SqGGglalaaaal~~r 199 (321)
T COG3458 158 LDAVRAVEILASLDEVD-EERIGVTGGSQGGGLALAAAALDPR 199 (321)
T ss_pred HHHHHHHHHHhccCccc-hhheEEeccccCchhhhhhhhcChh
Confidence 67777776666554332 3589999999999999876665554
No 178
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.38 E-value=0.0095 Score=38.09 Aligned_cols=19 Identities=42% Similarity=0.831 Sum_probs=7.0
Q ss_pred CCccccccCCchhHHHHhh
Q psy3685 588 CDLCSKKFSRKLNLTKHMK 606 (1042)
Q Consensus 588 C~~C~k~F~~~~~L~~H~~ 606 (1042)
|+.|++.|.....|..|++
T Consensus 3 C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 3 CPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCcchhCCHHHHHHHHH
Confidence 3333333333333333333
No 179
>COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=95.38 E-value=0.023 Score=58.14 Aligned_cols=58 Identities=16% Similarity=0.074 Sum_probs=36.3
Q ss_pred CeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC-CCCCCcEEEEccchhHHHHHHHH
Q psy3685 796 ALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS-LPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 796 a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~-~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
-.+++++.+|.|.=.-. -++.|++.++..+..+.. ...+.||.+|||||||+||=-++
T Consensus 34 iel~avqlPGR~~r~~e--------------p~~~di~~Lad~la~el~~~~~d~P~alfGHSmGa~lAfEvA 92 (244)
T COG3208 34 IELLAVQLPGRGDRFGE--------------PLLTDIESLADELANELLPPLLDAPFALFGHSMGAMLAFEVA 92 (244)
T ss_pred hheeeecCCCcccccCC--------------cccccHHHHHHHHHHHhccccCCCCeeecccchhHHHHHHHH
Confidence 36788888887753211 133444444444444433 23457999999999999995544
No 180
>COG2939 Carboxypeptidase C (cathepsin A) [Amino acid transport and metabolism]
Probab=95.34 E-value=0.017 Score=65.13 Aligned_cols=99 Identities=22% Similarity=0.209 Sum_probs=61.1
Q ss_pred CCcEEEEE-CCCCCCcc---------cccccc---cH--HHHHHHcCCeEEEEe-eeecccccCCCCCCcccccCCCHHH
Q psy3685 764 GGPIFLMI-GGEGEASA---------KWMVEG---TW--LDYAHNHNALAVQVE-HRFYGKSHPLSDLSVESLQYLSAEQ 827 (1042)
Q Consensus 764 ~~pi~~~~-gg~g~~~~---------~~~~~~---~~--~~la~~~~a~~~~~e-hR~~G~S~p~~~~~~~~~~~lt~~q 827 (1042)
..||+|++ ||+|..+. .-+..+ .. ......-.|.+|+|| .-+.|-|.-..+- .-.+.+-
T Consensus 100 ~rPvi~wlNGGPGcSS~~g~l~elGP~rI~~~~~P~~~~NP~SW~~~adLvFiDqPvGTGfS~a~~~e-----~~~d~~~ 174 (498)
T COG2939 100 NRPVIFWLNGGPGCSSVTGLLGELGPKRIQSGTSPSYPDNPGSWLDFADLVFIDQPVGTGFSRALGDE-----KKKDFEG 174 (498)
T ss_pred CCceEEEecCCCChHhhhhhhhhcCCeeeeCCCCCCCCCCccccccCCceEEEecCcccCcccccccc-----cccchhc
Confidence 47888888 99975432 111111 11 233444578999999 5688888642111 2234555
Q ss_pred HHhHHHHHHHHHhhhcCCCCC--CcEEEEccchhHHHHHHHH
Q psy3685 828 ALADLAYFITSMNTLYSLPAH--TKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 828 ~l~D~~~~~~~~~~~~~~~~~--~~~i~~G~Sygg~la~~~~ 867 (1042)
|=+|+..|.+.+-..+....+ .||+|+|-||||.-++.++
T Consensus 175 ~~~D~~~~~~~f~~~fp~~~r~~~~~~L~GESYgg~yip~~A 216 (498)
T COG2939 175 AGKDVYSFLRLFFDKFPHYARLLSPKFLAGESYGGHYIPVFA 216 (498)
T ss_pred cchhHHHHHHHHHHHHHHHhhhcCceeEeeccccchhhHHHH
Confidence 667777776665544422122 4899999999998776665
No 181
>KOG1283|consensus
Probab=95.33 E-value=0.054 Score=56.79 Aligned_cols=105 Identities=22% Similarity=0.194 Sum_probs=72.1
Q ss_pred CCCcEEEEE-CCCCCCcccccc---c------ccHHHHHHHcCCeEEEEeee-ecccccCCCCCCcccccCC-CHHHHHh
Q psy3685 763 HGGPIFLMI-GGEGEASAKWMV---E------GTWLDYAHNHNALAVQVEHR-FYGKSHPLSDLSVESLQYL-SAEQALA 830 (1042)
Q Consensus 763 ~~~pi~~~~-gg~g~~~~~~~~---~------~~~~~la~~~~a~~~~~ehR-~~G~S~p~~~~~~~~~~~l-t~~q~l~ 830 (1042)
...|.+|.+ ||+|+.+..+-+ . +-...+.....|.++++|-+ +-|-|+-.+. -.|. +.+|+..
T Consensus 29 s~~pl~lwlqGgpGaSstG~GNFeE~GPl~~~~~~r~~TWlk~adllfvDnPVGaGfSyVdg~-----~~Y~~~~~qia~ 103 (414)
T KOG1283|consen 29 SERPLALWLQGGPGASSTGFGNFEELGPLDLDGSPRDWTWLKDADLLFVDNPVGAGFSYVDGS-----SAYTTNNKQIAL 103 (414)
T ss_pred cCCCeeEEecCCCCCCCcCccchhhcCCcccCCCcCCchhhhhccEEEecCCCcCceeeecCc-----ccccccHHHHHH
Confidence 567999988 999876654322 1 22333455567889999975 6676654321 1233 7899999
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
|+.++++.+-....--...|..+|--||||-+|+-+.+---+
T Consensus 104 Dl~~llk~f~~~h~e~~t~P~~If~ESYGGKma~k~al~l~~ 145 (414)
T KOG1283|consen 104 DLVELLKGFFTNHPEFKTVPLYIFCESYGGKMAAKFALELDD 145 (414)
T ss_pred HHHHHHHHHHhcCccccccceEEEEhhcccchhhhhhhhHHH
Confidence 999998876544333345699999999999999988764433
No 182
>COG2272 PnbA Carboxylesterase type B [Lipid metabolism]
Probab=95.20 E-value=0.031 Score=62.99 Aligned_cols=110 Identities=21% Similarity=0.196 Sum_probs=67.1
Q ss_pred CcEEEEECCC----CCCcccccccccHHHHHHHcCCeEEEEeeee----c--ccccCCCCCCcccccCCCHHHHHhHHHH
Q psy3685 765 GPIFLMIGGE----GEASAKWMVEGTWLDYAHNHNALAVQVEHRF----Y--GKSHPLSDLSVESLQYLSAEQALADLAY 834 (1042)
Q Consensus 765 ~pi~~~~gg~----g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~----~--G~S~p~~~~~~~~~~~lt~~q~l~D~~~ 834 (1042)
.||+|||+|= |+.+.. .+. -..||++-+..||.+.+|= | ..+.-..+-..+| -.|.|...
T Consensus 94 ~PVmV~IHGG~y~~Gs~s~~-~yd--gs~La~~g~vVvVSvNYRLG~lGfL~~~~~~~~~~~~~n-------~Gl~Dqil 163 (491)
T COG2272 94 LPVMVYIHGGGYIMGSGSEP-LYD--GSALAARGDVVVVSVNYRLGALGFLDLSSLDTEDAFASN-------LGLLDQIL 163 (491)
T ss_pred CcEEEEEeccccccCCCccc-ccC--hHHHHhcCCEEEEEeCcccccceeeehhhcccccccccc-------ccHHHHHH
Confidence 5999999553 222221 233 3478998669999999982 1 1111111111122 35677777
Q ss_pred HHHHHhhh---cCCCCCCcEEEEccchhHHHHHHHHH--hCCCeEEEEEeeccccc
Q psy3685 835 FITSMNTL---YSLPAHTKWIAFGGSYPGALAAWLRY--KYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 835 ~~~~~~~~---~~~~~~~~~i~~G~Sygg~la~~~~~--~yP~~~~g~i~ssa~~~ 885 (1042)
-+++++.. |+.++ ..|-|||.|-||+.++++.. ....++.-||+-|++..
T Consensus 164 ALkWV~~NIe~FGGDp-~NVTl~GeSAGa~si~~Lla~P~AkGLF~rAi~~Sg~~~ 218 (491)
T COG2272 164 ALKWVRDNIEAFGGDP-QNVTLFGESAGAASILTLLAVPSAKGLFHRAIALSGAAS 218 (491)
T ss_pred HHHHHHHHHHHhCCCc-cceEEeeccchHHHHHHhhcCccchHHHHHHHHhCCCCC
Confidence 77777654 45544 36999999999988777543 33345666777777665
No 183
>PF10340 DUF2424: Protein of unknown function (DUF2424); InterPro: IPR019436 Sterol homeostasis in eukaryotic cells relies on the reciprocal interconversion of free sterols and steryl esters. In Saccharomyces cerevisiae (Baker's yeast) sterol acetylation requires the acetyltransferase Atf2, whereas deacetylation requires Say1, a membrane-anchored deacetylase with a putative active site in the ER lumen. Lack of Say1 results in the secretion of acetylated sterols into the culture medium, indicating that the substrate specificity of Say1 determines whether acetylated sterols are secreted from the cells or whether they are deacetylated and retained. In S. cerevisiae cells lacking Say1 or Atf2 are sensitive against the plant-derived allylbenzene eugenol and both Say1 and Atf2 affect pregnenolone toxicity, indicating that lipid acetylation acts as a detoxification pathway []. Homologues of Say1 are present in the mammalian genome and can functionally substitute for Say1 in yeast demonstrating that part of this pathway has been evolutionarily conserved [].
Probab=95.18 E-value=0.12 Score=57.28 Aligned_cols=121 Identities=17% Similarity=0.165 Sum_probs=63.1
Q ss_pred eEEEEEecc--cccC-CCcEEEEECCCCCCcccc-cccccHHHHHHHc-CCeEEEEeeeecc---cccCCCCCCcccccC
Q psy3685 751 KQRYFVNST--WYKH-GGPIFLMIGGEGEASAKW-MVEGTWLDYAHNH-NALAVQVEHRFYG---KSHPLSDLSVESLQY 822 (1042)
Q Consensus 751 ~qry~~~~~--~~~~-~~pi~~~~gg~g~~~~~~-~~~~~~~~la~~~-~a~~~~~ehR~~G---~S~p~~~~~~~~~~~ 822 (1042)
+|-||+-.. -.+| ..|||+|++|=|-.-... ..-.++..+...+ ...++++|.--.. .+.+++
T Consensus 105 ~~s~Wlvk~P~~~~pk~DpVlIYlHGGGY~l~~~p~qi~~L~~i~~~l~~~SILvLDYsLt~~~~~~~~yP--------- 175 (374)
T PF10340_consen 105 SQSYWLVKAPNRFKPKSDPVLIYLHGGGYFLGTTPSQIEFLLNIYKLLPEVSILVLDYSLTSSDEHGHKYP--------- 175 (374)
T ss_pred cceEEEEeCCcccCCCCCcEEEEEcCCeeEecCCHHHHHHHHHHHHHcCCCeEEEEeccccccccCCCcCc---------
Confidence 355777652 3444 479999997744221100 0001122222221 4577888854322 221221
Q ss_pred CCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH--hCCCe---EEEEEeeccccccc
Q psy3685 823 LSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY--KYPHL---VHGAMSASGPLRAV 887 (1042)
Q Consensus 823 lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~--~yP~~---~~g~i~ssa~~~~~ 887 (1042)
- .|.++++..+++.+.. +...++|+|-|.||.||.-+.+ +.++. =.++|+.|+-|...
T Consensus 176 ---t-QL~qlv~~Y~~Lv~~~---G~~nI~LmGDSAGGnL~Ls~LqyL~~~~~~~~Pk~~iLISPWv~l~ 238 (374)
T PF10340_consen 176 ---T-QLRQLVATYDYLVESE---GNKNIILMGDSAGGNLALSFLQYLKKPNKLPYPKSAILISPWVNLV 238 (374)
T ss_pred ---h-HHHHHHHHHHHHHhcc---CCCeEEEEecCccHHHHHHHHHHHhhcCCCCCCceeEEECCCcCCc
Confidence 1 2233333334433222 2357999999999999976653 22111 24789999888766
No 184
>PRK04940 hypothetical protein; Provisional
Probab=95.11 E-value=0.062 Score=53.02 Aligned_cols=87 Identities=14% Similarity=0.180 Sum_probs=51.6
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCC
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLP 846 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~ 846 (1042)
+|||++|..+.... +. .-|.++. .+ ..+-|+. +.| -.+.++|++-+...|..+... +
T Consensus 1 ~IlYlHGF~SS~~S----~~--~Ka~~l~-~~-~p~~~~~--~l~----------~~~P~~a~~~l~~~i~~~~~~-~-- 57 (180)
T PRK04940 1 MIIYLHGFDSTSPG----NH--EKVLQLQ-FI-DPDVRLI--SYS----------TLHPKHDMQHLLKEVDKMLQL-S-- 57 (180)
T ss_pred CEEEeCCCCCCCCc----cH--HHHHhhe-ee-CCCCeEE--ECC----------CCCHHHHHHHHHHHHHHhhhc-c--
Confidence 58999998765331 11 2333433 11 3333443 222 135677777666666543221 0
Q ss_pred CCCcEEEEccchhHHHHHHHHHhCCCeEEEEE
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAM 878 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i 878 (1042)
...+++++|.|.||..|.|++.+|- +.+++
T Consensus 58 ~~~~~~liGSSLGGyyA~~La~~~g--~~aVL 87 (180)
T PRK04940 58 DDERPLICGVGLGGYWAERIGFLCG--IRQVI 87 (180)
T ss_pred CCCCcEEEEeChHHHHHHHHHHHHC--CCEEE
Confidence 1137999999999999999999986 34444
No 185
>TIGR01839 PHA_synth_II poly(R)-hydroxyalkanoic acid synthase, class II. This model represents the class II subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs, typically with six to fourteen carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=95.04 E-value=0.44 Score=55.68 Aligned_cols=87 Identities=11% Similarity=0.067 Sum_probs=66.0
Q ss_pred HHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHH-
Q psy3685 787 WLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAW- 865 (1042)
Q Consensus 787 ~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~- 865 (1042)
+..+.-+.|..|+.++-|.-|.+ .++++.+.-++.+.+-|+.++..- +..++.++|.++||.|++.
T Consensus 239 lVr~lv~qG~~VflIsW~nP~~~----------~r~~~ldDYv~~i~~Ald~V~~~t---G~~~vnl~GyC~GGtl~a~~ 305 (560)
T TIGR01839 239 FVQYCLKNQLQVFIISWRNPDKA----------HREWGLSTYVDALKEAVDAVRAIT---GSRDLNLLGACAGGLTCAAL 305 (560)
T ss_pred HHHHHHHcCCeEEEEeCCCCChh----------hcCCCHHHHHHHHHHHHHHHHHhc---CCCCeeEEEECcchHHHHHH
Confidence 44444455999999998884432 367777777777777777776653 3358999999999998886
Q ss_pred ---HHHhCCC-eEEEEEeecccccc
Q psy3685 866 ---LRYKYPH-LVHGAMSASGPLRA 886 (1042)
Q Consensus 866 ---~~~~yP~-~~~g~i~ssa~~~~ 886 (1042)
+..++|+ .|..++...+|+..
T Consensus 306 ~a~~aA~~~~~~V~sltllatplDf 330 (560)
T TIGR01839 306 VGHLQALGQLRKVNSLTYLVSLLDS 330 (560)
T ss_pred HHHHHhcCCCCceeeEEeeeccccc
Confidence 8899996 69998888888864
No 186
>COG3319 Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=95.02 E-value=0.065 Score=56.58 Aligned_cols=58 Identities=17% Similarity=0.183 Sum_probs=41.9
Q ss_pred CeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH
Q psy3685 796 ALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 796 a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
..++.++-|++|.-.+ ..-++|+.++.+..-|+.+. +..|++|.|.|+||++|.-++.
T Consensus 27 ~~v~~l~a~g~~~~~~---------~~~~l~~~a~~yv~~Ir~~Q------P~GPy~L~G~S~GG~vA~evA~ 84 (257)
T COG3319 27 LPVYGLQAPGYGAGEQ---------PFASLDDMAAAYVAAIRRVQ------PEGPYVLLGWSLGGAVAFEVAA 84 (257)
T ss_pred ceeeccccCccccccc---------ccCCHHHHHHHHHHHHHHhC------CCCCEEEEeeccccHHHHHHHH
Confidence 4588888888873211 23378888888887777654 3359999999999999976653
No 187
>PRK10252 entF enterobactin synthase subunit F; Provisional
Probab=95.02 E-value=0.082 Score=71.41 Aligned_cols=97 Identities=10% Similarity=-0.019 Sum_probs=65.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHc--CCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNH--NALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~--~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+|-+++++|.|..... +..+++.+ +..|+.++.+++|.+.+. ..++++..+|++..+..+.
T Consensus 1068 ~~~l~~lh~~~g~~~~------~~~l~~~l~~~~~v~~~~~~g~~~~~~~---------~~~l~~la~~~~~~i~~~~-- 1130 (1296)
T PRK10252 1068 GPTLFCFHPASGFAWQ------FSVLSRYLDPQWSIYGIQSPRPDGPMQT---------ATSLDEVCEAHLATLLEQQ-- 1130 (1296)
T ss_pred CCCeEEecCCCCchHH------HHHHHHhcCCCCcEEEEECCCCCCCCCC---------CCCHHHHHHHHHHHHHhhC--
Confidence 4445566555543322 23444433 578999999999865332 2378888888888877542
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHh---CCCeEEEEEeecc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYK---YPHLVHGAMSASG 882 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~---yP~~~~g~i~ssa 882 (1042)
...|++++|+|+||++|..++.+ .|+.+...+...+
T Consensus 1131 ----~~~p~~l~G~S~Gg~vA~e~A~~l~~~~~~v~~l~l~~~ 1169 (1296)
T PRK10252 1131 ----PHGPYHLLGYSLGGTLAQGIAARLRARGEEVAFLGLLDT 1169 (1296)
T ss_pred ----CCCCEEEEEechhhHHHHHHHHHHHHcCCceeEEEEecC
Confidence 12489999999999999998875 5777877665443
No 188
>PF00135 COesterase: Carboxylesterase family The prints entry is specific to acetylcholinesterase; InterPro: IPR002018 Higher eukaryotes have many distinct esterases. Among the different types are those which act on carboxylic esters (3.1.1 from EC). Carboxyl-esterases have been classified into three categories (A, B and C) on the basis of differential patterns of inhibition by organophosphates. The sequence of a number of type-B carboxylesterases indicates [, , ] that the majority are evolutionary related. As is the case for lipases and serine proteases, the catalytic apparatus of esterases involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.; PDB: 3B3Q_A 1CLE_B 1GQS_A 2VJD_A 1HBJ_A 2C5G_A 1U65_A 2WG1_A 1FSS_A 3M3D_A ....
Probab=94.92 E-value=0.041 Score=66.40 Aligned_cols=112 Identities=20% Similarity=0.153 Sum_probs=63.3
Q ss_pred CcEEEEECCCCCCccccc-ccccHHHHHHHcCCeEEEEeee----ecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWM-VEGTWLDYAHNHNALAVQVEHR----FYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~-~~~~~~~la~~~~a~~~~~ehR----~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
-||+|||+|-|-..+.-. .......++...+..||.+-.| ||-.+ ++.... +...+|-|...-++++
T Consensus 125 lPV~v~ihGG~f~~G~~~~~~~~~~~~~~~~~vivVt~nYRlg~~Gfl~~---~~~~~~-----~gN~Gl~Dq~~AL~WV 196 (535)
T PF00135_consen 125 LPVMVWIHGGGFMFGSGSFPPYDGASLAASKDVIVVTINYRLGAFGFLSL---GDLDAP-----SGNYGLLDQRLALKWV 196 (535)
T ss_dssp EEEEEEE--STTTSSCTTSGGGHTHHHHHHHTSEEEEE----HHHHH-BS---SSTTSH-----BSTHHHHHHHHHHHHH
T ss_pred cceEEEeecccccCCCcccccccccccccCCCEEEEEecccccccccccc---cccccC-----chhhhhhhhHHHHHHH
Confidence 599999966443322210 1111234555669999999999 22211 111111 1234666777667776
Q ss_pred hhh---cCCCCCCcEEEEccchhHHHHHHHHHh--CCCeEEEEEeeccccc
Q psy3685 840 NTL---YSLPAHTKWIAFGGSYPGALAAWLRYK--YPHLVHGAMSASGPLR 885 (1042)
Q Consensus 840 ~~~---~~~~~~~~~i~~G~Sygg~la~~~~~~--yP~~~~g~i~ssa~~~ 885 (1042)
+.. |++++ ..|.|+|.|-||+.+.+...- -..++..||+.|+.+.
T Consensus 197 ~~nI~~FGGDp-~~VTl~G~SAGa~sv~~~l~sp~~~~LF~raI~~SGs~~ 246 (535)
T PF00135_consen 197 QDNIAAFGGDP-DNVTLFGQSAGAASVSLLLLSPSSKGLFHRAILQSGSAL 246 (535)
T ss_dssp HHHGGGGTEEE-EEEEEEEETHHHHHHHHHHHGGGGTTSBSEEEEES--TT
T ss_pred HhhhhhcccCC-cceeeeeecccccccceeeeccccccccccccccccccc
Confidence 655 44443 379999999988877766543 2369999999998654
No 189
>KOG3847|consensus
Probab=94.89 E-value=0.07 Score=55.94 Aligned_cols=116 Identities=18% Similarity=0.166 Sum_probs=67.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccC--CCCCC-----ccccc------------CCCH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHP--LSDLS-----VESLQ------------YLSA 825 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p--~~~~~-----~~~~~------------~lt~ 825 (1042)
-|||++.+|-|+... +++.+-.+||-. |..|.++|||-.-.+.- ..+.+ .+.+. .+..
T Consensus 118 ~PvvvFSHGLggsRt--~YSa~c~~LASh-G~VVaavEHRD~SA~~Ty~~~~~~~n~~lveq~~~ir~v~~~ekef~irN 194 (399)
T KOG3847|consen 118 YPVVVFSHGLGGSRT--LYSAYCTSLASH-GFVVAAVEHRDRSACWTYVLKEKHENEPLVEQWIKIRLVEANEKEFHIRN 194 (399)
T ss_pred ccEEEEecccccchh--hHHHHhhhHhhC-ceEEEEeecccCcceeEEEecccccCCcccccceEeeeeccCceeEEeeC
Confidence 499999999998766 455566678876 99999999997654421 11110 11111 1223
Q ss_pred HHHHhHHHHHHHHHhhhc----------CCC-------------CCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecc
Q psy3685 826 EQALADLAYFITSMNTLY----------SLP-------------AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASG 882 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~----------~~~-------------~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa 882 (1042)
||...-..+-.++++-.. .++ ..++++++|||+||+.++...... ..+.-||+--|
T Consensus 195 eqv~~R~~Ec~~aL~il~~i~~g~~~~~~L~g~~~~~~~~K~nl~~s~~aViGHSFGgAT~i~~ss~~-t~FrcaI~lD~ 273 (399)
T KOG3847|consen 195 EQVGQRAQECQKALKILEQINDGGTPDNVLPGNNSDLEQLKGNLDTSQAAVIGHSFGGATSIASSSSH-TDFRCAIALDA 273 (399)
T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCchhcccCccccHHHHhcchhhhhhhheeccccchhhhhhhccc-cceeeeeeeee
Confidence 444333333322221110 000 113589999999999988766654 34666776655
Q ss_pred cc
Q psy3685 883 PL 884 (1042)
Q Consensus 883 ~~ 884 (1042)
-+
T Consensus 274 WM 275 (399)
T KOG3847|consen 274 WM 275 (399)
T ss_pred ee
Confidence 44
No 190
>KOG2231|consensus
Probab=94.82 E-value=0.037 Score=64.91 Aligned_cols=30 Identities=13% Similarity=-0.025 Sum_probs=18.1
Q ss_pred ccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 855 GGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 855 G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
+.+..-.+..|...+.|....|...|.+.+
T Consensus 463 ~~ps~~~~ss~r~~~~~~ss~~~~~s~~~~ 492 (669)
T KOG2231|consen 463 SSPSSPYNSSTRSTAQPSSSLGPQSSLPSL 492 (669)
T ss_pred CCCcchhhhhcccccCCccccCccccchhh
Confidence 344455677777777777666655554333
No 191
>PF06259 Abhydrolase_8: Alpha/beta hydrolase; InterPro: IPR010427 This is a family of uncharacterised proteins found in Actinobacteria. Computational analysis suggests that they may belong to the alpha-beta hydrolase family of enzymes, as they are predicted to form the core secondary structures and catalytic machinery common to these proteins []. Genomic context suggests that they may function as lipases, controlling the concentration of their putative phospholipid substrates.
Probab=94.71 E-value=0.24 Score=49.07 Aligned_cols=59 Identities=15% Similarity=0.151 Sum_probs=45.7
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccc
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~ 884 (1042)
-.+.+-.+|+.|+..|+... .++..+.++|||||+.++.+.....+-.++-.|..++|-
T Consensus 86 ~A~~ga~~L~~f~~gl~a~~--~~~~~~tv~GHSYGS~v~G~A~~~~~~~vddvv~~GSPG 144 (177)
T PF06259_consen 86 YARAGAPRLARFLDGLRATH--GPDAHLTVVGHSYGSTVVGLAAQQGGLRVDDVVLVGSPG 144 (177)
T ss_pred HHHHHHHHHHHHHHHhhhhc--CCCCCEEEEEecchhHHHHHHhhhCCCCcccEEEECCCC
Confidence 35667889999999888765 345689999999999999998877666677666555554
No 192
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.71 E-value=0.017 Score=42.51 Aligned_cols=26 Identities=23% Similarity=0.533 Sum_probs=15.0
Q ss_pred cccCCccccccCCchhHHHHhhhcCC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQG 610 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~~ 610 (1042)
|-.|++|+..+.+..+|++|+.++|+
T Consensus 24 PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 24 PATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp -EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCcchhhccchhhHHHHHHHHhc
Confidence 66677777777777777777766554
No 193
>PF07224 Chlorophyllase: Chlorophyllase; InterPro: IPR010821 This family consists of several chlorophyllase proteins (3.1.1.14 from EC). Chlorophyllase (Chlase) is the first enzyme involved in chlorophyll degradation and catalyses the hydrolysis of the ester bond to yield chlorophyllide and phytol [, , ].; GO: 0047746 chlorophyllase activity, 0015996 chlorophyll catabolic process
Probab=94.69 E-value=0.072 Score=54.70 Aligned_cols=104 Identities=20% Similarity=0.208 Sum_probs=63.5
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
+=|||++.+|..-... .|+.++..+|- .|..||+.+ .+-. ..| ....-++|.++++.++..-+
T Consensus 45 ~yPVilF~HG~~l~ns--~Ys~lL~HIAS-HGfIVVAPQ-l~~~-~~p------------~~~~Ei~~aa~V~~WL~~gL 107 (307)
T PF07224_consen 45 TYPVILFLHGFNLYNS--FYSQLLAHIAS-HGFIVVAPQ-LYTL-FPP------------DGQDEIKSAASVINWLPEGL 107 (307)
T ss_pred CccEEEEeechhhhhH--HHHHHHHHHhh-cCeEEEech-hhcc-cCC------------CchHHHHHHHHHHHHHHhhh
Confidence 3478888888754322 34444555555 376776655 3322 222 12234688888888876542
Q ss_pred --CCC-----CCCcEEEEccchhHHHHHHHHHhC-CCeEEEEEeecccc
Q psy3685 844 --SLP-----AHTKWIAFGGSYPGALAAWLRYKY-PHLVHGAMSASGPL 884 (1042)
Q Consensus 844 --~~~-----~~~~~i~~G~Sygg~la~~~~~~y-P~~~~g~i~ssa~~ 884 (1042)
.++ +-.++.+.|||.||-.|-.+++.| .++=.+|+..--||
T Consensus 108 ~~~Lp~~V~~nl~klal~GHSrGGktAFAlALg~a~~lkfsaLIGiDPV 156 (307)
T PF07224_consen 108 QHVLPENVEANLSKLALSGHSRGGKTAFALALGYATSLKFSALIGIDPV 156 (307)
T ss_pred hhhCCCCcccccceEEEeecCCccHHHHHHHhcccccCchhheeccccc
Confidence 122 224799999999999998888887 55544444444444
No 194
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.65 E-value=0.013 Score=36.82 Aligned_cols=23 Identities=52% Similarity=0.870 Sum_probs=12.3
Q ss_pred ccCCccccccCCchhHHHHhhhcC
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKIHQ 609 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~H~ 609 (1042)
|+|+.|+.... +..|.+|+++|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666555 556666666554
No 195
>COG4188 Predicted dienelactone hydrolase [General function prediction only]
Probab=94.59 E-value=0.088 Score=57.45 Aligned_cols=103 Identities=13% Similarity=0.011 Sum_probs=61.9
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeee--cccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRF--YGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~--~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
-|||++.+|.|+.-..+.+ ....||.. |.+|.+|+|.+ .|..........+ +.-.-...=..|+-.++.++...
T Consensus 71 ~PlvvlshG~Gs~~~~f~~--~A~~lAs~-Gf~Va~~~hpgs~~~~~~~~~~~~~~-~~p~~~~erp~dis~lLd~L~~~ 146 (365)
T COG4188 71 LPLVVLSHGSGSYVTGFAW--LAEHLASY-GFVVAAPDHPGSNAGGAPAAYAGPGS-YAPAEWWERPLDISALLDALLQL 146 (365)
T ss_pred CCeEEecCCCCCCccchhh--hHHHHhhC-ceEEEeccCCCcccccCChhhcCCcc-cchhhhhcccccHHHHHHHHHHh
Confidence 5899999999887332222 23345554 99999999987 4544322111000 11112223347888888887766
Q ss_pred -----cCC-CCCCcEEEEccchhHHHHHHHHHhCC
Q psy3685 843 -----YSL-PAHTKWIAFGGSYPGALAAWLRYKYP 871 (1042)
Q Consensus 843 -----~~~-~~~~~~i~~G~Sygg~la~~~~~~yP 871 (1042)
+.. -+-.+|.++|+||||..++...--..
T Consensus 147 ~~sP~l~~~ld~~~Vgv~GhS~GG~T~m~laGA~~ 181 (365)
T COG4188 147 TASPALAGRLDPQRVGVLGHSFGGYTAMELAGAEL 181 (365)
T ss_pred hcCcccccccCccceEEEecccccHHHHHhccccc
Confidence 211 11247999999999988876654333
No 196
>COG0627 Predicted esterase [General function prediction only]
Probab=94.58 E-value=0.11 Score=56.93 Aligned_cols=125 Identities=18% Similarity=0.088 Sum_probs=74.9
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEE--EeeeecccccCCCC-CCcccccCCCHHHH-----HhHHHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQ--VEHRFYGKSHPLSD-LSVESLQYLSAEQA-----LADLAY 834 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~--~ehR~~G~S~p~~~-~~~~~~~~lt~~q~-----l~D~~~ 834 (1042)
..=||+++++|.....+.+...+-+...|.+.|..++. .+-||.|.-.+.-. +....=-|+...|. --+...
T Consensus 52 ~~ipV~~~l~G~t~~~~~~~~~~g~~~~a~~~g~~~~~p~t~~~~~~~~~~vv~p~G~~~sfY~d~~~~~~~~~~~q~~t 131 (316)
T COG0627 52 RDIPVLYLLSGLTCNEPNVYLLDGLRRQADESGWAVVTPDTSPRGAGVNISVVMPLGGGASFYSDWTQPPWASGPYQWET 131 (316)
T ss_pred CCCCEEEEeCCCCCCCCceEeccchhhhhhhcCeEEecCCCCcccCCCCccccccCCCccceecccccCccccCccchhH
Confidence 34579999999876654455445566778887888888 56677765544100 00000012222221 012222
Q ss_pred HH-----HHHhhhcCCCCC-CcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 835 FI-----TSMNTLYSLPAH-TKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 835 ~~-----~~~~~~~~~~~~-~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
|| ..+.+.++...+ ..-.++|+||||.=|..+++++|+.+..+.+-|+.+...
T Consensus 132 fl~~ELP~~~~~~f~~~~~~~~~aI~G~SMGG~GAl~lA~~~pd~f~~~sS~Sg~~~~s 190 (316)
T COG0627 132 FLTQELPALWEAAFPADGTGDGRAIAGHSMGGYGALKLALKHPDRFKSASSFSGILSPS 190 (316)
T ss_pred HHHhhhhHHHHHhcCcccccCCceeEEEeccchhhhhhhhhCcchhceecccccccccc
Confidence 22 112222222111 257889999999999999999999999888888887665
No 197
>PLN02213 sinapoylglucose-malate O-sinapoyltransferase/ carboxypeptidase
Probab=94.58 E-value=0.24 Score=54.99 Aligned_cols=87 Identities=13% Similarity=0.014 Sum_probs=56.0
Q ss_pred CeEEEEeee-ecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHH----HHHHHHHhC
Q psy3685 796 ALAVQVEHR-FYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGA----LAAWLRYKY 870 (1042)
Q Consensus 796 a~~~~~ehR-~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~----la~~~~~~y 870 (1042)
|.|+++|.+ |.|-|...++. .+-+.++++.|+..|++.+-..++.-.+.|+.++|-||||. ||..+....
T Consensus 2 aNvLfiDqPvGvGfSy~~~~~-----~~~~d~~~a~d~~~fL~~Ff~~~p~~~~~~fyI~GESYaG~YiP~la~~I~~~n 76 (319)
T PLN02213 2 ANIIFLDQPVGSGFSYSKTPI-----DKTGDISEVKRTHEFLQKWLSRHPQYFSNPLYVVGDSYSGMIVPALVQEISQGN 76 (319)
T ss_pred ccEEEecCCCCCCCCCCCCCC-----CccccHHHHHHHHHHHHHHHHhCcccccCCeEEEeeccccchHHHHHHHHHhhc
Confidence 689999988 99999653221 12333344499999998876666543457999999999995 444443333
Q ss_pred C-----C-eEEEEEeeccccccc
Q psy3685 871 P-----H-LVHGAMSASGPLRAV 887 (1042)
Q Consensus 871 P-----~-~~~g~i~ssa~~~~~ 887 (1042)
. . .+.|.+...+.+.+.
T Consensus 77 ~~~~~~~inLkGi~IGNg~t~~~ 99 (319)
T PLN02213 77 YICCEPPINLQGYMLGNPVTYMD 99 (319)
T ss_pred ccccCCceeeeEEEeCCCCCCcc
Confidence 1 1 355655555555443
No 198
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.51 E-value=0.013 Score=37.33 Aligned_cols=23 Identities=35% Similarity=0.869 Sum_probs=21.2
Q ss_pred ccCCCCCCccCCcchHHHHHhhc
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIH 705 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H 705 (1042)
|.|++|++.|++...|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 67999999999999999999865
No 199
>KOG2231|consensus
Probab=94.45 E-value=0.045 Score=64.23 Aligned_cols=47 Identities=23% Similarity=0.508 Sum_probs=25.2
Q ss_pred ccCCCCCCCCChhHHHHHHHhcCCCccccCCC------Ccccccccchhhhhcccc
Q psy3685 145 KCDLCSKDFSYKSSLIEHKKLHQGIRYKCNLC------SKEYFFKRNLTEHKKLHQ 194 (1042)
Q Consensus 145 ~C~~C~~~F~~~~~L~~H~~~h~~~~~~C~~C------~~~f~~~~~L~~H~~~h~ 194 (1042)
.|..|...|-....|.+|++.++ |.|..| +..|..-..|..|.+.++
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h---~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDH---EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cchhhhhhhccHHHHHHhhccce---eheeecCcccccchhcccchHHHHHhhhcC
Confidence 46666666666666666665544 334444 233445555666655544
No 200
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.44 E-value=0.036 Score=65.09 Aligned_cols=157 Identities=25% Similarity=0.340 Sum_probs=97.0
Q ss_pred cccccCcCCcccCCchhhhhHhh--hhcCC---cccCC--ccccccCCchhHHHHhhhcCCC-cccCCCCCcccccCCCC
Q psy3685 557 IRYKCDLCSKEFSRKDNLTEHKE--IHQGI---RYKCD--LCSKKFSRKLNLTKHMKIHQGI-RYKCDLCSKEFSRKDIP 628 (1042)
Q Consensus 557 ~~~~C~~C~k~F~~~~~L~~H~~--~H~~~---~~~C~--~C~k~F~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~~~l 628 (1042)
.++.|..|...|.....|..|.+ .|+++ ++.|+ .|++.|.+...+..|..+|++. ++.+..-.........+
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 35778888888888888888888 67766 68888 7888888888888888888886 45554332222221111
Q ss_pred cccccCCcCcccCCccccccCCCcccccccccccccccccccccccccccCCc--cccCCCCCCccCCcchHHHHHhhcc
Q psy3685 629 YEVTVKEESYEGDSINVEENNGNDAVTRDCLEFFKTEQNTTLGKLNKILHQDI--RYKCDLCSKDFSRKDNLTEHKEIHQ 706 (1042)
Q Consensus 629 ~~~~~~~~~~~c~~~~~~~~~~~~~~~~~C~~~f~~~~~~~l~~h~~~h~~~~--~~~C~~C~k~F~~~~~L~~H~~~H~ 706 (1042)
.....+ .................-..|...+.. ...+..|...|.... .+.+..|.+.|.....|..|++.|+
T Consensus 368 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (467)
T COG5048 368 NNEPPQ---SLQQYKDLKNDKKSETLSNSCIRNFKR--DSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT 442 (467)
T ss_pred CCCCcc---chhhccCccCCccccccccchhhhhcc--ccccccccccccccCCcCCCCCcchhhccCcccccccccccc
Confidence 110000 000000000000001111245555555 555555655555554 5789999999999999999999999
Q ss_pred CCCCcccCCCCC
Q psy3685 707 GIRVSDINHGTL 718 (1042)
Q Consensus 707 g~~~~~c~~c~~ 718 (1042)
...+..|.....
T Consensus 443 ~~~~~~~~~~~~ 454 (467)
T COG5048 443 NHAPLLCSILKS 454 (467)
T ss_pred cCCceeeccccc
Confidence 888877766554
No 201
>KOG4840|consensus
Probab=94.41 E-value=0.037 Score=54.88 Aligned_cols=103 Identities=15% Similarity=0.114 Sum_probs=67.4
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCC
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
--+||+||-|..-..-.+..-....-.+.+..+|.+-.|- . ...++..|..|-.+|+..++.++...-
T Consensus 37 ~~vvfiGGLgdgLl~~~y~~~L~~~lde~~wslVq~q~~S----s------y~G~Gt~slk~D~edl~~l~~Hi~~~~-- 104 (299)
T KOG4840|consen 37 VKVVFIGGLGDGLLICLYTTMLNRYLDENSWSLVQPQLRS----S------YNGYGTFSLKDDVEDLKCLLEHIQLCG-- 104 (299)
T ss_pred EEEEEEcccCCCccccccHHHHHHHHhhccceeeeeeccc----c------ccccccccccccHHHHHHHHHHhhccC--
Confidence 4789999998765543343333334445577777776552 1 122344577788899999998765431
Q ss_pred CCCCcEEEEccchhHHHHHHHH--HhCCCeEEEEEeec
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLR--YKYPHLVHGAMSAS 881 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~--~~yP~~~~g~i~ss 881 (1042)
....|||+|+|-|-.=-.+|. ..-|..+.+||+-+
T Consensus 105 -fSt~vVL~GhSTGcQdi~yYlTnt~~~r~iraaIlqA 141 (299)
T KOG4840|consen 105 -FSTDVVLVGHSTGCQDIMYYLTNTTKDRKIRAAILQA 141 (299)
T ss_pred -cccceEEEecCccchHHHHHHHhccchHHHHHHHHhC
Confidence 234899999999997766665 45566677777553
No 202
>cd00741 Lipase Lipase. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation", the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=94.34 E-value=0.095 Score=51.25 Aligned_cols=56 Identities=18% Similarity=0.070 Sum_probs=37.6
Q ss_pred HhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC----eEEEEEeeccccccc
Q psy3685 829 LADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH----LVHGAMSASGPLRAV 887 (1042)
Q Consensus 829 l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~----~~~g~i~ssa~~~~~ 887 (1042)
..++...++.....+ ++.+++++|||.||+||..+...++. .+...++-++|....
T Consensus 11 ~~~i~~~~~~~~~~~---p~~~i~v~GHSlGg~lA~l~a~~~~~~~~~~~~~~~~fg~p~~~~ 70 (153)
T cd00741 11 ANLVLPLLKSALAQY---PDYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRVYTFGPPRVGN 70 (153)
T ss_pred HHHHHHHHHHHHHHC---CCCeEEEEEcCHHHHHHHHHHHHHHhccCCCceEEEEeCCCcccc
Confidence 334444444444333 34689999999999999998877765 466677776665443
No 203
>KOG2237|consensus
Probab=94.26 E-value=0.069 Score=61.47 Aligned_cols=136 Identities=14% Similarity=0.090 Sum_probs=87.2
Q ss_pred hcccCCCCCccc-ceEEEEEeccccc--CCCcEEEEE-CCCCCC-cccccccccHHHHHHHcCCeEEEEeeeecc---cc
Q psy3685 738 KLDHFNIEEPRT-WKQRYFVNSTWYK--HGGPIFLMI-GGEGEA-SAKWMVEGTWLDYAHNHNALAVQVEHRFYG---KS 809 (1042)
Q Consensus 738 ~~dhf~~~~~~t-f~qry~~~~~~~~--~~~pi~~~~-gg~g~~-~~~~~~~~~~~~la~~~~a~~~~~ehR~~G---~S 809 (1042)
-+-+|-++.++| -++ ++|-.+.|+ +..|.+||. ||.|.. .+.|..+.. -|.+ .|..+.+..-||=| ++
T Consensus 441 ~~r~~~~SkDGt~VPM-~Iv~kk~~k~dg~~P~LLygYGay~isl~p~f~~srl--~lld-~G~Vla~a~VRGGGe~G~~ 516 (712)
T KOG2237|consen 441 VERIEVSSKDGTKVPM-FIVYKKDIKLDGSKPLLLYGYGAYGISLDPSFRASRL--SLLD-RGWVLAYANVRGGGEYGEQ 516 (712)
T ss_pred EEEEEEecCCCCccce-EEEEechhhhcCCCceEEEEecccceeecccccccee--EEEe-cceEEEEEeeccCcccccc
Confidence 344565555555 444 444466665 467999999 777753 455554433 2334 58888888889844 33
Q ss_pred cCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh-cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 810 HPLSDLSVESLQYLSAEQALADLAYFITSMNTL-YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 810 ~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~-~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.--+. +.+.-...++|+..=+++|-.+ |. ...+.-+.|+|-||.|++...-..|+|+.++||-.+.+.
T Consensus 517 WHk~G------~lakKqN~f~Dfia~AeyLve~gyt--~~~kL~i~G~SaGGlLvga~iN~rPdLF~avia~VpfmD 585 (712)
T KOG2237|consen 517 WHKDG------RLAKKQNSFDDFIACAEYLVENGYT--QPSKLAIEGGSAGGLLVGACINQRPDLFGAVIAKVPFMD 585 (712)
T ss_pred hhhcc------chhhhcccHHHHHHHHHHHHHcCCC--CccceeEecccCccchhHHHhccCchHhhhhhhcCccee
Confidence 22111 1122233456666666665433 33 234899999999999999999999999999998876664
No 204
>PRK04860 hypothetical protein; Provisional
Probab=94.21 E-value=0.028 Score=54.42 Aligned_cols=36 Identities=17% Similarity=0.375 Sum_probs=32.2
Q ss_pred ccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCC
Q psy3685 681 IRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGE 720 (1042)
Q Consensus 681 ~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~ 720 (1042)
-+|.|. |++ ...++.+|.++|+|+++|.|..|+...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l 153 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETL 153 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCcee
Confidence 379998 998 789999999999999999999999753
No 205
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.18 E-value=0.022 Score=36.20 Aligned_cols=23 Identities=35% Similarity=0.889 Sum_probs=15.3
Q ss_pred ccCCccccccCCchhHHHHhhhc
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKIH 608 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~H 608 (1042)
|.|++|++.|.....|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 45677777777777777776543
No 206
>PF00151 Lipase: Lipase; InterPro: IPR013818 Triglyceride lipases (3.1.1.3 from EC) are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. At least three tissue-specific isozymes exist in higher vertebrates, pancreatic, hepatic and gastric/lingual. These lipases are closely related to each other and to lipoprotein lipase (3.1.1.34 from EC), which hydrolyses triglycerides of chylomicrons and very low density lipoproteins (VLDL) []. The most conserved region in all these proteins is centred around a serine residue which has been shown [] to participate, with an histidine and an aspartic acid residue, in a charge relay system. Such a region is also present in lipases of prokaryotic origin and in lecithin-cholesterol acyltransferase (2.3.1.43 from EC) (LCAT) [], which catalyzes fatty acid transfer between phosphatidylcholine and cholesterol.; PDB: 1LPB_B 1LPA_B 1N8S_A 1GPL_A 1W52_X 2PVS_B 2OXE_B 1BU8_A 2PPL_A 1ETH_A ....
Probab=94.17 E-value=0.056 Score=59.84 Aligned_cols=99 Identities=13% Similarity=0.201 Sum_probs=59.0
Q ss_pred ccCCCcEEEEECCCCCCc--ccccccccHHHHHHH--cCCeEEEEeeeecccccCCCCCCcccccCC----CHHHHHhHH
Q psy3685 761 YKHGGPIFLMIGGEGEAS--AKWMVEGTWLDYAHN--HNALAVQVEHRFYGKSHPLSDLSVESLQYL----SAEQALADL 832 (1042)
Q Consensus 761 ~~~~~pi~~~~gg~g~~~--~~~~~~~~~~~la~~--~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l----t~~q~l~D~ 832 (1042)
+++..|++|+++|..+.. ..|.. .....|-+. -+++||++|....-.. .|. .+..+-+-+
T Consensus 67 fn~~~pt~iiiHGw~~~~~~~~~~~-~~~~all~~~~~d~NVI~VDWs~~a~~-----------~Y~~a~~n~~~vg~~l 134 (331)
T PF00151_consen 67 FNPSKPTVIIIHGWTGSGSSESWIQ-DMIKALLQKDTGDYNVIVVDWSRGASN-----------NYPQAVANTRLVGRQL 134 (331)
T ss_dssp --TTSEEEEEE--TT-TT-TTTHHH-HHHHHHHCC--S-EEEEEEE-HHHHSS------------HHHHHHHHHHHHHHH
T ss_pred cCCCCCeEEEEcCcCCcccchhHHH-HHHHHHHhhccCCceEEEEcchhhccc-----------cccchhhhHHHHHHHH
Confidence 467899999999986654 33433 122223333 3789999997643211 122 234445566
Q ss_pred HHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC
Q psy3685 833 AYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH 872 (1042)
Q Consensus 833 ~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~ 872 (1042)
+.||..|....+.+. .++-++|+|.|+-+|.........
T Consensus 135 a~~l~~L~~~~g~~~-~~ihlIGhSLGAHvaG~aG~~~~~ 173 (331)
T PF00151_consen 135 AKFLSFLINNFGVPP-ENIHLIGHSLGAHVAGFAGKYLKG 173 (331)
T ss_dssp HHHHHHHHHHH---G-GGEEEEEETCHHHHHHHHHHHTTT
T ss_pred HHHHHHHHhhcCCCh-hHEEEEeeccchhhhhhhhhhccC
Confidence 677777765544433 489999999999999999988887
No 207
>PRK04860 hypothetical protein; Provisional
Probab=94.12 E-value=0.028 Score=54.43 Aligned_cols=37 Identities=27% Similarity=0.810 Sum_probs=26.0
Q ss_pred cccCCccccccCCchhHHHHhhhcCCC-cccCCCCCcccccC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQGI-RYKCDLCSKEFSRK 625 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~~~-~~~C~~C~k~F~~~ 625 (1042)
+|.|. |++ ...++++|.++|+|+ +|.|..|++.|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 57776 776 666777777777776 67777777777543
No 208
>PLN00413 triacylglycerol lipase
Probab=94.08 E-value=0.2 Score=56.76 Aligned_cols=48 Identities=25% Similarity=0.292 Sum_probs=32.9
Q ss_pred CCCcEEEEccchhHHHHHHHHH----hC----CCeEEEEEeeccccccccCChhhH
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRY----KY----PHLVHGAMSASGPLRAVVDFPEYF 894 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~----~y----P~~~~g~i~ssa~~~~~~~~~~~~ 894 (1042)
++.++++.|||.||+||+.++. .. ...+.+.+..++|--...+|..|+
T Consensus 282 p~~kliVTGHSLGGALAtLaA~~L~~~~~~~~~~ri~~VYTFG~PRVGN~~FA~~~ 337 (479)
T PLN00413 282 PTSKFILSGHSLGGALAILFTAVLIMHDEEEMLERLEGVYTFGQPRVGDEDFGIFM 337 (479)
T ss_pred CCCeEEEEecCHHHHHHHHHHHHHHhccchhhccccceEEEeCCCCCccHHHHHHH
Confidence 3468999999999999998763 11 223456677777766555555554
No 209
>PLN03037 lipase class 3 family protein; Provisional
Probab=94.07 E-value=0.19 Score=57.44 Aligned_cols=67 Identities=22% Similarity=0.277 Sum_probs=45.9
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH----hCCCe-EEEEEeeccccccccCChhhHH
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY----KYPHL-VHGAMSASGPLRAVVDFPEYFG 895 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~----~yP~~-~~g~i~ssa~~~~~~~~~~~~~ 895 (1042)
+|+++++..+++.++.. .++.++++.|||.||+||+..+. ..|++ -..++..++|--....|.++++
T Consensus 298 eQVl~eV~rLv~~Yk~~---ge~~SItVTGHSLGGALAtLaA~DIa~~~p~~~~VtvyTFGsPRVGN~aFA~~~~ 369 (525)
T PLN03037 298 EQVMEEVKRLVNFFKDR---GEEVSLTITGHSLGGALALLNAYEAARSVPALSNISVISFGAPRVGNLAFKEKLN 369 (525)
T ss_pred HHHHHHHHHHHHhcccc---CCcceEEEeccCHHHHHHHHHHHHHHHhCCCCCCeeEEEecCCCccCHHHHHHHH
Confidence 78888888888766532 12357999999999999987763 45665 3345666666655555555544
No 210
>PLN02310 triacylglycerol lipase
Probab=93.92 E-value=0.14 Score=57.14 Aligned_cols=68 Identities=22% Similarity=0.292 Sum_probs=45.6
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH----hCCCeEEEEEeeccccccccCChhhHH
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY----KYPHLVHGAMSASGPLRAVVDFPEYFG 895 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~----~yP~~~~g~i~ssa~~~~~~~~~~~~~ 895 (1042)
.+|+++.+..++..+... .++.++++.|||.||+||+..+. ..|..-..++..++|--....|.++++
T Consensus 188 ~~qVl~eV~~L~~~y~~~---~e~~sI~vTGHSLGGALAtLaA~dl~~~~~~~~v~vyTFGsPRVGN~~Fa~~~~ 259 (405)
T PLN02310 188 SEQVMQEVKRLVNFYRGK---GEEVSLTVTGHSLGGALALLNAYEAATTIPDLFVSVISFGAPRVGNIAFKEKLN 259 (405)
T ss_pred HHHHHHHHHHHHHhhccc---CCcceEEEEcccHHHHHHHHHHHHHHHhCcCcceeEEEecCCCcccHHHHHHHH
Confidence 467777777666654321 13358999999999999987763 356655567777777766555555543
No 211
>KOG3724|consensus
Probab=93.88 E-value=0.12 Score=60.79 Aligned_cols=35 Identities=29% Similarity=0.320 Sum_probs=24.0
Q ss_pred EEEEccchhHHHHHHHHHhCCCeEEEE----Eeecccccc
Q psy3685 851 WIAFGGSYPGALAAWLRYKYPHLVHGA----MSASGPLRA 886 (1042)
Q Consensus 851 ~i~~G~Sygg~la~~~~~~yP~~~~g~----i~ssa~~~~ 886 (1042)
||++||||||++|..+ ..+|+.+.|+ |.-|+|..+
T Consensus 184 VILVGHSMGGiVAra~-~tlkn~~~~sVntIITlssPH~a 222 (973)
T KOG3724|consen 184 VILVGHSMGGIVARAT-LTLKNEVQGSVNTIITLSSPHAA 222 (973)
T ss_pred EEEEeccchhHHHHHH-HhhhhhccchhhhhhhhcCcccC
Confidence 9999999999988654 3455555554 455556544
No 212
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=93.70 E-value=0.038 Score=34.71 Aligned_cols=22 Identities=55% Similarity=0.955 Sum_probs=10.0
Q ss_pred cccCCCCcccCCchhHHHHHHHh
Q psy3685 505 YKCDLCSKEFSSKGNLTEHMKQH 527 (1042)
Q Consensus 505 ~~C~~C~~~f~~~~~L~~H~~~H 527 (1042)
|+|+.|+.... +..|.+|++.|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 34555555554 44555555544
No 213
>PF07082 DUF1350: Protein of unknown function (DUF1350); InterPro: IPR010765 This family consists of several hypothetical proteins from both cyanobacteria and plants. Members of this family are typically around 250 residues in length. The function of this family is unknown but the species distribution indicates that the family may be involved in photosynthesis.
Probab=93.64 E-value=0.28 Score=50.67 Aligned_cols=105 Identities=12% Similarity=0.057 Sum_probs=64.4
Q ss_pred CCCc--EEEEECCCCCC-cccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHHHHHH
Q psy3685 763 HGGP--IFLMIGGEGEA-SAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAYFITS 838 (1042)
Q Consensus 763 ~~~p--i~~~~gg~g~~-~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~~~~~ 838 (1042)
|..| ||-++||.--. .+...|.-+...||++ |++||++-... + +..+ -..+++..+..-.+.
T Consensus 13 P~~P~gvihFiGGaf~ga~P~itYr~lLe~La~~-Gy~ViAtPy~~----------t---fDH~~~A~~~~~~f~~~~~~ 78 (250)
T PF07082_consen 13 PPRPKGVIHFIGGAFVGAAPQITYRYLLERLADR-GYAVIATPYVV----------T---FDHQAIAREVWERFERCLRA 78 (250)
T ss_pred CCCCCEEEEEcCcceeccCcHHHHHHHHHHHHhC-CcEEEEEecCC----------C---CcHHHHHHHHHHHHHHHHHH
Confidence 4456 89999998332 3445566677889987 99999975322 1 1111 222333333333333
Q ss_pred HhhhcCCC-CCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeec
Q psy3685 839 MNTLYSLP-AHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSAS 881 (1042)
Q Consensus 839 ~~~~~~~~-~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ss 881 (1042)
+....... ...|++-+|||+|.-|-+.+...|+..-+|-|+.|
T Consensus 79 L~~~~~~~~~~lP~~~vGHSlGcklhlLi~s~~~~~r~gniliS 122 (250)
T PF07082_consen 79 LQKRGGLDPAYLPVYGVGHSLGCKLHLLIGSLFDVERAGNILIS 122 (250)
T ss_pred HHHhcCCCcccCCeeeeecccchHHHHHHhhhccCcccceEEEe
Confidence 33332222 22589999999999999999888876655555444
No 214
>COG4947 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=93.51 E-value=0.13 Score=48.98 Aligned_cols=114 Identities=25% Similarity=0.197 Sum_probs=68.1
Q ss_pred ccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccc---cCCCHHHHHhHHHHHHH
Q psy3685 761 YKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESL---QYLSAEQALADLAYFIT 837 (1042)
Q Consensus 761 ~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~---~~lt~~q~l~D~~~~~~ 837 (1042)
-..|.||+++----|.. ..+...|.+..||..+.+-.|.+ |-. +.+..+.+ -.-+.++|-.--|.+ +
T Consensus 23 GHaG~pVvvFpts~Grf-~eyed~G~v~ala~fie~G~vQl----ft~----~gldsESf~a~h~~~adr~~rH~Aye-r 92 (227)
T COG4947 23 GHAGIPVVVFPTSGGRF-NEYEDFGMVDALASFIEEGLVQL----FTL----SGLDSESFLATHKNAADRAERHRAYE-R 92 (227)
T ss_pred cCCCCcEEEEecCCCcc-hhhhhcccHHHHHHHHhcCcEEE----EEe----cccchHhHhhhcCCHHHHHHHHHHHH-H
Confidence 34688999887333332 22344567777887754443332 111 12222221 011344443333333 2
Q ss_pred HHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 838 SMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 838 ~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
++.++. ++. .-|+-|.|+||..|+-+-.++|+++.+.||-|++-.+.
T Consensus 93 Yv~eEa-lpg--s~~~sgcsmGayhA~nfvfrhP~lftkvialSGvYdar 139 (227)
T COG4947 93 YVIEEA-LPG--STIVSGCSMGAYHAANFVFRHPHLFTKVIALSGVYDAR 139 (227)
T ss_pred HHHHhh-cCC--CccccccchhhhhhhhhheeChhHhhhheeecceeeHH
Confidence 333332 232 46889999999999999999999999999999988766
No 215
>KOG2785|consensus
Probab=93.36 E-value=0.12 Score=55.57 Aligned_cols=56 Identities=29% Similarity=0.510 Sum_probs=46.0
Q ss_pred eeccCccccccCCchhHHHHHhhcCCC------------------------cccCCCCC---cccCCchhHHHHHH--hh
Q psy3685 424 RYKCDLCSKKFSRKLNLTKHMKIHQGI------------------------RYKCDLCS---KEFSRKGNLTKHMK--LH 474 (1042)
Q Consensus 424 ~~~C~~C~~~f~~~~~L~~H~~~H~~~------------------------~~~C~~C~---k~f~~~~~L~~H~~--~H 474 (1042)
|-.|-.|++.+.+...-..||..+++. -+.|-.|+ +.|.+..+.+.||. .|
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K~H 245 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDKGH 245 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhccC
Confidence 578999999999999999999888763 27799999 99999999999995 35
Q ss_pred cCCcc
Q psy3685 475 QGIRY 479 (1042)
Q Consensus 475 ~~~~~ 479 (1042)
...+|
T Consensus 246 Ckl~y 250 (390)
T KOG2785|consen 246 CKLPY 250 (390)
T ss_pred cccCC
Confidence 44443
No 216
>KOG2785|consensus
Probab=93.30 E-value=0.1 Score=56.15 Aligned_cols=82 Identities=26% Similarity=0.421 Sum_probs=42.5
Q ss_pred cccCCCcCcccCCcchhHhHhhh--hcCccccccccccCCcccCCccccccCCcchHHHH-h---hhcCCCccccCCCcc
Q psy3685 197 RYNCDLCSKEFSRKDNLTDHKKM--HQGNLTKHMKLHQGIRYKCDLCLKEFSRKDNLTEH-K---EIHQGIRYKCDLCSK 270 (1042)
Q Consensus 197 ~~~C~~C~~~f~~~~~L~~H~~~--h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H-~---~~h~~~~~~C~~C~~ 270 (1042)
.|.|.-|...|.+...-+.|.++ |.-+|.+....-.+. =-..|..+..-..- . ..-.+.++.|..|.+
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPI------taE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k 76 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPI------TAEEFNEKVLSDDSEKEENLEEAESVVYCEACNK 76 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCc------CHHHHhHHHhhhhhhhhhhhhhcccceehHHhhc
Confidence 36788888888888888888765 444443322111111 01112111110000 0 011233677777777
Q ss_pred ccCCchHHHHHHHh
Q psy3685 271 KFSRKLNLTKHMKI 284 (1042)
Q Consensus 271 ~f~~~~~L~~H~~~ 284 (1042)
.|.+......|++.
T Consensus 77 ~~~s~~a~~~hl~S 90 (390)
T KOG2785|consen 77 SFASPKAHENHLKS 90 (390)
T ss_pred cccChhhHHHHHHH
Confidence 77777777777654
No 217
>PF05990 DUF900: Alpha/beta hydrolase of unknown function (DUF900); InterPro: IPR010297 This domain is associated with proteins of unknown function, which are hydrolase-like.
Probab=93.24 E-value=0.26 Score=51.80 Aligned_cols=89 Identities=13% Similarity=0.109 Sum_probs=54.5
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCC--eEEEEeeeecccccCCCCCCcccccCC----CHHHHHhHHHHHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNA--LAVQVEHRFYGKSHPLSDLSVESLQYL----SAEQALADLAYFI 836 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a--~~~~~ehR~~G~S~p~~~~~~~~~~~l----t~~q~l~D~~~~~ 836 (1042)
+++.|+||++|.......... -+.+++..++. .+|.+--+..|. -+.|. +.+.+..+++.|+
T Consensus 16 ~~~~vlvfVHGyn~~f~~a~~--r~aql~~~~~~~~~~i~FsWPS~g~----------~~~Y~~d~~~a~~s~~~l~~~L 83 (233)
T PF05990_consen 16 PDKEVLVFVHGYNNSFEDALR--RAAQLAHDLGFPGVVILFSWPSDGS----------LLGYFYDRESARFSGPALARFL 83 (233)
T ss_pred CCCeEEEEEeCCCCCHHHHHH--HHHHHHHHhCCCceEEEEEcCCCCC----------hhhhhhhhhhHHHHHHHHHHHH
Confidence 567899999998765332221 13345554432 444444333332 12233 5666777888888
Q ss_pred HHHhhhcCCCCCCcEEEEccchhHHHHHHH
Q psy3685 837 TSMNTLYSLPAHTKWIAFGGSYPGALAAWL 866 (1042)
Q Consensus 837 ~~~~~~~~~~~~~~~i~~G~Sygg~la~~~ 866 (1042)
..+.... +..++.+++||||+.+..-.
T Consensus 84 ~~L~~~~---~~~~I~ilaHSMG~rv~~~a 110 (233)
T PF05990_consen 84 RDLARAP---GIKRIHILAHSMGNRVLLEA 110 (233)
T ss_pred HHHHhcc---CCceEEEEEeCchHHHHHHH
Confidence 8776652 34589999999999776554
No 218
>PF01764 Lipase_3: Lipase (class 3); InterPro: IPR002921 Triglyceride lipases are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. This family of lipases have been called Class 3 as they are not closely related to other lipase families.; GO: 0004806 triglyceride lipase activity, 0006629 lipid metabolic process; PDB: 1LGY_A 1DTE_A 1DT5_F 4DYH_B 1DU4_C 4EA6_B 1GT6_B 1EIN_A 1DT3_A 1TIB_A ....
Probab=93.24 E-value=0.085 Score=50.64 Aligned_cols=53 Identities=15% Similarity=0.122 Sum_probs=30.4
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCC----C--eEEEEEeecccccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYP----H--LVHGAMSASGPLRA 886 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP----~--~~~g~i~ssa~~~~ 886 (1042)
.+..-+..+..++. +.++++.|||.||+||+.+...+. . ...-.++.++|-..
T Consensus 49 ~~~~~l~~~~~~~~---~~~i~itGHSLGGalA~l~a~~l~~~~~~~~~~~~~~~fg~P~~~ 107 (140)
T PF01764_consen 49 QILDALKELVEKYP---DYSIVITGHSLGGALASLAAADLASHGPSSSSNVKCYTFGAPRVG 107 (140)
T ss_dssp HHHHHHHHHHHHST---TSEEEEEEETHHHHHHHHHHHHHHHCTTTSTTTEEEEEES-S--B
T ss_pred HHHHHHHHHHhccc---CccchhhccchHHHHHHHHHHhhhhcccccccceeeeecCCcccc
Confidence 33334444444443 358999999999999977665432 1 33455555555543
No 219
>PF08840 BAAT_C: BAAT / Acyl-CoA thioester hydrolase C terminal; InterPro: IPR014940 Acyl-CoA thioesterases are a group of enzymes that catalyse the hydrolysis of acyl-CoAs to the free fatty acid and coenzyme A (CoASH), providing the potential to regulate intracellular levels of acyl-CoAs, free fatty acids and CoASH. Bile acid-CoA:amino acid N-acetyltransferase (BAAT) is involved in bile acid metabolism and may also act as an acyl-CoA thioesterase that regulates intracellular levels of free fatty acids []. This entry represents a catalytic domain is found at the C terminus of acyl-CoA thioester hydrolases and bile acid-CoA:amino acid N-acetyltransferases. ; PDB: 3K2I_B 3HLK_B.
Probab=93.12 E-value=0.13 Score=53.34 Aligned_cols=36 Identities=22% Similarity=0.190 Sum_probs=29.8
Q ss_pred CcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 849 TKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 849 ~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.++.++|.|.||.||..++..|| .|.++|+.+++..
T Consensus 22 ~~Igi~G~SkGaelALllAs~~~-~i~avVa~~ps~~ 57 (213)
T PF08840_consen 22 DKIGIIGISKGAELALLLASRFP-QISAVVAISPSSV 57 (213)
T ss_dssp SSEEEEEETHHHHHHHHHHHHSS-SEEEEEEES--SB
T ss_pred CCEEEEEECHHHHHHHHHHhcCC-CccEEEEeCCcee
Confidence 48999999999999999999999 6887776665553
No 220
>cd00519 Lipase_3 Lipase (class 3). Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=93.12 E-value=0.13 Score=54.17 Aligned_cols=42 Identities=26% Similarity=0.220 Sum_probs=29.3
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK 869 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~ 869 (1042)
...+..++...+..+..++ ++.++++.|||.||++|+.+...
T Consensus 107 ~~~~~~~~~~~~~~~~~~~---p~~~i~vtGHSLGGaiA~l~a~~ 148 (229)
T cd00519 107 YKSLYNQVLPELKSALKQY---PDYKIIVTGHSLGGALASLLALD 148 (229)
T ss_pred HHHHHHHHHHHHHHHHhhC---CCceEEEEccCHHHHHHHHHHHH
Confidence 3344555566666655554 34689999999999999876653
No 221
>smart00824 PKS_TE Thioesterase. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa.
Probab=93.10 E-value=0.39 Score=49.36 Aligned_cols=71 Identities=15% Similarity=0.083 Sum_probs=45.0
Q ss_pred CCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh---CC
Q psy3685 795 NALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK---YP 871 (1042)
Q Consensus 795 ~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~---yP 871 (1042)
...++.++.+++|.+.+.. .+.+..++++...+. ... ...|++++|+|+||.+|..+... .+
T Consensus 25 ~~~v~~~~~~g~~~~~~~~---------~~~~~~~~~~~~~l~---~~~---~~~~~~l~g~s~Gg~~a~~~a~~l~~~~ 89 (212)
T smart00824 25 RRDVSALPLPGFGPGEPLP---------ASADALVEAQAEAVL---RAA---GGRPFVLVGHSSGGLLAHAVAARLEARG 89 (212)
T ss_pred CccEEEecCCCCCCCCCCC---------CCHHHHHHHHHHHHH---Hhc---CCCCeEEEEECHHHHHHHHHHHHHHhCC
Confidence 3578999999988764432 134444444444333 222 23589999999999999777765 34
Q ss_pred CeEEEEEee
Q psy3685 872 HLVHGAMSA 880 (1042)
Q Consensus 872 ~~~~g~i~s 880 (1042)
+.+.+.+..
T Consensus 90 ~~~~~l~~~ 98 (212)
T smart00824 90 IPPAAVVLL 98 (212)
T ss_pred CCCcEEEEE
Confidence 556555544
No 222
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.00 E-value=0.04 Score=35.78 Aligned_cols=22 Identities=45% Similarity=0.853 Sum_probs=15.1
Q ss_pred ccCCccccccCCchhHHHHhhh
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKI 607 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~ 607 (1042)
|.|.+|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777654
No 223
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=92.98 E-value=0.028 Score=36.50 Aligned_cols=22 Identities=36% Similarity=0.807 Sum_probs=20.5
Q ss_pred ccCCCCCCccCCcchHHHHHhh
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEI 704 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~ 704 (1042)
|.|+.|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999875
No 224
>PF05057 DUF676: Putative serine esterase (DUF676); InterPro: IPR007751 This domain, whose function is unknown, is found within a group of putative lipases.
Probab=92.85 E-value=0.39 Score=49.97 Aligned_cols=44 Identities=20% Similarity=0.040 Sum_probs=28.7
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
.++.....+++.|.......... ..|++.+|||+||.++.++..
T Consensus 54 gI~~~g~rL~~eI~~~~~~~~~~-~~~IsfIgHSLGGli~r~al~ 97 (217)
T PF05057_consen 54 GIDVCGERLAEEILEHIKDYESK-IRKISFIGHSLGGLIARYALG 97 (217)
T ss_pred hhHHHHHHHHHHHHHhccccccc-cccceEEEecccHHHHHHHHH
Confidence 45555556665555544443221 258999999999999987654
No 225
>PLN02571 triacylglycerol lipase
Probab=92.67 E-value=0.29 Score=54.89 Aligned_cols=40 Identities=18% Similarity=0.180 Sum_probs=29.7
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK 869 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~ 869 (1042)
-+|+++++..+++....+ ..++++.|||+||+||+.++..
T Consensus 207 r~qvl~eV~~L~~~y~~e-----~~sI~VTGHSLGGALAtLaA~d 246 (413)
T PLN02571 207 RDQVLNEVGRLVEKYKDE-----EISITICGHSLGAALATLNAVD 246 (413)
T ss_pred HHHHHHHHHHHHHhcCcc-----cccEEEeccchHHHHHHHHHHH
Confidence 377888887777544321 2479999999999999987753
No 226
>COG1505 Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]
Probab=92.67 E-value=0.079 Score=60.72 Aligned_cols=116 Identities=17% Similarity=0.124 Sum_probs=81.4
Q ss_pred CCcEEEEE-CCCCCC-cccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhh
Q psy3685 764 GGPIFLMI-GGEGEA-SAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNT 841 (1042)
Q Consensus 764 ~~pi~~~~-gg~g~~-~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~ 841 (1042)
.+|.+||- ||.... .+ .+.+.+.-|-++ |+.-|..-.||=|+=-|--.... .-.+=+.+.+|++.+.+.|.+
T Consensus 420 ~~pTll~aYGGF~vsltP--~fs~~~~~WLer-Gg~~v~ANIRGGGEfGp~WH~Aa---~k~nrq~vfdDf~AVaedLi~ 493 (648)
T COG1505 420 ENPTLLYAYGGFNISLTP--RFSGSRKLWLER-GGVFVLANIRGGGEFGPEWHQAG---MKENKQNVFDDFIAVAEDLIK 493 (648)
T ss_pred CCceEEEeccccccccCC--ccchhhHHHHhc-CCeEEEEecccCCccCHHHHHHH---hhhcchhhhHHHHHHHHHHHH
Confidence 58999998 887643 33 344556556665 88888889999777655321000 112334478999999988766
Q ss_pred hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 842 LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 842 ~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
.- ...-.++-+.|||=||.|.+-..-.+|+++.|+|.-.+.+.+
T Consensus 494 rg-itspe~lgi~GgSNGGLLvg~alTQrPelfgA~v~evPllDM 537 (648)
T COG1505 494 RG-ITSPEKLGIQGGSNGGLLVGAALTQRPELFGAAVCEVPLLDM 537 (648)
T ss_pred hC-CCCHHHhhhccCCCCceEEEeeeccChhhhCceeeccchhhh
Confidence 52 222237889999999999999999999999998887755443
No 227
>PLN02162 triacylglycerol lipase
Probab=92.56 E-value=0.3 Score=55.12 Aligned_cols=50 Identities=28% Similarity=0.299 Sum_probs=35.1
Q ss_pred CCCcEEEEccchhHHHHHHHHH----h-C---CCeEEEEEeeccccccccCChhhHHH
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRY----K-Y---PHLVHGAMSASGPLRAVVDFPEYFGV 896 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~----~-y---P~~~~g~i~ssa~~~~~~~~~~~~~~ 896 (1042)
++.++++.|||.||+||+.++. . . ++.+.+.+..++|--...+|.+|++.
T Consensus 276 p~~kliVTGHSLGGALAtLaAa~L~~~~~~~l~~~~~~vYTFGqPRVGn~~FA~~~~~ 333 (475)
T PLN02162 276 KNLKYILTGHSLGGALAALFPAILAIHGEDELLDKLEGIYTFGQPRVGDEDFGEFMKG 333 (475)
T ss_pred CCceEEEEecChHHHHHHHHHHHHHHccccccccccceEEEeCCCCccCHHHHHHHHh
Confidence 3468999999999999988642 1 2 22345678888887766666666554
No 228
>PLN02454 triacylglycerol lipase
Probab=92.37 E-value=0.28 Score=54.92 Aligned_cols=68 Identities=18% Similarity=0.180 Sum_probs=42.0
Q ss_pred HHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh-----C--CCeEEEEEeeccccccccCChhhHH
Q psy3685 827 QALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK-----Y--PHLVHGAMSASGPLRAVVDFPEYFG 895 (1042)
Q Consensus 827 q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~-----y--P~~~~g~i~ssa~~~~~~~~~~~~~ 895 (1042)
.+.+++...|+.+...|.. ...++++.|||+||+||+.++.. + ++.-..+|..++|-.....|.++++
T Consensus 207 S~r~qvl~~V~~l~~~Yp~-~~~sI~vTGHSLGGALAtLaA~di~~~g~~~~~~~V~~~TFGsPRVGN~~Fa~~~~ 281 (414)
T PLN02454 207 SARSQLLAKIKELLERYKD-EKLSIVLTGHSLGASLATLAAFDIVENGVSGADIPVTAIVFGSPQVGNKEFNDRFK 281 (414)
T ss_pred HHHHHHHHHHHHHHHhCCC-CCceEEEEecCHHHHHHHHHHHHHHHhcccccCCceEEEEeCCCcccCHHHHHHHH
Confidence 3555566666666666632 11359999999999999988743 1 2222334666666655555555544
No 229
>KOG3101|consensus
Probab=92.16 E-value=0.14 Score=50.53 Aligned_cols=124 Identities=21% Similarity=0.240 Sum_probs=69.9
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCC----cccccCCCHHH-HHhH----HHHH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLS----VESLQYLSAEQ-ALAD----LAYF 835 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~----~~~~~~lt~~q-~l~D----~~~~ 835 (1042)
-|+++|+.|..-....++....+.+-|.+.|-.||+.|--=.|--.-.++-| .+-=-|++.-| -.+. +.++
T Consensus 44 ~P~lf~LSGLTCT~~Nfi~Ksg~qq~As~hgl~vV~PDTSPRG~~v~g~~eswDFG~GAGFYvnAt~epw~~~yrMYdYv 123 (283)
T KOG3101|consen 44 CPVLFYLSGLTCTHENFIEKSGFQQQASKHGLAVVAPDTSPRGVEVAGDDESWDFGQGAGFYVNATQEPWAKHYRMYDYV 123 (283)
T ss_pred CceEEEecCCcccchhhHhhhhHHHhHhhcCeEEECCCCCCCccccCCCcccccccCCceeEEecccchHhhhhhHHHHH
Confidence 5899999997655555554455567777889899988864444332222111 11112331111 1111 1223
Q ss_pred HHHHhhhc-----CCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccCC
Q psy3685 836 ITSMNTLY-----SLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVDF 890 (1042)
Q Consensus 836 ~~~~~~~~-----~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~~ 890 (1042)
.+++-+.+ .++ -.++-+|||||||-=|....+|.|..+.. |+.=||+-.-.+-
T Consensus 124 ~kELp~~l~~~~~pld-~~k~~IfGHSMGGhGAl~~~Lkn~~kykS-vSAFAPI~NP~~c 181 (283)
T KOG3101|consen 124 VKELPQLLNSANVPLD-PLKVGIFGHSMGGHGALTIYLKNPSKYKS-VSAFAPICNPINC 181 (283)
T ss_pred HHHHHHHhcccccccc-chhcceeccccCCCceEEEEEcCcccccc-eeccccccCcccC
Confidence 33333222 121 13688999999999888889999996654 4444566444443
No 230
>PF02450 LCAT: Lecithin:cholesterol acyltransferase; InterPro: IPR003386 Lecithin:cholesterol acyltransferase (LACT), also known as phosphatidylcholine-sterol acyltransferase (2.3.1.43 from EC), is involved in extracellular metabolism of plasma lipoproteins, including cholesterol. It esterifies the free cholesterol transported in plasma lipoproteins, and is activated by apolipoprotein A-I. Defects in LACT cause Norum and Fish eye diseases. This family also includes phospholipid:diacylglycerol acyltransferase (PDAT)(2.3.1.158 from EC), which is involved in triacylglycerol formation by an acyl-CoA independent pathway. The enzyme specifically transfers acyl groups from the sn-2 position of a phospholipid to diacylglycerol, thus forming an sn-1-lysophospholipid [].; GO: 0008374 O-acyltransferase activity, 0006629 lipid metabolic process
Probab=92.13 E-value=0.21 Score=57.13 Aligned_cols=59 Identities=15% Similarity=0.210 Sum_probs=46.5
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC------eEEEEEeeccccccc
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH------LVHGAMSASGPLRAV 887 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~------~~~g~i~ssa~~~~~ 887 (1042)
.++...++...|+.+.... +.||+|+||||||.++.+|....+. .|.+.|+.|+|....
T Consensus 99 ~~~~~~~lk~~ie~~~~~~----~~kv~li~HSmGgl~~~~fl~~~~~~~W~~~~i~~~i~i~~p~~Gs 163 (389)
T PF02450_consen 99 RDEYFTKLKQLIEEAYKKN----GKKVVLIAHSMGGLVARYFLQWMPQEEWKDKYIKRFISIGTPFGGS 163 (389)
T ss_pred HHHHHHHHHHHHHHHHHhc----CCcEEEEEeCCCchHHHHHHHhccchhhHHhhhhEEEEeCCCCCCC
Confidence 4566677777777765432 3599999999999999999988864 488999999888654
No 231
>COG2382 Fes Enterochelin esterase and related enzymes [Inorganic ion transport and metabolism]
Probab=92.03 E-value=0.42 Score=50.77 Aligned_cols=124 Identities=15% Similarity=0.157 Sum_probs=72.1
Q ss_pred EEEEEecccccC--CCcEEEEECCCCCCcccccccc----cHHHHHHH---cCCeEEEEeeeecccccCCCCCCcc-ccc
Q psy3685 752 QRYFVNSTWYKH--GGPIFLMIGGEGEASAKWMVEG----TWLDYAHN---HNALAVQVEHRFYGKSHPLSDLSVE-SLQ 821 (1042)
Q Consensus 752 qry~~~~~~~~~--~~pi~~~~gg~g~~~~~~~~~~----~~~~la~~---~~a~~~~~ehR~~G~S~p~~~~~~~-~~~ 821 (1042)
.++++-...|.+ .-||++...|.- |...| .+..|..+ -.+++|.+|- .|.... ..-
T Consensus 83 ~~vv~lppgy~~~~k~pvl~~~DG~~-----~~~~g~i~~~~dsli~~g~i~pai~vgid~---------~d~~~R~~~~ 148 (299)
T COG2382 83 RRVVYLPPGYNPLEKYPVLYLQDGQD-----WFRSGRIPRILDSLIAAGEIPPAILVGIDY---------IDVKKRREEL 148 (299)
T ss_pred eEEEEeCCCCCccccccEEEEeccHH-----HHhcCChHHHHHHHHHcCCCCCceEEecCC---------CCHHHHHHHh
Confidence 466666666654 357888888762 22222 22233222 3566666551 111111 111
Q ss_pred CCCHHHHHhHHHHHHHHHhhhcCCCCC-CcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccccC
Q psy3685 822 YLSAEQALADLAYFITSMNTLYSLPAH-TKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAVVD 889 (1042)
Q Consensus 822 ~lt~~q~l~D~~~~~~~~~~~~~~~~~-~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~~~ 889 (1042)
+...+...+=..+++-+++..|+.... ..-+|.|-|+||.+|.+..+.||+.|-=+++.|+.+...++
T Consensus 149 ~~n~~~~~~L~~eLlP~v~~~yp~~~~a~~r~L~G~SlGG~vsL~agl~~Pe~FG~V~s~Sps~~~~~~ 217 (299)
T COG2382 149 HCNEAYWRFLAQELLPYVEERYPTSADADGRVLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTPL 217 (299)
T ss_pred cccHHHHHHHHHHhhhhhhccCcccccCCCcEEeccccccHHHHHHHhcCchhhceeeccCCccccCcc
Confidence 222222222233455666666654221 24689999999999999999999999888888888876654
No 232
>PF03583 LIP: Secretory lipase ; InterPro: IPR005152 This entry represents a family of secreted lipases. Family members include the LIP lipases from Candida albicans, which are expressed and secreted during the infection cycle of these pathogens [].; GO: 0004806 triglyceride lipase activity, 0016042 lipid catabolic process
Probab=91.87 E-value=0.67 Score=50.58 Aligned_cols=88 Identities=19% Similarity=0.206 Sum_probs=57.4
Q ss_pred HHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHH
Q psy3685 788 LDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 788 ~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
..+.. .|..|++-|+-|.|. |+-+- -+.-.++-|...-.+.+....++..+.+|+++|.|=||.=+.|.+
T Consensus 20 ~~~L~-~GyaVv~pDY~Glg~--~y~~~-------~~~a~avLD~vRAA~~~~~~~gl~~~~~v~l~GySqGG~Aa~~AA 89 (290)
T PF03583_consen 20 AAWLA-RGYAVVAPDYEGLGT--PYLNG-------RSEAYAVLDAVRAARNLPPKLGLSPSSRVALWGYSQGGQAALWAA 89 (290)
T ss_pred HHHHH-CCCEEEecCCCCCCC--cccCc-------HhHHHHHHHHHHHHHhcccccCCCCCCCEEEEeeCccHHHHHHHH
Confidence 33444 499999999999887 43110 133345555555555444433444567999999999998887766
Q ss_pred H----hCCCe---EEEEEeeccccc
Q psy3685 868 Y----KYPHL---VHGAMSASGPLR 885 (1042)
Q Consensus 868 ~----~yP~~---~~g~i~ssa~~~ 885 (1042)
. --|++ +.|+++.+.|..
T Consensus 90 ~l~~~YApeL~~~l~Gaa~gg~~~d 114 (290)
T PF03583_consen 90 ELAPSYAPELNRDLVGAAAGGPPAD 114 (290)
T ss_pred HHhHHhCcccccceeEEeccCCccC
Confidence 3 34777 678887776653
No 233
>COG1075 LipA Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold [General function prediction only]
Probab=91.83 E-value=0.4 Score=53.57 Aligned_cols=38 Identities=16% Similarity=0.089 Sum_probs=33.2
Q ss_pred CcEEEEccchhHHHHHHHHHhCC--CeEEEEEeecccccc
Q psy3685 849 TKWIAFGGSYPGALAAWLRYKYP--HLVHGAMSASGPLRA 886 (1042)
Q Consensus 849 ~~~i~~G~Sygg~la~~~~~~yP--~~~~g~i~ssa~~~~ 886 (1042)
.++.++|||+||.++.|+...+| ..|...++-+.|-..
T Consensus 127 ~~v~LigHS~GG~~~ry~~~~~~~~~~V~~~~tl~tp~~G 166 (336)
T COG1075 127 KKVNLIGHSMGGLDSRYYLGVLGGANRVASVVTLGTPHHG 166 (336)
T ss_pred CceEEEeecccchhhHHHHhhcCccceEEEEEEeccCCCC
Confidence 58999999999999999999999 888888888776643
No 234
>PLN02719 triacylglycerol lipase
Probab=91.55 E-value=0.34 Score=55.33 Aligned_cols=69 Identities=22% Similarity=0.256 Sum_probs=39.6
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh-----CC------CeEEEEEeeccccccccCChhh
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK-----YP------HLVHGAMSASGPLRAVVDFPEY 893 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~-----yP------~~~~g~i~ssa~~~~~~~~~~~ 893 (1042)
-+|+++.+..+++..... . .++.++++.|||.||+||+..+.. ++ ..-..++..++|-....+|.++
T Consensus 276 ReQVl~eV~rL~~~Ypd~-~-ge~~sItVTGHSLGGALAtLaA~Dl~~~gln~~~~~~~~pVtvyTFGsPRVGN~~Fa~~ 353 (518)
T PLN02719 276 REQVLTEVKRLVERYGDE-E-GEELSITVTGHSLGGALAVLSAYDVAEMGLNRTRKGKVIPVTAFTYGGPRVGNIRFKER 353 (518)
T ss_pred HHHHHHHHHHHHHHCCcc-c-CCcceEEEecCcHHHHHHHHHHHHHHHhcccccccccccceEEEEecCCCccCHHHHHH
Confidence 355666655554433211 0 123479999999999999886632 11 1123356666666555555555
Q ss_pred HH
Q psy3685 894 FG 895 (1042)
Q Consensus 894 ~~ 895 (1042)
++
T Consensus 354 ~~ 355 (518)
T PLN02719 354 IE 355 (518)
T ss_pred HH
Confidence 54
No 235
>KOG2565|consensus
Probab=91.49 E-value=0.18 Score=54.31 Aligned_cols=109 Identities=17% Similarity=0.158 Sum_probs=70.2
Q ss_pred CcEEEEECCCCCCcccccccccH-----HHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTW-----LDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~-----~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
-|++|+-|=+|+...++.--.+. ......+-.-||+.-.+|||=|.-.... -++ | +-.|.+++.+
T Consensus 153 ~PlLl~HGwPGsv~EFykfIPlLT~p~~hg~~~d~~FEVI~PSlPGygwSd~~sk~------GFn---~-~a~ArvmrkL 222 (469)
T KOG2565|consen 153 KPLLLLHGWPGSVREFYKFIPLLTDPKRHGNESDYAFEVIAPSLPGYGWSDAPSKT------GFN---A-AATARVMRKL 222 (469)
T ss_pred cceEEecCCCchHHHHHhhhhhhcCccccCCccceeEEEeccCCCCcccCcCCccC------Ccc---H-HHHHHHHHHH
Confidence 47766667678766543210000 0122334567999999999999432211 111 2 2334444444
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
-...+. .+.++=||-||..|++.++..||+.|.|..++-+++..
T Consensus 223 MlRLg~---nkffiqGgDwGSiI~snlasLyPenV~GlHlnm~~~~s 266 (469)
T KOG2565|consen 223 MLRLGY---NKFFIQGGDWGSIIGSNLASLYPENVLGLHLNMCFVNS 266 (469)
T ss_pred HHHhCc---ceeEeecCchHHHHHHHHHhhcchhhhHhhhcccccCC
Confidence 333333 48999999999999999999999999999988777754
No 236
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=91.41 E-value=0.14 Score=53.67 Aligned_cols=117 Identities=20% Similarity=0.328 Sum_probs=71.1
Q ss_pred cccCCc--cccccCCchhHHHHhhhcCCCcccCCCCCcccccCCCCcccccCCcCcccCCccccccCCCccccccccccc
Q psy3685 585 RYKCDL--CSKKFSRKLNLTKHMKIHQGIRYKCDLCSKEFSRKDIPYEVTVKEESYEGDSINVEENNGNDAVTRDCLEFF 662 (1042)
Q Consensus 585 ~~~C~~--C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~l~~~~~~~~~~~c~~~~~~~~~~~~~~~~~C~~~f 662 (1042)
.|.|+. |.........|+.|.+.-|+ .+-|.+|-+. + +.|.|. -..|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~-~~~C~~C~~n---K----------k~F~~E-----------------~~lF 199 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG-FVLCSECIGN---K----------KDFWNE-----------------IRLF 199 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC-cEEhHhhhcC---c----------ccCccc-----------------eeee
Confidence 366643 55555556677778777665 3456666432 0 112222 1233
Q ss_pred ccccccccccccccccCCccc----cCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhh
Q psy3685 663 KTEQNTTLGKLNKILHQDIRY----KCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQK 738 (1042)
Q Consensus 663 ~~~~~~~l~~h~~~h~~~~~~----~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~ 738 (1042)
++ +.|..|...-..+.-| .|..|.+-|-.-..|.+|+|..+ |+-+.|++.+.+.+.. ..-|+|.
T Consensus 200 ~~---~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QY--------FK~Y~~L 267 (493)
T COG5236 200 RS---STLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQY--------FKSYEDL 267 (493)
T ss_pred ec---ccccccccCCccccCcCCCchhhhccceecChHHHHHHHHhhh-hhhhhhhccCccchhh--------hhCHHHH
Confidence 33 5666666554444334 59999999999999999999643 6778888776321111 2226889
Q ss_pred cccCCC
Q psy3685 739 LDHFNI 744 (1042)
Q Consensus 739 ~dhf~~ 744 (1042)
-|||-.
T Consensus 268 e~HF~~ 273 (493)
T COG5236 268 EAHFRN 273 (493)
T ss_pred HHHhhc
Confidence 999963
No 237
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=91.34 E-value=0.2 Score=52.54 Aligned_cols=127 Identities=24% Similarity=0.491 Sum_probs=81.3
Q ss_pred ccccCC--CCCCCCChhHHHHHHHhcCCCccccCCCC---ccc------ccccchhhhhccccCc-c----ccCCCcCcc
Q psy3685 143 RYKCDL--CSKDFSYKSSLIEHKKLHQGIRYKCNLCS---KEY------FFKRNLTEHKKLHQGI-R----YNCDLCSKE 206 (1042)
Q Consensus 143 ~~~C~~--C~~~F~~~~~L~~H~~~h~~~~~~C~~C~---~~f------~~~~~L~~H~~~h~~~-~----~~C~~C~~~ 206 (1042)
.|.|+. |..+......|+.|.+..++ .+-|.+|- +.| .++..|+.|...-..+ - -.|..|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~-~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG-FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC-cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 577864 77777777888888887655 35666663 233 3566677776543221 2 358888888
Q ss_pred cCCcchhHhHhhhhcCccccccccccCCcccCCccccc-------cCCcchHHHHhhhcCCCccccCC--Cc----cccC
Q psy3685 207 FSRKDNLTDHKKMHQGNLTKHMKLHQGIRYKCDLCLKE-------FSRKDNLTEHKEIHQGIRYKCDL--CS----KKFS 273 (1042)
Q Consensus 207 f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~C~~C~~~-------f~~~~~L~~H~~~h~~~~~~C~~--C~----~~f~ 273 (1042)
|-+-+.|..|++..|. .|-+|++. |.+..+|..|.+.-+ |.|.. |- ..|.
T Consensus 230 FYdDDEL~~HcR~~HE--------------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h---y~ct~qtc~~~k~~vf~ 292 (493)
T COG5236 230 FYDDDELRRHCRLRHE--------------ACHICDMVGPIRYQYFKSYEDLEAHFRNAH---YCCTFQTCRVGKCYVFP 292 (493)
T ss_pred ecChHHHHHHHHhhhh--------------hhhhhhccCccchhhhhCHHHHHHHhhcCc---eEEEEEEEecCcEEEec
Confidence 8888888888875442 36666653 677777888865432 44422 21 3577
Q ss_pred CchHHHHHHHhhcC
Q psy3685 274 RKLNLTKHMKIHQG 287 (1042)
Q Consensus 274 ~~~~L~~H~~~h~~ 287 (1042)
....|..|+..-++
T Consensus 293 ~~~el~~h~~~~h~ 306 (493)
T COG5236 293 YHTELLEHLTRFHK 306 (493)
T ss_pred cHHHHHHHHHHHhh
Confidence 78888888755443
No 238
>PLN02753 triacylglycerol lipase
Probab=91.28 E-value=0.44 Score=54.66 Aligned_cols=69 Identities=20% Similarity=0.222 Sum_probs=42.1
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh-----CC------CeEEEEEeeccccccccCChhh
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK-----YP------HLVHGAMSASGPLRAVVDFPEY 893 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~-----yP------~~~~g~i~ssa~~~~~~~~~~~ 893 (1042)
.+|+++.+..++...+.+ ..++.++++.|||.||+||+..+.- ++ ..-..++..++|--....|..+
T Consensus 290 reQVl~eVkrLl~~Y~~e--~~~~~sItVTGHSLGGALAtLaA~Dla~~g~n~~~~~~~~pV~vyTFGsPRVGN~aFA~~ 367 (531)
T PLN02753 290 REQILTEVKRLVEEHGDD--DDSDLSITVTGHSLGGALAILSAYDIAEMGLNRSKKGKVIPVTVLTYGGPRVGNVRFKDR 367 (531)
T ss_pred HHHHHHHHHHHHHHcccc--cCCCceEEEEccCHHHHHHHHHHHHHHHhcccccccCccCceEEEEeCCCCccCHHHHHH
Confidence 466677766666544321 0123589999999999999887642 21 1112366666666655556555
Q ss_pred HH
Q psy3685 894 FG 895 (1042)
Q Consensus 894 ~~ 895 (1042)
++
T Consensus 368 ~~ 369 (531)
T PLN02753 368 ME 369 (531)
T ss_pred HH
Confidence 54
No 239
>PLN02408 phospholipase A1
Probab=90.49 E-value=0.42 Score=52.90 Aligned_cols=64 Identities=20% Similarity=0.245 Sum_probs=37.9
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH----hCCCe-EEEEEeeccccccccCChhhH
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY----KYPHL-VHGAMSASGPLRAVVDFPEYF 894 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~----~yP~~-~~g~i~ssa~~~~~~~~~~~~ 894 (1042)
+|+++.+..+++ +|.. ...++++.|||.||+||+..+. .+++. ...+++.++|-.....|.+++
T Consensus 182 ~qVl~eI~~ll~----~y~~-~~~sI~vTGHSLGGALAtLaA~dl~~~~~~~~~V~v~tFGsPRVGN~~Fa~~~ 250 (365)
T PLN02408 182 EMVREEIARLLQ----SYGD-EPLSLTITGHSLGAALATLTAYDIKTTFKRAPMVTVISFGGPRVGNRSFRRQL 250 (365)
T ss_pred HHHHHHHHHHHH----hcCC-CCceEEEeccchHHHHHHHHHHHHHHhcCCCCceEEEEcCCCCcccHHHHHHH
Confidence 455666555554 3321 1236999999999999977664 33332 233566666655544444444
No 240
>PF11187 DUF2974: Protein of unknown function (DUF2974); InterPro: IPR024499 This family of proteins has no known function.
Probab=90.40 E-value=0.51 Score=49.08 Aligned_cols=49 Identities=20% Similarity=0.276 Sum_probs=34.2
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCC----CeEEEEEeeccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYP----HLVHGAMSASGP 883 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP----~~~~g~i~ssa~ 883 (1042)
+.+.++..+...+. .++++.|||.||+||.+....-+ +.|..+++.-+|
T Consensus 70 ~A~~yl~~~~~~~~----~~i~v~GHSkGGnLA~yaa~~~~~~~~~rI~~vy~fDgP 122 (224)
T PF11187_consen 70 SALAYLKKIAKKYP----GKIYVTGHSKGGNLAQYAAANCDDEIQDRISKVYSFDGP 122 (224)
T ss_pred HHHHHHHHHHHhCC----CCEEEEEechhhHHHHHHHHHccHHHhhheeEEEEeeCC
Confidence 44555666555553 26999999999999999988744 446666655544
No 241
>PLN02324 triacylglycerol lipase
Probab=90.32 E-value=0.58 Score=52.45 Aligned_cols=65 Identities=12% Similarity=0.118 Sum_probs=39.2
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhC--------------CCeEEEEEeeccccccccCCh
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKY--------------PHLVHGAMSASGPLRAVVDFP 891 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~y--------------P~~~~g~i~ssa~~~~~~~~~ 891 (1042)
+|+++++..++.. |.. .+..+++.|||.||+||+..+.-. +..-..++..++|--....|.
T Consensus 197 eqVl~eV~~L~~~----Yp~-e~~sItvTGHSLGGALAtLaA~dl~~~~~n~~~~~~~~~~~~V~v~TFGsPRVGN~~Fa 271 (415)
T PLN02324 197 EQVQGELKRLLEL----YKN-EEISITFTGHSLGAVMSVLSAADLVYGKKNKINISLQKKQVPITVFAFGSPRIGDHNFK 271 (415)
T ss_pred HHHHHHHHHHHHH----CCC-CCceEEEecCcHHHHHHHHHHHHHHHhcccccccccccCCCceEEEEecCCCcCCHHHH
Confidence 5666666665543 321 124799999999999998877421 112233556666655555555
Q ss_pred hhHH
Q psy3685 892 EYFG 895 (1042)
Q Consensus 892 ~~~~ 895 (1042)
++++
T Consensus 272 ~~~~ 275 (415)
T PLN02324 272 NLVD 275 (415)
T ss_pred HHHH
Confidence 5554
No 242
>COG3946 VirJ Type IV secretory pathway, VirJ component [Intracellular trafficking and secretion]
Probab=90.17 E-value=0.81 Score=50.22 Aligned_cols=59 Identities=19% Similarity=0.229 Sum_probs=44.8
Q ss_pred HHHHHHHcCCeEEEEee-eecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHH
Q psy3685 787 WLDYAHNHNALAVQVEH-RFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGAL 862 (1042)
Q Consensus 787 ~~~la~~~~a~~~~~eh-R~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~l 862 (1042)
...|+++ |.-||.+|- |||= .--|.||+.+|+..+|+++...++. .+++|+|-|+|.-+
T Consensus 280 ~~~l~~~-gvpVvGvdsLRYfW-------------~~rtPe~~a~Dl~r~i~~y~~~w~~---~~~~liGySfGADv 339 (456)
T COG3946 280 AEALQKQ-GVPVVGVDSLRYFW-------------SERTPEQIAADLSRLIRFYARRWGA---KRVLLIGYSFGADV 339 (456)
T ss_pred HHHHHHC-CCceeeeehhhhhh-------------ccCCHHHHHHHHHHHHHHHHHhhCc---ceEEEEeecccchh
Confidence 3345554 888888873 5542 2348999999999999999998843 58999999999743
No 243
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=89.94 E-value=0.25 Score=57.99 Aligned_cols=53 Identities=38% Similarity=0.625 Sum_probs=29.5
Q ss_pred ccccCCCccccccccchhhhhh--hccCc---ccccC--CCccccccccchhhhhhhccC
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKK--IHQGI---RYKCN--LCSKEFSRKYNLTDHKKIHQG 60 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~--~h~~~---~~~C~--~C~k~f~~~~~L~~H~~~h~~ 60 (1042)
++.|..|...|.....|..|.+ .|+++ ++.|+ .|++.|.....+..|..+|.+
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 348 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccC
Confidence 4555555555555555555555 45443 45555 455555555555555555554
No 244
>KOG2624|consensus
Probab=89.79 E-value=0.61 Score=52.66 Aligned_cols=118 Identities=16% Similarity=0.082 Sum_probs=83.5
Q ss_pred CCCcEEEEECCCCCCcccccccc----cHHHHHHHcCCeEEEEeeeecccccCCCCCCcc----cccCCCHHH-HHhHHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEG----TWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVE----SLQYLSAEQ-ALADLA 833 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~----~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~----~~~~lt~~q-~l~D~~ 833 (1042)
..+|+||+.+|--..+..|+.+| +...||++ |..|-..--||---|.---.++.. -|. .+.++ +..|+.
T Consensus 71 ~~rp~Vll~HGLl~sS~~Wv~n~p~~sLaf~Lada-GYDVWLgN~RGn~ySr~h~~l~~~~~~~FW~-FS~~Em~~yDLP 148 (403)
T KOG2624|consen 71 KKRPVVLLQHGLLASSSSWVLNGPEQSLAFLLADA-GYDVWLGNNRGNTYSRKHKKLSPSSDKEFWD-FSWHEMGTYDLP 148 (403)
T ss_pred CCCCcEEEeeccccccccceecCccccHHHHHHHc-CCceeeecCcCcccchhhcccCCcCCcceee-cchhhhhhcCHH
Confidence 45799999999987788887543 45567776 888888888875555332223321 122 24444 899999
Q ss_pred HHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC---eEEEEEeeccccc
Q psy3685 834 YFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH---LVHGAMSASGPLR 885 (1042)
Q Consensus 834 ~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~---~~~g~i~ssa~~~ 885 (1042)
+.|.++...- +..++..+|||=|++..-.+...-|+ .|..++|-++++-
T Consensus 149 A~IdyIL~~T---~~~kl~yvGHSQGtt~~fv~lS~~p~~~~kI~~~~aLAP~~~ 200 (403)
T KOG2624|consen 149 AMIDYILEKT---GQEKLHYVGHSQGTTTFFVMLSERPEYNKKIKSFIALAPAAF 200 (403)
T ss_pred HHHHHHHHhc---cccceEEEEEEccchhheehhcccchhhhhhheeeeecchhh
Confidence 9999987764 34589999999999887666665554 5777777776663
No 245
>PF01083 Cutinase: Cutinase; InterPro: IPR000675 Aerial plant organs are protected by a cuticle composed of an insoluble polymeric structural compound, cutin, which is a polyester composed of hydroxy and hydroxyepoxy fatty acids []. Plant pathogenic fungi produce extracellular degradative enzymes [] that play an important role in pathogenesis. They include cutinase, which hydrolyses cutin, facilitating fungus penetration through the cuticle. Inhibition of the enzyme can prevent fungal infection through intact cuticles. Cutin monomers released from the cuticle by small amounts of cutinase on fungal spore surfaces can greatly increase the amount of cutinase secreted by the spore, the mechanism for which process is as yet unknown [, ]. Cutinase is a serine esterase containing the classical Ser, His, Asp triad of serine hydrolases []. The protein belongs to the alpha-beta class, with a central beta-sheet of 5 parallel strands covered by 5 helices on either side of the sheet. The active site cleft is partly covered by 2 thin bridges formed by amino acid side chains, by contrast with the hydrophobic lid possessed by other lipases []. The protein also contains 2 disulphide bridges, which are essential for activity, their cleavage resulting in complete loss of enzymatic activity []. Two cutinase-like proteins (MtCY39.35 and MtCY339.08c) have been found in the genome of the bacteria Mycobacterium tuberculosis.; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 1XZK_A 1XZA_A 1CUD_C 1XZI_A 1XZH_A 1CUF_A 1FFD_A 2CUT_A 1FFA_A 1CUA_A ....
Probab=89.72 E-value=1.4 Score=44.24 Aligned_cols=109 Identities=17% Similarity=0.095 Sum_probs=68.5
Q ss_pred EEEECCCCCCcc-cccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCC-CHHHHHhHHHHHHHHHhhhcCC
Q psy3685 768 FLMIGGEGEASA-KWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYL-SAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 768 ~~~~gg~g~~~~-~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~l-t~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
||+.-|.++... .+....+...|+...++..+.++.--|-.+.. ...|. +..+..+++...|+.+....
T Consensus 8 vi~aRGT~E~~g~~~~g~~~~~~l~~~~g~~~~~~~~V~YpA~~~-------~~~y~~S~~~G~~~~~~~i~~~~~~C-- 78 (179)
T PF01083_consen 8 VIFARGTGEPPGVGRVGPPFADALQAQPGGTSVAVQGVEYPASLG-------PNSYGDSVAAGVANLVRLIEEYAARC-- 78 (179)
T ss_dssp EEEE--TTSSTTTCCCHHHHHHHHHHHCTTCEEEEEE--S---SC-------GGSCHHHHHHHHHHHHHHHHHHHHHS--
T ss_pred EEEecCCCCCCCCccccHHHHHHHHhhcCCCeeEEEecCCCCCCC-------cccccccHHHHHHHHHHHHHHHHHhC--
Confidence 444455544322 22222344567777887777666333333211 11233 78888999999999887775
Q ss_pred CCCCcEEEEccchhHHHHHHHHHh------CCCeEEEEEeecccccc
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLRYK------YPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~~~------yP~~~~g~i~ssa~~~~ 886 (1042)
++.++||.|-|-|++++..+... ..+.|.|+|..+-|...
T Consensus 79 -P~~kivl~GYSQGA~V~~~~~~~~~l~~~~~~~I~avvlfGdP~~~ 124 (179)
T PF01083_consen 79 -PNTKIVLAGYSQGAMVVGDALSGDGLPPDVADRIAAVVLFGDPRRG 124 (179)
T ss_dssp -TTSEEEEEEETHHHHHHHHHHHHTTSSHHHHHHEEEEEEES-TTTB
T ss_pred -CCCCEEEEecccccHHHHHHHHhccCChhhhhhEEEEEEecCCccc
Confidence 35799999999999999888655 55778888888877763
No 246
>COG2936 Predicted acyl esterases [General function prediction only]
Probab=89.59 E-value=0.54 Score=54.74 Aligned_cols=83 Identities=16% Similarity=0.015 Sum_probs=63.9
Q ss_pred cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCe
Q psy3685 794 HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL 873 (1042)
Q Consensus 794 ~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~ 873 (1042)
.|++||..|-||-|.|.- .+.-+.. |-++|..+.|.++..+-= .|.+|-.+|-||+|....+.+...|--
T Consensus 79 ~GYavV~qDvRG~~~SeG-------~~~~~~~-~E~~Dg~D~I~Wia~QpW--sNG~Vgm~G~SY~g~tq~~~Aa~~pPa 148 (563)
T COG2936 79 QGYAVVNQDVRGRGGSEG-------VFDPESS-REAEDGYDTIEWLAKQPW--SNGNVGMLGLSYLGFTQLAAAALQPPA 148 (563)
T ss_pred CceEEEEecccccccCCc-------ccceecc-ccccchhHHHHHHHhCCc--cCCeeeeecccHHHHHHHHHHhcCCch
Confidence 499999999999999932 2222223 677999999999876421 345899999999999999999888887
Q ss_pred EEEEEeecccccc
Q psy3685 874 VHGAMSASGPLRA 886 (1042)
Q Consensus 874 ~~g~i~ssa~~~~ 886 (1042)
+.+++.-++.+..
T Consensus 149 Lkai~p~~~~~D~ 161 (563)
T COG2936 149 LKAIAPTEGLVDR 161 (563)
T ss_pred heeeccccccccc
Confidence 8887766666653
No 247
>COG4782 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=89.24 E-value=1.4 Score=47.99 Aligned_cols=96 Identities=11% Similarity=0.015 Sum_probs=59.4
Q ss_pred cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcc-cccCCCHHHHHhHHHHHHHHHh
Q psy3685 762 KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVE-SLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 762 ~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~-~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
..++-|+|+++|.+..-..-.+. ..+++...|+..+.+ +-|+|....-++ |..--+.+|+-.+++.+|+.+.
T Consensus 113 s~~k~vlvFvHGfNntf~dav~R--~aqI~~d~g~~~~pV-----vFSWPS~g~l~~Yn~DreS~~~Sr~aLe~~lr~La 185 (377)
T COG4782 113 SSAKTVLVFVHGFNNTFEDAVYR--TAQIVHDSGNDGVPV-----VFSWPSRGSLLGYNYDRESTNYSRPALERLLRYLA 185 (377)
T ss_pred cCCCeEEEEEcccCCchhHHHHH--HHHHHhhcCCCcceE-----EEEcCCCCeeeecccchhhhhhhHHHHHHHHHHHH
Confidence 35678999999987643322221 223333333332221 235564321111 1222388999999999999998
Q ss_pred hhcCCCCCCcEEEEccchhHHHHHHHH
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
.+-. ...+.+++||||+.|.+-+.
T Consensus 186 ~~~~---~~~I~ilAHSMGtwl~~e~L 209 (377)
T COG4782 186 TDKP---VKRIYLLAHSMGTWLLMEAL 209 (377)
T ss_pred hCCC---CceEEEEEecchHHHHHHHH
Confidence 8753 34799999999999987654
No 248
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.21 E-value=0.74 Score=41.17 Aligned_cols=107 Identities=21% Similarity=0.419 Sum_probs=56.2
Q ss_pred ccccCCCccccccccchhhhhhhccCcccccCCCccccccccchhhhhhhccC---CccccCccCccccCchhhhhHhhh
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKEFSRKYNLTDHKKIHQG---IRYKCNLCSKEFSRNCYLTEHMEQ 84 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~f~~~~~L~~H~~~h~~---~~~~C~~C~~~f~~~~~L~~H~~~ 84 (1042)
+|.|+.|+-..- ..|-.|++|+-...+..+|.+-. |+- .+|. .. .+.
T Consensus 1 GY~CPrC~skvC-------------~LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~---------ev-~~~----- 50 (112)
T TIGR00622 1 GYFCPQCRAKVC-------------ELPVECPICGLTLILSTHLARSY--HHLFPLKAFQ---------EI-PLE----- 50 (112)
T ss_pred CccCCCCCCCcc-------------CCCCcCCcCCCEEeccchHHHhh--hccCCCcccc---------cc-ccc-----
Confidence 478888854332 23778888888887777776542 332 1221 00 000
Q ss_pred ccCCcccCCCCCccccCcchhhhhhhhccCCccccCccCccccCchhHHHHHhHhcCCccccCCCC
Q psy3685 85 HQGIRYKCDLCSKDFSYKSSLIEHKKLHQGIRYKCNLCSKEFSRNCYLTEHMEQHQGIRYKCDLCS 150 (1042)
Q Consensus 85 H~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~C~~C~ 150 (1042)
.....-.|--|.+.|........= ..-...+|+|+.|...|--.-+.-.|...|. |+-|.
T Consensus 51 ~~~~~~~C~~C~~~f~~~~~~~~~-~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~-----CPGC~ 110 (112)
T TIGR00622 51 EYNGSRFCFGCQGPFPKPPVSPFD-ELKDSHRYVCAVCKNVFCVDCDVFVHESLHC-----CPGCI 110 (112)
T ss_pred ccCCCCcccCcCCCCCCccccccc-ccccccceeCCCCCCccccccchhhhhhccC-----CcCCC
Confidence 000112366666666554311100 0112236777777777777777777765553 66554
No 249
>PLN02934 triacylglycerol lipase
Probab=89.17 E-value=0.83 Score=52.30 Aligned_cols=50 Identities=18% Similarity=0.218 Sum_probs=34.9
Q ss_pred CCCcEEEEccchhHHHHHHHHHh----C----CCeEEEEEeeccccccccCChhhHHH
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRYK----Y----PHLVHGAMSASGPLRAVVDFPEYFGV 896 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~~----y----P~~~~g~i~ssa~~~~~~~~~~~~~~ 896 (1042)
++.++++.|||.||+||+.++.. . +..+...+..++|--...+|..|++.
T Consensus 319 p~~kIvVTGHSLGGALAtLaA~~L~l~~~~~~l~~~~~vYTFGsPRVGN~~FA~~~~~ 376 (515)
T PLN02934 319 KNAKFVVTGHSLGGALAILFPTVLVLQEETEVMKRLLGVYTFGQPRIGNRQLGKFMEA 376 (515)
T ss_pred CCCeEEEeccccHHHHHHHHHHHHHHhcccccccCceEEEEeCCCCccCHHHHHHHHH
Confidence 44689999999999999888621 1 12345677888887766666555544
No 250
>PLN02761 lipase class 3 family protein
Probab=89.17 E-value=0.81 Score=52.56 Aligned_cols=70 Identities=23% Similarity=0.186 Sum_probs=39.7
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh-----C-------CCeEEEEEeeccccccccCChhh
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK-----Y-------PHLVHGAMSASGPLRAVVDFPEY 893 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~-----y-------P~~~~g~i~ssa~~~~~~~~~~~ 893 (1042)
+|+++.+..++........ .++.++++.|||.||+||+..+.- + +..-..+++.++|=-...+|.++
T Consensus 272 ~qVl~eV~rL~~~Y~~~~k-~e~~sItVTGHSLGGALAtLaA~DIa~~gln~~~~~~~~~PVtv~TFGsPRVGN~~FA~~ 350 (527)
T PLN02761 272 EQVLAEVKRLVEYYGTEEE-GHEISITVTGHSLGASLALVSAYDIAELNLNHVPENNYKIPITVFSFSGPRVGNLRFKER 350 (527)
T ss_pred HHHHHHHHHHHHhcccccC-CCCceEEEeccchHHHHHHHHHHHHHHhccccccccccCCceEEEEcCCCCcCCHHHHHH
Confidence 5666666555543321111 133579999999999999877731 1 11112355666665555555555
Q ss_pred HHH
Q psy3685 894 FGV 896 (1042)
Q Consensus 894 ~~~ 896 (1042)
++.
T Consensus 351 ~d~ 353 (527)
T PLN02761 351 CDE 353 (527)
T ss_pred HHh
Confidence 543
No 251
>COG3150 Predicted esterase [General function prediction only]
Probab=88.97 E-value=1.2 Score=42.86 Aligned_cols=79 Identities=20% Similarity=0.206 Sum_probs=51.7
Q ss_pred EEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 768 FLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
|||++|..+ ++..-.. .+-.+ -+.-+.|..+.|.|... -.+.||++-+...|.....+
T Consensus 2 ilYlHGFnS-SP~shka----~l~~q----~~~~~~~~i~y~~p~l~--------h~p~~a~~ele~~i~~~~~~----- 59 (191)
T COG3150 2 ILYLHGFNS-SPGSHKA----VLLLQ----FIDEDVRDIEYSTPHLP--------HDPQQALKELEKAVQELGDE----- 59 (191)
T ss_pred eEEEecCCC-CcccHHH----HHHHH----HHhccccceeeecCCCC--------CCHHHHHHHHHHHHHHcCCC-----
Confidence 899999876 4422211 11111 13345566677766431 25788988888888765432
Q ss_pred CCcEEEEccchhHHHHHHHHHhC
Q psy3685 848 HTKWIAFGGSYPGALAAWLRYKY 870 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~~~y 870 (1042)
...++|-|.||-.|.|+...+
T Consensus 60 --~p~ivGssLGGY~At~l~~~~ 80 (191)
T COG3150 60 --SPLIVGSSLGGYYATWLGFLC 80 (191)
T ss_pred --CceEEeecchHHHHHHHHHHh
Confidence 367899999999999998765
No 252
>KOG3967|consensus
Probab=88.82 E-value=3.8 Score=40.83 Aligned_cols=99 Identities=21% Similarity=0.362 Sum_probs=55.8
Q ss_pred cccCCCcEEEEECCCCCCcc-cccc---------ccc---HHHHHHHcCCeEEEEee----eecc-cccCCCCCCccccc
Q psy3685 760 WYKHGGPIFLMIGGEGEASA-KWMV---------EGT---WLDYAHNHNALAVQVEH----RFYG-KSHPLSDLSVESLQ 821 (1042)
Q Consensus 760 ~~~~~~pi~~~~gg~g~~~~-~~~~---------~~~---~~~la~~~~a~~~~~eh----R~~G-~S~p~~~~~~~~~~ 821 (1042)
.-....-++|+|+|.|.+.. .|.. .|. +..-|-+.|.-||.+.- |||- +-.| +.
T Consensus 96 ~lt~~~kLlVLIHGSGvVrAGQWARrLIIN~~Ld~GTQiPyi~rAv~~Gygviv~N~N~~~kfye~k~np--------~k 167 (297)
T KOG3967|consen 96 ALTNPQKLLVLIHGSGVVRAGQWARRLIINEDLDSGTQIPYIKRAVAEGYGVIVLNPNRERKFYEKKRNP--------QK 167 (297)
T ss_pred HhcCccceEEEEecCceEecchHhhhhhhccccccCCcChHHHHHHHcCCcEEEeCCchhhhhhhcccCc--------ch
Confidence 33434458889999886542 2321 222 33346666777777643 3552 2222 33
Q ss_pred CC--CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCe
Q psy3685 822 YL--SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL 873 (1042)
Q Consensus 822 ~l--t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~ 873 (1042)
|. .++-|+.=..++|.-. .-..++++.|||||.+.+-+..++|+.
T Consensus 168 yirt~veh~~yvw~~~v~pa-------~~~sv~vvahsyGG~~t~~l~~~f~~d 214 (297)
T KOG3967|consen 168 YIRTPVEHAKYVWKNIVLPA-------KAESVFVVAHSYGGSLTLDLVERFPDD 214 (297)
T ss_pred hccchHHHHHHHHHHHhccc-------CcceEEEEEeccCChhHHHHHHhcCCc
Confidence 44 2233322222222111 113589999999999999999999975
No 253
>KOG2369|consensus
Probab=88.72 E-value=0.25 Score=55.41 Aligned_cols=59 Identities=17% Similarity=0.150 Sum_probs=45.6
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCe--------EEEEEeeccccc
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL--------VHGAMSASGPLR 885 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~--------~~g~i~ssa~~~ 885 (1042)
-.+|.+.+++.+|+.+-+.. +..|+|++||||||.+..+|-..+|+. +.+.|+-+||..
T Consensus 160 ~rd~yl~kLK~~iE~~~~~~---G~kkVvlisHSMG~l~~lyFl~w~~~~~~~W~~k~I~sfvnig~p~l 226 (473)
T KOG2369|consen 160 ERDQYLSKLKKKIETMYKLN---GGKKVVLISHSMGGLYVLYFLKWVEAEGPAWCDKYIKSFVNIGAPWL 226 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHc---CCCceEEEecCCccHHHHHHHhcccccchhHHHHHHHHHHccCchhc
Confidence 45778899999998877665 335999999999999999999999994 344445555543
No 254
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=88.53 E-value=0.28 Score=31.10 Aligned_cols=21 Identities=33% Similarity=0.892 Sum_probs=18.1
Q ss_pred ccCCCCCCccCCcchHHHHHhh
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEI 704 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~ 704 (1042)
..|+.||+.| ..+.|.+|+++
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4799999999 68899999864
No 255
>PLN02802 triacylglycerol lipase
Probab=88.35 E-value=0.72 Score=52.79 Aligned_cols=64 Identities=19% Similarity=0.195 Sum_probs=38.3
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH----hCCCe-EEEEEeeccccccccCChhhH
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY----KYPHL-VHGAMSASGPLRAVVDFPEYF 894 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~----~yP~~-~~g~i~ssa~~~~~~~~~~~~ 894 (1042)
+|+++++..+++ .|.. ++..+++.|||.||+||+..+. ..|+. ...++..++|-....+|.+++
T Consensus 312 eqVl~eV~~Ll~----~Y~~-e~~sI~VTGHSLGGALAtLaA~dL~~~~~~~~pV~vyTFGsPRVGN~aFA~~~ 380 (509)
T PLN02802 312 ESVVGEVRRLME----KYKG-EELSITVTGHSLGAALALLVADELATCVPAAPPVAVFSFGGPRVGNRAFADRL 380 (509)
T ss_pred HHHHHHHHHHHH----hCCC-CcceEEEeccchHHHHHHHHHHHHHHhCCCCCceEEEEcCCCCcccHHHHHHH
Confidence 455555555544 3322 2247999999999999976553 44543 223566666665555555554
No 256
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=87.74 E-value=1.6 Score=39.05 Aligned_cols=21 Identities=14% Similarity=0.487 Sum_probs=12.6
Q ss_pred CcccCCCCCcccCCchhHHHH
Q psy3685 450 IRYKCDLCSKEFSRKGNLTKH 470 (1042)
Q Consensus 450 ~~~~C~~C~k~f~~~~~L~~H 470 (1042)
.|-.|++|+-.......|.+-
T Consensus 14 LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCcCCcCCCEEeccchHHHh
Confidence 356666666666666665543
No 257
>KOG2482|consensus
Probab=87.28 E-value=0.82 Score=48.31 Aligned_cols=143 Identities=22% Similarity=0.399 Sum_probs=83.0
Q ss_pred ccccccccccc--cccCCccccccc-CchHHHHHHhh-cc---c-------------------cccccCCCCCccCCchh
Q psy3685 332 LTEHKEIHQGI--RYKCDLCSKKFS-RKLNLTKHMKI-HQ---G-------------------IRYKCDLCSKEFSRKGN 385 (1042)
Q Consensus 332 l~~H~~~h~~~--~~~C~~C~k~f~-~~~~L~~H~~~-h~---~-------------------~~~~C~~C~k~f~~~~~ 385 (1042)
|..|++-..+. ..+|-.|...+. .++..-.|+-. |. | .++.|-.|.+.|+.+..
T Consensus 131 LeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdknt 210 (423)
T KOG2482|consen 131 LEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNT 210 (423)
T ss_pred HHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHH
Confidence 44444333333 467888987664 45556666532 32 1 14789999999999999
Q ss_pred HHHHHHhhcCC---c--------ccc--CccCccccC-cchH-----HhHhhh---------hcCce--eccCccccccC
Q psy3685 386 LTKHMKLHQGI---R--------YKC--DLCLKEFSR-KDNL-----TEHKEI---------HQGIR--YKCDLCSKKFS 435 (1042)
Q Consensus 386 L~~H~~~h~~~---~--------~~C--~~C~k~f~~-~~~L-----~~H~~~---------h~~~~--~~C~~C~~~f~ 435 (1042)
|+.||+.-... | |.= ..=||+..- .+.+ ..+... +...+ ..|-.|.....
T Consensus 211 LkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~e 290 (423)
T KOG2482|consen 211 LKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYE 290 (423)
T ss_pred HHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchh
Confidence 99999753211 1 000 001111100 0001 011100 00112 58999999888
Q ss_pred CchhHHHHHhhcCCC----------------------------cccCCCCCcccCCchhHHHHHHhh
Q psy3685 436 RKLNLTKHMKIHQGI----------------------------RYKCDLCSKEFSRKGNLTKHMKLH 474 (1042)
Q Consensus 436 ~~~~L~~H~~~H~~~----------------------------~~~C~~C~k~f~~~~~L~~H~~~H 474 (1042)
+...|..||++-+.. .-.|-.|.-.|.....|..||.-+
T Consensus 291 n~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 291 NPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred hHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 899999999865431 135888888999999999998644
No 258
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=87.18 E-value=0.36 Score=30.62 Aligned_cols=19 Identities=42% Similarity=0.843 Sum_probs=10.8
Q ss_pred cCCccccccCCchhHHHHhh
Q psy3685 587 KCDLCSKKFSRKLNLTKHMK 606 (1042)
Q Consensus 587 ~C~~C~k~F~~~~~L~~H~~ 606 (1042)
.|+.||++| ....|.+|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 456666666 4555555554
No 259
>KOG2482|consensus
Probab=85.90 E-value=0.65 Score=49.04 Aligned_cols=50 Identities=26% Similarity=0.507 Sum_probs=36.5
Q ss_pred cccCCCccccCCchHHHHHHHhhcCC----------------------------cccCCCCCcccccccccccccccc
Q psy3685 263 YKCDLCSKKFSRKLNLTKHMKIHQGI----------------------------RYKCDLCSKEFSRKGNLTKHMKLH 312 (1042)
Q Consensus 263 ~~C~~C~~~f~~~~~L~~H~~~h~~~----------------------------~~~C~~C~~~f~~~~~L~~H~~~h 312 (1042)
..|-.|.....+...|..||+.-+.- .-.|..|.-.|..+..|..||..+
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 46777777777777777777654320 135888999999999999998654
No 260
>COG5153 CVT17 Putative lipase essential for disintegration of autophagic bodies inside the vacuole [Intracellular trafficking and secretion / Lipid metabolism]
Probab=85.58 E-value=0.92 Score=46.95 Aligned_cols=49 Identities=16% Similarity=0.249 Sum_probs=33.9
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+...+...+++.| +++.+++.|||.||++|+.+...|-=- +||.++|-.
T Consensus 261 a~ldI~~~v~~~Y---pda~iwlTGHSLGGa~AsLlG~~fglP---~VaFesPGd 309 (425)
T COG5153 261 AALDILGAVRRIY---PDARIWLTGHSLGGAIASLLGIRFGLP---VVAFESPGD 309 (425)
T ss_pred HHHHHHHHHHHhC---CCceEEEeccccchHHHHHhccccCCc---eEEecCchh
Confidence 3334455566667 457899999999999999998877432 344555543
No 261
>KOG4540|consensus
Probab=85.58 E-value=0.92 Score=46.95 Aligned_cols=49 Identities=16% Similarity=0.249 Sum_probs=33.9
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
+...+...+++.| +++.+++.|||.||++|+.+...|-=- +||.++|-.
T Consensus 261 a~ldI~~~v~~~Y---pda~iwlTGHSLGGa~AsLlG~~fglP---~VaFesPGd 309 (425)
T KOG4540|consen 261 AALDILGAVRRIY---PDARIWLTGHSLGGAIASLLGIRFGLP---VVAFESPGD 309 (425)
T ss_pred HHHHHHHHHHHhC---CCceEEEeccccchHHHHHhccccCCc---eEEecCchh
Confidence 3334455566667 457899999999999999998877432 344555543
No 262
>TIGR03712 acc_sec_asp2 accessory Sec system protein Asp2. This protein is designated Asp2 because, along with SecY2, SecA2, and other proteins it is part of the accessory secretory protein system. The system is involved in the export of serine-rich glycoproteins important for virulence in a number of Gram-positive species, including Streptococcus gordonii and Staphylococcus aureus. This protein family is assigned to transport rather than glycosylation function, but the specific molecular role is unknown.
Probab=84.83 E-value=7.1 Score=44.45 Aligned_cols=128 Identities=16% Similarity=0.118 Sum_probs=80.8
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCe-EEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNAL-AVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~-~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
.|..||-.|.-++.++-. .-+-+.+||. +++-|.|-=|.+. ||-.+..-+-+...|+......
T Consensus 289 PPL~VYFSGyR~aEGFEg-----y~MMk~Lg~PfLL~~DpRleGGaF-----------YlGs~eyE~~I~~~I~~~L~~L 352 (511)
T TIGR03712 289 PPLNVYFSGYRPAEGFEG-----YFMMKRLGAPFLLIGDPRLEGGAF-----------YLGSDEYEQGIINVIQEKLDYL 352 (511)
T ss_pred CCeEEeeccCcccCcchh-----HHHHHhcCCCeEEeecccccccee-----------eeCcHHHHHHHHHHHHHHHHHh
Confidence 489999988766544222 1345566664 5666888777763 4545555556666777666665
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHH-hCCCeEE--------EEEeeccccccccCChhhHHHHHHHhhccChhhH
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRY-KYPHLVH--------GAMSASGPLRAVVDFPEYFGVVADALATVSTECV 909 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~-~yP~~~~--------g~i~ssa~~~~~~~~~~~~~~v~~~~~~~~~~c~ 909 (1042)
++.. .-+||-|-|||..=|++|.. ..|+-|. |.||+-+-+..-.+|+.-++++.......+..-.
T Consensus 353 gF~~-~qLILSGlSMGTfgAlYYga~l~P~AIiVgKPL~NLGtiA~n~rL~RP~~F~TslDvl~~~~g~~s~~~i 426 (511)
T TIGR03712 353 GFDH-DQLILSGLSMGTFGALYYGAKLSPHAIIVGKPLVNLGTIASRMRLDRPDEFGTALDILLLNTGGTSSEDV 426 (511)
T ss_pred CCCH-HHeeeccccccchhhhhhcccCCCceEEEcCcccchhhhhccccccCCCCCchHHHhHHhhcCCCCHHHH
Confidence 6543 47999999999999999884 5677543 3344444444445666666666655444444333
No 263
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=84.81 E-value=0.49 Score=32.77 Aligned_cols=23 Identities=22% Similarity=0.544 Sum_probs=21.1
Q ss_pred cccCCCCCCccCCcchHHHHHhh
Q psy3685 682 RYKCDLCSKDFSRKDNLTEHKEI 704 (1042)
Q Consensus 682 ~~~C~~C~k~F~~~~~L~~H~~~ 704 (1042)
+|.|+.|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 68999999999999999999864
No 264
>PF06028 DUF915: Alpha/beta hydrolase of unknown function (DUF915); InterPro: IPR010315 This family consists of bacterial proteins of unknown function, which are hydrolase-like.; PDB: 3LP5_A 3FLE_A 3DS8_A.
Probab=84.66 E-value=2.5 Score=44.87 Aligned_cols=61 Identities=15% Similarity=0.261 Sum_probs=43.8
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC-----eEEEEEeecccccccc
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH-----LVHGAMSASGPLRAVV 888 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~-----~~~g~i~ssa~~~~~~ 888 (1042)
..+-..=+..++.+|+..|.. ..+=++||||||+.++.|...|-. .+.-.|+-++|.....
T Consensus 82 ~~~qa~wl~~vl~~L~~~Y~~---~~~N~VGHSmGg~~~~~yl~~~~~~~~~P~l~K~V~Ia~pfng~~ 147 (255)
T PF06028_consen 82 YKKQAKWLKKVLKYLKKKYHF---KKFNLVGHSMGGLSWTYYLENYGNDKNLPKLNKLVTIAGPFNGIL 147 (255)
T ss_dssp HHHHHHHHHHHHHHHHHCC-----SEEEEEEETHHHHHHHHHHHHCTTGTTS-EEEEEEEES--TTTTT
T ss_pred HHHHHHHHHHHHHHHHHhcCC---CEEeEEEECccHHHHHHHHHHhccCCCCcccceEEEeccccCccc
Confidence 444456677778888888865 478999999999999999888643 3677777788887654
No 265
>PF11339 DUF3141: Protein of unknown function (DUF3141); InterPro: IPR024501 This family of proteins appears to be predominantly expressed in Proteobacteria. Their function is unknown.
Probab=84.23 E-value=9 Score=43.99 Aligned_cols=106 Identities=20% Similarity=0.206 Sum_probs=70.5
Q ss_pred CcEEEEE--CCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 765 GPIFLMI--GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 765 ~pi~~~~--gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.|+|++. .|-|+.-+.|... --...|-.-|--|+++- |+ ..|.++. |++..+.-.+.|++.+...
T Consensus 69 rP~vViDPRAGHGpGIGGFK~d-SevG~AL~~GHPvYFV~--F~--p~P~pgQ--------Tl~DV~~ae~~Fv~~V~~~ 135 (581)
T PF11339_consen 69 RPFVVIDPRAGHGPGIGGFKPD-SEVGVALRAGHPVYFVG--FF--PEPEPGQ--------TLEDVMRAEAAFVEEVAER 135 (581)
T ss_pred CCeEEeCCCCCCCCCccCCCcc-cHHHHHHHcCCCeEEEE--ec--CCCCCCC--------cHHHHHHHHHHHHHHHHHh
Confidence 5777776 5555544444431 22234445476777664 22 2333321 4444555566788887776
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
... ..|.+++|--=||..++.++..+|+++.-.|..+||+.
T Consensus 136 hp~--~~kp~liGnCQgGWa~~mlAA~~Pd~~gplvlaGaPls 176 (581)
T PF11339_consen 136 HPD--APKPNLIGNCQGGWAAMMLAALRPDLVGPLVLAGAPLS 176 (581)
T ss_pred CCC--CCCceEEeccHHHHHHHHHHhcCcCccCceeecCCCcc
Confidence 532 23899999999999999999999999999999999994
No 266
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=83.55 E-value=0.6 Score=32.34 Aligned_cols=23 Identities=26% Similarity=0.735 Sum_probs=18.1
Q ss_pred cccCCccccccCCchhHHHHhhh
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKI 607 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~ 607 (1042)
+|.|++|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888888888888753
No 267
>PF07519 Tannase: Tannase and feruloyl esterase; InterPro: IPR011118 This family includes fungal tannase [] and feruloyl esterase [, ]. It also includes several bacterial homologues of unknown function.
Probab=83.53 E-value=1.3 Score=51.98 Aligned_cols=111 Identities=17% Similarity=0.168 Sum_probs=64.1
Q ss_pred EEEECCCCCCcccccccc--cHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHH--------HHHH
Q psy3685 768 FLMIGGEGEASAKWMVEG--TWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLA--------YFIT 837 (1042)
Q Consensus 768 ~~~~gg~g~~~~~~~~~~--~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~--------~~~~ 837 (1042)
|+.+||-|-. ....... .....+-..|++++.-|- ||..+... ....+..-.+++.|.+ .+.+
T Consensus 31 ~~~~GgGG~~-G~i~~~~~~~~~~~~~~~G~A~~~TD~-Gh~~~~~~-----~~~~~~~n~~~~~dfa~ra~h~~~~~aK 103 (474)
T PF07519_consen 31 FLQVGGGGFA-GGINYADGKASMATALARGYATASTDS-GHQGSAGS-----DDASFGNNPEALLDFAYRALHETTVVAK 103 (474)
T ss_pred eEEECCCeee-CcccccccccccchhhhcCeEEEEecC-CCCCCccc-----ccccccCCHHHHHHHHhhHHHHHHHHHH
Confidence 7778774432 2212111 112334445788888773 33222110 0111223444555554 3444
Q ss_pred HHhh-hcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeecccccc
Q psy3685 838 SMNT-LYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRA 886 (1042)
Q Consensus 838 ~~~~-~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~ 886 (1042)
.+-+ -|+.+. ..-...|+|-||--+...+++||+.++|+||.++.++.
T Consensus 104 ~l~~~~Yg~~p-~~sY~~GcS~GGRqgl~~AQryP~dfDGIlAgaPA~~~ 152 (474)
T PF07519_consen 104 ALIEAFYGKAP-KYSYFSGCSTGGRQGLMAAQRYPEDFDGILAGAPAINW 152 (474)
T ss_pred HHHHHHhCCCC-CceEEEEeCCCcchHHHHHHhChhhcCeEEeCCchHHH
Confidence 4433 344433 35678999999999999999999999999999877654
No 268
>KOG1551|consensus
Probab=83.07 E-value=2.5 Score=43.52 Aligned_cols=130 Identities=15% Similarity=0.168 Sum_probs=75.1
Q ss_pred cchhhhh-cccCC---CCCcccceEEEEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecc
Q psy3685 732 DQWFEQK-LDHFN---IEEPRTWKQRYFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYG 807 (1042)
Q Consensus 732 ~~~f~q~-~dhf~---~~~~~tf~qry~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G 807 (1042)
|+.|..| .|-|. +.+.+|=.-|+++- +.-+|+-|.+.|.|+...+-.+ -+...+.++ +-.-+.+|-.|||
T Consensus 80 EG~fasp~a~~~p~~mP~~~~~A~~~~liP----QK~~~KOG~~a~tgdh~y~rr~-~L~~p~~k~-~i~tmvle~pfYg 153 (371)
T KOG1551|consen 80 EGVFASPAASNWPKPMPPESRTARVAWLIP----QKMADLCLSWALTGDHVYTRRL-VLSKPINKR-EIATMVLEKPFYG 153 (371)
T ss_pred hhhhcCchhhhCccCCCCcccceeeeeecc----cCcCCeeEEEeecCCceeEeee-eecCchhhh-cchheeeeccccc
Confidence 5666555 34443 23445555555554 3346888888888776543221 122234444 5566889999999
Q ss_pred cccCCCCCCcccccCCCHHHHHhHH----HHHHHHHhhhcC---CCCCCcEEEEccchhHHHHHHHHHhCCCeE
Q psy3685 808 KSHPLSDLSVESLQYLSAEQALADL----AYFITSMNTLYS---LPAHTKWIAFGGSYPGALAAWLRYKYPHLV 874 (1042)
Q Consensus 808 ~S~p~~~~~~~~~~~lt~~q~l~D~----~~~~~~~~~~~~---~~~~~~~i~~G~Sygg~la~~~~~~yP~~~ 874 (1042)
.-.|... ....-.-+.|+ ++.|++....+. ..+-.+.-+.|-|+||.+|.......|.-|
T Consensus 154 qr~p~~q-------~~~~Le~vtDlf~mG~A~I~E~~~lf~Ws~~~g~g~~~~~g~Smgg~~a~~vgS~~q~Pv 220 (371)
T KOG1551|consen 154 QRVPEEQ-------IIHMLEYVTDLFKMGRATIQEFVKLFTWSSADGLGNLNLVGRSMGGDIANQVGSLHQKPV 220 (371)
T ss_pred ccCCHHH-------HHHHHHHHHHHHHhhHHHHHHHHHhcccccccCcccceeeeeecccHHHHhhcccCCCCc
Confidence 9998642 11111122332 233444444432 112247899999999999999888666654
No 269
>PLN02517 phosphatidylcholine-sterol O-acyltransferase
Probab=82.48 E-value=2.3 Score=49.73 Aligned_cols=57 Identities=18% Similarity=0.106 Sum_probs=40.0
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCC---------------CeEEEEEeeccccc
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYP---------------HLVHGAMSASGPLR 885 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP---------------~~~~g~i~ssa~~~ 885 (1042)
++...++...|+.+.... ++.||||+||||||.++.+|...-+ ..|.+.|+.|+|..
T Consensus 193 d~YF~rLK~lIE~ay~~n---ggkKVVLV~HSMGglv~lyFL~wv~~~~~~gG~gG~~W~dKyI~s~I~Iagp~l 264 (642)
T PLN02517 193 DQTLSRLKSNIELMVATN---GGKKVVVVPHSMGVLYFLHFMKWVEAPAPMGGGGGPGWCAKHIKAVMNIGGPFL 264 (642)
T ss_pred hHHHHHHHHHHHHHHHHc---CCCeEEEEEeCCchHHHHHHHHhccccccccCCcchHHHHHHHHHheecccccC
Confidence 566677778887665442 3469999999999999998765432 23556677776664
No 270
>COG3545 Predicted esterase of the alpha/beta hydrolase fold [General function prediction only]
Probab=82.30 E-value=2.2 Score=41.53 Aligned_cols=55 Identities=16% Similarity=0.089 Sum_probs=37.9
Q ss_pred HHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccccc
Q psy3685 832 LAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLRAV 887 (1042)
Q Consensus 832 ~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~~~ 887 (1042)
++..|..+.+..+..+ .|+|+++||.|+.+++-+...--.-|.|++..++|-...
T Consensus 43 ~~dWi~~l~~~v~a~~-~~~vlVAHSLGc~~v~h~~~~~~~~V~GalLVAppd~~~ 97 (181)
T COG3545 43 LDDWIARLEKEVNAAE-GPVVLVAHSLGCATVAHWAEHIQRQVAGALLVAPPDVSR 97 (181)
T ss_pred HHHHHHHHHHHHhccC-CCeEEEEecccHHHHHHHHHhhhhccceEEEecCCCccc
Confidence 3444555544443332 479999999999888877755555899999888766433
No 271
>KOG4388|consensus
Probab=82.03 E-value=6.8 Score=45.03 Aligned_cols=97 Identities=13% Similarity=0.091 Sum_probs=54.5
Q ss_pred cEEEEECCCCCCcccc-cccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh--
Q psy3685 766 PIFLMIGGEGEASAKW-MVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL-- 842 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~-~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~-- 842 (1042)
.+||-.+|-|-+.... ....+..+||.++|+-||.+|.---- --||+- |++.+-.-.-++-..
T Consensus 397 sli~HcHGGGfVAqsSkSHE~YLr~Wa~aL~cPiiSVdYSLAP-EaPFPR-------------aleEv~fAYcW~inn~a 462 (880)
T KOG4388|consen 397 SLIVHCHGGGFVAQSSKSHEPYLRSWAQALGCPIISVDYSLAP-EAPFPR-------------ALEEVFFAYCWAINNCA 462 (880)
T ss_pred eEEEEecCCceeeeccccccHHHHHHHHHhCCCeEEeeeccCC-CCCCCc-------------HHHHHHHHHHHHhcCHH
Confidence 3777776655442221 23345779999999999999953322 234442 333322111122111
Q ss_pred cCCCCCCcEEEEccchhHHHHHHH-------HHhCCCeEEE
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWL-------RYKYPHLVHG 876 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~-------~~~yP~~~~g 876 (1042)
..+....++++.|.|.||+|..-. -.+-||-+..
T Consensus 463 llG~TgEriv~aGDSAGgNL~~~VaLr~i~~gvRvPDGl~l 503 (880)
T KOG4388|consen 463 LLGSTGERIVLAGDSAGGNLCFTVALRAIAYGVRVPDGLML 503 (880)
T ss_pred HhCcccceEEEeccCCCcceeehhHHHHHHhCCCCCCceEE
Confidence 112234689999999999875332 2456765543
No 272
>PLN02847 triacylglycerol lipase
Probab=81.66 E-value=2.7 Score=49.12 Aligned_cols=22 Identities=36% Similarity=0.371 Sum_probs=18.6
Q ss_pred CCCcEEEEccchhHHHHHHHHH
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
++.++++.|||+||++|+.+..
T Consensus 249 PdYkLVITGHSLGGGVAALLAi 270 (633)
T PLN02847 249 PDFKIKIVGHSLGGGTAALLTY 270 (633)
T ss_pred CCCeEEEeccChHHHHHHHHHH
Confidence 4468999999999999988664
No 273
>KOG2112|consensus
Probab=81.39 E-value=3.1 Score=41.78 Aligned_cols=59 Identities=25% Similarity=0.284 Sum_probs=43.3
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
..++.+-++.+|.+--.. +. +.+.+++-|.|+||++|.++...||..+.|.++.|+-..
T Consensus 71 ~~~aa~~i~~Li~~e~~~-Gi-~~~rI~igGfs~G~a~aL~~~~~~~~~l~G~~~~s~~~p 129 (206)
T KOG2112|consen 71 LHRAADNIANLIDNEPAN-GI-PSNRIGIGGFSQGGALALYSALTYPKALGGIFALSGFLP 129 (206)
T ss_pred HHHHHHHHHHHHHHHHHc-CC-CccceeEcccCchHHHHHHHHhccccccceeeccccccc
Confidence 344555555555543332 22 224799999999999999999999999999998887665
No 274
>KOG2893|consensus
Probab=81.27 E-value=0.61 Score=46.42 Aligned_cols=45 Identities=33% Similarity=0.801 Sum_probs=30.0
Q ss_pred cCCCCCCCCChhHHHHHHHhcCCCccccCCCCcccccccchhhh-hccc
Q psy3685 146 CDLCSKDFSYKSSLIEHKKLHQGIRYKCNLCSKEYFFKRNLTEH-KKLH 193 (1042)
Q Consensus 146 C~~C~~~F~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H-~~~h 193 (1042)
|=+|++.|.....|+.|++..+ |+|.+|.|...+--.|..| |++|
T Consensus 13 cwycnrefddekiliqhqkakh---fkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAKH---FKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhhcc---ceeeeehhhhccCCCceeehhhhh
Confidence 6677777777777777766533 7777777776666666666 3444
No 275
>KOG1516|consensus
Probab=80.11 E-value=2.6 Score=50.93 Aligned_cols=115 Identities=21% Similarity=0.122 Sum_probs=63.3
Q ss_pred CcEEEEECCCCCCccccc--ccccHHHHHHHcCCeEEEEeeee--ccc-ccCCCCCCcccccCCCHHHHHhHHHHHHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWM--VEGTWLDYAHNHNALAVQVEHRF--YGK-SHPLSDLSVESLQYLSAEQALADLAYFITSM 839 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~--~~~~~~~la~~~~a~~~~~ehR~--~G~-S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~ 839 (1042)
-||+||++|.|-..+... .......++...+..||.+..|= .|- |.. +.....|+......+||.=+..-|..
T Consensus 112 ~pV~V~iHGG~~~~gs~~~~~~~~~~~~~~~~~VVvVt~~YRLG~lGF~st~-d~~~~gN~gl~Dq~~AL~wv~~~I~~- 189 (545)
T KOG1516|consen 112 LPVMVYIHGGGFQFGSASSFEIISPAYVLLLKDVVVVTINYRLGPLGFLSTG-DSAAPGNLGLFDQLLALRWVKDNIPS- 189 (545)
T ss_pred CCEEEEEeCCceeeccccchhhcCchhccccCCEEEEEecccceeceeeecC-CCCCCCcccHHHHHHHHHHHHHHHHh-
Confidence 599999966543222110 11112223333356777777772 120 111 11224666655444444333333332
Q ss_pred hhhcCCCCCCcEEEEccchhHHHHHHHHH--hCCCeEEEEEeeccccc
Q psy3685 840 NTLYSLPAHTKWIAFGGSYPGALAAWLRY--KYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 840 ~~~~~~~~~~~~i~~G~Sygg~la~~~~~--~yP~~~~g~i~ssa~~~ 885 (1042)
+++++ .+|.++|+|.||+++..+.. .--.++..+|..|+...
T Consensus 190 ---FGGdp-~~vTl~G~saGa~~v~~l~~Sp~s~~LF~~aI~~SG~~~ 233 (545)
T KOG1516|consen 190 ---FGGDP-KNVTLFGHSAGAASVSLLTLSPHSRGLFHKAISMSGNAL 233 (545)
T ss_pred ---cCCCC-CeEEEEeechhHHHHHHHhcCHhhHHHHHHHHhhccccc
Confidence 34433 47999999999999876553 22377888888877764
No 276
>KOG3975|consensus
Probab=79.38 E-value=15 Score=37.91 Aligned_cols=109 Identities=16% Similarity=0.109 Sum_probs=51.0
Q ss_pred cceEEEEEecccccCCCcEEEEE-CCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCC----CCCcccccCC
Q psy3685 749 TWKQRYFVNSTWYKHGGPIFLMI-GGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLS----DLSVESLQYL 823 (1042)
Q Consensus 749 tf~qry~~~~~~~~~~~pi~~~~-gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~----~~~~~~~~~l 823 (1042)
-|.-.+|+.... ...|+|+++ |++|.. ++ ++.+...|-..+++.+-+|+--.-|--.+.. +-+..|-.-.
T Consensus 15 i~~~~~~v~~~~--~~~~li~~IpGNPG~~-gF--Y~~F~~~L~~~l~~r~~~wtIsh~~H~~~P~sl~~~~s~~~~eif 89 (301)
T KOG3975|consen 15 ILTLKPWVTKSG--EDKPLIVWIPGNPGLL-GF--YTEFARHLHLNLIDRLPVWTISHAGHALMPASLREDHSHTNEEIF 89 (301)
T ss_pred ceeeeeeeccCC--CCceEEEEecCCCCch-hH--HHHHHHHHHHhcccccceeEEeccccccCCccccccccccccccc
Confidence 355556665432 457888888 666532 21 2233334444444322244444444333321 1111111122
Q ss_pred CHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHH
Q psy3685 824 SAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 824 t~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
+.+.-++--.+||+. + .+.+.+++++|||-|+-+..-+-
T Consensus 90 sL~~QV~HKlaFik~----~-~Pk~~ki~iiGHSiGaYm~Lqil 128 (301)
T KOG3975|consen 90 SLQDQVDHKLAFIKE----Y-VPKDRKIYIIGHSIGAYMVLQIL 128 (301)
T ss_pred chhhHHHHHHHHHHH----h-CCCCCEEEEEecchhHHHHHHHh
Confidence 222222333344432 2 24567999999998876554443
No 277
>KOG2893|consensus
Probab=77.75 E-value=0.82 Score=45.53 Aligned_cols=41 Identities=32% Similarity=0.711 Sum_probs=18.5
Q ss_pred cCccCccccCcchHHhHhhhhcCceeccCccccccCCchhHHHH
Q psy3685 400 CDLCLKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLTKH 443 (1042)
Q Consensus 400 C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H 443 (1042)
|-+|++.|.....|..|++.. -|+|.+|-|+..+--.|..|
T Consensus 13 cwycnrefddekiliqhqkak---hfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKAK---HFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeecccccchhhhhhhhhhhc---cceeeeehhhhccCCCceee
Confidence 444444444444444444322 24455555444444444444
No 278
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=77.35 E-value=1.5 Score=30.02 Aligned_cols=11 Identities=27% Similarity=0.951 Sum_probs=6.0
Q ss_pred cccCCCccccc
Q psy3685 9 YKCDLCSKEFS 19 (1042)
Q Consensus 9 ~~C~~C~~~f~ 19 (1042)
|+|..||..+.
T Consensus 2 ~~C~~CGy~y~ 12 (33)
T cd00350 2 YVCPVCGYIYD 12 (33)
T ss_pred EECCCCCCEEC
Confidence 45566655544
No 279
>PF03959 FSH1: Serine hydrolase (FSH1); InterPro: IPR005645 This entry represents proteins belonging to the AB hydrolase family. It consists of serine hydrolases of unknown specificity [, ] and includes uncharacterised proteins.; PDB: 1YCD_A.
Probab=77.06 E-value=5.2 Score=41.44 Aligned_cols=112 Identities=19% Similarity=0.099 Sum_probs=47.9
Q ss_pred CcEEEEECCCCCCcccccc--cccHHHHHHHcCCeEEEEeeeecccccC-CCCC----------Ccc----------ccc
Q psy3685 765 GPIFLMIGGEGEASAKWMV--EGTWLDYAHNHNALAVQVEHRFYGKSHP-LSDL----------SVE----------SLQ 821 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~--~~~~~~la~~~~a~~~~~ehR~~G~S~p-~~~~----------~~~----------~~~ 821 (1042)
.|=||.++|.|.....+.. ..+...|.+ .++.++++|=++-=...| ..+. ... ...
T Consensus 4 k~riLcLHG~~~na~if~~q~~~l~~~l~~-~~~ef~f~dgP~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~~~~~~~~ 82 (212)
T PF03959_consen 4 KPRILCLHGYGQNAEIFRQQTSALRKALKK-LDFEFVFVDGPHEVPPGPGIEPFSSEAESAFGDPGPFYSWWDPDDDDHE 82 (212)
T ss_dssp --EEEEE--TT--HHHHHHHTHHHHHHHHH-TT-EEEEE--SEE---GGG-SS---HHHHHHHHTT--EESS---S-SGG
T ss_pred CceEEEeCCCCcCHHHHHHHHHHHHHHHhh-CcEEEEEecCCcccCCcccccccccccccccCCCCcceeeeecCCCccc
Confidence 4679999998876554432 223333443 256788888554321100 0110 001 112
Q ss_pred CCCHHHHHhHHHHHHHHHhhhcCCCCCCcE-EEEccchhHHHHHHHHHhC--------CCeEEEEEeecccccc
Q psy3685 822 YLSAEQALADLAYFITSMNTLYSLPAHTKW-IAFGGSYPGALAAWLRYKY--------PHLVHGAMSASGPLRA 886 (1042)
Q Consensus 822 ~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~-i~~G~Sygg~la~~~~~~y--------P~~~~g~i~ssa~~~~ 886 (1042)
+..++++|+.+..+++. . .|+ -++|.|-||++|+.+.... +-.+.-+|..|+....
T Consensus 83 ~~~~~~sl~~l~~~i~~---~------GPfdGvlGFSQGA~lAa~ll~~~~~~~~~~~~~~~kf~V~~sg~~p~ 147 (212)
T PF03959_consen 83 YEGLDESLDYLRDYIEE---N------GPFDGVLGFSQGAALAALLLALQQRGRPDGAHPPFKFAVFISGFPPP 147 (212)
T ss_dssp G---HHHHHHHHHHHHH---H---------SEEEEETHHHHHHHHHHHHHHHHST--T----SEEEEES----E
T ss_pred ccCHHHHHHHHHHHHHh---c------CCeEEEEeecHHHHHHHHHHHHHHhhcccccCCCceEEEEEcccCCC
Confidence 44456666666665553 1 254 6899999999999887432 1234556655555543
No 280
>KOG4569|consensus
Probab=76.36 E-value=3.4 Score=46.09 Aligned_cols=58 Identities=22% Similarity=0.157 Sum_probs=34.4
Q ss_pred HHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHH----HhCCC--eEEEEEeeccccccccCC
Q psy3685 826 EQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLR----YKYPH--LVHGAMSASGPLRAVVDF 890 (1042)
Q Consensus 826 ~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~----~~yP~--~~~g~i~ssa~~~~~~~~ 890 (1042)
++..+|+..++. .+ ++..+++.|||.||+||..++ ...+. .-.+.+.-++|--...+|
T Consensus 155 ~~~~~~~~~L~~----~~---~~~~i~vTGHSLGgAlA~laa~~i~~~~~~~~~~v~v~tFG~PRvGn~~f 218 (336)
T KOG4569|consen 155 SGLDAELRRLIE----LY---PNYSIWVTGHSLGGALASLAALDLVKNGLKTSSPVKVYTFGQPRVGNLAF 218 (336)
T ss_pred HHHHHHHHHHHH----hc---CCcEEEEecCChHHHHHHHHHHHHHHcCCCCCCceEEEEecCCCcccHHH
Confidence 344455444444 33 245899999999999997665 33432 344566666554443333
No 281
>COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only]
Probab=71.70 E-value=4.9 Score=41.56 Aligned_cols=39 Identities=23% Similarity=0.338 Sum_probs=29.6
Q ss_pred HHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCCe
Q psy3685 832 LAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPHL 873 (1042)
Q Consensus 832 ~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~ 873 (1042)
+...+.+|...|+.+ ++=++|||+||.-.+.|...|-+.
T Consensus 122 lk~~msyL~~~Y~i~---k~n~VGhSmGg~~~~~Y~~~yg~d 160 (288)
T COG4814 122 LKKAMSYLQKHYNIP---KFNAVGHSMGGLGLTYYMIDYGDD 160 (288)
T ss_pred HHHHHHHHHHhcCCc---eeeeeeeccccHHHHHHHHHhcCC
Confidence 444566778888653 567899999999999888887654
No 282
>PF12048 DUF3530: Protein of unknown function (DUF3530); InterPro: IPR022529 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 272 to 336 amino acids in length. These proteins are distantly related to alpa/beta hydrolases so they may act as enzymes.
Probab=71.40 E-value=52 Score=36.27 Aligned_cols=49 Identities=12% Similarity=0.028 Sum_probs=33.4
Q ss_pred cEEEEccchhHHHHHHHHHhCCC-eEEEEEeeccccccccCChhhHHHHH
Q psy3685 850 KWIAFGGSYPGALAAWLRYKYPH-LVHGAMSASGPLRAVVDFPEYFGVVA 898 (1042)
Q Consensus 850 ~~i~~G~Sygg~la~~~~~~yP~-~~~g~i~ssa~~~~~~~~~~~~~~v~ 898 (1042)
.+|++|++.|+.+++-|....|. .+++.|..++-...........+.+.
T Consensus 194 ~ivlIg~G~gA~~~~~~la~~~~~~~daLV~I~a~~p~~~~n~~l~~~la 243 (310)
T PF12048_consen 194 NIVLIGHGTGAGWAARYLAEKPPPMPDALVLINAYWPQPDRNPALAEQLA 243 (310)
T ss_pred eEEEEEeChhHHHHHHHHhcCCCcccCeEEEEeCCCCcchhhhhHHHHhh
Confidence 49999999999888887776665 47888877766544433333334433
No 283
>KOG2186|consensus
Probab=71.38 E-value=1.9 Score=43.90 Aligned_cols=46 Identities=24% Similarity=0.621 Sum_probs=29.4
Q ss_pred cccCCCccccccccchhhhhhhccCcccccCCCccccccccchhhhhh
Q psy3685 9 YKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKEFSRKYNLTDHKK 56 (1042)
Q Consensus 9 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~f~~~~~L~~H~~ 56 (1042)
|.|+.||.+.. +..|.+|+-..++.-|.|-.|++.|.. .+...|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccCCeeEEeeccccccc-chhhhhhh
Confidence 66777777766 345555776666666677777777765 55555554
No 284
>PF02089 Palm_thioest: Palmitoyl protein thioesterase; InterPro: IPR002472 Neuronal ceroid lipofuscinoses (NCL) represent a group of encephalopathies that occur in 1 in 12,500 children. Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis []. The most common mutation results in intracellular accumulation of the polypeptide and undetectable enzyme activity in the brain. Direct sequencing of cDNAs derived from brain RNA of INCL patients has shown a mis-sense transversion of A to T at nucleotide position 364, which results in substitution of Trp for Arg at position 122 in the protein - Arg 122 is immediately adjacent to a lipase consensus sequence that contains the putative active site Ser of PPT. The occurrence of this and two other independent mutations in the PPT gene strongly suggests that defects in this gene cause INCL.; GO: 0008474 palmitoyl-(protein) hydrolase activity, 0006464 protein modification process; PDB: 3GRO_B 1PJA_A 1EXW_A 1EH5_A 1EI9_A.
Probab=70.73 E-value=13 Score=39.85 Aligned_cols=67 Identities=18% Similarity=0.196 Sum_probs=44.9
Q ss_pred CCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhCCC-eEEEEEeeccccccccCChhh
Q psy3685 822 YLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYPH-LVHGAMSASGPLRAVVDFPEY 893 (1042)
Q Consensus 822 ~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP~-~~~g~i~ssa~~~~~~~~~~~ 893 (1042)
+..+..-++.+.+.++...+.- .=+-++|.|=||.+..-+.+++|+ -|.-.|+-++|...+..++..
T Consensus 58 f~~v~~Qv~~vc~~l~~~p~L~-----~G~~~IGfSQGgl~lRa~vq~c~~~~V~nlISlggph~Gv~g~p~c 125 (279)
T PF02089_consen 58 FGNVNDQVEQVCEQLANDPELA-----NGFNAIGFSQGGLFLRAYVQRCNDPPVHNLISLGGPHMGVFGLPFC 125 (279)
T ss_dssp HSHHHHHHHHHHHHHHH-GGGT-----T-EEEEEETCHHHHHHHHHHH-TSS-EEEEEEES--TT-BSS-TCH
T ss_pred HHHHHHHHHHHHHHHhhChhhh-----cceeeeeeccccHHHHHHHHHCCCCCceeEEEecCcccccccCCcc
Confidence 3456666666666665543321 148999999999999999999986 599999999999888665543
No 285
>PLN02633 palmitoyl protein thioesterase family protein
Probab=67.13 E-value=36 Score=36.88 Aligned_cols=112 Identities=19% Similarity=0.207 Sum_probs=67.3
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHH-cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhc
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHN-HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLY 843 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~-~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~ 843 (1042)
-| ||+-+|-|..-... .-+-+.+|+.+ -|.-+..+|- |.+ ..+=-+..+.+=++.+.+-+......
T Consensus 26 ~P-~ViwHG~GD~c~~~-g~~~~~~l~~~~~g~~~~~i~i---g~~-------~~~s~~~~~~~Qve~vce~l~~~~~l- 92 (314)
T PLN02633 26 VP-FIMLHGIGTQCSDA-TNANFTQLLTNLSGSPGFCLEI---GNG-------VGDSWLMPLTQQAEIACEKVKQMKEL- 92 (314)
T ss_pred CC-eEEecCCCcccCCc-hHHHHHHHHHhCCCCceEEEEE---CCC-------ccccceeCHHHHHHHHHHHHhhchhh-
Confidence 35 44456776543221 11224455554 3555555553 444 11111224444445555555543322
Q ss_pred CCCCCCcEEEEccchhHHHHHHHHHhCCC--eEEEEEeeccccccccCChhh
Q psy3685 844 SLPAHTKWIAFGGSYPGALAAWLRYKYPH--LVHGAMSASGPLRAVVDFPEY 893 (1042)
Q Consensus 844 ~~~~~~~~i~~G~Sygg~la~~~~~~yP~--~~~g~i~ssa~~~~~~~~~~~ 893 (1042)
.+ -+-++|.|=||.++.-+..+.|+ -|.-.|+-++|-..+..++.+
T Consensus 93 --~~--G~naIGfSQGGlflRa~ierc~~~p~V~nlISlggph~Gv~g~p~C 140 (314)
T PLN02633 93 --SQ--GYNIVGRSQGNLVARGLIEFCDGGPPVYNYISLAGPHAGISSLPRC 140 (314)
T ss_pred --hC--cEEEEEEccchHHHHHHHHHCCCCCCcceEEEecCCCCCeeCCCCC
Confidence 11 38999999999999999999998 499999999999887665554
No 286
>PLN02748 tRNA dimethylallyltransferase
Probab=65.51 E-value=20 Score=41.57 Aligned_cols=85 Identities=12% Similarity=0.169 Sum_probs=54.6
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEe--eeecccccCCCCCCccc--------c------cCCCHH
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVE--HRFYGKSHPLSDLSVES--------L------QYLSAE 826 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~e--hR~~G~S~p~~~~~~~~--------~------~~lt~~ 826 (1042)
+.++||+++|-.|+. -+.+...||..+++-||..| +-|-|...-+...+.+. + ...|+.
T Consensus 20 ~~~~~i~i~GptgsG-----Ks~la~~la~~~~~eii~~DsmQVYrgLdIgTaKpt~eE~~~VpHHLid~v~p~e~ysv~ 94 (468)
T PLN02748 20 GKAKVVVVMGPTGSG-----KSKLAVDLASHFPVEIINADSMQVYSGLDVLTNKVPLHEQKGVPHHLLGVISPSVEFTAK 94 (468)
T ss_pred CCCCEEEEECCCCCC-----HHHHHHHHHHhcCeeEEcCchheeeCCcchhcCCCCHHHHcCCCCeeEeecCCCCcCcHH
Confidence 456688887766654 33567799999999999999 56777664443333221 1 234666
Q ss_pred HHHhHHHHHHHHHhhhcCCCCCCcEEEEccc
Q psy3685 827 QALADLAYFITSMNTLYSLPAHTKWIAFGGS 857 (1042)
Q Consensus 827 q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~S 857 (1042)
+...|....|..+...- ..-|++|||
T Consensus 95 ~F~~~A~~~I~~I~~rg-----k~PIlVGGT 120 (468)
T PLN02748 95 DFRDHAVPLIEEILSRN-----GLPVIVGGT 120 (468)
T ss_pred HHHHHHHHHHHHHHhcC-----CCeEEEcCh
Confidence 66677777776665432 235788887
No 287
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=64.87 E-value=2.3 Score=32.07 Aligned_cols=31 Identities=19% Similarity=0.428 Sum_probs=26.2
Q ss_pred ccCCccccCCCCCCccCCcchHHHHHhhccC
Q psy3685 677 LHQDIRYKCDLCSKDFSRKDNLTEHKEIHQG 707 (1042)
Q Consensus 677 h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g 707 (1042)
-.||.-++|+.||+.|+...+..+|+..-+|
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 4577789999999999999999999865444
No 288
>KOG4173|consensus
Probab=64.17 E-value=2.6 Score=41.32 Aligned_cols=49 Identities=20% Similarity=0.565 Sum_probs=33.8
Q ss_pred ccccCC--CCcccCCchhHHHHHHHhcCCccccCccCcccCCchhHHHHHHHh
Q psy3685 504 RYKCDL--CSKEFSSKGNLTEHMKQHQGIRYKCDLCSKEFSRKDNLTEHKEIH 554 (1042)
Q Consensus 504 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~C~~C~k~f~~~~~L~~H~~~H 554 (1042)
.|.|++ |...|.+......|...-++ -.|.+|.+.|++..-|..|+..-
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~HI~E~ 129 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAHILEW 129 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHHHHHH
Confidence 467765 77777777777777644333 36888888888888888887543
No 289
>KOG4173|consensus
Probab=61.67 E-value=3.1 Score=40.81 Aligned_cols=98 Identities=23% Similarity=0.542 Sum_probs=73.3
Q ss_pred ccCCchhHHHHHHHhc---------CC----ccccCc--cCcccCCchhHHHHHHHhcCcccccCcCCcccCCchhhhhH
Q psy3685 513 EFSSKGNLTEHMKQHQ---------GI----RYKCDL--CSKEFSRKDNLTEHKEIHQGIRYKCDLCSKEFSRKDNLTEH 577 (1042)
Q Consensus 513 ~f~~~~~L~~H~~~H~---------~~----~~~C~~--C~k~f~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H 577 (1042)
-|...++.++|+-... ++ .|.|++ |...|........|...-++ -.|.+|.+.|.+...|..|
T Consensus 48 pFf~~G~v~r~l~~~~V~~d~~d~~~~~~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~H 125 (253)
T KOG4173|consen 48 PFFEDGDVQRHLYLQDVIMDVADVPEKPRVPAFACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAH 125 (253)
T ss_pred cchhcccHHHHHhHhhheeeccccccccccccccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHH
Confidence 3556677888864321 11 478887 88899998888888654443 3699999999999999999
Q ss_pred hhh-h---------cCC-cccC--CccccccCCchhHHHHhhhcCCCc
Q psy3685 578 KEI-H---------QGI-RYKC--DLCSKKFSRKLNLTKHMKIHQGIR 612 (1042)
Q Consensus 578 ~~~-H---------~~~-~~~C--~~C~k~F~~~~~L~~H~~~H~~~~ 612 (1042)
+.. | .|. -|+| +-|+..|.+...-+.||..-|.-|
T Consensus 126 I~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 126 ILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHKYP 173 (253)
T ss_pred HHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhccCC
Confidence 753 3 233 3888 569999999999999987665543
No 290
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=60.45 E-value=4.9 Score=28.90 Aligned_cols=29 Identities=21% Similarity=0.602 Sum_probs=14.5
Q ss_pred cccCCCccccccccchhhhhhhccCcccccCCCcc
Q psy3685 9 YKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSK 43 (1042)
Q Consensus 9 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k 43 (1042)
|+|+.||..|....... .+..-.|+.||.
T Consensus 6 y~C~~Cg~~fe~~~~~~------~~~~~~CP~Cg~ 34 (41)
T smart00834 6 YRCEDCGHTFEVLQKIS------DDPLATCPECGG 34 (41)
T ss_pred EEcCCCCCEEEEEEecC------CCCCCCCCCCCC
Confidence 56666666665432221 123445666665
No 291
>PF11288 DUF3089: Protein of unknown function (DUF3089); InterPro: IPR021440 This family of proteins has no known function.
Probab=60.01 E-value=17 Score=37.01 Aligned_cols=44 Identities=16% Similarity=0.030 Sum_probs=29.3
Q ss_pred HHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhC
Q psy3685 825 AEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKY 870 (1042)
Q Consensus 825 ~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~y 870 (1042)
.+-|-+|+..--+++...++ ++.|+||.|||=|+++...+...+
T Consensus 73 ~~~ay~DV~~AF~~yL~~~n--~GRPfILaGHSQGs~~l~~LL~e~ 116 (207)
T PF11288_consen 73 FDLAYSDVRAAFDYYLANYN--NGRPFILAGHSQGSMHLLRLLKEE 116 (207)
T ss_pred HHhhHHHHHHHHHHHHHhcC--CCCCEEEEEeChHHHHHHHHHHHH
Confidence 34456666654444444443 345999999999998887776554
No 292
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=59.86 E-value=5.8 Score=40.53 Aligned_cols=25 Identities=20% Similarity=0.271 Sum_probs=12.6
Q ss_pred CccccCccCccccCchhHHHHHhHh
Q psy3685 115 IRYKCNLCSKEFSRNCYLTEHMEQH 139 (1042)
Q Consensus 115 ~~~~C~~C~~~F~~~~~L~~H~~~h 139 (1042)
.++.|+.|+........|..-.+.|
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCcccccccceeeeecc
Confidence 4555666655555444444444433
No 293
>COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair]
Probab=59.65 E-value=1e+02 Score=36.03 Aligned_cols=137 Identities=12% Similarity=0.192 Sum_probs=79.7
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
-|+||++++-+.... -.....++.+|+.+|.+||=|=|.- +++++ +...+
T Consensus 418 lPL~VlvDnGsTeED-----ipA~~~~k~Ygi~ivVVDHH~Pde~------------------vvD~y------vd~HV- 467 (715)
T COG1107 418 LPLLVLVDNGSTEED-----IPAIKQLKAYGIDIVVVDHHYPDEA------------------VVDEY------VDVHV- 467 (715)
T ss_pred cceEEEEcCCCcccc-----cHHHHHHHhcCCCEEEEcCCCCcch------------------hhhhh------hhhcc-
Confidence 588998865322211 2245778899999999999885543 11221 11111
Q ss_pred CCCCCcEEEEccch---hHHHHHHH-HHhCCCeEE-----EEEeeccccccccCChhhHHHHHHHhhccChhhHHHHHHH
Q psy3685 845 LPAHTKWIAFGGSY---PGALAAWL-RYKYPHLVH-----GAMSASGPLRAVVDFPEYFGVVADALATVSTECVKAVQTA 915 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sy---gg~la~~~-~~~yP~~~~-----g~i~ssa~~~~~~~~~~~~~~v~~~~~~~~~~c~~~i~~~ 915 (1042)
+|. +.||-| .|||++-+ |+-+|+.-. -|||.-+=......+++|.+...+.. +...-...|..+
T Consensus 468 ----NPy-~vGgd~~itaG~L~vEiArmInp~v~d~ikHLPaVA~~gD~a~ape~~~Ylela~~~g--yd~e~L~~ia~a 540 (715)
T COG1107 468 ----NPY-LVGGDSNITAGMLCVEIARMINPEVEDDIKHLPAVAGVGDRAKAPEAEQYLELAAERG--YDREDLEKIALA 540 (715)
T ss_pred ----Chh-hcCCCcCcchhHHHHHHHHHcCcchhhhhhcCcceeeecccccChhHHHHHHHHHhcC--CCHHHHHHHHHH
Confidence 132 233333 58888874 588997632 23332222222345667777766543 567777888887
Q ss_pred HHHHHHHhcCCchHHHHHHHhcC
Q psy3685 916 THTISKMLKSPSDAKYLTEQFKL 938 (1042)
Q Consensus 916 ~~~~~~~~~~~~~~~~l~~~f~~ 938 (1042)
.+.-.-++.-..++..+..+++.
T Consensus 541 vd~EaFylrf~~gr~ii~dIL~~ 563 (715)
T COG1107 541 VDYEAFYLRFMDGRGIIADILGT 563 (715)
T ss_pred HhHHHHHhhhcccchHHHHHhhc
Confidence 76555555555566667777764
No 294
>COG0846 SIR2 NAD-dependent protein deacetylases, SIR2 family [Transcription]
Probab=58.35 E-value=25 Score=37.13 Aligned_cols=102 Identities=14% Similarity=0.214 Sum_probs=64.9
Q ss_pred cccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEE
Q psy3685 674 NKILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQR 753 (1042)
Q Consensus 674 ~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qr 753 (1042)
..+|..-....|..|+..|........ ...+ .+..|+.|+.+.. .|-.-||-.+++. . |
T Consensus 114 i~lHGsl~~~~C~~C~~~~~~~~~~~~---~~~~-~~p~C~~Cg~~~l-------rP~VV~fGE~lp~-~------~--- 172 (250)
T COG0846 114 IELHGSLKRVRCSKCGNQYYDEDVIKF---IEDG-LIPRCPKCGGPVL-------RPDVVWFGEPLPA-S------F--- 172 (250)
T ss_pred EEeccceeeeEeCCCcCccchhhhhhh---cccC-CCCcCccCCCccc-------cCCEEEeCCCCCH-H------H---
Confidence 345666678899999998874441111 1222 5889999997511 2345677777664 1 1
Q ss_pred EEEecccc--cCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 754 YFVNSTWY--KHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 754 y~~~~~~~--~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
.+..+ -...-+|+.+|+.|.+.+... +..++++.|+.++.+..
T Consensus 173 ---~~~~~~~~~~~d~liviGTSl~V~Paa~----~p~~~~~~g~~~i~iN~ 217 (250)
T COG0846 173 ---LDEALEALKEADLLIVIGTSLKVYPAAG----LPELAKRRGAKVIEINL 217 (250)
T ss_pred ---HHHHHHHhccCCEEEEECcceEEcChhh----hhHHHHhcCCEEEEECC
Confidence 11111 145668999999887766322 33478889999999987
No 295
>TIGR01849 PHB_depoly_PhaZ polyhydroxyalkanoate depolymerase, intracellular. This model represents an intracellular depolymerase for polyhydroxyalkanoate (PHA), a carbon and energy storing polyester that accumulates in granules in many bacterial species when carbon sources are abundant but other nutrients are limiting. This family is named for PHAs generally, rather than polyhydroxybutyrate (PHB) specificially as in Ralstonia eutropha H16, to avoid overcalling chemical specificity in other species. Note that this family lacks the classic GXSXG lipase motif and instead shows weak similarity to some
Probab=58.26 E-value=39 Score=38.55 Aligned_cols=89 Identities=13% Similarity=0.022 Sum_probs=57.8
Q ss_pred cccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHH
Q psy3685 784 EGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALA 863 (1042)
Q Consensus 784 ~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la 863 (1042)
.+++..|.. |..|+.+|-+.-+.. | ..-+.++.+.-++-+.++|+.+ + .++-++|.+.||.++
T Consensus 120 RS~V~~Ll~--g~dVYl~DW~~p~~v-p------~~~~~f~ldDYi~~l~~~i~~~----G----~~v~l~GvCqgG~~~ 182 (406)
T TIGR01849 120 RSTVEALLP--DHDVYITDWVNARMV-P------LSAGKFDLEDYIDYLIEFIRFL----G----PDIHVIAVCQPAVPV 182 (406)
T ss_pred HHHHHHHhC--CCcEEEEeCCCCCCC-c------hhcCCCCHHHHHHHHHHHHHHh----C----CCCcEEEEchhhHHH
Confidence 344444555 889999998776643 1 1124566666664444444333 1 248899999999985
Q ss_pred HHHH-----HhCCCeEEEEEeeccccccccC
Q psy3685 864 AWLR-----YKYPHLVHGAMSASGPLRAVVD 889 (1042)
Q Consensus 864 ~~~~-----~~yP~~~~g~i~ssa~~~~~~~ 889 (1042)
+.+. ...|..+..++...+|+....+
T Consensus 183 laa~Al~a~~~~p~~~~sltlm~~PID~~~~ 213 (406)
T TIGR01849 183 LAAVALMAENEPPAQPRSMTLMGGPIDARAS 213 (406)
T ss_pred HHHHHHHHhcCCCCCcceEEEEecCccCCCC
Confidence 4433 3448889999999999986544
No 296
>PRK00091 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed
Probab=57.31 E-value=36 Score=37.34 Aligned_cols=40 Identities=10% Similarity=0.247 Sum_probs=29.0
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--eccccc
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKSH 810 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S~ 810 (1042)
+||+++|..|+. -+.+...||+.+++.+|..|-+ |.|.+.
T Consensus 5 ~~i~i~GptgsG-----Kt~la~~la~~~~~~iis~Ds~Qvy~~l~i 46 (307)
T PRK00091 5 KVIVIVGPTASG-----KTALAIELAKRLNGEIISADSMQVYRGMDI 46 (307)
T ss_pred eEEEEECCCCcC-----HHHHHHHHHHhCCCcEEeccccceeecccc
Confidence 567777776654 3345678999999999999985 555554
No 297
>PLN02606 palmitoyl-protein thioesterase
Probab=56.51 E-value=93 Score=33.79 Aligned_cols=43 Identities=21% Similarity=0.149 Sum_probs=38.5
Q ss_pred cEEEEccchhHHHHHHHHHhCCCe--EEEEEeeccccccccCChh
Q psy3685 850 KWIAFGGSYPGALAAWLRYKYPHL--VHGAMSASGPLRAVVDFPE 892 (1042)
Q Consensus 850 ~~i~~G~Sygg~la~~~~~~yP~~--~~g~i~ssa~~~~~~~~~~ 892 (1042)
-+-++|.|=||.++.-+.++.|+. |.-.|+-++|...+..++.
T Consensus 96 G~naIGfSQGglflRa~ierc~~~p~V~nlISlggph~Gv~g~p~ 140 (306)
T PLN02606 96 GYNIVAESQGNLVARGLIEFCDNAPPVINYVSLGGPHAGVAAIPK 140 (306)
T ss_pred ceEEEEEcchhHHHHHHHHHCCCCCCcceEEEecCCcCCcccCcc
Confidence 389999999999999999999983 9999999999988877663
No 298
>KOG2186|consensus
Probab=55.39 E-value=7.5 Score=39.80 Aligned_cols=47 Identities=21% Similarity=0.620 Sum_probs=31.1
Q ss_pred cccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhHHHHhhh
Q psy3685 559 YKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLTKHMKI 607 (1042)
Q Consensus 559 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~ 607 (1042)
|.|.+||.+-..+ .+.+|+-.-++.-|.|-.|++.|.. .+.+.|.+-
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~kC 50 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTKC 50 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCCeeEEeeccccccc-chhhhhhhh
Confidence 5677777765543 4556776666666777777777776 666666653
No 299
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=55.07 E-value=7.3 Score=27.53 Aligned_cols=13 Identities=23% Similarity=0.605 Sum_probs=6.8
Q ss_pred cccCCCccccccc
Q psy3685 9 YKCDLCSKEFSRK 21 (1042)
Q Consensus 9 ~~C~~C~~~f~~~ 21 (1042)
++|+.|+..|.-.
T Consensus 3 ~~CP~C~~~~~v~ 15 (38)
T TIGR02098 3 IQCPNCKTSFRVV 15 (38)
T ss_pred EECCCCCCEEEeC
Confidence 4555565555433
No 300
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=54.95 E-value=6.6 Score=30.09 Aligned_cols=30 Identities=20% Similarity=0.579 Sum_probs=15.8
Q ss_pred ccccCCCccccccccchhhhhhhccCcccccCCCcc
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSK 43 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k 43 (1042)
.|+|+.|+..|....... ...+-.|+.||.
T Consensus 5 ey~C~~Cg~~fe~~~~~~------~~~~~~CP~Cg~ 34 (52)
T TIGR02605 5 EYRCTACGHRFEVLQKMS------DDPLATCPECGG 34 (52)
T ss_pred EEEeCCCCCEeEEEEecC------CCCCCCCCCCCC
Confidence 366777776666442221 123345666664
No 301
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=54.57 E-value=10 Score=29.46 Aligned_cols=35 Identities=9% Similarity=0.070 Sum_probs=24.6
Q ss_pred cccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCC
Q psy3685 682 RYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEP 721 (1042)
Q Consensus 682 ~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~ 721 (1042)
.|.|+.||+.-..+..--+ .=..+|.|+.||+..|
T Consensus 27 ~F~CPnCGe~~I~Rc~~CR-----k~g~~Y~Cp~CGF~GP 61 (61)
T COG2888 27 KFPCPNCGEVEIYRCAKCR-----KLGNPYRCPKCGFEGP 61 (61)
T ss_pred EeeCCCCCceeeehhhhHH-----HcCCceECCCcCccCC
Confidence 4899999987776554322 1245899999998543
No 302
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=54.53 E-value=3.8 Score=30.94 Aligned_cols=22 Identities=23% Similarity=0.548 Sum_probs=11.8
Q ss_pred cccCCCccccccccchhhhhhh
Q psy3685 36 YKCNLCSKEFSRKYNLTDHKKI 57 (1042)
Q Consensus 36 ~~C~~C~k~f~~~~~L~~H~~~ 57 (1042)
++||-||+.|....+..+|...
T Consensus 18 lrCPRC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 18 LRCPRCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred eeCCchhHHHHHhHHHHHHhhH
Confidence 4555555555555555555443
No 303
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=54.38 E-value=7.2 Score=29.03 Aligned_cols=30 Identities=23% Similarity=0.540 Sum_probs=16.6
Q ss_pred CccccCCCccccccccchhhhhhhccCcccccCCCcccc
Q psy3685 7 IGYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKEF 45 (1042)
Q Consensus 7 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~f 45 (1042)
+.|+|+.||..|.....- ....|+.||..+
T Consensus 2 ~~y~C~~CG~~~~~~~~~---------~~~~Cp~CG~~~ 31 (46)
T PRK00398 2 AEYKCARCGREVELDEYG---------TGVRCPYCGYRI 31 (46)
T ss_pred CEEECCCCCCEEEECCCC---------CceECCCCCCeE
Confidence 356777777766543211 145677776544
No 304
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=54.29 E-value=6.8 Score=28.53 Aligned_cols=30 Identities=23% Similarity=0.656 Sum_probs=16.0
Q ss_pred ccccCCCccccccccchhhhhhhccCcccccCCCcc
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSK 43 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k 43 (1042)
.|+|..||..|....... ...+-.|+.||.
T Consensus 5 ey~C~~Cg~~fe~~~~~~------~~~~~~CP~Cg~ 34 (42)
T PF09723_consen 5 EYRCEECGHEFEVLQSIS------EDDPVPCPECGS 34 (42)
T ss_pred EEEeCCCCCEEEEEEEcC------CCCCCcCCCCCC
Confidence 366777766665443222 123455666665
No 305
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=54.26 E-value=8.6 Score=28.33 Aligned_cols=27 Identities=26% Similarity=0.497 Sum_probs=16.0
Q ss_pred ccccCCCccccccccchhhhhhhccCcccccCCCccc
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKE 44 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~ 44 (1042)
.|.|..||..|... .+.+.+|+.||..
T Consensus 2 ~Y~C~~Cg~~~~~~----------~~~~irC~~CG~r 28 (44)
T smart00659 2 IYICGECGRENEIK----------SKDVVRCRECGYR 28 (44)
T ss_pred EEECCCCCCEeecC----------CCCceECCCCCce
Confidence 36677777766533 2345667777643
No 306
>PF09949 DUF2183: Uncharacterized conserved protein (DUF2183); InterPro: IPR019236 This domain, found in various bacterial and fungal proteins, has no known function.
Probab=53.98 E-value=65 Score=28.68 Aligned_cols=32 Identities=19% Similarity=0.239 Sum_probs=25.1
Q ss_pred CCCcEEEEccchhHHHH--HHHHHhCCCeEEEEE
Q psy3685 847 AHTKWIAFGGSYPGALA--AWLRYKYPHLVHGAM 878 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la--~~~~~~yP~~~~g~i 878 (1042)
++.+||++|-|=-+-.. +-++.+||+.|.|..
T Consensus 63 P~~kfiLIGDsgq~DpeiY~~ia~~~P~~i~ai~ 96 (100)
T PF09949_consen 63 PERKFILIGDSGQHDPEIYAEIARRFPGRILAIY 96 (100)
T ss_pred CCCcEEEEeeCCCcCHHHHHHHHHHCCCCEEEEE
Confidence 44699999999877544 447789999998865
No 307
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=53.88 E-value=6.4 Score=40.67 Aligned_cols=13 Identities=46% Similarity=1.045 Sum_probs=5.8
Q ss_pred cccCCCccccccc
Q psy3685 9 YKCDLCSKEFSRK 21 (1042)
Q Consensus 9 ~~C~~C~~~f~~~ 21 (1042)
++||+|++.|.++
T Consensus 6 ~~CPvC~~~F~~~ 18 (214)
T PF09986_consen 6 ITCPVCGKEFKTK 18 (214)
T ss_pred eECCCCCCeeeee
Confidence 3444444444444
No 308
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=53.60 E-value=31 Score=31.27 Aligned_cols=26 Identities=27% Similarity=0.630 Sum_probs=20.5
Q ss_pred cccc----CCCCcccCCchhHHHHHHHhcC
Q psy3685 504 RYKC----DLCSKEFSSKGNLTEHMKQHQG 529 (1042)
Q Consensus 504 ~~~C----~~C~~~f~~~~~L~~H~~~H~~ 529 (1042)
-|.| ..|+....+...+++|++.++|
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 3778 8888888888888888877664
No 309
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.51 E-value=6.7 Score=37.82 Aligned_cols=35 Identities=20% Similarity=0.442 Sum_probs=16.5
Q ss_pred ccccCCCccccccccchhhhhhhccCCccccCccCccc
Q psy3685 35 RYKCNLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKEF 72 (1042)
Q Consensus 35 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~C~~~f 72 (1042)
-|.|+.|+..|.....+..- .....|.|+.||...
T Consensus 99 ~Y~Cp~C~~~y~~~ea~~~~---d~~~~f~Cp~Cg~~l 133 (147)
T smart00531 99 YYKCPNCQSKYTFLEANQLL---DMDGTFTCPRCGEEL 133 (147)
T ss_pred EEECcCCCCEeeHHHHHHhc---CCCCcEECCCCCCEE
Confidence 46666666666543332210 011236666666543
No 310
>PF02273 Acyl_transf_2: Acyl transferase; InterPro: IPR003157 LuxD proteins are bacterial acyl transferases. Together with an acyl-protein synthetase (LuxE) and reductase (LuxC), they form a multienzyme complex. This complex channels activated fatty acids into the aldehyde substrate for the luciferase-catalyzed bacterial bioluminescence reaction [, ]. ; GO: 0016746 transferase activity, transferring acyl groups, 0006631 fatty acid metabolic process; PDB: 1THT_B.
Probab=52.75 E-value=67 Score=33.47 Aligned_cols=106 Identities=13% Similarity=0.065 Sum_probs=55.4
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeec-ccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFY-GKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~-G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
.+|+||+..|.|.....+. |+...|+.. |..||=.|+-.+ |.| .+++.-+|+.-..+|+..++.+++.
T Consensus 29 ~~~tiliA~Gf~rrmdh~a--gLA~YL~~N-GFhViRyDsl~HvGlS-------sG~I~eftms~g~~sL~~V~dwl~~- 97 (294)
T PF02273_consen 29 RNNTILIAPGFARRMDHFA--GLAEYLSAN-GFHVIRYDSLNHVGLS-------SGDINEFTMSIGKASLLTVIDWLAT- 97 (294)
T ss_dssp -S-EEEEE-TT-GGGGGGH--HHHHHHHTT-T--EEEE---B--------------------HHHHHHHHHHHHHHHHH-
T ss_pred cCCeEEEecchhHHHHHHH--HHHHHHhhC-CeEEEeccccccccCC-------CCChhhcchHHhHHHHHHHHHHHHh-
Confidence 3589999999987654333 344445544 778887776554 665 4567788999999999999999984
Q ss_pred cCCCCCCcEEEEccchhHHHHHHHHHhCCCeEEEEEeeccccc
Q psy3685 843 YSLPAHTKWIAFGGSYPGALAAWLRYKYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 843 ~~~~~~~~~i~~G~Sygg~la~~~~~~yP~~~~g~i~ssa~~~ 885 (1042)
.+. ..+=++-.|.+|-+|--.+. -++ +.-+|.+-++|+
T Consensus 98 ~g~---~~~GLIAaSLSaRIAy~Va~-~i~-lsfLitaVGVVn 135 (294)
T PF02273_consen 98 RGI---RRIGLIAASLSARIAYEVAA-DIN-LSFLITAVGVVN 135 (294)
T ss_dssp TT------EEEEEETTHHHHHHHHTT-TS---SEEEEES--S-
T ss_pred cCC---CcchhhhhhhhHHHHHHHhh-ccC-cceEEEEeeeee
Confidence 332 35888999999999877666 344 444444446664
No 311
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=52.29 E-value=6.9 Score=38.21 Aligned_cols=16 Identities=13% Similarity=0.301 Sum_probs=8.1
Q ss_pred cccCCCccccccccch
Q psy3685 36 YKCNLCSKEFSRKYNL 51 (1042)
Q Consensus 36 ~~C~~C~k~f~~~~~L 51 (1042)
|.|+.|+..|+.-..+
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 5555555555544444
No 312
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=51.98 E-value=19 Score=32.74 Aligned_cols=25 Identities=28% Similarity=0.615 Sum_probs=22.4
Q ss_pred ccC----CCCCcccCCchhHHHHHHhhcC
Q psy3685 452 YKC----DLCSKEFSRKGNLTKHMKLHQG 476 (1042)
Q Consensus 452 ~~C----~~C~k~f~~~~~L~~H~~~H~~ 476 (1042)
|.| ..|+..+.+...+.+|.+.++|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 889 9999999999999999988764
No 313
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=51.14 E-value=10 Score=28.86 Aligned_cols=23 Identities=43% Similarity=0.816 Sum_probs=17.9
Q ss_pred cCCCCCCccCCc-----chHHHHHh-hcc
Q psy3685 684 KCDLCSKDFSRK-----DNLTEHKE-IHQ 706 (1042)
Q Consensus 684 ~C~~C~k~F~~~-----~~L~~H~~-~H~ 706 (1042)
.|..|++.++.. ++|.+|++ +|.
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 688888888766 58999998 665
No 314
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=50.78 E-value=11 Score=26.38 Aligned_cols=12 Identities=17% Similarity=0.661 Sum_probs=6.1
Q ss_pred ccCCCccccccc
Q psy3685 10 KCDLCSKEFSRK 21 (1042)
Q Consensus 10 ~C~~C~~~f~~~ 21 (1042)
.|+.|+..|.-.
T Consensus 4 ~Cp~C~~~y~i~ 15 (36)
T PF13717_consen 4 TCPNCQAKYEID 15 (36)
T ss_pred ECCCCCCEEeCC
Confidence 455555555433
No 315
>KOG2906|consensus
Probab=50.35 E-value=12 Score=32.30 Aligned_cols=75 Identities=17% Similarity=0.261 Sum_probs=34.7
Q ss_pred cccCCCCCcccccCCCCcc----cccCCcCcccCCccccccCCCcccccccccccccccccccccccccccCCcc----c
Q psy3685 612 RYKCDLCSKEFSRKDIPYE----VTVKEESYEGDSINVEENNGNDAVTRDCLEFFKTEQNTTLGKLNKILHQDIR----Y 683 (1042)
Q Consensus 612 ~~~C~~C~k~F~~~~~l~~----~~~~~~~~~c~~~~~~~~~~~~~~~~~C~~~f~~~~~~~l~~h~~~h~~~~~----~ 683 (1042)
.|.|..|+..|.-...+.. .-.+-..+.......+.+......|+.|+- ..+.-.-+.+-..+.| |
T Consensus 21 rf~C~tCpY~~~I~~ei~~r~~~~~Kevd~vlgg~~a~~nv~~t~~~Cp~Cgh------~rayF~qlQtRSADEPmT~FY 94 (105)
T KOG2906|consen 21 RFSCRTCPYVFPISREISSRKYPKLKEVDDVLGGDEAWENVDQTEATCPTCGH------ERAYFMQLQTRSADEPMTTFY 94 (105)
T ss_pred eEEcCCCCceeeEeeeeeccccCchhhhhhhcCCcccccchhhccCcCCCCCC------CceEEEEeeeccCCCcHhHhh
Confidence 5777777777754422221 000001111111122233344556667764 2222333444555555 7
Q ss_pred cCCCCCCcc
Q psy3685 684 KCDLCSKDF 692 (1042)
Q Consensus 684 ~C~~C~k~F 692 (1042)
+|--|+..-
T Consensus 95 kC~~C~~~W 103 (105)
T KOG2906|consen 95 KCCKCKHRW 103 (105)
T ss_pred hhhcccccc
Confidence 888887653
No 316
>KOG3253|consensus
Probab=50.04 E-value=33 Score=40.10 Aligned_cols=112 Identities=13% Similarity=0.054 Sum_probs=65.1
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhh
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTL 842 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~ 842 (1042)
+..|+++++.|-+.. + -...+++.|-.++.-..=.+|+..|-.-.+-+ -+++.++++-+..|.++...+
T Consensus 174 ~~spl~i~aps~p~a-p--~tSd~~~~wqs~lsl~gevvev~tfdl~n~ig--------G~nI~h~ae~~vSf~r~kvle 242 (784)
T KOG3253|consen 174 PASPLAIKAPSTPLA-P--KTSDRMWSWQSRLSLKGEVVEVPTFDLNNPIG--------GANIKHAAEYSVSFDRYKVLE 242 (784)
T ss_pred cCCceEEeccCCCCC-C--ccchHHHhHHHHHhhhceeeeeccccccCCCC--------CcchHHHHHHHHHHhhhhhhh
Confidence 567888888554411 0 11223333333333333444554444333322 278899999999998854433
Q ss_pred c-CCCCCCcEEEEccchhHHHHHHHHH-hCCCeEEEEEeeccccc
Q psy3685 843 Y-SLPAHTKWIAFGGSYPGALAAWLRY-KYPHLVHGAMSASGPLR 885 (1042)
Q Consensus 843 ~-~~~~~~~~i~~G~Sygg~la~~~~~-~yP~~~~g~i~ssa~~~ 885 (1042)
. ...+..++||+|-|+|..++-.... .+-..|+|.|-.+=|+.
T Consensus 243 i~gefpha~IiLvGrsmGAlVachVSpsnsdv~V~~vVCigypl~ 287 (784)
T KOG3253|consen 243 ITGEFPHAPIILVGRSMGALVACHVSPSNSDVEVDAVVCIGYPLD 287 (784)
T ss_pred hhccCCCCceEEEecccCceeeEEeccccCCceEEEEEEeccccc
Confidence 2 1123468999999999777766654 33344777776666664
No 317
>PLN02840 tRNA dimethylallyltransferase
Probab=49.86 E-value=49 Score=37.83 Aligned_cols=84 Identities=14% Similarity=0.158 Sum_probs=50.0
Q ss_pred CCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--ecccc----cCCCC-CC--------c-ccccCCCHHH
Q psy3685 764 GGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKS----HPLSD-LS--------V-ESLQYLSAEQ 827 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S----~p~~~-~~--------~-~~~~~lt~~q 827 (1042)
.++||+++|..|+. -+.+...||+.+++.+|.+|-. |.|.. .|... .. . +--...++.+
T Consensus 20 ~~~vi~I~GptgsG-----KTtla~~La~~~~~~iis~Ds~qvYr~~~IgTaKpt~eE~~~V~Hhlidil~p~e~ySv~~ 94 (421)
T PLN02840 20 KEKVIVISGPTGAG-----KSRLALELAKRLNGEIISADSVQVYRGLDVGSAKPSLSERKEVPHHLIDILHPSDDYSVGA 94 (421)
T ss_pred CCeEEEEECCCCCC-----HHHHHHHHHHHCCCCeEeccccceecceeEEcCCCCHHHHcCCCeEeEeecCCCCceeHHH
Confidence 35577777776654 3345679999999999999863 33433 34211 00 0 0012346667
Q ss_pred HHhHHHHHHHHHhhhcCCCCCCcEEEEccc
Q psy3685 828 ALADLAYFITSMNTLYSLPAHTKWIAFGGS 857 (1042)
Q Consensus 828 ~l~D~~~~~~~~~~~~~~~~~~~~i~~G~S 857 (1042)
-+.|....|+.+... + ..-|++||+
T Consensus 95 F~~~A~~~I~~i~~r----g-kiPIvVGGT 119 (421)
T PLN02840 95 FFDDARRATQDILNR----G-RVPIVAGGT 119 (421)
T ss_pred HHHHHHHHHHHHHhc----C-CCEEEEcCc
Confidence 777777777766543 2 234778887
No 318
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=49.66 E-value=13 Score=23.92 Aligned_cols=10 Identities=30% Similarity=0.843 Sum_probs=5.6
Q ss_pred ccCCCCcccC
Q psy3685 506 KCDLCSKEFS 515 (1042)
Q Consensus 506 ~C~~C~~~f~ 515 (1042)
.|+.||..|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4666665553
No 319
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=48.57 E-value=12 Score=26.31 Aligned_cols=10 Identities=30% Similarity=0.896 Sum_probs=5.0
Q ss_pred ccCCCccccc
Q psy3685 10 KCDLCSKEFS 19 (1042)
Q Consensus 10 ~C~~C~~~f~ 19 (1042)
.|+.|+..|.
T Consensus 4 ~CP~C~~~f~ 13 (37)
T PF13719_consen 4 TCPNCQTRFR 13 (37)
T ss_pred ECCCCCceEE
Confidence 4555555554
No 320
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=47.81 E-value=14 Score=28.83 Aligned_cols=35 Identities=9% Similarity=0.174 Sum_probs=22.3
Q ss_pred cccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCC
Q psy3685 682 RYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEP 721 (1042)
Q Consensus 682 ~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~ 721 (1042)
.|.|+.||+.-..+..--+ .-..+|.|+.||+..|
T Consensus 25 ~F~CPnCG~~~I~RC~~CR-----k~~~~Y~CP~CGF~GP 59 (59)
T PRK14890 25 KFLCPNCGEVIIYRCEKCR-----KQSNPYTCPKCGFEGP 59 (59)
T ss_pred EeeCCCCCCeeEeechhHH-----hcCCceECCCCCCcCc
Confidence 4889999987444432211 1146899999998543
No 321
>PLN02165 adenylate isopentenyltransferase
Probab=47.49 E-value=57 Score=36.03 Aligned_cols=43 Identities=12% Similarity=0.225 Sum_probs=30.7
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--eccccc
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKSH 810 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S~ 810 (1042)
+.++||+++|-.|+. -+.+...||+.+++.+|..|-+ |-|...
T Consensus 41 ~~g~iivIiGPTGSG-----KStLA~~LA~~l~~eIIsaDs~QvYkgldI 85 (334)
T PLN02165 41 CKDKVVVIMGATGSG-----KSRLSVDLATRFPSEIINSDKMQVYDGLKI 85 (334)
T ss_pred CCCCEEEEECCCCCc-----HHHHHHHHHHHcCCceecCChheeECCccc
Confidence 456788888876654 3345678999999999998866 445544
No 322
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=46.05 E-value=15 Score=27.49 Aligned_cols=34 Identities=21% Similarity=0.603 Sum_probs=15.2
Q ss_pred ccCCcCCccccCcccccccccccC-------CccccCCCCc
Q psy3685 479 YKCDLCLKEFSRKDNLTDRKIHQG-------IRYKCDLCSK 512 (1042)
Q Consensus 479 ~~C~~C~~~f~~~~~l~~~~~~~~-------~~~~C~~C~~ 512 (1042)
|+|.+|+..+.....-....+..+ ..|.|++|+.
T Consensus 2 y~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~w~CP~C~a 42 (47)
T PF00301_consen 2 YQCPVCGYVYDPEKGDPENGIPPGTPFEDLPDDWVCPVCGA 42 (47)
T ss_dssp EEETTTSBEEETTTBBGGGTB-TT--GGGS-TT-B-TTTSS
T ss_pred cCCCCCCEEEcCCcCCcccCcCCCCCHHHCCCCCcCcCCCC
Confidence 556666655544433332333332 2366777764
No 323
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=45.81 E-value=10 Score=37.89 Aligned_cols=29 Identities=17% Similarity=0.547 Sum_probs=14.7
Q ss_pred ccccCCCccccccccchhhhhhhccCCccccCccCcc
Q psy3685 35 RYKCNLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKE 71 (1042)
Q Consensus 35 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~~C~~~ 71 (1042)
-|.|+.|+..|+.-..+. .-|.|+.||..
T Consensus 117 ~Y~Cp~C~~rytf~eA~~--------~~F~Cp~Cg~~ 145 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAME--------YGFRCPQCGEM 145 (178)
T ss_pred EEECCCCCcEEeHHHHhh--------cCCcCCCCCCC
Confidence 355555555555444432 13556666554
No 324
>PF06792 UPF0261: Uncharacterised protein family (UPF0261); InterPro: IPR008322 The proteins in this entry are functionally uncharacterised.
Probab=44.22 E-value=1.9e+02 Score=32.78 Aligned_cols=152 Identities=20% Similarity=0.196 Sum_probs=75.8
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCccc-----------ccCC-CHHHHHhHHH
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVES-----------LQYL-SAEQALADLA 833 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~-----------~~~l-t~~q~l~D~~ 833 (1042)
|.|+++|-.-.- ...-.+..+.-++.|..++.+|-=-.|.+....|.+.+. +.-. +=.+|++=.+
T Consensus 2 ~tI~iigT~DTK---~~E~~yl~~~i~~~G~~v~~iDvg~~~~~~~~~di~~~eVa~~~g~~~~~~~~~~dRg~ai~~M~ 78 (403)
T PF06792_consen 2 KTIAIIGTLDTK---GEELLYLRDQIEAQGVEVLLIDVGTLGEPSFPPDISREEVARAAGDSIEAVRSSGDRGEAIEAMA 78 (403)
T ss_pred CEEEEEEccCCC---HHHHHHHHHHHHHCCCcEEEEEcCCCCCCCCCCCcCHHHHHHhcCCChHHhhccCCHHHHHHHHH
Confidence 445666543222 222233555555578888888865555443333443322 1111 2223333332
Q ss_pred HHH-HHHhhhcCCCCCCcEEEEccchhHHHHHHH-H---HhCCCeEEEEEeeccccccc---cCChhhHHHHHHHhhccC
Q psy3685 834 YFI-TSMNTLYSLPAHTKWIAFGGSYPGALAAWL-R---YKYPHLVHGAMSASGPLRAV---VDFPEYFGVVADALATVS 905 (1042)
Q Consensus 834 ~~~-~~~~~~~~~~~~~~~i~~G~Sygg~la~~~-~---~~yP~~~~g~i~ssa~~~~~---~~~~~~~~~v~~~~~~~~ 905 (1042)
.=. ..+...+....-.-+|.+|||-|..|++-. | ..+|.++...+||. -+... .|..-.+.++ + +.+.+
T Consensus 79 ~ga~~~v~~l~~~g~i~Gvi~~GGs~GT~lat~aMr~LPiG~PKlmVST~ASG-d~~~yvg~sDI~mm~SVv-D-iaGlN 155 (403)
T PF06792_consen 79 RGAARFVSDLYDEGKIDGVIGIGGSGGTALATAAMRALPIGFPKLMVSTMASG-DTSPYVGESDITMMYSVV-D-IAGLN 155 (403)
T ss_pred HHHHHHHHHHHhcCCccEEEEecCCccHHHHHHHHHhCCCCCCeEEEEccCCC-CcccccCcCCEEEeeecc-c-cccCC
Confidence 222 223333321111248999999999998764 4 46789999999864 22211 1211111111 1 22334
Q ss_pred hhhHHHHHHHHHHHHHHh
Q psy3685 906 TECVKAVQTATHTISKML 923 (1042)
Q Consensus 906 ~~c~~~i~~~~~~~~~~~ 923 (1042)
+-....+.+|...|..+.
T Consensus 156 ~isr~vL~NAA~Ai~GM~ 173 (403)
T PF06792_consen 156 SISRRVLSNAAGAIAGMA 173 (403)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 555666666666666666
No 325
>KOG2807|consensus
Probab=43.97 E-value=24 Score=37.67 Aligned_cols=31 Identities=32% Similarity=0.771 Sum_probs=22.7
Q ss_pred CccccCccCccccCchhHHHHHhHhcCCccccCCCC
Q psy3685 115 IRYKCNLCSKEFSRNCYLTEHMEQHQGIRYKCDLCS 150 (1042)
Q Consensus 115 ~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~C~~C~ 150 (1042)
..|.|+.|...|-.-.+.-.|...| .|+.|.
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh-----~CpgCe 374 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLH-----NCPGCE 374 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhh-----cCCCcC
Confidence 3688888888888777777776665 366665
No 326
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=43.29 E-value=23 Score=26.91 Aligned_cols=8 Identities=25% Similarity=1.103 Sum_probs=4.9
Q ss_pred cccCCCCc
Q psy3685 505 YKCDLCSK 512 (1042)
Q Consensus 505 ~~C~~C~~ 512 (1042)
|.|++|+.
T Consensus 35 w~CP~C~a 42 (50)
T cd00730 35 WVCPVCGA 42 (50)
T ss_pred CCCCCCCC
Confidence 56666653
No 327
>COG0324 MiaA tRNA delta(2)-isopentenylpyrophosphate transferase [Translation, ribosomal structure and biogenesis]
Probab=43.14 E-value=80 Score=34.45 Aligned_cols=71 Identities=15% Similarity=0.141 Sum_probs=44.4
Q ss_pred cccHHHHHHHcCCeEEEEeee--ecccc----cCCCC-CC--------c-ccccCCCHHHHHhHHHHHHHHHhhhcCCCC
Q psy3685 784 EGTWLDYAHNHNALAVQVEHR--FYGKS----HPLSD-LS--------V-ESLQYLSAEQALADLAYFITSMNTLYSLPA 847 (1042)
Q Consensus 784 ~~~~~~la~~~~a~~~~~ehR--~~G~S----~p~~~-~~--------~-~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~ 847 (1042)
+.+..+||+++|+-||.+|-. |-|.- +|... +. . +--...|+.+...|+...|..+...-
T Consensus 17 T~lai~LAk~~~~eIIs~DSmQvYr~mdIGTAKps~~e~~~vpHhliDi~~p~e~ysa~~f~~~a~~~i~~i~~rg---- 92 (308)
T COG0324 17 TALAIALAKRLGGEIISLDSMQVYRGLDIGTAKPSLEELAGVPHHLIDIRDPTESYSAAEFQRDALAAIDDILARG---- 92 (308)
T ss_pred HHHHHHHHHHcCCcEEecchhhhcCCCcccCCCCCHHHHcCCCEEEecccCccccccHHHHHHHHHHHHHHHHhCC----
Confidence 456789999999999999964 44333 34211 00 0 00124577777777777777766542
Q ss_pred CCcEEEEccchh
Q psy3685 848 HTKWIAFGGSYP 859 (1042)
Q Consensus 848 ~~~~i~~G~Syg 859 (1042)
..-|++|||.-
T Consensus 93 -k~pIlVGGTgl 103 (308)
T COG0324 93 -KLPILVGGTGL 103 (308)
T ss_pred -CCcEEEccHHH
Confidence 13488898853
No 328
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=43.04 E-value=18 Score=24.56 Aligned_cols=11 Identities=36% Similarity=0.905 Sum_probs=5.7
Q ss_pred cccCCCccccc
Q psy3685 9 YKCDLCSKEFS 19 (1042)
Q Consensus 9 ~~C~~C~~~f~ 19 (1042)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45556655554
No 329
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=42.53 E-value=18 Score=37.22 Aligned_cols=26 Identities=23% Similarity=0.370 Sum_probs=13.7
Q ss_pred cCcccCCcccccccCccccccccccc
Q psy3685 314 GIRYKCDLCLKEFSRKDNLTEHKEIH 339 (1042)
Q Consensus 314 ~~~~~C~~C~~~f~~~~~l~~H~~~h 339 (1042)
+.++.|+.|+........|..-.++|
T Consensus 207 ~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 207 GKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCCcccccccceeeeecc
Confidence 34556666666555555554444443
No 330
>TIGR00174 miaA tRNA isopentenyltransferase (miaA). Catalyzes the first step in the modification of an adenosine near the anticodon to 2-methylthio-N6-isopentyladenosine.
Probab=42.13 E-value=78 Score=34.32 Aligned_cols=81 Identities=15% Similarity=0.180 Sum_probs=46.4
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--ecccccCCCCCCccc--------c------cCCCHHHHHh
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKSHPLSDLSVES--------L------QYLSAEQALA 830 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S~p~~~~~~~~--------~------~~lt~~q~l~ 830 (1042)
||+++|..|+. -+.+...||+++++.+|.+|-+ |-|.+.-+..-+.+. + ...++.+...
T Consensus 1 vi~i~G~t~~G-----Ks~la~~l~~~~~~~iis~Ds~qvY~~l~IgTakp~~~e~~~v~hhlid~~~~~~~~~v~~f~~ 75 (287)
T TIGR00174 1 VIFIMGPTAVG-----KSQLAIQLAKKLNAEIISVDSMQIYKGMDIGTAKPSLQEREGIPHHLIDILDPSESYSAADFQT 75 (287)
T ss_pred CEEEECCCCCC-----HHHHHHHHHHhCCCcEEEechhheeeeccccCCCCCHHHHcCccEEEEEEechhheEcHHHHHH
Confidence 45666766543 2345678999999999999874 445443222111110 1 2335555556
Q ss_pred HHHHHHHHHhhhcCCCCCCcEEEEccc
Q psy3685 831 DLAYFITSMNTLYSLPAHTKWIAFGGS 857 (1042)
Q Consensus 831 D~~~~~~~~~~~~~~~~~~~~i~~G~S 857 (1042)
|....|..+... + ...|++|||
T Consensus 76 ~a~~~i~~~~~~----g-~~pi~vGGT 97 (287)
T TIGR00174 76 LALNAIADITAR----G-KIPLLVGGT 97 (287)
T ss_pred HHHHHHHHHHhC----C-CCEEEEcCc
Confidence 666666555432 2 245888987
No 331
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=42.06 E-value=18 Score=23.27 Aligned_cols=20 Identities=15% Similarity=0.614 Sum_probs=16.4
Q ss_pred ccCCCCCCccCCcchHHHHHh
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKE 703 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~ 703 (1042)
..|++|++.+ ....+++|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3699999999 5678888875
No 332
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=41.99 E-value=17 Score=32.75 Aligned_cols=9 Identities=33% Similarity=1.047 Sum_probs=3.7
Q ss_pred ccCccCccc
Q psy3685 64 KCNLCSKEF 72 (1042)
Q Consensus 64 ~C~~C~~~f 72 (1042)
.|+.||+.|
T Consensus 11 ~Cp~CG~kF 19 (108)
T PF09538_consen 11 TCPSCGAKF 19 (108)
T ss_pred cCCCCcchh
Confidence 344444444
No 333
>KOG2541|consensus
Probab=41.99 E-value=1.5e+02 Score=31.38 Aligned_cols=112 Identities=16% Similarity=0.129 Sum_probs=63.6
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcC
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYS 844 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~ 844 (1042)
-| |++.+|-|..........+...+.+--|..++.+|- |=| ..+. -++...+-++-+-+-+..+..
T Consensus 24 ~P-~ii~HGigd~c~~~~~~~~~q~l~~~~g~~v~~lei-g~g----~~~s-----~l~pl~~Qv~~~ce~v~~m~~--- 89 (296)
T KOG2541|consen 24 VP-VIVWHGIGDSCSSLSMANLTQLLEELPGSPVYCLEI-GDG----IKDS-----SLMPLWEQVDVACEKVKQMPE--- 89 (296)
T ss_pred CC-EEEEeccCcccccchHHHHHHHHHhCCCCeeEEEEe-cCC----cchh-----hhccHHHHHHHHHHHHhcchh---
Confidence 45 555677766544333334444556656888888883 222 0010 111222222222222222221
Q ss_pred CCCCCcEEEEccchhHHHHHHHHHhCCC-eEEEEEeeccccccccCChh
Q psy3685 845 LPAHTKWIAFGGSYPGALAAWLRYKYPH-LVHGAMSASGPLRAVVDFPE 892 (1042)
Q Consensus 845 ~~~~~~~i~~G~Sygg~la~~~~~~yP~-~~~g~i~ssa~~~~~~~~~~ 892 (1042)
++ .-+.++|-|=||.+|..+...=|+ -|.-.|+-++|-..+...+.
T Consensus 90 ls--qGynivg~SQGglv~Raliq~cd~ppV~n~ISL~gPhaG~~~~p~ 136 (296)
T KOG2541|consen 90 LS--QGYNIVGYSQGGLVARALIQFCDNPPVKNFISLGGPHAGIYGIPR 136 (296)
T ss_pred cc--CceEEEEEccccHHHHHHHHhCCCCCcceeEeccCCcCCccCCCC
Confidence 21 248999999999999988877664 47788988888877655444
No 334
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=40.70 E-value=29 Score=36.54 Aligned_cols=48 Identities=25% Similarity=0.500 Sum_probs=29.0
Q ss_pred cCCcCCccccCcccccccccccCCccccCCCCcccCCchhHHHHHHHh
Q psy3685 480 KCDLCLKEFSRKDNLTDRKIHQGIRYKCDLCSKEFSSKGNLTEHMKQH 527 (1042)
Q Consensus 480 ~C~~C~~~f~~~~~l~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H 527 (1042)
.|-.|.-.|+....-.........+|+|+.|...|-..-+.-.|...|
T Consensus 364 ~Cf~CQ~~fp~~~~~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 364 HCFVCQGPFPKPPVSPFDESTSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceeccCCCCCCCCCcccccccccceechhhhhhhhhhhHHHHHHHHh
Confidence 366666666654443333333345677777777777766666665554
No 335
>PHA00626 hypothetical protein
Probab=40.25 E-value=16 Score=28.03 Aligned_cols=12 Identities=33% Similarity=0.553 Sum_probs=5.7
Q ss_pred ccccCccCcccc
Q psy3685 62 RYKCNLCSKEFS 73 (1042)
Q Consensus 62 ~~~C~~C~~~f~ 73 (1042)
+|.|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 444555544443
No 336
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=40.23 E-value=16 Score=26.88 Aligned_cols=26 Identities=31% Similarity=0.558 Sum_probs=16.0
Q ss_pred CccccCCCCCCccCCc----chHHHHH-hhc
Q psy3685 680 DIRYKCDLCSKDFSRK----DNLTEHK-EIH 705 (1042)
Q Consensus 680 ~~~~~C~~C~k~F~~~----~~L~~H~-~~H 705 (1042)
+...+|..|++.+... ++|.+|+ +.|
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 4456899999988875 6799998 444
No 337
>PRK14556 pyrH uridylate kinase; Provisional
Probab=39.62 E-value=95 Score=32.81 Aligned_cols=95 Identities=17% Similarity=0.103 Sum_probs=51.9
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEE--eeeecccccCCCCCCcccccCCCHHHHH------hHHHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQV--EHRFYGKSHPLSDLSVESLQYLSAEQAL------ADLAYFI 836 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~--ehR~~G~S~p~~~~~~~~~~~lt~~q~l------~D~~~~~ 836 (1042)
|-|+++.||.| .+.+.+......+|..++|..+.+ +=-|.=.+.|-.+-....+..++.++++ .|...+.
T Consensus 136 g~vvi~~gg~G--~p~~StD~lAallA~~l~Ad~Lii~TdVDGVYd~DP~~~p~A~~i~~I~~~e~~~~~l~vmd~~A~~ 213 (249)
T PRK14556 136 GRVLIFAGGTG--NPFVTTDTTASLRAVEIGADALLKATTVNGVYDKDPNKYSDAKRFDKVTFSEVVSKELNVMDLGAFT 213 (249)
T ss_pred CCEEEEECCCC--CCcCCcHHHHHHHHHHcCCCEEEEEeCCCccCCCCCCCCCCceEeeEEchhhhcccchHhHHHHHHH
Confidence 45888899876 344445556667899999865553 2222212234222122233334443333 2332222
Q ss_pred HHHhhhcCCCCCCcEEEEccchhHHHHHHHH
Q psy3685 837 TSMNTLYSLPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 837 ~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
... + .+.|+++|.+.-.+.|..++.
T Consensus 214 ~a~--~----~gIpi~I~ng~~~~~L~~~l~ 238 (249)
T PRK14556 214 QCR--D----FGIPIYVFDLTQPNALVDAVL 238 (249)
T ss_pred HHH--H----CCCcEEEECCCCchHHHHHHc
Confidence 222 1 235899999988898888775
No 338
>KOG3043|consensus
Probab=39.59 E-value=13 Score=37.97 Aligned_cols=88 Identities=17% Similarity=0.127 Sum_probs=60.4
Q ss_pred HHHHcCCeEEEEeeeecccccCCC-C-CCcc---cccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHH
Q psy3685 790 YAHNHNALAVQVEHRFYGKSHPLS-D-LSVE---SLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAA 864 (1042)
Q Consensus 790 la~~~~a~~~~~ehR~~G~S~p~~-~-~~~~---~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~ 864 (1042)
+|.. |..|+..|. |.| .|.. + ...+ =++-++.+-+..|+..|+++++... ...++=++|.-|||.++.
T Consensus 63 ~A~~-Gy~v~vPD~-~~G--dp~~~~~~~~~~~~w~~~~~~~~~~~~i~~v~k~lk~~g---~~kkIGv~GfCwGak~vv 135 (242)
T KOG3043|consen 63 VALN-GYTVLVPDF-FRG--DPWSPSLQKSERPEWMKGHSPPKIWKDITAVVKWLKNHG---DSKKIGVVGFCWGAKVVV 135 (242)
T ss_pred HhcC-CcEEEcchh-hcC--CCCCCCCChhhhHHHHhcCCcccchhHHHHHHHHHHHcC---CcceeeEEEEeecceEEE
Confidence 4444 777777773 335 2211 1 1111 2677899999999999999999654 235889999999999999
Q ss_pred HHHHhCCCeEEEEEeecccc
Q psy3685 865 WLRYKYPHLVHGAMSASGPL 884 (1042)
Q Consensus 865 ~~~~~yP~~~~g~i~ssa~~ 884 (1042)
.+-.++|+..+|+.+-..++
T Consensus 136 ~~~~~~~~f~a~v~~hps~~ 155 (242)
T KOG3043|consen 136 TLSAKDPEFDAGVSFHPSFV 155 (242)
T ss_pred EeeccchhheeeeEecCCcC
Confidence 99999995544544444444
No 339
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=39.21 E-value=16 Score=35.30 Aligned_cols=13 Identities=31% Similarity=0.848 Sum_probs=7.3
Q ss_pred cccCcCCcccCCc
Q psy3685 559 YKCDLCSKEFSRK 571 (1042)
Q Consensus 559 ~~C~~C~k~F~~~ 571 (1042)
|.|+.|+..|...
T Consensus 100 Y~Cp~C~~~y~~~ 112 (147)
T smart00531 100 YKCPNCQSKYTFL 112 (147)
T ss_pred EECcCCCCEeeHH
Confidence 5566666555543
No 340
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=39.18 E-value=23 Score=32.00 Aligned_cols=9 Identities=44% Similarity=1.095 Sum_probs=3.6
Q ss_pred ccCCCCccc
Q psy3685 506 KCDLCSKEF 514 (1042)
Q Consensus 506 ~C~~C~~~f 514 (1042)
.|+.||..|
T Consensus 28 vCP~CG~~~ 36 (108)
T PF09538_consen 28 VCPKCGTEF 36 (108)
T ss_pred cCCCCCCcc
Confidence 344444333
No 341
>COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism]
Probab=38.99 E-value=92 Score=35.23 Aligned_cols=86 Identities=13% Similarity=0.079 Sum_probs=58.7
Q ss_pred HHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHH-hHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHH
Q psy3685 787 WLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQAL-ADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAW 865 (1042)
Q Consensus 787 ~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l-~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~ 865 (1042)
+..++-+.|-.|+.++-|+=..+ ++-.+.++-+ +++.+-|+.++..- +...+=++|.+-||++++-
T Consensus 131 ~V~~l~~~g~~vfvIsw~nPd~~----------~~~~~~edYi~e~l~~aid~v~~it---g~~~InliGyCvGGtl~~~ 197 (445)
T COG3243 131 LVRWLLEQGLDVFVISWRNPDAS----------LAAKNLEDYILEGLSEAIDTVKDIT---GQKDINLIGYCVGGTLLAA 197 (445)
T ss_pred HHHHHHHcCCceEEEeccCchHh----------hhhccHHHHHHHHHHHHHHHHHHHh---CccccceeeEecchHHHHH
Confidence 33444455889999987763322 2245566666 77777777766654 2247999999999999999
Q ss_pred HHHhCCCe-EEEEEeeccccc
Q psy3685 866 LRYKYPHL-VHGAMSASGPLR 885 (1042)
Q Consensus 866 ~~~~yP~~-~~g~i~ssa~~~ 885 (1042)
+...+|.. |..+..-.+|+.
T Consensus 198 ala~~~~k~I~S~T~lts~~D 218 (445)
T COG3243 198 ALALMAAKRIKSLTLLTSPVD 218 (445)
T ss_pred HHHhhhhcccccceeeecchh
Confidence 99999988 655555555553
No 342
>PRK04023 DNA polymerase II large subunit; Validated
Probab=38.85 E-value=27 Score=43.49 Aligned_cols=49 Identities=29% Similarity=0.737 Sum_probs=28.4
Q ss_pred ccccCccCcccCCchhHHHHHHHhcCcccccCcCCcccCCchhhhhHhhhhcCCcccCCccccccCCchhHHHHhhhcCC
Q psy3685 531 RYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKEFSRKDNLTEHKEIHQGIRYKCDLCSKKFSRKLNLTKHMKIHQG 610 (1042)
Q Consensus 531 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~C~~C~k~F~~~~~L~~H~~~H~~ 610 (1042)
...|+.||... ..+.|+.||.. +...+.|+.|+....
T Consensus 626 ~RfCpsCG~~t---------------~~frCP~CG~~--------------Te~i~fCP~CG~~~~-------------- 662 (1121)
T PRK04023 626 RRKCPSCGKET---------------FYRRCPFCGTH--------------TEPVYRCPRCGIEVE-------------- 662 (1121)
T ss_pred CccCCCCCCcC---------------CcccCCCCCCC--------------CCcceeCccccCcCC--------------
Confidence 45677777652 23567777765 333466777755432
Q ss_pred CcccCCCCCcccc
Q psy3685 611 IRYKCDLCSKEFS 623 (1042)
Q Consensus 611 ~~~~C~~C~k~F~ 623 (1042)
++.|+.||..-.
T Consensus 663 -~y~CPKCG~El~ 674 (1121)
T PRK04023 663 -EDECEKCGREPT 674 (1121)
T ss_pred -CCcCCCCCCCCC
Confidence 145777776543
No 343
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=38.67 E-value=12 Score=36.07 Aligned_cols=8 Identities=50% Similarity=1.240 Sum_probs=2.9
Q ss_pred cCccCccc
Q psy3685 65 CNLCSKEF 72 (1042)
Q Consensus 65 C~~C~~~f 72 (1042)
|+.||++|
T Consensus 31 c~~c~~~f 38 (154)
T PRK00464 31 CLACGKRF 38 (154)
T ss_pred ccccCCcc
Confidence 33333333
No 344
>PRK02399 hypothetical protein; Provisional
Probab=38.64 E-value=1.5e+02 Score=33.58 Aligned_cols=33 Identities=30% Similarity=0.304 Sum_probs=27.3
Q ss_pred cEEEEccchhHHHHHHHH----HhCCCeEEEEEeecc
Q psy3685 850 KWIAFGGSYPGALAAWLR----YKYPHLVHGAMSASG 882 (1042)
Q Consensus 850 ~~i~~G~Sygg~la~~~~----~~yP~~~~g~i~ssa 882 (1042)
-+|.+|||.|..|++-.- ...|.++...+||+.
T Consensus 98 gviglGGs~GT~lat~aMr~LPiG~PKlmVSTmAsg~ 134 (406)
T PRK02399 98 GVIGLGGSGGTALATPAMRALPIGVPKLMVSTMASGD 134 (406)
T ss_pred EEEEecCcchHHHHHHHHHhCCCCCCeEEEEccccCC
Confidence 589999999999987644 467899999998875
No 345
>PF03283 PAE: Pectinacetylesterase
Probab=38.57 E-value=68 Score=36.14 Aligned_cols=52 Identities=21% Similarity=0.255 Sum_probs=30.7
Q ss_pred HHHHHhhhcCCCCCCcEEEEccchhHHH----HHHHHHhCCC-eEEEEEeeccccccc
Q psy3685 835 FITSMNTLYSLPAHTKWIAFGGSYPGAL----AAWLRYKYPH-LVHGAMSASGPLRAV 887 (1042)
Q Consensus 835 ~~~~~~~~~~~~~~~~~i~~G~Sygg~l----a~~~~~~yP~-~~~g~i~ssa~~~~~ 887 (1042)
+++.+... ++++...+||.|.|-||.= +-++|..+|. ...-+++-|+.....
T Consensus 143 vl~~l~~~-gl~~a~~vlltG~SAGG~g~~~~~d~~~~~lp~~~~v~~~~DsG~f~d~ 199 (361)
T PF03283_consen 143 VLDDLLSN-GLPNAKQVLLTGCSAGGLGAILHADYVRDRLPSSVKVKCLSDSGFFLDN 199 (361)
T ss_pred HHHHHHHh-cCcccceEEEeccChHHHHHHHHHHHHHHHhccCceEEEeccccccccc
Confidence 34444444 2333347899999988843 3457889994 344455566555433
No 346
>PF14353 CpXC: CpXC protein
Probab=37.34 E-value=11 Score=35.44 Aligned_cols=21 Identities=29% Similarity=0.735 Sum_probs=10.6
Q ss_pred ccccCCCccccccccchhhhh
Q psy3685 35 RYKCNLCSKEFSRKYNLTDHK 55 (1042)
Q Consensus 35 ~~~C~~C~k~f~~~~~L~~H~ 55 (1042)
.|.||.||..|.-...+.-|-
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred EEECCCCCCceecCCCEEEEc
Confidence 355555555555444444443
No 347
>COG5441 Uncharacterized conserved protein [Function unknown]
Probab=36.93 E-value=1.7e+02 Score=31.31 Aligned_cols=134 Identities=19% Similarity=0.185 Sum_probs=71.4
Q ss_pred cHHHHHHHcCCeEEEEeeeecccccCCCCCCcccc------------cCCCHHHHHhHHH-HHHHHHhhhcCCCCCCcEE
Q psy3685 786 TWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESL------------QYLSAEQALADLA-YFITSMNTLYSLPAHTKWI 852 (1042)
Q Consensus 786 ~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~------------~~lt~~q~l~D~~-~~~~~~~~~~~~~~~~~~i 852 (1042)
++.++-+..|+.++.+|----+.-.-..|.|.+.. .--.--+|++-.+ .|.+.+.... +-.-+|
T Consensus 20 ylad~I~~aG~~~v~vDvs~~~~~~~~~dis~~~VA~~hp~~~qAv~~~~Drg~AiaaMa~A~~r~l~sR~---dV~gmi 96 (401)
T COG5441 20 YLADLIEAAGGSPVLVDVSTLRNPTSEVDISAEDVAGAHPGGRQAVLDGNDRGSAIAAMAEAFVRFLSSRG---DVAGMI 96 (401)
T ss_pred HHHHHHHHcCCCeEEEEeeccCCCCCCcccCHHHHhhhCCCcceeEeccCchhHHHHHHHHHHHHHhhccc---chhhee
Confidence 46677777888999888544321111123333221 1112223444333 3445444432 223589
Q ss_pred EEccchhHHHHHHHHH----hCCCeEEEEEeeccccccccCChhhH--HHHHHHhhccChhhHHHHHHHHHHHHHHhc
Q psy3685 853 AFGGSYPGALAAWLRY----KYPHLVHGAMSASGPLRAVVDFPEYF--GVVADALATVSTECVKAVQTATHTISKMLK 924 (1042)
Q Consensus 853 ~~G~Sygg~la~~~~~----~yP~~~~g~i~ssa~~~~~~~~~~~~--~~v~~~~~~~~~~c~~~i~~~~~~~~~~~~ 924 (1042)
-+|||-|.+|++-... .-|.++...+||+-..- -++..+.. -.|++ +...++-|...+.+|...|..+..
T Consensus 97 g~GGsgGT~lit~~m~~LPlgvPK~mVST~Asgdv~p-yv~~sDi~Mm~svtd-~aGLNrisR~VLsnAa~aiag~a~ 172 (401)
T COG5441 97 GMGGSGGTALITPAMRRLPLGVPKVMVSTLASGDVAP-YVGVSDIIMMYSVTD-MAGLNRISRVVLSNAAHAIAGMAK 172 (401)
T ss_pred ecCCCcchHhhhhHHHhcCcCCcceeeeeeeccCcCC-CCChhHeeeeeeccc-cccccHHHHHHHHHHHHHHHHHHh
Confidence 9999999998866554 45677888888874332 11111100 01111 223456677777777777766654
No 348
>KOG1202|consensus
Probab=36.92 E-value=6.6e+02 Score=32.73 Aligned_cols=37 Identities=24% Similarity=0.352 Sum_probs=26.2
Q ss_pred HhHHH-HHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHH
Q psy3685 829 LADLA-YFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 829 l~D~~-~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
|+|+| .+|+.+++.- +..|+-+.|-|||.-||--|+.
T Consensus 2164 ies~A~~yirqirkvQ---P~GPYrl~GYSyG~~l~f~ma~ 2201 (2376)
T KOG1202|consen 2164 IESLAAYYIRQIRKVQ---PEGPYRLAGYSYGACLAFEMAS 2201 (2376)
T ss_pred HHHHHHHHHHHHHhcC---CCCCeeeeccchhHHHHHHHHH
Confidence 34444 5577776543 4469999999999999876663
No 349
>cd01412 SIRT5_Af1_CobB SIRT5_Af1_CobB: Eukaryotic, archaeal and prokaryotic group (class3) which includes human sirtuin SIRT5, Archaeoglobus fulgidus Sir2-Af1, and E. coli CobB; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span. CobB is a bacterial sirtuin that deacetylates acetyl-CoA synthetase at an active site lysine to stimulate its enzymatic activity.
Probab=36.73 E-value=92 Score=32.46 Aligned_cols=99 Identities=13% Similarity=0.150 Sum_probs=54.0
Q ss_pred ccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEE
Q psy3685 675 KILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRY 754 (1042)
Q Consensus 675 ~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry 754 (1042)
.+|..-....|..|++.+..... ...+....|+.|+-.-.. ...||.+.++. .|.+
T Consensus 102 e~HG~~~~~~C~~C~~~~~~~~~-------~~~~~~p~C~~Cgg~lrp--------~Vv~fge~~p~-------~~~~-- 157 (224)
T cd01412 102 ELHGSLFRVRCSSCGYVGENNEE-------IPEEELPRCPKCGGLLRP--------GVVWFGESLPL-------ALLE-- 157 (224)
T ss_pred eeCCCcCccccCCCCCCCCcchh-------hhccCCCCCCCCCCccCC--------ceEECCCCCHH-------HHHH--
Confidence 34555556789999998865521 123456789999742222 22355554422 1111
Q ss_pred EEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee
Q psy3685 755 FVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR 804 (1042)
Q Consensus 755 ~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR 804 (1042)
..+.+. ...+||.+|-.+.+.+.+. +...|..-|+.+|.+...
T Consensus 158 --~~~~~~-~~dl~lvlGTsl~v~p~~~----l~~~~~~~~~~~i~iN~~ 200 (224)
T cd01412 158 --AVEALA-KADLFLVIGTSGVVYPAAG----LPEEAKERGARVIEINPE 200 (224)
T ss_pred --HHHHHH-cCCEEEEECcCccchhHHH----HHHHHHHCCCeEEEECCC
Confidence 001122 3568888886555544222 334566668888887654
No 350
>PRK08118 topology modulation protein; Reviewed
Probab=36.59 E-value=1.4e+02 Score=29.33 Aligned_cols=36 Identities=14% Similarity=0.229 Sum_probs=25.8
Q ss_pred EEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecc
Q psy3685 767 IFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYG 807 (1042)
Q Consensus 767 i~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G 807 (1042)
-|+++|..|+. -+.+...|++.++..++.+|.-+++
T Consensus 3 rI~I~G~~GsG-----KSTlak~L~~~l~~~~~~lD~l~~~ 38 (167)
T PRK08118 3 KIILIGSGGSG-----KSTLARQLGEKLNIPVHHLDALFWK 38 (167)
T ss_pred EEEEECCCCCC-----HHHHHHHHHHHhCCCceecchhhcc
Confidence 36777776654 2345678999999999999876654
No 351
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=36.17 E-value=16 Score=27.46 Aligned_cols=29 Identities=24% Similarity=0.662 Sum_probs=16.6
Q ss_pred CccccCCCccccccccchhhhhhhccCcccccCCCccc
Q psy3685 7 IGYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKE 44 (1042)
Q Consensus 7 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~ 44 (1042)
..|+|-.||+.|.. .-......|+.||..
T Consensus 5 ~~Y~C~~Cg~~~~~---------~~~~~~irCp~Cg~r 33 (49)
T COG1996 5 MEYKCARCGREVEL---------DQETRGIRCPYCGSR 33 (49)
T ss_pred EEEEhhhcCCeeeh---------hhccCceeCCCCCcE
Confidence 45777777777721 112234667777743
No 352
>PF05705 DUF829: Eukaryotic protein of unknown function (DUF829); InterPro: IPR008547 This signature identifies Transmembrane protein 53, that have no known function but are predicted to be integral membrane proteins.
Probab=35.31 E-value=1.6e+02 Score=30.92 Aligned_cols=105 Identities=13% Similarity=0.101 Sum_probs=53.4
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCC
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSL 845 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~ 845 (1042)
|+||++|=-|+....... +.++-.+-|+.++.+-.+.-.-..|. -.+..+++.+...+.. ...
T Consensus 1 plvvl~gW~gA~~~hl~K---Y~~~Y~~~g~~il~~~~~~~~~~~~~----------~~~~~~~~~l~~~l~~----~~~ 63 (240)
T PF05705_consen 1 PLVVLLGWMGAKPKHLAK---YSDLYQDPGFDILLVTSPPADFFWPS----------KRLAPAADKLLELLSD----SQS 63 (240)
T ss_pred CEEEEEeCCCCCHHHHHH---HHHHHHhcCCeEEEEeCCHHHHeeec----------cchHHHHHHHHHHhhh----hcc
Confidence 677777644432221110 22222225888888765443333221 1223333333333322 221
Q ss_pred CCCCcEEEEccchhHHHHHHHHH-----h--CCCe---EEEEEeeccccccc
Q psy3685 846 PAHTKWIAFGGSYPGALAAWLRY-----K--YPHL---VHGAMSASGPLRAV 887 (1042)
Q Consensus 846 ~~~~~~i~~G~Sygg~la~~~~~-----~--yP~~---~~g~i~ssa~~~~~ 887 (1042)
.+..++++-..|-||......-. . +... +.|.|-.|+|....
T Consensus 64 ~~~~~il~H~FSnGG~~~~~~l~~~~~~~~~~~~~~~~i~g~I~DS~P~~~~ 115 (240)
T PF05705_consen 64 ASPPPILFHSFSNGGSFLYSQLLEAYQSRKKFGKLLPRIKGIIFDSCPGIPT 115 (240)
T ss_pred CCCCCEEEEEEECchHHHHHHHHHHHHhcccccccccccceeEEeCCCCccc
Confidence 12238999999998866655433 1 1222 78999999986544
No 353
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=34.85 E-value=15 Score=34.43 Aligned_cols=29 Identities=28% Similarity=0.580 Sum_probs=18.6
Q ss_pred ccCCCCCCccCCcchHHHHHhhccCCCCcccC
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIHQGIRVSDIN 714 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~ 714 (1042)
-.|-+|||.|. .|++|++.|.|..|=++-
T Consensus 73 i~clecGk~~k---~LkrHL~~~~gltp~eYR 101 (132)
T PF05443_consen 73 IICLECGKKFK---TLKRHLRTHHGLTPEEYR 101 (132)
T ss_dssp EE-TBT--EES---BHHHHHHHTT-S-HHHHH
T ss_pred eEEccCCcccc---hHHHHHHHccCCCHHHHH
Confidence 57999999996 469999999998764443
No 354
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=34.58 E-value=8.2 Score=45.85 Aligned_cols=79 Identities=18% Similarity=0.318 Sum_probs=43.8
Q ss_pred cccCccCccccCchhhhhHhhhccCCcc-cCCCCCccccCcchhhhhhhhcc-CCccccCccCccccCchhHHHHHhHhc
Q psy3685 63 YKCNLCSKEFSRNCYLTEHMEQHQGIRY-KCDLCSKDFSYKSSLIEHKKLHQ-GIRYKCNLCSKEFSRNCYLTEHMEQHQ 140 (1042)
Q Consensus 63 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~-~C~~C~~~f~~~~~L~~H~~~h~-~~~~~C~~C~~~F~~~~~L~~H~~~h~ 140 (1042)
-.|+.|-+...+..+-..+ -|| .|..||-.|+-..+|---...-+ ..--.|+.|.+.+..+..-+-|
T Consensus 102 a~C~~Cl~Ei~dp~~rrY~------YPF~~CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP~nRRfH----- 170 (750)
T COG0068 102 ATCEDCLEEIFDPNSRRYL------YPFINCTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDPLNRRFH----- 170 (750)
T ss_pred hhhHHHHHHhcCCCCccee------ccccccCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCccccccc-----
Confidence 3577775554433322111 122 47888877766554422111111 1123589998888877664433
Q ss_pred CCccccCCCCCC
Q psy3685 141 GIRYKCDLCSKD 152 (1042)
Q Consensus 141 ~~~~~C~~C~~~ 152 (1042)
..|..|+.||-.
T Consensus 171 AQp~aCp~CGP~ 182 (750)
T COG0068 171 AQPIACPKCGPH 182 (750)
T ss_pred cccccCcccCCC
Confidence 468889999864
No 355
>COG1576 Uncharacterized conserved protein [Function unknown]
Probab=33.00 E-value=58 Score=31.31 Aligned_cols=43 Identities=19% Similarity=0.202 Sum_probs=26.6
Q ss_pred CCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhH
Q psy3685 795 NALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPG 860 (1042)
Q Consensus 795 ~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg 860 (1042)
|+.||++|-||=- +++ +.+|.++..++.. +..=.+++|||+|=
T Consensus 67 ~~~vi~Ld~~Gk~---------------~sS----e~fA~~l~~~~~~----G~~i~f~IGG~~Gl 109 (155)
T COG1576 67 GSYVVLLDIRGKA---------------LSS----EEFADFLERLRDD----GRDISFLIGGADGL 109 (155)
T ss_pred CCeEEEEecCCCc---------------CCh----HHHHHHHHHHHhc----CCeEEEEEeCcccC
Confidence 6799999988621 223 3344555544433 12247999999994
No 356
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=32.26 E-value=32 Score=26.60 Aligned_cols=32 Identities=22% Similarity=0.495 Sum_probs=17.4
Q ss_pred ccccCCCccccccccchhhhhhhccCcccccCCCccccc
Q psy3685 8 GYKCDLCSKEFSRKYNLTDHKKIHQGIRYKCNLCSKEFS 46 (1042)
Q Consensus 8 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~k~f~ 46 (1042)
.|+|+.||..+.-.... .+....|+.||..+.
T Consensus 2 ~~~CP~CG~~iev~~~~-------~GeiV~Cp~CGaele 33 (54)
T TIGR01206 2 QFECPDCGAEIELENPE-------LGELVICDECGAELE 33 (54)
T ss_pred ccCCCCCCCEEecCCCc-------cCCEEeCCCCCCEEE
Confidence 35677777766432211 134556777776553
No 357
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=32.11 E-value=24 Score=34.42 Aligned_cols=8 Identities=25% Similarity=1.032 Sum_probs=3.3
Q ss_pred cccCccCc
Q psy3685 532 YKCDLCSK 539 (1042)
Q Consensus 532 ~~C~~C~k 539 (1042)
|.|++||.
T Consensus 135 ~vC~vCGy 142 (166)
T COG1592 135 WVCPVCGY 142 (166)
T ss_pred EEcCCCCC
Confidence 44444443
No 358
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=31.84 E-value=26 Score=34.15 Aligned_cols=22 Identities=23% Similarity=0.711 Sum_probs=15.5
Q ss_pred ccccCcCCcccCCchhhhhHhhhhcCC-cccCCccc
Q psy3685 558 RYKCDLCSKEFSRKDNLTEHKEIHQGI-RYKCDLCS 592 (1042)
Q Consensus 558 ~~~C~~C~k~F~~~~~L~~H~~~H~~~-~~~C~~C~ 592 (1042)
-|.|++||..+. |+ |-+||+||
T Consensus 134 ~~vC~vCGy~~~-------------ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTHE-------------GEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCccc-------------CCCCCcCCCCC
Confidence 478888877543 33 67788887
No 359
>PRK14729 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; Provisional
Probab=31.41 E-value=1.5e+02 Score=32.49 Aligned_cols=82 Identities=13% Similarity=0.133 Sum_probs=48.2
Q ss_pred CcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee--ecccccCCCCCCccc--------c------cCCCHHHH
Q psy3685 765 GPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR--FYGKSHPLSDLSVES--------L------QYLSAEQA 828 (1042)
Q Consensus 765 ~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR--~~G~S~p~~~~~~~~--------~------~~lt~~q~ 828 (1042)
+|||+++|--|+. -+.+..+||++ ++-||..|-+ |-|...-+..-+.+. + ...|+.+-
T Consensus 4 ~~ii~I~GpTasG-----KS~LAl~LA~~-~~eIIsaDS~QvYr~ldIgTaKpt~eE~~~i~Hhlid~~~p~e~~sv~~f 77 (300)
T PRK14729 4 NKIVFIFGPTAVG-----KSNILFHFPKG-KAEIINVDSIQVYKEFDIASCKPSKELRKHIKHHLVDFLEPIKEYNLGIF 77 (300)
T ss_pred CcEEEEECCCccC-----HHHHHHHHHHh-CCcEEeccHHHHHCCCceecCCCCHHHHcCCCeeeeeccCCCCceeHHHH
Confidence 4677777655443 34567899999 6799999965 334332222212111 1 23466666
Q ss_pred HhHHHHHHHHHhhhcCCCCCCcEEEEccc
Q psy3685 829 LADLAYFITSMNTLYSLPAHTKWIAFGGS 857 (1042)
Q Consensus 829 l~D~~~~~~~~~~~~~~~~~~~~i~~G~S 857 (1042)
..|....|+.+... + ..-|++|||
T Consensus 78 ~~~a~~~i~~i~~~----g-k~PilvGGT 101 (300)
T PRK14729 78 YKEALKIIKELRQQ----K-KIPIFVGGS 101 (300)
T ss_pred HHHHHHHHHHHHHC----C-CCEEEEeCc
Confidence 77777777765443 2 234888887
No 360
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=31.35 E-value=22 Score=34.76 Aligned_cols=16 Identities=13% Similarity=0.418 Sum_probs=7.8
Q ss_pred cccCccCcccCCchhH
Q psy3685 532 YKCDLCSKEFSRKDNL 547 (1042)
Q Consensus 532 ~~C~~C~k~f~~~~~L 547 (1042)
|.|+.|+..|+...++
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 4455555555444444
No 361
>PF14353 CpXC: CpXC protein
Probab=30.90 E-value=51 Score=30.85 Aligned_cols=21 Identities=24% Similarity=0.507 Sum_probs=11.3
Q ss_pred cccCCCCCccccccccccccc
Q psy3685 289 RYKCDLCSKEFSRKGNLTKHM 309 (1042)
Q Consensus 289 ~~~C~~C~~~f~~~~~L~~H~ 309 (1042)
.+.|+.||..|.-...+..|-
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D 58 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHD 58 (128)
T ss_pred EEECCCCCCceecCCCEEEEc
Confidence 455666666555555554443
No 362
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=30.58 E-value=25 Score=23.49 Aligned_cols=11 Identities=27% Similarity=0.616 Sum_probs=5.2
Q ss_pred CcccccCCCcc
Q psy3685 33 GIRYKCNLCSK 43 (1042)
Q Consensus 33 ~~~~~C~~C~k 43 (1042)
+..|.|+.|+.
T Consensus 17 ~~~~vCp~C~~ 27 (30)
T PF08274_consen 17 GELLVCPECGH 27 (30)
T ss_dssp SSSEEETTTTE
T ss_pred CCEEeCCcccc
Confidence 33455555543
No 363
>PF15269 zf-C2H2_7: Zinc-finger
Probab=30.54 E-value=29 Score=25.07 Aligned_cols=23 Identities=39% Similarity=0.710 Sum_probs=17.4
Q ss_pred cccCCCCCcccCCchhHHHHHHh
Q psy3685 451 RYKCDLCSKEFSRKGNLTKHMKL 473 (1042)
Q Consensus 451 ~~~C~~C~k~f~~~~~L~~H~~~ 473 (1042)
.|+|-+|..+...++.|-+||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 36777888888888888888764
No 364
>PRK14905 triosephosphate isomerase/PTS system glucose/sucrose-specific transporter subunit IIB; Provisional
Probab=30.52 E-value=2.1e+02 Score=32.16 Aligned_cols=123 Identities=19% Similarity=0.246 Sum_probs=64.9
Q ss_pred CCeEEEEeeeec-cc-ccCCCCCCcccccCCCHHHHHhHHHHHHHH-HhhhcCCCCCCcEEEEccchhHHHHHHHHHhCC
Q psy3685 795 NALAVQVEHRFY-GK-SHPLSDLSVESLQYLSAEQALADLAYFITS-MNTLYSLPAHTKWIAFGGSYPGALAAWLRYKYP 871 (1042)
Q Consensus 795 ~a~~~~~ehR~~-G~-S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~-~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~yP 871 (1042)
.-.+|+.|...- |. -.| -|.+|| ++...+|+. +...+....+.--||+|||--..-+.-+.. -|
T Consensus 168 ~~~vIAYEPvWAIGTgg~~-----------as~~~~-~~~~~~Ir~~l~~~~~~~~~~v~ILYGGSV~~~N~~~l~~-~~ 234 (355)
T PRK14905 168 PHLFIAYEPVWAIGEGGIP-----------ASAEYA-DEKHAIIKQCLFELFAEESKKIPVLYGGSVNLENANELIM-KP 234 (355)
T ss_pred CceEEEECChHHhCCCCCC-----------CCHHHH-HHHHHHHHHHHHHHhccccCceeEEEeCcCCHHHHHHHhc-CC
Confidence 447889887754 42 111 245666 445555554 454443221112489999988777666633 44
Q ss_pred CeEEEEEeeccccccccCChhhHHHHHHHhhccChhhHHHHHHHHHH-HHHHhcCCchHHHHHHH
Q psy3685 872 HLVHGAMSASGPLRAVVDFPEYFGVVADALATVSTECVKAVQTATHT-ISKMLKSPSDAKYLTEQ 935 (1042)
Q Consensus 872 ~~~~g~i~ssa~~~~~~~~~~~~~~v~~~~~~~~~~c~~~i~~~~~~-~~~~~~~~~~~~~l~~~ 935 (1042)
+ ++|++..+|.+ +...|...+..........-.+-|..-+.. ..+++..-+|.+.+...
T Consensus 235 ~-iDG~LVG~asl----~~~~f~~Ii~~~~~~~~~~~~~~~~~~~~~~a~~ii~~lGG~~NI~~v 294 (355)
T PRK14905 235 H-IDGLFIGRSAW----DAQCFHALIADALKALAGSKIDPIIHKFSEIAIQLIDHLGGKDNISAL 294 (355)
T ss_pred C-CCEEEechhhc----cHHHHHHHHHHHHHhccCCcccHHHHhHHHHHHHHHHHhCCHHhhccc
Confidence 4 78888777776 334555555544433333333333332322 23344444555555544
No 365
>PRK04023 DNA polymerase II large subunit; Validated
Probab=30.37 E-value=43 Score=41.75 Aligned_cols=9 Identities=44% Similarity=0.977 Sum_probs=4.5
Q ss_pred eccCccccc
Q psy3685 425 YKCDLCSKK 433 (1042)
Q Consensus 425 ~~C~~C~~~ 433 (1042)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 345555544
No 366
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=29.67 E-value=29 Score=31.94 Aligned_cols=25 Identities=28% Similarity=0.404 Sum_probs=21.6
Q ss_pred ccCCCCCCccCCcchHHHHHhhccCCCC
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIHQGIRV 710 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H~g~~~ 710 (1042)
..|-++||.|. +|++|+.+|.|..|
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 46999999995 79999999998766
No 367
>PRK00103 rRNA large subunit methyltransferase; Provisional
Probab=29.64 E-value=60 Score=31.63 Aligned_cols=47 Identities=17% Similarity=0.070 Sum_probs=27.1
Q ss_pred CCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHH
Q psy3685 795 NALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALA 863 (1042)
Q Consensus 795 ~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la 863 (1042)
++.+|+||-+|- -+|+ +++|.++......- ...-++++||+||=.=+
T Consensus 67 ~~~~i~LDe~Gk---------------~~sS----~~fA~~l~~~~~~g---~~~i~F~IGGa~G~~~~ 113 (157)
T PRK00103 67 GARVIALDERGK---------------QLSS----EEFAQELERWRDDG---RSDVAFVIGGADGLSPA 113 (157)
T ss_pred CCEEEEEcCCCC---------------cCCH----HHHHHHHHHHHhcC---CccEEEEEcCccccCHH
Confidence 567999997762 2333 33455554432211 11357999999994433
No 368
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=29.64 E-value=25 Score=32.98 Aligned_cols=23 Identities=39% Similarity=0.889 Sum_probs=15.7
Q ss_pred ccCCccccccCCchhHHHHhhhcCCC
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKIHQGI 611 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~H~~~ 611 (1042)
-.|-+||+.|+. |++|++.|+|.
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S
T ss_pred eEEccCCcccch---HHHHHHHccCC
Confidence 579999998874 59999999885
No 369
>cd01413 SIR2_Af2 SIR2_Af2: Archaeal and prokaryotic group which includes Archaeoglobus fulgidus Sir2-Af2, Sulfolobus solfataricus ssSir2, and several bacterial homologs; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span. The Sir2 homolog from the archaea Sulfolobus solftaricus deacetylates the non-specific DNA protein Alba to mediate transcription repression.
Probab=28.92 E-value=1.4e+02 Score=31.18 Aligned_cols=101 Identities=12% Similarity=0.065 Sum_probs=55.9
Q ss_pred ccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEE
Q psy3685 675 KILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRY 754 (1042)
Q Consensus 675 ~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry 754 (1042)
.+|..-....|..|++.|....- +......+..|+.|+-.-. |-..||...|+. ..+ .
T Consensus 106 elHG~l~~~~C~~C~~~~~~~~~-----~~~~~~~~p~C~~Cgg~lr--------P~Vv~fgE~lp~------~~~-~-- 163 (222)
T cd01413 106 ELHGTLQTAYCVNCGSKYDLEEV-----KYAKKHEVPRCPKCGGIIR--------PDVVLFGEPLPQ------ALL-R-- 163 (222)
T ss_pred EccCCcCcceECCCCCCcchhHH-----HHhccCCCCcCCCCCCccC--------CCEEECCCCCCH------HHH-H--
Confidence 34555556789999998865432 1112234568999974221 223456554442 011 0
Q ss_pred EEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 755 FVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 755 ~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
....+-...-+||.+|-.+.+.+.. .+..+|++.|+.+|.+..
T Consensus 164 --~a~~~~~~~Dl~lvvGTSl~V~p~~----~l~~~a~~~g~~~i~iN~ 206 (222)
T cd01413 164 --EAIEAAKEADLFIVLGSSLVVYPAN----LLPLIAKENGAKLVIVNA 206 (222)
T ss_pred --HHHHHHhcCCEEEEEccCCEeccHh----HHHHHHHHcCCeEEEEcC
Confidence 0111112345888898877665522 245678877888877753
No 370
>KOG1280|consensus
Probab=28.53 E-value=23 Score=38.21 Aligned_cols=35 Identities=17% Similarity=0.406 Sum_probs=23.1
Q ss_pred cccCCccccccCCchhHHHHhhhcCCC-c--ccCCCCC
Q psy3685 585 RYKCDLCSKKFSRKLNLTKHMKIHQGI-R--YKCDLCS 619 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~~L~~H~~~H~~~-~--~~C~~C~ 619 (1042)
.|.|++|++.=.+...|..|+..-|.+ + -.|++|+
T Consensus 79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 378888888877888888887654443 2 2355544
No 371
>KOG4389|consensus
Probab=28.26 E-value=1.4e+02 Score=34.51 Aligned_cols=109 Identities=24% Similarity=0.278 Sum_probs=55.3
Q ss_pred EEEEE--CCCCCCccc-ccccccHHHHHHHcCCeEEEEeeee--cccc-cCCCCCCcccccCCCHHHHHhHHHHHHHHHh
Q psy3685 767 IFLMI--GGEGEASAK-WMVEGTWLDYAHNHNALAVQVEHRF--YGKS-HPLSDLSVESLQYLSAEQALADLAYFITSMN 840 (1042)
Q Consensus 767 i~~~~--gg~g~~~~~-~~~~~~~~~la~~~~a~~~~~ehR~--~G~S-~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~ 840 (1042)
|+|++ ||..+.++. -+++|- .||..-+-.||.+-.|= +|-= .|...-..+|++.|.-.-||.=+.+=|..
T Consensus 137 VlVWiyGGGF~sGt~SLdvYdGk--~la~~envIvVs~NYRvG~FGFL~l~~~~eaPGNmGl~DQqLAl~WV~~Ni~a-- 212 (601)
T KOG4389|consen 137 VLVWIYGGGFYSGTPSLDVYDGK--FLAAVENVIVVSMNYRVGAFGFLYLPGHPEAPGNMGLLDQQLALQWVQENIAA-- 212 (601)
T ss_pred EEEEEEcCccccCCcceeeeccc--eeeeeccEEEEEeeeeeccceEEecCCCCCCCCccchHHHHHHHHHHHHhHHH--
Confidence 77777 555333332 234443 35555577888888882 1111 11222234677776444443322222222
Q ss_pred hhcCCCCCCcEEEEccchhHH-HHHHHHHhCC---CeEEEEEeecccc
Q psy3685 841 TLYSLPAHTKWIAFGGSYPGA-LAAWLRYKYP---HLVHGAMSASGPL 884 (1042)
Q Consensus 841 ~~~~~~~~~~~i~~G~Sygg~-la~~~~~~yP---~~~~g~i~ssa~~ 884 (1042)
+++++ .++-|||-|.|.+ ++|. +.-| .++.-||+-|+.+
T Consensus 213 --FGGnp-~~vTLFGESAGaASv~aH--LlsP~S~glF~raIlQSGS~ 255 (601)
T KOG4389|consen 213 --FGGNP-SRVTLFGESAGAASVVAH--LLSPGSRGLFHRAILQSGSL 255 (601)
T ss_pred --hCCCc-ceEEEeccccchhhhhhe--ecCCCchhhHHHHHhhcCCC
Confidence 34333 4789999999874 3332 2223 3455555555544
No 372
>cd01407 SIR2-fam SIR2 family of proteins includes silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation, where the acetyl group from the lysine epsilon-amino group is transferred to the ADP-ribose moiety of NAD+, producing nicotinamide and the novel metabolite O-acetyl-ADP-ribose. Sir2 proteins, also known as sirtuins, are found in all eukaryotes and many archaea and prokaryotes and have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The oligomerization state of Sir2 appears to be organism-dependent, sometimes occurring as a monomer and sometimes as a multimer.
Probab=28.05 E-value=1.6e+02 Score=30.57 Aligned_cols=102 Identities=17% Similarity=0.162 Sum_probs=52.7
Q ss_pred cccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEE
Q psy3685 674 NKILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQR 753 (1042)
Q Consensus 674 ~~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qr 753 (1042)
..+|..-....|..|++.+... .+... . ....+..|+.|+..-... .-||...++.. |.+
T Consensus 101 ~elHG~~~~~~C~~C~~~~~~~-~~~~~--~-~~~~~p~C~~Cg~~lrP~--------Vv~fgE~~p~~-------~~~- 160 (218)
T cd01407 101 IELHGSLFRVRCTKCGKEYPRD-ELQAD--I-DREEVPRCPKCGGLLRPD--------VVFFGESLPEE-------LDE- 160 (218)
T ss_pred EECcCCcCcceeCCCcCCCcHH-HHhHh--h-ccCCCCcCCCCCCccCCC--------eEECCCCCcHH-------HHH-
Confidence 3345555567899999988644 22211 1 123467899998543332 22444333210 110
Q ss_pred EEEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 754 YFVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 754 y~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
...+-...-+||.+|-.+.+.+.+. +...|++.|+.+|.+..
T Consensus 161 ----a~~~~~~~Dl~lvlGTSl~V~p~~~----l~~~~~~~~~~~i~iN~ 202 (218)
T cd01407 161 ----AAEALAKADLLLVIGTSLQVYPAAG----LPLYAPERGAPVVIINL 202 (218)
T ss_pred ----HHHHHhcCCEEEEeCCCcccccHHH----HHHHHHHCCCeEEEECC
Confidence 0001111457888886655544222 33445556888887763
No 373
>PF10081 Abhydrolase_9: Alpha/beta-hydrolase family; InterPro: IPR012037 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=27.46 E-value=2.2e+02 Score=30.54 Aligned_cols=108 Identities=14% Similarity=0.127 Sum_probs=58.9
Q ss_pred cEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCC-HHHHHhHHHHHHHHHhhhc-
Q psy3685 766 PIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLS-AEQALADLAYFITSMNTLY- 843 (1042)
Q Consensus 766 pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt-~~q~l~D~~~~~~~~~~~~- 843 (1042)
-+|++..--|+ .|++...+..+...+++.+..+-..|--.. .=+.||+ -+.+.+-...++..+....
T Consensus 33 ~~lvV~~pTGt---GWVdp~a~~a~E~l~~GD~A~va~QYSylP--------Sw~sfl~dr~~a~~a~~aL~~aV~~~~~ 101 (289)
T PF10081_consen 33 KVLVVATPTGT---GWVDPWAVDALEYLYGGDVAIVAMQYSYLP--------SWLSFLVDRDAAREAARALFEAVYARWS 101 (289)
T ss_pred ceEEEEcCCCC---CccCHHHHhHHHHHhCCCeEEEEecccccc--------chHHHhcccchHHHHHHHHHHHHHHHHH
Confidence 35555543332 366655555666667777666665552222 1234553 4445555556666654433
Q ss_pred CCC--CCCcEEEEccchhHHHHHHHHHhCC---CeEEEEEeecccc
Q psy3685 844 SLP--AHTKWIAFGGSYPGALAAWLRYKYP---HLVHGAMSASGPL 884 (1042)
Q Consensus 844 ~~~--~~~~~i~~G~Sygg~la~~~~~~yP---~~~~g~i~ssa~~ 884 (1042)
.++ ...+++++|-|.|..-+.-.....+ +.++|++-+++|-
T Consensus 102 ~lP~~~RPkL~l~GeSLGa~g~~~af~~~~~~~~~vdGalw~GpP~ 147 (289)
T PF10081_consen 102 TLPEDRRPKLYLYGESLGAYGGEAAFDGLDDLRDRVDGALWVGPPF 147 (289)
T ss_pred hCCcccCCeEEEeccCccccchhhhhccHHHhhhhcceEEEeCCCC
Confidence 222 3357999999988754433332222 3378888777654
No 374
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=27.27 E-value=47 Score=30.57 Aligned_cols=11 Identities=9% Similarity=0.135 Sum_probs=4.6
Q ss_pred ccccCccCccc
Q psy3685 531 RYKCDLCSKEF 541 (1042)
Q Consensus 531 ~~~C~~C~k~f 541 (1042)
|-.|+.||..|
T Consensus 26 p~vcP~cg~~~ 36 (129)
T TIGR02300 26 PAVSPYTGEQF 36 (129)
T ss_pred CccCCCcCCcc
Confidence 33444444443
No 375
>KOG1280|consensus
Probab=27.01 E-value=51 Score=35.66 Aligned_cols=39 Identities=10% Similarity=0.050 Sum_probs=29.7
Q ss_pred CccccCCCCCCccCCcchHHHHHhh-ccCCC-CcccCCCCC
Q psy3685 680 DIRYKCDLCSKDFSRKDNLTEHKEI-HQGIR-VSDINHGTL 718 (1042)
Q Consensus 680 ~~~~~C~~C~k~F~~~~~L~~H~~~-H~g~~-~~~c~~c~~ 718 (1042)
+..|.|+.|++.=.+...|..|+.. |.... ...|+.|+.
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~~ 117 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCAA 117 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeecccc
Confidence 4479999999999999999999864 65543 246667764
No 376
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.90 E-value=29 Score=28.64 Aligned_cols=13 Identities=31% Similarity=0.631 Sum_probs=6.7
Q ss_pred cccC--CCccccccc
Q psy3685 36 YKCN--LCSKEFSRK 48 (1042)
Q Consensus 36 ~~C~--~C~k~f~~~ 48 (1042)
+.|. .||.+|...
T Consensus 28 ~qC~N~eCg~tF~t~ 42 (72)
T PRK09678 28 HQCQNVNCSATFITY 42 (72)
T ss_pred eecCCCCCCCEEEEE
Confidence 4555 555555443
No 377
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=26.56 E-value=31 Score=34.46 Aligned_cols=16 Identities=19% Similarity=0.503 Sum_probs=8.2
Q ss_pred cccCccCccccCchhh
Q psy3685 63 YKCNLCSKEFSRNCYL 78 (1042)
Q Consensus 63 ~~C~~C~~~f~~~~~L 78 (1042)
|.|+.|+..|+...++
T Consensus 118 Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 118 FFCPNCHIRFTFDEAM 133 (178)
T ss_pred EECCCCCcEEeHHHHh
Confidence 5555555555544443
No 378
>PF08237 PE-PPE: PE-PPE domain; InterPro: IPR013228 The human pathogen Mycobacterium tuberculosis harbours a large number of genes that encode proteins whose N-termini contain the characteristic motifs Pro-Glu (PE) or Pro-Pro-Glu (PPE). A subgroup of the PE proteins contains polymorphic GC-rich sequences (PGRS), while a subgroup of the PPE proteins contains major polymorphic tandem repeats (MPTR). The function of most of these proteins remains unknown []. However, the PE_PGRS proteins from Mycobacterium marinum are secreted by components of the ESX-5 system that belongs to the recently defined type VII secretion systems []. It has also been reported that the PE_PGRS family of proteins contains multiple calcium-binding and glycine-rich sequence motifs GGXGXD/NXUX. This sequence repeat constitutes a calcium-binding parallel beta-roll or parallel beta-helix structure and is found in RTX toxins secreted by many Gram-negative bacteria []. This domain is found C-terminal to the PE (IPR000084 from INTERPRO) and PPE (IPR000030 from INTERPRO) domains. The secondary structure of this domain is predicted to be a mixture of alpha helices and beta strands [].
Probab=26.54 E-value=1.3e+02 Score=31.35 Aligned_cols=46 Identities=13% Similarity=0.010 Sum_probs=27.8
Q ss_pred cCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHH
Q psy3685 821 QYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 821 ~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~ 867 (1042)
.-+|.++.+++-+.-+...-..... ...+++++|-|.|+.+|+...
T Consensus 21 ~~~t~~~Sv~~G~~~L~~ai~~~~~-~~~~vvV~GySQGA~Va~~~~ 66 (225)
T PF08237_consen 21 GSPTYDESVAEGVANLDAAIRAAIA-AGGPVVVFGYSQGAVVASNVL 66 (225)
T ss_pred CCCccchHHHHHHHHHHHHHHhhcc-CCCCEEEEEECHHHHHHHHHH
Confidence 3346666666655444332222111 335899999999998886544
No 379
>TIGR00515 accD acetyl-CoA carboxylase, carboxyl transferase, beta subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the beta chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=26.41 E-value=1.3e+02 Score=32.60 Aligned_cols=17 Identities=12% Similarity=0.200 Sum_probs=12.3
Q ss_pred CcE--EEEccchhHHHHHH
Q psy3685 849 TKW--IAFGGSYPGALAAW 865 (1042)
Q Consensus 849 ~~~--i~~G~Sygg~la~~ 865 (1042)
.|+ |++|-+|||+.++.
T Consensus 194 vP~IsVv~gpt~GG~aas~ 212 (285)
T TIGR00515 194 LPYISVLTDPTTGGVSASF 212 (285)
T ss_pred CCEEEEEeCCcchHHHHHH
Confidence 355 57899999975554
No 380
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=26.35 E-value=46 Score=24.71 Aligned_cols=10 Identities=50% Similarity=1.208 Sum_probs=5.7
Q ss_pred CccccCccCc
Q psy3685 61 IRYKCNLCSK 70 (1042)
Q Consensus 61 ~~~~C~~C~~ 70 (1042)
..|+|..|++
T Consensus 36 ~~~~C~~C~~ 45 (46)
T PF12760_consen 36 GRYRCKACRK 45 (46)
T ss_pred CeEECCCCCC
Confidence 4566666653
No 381
>TIGR00246 tRNA_RlmH_YbeA rRNA large subunit m3Psi methyltransferase RlmH. This protein, in the SPOUT methyltransferase family, previously designated YbeA in E. coli, was shown to be responsible for a further modification, a methylation, to a pseudouridine base in ribosomal large subunit RNA.
Probab=26.30 E-value=76 Score=30.79 Aligned_cols=47 Identities=15% Similarity=0.029 Sum_probs=27.8
Q ss_pred cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHH
Q psy3685 794 HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALA 863 (1042)
Q Consensus 794 ~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la 863 (1042)
.++.+|+||-||-= +|+ .++|.++...... +..-++++||+||=.=+
T Consensus 64 ~~~~~i~LDe~Gk~---------------~sS----~~fA~~l~~~~~~----g~~i~FvIGGa~G~~~~ 110 (153)
T TIGR00246 64 GKAHVVTLDIPGKP---------------WTT----PQLADTLEKWKTD----GRDVTLLIGGPEGLSPT 110 (153)
T ss_pred CCCeEEEEcCCCCc---------------CCH----HHHHHHHHHHhcc----CCeEEEEEcCCCcCCHH
Confidence 36789999987621 222 3445555443221 12367999999995444
No 382
>PRK14138 NAD-dependent deacetylase; Provisional
Probab=26.03 E-value=1.3e+02 Score=31.72 Aligned_cols=101 Identities=12% Similarity=0.132 Sum_probs=56.0
Q ss_pred cccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEEE
Q psy3685 676 ILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRYF 755 (1042)
Q Consensus 676 ~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry~ 755 (1042)
+|..-....|..|++.|....-+. .-....+..|+.|+-.-. |-..||...++. ..+.
T Consensus 113 lHG~~~~~~C~~C~~~~~~~~~~~----~~~~~~~p~Cp~Cgg~lr--------P~Vv~FgE~~p~------~~~~---- 170 (244)
T PRK14138 113 LHGNVEEYYCVRCGKRYTVEDVIE----KLEKSDVPRCDDCSGLIR--------PNIVFFGEALPQ------DALR---- 170 (244)
T ss_pred ccCCcCeeEECCCCCcccHHHHHH----HHhcCCCCCCCCCCCeEC--------CCEEECCCcCCH------HHHH----
Confidence 455555678999999987543332 111224668999974211 234456555432 0110
Q ss_pred EecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 756 VNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 756 ~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
...-+-...-+||.+|-...+.+.. .+..+|.+.|+.+|.+..
T Consensus 171 -~~~~~~~~aDl~lviGTSl~V~pa~----~l~~~~~~~g~~~i~iN~ 213 (244)
T PRK14138 171 -EAIRLSSKASLMIVMGSSLVVYPAA----ELPLITVRSGGKLVIVNL 213 (244)
T ss_pred -HHHHHHhcCCEEEEeCcCCeeecHh----HHHHHHHHcCCeEEEEcC
Confidence 0111112345888888776665521 134567777888887764
No 383
>PF15269 zf-C2H2_7: Zinc-finger
Probab=25.84 E-value=45 Score=24.11 Aligned_cols=23 Identities=39% Similarity=0.638 Sum_probs=18.7
Q ss_pred ccccCCCCCCCCChhHHHHHHHh
Q psy3685 143 RYKCDLCSKDFSYKSSLIEHKKL 165 (1042)
Q Consensus 143 ~~~C~~C~~~F~~~~~L~~H~~~ 165 (1042)
.|+|-.|..+...++.|..||+-
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 37788888888888888888864
No 384
>KOG2593|consensus
Probab=25.78 E-value=30 Score=38.71 Aligned_cols=11 Identities=9% Similarity=0.151 Sum_probs=4.9
Q ss_pred hHHHHHhhccC
Q psy3685 697 NLTEHKEIHQG 707 (1042)
Q Consensus 697 ~L~~H~~~H~g 707 (1042)
.|..+++.=.|
T Consensus 188 pi~d~Lk~~e~ 198 (436)
T KOG2593|consen 188 PIIDLLKELEG 198 (436)
T ss_pred HHHHHHHHhhc
Confidence 34445444344
No 385
>COG1773 Rubredoxin [Energy production and conversion]
Probab=25.77 E-value=52 Score=25.49 Aligned_cols=12 Identities=17% Similarity=0.789 Sum_probs=6.3
Q ss_pred ccCCcCCccccC
Q psy3685 479 YKCDLCLKEFSR 490 (1042)
Q Consensus 479 ~~C~~C~~~f~~ 490 (1042)
|+|..|+..|.-
T Consensus 4 ~~C~~CG~vYd~ 15 (55)
T COG1773 4 WRCSVCGYVYDP 15 (55)
T ss_pred eEecCCceEecc
Confidence 455555555543
No 386
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=25.01 E-value=27 Score=28.80 Aligned_cols=41 Identities=17% Similarity=0.328 Sum_probs=18.6
Q ss_pred cccCCCccccccccchhhhhhhccCCccccC--ccCccccCchh
Q psy3685 36 YKCNLCSKEFSRKYNLTDHKKIHQGIRYKCN--LCSKEFSRNCY 77 (1042)
Q Consensus 36 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~~C~--~C~~~f~~~~~ 77 (1042)
+.|+.||..-.-..+-..... -....+.|. .||.+|.....
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~-~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT-TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh-hheeeeecCCCCCCCEEEEEEE
Confidence 456666655422222111111 111345666 67776665444
No 387
>PF01972 SDH_sah: Serine dehydrogenase proteinase; InterPro: IPR002825 This family of archaebacterial proteins, formerly known as DUF114, has been found to be a serine dehydrogenase proteinase distantly related to ClpP proteinases that belong to the serine proteinase superfamily. The family belong to MEROPS peptidase family S49; they are mostly unassigned peptidases but include the archaean signal peptide peptidase 1 []. The family has a catalytic triad of Ser, Asp, His residues, which shows an altered residue ordering compared with the ClpP proteinases but similar to that of the carboxypeptidase clan []. ; GO: 0016021 integral to membrane
Probab=24.63 E-value=2.8e+02 Score=29.65 Aligned_cols=82 Identities=12% Similarity=0.092 Sum_probs=52.2
Q ss_pred cHHHHHHHcCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHH
Q psy3685 786 TWLDYAHNHNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAW 865 (1042)
Q Consensus 786 ~~~~la~~~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~ 865 (1042)
++.++.++-|..||.+=||-=-.|. + ...-.+|+ .++|...+.+.++..- ++.++.++=++.||.+-|-
T Consensus 39 ~i~~ie~kr~srvI~~Ihrqe~~~~-~---giPi~~~I----~i~dse~v~raI~~~~---~~~~IdLii~TpGG~v~AA 107 (285)
T PF01972_consen 39 LIREIEEKRGSRVITLIHRQERVSF-L---GIPIYRYI----DIDDSEFVLRAIREAP---KDKPIDLIIHTPGGLVDAA 107 (285)
T ss_pred HHHHHHHHhCCEEEEEEEeccccce-e---ccccceeE----cHhhHHHHHHHHHhcC---CCCceEEEEECCCCcHHHH
Confidence 4667888889999999997311111 0 01112344 3467777777776542 3457888889999965543
Q ss_pred H-----HHhCCCeEEEEE
Q psy3685 866 L-----RYKYPHLVHGAM 878 (1042)
Q Consensus 866 ~-----~~~yP~~~~g~i 878 (1042)
. -..+|..+.+.|
T Consensus 108 ~~I~~~l~~~~~~v~v~V 125 (285)
T PF01972_consen 108 EQIARALREHPAKVTVIV 125 (285)
T ss_pred HHHHHHHHhCCCCEEEEE
Confidence 3 257888888776
No 388
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=24.39 E-value=46 Score=30.88 Aligned_cols=27 Identities=19% Similarity=0.538 Sum_probs=16.7
Q ss_pred cccCCccccccCCch-hHHHHhhhcCCCcccCCCCCcc
Q psy3685 585 RYKCDLCSKKFSRKL-NLTKHMKIHQGIRYKCDLCSKE 621 (1042)
Q Consensus 585 ~~~C~~C~k~F~~~~-~L~~H~~~H~~~~~~C~~C~k~ 621 (1042)
|++|..||+.|...+ .|. --|+.||-.
T Consensus 1 PH~Ct~Cg~~f~dgs~eil----------~GCP~CGg~ 28 (131)
T PF09845_consen 1 PHQCTKCGRVFEDGSKEIL----------SGCPECGGN 28 (131)
T ss_pred CcccCcCCCCcCCCcHHHH----------ccCcccCCc
Confidence 467777777777655 222 137778753
No 389
>PTZ00409 Sir2 (Silent Information Regulator) protein; Provisional
Probab=24.38 E-value=2.1e+02 Score=30.77 Aligned_cols=105 Identities=13% Similarity=0.086 Sum_probs=54.7
Q ss_pred ccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEE
Q psy3685 675 KILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRY 754 (1042)
Q Consensus 675 ~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry 754 (1042)
.+|..-....|..|++.+.....+...+..-..+.+..|+ |+-.-. |-.-||...|+. ..+.+
T Consensus 130 ElHG~l~~~~C~~C~~~~~~~~~~~~~~~~~~~~~~P~C~-Cgg~lr--------P~VV~FGE~lp~------~~~~~-- 192 (271)
T PTZ00409 130 PLHGSVFEARCCTCRKTIQLNKIMLQKTSHFMHQLPPECP-CGGIFK--------PNVILFGEVIPK------SLLKQ-- 192 (271)
T ss_pred EeccCcCcceeCCCCCCcccCHHHHhhhhhhccCCCCCCC-CCCccc--------CcEEEeCCcCCH------HHHHH--
Confidence 3444455678999998887544333221111112256787 863211 223456555442 01110
Q ss_pred EEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 755 FVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 755 ~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
-.+ +-...-+||.+|-.+.+.+... +..+|.+.|+.||.+..
T Consensus 193 --a~~-~~~~aDlllviGTSl~V~pa~~----l~~~a~~~g~~vi~IN~ 234 (271)
T PTZ00409 193 --AEK-EIDKCDLLLVVGTSSSVSTATN----LCYRAHRKKKKIVEVNI 234 (271)
T ss_pred --HHH-HHHcCCEEEEECCCCcccCHHH----HHHHHHHcCCCEEEECC
Confidence 001 1122447888888776655221 44567888998888764
No 390
>PRK00481 NAD-dependent deacetylase; Provisional
Probab=23.38 E-value=1.8e+02 Score=30.76 Aligned_cols=99 Identities=13% Similarity=0.124 Sum_probs=52.6
Q ss_pred ccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEE
Q psy3685 675 KILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRY 754 (1042)
Q Consensus 675 ~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry 754 (1042)
.+|..-....|..|++.+.....+. .+ +..|+.|+-.-.. ...||.+.++.- .|..
T Consensus 115 elHG~~~~~~C~~C~~~~~~~~~~~-------~~-~p~C~~Cgg~lrP--------~Vv~fge~~~~~------~~~~-- 170 (242)
T PRK00481 115 ELHGSLLRARCTKCGQTYDLDEYLK-------PE-PPRCPKCGGILRP--------DVVLFGEMLPEL------AIDE-- 170 (242)
T ss_pred eccCCcCceeeCCCCCCcChhhhcc-------CC-CCCCCCCCCccCC--------CeEECCCCCCHH------HHHH--
Confidence 3455556678999998886544321 12 4459999733222 233565555320 1100
Q ss_pred EEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeee
Q psy3685 755 FVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHR 804 (1042)
Q Consensus 755 ~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR 804 (1042)
..+.+. ..-+||.+|-...+.+.+. +...|..-|+.+|.+-.-
T Consensus 171 --a~~~~~-~~dl~lviGTsl~V~p~~~----l~~~~~~~~~~~i~iN~~ 213 (242)
T PRK00481 171 --AYEALE-EADLFIVIGTSLVVYPAAG----LPYEAREHGAKTVEINLE 213 (242)
T ss_pred --HHHHHh-cCCEEEEECCCceEcCHhH----HHHHHHHCCCeEEEECCC
Confidence 001111 2467888886555533222 335567778888887644
No 391
>PF02590 SPOUT_MTase: Predicted SPOUT methyltransferase; InterPro: IPR003742 This family of proteins are predicted to be SPOUT methyltransferases []. ; GO: 0008168 methyltransferase activity, 0006364 rRNA processing, 0005737 cytoplasm; PDB: 1VH0_E 4FAK_A 1TO0_G 1O6D_A 1NS5_B.
Probab=23.36 E-value=77 Score=30.84 Aligned_cols=52 Identities=17% Similarity=0.144 Sum_probs=26.7
Q ss_pred cCCeEEEEeeeecccccCCCCCCcccccCCCHHHHHhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHh
Q psy3685 794 HNALAVQVEHRFYGKSHPLSDLSVESLQYLSAEQALADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYK 869 (1042)
Q Consensus 794 ~~a~~~~~ehR~~G~S~p~~~~~~~~~~~lt~~q~l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~ 869 (1042)
-++.+|+||-+|-- +|+ .++|.++......- ...-++++||||| ++--++.+
T Consensus 66 ~~~~~i~Ld~~Gk~---------------~sS----~~fA~~l~~~~~~g---~~~i~F~IGG~~G--~~~~~~~~ 117 (155)
T PF02590_consen 66 PNDYVILLDERGKQ---------------LSS----EEFAKKLERWMNQG---KSDIVFIIGGADG--LSEEVRKR 117 (155)
T ss_dssp TTSEEEEE-TTSEE-----------------H----HHHHHHHHHHHHTT---S-EEEEEE-BTTB----HHHHHH
T ss_pred CCCEEEEEcCCCcc---------------CCh----HHHHHHHHHHHhcC---CceEEEEEecCCC--CCHHHHhh
Confidence 37889999977622 233 34455555443321 1124799999998 54444443
No 392
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=23.14 E-value=42 Score=28.53 Aligned_cols=11 Identities=36% Similarity=1.144 Sum_probs=5.6
Q ss_pred cccCccCcccc
Q psy3685 63 YKCNLCSKEFS 73 (1042)
Q Consensus 63 ~~C~~C~~~f~ 73 (1042)
+.|..|+..|.
T Consensus 54 W~C~kCg~~fA 64 (89)
T COG1997 54 WKCRKCGAKFA 64 (89)
T ss_pred EEcCCCCCeec
Confidence 45555555543
No 393
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=23.12 E-value=60 Score=25.48 Aligned_cols=7 Identities=43% Similarity=0.700 Sum_probs=3.0
Q ss_pred chhhhhh
Q psy3685 50 NLTDHKK 56 (1042)
Q Consensus 50 ~L~~H~~ 56 (1042)
.|..|..
T Consensus 25 ~l~~H~~ 31 (60)
T PF02176_consen 25 ELDDHLE 31 (60)
T ss_dssp CHHHHHH
T ss_pred HHHHHHH
Confidence 4444444
No 394
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=23.04 E-value=1e+02 Score=27.55 Aligned_cols=11 Identities=36% Similarity=0.930 Sum_probs=8.1
Q ss_pred ccCCCCCCccC
Q psy3685 683 YKCDLCSKDFS 693 (1042)
Q Consensus 683 ~~C~~C~k~F~ 693 (1042)
|.|..|+..++
T Consensus 91 y~C~~C~~~w~ 101 (104)
T TIGR01384 91 YKCTKCGYVWR 101 (104)
T ss_pred EEeCCCCCeeE
Confidence 77888887664
No 395
>PF01221 Dynein_light: Dynein light chain type 1 ; InterPro: IPR001372 Dynein is a multisubunit microtubule-dependent motor enzyme that acts as the force generating protein of eukaryotic cilia and flagella. The cytoplasmic isoform of dynein acts as a motor for the intracellular retrograde motility of vesicles and organelles along microtubules. Dynein is composed of a number of ATP-binding large subunits (see IPR004273 from INTERPRO), intermediate size subunits and small subunits. Among the small subunits, there is a family of highly conserved proteins which make up this family [, ]. Both type 1 (DLC1) and 2 (DLC2) dynein light chains have a similar two-layer alpha-beta core structure consisting of beta-alpha(2)-beta-X-beta(2) [, ].; GO: 0007017 microtubule-based process, 0005875 microtubule associated complex; PDB: 1F95_A 1F96_A 1F3C_A 3P8M_B 2XQQ_C 1RE6_A 1CMI_A 1PWK_A 1PWJ_A 4DS1_C ....
Probab=23.01 E-value=63 Score=28.02 Aligned_cols=29 Identities=24% Similarity=0.626 Sum_probs=19.7
Q ss_pred hHHHHHHHH-HhhhcCCCCCCcE-EEEccchhHHH
Q psy3685 830 ADLAYFITS-MNTLYSLPAHTKW-IAFGGSYPGAL 862 (1042)
Q Consensus 830 ~D~~~~~~~-~~~~~~~~~~~~~-i~~G~Sygg~l 862 (1042)
.|+|.+|+. +...|+ ..| +++|.|||+.+
T Consensus 36 ~eiA~~iK~~lD~~yG----~~Wh~IVG~~Fg~~~ 66 (89)
T PF01221_consen 36 KEIAEFIKQELDKKYG----PTWHCIVGKSFGSSV 66 (89)
T ss_dssp HHHHHHHHHHHHHHHS----S-EEEEEESEEEEEE
T ss_pred HHHHHHHHHHHhcccC----CceEEEECCcEEEEE
Confidence 667777754 666674 367 78899999643
No 396
>cd01410 SIRT7 SIRT7: Eukaryotic and prokaryotic group (class4) which includes human sirtuin SIRT6, SIRT7, and several bacterial homologs; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span.
Probab=22.88 E-value=2.7e+02 Score=28.61 Aligned_cols=103 Identities=13% Similarity=0.057 Sum_probs=56.1
Q ss_pred ccccCCccccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCCCCCCccccCCCCcchhhhhcccCCCCCcccceEEE
Q psy3685 675 KILHQDIRYKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLGEPSGVMFEKLPSDQWFEQKLDHFNIEEPRTWKQRY 754 (1042)
Q Consensus 675 ~~h~~~~~~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~~~~~~~~~~~~~~~~f~q~~dhf~~~~~~tf~qry 754 (1042)
.+|..-....|..|+..+.....+. .+.....+..|+.|+-.-. |..-||.+.++.- .|.
T Consensus 88 elHG~~~~~~C~~C~~~~~~~~~~~---~~~~~~~~p~C~~Cgg~lr--------P~VV~FgE~lp~~------~~~--- 147 (206)
T cd01410 88 ELHGNMFIEVCKSCGPEYVRDDVVE---TRGDKETGRRCHACGGILK--------DTIVDFGERLPPE------NWM--- 147 (206)
T ss_pred EecCCcCcccCCCCCCccchHHHHH---HhhcCCCCCcCCCCcCccC--------CcEEECCCCCCHH------HHH---
Confidence 3444455678999998876443322 2223344678999974321 3345666554421 010
Q ss_pred EEecccccCCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEee
Q psy3685 755 FVNSTWYKHGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEH 803 (1042)
Q Consensus 755 ~~~~~~~~~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~eh 803 (1042)
....+-...-+||.+|-.+.+.+... +...|.+.|+.+|.+..
T Consensus 148 --~a~~~~~~aDlllviGTSl~V~pa~~----l~~~~~~~g~~vi~iN~ 190 (206)
T cd01410 148 --GAAAAACRADLFLCLGTSLQVTPAAN----LPLKAARAGGRLVIVNL 190 (206)
T ss_pred --HHHHHHhcCCEEEEECcCceehhHHH----HHHHHHhcCCeEEEECC
Confidence 01111123458888887766654211 34567777888887653
No 397
>KOG2960|consensus
Probab=22.58 E-value=86 Score=31.62 Aligned_cols=33 Identities=21% Similarity=0.307 Sum_probs=26.3
Q ss_pred CcEEEEccchhHHHHHHHH-HhCCCeEEEEEeec
Q psy3685 849 TKWIAFGGSYPGALAAWLR-YKYPHLVHGAMSAS 881 (1042)
Q Consensus 849 ~~~i~~G~Sygg~la~~~~-~~yP~~~~g~i~ss 881 (1042)
+-+|++|..-.|.-||++. .+.|++-.+.|-+|
T Consensus 77 sDvviVGAGSaGLsAAY~I~~~rPdlkvaIIE~S 110 (328)
T KOG2960|consen 77 SDVVIVGAGSAGLSAAYVIAKNRPDLKVAIIESS 110 (328)
T ss_pred cceEEECCCccccceeeeeeccCCCceEEEEEee
Confidence 4688899888888888866 59999988888555
No 398
>KOG1701|consensus
Probab=22.51 E-value=68 Score=35.85 Aligned_cols=38 Identities=26% Similarity=0.606 Sum_probs=18.5
Q ss_pred ccCCCCcccCCchhHHHHHHH-hcCCccccCccCcccCC
Q psy3685 506 KCDLCSKEFSSKGNLTEHMKQ-HQGIRYKCDLCSKEFSR 543 (1042)
Q Consensus 506 ~C~~C~~~f~~~~~L~~H~~~-H~~~~~~C~~C~k~f~~ 543 (1042)
.|..|+|...-...--+=|.. -|-.-|+|..|++...-
T Consensus 276 iC~~C~K~V~g~~~ac~Am~~~fHv~CFtC~~C~r~L~G 314 (468)
T KOG1701|consen 276 ICAFCHKTVSGQGLAVEAMDQLFHVQCFTCRTCRRQLAG 314 (468)
T ss_pred hhhhcCCcccCcchHHHHhhhhhcccceehHhhhhhhcc
Confidence 566677665554443333321 12224666666554443
No 399
>PF09994 DUF2235: Uncharacterized alpha/beta hydrolase domain (DUF2235); InterPro: IPR018712 This domain has no known function.
Probab=22.46 E-value=4.8e+02 Score=28.15 Aligned_cols=20 Identities=20% Similarity=0.026 Sum_probs=17.1
Q ss_pred CCcEEEEccchhHHHHHHHH
Q psy3685 848 HTKWIAFGGSYPGALAAWLR 867 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~la~~~~ 867 (1042)
...+++||.|-|+.+|.-+.
T Consensus 91 gd~I~lfGFSRGA~~AR~~a 110 (277)
T PF09994_consen 91 GDRIYLFGFSRGAYTARAFA 110 (277)
T ss_pred cceEEEEecCccHHHHHHHH
Confidence 34799999999999997766
No 400
>PF12242 Eno-Rase_NADH_b: NAD(P)H binding domain of trans-2-enoyl-CoA reductase; PDB: 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A 3S8M_A.
Probab=22.30 E-value=2.9e+02 Score=23.26 Aligned_cols=42 Identities=14% Similarity=0.140 Sum_probs=25.6
Q ss_pred HhHHHHHHHHHhhhcCCCCCCcEEEEccchhHHHHHHHHHhC
Q psy3685 829 LADLAYFITSMNTLYSLPAHTKWIAFGGSYPGALAAWLRYKY 870 (1042)
Q Consensus 829 l~D~~~~~~~~~~~~~~~~~~~~i~~G~Sygg~la~~~~~~y 870 (1042)
.+.+.+-|++++..-....-.+|+++|.|-|=.||+....-|
T Consensus 20 ~~~V~~qI~yvk~~~~~~GpK~VLViGaStGyGLAsRIa~aF 61 (78)
T PF12242_consen 20 ARNVENQIEYVKSQGKINGPKKVLVIGASTGYGLASRIAAAF 61 (78)
T ss_dssp HHHHHHHHHHHHHC---TS-SEEEEES-SSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhcCCCCCCceEEEEecCCcccHHHHHHHHh
Confidence 344555566665543222224799999999988998877665
No 401
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=22.12 E-value=40 Score=23.64 Aligned_cols=8 Identities=25% Similarity=1.094 Sum_probs=3.5
Q ss_pred CCcccccc
Q psy3685 588 CDLCSKKF 595 (1042)
Q Consensus 588 C~~C~k~F 595 (1042)
|+.||+.|
T Consensus 4 C~~Cg~~Y 11 (36)
T PF05191_consen 4 CPKCGRIY 11 (36)
T ss_dssp ETTTTEEE
T ss_pred cCCCCCcc
Confidence 44444444
No 402
>PRK13938 phosphoheptose isomerase; Provisional
Probab=21.94 E-value=3.5e+02 Score=27.53 Aligned_cols=23 Identities=13% Similarity=0.072 Sum_probs=20.8
Q ss_pred CcEEEEccchhHHHHHHHHHhCC
Q psy3685 849 TKWIAFGGSYPGALAAWLRYKYP 871 (1042)
Q Consensus 849 ~~~i~~G~Sygg~la~~~~~~yP 871 (1042)
.+++++|-+-+|.+|..+..+..
T Consensus 46 ~rI~i~G~G~S~~~A~~fa~~L~ 68 (196)
T PRK13938 46 ARVFMCGNGGSAADAQHFAAELT 68 (196)
T ss_pred CEEEEEeCcHHHHHHHHHHHHcC
Confidence 58999999999999999998875
No 403
>COG4474 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.11 E-value=7.7e+02 Score=24.22 Aligned_cols=31 Identities=23% Similarity=0.457 Sum_probs=23.0
Q ss_pred CCcEEEEccchhHHH-----HHHHHHhCCCeEEEEE
Q psy3685 848 HTKWIAFGGSYPGAL-----AAWLRYKYPHLVHGAM 878 (1042)
Q Consensus 848 ~~~~i~~G~Sygg~l-----a~~~~~~yP~~~~g~i 878 (1042)
...|++|-|+.|=-+ |.-+...||++-.|.|
T Consensus 42 GleW~litGqLG~E~WA~Evv~eLk~eyp~ik~avi 77 (180)
T COG4474 42 GLEWVLITGQLGFELWAAEVVIELKEEYPHIKLAVI 77 (180)
T ss_pred CceEEEEeccccHHHHHHHHHHHHHhhCCCeeEEEE
Confidence 358999999988654 3446789999776665
No 404
>PF04083 Abhydro_lipase: Partial alpha/beta-hydrolase lipase region; InterPro: IPR006693 The alpha/beta hydrolase fold is common to several hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is similar: an alpha/beta sheet, not barrel, of eight beta-sheets connected by alpha-helices []. This entry represents the N-terminal part of an alpha/beta hydrolase domain found in a number of lipases.; GO: 0006629 lipid metabolic process; PDB: 1K8Q_B 1HLG_B.
Probab=20.85 E-value=79 Score=25.42 Aligned_cols=23 Identities=26% Similarity=0.533 Sum_probs=12.6
Q ss_pred CCCcEEEEECCCCCCcccccccc
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEG 785 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~ 785 (1042)
+.+|+||+.+|....+..|+.+|
T Consensus 41 ~~k~pVll~HGL~~ss~~wv~ng 63 (63)
T PF04083_consen 41 KKKPPVLLQHGLLQSSDDWVLNG 63 (63)
T ss_dssp TT--EEEEE--TT--GGGGCSS-
T ss_pred CCCCcEEEECCcccChHHHHcCC
Confidence 45789999999988888887543
No 405
>KOG2029|consensus
Probab=20.69 E-value=1.2e+02 Score=35.85 Aligned_cols=22 Identities=27% Similarity=0.203 Sum_probs=17.6
Q ss_pred CCCcEEEEccchhHHHHHHHHH
Q psy3685 847 AHTKWIAFGGSYPGALAAWLRY 868 (1042)
Q Consensus 847 ~~~~~i~~G~Sygg~la~~~~~ 868 (1042)
.+.|+|.+|||+||.+|--+.+
T Consensus 524 ~~RPivwI~HSmGGLl~K~lLl 545 (697)
T KOG2029|consen 524 DDRPIVWIGHSMGGLLAKKLLL 545 (697)
T ss_pred CCCceEEEecccchHHHHHHHH
Confidence 3568999999999999865543
No 406
>TIGR00162 conserved hypothetical protein TIGR00162. This ortholog set includes MJ1210 from Methanococcus jannaschii and AF0525 from Archaeoglobus fulgidus, but not MJ0106 or AF1251.
Probab=20.67 E-value=8.8e+02 Score=24.43 Aligned_cols=39 Identities=15% Similarity=0.344 Sum_probs=26.7
Q ss_pred CCcEEEEECCCCCCcc--cccccccHHHHHHHcCC-eEEEEe
Q psy3685 764 GGPIFLMIGGEGEASA--KWMVEGTWLDYAHNHNA-LAVQVE 802 (1042)
Q Consensus 764 ~~pi~~~~gg~g~~~~--~~~~~~~~~~la~~~~a-~~~~~e 802 (1042)
+.+|+|++|.+-+..+ .|.....+.+||+++|+ .||.+.
T Consensus 14 ~~~illl~g~e~~~~p~~~~e~a~~vld~a~~~gv~~iitLg 55 (188)
T TIGR00162 14 GTDLIILVGNTQSLSPEGQYELVNAIIDVAKKYGARMIYTLG 55 (188)
T ss_pred CCCEEEEEcCCCCCChhhHHHHHHHHHHHHHHcCCCEEEEec
Confidence 4578888887745443 23455668889999998 566654
No 407
>COG3962 Acetolactate synthase [Amino acid transport and metabolism]
Probab=20.29 E-value=81 Score=35.83 Aligned_cols=43 Identities=16% Similarity=0.290 Sum_probs=29.3
Q ss_pred CCCcEEEEECCCCCCcccccccccHHHHHHHcCCeEEEEeeeeccc
Q psy3685 763 HGGPIFLMIGGEGEASAKWMVEGTWLDYAHNHNALAVQVEHRFYGK 808 (1042)
Q Consensus 763 ~~~pi~~~~gg~g~~~~~~~~~~~~~~la~~~~a~~~~~ehR~~G~ 808 (1042)
|+.+|++++|. |+..+ +++.+...+..-.+-+|+.+|-|+||=
T Consensus 461 pdreV~vmVGD-GSymM--lnSEL~Tsv~~g~Ki~Vvl~DN~GyGC 503 (617)
T COG3962 461 PDREVYVMVGD-GSYMM--LNSELATSVMLGKKIIVVLLDNRGYGC 503 (617)
T ss_pred CCCeEEEEEcc-cchhh--hhHHHHHHHHcCCeEEEEEECCCCcch
Confidence 56779999976 33322 454444445555577999999999994
No 408
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=20.10 E-value=50 Score=30.49 Aligned_cols=23 Identities=35% Similarity=0.613 Sum_probs=19.2
Q ss_pred ccCCccccccCCchhHHHHhhhcCCC
Q psy3685 586 YKCDLCSKKFSRKLNLTKHMKIHQGI 611 (1042)
Q Consensus 586 ~~C~~C~k~F~~~~~L~~H~~~H~~~ 611 (1042)
..|-.+||.|+ +|++|+.+|.|.
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gm 99 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGL 99 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCC
Confidence 46888999885 799999999875
No 409
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=20.08 E-value=66 Score=24.87 Aligned_cols=25 Identities=24% Similarity=0.426 Sum_probs=17.3
Q ss_pred ccCCCCCCccCCcchHHHHHhhccCCCCcccCCCCCC
Q psy3685 683 YKCDLCSKDFSRKDNLTEHKEIHQGIRVSDINHGTLG 719 (1042)
Q Consensus 683 ~~C~~C~k~F~~~~~L~~H~~~H~g~~~~~c~~c~~~ 719 (1042)
-+|+.||+.|.. ++-...|+.|+.+
T Consensus 6 ~~C~~Cg~~~~~------------~dDiVvCp~Cgap 30 (54)
T PF14446_consen 6 CKCPVCGKKFKD------------GDDIVVCPECGAP 30 (54)
T ss_pred ccChhhCCcccC------------CCCEEECCCCCCc
Confidence 478888888863 3445678888764
Done!