Query psy12907
Match_columns 949
No_of_seqs 417 out of 2097
Neff 5.2
Searched_HMMs 46136
Date Fri Aug 16 22:48:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12907.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12907hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03059 beta-galactosidase; P 100.0 4E-128 8E-133 1143.3 48.8 545 173-935 26-726 (840)
2 KOG0496|consensus 100.0 1E-117 3E-122 1018.7 48.8 549 174-938 17-640 (649)
3 PF01301 Glyco_hydro_35: Glyco 100.0 1.1E-83 2.4E-88 709.8 20.4 307 183-494 1-312 (319)
4 COG1874 LacA Beta-galactosidas 100.0 8.5E-42 1.9E-46 402.2 10.1 271 178-463 2-304 (673)
5 PF01301 Glyco_hydro_35: Glyco 100.0 2.1E-29 4.7E-34 278.7 9.2 146 17-162 125-274 (319)
6 PLN03059 beta-galactosidase; P 100.0 6.7E-29 1.4E-33 297.1 11.8 131 17-161 160-299 (840)
7 KOG0496|consensus 99.9 5.5E-22 1.2E-26 230.3 9.4 131 18-159 151-287 (649)
8 PF02449 Glyco_hydro_42: Beta- 99.8 7.3E-20 1.6E-24 206.6 13.6 242 198-457 2-341 (374)
9 PF13364 BetaGal_dom4_5: Beta- 99.0 8.7E-10 1.9E-14 105.3 8.5 65 845-917 32-101 (111)
10 PF02836 Glyco_hydro_2_C: Glyc 98.9 2.2E-08 4.8E-13 110.1 17.6 202 177-418 1-213 (298)
11 PF00150 Cellulase: Cellulase 98.6 6.7E-07 1.4E-11 95.7 14.5 157 186-364 3-172 (281)
12 PRK10150 beta-D-glucuronidase; 98.6 6.5E-06 1.4E-10 99.4 23.8 153 178-360 279-445 (604)
13 PRK09525 lacZ beta-D-galactosi 98.3 4.1E-05 8.8E-10 97.7 20.7 140 173-353 334-479 (1027)
14 PRK10340 ebgA cryptic beta-D-g 98.2 4.2E-05 9E-10 97.6 20.8 192 173-413 318-515 (1021)
15 smart00633 Glyco_10 Glycosyl h 98.0 0.00011 2.4E-09 79.6 14.9 113 229-366 3-127 (254)
16 PLN02161 beta-amylase 97.7 9.5E-05 2.1E-09 86.1 9.3 84 205-291 116-207 (531)
17 PF03198 Glyco_hydro_72: Gluca 97.7 0.00049 1.1E-08 76.6 14.3 160 177-374 11-200 (314)
18 PLN02801 beta-amylase 97.7 0.00022 4.9E-09 83.1 12.2 141 205-353 36-228 (517)
19 COG3250 LacZ Beta-galactosidas 97.7 0.00035 7.5E-09 86.9 14.2 118 177-335 286-409 (808)
20 PF13204 DUF4038: Protein of u 97.7 0.0012 2.7E-08 73.2 17.1 235 182-442 3-274 (289)
21 PLN02705 beta-amylase 97.7 0.00029 6.3E-09 83.3 12.1 117 205-329 267-414 (681)
22 PLN02803 beta-amylase 97.7 0.00013 2.8E-09 85.4 8.9 81 205-291 106-197 (548)
23 PLN00197 beta-amylase; Provisi 97.7 0.00014 3.1E-09 85.3 9.2 81 205-291 126-217 (573)
24 PLN02905 beta-amylase 97.6 0.00036 7.8E-09 82.8 12.2 117 205-329 285-432 (702)
25 TIGR03356 BGL beta-galactosida 97.5 0.00027 5.8E-09 82.5 8.2 97 206-315 54-151 (427)
26 PF01373 Glyco_hydro_14: Glyco 97.4 0.00022 4.7E-09 81.8 5.7 75 207-284 17-96 (402)
27 COG1874 LacA Beta-galactosidas 97.3 4.4E-05 9.5E-10 92.5 -0.8 147 16-165 135-292 (673)
28 COG3693 XynA Beta-1,4-xylanase 97.0 0.011 2.4E-07 66.1 13.9 174 215-411 55-258 (345)
29 PF00331 Glyco_hydro_10: Glyco 96.7 0.0065 1.4E-07 68.4 9.6 199 193-411 11-241 (320)
30 COG2730 BglC Endoglucanase [Ca 96.3 0.015 3.3E-07 67.5 9.8 114 205-335 67-193 (407)
31 PF13364 BetaGal_dom4_5: Beta- 96.3 0.015 3.3E-07 55.8 8.1 79 510-588 24-110 (111)
32 PF00232 Glyco_hydro_1: Glycos 96.1 0.0078 1.7E-07 70.8 6.0 98 206-316 58-157 (455)
33 PRK09852 cryptic 6-phospho-bet 95.9 0.0076 1.7E-07 71.4 4.5 98 206-315 71-170 (474)
34 PF14488 DUF4434: Domain of un 95.8 0.17 3.7E-06 52.1 13.5 127 201-353 15-151 (166)
35 PRK15014 6-phospho-beta-glucos 95.5 0.016 3.4E-07 68.9 5.4 97 207-315 70-168 (477)
36 PRK00870 haloalkane dehalogena 95.5 0.0068 1.5E-07 66.2 2.1 70 601-673 36-149 (302)
37 PLN02998 beta-glucosidase 95.5 0.016 3.4E-07 69.2 5.1 104 206-317 82-186 (497)
38 PLN02814 beta-glucosidase 95.2 0.02 4.4E-07 68.3 5.1 101 206-314 77-178 (504)
39 PRK09593 arb 6-phospho-beta-gl 94.9 0.032 7E-07 66.3 5.5 103 206-316 73-177 (478)
40 PRK09589 celA 6-phospho-beta-g 94.8 0.032 6.8E-07 66.3 5.1 103 206-316 67-171 (476)
41 PF07745 Glyco_hydro_53: Glyco 94.8 0.7 1.5E-05 52.7 15.5 103 209-334 27-136 (332)
42 PLN02849 beta-glucosidase 94.8 0.031 6.7E-07 66.8 4.9 103 206-316 79-182 (503)
43 COG3623 SgaU Putative L-xylulo 94.8 0.51 1.1E-05 51.1 13.3 39 374-413 220-259 (287)
44 PRK13511 6-phospho-beta-galact 94.4 0.046 1E-06 64.8 5.1 97 206-311 54-151 (469)
45 TIGR01233 lacG 6-phospho-beta- 94.4 0.052 1.1E-06 64.4 5.4 96 206-310 53-149 (467)
46 PF02837 Glyco_hydro_2_N: Glyc 94.1 0.1 2.3E-06 52.4 6.3 72 516-587 63-141 (167)
47 PF02638 DUF187: Glycosyl hydr 93.7 0.2 4.4E-06 56.4 8.3 117 204-331 17-162 (311)
48 COG3867 Arabinogalactan endo-1 93.5 2.4 5.1E-05 47.6 15.6 116 207-334 64-182 (403)
49 PF14871 GHL6: Hypothetical gl 93.0 0.45 9.7E-06 47.4 8.5 72 210-288 4-85 (132)
50 PLN02965 Probable pheophorbida 92.7 0.059 1.3E-06 57.4 1.9 58 614-673 5-106 (255)
51 PRK10349 carboxylesterase BioH 92.5 0.051 1.1E-06 57.6 1.2 57 614-673 15-108 (256)
52 PLN02211 methyl indole-3-aceta 92.3 0.1 2.2E-06 57.0 3.3 63 612-676 18-124 (273)
53 COG1649 Uncharacterized protei 91.7 0.78 1.7E-05 53.7 9.6 141 204-352 62-227 (418)
54 PRK10749 lysophospholipase L2; 91.7 0.088 1.9E-06 58.9 1.8 44 601-644 43-86 (330)
55 TIGR02240 PHA_depoly_arom poly 91.4 0.07 1.5E-06 57.4 0.7 60 613-675 26-127 (276)
56 COG2723 BglB Beta-glucosidase/ 91.4 0.25 5.4E-06 58.3 5.1 96 207-314 60-157 (460)
57 PRK10673 acyl-CoA esterase; Pr 90.8 0.13 2.7E-06 54.0 1.8 58 612-672 16-114 (255)
58 PLN02824 hydrolase, alpha/beta 90.6 0.12 2.5E-06 56.2 1.4 59 613-674 30-137 (294)
59 PRK14582 pgaB outer membrane N 89.9 4.5 9.7E-05 50.4 14.2 99 206-317 334-457 (671)
60 PF12697 Abhydrolase_6: Alpha/ 89.8 0.13 2.9E-06 51.0 1.0 59 615-676 1-103 (228)
61 PRK03592 haloalkane dehalogena 89.7 0.15 3.2E-06 55.4 1.3 59 613-674 28-128 (295)
62 PHA02857 monoglyceride lipase; 89.4 0.2 4.3E-06 53.7 2.0 60 615-676 28-134 (276)
63 smart00642 Aamy Alpha-amylase 89.0 0.99 2.1E-05 46.4 6.7 65 206-270 19-95 (166)
64 TIGR03611 RutD pyrimidine util 88.9 0.17 3.6E-06 52.1 1.0 71 603-676 3-117 (257)
65 PF02837 Glyco_hydro_2_N: Glyc 88.6 1 2.2E-05 45.3 6.4 69 846-919 66-136 (167)
66 PRK09936 hypothetical protein; 88.5 1.1 2.4E-05 50.1 6.9 58 201-264 33-91 (296)
67 PLN02679 hydrolase, alpha/beta 88.4 0.2 4.4E-06 56.8 1.4 58 613-673 89-190 (360)
68 TIGR03343 biphenyl_bphD 2-hydr 88.1 0.25 5.4E-06 52.7 1.7 58 613-672 31-134 (282)
69 PRK11126 2-succinyl-6-hydroxy- 88.0 0.25 5.4E-06 51.5 1.6 57 614-673 4-101 (242)
70 TIGR03056 bchO_mg_che_rel puta 88.0 0.33 7.2E-06 51.1 2.5 58 613-673 29-129 (278)
71 TIGR00542 hxl6Piso_put hexulos 88.0 10 0.00022 41.5 14.2 129 205-360 15-151 (279)
72 PLN02385 hydrolase; alpha/beta 87.9 0.28 6E-06 55.2 2.0 29 645-675 170-198 (349)
73 TIGR01738 bioH putative pimelo 87.5 0.19 4.2E-06 50.8 0.4 58 613-673 5-99 (245)
74 PRK13210 putative L-xylulose 5 87.2 38 0.00082 36.8 17.9 169 206-421 16-191 (284)
75 TIGR01250 pro_imino_pep_2 prol 86.9 0.34 7.3E-06 50.5 1.8 33 612-644 25-58 (288)
76 PLN03084 alpha/beta hydrolase 86.8 0.3 6.5E-06 56.6 1.5 67 603-673 119-231 (383)
77 TIGR03695 menH_SHCHC 2-succiny 86.6 0.36 7.8E-06 48.7 1.7 30 614-644 3-32 (251)
78 PRK08775 homoserine O-acetyltr 85.3 0.27 5.8E-06 55.3 0.0 28 645-674 146-173 (343)
79 PLN02578 hydrolase 84.9 0.4 8.7E-06 54.2 1.2 56 614-672 88-185 (354)
80 TIGR02427 protocat_pcaD 3-oxoa 83.9 0.45 9.7E-06 48.2 1.0 32 612-644 13-44 (251)
81 PLN02894 hydrolase, alpha/beta 82.8 0.56 1.2E-05 54.4 1.3 58 613-673 106-210 (402)
82 PF13200 DUF4015: Putative gly 82.4 3.5 7.7E-05 46.9 7.4 108 205-316 12-136 (316)
83 PRK09441 cytoplasmic alpha-amy 82.4 1.9 4.1E-05 51.4 5.5 67 199-265 8-101 (479)
84 PRK06489 hypothetical protein; 82.3 0.53 1.1E-05 53.4 0.8 16 614-629 71-86 (360)
85 PRK03204 haloalkane dehalogena 81.6 0.85 1.9E-05 49.9 2.1 66 602-673 27-135 (286)
86 PF01261 AP_endonuc_2: Xylose 80.1 8.4 0.00018 39.1 8.6 126 212-364 1-134 (213)
87 KOG4178|consensus 80.0 0.92 2E-05 51.4 1.7 60 614-675 46-149 (322)
88 PLN02652 hydrolase; alpha/beta 79.6 0.94 2E-05 52.7 1.7 63 613-675 137-246 (395)
89 PRK14875 acetoin dehydrogenase 79.5 1.1 2.3E-05 50.1 2.1 69 601-673 121-231 (371)
90 PF14307 Glyco_tran_WbsX: Glyc 79.3 16 0.00034 41.9 11.3 55 9-69 21-77 (345)
91 PRK01060 endonuclease IV; Prov 79.1 38 0.00082 37.0 13.8 95 207-328 13-109 (281)
92 PF00128 Alpha-amylase: Alpha 78.5 2.5 5.3E-05 45.5 4.4 56 209-264 7-71 (316)
93 PRK09856 fructoselysine 3-epim 77.1 76 0.0016 34.4 15.4 124 207-357 14-144 (275)
94 PLN02298 hydrolase, alpha/beta 76.9 1.5 3.3E-05 48.7 2.2 30 645-676 142-171 (330)
95 PRK10150 beta-D-glucuronidase; 76.8 5.5 0.00012 48.9 7.2 68 519-586 63-137 (604)
96 TIGR01249 pro_imino_pep_1 prol 74.2 1.3 2.7E-05 48.9 0.7 28 645-674 103-130 (306)
97 TIGR02402 trehalose_TreZ malto 73.8 5.5 0.00012 48.4 6.1 53 209-264 114-179 (542)
98 TIGR01515 branching_enzym alph 72.1 7.5 0.00016 48.0 6.7 57 209-265 159-226 (613)
99 TIGR01392 homoserO_Ac_trn homo 72.0 1.4 3E-05 49.7 0.5 27 645-673 135-161 (351)
100 smart00812 Alpha_L_fucos Alpha 71.6 18 0.0004 42.2 9.4 109 197-316 75-192 (384)
101 PRK13209 L-xylulose 5-phosphat 71.3 88 0.0019 34.1 14.2 168 207-421 22-196 (283)
102 PLN02447 1,4-alpha-glucan-bran 70.4 9 0.0002 48.4 6.9 61 205-265 250-320 (758)
103 PF05913 DUF871: Bacterial pro 70.0 4.2 9.2E-05 47.0 3.7 67 197-269 5-71 (357)
104 TIGR02403 trehalose_treC alpha 69.7 7.1 0.00015 47.5 5.7 57 207-265 28-95 (543)
105 TIGR02631 xylA_Arthro xylose i 67.1 81 0.0018 36.9 13.4 93 204-316 30-127 (382)
106 PLN03087 BODYGUARD 1 domain co 67.0 2.3 4.9E-05 51.0 0.8 59 613-673 202-308 (481)
107 TIGR02104 pulA_typeI pullulana 66.8 9.2 0.0002 47.1 6.0 57 208-265 166-249 (605)
108 KOG4409|consensus 66.6 2.4 5.1E-05 48.7 0.8 59 612-673 90-194 (365)
109 TIGR03234 OH-pyruv-isom hydrox 65.6 1.4E+02 0.0031 32.0 14.2 42 207-262 15-56 (254)
110 PRK12313 glycogen branching en 65.3 12 0.00025 46.4 6.5 53 212-264 177-239 (633)
111 PRK10933 trehalose-6-phosphate 64.7 13 0.00027 45.5 6.5 56 207-265 34-101 (551)
112 PLN02960 alpha-amylase 64.6 13 0.00028 47.6 6.6 57 209-265 420-486 (897)
113 PLN02511 hydrolase 64.5 3.6 7.8E-05 47.6 1.8 31 614-644 102-134 (388)
114 cd00019 AP2Ec AP endonuclease 64.4 1.6E+02 0.0034 32.2 14.4 97 206-330 10-107 (279)
115 TIGR01531 glyc_debranch glycog 62.2 28 0.00061 46.7 9.2 114 182-302 103-236 (1464)
116 PRK05402 glycogen branching en 62.0 15 0.00032 46.4 6.5 50 212-264 272-334 (726)
117 PRK09989 hypothetical protein; 62.0 1.3E+02 0.0029 32.5 13.2 42 208-263 17-58 (258)
118 COG3934 Endo-beta-mannanase [C 61.5 7.3 0.00016 46.4 3.5 155 184-351 4-168 (587)
119 PF02679 ComA: (2R)-phospho-3- 61.5 22 0.00048 39.2 7.0 52 205-266 83-134 (244)
120 PRK09505 malS alpha-amylase; R 61.5 15 0.00033 46.0 6.5 58 208-265 232-312 (683)
121 PF04909 Amidohydro_2: Amidohy 61.0 27 0.00059 37.0 7.6 146 205-367 83-235 (273)
122 PRK12568 glycogen branching en 61.0 16 0.00035 46.1 6.5 55 210-267 274-341 (730)
123 PRK09997 hydroxypyruvate isome 60.9 1.6E+02 0.0034 31.9 13.5 50 197-263 9-58 (258)
124 COG2267 PldB Lysophospholipase 60.9 4 8.6E-05 45.8 1.2 64 613-678 35-146 (298)
125 TIGR02456 treS_nterm trehalose 59.7 13 0.00029 45.0 5.5 57 206-265 28-96 (539)
126 PRK10785 maltodextrin glucosid 58.8 19 0.0004 44.5 6.5 57 208-264 181-245 (598)
127 PF02065 Melibiase: Melibiase; 58.7 83 0.0018 37.1 11.5 120 189-309 41-178 (394)
128 TIGR03101 hydr2_PEP hydrolase, 57.7 4 8.8E-05 45.2 0.6 29 645-675 107-135 (266)
129 PF01229 Glyco_hydro_39: Glyco 55.6 18 0.00039 43.3 5.6 125 194-334 27-166 (486)
130 TIGR03849 arch_ComA phosphosul 55.5 80 0.0017 34.8 9.9 51 206-266 71-121 (237)
131 PRK14706 glycogen branching en 55.4 21 0.00046 44.4 6.3 53 212-264 174-236 (639)
132 PRK05855 short chain dehydroge 54.8 6.5 0.00014 46.6 1.7 31 613-644 26-56 (582)
133 cd06593 GH31_xylosidase_YicI Y 54.6 37 0.0008 38.0 7.5 69 203-271 21-92 (308)
134 PRK14705 glycogen branching en 54.4 23 0.00049 47.2 6.5 54 211-264 771-834 (1224)
135 smart00518 AP2Ec AP endonuclea 54.3 2.6E+02 0.0057 30.3 13.9 92 208-328 12-104 (273)
136 PF06832 BiPBP_C: Penicillin-B 54.3 23 0.00049 32.4 4.8 68 519-589 17-84 (89)
137 cd06602 GH31_MGAM_SI_GAA This 53.3 1.2E+02 0.0026 34.8 11.5 67 205-272 23-93 (339)
138 COG0296 GlgB 1,4-alpha-glucan 52.8 26 0.00056 43.5 6.3 56 205-264 164-233 (628)
139 cd06565 GH20_GcnA-like Glycosy 52.7 2.2E+02 0.0048 32.1 13.3 56 205-264 16-79 (301)
140 cd06592 GH31_glucosidase_KIAA1 48.0 45 0.00097 37.5 6.9 69 200-271 24-96 (303)
141 TIGR02100 glgX_debranch glycog 47.6 27 0.00059 43.9 5.5 55 211-265 189-265 (688)
142 cd04908 ACT_Bt0572_1 N-termina 47.1 58 0.0013 27.9 6.0 55 205-263 12-66 (66)
143 PLN02361 alpha-amylase 47.0 40 0.00087 39.8 6.5 56 209-264 32-95 (401)
144 PRK14510 putative bifunctional 46.9 28 0.0006 46.6 5.7 55 210-264 191-266 (1221)
145 cd06589 GH31 The enzymes of gl 46.2 37 0.00081 37.3 5.8 65 204-269 22-90 (265)
146 TIGR02401 trehalose_TreY malto 46.0 42 0.00091 43.0 6.8 62 206-267 16-87 (825)
147 PF13199 Glyco_hydro_66: Glyco 45.7 45 0.00098 41.0 6.8 83 204-286 116-213 (559)
148 KOG0626|consensus 45.3 41 0.00088 40.9 6.2 114 207-328 92-208 (524)
149 COG1647 Esterase/lipase [Gener 45.0 12 0.00027 40.7 1.8 57 614-673 17-118 (243)
150 PRK13398 3-deoxy-7-phosphohept 44.9 83 0.0018 35.1 8.2 74 184-265 21-98 (266)
151 PLN02980 2-oxoglutarate decarb 44.6 11 0.00024 51.8 1.6 41 603-644 1362-1402(1655)
152 PRK11071 esterase YqiA; Provis 43.9 8.9 0.00019 39.9 0.5 55 614-673 3-92 (190)
153 TIGR03100 hydr1_PEP hydrolase, 43.6 10 0.00022 41.5 0.9 29 645-676 108-136 (274)
154 PF03659 Glyco_hydro_71: Glyco 43.4 1E+02 0.0022 36.2 9.1 53 204-265 15-67 (386)
155 PRK14507 putative bifunctional 43.4 45 0.00097 45.9 6.8 59 206-267 758-829 (1693)
156 TIGR02102 pullulan_Gpos pullul 42.8 44 0.00095 44.3 6.5 21 245-265 555-575 (1111)
157 PRK12677 xylose isomerase; Pro 42.4 1.7E+02 0.0037 34.3 10.7 92 206-316 31-126 (384)
158 PRK07581 hypothetical protein; 42.2 7.6 0.00016 43.4 -0.4 28 645-674 132-159 (339)
159 PRK10340 ebgA cryptic beta-D-g 42.1 83 0.0018 41.5 8.8 67 521-587 109-181 (1021)
160 PLN00196 alpha-amylase; Provis 42.0 55 0.0012 38.9 6.7 57 209-265 47-112 (428)
161 PRK14511 maltooligosyl trehalo 41.6 54 0.0012 42.3 6.8 61 205-268 19-92 (879)
162 PF01055 Glyco_hydro_31: Glyco 41.4 64 0.0014 37.9 7.1 69 205-274 42-112 (441)
163 PF02449 Glyco_hydro_42: Beta- 41.3 79 0.0017 36.4 7.7 108 48-158 212-335 (374)
164 PRK03705 glycogen debranching 40.4 45 0.00098 41.8 5.9 55 211-265 184-262 (658)
165 PF14307 Glyco_tran_WbsX: Glyc 40.0 3.8E+02 0.0082 30.8 12.9 142 198-364 49-197 (345)
166 cd06591 GH31_xylosidase_XylS X 39.9 57 0.0012 37.0 6.1 66 204-270 22-91 (319)
167 KOG1454|consensus 39.8 17 0.00036 41.5 1.9 64 612-677 58-170 (326)
168 COG1306 Uncharacterized conser 39.5 47 0.001 37.8 5.1 61 205-265 76-144 (400)
169 COG3589 Uncharacterized conser 38.4 48 0.001 38.2 5.1 68 196-270 6-74 (360)
170 PF01791 DeoC: DeoC/LacD famil 37.3 14 0.00029 39.9 0.7 53 209-264 79-131 (236)
171 smart00518 AP2Ec AP endonuclea 36.2 4.4E+02 0.0095 28.5 12.1 63 202-266 41-106 (273)
172 PF01120 Alpha_L_fucos: Alpha- 36.0 97 0.0021 35.6 7.3 75 198-278 86-168 (346)
173 COG2159 Predicted metal-depend 35.8 1.4E+02 0.003 33.7 8.3 70 190-267 97-166 (293)
174 PF14587 Glyco_hydr_30_2: O-Gl 35.7 3.1E+02 0.0067 32.5 11.2 118 234-366 93-228 (384)
175 cd06598 GH31_transferase_CtsZ 35.5 77 0.0017 35.9 6.3 67 204-270 22-95 (317)
176 cd06599 GH31_glycosidase_Aec37 34.9 92 0.002 35.3 6.8 67 205-271 28-99 (317)
177 PRK00042 tpiA triosephosphate 34.4 1.7E+02 0.0038 32.4 8.6 90 212-313 79-196 (250)
178 PRK10076 pyruvate formate lyas 34.3 2.2E+02 0.0048 30.6 9.2 125 206-360 54-211 (213)
179 PRK09525 lacZ beta-D-galactosi 33.9 84 0.0018 41.5 7.0 65 520-584 119-190 (1027)
180 PF02055 Glyco_hydro_30: O-Gly 33.8 1.5E+02 0.0033 36.0 8.7 152 247-415 155-341 (496)
181 cd02742 GH20_hexosaminidase Be 33.6 2.5E+02 0.0053 31.7 9.9 59 203-264 13-91 (303)
182 cd06603 GH31_GANC_GANAB_alpha 33.6 80 0.0017 36.1 6.1 67 205-272 23-91 (339)
183 PRK10985 putative hydrolase; P 33.4 23 0.00049 39.7 1.7 31 614-644 60-92 (324)
184 COG0366 AmyA Glycosidases [Car 32.8 58 0.0013 38.2 5.0 55 210-264 33-96 (505)
185 PF00975 Thioesterase: Thioest 32.8 17 0.00037 37.7 0.5 30 614-643 2-31 (229)
186 KOG1689|consensus 32.7 30 0.00064 36.2 2.1 41 881-922 131-172 (221)
187 PRK12858 tagatose 1,6-diphosph 32.1 41 0.0009 38.8 3.5 64 198-264 99-162 (340)
188 cd06600 GH31_MGAM-like This fa 31.9 88 0.0019 35.5 6.0 67 204-271 22-90 (317)
189 PF01261 AP_endonuc_2: Xylose 31.7 1.9E+02 0.0042 29.1 8.0 105 205-336 26-137 (213)
190 KOG2230|consensus 31.6 2.1E+02 0.0046 35.4 9.0 113 181-332 327-444 (867)
191 PF11324 DUF3126: Protein of u 31.5 54 0.0012 29.1 3.3 21 544-564 25-45 (63)
192 cd06601 GH31_lyase_GLase GLase 30.4 2.4E+02 0.0051 32.5 9.1 66 204-270 22-89 (332)
193 cd06563 GH20_chitobiase-like T 29.1 6E+02 0.013 29.4 12.2 59 203-264 15-105 (357)
194 TIGR01838 PHA_synth_I poly(R)- 28.2 39 0.00085 41.3 2.6 66 612-678 188-306 (532)
195 COG2890 HemK Methylase of poly 28.2 3.9E+02 0.0084 30.0 10.2 125 202-366 141-266 (280)
196 TIGR01836 PHA_synth_III_C poly 27.9 31 0.00068 39.0 1.6 31 645-677 144-174 (350)
197 PRK09856 fructoselysine 3-epim 27.9 83 0.0018 34.1 4.8 60 205-268 89-153 (275)
198 PLN02877 alpha-amylase/limit d 27.5 1.1E+02 0.0024 40.0 6.4 20 245-264 466-485 (970)
199 cd06595 GH31_xylosidase_XylS-l 27.4 1.5E+02 0.0032 33.3 6.7 66 204-269 23-98 (292)
200 smart00854 PGA_cap Bacterial c 27.3 8E+02 0.017 26.4 12.4 51 199-262 57-107 (239)
201 KOG4039|consensus 27.2 95 0.0021 33.2 4.7 84 195-284 98-184 (238)
202 PRK01060 endonuclease IV; Prov 27.0 6.1E+02 0.013 27.6 11.4 63 203-266 44-111 (281)
203 PRK08673 3-deoxy-7-phosphohept 26.9 1.7E+02 0.0037 33.9 7.2 75 184-265 87-164 (335)
204 cd06604 GH31_glucosidase_II_Ma 26.2 1.3E+02 0.0028 34.3 6.2 67 204-271 22-90 (339)
205 TIGR02103 pullul_strch alpha-1 25.9 1.2E+02 0.0026 39.6 6.2 20 245-264 404-423 (898)
206 cd06564 GH20_DspB_LnbB-like Gl 25.7 5.8E+02 0.013 29.0 11.2 140 197-358 10-196 (326)
207 cd07381 MPP_CapA CapA and rela 25.5 7.1E+02 0.015 26.6 11.3 131 209-360 67-212 (239)
208 COG1735 Php Predicted metal-de 25.5 2.4E+02 0.0053 32.3 7.8 121 209-364 51-172 (316)
209 PLN02784 alpha-amylase 25.3 1.1E+02 0.0023 39.7 5.5 56 209-264 524-587 (894)
210 PRK09267 flavodoxin FldA; Vali 25.1 4.2E+02 0.0092 26.7 9.1 73 187-262 45-117 (169)
211 TIGR00677 fadh2_euk methylenet 24.7 2E+02 0.0043 32.3 7.1 108 194-316 132-252 (281)
212 cd06562 GH20_HexA_HexB-like Be 24.5 9.7E+02 0.021 27.6 12.8 62 203-264 15-89 (348)
213 TIGR00587 nfo apurinic endonuc 24.2 4E+02 0.0086 29.4 9.3 86 208-316 13-100 (274)
214 PRK07868 acyl-CoA synthetase; 23.8 39 0.00085 44.1 1.6 63 613-676 68-179 (994)
215 PRK12677 xylose isomerase; Pro 23.8 2.8E+02 0.006 32.6 8.3 59 207-266 68-136 (384)
216 PLN02763 hydrolase, hydrolyzin 23.7 3.2E+02 0.007 36.1 9.4 67 205-272 200-268 (978)
217 PRK05077 frsA fermentation/res 23.0 45 0.00098 39.2 1.7 28 645-674 273-300 (414)
218 cd00311 TIM Triosephosphate is 22.9 1.5E+02 0.0033 32.6 5.6 48 213-266 78-125 (242)
219 PRK13210 putative L-xylulose 5 22.8 1.2E+02 0.0025 33.1 4.8 60 205-265 93-153 (284)
220 PTZ00372 endonuclease 4-like p 22.7 1.1E+03 0.023 28.4 12.7 82 183-266 150-240 (413)
221 cd00537 MTHFR Methylenetetrahy 22.3 2.9E+02 0.0062 30.5 7.8 104 197-315 138-250 (274)
222 KOG2024|consensus 22.3 1E+02 0.0023 34.6 4.1 50 508-557 69-129 (297)
223 PRK09875 putative hydrolase; P 22.3 2.5E+02 0.0054 31.8 7.3 62 206-284 34-95 (292)
224 COG1523 PulA Type II secretory 21.7 1.3E+02 0.0028 38.1 5.4 53 212-264 206-284 (697)
225 cd06597 GH31_transferase_CtsY 21.6 2E+02 0.0044 33.0 6.5 74 198-271 13-111 (340)
226 TIGR02635 RhaI_grampos L-rhamn 21.3 1.4E+03 0.03 27.0 14.6 87 205-316 39-128 (378)
227 PF13380 CoA_binding_2: CoA bi 20.8 1.9E+02 0.0041 28.0 5.3 45 202-262 62-106 (116)
228 PRK08227 autoinducer 2 aldolas 20.6 2.6E+02 0.0056 31.4 6.8 47 211-263 99-145 (264)
229 PRK09997 hydroxypyruvate isome 20.4 1.3E+02 0.0027 32.7 4.4 60 206-265 85-144 (258)
230 COG3684 LacD Tagatose-1,6-bisp 20.1 74 0.0016 35.6 2.5 60 201-265 106-167 (306)
231 COG5309 Exo-beta-1,3-glucanase 20.1 1.3E+03 0.029 26.3 15.4 190 206-448 63-279 (305)
No 1
>PLN03059 beta-galactosidase; Provisional
Probab=100.00 E-value=3.7e-128 Score=1143.26 Aligned_cols=545 Identities=25% Similarity=0.442 Sum_probs=440.6
Q ss_pred ceeEEeecCCceEeCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHH
Q psy12907 173 RKFEIDYERDTFVKDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLE 252 (949)
Q Consensus 173 ~~f~v~~d~~~f~~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~ 252 (949)
...+|+||+++|+|||+|++++||++||||+||++|+|||+||||+|+|||+||||||+|||+||+|||+|++||++||+
T Consensus 26 ~~~~v~~d~~~f~idG~p~~i~sG~iHY~R~~p~~W~d~L~k~Ka~GlNtV~tYV~Wn~HEp~~G~~dF~G~~DL~~Fl~ 105 (840)
T PLN03059 26 GSASVSYDHRAFIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIK 105 (840)
T ss_pred ceeEEEEeCCEEEECCEEEEEEEeCcccCcCCHHHHHHHHHHHHHcCCCeEEEEecccccCCCCCeeeccchHHHHHHHH
Confidence 35689999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHH--hccccCCCceEEeccc
Q psy12907 253 LAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRIT--RFLYGNGGPIILVQVE 330 (949)
Q Consensus 253 la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~--~~~~~~GGpII~vQIE 330 (949)
+|||+||+|||||||||||||++||+|.||++ .|+|++||+||+|+++|++|+++|+++|+ ++|+++||||||+|||
T Consensus 106 la~e~GLyvilRpGPYIcAEw~~GGlP~WL~~-~~~i~~Rs~d~~fl~~v~~~~~~l~~~l~~~~l~~~~GGPIImvQIE 184 (840)
T PLN03059 106 VVQAAGLYVHLRIGPYICAEWNFGGFPVWLKY-VPGIEFRTDNGPFKAAMQKFTEKIVDMMKSEKLFEPQGGPIILSQIE 184 (840)
T ss_pred HHHHcCCEEEecCCcceeeeecCCCCchhhhc-CCCcccccCCHHHHHHHHHHHHHHHHHHhhcceeecCCCcEEEEEec
Confidence 99999999999999999999999999999998 89999999999999999999999999996 7899999999999999
Q ss_pred cccCCc-----cchHHHHHHHHHHHHHhcCCeeEEEeeCCCCccccccccCCeeeeecCCCCCChHHHHHHhhhcCCCCC
Q psy12907 331 NEMGSY-----TCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKCTVPGVYATVDFGAATNVSSAFAAMRTVSPHGP 405 (949)
Q Consensus 331 NEyG~y-----~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~l~~~~~g~~~t~nfg~~~~~~~~f~~~~~~~p~~P 405 (949)
||||+| ++|++||+||+++++++ |++|||||||++. ..++++.|+| | ..++.|.. .++.+|
T Consensus 185 NEYGs~~~~~~~~d~~Yl~~l~~~~~~~-Gi~VPl~t~dg~~------~~~~v~~t~N-g---~~~~~f~~---~~~~~P 250 (840)
T PLN03059 185 NEYGPVEWEIGAPGKAYTKWAADMAVKL-GTGVPWVMCKQED------APDPVIDTCN-G---FYCENFKP---NKDYKP 250 (840)
T ss_pred ccccceecccCcchHHHHHHHHHHHHHc-CCCcceEECCCCC------CCccceecCC-C---chhhhccc---CCCCCC
Confidence 999998 78999999999999987 9999999999862 3578999998 4 23455653 344579
Q ss_pred eeeeecCCCcccccCCCCCCCChHHHHHHHHHHHHcCce-EEEeeccCCCCCCCCCCCCCC---CC-C------------
Q psy12907 406 LVNSEYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMKAN-VNFYMFYGGTNFGFTAGESRA---YL-P------------ 468 (949)
Q Consensus 406 ~~~~Ef~~GWfd~WG~~~~~~~~~~~~~~l~~~l~~gaS-vN~YMfhGGTNFG~~~Gan~~---y~-~------------ 468 (949)
+||+|||+|||+|||++++.+++++++..++++|+.|+| +|||||||||||||||||+.. |. +
T Consensus 251 ~m~tE~w~GWf~~wG~~~~~r~~~d~a~~~~~~l~~g~S~~N~YMfhGGTNFG~~~Ga~~~~TSYDYdAPL~E~G~~t~p 330 (840)
T PLN03059 251 KMWTEAWTGWYTEFGGAVPNRPAEDLAFSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLPREP 330 (840)
T ss_pred cEEeccCchhHhhcCCCCCcCCHHHHHHHHHHHHHcCCeeEEeeeccCcCCcccccCCCccccccccCCccccccCcchh
Confidence 999999999999999999999999999999999999999 699999999999999999842 10 0
Q ss_pred --------------CCCCCCCCCC---------------C--------CCCCc---ccccc------------------c
Q psy12907 469 --------------APSLPPPKPA---------------P--------KADYG---QIILS------------------P 490 (949)
Q Consensus 469 --------------~p~~~~P~~~---------------p--------~~~yg---~v~l~------------------~ 490 (949)
.+ +++|... + +.+|+ .++++ .
T Consensus 331 Ky~~lr~l~~~~~~~~-~~l~~~~p~~~~lg~~~ea~~y~~~~~caaFl~n~~~~~~~~v~f~g~~y~lp~~Svsilpd~ 409 (840)
T PLN03059 331 KWGHLRDLHKAIKLCE-PALVSVDPTVTSLGSNQEAHVFKSKSACAAFLANYDTKYSVKVTFGNGQYDLPPWSVSILPDC 409 (840)
T ss_pred HHHHHHHHHHHHHhcC-ccccCCCCceeccCCceeEEEccCccchhhheeccCCCCceeEEECCcccccCccceeecccc
Confidence 10 1122111 2 45566 56666 4
Q ss_pred ccchhhhhhc------c-CCCc-------------cc---CCCCccccc-------CCCceeEEEEEecCCCCCC-----
Q psy12907 491 AASIFDKVAQ------T-LPPL-------------TS---AFPLSFEAL-------DQSFGFVLYETIIPDARFP----- 535 (949)
Q Consensus 491 ~~~L~~~l~~------~-~~~~-------------~s---~~P~tmE~l-------gQ~~GyvlY~t~i~~~~~~----- 535 (949)
..+||+.+.. . ..++ .+ +.|++||+| +|.+||++|+|.+......
T Consensus 410 ~~~lfnta~v~~q~~~~~~~~~~~~~~w~~~~e~~~~~~~~~~~~~e~l~e~~n~t~d~~dYlwY~t~i~~~~~~~~~~~ 489 (840)
T PLN03059 410 KTAVFNTARLGAQSSQMKMNPVGSTFSWQSYNEETASAYTDDTTTMDGLWEQINVTRDATDYLWYMTEVHIDPDEGFLKT 489 (840)
T ss_pred cceeeeccccccccceeecccccccccceeecccccccccCCCcchhhHHHhhcccCCCCceEEEEEEEeecCCcccccc
Confidence 4466665542 1 1122 33 349999999 9999999999999653321
Q ss_pred -CCcccccCCcceEEEEEECCEEEEEEEecc---eeeee--ccCCCC-CeEEEEEEeCCCCcCCCCC-CCCcCcccccCC
Q psy12907 536 -DPALLTISGLRDRGQVFVDEKLVTILYRNK---MLSTP--IMARPG-QKLSILVENMGRINYGSYL-HDPKGWKVYDCG 607 (949)
Q Consensus 536 -~~~~L~i~~vrDrA~Vfvdg~~vG~l~r~~---~~~i~--~~~~~~-~~LdILVENmGRvNyG~~~-~D~KGI~~~~~g 607 (949)
...+|++.+++|||+|||||+++|+++++. ...++ +....+ ++|+||||||||+|||+.+ .++|||
T Consensus 490 ~~~~~L~v~~~~d~~~vFVNg~~~Gt~~~~~~~~~~~~~~~v~l~~g~n~L~iLse~vG~~NyG~~le~~~kGI------ 563 (840)
T PLN03059 490 GQYPVLTIFSAGHALHVFINGQLAGTVYGELSNPKLTFSQNVKLTVGINKISLLSVAVGLPNVGLHFETWNAGV------ 563 (840)
T ss_pred CCCceEEEcccCcEEEEEECCEEEEEEEeecCCcceEEecccccCCCceEEEEEEEeCCCCccCcccccccccc------
Confidence 123479999999999999999999998852 22332 123333 5899999999999999998 679999
Q ss_pred CCccCCCeEECCCCCCchhhhhhhhhhcccccccccccCCcchhhcccccccccCceeeccccccccccccccccccch-
Q psy12907 608 PKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLSYLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDTSVFNYSETYA- 686 (949)
Q Consensus 608 ~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt~~f~~~~~~~- 686 (949)
.++|. |+.. ..| ...+.+| +
T Consensus 564 ----~g~V~----------------i~g~--------~~g---------~~dls~~----------------------~W 584 (840)
T PLN03059 564 ----LGPVT----------------LKGL--------NEG---------TRDLSGW----------------------KW 584 (840)
T ss_pred ----cccEE----------------Eecc--------cCC---------ceecccC----------------------cc
Confidence 99998 3100 012 4566677 6
Q ss_pred hhcccchhhhh--hhhcCCCcccccchhhhhhHHHHHHHHHhhccccccccccccccceeeeeeecccccccccchhhhH
Q psy12907 687 IFRILPCVLLK--KMVTGSLSVENTDVDIQNSIEFMIRKYKLLTNKHLNIETHKKFIKTFVWSVLLYGCETWILNKYETE 764 (949)
Q Consensus 687 ~~~~~~~~~l~--~~~~~~~~~~~~d~~i~~s~~Fmv~~ld~Ls~~dL~~rl~l~~~~~~V~~~l~~gs~~~~~~~~~~~ 764 (949)
.| ++. |+ ++-+.
T Consensus 585 ~y---~lg-L~GE~~~i~-------------------------------------------------------------- 598 (840)
T PLN03059 585 SY---KIG-LKGEALSLH-------------------------------------------------------------- 598 (840)
T ss_pred cc---ccC-ccceecccc--------------------------------------------------------------
Confidence 66 666 32 11000
Q ss_pred HHHHHHHHhhhhhhhhhccchhhhcccccccccccccccCCCCCCccccccccCCCCCCCcccccCCCccccccCCCCCC
Q psy12907 765 RLEAMEMWMKNITEAYTRKTENILLQEHHFHHECNILFYNEPNNGILSEVLLDSKPLSPWSMTQYPLSNLSWIDTAPATN 844 (949)
Q Consensus 765 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 844 (949)
+ .++.. ...|.... ...
T Consensus 599 ---------------------------------------~-----------~~~~~------------~~~W~~~~-~~~ 615 (840)
T PLN03059 599 ---------------------------------------T-----------ITGSS------------SVEWVEGS-LLA 615 (840)
T ss_pred ---------------------------------------c-----------cCCCC------------Cccccccc-ccc
Confidence 0 00000 01232111 011
Q ss_pred CCCCceEEEEEEEccCCCCCCCC-eEEecCCCcceEEEEcCcccccccCCC--------------------------CCe
Q psy12907 845 TTKLPVFYTATFTLNAEHPKPLD-GYVDMSNWAKGVVFINEHNLGKYWTTL--------------------------GPQ 897 (949)
Q Consensus 845 ~~~~p~~~~~~~~~~~~~~~~~d-t~~~~~~w~kG~~~vng~nlGryw~~~--------------------------gpq 897 (949)
....|+||+++|+++ ++.| |||||++||||+|||||+||||||+ + |||
T Consensus 616 ~~~p~twYK~~Fd~p----~g~Dpv~LDm~gmGKG~aWVNG~nIGRYW~-~~a~~~gC~~c~y~g~~~~~kc~~~cggP~ 690 (840)
T PLN03059 616 QKQPLTWYKTTFDAP----GGNDPLALDMSSMGKGQIWINGQSIGRHWP-AYTAHGSCNGCNYAGTFDDKKCRTNCGEPS 690 (840)
T ss_pred CCCCceEEEEEEeCC----CCCCCEEEecccCCCeeEEECCcccccccc-cccccCCCccccccccccchhhhccCCCce
Confidence 122399999999997 5789 9999999999999999999999996 5 999
Q ss_pred eeEE-eccCCCCCCCCCceEEEEEeeCCCCCceEEeeec
Q psy12907 898 LTLY-LPAPFIKPYPEVNRITVLELQAPPSDLKVKFTTE 935 (949)
Q Consensus 898 ~tly-~P~~~l~~~~g~n~i~~~e~~~~~~~~~v~~~~~ 935 (949)
|||| ||++|| |+|.|+|||||+++.. ...|+|.+.
T Consensus 691 q~lYHVPr~~L--k~g~N~lViFEe~gg~-p~~I~~~~~ 726 (840)
T PLN03059 691 QRWYHVPRSWL--KPSGNLLIVFEEWGGN-PAGISLVKR 726 (840)
T ss_pred eEEEeCcHHHh--ccCCceEEEEEecCCC-CCceEEEEe
Confidence 9999 999999 9999999999998743 456777654
No 2
>KOG0496|consensus
Probab=100.00 E-value=1.3e-117 Score=1018.72 Aligned_cols=549 Identities=34% Similarity=0.586 Sum_probs=451.3
Q ss_pred eeEEeecCCceEeCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHH
Q psy12907 174 KFEIDYERDTFVKDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLEL 253 (949)
Q Consensus 174 ~f~v~~d~~~f~~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~l 253 (949)
++.|+||+++|++||+|+.++||++||+|++|++|+|+|+|+|++|+|+|+||||||.|||+||+|||+|+.||++||++
T Consensus 17 ~~~v~yd~~~~~idG~r~~~isGsIHY~R~~pe~W~~~i~k~k~~Gln~IqtYVfWn~Hep~~g~y~FsG~~DlvkFikl 96 (649)
T KOG0496|consen 17 SFNVTYDKRSLLIDGQRFILISGSIHYPRSTPEMWPDLIKKAKAGGLNVIQTYVFWNLHEPSPGKYDFSGRYDLVKFIKL 96 (649)
T ss_pred eeEEeccccceeecCCeeEEEEeccccccCChhhhHHHHHHHHhcCCceeeeeeecccccCCCCcccccchhHHHHHHHH
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEecccccc
Q psy12907 254 AVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEM 333 (949)
Q Consensus 254 a~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEy 333 (949)
|+++||+|+||+||||||||++||+|.||.. .|.+.+||+|++|++++++|+++|+++|++++++|||||||+||||||
T Consensus 97 ~~~~GLyv~LRiGPyIcaEw~~GG~P~wL~~-~pg~~~Rt~nepfk~~~~~~~~~iv~~mk~L~~~qGGPIIl~QIENEY 175 (649)
T KOG0496|consen 97 IHKAGLYVILRIGPYICAEWNFGGLPWWLRN-VPGIVFRTDNEPFKAEMERWTTKIVPMMKKLFASQGGPIILVQIENEY 175 (649)
T ss_pred HHHCCeEEEecCCCeEEecccCCCcchhhhh-CCceEEecCChHHHHHHHHHHHHHHHHHHHHHhhcCCCEEEEEeechh
Confidence 9999999999999999999999999977776 899999999999999999999999999999999999999999999999
Q ss_pred CCccchHHHHHHHHHHHHHhcCCeeEEEeeCCCCcccccc-ccC--Ce-eeeecCCCCCChHHHHHHhhhcCCCCCeeee
Q psy12907 334 GSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKC-TVP--GV-YATVDFGAATNVSSAFAAMRTVSPHGPLVNS 409 (949)
Q Consensus 334 G~y~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~l~~-~~~--g~-~~t~nfg~~~~~~~~f~~~~~~~p~~P~~~~ 409 (949)
| .|..++.+..+++++-++.++++++++.++++| +.+ +. ..++| +....+.-.+..+|++|+|++
T Consensus 176 G------~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk~~dapd~~in~cn-----g~~c~~~f~~pn~~~kP~~wt 244 (649)
T KOG0496|consen 176 G------NYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCKQDDAPDPGINTCN-----GFYCGDTFKRPNSPNKPLVWT 244 (649)
T ss_pred h------HHHHHHHHHHHHhhccceEEEEecCCCCceeEecCCCCCCccccccC-----CccchhhhccCCCCCCCceec
Confidence 9 788999999999999999999999999999999 332 22 22222 222234455678999999999
Q ss_pred ecCCCcccccCCCCCCCChHHHHHHHHHHHHcC-ceEEEeeccCCCCCCCCCC---CCCC--------------------
Q psy12907 410 EYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMK-ANVNFYMFYGGTNFGFTAG---ESRA-------------------- 465 (949)
Q Consensus 410 Ef~~GWfd~WG~~~~~~~~~~~~~~l~~~l~~g-aSvN~YMfhGGTNFG~~~G---an~~-------------------- 465 (949)
|||+|||++||++++.|++++++..+..+++.| +++||||||||||||++|| |+..
T Consensus 245 E~wtgwf~~wGg~~~~R~~e~ia~~va~fls~ggs~vNyYM~hGGTNFGrt~G~~~atsy~~dap~dgl~~~pk~ghlk~ 324 (649)
T KOG0496|consen 245 ENWTGWFTHWGGPHPCRPVEDIALSVARFLSKGGSSVNYYMYHGGTNFGRTNGPFIATSYDYDAPLDGLLRQPKYGHLKP 324 (649)
T ss_pred ccccchhhhhCCCCCCCCHHHHHHHHHHHHhcCccceEEEEeecccCCCcccCcccccccccccccchhhcCCCcccccc
Confidence 999999999999999999999999999999988 5689999999999999999 7630
Q ss_pred -----------------------------------CCCCCCCCCCCCCCCCCCcccccccccchhhhhhcc----CCCcc
Q psy12907 466 -----------------------------------YLPAPSLPPPKPAPKADYGQIILSPAASIFDKVAQT----LPPLT 506 (949)
Q Consensus 466 -----------------------------------y~~~p~~~~P~~~p~~~yg~v~l~~~~~L~~~l~~~----~~~~~ 506 (949)
+.+.|.+++|...|+..|++++++-..++...+..+ .....
T Consensus 325 ~hts~d~~ep~lv~gd~~~~kyg~~~~~C~~Fl~n~~~~~~~~v~f~~~~y~~~~~slsilpdck~~~~nta~~~~~~~~ 404 (649)
T KOG0496|consen 325 LHTSYDYCEPALVAGDITTAKYGNLREACAAFLSNNNGAPAAPVPFNKPKYRLPPWSLSILPDCKTVVYNTAKVMAQWIS 404 (649)
T ss_pred chhhhhhcCccccccCcccccccchhhHHHHHHhcCCCCCCCccccCCCccccCceeEEechhhcchhhhcccccccccc
Confidence 122455566777788888888887766665544222 12456
Q ss_pred cCCCCcccccCCCceeEEEEEecCCCCCCCCcccccC-CcceEEEEEECCEEEEEEEecc-eeeeeccCCC-----CCeE
Q psy12907 507 SAFPLSFEALDQSFGFVLYETIIPDARFPDPALLTIS-GLRDRGQVFVDEKLVTILYRNK-MLSTPIMARP-----GQKL 579 (949)
Q Consensus 507 s~~P~tmE~lgQ~~GyvlY~t~i~~~~~~~~~~L~i~-~vrDrA~Vfvdg~~vG~l~r~~-~~~i~~~~~~-----~~~L 579 (949)
+.+|+++|..+|.+|++||++.++.+.. .+..++|+ .++|++||||||+++|++.++. ...+++.... .++|
T Consensus 405 ~~e~~~~~~~~~~~~~ll~~~~~t~d~s-d~t~~~i~ls~g~~~hVfvNg~~~G~~~g~~~~~~~~~~~~~~l~~g~n~l 483 (649)
T KOG0496|consen 405 FTEPIPSEAVGQSFGGLLEQTNLTKDKS-DTTSLKIPLSLGHALHVFVNGEFAGSLHGNNEKIKLNLSQPVGLKAGENKL 483 (649)
T ss_pred ccCCCccccccCcceEEEEEEeeccccC-CCceEeecccccceEEEEECCEEeeeEeccccceeEEeecccccccCcceE
Confidence 7899999999999999999998876542 23446788 9999999999999999999963 4444443222 2789
Q ss_pred EEEEEeCCCCcCCCCCCCCcCcccccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccccCCcchhhccccccc
Q psy12907 580 SILVENMGRINYGSYLHDPKGWKVYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLSYLGGFLAQKFAEHTV 659 (949)
Q Consensus 580 dILVENmGRvNyG~~~~D~KGI~~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is~lg~~l~q~f~~~~~ 659 (949)
+|||||+||+|||...+|+||| +|+|. |. | ..
T Consensus 484 ~iL~~~~G~~n~G~~e~~~~Gi----------~g~v~----------------l~------------g----------~~ 515 (649)
T KOG0496|consen 484 ALLSENVGLPNYGHFENDFKGI----------LGPVY----------------LN------------G----------LI 515 (649)
T ss_pred EEEEEecCCCCcCccccccccc----------ccceE----------------Ee------------e----------ee
Confidence 9999999999999777899999 99998 41 1 11
Q ss_pred ccCceeeccccccccccccccccccchhhcccchhh-hhhhhcCCCcccccchhhhhhHHHHHHHHHhhccccccccccc
Q psy12907 660 LRPRVVSLFLCNSFYDTSVFNYSETYAIFRILPCVL-LKKMVTGSLSVENTDVDIQNSIEFMIRKYKLLTNKHLNIETHK 738 (949)
Q Consensus 660 ~~~~v~sl~lcn~~~dt~~f~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~d~~i~~s~~Fmv~~ld~Ls~~dL~~rl~l 738 (949)
...| ..+ |+.+ |+.-.+..+
T Consensus 516 ~l~~----------------------~~w---~~~~gl~ge~~~~~---------------------------------- 536 (649)
T KOG0496|consen 516 DLTW----------------------TKW---PYKVGLKGEKLGLH---------------------------------- 536 (649)
T ss_pred ccce----------------------eec---ceecccccchhhcc----------------------------------
Confidence 2234 333 2220 111000000
Q ss_pred cccceeeeeeecccccccccchhhhHHHHHHHHHhhhhhhhhhccchhhhcccccccccccccccCCCCCCccccccccC
Q psy12907 739 KFIKTFVWSVLLYGCETWILNKYETERLEAMEMWMKNITEAYTRKTENILLQEHHFHHECNILFYNEPNNGILSEVLLDS 818 (949)
Q Consensus 739 ~~~~~~V~~~l~~gs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 818 (949)
+.++
T Consensus 537 ----------------------------------------------------------------------------~~~~ 540 (649)
T KOG0496|consen 537 ----------------------------------------------------------------------------TEEG 540 (649)
T ss_pred ----------------------------------------------------------------------------cccc
Confidence 0001
Q ss_pred CCCCCCcccccCCCccccccCCCCCCCCCCceEEEEEEEccCCCCCCCCeEEecCCCcceEEEEcCcccccccCCCCCee
Q psy12907 819 KPLSPWSMTQYPLSNLSWIDTAPATNTTKLPVFYTATFTLNAEHPKPLDGYVDMSNWAKGVVFINEHNLGKYWTTLGPQL 898 (949)
Q Consensus 819 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~dt~~~~~~w~kG~~~vng~nlGryw~~~gpq~ 898 (949)
.... .|.... ....++|.+|.++|.++++ +.||||||++||||+|||||+||||||+..|||+
T Consensus 541 ~~~v------------~w~~~~--~~~~k~P~~w~k~f~~p~g---~~~t~Ldm~g~GKG~vwVNG~niGRYW~~~G~Q~ 603 (649)
T KOG0496|consen 541 SSKV------------KWKKLS--NTATKQPLTWYKTFDIPSG---SEPTALDMNGWGKGQVWVNGQNIGRYWPSFGPQR 603 (649)
T ss_pred cccc------------ceeecc--CcccCCCeEEEEEecCCCC---CCCeEEecCCCcceEEEECCcccccccCCCCCce
Confidence 0111 222111 1122379999999999864 5699999999999999999999999999899999
Q ss_pred eEEeccCCCCCCCCCceEEEEEeeCCCCCceEEeeecccc
Q psy12907 899 TLYLPAPFIKPYPEVNRITVLELQAPPSDLKVKFTTEHKV 938 (949)
Q Consensus 899 tly~P~~~l~~~~g~n~i~~~e~~~~~~~~~v~~~~~~~~ 938 (949)
|||||++|| |++.|.|||||++. .....|+|++.+..
T Consensus 604 ~yhvPr~~L--k~~~N~lvvfEee~-~~p~~i~~~~~~~~ 640 (649)
T KOG0496|consen 604 TYHVPRSWL--KPSGNLLVVFEEEG-GDPNGISFVTRPVL 640 (649)
T ss_pred EEECcHHHh--CcCCceEEEEEecc-CCCccceEEEeEee
Confidence 999999999 77999999999998 44457999888766
No 3
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=100.00 E-value=1.1e-83 Score=709.77 Aligned_cols=307 Identities=47% Similarity=0.856 Sum_probs=246.5
Q ss_pred ceEeCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE
Q psy12907 183 TFVKDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 183 ~f~~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
+|+|||+|++++|||+||||+|+++|+|+|+||||+|+|||++||+||+|||+||+|||+|.+||++||++|+|+||+||
T Consensus 1 ~~~~~g~~~~~~~Ge~hy~r~p~~~W~~~l~k~ka~G~n~v~~yv~W~~he~~~g~~df~g~~dl~~f~~~a~~~gl~vi 80 (319)
T PF01301_consen 1 SFLIDGKPFFILSGEFHYFRIPPEYWRDRLQKMKAAGLNTVSTYVPWNLHEPEEGQFDFTGNRDLDRFLDLAQENGLYVI 80 (319)
T ss_dssp CEEETTEEE-EEEEEE-GGGS-GGGHHHHHHHHHHTT-SEEEEE--HHHHSSBTTB---SGGG-HHHHHHHHHHTT-EEE
T ss_pred CeEECCEEEEEEEeeeccccCChhHHHHHHHHHHhCCcceEEEeccccccCCCCCcccccchhhHHHHHHHHHHcCcEEE
Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCccchHHH
Q psy12907 263 LRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSYTCDKEH 342 (949)
Q Consensus 263 LRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y~~d~~Y 342 (949)
|||||||||||++||+|.||++ ++++++||+||.|+++|++|+++|+++++++|+++||||||+|||||||++++|++|
T Consensus 81 lrpGpyi~aE~~~gG~P~Wl~~-~~~~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~GGpII~vQvENEyg~~~~~~~Y 159 (319)
T PF01301_consen 81 LRPGPYICAEWDNGGLPAWLLR-KPDIRLRTNDPPFLEAVERWYRALAKIIKPLQYTNGGPIIMVQVENEYGSYGTDRAY 159 (319)
T ss_dssp EEEES---TTBGGGG--GGGGG-STTS-SSSS-HHHHHHHHHHHHHHHHHHGGGBGGGTSSEEEEEESSSGGCTSS-HHH
T ss_pred ecccceecccccchhhhhhhhc-cccccccccchhHHHHHHHHHHHHHHHHHhhhhcCCCceehhhhhhhhCCCcccHhH
Confidence 9999999999999999999999 779999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhcCCe-eEEEeeCCCCcccccc-ccCC--eeeeecCCCCCChHHHHHHhhhcCCCCCeeeeecCCCcccc
Q psy12907 343 MIWLRDQMKYYVRDA-AVLYTTDGGGVGYLKC-TVPG--VYATVDFGAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTH 418 (949)
Q Consensus 343 ~~~L~~~~~~~lG~~-vplft~Dg~~~~~l~~-~~~g--~~~t~nfg~~~~~~~~f~~~~~~~p~~P~~~~Ef~~GWfd~ 418 (949)
|+.|++++++. |++ ++++|+|+++..+..| .+|+ ++++.+|+++.++.+.|..+++.+|++|+|++|||+|||++
T Consensus 160 ~~~l~~~~~~~-g~~~~~~~t~d~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~p~~P~~~~E~~~Gwf~~ 238 (319)
T PF01301_consen 160 MEALKDAYRDW-GIDPVLLYTTDGPWGSWLPDGGLPGADIYATDNFPPGDNPDEYFGDQRSFQPNQPLMCTEFWGGWFDH 238 (319)
T ss_dssp HHHHHHHHHHT-T-SSSBEEEEESSSHCCHCCC-TTTGSCEEEEEETTTSSHHHHHHHHHHHHTTS--EEEEEESS---B
T ss_pred HHHHHHHHHHh-hCccceeeccCCCcccccccCCCCcceEEeccccCCCchHHHHHhhhhhcCCCCCeEEEEeccccccc
Confidence 99999999998 776 7889999987777777 7888 89999999876667889999999999999999999999999
Q ss_pred cCCCCCCCChHHHHHHHHHHHHcCceEEEeeccCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCcccccccccch
Q psy12907 419 WGEKIATVDTDPVVKTLQIMLEMKANVNFYMFYGGTNFGFTAGESRAYLPAP-SLPPPKPAPKADYGQIILSPAASI 494 (949)
Q Consensus 419 WG~~~~~~~~~~~~~~l~~~l~~gaSvN~YMfhGGTNFG~~~Gan~~y~~~p-~~~~P~~~p~~~yg~v~l~~~~~L 494 (949)
||++++.+++++++..++++++.|+++||||||||||||+++|++..- .| .......+|..++|.++ .+...|
T Consensus 239 WG~~~~~~~~~~~~~~l~~~l~~g~~~nyYM~hGGTNfG~~~ga~~~~--~p~~TSYDY~ApI~E~G~~~-~Ky~~l 312 (319)
T PF01301_consen 239 WGGPHYTRPAEDVAADLARMLSKGNSLNYYMFHGGTNFGFWAGANYYG--QPDITSYDYDAPIDEYGQLT-PKYYEL 312 (319)
T ss_dssp TTS--HHHHHHHHHHHHHHHHHHCSEEEEEECE--B--TT-B-EETTT--EEB-SB--TT-SB-TTS-B--HHHHHH
T ss_pred cCCCCccCCHHHHHHHHHHHHHhhcccceeeccccCCccccccCCCCC--CCCcccCCcCCccCcCCCcC-HHHHHH
Confidence 999999999999999999999999999999999999999999998531 11 12345567888999888 544433
No 4
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=8.5e-42 Score=402.17 Aligned_cols=271 Identities=26% Similarity=0.426 Sum_probs=204.0
Q ss_pred eecCCceEeCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEE-eecCCccCCCCCeeeccCchhHHHHHHHHHH
Q psy12907 178 DYERDTFVKDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVST-YVEWSFHEPSPGQFQFTGDQDLEYFLELAVR 256 (949)
Q Consensus 178 ~~d~~~f~~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e 256 (949)
.+++..+++||++++++||++||+|+|++.|.|||+|||++|+|+|++ |+.||+|||++|+|||+ ++|+. |+++|++
T Consensus 2 ~~~~~~~~~dg~~~~l~gG~y~p~~~p~~~w~ddl~~mk~~G~N~V~ig~faW~~~eP~eG~fdf~-~~D~~-~l~~a~~ 79 (673)
T COG1874 2 SYDGYSFIRDGRRILLYGGDYYPERWPRETWMDDLRKMKALGLNTVRIGYFAWNLHEPEEGKFDFT-WLDEI-FLERAYK 79 (673)
T ss_pred cccccceeeCCceeEEeccccChHHCCHHHHHHHHHHHHHhCCCeeEeeeEEeeccCccccccCcc-cchHH-HHHHHHh
Confidence 356888999999999999999999999999999999999999999999 99999999999999999 78999 9999999
Q ss_pred CCceEEecCCc-ccccccCCCCccccccccCCCeeee---------cCCHhHHHHHHHHHHHHHHHHHhccccCCCceEE
Q psy12907 257 EDLLVLLRPGP-YICAERDFGGLPSWLMTIKPNITLR---------SKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIIL 326 (949)
Q Consensus 257 ~GL~VILRpGP-YIcAEw~~GGlP~WL~~~~p~~~~R---------t~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~ 326 (949)
.||+||||||| ..|++|..++.|+||.. ++.-..| .++|.|++++++ |+++|+++.+.++++||+
T Consensus 80 ~Gl~vil~t~P~g~~P~Wl~~~~PeiL~~-~~~~~~~~~g~r~~~~~~~~~Yr~~~~~----i~~~irer~~~~~~~v~~ 154 (673)
T COG1874 80 AGLYVILRTGPTGAPPAWLAKKYPEILAV-DENGRVRSDGARENICPVSPVYREYLDR----ILQQIRERLYGNGPAVIT 154 (673)
T ss_pred cCceEEEecCCCCCCchHHhcCChhheEe-cCCCcccCCCcccccccccHHHHHHHHH----HHHHHHHHHhccCCceeE
Confidence 99999999999 99999999999999987 5553333 467889988887 788888888899999999
Q ss_pred eccccccCCcc-----chHHHHHHHHHHHHHhcCCeeEEEeeCCCCcccccc-ccCC--eeeeec-CCCCCChHHHHHHh
Q psy12907 327 VQVENEMGSYT-----CDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKC-TVPG--VYATVD-FGAATNVSSAFAAM 397 (949)
Q Consensus 327 vQIENEyG~y~-----~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~l~~-~~~g--~~~t~n-fg~~~~~~~~f~~~ 397 (949)
||++||||++. |++.+..||++.+ | .+...+.++...+.. +... .+.+.+ ++.. .....+-..
T Consensus 155 w~~dneY~~~~~~~~~~~~~f~~wLk~~y----g---~l~~ln~~w~t~~ws~t~~~~~~i~~p~~~~e~-~~~~~~ld~ 226 (673)
T COG1874 155 WQNDNEYGGHPCYCDYCQAAFRLWLKKGY----G---SLDNLNEAWGTSFWSHTYKDFDEIMSPNPFGEL-PLPGLYLDY 226 (673)
T ss_pred EEccCccCCccccccccHHHHHHHHHhCc----c---hHHhhhhhhhhhhcccccccHHhhcCCCCcccc-CCccchhhH
Confidence 99999999954 4556666666554 2 222233222222211 2111 111122 3320 111111222
Q ss_pred hhcCCCC----CeeeeecCCCcc-cccCCCCCCCC-hHHHHHHHHHHHHcCceEEEeeccCCCCCC------CCCCCC
Q psy12907 398 RTVSPHG----PLVNSEYYPGWL-THWGEKIATVD-TDPVVKTLQIMLEMKANVNFYMFYGGTNFG------FTAGES 463 (949)
Q Consensus 398 ~~~~p~~----P~~~~Ef~~GWf-d~WG~~~~~~~-~~~~~~~l~~~l~~gaSvN~YMfhGGTNFG------~~~Gan 463 (949)
+++...+ +..+.|.+-+|| +.|..+....+ .+.-...+.+.+....+-||||+|+|++|+ |++|+.
T Consensus 227 ~~f~~e~~~~~~~~~~~~~~~~~P~~pvt~nl~~~~~~~~~~~~~~~ld~~swdny~~~~~~~~~~~~~h~l~r~~~~ 304 (673)
T COG1874 227 RRFESEQILEFVREEGEAIKAYFPNRPVTPNLLAAFKKFDAYKWEKVLDFASWDNYPAWHRGRDFTKFIHDLFRNGKQ 304 (673)
T ss_pred hhhhhhhhHHHHHHHHHHHHHhCCCCCCChhHhhhhhhcchHHHHHhcChhhhhhhhhhccccchhhhhHHHHHhhcc
Confidence 3343333 778888899999 88876655444 344444567777777667999999999999 999988
No 5
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=99.96 E-value=2.1e-29 Score=278.75 Aligned_cols=146 Identities=34% Similarity=0.568 Sum_probs=122.3
Q ss_pred hhhhcccCCccC-CCcEEEEeeccCCCCCCCCHHHHHHHHHHHHHhCCCeeEEEeeCCCCCCccc-cCCCC--ceeeccC
Q psy12907 17 TNLQKTRPTTYA-PGEIFPFQVENEMGSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLK-CTVPG--VYATVDF 92 (949)
Q Consensus 17 ~~~~~iap~qit-GGPIImvQVENEYGsyg~D~~YM~~L~d~~r~~Gi~~VPL~T~Dg~~~~~~~-G~lpg--v~~tvnf 92 (949)
.++++++|+|++ ||||||||||||||++++|++||+.|++++++.|++.++++++|+++..... +.+|+ ++.+.+|
T Consensus 125 ~~~~~~~~~~~~~GGpII~vQvENEyg~~~~~~~Y~~~l~~~~~~~g~~~~~~~t~d~~~~~~~~~~~~~g~~~~~~~~~ 204 (319)
T PF01301_consen 125 ALAKIIKPLQYTNGGPIIMVQVENEYGSYGTDRAYMEALKDAYRDWGIDPVLLYTTDGPWGSWLPDGGLPGADIYATDNF 204 (319)
T ss_dssp HHHHHHGGGBGGGTSSEEEEEESSSGGCTSS-HHHHHHHHHHHHHTT-SSSBEEEEESSSHCCHCCC-TTTGSCEEEEEE
T ss_pred HHHHHHHhhhhcCCCceehhhhhhhhCCCcccHhHHHHHHHHHHHhhCccceeeccCCCcccccccCCCCcceEEecccc
Confidence 468999999999 9999999999999999999999999999999999987889999987654444 47888 8999999
Q ss_pred CCCCCHHHHHHHhhhcCCCCCceeeeeCCccccccCCCCccCChHHHHHHHHHHHHcccceeeeccccce
Q psy12907 93 GAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMKANVNFYIPFSPF 162 (949)
Q Consensus 93 g~~~~~~~~~~~l~~~~P~~P~m~~Ef~~GWFD~WG~~~~~~~~~~~~~~l~~~L~~GaS~NlYM~h~~~ 162 (949)
.+..++.+++..+++.+|++|+|++|||+||||+||++++.+|+++++..++++++.|.++||||+||+.
T Consensus 205 ~~~~~~~~~~~~~~~~~p~~P~~~~E~~~Gwf~~WG~~~~~~~~~~~~~~l~~~l~~g~~~nyYM~hGGT 274 (319)
T PF01301_consen 205 PPGDNPDEYFGDQRSFQPNQPLMCTEFWGGWFDHWGGPHYTRPAEDVAADLARMLSKGNSLNYYMFHGGT 274 (319)
T ss_dssp TTTSSHHHHHHHHHHHHTTS--EEEEEESS---BTTS--HHHHHHHHHHHHHHHHHHCSEEEEEECE--B
T ss_pred CCCchHHHHHhhhhhcCCCCCeEEEEeccccccccCCCCccCCHHHHHHHHHHHHHhhcccceeeccccC
Confidence 8765567889999999999999999999999999999999999999999999999999999999999774
No 6
>PLN03059 beta-galactosidase; Provisional
Probab=99.95 E-value=6.7e-29 Score=297.06 Aligned_cols=131 Identities=17% Similarity=0.244 Sum_probs=113.9
Q ss_pred hhhhccc--CCccC-CCcEEEEeeccCCCCC-----CCCHHHHHHHHHHHHHhCCCeeEEEeeCCCCCCccccCCCCcee
Q psy12907 17 TNLQKTR--PTTYA-PGEIFPFQVENEMGSY-----TCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKCTVPGVYA 88 (949)
Q Consensus 17 ~~~~~ia--p~qit-GGPIImvQVENEYGsy-----g~D~~YM~~L~d~~r~~Gi~~VPL~T~Dg~~~~~~~G~lpgv~~ 88 (949)
.++++++ ++|++ ||||||+|||||||+| ++|++||++|+++++++|++ |||||||++. .+++++.
T Consensus 160 ~l~~~l~~~~l~~~~GGPIImvQIENEYGs~~~~~~~~d~~Yl~~l~~~~~~~Gi~-VPl~t~dg~~------~~~~v~~ 232 (840)
T PLN03059 160 KIVDMMKSEKLFEPQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADMAVKLGTG-VPWVMCKQED------APDPVID 232 (840)
T ss_pred HHHHHHhhcceeecCCCcEEEEEecccccceecccCcchHHHHHHHHHHHHHcCCC-cceEECCCCC------CCcccee
Confidence 4567784 78888 9999999999999998 78999999999999999998 9999999762 4578999
Q ss_pred eccCCCCCCHHHHHHHhhhcCCCCCceeeeeCCccccccCCCCccCChHHHHHHHHHHHHcccc-eeeeccccc
Q psy12907 89 TVDFGAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMKAN-VNFYIPFSP 161 (949)
Q Consensus 89 tvnfg~~~~~~~~~~~l~~~~P~~P~m~~Ef~~GWFD~WG~~~~~~~~~~~~~~l~~~L~~GaS-~NlYM~h~~ 161 (949)
|+| +. .++.|. ..++.+|+|++|||+|||||||++++.+++++++..++++|+.|+| +|||||||+
T Consensus 233 t~N-g~---~~~~f~---~~~~~~P~m~tE~w~GWf~~wG~~~~~r~~~d~a~~~~~~l~~g~S~~N~YMfhGG 299 (840)
T PLN03059 233 TCN-GF---YCENFK---PNKDYKPKMWTEAWTGWYTEFGGAVPNRPAEDLAFSVARFIQNGGSFINYYMYHGG 299 (840)
T ss_pred cCC-Cc---hhhhcc---cCCCCCCcEEeccCchhHhhcCCCCCcCCHHHHHHHHHHHHHcCCeeEEeeeccCc
Confidence 999 32 234443 2455689999999999999999999999999999999999999999 799999955
No 7
>KOG0496|consensus
Probab=99.86 E-value=5.5e-22 Score=230.29 Aligned_cols=131 Identities=29% Similarity=0.447 Sum_probs=100.7
Q ss_pred hhhcccCCccC-CCcEEEEeeccCCCCCCCCHHHHHHHHHHHHHhCCCeeEEEeeCCCCCCcccc---CCCC-ceeeccC
Q psy12907 18 NLQKTRPTTYA-PGEIFPFQVENEMGSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKC---TVPG-VYATVDF 92 (949)
Q Consensus 18 ~~~~iap~qit-GGPIImvQVENEYGsyg~D~~YM~~L~d~~r~~Gi~~VPL~T~Dg~~~~~~~G---~lpg-v~~tvnf 92 (949)
++++++|++++ ||||||+||||||| .|.+++++..+++-..++.++++++.....+.| ..|+ +..+||
T Consensus 151 iv~~mk~L~~~qGGPIIl~QIENEYG------~~~~~~~~~~k~y~~w~a~m~~~l~~gvpw~mCk~~dapd~~in~cn- 223 (649)
T KOG0496|consen 151 IVPMMKKLFASQGGPIILVQIENEYG------NYLRALGAEGKSYLKWAAVLATSLGTGVPWVMCKQDDAPDPGINTCN- 223 (649)
T ss_pred HHHHHHHHHhhcCCCEEEEEeechhh------HHHHHHHHHHHHhhccceEEEEecCCCCceeEecCCCCCCccccccC-
Confidence 55666699999 99999999999999 566777777777665557778877654323223 2333 233333
Q ss_pred CCCCCHHHHHHHhhhcCCCCCceeeeeCCccccccCCCCccCChHHHHHHHHHHHHcc-cceeeeccc
Q psy12907 93 GAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMK-ANVNFYIPF 159 (949)
Q Consensus 93 g~~~~~~~~~~~l~~~~P~~P~m~~Ef~~GWFD~WG~~~~~~~~~~~~~~l~~~L~~G-aS~NlYM~h 159 (949)
. ....+...+..+|++|.|++|||+|||++||++++.+++++++..+..+++.| +++|+||+|
T Consensus 224 --g--~~c~~~f~~pn~~~kP~~wtE~wtgwf~~wGg~~~~R~~e~ia~~va~fls~ggs~vNyYM~h 287 (649)
T KOG0496|consen 224 --G--FYCGDTFKRPNSPNKPLVWTENWTGWFTHWGGPHPCRPVEDIALSVARFLSKGGSSVNYYMYH 287 (649)
T ss_pred --C--ccchhhhccCCCCCCCceecccccchhhhhCCCCCCCCHHHHHHHHHHHHhcCccceEEEEee
Confidence 1 12234445578999999999999999999999999999999999999999999 689999999
No 8
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=99.81 E-value=7.3e-20 Score=206.58 Aligned_cols=242 Identities=23% Similarity=0.392 Sum_probs=147.3
Q ss_pred cCCCCCChhhHHHHHHHHHHcCCCEEEE-eecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCC
Q psy12907 198 FHYFRSPRYYWQDRLRKLRAAGLNAVST-YVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFG 276 (949)
Q Consensus 198 ~HY~R~p~~~W~d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~G 276 (949)
+++.++|++.|+++|++||++|+|+|++ .+.|+..||+||+|||+. |+++|++|+++||+|||+.. .+
T Consensus 2 y~pe~~~~e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~eG~ydF~~---lD~~l~~a~~~Gi~viL~~~--------~~ 70 (374)
T PF02449_consen 2 YYPEQWPEEEWEEDLRLMKEAGFNTVRIGEFSWSWLEPEEGQYDFSW---LDRVLDLAAKHGIKVILGTP--------TA 70 (374)
T ss_dssp --GGGS-CCHHHHHHHHHHHHT-SEEEE-CCEHHHH-SBTTB---HH---HHHHHHHHHCTT-EEEEEEC--------TT
T ss_pred CCcccCCHHHHHHHHHHHHHcCCCEEEEEEechhhccCCCCeeecHH---HHHHHHHHHhccCeEEEEec--------cc
Confidence 5788999999999999999999999996 678999999999999986 99999999999999999854 57
Q ss_pred CccccccccCCCeeee----------------cCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCC---c-
Q psy12907 277 GLPSWLMTIKPNITLR----------------SKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGS---Y- 336 (949)
Q Consensus 277 GlP~WL~~~~p~~~~R----------------t~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~---y- 336 (949)
..|.||.+++|++... .++|.|++++++++++|+++.+.+ ..||++||+||+|. |
T Consensus 71 ~~P~Wl~~~~Pe~~~~~~~g~~~~~g~~~~~~~~~p~yr~~~~~~~~~l~~~y~~~-----p~vi~~~i~NE~~~~~~~~ 145 (374)
T PF02449_consen 71 APPAWLYDKYPEILPVDADGRRRGFGSRQHYCPNSPAYREYARRFIRALAERYGDH-----PAVIGWQIDNEPGYHRCYS 145 (374)
T ss_dssp TS-HHHHCCSGCCC-B-TTTSBEECCCSTT-HCCHHHHHHHHHHHHHHHHHHHTTT-----TTEEEEEECCSTTCTS--S
T ss_pred ccccchhhhcccccccCCCCCcCccCCccccchhHHHHHHHHHHHHHHHHhhcccc-----ceEEEEEeccccCcCcCCC
Confidence 8999999888886431 367899999999888887777643 47999999999976 2
Q ss_pred -cchHHHHHHHHHHHHHh------cCCe-----------e--EEEeeCCCCc----------------------cccccc
Q psy12907 337 -TCDKEHMIWLRDQMKYY------VRDA-----------A--VLYTTDGGGV----------------------GYLKCT 374 (949)
Q Consensus 337 -~~d~~Y~~~L~~~~~~~------lG~~-----------v--plft~Dg~~~----------------------~~l~~~ 374 (949)
.|.++|.+||++++... =|.+ | |..+.+.... ..++..
T Consensus 146 ~~~~~~f~~wLk~kY~ti~~LN~aWgt~~ws~~~~~f~~v~~P~~~~~~~~~~~~~D~~rF~~~~~~~~~~~~~~~ir~~ 225 (374)
T PF02449_consen 146 PACQAAFRQWLKEKYGTIEALNRAWGTAFWSQRYSSFDEVPPPRPTSSPENPAQWLDWYRFQSDRVAEFFRWQADIIREY 225 (374)
T ss_dssp HHHHHHHHHHHHHHHSSHHHHHHHHTTTGGG---SSGGG---S-S-SS---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHhCCHHHHHHHHcCCcccCccCcHHhcCCCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45688999999887420 0111 1 1112111000 000111
Q ss_pred cCCeeeeecCCCC-------------------------------C---ChHHHHHHhhhcCCCCCeeeeecCCCcccccC
Q psy12907 375 VPGVYATVDFGAA-------------------------------T---NVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWG 420 (949)
Q Consensus 375 ~~g~~~t~nfg~~-------------------------------~---~~~~~f~~~~~~~p~~P~~~~Ef~~GWfd~WG 420 (949)
.|+...|.|+... . ...-..+..|...+++|.+++|..+| -..|+
T Consensus 226 ~p~~~vt~n~~~~~~~~~d~~~~a~~~D~~~~d~Y~~~~~~~~~~~~~~~a~~~dl~R~~~~~kpf~v~E~~~g-~~~~~ 304 (374)
T PF02449_consen 226 DPDHPVTTNFMGSWFNGIDYFKWAKYLDVVSWDSYPDGSFDFYDDDPYSLAFNHDLMRSLAKGKPFWVMEQQPG-PVNWR 304 (374)
T ss_dssp STT-EEE-EE-TT---SS-HHHHGGGSSSEEEEE-HHHHHTTTT--TTHHHHHHHHHHHHTTT--EEEEEE--S---SSS
T ss_pred CCCceEEeCccccccCcCCHHHHHhhCCcceeccccCcccCCCCCCHHHHHHHHHHHHhhcCCCceEeecCCCC-CCCCc
Confidence 2333333222110 0 01112234566678899999999999 66676
Q ss_pred CCCCCCChHHHHHHHHHHHHcCce-EEEeeccCCCCCC
Q psy12907 421 EKIATVDTDPVVKTLQIMLEMKAN-VNFYMFYGGTNFG 457 (949)
Q Consensus 421 ~~~~~~~~~~~~~~l~~~l~~gaS-vN~YMfhGGTNFG 457 (949)
.......+..+....-.-++.|+. +.|+=+ ..--+|
T Consensus 305 ~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~w-r~~~~g 341 (374)
T PF02449_consen 305 PYNRPPRPGELRLWSWQAIAHGADGILFWQW-RQSRFG 341 (374)
T ss_dssp SS-----TTHHHHHHHHHHHTT-S-EEEC-S-B--SSS
T ss_pred cCCCCCCCCHHHHHHHHHHHHhCCeeEeeec-cCCCCC
Confidence 655555566676666677889987 566555 444444
No 9
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=99.01 E-value=8.7e-10 Score=105.27 Aligned_cols=65 Identities=29% Similarity=0.433 Sum_probs=52.7
Q ss_pred CCCCceEEEEEEEccCCCCCCCCeEEe-----cCCCcceEEEEcCcccccccCCCCCeeeEEeccCCCCCCCCCceEE
Q psy12907 845 TTKLPVFYTATFTLNAEHPKPLDGYVD-----MSNWAKGVVFINEHNLGKYWTTLGPQLTLYLPAPFIKPYPEVNRIT 917 (949)
Q Consensus 845 ~~~~p~~~~~~~~~~~~~~~~~dt~~~-----~~~w~kG~~~vng~nlGryw~~~gpq~tly~P~~~l~~~~g~n~i~ 917 (949)
...++.|||++|+.. ..|+.|. ...+.+..|||||++|||||+.+|||.|+.||+.+| +.+ |.|+
T Consensus 32 ~~~g~~~Yrg~F~~~-----~~~~~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~g~q~tf~~p~~il--~~~-n~v~ 101 (111)
T PF13364_consen 32 FHAGYLWYRGTFTGT-----GQDTSLTPLNIQGGNAFRASVWVNGWFLGSYWPGIGPQTTFSVPAGIL--KYG-NNVL 101 (111)
T ss_dssp SSSCEEEEEEEEETT-----TEEEEEE-EEECSSTTEEEEEEETTEEEEEEETTTECCEEEEE-BTTB--TTC-EEEE
T ss_pred cCCCCEEEEEEEeCC-----CcceeEEEEeccCCCceEEEEEECCEEeeeecCCCCccEEEEeCceee--cCC-CEEE
Confidence 346799999999754 3455555 577999999999999999998899999999999999 776 5554
No 10
>PF02836 Glyco_hydro_2_C: Glycosyl hydrolases family 2, TIM barrel domain; InterPro: IPR006103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. Beta-galactosidase from E. coli has a TIM-barrel-like core surrounded by four other largely beta domains [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3CMG_A 3FN9_C 1YQ2_A 3K4D_B 3LPG_B 3LPF_A 3K4A_B 3K46_B 3GM8_A 3DEC_A ....
Probab=98.95 E-value=2.2e-08 Score=110.12 Aligned_cols=202 Identities=20% Similarity=0.237 Sum_probs=121.0
Q ss_pred EeecCCceEeCCeeeEEeccccCCC------CCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHH
Q psy12907 177 IDYERDTFVKDGKPFRYVSGSFHYF------RSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYF 250 (949)
Q Consensus 177 v~~d~~~f~~dGkpf~i~sGe~HY~------R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~f 250 (949)
|.+.++.|+|||+|+.+.+...|.. .++.+.|+.+|++||+||+|+|++ .|-|. =.+|
T Consensus 1 vev~~~~~~lNGk~~~l~Gv~~h~~~~~~g~a~~~~~~~~d~~l~k~~G~N~iR~-----~h~p~-----------~~~~ 64 (298)
T PF02836_consen 1 VEVKDGGFYLNGKPIFLRGVNRHQDYPGLGRAMPDEAMERDLELMKEMGFNAIRT-----HHYPP-----------SPRF 64 (298)
T ss_dssp EEEETTEEEETTEEE-EEEEEE-S-BTTTBT---HHHHHHHHHHHHHTT-SEEEE-----TTS-------------SHHH
T ss_pred CEEECCEEEECCEEEEEEEEeeCcCcccccccCCHHHHHHHHHHHHhcCcceEEc-----ccccC-----------cHHH
Confidence 3456999999999999999887754 256899999999999999999999 34442 2579
Q ss_pred HHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccc
Q psy12907 251 LELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVE 330 (949)
Q Consensus 251 l~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIE 330 (949)
+++|.++||.|+.-+ | ..+.=.|-.. . .......+|.+.+.+.+-+++++.+.+.| ..||||=+-
T Consensus 65 ~~~cD~~GilV~~e~-~-------~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~v~~~~NH-----PSIi~W~~g 129 (298)
T PF02836_consen 65 YDLCDELGILVWQEI-P-------LEGHGSWQDF-G-NCNYDADDPEFRENAEQELREMVRRDRNH-----PSIIMWSLG 129 (298)
T ss_dssp HHHHHHHT-EEEEE--S--------BSCTSSSST-S-CTSCTTTSGGHHHHHHHHHHHHHHHHTT------TTEEEEEEE
T ss_pred HHHHhhcCCEEEEec-c-------ccccCccccC-C-ccccCCCCHHHHHHHHHHHHHHHHcCcCc-----CchheeecC
Confidence 999999999998653 1 1111112111 0 01235678999998888788877777665 479999999
Q ss_pred cccCCccchHHHHHHHHHHHHHhcCCeeEEEeeCCCCccccccccCCeeeeecCCCCC---ChHHHHHHhhh--cCCCCC
Q psy12907 331 NEMGSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKCTVPGVYATVDFGAAT---NVSSAFAAMRT--VSPHGP 405 (949)
Q Consensus 331 NEyG~y~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~l~~~~~g~~~t~nfg~~~---~~~~~f~~~~~--~~p~~P 405 (949)
||- ....+++.|.+.+++. --+-|+..+.+.....+ .........+... .....-..+.. ..+++|
T Consensus 130 NE~----~~~~~~~~l~~~~k~~-DptRpv~~~~~~~~~~~----~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~kP 200 (298)
T PF02836_consen 130 NES----DYREFLKELYDLVKKL-DPTRPVTYASNGWDPYV----DDIIFDIYSGWYNGYGDPEDFEKYLEDWYKYPDKP 200 (298)
T ss_dssp ESS----HHHHHHHHHHHHHHHH--TTSEEEEETGTSGGST----SSCEECSETTTSSSCCHHHHHHHHHHHHHHHCTS-
T ss_pred ccC----ccccchhHHHHHHHhc-CCCCceeeccccccccc----ccccccccccccCCcccHHHHHHHHHhccccCCCC
Confidence 997 4567788888888875 33344433322111111 1111111111111 11221122222 368899
Q ss_pred eeeeecCCCcccc
Q psy12907 406 LVNSEYYPGWLTH 418 (949)
Q Consensus 406 ~~~~Ef~~GWfd~ 418 (949)
++.+||....+..
T Consensus 201 ~i~sEyg~~~~~~ 213 (298)
T PF02836_consen 201 IIISEYGADAYNS 213 (298)
T ss_dssp EEEEEESEBBSST
T ss_pred eEehhcccccccc
Confidence 9999998766543
No 11
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=98.59 E-value=6.7e-07 Score=95.67 Aligned_cols=157 Identities=18% Similarity=0.295 Sum_probs=108.4
Q ss_pred eCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccC-CCCCe-eeccCchhHHHHHHHHHHCCceEEe
Q psy12907 186 KDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHE-PSPGQ-FQFTGDQDLEYFLELAVREDLLVLL 263 (949)
Q Consensus 186 ~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HE-p~~G~-fdF~g~~DL~~fl~la~e~GL~VIL 263 (949)
.+|+++.+.+-+.|... +..-++.+++||++|+|+|++.+.|...+ +.++. ++=+....|+++|+.|+++||+|||
T Consensus 3 ~~G~~v~~~G~n~~w~~--~~~~~~~~~~~~~~G~n~VRi~v~~~~~~~~~~~~~~~~~~~~~ld~~v~~a~~~gi~vil 80 (281)
T PF00150_consen 3 QNGKPVNWRGFNTHWYN--PSITEADFDQLKALGFNTVRIPVGWEAYQEPNPGYNYDETYLARLDRIVDAAQAYGIYVIL 80 (281)
T ss_dssp TTSEBEEEEEEEETTSG--GGSHHHHHHHHHHTTESEEEEEEESTSTSTTSTTTSBTHHHHHHHHHHHHHHHHTT-EEEE
T ss_pred CCCCeEEeeeeecccCC--CCCHHHHHHHHHHCCCCEEEeCCCHHHhcCCCCCccccHHHHHHHHHHHHHHHhCCCeEEE
Confidence 47899999888888321 12788999999999999999999995544 67764 6666678899999999999999998
Q ss_pred cCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCccch----
Q psy12907 264 RPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSYTCD---- 339 (949)
Q Consensus 264 RpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y~~d---- 339 (949)
.+- ..|.|...... -........+++.+...++.++.. ..+|++++|=||......+
T Consensus 81 d~h----------~~~~w~~~~~~--------~~~~~~~~~~~~~~~~~la~~y~~-~~~v~~~el~NEP~~~~~~~~w~ 141 (281)
T PF00150_consen 81 DLH----------NAPGWANGGDG--------YGNNDTAQAWFKSFWRALAKRYKD-NPPVVGWELWNEPNGGNDDANWN 141 (281)
T ss_dssp EEE----------ESTTCSSSTST--------TTTHHHHHHHHHHHHHHHHHHHTT-TTTTEEEESSSSGCSTTSTTTTS
T ss_pred Eec----------cCccccccccc--------cccchhhHHHHHhhhhhhccccCC-CCcEEEEEecCCccccCCccccc
Confidence 742 12667432110 011222444555555556555433 3469999999999875322
Q ss_pred -------HHHHHHHHHHHHHhcCCeeEEEeeC
Q psy12907 340 -------KEHMIWLRDQMKYYVRDAAVLYTTD 364 (949)
Q Consensus 340 -------~~Y~~~L~~~~~~~lG~~vplft~D 364 (949)
.++.+.+.+.+|+. +-+.+++...
T Consensus 142 ~~~~~~~~~~~~~~~~~Ir~~-~~~~~i~~~~ 172 (281)
T PF00150_consen 142 AQNPADWQDWYQRAIDAIRAA-DPNHLIIVGG 172 (281)
T ss_dssp HHHTHHHHHHHHHHHHHHHHT-TSSSEEEEEE
T ss_pred cccchhhhhHHHHHHHHHHhc-CCcceeecCC
Confidence 46667777778876 7766665554
No 12
>PRK10150 beta-D-glucuronidase; Provisional
Probab=98.57 E-value=6.5e-06 Score=99.41 Aligned_cols=153 Identities=18% Similarity=0.146 Sum_probs=101.3
Q ss_pred eecCCceEeCCeeeEEeccccCCC------CCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHH
Q psy12907 178 DYERDTFVKDGKPFRYVSGSFHYF------RSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFL 251 (949)
Q Consensus 178 ~~d~~~f~~dGkpf~i~sGe~HY~------R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl 251 (949)
+.++..|+|||+|+++.+...|.. .++.+.|+.+++.||++|+|+|++- |-|. =.+|+
T Consensus 279 ~~~~~~f~lNG~pv~lrG~~~h~~~~~~G~a~~~~~~~~d~~l~K~~G~N~vR~s-----h~p~-----------~~~~~ 342 (604)
T PRK10150 279 AVKGGQFLINGKPFYFKGFGKHEDADIRGKGLDEVLNVHDHNLMKWIGANSFRTS-----HYPY-----------SEEML 342 (604)
T ss_pred EEeCCEEEECCEEEEEEeeeccCCCCccCCcCCHHHHHHHHHHHHHCCCCEEEec-----cCCC-----------CHHHH
Confidence 445889999999999998877743 2567889999999999999999992 4442 13799
Q ss_pred HHHHHCCceEEecCCcccccccCCCCcccccc-------ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCce
Q psy12907 252 ELAVREDLLVLLRPGPYICAERDFGGLPSWLM-------TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPI 324 (949)
Q Consensus 252 ~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~-------~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpI 324 (949)
++|-+.||+|+--.. .-|+..|.. +..+.-..-..+|.+.++..+-+++++.+.+.| ..|
T Consensus 343 ~~cD~~GllV~~E~p--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~mv~r~~NH-----PSI 409 (604)
T PRK10150 343 DLADRHGIVVIDETP--------AVGLNLSFGAGLEAGNKPKETYSEEAVNGETQQAHLQAIRELIARDKNH-----PSV 409 (604)
T ss_pred HHHHhcCcEEEEecc--------cccccccccccccccccccccccccccchhHHHHHHHHHHHHHHhccCC-----ceE
Confidence 999999999986532 111111211 101111112345667666665555555554443 579
Q ss_pred EEeccccccCCc-cchHHHHHHHHHHHHHhcCCeeEE
Q psy12907 325 ILVQVENEMGSY-TCDKEHMIWLRDQMKYYVRDAAVL 360 (949)
Q Consensus 325 I~vQIENEyG~y-~~d~~Y~~~L~~~~~~~lG~~vpl 360 (949)
|||=+-||-..- ....+|++.+.+.+++. --+-|.
T Consensus 410 i~Ws~gNE~~~~~~~~~~~~~~l~~~~k~~-DptR~v 445 (604)
T PRK10150 410 VMWSIANEPASREQGAREYFAPLAELTRKL-DPTRPV 445 (604)
T ss_pred EEEeeccCCCccchhHHHHHHHHHHHHHhh-CCCCce
Confidence 999999997532 23457778888888865 333343
No 13
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=98.25 E-value=4.1e-05 Score=97.68 Aligned_cols=140 Identities=24% Similarity=0.246 Sum_probs=93.8
Q ss_pred ceeEEeecCCceEeCCeeeEEeccccCCC------CCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchh
Q psy12907 173 RKFEIDYERDTFVKDGKPFRYVSGSFHYF------RSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQD 246 (949)
Q Consensus 173 ~~f~v~~d~~~f~~dGkpf~i~sGe~HY~------R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~D 246 (949)
|++++ ++..|+|||+|+.+-+...|.. +++++.++.+|+.||++|+|+|++- |-|.
T Consensus 334 R~iei--~~~~f~LNGkpi~lrGvn~h~~~p~~G~a~t~e~~~~di~lmK~~g~NaVR~s-----HyP~----------- 395 (1027)
T PRK09525 334 RKVEI--ENGLLKLNGKPLLIRGVNRHEHHPEHGQVMDEETMVQDILLMKQHNFNAVRCS-----HYPN----------- 395 (1027)
T ss_pred EEEEE--ECCEEEECCEEEEEEEeEccccCcccCccCCHHHHHHHHHHHHHCCCCEEEec-----CCCC-----------
Confidence 34444 5889999999999999887732 4678999999999999999999992 4442
Q ss_pred HHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEE
Q psy12907 247 LEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIIL 326 (949)
Q Consensus 247 L~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~ 326 (949)
=.+|.++|-|.||+|+--.. . |+ .|-.|. . . -.++|.|.+++.+=+++++.+.+ |-..|||
T Consensus 396 ~p~fydlcDe~GilV~dE~~-~---e~-hg~~~~---~-~-----~~~dp~~~~~~~~~~~~mV~Rdr-----NHPSIi~ 456 (1027)
T PRK09525 396 HPLWYELCDRYGLYVVDEAN-I---ET-HGMVPM---N-R-----LSDDPRWLPAMSERVTRMVQRDR-----NHPSIII 456 (1027)
T ss_pred CHHHHHHHHHcCCEEEEecC-c---cc-cCCccc---c-C-----CCCCHHHHHHHHHHHHHHHHhCC-----CCCEEEE
Confidence 14789999999999986632 1 11 111111 0 0 13577777765544444444444 3357999
Q ss_pred eccccccCCccchHHHHHHHHHHHHHh
Q psy12907 327 VQVENEMGSYTCDKEHMIWLRDQMKYY 353 (949)
Q Consensus 327 vQIENEyG~y~~d~~Y~~~L~~~~~~~ 353 (949)
|=+-||-|. + .....+.+.+++.
T Consensus 457 WSlgNE~~~-g---~~~~~l~~~~k~~ 479 (1027)
T PRK09525 457 WSLGNESGH-G---ANHDALYRWIKSN 479 (1027)
T ss_pred EeCccCCCc-C---hhHHHHHHHHHhh
Confidence 999999653 1 2234555666643
No 14
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=98.25 E-value=4.2e-05 Score=97.61 Aligned_cols=192 Identities=17% Similarity=0.177 Sum_probs=114.8
Q ss_pred ceeEEeecCCceEeCCeeeEEeccccCCC-----C-CChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchh
Q psy12907 173 RKFEIDYERDTFVKDGKPFRYVSGSFHYF-----R-SPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQD 246 (949)
Q Consensus 173 ~~f~v~~d~~~f~~dGkpf~i~sGe~HY~-----R-~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~D 246 (949)
|++++ ++..|+|||+|+.+.+...|.. | ++++.|+.+|+.||++|+|+|++- |-|.
T Consensus 318 R~iei--~~~~f~lNGkpi~lrGvnrh~~~p~~G~a~~~e~~~~dl~lmK~~g~NavR~s-----HyP~----------- 379 (1021)
T PRK10340 318 RDIKV--RDGLFWINNRYVKLHGVNRHDNDHRKGRAVGMDRVEKDIQLMKQHNINSVRTA-----HYPN----------- 379 (1021)
T ss_pred EEEEE--ECCEEEECCEEEEEEEeecCCCCcccCccCCHHHHHHHHHHHHHCCCCEEEec-----CCCC-----------
Confidence 34444 5889999999999999877633 2 568899999999999999999983 4442
Q ss_pred HHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEE
Q psy12907 247 LEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIIL 326 (949)
Q Consensus 247 L~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~ 326 (949)
=.+|+++|-|.||+|+--. |..|..|.. .. +...-+++|.+.++..+-+++++.+.+ |-..|||
T Consensus 380 ~~~fydlcDe~GllV~dE~-~~e~~g~~~-------~~---~~~~~~~~p~~~~~~~~~~~~mV~Rdr-----NHPSIi~ 443 (1021)
T PRK10340 380 DPRFYELCDIYGLFVMAET-DVESHGFAN-------VG---DISRITDDPQWEKVYVDRIVRHIHAQK-----NHPSIII 443 (1021)
T ss_pred CHHHHHHHHHCCCEEEECC-cccccCccc-------cc---ccccccCCHHHHHHHHHHHHHHHHhCC-----CCCEEEE
Confidence 2378999999999998764 332322211 00 011124577776554333444444443 3458999
Q ss_pred eccccccCCccchHHHHHHHHHHHHHhcCCeeEEEeeCCCCccccccccCCeeeeecCCCCCChHHHHHHhhhcCCCCCe
Q psy12907 327 VQVENEMGSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGYLKCTVPGVYATVDFGAATNVSSAFAAMRTVSPHGPL 406 (949)
Q Consensus 327 vQIENEyG~y~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~l~~~~~g~~~t~nfg~~~~~~~~f~~~~~~~p~~P~ 406 (949)
|=+-||-+. +. .+..+.+.+|+. --+-| .+..+.. ....-++++. .++ .. +......+..+++|+
T Consensus 444 WslGNE~~~-g~---~~~~~~~~~k~~-DptR~-v~~~~~~----~~~~~Dv~~~-~Y~---~~-~~~~~~~~~~~~kP~ 508 (1021)
T PRK10340 444 WSLGNESGY-GC---NIRAMYHAAKAL-DDTRL-VHYEEDR----DAEVVDVIST-MYT---RV-ELMNEFGEYPHPKPR 508 (1021)
T ss_pred EECccCccc-cH---HHHHHHHHHHHh-CCCce-EEeCCCc----Cccccceecc-ccC---CH-HHHHHHHhCCCCCcE
Confidence 999999753 21 235667777754 22222 3332211 0111233322 122 11 223333344467999
Q ss_pred eeeecCC
Q psy12907 407 VNSEYYP 413 (949)
Q Consensus 407 ~~~Ef~~ 413 (949)
+.+||-.
T Consensus 509 i~~Ey~h 515 (1021)
T PRK10340 509 ILCEYAH 515 (1021)
T ss_pred EEEchHh
Confidence 9999853
No 15
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=97.99 E-value=0.00011 Score=79.55 Aligned_cols=113 Identities=21% Similarity=0.317 Sum_probs=84.2
Q ss_pred CCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHH
Q psy12907 229 WSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQ 308 (949)
Q Consensus 229 Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~ 308 (949)
|...||++|+|||+. ++++++.|+++||.| |..+-+ |.. ..|.|+.. .+ .+...+++++|+++
T Consensus 3 W~~~ep~~G~~n~~~---~D~~~~~a~~~gi~v--~gH~l~---W~~-~~P~W~~~-~~-------~~~~~~~~~~~i~~ 65 (254)
T smart00633 3 WDSTEPSRGQFNFSG---ADAIVNFAKENGIKV--RGHTLV---WHS-QTPDWVFN-LS-------KETLLARLENHIKT 65 (254)
T ss_pred cccccCCCCccChHH---HHHHHHHHHHCCCEE--EEEEEe---ecc-cCCHhhhc-CC-------HHHHHHHHHHHHHH
Confidence 899999999999985 899999999999998 332222 544 68999875 22 33556777777777
Q ss_pred HHHHHHhccccCCCceEEeccccccCC----------c--cchHHHHHHHHHHHHHhcCCeeEEEeeCCC
Q psy12907 309 LFPRITRFLYGNGGPIILVQVENEMGS----------Y--TCDKEHMIWLRDQMKYYVRDAAVLYTTDGG 366 (949)
Q Consensus 309 l~~~l~~~~~~~GGpII~vQIENEyG~----------y--~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~ 366 (949)
++.+.+ |.|..++|=||--. + ..+.+|+...-+.+|+. .-++.||.+|..
T Consensus 66 v~~ry~-------g~i~~wdV~NE~~~~~~~~~~~~~w~~~~G~~~i~~af~~ar~~-~P~a~l~~Ndy~ 127 (254)
T smart00633 66 VVGRYK-------GKIYAWDVVNEALHDNGSGLRRSVWYQILGEDYIEKAFRYAREA-DPDAKLFYNDYN 127 (254)
T ss_pred HHHHhC-------CcceEEEEeeecccCCCcccccchHHHhcChHHHHHHHHHHHHh-CCCCEEEEeccC
Confidence 766653 56889999999422 1 12347888888888887 778999999854
No 16
>PLN02161 beta-amylase
Probab=97.73 E-value=9.5e-05 Score=86.07 Aligned_cols=84 Identities=21% Similarity=0.285 Sum_probs=64.9
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCC----Ccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFG----GLP 279 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~G----GlP 279 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| ..+++++++++||++.+--.=--|+---.+ -||
T Consensus 116 ~~al~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~YdWsg---Y~~l~~mvr~~GLKlq~vmSFHqCGGNvGd~~~IpLP 192 (531)
T PLN02161 116 LKALTVSLKALKLAGVHGIAVEVWWGIVERFSPLEFKWSL---YEELFRLISEAGLKLHVALCFHSNMHLFGGKGGISLP 192 (531)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEeeeeeeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEecccCCCCCCccCccCC
Confidence 456778999999999999999999999998 799999999 888999999999998433222333221111 289
Q ss_pred ccccc---cCCCeee
Q psy12907 280 SWLMT---IKPNITL 291 (949)
Q Consensus 280 ~WL~~---~~p~~~~ 291 (949)
.|+.+ ++|++..
T Consensus 193 ~WV~~~g~~~pDi~f 207 (531)
T PLN02161 193 LWIREIGDVNKDIYY 207 (531)
T ss_pred HHHHhhhccCCCceE
Confidence 99975 3677654
No 17
>PF03198 Glyco_hydro_72: Glucanosyltransferase; InterPro: IPR004886 This family is a group of yeast glycolipid proteins anchored to the membrane. It includes Candida albicans (Yeast) pH-regulated protein, which is required for apical growth and plays a role in morphogenesis and Saccharomyces cerevisiae glycolipid anchored surface protein.; PDB: 2W61_A 2W62_A 2W63_A.
Probab=97.72 E-value=0.00049 Score=76.62 Aligned_cols=160 Identities=14% Similarity=0.210 Sum_probs=84.8
Q ss_pred EeecCCceE--eCCeeeEEeccccCCCCC-----------ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccC
Q psy12907 177 IDYERDTFV--KDGKPFRYVSGSFHYFRS-----------PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTG 243 (949)
Q Consensus 177 v~~d~~~f~--~dGkpf~i~sGe~HY~R~-----------p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g 243 (949)
|...++.|. .+|++|+|.+=++-+--. .++.|+.++..||++|+|||++|- .+-
T Consensus 11 I~ikG~kff~~~~g~~F~ikGVaYQp~~~~~~~~~~DPLad~~~C~rDi~~l~~LgiNtIRVY~-------------vdp 77 (314)
T PF03198_consen 11 IEIKGNKFFYSKNGTRFFIKGVAYQPGGSSEPSNYIDPLADPEACKRDIPLLKELGINTIRVYS-------------VDP 77 (314)
T ss_dssp EEEETTEEEETTT--B--EEEEE----------SS--GGG-HHHHHHHHHHHHHHT-SEEEES----------------T
T ss_pred EEEECCEeEECCCCCEEEEeeEEcccCCCCCCccCcCcccCHHHHHHhHHHHHHcCCCEEEEEE-------------eCC
Confidence 444577787 788888888776654333 257999999999999999999982 233
Q ss_pred chhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCH--hHHHHHHHHHHHHHHHHHhccccCC
Q psy12907 244 DQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDE--VYQHYVNKWFAQLFPRITRFLYGNG 321 (949)
Q Consensus 244 ~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp--~y~~~v~r~~~~l~~~l~~~~~~~G 321 (949)
..|=++++++.+++|+|||+--+. |...+-..+| .|-...-.-+.+++..++++.
T Consensus 78 ~~nHd~CM~~~~~aGIYvi~Dl~~-------------------p~~sI~r~~P~~sw~~~l~~~~~~vid~fa~Y~---- 134 (314)
T PF03198_consen 78 SKNHDECMSAFADAGIYVILDLNT-------------------PNGSINRSDPAPSWNTDLLDRYFAVIDAFAKYD---- 134 (314)
T ss_dssp TS--HHHHHHHHHTT-EEEEES-B-------------------TTBS--TTS------HHHHHHHHHHHHHHTT-T----
T ss_pred CCCHHHHHHHHHhCCCEEEEecCC-------------------CCccccCCCCcCCCCHHHHHHHHHHHHHhccCC----
Confidence 457889999999999999988432 1222223334 333222222334666676664
Q ss_pred CceEEeccccccCCc-------cchHHHHHHHHHHHHHhcCC-eeEE--EeeCCCC-----ccccccc
Q psy12907 322 GPIILVQVENEMGSY-------TCDKEHMIWLRDQMKYYVRD-AAVL--YTTDGGG-----VGYLKCT 374 (949)
Q Consensus 322 GpII~vQIENEyG~y-------~~d~~Y~~~L~~~~~~~lG~-~vpl--ft~Dg~~-----~~~l~~~ 374 (949)
+++++=+-||--.- .+-|+..+.+|+-+++. +. .+|+ -++|... ..++.|.
T Consensus 135 -N~LgFf~GNEVin~~~~t~aap~vKAavRD~K~Yi~~~-~~R~IPVGYsaaD~~~~r~~~a~Yl~Cg 200 (314)
T PF03198_consen 135 -NTLGFFAGNEVINDASNTNAAPYVKAAVRDMKAYIKSK-GYRSIPVGYSAADDAEIRQDLANYLNCG 200 (314)
T ss_dssp -TEEEEEEEESSS-STT-GGGHHHHHHHHHHHHHHHHHS-SS----EEEEE---TTTHHHHHHHTTBT
T ss_pred -ceEEEEecceeecCCCCcccHHHHHHHHHHHHHHHHhc-CCCCCceeEEccCChhHHHHHHHHhcCC
Confidence 79999999997532 12345555555555554 55 4455 4556542 2346673
No 18
>PLN02801 beta-amylase
Probab=97.72 E-value=0.00022 Score=83.12 Aligned_cols=141 Identities=23% Similarity=0.441 Sum_probs=95.6
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceEE--ecCCcccccccCCC-----
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLVL--LRPGPYICAERDFG----- 276 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~VI--LRpGPYIcAEw~~G----- 276 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| -.+++++++++||++. +.+ --|+- +-|
T Consensus 36 ~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~P~~YdWsg---Y~~l~~mvr~~GLKlq~vmSF--HqCGG-NVGD~~~I 109 (517)
T PLN02801 36 EEGLEKQLKRLKEAGVDGVMVDVWWGIVESKGPKQYDWSA---YRSLFELVQSFGLKIQAIMSF--HQCGG-NVGDAVNI 109 (517)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCccCcHH---HHHHHHHHHHcCCeEEEEEEe--cccCC-CCCCcccc
Confidence 567889999999999999999999999998 599999999 8889999999999984 543 23422 111
Q ss_pred Cccccccc---cCCCeeeec----CCHhHH----------------HHHHHHHHHHHHHHHhccccCCCceEEecc----
Q psy12907 277 GLPSWLMT---IKPNITLRS----KDEVYQ----------------HYVNKWFAQLFPRITRFLYGNGGPIILVQV---- 329 (949)
Q Consensus 277 GlP~WL~~---~~p~~~~Rt----~dp~y~----------------~~v~r~~~~l~~~l~~~~~~~GGpII~vQI---- 329 (949)
-||.|+.+ ++|++.... .|+.|+ +.-+.|++..-..+++++ .+|.|..|||
T Consensus 110 pLP~WV~~~g~~~pDi~ftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~Dfm~SFr~~F~~~l--~~~~I~eI~VGlGP 187 (517)
T PLN02801 110 PIPQWVRDVGDSDPDIFYTNRSGNRNKEYLSIGVDNLPLFHGRTAVEMYSDYMKSFRENMADFL--EAGVIIDIEVGLGP 187 (517)
T ss_pred cCCHHHHHhhccCCCceeecCCCCcCcceeeeccCcccccCCCCHHHHHHHHHHHHHHHHHHhc--cCCeeEEEEEcccc
Confidence 28999975 367764421 122222 222345555555666664 3468888888
Q ss_pred --ccccCCc--------------cc-hHHHHHHHHHHHHHh
Q psy12907 330 --ENEMGSY--------------TC-DKEHMIWLRDQMKYY 353 (949)
Q Consensus 330 --ENEyG~y--------------~~-d~~Y~~~L~~~~~~~ 353 (949)
|=.|=|| .| |+--+..|++.+.+.
T Consensus 188 ~GELRYPSYp~~~gW~fpGiGEFQCYDky~~~~l~~aA~~~ 228 (517)
T PLN02801 188 AGELRYPSYPETQGWVFPGIGEFQCYDKYLKADFKEAATEA 228 (517)
T ss_pred cccccCCCCcCCCCCCCCCcceeeeccHHHHHHHHHHHHhc
Confidence 4445454 12 444455666666654
No 19
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=97.71 E-value=0.00035 Score=86.86 Aligned_cols=118 Identities=24% Similarity=0.311 Sum_probs=89.7
Q ss_pred EeecCCceEeCCeeeEEeccccCCC-----CCCh-hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHH
Q psy12907 177 IDYERDTFVKDGKPFRYVSGSFHYF-----RSPR-YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYF 250 (949)
Q Consensus 177 v~~d~~~f~~dGkpf~i~sGe~HY~-----R~p~-~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~f 250 (949)
|.++.+.|.|||+|+++-+-.-|.+ |... +.-+++|++||++|+|+|+|- |-|+. .+|
T Consensus 286 iei~~~~~~iNGkpvf~kGvnrHe~~~~~G~~~~~~~~~~dl~lmk~~n~N~vRts-----HyP~~-----------~~~ 349 (808)
T COG3250 286 VEIKDGLLLINGKPVFIRGVNRHEDDPILGRVTDEDAMERDLKLMKEANMNSVRTS-----HYPNS-----------EEF 349 (808)
T ss_pred EEEECCeEEECCeEEEEeeeecccCCCccccccCHHHHHHHHHHHHHcCCCEEEec-----CCCCC-----------HHH
Confidence 4566889999999999999988866 3333 448899999999999999996 66653 569
Q ss_pred HHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccc
Q psy12907 251 LELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVE 330 (949)
Q Consensus 251 l~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIE 330 (949)
+++|.++||+||--+ ..||-. .| ++|.|++.+..=.++++.+.+.| ..||||=+-
T Consensus 350 ydLcDelGllV~~Ea----~~~~~~--~~--------------~~~~~~k~~~~~i~~mver~knH-----PSIiiWs~g 404 (808)
T COG3250 350 YDLCDELGLLVIDEA----MIETHG--MP--------------DDPEWRKEVSEEVRRMVERDRNH-----PSIIIWSLG 404 (808)
T ss_pred HHHHHHhCcEEEEec----chhhcC--CC--------------CCcchhHHHHHHHHHHHHhccCC-----CcEEEEecc
Confidence 999999999999774 222321 11 67788887776666666666544 479999999
Q ss_pred cccCC
Q psy12907 331 NEMGS 335 (949)
Q Consensus 331 NEyG~ 335 (949)
||-|.
T Consensus 405 NE~~~ 409 (808)
T COG3250 405 NESGH 409 (808)
T ss_pred ccccC
Confidence 99764
No 20
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=97.69 E-value=0.0012 Score=73.15 Aligned_cols=235 Identities=19% Similarity=0.234 Sum_probs=111.8
Q ss_pred CceE-eCCeeeEEeccccCC---CCCChhhHHHHHHHHHHcCCCEEEEeec--CCcc-C-------C----CCCeeeccC
Q psy12907 182 DTFV-KDGKPFRYVSGSFHY---FRSPRYYWQDRLRKLRAAGLNAVSTYVE--WSFH-E-------P----SPGQFQFTG 243 (949)
Q Consensus 182 ~~f~-~dGkpf~i~sGe~HY---~R~p~~~W~d~L~k~Ka~GlNtV~tYv~--Wn~H-E-------p----~~G~fdF~g 243 (949)
+.|. -||+||+.++ .-.+ .|...+.|+.-|+..|+-|||+|++=++ |.-+ . | .++.+||+.
T Consensus 3 r~f~~~dG~Pff~lg-dT~W~~~~~~~~~e~~~yL~~r~~qgFN~iq~~~l~~~~~~~~~n~~~~~~~~~~~~~~~d~~~ 81 (289)
T PF13204_consen 3 RHFVYADGTPFFWLG-DTAWSLFHRLTREEWEQYLDTRKEQGFNVIQMNVLPQWDGYNTPNRYGFAPFPDEDPGQFDFTR 81 (289)
T ss_dssp SSEEETTS-B--EEE-EE-TTHHHH--HHHHHHHHHHHHHTT--EEEEES-SSSS-B----TTS-BS-SSTT------TT
T ss_pred ceEecCCCCEEeehh-HHHHHHhhCCCHHHHHHHHHHHHHCCCCEEEEEeCCCcccccccccCCCcCCCCCCccccCCCC
Confidence 4455 6999999987 3332 2677899999999999999999998765 3322 1 1 223367764
Q ss_pred -----chhHHHHHHHHHHCCceEEecC---CcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHh
Q psy12907 244 -----DQDLEYFLELAVREDLLVLLRP---GPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITR 315 (949)
Q Consensus 244 -----~~DL~~fl~la~e~GL~VILRp---GPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~ 315 (949)
...+++.|+.|.+.||.+-|-| +||.-.-|-+| |..+. .+.+++|.+.|+.+++.
T Consensus 82 ~N~~YF~~~d~~i~~a~~~Gi~~~lv~~wg~~~~~~~Wg~~--~~~m~---------------~e~~~~Y~~yv~~Ry~~ 144 (289)
T PF13204_consen 82 PNPAYFDHLDRRIEKANELGIEAALVPFWGCPYVPGTWGFG--PNIMP---------------PENAERYGRYVVARYGA 144 (289)
T ss_dssp ----HHHHHHHHHHHHHHTT-EEEEESS-HHHHH---------TTSS----------------HHHHHHHHHHHHHHHTT
T ss_pred CCHHHHHHHHHHHHHHHHCCCeEEEEEEECCcccccccccc--ccCCC---------------HHHHHHHHHHHHHHHhc
Confidence 2679999999999999984432 34443444443 11111 36788999999999987
Q ss_pred ccccCCCceEEeccccccCCccchHHHHHHHHHHHHHhcCCeeEE-EeeCCCCcccccc-ccCC-----eeeeecCCCCC
Q psy12907 316 FLYGNGGPIILVQVENEMGSYTCDKEHMIWLRDQMKYYVRDAAVL-YTTDGGGVGYLKC-TVPG-----VYATVDFGAAT 388 (949)
Q Consensus 316 ~~~~~GGpII~vQIENEyG~y~~d~~Y~~~L~~~~~~~lG~~vpl-ft~Dg~~~~~l~~-~~~g-----~~~t~nfg~~~ 388 (949)
+- +|| +=|-||+-....+.++.+.+.+.+++.-.-. +. ++.-+. ...-.. .-.. .+.+.+-....
T Consensus 145 ~~-----Nvi-W~l~gd~~~~~~~~~~w~~~~~~i~~~dp~~-L~T~H~~~~-~~~~~~~~~~~Wldf~~~Qsgh~~~~~ 216 (289)
T PF13204_consen 145 YP-----NVI-WILGGDYFDTEKTRADWDAMARGIKENDPYQ-LITIHPCGR-TSSPDWFHDEPWLDFNMYQSGHNRYDQ 216 (289)
T ss_dssp -S-----SEE-EEEESSS--TTSSHHHHHHHHHHHHHH--SS--EEEEE-BT-EBTHHHHTT-TT--SEEEB--S--TT-
T ss_pred CC-----CCE-EEecCccCCCCcCHHHHHHHHHHHHhhCCCC-cEEEeCCCC-CCcchhhcCCCcceEEEeecCCCcccc
Confidence 63 355 5588898222567788888888888763333 22 222211 000000 0000 11111100000
Q ss_pred ChHHHH--HHhhhcCCCCCeeeeec-CCCcccccCCCCCCCChHHHHHHHHH-HHHcC
Q psy12907 389 NVSSAF--AAMRTVSPHGPLVNSEY-YPGWLTHWGEKIATVDTDPVVKTLQI-MLEMK 442 (949)
Q Consensus 389 ~~~~~f--~~~~~~~p~~P~~~~Ef-~~GWfd~WG~~~~~~~~~~~~~~l~~-~l~~g 442 (949)
...... +..-+..|.+|.++.|- |-|-...+.......+++++.+.+=+ +|+.+
T Consensus 217 ~~~~~~~~~~~~~~~p~KPvin~Ep~YEg~~~~~~~~~~~~~~~dvrr~aw~svlaGa 274 (289)
T PF13204_consen 217 DNWYYLPEEFDYRRKPVKPVINGEPCYEGIPYSRWGYNGRFSAEDVRRRAWWSVLAGA 274 (289)
T ss_dssp -THHHH--HHHHTSSS---EEESS---BT-BTTSS-TS-B--HHHHHHHHHHHHHCT-
T ss_pred hHHHHHhhhhhhhhCCCCCEEcCcccccCCCCCcCcccCCCCHHHHHHHHHHHHhcCC
Confidence 111111 11124568999999993 34443333223445677887776644 45433
No 21
>PLN02705 beta-amylase
Probab=97.67 E-value=0.00029 Score=83.32 Aligned_cols=117 Identities=12% Similarity=0.252 Sum_probs=84.1
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceE--EecCCcccccccCCC-----
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLV--LLRPGPYICAERDFG----- 276 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~V--ILRpGPYIcAEw~~G----- 276 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| ..+++++++++||++ ||.+ --|+- +-|
T Consensus 267 ~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~~P~~YdWsg---Y~~L~~mvr~~GLKlqvVmSF--HqCGG-NVGD~~~I 340 (681)
T PLN02705 267 PEGVRQELSHMKSLNVDGVVVDCWWGIVEGWNPQKYVWSG---YRELFNIIREFKLKLQVVMAF--HEYGG-NASGNVMI 340 (681)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeEeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEe--eccCC-CCCCcccc
Confidence 577889999999999999999999999998 699999999 888999999999997 4543 23433 111
Q ss_pred Cccccccc---cCCCeeeec----CCHh----------------HHHHHHHHHHHHHHHHHhccccCCCceEEecc
Q psy12907 277 GLPSWLMT---IKPNITLRS----KDEV----------------YQHYVNKWFAQLFPRITRFLYGNGGPIILVQV 329 (949)
Q Consensus 277 GlP~WL~~---~~p~~~~Rt----~dp~----------------y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQI 329 (949)
-||.|+.+ ++|++.... .|+. -++.-..|++..-..+++++ .+|-|.-|||
T Consensus 341 PLP~WV~e~g~~nPDifftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~DFM~SFr~~F~~fl--~~g~I~eI~V 414 (681)
T PLN02705 341 SLPQWVLEIGKDNQDIFFTDREGRRNTECLSWSIDKERVLKGRTGIEVYFDFMRSFRSEFDDLF--VEGLITAVEI 414 (681)
T ss_pred cCCHHHHHhcccCCCceeecCCCCcccceeeeecCcccccCCCCHHHHHHHHHHHHHHHHHHhc--cCCceeEEEe
Confidence 29999984 467765421 0111 12333445555556666665 3467888888
No 22
>PLN02803 beta-amylase
Probab=97.66 E-value=0.00013 Score=85.41 Aligned_cols=81 Identities=21% Similarity=0.484 Sum_probs=64.7
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceEE--ecCCcccccccCCC-----
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLVL--LRPGPYICAERDFG----- 276 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~VI--LRpGPYIcAEw~~G----- 276 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| ..+++++++++||++. +.+ --|+- +-|
T Consensus 106 ~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~YdWsg---Y~~l~~mvr~~GLKlq~vmSF--HqCGG-NVGD~~~I 179 (548)
T PLN02803 106 PRAMNASLMALRSAGVEGVMVDAWWGLVEKDGPMKYNWEG---YAELVQMVQKHGLKLQVVMSF--HQCGG-NVGDSCSI 179 (548)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEeeeeeeccCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEe--cccCC-CCCCcccc
Confidence 466788999999999999999999999998 599999999 8889999999999984 543 23422 111
Q ss_pred Cccccccc---cCCCeee
Q psy12907 277 GLPSWLMT---IKPNITL 291 (949)
Q Consensus 277 GlP~WL~~---~~p~~~~ 291 (949)
-||.|+.+ ++|++..
T Consensus 180 pLP~WV~e~~~~~pDi~f 197 (548)
T PLN02803 180 PLPPWVLEEMSKNPDLVY 197 (548)
T ss_pred cCCHHHHHhhhcCCCceE
Confidence 28999975 3677654
No 23
>PLN00197 beta-amylase; Provisional
Probab=97.65 E-value=0.00014 Score=85.32 Aligned_cols=81 Identities=30% Similarity=0.594 Sum_probs=65.4
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceEE--ecCCcccccccCCC-----
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLVL--LRPGPYICAERDFG----- 276 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~VI--LRpGPYIcAEw~~G----- 276 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| ..+++++++++||++. +.+ --|+- +-|
T Consensus 126 ~~~l~~~L~~LK~~GVdGVmvDvWWGiVE~~~p~~YdWsg---Y~~L~~mvr~~GLKlq~VmSF--HqCGG-NVGD~~~I 199 (573)
T PLN00197 126 RKAMKASLQALKSAGVEGIMMDVWWGLVERESPGVYNWGG---YNELLEMAKRHGLKVQAVMSF--HQCGG-NVGDSCTI 199 (573)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEe--cccCC-CCCCcccc
Confidence 567889999999999999999999999998 799999999 8889999999999984 543 23432 111
Q ss_pred Cccccccc---cCCCeee
Q psy12907 277 GLPSWLMT---IKPNITL 291 (949)
Q Consensus 277 GlP~WL~~---~~p~~~~ 291 (949)
-||.|+.+ ++|++..
T Consensus 200 pLP~WV~~~g~~dpDiff 217 (573)
T PLN00197 200 PLPKWVVEEVDKDPDLAY 217 (573)
T ss_pred cCCHHHHHhhccCCCcee
Confidence 28999975 3677654
No 24
>PLN02905 beta-amylase
Probab=97.63 E-value=0.00036 Score=82.79 Aligned_cols=117 Identities=18% Similarity=0.386 Sum_probs=83.1
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC-CCCeeeccCchhHHHHHHHHHHCCceEE--ecCCcccccccCCC-----
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-SPGQFQFTGDQDLEYFLELAVREDLLVL--LRPGPYICAERDFG----- 276 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-~~G~fdF~g~~DL~~fl~la~e~GL~VI--LRpGPYIcAEw~~G----- 276 (949)
++.-+..|+++|++|+..|.+-|.|.+.|. .|++|||+| ..+++++++++||++. |.+ --|+- +-|
T Consensus 285 ~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~gP~~YdWsg---Y~~L~~mvr~~GLKlqvVMSF--HqCGG-NVGD~~~I 358 (702)
T PLN02905 285 PDGLLKQLRILKSINVDGVKVDCWWGIVEAHAPQEYNWNG---YKRLFQMVRELKLKLQVVMSF--HECGG-NVGDDVCI 358 (702)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeecCCCCcCCcHH---HHHHHHHHHHcCCeEEEEEEe--cccCC-CCCCcccc
Confidence 456788999999999999999999999998 799999999 8889999999999984 543 33432 111
Q ss_pred Cccccccc---cCCCeeeec----CCHh----------------HHHHHHHHHHHHHHHHHhccccCCCceEEecc
Q psy12907 277 GLPSWLMT---IKPNITLRS----KDEV----------------YQHYVNKWFAQLFPRITRFLYGNGGPIILVQV 329 (949)
Q Consensus 277 GlP~WL~~---~~p~~~~Rt----~dp~----------------y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQI 329 (949)
-||.|+.+ ++|++.... .|+. -++.-..|++..-..+++++ .+|.|.-|||
T Consensus 359 PLP~WV~e~g~~nPDifftDrsG~rn~EyLSlg~D~~pvl~GRTplq~Y~DFM~SFr~~F~~fl--~~g~I~eI~V 432 (702)
T PLN02905 359 PLPHWVAEIGRSNPDIFFTDREGRRNPECLSWGIDKERILRGRTALEVYFDYMRSFRVEFDEFF--EDGVISMVEV 432 (702)
T ss_pred cCCHHHHHhhhcCCCceEecCCCCccCceeeeecccccccCCCCHHHHHHHHHHHHHHHHHHHh--cCCceEEEEe
Confidence 28999975 367775421 0111 12333445555555666665 3467888888
No 25
>TIGR03356 BGL beta-galactosidase.
Probab=97.47 E-value=0.00027 Score=82.45 Aligned_cols=97 Identities=11% Similarity=0.156 Sum_probs=76.3
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||++|+|++++-|.|...+|+ +|++|.++..-.+++|+.|.++||.+|+--= .=.+|.||.+
T Consensus 54 ~~y~eDi~l~~~~G~~~~R~si~Wsri~p~g~~~~n~~~~~~y~~~i~~l~~~gi~pivtL~--------Hfd~P~~l~~ 125 (427)
T TIGR03356 54 HRYEEDVALMKELGVDAYRFSIAWPRIFPEGTGPVNPKGLDFYDRLVDELLEAGIEPFVTLY--------HWDLPQALED 125 (427)
T ss_pred HhHHHHHHHHHHcCCCeEEcccchhhcccCCCCCcCHHHHHHHHHHHHHHHHcCCeeEEeec--------cCCccHHHHh
Confidence 458899999999999999999999999999 7899988888899999999999999886521 1358999986
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHh
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITR 315 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~ 315 (949)
+ .+- .++...++-.+|.+.+++++++
T Consensus 126 ~-gGw----~~~~~~~~f~~ya~~~~~~~~d 151 (427)
T TIGR03356 126 R-GGW----LNRDTAEWFAEYAAVVAERLGD 151 (427)
T ss_pred c-CCC----CChHHHHHHHHHHHHHHHHhCC
Confidence 3 332 2355556666666666666654
No 26
>PF01373 Glyco_hydro_14: Glycosyl hydrolase family 14; InterPro: IPR001554 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 14 GH14 from CAZY comprises enzymes with only one known activity; beta-amylase (3.2.1.2 from EC). A Glu residue has been proposed as a catalytic residue, but it is not known if it is the nucleophile or the proton donor. Beta-amylase [, ] is an enzyme that hydrolyses 1,4-alpha-glucosidic linkages in starch-type polysaccharide substrates so as to remove successive maltose units from the non-reducing ends of the chains. Beta-amylase is present in certain bacteria as well as in plants. Three highly conserved sequence regions are found in all known beta-amylases. The first of these regions is located in the N-terminal section of the enzymes and contains an aspartate which is known [] to be involved in the catalytic mechanism. The second, located in a more central location, is centred around a glutamate which is also involved [] in the catalytic mechanism. The 3D structure of a complex of soybean beta-amylase with an inhibitor (alpha-cyclodextrin) has been determined to 3.0A resolution by X-ray diffraction []. The enzyme folds into large and small domains: the large domain has a (beta alpha)8 super-secondary structural core, while the smaller is formed from two long loops extending from the beta-3 and beta-4 strands of the (beta alpha)8 fold []. The interface of the two domains, together with shorter loops from the (beta alpha)8 core, form a deep cleft, in which the inhibitor binds []. Two maltose molecules also bind in the cleft, one sharing a binding site with alpha-cyclodextrin, and the other sitting more deeply in the cleft [].; GO: 0016161 beta-amylase activity, 0000272 polysaccharide catabolic process; PDB: 1FA2_A 2DQX_A 1WDP_A 1UKP_C 1BYC_A 1BYA_A 1Q6C_A 1V3I_A 1BTC_A 1BYB_A ....
Probab=97.36 E-value=0.00022 Score=81.76 Aligned_cols=75 Identities=25% Similarity=0.443 Sum_probs=53.7
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCccccc----ccCCCCcccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICA----ERDFGGLPSW 281 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcA----Ew~~GGlP~W 281 (949)
.-+..|+++|++|+..|.+.|.|.+.|.+ |++|||++ -.++++++++.||++.+-..=--|+ ..-+=-||.|
T Consensus 17 ~~~~~L~~LK~~GV~GVmvdvWWGiVE~~~p~~ydWs~---Y~~l~~~vr~~GLk~~~vmsfH~cGgNvgD~~~IpLP~W 93 (402)
T PF01373_consen 17 ALEAQLRALKSAGVDGVMVDVWWGIVEGEGPQQYDWSG---YRELFEMVRDAGLKLQVVMSFHQCGGNVGDDCNIPLPSW 93 (402)
T ss_dssp HHHHHHHHHHHTTEEEEEEEEEHHHHTGSSTTB---HH---HHHHHHHHHHTT-EEEEEEE-S-BSSSTTSSSEB-S-HH
T ss_pred HHHHHHHHHHHcCCcEEEEEeEeeeeccCCCCccCcHH---HHHHHHHHHHcCCeEEEEEeeecCCCCCCCccCCcCCHH
Confidence 56788999999999999999999999997 99999999 8899999999999985432222331 1111138999
Q ss_pred ccc
Q psy12907 282 LMT 284 (949)
Q Consensus 282 L~~ 284 (949)
+..
T Consensus 94 v~~ 96 (402)
T PF01373_consen 94 VWE 96 (402)
T ss_dssp HHH
T ss_pred HHh
Confidence 974
No 27
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=97.30 E-value=4.4e-05 Score=92.54 Aligned_cols=147 Identities=12% Similarity=0.024 Sum_probs=89.5
Q ss_pred chhhhcccCCccC-CCcEEEEeeccCCCCCCCCHHHHHHHHHHHHHhCCCeeEEEeeCCCCCC-ccccCCCC--ceeecc
Q psy12907 16 WTNLQKTRPTTYA-PGEIFPFQVENEMGSYTCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVG-YLKCTVPG--VYATVD 91 (949)
Q Consensus 16 ~~~~~~iap~qit-GGPIImvQVENEYGsyg~D~~YM~~L~d~~r~~Gi~~VPL~T~Dg~~~~-~~~G~lpg--v~~tvn 91 (949)
..+.++|+++.++ |++||++|++||||+++|+.+|.+.+...+.+.+-. .+-..+.++.. +.+.+... ...+.+
T Consensus 135 ~~i~~~irer~~~~~~~v~~w~~dneY~~~~~~~~~~~~~f~~wLk~~yg--~l~~ln~~w~t~~ws~t~~~~~~i~~p~ 212 (673)
T COG1874 135 DRILQQIRERLYGNGPAVITWQNDNEYGGHPCYCDYCQAAFRLWLKKGYG--SLDNLNEAWGTSFWSHTYKDFDEIMSPN 212 (673)
T ss_pred HHHHHHHHHHHhccCCceeEEEccCccCCccccccccHHHHHHHHHhCcc--hHHhhhhhhhhhhcccccccHHhhcCCC
Confidence 3477889999988 899999999999999888888888887766555543 23333332221 21222221 111122
Q ss_pred -CCCCCCHHHHHHHhhhcCCCC----CceeeeeCCccc-cccCCCCccCC-hHHHHHHHHHHHHcccceeeeccccceee
Q psy12907 92 -FGAATNVSSAFAAMRTVSPHG----PLVNSEYYPGWL-THWGEKIATVD-TDPVVKTLQIMLEMKANVNFYIPFSPFFF 164 (949)
Q Consensus 92 -fg~~~~~~~~~~~l~~~~P~~----P~m~~Ef~~GWF-D~WG~~~~~~~-~~~~~~~l~~~L~~GaS~NlYM~h~~~~~ 164 (949)
++.. .....+-..+.+...+ +....|.+-+|| +.|..++.... .+.-...+++.+....+-|+||+|.+..+
T Consensus 213 ~~~e~-~~~~~~ld~~~f~~e~~~~~~~~~~~~~~~~~P~~pvt~nl~~~~~~~~~~~~~~~ld~~swdny~~~~~~~~~ 291 (673)
T COG1874 213 PFGEL-PLPGLYLDYRRFESEQILEFVREEGEAIKAYFPNRPVTPNLLAAFKKFDAYKWEKVLDFASWDNYPAWHRGRDF 291 (673)
T ss_pred Ccccc-CCccchhhHhhhhhhhhHHHHHHHHHHHHHhCCCCCCChhHhhhhhhcchHHHHHhcChhhhhhhhhhccccch
Confidence 2220 0011112223344444 777888888888 88887655444 34444556777777777899999966544
Q ss_pred c
Q psy12907 165 K 165 (949)
Q Consensus 165 ~ 165 (949)
.
T Consensus 292 ~ 292 (673)
T COG1874 292 T 292 (673)
T ss_pred h
Confidence 4
No 28
>COG3693 XynA Beta-1,4-xylanase [Carbohydrate transport and metabolism]
Probab=96.99 E-value=0.011 Score=66.09 Aligned_cols=174 Identities=17% Similarity=0.203 Sum_probs=120.9
Q ss_pred HHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeeeecC
Q psy12907 215 LRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSK 294 (949)
Q Consensus 215 ~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~ 294 (949)
.|+.+.=|-+.-.=|+..||++|.|+|+. -++..+.|+++||.+- -=+.| |-+ -.|.|+...+ -+
T Consensus 55 ~re~n~iTpenemKwe~i~p~~G~f~Fe~---AD~ia~FAr~h~m~lh--GHtLv---W~~-q~P~W~~~~e------~~ 119 (345)
T COG3693 55 ARECNQITPENEMKWEAIEPERGRFNFEA---ADAIANFARKHNMPLH--GHTLV---WHS-QVPDWLFGDE------LS 119 (345)
T ss_pred HhhhcccccccccccccccCCCCccCccc---hHHHHHHHHHcCCeec--cceee---ecc-cCCchhhccc------cC
Confidence 44444445556667999999999999997 6789999999999862 11112 444 7899997621 34
Q ss_pred CHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccC----Cc--------cchHHHHHHHHHHHHHhcCCeeEEEe
Q psy12907 295 DEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMG----SY--------TCDKEHMIWLRDQMKYYVRDAAVLYT 362 (949)
Q Consensus 295 dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG----~y--------~~d~~Y~~~L~~~~~~~lG~~vplft 362 (949)
-++..+.+++++..++.+.+. -|+.|-|=||-= ++ ....+|+.+.-+.+|+. .-+..||-
T Consensus 120 ~~~~~~~~e~hI~tV~~rYkg-------~~~sWDVVNE~vdd~g~~R~s~w~~~~~gpd~I~~aF~~Area-dP~AkL~~ 191 (345)
T COG3693 120 KEALAKMVEEHIKTVVGRYKG-------SVASWDVVNEAVDDQGSLRRSAWYDGGTGPDYIKLAFHIAREA-DPDAKLVI 191 (345)
T ss_pred hHHHHHHHHHHHHHHHHhccC-------ceeEEEecccccCCCchhhhhhhhccCCccHHHHHHHHHHHhh-CCCceEEe
Confidence 578899999999998888763 488999999952 21 34579999999999986 88999999
Q ss_pred eCCCCc-------------ccccc---ccCCeeeeecCCCC-CChHHHHHHhhhcCC-CCCeeeeec
Q psy12907 363 TDGGGV-------------GYLKC---TVPGVYATVDFGAA-TNVSSAFAAMRTVSP-HGPLVNSEY 411 (949)
Q Consensus 363 ~Dg~~~-------------~~l~~---~~~g~~~t~nfg~~-~~~~~~f~~~~~~~p-~~P~~~~Ef 411 (949)
+|.... ..|.. .++|+-..++|+.+ ......+..+.++.. +-|.+++|+
T Consensus 192 NDY~ie~~~~kr~~~~nlI~~LkekG~pIDgiG~QsH~~~~~~~~~~~~~a~~~~~k~Gl~i~VTEL 258 (345)
T COG3693 192 NDYSIEGNPAKRNYVLNLIEELKEKGAPIDGIGIQSHFSGDGPSIEKMRAALLKFSKLGLPIYVTEL 258 (345)
T ss_pred ecccccCChHHHHHHHHHHHHHHHCCCCccceeeeeeecCCCCCHHHHHHHHHHHhhcCCCceEEEe
Confidence 997421 01111 23455555555432 234555555555533 569999995
No 29
>PF00331 Glyco_hydro_10: Glycosyl hydrolase family 10; InterPro: IPR001000 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 10 GH10 from CAZY comprises enzymes with a number of known activities; xylanase (3.2.1.8 from EC); endo-1,3-beta-xylanase (3.2.1.32 from EC); cellobiohydrolase (3.2.1.91 from EC). These enzymes were formerly known as cellulase family F. The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) [, ]. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family F [] or as the glycosyl hydrolases family 10 []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1UQZ_A 1UQY_A 1UR2_A 1UR1_A 2CNC_A 1OD8_A 1E0W_A 1E0V_A 1V0M_A 1E0X_B ....
Probab=96.70 E-value=0.0065 Score=68.41 Aligned_cols=199 Identities=19% Similarity=0.208 Sum_probs=124.5
Q ss_pred EeccccCCCCCChhhHHHHHHHHHHcCCCEEEEe--ecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 193 YVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTY--VEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 193 i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tY--v~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
.++.+++..++..+. ..+.+-..-||.|..- .-|...||++|+|+|+. .+++++.|+++||.|--- |
T Consensus 11 ~~G~av~~~~~~~~~---~~~~~~~~~Fn~~t~eN~~Kw~~~e~~~g~~~~~~---~D~~~~~a~~~g~~vrGH-----~ 79 (320)
T PF00331_consen 11 PFGAAVNAQQLEDDP---RYRELFAKHFNSVTPENEMKWGSIEPEPGRFNFES---ADAILDWARENGIKVRGH-----T 79 (320)
T ss_dssp EEEEEEBGGGHTHHH---HHHHHHHHH-SEEEESSTTSHHHHESBTTBEE-HH---HHHHHHHHHHTT-EEEEE-----E
T ss_pred CEEEEechhHcCCcH---HHHHHHHHhCCeeeeccccchhhhcCCCCccCccc---hhHHHHHHHhcCcceeee-----e
Confidence 567778777776542 3444445668988764 66999999999999974 999999999999998422 1
Q ss_pred cccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCc--------------
Q psy12907 271 AERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSY-------------- 336 (949)
Q Consensus 271 AEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y-------------- 336 (949)
-=|.. ..|.|+... +... ....+...+++++++++++.+.+.. |.|.+|=|=||-=.-
T Consensus 80 LvW~~-~~P~w~~~~-~~~~-~~~~~~~~~~l~~~I~~v~~~y~~~-----g~i~~WDVvNE~i~~~~~~~~~r~~~~~~ 151 (320)
T PF00331_consen 80 LVWHS-QTPDWVFNL-ANGS-PDEKEELRARLENHIKTVVTRYKDK-----GRIYAWDVVNEAIDDDGNPGGLRDSPWYD 151 (320)
T ss_dssp EEESS-SS-HHHHTS-TTSS-BHHHHHHHHHHHHHHHHHHHHTTTT-----TTESEEEEEES-B-TTSSSSSBCTSHHHH
T ss_pred EEEcc-cccceeeec-cCCC-cccHHHHHHHHHHHHHHHHhHhccc-----cceEEEEEeeecccCCCccccccCChhhh
Confidence 11544 789999872 1000 0001237788888888877776532 789999999985211
Q ss_pred cchHHHHHHHHHHHHHhcCCeeEEEeeCCCCccc------------ccc---ccCCeeeeecCCCCCChHHHHHHhhhcC
Q psy12907 337 TCDKEHMIWLRDQMKYYVRDAAVLYTTDGGGVGY------------LKC---TVPGVYATVDFGAATNVSSAFAAMRTVS 401 (949)
Q Consensus 337 ~~d~~Y~~~L~~~~~~~lG~~vplft~Dg~~~~~------------l~~---~~~g~~~t~nfg~~~~~~~~f~~~~~~~ 401 (949)
....+|+...-+.+++. -.++.||-+|...... +.. .++|+--..++..+....+..+.++++.
T Consensus 152 ~lG~~yi~~aF~~A~~~-~P~a~L~~NDy~~~~~~k~~~~~~lv~~l~~~gvpIdgIG~Q~H~~~~~~~~~i~~~l~~~~ 230 (320)
T PF00331_consen 152 ALGPDYIADAFRAAREA-DPNAKLFYNDYNIESPAKRDAYLNLVKDLKARGVPIDGIGLQSHFDAGYPPEQIWNALDRFA 230 (320)
T ss_dssp HHTTCHHHHHHHHHHHH-HTTSEEEEEESSTTSTHHHHHHHHHHHHHHHTTHCS-EEEEEEEEETTSSHHHHHHHHHHHH
T ss_pred cccHhHHHHHHHHHHHh-CCCcEEEeccccccchHHHHHHHHHHHHHHhCCCccceechhhccCCCCCHHHHHHHHHHHH
Confidence 12357888888888877 6688999998753322 001 1344433334433333334444455442
Q ss_pred -CCCCeeeeec
Q psy12907 402 -PHGPLVNSEY 411 (949)
Q Consensus 402 -p~~P~~~~Ef 411 (949)
-+-|..+||+
T Consensus 231 ~~Gl~i~ITEl 241 (320)
T PF00331_consen 231 SLGLPIHITEL 241 (320)
T ss_dssp TTTSEEEEEEE
T ss_pred HcCCceEEEee
Confidence 2579999997
No 30
>COG2730 BglC Endoglucanase [Carbohydrate transport and metabolism]
Probab=96.33 E-value=0.015 Score=67.51 Aligned_cols=114 Identities=18% Similarity=0.189 Sum_probs=69.9
Q ss_pred hhhH-----HHHHHHHHHcCCCEEEEeecCCccCCC----CCeeeccCchhHHHHHHHHHHCCceEEec----CCccccc
Q psy12907 205 RYYW-----QDRLRKLRAAGLNAVSTYVEWSFHEPS----PGQFQFTGDQDLEYFLELAVREDLLVLLR----PGPYICA 271 (949)
Q Consensus 205 ~~~W-----~d~L~k~Ka~GlNtV~tYv~Wn~HEp~----~G~fdF~g~~DL~~fl~la~e~GL~VILR----pGPYIcA 271 (949)
...| ++.+..||.+|||+|++++.|..+++. |...+=+-..-|++.|+-|++.||+|++- ||.-.|-
T Consensus 67 ~~~w~~~~~~~~~~~ik~~G~n~VRiPi~~~~~~~~~~~~p~~~~~~~~~~ld~~I~~a~~~gi~V~iD~H~~~~~~~~~ 146 (407)
T COG2730 67 ESHWGNFITEEDFDQIKSAGFNAVRIPIGYWALQATDGDNPYLIGLTQLKILDEAINWAKKLGIYVLIDLHGYPGGNNGH 146 (407)
T ss_pred hhccchhhhhhHHHHHHHcCCcEEEcccchhhhhccCCCCCCeecchHHHHHHHHHHHHHhcCeeEEEEecccCCCCCCc
Confidence 5567 899999999999999999994444553 33331111126889999999999999987 3332321
Q ss_pred ccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCC
Q psy12907 272 ERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGS 335 (949)
Q Consensus 272 Ew~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~ 335 (949)
| ..|....++. .+.....+.++.+.|+.++.. .-.||++|+=||.-.
T Consensus 147 ~------~s~~~~~~~~----------~~~~~~~~~~~w~~ia~~f~~-~~~VIg~~~~NEP~~ 193 (407)
T COG2730 147 E------HSGYTSDYKE----------ENENVEATIDIWKFIANRFKN-YDTVIGFELINEPNG 193 (407)
T ss_pred C------cccccccccc----------cchhHHHHHHHHHHHHHhccC-CCceeeeeeecCCcc
Confidence 1 1222221111 112222333344445544322 458999999999863
No 31
>PF13364 BetaGal_dom4_5: Beta-galactosidase jelly roll domain; PDB: 1TG7_A 1XC6_A 3OGS_A 3OGV_A 3OGR_A 3OG2_A.
Probab=96.31 E-value=0.015 Score=55.80 Aligned_cols=79 Identities=20% Similarity=0.237 Sum_probs=52.0
Q ss_pred CCcccccCCCceeEEEEEecCCCCCCCCcc-ccc-CCcceEEEEEECCEEEEEEE-ec-cee--eeecc-CCC-CCeEEE
Q psy12907 510 PLSFEALDQSFGFVLYETIIPDARFPDPAL-LTI-SGLRDRGQVFVDEKLVTILY-RN-KML--STPIM-ARP-GQKLSI 581 (949)
Q Consensus 510 P~tmE~lgQ~~GyvlY~t~i~~~~~~~~~~-L~i-~~vrDrA~Vfvdg~~vG~l~-r~-~~~--~i~~~-~~~-~~~LdI 581 (949)
.+.+-+.|...|+++||+++........+. |.+ .+-.-|++|||||+++|.-. .. .+. ++|.. .+. .+.|.+
T Consensus 24 ~l~~~~~g~~~g~~~Yrg~F~~~~~~~~~~~l~~~~g~~~~~~vwVNG~~~G~~~~~~g~q~tf~~p~~il~~~n~v~~v 103 (111)
T PF13364_consen 24 VLYASDYGFHAGYLWYRGTFTGTGQDTSLTPLNIQGGNAFRASVWVNGWFLGSYWPGIGPQTTFSVPAGILKYGNNVLVV 103 (111)
T ss_dssp STCCGCGTSSSCEEEEEEEEETTTEEEEEE-EEECSSTTEEEEEEETTEEEEEEETTTECCEEEEE-BTTBTTCEEEEEE
T ss_pred eeccCccccCCCCEEEEEEEeCCCcceeEEEEeccCCCceEEEEEECCEEeeeecCCCCccEEEEeCceeecCCCEEEEE
Confidence 367778888999999999995422112232 233 36788999999999999976 22 223 44432 122 246899
Q ss_pred EEEeCCC
Q psy12907 582 LVENMGR 588 (949)
Q Consensus 582 LVENmGR 588 (949)
|+.+||.
T Consensus 104 l~~~~g~ 110 (111)
T PF13364_consen 104 LWDNMGH 110 (111)
T ss_dssp EEE-STT
T ss_pred EEeCCCC
Confidence 9999995
No 32
>PF00232 Glyco_hydro_1: Glycosyl hydrolase family 1; InterPro: IPR001360 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 1 GH1 from CAZY comprises enzymes with a number of known activities; beta-glucosidase (3.2.1.21 from EC); beta-galactosidase (3.2.1.23 from EC); 6-phospho-beta-galactosidase (3.2.1.85 from EC); 6-phospho-beta-glucosidase (3.2.1.86 from EC); lactase-phlorizin hydrolase (3.2.1.62 from EC), (3.2.1.108 from EC); beta-mannosidase (3.2.1.25 from EC); myrosinase (3.2.1.147 from EC). ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1QVB_A 3AHY_D 2E9L_A 2ZOX_A 2JFE_X 2E9M_A 3FIZ_A 3FIY_A 3CMJ_A 3FJ0_A ....
Probab=96.10 E-value=0.0078 Score=70.82 Aligned_cols=98 Identities=14% Similarity=0.232 Sum_probs=72.8
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
..|+++++.||++|+|+.++-+.|...+|. +|++|-+|..--+++|+.+.++||..|+- --.-.+|.||.
T Consensus 58 ~~y~eDi~l~~~lg~~~yRfsi~W~Ri~P~g~~g~~n~~~~~~Y~~~i~~l~~~gi~P~vt--------L~H~~~P~~l~ 129 (455)
T PF00232_consen 58 HRYKEDIALMKELGVNAYRFSISWSRIFPDGFEGKVNEEGLDFYRDLIDELLENGIEPIVT--------LYHFDLPLWLE 129 (455)
T ss_dssp HHHHHHHHHHHHHT-SEEEEE--HHHHSTTSSSSSS-HHHHHHHHHHHHHHHHTT-EEEEE--------EESS--BHHHH
T ss_pred hhhhHHHHHHHhhccceeeeecchhheeecccccccCHhHhhhhHHHHHHHHhhccceeee--------eeeccccccee
Confidence 469999999999999999999999999999 69999999888899999999999998765 22457899997
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
+ .-+- .++...+.-.+|.+.+++++.++
T Consensus 130 ~-~ggw----~~~~~~~~F~~Ya~~~~~~~gd~ 157 (455)
T PF00232_consen 130 D-YGGW----LNRETVDWFARYAEFVFERFGDR 157 (455)
T ss_dssp H-HTGG----GSTHHHHHHHHHHHHHHHHHTTT
T ss_pred e-cccc----cCHHHHHHHHHHHHHHHHHhCCC
Confidence 6 3221 24556666677777777777653
No 33
>PRK09852 cryptic 6-phospho-beta-glucosidase; Provisional
Probab=95.87 E-value=0.0076 Score=71.40 Aligned_cols=98 Identities=12% Similarity=0.136 Sum_probs=71.7
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
..|+++++.||++|+|+.++-+.|...+|. +++++=+|..-.+++|+.+.++||.+++-- -.=.+|.||.
T Consensus 71 hry~eDi~l~~~lG~~~yR~si~WsRi~P~g~~~~~n~~~~~~Y~~~i~~l~~~gi~p~VtL--------~H~~~P~~l~ 142 (474)
T PRK09852 71 HRYKEDIALMAEMGFKVFRTSIAWSRLFPQGDELTPNQQGIAFYRSVFEECKKYGIEPLVTL--------CHFDVPMHLV 142 (474)
T ss_pred hhhHHHHHHHHHcCCCeEEeeceeeeeeeCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------eCCCCCHHHH
Confidence 347999999999999999999999999997 556777777778999999999999987552 1236899997
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHh
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITR 315 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~ 315 (949)
.++-+- .++...++-.+|.+.+++++.+
T Consensus 143 ~~~GGW----~~~~~~~~F~~ya~~~~~~fgd 170 (474)
T PRK09852 143 TEYGSW----RNRKMVEFFSRYARTCFEAFDG 170 (474)
T ss_pred HhcCCC----CCHHHHHHHHHHHHHHHHHhcC
Confidence 632322 2344444445555555555543
No 34
>PF14488 DUF4434: Domain of unknown function (DUF4434)
Probab=95.83 E-value=0.17 Score=52.15 Aligned_cols=127 Identities=17% Similarity=0.202 Sum_probs=75.3
Q ss_pred CCCChhhHHHHHHHHHHcCCCEEEE----e-----ecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 201 FRSPRYYWQDRLRKLRAAGLNAVST----Y-----VEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 201 ~R~p~~~W~d~L~k~Ka~GlNtV~t----Y-----v~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
-.+.++.|+..++.||++|++||=. | -+..+ .++.|.-....-|+.+|++|++.||+|.+.-+ -
T Consensus 15 ~~~~~~~W~~~~~~m~~~GidtlIlq~~~~~~~~~yps~~---~~~~~~~~~~d~l~~~L~~A~~~Gmkv~~Gl~--~-- 87 (166)
T PF14488_consen 15 QNWTPAQWREEFRAMKAIGIDTLILQWTGYGGFAFYPSKL---SPGGFYMPPVDLLEMILDAADKYGMKVFVGLY--F-- 87 (166)
T ss_pred cCCCHHHHHHHHHHHHHcCCcEEEEEEeecCCcccCCccc---cCccccCCcccHHHHHHHHHHHcCCEEEEeCC--C--
Confidence 5899999999999999999999832 1 11222 12223323344699999999999999988742 1
Q ss_pred ccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCc-cchHHHHHHHHHHH
Q psy12907 272 ERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSY-TCDKEHMIWLRDQM 350 (949)
Q Consensus 272 Ew~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y-~~d~~Y~~~L~~~~ 350 (949)
-|.|..+ .|+.... .+-+.++..+....-.+ ...=++=|=.|...+ -...++.+.|.+.+
T Consensus 88 ------~~~~w~~---------~~~~~~~---~~~~~v~~el~~~yg~h-~sf~GWYip~E~~~~~~~~~~~~~~l~~~l 148 (166)
T PF14488_consen 88 ------DPDYWDQ---------GDLDWEA---ERNKQVADELWQRYGHH-PSFYGWYIPYEIDDYNWNAPERFALLGKYL 148 (166)
T ss_pred ------Cchhhhc---------cCHHHHH---HHHHHHHHHHHHHHcCC-CCCceEEEecccCCcccchHHHHHHHHHHH
Confidence 1233321 2222211 11122555555433222 255567777777655 33566677777776
Q ss_pred HHh
Q psy12907 351 KYY 353 (949)
Q Consensus 351 ~~~ 353 (949)
++.
T Consensus 149 k~~ 151 (166)
T PF14488_consen 149 KQI 151 (166)
T ss_pred HHh
Confidence 654
No 35
>PRK15014 6-phospho-beta-glucosidase BglA; Provisional
Probab=95.54 E-value=0.016 Score=68.90 Aligned_cols=97 Identities=10% Similarity=0.199 Sum_probs=73.2
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
.|+++++.||++|+|+-++-|.|....|. +|+++=+|..-.+++|+.+.++||..++-- -.=.+|.||..
T Consensus 70 ry~EDI~Lm~elG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~l~~~GI~P~vTL--------~H~dlP~~L~~ 141 (477)
T PRK15014 70 HYKEDIKLFAEMGFKCFRTSIAWTRIFPKGDEAQPNEEGLKFYDDMFDELLKYNIEPVITL--------SHFEMPLHLVQ 141 (477)
T ss_pred ccHHHHHHHHHcCCCEEEecccceeeccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe--------eCCCCCHHHHH
Confidence 48999999999999999999999999997 567788887888999999999999987652 12268999976
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHh
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITR 315 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~ 315 (949)
++-+- .++...++-.+|.+.+++++..
T Consensus 142 ~yGGW----~n~~~~~~F~~Ya~~~f~~fgd 168 (477)
T PRK15014 142 QYGSW----TNRKVVDFFVRFAEVVFERYKH 168 (477)
T ss_pred hcCCC----CChHHHHHHHHHHHHHHHHhcC
Confidence 33332 2444455555555556665544
No 36
>PRK00870 haloalkane dehalogenase; Provisional
Probab=95.49 E-value=0.0068 Score=66.20 Aligned_cols=70 Identities=20% Similarity=0.343 Sum_probs=58.4
Q ss_pred cccccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------
Q psy12907 601 WKVYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------ 644 (949)
Q Consensus 601 I~~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------ 644 (949)
|.|.+.|..+ ..+|+|+||..+.+..|..++-.|+++|||||.
T Consensus 36 i~y~~~G~~~-~~~lvliHG~~~~~~~w~~~~~~L~~~gy~vi~~Dl~G~G~S~~~~~~~~~~~~~~a~~l~~~l~~l~~ 114 (302)
T PRK00870 36 MHYVDEGPAD-GPPVLLLHGEPSWSYLYRKMIPILAAAGHRVIAPDLIGFGRSDKPTRREDYTYARHVEWMRSWFEQLDL 114 (302)
T ss_pred EEEEecCCCC-CCEEEEECCCCCchhhHHHHHHHHHhCCCEEEEECCCCCCCCCCCCCcccCCHHHHHHHHHHHHHHcCC
Confidence 5566666533 347999999999999999999889889999995
Q ss_pred --------cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 --------YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 --------~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
|+||.+|+.++...+ +.|.+|+||++.
T Consensus 115 ~~v~lvGhS~Gg~ia~~~a~~~p--~~v~~lvl~~~~ 149 (302)
T PRK00870 115 TDVTLVCQDWGGLIGLRLAAEHP--DRFARLVVANTG 149 (302)
T ss_pred CCEEEEEEChHHHHHHHHHHhCh--hheeEEEEeCCC
Confidence 799999999998776 689999999875
No 37
>PLN02998 beta-glucosidase
Probab=95.47 E-value=0.016 Score=69.16 Aligned_cols=104 Identities=15% Similarity=0.213 Sum_probs=75.2
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||+||+|+-++-|.|.-.+|+ .|.+|=+|..--+++|+.+.++||..++--= =| .+|.||..
T Consensus 82 hry~EDi~lmk~lG~~~YRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~L~~~GIeP~VTL~-----H~---dlP~~L~~ 153 (497)
T PLN02998 82 HKYKEDVKLMADMGLEAYRFSISWSRLLPSGRGPINPKGLQYYNNLIDELITHGIQPHVTLH-----HF---DLPQALED 153 (497)
T ss_pred HhhHHHHHHHHHcCCCeEEeeccHHhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCceEEEec-----CC---CCCHHHHH
Confidence 358999999999999999999999999997 6778888888889999999999998765420 14 47999976
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHhcc
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITRFL 317 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~ 317 (949)
++-+-.=|..-..|.++++.-++++..+++-+.
T Consensus 154 ~yGGW~n~~~v~~F~~YA~~~~~~fgdrVk~Wi 186 (497)
T PLN02998 154 EYGGWLSQEIVRDFTAYADTCFKEFGDRVSHWT 186 (497)
T ss_pred hhCCcCCchHHHHHHHHHHHHHHHhcCcCCEEE
Confidence 333322233334556655555555444444333
No 38
>PLN02814 beta-glucosidase
Probab=95.24 E-value=0.02 Score=68.35 Aligned_cols=101 Identities=13% Similarity=0.135 Sum_probs=72.7
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||+||+|+-++-|.|.-.+|. +|.++-+|..--+++|+.+.++|+..++--= =| -+|.||.+
T Consensus 77 hry~EDI~L~k~lG~~ayRfSIsWsRI~P~G~g~~N~~Gl~fY~~lId~l~~~GI~P~VTL~-----H~---dlP~~L~~ 148 (504)
T PLN02814 77 HKYKEDVKLMAEMGLESFRFSISWSRLIPNGRGLINPKGLLFYKNLIKELRSHGIEPHVTLY-----HY---DLPQSLED 148 (504)
T ss_pred HhhHHHHHHHHHcCCCEEEEeccHhhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCceEEEec-----CC---CCCHHHHH
Confidence 458999999999999999999999999996 6788888888889999999999998765420 24 47999986
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHH
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRIT 314 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~ 314 (949)
++-+-.-|..-..|.++++.-++++..+++
T Consensus 149 ~yGGW~n~~~i~~F~~YA~~~f~~fgdrVk 178 (504)
T PLN02814 149 EYGGWINRKIIEDFTAFADVCFREFGEDVK 178 (504)
T ss_pred hcCCcCChhHHHHHHHHHHHHHHHhCCcCC
Confidence 333322223334455555444444444333
No 39
>PRK09593 arb 6-phospho-beta-glucosidase; Reviewed
Probab=94.91 E-value=0.032 Score=66.29 Aligned_cols=103 Identities=12% Similarity=0.111 Sum_probs=73.6
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
..|+++++.||+||+|+-++-|.|.-.+|. +|+++=+|..--+++|+.+.++||..++-- - =| .+|.||.
T Consensus 73 hry~eDi~Lm~~lG~~aYRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lId~L~~~GI~P~VTL---~--H~---dlP~~L~ 144 (478)
T PRK09593 73 HHYKEDIALFAEMGFKTYRMSIAWTRIFPKGDELEPNEAGLQFYEDIFKECHKYGIEPLVTI---T--HF---DCPMHLI 144 (478)
T ss_pred HhhHHHHHHHHHcCCCEEEEecchhhcccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEe---c--cc---CCCHHHH
Confidence 458999999999999999999999999997 667777787778899999999999876541 0 13 5899997
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
.++-+-.=|..-..|.++++.-++++..+++-.
T Consensus 145 ~~~GGW~n~~~v~~F~~YA~~~~~~fgdrVk~W 177 (478)
T PRK09593 145 EEYGGWRNRKMVGFYERLCRTLFTRYKGLVKYW 177 (478)
T ss_pred hhcCCCCChHHHHHHHHHHHHHHHHhcCcCCEE
Confidence 633333223333455555555444444444433
No 40
>PRK09589 celA 6-phospho-beta-glucosidase; Reviewed
Probab=94.83 E-value=0.032 Score=66.32 Aligned_cols=103 Identities=11% Similarity=0.180 Sum_probs=73.4
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
..|+++++.||+||+|+-++-|.|.-.+|. +|+++=+|..--+++|+.+.++||..++--= =| -+|.||.
T Consensus 67 hry~eDi~Lm~~lG~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~L~~~GI~P~VTL~-----H~---dlP~~L~ 138 (476)
T PRK09589 67 HRYKEDIALFAEMGFKCFRTSIAWTRIFPQGDELEPNEEGLQFYDDLFDECLKQGIEPVVTLS-----HF---EMPYHLV 138 (476)
T ss_pred HhhHHHHHHHHHcCCCEEEeccchhhcCcCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEec-----CC---CCCHHHH
Confidence 458999999999999999999999999997 5667777877788999999999998765520 13 5899997
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
.++-+-.-|..-..|.++++.-++++..+++-.
T Consensus 139 ~~yGGW~n~~~i~~F~~YA~~~f~~fgdrVk~W 171 (476)
T PRK09589 139 TEYGGWRNRKLIDFFVRFAEVVFTRYKDKVKYW 171 (476)
T ss_pred HhcCCcCChHHHHHHHHHHHHHHHHhcCCCCEE
Confidence 643333223333455555555444444444433
No 41
>PF07745 Glyco_hydro_53: Glycosyl hydrolase family 53; InterPro: IPR011683 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This domain is found in family 53 of the glycosyl hydrolase classification []. These enzymes are endo-1,4- beta-galactanases (3.2.1.89 from EC). The structure of this domain is known [] and has a TIM barrel fold.; GO: 0015926 glucosidase activity; PDB: 1HJQ_A 1HJS_A 1HJU_B 1FHL_A 1FOB_A 2GFT_A 1UR4_B 1UR0_A 1R8L_B 2CCR_A ....
Probab=94.80 E-value=0.7 Score=52.73 Aligned_cols=103 Identities=24% Similarity=0.347 Sum_probs=63.4
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCCCC-CeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCC
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEPSP-GQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKP 287 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~-G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p 287 (949)
+|.++.+|+.|+|.|+.=| |+ .|.. |..|.+ +..+..+-|+++||.|+|-.- | --.|- +|
T Consensus 27 ~d~~~ilk~~G~N~vRlRv-wv--~P~~~g~~~~~---~~~~~akrak~~Gm~vlldfH-Y---------SD~Wa---DP 87 (332)
T PF07745_consen 27 KDLFQILKDHGVNAVRLRV-WV--NPYDGGYNDLE---DVIALAKRAKAAGMKVLLDFH-Y---------SDFWA---DP 87 (332)
T ss_dssp --HHHHHHHTT--EEEEEE--S--S-TTTTTTSHH---HHHHHHHHHHHTT-EEEEEE--S---------SSS-----BT
T ss_pred CCHHHHHHhcCCCeEEEEe-cc--CCcccccCCHH---HHHHHHHHHHHCCCeEEEeec-c---------cCCCC---CC
Confidence 5789999999999999988 55 4444 544443 456666677889999998742 1 11232 23
Q ss_pred Ceeee-----c-CCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccC
Q psy12907 288 NITLR-----S-KDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMG 334 (949)
Q Consensus 288 ~~~~R-----t-~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG 334 (949)
+-+.. . +-..-.++|..|.+.++..|+. +|=.+=||||-||..
T Consensus 88 g~Q~~P~aW~~~~~~~l~~~v~~yT~~vl~~l~~----~G~~pd~VQVGNEin 136 (332)
T PF07745_consen 88 GKQNKPAAWANLSFDQLAKAVYDYTKDVLQALKA----AGVTPDMVQVGNEIN 136 (332)
T ss_dssp TB-B--TTCTSSSHHHHHHHHHHHHHHHHHHHHH----TT--ESEEEESSSGG
T ss_pred CCCCCCccCCCCCHHHHHHHHHHHHHHHHHHHHH----CCCCccEEEeCcccc
Confidence 22211 1 3456678999999999999975 344566999999974
No 42
>PLN02849 beta-glucosidase
Probab=94.79 E-value=0.031 Score=66.80 Aligned_cols=103 Identities=16% Similarity=0.202 Sum_probs=73.3
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||+||+|+-++-|.|.-.+|. .|.+|=+|..--+++|+.+.++||.-++--= =| -+|.||.+
T Consensus 79 hrY~eDI~Lm~~lG~~aYRfSIsWsRI~P~G~g~vN~~gl~fY~~lid~l~~~GI~P~VTL~-----H~---dlP~~L~~ 150 (503)
T PLN02849 79 HKYKEDVKLMVETGLDAFRFSISWSRLIPNGRGSVNPKGLQFYKNFIQELVKHGIEPHVTLF-----HY---DHPQYLED 150 (503)
T ss_pred HhHHHHHHHHHHcCCCeEEEeccHHhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCeEEEeec-----CC---CCcHHHHH
Confidence 358999999999999999999999999997 3777878877889999999999998765420 13 47999976
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
++-+-.=|..-..|.++++.-++++..+++-+
T Consensus 151 ~yGGW~nr~~v~~F~~YA~~~f~~fgDrVk~W 182 (503)
T PLN02849 151 DYGGWINRRIIKDFTAYADVCFREFGNHVKFW 182 (503)
T ss_pred hcCCcCCchHHHHHHHHHHHHHHHhcCcCCEE
Confidence 33332223333445555555444444444433
No 43
>COG3623 SgaU Putative L-xylulose-5-phosphate 3-epimerase [Carbohydrate transport and metabolism]
Probab=94.79 E-value=0.51 Score=51.10 Aligned_cols=39 Identities=26% Similarity=0.643 Sum_probs=29.7
Q ss_pred ccCCeeeeecCCCCC-ChHHHHHHhhhcCCCCCeeeeecCC
Q psy12907 374 TVPGVYATVDFGAAT-NVSSAFAAMRTVSPHGPLVNSEYYP 413 (949)
Q Consensus 374 ~~~g~~~t~nfg~~~-~~~~~f~~~~~~~p~~P~~~~Ef~~ 413 (949)
+.||.+-.+.||-++ +-.+.|+.+++.+..+|++ .|.|+
T Consensus 220 ~~~GqFrdvpfGeG~Vdf~~~f~~lk~~ny~gpfL-IEMWs 259 (287)
T COG3623 220 TSPGQFRDVPFGEGCVDFEECFKTLKQLNYRGPFL-IEMWS 259 (287)
T ss_pred cCCCccccCCcCCcchhHHHHHHHHHHhCCCCcee-hhhhh
Confidence 567888888888653 4467899999999999976 36554
No 44
>PRK13511 6-phospho-beta-galactosidase; Provisional
Probab=94.39 E-value=0.046 Score=64.83 Aligned_cols=97 Identities=13% Similarity=0.193 Sum_probs=69.8
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||++|+|+-++-|.|.-.+|. .|.+|=+|..--+++|+.+.++||..++-- -.=.+|.||.+
T Consensus 54 ~ry~eDi~L~~~lG~~~yRfSIsWsRI~P~G~g~vN~~gl~~Y~~lid~l~~~GI~P~VTL--------~H~dlP~~L~~ 125 (469)
T PRK13511 54 HRYPEDLKLAEEFGVNGIRISIAWSRIFPDGYGEVNPKGVEYYHRLFAECHKRHVEPFVTL--------HHFDTPEALHS 125 (469)
T ss_pred hhhHHHHHHHHHhCCCEEEeeccHhhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEe--------cCCCCcHHHHH
Confidence 348999999999999999999999999997 577888888888999999999999876542 12258999986
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHH
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFP 311 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~ 311 (949)
.-+-.=|..-..|.++++.-++++..
T Consensus 126 -~GGW~n~~~v~~F~~YA~~~~~~fgd 151 (469)
T PRK13511 126 -NGDWLNRENIDHFVRYAEFCFEEFPE 151 (469)
T ss_pred -cCCCCCHHHHHHHHHHHHHHHHHhCC
Confidence 32221122223444444444444444
No 45
>TIGR01233 lacG 6-phospho-beta-galactosidase. This enzyme is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=94.36 E-value=0.052 Score=64.39 Aligned_cols=96 Identities=15% Similarity=0.168 Sum_probs=70.8
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|+++++.||+||+|+-++-+.|.-.+|. +|+++=+|..--+++|+.+.++||..++--= + | -+|.||.+
T Consensus 53 hry~eDi~L~~~lG~~~yRfSIsWsRI~P~g~~~~N~~gl~~Y~~lid~l~~~GI~P~VTL~---H--~---dlP~~L~~ 124 (467)
T TIGR01233 53 HKYPVDLELAEEYGVNGIRISIAWSRIFPTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTLH---H--F---DTPEALHS 124 (467)
T ss_pred hhHHHHHHHHHHcCCCEEEEecchhhccCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEecc---C--C---CCcHHHHH
Confidence 348999999999999999999999999996 5777777877888999999999999776521 1 3 48999976
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHH
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLF 310 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~ 310 (949)
..+-.-|..-..|.++++.-++++.
T Consensus 125 -~GGW~n~~~v~~F~~YA~~~f~~fg 149 (467)
T TIGR01233 125 -NGDFLNRENIEHFIDYAAFCFEEFP 149 (467)
T ss_pred -cCCCCCHHHHHHHHHHHHHHHHHhC
Confidence 3332223333445555555444444
No 46
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=94.12 E-value=0.1 Score=52.36 Aligned_cols=72 Identities=22% Similarity=0.302 Sum_probs=50.4
Q ss_pred cCCCceeEEEEEecCCCC--CCCCcccccCCcceEEEEEECCEEEEEEEec-ceeeeecc--CCCC--CeEEEEEEeCC
Q psy12907 516 LDQSFGFVLYETIIPDAR--FPDPALLTISGLRDRGQVFVDEKLVTILYRN-KMLSTPIM--ARPG--QKLSILVENMG 587 (949)
Q Consensus 516 lgQ~~GyvlY~t~i~~~~--~~~~~~L~i~~vrDrA~Vfvdg~~vG~l~r~-~~~~i~~~--~~~~--~~LdILVENmG 587 (949)
.....|...||+++.-+. ....+.|.+.++++.|.|||||+.+|.-.-. ....+.++ .+.+ ++|.|.|-|..
T Consensus 63 ~~~~~~~~wYr~~f~lp~~~~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~~~~~~~dIt~~l~~g~~N~l~V~v~~~~ 141 (167)
T PF02837_consen 63 LWDYSGYAWYRRTFTLPADWKGKRVFLRFEGVDYAAEVYVNGKLVGSHEGGYTPFEFDITDYLKPGEENTLAVRVDNWP 141 (167)
T ss_dssp TSTCCSEEEEEEEEEESGGGTTSEEEEEESEEESEEEEEETTEEEEEEESTTS-EEEECGGGSSSEEEEEEEEEEESSS
T ss_pred ccccCceEEEEEEEEeCchhcCceEEEEeccceEeeEEEeCCeEEeeeCCCcCCeEEeChhhccCCCCEEEEEEEeecC
Confidence 355678999999874332 3456778999999999999999999996542 33444443 2322 57888888544
No 47
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=93.74 E-value=0.2 Score=56.44 Aligned_cols=117 Identities=21% Similarity=0.275 Sum_probs=71.7
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEe-------------ecCCccCC-CCCe---eeccCchhHHHHHHHHHHCCceEEecCC
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTY-------------VEWSFHEP-SPGQ---FQFTGDQDLEYFLELAVREDLLVLLRPG 266 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tY-------------v~Wn~HEp-~~G~---fdF~g~~DL~~fl~la~e~GL~VILRpG 266 (949)
.++.-++.|++++++|+|+|-.- .+|.---+ .+|. ||. |+.+|+.|++.||.|..+.-
T Consensus 17 ~~~~~~~~l~~l~~~~~N~V~~qVr~~gda~Y~S~~~p~s~~~~g~~~~~pg~Dp-----L~~~I~eaHkrGlevHAW~~ 91 (311)
T PF02638_consen 17 SKEQIDEMLDDLKSAGFNAVFVQVRPRGDALYPSDIEPWSGYLTGKQGKDPGFDP-----LEFMIEEAHKRGLEVHAWFR 91 (311)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEEEEeCcEEEecccccccccccCCCCCCCCCccH-----HHHHHHHHHHcCCEEEEEEE
Confidence 46788999999999999998433 44542111 1121 333 99999999999999987651
Q ss_pred cccccccCC----CCccccccccCCCeeeec----CCHhH----HHHHHHHHHHHHHHHHhccccCCCceEEecccc
Q psy12907 267 PYICAERDF----GGLPSWLMTIKPNITLRS----KDEVY----QHYVNKWFAQLFPRITRFLYGNGGPIILVQVEN 331 (949)
Q Consensus 267 PYIcAEw~~----GGlP~WL~~~~p~~~~Rt----~dp~y----~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIEN 331 (949)
....+. +. -.-|.|+....|+..... .+..| ..+|++|+..++..|.... +|=++|++-
T Consensus 92 ~~~~~~-~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i~~~v~Eiv~~Y-----dvDGIhlDd 162 (311)
T PF02638_consen 92 VGFNAP-DVSHILKKHPEWFAVNHPGWVRTYEDANGGYYWLNPGHPEVRDYIIDIVKEIVKNY-----DVDGIHLDD 162 (311)
T ss_pred eecCCC-chhhhhhcCchhheecCCCceeecccCCCCceEECCCCHHHHHHHHHHHHHHHhcC-----CCCeEEecc
Confidence 001100 11 124788764455543332 11112 3678888888888776544 355688873
No 48
>COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism]
Probab=93.53 E-value=2.4 Score=47.63 Aligned_cols=116 Identities=28% Similarity=0.253 Sum_probs=75.7
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHH---HCCceEEecCCcccccccCCCCcccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAV---REDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~---e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
.=+|.|+-+|+.|+|.|+.=| ||..--..|.=-=.|..|+.+.+++|+ ..||+|++-+= -+ ++=.=|+ .
T Consensus 64 ~~qD~~~iLK~~GvNyvRlRv-wndP~dsngn~yggGnnD~~k~ieiakRAk~~GmKVl~dFH---YS--DfwaDPa--k 135 (403)
T COG3867 64 VRQDALQILKNHGVNYVRLRV-WNDPYDSNGNGYGGGNNDLKKAIEIAKRAKNLGMKVLLDFH---YS--DFWADPA--K 135 (403)
T ss_pred hHHHHHHHHHHcCcCeEEEEE-ecCCccCCCCccCCCcchHHHHHHHHHHHHhcCcEEEeecc---ch--hhccChh--h
Confidence 347899999999999998865 776655555555557899999998875 46999998742 11 1100011 0
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccC
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMG 334 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG 334 (949)
.+.|.....-+-..-.+++-.|.+..+..+++- |=-+=||||-||-.
T Consensus 136 Q~kPkaW~~l~fe~lk~avy~yTk~~l~~m~~e----Gi~pdmVQVGNEtn 182 (403)
T COG3867 136 QKKPKAWENLNFEQLKKAVYSYTKYVLTTMKKE----GILPDMVQVGNETN 182 (403)
T ss_pred cCCcHHhhhcCHHHHHHHHHHHHHHHHHHHHHc----CCCccceEeccccC
Confidence 112222222344555677888888888888754 32344999999974
No 49
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=93.03 E-value=0.45 Score=47.38 Aligned_cols=72 Identities=15% Similarity=0.245 Sum_probs=48.5
Q ss_pred HHHHHHHHcCCCEEEEeec----C-----CccCCCCCe-eeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcc
Q psy12907 210 DRLRKLRAAGLNAVSTYVE----W-----SFHEPSPGQ-FQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLP 279 (949)
Q Consensus 210 d~L~k~Ka~GlNtV~tYv~----W-----n~HEp~~G~-fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP 279 (949)
+-++.+|++|+|+|.++.- | .+|.+.|+- .|+ |.++++.|++.||.|++|...- --|+-.---|
T Consensus 4 ~~~~~lk~~~v~si~i~a~~h~g~ayYPt~~~~~hp~L~~Dl-----lge~v~a~h~~Girv~ay~~~~-~d~~~~~~HP 77 (132)
T PF14871_consen 4 QFVDTLKEAHVNSITIFAKCHGGYAYYPTKVGPRHPGLKRDL-----LGEQVEACHERGIRVPAYFDFS-WDEDAAERHP 77 (132)
T ss_pred HHHHHHHHhCCCEEEEEcccccEEEEccCCCCcCCCCCCcCH-----HHHHHHHHHHCCCEEEEEEeee-cChHHHHhCC
Confidence 3467889999999987532 2 245555543 222 6899999999999999996543 2233333569
Q ss_pred ccccccCCC
Q psy12907 280 SWLMTIKPN 288 (949)
Q Consensus 280 ~WL~~~~p~ 288 (949)
.|+.+ +++
T Consensus 78 eW~~~-~~~ 85 (132)
T PF14871_consen 78 EWFVR-DAD 85 (132)
T ss_pred ceeeE-CCC
Confidence 99986 444
No 50
>PLN02965 Probable pheophorbidase
Probab=92.68 E-value=0.059 Score=57.43 Aligned_cols=58 Identities=16% Similarity=0.190 Sum_probs=51.8
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc--------------------------------------------cCCcc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS--------------------------------------------YLGGF 649 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is--------------------------------------------~lg~~ 649 (949)
.|+|+||..+++.+|-.++-.|+++|||||. |+||.
T Consensus 5 ~vvllHG~~~~~~~w~~~~~~L~~~~~~via~Dl~G~G~S~~~~~~~~~~~~~a~dl~~~l~~l~~~~~~~lvGhSmGG~ 84 (255)
T PLN02965 5 HFVFVHGASHGAWCWYKLATLLDAAGFKSTCVDLTGAGISLTDSNTVSSSDQYNRPLFALLSDLPPDHKVILVGHSIGGG 84 (255)
T ss_pred EEEEECCCCCCcCcHHHHHHHHhhCCceEEEecCCcCCCCCCCccccCCHHHHHHHHHHHHHhcCCCCCEEEEecCcchH
Confidence 4889999999999999998889888999995 79999
Q ss_pred hhhcccccccccCceeeccccccc
Q psy12907 650 LAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 650 l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
+++.++...+ +.|++|+|+|+.
T Consensus 85 ia~~~a~~~p--~~v~~lvl~~~~ 106 (255)
T PLN02965 85 SVTEALCKFT--DKISMAIYVAAA 106 (255)
T ss_pred HHHHHHHhCc--hheeEEEEEccc
Confidence 9999999776 789999999986
No 51
>PRK10349 carboxylesterase BioH; Provisional
Probab=92.51 E-value=0.051 Score=57.56 Aligned_cols=57 Identities=26% Similarity=0.268 Sum_probs=49.2
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------cCCcchhhcccc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-------------------------------------YLGGFLAQKFAE 656 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-------------------------------------~lg~~l~q~f~~ 656 (949)
+|+||||..+++..|-.++-.|+ +.||||. |+||++|+.|+.
T Consensus 15 ~ivllHG~~~~~~~w~~~~~~L~-~~~~vi~~Dl~G~G~S~~~~~~~~~~~~~~l~~~~~~~~~lvGhS~Gg~ia~~~a~ 93 (256)
T PRK10349 15 HLVLLHGWGLNAEVWRCIDEELS-SHFTLHLVDLPGFGRSRGFGALSLADMAEAVLQQAPDKAIWLGWSLGGLVASQIAL 93 (256)
T ss_pred eEEEECCCCCChhHHHHHHHHHh-cCCEEEEecCCCCCCCCCCCCCCHHHHHHHHHhcCCCCeEEEEECHHHHHHHHHHH
Confidence 59999999999998888887776 4599994 799999999998
Q ss_pred cccccCceeeccccccc
Q psy12907 657 HTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 657 ~~~~~~~v~sl~lcn~~ 673 (949)
..+ ..|.+|+|+|+.
T Consensus 94 ~~p--~~v~~lili~~~ 108 (256)
T PRK10349 94 THP--ERVQALVTVASS 108 (256)
T ss_pred hCh--HhhheEEEecCc
Confidence 777 689999999985
No 52
>PLN02211 methyl indole-3-acetate methyltransferase
Probab=92.33 E-value=0.1 Score=56.97 Aligned_cols=63 Identities=11% Similarity=0.201 Sum_probs=50.9
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccccccccc--------------------------------------------cCC
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS--------------------------------------------YLG 647 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is--------------------------------------------~lg 647 (949)
..+|+|+||..+++.+|..++-.|.++|||||. |+|
T Consensus 18 ~p~vvliHG~~~~~~~w~~~~~~L~~~g~~vi~~dl~g~G~s~~~~~~~~~~~~~~~~l~~~i~~l~~~~~v~lvGhS~G 97 (273)
T PLN02211 18 PPHFVLIHGISGGSWCWYKIRCLMENSGYKVTCIDLKSAGIDQSDADSVTTFDEYNKPLIDFLSSLPENEKVILVGHSAG 97 (273)
T ss_pred CCeEEEECCCCCCcCcHHHHHHHHHhCCCEEEEecccCCCCCCCCcccCCCHHHHHHHHHHHHHhcCCCCCEEEEEECch
Confidence 446999999999999998888778888999994 688
Q ss_pred cchhhcccccccccCceeecccccccccc
Q psy12907 648 GFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 648 ~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
|.++..+++..+ +.|++|+++.++...
T Consensus 98 G~v~~~~a~~~p--~~v~~lv~~~~~~~~ 124 (273)
T PLN02211 98 GLSVTQAIHRFP--KKICLAVYVAATMLK 124 (273)
T ss_pred HHHHHHHHHhCh--hheeEEEEeccccCC
Confidence 888888887654 578888888776543
No 53
>COG1649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=91.74 E-value=0.78 Score=53.74 Aligned_cols=141 Identities=21% Similarity=0.228 Sum_probs=91.3
Q ss_pred ChhhHHHHHHHHHHcCCCEEEE-------------eecCCccCCCCCeee-ccCchhHHHHHHHHHHCCceEEecCCccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVST-------------YVEWSFHEPSPGQFQ-FTGDQDLEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~t-------------Yv~Wn~HEp~~G~fd-F~g~~DL~~fl~la~e~GL~VILRpGPYI 269 (949)
.+..-.+.|.+++++|+|||-. ..+|..-- ||++- =.|..=|...|+.|++.||.|+.+.=||.
T Consensus 62 ~~~el~~~ld~l~~ln~NTv~~qV~~~G~~lypS~~~p~s~~~--~~~~~~~~g~DpLa~~I~~AHkr~l~v~aWf~~~~ 139 (418)
T COG1649 62 QRQELKDILDDLQKLNFNTVYPQVWNDGDALYPSAVLPWSDGL--PGVLGVDPGYDPLAFVIAEAHKRGLEVHAWFNPYR 139 (418)
T ss_pred cHHHHHHHHHHHHHcCCceeEEEEecCccccccccccccccCc--CcccCCCCCCChHHHHHHHHHhcCCeeeechhhcc
Confidence 4677789999999999999842 23455443 44432 12344588899999999999999988887
Q ss_pred ccccCCCC---ccccccccCCCeee-ecCC-------HhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCccc
Q psy12907 270 CAERDFGG---LPSWLMTIKPNITL-RSKD-------EVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSYTC 338 (949)
Q Consensus 270 cAEw~~GG---lP~WL~~~~p~~~~-Rt~d-------p~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y~~ 338 (949)
.|--..-. -|.|+.++.|+... |... .+..-+|+.|+..++-.+...+ .|=++|.+--++ |..
T Consensus 140 ~a~~~s~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ldPg~Pevq~~i~~lv~evV~~Y-----dvDGIQfDd~fy-~~~ 213 (418)
T COG1649 140 MAPPTSPLTKRHPHWLTTKRPGWVYVRHQGWGKRVWLDPGIPEVQDFITSLVVEVVRNY-----DVDGIQFDDYFY-YPI 213 (418)
T ss_pred cCCCCChhHhhCCCCcccCCCCeEEEecCCceeeeEeCCCChHHHHHHHHHHHHHHhCC-----CCCceecceeec-ccC
Confidence 65422211 37777765455333 3321 1345678888888777776655 455689987766 544
Q ss_pred hHHHHHHHHHHHHH
Q psy12907 339 DKEHMIWLRDQMKY 352 (949)
Q Consensus 339 d~~Y~~~L~~~~~~ 352 (949)
+-.|...-...++.
T Consensus 214 ~~gy~~~~~~~y~~ 227 (418)
T COG1649 214 PFGYDPDTVTLYRY 227 (418)
T ss_pred ccccCchHHHHHHh
Confidence 44455555555554
No 54
>PRK10749 lysophospholipase L2; Provisional
Probab=91.66 E-value=0.088 Score=58.87 Aligned_cols=44 Identities=20% Similarity=0.203 Sum_probs=32.5
Q ss_pred cccccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc
Q psy12907 601 WKVYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS 644 (949)
Q Consensus 601 I~~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is 644 (949)
|.|+..+|....+.|+++||..+++..|...+..|+++||||+.
T Consensus 43 l~~~~~~~~~~~~~vll~HG~~~~~~~y~~~~~~l~~~g~~v~~ 86 (330)
T PRK10749 43 IRFVRFRAPHHDRVVVICPGRIESYVKYAELAYDLFHLGYDVLI 86 (330)
T ss_pred EEEEEccCCCCCcEEEEECCccchHHHHHHHHHHHHHCCCeEEE
Confidence 44444443222446899999999988888888788899999994
No 55
>TIGR02240 PHA_depoly_arom poly(3-hydroxyalkanoate) depolymerase. This family consists of the polyhydroxyalkanoic acid (PHA) depolymerase of Pseudomonas oleovorans, Pseudomonas putida BM01, and related species. This enzyme is part of polyester storage and mobilization system as in many bacteria. However, species containing this enzyme are unusual in their capacity to produce aromatic polyesters when grown on carbon sources such as benzoic acid or phenylacetic acid.
Probab=91.40 E-value=0.07 Score=57.44 Aligned_cols=60 Identities=23% Similarity=0.413 Sum_probs=51.3
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------------cCCcch
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------------YLGGFL 650 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------------~lg~~l 650 (949)
.+|+|+||..++++.|..++-.|+ +.||||. |+||.+
T Consensus 26 ~plvllHG~~~~~~~w~~~~~~L~-~~~~vi~~Dl~G~G~S~~~~~~~~~~~~~~~~~~~i~~l~~~~~~LvG~S~GG~v 104 (276)
T TIGR02240 26 TPLLIFNGIGANLELVFPFIEALD-PDLEVIAFDVPGVGGSSTPRHPYRFPGLAKLAARMLDYLDYGQVNAIGVSWGGAL 104 (276)
T ss_pred CcEEEEeCCCcchHHHHHHHHHhc-cCceEEEECCCCCCCCCCCCCcCcHHHHHHHHHHHHHHhCcCceEEEEECHHHHH
Confidence 489999999999998887777665 5799994 799999
Q ss_pred hhcccccccccCceeeccccccccc
Q psy12907 651 AQKFAEHTVLRPRVVSLFLCNSFYD 675 (949)
Q Consensus 651 ~q~f~~~~~~~~~v~sl~lcn~~~d 675 (949)
|..++...+ +.|++|+|+|+...
T Consensus 105 a~~~a~~~p--~~v~~lvl~~~~~~ 127 (276)
T TIGR02240 105 AQQFAHDYP--ERCKKLILAATAAG 127 (276)
T ss_pred HHHHHHHCH--HHhhheEEeccCCc
Confidence 999999887 57999999998754
No 56
>COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]
Probab=91.36 E-value=0.25 Score=58.26 Aligned_cols=96 Identities=19% Similarity=0.282 Sum_probs=69.7
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCe--eeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQ--FQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~--fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
.++++++.||+||+|+.++-|.|...-|..+. .+=.|-.=-+++++-|.++|+..++--- =| -+|.||.+
T Consensus 60 rYkeDi~L~~emG~~~~R~SI~WsRIfP~g~~~e~N~~gl~fY~~l~del~~~gIep~vTL~-----Hf---d~P~~L~~ 131 (460)
T COG2723 60 RYKEDIALAKEMGLNAFRTSIEWSRIFPNGDGGEVNEKGLRFYDRLFDELKARGIEPFVTLY-----HF---DLPLWLQK 131 (460)
T ss_pred hhHHHHHHHHHcCCCEEEeeeeEEEeecCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEec-----cc---CCcHHHhh
Confidence 47899999999999999999999999997654 7777777788999999999999876521 14 36999987
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHH
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRIT 314 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~ 314 (949)
++.+-.=|..-..| .+|-+.++.++.
T Consensus 132 ~ygGW~nR~~i~~F----~~ya~~vf~~f~ 157 (460)
T COG2723 132 PYGGWENRETVDAF----ARYAATVFERFG 157 (460)
T ss_pred ccCCccCHHHHHHH----HHHHHHHHHHhc
Confidence 43333223333344 444455555554
No 57
>PRK10673 acyl-CoA esterase; Provisional
Probab=90.77 E-value=0.13 Score=54.03 Aligned_cols=58 Identities=21% Similarity=0.254 Sum_probs=49.1
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccccccccc-----------------------------------------cCCcch
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-----------------------------------------YLGGFL 650 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-----------------------------------------~lg~~l 650 (949)
..+|+|+||..+++..|...+-.|+ ++||||. |+||.+
T Consensus 16 ~~~iv~lhG~~~~~~~~~~~~~~l~-~~~~vi~~D~~G~G~s~~~~~~~~~~~~~d~~~~l~~l~~~~~~lvGhS~Gg~v 94 (255)
T PRK10673 16 NSPIVLVHGLFGSLDNLGVLARDLV-NDHDIIQVDMRNHGLSPRDPVMNYPAMAQDLLDTLDALQIEKATFIGHSMGGKA 94 (255)
T ss_pred CCCEEEECCCCCchhHHHHHHHHHh-hCCeEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHcCCCceEEEEECHHHHH
Confidence 4589999999999999888877764 6899994 799999
Q ss_pred hhcccccccccCceeecccccc
Q psy12907 651 AQKFAEHTVLRPRVVSLFLCNS 672 (949)
Q Consensus 651 ~q~f~~~~~~~~~v~sl~lcn~ 672 (949)
++.++...+ ..|++++|+++
T Consensus 95 a~~~a~~~~--~~v~~lvli~~ 114 (255)
T PRK10673 95 VMALTALAP--DRIDKLVAIDI 114 (255)
T ss_pred HHHHHHhCH--hhcceEEEEec
Confidence 999998766 57999998874
No 58
>PLN02824 hydrolase, alpha/beta fold family protein
Probab=90.59 E-value=0.12 Score=56.23 Aligned_cols=59 Identities=22% Similarity=0.243 Sum_probs=51.0
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------------------
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------------------ 644 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------------------ 644 (949)
.+|+||||..+.++.|-.++-.|+++ ||||.
T Consensus 30 ~~vlllHG~~~~~~~w~~~~~~L~~~-~~vi~~DlpG~G~S~~~~~~~~~~~~~~~~~~~a~~l~~~l~~l~~~~~~lvG 108 (294)
T PLN02824 30 PALVLVHGFGGNADHWRKNTPVLAKS-HRVYAIDLLGYGYSDKPNPRSAPPNSFYTFETWGEQLNDFCSDVVGDPAFVIC 108 (294)
T ss_pred CeEEEECCCCCChhHHHHHHHHHHhC-CeEEEEcCCCCCCCCCCccccccccccCCHHHHHHHHHHHHHHhcCCCeEEEE
Confidence 37999999999999999999888876 58883
Q ss_pred -cCCcchhhcccccccccCceeecccccccc
Q psy12907 645 -YLGGFLAQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 645 -~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|+||.++..|+...+ +.|++|+|+|+-.
T Consensus 109 hS~Gg~va~~~a~~~p--~~v~~lili~~~~ 137 (294)
T PLN02824 109 NSVGGVVGLQAAVDAP--ELVRGVMLINISL 137 (294)
T ss_pred eCHHHHHHHHHHHhCh--hheeEEEEECCCc
Confidence 699999999999887 6899999999753
No 59
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=89.90 E-value=4.5 Score=50.38 Aligned_cols=99 Identities=12% Similarity=0.092 Sum_probs=62.6
Q ss_pred hhHHHHHHHHHHcCCCEE---------------EEeecCCccCCCCCeeeccCchhHHH-HHHHHHHCCceEEecCCccc
Q psy12907 206 YYWQDRLRKLRAAGLNAV---------------STYVEWSFHEPSPGQFQFTGDQDLEY-FLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV---------------~tYv~Wn~HEp~~G~fdF~g~~DL~~-fl~la~e~GL~VILRpGPYI 269 (949)
+.-...|+.+|++|+||| +.|+|| -|= ||+-|. +.+ ...++.+.|+.|-.|-.||-
T Consensus 334 ~~L~~lLdrlk~~G~ntV~lqafadp~gd~~~~s~yfP~-~~l--p~r~d~-----f~~~aw~l~~r~~v~v~AWmp~~~ 405 (671)
T PRK14582 334 RNIDVLIQRVKDMQISTVYLQAFADPDGDGLVKELYFPN-RLL--PMRADL-----FNRVAWQLRTRAGVNVYAWMPVLS 405 (671)
T ss_pred HHHHHHHHHHHHcCCCEEEEEeccCCCCCccccccccCc-ccc--ccccCC-----cCHHHHHHHHhhCCEEEEecccee
Confidence 457788999999999999 567888 443 334332 112 34559999999999999985
Q ss_pred cc---------ccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhcc
Q psy12907 270 CA---------ERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFL 317 (949)
Q Consensus 270 cA---------Ew~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~ 317 (949)
-. +++..+-|+.. +|+-..| =.+|..++++|++.|..-|+.+.
T Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~r--l~P~~pe~r~~i~~i~~dla~~~ 457 (671)
T PRK14582 406 FDLDPTLPRVKRLDTGEGKAQI---HPEQYRR--LSPFDDRVRAQVGMLYEDLAGHA 457 (671)
T ss_pred eccCCCcchhhhccccCCcccc---CCCCCcC--CCCCCHHHHHHHHHHHHHHHHhC
Confidence 31 12111111111 1111112 12467889999999999999864
No 60
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=89.83 E-value=0.13 Score=50.97 Aligned_cols=59 Identities=29% Similarity=0.452 Sum_probs=49.9
Q ss_pred eEECCCCCCchhhhhhhhhhcccccccccc--------------------------------------------cCCcch
Q psy12907 615 LVFLPPVSGTADIFFHLMLDLSRKGYRTLS--------------------------------------------YLGGFL 650 (949)
Q Consensus 615 v~~l~~~~g~a~~~~~~~~~l~~kgyR~is--------------------------------------------~lg~~l 650 (949)
|+|+||..+++..|..++-.| ++||||++ |+||.+
T Consensus 1 vv~~hG~~~~~~~~~~~~~~l-~~~~~v~~~d~~G~G~s~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~lvG~S~Gg~~ 79 (228)
T PF12697_consen 1 VVFLHGFGGSSESWDPLAEAL-ARGYRVIAFDLPGHGRSDPPPDYSPYSIEDYAEDLAELLDALGIKKVILVGHSMGGMI 79 (228)
T ss_dssp EEEE-STTTTGGGGHHHHHHH-HTTSEEEEEECTTSTTSSSHSSGSGGSHHHHHHHHHHHHHHTTTSSEEEEEETHHHHH
T ss_pred eEEECCCCCCHHHHHHHHHHH-hCCCEEEEEecCCccccccccccCCcchhhhhhhhhhccccccccccccccccccccc
Confidence 679999999999998888888 48999995 699999
Q ss_pred hhcccccccccCceeecccccccccc
Q psy12907 651 AQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 651 ~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
++.++...+ +.|.+++++++....
T Consensus 80 a~~~a~~~p--~~v~~~vl~~~~~~~ 103 (228)
T PF12697_consen 80 ALRLAARYP--DRVKGLVLLSPPPPL 103 (228)
T ss_dssp HHHHHHHSG--GGEEEEEEESESSSH
T ss_pred ccccccccc--cccccceeecccccc
Confidence 999998866 489999999988654
No 61
>PRK03592 haloalkane dehalogenase; Provisional
Probab=89.68 E-value=0.15 Score=55.44 Aligned_cols=59 Identities=22% Similarity=0.355 Sum_probs=51.9
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------------cCCcch
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------------YLGGFL 650 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------------~lg~~l 650 (949)
.+|+|+||.-+++.++-.++-.|++++ |||+ |+||.+
T Consensus 28 ~~vvllHG~~~~~~~w~~~~~~L~~~~-~via~D~~G~G~S~~~~~~~~~~~~a~dl~~ll~~l~~~~~~lvGhS~Gg~i 106 (295)
T PRK03592 28 DPIVFLHGNPTSSYLWRNIIPHLAGLG-RCLAPDLIGMGASDKPDIDYTFADHARYLDAWFDALGLDDVVLVGHDWGSAL 106 (295)
T ss_pred CEEEEECCCCCCHHHHHHHHHHHhhCC-EEEEEcCCCCCCCCCCCCCCCHHHHHHHHHHHHHHhCCCCeEEEEECHHHHH
Confidence 489999999999999988888888775 9984 799999
Q ss_pred hhcccccccccCceeecccccccc
Q psy12907 651 AQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 651 ~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|++|+..++ +.|++|+|+|++.
T Consensus 107 a~~~a~~~p--~~v~~lil~~~~~ 128 (295)
T PRK03592 107 GFDWAARHP--DRVRGIAFMEAIV 128 (295)
T ss_pred HHHHHHhCh--hheeEEEEECCCC
Confidence 999999988 7899999999853
No 62
>PHA02857 monoglyceride lipase; Provisional
Probab=89.38 E-value=0.2 Score=53.70 Aligned_cols=60 Identities=12% Similarity=0.133 Sum_probs=46.8
Q ss_pred eEECCCCCCchhhhhhhhhhcccccccccc-----------------------------------------------cCC
Q psy12907 615 LVFLPPVSGTADIFFHLMLDLSRKGYRTLS-----------------------------------------------YLG 647 (949)
Q Consensus 615 v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-----------------------------------------------~lg 647 (949)
|+++||..+++..|-.++-.|+++|||||+ |+|
T Consensus 28 v~llHG~~~~~~~~~~~~~~l~~~g~~via~D~~G~G~S~~~~~~~~~~~~~~~d~~~~l~~~~~~~~~~~~~lvG~S~G 107 (276)
T PHA02857 28 VFISHGAGEHSGRYEELAENISSLGILVFSHDHIGHGRSNGEKMMIDDFGVYVRDVVQHVVTIKSTYPGVPVFLLGHSMG 107 (276)
T ss_pred EEEeCCCccccchHHHHHHHHHhCCCEEEEccCCCCCCCCCccCCcCCHHHHHHHHHHHHHHHHhhCCCCCEEEEEcCch
Confidence 566688888888888877778888888884 688
Q ss_pred cchhhcccccccccCceeecccccccccc
Q psy12907 648 GFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 648 ~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
|.+|+.++...+ +.|++++|++++.+.
T Consensus 108 G~ia~~~a~~~p--~~i~~lil~~p~~~~ 134 (276)
T PHA02857 108 ATISILAAYKNP--NLFTAMILMSPLVNA 134 (276)
T ss_pred HHHHHHHHHhCc--cccceEEEecccccc
Confidence 888888887665 457888888887653
No 63
>smart00642 Aamy Alpha-amylase domain.
Probab=89.01 E-value=0.99 Score=46.42 Aligned_cols=65 Identities=14% Similarity=0.123 Sum_probs=44.4
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccC-------CCCCee-----eccCchhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHE-------PSPGQF-----QFTGDQDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HE-------p~~G~f-----dF~g~~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
+-+.++|.-+|++|+|+|.+-=++...+ -.+..| .|....|+.++++.|+++||.||+-.=|-=+
T Consensus 19 ~gi~~~l~yl~~lG~~~I~l~Pi~~~~~~~~~~~gY~~~d~~~i~~~~Gt~~d~~~lv~~~h~~Gi~vilD~V~NH~ 95 (166)
T smart00642 19 QGIIEKLDYLKDLGVTAIWLSPIFESPQGYPSYHGYDISDYKQIDPRFGTMEDFKELVDAAHARGIKVILDVVINHT 95 (166)
T ss_pred HHHHHHHHHHHHCCCCEEEECcceeCCCCCCCCCCcCccccCCCCcccCCHHHHHHHHHHHHHCCCEEEEEECCCCC
Confidence 3456677779999999998743332221 122222 4556689999999999999999987544333
No 64
>TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067.
Probab=88.95 E-value=0.17 Score=52.12 Aligned_cols=71 Identities=20% Similarity=0.300 Sum_probs=51.9
Q ss_pred cccCCCCc-cCCCeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------
Q psy12907 603 VYDCGPKH-IRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------- 644 (949)
Q Consensus 603 ~~~~g~~~-~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------- 644 (949)
|+..|+.+ ...+|+|+||..++++.|..++-.| ++|||||.
T Consensus 3 ~~~~~~~~~~~~~iv~lhG~~~~~~~~~~~~~~l-~~~~~vi~~D~~G~G~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (257)
T TIGR03611 3 YELHGPPDADAPVVVLSSGLGGSGSYWAPQLDVL-TQRFHVVTYDHRGTGRSPGELPPGYSIAHMADDVLQLLDALNIER 81 (257)
T ss_pred EEEecCCCCCCCEEEEEcCCCcchhHHHHHHHHH-HhccEEEEEcCCCCCCCCCCCcccCCHHHHHHHHHHHHHHhCCCc
Confidence 34445433 2446899999999998887776545 46899994
Q ss_pred ------cCCcchhhcccccccccCceeecccccccccc
Q psy12907 645 ------YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 645 ------~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
|+||.+|.+++...+ +.|+++++|+++.+.
T Consensus 82 ~~l~G~S~Gg~~a~~~a~~~~--~~v~~~i~~~~~~~~ 117 (257)
T TIGR03611 82 FHFVGHALGGLIGLQLALRYP--ERLLSLVLINAWSRP 117 (257)
T ss_pred EEEEEechhHHHHHHHHHHCh--HHhHHheeecCCCCC
Confidence 689999999887665 468888888877543
No 65
>PF02837 Glyco_hydro_2_N: Glycosyl hydrolases family 2, sugar binding domain; InterPro: IPR006104 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. This domain has a jelly-roll fold [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3DEC_A 3OB8_A 3OBA_A 3CMG_A 3FN9_C 2VZU_A 2X09_A 2VZO_A 2X05_A 2VZV_B ....
Probab=88.64 E-value=1 Score=45.27 Aligned_cols=69 Identities=19% Similarity=0.247 Sum_probs=50.3
Q ss_pred CCCceEEEEEEEccCCCCCCCCeEEecCC-CcceEEEEcCcccccccCCCCCeeeEEeccCCCCCCCCC-ceEEEE
Q psy12907 846 TKLPVFYTATFTLNAEHPKPLDGYVDMSN-WAKGVVFINEHNLGKYWTTLGPQLTLYLPAPFIKPYPEV-NRITVL 919 (949)
Q Consensus 846 ~~~p~~~~~~~~~~~~~~~~~dt~~~~~~-w~kG~~~vng~nlGryw~~~gpq~tly~P~~~l~~~~g~-n~i~~~ 919 (949)
..+...|+-+|+++... +-...+|.+.+ -....|||||+-+|+-...- ....+=|+. .| ++|. |+|.|.
T Consensus 66 ~~~~~wYr~~f~lp~~~-~~~~~~L~f~gv~~~a~v~vNG~~vg~~~~~~-~~~~~dIt~-~l--~~g~~N~l~V~ 136 (167)
T PF02837_consen 66 YSGYAWYRRTFTLPADW-KGKRVFLRFEGVDYAAEVYVNGKLVGSHEGGY-TPFEFDITD-YL--KPGEENTLAVR 136 (167)
T ss_dssp CCSEEEEEEEEEESGGG-TTSEEEEEESEEESEEEEEETTEEEEEEESTT-S-EEEECGG-GS--SSEEEEEEEEE
T ss_pred cCceEEEEEEEEeCchh-cCceEEEEeccceEeeEEEeCCeEEeeeCCCc-CCeEEeChh-hc--cCCCCEEEEEE
Confidence 45689999999998532 34578999876 56999999999999976422 233444554 78 8888 887763
No 66
>PRK09936 hypothetical protein; Provisional
Probab=88.45 E-value=1.1 Score=50.07 Aligned_cols=58 Identities=24% Similarity=0.391 Sum_probs=46.5
Q ss_pred CCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc-hhHHHHHHHHHHCCceEEec
Q psy12907 201 FRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD-QDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 201 ~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~-~DL~~fl~la~e~GL~VILR 264 (949)
.+++++.|+.+++.+|+.||+|+- |=|.-. |.=||.+. ..|.+.++.|++.||.|+|.
T Consensus 33 ~~~~~~qWq~~~~~~~~~G~~tLi--vQWt~y----G~~~fg~~~g~La~~l~~A~~~Gl~v~vG 91 (296)
T PRK09936 33 SQVTDTQWQGLWSQLRLQGFDTLV--VQWTRY----GDADFGGQRGWLAKRLAAAQQAGLKLVVG 91 (296)
T ss_pred CCCCHHHHHHHHHHHHHcCCcEEE--EEeeec----cCCCcccchHHHHHHHHHHHHcCCEEEEc
Confidence 478899999999999999999863 345433 11188764 47999999999999999875
No 67
>PLN02679 hydrolase, alpha/beta fold family protein
Probab=88.42 E-value=0.2 Score=56.84 Aligned_cols=58 Identities=17% Similarity=0.197 Sum_probs=48.2
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------------cCCcc
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-------------------------------------------YLGGF 649 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-------------------------------------------~lg~~ 649 (949)
.+|+||||..+.+..|..++-.|++ +||||. |+||.
T Consensus 89 p~lvllHG~~~~~~~w~~~~~~L~~-~~~via~Dl~G~G~S~~~~~~~~~~~~~a~~l~~~l~~l~~~~~~lvGhS~Gg~ 167 (360)
T PLN02679 89 PPVLLVHGFGASIPHWRRNIGVLAK-NYTVYAIDLLGFGASDKPPGFSYTMETWAELILDFLEEVVQKPTVLIGNSVGSL 167 (360)
T ss_pred CeEEEECCCCCCHHHHHHHHHHHhc-CCEEEEECCCCCCCCCCCCCccccHHHHHHHHHHHHHHhcCCCeEEEEECHHHH
Confidence 4799999999999999888877764 899995 69999
Q ss_pred hhhcccc-cccccCceeeccccccc
Q psy12907 650 LAQKFAE-HTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 650 l~q~f~~-~~~~~~~v~sl~lcn~~ 673 (949)
+++.++. ..+ +.|++|+|+|+-
T Consensus 168 ia~~~a~~~~P--~rV~~LVLi~~~ 190 (360)
T PLN02679 168 ACVIAASESTR--DLVRGLVLLNCA 190 (360)
T ss_pred HHHHHHHhcCh--hhcCEEEEECCc
Confidence 9988875 344 589999999874
No 68
>TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself.
Probab=88.13 E-value=0.25 Score=52.72 Aligned_cols=58 Identities=22% Similarity=0.323 Sum_probs=44.2
Q ss_pred CCeEECCCCCCchh---hhhhhhhhcccccccccc-------------------------------------------cC
Q psy12907 613 SPLVFLPPVSGTAD---IFFHLMLDLSRKGYRTLS-------------------------------------------YL 646 (949)
Q Consensus 613 ~~v~~l~~~~g~a~---~~~~~~~~l~~kgyR~is-------------------------------------------~l 646 (949)
.+|+||||..+.++ -+++++..|+++|||||. |+
T Consensus 31 ~~ivllHG~~~~~~~~~~~~~~~~~l~~~~~~vi~~D~~G~G~S~~~~~~~~~~~~~~~~l~~~l~~l~~~~~~lvG~S~ 110 (282)
T TIGR03343 31 EAVIMLHGGGPGAGGWSNYYRNIGPFVDAGYRVILKDSPGFNKSDAVVMDEQRGLVNARAVKGLMDALDIEKAHLVGNSM 110 (282)
T ss_pred CeEEEECCCCCchhhHHHHHHHHHHHHhCCCEEEEECCCCCCCCCCCcCcccccchhHHHHHHHHHHcCCCCeeEEEECc
Confidence 36999999866553 456778888888999995 68
Q ss_pred CcchhhcccccccccCceeecccccc
Q psy12907 647 GGFLAQKFAEHTVLRPRVVSLFLCNS 672 (949)
Q Consensus 647 g~~l~q~f~~~~~~~~~v~sl~lcn~ 672 (949)
||++++.++...+ +.|+++++.++
T Consensus 111 Gg~ia~~~a~~~p--~~v~~lvl~~~ 134 (282)
T TIGR03343 111 GGATALNFALEYP--DRIGKLILMGP 134 (282)
T ss_pred hHHHHHHHHHhCh--HhhceEEEECC
Confidence 8888888887665 56777777765
No 69
>PRK11126 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase; Provisional
Probab=88.01 E-value=0.25 Score=51.45 Aligned_cols=57 Identities=25% Similarity=0.259 Sum_probs=48.0
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc-----------------------------------------cCCcchhh
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-----------------------------------------YLGGFLAQ 652 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-----------------------------------------~lg~~l~q 652 (949)
+|+|+||..+++..|-.++-.| ++||||+ |+||.+|+
T Consensus 4 ~vvllHG~~~~~~~w~~~~~~l--~~~~vi~~D~~G~G~S~~~~~~~~~~~~~~l~~~l~~~~~~~~~lvG~S~Gg~va~ 81 (242)
T PRK11126 4 WLVFLHGLLGSGQDWQPVGEAL--PDYPRLYIDLPGHGGSAAISVDGFADVSRLLSQTLQSYNILPYWLVGYSLGGRIAM 81 (242)
T ss_pred EEEEECCCCCChHHHHHHHHHc--CCCCEEEecCCCCCCCCCccccCHHHHHHHHHHHHHHcCCCCeEEEEECHHHHHHH
Confidence 5999999999999999888877 3799995 79999999
Q ss_pred cccccccccCceeeccccccc
Q psy12907 653 KFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 653 ~f~~~~~~~~~v~sl~lcn~~ 673 (949)
.++...+- ..|++|+|+++.
T Consensus 82 ~~a~~~~~-~~v~~lvl~~~~ 101 (242)
T PRK11126 82 YYACQGLA-GGLCGLIVEGGN 101 (242)
T ss_pred HHHHhCCc-ccccEEEEeCCC
Confidence 99887642 358999998764
No 70
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=88.00 E-value=0.33 Score=51.12 Aligned_cols=58 Identities=17% Similarity=0.201 Sum_probs=48.5
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------------cCCcc
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-------------------------------------------YLGGF 649 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-------------------------------------------~lg~~ 649 (949)
.+|+|+||..++..+|..++-.|+ ++||||. |+||.
T Consensus 29 ~~vv~~hG~~~~~~~~~~~~~~l~-~~~~vi~~D~~G~G~S~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~lvG~S~Gg~ 107 (278)
T TIGR03056 29 PLLLLLHGTGASTHSWRDLMPPLA-RSFRVVAPDLPGHGFTRAPFRFRFTLPSMAEDLSALCAAEGLSPDGVIGHSAGAA 107 (278)
T ss_pred CeEEEEcCCCCCHHHHHHHHHHHh-hCcEEEeecCCCCCCCCCccccCCCHHHHHHHHHHHHHHcCCCCceEEEECccHH
Confidence 369999999999999988877775 4699994 69999
Q ss_pred hhhcccccccccCceeeccccccc
Q psy12907 650 LAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 650 l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
++..+++..+ ++|++++++++.
T Consensus 108 ~a~~~a~~~p--~~v~~~v~~~~~ 129 (278)
T TIGR03056 108 IALRLALDGP--VTPRMVVGINAA 129 (278)
T ss_pred HHHHHHHhCC--cccceEEEEcCc
Confidence 9999998876 578888888775
No 71
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=87.99 E-value=10 Score=41.50 Aligned_cols=129 Identities=11% Similarity=0.106 Sum_probs=76.3
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCcccccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGlP~WL~ 283 (949)
...|++.|+.++++|++.|++-+ +..| ..+...+++ ..++.++.+.+++.||.|. +.+++ .+.+|
T Consensus 15 ~~~~~e~l~~~~~~G~~~VEl~~-~~~~-~~~~~~~~~-~~~~~~~~~~l~~~gl~i~~~~~~~-------~~~~~---- 80 (279)
T TIGR00542 15 GECWLERLQLAKTCGFDFVEMSV-DETD-DRLSRLDWS-REQRLALVNAIIETGVRIPSMCLSA-------HRRFP---- 80 (279)
T ss_pred CCCHHHHHHHHHHcCCCEEEEec-CCcc-chhhccCCC-HHHHHHHHHHHHHcCCCceeeecCC-------CccCc----
Confidence 35699999999999999999953 2222 223444554 3578899999999999984 44331 01111
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEecccccc--CCc-----cchHHHHHHHHHHHHHhcCC
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEM--GSY-----TCDKEHMIWLRDQMKYYVRD 356 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEy--G~y-----~~d~~Y~~~L~~~~~~~lG~ 356 (949)
+-+.++.-+++..++++..++..+.+ |.++|.+-- .++ +.. ..-.+.++.+.+.++++ |+
T Consensus 81 -------l~~~~~~~r~~~~~~~~~~i~~a~~l----G~~~v~~~~-~~~~~~~~~~~~~~~~~~~l~~l~~~A~~~-Gv 147 (279)
T TIGR00542 81 -------LGSKDKAVRQQGLEIMEKAIQLARDL----GIRTIQLAG-YDVYYEEHDEETRRRFREGLKEAVELAARA-QV 147 (279)
T ss_pred -------CCCcCHHHHHHHHHHHHHHHHHHHHh----CCCEEEecC-cccccCcCCHHHHHHHHHHHHHHHHHHHHc-CC
Confidence 11235555666666777777777665 445654421 111 110 01125556666666665 77
Q ss_pred eeEE
Q psy12907 357 AAVL 360 (949)
Q Consensus 357 ~vpl 360 (949)
.+-|
T Consensus 148 ~l~l 151 (279)
T TIGR00542 148 TLAV 151 (279)
T ss_pred EEEE
Confidence 6444
No 72
>PLN02385 hydrolase; alpha/beta fold family protein
Probab=87.88 E-value=0.28 Score=55.20 Aligned_cols=29 Identities=17% Similarity=0.138 Sum_probs=22.1
Q ss_pred cCCcchhhcccccccccCceeeccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYD 675 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~d 675 (949)
|+||.+++.++...+ +.|.+++|+++.+.
T Consensus 170 SmGG~val~~a~~~p--~~v~glVLi~p~~~ 198 (349)
T PLN02385 170 SMGGAVALKVHLKQP--NAWDGAILVAPMCK 198 (349)
T ss_pred ccchHHHHHHHHhCc--chhhheeEeccccc
Confidence 688888888887766 46788888887654
No 73
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=87.46 E-value=0.19 Score=50.81 Aligned_cols=58 Identities=28% Similarity=0.276 Sum_probs=48.4
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------cCCcchhhccc
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-------------------------------------YLGGFLAQKFA 655 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-------------------------------------~lg~~l~q~f~ 655 (949)
.+|+|+||..++++.|..++-.|+ ++||||. |+||+++..++
T Consensus 5 ~~iv~~HG~~~~~~~~~~~~~~l~-~~~~vi~~d~~G~G~s~~~~~~~~~~~~~~~~~~~~~~~~lvG~S~Gg~~a~~~a 83 (245)
T TIGR01738 5 VHLVLIHGWGMNAEVFRCLDEELS-AHFTLHLVDLPGHGRSRGFGPLSLADAAEAIAAQAPDPAIWLGWSLGGLVALHIA 83 (245)
T ss_pred ceEEEEcCCCCchhhHHHHHHhhc-cCeEEEEecCCcCccCCCCCCcCHHHHHHHHHHhCCCCeEEEEEcHHHHHHHHHH
Confidence 369999999999999887777765 5799994 69999999999
Q ss_pred ccccccCceeeccccccc
Q psy12907 656 EHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 656 ~~~~~~~~v~sl~lcn~~ 673 (949)
...+ +.|.+++|+++.
T Consensus 84 ~~~p--~~v~~~il~~~~ 99 (245)
T TIGR01738 84 ATHP--DRVRALVTVASS 99 (245)
T ss_pred HHCH--HhhheeeEecCC
Confidence 8776 568899998875
No 74
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=87.16 E-value=38 Score=36.83 Aligned_cols=169 Identities=13% Similarity=0.146 Sum_probs=94.2
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGlP~WL~~ 284 (949)
-.|++.++.++++|+..|++.+. ..|+ .....+|+ ..++.++-++++++||.|. +.++ +.-.+
T Consensus 16 ~~~~e~~~~~~~~G~~~iEl~~~-~~~~-~~~~~~~~-~~~~~~l~~~l~~~Gl~i~~~~~~----------~~~~~--- 79 (284)
T PRK13210 16 LSWEERLVFAKELGFDFVEMSVD-ESDE-RLARLDWS-KEERLSLVKAIYETGVRIPSMCLS----------GHRRF--- 79 (284)
T ss_pred CCHHHHHHHHHHcCCCeEEEecC-Cccc-ccccccCC-HHHHHHHHHHHHHcCCCceEEecc----------cccCc---
Confidence 47999999999999999999643 2221 11122333 3478999999999999984 3322 11000
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCC------ccchHHHHHHHHHHHHHhcCCee
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGS------YTCDKEHMIWLRDQMKYYVRDAA 358 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~------y~~d~~Y~~~L~~~~~~~lG~~v 358 (949)
.+.+.|+.-+++..+.+++++...+.+- .+.|-+.--..+.. +..-.+.++.+.+.++++ |+.+
T Consensus 80 -----~~~~~d~~~r~~~~~~~~~~i~~a~~lG----~~~v~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~-gv~l 149 (284)
T PRK13210 80 -----PFGSRDPATRERALEIMKKAIRLAQDLG----IRTIQLAGYDVYYEEKSEETRQRFIEGLAWAVEQAAAA-QVML 149 (284)
T ss_pred -----CCCCCCHHHHHHHHHHHHHHHHHHHHhC----CCEEEECCcccccccccHHHHHHHHHHHHHHHHHHHHh-CCEE
Confidence 1223466666666667777777776653 34544321000000 011235677777777766 7764
Q ss_pred EEEeeCCCCccccccccCCeeeeecCCCCCChHHHHHHhhhcCCCCCeeeeecCCCcccccCC
Q psy12907 359 VLYTTDGGGVGYLKCTVPGVYATVDFGAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWGE 421 (949)
Q Consensus 359 plft~Dg~~~~~l~~~~~g~~~t~nfg~~~~~~~~f~~~~~~~p~~P~~~~Ef~~GWfd~WG~ 421 (949)
-+=+..+ ++. ....+....++.+. .|.+-..|=.||...||+
T Consensus 150 ~lE~~~~-----------------~~~--~~~~~~~~l~~~v~--~~~~~~~~D~~h~~~~~~ 191 (284)
T PRK13210 150 AVEIMDT-----------------PFM--NSISKWKKWDKEID--SPWLTVYPDVGNLSAWGN 191 (284)
T ss_pred EEEecCc-----------------ccc--CCHHHHHHHHHHcC--CCceeEEecCChhhhcCC
Confidence 4322111 011 12233444455553 466777777777777753
No 75
>TIGR01250 pro_imino_pep_2 proline-specific peptidases, Bacillus coagulans-type subfamily. This model describes a subfamily of the alpha/beta fold family of hydrolases. Characterized members include prolinases (Pro-Xaa dipeptidase, EC 3.4.13.8), prolyl aminopeptidases (EC 3.4.11.5), and a leucyl aminopeptidase
Probab=86.89 E-value=0.34 Score=50.49 Aligned_cols=33 Identities=15% Similarity=0.320 Sum_probs=26.5
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccc-cccccc
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRK-GYRTLS 644 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~k-gyR~is 644 (949)
..+|+|+||.-|.+..++..+..+.++ |||||.
T Consensus 25 ~~~vl~~hG~~g~~~~~~~~~~~~l~~~g~~vi~ 58 (288)
T TIGR01250 25 KIKLLLLHGGPGMSHEYLENLRELLKEEGREVIM 58 (288)
T ss_pred CCeEEEEcCCCCccHHHHHHHHHHHHhcCCEEEE
Confidence 347999999988887787777777665 999995
No 76
>PLN03084 alpha/beta hydrolase fold protein; Provisional
Probab=86.82 E-value=0.3 Score=56.58 Aligned_cols=67 Identities=18% Similarity=0.256 Sum_probs=48.4
Q ss_pred cccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc--------------------------------------
Q psy12907 603 VYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-------------------------------------- 644 (949)
Q Consensus 603 ~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-------------------------------------- 644 (949)
|.+.|+.+ ..+|+||||..+++..|-.++-.|+ ++||||+
T Consensus 119 y~~~G~~~-~~~ivllHG~~~~~~~w~~~~~~L~-~~~~Via~DlpG~G~S~~p~~~~~~~ys~~~~a~~l~~~i~~l~~ 196 (383)
T PLN03084 119 CVESGSNN-NPPVLLIHGFPSQAYSYRKVLPVLS-KNYHAIAFDWLGFGFSDKPQPGYGFNYTLDEYVSSLESLIDELKS 196 (383)
T ss_pred EEecCCCC-CCeEEEECCCCCCHHHHHHHHHHHh-cCCEEEEECCCCCCCCCCCcccccccCCHHHHHHHHHHHHHHhCC
Confidence 44445433 3479999999999999888877775 5899995
Q ss_pred --------cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 --------YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 --------~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
++||.+++.|+...+ +.|++|+|+|+=
T Consensus 197 ~~~~LvG~s~GG~ia~~~a~~~P--~~v~~lILi~~~ 231 (383)
T PLN03084 197 DKVSLVVQGYFSPPVVKYASAHP--DKIKKLILLNPP 231 (383)
T ss_pred CCceEEEECHHHHHHHHHHHhCh--HhhcEEEEECCC
Confidence 466777777777655 467777777753
No 77
>TIGR03695 menH_SHCHC 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase. This protein catalyzes the formation of SHCHC, or (1 R,6 R)-2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate, by elmination of pyruvate from 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate (SEPHCHC). Note that SHCHC synthase activity previously was attributed to MenD, which in fact is SEPHCHC synthase.
Probab=86.57 E-value=0.36 Score=48.66 Aligned_cols=30 Identities=30% Similarity=0.369 Sum_probs=26.5
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS 644 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is 644 (949)
+|+|+||..|++..|..++-.|+ +||+|+.
T Consensus 3 ~vv~~hG~~~~~~~~~~~~~~L~-~~~~v~~ 32 (251)
T TIGR03695 3 VLVFLHGFLGSGADWQALIELLG-PHFRCLA 32 (251)
T ss_pred EEEEEcCCCCchhhHHHHHHHhc-ccCeEEE
Confidence 68899999999999998888887 8999985
No 78
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=85.27 E-value=0.27 Score=55.29 Aligned_cols=28 Identities=21% Similarity=0.336 Sum_probs=24.8
Q ss_pred cCCcchhhcccccccccCceeecccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|+||++|+.|+..++ +.|.+|+|+++-.
T Consensus 146 SmGG~vA~~~A~~~P--~~V~~LvLi~s~~ 173 (343)
T PRK08775 146 SYGALVGLQFASRHP--ARVRTLVVVSGAH 173 (343)
T ss_pred CHHHHHHHHHHHHCh--HhhheEEEECccc
Confidence 799999999999887 6899999999864
No 79
>PLN02578 hydrolase
Probab=84.88 E-value=0.4 Score=54.24 Aligned_cols=56 Identities=29% Similarity=0.330 Sum_probs=48.5
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------------cCCcchh
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------------YLGGFLA 651 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------------~lg~~l~ 651 (949)
||+|+||..+++..+.+++-.|+ ++|||+. |+||+++
T Consensus 88 ~vvliHG~~~~~~~w~~~~~~l~-~~~~v~~~D~~G~G~S~~~~~~~~~~~~a~~l~~~i~~~~~~~~~lvG~S~Gg~ia 166 (354)
T PLN02578 88 PIVLIHGFGASAFHWRYNIPELA-KKYKVYALDLLGFGWSDKALIEYDAMVWRDQVADFVKEVVKEPAVLVGNSLGGFTA 166 (354)
T ss_pred eEEEECCCCCCHHHHHHHHHHHh-cCCEEEEECCCCCCCCCCcccccCHHHHHHHHHHHHHHhccCCeEEEEECHHHHHH
Confidence 79999999999999988887776 4699995 6999999
Q ss_pred hcccccccccCceeecccccc
Q psy12907 652 QKFAEHTVLRPRVVSLFLCNS 672 (949)
Q Consensus 652 q~f~~~~~~~~~v~sl~lcn~ 672 (949)
+.++...+ +.|++++|+|+
T Consensus 167 ~~~A~~~p--~~v~~lvLv~~ 185 (354)
T PLN02578 167 LSTAVGYP--ELVAGVALLNS 185 (354)
T ss_pred HHHHHhCh--HhcceEEEECC
Confidence 99999876 67999999886
No 80
>TIGR02427 protocat_pcaD 3-oxoadipate enol-lactonase. Members of this family are 3-oxoadipate enol-lactonase. Note that the substrate is known as 3-oxoadipate enol-lactone, 2-oxo-2,3-dihydrofuran-5-acetate, 4,5-Dihydro-5-oxofuran-2-acetate, and 5-oxo-4,5-dihydrofuran-2-acetate. The enzyme the catalyzes the fourth step in the protocatechuate degradation to beta-ketoadipate and then to succinyl-CoA and acetyl-CoA. 4-hydroxybenzoate, 3-hydroxybenzoate, and vanillate all can be converted in one step to protocatechuate. This enzyme also acts in catechol degradation. In genomes that catabolize both catechol and protocatechuate, two forms of this enzyme may be found. All members of the seed alignment for this model were chosen from within protocatechuate degradation operons of at least three genes of the pathway, from genomes with the complete pathway through beta-ketoadipate.
Probab=83.92 E-value=0.45 Score=48.22 Aligned_cols=32 Identities=19% Similarity=0.352 Sum_probs=24.1
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccccccccc
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS 644 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is 644 (949)
...|+|+|+..+++..|...+-.| ++||||++
T Consensus 13 ~~~li~~hg~~~~~~~~~~~~~~l-~~~~~v~~ 44 (251)
T TIGR02427 13 APVLVFINSLGTDLRMWDPVLPAL-TPDFRVLR 44 (251)
T ss_pred CCeEEEEcCcccchhhHHHHHHHh-hcccEEEE
Confidence 345789999888888776666555 46899995
No 81
>PLN02894 hydrolase, alpha/beta fold family protein
Probab=82.77 E-value=0.56 Score=54.42 Aligned_cols=58 Identities=24% Similarity=0.434 Sum_probs=45.6
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc-----------------------------------------------c
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-----------------------------------------------Y 645 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-----------------------------------------------~ 645 (949)
.+|+|+||..+++..|+.++-.|++ +|||+. |
T Consensus 106 p~vvllHG~~~~~~~~~~~~~~L~~-~~~vi~~D~rG~G~S~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~lvGhS 184 (402)
T PLN02894 106 PTLVMVHGYGASQGFFFRNFDALAS-RFRVIAIDQLGWGGSSRPDFTCKSTEETEAWFIDSFEEWRKAKNLSNFILLGHS 184 (402)
T ss_pred CEEEEECCCCcchhHHHHHHHHHHh-CCEEEEECCCCCCCCCCCCcccccHHHHHHHHHHHHHHHHHHcCCCCeEEEEEC
Confidence 3688899988888888888777764 588884 6
Q ss_pred CCcchhhcccccccccCceeeccccccc
Q psy12907 646 LGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 646 lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
+||++|+.++...+ ..|++|+|+++-
T Consensus 185 ~GG~la~~~a~~~p--~~v~~lvl~~p~ 210 (402)
T PLN02894 185 FGGYVAAKYALKHP--EHVQHLILVGPA 210 (402)
T ss_pred HHHHHHHHHHHhCc--hhhcEEEEECCc
Confidence 88999998888765 578888888753
No 82
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=82.43 E-value=3.5 Score=46.87 Aligned_cols=108 Identities=16% Similarity=0.187 Sum_probs=68.0
Q ss_pred hhhHHHHHHHHHHcCCCEEEE-------eecCCccCCCCCeeecc--CchhHHHHHHHHHHCCceEEecCCcccc---cc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVST-------YVEWSFHEPSPGQFQFT--GDQDLEYFLELAVREDLLVLLRPGPYIC---AE 272 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~t-------Yv~Wn~HEp~~G~fdF~--g~~DL~~fl~la~e~GL~VILRpGPYIc---AE 272 (949)
++.-+..|+.+++.|+|+|-+ .|.+..-.|..-+..-. ...|+.++++.++++|+++|.|.=-+-- |+
T Consensus 12 ~~~~~~~~~~i~~t~lNavVIDvKdd~G~i~y~s~~~~~~~~ga~~~~i~D~~~l~~~l~e~gIY~IARIv~FkD~~la~ 91 (316)
T PF13200_consen 12 PERLDKLLDLIKRTELNAVVIDVKDDDGNITYDSQVPLAREIGAVKPYIKDLKALVKKLKEHGIYPIARIVVFKDPVLAE 91 (316)
T ss_pred HHHHHHHHHHHHhcCCceEEEEEecCCceEEecCCCchhhhcccccccccCHHHHHHHHHHCCCEEEEEEEEecChHHhh
Confidence 356778999999999999843 35565544443333221 1379999999999999999999521110 11
Q ss_pred cCCCCccccccccCCCeeeecC-----CHhHHHHHHHHHHHHHHHHHhc
Q psy12907 273 RDFGGLPSWLMTIKPNITLRSK-----DEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 273 w~~GGlP~WL~~~~p~~~~Rt~-----dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
. -|.|-.+...+-..|.+ =.+|.++|.+|.-.|+..++..
T Consensus 92 ~----~pe~av~~~~G~~w~d~~~~~WvnP~~~evw~Y~i~IA~Eaa~~ 136 (316)
T PF13200_consen 92 A----HPEWAVKTKDGSVWRDNEGEAWVNPYSKEVWDYNIDIAKEAAKL 136 (316)
T ss_pred h----ChhhEEECCCCCcccCCCCCccCCCCCHHHHHHHHHHHHHHHHc
Confidence 1 24444321111111211 1257899999999999999865
No 83
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=82.39 E-value=1.9 Score=51.38 Aligned_cols=67 Identities=16% Similarity=0.180 Sum_probs=45.2
Q ss_pred CCCCCC----hhhHH---HHHHHHHHcCCCEEEE-eecCCc-----cCCCCCee--------------eccCchhHHHHH
Q psy12907 199 HYFRSP----RYYWQ---DRLRKLRAAGLNAVST-YVEWSF-----HEPSPGQF--------------QFTGDQDLEYFL 251 (949)
Q Consensus 199 HY~R~p----~~~W~---d~L~k~Ka~GlNtV~t-Yv~Wn~-----HEp~~G~f--------------dF~g~~DL~~fl 251 (949)
|.|.|+ .+.|. +.|.-+|++|+++|-+ +++-+. |--.+--| .|....||.+++
T Consensus 8 q~f~w~~~~~~~~~~~I~~kldyl~~LGvtaIwl~P~~~~~~~~~~hgY~~~D~~~~~~~~~~~~id~~fGt~~dl~~Li 87 (479)
T PRK09441 8 QYFEWYLPNDGKLWNRLAERAPELAEAGITAVWLPPAYKGTSGGYDVGYGVYDLFDLGEFDQKGTVRTKYGTKEELLNAI 87 (479)
T ss_pred EEEEeccCCCccHHHHHHHHHHHHHHcCCCEEEeCCCccCCCCCCCCCCCeecccccccccccCCcCcCcCCHHHHHHHH
Confidence 555555 35575 5677789999999976 343322 22222111 244568999999
Q ss_pred HHHHHCCceEEecC
Q psy12907 252 ELAVREDLLVLLRP 265 (949)
Q Consensus 252 ~la~e~GL~VILRp 265 (949)
+.|++.||+||+-.
T Consensus 88 ~~~H~~Gi~vi~D~ 101 (479)
T PRK09441 88 DALHENGIKVYADV 101 (479)
T ss_pred HHHHHCCCEEEEEE
Confidence 99999999999873
No 84
>PRK06489 hypothetical protein; Provisional
Probab=82.26 E-value=0.53 Score=53.35 Aligned_cols=16 Identities=19% Similarity=0.459 Sum_probs=10.7
Q ss_pred CeEECCCCCCchhhhh
Q psy12907 614 PLVFLPPVSGTADIFF 629 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~ 629 (949)
+|+||||..|.+..++
T Consensus 71 plvllHG~~~~~~~~~ 86 (360)
T PRK06489 71 AVLVLHGTGGSGKSFL 86 (360)
T ss_pred eEEEeCCCCCchhhhc
Confidence 5777777777666654
No 85
>PRK03204 haloalkane dehalogenase; Provisional
Probab=81.64 E-value=0.85 Score=49.91 Aligned_cols=66 Identities=20% Similarity=0.340 Sum_probs=48.7
Q ss_pred ccccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc-------------------------------------
Q psy12907 602 KVYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------- 644 (949)
Q Consensus 602 ~~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------- 644 (949)
.|.+.|. ..+|+||||....+..|-.++-.| +++||||+
T Consensus 27 ~y~~~G~---~~~iv~lHG~~~~~~~~~~~~~~l-~~~~~vi~~D~~G~G~S~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (286)
T PRK03204 27 HYIDEGT---GPPILLCHGNPTWSFLYRDIIVAL-RDRFRCVAPDYLGFGLSERPSGFGYQIDEHARVIGEFVDHLGLDR 102 (286)
T ss_pred EEEECCC---CCEEEEECCCCccHHHHHHHHHHH-hCCcEEEEECCCCCCCCCCCCccccCHHHHHHHHHHHHHHhCCCC
Confidence 3444553 247999999987777766655555 45799994
Q ss_pred ------cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 ------YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 ------~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
|+||.+++.|+...+ +.|++++|.|+.
T Consensus 103 ~~lvG~S~Gg~va~~~a~~~p--~~v~~lvl~~~~ 135 (286)
T PRK03204 103 YLSMGQDWGGPISMAVAVERA--DRVRGVVLGNTW 135 (286)
T ss_pred EEEEEECccHHHHHHHHHhCh--hheeEEEEECcc
Confidence 699999999988776 578898887765
No 86
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=80.06 E-value=8.4 Score=39.14 Aligned_cols=126 Identities=15% Similarity=0.072 Sum_probs=76.0
Q ss_pred HHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCeee
Q psy12907 212 LRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITL 291 (949)
Q Consensus 212 L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~ 291 (949)
|+.++++|+..|+............ ..+++++.++++++||.+..--.+ .. +. .+....
T Consensus 1 l~~~~~~G~~~vE~~~~~~~~~~~~-------~~~~~~~~~~~~~~gl~i~~~~~~-~~--~~-----------~~~~~~ 59 (213)
T PF01261_consen 1 LEAAAEAGFDGVELRFDDGQPWDEK-------DDEAEELRRLLEDYGLKIASLHPP-TN--FW-----------SPDEEN 59 (213)
T ss_dssp HHHHHHTTHSEEEEEHHHHSHHTHH-------HHHHHHHHHHHHHTTCEEEEEEEE-ES--SS-----------CTGTTS
T ss_pred ChHHHHcCCCEEEEecCCCcccccc-------hHHHHHHHHHHHHcCCeEEEEecc-cc--cc-----------cccccc
Confidence 6789999999999876533222211 346999999999999997522111 00 00 111112
Q ss_pred ecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccc--cccCCc------cchHHHHHHHHHHHHHhcCCeeEEEee
Q psy12907 292 RSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVE--NEMGSY------TCDKEHMIWLRDQMKYYVRDAAVLYTT 363 (949)
Q Consensus 292 Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIE--NEyG~y------~~d~~Y~~~L~~~~~~~lG~~vplft~ 363 (949)
++..++ ++...+.+.+.+...+.+ |.+.|.+..= +..... ..-.+.++.|.+.++++ |+.+.+-+.
T Consensus 60 ~~~~~~-r~~~~~~~~~~i~~a~~l----g~~~i~~~~g~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~-gv~i~lE~~ 133 (213)
T PF01261_consen 60 GSANDE-REEALEYLKKAIDLAKRL----GAKYIVVHSGRYPSGPEDDTEENWERLAENLRELAEIAEEY-GVRIALENH 133 (213)
T ss_dssp TTSSSH-HHHHHHHHHHHHHHHHHH----TBSEEEEECTTESSSTTSSHHHHHHHHHHHHHHHHHHHHHH-TSEEEEE-S
T ss_pred cCcchh-hHHHHHHHHHHHHHHHHh----CCCceeecCcccccccCCCHHHHHHHHHHHHHHHHhhhhhh-cceEEEecc
Confidence 344444 777777888888888776 4566666643 121111 22356777788888777 877666554
Q ss_pred C
Q psy12907 364 D 364 (949)
Q Consensus 364 D 364 (949)
.
T Consensus 134 ~ 134 (213)
T PF01261_consen 134 P 134 (213)
T ss_dssp S
T ss_pred c
Confidence 3
No 87
>KOG4178|consensus
Probab=80.00 E-value=0.92 Score=51.41 Aligned_cols=60 Identities=22% Similarity=0.276 Sum_probs=53.2
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc--------------------------------------------cCCcc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS--------------------------------------------YLGGF 649 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is--------------------------------------------~lg~~ 649 (949)
-|.+|||--.......+|+..|+.+|||||+ +.|+.
T Consensus 46 ~illlHGfPe~wyswr~q~~~la~~~~rviA~DlrGyG~Sd~P~~~~~Yt~~~l~~di~~lld~Lg~~k~~lvgHDwGai 125 (322)
T KOG4178|consen 46 IVLLLHGFPESWYSWRHQIPGLASRGYRVIAPDLRGYGFSDAPPHISEYTIDELVGDIVALLDHLGLKKAFLVGHDWGAI 125 (322)
T ss_pred EEEEEccCCccchhhhhhhhhhhhcceEEEecCCCCCCCCCCCCCcceeeHHHHHHHHHHHHHHhccceeEEEeccchhH
Confidence 3556999999999999999999999999996 58999
Q ss_pred hhhcccccccccCceeeccccccccc
Q psy12907 650 LAQKFAEHTVLRPRVVSLFLCNSFYD 675 (949)
Q Consensus 650 l~q~f~~~~~~~~~v~sl~lcn~~~d 675 (949)
+|++++.-.+ ++|.++++||.=.-
T Consensus 126 vaw~la~~~P--erv~~lv~~nv~~~ 149 (322)
T KOG4178|consen 126 VAWRLALFYP--ERVDGLVTLNVPFP 149 (322)
T ss_pred HHHHHHHhCh--hhcceEEEecCCCC
Confidence 9999999887 79999999997644
No 88
>PLN02652 hydrolase; alpha/beta fold family protein
Probab=79.63 E-value=0.94 Score=52.71 Aligned_cols=63 Identities=14% Similarity=0.281 Sum_probs=48.3
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc-----------------------------------------------c
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS-----------------------------------------------Y 645 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is-----------------------------------------------~ 645 (949)
+.|+|+||..+.+..|...+-.|+++||+|+. |
T Consensus 137 ~~Vl~lHG~~~~~~~~~~~a~~L~~~Gy~V~~~D~rGhG~S~~~~~~~~~~~~~~~Dl~~~l~~l~~~~~~~~i~lvGhS 216 (395)
T PLN02652 137 GILIIIHGLNEHSGRYLHFAKQLTSCGFGVYAMDWIGHGGSDGLHGYVPSLDYVVEDTEAFLEKIRSENPGVPCFLFGHS 216 (395)
T ss_pred eEEEEECCchHHHHHHHHHHHHHHHCCCEEEEeCCCCCCCCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCCCEEEEEEC
Confidence 45899999999998888888889999999994 5
Q ss_pred CCcchhhcccccccccCceeeccccccccc
Q psy12907 646 LGGFLAQKFAEHTVLRPRVVSLFLCNSFYD 675 (949)
Q Consensus 646 lg~~l~q~f~~~~~~~~~v~sl~lcn~~~d 675 (949)
+||.++..++......+.|..++|+++..+
T Consensus 217 mGG~ial~~a~~p~~~~~v~glVL~sP~l~ 246 (395)
T PLN02652 217 TGGAVVLKAASYPSIEDKLEGIVLTSPALR 246 (395)
T ss_pred HHHHHHHHHHhccCcccccceEEEECcccc
Confidence 677777776654333457777888777654
No 89
>PRK14875 acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional
Probab=79.53 E-value=1.1 Score=50.11 Aligned_cols=69 Identities=20% Similarity=0.297 Sum_probs=51.7
Q ss_pred cccccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------
Q psy12907 601 WKVYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------ 644 (949)
Q Consensus 601 I~~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------ 644 (949)
|.|+..|+.. ..+|+|+||..|+...|+.++-.|++. |||++
T Consensus 121 i~~~~~g~~~-~~~vl~~HG~~~~~~~~~~~~~~l~~~-~~v~~~d~~g~G~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (371)
T PRK14875 121 VRYLRLGEGD-GTPVVLIHGFGGDLNNWLFNHAALAAG-RPVIALDLPGHGASSKAVGAGSLDELAAAVLAFLDALGIER 198 (371)
T ss_pred EEEecccCCC-CCeEEEECCCCCccchHHHHHHHHhcC-CEEEEEcCCCCCCCCCCCCCCCHHHHHHHHHHHHHhcCCcc
Confidence 5566666544 457999999999999999888877654 99995
Q ss_pred ------cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 ------YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 ------~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
|+||+++..++...+ ..|..+++.++.
T Consensus 199 ~~lvG~S~Gg~~a~~~a~~~~--~~v~~lv~~~~~ 231 (371)
T PRK14875 199 AHLVGHSMGGAVALRLAARAP--QRVASLTLIAPA 231 (371)
T ss_pred EEEEeechHHHHHHHHHHhCc--hheeEEEEECcC
Confidence 688888888777654 457777776654
No 90
>PF14307 Glyco_tran_WbsX: Glycosyltransferase WbsX
Probab=79.26 E-value=16 Score=41.91 Aligned_cols=55 Identities=31% Similarity=0.476 Sum_probs=40.3
Q ss_pred CCCCcccchhhhcccCCccC-CCcEEEEeeccCCCCCC-CCHHHHHHHHHHHHHhCCCeeEEE
Q psy12907 9 GTGMTLPWTNLQKTRPTTYA-PGEIFPFQVENEMGSYT-CDKEHMIWLRDQMKYYVRDAAVLY 69 (949)
Q Consensus 9 ~~~~~~~~~~~~~iap~qit-GGPIImvQVENEYGsyg-~D~~YM~~L~d~~r~~Gi~~VPL~ 69 (949)
|.|||| |||..+.+|+.-. -=|-+-. + -|-|. .|++.|++=.++++++||+ .-.|
T Consensus 21 g~GftE-Wt~v~~a~p~f~gh~qP~~P~--~--lGyYdl~~p~v~~~Q~~lA~~~GI~-gF~~ 77 (345)
T PF14307_consen 21 GKGFTE-WTNVKKAKPLFPGHYQPRVPL--D--LGYYDLRDPEVMEKQAELAKEYGID-GFCF 77 (345)
T ss_pred cCCCCc-ceEcccccccCCCCCCCCCCC--c--CCcccCCCHHHHHHHHHHHHHhCCC-EEEE
Confidence 789998 9999998887644 2343322 2 34443 6999999999999999998 4433
No 91
>PRK01060 endonuclease IV; Provisional
Probab=79.06 E-value=38 Score=36.97 Aligned_cols=95 Identities=13% Similarity=0.215 Sum_probs=60.3
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceE--EecCCcccccccCCCCccccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLV--LLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~V--ILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
.+++.+++++++|+++|+..+.- -|.-..+.+ ...++.++-+++++.||.+ +.--+||. .
T Consensus 13 ~~~~~l~~~~~~G~d~vEl~~~~-p~~~~~~~~---~~~~~~~lk~~~~~~gl~~~~~~~h~~~~-------------~- 74 (281)
T PRK01060 13 GLEGAVAEAAEIGANAFMIFTGN-PQQWKRKPL---EELNIEAFKAACEKYGISPEDILVHAPYL-------------I- 74 (281)
T ss_pred CHHHHHHHHHHcCCCEEEEECCC-CCCCcCCCC---CHHHHHHHHHHHHHcCCCCCceEEecceE-------------e-
Confidence 38899999999999999996531 121111122 1235888999999999984 11122321 1
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEec
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQ 328 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQ 328 (949)
.+-+.|+..+++..+.+++.+...+.+ |-+.|-+.
T Consensus 75 -----nl~~~d~~~r~~s~~~~~~~i~~A~~l----ga~~vv~h 109 (281)
T PRK01060 75 -----NLGNPNKEILEKSRDFLIQEIERCAAL----GAKLLVFH 109 (281)
T ss_pred -----cCCCCCHHHHHHHHHHHHHHHHHHHHc----CCCEEEEc
Confidence 123457788888888888888777665 33455444
No 92
>PF00128 Alpha-amylase: Alpha amylase, catalytic domain; InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=78.48 E-value=2.5 Score=45.54 Aligned_cols=56 Identities=21% Similarity=0.287 Sum_probs=38.5
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCCCCC--eee-------ccCchhHHHHHHHHHHCCceEEec
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEPSPG--QFQ-------FTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G--~fd-------F~g~~DL~~fl~la~e~GL~VILR 264 (949)
.++|.-+|++|+|+|.+-=++.-....-| .-| |....|+.++++.|++.||+|||-
T Consensus 7 ~~kLdyl~~lGv~~I~l~Pi~~~~~~~~gY~~~d~~~vd~~~Gt~~d~~~Lv~~~h~~gi~VilD 71 (316)
T PF00128_consen 7 IDKLDYLKDLGVNAIWLSPIFESPNGYHGYDPSDYYAVDPRFGTMEDFKELVDAAHKRGIKVILD 71 (316)
T ss_dssp HHTHHHHHHHTESEEEESS-EESSSSTTTTSESEEEEESTTTBHHHHHHHHHHHHHHTTCEEEEE
T ss_pred HHhhHHHHHcCCCceecccccccccccccccceeeeccccccchhhhhhhhhhccccccceEEEe
Confidence 46788899999999987533332211111 112 234579999999999999999975
No 93
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=77.14 E-value=76 Score=34.43 Aligned_cols=124 Identities=12% Similarity=0.119 Sum_probs=68.4
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEe-cCCcccccccCCCCcccccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLL-RPGPYICAERDFGGLPSWLMTI 285 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VIL-RpGPYIcAEw~~GGlP~WL~~~ 285 (949)
.|++.|+.++++|++.|++.... .|+-.+ +++ ..++.++-++++++||.|.. .|+. +++|..+..
T Consensus 14 ~l~~~l~~~~~~G~~~vEl~~~~-~~~~~~---~~~-~~~~~~l~~~~~~~gl~v~s~~~~~--------~~~~~~~~~- 79 (275)
T PRK09856 14 PIEHAFRDASELGYDGIEIWGGR-PHAFAP---DLK-AGGIKQIKALAQTYQMPIIGYTPET--------NGYPYNMML- 79 (275)
T ss_pred CHHHHHHHHHHcCCCEEEEccCC-cccccc---ccC-chHHHHHHHHHHHcCCeEEEecCcc--------cCcCccccC-
Confidence 59999999999999999983211 011011 122 24688899999999999853 3321 234333321
Q ss_pred CCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccc-cC-C----ccchHHHHHHHHHHHHHhcCCe
Q psy12907 286 KPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENE-MG-S----YTCDKEHMIWLRDQMKYYVRDA 357 (949)
Q Consensus 286 ~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENE-yG-~----y~~d~~Y~~~L~~~~~~~lG~~ 357 (949)
.++.-+++..+.+++.+..-+.+ |.+.|.+-.-.. +. . +..-.+.++.|.+.+.++ |+.
T Consensus 80 --------~~~~~r~~~~~~~~~~i~~a~~l----Ga~~i~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~-gv~ 144 (275)
T PRK09856 80 --------GDEHMRRESLDMIKLAMDMAKEM----NAGYTLISAAHAGYLTPPNVIWGRLAENLSELCEYAENI-GMD 144 (275)
T ss_pred --------CCHHHHHHHHHHHHHHHHHHHHh----CCCEEEEcCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHc-CCE
Confidence 23444455455555555555544 334443421111 00 0 112235677777777776 765
No 94
>PLN02298 hydrolase, alpha/beta fold family protein
Probab=76.89 E-value=1.5 Score=48.66 Aligned_cols=30 Identities=10% Similarity=0.078 Sum_probs=21.7
Q ss_pred cCCcchhhcccccccccCceeecccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
|+||.+++.|+...+ ..|..|+|+++..+.
T Consensus 142 SmGG~ia~~~a~~~p--~~v~~lvl~~~~~~~ 171 (330)
T PLN02298 142 SMGGAICLLIHLANP--EGFDGAVLVAPMCKI 171 (330)
T ss_pred cchhHHHHHHHhcCc--ccceeEEEecccccC
Confidence 577888887777655 467888888877553
No 95
>PRK10150 beta-D-glucuronidase; Provisional
Probab=76.79 E-value=5.5 Score=48.86 Aligned_cols=68 Identities=15% Similarity=0.172 Sum_probs=48.3
Q ss_pred CceeEEEEEecCCCC--CCCCcccccCCcceEEEEEECCEEEEEEEec-ceeeeeccC--CC--CCeEEEEEEeC
Q psy12907 519 SFGFVLYETIIPDAR--FPDPALLTISGLRDRGQVFVDEKLVTILYRN-KMLSTPIMA--RP--GQKLSILVENM 586 (949)
Q Consensus 519 ~~GyvlY~t~i~~~~--~~~~~~L~i~~vrDrA~Vfvdg~~vG~l~r~-~~~~i~~~~--~~--~~~LdILVENm 586 (949)
..|...||+++.-+. .+..+.|.+.+|.-.|.|+|||+.||.-.-- ....++++. +. .++|.|.|.|.
T Consensus 63 ~~G~~WYrr~f~lp~~~~gk~v~L~Fegv~~~a~V~lNG~~vg~~~~~~~~f~~DIT~~l~~G~~n~L~V~v~n~ 137 (604)
T PRK10150 63 YVGDVWYQREVFIPKGWAGQRIVLRFGSVTHYAKVWVNGQEVMEHKGGYTPFEADITPYVYAGKSVRITVCVNNE 137 (604)
T ss_pred CcccEEEEEEEECCcccCCCEEEEEECcccceEEEEECCEEeeeEcCCccceEEeCchhccCCCceEEEEEEecC
Confidence 357788999874322 3456789999999999999999999975442 334555532 22 24899999885
No 96
>TIGR01249 pro_imino_pep_1 proline iminopeptidase, Neisseria-type subfamily. This model represents one of two related families of proline iminopeptidase in the alpha/beta fold hydrolase family. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members.
Probab=74.21 E-value=1.3 Score=48.95 Aligned_cols=28 Identities=29% Similarity=0.313 Sum_probs=18.8
Q ss_pred cCCcchhhcccccccccCceeecccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|+||+++..++...+ +.|.+++|+|.+.
T Consensus 103 S~GG~ia~~~a~~~p--~~v~~lvl~~~~~ 130 (306)
T TIGR01249 103 SWGSTLALAYAQTHP--EVVTGLVLRGIFL 130 (306)
T ss_pred CHHHHHHHHHHHHCh--Hhhhhheeecccc
Confidence 577777777776655 3566777776653
No 97
>TIGR02402 trehalose_TreZ malto-oligosyltrehalose trehalohydrolase. Members of this family are the trehalose biosynthetic enzyme malto-oligosyltrehalose trehalohydrolase, formally known as 4-alpha-D-{(1-4)-alpha-D-glucano}trehalose trehalohydrolase (EC 3.2.1.141). It is the TreZ protein of the TreYZ pathway for trehalose biosynthesis, and alternative to the OtsAB system.
Probab=73.76 E-value=5.5 Score=48.41 Aligned_cols=53 Identities=26% Similarity=0.488 Sum_probs=39.1
Q ss_pred HHHHHHHHHcCCCEEEE-eec-------CCcc-----CCCCCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 209 QDRLRKLRAAGLNAVST-YVE-------WSFH-----EPSPGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~t-Yv~-------Wn~H-----Ep~~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.++|.-+|++|+|+|.. .|+ |.+. .+.+ .|.+..|+.++++.|+++||.|||-
T Consensus 114 ~~~l~yl~~LGv~~i~L~Pi~~~~~~~~~GY~~~~~~~~~~---~~G~~~e~k~lV~~aH~~Gi~VilD 179 (542)
T TIGR02402 114 IEKLPYLADLGITAIELMPVAQFPGTRGWGYDGVLPYAPHN---AYGGPDDLKALVDAAHGLGLGVILD 179 (542)
T ss_pred HHhhHHHHHcCCCEEEeCccccCCCCCCCCCCccCcccccc---ccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 45688999999999976 332 4322 2211 3445679999999999999999987
No 98
>TIGR01515 branching_enzym alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase. A sequence from Arabidopsis thaliana, GP|9294564, scores just above trusted, but appears either to contain corrupt sequence or, more likely, to be a pseudogene as some of the conserved catalytic residues common to the alpha amylase family are not conserved here.
Probab=72.13 E-value=7.5 Score=47.99 Aligned_cols=57 Identities=18% Similarity=0.240 Sum_probs=39.7
Q ss_pred HHHH-HHHHHcCCCEEEE-eecCCccCC----CCC-----eeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 209 QDRL-RKLRAAGLNAVST-YVEWSFHEP----SPG-----QFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 209 ~d~L-~k~Ka~GlNtV~t-Yv~Wn~HEp----~~G-----~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.++| .-+|++|+|+|+. .|+..-... .+- .-.|....||.++++.|+++||.|||--
T Consensus 159 ~~~l~dyl~~LGvt~i~L~Pi~e~~~~~~wGY~~~~y~~~~~~~Gt~~dlk~lV~~~H~~Gi~VilD~ 226 (613)
T TIGR01515 159 ADQLIPYVKELGFTHIELLPVAEHPFDGSWGYQVTGYYAPTSRFGTPDDFMYFVDACHQAGIGVILDW 226 (613)
T ss_pred HHHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 3454 7789999999998 776432111 000 1134456799999999999999999873
No 99
>TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase. This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. This model has been rebuilt to identify sequences more broadly, including a number of sequences suggested to be homoserine O-acetyltransferase based on proximity to other Met biosynthesis genes.
Probab=72.02 E-value=1.4 Score=49.70 Aligned_cols=27 Identities=22% Similarity=0.303 Sum_probs=17.0
Q ss_pred cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
|+||.+++.++...+ +.|++|+|+++.
T Consensus 135 S~Gg~ia~~~a~~~p--~~v~~lvl~~~~ 161 (351)
T TIGR01392 135 SMGGMQALEWAIDYP--ERVRAIVVLATS 161 (351)
T ss_pred CHHHHHHHHHHHHCh--HhhheEEEEccC
Confidence 566666666666554 456666666654
No 100
>smart00812 Alpha_L_fucos Alpha-L-fucosidase. O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site PUBMED:. Because the fold of proteins is better conserved than their sequences, some of the families can be grouped in 'clans'. Family 29 encompasses alpha-L-fucosidases, which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Deficiency of alpha-L-fucosidase results in the lysosomal storage disease fucosidosis.
Probab=71.56 E-value=18 Score=42.24 Aligned_cols=109 Identities=17% Similarity=0.223 Sum_probs=63.0
Q ss_pred ccCCCCCChhhHHHHHHHHHHcCCCEEEE-------eecCCccCCCCCeeeccCchh-HHHHHHHHHHCCceEEecCCcc
Q psy12907 197 SFHYFRSPRYYWQDRLRKLRAAGLNAVST-------YVEWSFHEPSPGQFQFTGDQD-LEYFLELAVREDLLVLLRPGPY 268 (949)
Q Consensus 197 e~HY~R~p~~~W~d~L~k~Ka~GlNtV~t-------Yv~Wn~HEp~~G~fdF~g~~D-L~~fl~la~e~GL~VILRpGPY 268 (949)
.+.+.+..++.|. +.+|++|+..|-. +-.|.-.-..-..-+-...+| |.++.+.|+++||++-+ |
T Consensus 75 ~F~p~~fD~~~Wa---~~~k~AGakY~vlTaKHHDGF~lw~S~~t~~n~~~~~pkrDiv~el~~A~rk~Glk~G~----Y 147 (384)
T smart00812 75 QFTAEKFDPEEWA---DLFKKAGAKYVVLTAKHHDGFCLWDSKYSNWNAVDTGPKRDLVGELADAVRKRGLKFGL----Y 147 (384)
T ss_pred cCCchhCCHHHHH---HHHHHcCCCeEEeeeeecCCccccCCCCCCCcccCCCCCcchHHHHHHHHHHcCCeEEE----E
Confidence 5567778888886 5678888886521 222443322111111111334 56789999999999866 3
Q ss_pred cc-cccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 269 IC-AERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 269 Ic-AEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
.- .+|.. |.|... .+.-..+.+.+.|.+.++.|+.+|.+.|..+
T Consensus 148 ~S~~DW~~---p~y~~~-~~~~~~~~~~~~~~~y~~~~~~Ql~ELit~Y 192 (384)
T smart00812 148 HSLFDWFN---PLYAGP-TSSDEDPDNWPRFQEFVDDWLPQLRELVTRY 192 (384)
T ss_pred cCHHHhCC---Cccccc-cccccccccchhHHHHHHHHHHHHHHHHhcC
Confidence 33 36654 444321 1111123456778888877777777777654
No 101
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=71.29 E-value=88 Score=34.13 Aligned_cols=168 Identities=14% Similarity=0.142 Sum_probs=89.8
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCcccccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGLPSWLMTI 285 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGlP~WL~~~ 285 (949)
.|++.++.++++|+..|+..+. ..++ .....+++ ..+++++.++++++||.|. +.++... .++
T Consensus 22 ~~~e~~~~~~~~G~~~iEl~~~-~~~~-~~~~~~~~-~~~~~~l~~~l~~~gl~i~~~~~~~~~-------~~~------ 85 (283)
T PRK13209 22 CWLEKLAIAKTAGFDFVEMSVD-ESDE-RLARLDWS-REQRLALVNALVETGFRVNSMCLSAHR-------RFP------ 85 (283)
T ss_pred CHHHHHHHHHHcCCCeEEEecC-cccc-chhccCCC-HHHHHHHHHHHHHcCCceeEEeccccc-------ccC------
Confidence 6999999999999999999532 1111 01122333 2468899999999999984 3322100 010
Q ss_pred CCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccc-cCC-----ccchHHHHHHHHHHHHHhcCCeeE
Q psy12907 286 KPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENE-MGS-----YTCDKEHMIWLRDQMKYYVRDAAV 359 (949)
Q Consensus 286 ~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENE-yG~-----y~~d~~Y~~~L~~~~~~~lG~~vp 359 (949)
+-+.++.-++...+.+++.+...+.+ |.++|-+.--.- ++. +..-.+.++.|.+.++++ |+.+-
T Consensus 86 -----~~~~~~~~r~~~~~~~~~~i~~a~~l----G~~~i~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~A~~~-GV~i~ 155 (283)
T PRK13209 86 -----LGSEDDAVRAQALEIMRKAIQLAQDL----GIRVIQLAGYDVYYEQANNETRRRFIDGLKESVELASRA-SVTLA 155 (283)
T ss_pred -----CCCCCHHHHHHHHHHHHHHHHHHHHc----CCCEEEECCccccccccHHHHHHHHHHHHHHHHHHHHHh-CCEEE
Confidence 11235555666666676677666654 455554421000 000 001134566677777766 76543
Q ss_pred EEeeCCCCccccccccCCeeeeecCCCCCChHHHHHHhhhcCCCCCeeeeecCCCcccccCC
Q psy12907 360 LYTTDGGGVGYLKCTVPGVYATVDFGAATNVSSAFAAMRTVSPHGPLVNSEYYPGWLTHWGE 421 (949)
Q Consensus 360 lft~Dg~~~~~l~~~~~g~~~t~nfg~~~~~~~~f~~~~~~~p~~P~~~~Ef~~GWfd~WG~ 421 (949)
+=+..+. +- ....+....++++. .|.+..-|=.|++.+||.
T Consensus 156 iE~~~~~-----------------~~--~~~~~~~~ll~~v~--~~~lgl~~D~~h~~~~~~ 196 (283)
T PRK13209 156 FEIMDTP-----------------FM--NSISKALGYAHYLN--SPWFQLYPDIGNLSAWDN 196 (283)
T ss_pred EeecCCc-----------------cc--CCHHHHHHHHHHhC--CCccceEeccchHHHhcC
Confidence 3222111 00 12344555555553 355555555566666653
No 102
>PLN02447 1,4-alpha-glucan-branching enzyme
Probab=70.39 E-value=9 Score=48.39 Aligned_cols=61 Identities=26% Similarity=0.374 Sum_probs=43.7
Q ss_pred hhhHHHHHHHHHHcCCCEEEE-ee-------cCCccCCCC--CeeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVST-YV-------EWSFHEPSP--GQFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~t-Yv-------~Wn~HEp~~--G~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.+.|++.|..+|++|+|+|+. .| .|.++-..- =.-.|....||.++++.|+++||.|||--
T Consensus 250 ~~~~~~~L~ylk~LG~t~I~LmPi~e~~~~~~wGY~~~~~fa~~~~~Gtp~dlk~LVd~aH~~GI~VilDv 320 (758)
T PLN02447 250 REFADDVLPRIKALGYNAVQLMAIQEHAYYGSFGYHVTNFFAVSSRSGTPEDLKYLIDKAHSLGLRVLMDV 320 (758)
T ss_pred HHHHHHHHHHHHHcCCCEEEECCccccCCCCCCCcCcccCcccccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 456888999999999999976 22 244331100 01235556899999999999999999873
No 103
>PF05913 DUF871: Bacterial protein of unknown function (DUF871); InterPro: IPR008589 This family consists of several conserved hypothetical proteins from bacteria and archaea. The function of this family is unknown though a number are annotated as outer surface proteins.; PDB: 2P0O_A 1X7F_A.
Probab=70.01 E-value=4.2 Score=46.98 Aligned_cols=67 Identities=21% Similarity=0.171 Sum_probs=44.0
Q ss_pred ccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCccc
Q psy12907 197 SFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 197 e~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYI 269 (949)
|+.+..-+.+.....|++|+++|+..|-| ++|.|+...=+. ...+..+++.|+++||.|++-..|=+
T Consensus 5 SvY~~~~~~~~~~~yi~~a~~~Gf~~iFT----SL~ipe~~~~~~--~~~~~~l~~~a~~~~~~v~~Disp~~ 71 (357)
T PF05913_consen 5 SVYPGQSSFEENKAYIEKAAKYGFKRIFT----SLHIPEDDPEDY--LERLKELLKLAKELGMEVIADISPKV 71 (357)
T ss_dssp EE-CCCS-HHHHHHHHHHHHCTTEEEEEE----EE---------H--HHHHHHHHHHHHHCT-EEEEEE-CCH
T ss_pred EEeCCCCCHHHHHHHHHHHHHCCCCEEEC----CCCcCCCCHHHH--HHHHHHHHHHHHHCCCEEEEECCHHH
Confidence 45555556889999999999999987766 689988543332 24689999999999999999877633
No 104
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=69.65 E-value=7.1 Score=47.47 Aligned_cols=57 Identities=16% Similarity=0.187 Sum_probs=40.8
Q ss_pred hHHHHHHHHHHcCCCEEEE-eecCCccCCCCCeee----------ccCchhHHHHHHHHHHCCceEEecC
Q psy12907 207 YWQDRLRKLRAAGLNAVST-YVEWSFHEPSPGQFQ----------FTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~~G~fd----------F~g~~DL~~fl~la~e~GL~VILRp 265 (949)
-+.++|.-+|++|+|+|.+ .++-+-. ....|+ |....|+.++++.|+++||+|||-.
T Consensus 28 gi~~~l~yl~~lG~~~i~l~Pi~~~~~--~~~gY~~~d~~~id~~~Gt~~~~~~lv~~ah~~gi~vilD~ 95 (543)
T TIGR02403 28 GIIEKLDYLKKLGVDYIWLNPFYVSPQ--KDNGYDVSDYYAINPLFGTMADFEELVSEAKKRNIKIMLDM 95 (543)
T ss_pred HHHHhHHHHHHcCCCEEEECCcccCCC--CCCCCCccccCccCcccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 4678899999999999976 3332210 001221 4456799999999999999999874
No 105
>TIGR02631 xylA_Arthro xylose isomerase, Arthrobacter type. This model describes a D-xylose isomerase that is also active as a D-glucose isomerase. It is tetrameric and dependent on a divalent cation Mg2+, Co2+ or Mn2+ as characterized in Arthrobacter. Members of this family differ substantially from the D-xylose isomerases of family TIGR02630.
Probab=67.13 E-value=81 Score=36.92 Aligned_cols=93 Identities=15% Similarity=0.108 Sum_probs=55.3
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEe----ecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTY----VEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGL 278 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tY----v~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGl 278 (949)
|+....+++++++++|+..|+.. ++|..-+. +-..++.++-+++++.||.|. +-++-+-...|..|+
T Consensus 30 ~~~~~~e~i~~la~~GfdgVE~~~~dl~P~~~~~~-------e~~~~~~~lk~~L~~~GL~v~~v~~nl~~~~~~~~g~- 101 (382)
T TIGR02631 30 TALDPVEAVHKLAELGAYGVTFHDDDLIPFGAPPQ-------ERDQIVRRFKKALDETGLKVPMVTTNLFSHPVFKDGG- 101 (382)
T ss_pred CCcCHHHHHHHHHHhCCCEEEecccccCCCCCChh-------HHHHHHHHHHHHHHHhCCeEEEeeccccCCccccCCC-
Confidence 34567899999999999999864 22211110 002457889999999999985 333211111122221
Q ss_pred cccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 279 PSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 279 P~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
+-+.|+..+++.-+++++.+..-+.+
T Consensus 102 ------------las~d~~vR~~ai~~~kraId~A~eL 127 (382)
T TIGR02631 102 ------------FTSNDRSVRRYALRKVLRNMDLGAEL 127 (382)
T ss_pred ------------CCCCCHHHHHHHHHHHHHHHHHHHHh
Confidence 23446766666666666666665554
No 106
>PLN03087 BODYGUARD 1 domain containing hydrolase; Provisional
Probab=67.01 E-value=2.3 Score=50.98 Aligned_cols=59 Identities=22% Similarity=0.290 Sum_probs=44.3
Q ss_pred CCeEECCCCCCchhhhhhhh-hhcc---cccccccc--------------------------------------------
Q psy12907 613 SPLVFLPPVSGTADIFFHLM-LDLS---RKGYRTLS-------------------------------------------- 644 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~-~~l~---~kgyR~is-------------------------------------------- 644 (949)
.+|+||||..+.+..|..++ -.|+ +++||||.
T Consensus 202 ~~VVLlHG~~~s~~~W~~~~~~~L~~~~~~~yrVia~Dl~G~G~S~~p~~~~ytl~~~a~~l~~~ll~~lg~~k~~LVGh 281 (481)
T PLN03087 202 EDVLFIHGFISSSAFWTETLFPNFSDAAKSTYRLFAVDLLGFGRSPKPADSLYTLREHLEMIERSVLERYKVKSFHIVAH 281 (481)
T ss_pred CeEEEECCCCccHHHHHHHHHHHHHHHhhCCCEEEEECCCCCCCCcCCCCCcCCHHHHHHHHHHHHHHHcCCCCEEEEEE
Confidence 47999999998888766443 3333 47899984
Q ss_pred cCCcchhhcccccccccCceeeccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
|+||.+++.|+..++ +.|.+|+|.++.
T Consensus 282 SmGG~iAl~~A~~~P--e~V~~LVLi~~~ 308 (481)
T PLN03087 282 SLGCILALALAVKHP--GAVKSLTLLAPP 308 (481)
T ss_pred CHHHHHHHHHHHhCh--HhccEEEEECCC
Confidence 688888888888776 578888888753
No 107
>TIGR02104 pulA_typeI pullulanase, type I. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the subfamilies described in TIGR02102 and TIGR02103 but having a different domain architecture with shorter sequences. Members are called type I pullulanases.
Probab=66.85 E-value=9.2 Score=47.10 Aligned_cols=57 Identities=16% Similarity=0.299 Sum_probs=38.5
Q ss_pred HHHHHHHHHHcCCCEEEE-eec---------------CCccC-----CCCCeee----cc--CchhHHHHHHHHHHCCce
Q psy12907 208 WQDRLRKLRAAGLNAVST-YVE---------------WSFHE-----PSPGQFQ----FT--GDQDLEYFLELAVREDLL 260 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~t-Yv~---------------Wn~HE-----p~~G~fd----F~--g~~DL~~fl~la~e~GL~ 260 (949)
-.+.|.-+|++|+|+|+. +|+ |.+.- |++ .|- +- ...++.++++.|+++||.
T Consensus 166 ~~~~LdyL~~LGvt~I~L~Pi~~~~~~~~~~~~~~~~wGY~~~~y~~~~~-~y~~~p~~~~~~~~efk~lV~~~H~~Gi~ 244 (605)
T TIGR02104 166 VSTGLDYLKELGVTHVQLLPVFDFAGVDEEDPNNAYNWGYDPLNYNVPEG-SYSTNPYDPATRIRELKQMIQALHENGIR 244 (605)
T ss_pred chhHHHHHHHcCCCEEEeCCcccccccccccCCCCCCCCCCCccCCCcCh-hhhcCCCccchHHHHHHHHHHHHHHCCCE
Confidence 346789999999999986 333 33321 110 111 11 136899999999999999
Q ss_pred EEecC
Q psy12907 261 VLLRP 265 (949)
Q Consensus 261 VILRp 265 (949)
|||--
T Consensus 245 VilDv 249 (605)
T TIGR02104 245 VIMDV 249 (605)
T ss_pred EEEEE
Confidence 99873
No 108
>KOG4409|consensus
Probab=66.56 E-value=2.4 Score=48.71 Aligned_cols=59 Identities=29% Similarity=0.496 Sum_probs=51.0
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccccccccc----------------------------------------------c
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS----------------------------------------------Y 645 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is----------------------------------------------~ 645 (949)
.-|+|++||-.+..+.||.-+-+|+. ..+|++ |
T Consensus 90 ~~plVliHGyGAg~g~f~~Nf~~La~-~~~vyaiDllG~G~SSRP~F~~d~~~~e~~fvesiE~WR~~~~L~KmilvGHS 168 (365)
T KOG4409|consen 90 KTPLVLIHGYGAGLGLFFRNFDDLAK-IRNVYAIDLLGFGRSSRPKFSIDPTTAEKEFVESIEQWRKKMGLEKMILVGHS 168 (365)
T ss_pred CCcEEEEeccchhHHHHHHhhhhhhh-cCceEEecccCCCCCCCCCCCCCcccchHHHHHHHHHHHHHcCCcceeEeecc
Confidence 45899999999999999999999886 666664 6
Q ss_pred CCcchhhcccccccccCceeeccccccc
Q psy12907 646 LGGFLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 646 lg~~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
+|||||-+||...| +.|.-|+||-+-
T Consensus 169 fGGYLaa~YAlKyP--erV~kLiLvsP~ 194 (365)
T KOG4409|consen 169 FGGYLAAKYALKYP--ERVEKLILVSPW 194 (365)
T ss_pred chHHHHHHHHHhCh--HhhceEEEeccc
Confidence 99999999999999 579999999653
No 109
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=65.61 E-value=1.4e+02 Score=31.98 Aligned_cols=42 Identities=19% Similarity=0.218 Sum_probs=35.0
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
.+++.+++++++|++.|+...++ ..++..+.++++++||.|.
T Consensus 15 ~l~e~~~~~~e~G~~~vEl~~~~--------------~~~~~~l~~~l~~~gl~v~ 56 (254)
T TIGR03234 15 PFLERFAAAAQAGFTGVEYLFPY--------------DWDAEALKARLAAAGLEQV 56 (254)
T ss_pred CHHHHHHHHHHcCCCEEEecCCc--------------cCCHHHHHHHHHHcCCeEE
Confidence 48999999999999999985432 1358889999999999986
No 110
>PRK12313 glycogen branching enzyme; Provisional
Probab=65.26 E-value=12 Score=46.44 Aligned_cols=53 Identities=13% Similarity=0.337 Sum_probs=37.2
Q ss_pred HHHHHHcCCCEEEE-eec-------CCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 212 LRKLRAAGLNAVST-YVE-------WSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 212 L~k~Ka~GlNtV~t-Yv~-------Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
|.-+|++|+|+|.. .|+ |.+.-.. .=.-.|.+..||.+|++.|+++||.|||-
T Consensus 177 l~yl~~LGv~~i~L~Pi~~~~~~~~~GY~~~~y~~i~~~~Gt~~d~k~lv~~~H~~Gi~VilD 239 (633)
T PRK12313 177 IPYVKEMGYTHVEFMPLMEHPLDGSWGYQLTGYFAPTSRYGTPEDFMYLVDALHQNGIGVILD 239 (633)
T ss_pred HHHHHHcCCCEEEeCchhcCCCCCCCCCCCcCcCcCCCCCCCHHHHHHHHHHHHHCCCEEEEE
Confidence 58899999999985 442 3211100 00113555689999999999999999987
No 111
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=64.71 E-value=13 Score=45.50 Aligned_cols=56 Identities=18% Similarity=0.227 Sum_probs=40.6
Q ss_pred hHHHHHHHHHHcCCCEEEE-eecCCccCCCC-Cee----------eccCchhHHHHHHHHHHCCceEEecC
Q psy12907 207 YWQDRLRKLRAAGLNAVST-YVEWSFHEPSP-GQF----------QFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~~-G~f----------dF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
-+.++|.-+|++|+++|-+ .++-. |.. .-| +|....|+.++++.|+++||+|||-.
T Consensus 34 gi~~~ldyl~~lGv~~i~l~P~~~~---~~~~~gY~~~d~~~id~~~Gt~~d~~~lv~~~h~~gi~vilD~ 101 (551)
T PRK10933 34 GVTQRLDYLQKLGVDAIWLTPFYVS---PQVDNGYDVANYTAIDPTYGTLDDFDELVAQAKSRGIRIILDM 101 (551)
T ss_pred HHHHhhHHHHhCCCCEEEECCCCCC---CCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 3568889999999999976 44321 111 122 23445799999999999999999874
No 112
>PLN02960 alpha-amylase
Probab=64.59 E-value=13 Score=47.63 Aligned_cols=57 Identities=25% Similarity=0.304 Sum_probs=39.6
Q ss_pred HHHHHHHHHcCCCEEEE-eec-------CCccCCC--CCeeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 209 QDRLRKLRAAGLNAVST-YVE-------WSFHEPS--PGQFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~t-Yv~-------Wn~HEp~--~G~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
++.|.-+|++|+|+|+. .|+ |.+.-.. .=.-.|....||.+|++.|+++||.|||--
T Consensus 420 e~~LdYLk~LGvt~IeLmPv~e~~~~~swGY~~~~yfa~~~~yGtp~dfk~LVd~aH~~GI~VILDv 486 (897)
T PLN02960 420 QKVLPHVKKAGYNAIQLIGVQEHKDYSSVGYKVTNFFAVSSRFGTPDDFKRLVDEAHGLGLLVFLDI 486 (897)
T ss_pred HHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCCCcccccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 35689999999999986 443 4322110 001123345799999999999999999873
No 113
>PLN02511 hydrolase
Probab=64.49 E-value=3.6 Score=47.56 Aligned_cols=31 Identities=29% Similarity=0.656 Sum_probs=22.9
Q ss_pred CeEECCCCCCchhh-hhh-hhhhcccccccccc
Q psy12907 614 PLVFLPPVSGTADI-FFH-LMLDLSRKGYRTLS 644 (949)
Q Consensus 614 ~v~~l~~~~g~a~~-~~~-~~~~l~~kgyR~is 644 (949)
.|++|||..|.+.. |.. .+..|.++|||||.
T Consensus 102 ~vvllHG~~g~s~~~y~~~~~~~~~~~g~~vv~ 134 (388)
T PLN02511 102 VLILLPGLTGGSDDSYVRHMLLRARSKGWRVVV 134 (388)
T ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHCCCEEEE
Confidence 48899999988764 554 34456688999994
No 114
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=64.36 E-value=1.6e+02 Score=32.22 Aligned_cols=97 Identities=13% Similarity=0.201 Sum_probs=59.5
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHC-CceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVRE-DLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~-GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
..|++.++.+|++|++.|++-+........+ .....+++++.++++++ ++.+.+- +||. +
T Consensus 10 ~~l~~~l~~a~~~G~d~vEl~~~~~~~~~~~----~~~~~~~~~l~~~~~~~~~~~i~~~-~~~~------------~-- 70 (279)
T cd00019 10 FGLENALKRAKEIGFDTVAMFLGNPRSWLSR----PLKKERAEKFKAIAEEGPSICLSVH-APYL------------I-- 70 (279)
T ss_pred ccHHHHHHHHHHcCCCEEEEEcCCCCccCCC----CCCHHHHHHHHHHHHHcCCCcEEEE-cCce------------e--
Confidence 7799999999999999999876532111111 11346899999999999 7666543 2221 0
Q ss_pred cCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccc
Q psy12907 285 IKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVE 330 (949)
Q Consensus 285 ~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIE 330 (949)
.+.+.++.-+++....+++.++..+.+- -+.|.+..-
T Consensus 71 -----~~~~~~~~~r~~~~~~~~~~i~~A~~lG----~~~v~~~~g 107 (279)
T cd00019 71 -----NLASPDKEKREKSIERLKDEIERCEELG----IRLLVFHPG 107 (279)
T ss_pred -----ccCCCCHHHHHHHHHHHHHHHHHHHHcC----CCEEEECCC
Confidence 1122344445555556666666666653 345555544
No 115
>TIGR01531 glyc_debranch glycogen debranching enzymye. glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4--1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
Probab=62.24 E-value=28 Score=46.66 Aligned_cols=114 Identities=16% Similarity=0.218 Sum_probs=66.3
Q ss_pred CceEeCCeeeEEecc---ccCCCCCC--hhhHHHHHHHHHHcCCCEEEE-eec-CCc-cCC--CCCeee----c----cC
Q psy12907 182 DTFVKDGKPFRYVSG---SFHYFRSP--RYYWQDRLRKLRAAGLNAVST-YVE-WSF-HEP--SPGQFQ----F----TG 243 (949)
Q Consensus 182 ~~f~~dGkpf~i~sG---e~HY~R~p--~~~W~d~L~k~Ka~GlNtV~t-Yv~-Wn~-HEp--~~G~fd----F----~g 243 (949)
-.+.|+|+.+..+++ .-...++- -+.|+++|+.+|++|.|+|.. .++ =.- ..| -..++. | .+
T Consensus 103 P~L~i~~~~~lPl~~i~iqTvlsK~mG~~~~w~~~L~~ik~lGyN~IhftPI~~~G~SnS~Ysi~Dyl~idP~~~~~~~~ 182 (1464)
T TIGR01531 103 PMLYINADKFLPLDSIALQTVLAKLLGPLSEWEPRLRVAKEKGYNMIHFTPLQELGGSNSCYSLYDQLQLNQHFKSQKDG 182 (1464)
T ss_pred CeeEECCCcccCcCceeeeeehhhhcCCHHHHHHHHHHHHHcCCCEEEeCCCccCCCCCCCccccchhhcChhhcccCCc
Confidence 556666633333332 23444544 267999999999999999964 333 110 000 011111 3 35
Q ss_pred chhHHHHHHHHHHC-CceEEecCCcccccccCCCCc-cccccccCCCeeeecCCHhHHHHH
Q psy12907 244 DQDLEYFLELAVRE-DLLVLLRPGPYICAERDFGGL-PSWLMTIKPNITLRSKDEVYQHYV 302 (949)
Q Consensus 244 ~~DL~~fl~la~e~-GL~VILRpGPYIcAEw~~GGl-P~WL~~~~p~~~~Rt~dp~y~~~v 302 (949)
..|+.++++.|++. ||++|+-- -|+.=+- =.||.+ +|+.-....+.+|++++
T Consensus 183 ~~d~~~lV~~~h~~~Gm~~ilDv------V~NHTa~ds~Wl~e-HPEa~Yn~~~sP~L~~A 236 (1464)
T TIGR01531 183 KNDVQALVEKLHRDWNVLSITDI------VFNHTANNSPWLLE-HPEAAYNCITSPHLRPA 236 (1464)
T ss_pred HHHHHHHHHHHHHhcCCEEEEEe------eecccccCCHHHHh-ChHhhcCCCCCchhhhH
Confidence 67899999999996 99998762 1333222 247766 77754444444455443
No 116
>PRK05402 glycogen branching enzyme; Provisional
Probab=62.05 E-value=15 Score=46.44 Aligned_cols=50 Identities=20% Similarity=0.513 Sum_probs=36.2
Q ss_pred HHHHHHcCCCEEEE-eec-------CCccCCCCCe-----eeccCchhHHHHHHHHHHCCceEEec
Q psy12907 212 LRKLRAAGLNAVST-YVE-------WSFHEPSPGQ-----FQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 212 L~k~Ka~GlNtV~t-Yv~-------Wn~HEp~~G~-----fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
|.-+|++|+|+|.. .|+ |.. .+.. -.|....||.+|++.|+++||.|||-
T Consensus 272 ~~ylk~LGv~~i~L~Pi~e~~~~~~~GY---~~~~y~ai~~~~Gt~~dfk~lV~~~H~~Gi~VilD 334 (726)
T PRK05402 272 IPYVKEMGFTHVELLPIAEHPFDGSWGY---QPTGYYAPTSRFGTPDDFRYFVDACHQAGIGVILD 334 (726)
T ss_pred HHHHHHcCCCEEEECCcccCCCCCCCCC---CcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 36679999999986 443 221 1111 12445689999999999999999987
No 117
>PRK09989 hypothetical protein; Provisional
Probab=61.96 E-value=1.3e+02 Score=32.48 Aligned_cols=42 Identities=19% Similarity=0.267 Sum_probs=34.0
Q ss_pred HHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEe
Q psy12907 208 WQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLL 263 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VIL 263 (949)
.+++|++++++|+..|++-.+|. .+.+++.++++++||.|..
T Consensus 17 l~~~l~~~~~~Gfd~VEl~~~~~--------------~~~~~~~~~l~~~Gl~v~~ 58 (258)
T PRK09989 17 FIERFAAARKAGFDAVEFLFPYD--------------YSTLQIQKQLEQNHLTLAL 58 (258)
T ss_pred HHHHHHHHHHcCCCEEEECCccc--------------CCHHHHHHHHHHcCCcEEE
Confidence 78999999999999999854332 2367788889999999863
No 118
>COG3934 Endo-beta-mannanase [Carbohydrate transport and metabolism]
Probab=61.53 E-value=7.3 Score=46.38 Aligned_cols=155 Identities=15% Similarity=0.136 Sum_probs=99.9
Q ss_pred eEeCCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCcc-CC---CCCeeec-cCchhHHHHHHHHHHCC
Q psy12907 184 FVKDGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFH-EP---SPGQFQF-TGDQDLEYFLELAVRED 258 (949)
Q Consensus 184 f~~dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~H-Ep---~~G~fdF-~g~~DL~~fl~la~e~G 258 (949)
|.++++.+..++..--+.++-.++-+++|+.++-+|+++++.. -+- |+ ++|.-+- ++..-++.|++.|.+++
T Consensus 4 F~Lg~n~wprIanikmw~~~~~~ei~~dle~a~~vg~k~lR~f---iLDgEdc~d~~G~~na~s~~~y~~~fla~a~~l~ 80 (587)
T COG3934 4 FALGLNRWPRIANIKMWPAIGNREIKADLEPAGFVGVKDLRLF---ILDGEDCRDKEGYRNAGSNVWYAAWFLAPAGYLD 80 (587)
T ss_pred EEeccccchhhhhhhHHHHhhhhhhhcccccccCccceeEEEE---EecCcchhhhhceecccccHHHHHHHhhhcccCc
Confidence 6778888888877777777777788899999999999999986 334 55 2332221 23356889999999999
Q ss_pred ceEEecCCcccccccCCCCc---cccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCC
Q psy12907 259 LLVLLRPGPYICAERDFGGL---PSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGS 335 (949)
Q Consensus 259 L~VILRpGPYIcAEw~~GGl---P~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~ 335 (949)
|+|+++ .|..==.+||- -.|.-...|+=.. -||.++..-++|...|++-. .....|.+|-+-||-=+
T Consensus 81 lkvlit---livg~~hmgg~Nw~Ipwag~~~pdn~i--yD~k~~~~~kkyvedlVk~y-----k~~ptI~gw~l~Ne~lv 150 (587)
T COG3934 81 LKVLIT---LIVGLKHMGGTNWRIPWAGEQSPDNVI--YDPKFRGPGKKYVEDLVKPY-----KLDPTIAGWALRNEPLV 150 (587)
T ss_pred ceEEEE---EeecccccCcceeEeecCCCCCccccc--cchhhcccHHHHHHHHhhhh-----ccChHHHHHHhcCCccc
Confidence 999876 23322234553 2244222333111 25555555566666666522 23346777888888221
Q ss_pred --ccchHHHHHHHHHHHH
Q psy12907 336 --YTCDKEHMIWLRDQMK 351 (949)
Q Consensus 336 --y~~d~~Y~~~L~~~~~ 351 (949)
-.++..++.|++.++-
T Consensus 151 ~~p~s~N~f~~w~~emy~ 168 (587)
T COG3934 151 EAPISVNNFWDWSGEMYA 168 (587)
T ss_pred cccCChhHHHHHHHHHHH
Confidence 1457899999999863
No 119
>PF02679 ComA: (2R)-phospho-3-sulfolactate synthase (ComA); InterPro: IPR003830 Methanogenic archaea produce methane via the anaerobic reduction of acetate or single carbon compounds []. Coenzyme M (CoM; 2-mercaptoethanesulphonic acid) serves as the terminal methyl carrier for this process. Previously thought to be unique to methanogenic archaea, CoM has also been found in methylotrophic bacteria. Biosynthesis of CoM begins with the Michael addition of sulphite to phosphoenolpyruvate, forming 2-phospho-3-sulpholactate (PSL). This reaction is catalyzed by members of this family, PSL synthase (ComA) []. Subsequently, PSL is dephosphorylated by phosphosulpholactate phosphatase (ComB) to form 3-sulpholactate [], which is then converted to 3-sulphopyruvate by L-sulpholactate dehydrogenase (ComC; 1.1.1.272 from EC) []. Sulphopyruvate decarboxylase (ComDE; 4.1.1.79 from EC) converts 3-sulphopyruvate to sulphoacetaldehyde []. Reductive thiolation of sulphoacetaldehyde is the final step.; GO: 0019295 coenzyme M biosynthetic process; PDB: 1U83_A 1QWG_A.
Probab=61.46 E-value=22 Score=39.18 Aligned_cols=52 Identities=19% Similarity=0.341 Sum_probs=38.8
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPG 266 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpG 266 (949)
+...++-|+.+|++||++|++ ..|..+.+ ..+..++|+.|+++|+.|+--.|
T Consensus 83 q~~~~~yl~~~k~lGf~~IEi---------SdGti~l~-~~~r~~~I~~~~~~Gf~v~~EvG 134 (244)
T PF02679_consen 83 QGKFDEYLEECKELGFDAIEI---------SDGTIDLP-EEERLRLIRKAKEEGFKVLSEVG 134 (244)
T ss_dssp TT-HHHHHHHHHHCT-SEEEE-----------SSS----HHHHHHHHHHHCCTTSEEEEEES
T ss_pred cChHHHHHHHHHHcCCCEEEe---------cCCceeCC-HHHHHHHHHHHHHCCCEEeeccc
Confidence 566789999999999999987 35666664 35788999999999999987766
No 120
>PRK09505 malS alpha-amylase; Reviewed
Probab=61.46 E-value=15 Score=45.98 Aligned_cols=58 Identities=17% Similarity=0.182 Sum_probs=41.3
Q ss_pred HHHHHHHHHHcCCCEEEE-eecCCccC-----------------CCCC-----eeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 208 WQDRLRKLRAAGLNAVST-YVEWSFHE-----------------PSPG-----QFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~t-Yv~Wn~HE-----------------p~~G-----~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
+.+.|.-+|+||+|+|-+ .++=+.|. --+- .-.|....|+..+++.|+++||+|||-
T Consensus 232 i~~kLdyl~~LGv~aIwlsPi~~~~~~~~~~g~~g~~~~~~yhgY~~~D~~~id~~~Gt~~dfk~Lv~~aH~~Gi~VilD 311 (683)
T PRK09505 232 LTEKLDYLQQLGVNALWISSPLEQIHGWVGGGTKGDFPHYAYHGYYTLDWTKLDANMGTEADLRTLVDEAHQRGIRILFD 311 (683)
T ss_pred HHHhhHHHHHcCCCEEEeCccccccccccccccccCCCcCCCCCCCccccccCCCCCCCHHHHHHHHHHHHHCCCEEEEE
Confidence 567888999999999975 44433332 1111 123445689999999999999999976
Q ss_pred C
Q psy12907 265 P 265 (949)
Q Consensus 265 p 265 (949)
.
T Consensus 312 ~ 312 (683)
T PRK09505 312 V 312 (683)
T ss_pred E
Confidence 3
No 121
>PF04909 Amidohydro_2: Amidohydrolase; InterPro: IPR006992 These proteins are related to the metal-dependent hydrolase superfamily []. The family includes 2-amino-3-carboxymuconate-6-semialdehyde decarboxylase which converts alpha-amino-beta-carboxymuconate-epsilon- semialdehyde (ACMS) to alpha-aminomuconate semialdehyde (AMS). ACMS can be converted non-enzymatically to quinolate, a potent endogenous excitoxin of neuronal cells which is implicated in the pathogenesis of various neurodegenerative disorders. In the presence of AMCSD, ACMS is converted to AMS, a benign catabolite. 2-amino-3-(3-oxoprop-2-enyl)-but-2-enedioate = 2-aminomuconate semialdehyde + CO2. ; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 2QPX_A 4D8L_A 3K4W_I 3IRS_B 4DZI_B 3S4T_G 2GWG_B 3IJ6_A 2DVX_C 2DVT_C ....
Probab=60.98 E-value=27 Score=36.96 Aligned_cols=146 Identities=16% Similarity=0.139 Sum_probs=72.1
Q ss_pred hhhHHHHHHHHH-HcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCC-cccccccCCCCccc--
Q psy12907 205 RYYWQDRLRKLR-AAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPG-PYICAERDFGGLPS-- 280 (949)
Q Consensus 205 ~~~W~d~L~k~K-a~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpG-PYIcAEw~~GGlP~-- 280 (949)
++.+.+.+++.. +.|+-.|..+-.. +.++........+++++|+|+|+-|++-+| +.-..+-.....|.
T Consensus 83 ~~~~~~~l~~~~~~~g~~Gv~l~~~~-------~~~~~~~~~~~~~~~~~~~~~~~pv~~H~g~~~~~~~~~~~~~~~~~ 155 (273)
T PF04909_consen 83 PEDAVEELERALQELGFRGVKLHPDL-------GGFDPDDPRLDDPIFEAAEELGLPVLIHTGMTGFPDAPSDPADPEEL 155 (273)
T ss_dssp HHHHHHHHHHHHHTTTESEEEEESSE-------TTCCTTSGHCHHHHHHHHHHHT-EEEEEESHTHHHHHHHHHHHHHHH
T ss_pred chhHHHHHHHhccccceeeeEecCCC-------CccccccHHHHHHHHHHHHhhccceeeeccccchhhhhHHHHHHHHH
Confidence 456666666666 8999999876432 233333322235999999999999999976 10000000000010
Q ss_pred -cccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccC-CccchHHH-HHHHHHHHHHhcCCe
Q psy12907 281 -WLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMG-SYTCDKEH-MIWLRDQMKYYVRDA 357 (949)
Q Consensus 281 -WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG-~y~~d~~Y-~~~L~~~~~~~lG~~ 357 (949)
-++.++|++++--.+-.+- ..|+..++..++.+. +| .+.+---++ .+.....+ ...++.+++. .|.+
T Consensus 156 ~~~~~~~P~l~ii~~H~G~~---~~~~~~~~~l~~~~~-----nv-y~d~s~~~~~~~~~~~~~~~~~l~~~~~~-~g~d 225 (273)
T PF04909_consen 156 EELLERFPDLRIILAHLGGP---FPWWEEALRLLDRFP-----NV-YVDLSGIPPFWYFWPPSFDRPFLRRAVDE-FGPD 225 (273)
T ss_dssp TTHHHHSTTSEEEESGGGTT---HHHHHHHHHHHHHHT-----TE-EEECHSHHSSEEEETTHHCHHHHHHHHHH-HTGG
T ss_pred HHHHHHhcCCeEEEecCccc---chhHHHHHHHHHhCC-----cc-cccccccccccccCcccccHHHHHHHHHH-hCCc
Confidence 1233478877665443333 445555555555431 01 111100000 01111122 3444454544 4899
Q ss_pred eEEEeeCCCC
Q psy12907 358 AVLYTTDGGG 367 (949)
Q Consensus 358 vplft~Dg~~ 367 (949)
-.+|.||.+.
T Consensus 226 rilfGSD~P~ 235 (273)
T PF04909_consen 226 RILFGSDYPH 235 (273)
T ss_dssp GEEEE--TTS
T ss_pred eEEecCCCCC
Confidence 9999999764
No 122
>PRK12568 glycogen branching enzyme; Provisional
Probab=60.96 E-value=16 Score=46.10 Aligned_cols=55 Identities=13% Similarity=0.342 Sum_probs=39.4
Q ss_pred HHHHHHHHcCCCEEEE-eec-------CCcc-----CCCCCeeeccCchhHHHHHHHHHHCCceEEecCCc
Q psy12907 210 DRLRKLRAAGLNAVST-YVE-------WSFH-----EPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGP 267 (949)
Q Consensus 210 d~L~k~Ka~GlNtV~t-Yv~-------Wn~H-----Ep~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGP 267 (949)
+.|.-+|++|+|+|+. .|+ |.+. .|.+ .|....|+..|++.|+++||.|||-.=|
T Consensus 274 ~ll~ylk~LGvt~I~LmPi~e~~~~~~wGY~~~~~~a~~~---~~G~~~dfk~lV~~~H~~Gi~VIlD~V~ 341 (730)
T PRK12568 274 QLIPYVQQLGFTHIELLPITEHPFGGSWGYQPLGLYAPTA---RHGSPDGFAQFVDACHRAGIGVILDWVS 341 (730)
T ss_pred HHHHHHHHcCCCEEEECccccCCCCCCCCCCCCcCCccCc---ccCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 3468889999999986 342 4321 1111 3455679999999999999999987433
No 123
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=60.87 E-value=1.6e+02 Score=31.90 Aligned_cols=50 Identities=22% Similarity=0.337 Sum_probs=38.3
Q ss_pred ccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEe
Q psy12907 197 SFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLL 263 (949)
Q Consensus 197 e~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VIL 263 (949)
++.|-+.| ++++|++++++|+..|++.. | + ..+++++.++++++||.+..
T Consensus 9 ~~~~~~~~---l~~~l~~~a~~Gf~~VEl~~------~------~--~~~~~~~~~~l~~~gl~~~~ 58 (258)
T PRK09997 9 SMLFGEYD---FLARFEKAAQCGFRGVEFMF------P------Y--DYDIEELKQVLASNKLEHTL 58 (258)
T ss_pred ehhccCCC---HHHHHHHHHHhCCCEEEEcC------C------C--CCCHHHHHHHHHHcCCcEEE
Confidence 34444454 88999999999999999831 1 1 14699999999999999853
No 124
>COG2267 PldB Lysophospholipase [Lipid metabolism]
Probab=60.86 E-value=4 Score=45.77 Aligned_cols=64 Identities=19% Similarity=0.337 Sum_probs=57.9
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc------------------------------------------------
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS------------------------------------------------ 644 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is------------------------------------------------ 644 (949)
+-|+++||..+.+.-|+..+-+|++.||-|+.
T Consensus 35 g~Vvl~HG~~Eh~~ry~~la~~l~~~G~~V~~~D~RGhG~S~r~~rg~~~~f~~~~~dl~~~~~~~~~~~~~~p~~l~gH 114 (298)
T COG2267 35 GVVVLVHGLGEHSGRYEELADDLAARGFDVYALDLRGHGRSPRGQRGHVDSFADYVDDLDAFVETIAEPDPGLPVFLLGH 114 (298)
T ss_pred cEEEEecCchHHHHHHHHHHHHHHhCCCEEEEecCCCCCCCCCCCcCCchhHHHHHHHHHHHHHHHhccCCCCCeEEEEe
Confidence 34889999999999999999999999999994
Q ss_pred cCCcchhhcccccccccCceeecccccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDTSV 678 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt~~ 678 (949)
|.||.+|..|++..+ +-|..++||+++..-..
T Consensus 115 SmGg~Ia~~~~~~~~--~~i~~~vLssP~~~l~~ 146 (298)
T COG2267 115 SMGGLIALLYLARYP--PRIDGLVLSSPALGLGG 146 (298)
T ss_pred CcHHHHHHHHHHhCC--ccccEEEEECccccCCh
Confidence 799999999999988 78999999999988664
No 125
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=59.71 E-value=13 Score=45.02 Aligned_cols=57 Identities=18% Similarity=0.197 Sum_probs=40.9
Q ss_pred hhHHHHHHHHHHcCCCEEEE-eecCCccCCC-CCee----------eccCchhHHHHHHHHHHCCceEEecC
Q psy12907 206 YYWQDRLRKLRAAGLNAVST-YVEWSFHEPS-PGQF----------QFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~-~G~f----------dF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.-+.++|.-+|++|+|+|-+ .|+ .. +. ...| .|....|+.++++.|++.||+|||-.
T Consensus 28 ~gi~~~Ldyl~~LGv~~i~L~Pi~-~~--~~~~~gY~~~dy~~vd~~~Gt~~df~~Lv~~ah~~Gi~vilD~ 96 (539)
T TIGR02456 28 PGLTSKLDYLKWLGVDALWLLPFF-QS--PLRDDGYDVSDYRAILPEFGTIDDFKDFVDEAHARGMRVIIDL 96 (539)
T ss_pred HHHHHhHHHHHHCCCCEEEECCCc-CC--CCCCCCCCcccccccChhhCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 45778899999999999976 333 10 10 1112 24456899999999999999999863
No 126
>PRK10785 maltodextrin glucosidase; Provisional
Probab=58.82 E-value=19 Score=44.48 Aligned_cols=57 Identities=18% Similarity=0.267 Sum_probs=39.8
Q ss_pred HHHHHHHHHHcCCCEEEE-eecCC--ccCCCCCee-----eccCchhHHHHHHHHHHCCceEEec
Q psy12907 208 WQDRLRKLRAAGLNAVST-YVEWS--FHEPSPGQF-----QFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~t-Yv~Wn--~HEp~~G~f-----dF~g~~DL~~fl~la~e~GL~VILR 264 (949)
-.++|.-+|++|+|+|-+ +|+=+ .|--...-| .|.+..|+.++++.|++.||+|||-
T Consensus 181 I~~kLdYL~~LGv~~I~L~Pif~s~s~hgYd~~Dy~~iDp~~Gt~~df~~Lv~~aH~rGikVilD 245 (598)
T PRK10785 181 ISEKLPYLKKLGVTALYLNPIFTAPSVHKYDTEDYRHVDPQLGGDAALLRLRHATQQRGMRLVLD 245 (598)
T ss_pred HHHHHHHHHHcCCCEEEeCCcccCCCCCCcCcccccccCcccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 357788889999999976 55522 121111111 2445689999999999999999975
No 127
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=58.71 E-value=83 Score=37.08 Aligned_cols=120 Identities=13% Similarity=0.164 Sum_probs=70.1
Q ss_pred eeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCC----CeeeccC---chhHHHHHHHHHHCCceE
Q psy12907 189 KPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSP----GQFQFTG---DQDLEYFLELAVREDLLV 261 (949)
Q Consensus 189 kpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~----G~fdF~g---~~DL~~fl~la~e~GL~V 261 (949)
.|..+-+=+-.|+.+..+.-.+.+++++++|++.+.+=--|....... |.+.-+- -.-|..+.+.+++.||+.
T Consensus 41 ~pv~~nsW~~~~~d~~e~~i~~~a~~~~~~G~e~fviDDGW~~~r~~d~~~~GdW~~~~~kFP~Gl~~l~~~i~~~Gmk~ 120 (394)
T PF02065_consen 41 PPVGWNSWEAYYFDITEEKILELADAAAELGYEYFVIDDGWFGGRDDDNAGLGDWEPDPKKFPNGLKPLADYIHSLGMKF 120 (394)
T ss_dssp --EEEESHHHHTTG--HHHHHHHHHHHHHHT-SEEEE-SSSBCTESTTTSTTSBECBBTTTSTTHHHHHHHHHHHTT-EE
T ss_pred CceEEEcccccCcCCCHHHHHHHHHHHHHhCCEEEEEcCccccccCCCcccCCceeEChhhhCCcHHHHHHHHHHCCCeE
Confidence 344443444567888889999999999999999877666686542211 2222110 023999999999999999
Q ss_pred EecCCcccccccC--CCCccccccccCCCee---ee------cCCHhHHHHHHHHHHHH
Q psy12907 262 LLRPGPYICAERD--FGGLPSWLMTIKPNIT---LR------SKDEVYQHYVNKWFAQL 309 (949)
Q Consensus 262 ILRpGPYIcAEw~--~GGlP~WL~~~~p~~~---~R------t~dp~y~~~v~r~~~~l 309 (949)
=|+..|-++++=. .-..|.|++. .++.. -| .++|...+++...+.++
T Consensus 121 GlW~ePe~v~~~S~l~~~hPdw~l~-~~~~~~~~~r~~~vLD~~~pev~~~l~~~i~~l 178 (394)
T PF02065_consen 121 GLWFEPEMVSPDSDLYREHPDWVLR-DPGRPPTLGRNQYVLDLSNPEVRDYLFEVIDRL 178 (394)
T ss_dssp EEEEETTEEESSSCHCCSSBGGBTC-CTTSE-ECBTTBEEB-TTSHHHHHHHHHHHHHH
T ss_pred EEEeccccccchhHHHHhCccceee-cCCCCCcCcccceEEcCCCHHHHHHHHHHHHHH
Confidence 8888886653211 2247999987 44321 12 34665555544433333
No 128
>TIGR03101 hydr2_PEP hydrolase, ortholog 2, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Probab=57.70 E-value=4 Score=45.15 Aligned_cols=29 Identities=24% Similarity=0.273 Sum_probs=20.8
Q ss_pred cCCcchhhcccccccccCceeeccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYD 675 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~d 675 (949)
|+||.++..++...+ +.|.+++|+++..+
T Consensus 107 SmGG~vAl~~A~~~p--~~v~~lVL~~P~~~ 135 (266)
T TIGR03101 107 RLGALLALDAANPLA--AKCNRLVLWQPVVS 135 (266)
T ss_pred CHHHHHHHHHHHhCc--cccceEEEeccccc
Confidence 678888887776654 46777888877744
No 129
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=55.57 E-value=18 Score=43.35 Aligned_cols=125 Identities=18% Similarity=0.143 Sum_probs=64.7
Q ss_pred eccccCCCCCChhhHHHHHHHHH-HcCCCEEEEe-ec---CCcc-C-CCCC--eeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 194 VSGSFHYFRSPRYYWQDRLRKLR-AAGLNAVSTY-VE---WSFH-E-PSPG--QFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 194 ~sGe~HY~R~p~~~W~d~L~k~K-a~GlNtV~tY-v~---Wn~H-E-p~~G--~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
+-|+-|..-..++.|+..|+.++ ++||..|++. ++ .... | ..+| .|||+. |+.+++...++||+..+.
T Consensus 27 ~~~~g~a~~~l~~~~q~~l~~~~~~~gf~yvR~h~l~~ddm~~~~~~~~~~~~~Ynf~~---lD~i~D~l~~~g~~P~ve 103 (486)
T PF01229_consen 27 CVGSGRANLLLRADWQEQLRELQEELGFRYVRFHGLFSDDMMVYSESDEDGIPPYNFTY---LDQILDFLLENGLKPFVE 103 (486)
T ss_dssp EEEES-GGGGGBHHHHHHHHHHHCCS--SEEEES-TTSTTTT-EEEEETTEEEEE--HH---HHHHHHHHHHCT-EEEEE
T ss_pred hcCCCchHHHhhHHHHHHHHHHHhccCceEEEEEeeccCchhhccccccCCCCcCChHH---HHHHHHHHHHcCCEEEEE
Confidence 34555565667889999999997 6899998763 11 1111 1 1233 399975 999999999999998766
Q ss_pred CCcccccccCCCCccccccccCCCeeee----cCCHhHHHHHHHHHHHHHHHHHhccccCCCceE--EeccccccC
Q psy12907 265 PGPYICAERDFGGLPSWLMTIKPNITLR----SKDEVYQHYVNKWFAQLFPRITRFLYGNGGPII--LVQVENEMG 334 (949)
Q Consensus 265 pGPYIcAEw~~GGlP~WL~~~~p~~~~R----t~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII--~vQIENEyG 334 (949)
-|- .|.++.. .+..... .+-|.-.+++..+++++++++..++-.+ -|- -+.|=||..
T Consensus 104 l~f----------~p~~~~~-~~~~~~~~~~~~~pp~~~~~W~~lv~~~~~h~~~RYG~~--ev~~W~fEiWNEPd 166 (486)
T PF01229_consen 104 LGF----------MPMALAS-GYQTVFWYKGNISPPKDYEKWRDLVRAFARHYIDRYGIE--EVSTWYFEIWNEPD 166 (486)
T ss_dssp E-S----------B-GGGBS-S--EETTTTEE-S-BS-HHHHHHHHHHHHHHHHHHHHHH--HHTTSEEEESS-TT
T ss_pred EEe----------chhhhcC-CCCccccccCCcCCcccHHHHHHHHHHHHHHHHhhcCCc--cccceeEEeCcCCC
Confidence 442 3344433 1111111 1223345666666666666665542100 011 356778864
No 130
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=55.51 E-value=80 Score=34.84 Aligned_cols=51 Identities=22% Similarity=0.312 Sum_probs=42.9
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCC
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPG 266 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpG 266 (949)
...++-++.+|++||++|++ ..|..+++ ..+..++|+.++++||.|+--.|
T Consensus 71 ~~~~~Yl~~~k~lGf~~IEi---------S~G~~~i~-~~~~~rlI~~~~~~g~~v~~EvG 121 (237)
T TIGR03849 71 GKFDEYLNECDELGFEAVEI---------SDGSMEIS-LEERCNLIERAKDNGFMVLSEVG 121 (237)
T ss_pred hhHHHHHHHHHHcCCCEEEE---------cCCccCCC-HHHHHHHHHHHHhCCCeEecccc
Confidence 56788889999999999986 46777775 45788999999999999986655
No 131
>PRK14706 glycogen branching enzyme; Provisional
Probab=55.43 E-value=21 Score=44.45 Aligned_cols=53 Identities=15% Similarity=0.215 Sum_probs=35.6
Q ss_pred HHHHHHcCCCEEEE-eec-------CCccCC--CCCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 212 LRKLRAAGLNAVST-YVE-------WSFHEP--SPGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 212 L~k~Ka~GlNtV~t-Yv~-------Wn~HEp--~~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
+.-+|+||+|+|+. .|. |...-- ..=.-.|....|+.+|++.|+++||.|||-
T Consensus 174 ~~ylk~lG~t~velmPv~e~~~~~~wGY~~~~~~~~~~~~g~~~~~~~lv~~~H~~gi~VilD 236 (639)
T PRK14706 174 GEYVTYMGYTHVELLGVMEHPFDGSWGYQVTGYYAPTSRLGTPEDFKYLVNHLHGLGIGVILD 236 (639)
T ss_pred HHHHHHcCCCEEEccchhcCCCCCCCCcCcccccccccccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 36789999999985 331 332110 000112334579999999999999999976
No 132
>PRK05855 short chain dehydrogenase; Validated
Probab=54.80 E-value=6.5 Score=46.64 Aligned_cols=31 Identities=10% Similarity=0.215 Sum_probs=26.5
Q ss_pred CCeEECCCCCCchhhhhhhhhhcccccccccc
Q psy12907 613 SPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS 644 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is 644 (949)
.+|+|+||..+++..|..++-.| ++|||||.
T Consensus 26 ~~ivllHG~~~~~~~w~~~~~~L-~~~~~Vi~ 56 (582)
T PRK05855 26 PTVVLVHGYPDNHEVWDGVAPLL-ADRFRVVA 56 (582)
T ss_pred CeEEEEcCCCchHHHHHHHHHHh-hcceEEEE
Confidence 47999999999999888877777 78999995
No 133
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=54.56 E-value=37 Score=38.04 Aligned_cols=69 Identities=19% Similarity=0.152 Sum_probs=49.8
Q ss_pred CChhhHHHHHHHHHHcCCCEEEEeecCCccCCC-CCeeeccCc--hhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 203 SPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPS-PGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 203 ~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~-~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
...+..++.++++|+.|+.+=...+-...+... -+.|.|+-. -|..++++..++.|++|++..=|+|+.
T Consensus 21 ~~~~~v~~~~~~~~~~~iP~d~~~lD~~w~~~~~~~~f~~d~~~FPd~~~~i~~l~~~G~~~~~~~~P~i~~ 92 (308)
T cd06593 21 YDEEEVNEFADGMRERNLPCDVIHLDCFWMKEFQWCDFEFDPDRFPDPEGMLSRLKEKGFKVCLWINPYIAQ 92 (308)
T ss_pred CCHHHHHHHHHHHHHcCCCeeEEEEecccccCCcceeeEECcccCCCHHHHHHHHHHCCCeEEEEecCCCCC
Confidence 446788999999999997765444443333322 135555422 389999999999999999988888864
No 134
>PRK14705 glycogen branching enzyme; Provisional
Probab=54.45 E-value=23 Score=47.22 Aligned_cols=54 Identities=15% Similarity=0.277 Sum_probs=37.9
Q ss_pred HHHHHHHcCCCEEEE-eec-------CCccC--CCCCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 211 RLRKLRAAGLNAVST-YVE-------WSFHE--PSPGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 211 ~L~k~Ka~GlNtV~t-Yv~-------Wn~HE--p~~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.|.-+|+||+|+|+. .|+ |.+.- ...=.-.|....|+..|++.|+++||.|||-
T Consensus 771 lldYlk~LGvt~IeLmPv~e~p~~~swGY~~~~y~ap~~ryGt~~dfk~lVd~~H~~GI~VILD 834 (1224)
T PRK14705 771 LVDYVKWLGFTHVEFMPVAEHPFGGSWGYQVTSYFAPTSRFGHPDEFRFLVDSLHQAGIGVLLD 834 (1224)
T ss_pred HHHHHHHhCCCEEEECccccCCCCCCCCCCccccCCcCcccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 368899999999986 442 43210 0001113445689999999999999999986
No 135
>smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites
Probab=54.28 E-value=2.6e+02 Score=30.30 Aligned_cols=92 Identities=15% Similarity=0.224 Sum_probs=59.1
Q ss_pred HHHHHHHHHHcCCCEEEEeecCCccCCCCCee-eccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccC
Q psy12907 208 WQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQF-QFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIK 286 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~f-dF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~ 286 (949)
-++.++++.++|++.|+... .+|..-.- +++ ..+++++-+++++.||.+.+- +||.
T Consensus 12 ~~~~~~~~~~~G~~~vel~~----~~~~~~~~~~~~-~~~~~~l~~~~~~~gl~ls~h-~p~~----------------- 68 (273)
T smart00518 12 LYKAFIEAVDIGARSFQLFL----GNPRSWKGVRLS-EETAEKFKEALKENNIDVSVH-APYL----------------- 68 (273)
T ss_pred HhHHHHHHHHcCCCEEEEEC----CCCCCCCCCCCC-HHHHHHHHHHHHHcCCCEEEE-CCce-----------------
Confidence 45789999999999999843 33322110 222 246888999999999987542 2321
Q ss_pred CCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEec
Q psy12907 287 PNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQ 328 (949)
Q Consensus 287 p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQ 328 (949)
+.+.+.++..+++..+++++.+...+.+ |.++|.+.
T Consensus 69 --~nl~s~d~~~r~~~~~~l~~~i~~A~~l----Ga~~vv~h 104 (273)
T smart00518 69 --INLASPDKEKVEKSIERLIDEIKRCEEL----GIKALVFH 104 (273)
T ss_pred --ecCCCCCHHHHHHHHHHHHHHHHHHHHc----CCCEEEEc
Confidence 1134557777777777777777766654 34555544
No 136
>PF06832 BiPBP_C: Penicillin-Binding Protein C-terminus Family; InterPro: IPR009647 This conserved region of approximately 90 residues is found in a sub-group of bacterial Penicillin-Binding Proteins (PBPs). A variable length loop region separates this region from the transpeptidase unit (IPR001460 from INTERPRO). It is predicted to be a beta fold.
Probab=54.26 E-value=23 Score=32.40 Aligned_cols=68 Identities=22% Similarity=0.110 Sum_probs=43.3
Q ss_pred CceeEEEEEecCCCCCCCCcccccCCcceEEEEEECCEEEEEEEecceeeeeccCCCCCeEEEEEEeCCCC
Q psy12907 519 SFGFVLYETIIPDARFPDPALLTISGLRDRGQVFVDEKLVTILYRNKMLSTPIMARPGQKLSILVENMGRI 589 (949)
Q Consensus 519 ~~GyvlY~t~i~~~~~~~~~~L~i~~vrDrA~Vfvdg~~vG~l~r~~~~~i~~~~~~~~~LdILVENmGRv 589 (949)
.-|-+++...-. + ...++.|++.+-+++.+-||||+++|+........+........+|.+ |...||.
T Consensus 17 ~~g~~~~~~~~~-~-~~~~l~l~a~~~~~~~~W~vdg~~~g~~~~~~~~~~~~~~~G~h~l~v-vD~~G~~ 84 (89)
T PF06832_consen 17 PDGAVLALDPGI-P-ERQPLVLKAAGGRGPVYWFVDGEPLGTTQPGHQLFWQPDRPGEHTLTV-VDAQGRS 84 (89)
T ss_pred CCCCEEEeCCCC-C-ccceEEEEEeCCCCcEEEEECCEEcccCCCCCeEEeCCCCCeeEEEEE-EcCCCCE
Confidence 346666654321 1 234566666666779999999999988777655555542222256765 7888875
No 137
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=53.28 E-value=1.2e+02 Score=34.80 Aligned_cols=67 Identities=15% Similarity=0.166 Sum_probs=51.6
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc--hhH--HHHHHHHHHCCceEEecCCcccccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD--QDL--EYFLELAVREDLLVLLRPGPYICAE 272 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~--~DL--~~fl~la~e~GL~VILRpGPYIcAE 272 (949)
.+..++.++++++.|+.+=.+.+=+..+.. -+.|.|+.. -|. .++++..++.|++|++..=|+|+.+
T Consensus 23 ~~~v~~~~~~~r~~~iP~d~i~lD~~~~~~-~~~f~~d~~~FPdp~~~~mi~~L~~~G~k~~~~i~P~v~~~ 93 (339)
T cd06602 23 VDEVKEVVENMRAAGIPLDVQWNDIDYMDR-RRDFTLDPVRFPGLKMPEFVDELHANGQHYVPILDPAISAN 93 (339)
T ss_pred HHHHHHHHHHHHHhCCCcceEEECcccccC-ccceecccccCCCccHHHHHHHHHHCCCEEEEEEeCccccC
Confidence 677899999999999987555544444432 466766653 377 9999999999999999988999864
No 138
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=52.83 E-value=26 Score=43.53 Aligned_cols=56 Identities=18% Similarity=0.335 Sum_probs=42.0
Q ss_pred hhhHHHHHHHHHHcCCCEEEE-eec-------CCccCCCCCee------eccCchhHHHHHHHHHHCCceEEec
Q psy12907 205 RYYWQDRLRKLRAAGLNAVST-YVE-------WSFHEPSPGQF------QFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~t-Yv~-------Wn~HEp~~G~f------dF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.+.=.+.|.-+|+||+++||. .|. |.+- |.. .|..-.|+.+||+.|+++||-|||-
T Consensus 164 ~e~a~~llpYl~elG~T~IELMPv~e~p~~~sWGYq----~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD 233 (628)
T COG0296 164 FELAIELLPYLKELGITHIELMPVAEHPGDRSWGYQ----GTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILD 233 (628)
T ss_pred HHHHHHHhHHHHHhCCCEEEEcccccCCCCCCCCCC----cceeccccccCCCHHHHHHHHHHHHHcCCEEEEE
Confidence 455677889999999999996 332 5432 122 2333479999999999999999987
No 139
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=52.70 E-value=2.2e+02 Score=32.14 Aligned_cols=56 Identities=21% Similarity=0.322 Sum_probs=41.8
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeec----CCccCCC----CCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVE----WSFHEPS----PGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~----Wn~HEp~----~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.++-++.++.|...|+|.+..|+- +.-+ |+ +|.|. ..|+.++++.|++.|+.||--
T Consensus 16 ~~~lk~~id~ma~~k~N~l~lhl~D~f~~~~~-p~~~~~~~~yT---~~ei~ei~~yA~~~gI~vIPe 79 (301)
T cd06565 16 VSYLKKLLRLLALLGANGLLLYYEDTFPYEGE-PEVGRMRGAYT---KEEIREIDDYAAELGIEVIPL 79 (301)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEEecceecCCC-cccccCCCCcC---HHHHHHHHHHHHHcCCEEEec
Confidence 577889999999999999988753 3221 22 22221 368999999999999999854
No 140
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=48.05 E-value=45 Score=37.51 Aligned_cols=69 Identities=14% Similarity=0.173 Sum_probs=52.5
Q ss_pred CCCCChhhHHHHHHHHHHcCCC--EEEEeecCCccCCCCCeeeccC--chhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 200 YFRSPRYYWQDRLRKLRAAGLN--AVSTYVEWSFHEPSPGQFQFTG--DQDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 200 Y~R~p~~~W~d~L~k~Ka~GlN--tV~tYv~Wn~HEp~~G~fdF~g--~~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
+.....+.-++.++++++.|+. +|.+=..|- ..-|.|.|+- .-|..++++..++.|+++++..=|+|+.
T Consensus 24 ~~~~s~~~v~~~~~~~~~~~iP~d~i~iD~~w~---~~~g~f~~d~~~FPdp~~mi~~l~~~G~k~~l~i~P~i~~ 96 (303)
T cd06592 24 KADINQETVLNYAQEIIDNGFPNGQIEIDDNWE---TCYGDFDFDPTKFPDPKGMIDQLHDLGFRVTLWVHPFINT 96 (303)
T ss_pred ccCcCHHHHHHHHHHHHHcCCCCCeEEeCCCcc---ccCCccccChhhCCCHHHHHHHHHHCCCeEEEEECCeeCC
Confidence 4567788899999999999975 444444452 3456666643 3489999999999999999888788864
No 141
>TIGR02100 glgX_debranch glycogen debranching enzyme GlgX. This family consists of the GlgX protein from the E. coli glycogen operon and probable equivalogs from other prokaryotic species. GlgX is not required for glycogen biosynthesis, but instead acts as a debranching enzyme for glycogen catabolism. This model distinguishes GlgX from pullanases and other related proteins that also operate on alpha-1,6-glycosidic linkages. In the wide band between the trusted and noise cutoffs are functionally similar enzymes, mostly from plants, that act similarly but usually are termed isoamylase.
Probab=47.60 E-value=27 Score=43.91 Aligned_cols=55 Identities=15% Similarity=0.208 Sum_probs=36.3
Q ss_pred HHHHHHHcCCCEEEE-eecCCccC---CCCC-----ee---e-------c---cCchhHHHHHHHHHHCCceEEecC
Q psy12907 211 RLRKLRAAGLNAVST-YVEWSFHE---PSPG-----QF---Q-------F---TGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 211 ~L~k~Ka~GlNtV~t-Yv~Wn~HE---p~~G-----~f---d-------F---~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.|.-+|++|+|+|+. .|+=...+ .+.| -| | | ....++.++++.|+++||.|||--
T Consensus 189 ~LdyLk~LGvtaI~L~Pi~~~~~~~~~~~~~~~~ywGYd~~~y~a~d~~y~~~g~~~efk~LV~~~H~~GI~VIlDv 265 (688)
T TIGR02100 189 MIDYLKKLGVTAVELLPVHAFIDDRHLLEKGLRNYWGYNTLGFFAPEPRYLASGQVAEFKTMVRALHDAGIEVILDV 265 (688)
T ss_pred hhHHHHHcCCCEEEECCcccCCccccccccCCCCccCcCcccccccChhhcCCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 477899999999986 44311111 1111 01 1 1 124689999999999999999873
No 142
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=47.09 E-value=58 Score=27.85 Aligned_cols=55 Identities=16% Similarity=0.041 Sum_probs=43.3
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEe
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLL 263 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VIL 263 (949)
|..-.+.++.+.+.|+|..++|++= ++. ++.+.+.. .|.++..+..+++|..|.|
T Consensus 12 pG~La~v~~~l~~~~inI~~i~~~~--~~~-~~~~rl~~-~~~~~~~~~L~~~G~~v~~ 66 (66)
T cd04908 12 PGRLAAVTEILSEAGINIRALSIAD--TSE-FGILRLIV-SDPDKAKEALKEAGFAVKL 66 (66)
T ss_pred CChHHHHHHHHHHCCCCEEEEEEEe--cCC-CCEEEEEE-CCHHHHHHHHHHCCCEEEC
Confidence 4456788999999999999999732 333 58888866 5678999999999988754
No 143
>PLN02361 alpha-amylase
Probab=47.01 E-value=40 Score=39.76 Aligned_cols=56 Identities=9% Similarity=0.136 Sum_probs=38.7
Q ss_pred HHHHHHHHHcCCCEEEEeecCCc---cCCCCCe-e----eccCchhHHHHHHHHHHCCceEEec
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSF---HEPSPGQ-F----QFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~---HEp~~G~-f----dF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.+.|.-++++|+++|-+.=+..- |--.+.. | .|....+|.++++.|+++||+||+-
T Consensus 32 ~~kl~~l~~lG~t~iwl~P~~~~~~~~GY~~~d~y~~~~~~Gt~~el~~li~~~h~~gi~vi~D 95 (401)
T PLN02361 32 EGKVPDLAKSGFTSAWLPPPSQSLAPEGYLPQNLYSLNSAYGSEHLLKSLLRKMKQYNVRAMAD 95 (401)
T ss_pred HHHHHHHHHcCCCEEEeCCCCcCCCCCCCCcccccccCcccCCHHHHHHHHHHHHHcCCEEEEE
Confidence 45677789999999977543321 2112221 2 2444579999999999999999975
No 144
>PRK14510 putative bifunctional 4-alpha-glucanotransferase/glycogen debranching enzyme; Provisional
Probab=46.90 E-value=28 Score=46.58 Aligned_cols=55 Identities=18% Similarity=0.322 Sum_probs=38.2
Q ss_pred HHHHHHHHcCCCEEEE-eecCCccCCC---CC-----eee----------cc--CchhHHHHHHHHHHCCceEEec
Q psy12907 210 DRLRKLRAAGLNAVST-YVEWSFHEPS---PG-----QFQ----------FT--GDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 210 d~L~k~Ka~GlNtV~t-Yv~Wn~HEp~---~G-----~fd----------F~--g~~DL~~fl~la~e~GL~VILR 264 (949)
+.|.-+|++|+|+|+. .|+=...|.. .| -|+ |. ...++.++++.|+++||.|||-
T Consensus 191 ~~i~yLk~LGvt~I~L~Pi~~~~~~~~~~~~g~~~yWGY~~~~yfa~dp~yg~~~~~efk~lV~~~H~~GI~VILD 266 (1221)
T PRK14510 191 EAISYLKKLGVSIVELNPIFASVDEHHLPQLGLSNYWGYNTVAFLAPDPRLAPGGEEEFAQAIKEAQSAGIAVILD 266 (1221)
T ss_pred hhHHHHHHcCCCEEEeCCccccCcccccccccCcCcCCCCCCCCCCcChhhccCcHHHHHHHHHHHHHCCCEEEEE
Confidence 4677899999999986 4542222211 11 021 22 5678999999999999999986
No 145
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=46.17 E-value=37 Score=37.27 Aligned_cols=65 Identities=11% Similarity=0.192 Sum_probs=49.9
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCee--eccC--chhHHHHHHHHHHCCceEEecCCccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQF--QFTG--DQDLEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~f--dF~g--~~DL~~fl~la~e~GL~VILRpGPYI 269 (949)
..+...+.++.+++.|+-+=.+.+-+..++. -+.| +|+- .-|..++++..++.|++|++..=|+|
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~~~lD~~~~~~-~~~f~~~~d~~~Fpdp~~~i~~l~~~g~~~~~~~~P~v 90 (265)
T cd06589 22 DQDKVLEVIDGMRENDIPLDGFVLDDDYTDG-YGDFTFDWDAGKFPNPKSMIDELHDNGVKLVLWIDPYI 90 (265)
T ss_pred CHHHHHHHHHHHHHcCCCccEEEECcccccC-CceeeeecChhhCCCHHHHHHHHHHCCCEEEEEeChhH
Confidence 4677899999999999987666665555543 4666 5532 34899999999999999999887777
No 146
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=46.04 E-value=42 Score=43.04 Aligned_cols=62 Identities=18% Similarity=0.095 Sum_probs=43.8
Q ss_pred hhHHHHHHHHHHcCCCEEEE-eecC----CccCCC-----CCeeeccCchhHHHHHHHHHHCCceEEecCCc
Q psy12907 206 YYWQDRLRKLRAAGLNAVST-YVEW----SFHEPS-----PGQFQFTGDQDLEYFLELAVREDLLVLLRPGP 267 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~t-Yv~W----n~HEp~-----~G~fdF~g~~DL~~fl~la~e~GL~VILRpGP 267 (949)
+.|.+.|.-++++|+++|.+ .++= +.|--. .=.-.|.+..|+.+|++.|+++||.|||-.=|
T Consensus 16 ~~~~~~L~YL~~LGv~~V~lsPi~~a~~gs~hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIlDiVp 87 (825)
T TIGR02401 16 DDAAALLPYLKSLGVSHLYLSPILTAVPGSTHGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIVDIVP 87 (825)
T ss_pred HHHHHhhHHHHHcCCCEEEeCcCccCCCCCCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 56899999999999999965 3321 111100 00123456789999999999999999987544
No 147
>PF13199 Glyco_hydro_66: Glycosyl hydrolase family 66; PDB: 3VMO_A 3VMN_A 3VMP_A.
Probab=45.72 E-value=45 Score=41.00 Aligned_cols=83 Identities=14% Similarity=0.271 Sum_probs=51.4
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEe-ecCCccCCCCCee-----ec---cCc----hhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTY-VEWSFHEPSPGQF-----QF---TGD----QDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tY-v~Wn~HEp~~G~f-----dF---~g~----~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
.++.=++.|..|+...||.|+.| ..|.+|.|-|+.= .| .++ .-+...|+.|++.|+.++.=--=|-+
T Consensus 116 ~~~~~~~~i~~L~~yHIN~~QFYDW~~rH~~Pl~~~~~~~~~~w~D~~~r~i~~~~Vk~yI~~ah~~Gmkam~Ynmiyaa 195 (559)
T PF13199_consen 116 SAEDIEAEIDQLNRYHINGLQFYDWMYRHHKPLPGTNGQPDQTWTDWANRQISTSTVKDYINAAHKYGMKAMAYNMIYAA 195 (559)
T ss_dssp GHHHHHHHHHHHHHTT--EEEETS--SBTTB-S-SSS-EEE-TT-TTT--EEEHHHHHHHHHHHHHTT-EEEEEEESSEE
T ss_pred CchhHHHHHHHHHhhCcCeEEEEeeccccCCcCCCCCCchhhhhhhhcCCEehHHHHHHHHHHHHHcCcceehhHhhhcc
Confidence 45678889999999999999999 6699999987644 22 221 35789999999999999743222322
Q ss_pred ccc--CCCCccccccccC
Q psy12907 271 AER--DFGGLPSWLMTIK 286 (949)
Q Consensus 271 AEw--~~GGlP~WL~~~~ 286 (949)
-+. ..|=.|.|.+-++
T Consensus 196 ~~~~~~~gv~~eW~ly~d 213 (559)
T PF13199_consen 196 NNNYEEDGVSPEWGLYKD 213 (559)
T ss_dssp ETT--S--SS-GGBEEES
T ss_pred ccCcccccCCchhhhhhc
Confidence 222 3566788987533
No 148
>KOG0626|consensus
Probab=45.33 E-value=41 Score=40.88 Aligned_cols=114 Identities=14% Similarity=0.227 Sum_probs=81.2
Q ss_pred hHHHHHHHHHHcCCCEEEEeecCCccCCCC---CeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccc
Q psy12907 207 YWQDRLRKLRAAGLNAVSTYVEWSFHEPSP---GQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLM 283 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~---G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~ 283 (949)
.++++++.||++|++.-+.-|.|...=|.- +..+-+|..=-..+|+...++|+...+-- | =| .+|.||-
T Consensus 92 ~ykeDv~Lmk~lgv~afRFSIsWSRIlP~G~~~~gVN~~Gi~fY~~LI~eL~~nGI~P~VTL--f---Hw---DlPq~Le 163 (524)
T KOG0626|consen 92 RYKEDVKLMKELGVDAFRFSISWSRILPNGRLTGGVNEAGIQFYNNLIDELLANGIEPFVTL--F---HW---DLPQALE 163 (524)
T ss_pred hhHHHHHHHHHcCCCeEEEEeehHhhCCCCCcCCCcCHHHHHHHHHHHHHHHHcCCeEEEEE--e---cC---CCCHHHH
Confidence 488999999999999999999999887753 45777776667778888999999876542 0 24 4788887
Q ss_pred ccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEec
Q psy12907 284 TIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQ 328 (949)
Q Consensus 284 ~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQ 328 (949)
+.+-+-.-+..-..|++.++--|+++..+++-+.-=|...|..++
T Consensus 164 DeYgGwLn~~ivedF~~yA~~CF~~fGDrVK~WiT~NEP~v~s~~ 208 (524)
T KOG0626|consen 164 DEYGGWLNPEIVEDFRDYADLCFQEFGDRVKHWITFNEPNVFSIG 208 (524)
T ss_pred HHhccccCHHHHHHHHHHHHHHHHHhcccceeeEEecccceeeee
Confidence 644443333445567777777777777777765544555554443
No 149
>COG1647 Esterase/lipase [General function prediction only]
Probab=45.03 E-value=12 Score=40.73 Aligned_cols=57 Identities=25% Similarity=0.356 Sum_probs=45.1
Q ss_pred CeEECCCCCCchhhhhhhhhhcccccccccc---------------------------------------------cCCc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTLS---------------------------------------------YLGG 648 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is---------------------------------------------~lg~ 648 (949)
.|.||||-.||.+-.-...-.|.++||-|.+ |+||
T Consensus 17 AVLllHGFTGt~~Dvr~Lgr~L~e~GyTv~aP~ypGHG~~~e~fl~t~~~DW~~~v~d~Y~~L~~~gy~eI~v~GlSmGG 96 (243)
T COG1647 17 AVLLLHGFTGTPRDVRMLGRYLNENGYTVYAPRYPGHGTLPEDFLKTTPRDWWEDVEDGYRDLKEAGYDEIAVVGLSMGG 96 (243)
T ss_pred EEEEEeccCCCcHHHHHHHHHHHHCCceEecCCCCCCCCCHHHHhcCCHHHHHHHHHHHHHHHHHcCCCeEEEEeecchh
Confidence 4677888888888777777777888888775 7999
Q ss_pred chhhcccccccccCceeeccccccc
Q psy12907 649 FLAQKFAEHTVLRPRVVSLFLCNSF 673 (949)
Q Consensus 649 ~l~q~f~~~~~~~~~v~sl~lcn~~ 673 (949)
.++.|-+.+.+.+.- ..||.++
T Consensus 97 v~alkla~~~p~K~i---v~m~a~~ 118 (243)
T COG1647 97 VFALKLAYHYPPKKI---VPMCAPV 118 (243)
T ss_pred HHHHHHHhhCCccce---eeecCCc
Confidence 999999999886544 5677766
No 150
>PRK13398 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=44.90 E-value=83 Score=35.09 Aligned_cols=74 Identities=16% Similarity=0.215 Sum_probs=56.8
Q ss_pred eEeCCeeeEEecc--ccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeecc--CchhHHHHHHHHHHCCc
Q psy12907 184 FVKDGKPFRYVSG--SFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFT--GDQDLEYFLELAVREDL 259 (949)
Q Consensus 184 f~~dGkpf~i~sG--e~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~--g~~DL~~fl~la~e~GL 259 (949)
+.+++.++.++.| ++| ..+.-.+..+++|++|+..+..|.+=+... -+.|. |...+..+-+.|++.||
T Consensus 21 ~~~g~~~~~~iaGPCsie----~~~~~~~~A~~lk~~g~~~~r~~~~kpRTs----~~s~~G~g~~gl~~l~~~~~~~Gl 92 (266)
T PRK13398 21 VVIGGEEKIIIAGPCAVE----SEEQMVKVAEKLKELGVHMLRGGAFKPRTS----PYSFQGLGEEGLKILKEVGDKYNL 92 (266)
T ss_pred EEEcCCCEEEEEeCCcCC----CHHHHHHHHHHHHHcCCCEEEEeeecCCCC----CCccCCcHHHHHHHHHHHHHHcCC
Confidence 6667778888887 344 367788899999999999999997653332 33565 46779999999999999
Q ss_pred eEEecC
Q psy12907 260 LVLLRP 265 (949)
Q Consensus 260 ~VILRp 265 (949)
.++-.|
T Consensus 93 ~~~te~ 98 (266)
T PRK13398 93 PVVTEV 98 (266)
T ss_pred CEEEee
Confidence 998664
No 151
>PLN02980 2-oxoglutarate decarboxylase/ hydro-lyase/ magnesium ion binding / thiamin pyrophosphate binding
Probab=44.57 E-value=11 Score=51.81 Aligned_cols=41 Identities=27% Similarity=0.366 Sum_probs=29.4
Q ss_pred cccCCCCccCCCeEECCCCCCchhhhhhhhhhcccccccccc
Q psy12907 603 VYDCGPKHIRSPLVFLPPVSGTADIFFHLMLDLSRKGYRTLS 644 (949)
Q Consensus 603 ~~~~g~~~~~~~v~~l~~~~g~a~~~~~~~~~l~~kgyR~is 644 (949)
|++.|......+|+||||..|++..|..++-.|+ +.||||.
T Consensus 1362 ~~~~G~~~~~~~vVllHG~~~s~~~w~~~~~~L~-~~~rVi~ 1402 (1655)
T PLN02980 1362 VHEVGQNAEGSVVLFLHGFLGTGEDWIPIMKAIS-GSARCIS 1402 (1655)
T ss_pred EEecCCCCCCCeEEEECCCCCCHHHHHHHHHHHh-CCCEEEE
Confidence 3444432223479999999999999888877775 4588885
No 152
>PRK11071 esterase YqiA; Provisional
Probab=43.88 E-value=8.9 Score=39.89 Aligned_cols=55 Identities=18% Similarity=0.310 Sum_probs=40.4
Q ss_pred CeEECCCCCCchhhhhhhhh-h-ccc--ccccccc-------------------------------cCCcchhhcccccc
Q psy12907 614 PLVFLPPVSGTADIFFHLML-D-LSR--KGYRTLS-------------------------------YLGGFLAQKFAEHT 658 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~-~-l~~--kgyR~is-------------------------------~lg~~l~q~f~~~~ 658 (949)
+|+||||..+++..+..+.+ . |++ ++|||+. |+||+++++++...
T Consensus 3 ~illlHGf~ss~~~~~~~~~~~~l~~~~~~~~v~~~dl~g~~~~~~~~l~~l~~~~~~~~~~lvG~S~Gg~~a~~~a~~~ 82 (190)
T PRK11071 3 TLLYLHGFNSSPRSAKATLLKNWLAQHHPDIEMIVPQLPPYPADAAELLESLVLEHGGDPLGLVGSSLGGYYATWLSQCF 82 (190)
T ss_pred eEEEECCCCCCcchHHHHHHHHHHHHhCCCCeEEeCCCCCCHHHHHHHHHHHHHHcCCCCeEEEEECHHHHHHHHHHHHc
Confidence 68899999999998885432 3 233 3788884 79999999999877
Q ss_pred cccCceeeccccccc
Q psy12907 659 VLRPRVVSLFLCNSF 673 (949)
Q Consensus 659 ~~~~~v~sl~lcn~~ 673 (949)
+. .++|.|+-
T Consensus 83 ~~-----~~vl~~~~ 92 (190)
T PRK11071 83 ML-----PAVVVNPA 92 (190)
T ss_pred CC-----CEEEECCC
Confidence 62 24666664
No 153
>TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily.
Probab=43.59 E-value=10 Score=41.49 Aligned_cols=29 Identities=28% Similarity=0.222 Sum_probs=23.1
Q ss_pred cCCcchhhcccccccccCceeecccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
|+||.++.+++... +.|++++|.|++..+
T Consensus 108 S~Gg~~a~~~a~~~---~~v~~lil~~p~~~~ 136 (274)
T TIGR03100 108 CDAASAALLYAPAD---LRVAGLVLLNPWVRT 136 (274)
T ss_pred CHHHHHHHHHhhhC---CCccEEEEECCccCC
Confidence 78999999998643 478999999998543
No 154
>PF03659 Glyco_hydro_71: Glycosyl hydrolase family 71 ; InterPro: IPR005197 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha-1,3-glucanases belonging to glycoside hydrolase family 71 (GH71 from CAZY).
Probab=43.40 E-value=1e+02 Score=36.20 Aligned_cols=53 Identities=17% Similarity=0.280 Sum_probs=41.4
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
..+.|+++++.+|++|++....-+- ....+. ...|...++.|++.|+++.+-+
T Consensus 15 t~~dw~~di~~A~~~GIDgFaLNig------~~d~~~---~~~l~~a~~AA~~~gFKlf~Sf 67 (386)
T PF03659_consen 15 TQEDWEADIRLAQAAGIDGFALNIG------SSDSWQ---PDQLADAYQAAEAVGFKLFFSF 67 (386)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEecc------cCCccc---HHHHHHHHHHHHhcCCEEEEEe
Confidence 6899999999999999999877654 111111 2458889999999999998875
No 155
>PRK14507 putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase; Provisional
Probab=43.35 E-value=45 Score=45.90 Aligned_cols=59 Identities=24% Similarity=0.265 Sum_probs=43.9
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCe---e----------eccCchhHHHHHHHHHHCCceEEecCCc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQ---F----------QFTGDQDLEYFLELAVREDLLVLLRPGP 267 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~---f----------dF~g~~DL~~fl~la~e~GL~VILRpGP 267 (949)
+.|.+.|.-+|++|+|+|-+-=.+ +..+|. | .|.+..|+.++++.|+++||.|||-.=|
T Consensus 758 ~~~~~~l~Yl~~LGv~~i~lsPi~---~a~~gs~hGYdv~D~~~idp~lG~~edf~~Lv~~ah~~Gi~vilDiV~ 829 (1693)
T PRK14507 758 ADAEAILPYLAALGISHVYASPIL---KARPGSTHGYDIVDHSQINPEIGGEEGFERFCAALKAHGLGQLLDIVP 829 (1693)
T ss_pred HHHHHHhHHHHHcCCCEEEECCCc---CCCCCCCCCCCCCCCCccCcccCCHHHHHHHHHHHHHCCCEEEEEecc
Confidence 679999999999999999663222 222221 1 2446789999999999999999987544
No 156
>TIGR02102 pullulan_Gpos pullulanase, extracellular, Gram-positive. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. In contrast, a glycogen debranching enzyme such GlgX, homologous to this family, can release glucose at alpha,1-6 linkages from glycogen first subjected to limit degradation by phosphorylase. Characterized members of this family include a surface-located pullulanase from Streptococcus pneumoniae (PubMed:11083842) and an extracellular bifunctional amylase/pullulanase with C-terminal pullulanase activity (PubMed:8798645).
Probab=42.84 E-value=44 Score=44.28 Aligned_cols=21 Identities=10% Similarity=0.226 Sum_probs=19.3
Q ss_pred hhHHHHHHHHHHCCceEEecC
Q psy12907 245 QDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 245 ~DL~~fl~la~e~GL~VILRp 265 (949)
.++.++++.|+++||.|||--
T Consensus 555 ~EfK~LV~alH~~GI~VILDV 575 (1111)
T TIGR02102 555 AEFKNLINEIHKRGMGVILDV 575 (1111)
T ss_pred HHHHHHHHHHHHCCCEEEEec
Confidence 689999999999999999873
No 157
>PRK12677 xylose isomerase; Provisional
Probab=42.44 E-value=1.7e+02 Score=34.34 Aligned_cols=92 Identities=15% Similarity=0.116 Sum_probs=56.5
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeecc---CchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCcccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFT---GDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGLPSW 281 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~---g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGlP~W 281 (949)
-.+.+++++++++|+..|+.. .+..--|+.+ -...+.++.+++++.||.|. +.|.-|-+..+..|+
T Consensus 31 ~~~~E~v~~~a~~Gf~gVElh------~~~l~p~~~~~~~~~~~~~~lk~~l~~~GL~v~~v~~n~f~~p~~~~g~---- 100 (384)
T PRK12677 31 LDPVEAVHKLAELGAYGVTFH------DDDLVPFGATDAERDRIIKRFKKALDETGLVVPMVTTNLFTHPVFKDGA---- 100 (384)
T ss_pred CCHHHHHHHHHHhCCCEEEec------ccccCCCCCChhhhHHHHHHHHHHHHHcCCeeEEEecCCCCCccccCCc----
Confidence 358899999999999999883 1111112111 11358899999999999986 544322111222222
Q ss_pred ccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 282 LMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 282 L~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
+-+.|+.-++...+.+++.+..-+.+
T Consensus 101 ---------lts~d~~~R~~Ai~~~~r~IdlA~eL 126 (384)
T PRK12677 101 ---------FTSNDRDVRRYALRKVLRNIDLAAEL 126 (384)
T ss_pred ---------CCCCCHHHHHHHHHHHHHHHHHHHHh
Confidence 33446666776666666666666654
No 158
>PRK07581 hypothetical protein; Validated
Probab=42.16 E-value=7.6 Score=43.36 Aligned_cols=28 Identities=11% Similarity=-0.092 Sum_probs=23.6
Q ss_pred cCCcchhhcccccccccCceeecccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|+||++|+.++..+| +.|.+|+|+++..
T Consensus 132 S~GG~va~~~a~~~P--~~V~~Lvli~~~~ 159 (339)
T PRK07581 132 SMGAQQTYHWAVRYP--DMVERAAPIAGTA 159 (339)
T ss_pred CHHHHHHHHHHHHCH--HHHhhheeeecCC
Confidence 899999999999888 6889999887653
No 159
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=42.07 E-value=83 Score=41.51 Aligned_cols=67 Identities=12% Similarity=0.120 Sum_probs=46.1
Q ss_pred eeEEEEEecCCCC--CCCCcccccCCcceEEEEEECCEEEEEEEec-ceeeeecc--CCC-CCeEEEEEEeCC
Q psy12907 521 GFVLYETIIPDAR--FPDPALLTISGLRDRGQVFVDEKLVTILYRN-KMLSTPIM--ARP-GQKLSILVENMG 587 (949)
Q Consensus 521 GyvlY~t~i~~~~--~~~~~~L~i~~vrDrA~Vfvdg~~vG~l~r~-~~~~i~~~--~~~-~~~LdILVENmG 587 (949)
|--.||+++.-+. .+..+.|.+.+|.-.|+|+|||++||.-.-. ....+.++ .+. .++|.|.|.+..
T Consensus 109 ~~g~Yrr~F~lp~~~~gkrv~L~FeGV~s~a~VwvNG~~VG~~~g~~~pfefDIT~~l~~G~N~LaV~V~~~~ 181 (1021)
T PRK10340 109 PTGAYQRTFTLSDGWQGKQTIIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWA 181 (1021)
T ss_pred CeEEEEEEEEeCcccccCcEEEEECccceEEEEEECCEEeccccCCCccEEEEcchhhCCCccEEEEEEEecC
Confidence 5669999874332 3445778999999999999999999974332 23344443 222 378999998543
No 160
>PLN00196 alpha-amylase; Provisional
Probab=42.04 E-value=55 Score=38.91 Aligned_cols=57 Identities=16% Similarity=0.243 Sum_probs=39.5
Q ss_pred HHHHHHHHHcCCCEEEEeecCCc---cCCCCCe-ee-----ccCchhHHHHHHHHHHCCceEEecC
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSF---HEPSPGQ-FQ-----FTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~---HEp~~G~-fd-----F~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.+.|.-+|++|+++|-+.=+..- |--.+.. |+ |....||.++++.|++.||+||+-.
T Consensus 47 ~~kldyL~~LGvtaIWL~P~~~s~s~hGY~~~D~y~ld~~~fGt~~elk~Lv~~aH~~GIkVilDv 112 (428)
T PLN00196 47 MGKVDDIAAAGITHVWLPPPSHSVSEQGYMPGRLYDLDASKYGNEAQLKSLIEAFHGKGVQVIADI 112 (428)
T ss_pred HHHHHHHHHcCCCEEEeCCCCCCCCCCCCCccccCCCCcccCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 47788889999999976433221 2212211 22 3345799999999999999999864
No 161
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=41.56 E-value=54 Score=42.33 Aligned_cols=61 Identities=11% Similarity=0.107 Sum_probs=44.3
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCC------ee-------eccCchhHHHHHHHHHHCCceEEecCCcc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPG------QF-------QFTGDQDLEYFLELAVREDLLVLLRPGPY 268 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G------~f-------dF~g~~DL~~fl~la~e~GL~VILRpGPY 268 (949)
-+.|.+.|.-++++|+|+|-.-=.+ +..+| .. .|.+..++.+|++.|+++||.|||-.=|-
T Consensus 19 f~~~~~~l~YL~~LGis~IyLsPi~---~a~~gs~hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIlDiV~N 92 (879)
T PRK14511 19 FDDAAELVPYFADLGVSHLYLSPIL---AARPGSTHGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLILDIVPN 92 (879)
T ss_pred HHHHHHHhHHHHHcCCCEEEECcCc---cCCCCCCCCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEeccc
Confidence 3569999999999999999653211 11112 11 23466899999999999999999886553
No 162
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=41.37 E-value=64 Score=37.85 Aligned_cols=69 Identities=12% Similarity=0.323 Sum_probs=46.6
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc--hhHHHHHHHHHHCCceEEecCCcccccccC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYICAERD 274 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIcAEw~ 274 (949)
.+...+.++.+|+.|+-.=...+-..... ..+.|.|+.. -|..++++.+++.|+++++..-|+|+-+-.
T Consensus 42 ~~~v~~~i~~~~~~~iP~d~~~iD~~~~~-~~~~f~~d~~~FPd~~~~~~~l~~~G~~~~~~~~P~v~~~~~ 112 (441)
T PF01055_consen 42 QDEVREVIDRYRSNGIPLDVIWIDDDYQD-GYGDFTWDPERFPDPKQMIDELHDQGIKVVLWVHPFVSNDSP 112 (441)
T ss_dssp HHHHHHHHHHHHHTT--EEEEEE-GGGSB-TTBTT-B-TTTTTTHHHHHHHHHHTT-EEEEEEESEEETTTT
T ss_pred HHHHHHHHHHHHHcCCCccceeccccccc-cccccccccccccchHHHHHhHhhCCcEEEEEeecccCCCCC
Confidence 66788999999999988755555433333 4445555432 389999999999999999988888875543
No 163
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=41.34 E-value=79 Score=36.43 Aligned_cols=108 Identities=19% Similarity=0.180 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHHHhCCCeeEEEeeCCCC--CCc--cc-cCCCCceeeccCC----------CCCCHHHHHHHhhhcCCCC
Q psy12907 48 KEHMIWLRDQMKYYVRDAAVLYTTDGGG--VGY--LK-CTVPGVYATVDFG----------AATNVSSAFAAMRTVSPHG 112 (949)
Q Consensus 48 ~~YM~~L~d~~r~~Gi~~VPL~T~Dg~~--~~~--~~-G~lpgv~~tvnfg----------~~~~~~~~~~~l~~~~P~~ 112 (949)
.+|+.++++.+|+..-+ .|+.|+-.+. ... .. ...-++.+ .|.. ......-..+..+...+++
T Consensus 212 ~~~~~~~~~~ir~~~p~-~~vt~n~~~~~~~~~d~~~~a~~~D~~~-~d~Y~~~~~~~~~~~~~~~a~~~dl~R~~~~~k 289 (374)
T PF02449_consen 212 AEFFRWQADIIREYDPD-HPVTTNFMGSWFNGIDYFKWAKYLDVVS-WDSYPDGSFDFYDDDPYSLAFNHDLMRSLAKGK 289 (374)
T ss_dssp HHHHHHHHHHHHHHSTT--EEE-EE-TT---SS-HHHHGGGSSSEE-EEE-HHHHHTTTT--TTHHHHHHHHHHHHTTT-
T ss_pred HHHHHHHHHHHHHhCCC-ceEEeCccccccCcCCHHHHHhhCCcce-eccccCcccCCCCCCHHHHHHHHHHHHhhcCCC
Confidence 58999999999999755 6777764321 100 00 00011111 1111 0111222334456667899
Q ss_pred CceeeeeCCccccccCCCCccCChHHHHHHHHHHHHcccc-eeeecc
Q psy12907 113 PLVNSEYYPGWLTHWGEKIATVDTDPVVKTLQIMLEMKAN-VNFYIP 158 (949)
Q Consensus 113 P~m~~Ef~~GWFD~WG~~~~~~~~~~~~~~l~~~L~~GaS-~NlYM~ 158 (949)
|++++|.++| -..|+.......+..+....-..++.|+. +.++=+
T Consensus 290 pf~v~E~~~g-~~~~~~~~~~~~pg~~~~~~~~~~A~Ga~~i~~~~w 335 (374)
T PF02449_consen 290 PFWVMEQQPG-PVNWRPYNRPPRPGELRLWSWQAIAHGADGILFWQW 335 (374)
T ss_dssp -EEEEEE--S---SSSSS-----TTHHHHHHHHHHHTT-S-EEEC-S
T ss_pred ceEeecCCCC-CCCCccCCCCCCCCHHHHHHHHHHHHhCCeeEeeec
Confidence 9999999999 66677665555667777777778899984 555444
No 164
>PRK03705 glycogen debranching enzyme; Provisional
Probab=40.42 E-value=45 Score=41.79 Aligned_cols=55 Identities=16% Similarity=0.254 Sum_probs=35.9
Q ss_pred HHHHHHHcCCCEEEE-eecCCccCCCC---C-----eee----------ccC-----chhHHHHHHHHHHCCceEEecC
Q psy12907 211 RLRKLRAAGLNAVST-YVEWSFHEPSP---G-----QFQ----------FTG-----DQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 211 ~L~k~Ka~GlNtV~t-Yv~Wn~HEp~~---G-----~fd----------F~g-----~~DL~~fl~la~e~GL~VILRp 265 (949)
.|.-+|++|+|+|+. +|+=...++.. | -|| |.. ..++.++++.|+++||.|||--
T Consensus 184 ~LdYLk~LGvt~I~L~Pv~~~~~~~~~~~~g~~~ywGYd~~~yfa~d~~ygt~~~~~~~efk~LV~~~H~~GI~VIlDv 262 (658)
T PRK03705 184 MIAYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPLAMFALDPAYASGPETALDEFRDAVKALHKAGIEVILDV 262 (658)
T ss_pred chHHHHHcCCCEEEecCcccCCCcccccccccccccCcccccccccccccCCCCcchHHHHHHHHHHHHHCCCEEEEEE
Confidence 488899999999986 34311111110 0 011 211 2589999999999999999873
No 165
>PF14307 Glyco_tran_WbsX: Glycosyltransferase WbsX
Probab=40.01 E-value=3.8e+02 Score=30.81 Aligned_cols=142 Identities=18% Similarity=0.244 Sum_probs=85.1
Q ss_pred cCCCCC-ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHH---HCCceEEecCCccccccc
Q psy12907 198 FHYFRS-PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAV---REDLLVLLRPGPYICAER 273 (949)
Q Consensus 198 ~HY~R~-p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~---e~GL~VILRpGPYIcAEw 273 (949)
..|.-+ .++..+.-++.+|+.|++.-..|-.| |+|.+=|++-++..- +.++..-|+ |
T Consensus 49 lGyYdl~~p~v~~~Q~~lA~~~GI~gF~~~~Yw-----------f~gk~lLe~p~~~~l~~~~~d~pFcl~--------W 109 (345)
T PF14307_consen 49 LGYYDLRDPEVMEKQAELAKEYGIDGFCFYHYW-----------FNGKRLLEKPLENLLASKEPDFPFCLC--------W 109 (345)
T ss_pred CCcccCCCHHHHHHHHHHHHHhCCCEEEEEeee-----------cCCchHHHHHHHHHHhcCCCCCcEEEE--------E
Confidence 443333 37889999999999999999999877 445555666665443 345555444 4
Q ss_pred CCCCc-cccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHh--ccccCCCceEEeccccccCCccchHHHHHHHHHHH
Q psy12907 274 DFGGL-PSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITR--FLYGNGGPIILVQVENEMGSYTCDKEHMIWLRDQM 350 (949)
Q Consensus 274 ~~GGl-P~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~--~~~~~GGpII~vQIENEyG~y~~d~~Y~~~L~~~~ 350 (949)
.+..- =.|-.. ..++.+--..+. .+..++.++.|++.++. ++--+|-||+++=--.+. .+-++.++.+++.+
T Consensus 110 AN~~w~~~w~g~-~~~~l~~q~y~~-~~d~~~~~~~l~~~F~D~rYikVdGKPv~~Iy~p~~~---pd~~~~~~~wr~~a 184 (345)
T PF14307_consen 110 ANENWTRRWDGR-NNEILIEQKYSG-EDDWKEHFRYLLPYFKDPRYIKVDGKPVFLIYRPGDI---PDIKEMIERWREEA 184 (345)
T ss_pred CCChhhhccCCC-CccccccccCCc-hhHHHHHHHHHHHHhCCCCceeECCEEEEEEECcccc---cCHHHHHHHHHHHH
Confidence 33211 112222 122222111110 02235566778888874 333477899987554333 45689999999999
Q ss_pred HHhcCCeeEEEeeC
Q psy12907 351 KYYVRDAAVLYTTD 364 (949)
Q Consensus 351 ~~~lG~~vplft~D 364 (949)
+++ |+..+.+..+
T Consensus 185 ~~~-G~~giyii~~ 197 (345)
T PF14307_consen 185 KEA-GLPGIYIIAV 197 (345)
T ss_pred HHc-CCCceEEEEE
Confidence 987 8886655443
No 166
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=39.92 E-value=57 Score=37.00 Aligned_cols=66 Identities=12% Similarity=0.221 Sum_probs=49.2
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCC--eeeccCc--hhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPG--QFQFTGD--QDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G--~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
..+.-++.++++++.|+-+=.+.+-|.... ..+ .|+|+-. -|..++++..++.|++|++..=|+|+
T Consensus 22 ~~~ev~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~~~f~~d~~~FPdp~~mi~~L~~~G~kv~~~i~P~v~ 91 (319)
T cd06591 22 TQEELLDVAKEYRKRGIPLDVIVQDWFYWP-KQGWGEWKFDPERFPDPKAMVRELHEMNAELMISIWPTFG 91 (319)
T ss_pred CHHHHHHHHHHHHHhCCCccEEEEechhhc-CCCceeEEEChhhCCCHHHHHHHHHHCCCEEEEEecCCcC
Confidence 456778999999999988755555555443 234 6666543 38999999999999999987667764
No 167
>KOG1454|consensus
Probab=39.84 E-value=17 Score=41.48 Aligned_cols=64 Identities=30% Similarity=0.382 Sum_probs=50.5
Q ss_pred CCCeEECCCCCCchhhhhhhhhhcccc-cccccc--------------------------------------------cC
Q psy12907 612 RSPLVFLPPVSGTADIFFHLMLDLSRK-GYRTLS--------------------------------------------YL 646 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~~~~~l~~k-gyR~is--------------------------------------------~l 646 (949)
..||++|||-.+.+.-|=+++-.|+++ |+||++ |+
T Consensus 58 ~~pvlllHGF~~~~~~w~~~~~~L~~~~~~~v~aiDl~G~g~~s~~~~~~~y~~~~~v~~i~~~~~~~~~~~~~lvghS~ 137 (326)
T KOG1454|consen 58 KPPVLLLHGFGASSFSWRRVVPLLSKAKGLRVLAIDLPGHGYSSPLPRGPLYTLRELVELIRRFVKEVFVEPVSLVGHSL 137 (326)
T ss_pred CCcEEEeccccCCcccHhhhccccccccceEEEEEecCCCCcCCCCCCCCceehhHHHHHHHHHHHhhcCcceEEEEeCc
Confidence 779999999999777777777777755 599985 69
Q ss_pred CcchhhcccccccccCceeecc----ccccccccc
Q psy12907 647 GGFLAQKFAEHTVLRPRVVSLF----LCNSFYDTS 677 (949)
Q Consensus 647 g~~l~q~f~~~~~~~~~v~sl~----lcn~~~dt~ 677 (949)
||++|-.||...+ +-|.+|+ +|+.++.+.
T Consensus 138 Gg~va~~~Aa~~P--~~V~~lv~~~~~~~~~~~~~ 170 (326)
T KOG1454|consen 138 GGIVALKAAAYYP--ETVDSLVLLDLLGPPVYSTP 170 (326)
T ss_pred HHHHHHHHHHhCc--ccccceeeecccccccccCC
Confidence 9999999999877 7888888 455554433
No 168
>COG1306 Uncharacterized conserved protein [Function unknown]
Probab=39.50 E-value=47 Score=37.77 Aligned_cols=61 Identities=15% Similarity=0.200 Sum_probs=39.5
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecC---CccCCCCCeeec-----cCchhHHHHHHHHHHCCceEEecC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEW---SFHEPSPGQFQF-----TGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~W---n~HEp~~G~fdF-----~g~~DL~~fl~la~e~GL~VILRp 265 (949)
+..-.++++.+|..|+|++-+=+-= ++.=|....+.= .-..|+..||+.|+|+|||+|.|.
T Consensus 76 kk~~de~fk~ikdn~~Na~ViD~Kdd~G~lty~s~d~~~~~~~sv~~f~Di~~~iKkaKe~giY~IARi 144 (400)
T COG1306 76 KKRLDELFKLIKDNNINAFVIDVKDDYGELTYPSSDEINKYTKSVNKFKDIEPVIKKAKENGIYAIARI 144 (400)
T ss_pred hhHHHHHHHHHHhCCCCEEEEEecCCCccEeccccchhhhhhhccccccccHHHHHHHHhcCeEEEEEE
Confidence 3456789999999999997543211 111111111110 012589999999999999999994
No 169
>COG3589 Uncharacterized conserved protein [Function unknown]
Probab=38.37 E-value=48 Score=38.21 Aligned_cols=68 Identities=22% Similarity=0.234 Sum_probs=51.9
Q ss_pred cccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCe-eeccCchhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 196 GSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQ-FQFTGDQDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 196 Ge~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~-fdF~g~~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
=++.+.|.+.+.=..-|++|...|+..|-| ++|.|++.. --|. -+.+.++.|.++|++||+-..|-|-
T Consensus 6 fSifp~~~~~~~~~~Yi~~~~~~Gf~~IFt----sl~~~~~~~~~~~~---~~~ell~~Anklg~~vivDvnPsil 74 (360)
T COG3589 6 FSIFPNRSPKEKDIAYIDRMHKYGFKRIFT----SLLIPEEDAELYFH---RFKELLKEANKLGLRVIVDVNPSIL 74 (360)
T ss_pred EEeccCCCcchhHHHHHHHHHHcCccceee----ecccCCchHHHHHH---HHHHHHHHHHhcCcEEEEEcCHHHH
Confidence 355667888888888899999999988766 567776642 1122 3788999999999999999877554
No 170
>PF01791 DeoC: DeoC/LacD family aldolase; InterPro: IPR002915 This family includes the enzyme deoxyribose-phosphate aldolase, which is involved in nucleotide metabolism. 2-deoxy-D-ribose 5-phosphate = D-glyceraldehyde 3-phosphate + acetaldehyde The family also includes a group of related bacterial proteins of unknown function, see examples Q57843 from SWISSPROT and P76143 from SWISSPROT.; GO: 0016829 lyase activity; PDB: 2A4A_A 1VCV_B 1P1X_A 1KTN_B 1JCL_A 1JCJ_A 1MZH_A 3GKF_D 3GLC_L 3GND_N ....
Probab=37.32 E-value=14 Score=39.89 Aligned_cols=53 Identities=11% Similarity=0.159 Sum_probs=42.3
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
-...+.+.++|.+.|.+.++|..-.+..-.+.. .++.++.+.|++.||.||+.
T Consensus 79 ~~~ve~A~~~GAd~vd~vi~~~~~~~~~~~~~~---~~i~~v~~~~~~~gl~vIlE 131 (236)
T PF01791_consen 79 VAEVEEAIRLGADEVDVVINYGALGSGNEDEVI---EEIAAVVEECHKYGLKVILE 131 (236)
T ss_dssp HHHHHHHHHTT-SEEEEEEEHHHHHTTHHHHHH---HHHHHHHHHHHTSEEEEEEE
T ss_pred HHHHHHHHHcCCceeeeeccccccccccHHHHH---HHHHHHHHHHhcCCcEEEEE
Confidence 556888999999999999999766554433333 57999999999999999998
No 171
>smart00518 AP2Ec AP endonuclease family 2. These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites
Probab=36.22 E-value=4.4e+02 Score=28.53 Aligned_cols=63 Identities=13% Similarity=0.130 Sum_probs=43.6
Q ss_pred CCChhhHHHHHHHHHHcCCCEEEEeec--CCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCC
Q psy12907 202 RSPRYYWQDRLRKLRAAGLNAVSTYVE--WSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPG 266 (949)
Q Consensus 202 R~p~~~W~d~L~k~Ka~GlNtV~tYv~--Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpG 266 (949)
+++++..+..-+.+++.|+.. +..-+ .|+..|.|....-+ ...+.+-+++|++.|-.+| +-||
T Consensus 41 ~~~~~~~~~l~~~~~~~gl~l-s~h~p~~~nl~s~d~~~r~~~-~~~l~~~i~~A~~lGa~~vv~h~g 106 (273)
T smart00518 41 RLSEETAEKFKEALKENNIDV-SVHAPYLINLASPDKEKVEKS-IERLIDEIKRCEELGIKALVFHPG 106 (273)
T ss_pred CCCHHHHHHHHHHHHHcCCCE-EEECCceecCCCCCHHHHHHH-HHHHHHHHHHHHHcCCCEEEEccc
Confidence 577888888888899999974 33222 36666655544443 2458889999999999764 4444
No 172
>PF01120 Alpha_L_fucos: Alpha-L-fucosidase; InterPro: IPR000933 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Family 29 (GH29 from CAZY) encompasses alpha-L-fucosidases (3.2.1.51 from EC) [], which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Alpha-L-fucosidase is responsible for hydrolysing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Fucosylated glycoconjugates are involved in numerous biological events, making alpha-l-fucosidases, the enzymes responsible for their processing, critically important. Deficiency in alpha-l-fucosidase activity is associated with fucosidosis, a lysosomal storage disorder characterised by rapid neurodegeneration, resulting in severe mental and motor deterioration []. The enzyme is a hexamer and displays a two-domain fold, composed of a catalytic (beta/alpha)(8)-like domain and a C-terminal beta-sandwich domain []. Drosophila melanogaster spermatozoa contains an alpha-l-fucosidase that might be involved in fertilisation by interacting with alpha-l-fucose residues on the micropyle of the eggshell []. In human sperm, membrane-associated alpha-l-fucosidase is stable for extended periods of time, which is made possible by membrane domains and compartmentalisation. These help preserve protein integrity []. ; GO: 0004560 alpha-L-fucosidase activity, 0005975 carbohydrate metabolic process; PDB: 3EYP_B 2ZX6_A 2ZWY_B 2ZX8_B 2WSP_A 2ZXA_A 2ZWZ_B 1ODU_B 1HL9_A 2ZX5_B ....
Probab=36.03 E-value=97 Score=35.62 Aligned_cols=75 Identities=15% Similarity=0.156 Sum_probs=36.4
Q ss_pred cCCCCCChhhHHHHHHHHHHcCCCEEE-E------eecCCccCCCCCeeeccCchh-HHHHHHHHHHCCceEEecCCccc
Q psy12907 198 FHYFRSPRYYWQDRLRKLRAAGLNAVS-T------YVEWSFHEPSPGQFQFTGDQD-LEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 198 ~HY~R~p~~~W~d~L~k~Ka~GlNtV~-t------Yv~Wn~HEp~~G~fdF~g~~D-L~~fl~la~e~GL~VILRpGPYI 269 (949)
+.+....++.|. +.+|++|+.-|- | +-.|.-.-..-..-+-...+| +.+|.+.|+++||++-+=-.|
T Consensus 86 F~p~~fD~dqW~---~~ak~aGakY~VlTakHHDGF~LW~S~~t~~~v~~~~~krDiv~El~~A~rk~Glk~G~Y~S~-- 160 (346)
T PF01120_consen 86 FNPTKFDADQWA---KLAKDAGAKYVVLTAKHHDGFCLWPSKYTDYNVVNSGPKRDIVGELADACRKYGLKFGLYYSP-- 160 (346)
T ss_dssp ---TT--HHHHH---HHHHHTT-SEEEEEEE-TT--BSS--TT-SSBGGGGGGTS-HHHHHHHHHHHTT-EEEEEEES--
T ss_pred CCcccCCHHHHH---HHHHHcCCCEEEeehhhcCccccCCCCCCcccccCCCCCCCHHHHHHHHHHHcCCeEEEEecc--
Confidence 445577788885 578899998752 1 222443222111111112234 567999999999999763222
Q ss_pred ccccCCCCc
Q psy12907 270 CAERDFGGL 278 (949)
Q Consensus 270 cAEw~~GGl 278 (949)
++|.....
T Consensus 161 -~dw~~~~~ 168 (346)
T PF01120_consen 161 -WDWHHPDY 168 (346)
T ss_dssp -SSCCCTTT
T ss_pred -hHhcCccc
Confidence 35654333
No 173
>COG2159 Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]
Probab=35.82 E-value=1.4e+02 Score=33.67 Aligned_cols=70 Identities=24% Similarity=0.265 Sum_probs=41.1
Q ss_pred eeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCc
Q psy12907 190 PFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGP 267 (949)
Q Consensus 190 pf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGP 267 (949)
|-.+++...-..|-+...=++.-+.+++.|+--|.+ |....|.+-- ...+..+.+.|+++|+-|.+..|+
T Consensus 97 pdrf~~~~~v~p~~~~~a~~E~er~v~~~gf~g~~l------~p~~~~~~~~--~~~~~pi~~~a~~~gvpv~ihtG~ 166 (293)
T COG2159 97 PDRFVGFARVDPRDPEAAAEELERRVRELGFVGVKL------HPVAQGFYPD--DPRLYPIYEAAEELGVPVVIHTGA 166 (293)
T ss_pred CcceeeeeeeCCCchHHHHHHHHHHHHhcCceEEEe------cccccCCCCC--ChHHHHHHHHHHHcCCCEEEEeCC
Confidence 333444444444554333444455556678776665 2222222211 123789999999999999998776
No 174
>PF14587 Glyco_hydr_30_2: O-Glycosyl hydrolase family 30; PDB: 3CLW_B.
Probab=35.75 E-value=3.1e+02 Score=32.47 Aligned_cols=118 Identities=14% Similarity=0.205 Sum_probs=64.3
Q ss_pred CCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCCe----eeec-CCHhHHHHHHHHHHH
Q psy12907 234 PSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPNI----TLRS-KDEVYQHYVNKWFAQ 308 (949)
Q Consensus 234 p~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~----~~Rt-~dp~y~~~v~r~~~~ 308 (949)
+..|.|||+....=..|++.|++.|...++-.. =..|.|+++ .... ...+ -.+...++-..|+..
T Consensus 93 ~~dg~yDW~~D~gQrwfL~~Ak~rGV~~f~aFS---------NSPP~~MT~-NG~~~g~~~~~~NLk~d~y~~FA~YLa~ 162 (384)
T PF14587_consen 93 PADGSYDWDADAGQRWFLKAAKERGVNIFEAFS---------NSPPWWMTK-NGSASGGDDGSDNLKPDNYDAFADYLAD 162 (384)
T ss_dssp -TTS-B-TTSSHHHHHHHHHHHHTT---EEEE----------SSS-GGGSS-SSSSB-S-SSS-SS-TT-HHHHHHHHHH
T ss_pred CCCCCcCCCCCHHHHHHHHHHHHcCCCeEEEee---------cCCCHHHhc-CCCCCCCCccccccChhHHHHHHHHHHH
Confidence 677999999877788899999999998876532 257888875 2110 0011 123445556667777
Q ss_pred HHHHHHhccccCCCceEEeccccccCCc-------cc------hHHHHHHHHHHHHHhcCCeeEEEeeCCC
Q psy12907 309 LFPRITRFLYGNGGPIILVQVENEMGSY-------TC------DKEHMIWLRDQMKYYVRDAAVLYTTDGG 366 (949)
Q Consensus 309 l~~~l~~~~~~~GGpII~vQIENEyG~y-------~~------d~~Y~~~L~~~~~~~lG~~vplft~Dg~ 366 (949)
+++.++. .|=+|=-+=-=||...- +| ..+.++.|...+++. |++..+-.+|..
T Consensus 163 Vv~~~~~----~GI~f~~IsP~NEP~~~W~~~~QEG~~~~~~e~a~vI~~L~~~L~~~-GL~t~I~~~Ea~ 228 (384)
T PF14587_consen 163 VVKHYKK----WGINFDYISPFNEPQWNWAGGSQEGCHFTNEEQADVIRALDKALKKR-GLSTKISACEAG 228 (384)
T ss_dssp HHHHHHC----TT--EEEEE--S-TTS-GG--SS-B----HHHHHHHHHHHHHHHHHH-T-S-EEEEEEES
T ss_pred HHHHHHh----cCCccceeCCcCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHHHHhc-CCCceEEecchh
Confidence 6666633 22123223333776421 22 368889999999887 998877777654
No 175
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=35.49 E-value=77 Score=35.90 Aligned_cols=67 Identities=9% Similarity=0.199 Sum_probs=48.7
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCC-----CCCeeeccCc--hhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEP-----SPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp-----~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
..+...+.++++|+.|+-+=.+.+-+..+.. .-|.|+|+-. -|..++++..++.|++|++..=|+|+
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~w~~~~~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~v~P~v~ 95 (317)
T cd06598 22 NWQEVDDTIKTLREKDFPLDAAILDLYWFGKDIDKGHMGNLDWDRKAFPDPAGMIADLAKKGVKTIVITEPFVL 95 (317)
T ss_pred CHHHHHHHHHHHHHhCCCceEEEEechhhcCcccCCceeeeEeccccCCCHHHHHHHHHHcCCcEEEEEcCccc
Confidence 3677899999999999876555544333332 2345666532 48999999999999999988777775
No 176
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=34.87 E-value=92 Score=35.27 Aligned_cols=67 Identities=12% Similarity=0.069 Sum_probs=48.1
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCC---CCCeeeccCc--hhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEP---SPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp---~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
.+.-.+.++++++.++-+=.+.+-+....- ....|+|.-. -|..++++..++.|++|++..=|+|+.
T Consensus 28 q~~v~~~~~~~r~~~iP~d~i~ld~~~~~~~~~~~~~f~~d~~~FPdp~~mi~~L~~~g~k~~~~i~P~i~~ 99 (317)
T cd06599 28 QEALLEFIDKCREHDIPCDSFHLSSGYTSIEGGKRYVFNWNKDRFPDPAAFVAKFHERGIRLAPNIKPGLLQ 99 (317)
T ss_pred HHHHHHHHHHHHHcCCCeeEEEEeccccccCCCceeeeecCcccCCCHHHHHHHHHHCCCEEEEEeCCcccC
Confidence 456788999999999987555554333222 1234555422 489999999999999999988888864
No 177
>PRK00042 tpiA triosephosphate isomerase; Provisional
Probab=34.41 E-value=1.7e+02 Score=32.36 Aligned_cols=90 Identities=18% Similarity=0.275 Sum_probs=58.9
Q ss_pred HHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCC---------------
Q psy12907 212 LRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFG--------------- 276 (949)
Q Consensus 212 L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~G--------------- 276 (949)
.+++|++|++.|-+. |..++-.|.= .+..+.+=++.|.++||.+|++.|=.. .|-+.|
T Consensus 79 ~~mLkd~G~~~viiG-----HSERR~~f~E-td~~v~~K~~~a~~~gl~pIvCiGEt~-~~r~~g~~~~v~~~Ql~~~l~ 151 (250)
T PRK00042 79 AEMLKDLGVKYVIIG-----HSERRQYFGE-TDELVNKKVKAALKAGLTPILCVGETL-EEREAGKTEEVVARQLEAALA 151 (250)
T ss_pred HHHHHHCCCCEEEeC-----cccccCccCc-CHHHHHHHHHHHHHCCCEEEEEcCCcH-HHHHcCChHHHHHHHHHHHHc
Confidence 358999999988875 5545544432 345677777789999999999988421 011111
Q ss_pred ----------Cc---cccccccCCCeeeecCCHhHHHHHHHHHHHHHHHH
Q psy12907 277 ----------GL---PSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRI 313 (949)
Q Consensus 277 ----------Gl---P~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l 313 (949)
-+ |.|.-- - =++.+|...+.+.++++..+..+
T Consensus 152 ~~~~~~~~~~vIAYEPvWAIG-t----G~~as~~~~~~v~~~Ir~~l~~~ 196 (250)
T PRK00042 152 GLSAEQFANLVIAYEPVWAIG-T----GKTATPEQAQEVHAFIRAVLAEL 196 (250)
T ss_pred cCCHHHhCCEEEEECCHHHhC-C----CCCCCHHHHHHHHHHHHHHHHHh
Confidence 12 666643 1 14568888888888888766543
No 178
>PRK10076 pyruvate formate lyase II activase; Provisional
Probab=34.28 E-value=2.2e+02 Score=30.63 Aligned_cols=125 Identities=16% Similarity=0.100 Sum_probs=69.1
Q ss_pred hhHHHHHHHHHHcCCCE-EEE--eecCCccCC---CCC--eeeccC-------------chhHHHHHHHHHHCCceEEec
Q psy12907 206 YYWQDRLRKLRAAGLNA-VST--YVEWSFHEP---SPG--QFQFTG-------------DQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNt-V~t--Yv~Wn~HEp---~~G--~fdF~g-------------~~DL~~fl~la~e~GL~VILR 264 (949)
+.-.+.++++|+.|+.| ++| |++|...+. .-. -+|..+ ...+.+.++.+.+.|..+.+|
T Consensus 54 ~fl~~l~~~~k~~gi~~~leTnG~~~~~~~~~l~~~~D~~l~DiK~~d~~~~~~~tG~~~~~il~nl~~l~~~g~~v~iR 133 (213)
T PRK10076 54 EFATRFLQRLRLWGVSCAIETAGDAPASKLLPLAKLCDEVLFDLKIMDATQARDVVKMNLPRVLENLRLLVSEGVNVIPR 133 (213)
T ss_pred HHHHHHHHHHHHcCCCEEEECCCCCCHHHHHHHHHhcCEEEEeeccCCHHHHHHHHCCCHHHHHHHHHHHHhCCCcEEEE
Confidence 66789999999999976 555 555532221 111 233322 123445667777888777777
Q ss_pred CCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEecc-----------cccc
Q psy12907 265 PGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQV-----------ENEM 333 (949)
Q Consensus 265 pGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQI-----------ENEy 333 (949)
. | ..|++ ++++.-++++.+|++.+. +.++. ++-.+- +.++
T Consensus 134 ~-~-----------------vIPg~---nd~~e~i~~ia~~l~~l~--~~~~~------llpyh~~g~~Ky~~lg~~y~~ 184 (213)
T PRK10076 134 L-P-----------------LIPGF---TLSRENMQQALDVLIPLG--IKQIH------LLPFHQYGEPKYRLLGKTWSM 184 (213)
T ss_pred E-E-----------------EECCC---CCCHHHHHHHHHHHHHcC--CceEE------EecCCccchhHHHHcCCcCcc
Confidence 3 1 12222 345666666666665431 11110 011111 1122
Q ss_pred CCc-cchHHHHHHHHHHHHHhcCCeeEE
Q psy12907 334 GSY-TCDKEHMIWLRDQMKYYVRDAAVL 360 (949)
Q Consensus 334 G~y-~~d~~Y~~~L~~~~~~~lG~~vpl 360 (949)
... ..+.+.|+.+++.++++ |+.+.+
T Consensus 185 ~~~~~~~~~~l~~~~~~~~~~-gl~~~i 211 (213)
T PRK10076 185 KEVPAPSSADVATMREMAERA-GFQVTV 211 (213)
T ss_pred CCCCCcCHHHHHHHHHHHHHc-CCeEEe
Confidence 112 46789999999999987 887643
No 179
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=33.93 E-value=84 Score=41.50 Aligned_cols=65 Identities=9% Similarity=0.103 Sum_probs=44.0
Q ss_pred ceeEEEEEecCCCC--CCC-CcccccCCcceEEEEEECCEEEEEEEec-ceeeeecc--CCC-CCeEEEEEE
Q psy12907 520 FGFVLYETIIPDAR--FPD-PALLTISGLRDRGQVFVDEKLVTILYRN-KMLSTPIM--ARP-GQKLSILVE 584 (949)
Q Consensus 520 ~GyvlY~t~i~~~~--~~~-~~~L~i~~vrDrA~Vfvdg~~vG~l~r~-~~~~i~~~--~~~-~~~LdILVE 584 (949)
-|-..||+++.-+. .+. .+.|.+.+|.-.|.|+|||++||.-.-. ...++.++ .+. .++|.|.|-
T Consensus 119 n~~gwYrr~F~vp~~w~~~~rv~L~FeGV~~~a~VwvNG~~VG~~~g~~~pfefDIT~~l~~G~N~L~V~V~ 190 (1027)
T PRK09525 119 NPTGCYSLTFTVDESWLQSGQTRIIFDGVNSAFHLWCNGRWVGYSQDSRLPAEFDLSPFLRAGENRLAVMVL 190 (1027)
T ss_pred CCeEEEEEEEEeChhhcCCCeEEEEECeeccEEEEEECCEEEEeecCCCceEEEEChhhhcCCccEEEEEEE
Confidence 46779999874332 122 5678999999999999999999974322 23344443 122 367888884
No 180
>PF02055 Glyco_hydro_30: O-Glycosyl hydrolase family 30; InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=33.80 E-value=1.5e+02 Score=36.01 Aligned_cols=152 Identities=16% Similarity=0.251 Sum_probs=85.1
Q ss_pred HHHHHHHHHHC--CceEEecCCcccccccCCCCccccccccCCCe----eee-cCCHhHHHHHHHHHHHHHHHHHhcccc
Q psy12907 247 LEYFLELAVRE--DLLVLLRPGPYICAERDFGGLPSWLMTIKPNI----TLR-SKDEVYQHYVNKWFAQLFPRITRFLYG 319 (949)
Q Consensus 247 L~~fl~la~e~--GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~----~~R-t~dp~y~~~v~r~~~~l~~~l~~~~~~ 319 (949)
+..+|+.|++. +|+++.-| |. .|+|++. ..++ .++ ...+.|.++..+||.+.++..++
T Consensus 155 ~ip~ik~a~~~~~~lki~aSp-------WS---pP~WMKt-n~~~~g~g~l~g~~~~~y~~~yA~Y~vkfi~aY~~---- 219 (496)
T PF02055_consen 155 KIPLIKEALAINPNLKIFASP-------WS---PPAWMKT-NGSMNGGGSLKGSLGDEYYQAYADYFVKFIQAYKK---- 219 (496)
T ss_dssp HHHHHHHHHHHHTT-EEEEEE-------S------GGGBT-TSSSCSS-BBSCGTTSHHHHHHHHHHHHHHHHHHC----
T ss_pred HHHHHHHHHHhCCCcEEEEec-------CC---CCHHHcc-CCcCcCCCccCCCCCchhHHHHHHHHHHHHHHHHH----
Confidence 34677766664 57777766 64 7999987 3222 344 23457888878887777777664
Q ss_pred CCCceEEeccccccCC-------c-c------chHHHHHH-HHHHHHHhcCC--eeEEEeeCCCCc---cccccccC--C
Q psy12907 320 NGGPIILVQVENEMGS-------Y-T------CDKEHMIW-LRDQMKYYVRD--AAVLYTTDGGGV---GYLKCTVP--G 377 (949)
Q Consensus 320 ~GGpII~vQIENEyG~-------y-~------~d~~Y~~~-L~~~~~~~lG~--~vplft~Dg~~~---~~l~~~~~--g 377 (949)
+|=||=++-+-||... | . ..++|+.. |.-+++++ |. ++-|+..|.... .+...-+. .
T Consensus 220 ~GI~i~aiT~QNEP~~~~~~~~~~~s~~~t~~~~~~Fi~~~LgP~l~~~-~~g~d~kI~~~D~n~~~~~~~~~~il~d~~ 298 (496)
T PF02055_consen 220 EGIPIWAITPQNEPDNGSDPNYPWPSMGWTPEEQADFIKNYLGPALRKA-GLGKDVKILIYDHNRDNLPDYADTILNDPE 298 (496)
T ss_dssp TT--ESEEESSSSCCGGGSTT-SSC--B--HHHHHHHHHHTHHHHHHTS-TT-TTSEEEEEEEEGGGTTHHHHHHHTSHH
T ss_pred CCCCeEEEeccCCCCCCCCCCCCCCcCCCCHHHHHHHHHHHHHHHHHhc-CCCCceEEEEEecCCcccchhhhhhhcChh
Confidence 4558889999999752 2 1 13677653 77778776 54 788877663211 11111111 1
Q ss_pred ---eee--eec-CCCCCChHHHHHHhhhcCCCCCeeeeecCCCc
Q psy12907 378 ---VYA--TVD-FGAATNVSSAFAAMRTVSPHGPLVNSEYYPGW 415 (949)
Q Consensus 378 ---~~~--t~n-fg~~~~~~~~f~~~~~~~p~~P~~~~Ef~~GW 415 (949)
.+. ..+ ++ +...........+..|++.++.+|-..|.
T Consensus 299 A~~yv~GiA~HwY~-g~~~~~~l~~~h~~~P~k~l~~TE~~~g~ 341 (496)
T PF02055_consen 299 AAKYVDGIAFHWYG-GDPSPQALDQVHNKFPDKFLLFTEACCGS 341 (496)
T ss_dssp HHTTEEEEEEEETT-CS-HCHHHHHHHHHSTTSEEEEEEEESS-
T ss_pred hHhheeEEEEECCC-CCchhhHHHHHHHHCCCcEEEeeccccCC
Confidence 111 112 22 11122345555667899999999987663
No 181
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=33.65 E-value=2.5e+02 Score=31.69 Aligned_cols=59 Identities=14% Similarity=0.184 Sum_probs=45.4
Q ss_pred CChhhHHHHHHHHHHcCCCEEEEeec----CCccCCC------CC----------eeeccCchhHHHHHHHHHHCCceEE
Q psy12907 203 SPRYYWQDRLRKLRAAGLNAVSTYVE----WSFHEPS------PG----------QFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 203 ~p~~~W~d~L~k~Ka~GlNtV~tYv~----Wn~HEp~------~G----------~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
.+.+..++.++.|...++|++..++- |.+--+. +| .|. ..|+.++++.|++.|+.||
T Consensus 13 ~~~~~lk~~id~ma~~K~N~lhlHl~D~~~~~le~~~~p~l~~~g~~~~~~~~~~~yT---~~di~elv~yA~~rgI~vi 89 (303)
T cd02742 13 LSVESIKRTIDVLARYKINTFHWHLTDDQAWRIESKKFPELAEKGGQINPRSPGGFYT---YAQLKDIIEYAAARGIEVI 89 (303)
T ss_pred cCHHHHHHHHHHHHHhCCcEEEEeeecCCCceEeeCccchhhhhcccccCCCCCCeEC---HHHHHHHHHHHHHcCCEEE
Confidence 35678899999999999999999977 7554221 12 222 3689999999999999998
Q ss_pred ec
Q psy12907 263 LR 264 (949)
Q Consensus 263 LR 264 (949)
--
T Consensus 90 PE 91 (303)
T cd02742 90 PE 91 (303)
T ss_pred Ee
Confidence 44
No 182
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=33.57 E-value=80 Score=36.09 Aligned_cols=67 Identities=9% Similarity=0.037 Sum_probs=50.2
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc--hhHHHHHHHHHHCCceEEecCCcccccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYICAE 272 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIcAE 272 (949)
.+.-++.++++++.|+.+=.+.+-+... ...+.|+|+-. -|..++++..++.|++|++..=|+|+.+
T Consensus 23 ~~ev~~~~~~~~~~~iP~d~i~lD~~~~-~~~~~f~~d~~~FPdp~~mi~~L~~~G~k~~~~~~P~v~~~ 91 (339)
T cd06603 23 QEDVKEVDAGFDEHDIPYDVIWLDIEHT-DGKRYFTWDKKKFPDPEKMQEKLASKGRKLVTIVDPHIKRD 91 (339)
T ss_pred HHHHHHHHHHHHHcCCCceEEEEChHHh-CCCCceEeCcccCCCHHHHHHHHHHCCCEEEEEecCceecC
Confidence 5667889999999998875554443322 34556666542 3899999999999999999988999854
No 183
>PRK10985 putative hydrolase; Provisional
Probab=33.41 E-value=23 Score=39.68 Aligned_cols=31 Identities=16% Similarity=0.364 Sum_probs=20.3
Q ss_pred CeEECCCCCCchhh-hh-hhhhhcccccccccc
Q psy12907 614 PLVFLPPVSGTADI-FF-HLMLDLSRKGYRTLS 644 (949)
Q Consensus 614 ~v~~l~~~~g~a~~-~~-~~~~~l~~kgyR~is 644 (949)
.|+++||..|++.. |. ..+-.|+++||||+.
T Consensus 60 ~vll~HG~~g~~~~~~~~~~~~~l~~~G~~v~~ 92 (324)
T PRK10985 60 RLVLFHGLEGSFNSPYAHGLLEAAQKRGWLGVV 92 (324)
T ss_pred EEEEeCCCCCCCcCHHHHHHHHHHHHCCCEEEE
Confidence 47778888877554 32 244456677888885
No 184
>COG0366 AmyA Glycosidases [Carbohydrate transport and metabolism]
Probab=32.78 E-value=58 Score=38.15 Aligned_cols=55 Identities=20% Similarity=0.338 Sum_probs=39.0
Q ss_pred HHHHHHHHcCCCEEEE-eec---CCccCCCCCee-----eccCchhHHHHHHHHHHCCceEEec
Q psy12907 210 DRLRKLRAAGLNAVST-YVE---WSFHEPSPGQF-----QFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 210 d~L~k~Ka~GlNtV~t-Yv~---Wn~HEp~~G~f-----dF~g~~DL~~fl~la~e~GL~VILR 264 (949)
+.|.-+|.+|+++|-+ .++ -..|--..-.| .|....|+.++++.|++.||+||+-
T Consensus 33 ~~LdYl~~LGv~aiwl~Pi~~s~~~~~gY~~~Dy~~id~~~Gt~~d~~~li~~~H~~gi~vi~D 96 (505)
T COG0366 33 EKLDYLKELGVDAIWLSPIFESPQADHGYDVSDYTKVDPHFGTEEDFKELVEEAHKRGIKVILD 96 (505)
T ss_pred HhhhHHHHhCCCEEEeCCCCCCCccCCCccccchhhcCcccCCHHHHHHHHHHHHHCCCEEEEE
Confidence 7888999999999943 222 11221111000 5777899999999999999999865
No 185
>PF00975 Thioesterase: Thioesterase domain; InterPro: IPR001031 Thioesterase domains often occur integrated in or associated with peptide synthetases which are involved in the non-ribosomal synthesis of peptide antibiotics []. Thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates.; GO: 0016788 hydrolase activity, acting on ester bonds, 0009058 biosynthetic process; PDB: 2RON_A 2K2Q_B 3LCR_B 2HFJ_B 1MNQ_A 1MN6_B 1MNA_B 2HFK_B 2H7Y_B 2H7X_A ....
Probab=32.77 E-value=17 Score=37.74 Aligned_cols=30 Identities=27% Similarity=0.434 Sum_probs=22.6
Q ss_pred CeEECCCCCCchhhhhhhhhhccccccccc
Q psy12907 614 PLVFLPPVSGTADIFFHLMLDLSRKGYRTL 643 (949)
Q Consensus 614 ~v~~l~~~~g~a~~~~~~~~~l~~kgyR~i 643 (949)
+|+|+|++.|++.+|...+-.|...++.|+
T Consensus 2 ~lf~~p~~gG~~~~y~~la~~l~~~~~~v~ 31 (229)
T PF00975_consen 2 PLFCFPPAGGSASSYRPLARALPDDVIGVY 31 (229)
T ss_dssp EEEEESSTTCSGGGGHHHHHHHTTTEEEEE
T ss_pred eEEEEcCCccCHHHHHHHHHhCCCCeEEEE
Confidence 588999999999888777766665345555
No 186
>KOG1689|consensus
Probab=32.69 E-value=30 Score=36.15 Aligned_cols=41 Identities=17% Similarity=0.543 Sum_probs=31.4
Q ss_pred EEcCccccccc-CCCCCeeeEEeccCCCCCCCCCceEEEEEee
Q psy12907 881 FINEHNLGKYW-TTLGPQLTLYLPAPFIKPYPEVNRITVLELQ 922 (949)
Q Consensus 881 ~vng~nlGryw-~~~gpq~tly~P~~~l~~~~g~n~i~~~e~~ 922 (949)
|.=|-++|-.| |.--|-+.=|+|+++-|||+- -++.++.|.
T Consensus 131 wtv~ecig~WWRPNFe~~~YPyiP~hitkPKeh-~kL~lV~L~ 172 (221)
T KOG1689|consen 131 WTVGECIGNWWRPNFETPMYPYIPPHITKPKEH-TKLFLVQLP 172 (221)
T ss_pred ccHhhhhhcccCCCCCCcccCCCCcccCCchhc-cEEEEEEcc
Confidence 45567899999 456777888999999988764 667776664
No 187
>PRK12858 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=32.07 E-value=41 Score=38.81 Aligned_cols=64 Identities=17% Similarity=0.123 Sum_probs=44.8
Q ss_pred cCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEec
Q psy12907 198 FHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 198 ~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILR 264 (949)
.++ |.|...=.-..+.++++|.++|.+.++|.-.++. +-+-.-..+|.++.+.|+++||-+++-
T Consensus 99 ~~g-r~~~~~~~~sve~a~~~GAdAVk~lv~~~~d~~~--~~~~~~~~~l~rv~~ec~~~giPlllE 162 (340)
T PRK12858 99 APG-RLPDLLDNWSVRRIKEAGADAVKLLLYYRPDEDD--AINDRKHAFVERVGAECRANDIPFFLE 162 (340)
T ss_pred CCC-CCccccccccHHHHHHcCCCEEEEEEEeCCCcch--HHHHHHHHHHHHHHHHHHHcCCceEEE
Confidence 345 7664433334677999999999999999954331 001112346999999999999999875
No 188
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=31.93 E-value=88 Score=35.47 Aligned_cols=67 Identities=9% Similarity=0.161 Sum_probs=49.5
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc--hhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
..+..++.++++++.++-.=.+.+-+.... .-+.|+|+-. -|..++++..++.|++|++..=|+|+.
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~f~~d~~~FPdp~~~i~~l~~~g~k~~~~~~P~i~~ 90 (317)
T cd06600 22 PQDKVVEVVDIMQKEGFPYDVVFLDIHYMD-SYRLFTWDPYRFPEPKKLIDELHKRNVKLVTIVDPGIRV 90 (317)
T ss_pred CHHHHHHHHHHHHHcCCCcceEEEChhhhC-CCCceeechhcCCCHHHHHHHHHHCCCEEEEEeeccccC
Confidence 366789999999999987644444332222 3356666542 489999999999999999888888874
No 189
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=31.69 E-value=1.9e+02 Score=29.14 Aligned_cols=105 Identities=16% Similarity=0.125 Sum_probs=63.1
Q ss_pred hhhHHHHHHHHHHcCCCEEEEe--ecCCccCCC----CCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTY--VEWSFHEPS----PGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGG 277 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tY--v~Wn~HEp~----~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GG 277 (949)
....++..+.+++.|+..+... ..|...... +... -.....+.+.+++|++.|...+ +.+|.
T Consensus 26 ~~~~~~~~~~~~~~gl~i~~~~~~~~~~~~~~~~~~~~~~r-~~~~~~~~~~i~~a~~lg~~~i~~~~g~---------- 94 (213)
T PF01261_consen 26 DDEAEELRRLLEDYGLKIASLHPPTNFWSPDEENGSANDER-EEALEYLKKAIDLAKRLGAKYIVVHSGR---------- 94 (213)
T ss_dssp HHHHHHHHHHHHHTTCEEEEEEEEESSSCTGTTSTTSSSHH-HHHHHHHHHHHHHHHHHTBSEEEEECTT----------
T ss_pred hHHHHHHHHHHHHcCCeEEEEecccccccccccccCcchhh-HHHHHHHHHHHHHHHHhCCCceeecCcc----------
Confidence 3456677888899999966544 444443211 1111 1123469999999999998875 44442
Q ss_pred ccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCc
Q psy12907 278 LPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSY 336 (949)
Q Consensus 278 lP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y 336 (949)
.+.+- ......-.+.+.+.++++.++.+++- +.+-+||..+..
T Consensus 95 ~~~~~---------~~~~~~~~~~~~~~l~~l~~~a~~~g-------v~i~lE~~~~~~ 137 (213)
T PF01261_consen 95 YPSGP---------EDDTEENWERLAENLRELAEIAEEYG-------VRIALENHPGPF 137 (213)
T ss_dssp ESSST---------TSSHHHHHHHHHHHHHHHHHHHHHHT-------SEEEEE-SSSSS
T ss_pred ccccc---------CCCHHHHHHHHHHHHHHHHhhhhhhc-------ceEEEecccCcc
Confidence 00000 11233456677777888888887653 457889888765
No 190
>KOG2230|consensus
Probab=31.57 E-value=2.1e+02 Score=35.37 Aligned_cols=113 Identities=18% Similarity=0.210 Sum_probs=73.8
Q ss_pred CCceEeCCeeeEEeccccCC-----CCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHH
Q psy12907 181 RDTFVKDGKPFRYVSGSFHY-----FRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAV 255 (949)
Q Consensus 181 ~~~f~~dGkpf~i~sGe~HY-----~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~ 255 (949)
+-.|.+++.|.++.++.--+ .|...+.-+-.|+-++++|+|++++. . - |...-+.|-++|.
T Consensus 327 nfyfkin~~pvflkg~nwip~s~f~dr~t~~~~~~LL~Sv~e~~MN~lRVW---G-----G------GvYEsd~FY~lad 392 (867)
T KOG2230|consen 327 NFYFKINDEPVFLKGTNWIPVSMFRDRENIAKTEFLLDSVAEVGMNMLRVW---G-----G------GVYESDYFYQLAD 392 (867)
T ss_pred eeEEEEcCcEEEeecCCccChHHHHhhHHHHHHHHHHHHHHHhCcceEEEe---c-----C------ccccchhHHHHhh
Confidence 34678899999888776432 24445667778999999999999873 2 1 2334568999999
Q ss_pred HCCceEEecCCcccccccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccc
Q psy12907 256 REDLLVLLRPGPYICAERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENE 332 (949)
Q Consensus 256 e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENE 332 (949)
+.||.|---. =+.| .+-..+..|++.|+.=.+.=+.+|+.|- .||.+-=.||
T Consensus 393 ~lGilVWQD~-MFAC-------------------AlYPt~~eFl~sv~eEV~yn~~Rls~Hp-----SviIfsgNNE 444 (867)
T KOG2230|consen 393 SLGILVWQDM-MFAC-------------------ALYPTNDEFLSSVREEVRYNAMRLSHHP-----SVIIFSGNNE 444 (867)
T ss_pred hccceehhhh-HHHh-------------------hcccCcHHHHHHHHHHHHHHHHhhccCC-----eEEEEeCCCc
Confidence 9998872110 0122 2334578899888876666556665442 4555544343
No 191
>PF11324 DUF3126: Protein of unknown function (DUF3126); InterPro: IPR021473 This family of proteins with unknown function appear to be restricted to Alphaproteobacteria.
Probab=31.53 E-value=54 Score=29.10 Aligned_cols=21 Identities=19% Similarity=0.689 Sum_probs=19.1
Q ss_pred CcceEEEEEECCEEEEEEEec
Q psy12907 544 GLRDRGQVFVDEKLVTILYRN 564 (949)
Q Consensus 544 ~vrDrA~Vfvdg~~vG~l~r~ 564 (949)
+-.|.|-||++++++|+++|+
T Consensus 25 k~~dsaEV~~g~EfiGvi~~D 45 (63)
T PF11324_consen 25 KKDDSAEVYIGDEFIGVIYRD 45 (63)
T ss_pred CCCCceEEEeCCEEEEEEEee
Confidence 457999999999999999996
No 192
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=30.41 E-value=2.4e+02 Score=32.54 Aligned_cols=66 Identities=14% Similarity=0.156 Sum_probs=52.0
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCc--hhHHHHHHHHHHCCceEEecCCcccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYIC 270 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYIc 270 (949)
..+..++.++++++.++-+=.+++-|..+. .-+.|.|+.. -|..++++..++.|+++++..=|+|.
T Consensus 22 ~~~ev~~v~~~~r~~~IP~D~i~lDidy~~-~~~~Ft~d~~~FPdp~~mv~~L~~~G~klv~~i~P~i~ 89 (332)
T cd06601 22 NRSDLEEVVEGYRDNNIPLDGLHVDVDFQD-NYRTFTTNGGGFPNPKEMFDNLHNKGLKCSTNITPVIS 89 (332)
T ss_pred CHHHHHHHHHHHHHcCCCCceEEEcCchhc-CCCceeecCCCCCCHHHHHHHHHHCCCeEEEEecCcee
Confidence 367889999999999987766666665554 3466777543 48899999999999999888778887
No 193
>cd06563 GH20_chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. This GH20 domain family includes an N-acetylglucosamidase (GlcNAcase A) from Pseudoalteromonas piscicida and an N-acetylhexosaminidase (SpHex) from Streptomyces plicatus. SpHex lacks the C-terminal PKD (polycystic kidney disease I)-like domain found in the chitobiases. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=29.10 E-value=6e+02 Score=29.35 Aligned_cols=59 Identities=12% Similarity=0.172 Sum_probs=44.3
Q ss_pred CChhhHHHHHHHHHHcCCCEEEEeec----CCccCC----------------------------CCCeeeccCchhHHHH
Q psy12907 203 SPRYYWQDRLRKLRAAGLNAVSTYVE----WSFHEP----------------------------SPGQFQFTGDQDLEYF 250 (949)
Q Consensus 203 ~p~~~W~d~L~k~Ka~GlNtV~tYv~----Wn~HEp----------------------------~~G~fdF~g~~DL~~f 250 (949)
+|.+..++.+..|...++|+.+.++- |.+--+ ..|.|. ..|+.++
T Consensus 15 ~~~~~ik~~Id~ma~~K~N~lhlHltDdq~~rle~~~~P~Lt~~ga~~~~~~~~~~~~~~~~~~~~~~YT---~~di~ei 91 (357)
T cd06563 15 FPVDEVKRFIDLMALYKLNVFHWHLTDDQGWRIEIKKYPKLTEVGAWRGPTEIGLPQGGGDGTPYGGFYT---QEEIREI 91 (357)
T ss_pred cCHHHHHHHHHHHHHhccceEEEeeecCCCceecccCcchhhhcccccCcccccccccccCCCccCceEC---HHHHHHH
Confidence 36888999999999999999988863 432211 112232 4689999
Q ss_pred HHHHHHCCceEEec
Q psy12907 251 LELAVREDLLVLLR 264 (949)
Q Consensus 251 l~la~e~GL~VILR 264 (949)
++.|++.|+.||--
T Consensus 92 v~yA~~rgI~VIPE 105 (357)
T cd06563 92 VAYAAERGITVIPE 105 (357)
T ss_pred HHHHHHcCCEEEEe
Confidence 99999999999854
No 194
>TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I. This model represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerizes hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of cell.
Probab=28.24 E-value=39 Score=41.27 Aligned_cols=66 Identities=24% Similarity=0.303 Sum_probs=44.6
Q ss_pred CCCeEECCCCCCchhhhhh-----hhhhcccccccccc------------------------------------------
Q psy12907 612 RSPLVFLPPVSGTADIFFH-----LMLDLSRKGYRTLS------------------------------------------ 644 (949)
Q Consensus 612 ~~~v~~l~~~~g~a~~~~~-----~~~~l~~kgyR~is------------------------------------------ 644 (949)
..||+|+|+....+-|+-. .+-.|.++||||+.
T Consensus 188 ~~PlLiVp~~i~k~yilDL~p~~Slv~~L~~qGf~V~~iDwrgpg~s~~~~~~ddY~~~~i~~al~~v~~~~g~~kv~lv 267 (532)
T TIGR01838 188 KTPLLIVPPWINKYYILDLRPQNSLVRWLVEQGHTVFVISWRNPDASQADKTFDDYIRDGVIAALEVVEAITGEKQVNCV 267 (532)
T ss_pred CCcEEEECcccccceeeecccchHHHHHHHHCCcEEEEEECCCCCcccccCChhhhHHHHHHHHHHHHHHhcCCCCeEEE
Confidence 4578888888877777642 33446678888862
Q ss_pred --cCCcchhh----cccccccccCceeecccccccccccc
Q psy12907 645 --YLGGFLAQ----KFAEHTVLRPRVVSLFLCNSFYDTSV 678 (949)
Q Consensus 645 --~lg~~l~q----~f~~~~~~~~~v~sl~lcn~~~dt~~ 678 (949)
++||.++. .++... ..+.|+|++++++-+|.+.
T Consensus 268 G~cmGGtl~a~ala~~aa~~-~~~rv~slvll~t~~Df~~ 306 (532)
T TIGR01838 268 GYCIGGTLLSTALAYLAARG-DDKRIKSATFFTTLLDFSD 306 (532)
T ss_pred EECcCcHHHHHHHHHHHHhC-CCCccceEEEEecCcCCCC
Confidence 68998752 233332 1257999999999888654
No 195
>COG2890 HemK Methylase of polypeptide chain release factors [Translation, ribosomal structure and biogenesis]
Probab=28.19 E-value=3.9e+02 Score=29.98 Aligned_cols=125 Identities=16% Similarity=0.118 Sum_probs=79.0
Q ss_pred CCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccc
Q psy12907 202 RSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSW 281 (949)
Q Consensus 202 R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~W 281 (949)
-+.++-++=.-+.++..|+.-+..-.. .+.++-+|+||. |+.=.|||-+| +....|.+
T Consensus 141 Dis~~Al~~A~~Na~~~~l~~~~~~~~-dlf~~~~~~fDl--------------------IVsNPPYip~~-~~~~~~~~ 198 (280)
T COG2890 141 DISPDALALARENAERNGLVRVLVVQS-DLFEPLRGKFDL--------------------IVSNPPYIPAE-DPELLPEV 198 (280)
T ss_pred ECCHHHHHHHHHHHHHcCCccEEEEee-ecccccCCceeE--------------------EEeCCCCCCCc-ccccChhh
Confidence 345667777777888889722223333 778888886665 66666999999 77778887
Q ss_pred ccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCccchHHHHHHHHHHHHHhcC-CeeEE
Q psy12907 282 LMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSYTCDKEHMIWLRDQMKYYVR-DAAVL 360 (949)
Q Consensus 282 L~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y~~d~~Y~~~L~~~~~~~lG-~~vpl 360 (949)
+.. .|...+-. .+.-++..+++++.+-..+++ |-++++.+ +....+.+++++.+. | ....-
T Consensus 199 ~~~-EP~~Al~~-g~dGl~~~~~i~~~a~~~l~~------~g~l~le~---------g~~q~~~v~~~~~~~-~~~~~v~ 260 (280)
T COG2890 199 VRY-EPLLALVG-GGDGLEVYRRILGEAPDILKP------GGVLILEI---------GLTQGEAVKALFEDT-GFFEIVE 260 (280)
T ss_pred hcc-CHHHHHcc-CccHHHHHHHHHHhhHHHcCC------CcEEEEEE---------CCCcHHHHHHHHHhc-CCceEEE
Confidence 766 55433332 335677777766666655555 44666665 223367888888876 6 44444
Q ss_pred EeeCCC
Q psy12907 361 YTTDGG 366 (949)
Q Consensus 361 ft~Dg~ 366 (949)
+..|..
T Consensus 261 ~~~d~~ 266 (280)
T COG2890 261 TLKDLF 266 (280)
T ss_pred EEecCC
Confidence 555543
No 196
>TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit. This model represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (TIGR01834) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerize short-chain-length hydroxyalkanoates.
Probab=27.88 E-value=31 Score=38.97 Aligned_cols=31 Identities=16% Similarity=0.140 Sum_probs=22.2
Q ss_pred cCCcchhhcccccccccCceeeccccccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDTS 677 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt~ 677 (949)
|.||.++..|+...+ +.|++++++++..|..
T Consensus 144 S~GG~i~~~~~~~~~--~~v~~lv~~~~p~~~~ 174 (350)
T TIGR01836 144 CQGGTFSLCYAALYP--DKIKNLVTMVTPVDFE 174 (350)
T ss_pred CHHHHHHHHHHHhCc--hheeeEEEeccccccC
Confidence 688888888776554 4578888888776643
No 197
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=27.86 E-value=83 Score=34.12 Aligned_cols=60 Identities=13% Similarity=0.078 Sum_probs=40.6
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCC----CeeeccCchhHHHHHHHHHHCCceEEecC-Ccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSP----GQFQFTGDQDLEYFLELAVREDLLVLLRP-GPY 268 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~----G~fdF~g~~DL~~fl~la~e~GL~VILRp-GPY 268 (949)
.+.+++.++.++++|..+|.+- ..+.... -.+... ...|.++.++|++.|+.+.+-| +|+
T Consensus 89 ~~~~~~~i~~a~~lGa~~i~~~---~~~~~~~~~~~~~~~~~-~~~l~~l~~~a~~~gv~l~iE~~~~~ 153 (275)
T PRK09856 89 LDMIKLAMDMAKEMNAGYTLIS---AAHAGYLTPPNVIWGRL-AENLSELCEYAENIGMDLILEPLTPY 153 (275)
T ss_pred HHHHHHHHHHHHHhCCCEEEEc---CCCCCCCCCHHHHHHHH-HHHHHHHHHHHHHcCCEEEEecCCCC
Confidence 3577788999999999998552 2232111 112221 2468999999999999998886 343
No 198
>PLN02877 alpha-amylase/limit dextrinase
Probab=27.47 E-value=1.1e+02 Score=40.05 Aligned_cols=20 Identities=20% Similarity=0.242 Sum_probs=18.4
Q ss_pred hhHHHHHHHHHHCCceEEec
Q psy12907 245 QDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 245 ~DL~~fl~la~e~GL~VILR 264 (949)
+++.++++.|+++||.|||-
T Consensus 466 ~efk~mV~~lH~~GI~VImD 485 (970)
T PLN02877 466 IEFRKMVQALNRIGLRVVLD 485 (970)
T ss_pred HHHHHHHHHHHHCCCEEEEE
Confidence 46999999999999999987
No 199
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=27.43 E-value=1.5e+02 Score=33.28 Aligned_cols=66 Identities=12% Similarity=0.124 Sum_probs=47.4
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCC--------CCCeeeccCc--hhHHHHHHHHHHCCceEEecCCccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEP--------SPGQFQFTGD--QDLEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp--------~~G~fdF~g~--~DL~~fl~la~e~GL~VILRpGPYI 269 (949)
..+.-++.++++|+.|+-+=.+++=...|.- .-+.|+|+-. -|..++++..++.|++|++..=|+|
T Consensus 23 s~~ev~~v~~~~r~~~iP~D~i~lD~dw~~~~~~~~~~~~~~~ft~d~~~FPdp~~mi~~Lh~~G~k~v~~v~P~~ 98 (292)
T cd06595 23 SDEEYLALMDRFKKHNIPLDVLVIDMDWHVTDIPSKYGSGWTGYSWNRKLFPDPEKLLQDLHDRGLKVTLNLHPAD 98 (292)
T ss_pred CHHHHHHHHHHHHHhCCCccEEEEecccccccccccccCCcceeEEChhcCCCHHHHHHHHHHCCCEEEEEeCCCc
Confidence 4677889999999999887555554333321 2246666532 4899999999999999988765554
No 200
>smart00854 PGA_cap Bacterial capsule synthesis protein PGA_cap. This protein is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein.
Probab=27.30 E-value=8e+02 Score=26.36 Aligned_cols=51 Identities=25% Similarity=0.366 Sum_probs=36.0
Q ss_pred CCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE
Q psy12907 199 HYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 199 HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
..|+.|++ .++.|+++|++.+.+- |=| .||+ |..-|.+.++.+++.|+..+
T Consensus 57 ~~f~~~~~----~~~~l~~~G~d~~~la---NNH-----~fD~-G~~gl~~t~~~l~~a~i~~~ 107 (239)
T smart00854 57 PNFRAPPE----NAAALKAAGFDVVSLA---NNH-----SLDY-GEEGLLDTLAALDAAGIAHV 107 (239)
T ss_pred eEecCCHH----HHHHHHHhCCCEEEec---cCc-----cccc-chHHHHHHHHHHHHCCCCEe
Confidence 34566654 5789999999988774 234 4666 33458888888888888765
No 201
>KOG4039|consensus
Probab=27.22 E-value=95 Score=33.22 Aligned_cols=84 Identities=18% Similarity=0.283 Sum_probs=45.3
Q ss_pred ccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCee-eccCc--hhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 195 SGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQF-QFTGD--QDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 195 sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~f-dF~g~--~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
+|.-...|+..+|--...+.+|+-||.++-.--.=..|-...=-| .-.|+ +|+.+ + +-.-++|+||||..|
T Consensus 98 aGadgfykvDhDyvl~~A~~AKe~Gck~fvLvSS~GAd~sSrFlY~k~KGEvE~~v~e---L--~F~~~~i~RPG~ll~- 171 (238)
T KOG4039|consen 98 AGADGFYKVDHDYVLQLAQAAKEKGCKTFVLVSSAGADPSSRFLYMKMKGEVERDVIE---L--DFKHIIILRPGPLLG- 171 (238)
T ss_pred cccCceEeechHHHHHHHHHHHhCCCeEEEEEeccCCCcccceeeeeccchhhhhhhh---c--cccEEEEecCcceec-
Confidence 444455689999999999999999998864322222221111000 11221 22211 0 123468999999877
Q ss_pred ccCCCCccccccc
Q psy12907 272 ERDFGGLPSWLMT 284 (949)
Q Consensus 272 Ew~~GGlP~WL~~ 284 (949)
|...--.-.||.+
T Consensus 172 ~R~esr~geflg~ 184 (238)
T KOG4039|consen 172 ERTESRQGEFLGN 184 (238)
T ss_pred ccccccccchhhh
Confidence 3332233345554
No 202
>PRK01060 endonuclease IV; Provisional
Probab=27.03 E-value=6.1e+02 Score=27.58 Aligned_cols=63 Identities=11% Similarity=0.063 Sum_probs=39.2
Q ss_pred CChhhHHHHHHHHHHcCCCE----EEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceE-EecCC
Q psy12907 203 SPRYYWQDRLRKLRAAGLNA----VSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLV-LLRPG 266 (949)
Q Consensus 203 ~p~~~W~d~L~k~Ka~GlNt----V~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~V-ILRpG 266 (949)
..++.-+..-+++++.|+.. +......|+-.|.|...+.+ ...+.+.+++|++.|-.. ++.||
T Consensus 44 ~~~~~~~~lk~~~~~~gl~~~~~~~h~~~~~nl~~~d~~~r~~s-~~~~~~~i~~A~~lga~~vv~h~G 111 (281)
T PRK01060 44 LEELNIEAFKAACEKYGISPEDILVHAPYLINLGNPNKEILEKS-RDFLIQEIERCAALGAKLLVFHPG 111 (281)
T ss_pred CCHHHHHHHHHHHHHcCCCCCceEEecceEecCCCCCHHHHHHH-HHHHHHHHHHHHHcCCCEEEEcCC
Confidence 33444344444566788873 22222356666666555554 356889999999999974 56665
No 203
>PRK08673 3-deoxy-7-phosphoheptulonate synthase; Reviewed
Probab=26.89 E-value=1.7e+02 Score=33.86 Aligned_cols=75 Identities=15% Similarity=0.175 Sum_probs=55.3
Q ss_pred eEeCCeeeEEeccccCCCCC-ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccC--chhHHHHHHHHHHCCce
Q psy12907 184 FVKDGKPFRYVSGSFHYFRS-PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTG--DQDLEYFLELAVREDLL 260 (949)
Q Consensus 184 f~~dGkpf~i~sGe~HY~R~-p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g--~~DL~~fl~la~e~GL~ 260 (949)
..++|.++.++.| +=-+ .++.-.+..+.+|++|.+.+..|++= |+---|.|.| ..-|..+.+.|++.||.
T Consensus 87 ~~ig~~~~~~IAG---PCsiEs~e~~~~~A~~lk~~ga~~~r~~~fK----pRTsp~sf~G~g~~gL~~L~~~~~~~Gl~ 159 (335)
T PRK08673 87 VEIGGGKPVVIAG---PCSVESEEQILEIARAVKEAGAQILRGGAFK----PRTSPYSFQGLGEEGLKLLAEAREETGLP 159 (335)
T ss_pred EEECCCceEEEEe---cCccCCHHHHHHHHHHHHHhchhhccCcEec----CCCCCcccccccHHHHHHHHHHHHHcCCc
Confidence 4567778888888 1112 25677788889999999999998884 4444467765 45677788889999999
Q ss_pred EEecC
Q psy12907 261 VLLRP 265 (949)
Q Consensus 261 VILRp 265 (949)
++-.|
T Consensus 160 v~tev 164 (335)
T PRK08673 160 IVTEV 164 (335)
T ss_pred EEEee
Confidence 98764
No 204
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=26.19 E-value=1.3e+02 Score=34.32 Aligned_cols=67 Identities=10% Similarity=0.155 Sum_probs=48.4
Q ss_pred ChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccC--chhHHHHHHHHHHCCceEEecCCccccc
Q psy12907 204 PRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTG--DQDLEYFLELAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 204 p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g--~~DL~~fl~la~e~GL~VILRpGPYIcA 271 (949)
..+..++.++++++.|+-+=.+.+-+.... .-+.|+|+- .-|..++++..++.|++|++..=|+|+.
T Consensus 22 ~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~-~~~~f~~d~~~fPdp~~m~~~l~~~g~~~~~~~~P~v~~ 90 (339)
T cd06604 22 PEEEVREIADEFRERDIPCDAIYLDIDYMD-GYRVFTWDKERFPDPKELIKELHEQGFKVVTIIDPGVKV 90 (339)
T ss_pred CHHHHHHHHHHHHHhCCCcceEEECchhhC-CCCceeeccccCCCHHHHHHHHHHCCCEEEEEEeCceeC
Confidence 366788999999999987644443333332 334566653 2478999999999999999887788864
No 205
>TIGR02103 pullul_strch alpha-1,6-glucosidases, pullulanase-type. Members of this protein family include secreted (or membrane-anchored) pullulanases of Gram-negative bacteria and pullulanase-type starch debranching enzymes of plants. Both enzymes hydrolyze alpha-1,6 glycosidic linkages. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family is closely homologous to, but architecturally different from, the Gram-positive pullulanases of Gram-positive bacteria (TIGR02102).
Probab=25.95 E-value=1.2e+02 Score=39.59 Aligned_cols=20 Identities=10% Similarity=0.265 Sum_probs=18.5
Q ss_pred hhHHHHHHHHHHCCceEEec
Q psy12907 245 QDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 245 ~DL~~fl~la~e~GL~VILR 264 (949)
.++.++++.|+++||.|||-
T Consensus 404 ~Efk~mV~alH~~Gi~VIlD 423 (898)
T TIGR02103 404 KEFREMVQALNKTGLNVVMD 423 (898)
T ss_pred HHHHHHHHHHHHCCCEEEEE
Confidence 57999999999999999987
No 206
>cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=25.74 E-value=5.8e+02 Score=29.02 Aligned_cols=140 Identities=12% Similarity=0.181 Sum_probs=78.8
Q ss_pred ccCCCCCChhhHHHHHHHHHHcCCCEEEEeec--CCcc---CCC------------------------CCeeeccCchhH
Q psy12907 197 SFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVE--WSFH---EPS------------------------PGQFQFTGDQDL 247 (949)
Q Consensus 197 e~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~--Wn~H---Ep~------------------------~G~fdF~g~~DL 247 (949)
+=||+ +.+..++.++.|...++|++..++- |.+- .|. .|.| -..|+
T Consensus 10 aR~~~--~~~~ik~~id~ma~~K~N~lhlHltD~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~~~~~~~~Y---T~~di 84 (326)
T cd06564 10 GRKYY--SMDFLKDIIKTMSWYKMNDLQLHLNDNLIFNLDDMSTTVNNATYASDDVKSGNNYYNLTANDGYY---TKEEF 84 (326)
T ss_pred cCCCC--CHHHHHHHHHHHHHcCCceEEEeecCCcccccCCCchhhhhhhhhccccccccccCCCCCCCCcc---cHHHH
Confidence 33554 6889999999999999999987643 2221 111 1111 13689
Q ss_pred HHHHHHHHHCCceEEecCCcccccccCCCCc-cccccccCCCeeee------------cCCHhHHHHHHHHHHHHHHHHH
Q psy12907 248 EYFLELAVREDLLVLLRPGPYICAERDFGGL-PSWLMTIKPNITLR------------SKDEVYQHYVNKWFAQLFPRIT 314 (949)
Q Consensus 248 ~~fl~la~e~GL~VILRpGPYIcAEw~~GGl-P~WL~~~~p~~~~R------------t~dp~y~~~v~r~~~~l~~~l~ 314 (949)
.++++.|++.|+.||-- -+.-|- =+|+.. +|+...+ -++| ++-++++.|+..+.
T Consensus 85 ~eiv~yA~~rgI~vIPE--------ID~PGH~~a~~~~-~pel~~~~~~~~~~~~~l~~~~~----~t~~f~~~l~~E~~ 151 (326)
T cd06564 85 KELIAYAKDRGVNIIPE--------IDSPGHSLAFTKA-MPELGLKNPFSKYDKDTLDISNP----EAVKFVKALFDEYL 151 (326)
T ss_pred HHHHHHHHHcCCeEecc--------CCCcHHHHHHHHh-hHHhcCCCcccCCCcccccCCCH----HHHHHHHHHHHHHH
Confidence 99999999999999743 232221 223332 3332221 1333 44455555666666
Q ss_pred hccccCCCceEEeccc-cccCC----ccchHHHHHHHHHHHHHhcCCee
Q psy12907 315 RFLYGNGGPIILVQVE-NEMGS----YTCDKEHMIWLRDQMKYYVRDAA 358 (949)
Q Consensus 315 ~~~~~~GGpII~vQIE-NEyG~----y~~d~~Y~~~L~~~~~~~lG~~v 358 (949)
+..-. .++. ++|- -|+-. ...-..|++.+.+.+++. |..+
T Consensus 152 ~~f~~-~~~~--~HiGgDE~~~~~~~~~~~~~f~~~~~~~v~~~-gk~~ 196 (326)
T cd06564 152 DGFNP-KSDT--VHIGADEYAGDAGYAEAFRAYVNDLAKYVKDK-GKTP 196 (326)
T ss_pred HhcCC-CCCE--EEeccccccccCccHHHHHHHHHHHHHHHHHc-CCeE
Confidence 54421 1222 2221 22211 122357888888888876 7653
No 207
>cd07381 MPP_CapA CapA and related proteins, metallophosphatase domain. CapA is one of three membrane-associated enzymes in Bacillus anthracis that is required for synthesis of gamma-polyglutamic acid (PGA), a major component of the bacterial capsule. The YwtB and PgsA proteins of Bacillus subtilis are closely related to CapA and are also included in this alignment model. CapA belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal
Probab=25.46 E-value=7.1e+02 Score=26.62 Aligned_cols=131 Identities=13% Similarity=0.056 Sum_probs=0.0
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecC------CcccccccCCCCccccc
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRP------GPYICAERDFGGLPSWL 282 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRp------GPYIcAEw~~GGlP~WL 282 (949)
++.++.|+++|++.+.+- |=| .||| |..-|.+.++..++.|+..+-.- .||.--|.+ |...=+
T Consensus 67 ~~~~~~L~~~G~d~~tla---NNH-----~fD~-G~~gl~~t~~~l~~~~i~~~g~~~~~~~~~~~~i~~~~--g~kVg~ 135 (239)
T cd07381 67 PEVADALKAAGFDVVSLA---NNH-----TLDY-GEEGLLDTLDALDEAGIAHAGAGRNLEEARRPAILEVN--GIKVAF 135 (239)
T ss_pred HHHHHHHHHhCCCEEEcc---ccc-----cccc-chHHHHHHHHHHHHcCCceeECCCCHHHhcCcEEEEEC--CEEEEE
Q ss_pred cccCCCeeeecCCHhH---------HHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCccchHHHHHHHHHHHHHh
Q psy12907 283 MTIKPNITLRSKDEVY---------QHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSYTCDKEHMIWLRDQMKYY 353 (949)
Q Consensus 283 ~~~~p~~~~Rt~dp~y---------~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y~~d~~Y~~~L~~~~~~~ 353 (949)
.........-...+.. .++++++++++-+. +++. |++++.-.||. ..-..+.+.+.+.+.+.
T Consensus 136 ig~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~lr~~-~D~v------Iv~~H~G~e~~--~~p~~~~~~la~~l~~~ 206 (239)
T cd07381 136 LAYTYGTNGIPLAAGARPGGVNPLDLERIAADIAEAKKK-ADIV------IVSLHWGVEYS--YYPTPEQRELARALIDA 206 (239)
T ss_pred EEEECCCCCCcCcccCCccccCccCHHHHHHHHHHHhhc-CCEE------EEEecCcccCC--CCCCHHHHHHHHHHHHC
Q ss_pred cCCeeEE
Q psy12907 354 VRDAAVL 360 (949)
Q Consensus 354 lG~~vpl 360 (949)
|+|+++
T Consensus 207 -G~D~Ii 212 (239)
T cd07381 207 -GADLVI 212 (239)
T ss_pred -CCCEEE
No 208
>COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only]
Probab=25.45 E-value=2.4e+02 Score=32.34 Aligned_cols=121 Identities=17% Similarity=0.135 Sum_probs=73.0
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccccCCC
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTIKPN 288 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~~p~ 288 (949)
...+...++.|.+||-.-..= .--||..+..+++++.||.+|...|+|.-+.|+ .|+..
T Consensus 51 ~~e~~~~~a~Gg~TIVD~T~~------------~~GRdv~~m~~vs~atglnIV~~TGfy~~~~~p-----~~~~~---- 109 (316)
T COG1735 51 IAELKRLMARGGQTIVDATNI------------GIGRDVLKMRRVAEATGLNIVAATGFYKAAFHP-----EYFAL---- 109 (316)
T ss_pred HHHHHHHHHcCCCeEeeCCcc------------ccCcCHHHHHHHHHHhCCcEEEeccccccccch-----hHHhh----
Confidence 345667777899997533211 113799999999999999999999999998886 67654
Q ss_pred eeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccccCCc-cchHHHHHHHHHHHHHhcCCeeEEEeeC
Q psy12907 289 ITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENEMGSY-TCDKEHMIWLRDQMKYYVRDAAVLYTTD 364 (949)
Q Consensus 289 ~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENEyG~y-~~d~~Y~~~L~~~~~~~lG~~vplft~D 364 (949)
|+ ++.+...+...+..=. .|+-|..=+=-|-|++ +=...=.+.|+..++.+.-..+|+-|..
T Consensus 110 ---~~--------i~~~ae~~v~ei~~Gi---~gT~ikAGiIk~~~~~~~iTp~Eek~lrAaA~A~~~Tg~Pi~tHt 172 (316)
T COG1735 110 ---RP--------IEELAEFVVKEIEEGI---AGTGIKAGIIKEAGGSPAITPLEEKSLRAAARAHKETGAPISTHT 172 (316)
T ss_pred ---CC--------HHHHHHHHHHHHHhcc---cCCccccceeeeccCcccCCHHHHHHHHHHHHHhhhcCCCeEEec
Confidence 22 3344444444444211 1333322222455555 2233335666777776645556666553
No 209
>PLN02784 alpha-amylase
Probab=25.32 E-value=1.1e+02 Score=39.74 Aligned_cols=56 Identities=14% Similarity=0.164 Sum_probs=38.5
Q ss_pred HHHHHHHHHcCCCEEEEeecCCccCC---CCCe-ee----ccCchhHHHHHHHHHHCCceEEec
Q psy12907 209 QDRLRKLRAAGLNAVSTYVEWSFHEP---SPGQ-FQ----FTGDQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 209 ~d~L~k~Ka~GlNtV~tYv~Wn~HEp---~~G~-fd----F~g~~DL~~fl~la~e~GL~VILR 264 (949)
.+++.-++++|+++|-+.=+..-..+ .+.. |+ |....||..+++.|+++||.||+-
T Consensus 524 ~ekldyL~~LG~taIWLpP~~~s~s~~GY~p~D~y~lds~yGT~~ELk~LI~a~H~~GIkVIlD 587 (894)
T PLN02784 524 GEKAAELSSLGFTVVWLPPPTESVSPEGYMPKDLYNLNSRYGTIDELKDLVKSFHEVGIKVLGD 587 (894)
T ss_pred HHHHHHHHHhCCCEEEeCCCCCCCCCCCcCcccccccCcCcCCHHHHHHHHHHHHHCCCEEEEE
Confidence 46778889999999977543221111 1111 11 333579999999999999999976
No 210
>PRK09267 flavodoxin FldA; Validated
Probab=25.14 E-value=4.2e+02 Score=26.74 Aligned_cols=73 Identities=7% Similarity=0.086 Sum_probs=46.7
Q ss_pred CCeeeEEeccccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE
Q psy12907 187 DGKPFRYVSGSFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 187 dGkpf~i~sGe~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
......+++...|....++..|.+-+.+++...++...+.+| .......-.-.| ..-+..+-+++++.|..++
T Consensus 45 ~~~d~vi~g~pt~~~G~~~~~~~~fl~~~~~~~l~~k~vaif-g~g~~~~~~~~~--~~~~~~l~~~l~~~g~~~v 117 (169)
T PRK09267 45 EAYDLLILGIPTWGYGELQCDWDDFLPELEEIDFSGKKVALF-GLGDQEDYAEYF--CDAMGTLYDIVEPRGATIV 117 (169)
T ss_pred hhCCEEEEEecCcCCCCCCHHHHHHHHHHhcCCCCCCEEEEE-ecCCCCcchHHH--HHHHHHHHHHHHHCCCEEE
Confidence 445567788888877777889999999888777776666666 222111111112 1236667778888896664
No 211
>TIGR00677 fadh2_euk methylenetetrahydrofolate reductase, eukaryotic type. This protein is an FAD-containing flavoprotein.
Probab=24.74 E-value=2e+02 Score=32.28 Aligned_cols=108 Identities=13% Similarity=0.121 Sum_probs=65.0
Q ss_pred eccccCCCCCC---hhhH-HHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCccc
Q psy12907 194 VSGSFHYFRSP---RYYW-QDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYI 269 (949)
Q Consensus 194 ~sGe~HY~R~p---~~~W-~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYI 269 (949)
+++..||+.=| .... -++|++=.++|.+.+-|-.++ |. ..+.+|++.|++.|+.+=+-||..-
T Consensus 132 igva~~Pe~Hp~~~~~~~d~~~L~~Ki~aGA~f~iTQ~~F----------d~---~~~~~f~~~~~~~gi~~PIi~GI~p 198 (281)
T TIGR00677 132 IGVAGYPEGHPEAESVELDLKYLKEKVDAGADFIITQLFY----------DV---DNFLKFVNDCRAIGIDCPIVPGIMP 198 (281)
T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHcCCCEeecccee----------cH---HHHHHHHHHHHHcCCCCCEEeeccc
Confidence 45555554422 2222 235554446999988875443 22 3578999999999776644555544
Q ss_pred c---------cccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 270 C---------AERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 270 c---------AEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
+ +||..--+|.|+.++.- ....+++...+.--++..+++..+.+.
T Consensus 199 i~s~~~~~~~~~~~Gi~vP~~l~~~l~--~~~~~~~~~~~~gi~~a~~~~~~l~~~ 252 (281)
T TIGR00677 199 INNYASFLRRAKWSKTKIPQEIMSRLE--PIKDDDEAVRDYGIELIVEMCQKLLAS 252 (281)
T ss_pred cCCHHHHHHHHhcCCCCCCHHHHHHHH--hccCCHHHHHHHHHHHHHHHHHHHHHC
Confidence 4 57887778999987210 012233445556666677777777654
No 212
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=24.50 E-value=9.7e+02 Score=27.63 Aligned_cols=62 Identities=15% Similarity=0.210 Sum_probs=46.3
Q ss_pred CChhhHHHHHHHHHHcCCCEEEEeec----CCccCCC------CCeeeccC---chhHHHHHHHHHHCCceEEec
Q psy12907 203 SPRYYWQDRLRKLRAAGLNAVSTYVE----WSFHEPS------PGQFQFTG---DQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 203 ~p~~~W~d~L~k~Ka~GlNtV~tYv~----Wn~HEp~------~G~fdF~g---~~DL~~fl~la~e~GL~VILR 264 (949)
+|.+..++.++.|....+|+...++- |.+--+. .|.|.=.| ..|+.++++.|++.|+.||--
T Consensus 15 ~~~~~ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~Lt~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPE 89 (348)
T cd06562 15 LSVDSIKRTIDAMAYNKLNVLHWHITDSQSFPLESPSYPELSKKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPE 89 (348)
T ss_pred CCHHHHHHHHHHHHHhCCcEEEEeEEcCCCceEeeCCCchhhhccCcCCCceECHHHHHHHHHHHHHcCCEEEEe
Confidence 46788999999999999999998863 6554321 23332221 368999999999999999854
No 213
>TIGR00587 nfo apurinic endonuclease (APN1). All proteins in this family for which functions are known are 5' AP endonculeases that are used in base excision repair and the repair of abasic sites in DNA.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=24.24 E-value=4e+02 Score=29.40 Aligned_cols=86 Identities=16% Similarity=0.207 Sum_probs=53.6
Q ss_pred HHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceE--EecCCcccccccCCCCcccccccc
Q psy12907 208 WQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLV--LLRPGPYICAERDFGGLPSWLMTI 285 (949)
Q Consensus 208 W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~V--ILRpGPYIcAEw~~GGlP~WL~~~ 285 (949)
-.+.++.++++|+++|++++-. |+--..+.-...+.++|-+.++++++.+ +.-=+||. .+
T Consensus 13 ~~~a~~~~~~~G~~~~qif~~~----P~~w~~~~~~~~~~~~~~~~~~~~~~~~~~i~~Hapy~-------------iN- 74 (274)
T TIGR00587 13 LQAAYNRAAEIGATAFMFFLKS----PRWWRRPMLEEEVIDWFKAALETNKNLSQIVLVHAPYL-------------IN- 74 (274)
T ss_pred HHHHHHHHHHhCCCEEEEEecC----ccccCCCCCCHHHHHHHHHHHHHcCCCCcceeccCCee-------------ee-
Confidence 3568999999999999997642 1111111112346788888899998863 33334542 11
Q ss_pred CCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 286 KPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 286 ~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
+=+.|+.-+++..+.+++.+..-+.+
T Consensus 75 -----las~~~~~r~~sv~~~~~~i~~A~~l 100 (274)
T TIGR00587 75 -----LASPDEEKEEKSLDVLDEELKRCELL 100 (274)
T ss_pred -----cCCCCHHHHHHHHHHHHHHHHHHHHc
Confidence 22446667777777777766666655
No 214
>PRK07868 acyl-CoA synthetase; Validated
Probab=23.85 E-value=39 Score=44.07 Aligned_cols=63 Identities=22% Similarity=0.269 Sum_probs=43.6
Q ss_pred CCeEECCCCCCchhhhhhh-----hhhcccccccccc-------------------------------------------
Q psy12907 613 SPLVFLPPVSGTADIFFHL-----MLDLSRKGYRTLS------------------------------------------- 644 (949)
Q Consensus 613 ~~v~~l~~~~g~a~~~~~~-----~~~l~~kgyR~is------------------------------------------- 644 (949)
.||+|+||...++.+|=.+ +-.|++.||||+.
T Consensus 68 ~plllvhg~~~~~~~~d~~~~~s~v~~L~~~g~~v~~~d~G~~~~~~~~~~~~l~~~i~~l~~~l~~v~~~~~~~v~lvG 147 (994)
T PRK07868 68 PPVLMVHPMMMSADMWDVTRDDGAVGILHRAGLDPWVIDFGSPDKVEGGMERNLADHVVALSEAIDTVKDVTGRDVHLVG 147 (994)
T ss_pred CcEEEECCCCCCccceecCCcccHHHHHHHCCCEEEEEcCCCCChhHcCccCCHHHHHHHHHHHHHHHHHhhCCceEEEE
Confidence 3677777777766666544 3445566777763
Q ss_pred -cCCcchhhcccccccccCceeecccccccccc
Q psy12907 645 -YLGGFLAQKFAEHTVLRPRVVSLFLCNSFYDT 676 (949)
Q Consensus 645 -~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~dt 676 (949)
++||.+++.|+...+ .+.|+||++.++=.|.
T Consensus 148 ~s~GG~~a~~~aa~~~-~~~v~~lvl~~~~~d~ 179 (994)
T PRK07868 148 YSQGGMFCYQAAAYRR-SKDIASIVTFGSPVDT 179 (994)
T ss_pred EChhHHHHHHHHHhcC-CCccceEEEEeccccc
Confidence 689999999986432 2479999998887664
No 215
>PRK12677 xylose isomerase; Provisional
Probab=23.82 E-value=2.8e+02 Score=32.62 Aligned_cols=59 Identities=14% Similarity=0.090 Sum_probs=36.5
Q ss_pred hHHHHHHHHHHcCCCEEE--EeecC-------CccCCCCCeeeccCchhHHHHHHHHHHCCce-EEecCC
Q psy12907 207 YWQDRLRKLRAAGLNAVS--TYVEW-------SFHEPSPGQFQFTGDQDLEYFLELAVREDLL-VLLRPG 266 (949)
Q Consensus 207 ~W~d~L~k~Ka~GlNtV~--tYv~W-------n~HEp~~G~fdF~g~~DL~~fl~la~e~GL~-VILRpG 266 (949)
.+++..+.+++.|+..+. +-.+. ++-.|.+.+-++. ...+.+.|++|++.|-. |++++|
T Consensus 68 ~~~~lk~~l~~~GL~v~~v~~n~f~~p~~~~g~lts~d~~~R~~A-i~~~~r~IdlA~eLGa~~Vvv~~G 136 (384)
T PRK12677 68 IIKRFKKALDETGLVVPMVTTNLFTHPVFKDGAFTSNDRDVRRYA-LRKVLRNIDLAAELGAKTYVMWGG 136 (384)
T ss_pred HHHHHHHHHHHcCCeeEEEecCCCCCccccCCcCCCCCHHHHHHH-HHHHHHHHHHHHHhCCCEEEEeeC
Confidence 577888888999998443 33222 2222222222221 23477888999999997 567776
No 216
>PLN02763 hydrolase, hydrolyzing O-glycosyl compounds
Probab=23.66 E-value=3.2e+02 Score=36.10 Aligned_cols=67 Identities=13% Similarity=0.079 Sum_probs=50.3
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccC--chhHHHHHHHHHHCCceEEecCCcccccc
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTG--DQDLEYFLELAVREDLLVLLRPGPYICAE 272 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g--~~DL~~fl~la~e~GL~VILRpGPYIcAE 272 (949)
.+.-++.++++|+.++-+=.+++-|.++.. -+.|.|+- .-|..++++..++.|+++++-.-|+|.+|
T Consensus 200 q~eV~eva~~fre~~IP~DvIwlDidYm~g-~~~FTwD~~rFPdP~~mv~~Lh~~G~kvv~iidPgI~~d 268 (978)
T PLN02763 200 AKRVAEIARTFREKKIPCDVVWMDIDYMDG-FRCFTFDKERFPDPKGLADDLHSIGFKAIWMLDPGIKAE 268 (978)
T ss_pred HHHHHHHHHHHHHcCCCceEEEEehhhhcC-CCceeECcccCCCHHHHHHHHHHCCCEEEEEEcCCCccC
Confidence 566788999999999988666666665542 23455643 24889999999999999987667778764
No 217
>PRK05077 frsA fermentation/respiration switch protein; Reviewed
Probab=22.97 E-value=45 Score=39.16 Aligned_cols=28 Identities=18% Similarity=0.208 Sum_probs=20.4
Q ss_pred cCCcchhhcccccccccCceeecccccccc
Q psy12907 645 YLGGFLAQKFAEHTVLRPRVVSLFLCNSFY 674 (949)
Q Consensus 645 ~lg~~l~q~f~~~~~~~~~v~sl~lcn~~~ 674 (949)
|+||+++...+...+ +.++.+++..+..
T Consensus 273 S~GG~~Al~~A~~~p--~ri~a~V~~~~~~ 300 (414)
T PRK05077 273 RFGANVAVRLAYLEP--PRLKAVACLGPVV 300 (414)
T ss_pred ChHHHHHHHHHHhCC--cCceEEEEECCcc
Confidence 899999998886544 4677777666654
No 218
>cd00311 TIM Triosephosphate isomerase (TIM) is a glycolytic enzyme that catalyzes the interconversion of dihydroxyacetone phosphate and D-glyceraldehyde-3-phosphate. The reaction is very efficient and requires neither cofactors nor metal ions. TIM, usually homodimeric, but in some organisms tetrameric, is ubiqitous and conserved in function across eukaryotes, bacteria and archaea.
Probab=22.94 E-value=1.5e+02 Score=32.63 Aligned_cols=48 Identities=25% Similarity=0.252 Sum_probs=37.5
Q ss_pred HHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCC
Q psy12907 213 RKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPG 266 (949)
Q Consensus 213 ~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpG 266 (949)
+++|++|++.|-+. |..++--|.- .+.++.+-++.|.++||.+|++.|
T Consensus 78 ~mL~d~G~~~viiG-----HSERR~~f~E-t~~~i~~Kv~~a~~~gl~pIvCiG 125 (242)
T cd00311 78 EMLKDAGAKYVIIG-----HSERRQYFGE-TDEDVAKKVKAALEAGLTPILCVG 125 (242)
T ss_pred HHHHHcCCCEEEeC-----cccccCcCCC-CcHHHHHHHHHHHHCCCEEEEEeC
Confidence 57999999988775 4444433433 357899999999999999999987
No 219
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=22.84 E-value=1.2e+02 Score=33.06 Aligned_cols=60 Identities=10% Similarity=0.047 Sum_probs=38.7
Q ss_pred hhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeecc-CchhHHHHHHHHHHCCceEEecC
Q psy12907 205 RYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFT-GDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~-g~~DL~~fl~la~e~GL~VILRp 265 (949)
.+.+++.++.++++|++.|.+.-.-...++.. .-.++ -...+.++.++|+++|+.+.+-+
T Consensus 93 ~~~~~~~i~~a~~lG~~~v~~~~~~~~~~~~~-~~~~~~~~~~l~~l~~~a~~~gv~l~lE~ 153 (284)
T PRK13210 93 LEIMKKAIRLAQDLGIRTIQLAGYDVYYEEKS-EETRQRFIEGLAWAVEQAAAAQVMLAVEI 153 (284)
T ss_pred HHHHHHHHHHHHHhCCCEEEECCccccccccc-HHHHHHHHHHHHHHHHHHHHhCCEEEEEe
Confidence 35688999999999999997531100011111 00111 12468889999999999998876
No 220
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=22.69 E-value=1.1e+03 Score=28.43 Aligned_cols=82 Identities=11% Similarity=0.165 Sum_probs=59.6
Q ss_pred ceEeCCeeeEEeccccCCCCCC---hhhHHHHHHHHHHcCCCE--E--EE-eecCCccCCCCCeeeccCchhHHHHHHHH
Q psy12907 183 TFVKDGKPFRYVSGSFHYFRSP---RYYWQDRLRKLRAAGLNA--V--ST-YVEWSFHEPSPGQFQFTGDQDLEYFLELA 254 (949)
Q Consensus 183 ~f~~dGkpf~i~sGe~HY~R~p---~~~W~d~L~k~Ka~GlNt--V--~t-Yv~Wn~HEp~~G~fdF~g~~DL~~fl~la 254 (949)
...+++.-|+++.+.-+-++.+ ++.-+.--+.+++.|++. | .. |. -|+-.|.+...+++ ..-+.+-|+.|
T Consensus 150 a~~~g~~afqiF~~npr~w~~~~~~~~~~~~f~~~~~~~gi~~~~i~~HapYl-INLASpd~e~rekS-v~~~~~eL~rA 227 (413)
T PTZ00372 150 AYNIAGQAFALFLKNQRTWNSPPLSDETIDKFKENCKKYNYDPKFILPHGSYL-INLANPDKEKREKS-YDAFLDDLQRC 227 (413)
T ss_pred HHHcCCCEEEEEcCCCccCCCCCCCHHHHHHHHHHHHHcCCCcceEEeecCce-ecCCCCCHHHHHHH-HHHHHHHHHHH
Confidence 3456889999999998877655 345555566777887752 3 22 33 78888888887776 34577888999
Q ss_pred HHCCceE-EecCC
Q psy12907 255 VREDLLV-LLRPG 266 (949)
Q Consensus 255 ~e~GL~V-ILRpG 266 (949)
.+.|... ++-||
T Consensus 228 ~~LGa~~VV~HPG 240 (413)
T PTZ00372 228 EQLGIKLYNFHPG 240 (413)
T ss_pred HHcCCCEEEECCC
Confidence 9999985 57787
No 221
>cd00537 MTHFR Methylenetetrahydrofolate reductase (MTHFR). 5,10-Methylenetetrahydrofolate is reduced to 5-methyltetrahydrofolate by methylenetetrahydrofolate reductase, a cytoplasmic, NAD(P)-dependent enzyme. 5-methyltetrahydrofolate is utilized by methionine synthase to convert homocysteine to methionine. The enzymatic mechanism is a ping-pong bi-bi mechanism, in which NAD(P)+ release precedes the binding of methylenetetrahydrofolate and the acceptor is free FAD. The family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from prokaryotes and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The bacterial enzyme is a homotetramer and NADH is the preferred reductant while the eukaryotic enzyme is a homodimer and NADPH is the preferred reductant. In humans, there are several clinically significant mutations in MTHFR that result in hyperhomocysteinemia, which is a risk factor for the development of cardiovascular disease.
Probab=22.34 E-value=2.9e+02 Score=30.46 Aligned_cols=104 Identities=15% Similarity=0.102 Sum_probs=63.6
Q ss_pred ccCCCCCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCC--ceEEecCCcccc----
Q psy12907 197 SFHYFRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVRED--LLVLLRPGPYIC---- 270 (949)
Q Consensus 197 e~HY~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~G--L~VILRpGPYIc---- 270 (949)
+.|+..-+.+.=.++|++=.++|.+.+-|-.+.+. ..+.+|++.|++.| +.||+..-|-..
T Consensus 138 e~hp~~~~~~~~~~~L~~Ki~aGA~f~iTQ~~fd~-------------~~~~~~~~~~~~~gi~vPIi~GI~p~~s~~~l 204 (274)
T cd00537 138 EGHPEAPSLEEDIKRLKRKVDAGADFIITQLFFDN-------------DAFLRFVDRCRAAGITVPIIPGIMPLTSYKQA 204 (274)
T ss_pred CcCCCCCCHHHHHHHHHHHHHCCCCEEeecccccH-------------HHHHHHHHHHHHcCCCCCEEeeccccCCHHHH
Confidence 66776655444445555555679999888655433 35889999999998 555666555433
Q ss_pred ---cccCCCCccccccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHh
Q psy12907 271 ---AERDFGGLPSWLMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITR 315 (949)
Q Consensus 271 ---AEw~~GGlP~WL~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~ 315 (949)
+++..-++|.|+.++... ...+.....+.-.++..+++..+.+
T Consensus 205 ~~~~~~~Gv~vP~~~~~~l~~--~~~~~~~~~~~g~~~~~~l~~~l~~ 250 (274)
T cd00537 205 KRFAKLCGVEIPDWLLERLEK--LKDDAEAVRAEGIEIAAELCDELLE 250 (274)
T ss_pred HHHHHhhCCCCCHHHHHHHHh--cCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 456666789998862100 1122233445556666666666655
No 222
>KOG2024|consensus
Probab=22.31 E-value=1e+02 Score=34.57 Aligned_cols=50 Identities=24% Similarity=0.389 Sum_probs=38.0
Q ss_pred CCCCcccccCCC------ceeEEEEEecCCCC-----CCCCcccccCCcceEEEEEECCEE
Q psy12907 508 AFPLSFEALDQS------FGFVLYETIIPDAR-----FPDPALLTISGLRDRGQVFVDEKL 557 (949)
Q Consensus 508 ~~P~tmE~lgQ~------~GyvlY~t~i~~~~-----~~~~~~L~i~~vrDrA~Vfvdg~~ 557 (949)
..|-++-+++|. +|.+.|+.++.-++ ..++..|++..+|-+|.|.+||..
T Consensus 69 pvpss~nDi~~d~~lrdfv~~~wyer~v~vpe~w~~~~~~r~vlr~~s~H~~Aivwvng~~ 129 (297)
T KOG2024|consen 69 PVPSSFNDIGQDWRLRDFVGLVWYERTVTVPESWTQDLGKRVVLRIGSAHSYAIVWVNGVD 129 (297)
T ss_pred ccccchhccccCCccccceeeeEEEEEEEcchhhhhhcCCeEEEEeecccceeEEEEccee
Confidence 456667777775 79999998874433 234567899999999999999864
No 223
>PRK09875 putative hydrolase; Provisional
Probab=22.25 E-value=2.5e+02 Score=31.81 Aligned_cols=62 Identities=16% Similarity=0.240 Sum_probs=46.4
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMT 284 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~ 284 (949)
+.=.+.|+.+|++|.+||---.+.. -.||...+.+++++-|+.||...|-|.-. -.|.|+..
T Consensus 34 ~~~~~el~~~~~~Gg~tiVd~T~~g------------~GRd~~~l~~is~~tgv~Iv~~TG~y~~~-----~~p~~~~~ 95 (292)
T PRK09875 34 AFICQEMNDLMTRGVRNVIEMTNRY------------MGRNAQFMLDVMRETGINVVACTGYYQDA-----FFPEHVAT 95 (292)
T ss_pred HHHHHHHHHHHHhCCCeEEecCCCc------------cCcCHHHHHHHHHHhCCcEEEcCcCCCCc-----cCCHHHhc
Confidence 4445678889999999973222221 24899999999999999999999988642 25788865
No 224
>COG1523 PulA Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]
Probab=21.72 E-value=1.3e+02 Score=38.09 Aligned_cols=53 Identities=19% Similarity=0.282 Sum_probs=39.2
Q ss_pred HHHHHHcCCCEEEE-eecCCccCCCC--------------------CeeeccC-----chhHHHHHHHHHHCCceEEec
Q psy12907 212 LRKLRAAGLNAVST-YVEWSFHEPSP--------------------GQFQFTG-----DQDLEYFLELAVREDLLVLLR 264 (949)
Q Consensus 212 L~k~Ka~GlNtV~t-Yv~Wn~HEp~~--------------------G~fdF~g-----~~DL~~fl~la~e~GL~VILR 264 (949)
|.-+|.+|+++|+. +|+.-..|+.. |.|-=+. .+.+..+++.++++||-|||-
T Consensus 206 i~yLk~LGvtaVeLLPV~~~~~~~~l~~~gl~n~WGYdP~~fFAp~~~Yss~p~p~~~i~EfK~mV~~lHkaGI~VILD 284 (697)
T COG1523 206 IDYLKDLGVTAVELLPVFDFYDEPHLDKSGLNNNWGYDPLNFFAPEGRYASNPEPATRIKEFKDMVKALHKAGIEVILD 284 (697)
T ss_pred HHHHHHhCCceEEEecceEEeccccccccccccccCCCcccccCCCccccCCCCcchHHHHHHHHHHHHHHcCCEEEEE
Confidence 88999999999986 66655555443 2222222 257888999999999999986
No 225
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=21.56 E-value=2e+02 Score=33.04 Aligned_cols=74 Identities=11% Similarity=0.068 Sum_probs=50.3
Q ss_pred cCCCCCC---hhhHHHHHHHHHHcCCCEEEEee----------cCCccCCC---------CCeeeccC---chhHHHHHH
Q psy12907 198 FHYFRSP---RYYWQDRLRKLRAAGLNAVSTYV----------EWSFHEPS---------PGQFQFTG---DQDLEYFLE 252 (949)
Q Consensus 198 ~HY~R~p---~~~W~d~L~k~Ka~GlNtV~tYv----------~Wn~HEp~---------~G~fdF~g---~~DL~~fl~ 252 (949)
+|..|.- .+.-++.++++++.|+.+=.+++ .|+...-. =+.++|.. .-|..++++
T Consensus 13 ~~~sr~~Y~~~~ev~~v~~~~~~~~iP~d~i~lD~W~~~~~~~~w~d~~y~~~~~~~~~~~~~~~f~~~~~FPdp~~mi~ 92 (340)
T cd06597 13 LWMSANEWDTQAEVMRQMDAHEEHGIPVTVVVIEQWSDEATFYVFNDAQYTPKDGGAPLSYDDFSFPVEGRWPNPKGMID 92 (340)
T ss_pred hhhhccCCCCHHHHHHHHHHHHHcCCCeeEEEEecccCcceeeeeccchhcccccCCcceecccccCccccCCCHHHHHH
Confidence 4555533 56778999999999998755443 35432211 13344432 137999999
Q ss_pred HHHHCCceEEecCCccccc
Q psy12907 253 LAVREDLLVLLRPGPYICA 271 (949)
Q Consensus 253 la~e~GL~VILRpGPYIcA 271 (949)
..++.|++|+|..=|+|..
T Consensus 93 ~Lh~~G~kv~l~v~P~i~~ 111 (340)
T cd06597 93 ELHEQGVKVLLWQIPIIKL 111 (340)
T ss_pred HHHHCCCEEEEEecCcccc
Confidence 9999999999988888863
No 226
>TIGR02635 RhaI_grampos L-rhamnose isomerase, Streptomyces subtype. This clade of sequences is closely related to the L-rhamnose isomerases found in Pseudomonas stutzeri and in a number of the Rhizobiales (TIGR02629). The genes of the family represented here are found in similar genomic contexts which contain genes apparently involved in rhamnose catabolism such as rhamnulose-1-phosphate aldolase (TIGR02632), sugar kinases, and sugar transporters.
Probab=21.26 E-value=1.4e+03 Score=27.03 Aligned_cols=87 Identities=11% Similarity=0.101 Sum_probs=54.7
Q ss_pred hhhHHHHHHHHHHc-CC-CEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE-ecCCcccccccCCCCcccc
Q psy12907 205 RYYWQDRLRKLRAA-GL-NAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL-LRPGPYICAERDFGGLPSW 281 (949)
Q Consensus 205 ~~~W~d~L~k~Ka~-Gl-NtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI-LRpGPYIcAEw~~GGlP~W 281 (949)
.+...+++.+++.+ ++ -.|+..++| +.. .|+.++.++++++||.|. +-|+=+-. |.
T Consensus 39 ~~e~~~d~~~v~~L~~~~~~v~lH~~~------d~~------~d~~~~~~~l~~~GL~v~~i~p~~f~~--------~~- 97 (378)
T TIGR02635 39 VFEKIEDAALVHRLTGICPTVALHIPW------DRV------EDYEELARYAEELGLKIGAINPNLFQD--------DD- 97 (378)
T ss_pred HHHHHHHHHHHHhhcCCCCceeeccCC------ccc------cCHHHHHHHHHHcCCceeeeeCCccCC--------cc-
Confidence 34555566666655 33 356676666 222 358889999999999996 77651111 11
Q ss_pred ccccCCCeeeecCCHhHHHHHHHHHHHHHHHHHhc
Q psy12907 282 LMTIKPNITLRSKDEVYQHYVNKWFAQLFPRITRF 316 (949)
Q Consensus 282 L~~~~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~ 316 (949)
++...+-+.||..++..-++.++.+..-+.+
T Consensus 98 ----~~~GSLt~pD~~vR~~AIe~~k~~idiA~eL 128 (378)
T TIGR02635 98 ----YKFGSLTHPDKRIRRKAIDHLLECVDIAKKT 128 (378)
T ss_pred ----cCCCCCCCCCHHHHHHHHHHHHHHHHHHHHh
Confidence 1122455678888888888777777766654
No 227
>PF13380 CoA_binding_2: CoA binding domain; PDB: 3FF4_A 2D5A_A 2D59_A 2E6U_X 1IUL_A 1IUK_A 1Y81_A 2DUW_A.
Probab=20.78 E-value=1.9e+02 Score=27.95 Aligned_cols=45 Identities=22% Similarity=0.208 Sum_probs=31.6
Q ss_pred CCChhhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEE
Q psy12907 202 RSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVL 262 (949)
Q Consensus 202 R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VI 262 (949)
-+|++...+.++.++++|+..|=..- | ..-++.+++|+++||.++
T Consensus 62 ~~~~~~~~~~v~~~~~~g~~~v~~~~---------g-------~~~~~~~~~a~~~gi~vi 106 (116)
T PF13380_consen 62 CVPPDKVPEIVDEAAALGVKAVWLQP---------G-------AESEELIEAAREAGIRVI 106 (116)
T ss_dssp -S-HHHHHHHHHHHHHHT-SEEEE-T---------T-------S--HHHHHHHHHTT-EEE
T ss_pred EcCHHHHHHHHHHHHHcCCCEEEEEc---------c-------hHHHHHHHHHHHcCCEEE
Confidence 45689999999999999988765431 1 346788999999999975
No 228
>PRK08227 autoinducer 2 aldolase; Validated
Probab=20.57 E-value=2.6e+02 Score=31.41 Aligned_cols=47 Identities=11% Similarity=0.138 Sum_probs=37.2
Q ss_pred HHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEe
Q psy12907 211 RLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLL 263 (949)
Q Consensus 211 ~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VIL 263 (949)
..+.+-++|.++|.++++|.- .+.-+-..|+.+..+.|++.||-+|+
T Consensus 99 sVeeAvrlGAdAV~~~v~~Gs------~~E~~~l~~l~~v~~ea~~~G~Plla 145 (264)
T PRK08227 99 DMEDAVRLNACAVAAQVFIGS------EYEHQSIKNIIQLVDAGLRYGMPVMA 145 (264)
T ss_pred cHHHHHHCCCCEEEEEEecCC------HHHHHHHHHHHHHHHHHHHhCCcEEE
Confidence 366678899999999999981 22223346899999999999999886
No 229
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=20.39 E-value=1.3e+02 Score=32.65 Aligned_cols=60 Identities=25% Similarity=0.275 Sum_probs=39.2
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
+..++.++.++++|..+|.+...+.-....+.+..-.-...|.++.++|++.|+.+.+-|
T Consensus 85 ~~~~~~i~~a~~lga~~i~~~~g~~~~~~~~~~~~~~~~~~l~~l~~~a~~~Gv~l~lE~ 144 (258)
T PRK09997 85 DGVAAAIRYARALGNKKINCLVGKTPAGFSSEQIHATLVENLRYAANMLMKEDILLLIEP 144 (258)
T ss_pred HHHHHHHHHHHHhCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHcCCEEEEEe
Confidence 567889999999999998764333211111111111112457888899999999998886
No 230
>COG3684 LacD Tagatose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism]
Probab=20.14 E-value=74 Score=35.63 Aligned_cols=60 Identities=22% Similarity=0.318 Sum_probs=47.3
Q ss_pred CCCChhhHHHHHHHHHHcCCCEEEEeecCCccCC--CCCeeeccCchhHHHHHHHHHHCCceEEecC
Q psy12907 201 FRSPRYYWQDRLRKLRAAGLNAVSTYVEWSFHEP--SPGQFQFTGDQDLEYFLELAVREDLLVLLRP 265 (949)
Q Consensus 201 ~R~p~~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp--~~G~fdF~g~~DL~~fl~la~e~GL~VILRp 265 (949)
.|+|.=.=.=..+++|+.|-+.|-+-|.|..-|+ .+-+..+ +++|...|..+||..+|-|
T Consensus 106 ~rlp~l~~~isa~riK~~G~~avK~Lvy~~~D~~e~neqk~a~-----ierigsec~aedi~f~lE~ 167 (306)
T COG3684 106 VRLPDLLRKISAKRIKEDGGDAVKFLVYYRSDEDEINEQKLAY-----IERIGSECHAEDLPFFLEP 167 (306)
T ss_pred ccchhhhhhhCHHHHHHhcccceEEEEEEcCCchHHhHHHHHH-----HHHHHHHhhhcCCceeEee
Confidence 4666322222467899999999999999999999 4445555 9999999999999998886
No 231
>COG5309 Exo-beta-1,3-glucanase [Carbohydrate transport and metabolism]
Probab=20.09 E-value=1.3e+03 Score=26.33 Aligned_cols=190 Identities=18% Similarity=0.179 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHcCCCEEEEeecCCccCCCCCeeeccCchhHHHHHHHHHHCCceEEecCCcccccccCCCCcccccccc
Q psy12907 206 YYWQDRLRKLRAAGLNAVSTYVEWSFHEPSPGQFQFTGDQDLEYFLELAVREDLLVLLRPGPYICAERDFGGLPSWLMTI 285 (949)
Q Consensus 206 ~~W~d~L~k~Ka~GlNtV~tYv~Wn~HEp~~G~fdF~g~~DL~~fl~la~e~GL~VILRpGPYIcAEw~~GGlP~WL~~~ 285 (949)
+.-+-+|+.++.-+. .|++| | +.-+-|+..+.+|.+.|++|+|. .|+..
T Consensus 63 ~~~~sDLe~l~~~t~-~IR~Y----------~----sDCn~le~v~pAa~~~g~kv~lG---------------iw~td- 111 (305)
T COG5309 63 DQVASDLELLASYTH-SIRTY----------G----SDCNTLENVLPAAEASGFKVFLG---------------IWPTD- 111 (305)
T ss_pred HHHHhHHHHhccCCc-eEEEe----------e----ccchhhhhhHHHHHhcCceEEEE---------------Eeecc-
Q ss_pred CCCeeeecCCHhHHHHHHHHHHHHHHHHHhccccCCCceEEeccccc--cCCc---cchHHHHHHHHHHHHHhcCCeeEE
Q psy12907 286 KPNITLRSKDEVYQHYVNKWFAQLFPRITRFLYGNGGPIILVQVENE--MGSY---TCDKEHMIWLRDQMKYYVRDAAVL 360 (949)
Q Consensus 286 ~p~~~~Rt~dp~y~~~v~r~~~~l~~~l~~~~~~~GGpII~vQIENE--yG~y---~~d~~Y~~~L~~~~~~~lG~~vpl 360 (949)
+....-=+.++..+.+.---. .|..+=|-|| |+.. ..-.+|+...|.++++. |.++|.
T Consensus 112 --------------d~~~~~~~til~ay~~~~~~d--~v~~v~VGnEal~r~~~tasql~~~I~~vrsav~~a-gy~gpV 174 (305)
T COG5309 112 --------------DIHDAVEKTILSAYLPYNGWD--DVTTVTVGNEALNRNDLTASQLIEYIDDVRSAVKEA-GYDGPV 174 (305)
T ss_pred --------------chhhhHHHHHHHHHhccCCCC--ceEEEEechhhhhcCCCCHHHHHHHHHHHHHHHHhc-CCCCce
Q ss_pred EeeCCCCccccccccCCeeeeecCCCCC----------------ChHHHHHHhhhcCCC-CCeeeeecCCCccc---ccC
Q psy12907 361 YTTDGGGVGYLKCTVPGVYATVDFGAAT----------------NVSSAFAAMRTVSPH-GPLVNSEYYPGWLT---HWG 420 (949)
Q Consensus 361 ft~Dg~~~~~l~~~~~g~~~t~nfg~~~----------------~~~~~f~~~~~~~p~-~P~~~~Ef~~GWfd---~WG 420 (949)
-|.|.+ ..-..-|.+....||=..+ -+....+.++..-+. +|++++| +||.. ..|
T Consensus 175 ~T~dsw---~~~~~np~l~~~SDfia~N~~aYwd~~~~a~~~~~f~~~q~e~vqsa~g~~k~~~v~E--tGWPS~G~~~G 249 (305)
T COG5309 175 TTVDSW---NVVINNPELCQASDFIAANAHAYWDGQTVANAAGTFLLEQLERVQSACGTKKTVWVTE--TGWPSDGRTYG 249 (305)
T ss_pred eecccc---eeeeCChHHhhhhhhhhcccchhccccchhhhhhHHHHHHHHHHHHhcCCCccEEEee--ccCCCCCCccC
Q ss_pred CCCCCCChHH--HHHHHHHHHHcCceEEEe
Q psy12907 421 EKIATVDTDP--VVKTLQIMLEMKANVNFY 448 (949)
Q Consensus 421 ~~~~~~~~~~--~~~~l~~~l~~gaSvN~Y 448 (949)
+..+..+.+. +...+-.|-+.|.++=++
T Consensus 250 ~a~pS~anq~~~~~~i~~~~~~~G~d~fvf 279 (305)
T COG5309 250 SAVPSVANQKIAVQEILNALRSCGYDVFVF 279 (305)
T ss_pred CcCCChhHHHHHHHHHHhhhhccCccEEEe
Done!