Query 024821
Match_columns 262
No_of_seqs 253 out of 1429
Neff 6.5
Searched_HMMs 46136
Date Fri Mar 29 07:40:10 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/024821.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/024821hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0626 Beta-glucosidase, lact 100.0 2.5E-81 5.3E-86 600.7 22.3 234 24-257 32-271 (524)
2 PLN02998 beta-glucosidase 100.0 8.2E-78 1.8E-82 584.7 22.4 239 20-258 22-262 (497)
3 PLN02849 beta-glucosidase 100.0 3E-77 6.5E-82 581.5 23.2 237 21-258 22-258 (503)
4 PLN02814 beta-glucosidase 100.0 2.4E-76 5.1E-81 575.3 21.7 233 24-258 23-256 (504)
5 COG2723 BglB Beta-glucosidase/ 100.0 2.7E-75 5.9E-80 554.8 20.0 224 27-261 2-232 (460)
6 PRK13511 6-phospho-beta-galact 100.0 5.7E-74 1.2E-78 555.4 20.7 216 27-258 3-218 (469)
7 TIGR01233 lacG 6-phospho-beta- 100.0 1.6E-73 3.5E-78 551.8 20.9 216 27-258 2-217 (467)
8 PRK09593 arb 6-phospho-beta-gl 100.0 4.7E-73 1E-77 549.7 20.8 219 26-258 3-241 (478)
9 PRK09589 celA 6-phospho-beta-g 100.0 8.4E-73 1.8E-77 547.8 20.8 222 28-260 3-246 (476)
10 PRK15014 6-phospho-beta-glucos 100.0 5E-72 1.1E-76 542.3 21.7 220 25-258 2-242 (477)
11 PF00232 Glyco_hydro_1: Glycos 100.0 5E-72 1.1E-76 540.1 16.1 216 27-257 3-222 (455)
12 PRK09852 cryptic 6-phospho-bet 100.0 2.1E-70 4.6E-75 530.4 20.8 217 28-258 3-239 (474)
13 TIGR03356 BGL beta-galactosida 100.0 1.9E-68 4.1E-73 511.7 20.4 214 29-257 1-217 (427)
14 PF00150 Cellulase: Cellulase 99.4 5.4E-13 1.2E-17 118.9 8.8 109 80-191 22-134 (281)
15 PF02449 Glyco_hydro_42: Beta- 99.4 2.7E-13 5.9E-18 128.0 7.2 108 79-190 10-140 (374)
16 smart00633 Glyco_10 Glycosyl h 98.9 2.7E-09 5.7E-14 96.0 8.0 83 101-191 2-86 (254)
17 COG1874 LacA Beta-galactosidas 98.5 1.3E-07 2.8E-12 95.5 7.1 113 80-196 31-170 (673)
18 COG2730 BglC Endoglucanase [Ca 98.4 1E-06 2.2E-11 84.6 8.6 109 82-190 76-193 (407)
19 PF01229 Glyco_hydro_39: Glyco 98.3 2E-06 4.3E-11 84.4 8.5 107 80-191 40-168 (486)
20 PF00331 Glyco_hydro_10: Glyco 98.0 2E-05 4.4E-10 73.4 8.9 125 29-191 6-137 (320)
21 PF01301 Glyco_hydro_35: Glyco 97.7 9.2E-05 2E-09 69.1 7.2 109 80-189 25-151 (319)
22 PLN02803 beta-amylase 97.3 0.00058 1.3E-08 67.2 6.8 108 78-190 106-252 (548)
23 PLN02161 beta-amylase 97.3 0.0007 1.5E-08 66.3 7.1 111 75-190 113-262 (531)
24 PF13204 DUF4038: Protein of u 97.2 0.0025 5.4E-08 58.7 10.0 99 85-188 36-156 (289)
25 PF07745 Glyco_hydro_53: Glyco 97.2 0.0051 1.1E-07 57.9 12.1 103 82-191 27-138 (332)
26 PF01373 Glyco_hydro_14: Glyco 97.2 0.00059 1.3E-08 65.4 5.7 106 78-189 15-151 (402)
27 PLN00197 beta-amylase; Provisi 97.2 0.0011 2.3E-08 65.6 7.5 108 78-190 126-272 (573)
28 PLN03059 beta-galactosidase; P 97.1 0.0028 6E-08 65.8 10.2 109 79-189 59-188 (840)
29 COG3693 XynA Beta-1,4-xylanase 97.1 0.0013 2.7E-08 61.4 6.8 96 89-190 56-153 (345)
30 PLN02801 beta-amylase 97.0 0.0034 7.4E-08 61.6 8.6 100 77-179 35-173 (517)
31 PLN02905 beta-amylase 96.9 0.0034 7.5E-08 62.9 8.2 100 76-178 283-421 (702)
32 PLN02705 beta-amylase 96.9 0.0033 7.1E-08 62.9 8.0 98 78-178 267-403 (681)
33 PF02836 Glyco_hydro_2_C: Glyc 96.6 0.0093 2E-07 54.6 8.6 95 77-190 34-134 (298)
34 PF14587 Glyco_hydr_30_2: O-Gl 96.6 0.0049 1.1E-07 58.9 6.7 80 110-190 95-185 (384)
35 PRK10150 beta-D-glucuronidase; 95.8 0.031 6.8E-07 56.3 8.1 93 80-189 314-419 (604)
36 PF14488 DUF4434: Domain of un 95.8 0.069 1.5E-06 45.4 8.8 104 79-190 20-132 (166)
37 KOG0496 Beta-galactosidase [Ca 95.3 0.075 1.6E-06 53.7 8.3 109 79-189 49-176 (649)
38 PRK09525 lacZ beta-D-galactosi 93.6 0.23 5.1E-06 53.3 8.1 90 78-189 370-464 (1027)
39 PF07488 Glyco_hydro_67M: Glyc 93.3 0.56 1.2E-05 43.7 8.9 88 78-178 56-151 (328)
40 PRK10340 ebgA cryptic beta-D-g 92.7 0.42 9.1E-06 51.4 8.3 90 78-189 354-451 (1021)
41 COG3250 LacZ Beta-galactosidas 92.3 0.75 1.6E-05 48.3 9.3 90 75-189 317-408 (808)
42 COG3934 Endo-beta-mannanase [C 91.6 0.064 1.4E-06 52.6 0.6 109 81-190 28-150 (587)
43 COG3664 XynB Beta-xylosidase [ 91.3 0.43 9.3E-06 46.1 5.8 100 88-192 14-118 (428)
44 COG3867 Arabinogalactan endo-1 91.2 5.8 0.00013 37.3 12.7 137 25-189 31-182 (403)
45 smart00642 Aamy Alpha-amylase 90.9 0.54 1.2E-05 39.7 5.5 64 77-140 17-91 (166)
46 KOG2233 Alpha-N-acetylglucosam 87.7 5 0.00011 39.9 10.1 112 78-189 77-249 (666)
47 PF03198 Glyco_hydro_72: Gluca 85.8 3 6.6E-05 39.0 7.2 89 80-187 54-144 (314)
48 PLN02361 alpha-amylase 81.9 2.4 5.2E-05 41.0 5.1 64 76-139 26-96 (401)
49 PF00128 Alpha-amylase: Alpha 81.6 1 2.3E-05 39.8 2.3 60 81-140 6-73 (316)
50 cd06593 GH31_xylosidase_YicI Y 81.5 8.6 0.00019 35.3 8.4 106 81-189 26-161 (308)
51 PF02638 DUF187: Glycosyl hydr 81.4 6.3 0.00014 36.7 7.5 98 79-176 19-154 (311)
52 PLN00196 alpha-amylase; Provis 81.0 2.6 5.6E-05 41.1 5.0 63 77-139 42-112 (428)
53 PF10566 Glyco_hydro_97: Glyco 80.4 7.2 0.00016 35.9 7.4 119 55-174 9-149 (273)
54 PF14871 GHL6: Hypothetical gl 78.9 6.2 0.00013 32.2 5.9 53 84-140 5-65 (132)
55 cd06601 GH31_lyase_GLase GLase 77.6 9.1 0.0002 36.0 7.4 97 93-193 40-139 (332)
56 PRK05799 coproporphyrinogen II 77.0 6.7 0.00015 37.0 6.4 97 81-192 98-196 (374)
57 TIGR01210 conserved hypothetic 76.6 6.8 0.00015 36.4 6.2 111 82-203 117-229 (313)
58 PRK05402 glycogen branching en 76.5 13 0.00029 38.5 8.9 99 78-176 264-397 (726)
59 PF12876 Cellulase-like: Sugar 76.4 1.3 2.8E-05 33.2 1.1 19 171-189 1-22 (88)
60 cd07945 DRE_TIM_CMS Leptospira 76.2 6.4 0.00014 36.1 5.8 109 80-191 75-205 (280)
61 cd06592 GH31_glucosidase_KIAA1 75.2 15 0.00032 33.9 8.0 106 81-189 32-167 (303)
62 PRK14041 oxaloacetate decarbox 75.2 8.5 0.00018 38.1 6.7 56 77-145 88-148 (467)
63 cd03174 DRE_TIM_metallolyase D 75.0 7.4 0.00016 34.4 5.8 107 82-191 77-204 (265)
64 PRK14705 glycogen branching en 74.8 17 0.00036 40.2 9.3 95 81-176 767-897 (1224)
65 cd06602 GH31_MGAM_SI_GAA This 74.5 12 0.00026 35.1 7.3 107 82-191 27-169 (339)
66 PRK14040 oxaloacetate decarbox 74.2 9.1 0.0002 38.9 6.8 98 77-191 90-212 (593)
67 cd06600 GH31_MGAM-like This fa 73.8 24 0.00053 32.7 9.1 106 83-191 28-164 (317)
68 PRK05692 hydroxymethylglutaryl 73.2 10 0.00022 34.9 6.3 108 80-190 80-212 (287)
69 PRK09441 cytoplasmic alpha-amy 73.2 4.4 9.5E-05 39.8 4.2 64 76-139 19-101 (479)
70 cd06603 GH31_GANC_GANAB_alpha 73.2 18 0.0004 33.8 8.2 110 81-192 26-167 (339)
71 cd06598 GH31_transferase_CtsZ 72.8 25 0.00054 32.6 8.9 108 82-191 27-168 (317)
72 PRK12858 tagatose 1,6-diphosph 71.9 14 0.0003 35.1 7.0 53 85-140 112-164 (340)
73 PRK12399 tagatose 1,6-diphosph 70.6 18 0.00039 34.1 7.3 58 85-145 111-168 (324)
74 PRK04161 tagatose 1,6-diphosph 70.6 18 0.00039 34.1 7.3 59 84-145 112-170 (329)
75 PRK12568 glycogen branching en 70.5 22 0.00047 37.2 8.6 98 78-176 268-401 (730)
76 TIGR02402 trehalose_TreZ malto 70.5 21 0.00046 35.8 8.3 98 77-176 109-237 (542)
77 PLN02746 hydroxymethylglutaryl 70.2 13 0.00028 35.4 6.4 107 81-190 123-254 (347)
78 PRK12313 glycogen branching en 70.1 24 0.00053 36.0 8.9 99 78-176 169-302 (633)
79 TIGR01515 branching_enzym alph 69.5 31 0.00066 35.2 9.4 99 78-176 155-288 (613)
80 cd07939 DRE_TIM_NifV Streptomy 69.3 5.5 0.00012 35.8 3.6 107 81-191 71-196 (259)
81 TIGR02090 LEU1_arch isopropylm 69.3 11 0.00024 35.7 5.9 106 81-191 73-198 (363)
82 cd07937 DRE_TIM_PC_TC_5S Pyruv 69.2 22 0.00047 32.4 7.5 94 81-191 93-206 (275)
83 COG1501 Alpha-glucosidases, fa 68.7 14 0.0003 38.8 6.9 101 90-193 293-421 (772)
84 TIGR02403 trehalose_treC alpha 68.6 6.8 0.00015 39.3 4.4 64 77-140 25-96 (543)
85 cd06591 GH31_xylosidase_XylS X 68.4 32 0.0007 31.9 8.6 108 81-191 26-163 (319)
86 PLN02784 alpha-amylase 67.9 9.5 0.00021 40.5 5.4 64 76-139 518-588 (894)
87 PRK12581 oxaloacetate decarbox 67.6 15 0.00032 36.4 6.4 56 77-145 98-158 (468)
88 PRK07379 coproporphyrinogen II 66.7 14 0.00031 35.4 6.0 104 81-200 114-220 (400)
89 cd06543 GH18_PF-ChiA-like PF-C 66.4 26 0.00057 32.4 7.5 59 86-144 19-79 (294)
90 cd07948 DRE_TIM_HCS Saccharomy 65.9 7.7 0.00017 35.2 3.9 65 81-145 73-139 (262)
91 cd07944 DRE_TIM_HOA_like 4-hyd 65.7 20 0.00043 32.5 6.5 94 82-191 85-197 (266)
92 cd06599 GH31_glycosidase_Aec37 65.3 39 0.00084 31.3 8.5 108 82-190 32-171 (317)
93 TIGR00433 bioB biotin syntheta 65.2 14 0.00029 33.5 5.3 55 81-138 122-177 (296)
94 PRK03705 glycogen debranching 64.6 11 0.00025 38.7 5.2 91 85-176 185-329 (658)
95 cd07943 DRE_TIM_HOA 4-hydroxy- 64.5 39 0.00084 30.3 8.2 93 82-191 88-199 (263)
96 PRK09058 coproporphyrinogen II 64.3 22 0.00048 34.7 6.9 106 81-202 162-270 (449)
97 PRK12331 oxaloacetate decarbox 64.0 23 0.0005 34.8 7.0 93 82-191 99-211 (448)
98 TIGR00612 ispG_gcpE 1-hydroxy- 63.5 54 0.0012 31.2 9.0 85 73-171 76-160 (346)
99 PF05089 NAGLU: Alpha-N-acetyl 63.1 39 0.00084 32.0 8.0 110 78-189 18-185 (333)
100 TIGR00539 hemN_rel putative ox 62.7 19 0.00041 33.9 6.0 92 81-189 99-194 (360)
101 TIGR03217 4OH_2_O_val_ald 4-hy 62.6 58 0.0013 30.6 9.2 94 81-190 89-201 (333)
102 PRK05628 coproporphyrinogen II 62.4 22 0.00047 33.6 6.4 94 81-192 107-205 (375)
103 TIGR01108 oadA oxaloacetate de 62.2 22 0.00049 36.1 6.7 93 82-191 94-206 (582)
104 PRK10785 maltodextrin glucosid 62.0 11 0.00024 38.2 4.6 58 81-140 181-247 (598)
105 PRK06294 coproporphyrinogen II 61.3 26 0.00056 33.3 6.6 94 81-191 102-199 (370)
106 PRK14510 putative bifunctional 61.0 11 0.00025 41.5 4.7 57 83-139 191-267 (1221)
107 KOG1065 Maltase glucoamylase a 60.6 37 0.0008 35.8 8.0 103 85-193 317-454 (805)
108 COG1523 PulA Type II secretory 60.1 15 0.00033 38.1 5.1 62 78-139 197-285 (697)
109 TIGR01232 lacD tagatose 1,6-di 60.0 30 0.00066 32.6 6.7 59 84-145 111-169 (325)
110 PRK10933 trehalose-6-phosphate 59.5 21 0.00045 35.9 5.9 64 77-140 31-102 (551)
111 COG2100 Predicted Fe-S oxidore 58.9 33 0.00071 32.7 6.6 80 77-172 199-284 (414)
112 COG3589 Uncharacterized conser 58.6 33 0.00071 32.7 6.6 69 85-167 22-90 (360)
113 PRK08446 coproporphyrinogen II 58.6 27 0.00059 32.8 6.3 92 82-189 98-192 (350)
114 cd07938 DRE_TIM_HMGL 3-hydroxy 58.4 32 0.00069 31.3 6.5 108 81-191 75-207 (274)
115 TIGR02660 nifV_homocitr homoci 57.9 24 0.00052 33.5 5.8 107 81-191 74-199 (365)
116 PRK14511 maltooligosyl trehalo 57.3 32 0.00069 36.7 7.0 63 78-140 19-90 (879)
117 TIGR03581 EF_0839 conserved hy 57.1 27 0.00058 31.3 5.4 74 78-163 134-229 (236)
118 PLN02447 1,4-alpha-glucan-bran 57.0 15 0.00032 38.6 4.4 100 77-176 248-383 (758)
119 PF03659 Glyco_hydro_71: Glyco 56.8 46 0.00099 32.1 7.5 52 79-140 17-68 (386)
120 TIGR03234 OH-pyruv-isom hydrox 56.7 25 0.00053 30.9 5.4 65 78-145 83-150 (254)
121 PRK08599 coproporphyrinogen II 56.4 31 0.00067 32.6 6.3 95 81-192 99-197 (377)
122 PRK08195 4-hyroxy-2-oxovalerat 56.4 75 0.0016 29.9 8.8 93 82-190 91-202 (337)
123 PF12891 Glyco_hydro_44: Glyco 56.0 33 0.00071 31.0 6.0 106 119-248 24-174 (239)
124 PF10566 Glyco_hydro_97: Glyco 55.1 17 0.00038 33.4 4.2 70 81-161 108-177 (273)
125 PRK09505 malS alpha-amylase; R 55.0 25 0.00054 36.5 5.8 60 81-140 232-313 (683)
126 PRK11858 aksA trans-homoaconit 54.8 30 0.00066 33.0 6.0 106 82-191 78-202 (378)
127 TIGR02456 treS_nterm trehalose 54.6 27 0.00058 34.9 5.8 62 78-139 27-96 (539)
128 cd06545 GH18_3CO4_chitinase Th 54.5 32 0.00068 30.6 5.7 78 99-177 27-130 (253)
129 PF01055 Glyco_hydro_31: Glyco 53.5 49 0.0011 31.8 7.2 108 81-191 45-184 (441)
130 TIGR02100 glgX_debranch glycog 53.3 25 0.00054 36.5 5.4 56 85-140 190-266 (688)
131 PRK00366 ispG 4-hydroxy-3-meth 52.6 73 0.0016 30.5 7.9 72 89-171 98-169 (360)
132 TIGR02629 L_rham_iso_rhiz L-rh 51.6 78 0.0017 30.9 8.1 89 81-183 72-171 (412)
133 PRK14706 glycogen branching en 51.3 96 0.0021 31.9 9.2 90 86-176 175-299 (639)
134 PRK08208 coproporphyrinogen II 51.3 30 0.00066 33.5 5.4 61 81-145 140-203 (430)
135 cd06595 GH31_xylosidase_XylS-l 50.9 1.1E+02 0.0024 27.9 8.8 109 82-191 28-163 (292)
136 TIGR01212 radical SAM protein, 50.8 34 0.00073 31.6 5.4 107 82-202 123-233 (302)
137 cd07940 DRE_TIM_IPMS 2-isoprop 49.4 38 0.00083 30.4 5.5 107 82-191 72-203 (268)
138 COG0821 gcpE 1-hydroxy-2-methy 49.0 1.3E+02 0.0029 28.7 8.9 72 88-171 91-162 (361)
139 TIGR03471 HpnJ hopanoid biosyn 49.0 45 0.00098 32.5 6.3 56 82-140 287-344 (472)
140 cd06604 GH31_glucosidase_II_Ma 48.5 1E+02 0.0022 28.7 8.4 106 82-191 27-163 (339)
141 COG0366 AmyA Glycosidases [Car 48.3 25 0.00054 33.7 4.3 57 83-139 33-97 (505)
142 TIGR02401 trehalose_TreY malto 47.3 36 0.00078 36.1 5.5 66 78-143 15-91 (825)
143 PRK13347 coproporphyrinogen II 47.1 36 0.00078 33.2 5.2 78 81-173 151-230 (453)
144 PRK14507 putative bifunctional 46.5 54 0.0012 37.6 7.0 64 80-143 759-833 (1693)
145 cd06589 GH31 The enzymes of gl 46.3 2.2E+02 0.0047 25.5 10.9 91 81-191 26-120 (265)
146 cd07947 DRE_TIM_Re_CS Clostrid 44.9 39 0.00085 31.0 4.9 59 81-139 76-135 (279)
147 PF01120 Alpha_L_fucos: Alpha- 44.6 67 0.0014 30.2 6.5 59 85-143 97-162 (346)
148 cd02874 GH18_CFLE_spore_hydrol 44.0 71 0.0015 29.1 6.5 87 85-176 16-134 (313)
149 PRK12330 oxaloacetate decarbox 43.8 71 0.0015 32.0 6.7 96 82-191 100-214 (499)
150 PRK12677 xylose isomerase; Pro 43.5 1.4E+02 0.0031 28.6 8.6 72 80-158 32-104 (384)
151 PRK09936 hypothetical protein; 43.0 1.9E+02 0.0041 27.0 8.9 98 81-189 40-145 (296)
152 cd06542 GH18_EndoS-like Endo-b 42.7 68 0.0015 28.3 5.9 18 160-177 126-143 (255)
153 cd07941 DRE_TIM_LeuA3 Desulfob 42.6 31 0.00068 31.2 3.8 107 82-191 81-209 (273)
154 PLN02960 alpha-amylase 41.5 34 0.00074 36.5 4.3 99 77-176 414-549 (897)
155 PLN02389 biotin synthase 41.5 62 0.0013 31.1 5.8 57 80-139 176-233 (379)
156 PRK10426 alpha-glucosidase; Pr 41.5 2.2E+02 0.0048 29.3 10.1 105 81-188 223-364 (635)
157 PF00682 HMGL-like: HMGL-like 41.4 61 0.0013 28.2 5.4 106 82-190 66-194 (237)
158 PRK09249 coproporphyrinogen II 41.0 50 0.0011 32.2 5.2 76 81-173 150-229 (453)
159 cd03130 GATase1_CobB Type 1 gl 40.8 67 0.0015 27.6 5.4 70 72-149 6-89 (198)
160 PRK05660 HemN family oxidoredu 40.4 65 0.0014 30.6 5.8 94 81-190 106-202 (378)
161 PF04914 DltD_C: DltD C-termin 39.9 76 0.0017 25.8 5.3 57 117-177 34-90 (130)
162 PF03511 Fanconi_A: Fanconi an 39.4 26 0.00055 25.2 2.1 38 103-142 19-56 (64)
163 PRK09282 pyruvate carboxylase 39.3 96 0.0021 31.7 7.0 94 81-191 98-211 (592)
164 smart00729 Elp3 Elongator prot 39.1 1.6E+02 0.0034 23.9 7.3 58 79-139 97-157 (216)
165 PF11790 Glyco_hydro_cc: Glyco 38.4 25 0.00054 31.3 2.4 24 166-190 54-77 (239)
166 TIGR00538 hemN oxygen-independ 38.4 27 0.00059 34.0 2.9 50 81-134 150-202 (455)
167 PRK05904 coproporphyrinogen II 38.0 81 0.0017 29.8 5.9 92 81-190 102-198 (353)
168 PF13812 PPR_3: Pentatricopept 37.9 27 0.00058 20.2 1.8 15 121-135 20-34 (34)
169 TIGR00542 hxl6Piso_put hexulos 37.7 43 0.00093 29.9 3.9 59 79-139 94-153 (279)
170 PRK09856 fructoselysine 3-epim 37.6 36 0.00078 30.2 3.4 60 78-138 89-148 (275)
171 PRK10658 putative alpha-glucos 37.6 2E+02 0.0043 29.8 9.1 103 85-189 289-420 (665)
172 TIGR01211 ELP3 histone acetylt 36.9 62 0.0013 32.5 5.2 104 81-200 205-313 (522)
173 TIGR02104 pulA_typeI pullulana 36.3 76 0.0016 32.2 5.8 59 118-176 228-312 (605)
174 COG3661 AguA Alpha-glucuronida 35.9 1.7E+02 0.0037 29.2 7.7 90 79-177 183-279 (684)
175 PRK13398 3-deoxy-7-phosphohept 35.8 1E+02 0.0022 28.1 6.0 73 74-149 36-108 (266)
176 PRK01060 endonuclease IV; Prov 35.1 1.2E+02 0.0026 26.9 6.3 52 79-135 12-63 (281)
177 TIGR02631 xylA_Arthro xylose i 34.9 2.9E+02 0.0062 26.5 9.2 57 77-137 30-86 (382)
178 TIGR02635 RhaI_grampos L-rhamn 34.7 1.9E+02 0.0042 27.8 8.0 87 81-182 42-136 (378)
179 cd06525 GH25_Lyc-like Lyc mura 34.0 2.8E+02 0.006 23.2 8.2 30 154-183 95-124 (184)
180 PRK11572 copper homeostasis pr 33.6 94 0.002 28.3 5.3 41 79-128 73-113 (248)
181 cd02803 OYE_like_FMN_family Ol 33.3 73 0.0016 29.2 4.8 39 104-142 62-100 (327)
182 PRK05692 hydroxymethylglutaryl 32.6 3.1E+02 0.0066 25.1 8.7 52 80-136 121-172 (287)
183 TIGR03551 F420_cofH 7,8-dideme 32.3 1.6E+02 0.0034 27.5 6.9 90 81-186 140-234 (343)
184 PRK13209 L-xylulose 5-phosphat 32.3 3.6E+02 0.0077 23.8 9.1 53 80-136 22-74 (283)
185 PRK08207 coproporphyrinogen II 32.2 44 0.00096 33.2 3.3 60 81-145 268-331 (488)
186 PF01261 AP_endonuc_2: Xylose 32.1 38 0.00082 28.0 2.4 61 78-138 70-131 (213)
187 cd08576 GDPD_like_SMaseD_PLD G 31.9 3.5E+02 0.0076 24.8 8.8 103 82-190 10-126 (265)
188 PRK07094 biotin synthase; Prov 31.9 74 0.0016 29.2 4.5 57 80-139 127-185 (323)
189 TIGR00423 radical SAM domain p 31.8 1.6E+02 0.0036 26.9 6.8 54 81-140 106-166 (309)
190 PRK06256 biotin synthase; Vali 31.3 67 0.0014 29.7 4.2 57 80-139 150-207 (336)
191 cd00019 AP2Ec AP endonuclease 31.1 2.1E+02 0.0046 25.4 7.3 25 79-103 10-34 (279)
192 TIGR02159 PA_CoA_Oxy4 phenylac 31.0 57 0.0012 27.1 3.3 56 73-137 36-92 (146)
193 PF04055 Radical_SAM: Radical 30.8 56 0.0012 25.4 3.1 53 82-136 90-144 (166)
194 PF09713 A_thal_3526: Plant pr 30.7 63 0.0014 22.5 2.9 35 122-167 16-51 (54)
195 cd07938 DRE_TIM_HMGL 3-hydroxy 30.7 3.7E+02 0.0081 24.3 8.9 51 81-136 116-166 (274)
196 TIGR03699 mena_SCO4550 menaqui 30.2 2.3E+02 0.005 26.2 7.6 53 81-139 142-201 (340)
197 PRK03892 ribonuclease P protei 29.8 1.6E+02 0.0034 26.3 5.9 87 87-174 122-215 (216)
198 TIGR01589 A_thal_3526 uncharac 29.5 85 0.0019 22.1 3.4 36 122-168 19-55 (57)
199 PTZ00445 p36-lilke protein; Pr 29.3 86 0.0019 28.0 4.2 56 85-140 35-99 (219)
200 PF02057 Glyco_hydro_59: Glyco 29.3 1.1E+02 0.0025 31.7 5.6 65 123-190 116-185 (669)
201 smart00052 EAL Putative diguan 29.0 90 0.002 26.4 4.4 62 72-138 148-209 (241)
202 PF04646 DUF604: Protein of un 29.0 26 0.00057 31.9 1.0 72 127-201 76-147 (255)
203 cd01335 Radical_SAM Radical SA 28.9 71 0.0015 25.5 3.5 59 81-140 87-146 (204)
204 PRK09057 coproporphyrinogen II 28.2 1.5E+02 0.0033 28.1 6.1 96 81-192 103-200 (380)
205 PF04028 DUF374: Domain of unk 27.9 1.3E+02 0.0028 22.1 4.3 40 86-139 27-66 (74)
206 cd06594 GH31_glucosidase_YihQ 27.8 4.4E+02 0.0095 24.4 9.0 34 155-189 135-168 (317)
207 TIGR00542 hxl6Piso_put hexulos 27.7 4.3E+02 0.0093 23.4 9.1 54 79-136 16-69 (279)
208 TIGR00587 nfo apurinic endonuc 27.5 1.7E+02 0.0037 26.3 6.0 50 81-135 13-62 (274)
209 TIGR02102 pullulan_Gpos pullul 27.2 2.7E+02 0.0058 30.9 8.3 100 77-176 478-635 (1111)
210 PRK09997 hydroxypyruvate isome 27.0 1.4E+02 0.003 26.3 5.3 62 81-145 87-151 (258)
211 PF02065 Melibiase: Melibiase; 26.6 2.4E+02 0.0052 27.3 7.1 140 81-247 60-228 (394)
212 PRK13210 putative L-xylulose 5 26.0 81 0.0018 27.9 3.6 59 79-139 94-153 (284)
213 PRK09389 (R)-citramalate synth 25.9 80 0.0017 31.4 3.8 107 81-191 75-200 (488)
214 COG5520 O-Glycosyl hydrolase [ 25.8 1.8E+02 0.0039 28.2 5.9 94 90-191 77-181 (433)
215 PTZ00445 p36-lilke protein; Pr 25.4 1.2E+02 0.0025 27.2 4.3 67 119-190 29-104 (219)
216 PRK06582 coproporphyrinogen II 25.4 2.8E+02 0.0061 26.5 7.4 94 82-191 111-206 (390)
217 PLN02925 4-hydroxy-3-methylbut 25.4 2.2E+02 0.0047 29.9 6.8 52 120-172 211-262 (733)
218 PF13200 DUF4015: Putative gly 25.2 2.3E+02 0.0049 26.7 6.5 95 79-174 13-136 (316)
219 cd00927 Cyt_c_Oxidase_VIc Cyto 25.2 31 0.00068 25.3 0.6 19 75-93 46-66 (70)
220 PRK09967 putative outer membra 25.0 2.5E+02 0.0053 23.5 6.2 64 80-143 26-93 (160)
221 TIGR00674 dapA dihydrodipicoli 24.8 3.7E+02 0.0081 24.2 7.8 67 110-190 13-79 (285)
222 cd01948 EAL EAL domain. This d 24.6 1E+02 0.0022 26.1 3.8 63 72-139 147-209 (240)
223 PF04007 DUF354: Protein of un 24.5 88 0.0019 29.5 3.7 39 114-152 9-47 (335)
224 TIGR00676 fadh2 5,10-methylene 24.1 2.8E+02 0.0061 25.0 6.9 74 118-191 169-264 (272)
225 PRK13210 putative L-xylulose 5 24.0 2.8E+02 0.0061 24.4 6.8 54 80-137 17-70 (284)
226 PRK08508 biotin synthase; Prov 23.8 1.4E+02 0.0031 27.0 4.9 56 81-139 101-157 (279)
227 KOG0470 1,4-alpha-glucan branc 23.8 1E+02 0.0022 32.3 4.2 64 78-141 253-333 (757)
228 PRK12755 phospho-2-dehydro-3-d 23.6 2.1E+02 0.0046 27.4 6.0 73 78-150 67-156 (353)
229 COG0138 PurH AICAR transformyl 23.2 93 0.002 31.1 3.6 50 73-138 459-508 (515)
230 PF02156 Glyco_hydro_26: Glyco 22.7 1.6E+02 0.0035 27.5 5.0 52 118-177 136-192 (311)
231 COG3684 LacD Tagatose-1,6-bisp 22.5 1.2E+02 0.0025 28.2 3.8 56 84-143 116-171 (306)
232 cd06565 GH20_GcnA-like Glycosy 22.4 2.8E+02 0.006 25.5 6.5 56 81-140 19-81 (301)
233 PRK13209 L-xylulose 5-phosphat 22.1 98 0.0021 27.5 3.4 56 79-139 99-158 (283)
234 COG5016 Pyruvate/oxaloacetate 22.1 3E+02 0.0065 27.2 6.7 51 77-140 91-146 (472)
235 COG0635 HemN Coproporphyrinoge 22.0 2.9E+02 0.0062 26.8 6.8 101 82-199 137-241 (416)
236 PRK07106 5-aminoimidazole-4-ca 22.0 95 0.0021 30.1 3.3 50 73-138 334-383 (390)
237 PRK09432 metF 5,10-methylenete 21.6 3E+02 0.0064 25.4 6.5 74 119-192 189-284 (296)
238 TIGR02026 BchE magnesium-proto 21.5 1.4E+02 0.0031 29.4 4.7 61 81-145 286-348 (497)
239 TIGR02351 thiH thiazole biosyn 21.4 5E+02 0.011 24.5 8.2 101 77-191 157-262 (366)
240 PF02055 Glyco_hydro_30: O-Gly 21.1 1.1E+02 0.0024 30.6 3.7 101 89-191 110-236 (496)
241 COG1453 Predicted oxidoreducta 21.0 2.3E+02 0.0049 27.5 5.6 87 88-185 13-117 (391)
242 TIGR01856 hisJ_fam histidinol 21.0 2.6E+02 0.0056 24.8 5.9 61 119-181 15-78 (253)
243 PF03932 CutC: CutC family; I 20.8 1.4E+02 0.003 26.2 4.0 41 80-129 73-113 (201)
244 cd00950 DHDPS Dihydrodipicolin 20.8 4.7E+02 0.01 23.3 7.6 67 110-190 15-81 (284)
245 PF06415 iPGM_N: BPG-independe 20.8 2.2E+02 0.0049 25.4 5.3 66 69-140 37-102 (223)
246 cd01990 Alpha_ANH_like_I This 20.6 2.9E+02 0.0064 23.2 5.9 57 87-143 47-106 (202)
247 PF14417 MEDS: MEDS: MEthanoge 20.6 2.2E+02 0.0048 24.1 5.1 58 78-144 104-163 (191)
248 PF06777 DUF1227: Protein of u 20.5 2E+02 0.0044 24.0 4.7 68 119-188 16-86 (146)
249 cd04501 SGNH_hydrolase_like_4 20.3 2.6E+02 0.0056 22.7 5.4 59 115-176 79-138 (183)
250 cd06522 GH25_AtlA-like AtlA is 20.3 1.8E+02 0.0039 24.7 4.5 15 80-94 43-57 (192)
251 PLN03153 hypothetical protein; 20.3 1.1E+02 0.0025 30.8 3.6 38 128-168 327-365 (537)
252 cd04733 OYE_like_2_FMN Old yel 20.2 2.7E+02 0.0058 25.9 6.0 40 103-142 63-105 (338)
253 cd06562 GH20_HexA_HexB-like Be 20.2 2E+02 0.0043 27.1 5.1 63 81-146 20-94 (348)
254 PRK09856 fructoselysine 3-epim 20.1 3.1E+02 0.0067 24.1 6.2 52 80-137 14-65 (275)
255 PF03129 HGTP_anticodon: Antic 20.1 1.2E+02 0.0025 22.2 2.9 27 114-140 11-37 (94)
No 1
>KOG0626 consensus Beta-glucosidase, lactase phlorizinhydrolase, and related proteins [Carbohydrate transport and metabolism]
Probab=100.00 E-value=2.5e-81 Score=600.72 Aligned_cols=234 Identities=62% Similarity=1.103 Sum_probs=224.6
Q ss_pred cccCCCCCCCeeeeecccccccCCCCCCCCCCccccccccc-C-CCC--CCCCcCCccccccHHHHHHHHHcCCCcceec
Q 024821 24 YTKNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHA-G-NVP--GTGDVACDEYHKYKEDVKLMADTGLDAYRFS 99 (262)
Q Consensus 24 ~~~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~-~-~~~--~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~s 99 (262)
+++..||++|+||+||||||+|||+++|||++|+||.|++. + ++. .++++|||+||||+|||+|||+||+++||||
T Consensus 32 ~~r~~FP~~F~FGtAtSAyQ~EGA~~e~gRg~svWD~f~~~~p~~~~~~~ngdva~D~Yh~ykeDv~Lmk~lgv~afRFS 111 (524)
T KOG0626|consen 32 FSRADFPKGFLFGTATSAYQVEGAANEDGRGPSVWDTFTHKYPGKICDGSNGDVAVDFYHRYKEDVKLMKELGVDAFRFS 111 (524)
T ss_pred ccccCCCCCceeeccchHHHhhhhhccCCCCCchhhhhhccCCcccccCCCCCeechhhhhhHHHHHHHHHcCCCeEEEE
Confidence 56888999999999999999999999999999999999987 3 322 6789999999999999999999999999999
Q ss_pred cccCccccCCC--CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCC
Q 024821 100 ISWSRLIPNGR--GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGD 177 (262)
Q Consensus 100 i~WsRI~P~g~--g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd 177 (262)
||||||+|.|+ +.+|++|++||+++|++|+++||+|+|||+|||+|++|+++||||+|++++++|.+||+.||++|||
T Consensus 112 IsWSRIlP~G~~~~gVN~~Gi~fY~~LI~eL~~nGI~P~VTLfHwDlPq~LeDeYgGwLn~~ivedF~~yA~~CF~~fGD 191 (524)
T KOG0626|consen 112 ISWSRILPNGRLTGGVNEAGIQFYNNLIDELLANGIEPFVTLFHWDLPQALEDEYGGWLNPEIVEDFRDYADLCFQEFGD 191 (524)
T ss_pred eehHhhCCCCCcCCCcCHHHHHHHHHHHHHHHHcCCeEEEEEecCCCCHHHHHHhccccCHHHHHHHHHHHHHHHHHhcc
Confidence 99999999996 6799999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCceEEeccCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeec
Q 024821 178 RVSYWTTVNEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFS 257 (262)
Q Consensus 178 ~V~~w~t~NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~ 257 (262)
+||+|+|||||++++..||..|..|||+|+....+|+.++|++++|.+.|||++|||+||++||+.++..|+|+|||+++
T Consensus 192 rVK~WiT~NEP~v~s~~gY~~G~~aPGrCs~~~~~c~~g~s~~epYiv~HNllLAHA~Av~~yr~kyk~~Q~G~IGi~~~ 271 (524)
T KOG0626|consen 192 RVKHWITFNEPNVFSIGGYDTGTKAPGRCSKYVGNCSAGNSGTEPYIVAHNLLLAHAAAVDLYRKKYKKKQGGKIGIALS 271 (524)
T ss_pred cceeeEEecccceeeeehhccCCCCCCCCCcccccCCCCCCCCCcchHHHHHHHHHHHHHHHHHHhhhhhcCCeEeEEEe
Confidence 99999999999999999999999999999987678999999999999999999999999999999999999999999986
No 2
>PLN02998 beta-glucosidase
Probab=100.00 E-value=8.2e-78 Score=584.70 Aligned_cols=239 Identities=72% Similarity=1.262 Sum_probs=215.7
Q ss_pred cccccccCCCCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCC-CCCCcCCccccccHHHHHHHHHcCCCccee
Q 024821 20 TAVEYTKNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVP-GTGDVACDEYHKYKEDVKLMADTGLDAYRF 98 (262)
Q Consensus 20 ~~~~~~~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~-~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~ 98 (262)
.+..+.+.+||++|+||+||||||+||++++||||+|+||.+++.+... .++++||||||||+|||+|||+||+++|||
T Consensus 22 ~~~~~~~~~FP~~FlwG~AtSA~QvEGa~~~~Gkg~siwD~~~~~~~~~~~~~~~a~D~Yhry~EDi~lmk~lG~~~YRf 101 (497)
T PLN02998 22 SSLKYSRNDFPPGFVFGSGTSAYQVEGAADEDGRTPSIWDVFAHAGHSGVAAGNVACDQYHKYKEDVKLMADMGLEAYRF 101 (497)
T ss_pred ccccCccccCCCCCEEeeechHHHhCCCcCCCCCccchhhcccccCcCCCCCCcccccHHHhhHHHHHHHHHcCCCeEEe
Confidence 3444777889999999999999999999999999999999998754221 367889999999999999999999999999
Q ss_pred ccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCC
Q 024821 99 SISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 99 si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~ 178 (262)
||+||||+|+|+|.+|++||+||+++|++|+++||+|+|||+|||+|+||+++||||.|+++++.|++||++|+++|||+
T Consensus 102 SIsWsRI~P~G~g~vN~~gl~~Y~~lid~L~~~GIeP~VTL~H~dlP~~L~~~yGGW~n~~~v~~F~~YA~~~~~~fgdr 181 (497)
T PLN02998 102 SISWSRLLPSGRGPINPKGLQYYNNLIDELITHGIQPHVTLHHFDLPQALEDEYGGWLSQEIVRDFTAYADTCFKEFGDR 181 (497)
T ss_pred eccHHhcCcCCCCCcCHHHHHHHHHHHHHHHHcCCceEEEecCCCCCHHHHHhhCCcCCchHHHHHHHHHHHHHHHhcCc
Confidence 99999999998889999999999999999999999999999999999999998999999999999999999999999999
Q ss_pred CceEEeccCCchhccccccccCCCCCCCCCC-CCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeec
Q 024821 179 VSYWTTVNEPNGFAMVGYDFGIAPPKRCSPP-LNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFS 257 (262)
Q Consensus 179 V~~w~t~NEP~~~~~~gy~~g~~~p~~~~~~-~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~ 257 (262)
|++|+|||||++++..||..|.+|||.+... +..|..+++.++.+|++||+++|||+|++++|+.++..+.|+||++++
T Consensus 182 Vk~WiT~NEP~~~~~~gy~~G~~~Pg~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~A~~~~~~~~~~~~~g~IGi~~~ 261 (497)
T PLN02998 182 VSHWTTINEVNVFALGGYDQGITPPARCSPPFGLNCTKGNSSIEPYIAVHNMLLAHASATILYKQQYKYKQHGSVGISVY 261 (497)
T ss_pred CCEEEEccCcchhhhcchhhcccCCCccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcEEEEEe
Confidence 9999999999999999999999999964321 123554455577899999999999999999999876566789999887
Q ss_pred c
Q 024821 258 I 258 (262)
Q Consensus 258 ~ 258 (262)
.
T Consensus 262 ~ 262 (497)
T PLN02998 262 T 262 (497)
T ss_pred C
Confidence 5
No 3
>PLN02849 beta-glucosidase
Probab=100.00 E-value=3e-77 Score=581.47 Aligned_cols=237 Identities=66% Similarity=1.184 Sum_probs=212.8
Q ss_pred ccccccCCCCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceecc
Q 024821 21 AVEYTKNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSI 100 (262)
Q Consensus 21 ~~~~~~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si 100 (262)
+..+.+.+||+||+||+||||||+||++++||||+|+||.+++.+.. .++++||||||||+|||+|||+||+++|||||
T Consensus 22 ~~~~~~~~FP~dFlwG~AtsA~QiEGa~~~~Gkg~SiwD~~~~~~~~-~~~~~a~D~YhrY~eDI~Lm~~lG~~aYRfSI 100 (503)
T PLN02849 22 SSDYSRSDFPEGFVFGAGTSAYQWEGAFDEDGRKPSVWDTFLHSRNM-SNGDIACDGYHKYKEDVKLMVETGLDAFRFSI 100 (503)
T ss_pred cCCCccccCCCCCEEEeechhhhhcCCcCCCCCcCcceeeeeccCCC-CCCCccccHHHhHHHHHHHHHHcCCCeEEEec
Confidence 35577788999999999999999999999999999999999875322 47788999999999999999999999999999
Q ss_pred ccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCc
Q 024821 101 SWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVS 180 (262)
Q Consensus 101 ~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~ 180 (262)
+||||+|+|.|++|++|++||+++||+|+++||+|+|||+|||+|+||+++||||.|++++++|++||+.|+++|||+|+
T Consensus 101 sWsRI~P~G~g~vN~~gl~fY~~lid~l~~~GI~P~VTL~H~dlP~~L~~~yGGW~nr~~v~~F~~YA~~~f~~fgDrVk 180 (503)
T PLN02849 101 SWSRLIPNGRGSVNPKGLQFYKNFIQELVKHGIEPHVTLFHYDHPQYLEDDYGGWINRRIIKDFTAYADVCFREFGNHVK 180 (503)
T ss_pred cHHhcCcCCCCCCCHHHHHHHHHHHHHHHHcCCeEEEeecCCCCcHHHHHhcCCcCCchHHHHHHHHHHHHHHHhcCcCC
Confidence 99999999888999999999999999999999999999999999999999899999999999999999999999999999
Q ss_pred eEEeccCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeecc
Q 024821 181 YWTTVNEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFSI 258 (262)
Q Consensus 181 ~w~t~NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~~ 258 (262)
+|+|||||++++..||..|.+|||.+......|..+++.++.+|+.||+++|||+||+++|++++..+.|+||++++.
T Consensus 181 ~WiT~NEP~~~~~~gy~~G~~~Pg~~~~~~~~~~~~~~~~~~~~a~hn~llAHa~A~~~~~~~~~~~~~~~IGi~~~~ 258 (503)
T PLN02849 181 FWTTINEANIFTIGGYNDGITPPGRCSSPGRNCSSGNSSTEPYIVGHNLLLAHASVSRLYKQKYKDMQGGSIGFSLFA 258 (503)
T ss_pred EEEEecchhhhhhchhhhccCCCCccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCEEEEEEEC
Confidence 999999999999999999999999643110124334445678999999999999999999997654456788887753
No 4
>PLN02814 beta-glucosidase
Probab=100.00 E-value=2.4e-76 Score=575.32 Aligned_cols=233 Identities=63% Similarity=1.074 Sum_probs=209.7
Q ss_pred cccCCCCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccC
Q 024821 24 YTKNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWS 103 (262)
Q Consensus 24 ~~~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~Ws 103 (262)
+.+.+||++|+||+||||||+||++++||||+|+||.+++... ..++++||||||||+|||+|||+||+++|||||+||
T Consensus 23 ~~~~~fP~~FlwG~AtaA~QiEGa~~~~gkg~siwD~~~~~~~-~~~~~~a~D~Yhry~EDI~L~k~lG~~ayRfSIsWs 101 (504)
T PLN02814 23 FTRNDFPEDFLFGAATSAYQWEGAVDEDGRTPSVWDTTSHCYN-GGNGDIASDGYHKYKEDVKLMAEMGLESFRFSISWS 101 (504)
T ss_pred cccccCCCCCEEeeechhhhhcCCcCCCCCccchhheeeeccC-CCCCCccccHHHhhHHHHHHHHHcCCCEEEEeccHh
Confidence 6677899999999999999999999999999999999876311 147889999999999999999999999999999999
Q ss_pred ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEE
Q 024821 104 RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWT 183 (262)
Q Consensus 104 RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~ 183 (262)
||+|+|+|++|++|++||+++||+|+++||+|+|||+|||+|+||+++||||.|+++++.|++||+.|+++|||+|++|+
T Consensus 102 RI~P~G~g~~N~~Gl~fY~~lId~l~~~GI~P~VTL~H~dlP~~L~~~yGGW~n~~~i~~F~~YA~~~f~~fgdrVk~Wi 181 (504)
T PLN02814 102 RLIPNGRGLINPKGLLFYKNLIKELRSHGIEPHVTLYHYDLPQSLEDEYGGWINRKIIEDFTAFADVCFREFGEDVKLWT 181 (504)
T ss_pred hcCcCCCCCCCHHHHHHHHHHHHHHHHcCCceEEEecCCCCCHHHHHhcCCcCChhHHHHHHHHHHHHHHHhCCcCCEEE
Confidence 99999989999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccCCchhccccccccCCCCCCCCCC-CCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeecc
Q 024821 184 TVNEPNGFAMVGYDFGIAPPKRCSPP-LNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFSI 258 (262)
Q Consensus 184 t~NEP~~~~~~gy~~g~~~p~~~~~~-~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~~ 258 (262)
|||||++++..||..|.. ||.++.. ...|..+++.++.+|++||+++|||+||+++|+++...+.|+||+++++
T Consensus 182 T~NEP~~~~~~gy~~G~~-pg~~~~~~~~~~~~~~~~~~~~~a~hn~llAHa~Av~~~~~~~~~~~~g~IGi~~~~ 256 (504)
T PLN02814 182 TINEATIFAIGSYGQGIR-YGHCSPNKFINCSTGNSCTETYIAGHNMLLAHASASNLYKLKYKSKQRGSIGLSIFA 256 (504)
T ss_pred eccccchhhhcccccCcC-CCCCCcccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCeEEEEEeC
Confidence 999999999999998874 7754421 1234334455678999999999999999999998655567888888764
No 5
>COG2723 BglB Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=2.7e-75 Score=554.77 Aligned_cols=224 Identities=45% Similarity=0.796 Sum_probs=208.0
Q ss_pred CCCCCCCeeeeecccccccCCCCCCCCCCcccccccccC---CCC--CCCCcCCccccccHHHHHHHHHcCCCcceeccc
Q 024821 27 NDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAG---NVP--GTGDVACDEYHKYKEDVKLMADTGLDAYRFSIS 101 (262)
Q Consensus 27 ~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~---~~~--~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~ 101 (262)
.+||++|+||+||||+|+||++++||||+|+||+|++.. +.. .+++.|+||||||+|||+|||+||+|+|||||+
T Consensus 2 ~~FPkdFlWG~AtAa~Q~EGa~~~dGkg~s~wD~~~~~~~~~~~~~~~~~~~a~d~YhrYkeDi~L~~emG~~~~R~SI~ 81 (460)
T COG2723 2 LKFPKDFLWGGATAAFQVEGAWNEDGKGPSDWDVWVHDEIPGRLVSGDPPEEASDFYHRYKEDIALAKEMGLNAFRTSIE 81 (460)
T ss_pred CCCCCCCeeecccccccccCCcCCCCCCCeeeeeeeccccCCcccCCCCCccccchhhhhHHHHHHHHHcCCCEEEeeee
Confidence 469999999999999999999999999999999999842 222 578899999999999999999999999999999
Q ss_pred cCccccCCCC-CCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCc
Q 024821 102 WSRLIPNGRG-PVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVS 180 (262)
Q Consensus 102 WsRI~P~g~g-~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~ 180 (262)
||||+|+|.+ ++|++||+||+++||+|+++||+|+|||+|||+|+||+++||||.|++++++|++||++||+||||+|+
T Consensus 82 WsRIfP~g~~~e~N~~gl~fY~~l~del~~~gIep~vTL~Hfd~P~~L~~~ygGW~nR~~i~~F~~ya~~vf~~f~dkVk 161 (460)
T COG2723 82 WSRIFPNGDGGEVNEKGLRFYDRLFDELKARGIEPFVTLYHFDLPLWLQKPYGGWENRETVDAFARYAATVFERFGDKVK 161 (460)
T ss_pred EEEeecCCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEecccCCcHHHhhccCCccCHHHHHHHHHHHHHHHHHhcCcce
Confidence 9999999865 899999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eEEeccCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccc-eEeeeeccC
Q 024821 181 YWTTVNEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKK-ISFMCFSIP 259 (262)
Q Consensus 181 ~w~t~NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G-~ig~~~~~p 259 (262)
+|+||||||+++..||+.|.+||+..+ .+.++|++||+++|||+|++++|+..|+.++| +++++..||
T Consensus 162 ~W~TFNE~n~~~~~~y~~~~~~p~~~~-----------~~~~~qa~hh~~lA~A~avk~~~~~~~~~kIG~~~~~~p~YP 230 (460)
T COG2723 162 YWFTFNEPNVVVELGYLYGGHPPGIVD-----------PKAAYQVAHHMLLAHALAVKAIKKINPKGKVGIILNLTPAYP 230 (460)
T ss_pred EEEEecchhhhhcccccccccCCCccC-----------HHHHHHHHHHHHHHHHHHHHHHHhhCCcCceEEEeccCcCCC
Confidence 999999999999999999999998654 36789999999999999999999998864555 557777888
Q ss_pred CC
Q 024821 260 YR 261 (262)
Q Consensus 260 ~~ 261 (262)
++
T Consensus 231 ~s 232 (460)
T COG2723 231 LS 232 (460)
T ss_pred CC
Confidence 75
No 6
>PRK13511 6-phospho-beta-galactosidase; Provisional
Probab=100.00 E-value=5.7e-74 Score=555.42 Aligned_cols=216 Identities=40% Similarity=0.736 Sum_probs=199.5
Q ss_pred CCCCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccCccc
Q 024821 27 NDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLI 106 (262)
Q Consensus 27 ~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~ 106 (262)
.+||++|+||+||||||+||++++||||+|+||.+++.... .++++||||||||+|||+|||+||+++|||||+||||+
T Consensus 3 ~~fP~~FlwG~Atsa~QiEG~~~~~Gkg~siwD~~~~~~~~-~~~~~a~d~Y~ry~eDi~L~~~lG~~~yRfSIsWsRI~ 81 (469)
T PRK13511 3 KTLPKDFIFGGATAAYQAEGATKTDGKGPVAWDKYLEENYW-FTPDPASDFYHRYPEDLKLAEEFGVNGIRISIAWSRIF 81 (469)
T ss_pred CCCCCCCEEEeechHhhhcCCcCCCCCccchhhcccccCCC-CCCCcccchhhhhHHHHHHHHHhCCCEEEeeccHhhcC
Confidence 45999999999999999999999999999999999874221 37889999999999999999999999999999999999
Q ss_pred cCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEecc
Q 024821 107 PNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVN 186 (262)
Q Consensus 107 P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~N 186 (262)
|+|+|.+|++||+||+++|++|+++||+|+|||+|||+|+||+++ |||.|+++++.|++||+.||++||| |++|+|||
T Consensus 82 P~G~g~vN~~gl~~Y~~lid~l~~~GI~P~VTL~H~dlP~~L~~~-GGW~n~~~v~~F~~YA~~~~~~fgd-Vk~W~T~N 159 (469)
T PRK13511 82 PDGYGEVNPKGVEYYHRLFAECHKRHVEPFVTLHHFDTPEALHSN-GDWLNRENIDHFVRYAEFCFEEFPE-VKYWTTFN 159 (469)
T ss_pred cCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEEecCCCCcHHHHHc-CCCCCHHHHHHHHHHHHHHHHHhCC-CCEEEEcc
Confidence 998889999999999999999999999999999999999999976 9999999999999999999999999 99999999
Q ss_pred CCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeecc
Q 024821 187 EPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFSI 258 (262)
Q Consensus 187 EP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~~ 258 (262)
||++++..||..|.+|||++.. .+..+|++||+++|||+||+++|+.. +.|+||++++.
T Consensus 160 EP~~~~~~gy~~G~~~Pg~~~~----------~~~~~~~~hn~llAHa~A~~~~~~~~---~~g~IGi~~~~ 218 (469)
T PRK13511 160 EIGPIGDGQYLVGKFPPGIKYD----------LAKVFQSHHNMMVAHARAVKLFKDKG---YKGEIGVVHAL 218 (469)
T ss_pred chhhhhhcchhhcccCCCCCcc----------HHHHHHHHHHHHHHHHHHHHHHHHhC---CCCeEEEEecC
Confidence 9999999999999999996421 14689999999999999999999975 45778887763
No 7
>TIGR01233 lacG 6-phospho-beta-galactosidase. This enzyme is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=100.00 E-value=1.6e-73 Score=551.76 Aligned_cols=216 Identities=38% Similarity=0.692 Sum_probs=198.1
Q ss_pred CCCCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccCccc
Q 024821 27 NDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLI 106 (262)
Q Consensus 27 ~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~ 106 (262)
.+||+||+||+|||||||||+++++|||+|+||.+.+.... .++++||||||||+|||+|||+||+++|||||+||||+
T Consensus 2 ~~fP~~FlwG~AtsA~QvEG~~~~~Gkg~siwD~~~~~~~~-~~~~~a~d~yhry~eDi~L~~~lG~~~yRfSIsWsRI~ 80 (467)
T TIGR01233 2 KTLPKDFIFGGATAAYQAEGATHTDGKGPVAWDKYLEDNYW-YTAEPASDFYHKYPVDLELAEEYGVNGIRISIAWSRIF 80 (467)
T ss_pred CCCCCCCEEeeechhhhcCCCcCCCCCcCchhhccccCCCC-CCCCccCchhhhHHHHHHHHHHcCCCEEEEecchhhcc
Confidence 35999999999999999999999999999999998764221 36788999999999999999999999999999999999
Q ss_pred cCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEecc
Q 024821 107 PNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVN 186 (262)
Q Consensus 107 P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~N 186 (262)
|+|+|++|++|++||+++|++|+++||+|+|||+|||+|+||+++ |||.|+++++.|++||+.||++||| |++|+|||
T Consensus 81 P~g~~~~N~~gl~~Y~~lid~l~~~GI~P~VTL~H~dlP~~L~~~-GGW~n~~~v~~F~~YA~~~f~~fgd-Vk~WiT~N 158 (467)
T TIGR01233 81 PTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTLHHFDTPEALHSN-GDFLNRENIEHFIDYAAFCFEEFPE-VNYWTTFN 158 (467)
T ss_pred CCCCCCcCHHHHHHHHHHHHHHHHcCCEEEEeccCCCCcHHHHHc-CCCCCHHHHHHHHHHHHHHHHHhCC-CCEEEEec
Confidence 998889999999999999999999999999999999999999976 9999999999999999999999998 99999999
Q ss_pred CCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeecc
Q 024821 187 EPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFSI 258 (262)
Q Consensus 187 EP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~~ 258 (262)
|||+++..||..|.+|||.+.. .++.+|++||+++|||+||+++|++.+ .|+||++++.
T Consensus 159 EP~~~~~~gy~~G~~~Pg~~~~----------~~~~~~a~hn~l~AHa~A~~~~~~~~~---~~~IGi~~~~ 217 (467)
T TIGR01233 159 EIGPIGDGQYLVGKFPPGIKYD----------LAKVFQSHHNMMVSHARAVKLYKDKGY---KGEIGVVHAL 217 (467)
T ss_pred chhhhhhccchhcccCCCccch----------hHHHHHHHHHHHHHHHHHHHHHHHhCC---CCeEEEEecC
Confidence 9999999999999999995321 136899999999999999999999754 4677777654
No 8
>PRK09593 arb 6-phospho-beta-glucosidase; Reviewed
Probab=100.00 E-value=4.7e-73 Score=549.73 Aligned_cols=219 Identities=35% Similarity=0.584 Sum_probs=195.8
Q ss_pred cCCCCCCCeeeeecccccccCCCCCCCCCCccccccccc-CCC---C-------------CCCCcCCccccccHHHHHHH
Q 024821 26 KNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHA-GNV---P-------------GTGDVACDEYHKYKEDVKLM 88 (262)
Q Consensus 26 ~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~-~~~---~-------------~~~~~a~d~y~~~~eDi~l~ 88 (262)
..+||++|+||+||||||+||++++||||+|+||++++. +++ . .++++||||||||+|||+||
T Consensus 3 ~~~fP~~FlwG~AtsA~QiEGa~~~~Gkg~siwD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~Yhry~eDi~Lm 82 (478)
T PRK09593 3 KMPFPKGFLWGGATAANQCEGAYNVDGRGLANVDVVPIGEDRFPIITGEKKMFDFEEGYFYPAKEAIDMYHHYKEDIALF 82 (478)
T ss_pred cccCCCCCEEeeechHHHhCCCcCCCCCccchhhccccCcCcccccccccccccccccccCCCCcccchHHhhHHHHHHH
Confidence 457999999999999999999999999999999998873 222 0 14688999999999999999
Q ss_pred HHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHH
Q 024821 89 ADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAY 167 (262)
Q Consensus 89 k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~y 167 (262)
|+||+++|||||+||||+|+| .+.+|++||+||+++||+|+++||+|+|||+|||+|+||+++||||.|+++++.|++|
T Consensus 83 ~~lG~~aYRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lId~L~~~GI~P~VTL~H~dlP~~L~~~~GGW~n~~~v~~F~~Y 162 (478)
T PRK09593 83 AEMGFKTYRMSIAWTRIFPKGDELEPNEAGLQFYEDIFKECHKYGIEPLVTITHFDCPMHLIEEYGGWRNRKMVGFYERL 162 (478)
T ss_pred HHcCCCEEEEecchhhcccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEecccCCCHHHHhhcCCCCChHHHHHHHHH
Confidence 999999999999999999998 4579999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhCCCCceEEeccCCchhcccccc-ccC-CCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcC
Q 024821 168 ADVCFREFGDRVSYWTTVNEPNGFAMVGYD-FGI-APPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQ 245 (262)
Q Consensus 168 a~~v~~~~gd~V~~w~t~NEP~~~~~~gy~-~g~-~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~ 245 (262)
|+.|+++|||+|++|+|||||++++..||. .|. +|||.. +.+++++++||+++|||+|++++|+..+
T Consensus 163 A~~~~~~fgdrVk~WiT~NEP~~~~~~~~~~~g~~~~~g~~-----------~~~~~~~a~h~~llAHa~A~~~~~~~~~ 231 (478)
T PRK09593 163 CRTLFTRYKGLVKYWLTFNEINMILHAPFMGAGLYFEEGEN-----------KEQVKYQAAHHELVASAIATKIAHEVDP 231 (478)
T ss_pred HHHHHHHhcCcCCEEEeecchhhhhcccccccCcccCCCCc-----------hhhhHHHHHHHHHHHHHHHHHHHHHhCC
Confidence 999999999999999999999999888876 443 356532 2256899999999999999999998654
Q ss_pred CCccceEeeeecc
Q 024821 246 VISKKISFMCFSI 258 (262)
Q Consensus 246 ~~~~G~ig~~~~~ 258 (262)
.|+||++++.
T Consensus 232 ---~g~VGi~~~~ 241 (478)
T PRK09593 232 ---ENKVGCMLAA 241 (478)
T ss_pred ---CCeEEEEEeC
Confidence 4678887764
No 9
>PRK09589 celA 6-phospho-beta-glucosidase; Reviewed
Probab=100.00 E-value=8.4e-73 Score=547.76 Aligned_cols=222 Identities=36% Similarity=0.637 Sum_probs=191.0
Q ss_pred CCCCCCeeeeecccccccCCCCCCCCCCccccccc---cc--CCCC--------CCCCcCCccccccHHHHHHHHHcCCC
Q 024821 28 DFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFA---HA--GNVP--------GTGDVACDEYHKYKEDVKLMADTGLD 94 (262)
Q Consensus 28 ~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~---~~--~~~~--------~~~~~a~d~y~~~~eDi~l~k~lG~~ 94 (262)
+||++|+||+||||||+||++++||||+|+||.++ +. +++. .++++||||||||+|||+|||+||+|
T Consensus 3 ~fP~~FlwG~AtsA~QiEGa~~~~gkg~siwD~~~~~~~~~~~~~~~~~~~~~~~~~~~a~D~Yhry~eDi~Lm~~lG~~ 82 (476)
T PRK09589 3 GFKKGFLWGGAVAAHQLEGGWNEGGKGISVADVMTAGAHGVPREITEGVIEGKNYPNHEAIDFYHRYKEDIALFAEMGFK 82 (476)
T ss_pred CCCCCCEEeeechHhhhcCCcCCCCCCCchhcccccccccCccccccCccCCCcCCCcccccHHHhhHHHHHHHHHcCCC
Confidence 59999999999999999999999999999999988 31 2221 14688999999999999999999999
Q ss_pred cceeccccCccccCCC-CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHH
Q 024821 95 AYRFSISWSRLIPNGR-GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFR 173 (262)
Q Consensus 95 ~~R~si~WsRI~P~g~-g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~ 173 (262)
+|||||+||||+|+|. +.+|++||+||+++|++|+++||+|+|||+|||+|+||+++||||.|+++++.|++||+.||+
T Consensus 83 ~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~L~~~GI~P~VTL~H~dlP~~L~~~yGGW~n~~~i~~F~~YA~~~f~ 162 (476)
T PRK09589 83 CFRTSIAWTRIFPQGDELEPNEEGLQFYDDLFDECLKQGIEPVVTLSHFEMPYHLVTEYGGWRNRKLIDFFVRFAEVVFT 162 (476)
T ss_pred EEEeccchhhcCcCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEecCCCCCHHHHHhcCCcCChHHHHHHHHHHHHHHH
Confidence 9999999999999984 568999999999999999999999999999999999999999999999999999999999999
Q ss_pred HhCCCCceEEeccCCchhccc-----ccc-ccC-CCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCC
Q 024821 174 EFGDRVSYWTTVNEPNGFAMV-----GYD-FGI-APPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQV 246 (262)
Q Consensus 174 ~~gd~V~~w~t~NEP~~~~~~-----gy~-~g~-~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~ 246 (262)
+|||+|++|+|||||++++.. ||. .|. +|||.. ..+..+|++||+++|||+|++++|++.|+
T Consensus 163 ~fgdrVk~WiT~NEp~~~~~~~~~~~~~~~~g~~~~pg~~-----------~~~~~~~~~h~~llAha~A~~~~~~~~~~ 231 (476)
T PRK09589 163 RYKDKVKYWMTFNEINNQANFSEDFAPFTNSGILYSPGED-----------REQIMYQAAHYELVASALAVKTGHEINPD 231 (476)
T ss_pred HhcCCCCEEEEecchhhhhccccccCCccccccccCCCCc-----------hhHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 999999999999999998766 343 333 255421 12457999999999999999999998765
Q ss_pred Cccc-eEeeeeccCC
Q 024821 247 ISKK-ISFMCFSIPY 260 (262)
Q Consensus 247 ~~~G-~ig~~~~~p~ 260 (262)
.++| ++.....+|+
T Consensus 232 ~~iG~~~~~~~~~P~ 246 (476)
T PRK09589 232 FQIGCMIAMCPIYPL 246 (476)
T ss_pred CcEEEEEeCCeeeeC
Confidence 4433 2233334564
No 10
>PRK15014 6-phospho-beta-glucosidase BglA; Provisional
Probab=100.00 E-value=5e-72 Score=542.26 Aligned_cols=220 Identities=34% Similarity=0.676 Sum_probs=194.7
Q ss_pred ccCCCCCCCeeeeecccccccCCCCCCCCCCccccccc---cc--CCC-----C---CCCCcCCccccccHHHHHHHHHc
Q 024821 25 TKNDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFA---HA--GNV-----P---GTGDVACDEYHKYKEDVKLMADT 91 (262)
Q Consensus 25 ~~~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~---~~--~~~-----~---~~~~~a~d~y~~~~eDi~l~k~l 91 (262)
++.+||++|+||+||||||+||++++||||+|+||+++ +. +++ . .++++||||||||+|||+|||+|
T Consensus 2 ~~~~FP~~FlwG~AtsA~QiEGa~~e~Gkg~siwD~~~~~~~~~~~~~~~~~~~~~~~~~~~A~D~Yhry~EDI~Lm~el 81 (477)
T PRK15014 2 KKLTLPKDFLWGGAVAAHQVEGGWNKGGKGPSICDVLTGGAHGVPREITKEVVPGKYYPNHEAVDFYGHYKEDIKLFAEM 81 (477)
T ss_pred CcCCCCCCCEEeeecHHHHhCCCcCCCCCcccHhhccccccccCccccccccccCCcCCCCcccCcccccHHHHHHHHHc
Confidence 45679999999999999999999999999999999988 31 211 1 15688999999999999999999
Q ss_pred CCCcceeccccCccccCCC-CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHH
Q 024821 92 GLDAYRFSISWSRLIPNGR-GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADV 170 (262)
Q Consensus 92 G~~~~R~si~WsRI~P~g~-g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~ 170 (262)
|+|+|||||+||||+|+|. +.+|++|++||+++|++|+++||+|+|||+|||+|+||+++||||.|+++++.|++||+.
T Consensus 82 G~~~yRfSIsWsRI~P~G~~~~~N~~gl~~Y~~lid~l~~~GI~P~vTL~H~dlP~~L~~~yGGW~n~~~~~~F~~Ya~~ 161 (477)
T PRK15014 82 GFKCFRTSIAWTRIFPKGDEAQPNEEGLKFYDDMFDELLKYNIEPVITLSHFEMPLHLVQQYGSWTNRKVVDFFVRFAEV 161 (477)
T ss_pred CCCEEEecccceeeccCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEeeCCCCCHHHHHhcCCCCChHHHHHHHHHHHH
Confidence 9999999999999999984 569999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhCCCCceEEeccCCchh-----ccccccc-cCC-CCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHh
Q 024821 171 CFREFGDRVSYWTTVNEPNGF-----AMVGYDF-GIA-PPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKN 243 (262)
Q Consensus 171 v~~~~gd~V~~w~t~NEP~~~-----~~~gy~~-g~~-~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~ 243 (262)
||++|||+|++|+|||||+++ +..||.. |.+ ||+.. +.++.+|++||+++|||+|++++|+.
T Consensus 162 ~f~~fgdrVk~WiT~NEp~~~~~~~~~~~gy~~~g~~~~~~~~-----------~~~~~~~~~h~~llAHa~A~~~~~~~ 230 (477)
T PRK15014 162 VFERYKHKVKYWMTFNEINNQRNWRAPLFGYCCSGVVYTEHEN-----------PEETMYQVLHHQFVASALAVKAARRI 230 (477)
T ss_pred HHHHhcCcCCEEEEecCcccccccccccccccccccccCCCCc-----------hhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 999999999999999999987 6678874 654 44321 12458999999999999999999998
Q ss_pred cCCCccceEeeeecc
Q 024821 244 YQVISKKISFMCFSI 258 (262)
Q Consensus 244 ~~~~~~G~ig~~~~~ 258 (262)
.++ |+||++++.
T Consensus 231 ~~~---~~IGi~~~~ 242 (477)
T PRK15014 231 NPE---MKVGCMLAM 242 (477)
T ss_pred CCC---CeEEEEEeC
Confidence 754 677776663
No 11
>PF00232 Glyco_hydro_1: Glycosyl hydrolase family 1; InterPro: IPR001360 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 1 GH1 from CAZY comprises enzymes with a number of known activities; beta-glucosidase (3.2.1.21 from EC); beta-galactosidase (3.2.1.23 from EC); 6-phospho-beta-galactosidase (3.2.1.85 from EC); 6-phospho-beta-glucosidase (3.2.1.86 from EC); lactase-phlorizin hydrolase (3.2.1.62 from EC), (3.2.1.108 from EC); beta-mannosidase (3.2.1.25 from EC); myrosinase (3.2.1.147 from EC). ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1QVB_A 3AHY_D 2E9L_A 2ZOX_A 2JFE_X 2E9M_A 3FIZ_A 3FIY_A 3CMJ_A 3FJ0_A ....
Probab=100.00 E-value=5e-72 Score=540.11 Aligned_cols=216 Identities=56% Similarity=1.005 Sum_probs=190.2
Q ss_pred CCCCCCCeeeeecccccccCCCCCCCCCCccccccccc-CCCC--CCCCcCCccccccHHHHHHHHHcCCCcceeccccC
Q 024821 27 NDFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHA-GNVP--GTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWS 103 (262)
Q Consensus 27 ~~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~-~~~~--~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~Ws 103 (262)
.+||++|+||+|||||||||++++||||+|+||.|++. +++. .+++.||||||||+|||+|||+||+++|||||+|+
T Consensus 3 ~~fp~~F~wG~atsa~Q~EG~~~~dGkg~s~wd~~~~~~~~~~~~~~~~~a~d~y~~y~eDi~l~~~lg~~~yRfsi~W~ 82 (455)
T PF00232_consen 3 KKFPEDFLWGVATSAYQIEGAWNEDGKGPSIWDTFCHEPGKVEDGSTGDVACDHYHRYKEDIALMKELGVNAYRFSISWS 82 (455)
T ss_dssp GGS-TT-EEEEE--HHHHSSSTTSTTSTTBHHHHHHHSTTSSTTSSSSSSTTGHHHHHHHHHHHHHHHT-SEEEEE--HH
T ss_pred CCCCCCCeEEEeceeccccceecCCCCCcccccccccccceeeccccCcccccchhhhhHHHHHHHhhccceeeeecchh
Confidence 46999999999999999999999999999999999887 4332 56889999999999999999999999999999999
Q ss_pred ccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceE
Q 024821 104 RLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYW 182 (262)
Q Consensus 104 RI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w 182 (262)
||+|+| +|.+|++++++|+++|++|+++||+|+|||+|||+|+||++ +|||.|+++++.|++||+.|+++|||+|++|
T Consensus 83 Ri~P~g~~g~~n~~~~~~Y~~~i~~l~~~gi~P~vtL~H~~~P~~l~~-~ggw~~~~~~~~F~~Ya~~~~~~~gd~V~~w 161 (455)
T PF00232_consen 83 RIFPDGFEGKVNEEGLDFYRDLIDELLENGIEPIVTLYHFDLPLWLED-YGGWLNRETVDWFARYAEFVFERFGDRVKYW 161 (455)
T ss_dssp HHSTTSSSSSS-HHHHHHHHHHHHHHHHTT-EEEEEEESS--BHHHHH-HTGGGSTHHHHHHHHHHHHHHHHHTTTBSEE
T ss_pred heeecccccccCHhHhhhhHHHHHHHHhhccceeeeeeecccccceee-cccccCHHHHHHHHHHHHHHHHHhCCCcceE
Confidence 999998 89999999999999999999999999999999999999997 7999999999999999999999999999999
Q ss_pred EeccCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeec
Q 024821 183 TTVNEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFS 257 (262)
Q Consensus 183 ~t~NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~ 257 (262)
+|||||++.+..||+.|.+|||..+ .++.++++||+++|||+|++++|+++++ |+||++++
T Consensus 162 ~T~NEp~~~~~~~y~~g~~~p~~~~-----------~~~~~~~~h~~l~AHa~A~~~~~~~~~~---~~IGi~~~ 222 (455)
T PF00232_consen 162 ITFNEPNVFALLGYLYGGFPPGRDS-----------LKAFYQAAHNLLLAHAKAVKAIKEKYPD---GKIGIALN 222 (455)
T ss_dssp EEEETHHHHHHHHHTSSSSTTCSST-----------HHHHHHHHHHHHHHHHHHHHHHHHHTCT---SEEEEEEE
T ss_pred Eeccccceeeccccccccccccccc-----------cchhhHHHhhHHHHHHHHHHHHhhcccc---eEEecccc
Confidence 9999999999999999999999443 3678999999999999999999999865 45555544
No 12
>PRK09852 cryptic 6-phospho-beta-glucosidase; Provisional
Probab=100.00 E-value=2.1e-70 Score=530.40 Aligned_cols=217 Identities=36% Similarity=0.612 Sum_probs=195.6
Q ss_pred CCCCCCeeeeecccccccCCCCCCCCCCccccccccc-CCC-------------C---CCCCcCCccccccHHHHHHHHH
Q 024821 28 DFPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHA-GNV-------------P---GTGDVACDEYHKYKEDVKLMAD 90 (262)
Q Consensus 28 ~fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~-~~~-------------~---~~~~~a~d~y~~~~eDi~l~k~ 90 (262)
+||++|+||+||||||+||++++||||+|+||++++. +++ . .++++||||||||+|||+||++
T Consensus 3 ~FP~~FlwG~AtsA~QiEGa~~~~Gkg~siwD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~D~Yhry~eDi~l~~~ 82 (474)
T PRK09852 3 VFPEGFLWGGALAANQSEGAFREGGKGLTTVDMIPHGEHRMAVKLGLEKRFQLRDDEFYPSHEAIDFYHRYKEDIALMAE 82 (474)
T ss_pred CCCCCCEEeccchHhhcCCCcCCCCCCCchhhccccCCCcccccccccccccccccCcCCCCccCchhhhhHHHHHHHHH
Confidence 5999999999999999999999999999999998873 222 0 1467899999999999999999
Q ss_pred cCCCcceeccccCccccCCC-CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHH
Q 024821 91 TGLDAYRFSISWSRLIPNGR-GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYAD 169 (262)
Q Consensus 91 lG~~~~R~si~WsRI~P~g~-g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~ 169 (262)
||+|+|||||+||||+|+|. +.+|++++++|+++|++|+++||+|+|||+|||+|+||+++||||.|+++++.|++||+
T Consensus 83 lG~~~yR~si~WsRi~P~g~~~~~n~~~~~~Y~~~i~~l~~~gi~p~VtL~H~~~P~~l~~~~GGW~~~~~~~~F~~ya~ 162 (474)
T PRK09852 83 MGFKVFRTSIAWSRLFPQGDELTPNQQGIAFYRSVFEECKKYGIEPLVTLCHFDVPMHLVTEYGSWRNRKMVEFFSRYAR 162 (474)
T ss_pred cCCCeEEeeceeeeeeeCCCCCCCCHHHHHHHHHHHHHHHHcCCEEEEEeeCCCCCHHHHHhcCCCCCHHHHHHHHHHHH
Confidence 99999999999999999984 56899999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhCCCCceEEeccCCchhcccccc-ccC-CCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCC
Q 024821 170 VCFREFGDRVSYWTTVNEPNGFAMVGYD-FGI-APPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVI 247 (262)
Q Consensus 170 ~v~~~~gd~V~~w~t~NEP~~~~~~gy~-~g~-~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~ 247 (262)
.|+++|||+|++|+||||||+++..||. .|. +||+... .+..+|++||+++|||+|++++|+..++
T Consensus 163 ~~~~~fgd~Vk~WiTfNEPn~~~~~gy~~~g~~~~p~~~~-----------~~~~~~~~hn~llAHa~A~~~~~~~~~~- 230 (474)
T PRK09852 163 TCFEAFDGLVKYWLTFNEINIMLHSPFSGAGLVFEEGENQ-----------DQVKYQAAHHELVASALATKIAHEVNPQ- 230 (474)
T ss_pred HHHHHhcCcCCeEEeecchhhhhccCccccCcccCCCCCc-----------hHhHHHHHHHHHHHHHHHHHHHHHhCCC-
Confidence 9999999999999999999999999996 564 4776421 2457999999999999999999998654
Q ss_pred ccceEeeeecc
Q 024821 248 SKKISFMCFSI 258 (262)
Q Consensus 248 ~~G~ig~~~~~ 258 (262)
|+||++++.
T Consensus 231 --~~IGi~~~~ 239 (474)
T PRK09852 231 --NQVGCMLAG 239 (474)
T ss_pred --CeEEEEEeC
Confidence 677777664
No 13
>TIGR03356 BGL beta-galactosidase.
Probab=100.00 E-value=1.9e-68 Score=511.65 Aligned_cols=214 Identities=52% Similarity=0.947 Sum_probs=197.3
Q ss_pred CCCCCeeeeecccccccCCCCCCCCCCccccccccc-CCCC--CCCCcCCccccccHHHHHHHHHcCCCcceeccccCcc
Q 024821 29 FPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHA-GNVP--GTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRL 105 (262)
Q Consensus 29 fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~-~~~~--~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI 105 (262)
||++|+||+||||+|+||+++++|||+|+||.+++. +++. .++++||||||+|+|||++||+||+++|||||+||||
T Consensus 1 fp~~FlwG~atsa~Q~EG~~~~~gkg~s~wd~~~~~~~~~~~~~~~~~a~d~y~~y~eDi~l~~~~G~~~~R~si~Wsri 80 (427)
T TIGR03356 1 FPKDFLWGVATASYQIEGAVNEDGRGPSIWDTFSHTPGKVKDGDTGDVACDHYHRYEEDVALMKELGVDAYRFSIAWPRI 80 (427)
T ss_pred CCCCCEEeeechHHhhCCCcCCCCCccchhheeccCCCcccCCCCCCccccHHHhHHHHHHHHHHcCCCeEEcccchhhc
Confidence 899999999999999999999999999999998874 3332 3678899999999999999999999999999999999
Q ss_pred ccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEec
Q 024821 106 IPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTV 185 (262)
Q Consensus 106 ~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~ 185 (262)
+|+|+|++|++++++|+++|++|+++||+|+|||+|||+|+||+++ |||.|+++++.|++||+.|+++|||+|++|+||
T Consensus 81 ~p~g~~~~n~~~~~~y~~~i~~l~~~gi~pivtL~Hfd~P~~l~~~-gGw~~~~~~~~f~~ya~~~~~~~~d~v~~w~t~ 159 (427)
T TIGR03356 81 FPEGTGPVNPKGLDFYDRLVDELLEAGIEPFVTLYHWDLPQALEDR-GGWLNRDTAEWFAEYAAVVAERLGDRVKHWITL 159 (427)
T ss_pred ccCCCCCcCHHHHHHHHHHHHHHHHcCCeeEEeeccCCccHHHHhc-CCCCChHHHHHHHHHHHHHHHHhCCcCCEEEEe
Confidence 9998889999999999999999999999999999999999999988 999999999999999999999999999999999
Q ss_pred cCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCCccceEeeeec
Q 024821 186 NEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVISKKISFMCFS 257 (262)
Q Consensus 186 NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~~~G~ig~~~~ 257 (262)
|||++.+..||..|.+||+.++ .+..+|++||+++|||+|++++|++.|+ |+||++++
T Consensus 160 NEp~~~~~~~y~~G~~~P~~~~-----------~~~~~~~~hnll~Aha~A~~~~~~~~~~---~~IGi~~~ 217 (427)
T TIGR03356 160 NEPWCSAFLGYGLGVHAPGLRD-----------LRAALQAAHHLLLAHGLAVQALRANGPG---AQVGIVLN 217 (427)
T ss_pred cCcceecccchhhccCCCCCcc-----------HHHHHHHHHHHHHHHHHHHHHHHHhCCC---CeEEEEEe
Confidence 9999999999999988998432 1357999999999999999999998765 56666654
No 14
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 5 GH5 from CAZY comprises enzymes with several known activities; endoglucanase (3.2.1.4 from EC); beta-mannanase (3.2.1.78 from EC); exo-1,3-glucanase (3.2.1.58 from EC); endo-1,6-glucanase (3.2.1.75 from EC); xylanase (3.2.1.8 from EC); endoglycoceramidase (3.2.1.123 from EC). The microbial degradation of cellulose and xylans requires several types of enzymes. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family A [] or as the glycosyl hydrolases family 5 []. One of the conserved regions in this family contains a conserved glutamic acid residue which is potentially involved [] in the catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3NDY_A 3NDZ_B 1LF1_A 1TVP_B 1TVN_A 3AYR_A 3AYS_A 1QI0_A 1W3K_A 1OCQ_A ....
Probab=99.42 E-value=5.4e-13 Score=118.88 Aligned_cols=109 Identities=22% Similarity=0.367 Sum_probs=90.3
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccc-cCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCC-CC
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLI-PNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGW-IN 157 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~-P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw-~~ 157 (262)
..++|++.++++|+|++|+.+.|..++ |...+.++.+.++.++++|+.+.++||.+++++|+. |.|.... ++. ..
T Consensus 22 ~~~~~~~~~~~~G~n~VRi~v~~~~~~~~~~~~~~~~~~~~~ld~~v~~a~~~gi~vild~h~~--~~w~~~~-~~~~~~ 98 (281)
T PF00150_consen 22 ITEADFDQLKALGFNTVRIPVGWEAYQEPNPGYNYDETYLARLDRIVDAAQAYGIYVILDLHNA--PGWANGG-DGYGNN 98 (281)
T ss_dssp SHHHHHHHHHHTTESEEEEEEESTSTSTTSTTTSBTHHHHHHHHHHHHHHHHTT-EEEEEEEES--TTCSSST-STTTTH
T ss_pred CHHHHHHHHHHCCCCEEEeCCCHHHhcCCCCCccccHHHHHHHHHHHHHHHhCCCeEEEEeccC--ccccccc-cccccc
Confidence 678999999999999999999998888 453356999999999999999999999999999863 7774322 223 33
Q ss_pred hhhHHHHHHHHHHHHHHhCC--CCceEEeccCCchh
Q 024821 158 RMIVKDFTAYADVCFREFGD--RVSYWTTVNEPNGF 191 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd--~V~~w~t~NEP~~~ 191 (262)
....+.|.++++.++++|++ .|..|.++|||+..
T Consensus 99 ~~~~~~~~~~~~~la~~y~~~~~v~~~el~NEP~~~ 134 (281)
T PF00150_consen 99 DTAQAWFKSFWRALAKRYKDNPPVVGWELWNEPNGG 134 (281)
T ss_dssp HHHHHHHHHHHHHHHHHHTTTTTTEEEESSSSGCST
T ss_pred hhhHHHHHhhhhhhccccCCCCcEEEEEecCCcccc
Confidence 44678899999999999954 58899999999975
No 15
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=99.42 E-value=2.7e-13 Score=128.03 Aligned_cols=108 Identities=24% Similarity=0.438 Sum_probs=87.5
Q ss_pred cccHHHHHHHHHcCCCccee-ccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhh-----
Q 024821 79 HKYKEDVKLMADTGLDAYRF-SISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEY----- 152 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~-si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~----- 152 (262)
..+++|+++|+++|+|++|+ .++|++|+|+ +|++| +..+|++|+.+.++||++++.+.....|.|+.+++
T Consensus 10 e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~-eG~yd---F~~lD~~l~~a~~~Gi~viL~~~~~~~P~Wl~~~~Pe~~~ 85 (374)
T PF02449_consen 10 EEWEEDLRLMKEAGFNTVRIGEFSWSWLEPE-EGQYD---FSWLDRVLDLAAKHGIKVILGTPTAAPPAWLYDKYPEILP 85 (374)
T ss_dssp CHHHHHHHHHHHHT-SEEEE-CCEHHHH-SB-TTB------HHHHHHHHHHHCTT-EEEEEECTTTS-HHHHCCSGCCC-
T ss_pred HHHHHHHHHHHHcCCCEEEEEEechhhccCC-CCeee---cHHHHHHHHHHHhccCeEEEEecccccccchhhhcccccc
Confidence 46899999999999999996 6799999999 89999 77899999999999999999998889999998653
Q ss_pred ----------CC-----CCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCch
Q 024821 153 ----------GG-----WINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPNG 190 (262)
Q Consensus 153 ----------gG-----w~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~~ 190 (262)
|+ ..+|...+.+.++++.++++|++. |-.|.+.|||..
T Consensus 86 ~~~~g~~~~~g~~~~~~~~~p~yr~~~~~~~~~l~~~y~~~p~vi~~~i~NE~~~ 140 (374)
T PF02449_consen 86 VDADGRRRGFGSRQHYCPNSPAYREYARRFIRALAERYGDHPAVIGWQIDNEPGY 140 (374)
T ss_dssp B-TTTSBEECCCSTT-HCCHHHHHHHHHHHHHHHHHHHTTTTTEEEEEECCSTTC
T ss_pred cCCCCCcCccCCccccchhHHHHHHHHHHHHHHHHhhccccceEEEEEeccccCc
Confidence 11 124567788889999999999985 778999999976
No 16
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=98.93 E-value=2.7e-09 Score=96.00 Aligned_cols=83 Identities=18% Similarity=0.326 Sum_probs=71.7
Q ss_pred ccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcc--eeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCC
Q 024821 101 SWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQP--HVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 101 ~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p--~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~ 178 (262)
.|++++|+ +|.+| ++..|++++.++++||++ .+.+.|...|.|+... + .++..+.+.+|++.+++||+++
T Consensus 2 kW~~~ep~-~G~~n---~~~~D~~~~~a~~~gi~v~gH~l~W~~~~P~W~~~~-~---~~~~~~~~~~~i~~v~~ry~g~ 73 (254)
T smart00633 2 KWDSTEPS-RGQFN---FSGADAIVNFAKENGIKVRGHTLVWHSQTPDWVFNL-S---KETLLARLENHIKTVVGRYKGK 73 (254)
T ss_pred CcccccCC-CCccC---hHHHHHHHHHHHHCCCEEEEEEEeecccCCHhhhcC-C---HHHHHHHHHHHHHHHHHHhCCc
Confidence 59999999 89999 677899999999999995 4456777899998642 2 5677899999999999999999
Q ss_pred CceEEeccCCchh
Q 024821 179 VSYWTTVNEPNGF 191 (262)
Q Consensus 179 V~~w~t~NEP~~~ 191 (262)
|..|.++|||...
T Consensus 74 i~~wdV~NE~~~~ 86 (254)
T smart00633 74 IYAWDVVNEALHD 86 (254)
T ss_pred ceEEEEeeecccC
Confidence 9999999999863
No 17
>COG1874 LacA Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=98.55 E-value=1.3e-07 Score=95.46 Aligned_cols=113 Identities=21% Similarity=0.382 Sum_probs=88.5
Q ss_pred ccHHHHHHHHHcCCCccee-ccccCccccCCCCCCChhhhhHHHHH-HHHHHHcCCcceeec-cCCCCcHhHHHhh----
Q 024821 80 KYKEDVKLMADTGLDAYRF-SISWSRLIPNGRGPVNPKGLQYYNNL-INELISYGIQPHVTL-HHLDLPQALEDEY---- 152 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~-si~WsRI~P~g~g~~n~~~l~~y~~~-id~l~~~GI~p~vtL-~H~~~P~~l~~~~---- 152 (262)
-+++|++.||++|+|++|+ -++|++++|+ +|++|. .+.|.. ++.+.+.||.+++.- .....|.|+..+|
T Consensus 31 ~w~ddl~~mk~~G~N~V~ig~faW~~~eP~-eG~fdf---~~~D~~~l~~a~~~Gl~vil~t~P~g~~P~Wl~~~~PeiL 106 (673)
T COG1874 31 TWMDDLRKMKALGLNTVRIGYFAWNLHEPE-EGKFDF---TWLDEIFLERAYKAGLYVILRTGPTGAPPAWLAKKYPEIL 106 (673)
T ss_pred HHHHHHHHHHHhCCCeeEeeeEEeeccCcc-ccccCc---ccchHHHHHHHHhcCceEEEecCCCCCCchHHhcCChhhe
Confidence 3788999999999999999 5699999999 899994 477777 999999999999987 6678899988764
Q ss_pred -----------CCCCChhhH-HHHHHHHHH----HHHH-hCCC--CceEEeccCCch-hccccc
Q 024821 153 -----------GGWINRMIV-KDFTAYADV----CFRE-FGDR--VSYWTTVNEPNG-FAMVGY 196 (262)
Q Consensus 153 -----------gGw~~~~~~-~~F~~ya~~----v~~~-~gd~--V~~w~t~NEP~~-~~~~gy 196 (262)
|+|.+-+.. ..|.+|++. +.+| ||+. |--|.+-||-.. .+...|
T Consensus 107 ~~~~~~~~~~~g~r~~~~~~~~~Yr~~~~~i~~~irer~~~~~~~v~~w~~dneY~~~~~~~~~ 170 (673)
T COG1874 107 AVDENGRVRSDGARENICPVSPVYREYLDRILQQIRERLYGNGPAVITWQNDNEYGGHPCYCDY 170 (673)
T ss_pred EecCCCcccCCCcccccccccHHHHHHHHHHHHHHHHHHhccCCceeEEEccCccCCccccccc
Confidence 456443322 246666666 7888 8774 778999999776 444443
No 18
>COG2730 BglC Endoglucanase [Carbohydrate transport and metabolism]
Probab=98.39 E-value=1e-06 Score=84.61 Aligned_cols=109 Identities=21% Similarity=0.257 Sum_probs=81.2
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHh--h-CCC
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDE--Y-GGW 155 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~--~-gGw 155 (262)
++|+..||+.|+|++|+.+.|..+.+.+ +...+...+.+.+++|+..++.||.+++++|++.-+.-=.+. + +.+
T Consensus 76 ~~~~~~ik~~G~n~VRiPi~~~~~~~~~~~~p~~~~~~~~~~ld~~I~~a~~~gi~V~iD~H~~~~~~~~~~~s~~~~~~ 155 (407)
T COG2730 76 EEDFDQIKSAGFNAVRIPIGYWALQATDGDNPYLIGLTQLKILDEAINWAKKLGIYVLIDLHGYPGGNNGHEHSGYTSDY 155 (407)
T ss_pred hhHHHHHHHcCCcEEEcccchhhhhccCCCCCCeecchHHHHHHHHHHHHHhcCeeEEEEecccCCCCCCcCcccccccc
Confidence 8999999999999999999866655542 222224445699999999999999999999986522211111 0 112
Q ss_pred C-ChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCch
Q 024821 156 I-NRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPNG 190 (262)
Q Consensus 156 ~-~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~~ 190 (262)
. ..+.++.+.+-++.++.||++. |-...++|||+.
T Consensus 156 ~~~~~~~~~~~~~w~~ia~~f~~~~~VIg~~~~NEP~~ 193 (407)
T COG2730 156 KEENENVEATIDIWKFIANRFKNYDTVIGFELINEPNG 193 (407)
T ss_pred cccchhHHHHHHHHHHHHHhccCCCceeeeeeecCCcc
Confidence 2 3557799999999999999984 555689999995
No 19
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 39 GH39 from CAZY comprises enzymes with several known activities; alpha-L-iduronidase (3.2.1.76 from EC); beta-xylosidase (3.2.1.37 from EC). The most highly conserved regions in these enzymes are located in their N-terminal sections. These contain a glutamic acid residue which, on the basis of similarities with other families of glycosyl hydrolases [], probably acts as the proton donor in their catalytic mechanism.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BS9_D 2BFG_E 1W91_B 1UHV_D 1PX8_A.
Probab=98.30 E-value=2e-06 Score=84.38 Aligned_cols=107 Identities=24% Similarity=0.433 Sum_probs=65.1
Q ss_pred ccHHHHHHHH-HcCCCcceec--c--ccCcccc-CCCCC--CChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHh
Q 024821 80 KYKEDVKLMA-DTGLDAYRFS--I--SWSRLIP-NGRGP--VNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDE 151 (262)
Q Consensus 80 ~~~eDi~l~k-~lG~~~~R~s--i--~WsRI~P-~g~g~--~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~ 151 (262)
.+++.+..++ ++||+.+||- + +..-..+ ++.|. +| +...|+++|.|+++||+|+|.|.. +|.++...
T Consensus 40 ~~q~~l~~~~~~~gf~yvR~h~l~~ddm~~~~~~~~~~~~~Yn---f~~lD~i~D~l~~~g~~P~vel~f--~p~~~~~~ 114 (486)
T PF01229_consen 40 DWQEQLRELQEELGFRYVRFHGLFSDDMMVYSESDEDGIPPYN---FTYLDQILDFLLENGLKPFVELGF--MPMALASG 114 (486)
T ss_dssp HHHHHHHHHHCCS--SEEEES-TTSTTTT-EEEEETTEEEEE-----HHHHHHHHHHHHCT-EEEEEE-S--B-GGGBSS
T ss_pred HHHHHHHHHHhccCceEEEEEeeccCchhhccccccCCCCcCC---hHHHHHHHHHHHHcCCEEEEEEEe--chhhhcCC
Confidence 3666666665 9999999986 2 2222222 22222 57 788999999999999999999975 78776532
Q ss_pred ------hCCCC-ChhhHHHHHHHHHHHHHHhCC-----CCc--eEEeccCCchh
Q 024821 152 ------YGGWI-NRMIVKDFTAYADVCFREFGD-----RVS--YWTTVNEPNGF 191 (262)
Q Consensus 152 ------~gGw~-~~~~~~~F~~ya~~v~~~~gd-----~V~--~w~t~NEP~~~ 191 (262)
+.|+. .|+..+.|.++++.+++|+-+ .|. +|++||||++.
T Consensus 115 ~~~~~~~~~~~~pp~~~~~W~~lv~~~~~h~~~RYG~~ev~~W~fEiWNEPd~~ 168 (486)
T PF01229_consen 115 YQTVFWYKGNISPPKDYEKWRDLVRAFARHYIDRYGIEEVSTWYFEIWNEPDLK 168 (486)
T ss_dssp --EETTTTEE-S-BS-HHHHHHHHHHHHHHHHHHHHHHHHTTSEEEESS-TTST
T ss_pred CCccccccCCcCCcccHHHHHHHHHHHHHHHHhhcCCccccceeEEeCcCCCcc
Confidence 12222 356678888888777665543 355 56899999974
No 20
>PF00331 Glyco_hydro_10: Glycosyl hydrolase family 10; InterPro: IPR001000 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 10 GH10 from CAZY comprises enzymes with a number of known activities; xylanase (3.2.1.8 from EC); endo-1,3-beta-xylanase (3.2.1.32 from EC); cellobiohydrolase (3.2.1.91 from EC). These enzymes were formerly known as cellulase family F. The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) [, ]. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family F [] or as the glycosyl hydrolases family 10 []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1UQZ_A 1UQY_A 1UR2_A 1UR1_A 2CNC_A 1OD8_A 1E0W_A 1E0V_A 1V0M_A 1E0X_B ....
Probab=98.04 E-value=2e-05 Score=73.42 Aligned_cols=125 Identities=17% Similarity=0.240 Sum_probs=87.2
Q ss_pred CCCCCeeeeecccccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccCccccC
Q 024821 29 FPPGFIFGSGTSAYQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPN 108 (262)
Q Consensus 29 fp~~FlwG~Atsa~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~ 108 (262)
.+.+|.+|+|.++.++++.. .|++-+ .+..+.-+..-..-|..++|.
T Consensus 6 ~~~~f~~G~av~~~~~~~~~-------------------------------~~~~~~--~~~Fn~~t~eN~~Kw~~~e~~ 52 (320)
T PF00331_consen 6 AKHKFPFGAAVNAQQLEDDP-------------------------------RYRELF--AKHFNSVTPENEMKWGSIEPE 52 (320)
T ss_dssp HCTTTEEEEEEBGGGHTHHH-------------------------------HHHHHH--HHH-SEEEESSTTSHHHHESB
T ss_pred HhccCCEEEEechhHcCCcH-------------------------------HHHHHH--HHhCCeeeeccccchhhhcCC
Confidence 45789999999999888730 011111 122222233334789999999
Q ss_pred CCCCCChhhhhHHHHHHHHHHHcCCccee--eccCCCCcHhHHHhhCCCCChh---hHHHHHHHHHHHHHHhC--CCCce
Q 024821 109 GRGPVNPKGLQYYNNLINELISYGIQPHV--TLHHLDLPQALEDEYGGWINRM---IVKDFTAYADVCFREFG--DRVSY 181 (262)
Q Consensus 109 g~g~~n~~~l~~y~~~id~l~~~GI~p~v--tL~H~~~P~~l~~~~gGw~~~~---~~~~F~~ya~~v~~~~g--d~V~~ 181 (262)
+|.+| ++..|++++.++++||++-- -+.|--.|.|+... .-+...+ ..+...+|.+.+++||+ ++|..
T Consensus 53 -~g~~~---~~~~D~~~~~a~~~g~~vrGH~LvW~~~~P~w~~~~-~~~~~~~~~~~~~~l~~~I~~v~~~y~~~g~i~~ 127 (320)
T PF00331_consen 53 -PGRFN---FESADAILDWARENGIKVRGHTLVWHSQTPDWVFNL-ANGSPDEKEELRARLENHIKTVVTRYKDKGRIYA 127 (320)
T ss_dssp -TTBEE----HHHHHHHHHHHHTT-EEEEEEEEESSSS-HHHHTS-TTSSBHHHHHHHHHHHHHHHHHHHHTTTTTTESE
T ss_pred -CCccC---ccchhHHHHHHHhcCcceeeeeEEEcccccceeeec-cCCCcccHHHHHHHHHHHHHHHHhHhccccceEE
Confidence 79999 66789999999999999873 34566789999753 1233333 77889999999999999 48999
Q ss_pred EEeccCCchh
Q 024821 182 WTTVNEPNGF 191 (262)
Q Consensus 182 w~t~NEP~~~ 191 (262)
|-+.|||-.-
T Consensus 128 WDVvNE~i~~ 137 (320)
T PF00331_consen 128 WDVVNEAIDD 137 (320)
T ss_dssp EEEEES-B-T
T ss_pred EEEeeecccC
Confidence 9999999764
No 21
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=97.71 E-value=9.2e-05 Score=69.12 Aligned_cols=109 Identities=12% Similarity=0.145 Sum_probs=75.8
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc--------CCCCcHhHHHh
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH--------HLDLPQALEDE 151 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~--------H~~~P~~l~~~ 151 (262)
.|++-++.||++|+|++-+-+.|.--+|. +|++|.++..-.+++|+.++++|+.+++-.- .-.+|.||..+
T Consensus 25 ~W~~~l~k~ka~G~n~v~~yv~W~~he~~-~g~~df~g~~dl~~f~~~a~~~gl~vilrpGpyi~aE~~~gG~P~Wl~~~ 103 (319)
T PF01301_consen 25 YWRDRLQKMKAAGLNTVSTYVPWNLHEPE-EGQFDFTGNRDLDRFLDLAQENGLYVILRPGPYICAEWDNGGLPAWLLRK 103 (319)
T ss_dssp GHHHHHHHHHHTT-SEEEEE--HHHHSSB-TTB---SGGG-HHHHHHHHHHTT-EEEEEEES---TTBGGGG--GGGGGS
T ss_pred HHHHHHHHHHhCCcceEEEeccccccCCC-CCcccccchhhHHHHHHHHHHcCcEEEecccceecccccchhhhhhhhcc
Confidence 47888999999999999999999999999 8999999999999999999999999776432 12479999866
Q ss_pred hCCC---CChhhHHHHHHHHHHHHHHhCC-------CCceEEeccCCc
Q 024821 152 YGGW---INRMIVKDFTAYADVCFREFGD-------RVSYWTTVNEPN 189 (262)
Q Consensus 152 ~gGw---~~~~~~~~F~~ya~~v~~~~gd-------~V~~w~t~NEP~ 189 (262)
.+.. .++...+.-.+|.+.+++...+ -|-.-.+=||..
T Consensus 104 ~~~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~GGpII~vQvENEyg 151 (319)
T PF01301_consen 104 PDIRLRTNDPPFLEAVERWYRALAKIIKPLQYTNGGPIIMVQVENEYG 151 (319)
T ss_dssp TTS-SSSS-HHHHHHHHHHHHHHHHHHGGGBGGGTSSEEEEEESSSGG
T ss_pred ccccccccchhHHHHHHHHHHHHHHHHHhhhhcCCCceehhhhhhhhC
Confidence 3332 2455667777777777777655 255577888866
No 22
>PLN02803 beta-amylase
Probab=97.28 E-value=0.00058 Score=67.18 Aligned_cols=108 Identities=19% Similarity=0.332 Sum_probs=82.3
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-C-----------CCCc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-H-----------LDLP 145 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H-----------~~~P 145 (262)
-.-.+..++.+|++||+.+-+.+-|--+|+++++++| +..|+++++.+++.|++..+.|. | .-+|
T Consensus 106 ~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~Yd---WsgY~~l~~mvr~~GLKlq~vmSFHqCGGNVGD~~~IpLP 182 (548)
T PLN02803 106 PRAMNASLMALRSAGVEGVMVDAWWGLVEKDGPMKYN---WEGYAELVQMVQKHGLKLQVVMSFHQCGGNVGDSCSIPLP 182 (548)
T ss_pred HHHHHHHHHHHHHcCCCEEEEEeeeeeeccCCCCcCC---cHHHHHHHHHHHHcCCeEEEEEEecccCCCCCCcccccCC
Confidence 3447789999999999999999999999999899999 55699999999999999888774 5 2589
Q ss_pred HhHHHh--------h---CCCCC----------------hhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 146 QALEDE--------Y---GGWIN----------------RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 146 ~~l~~~--------~---gGw~~----------------~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
.|+.+. | .|-.| +.-++.|.+|-+-.-++|.+... -|+.|..+
T Consensus 183 ~WV~e~~~~~pDi~ftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~Dfm~SFr~~F~~~l~--~~I~eI~V 252 (548)
T PLN02803 183 PWVLEEMSKNPDLVYTDRSGRRNPEYISLGCDSLPVLRGRTPIQVYSDYMRSFRERFKDYLG--GVIAEIQV 252 (548)
T ss_pred HHHHHhhhcCCCceEecCCCCcccceeccccccchhccCCCHHHHHHHHHHHHHHHHHHHhc--CceEEEEe
Confidence 997752 0 12222 22447788888888777777553 35555443
No 23
>PLN02161 beta-amylase
Probab=97.27 E-value=0.0007 Score=66.31 Aligned_cols=111 Identities=16% Similarity=0.255 Sum_probs=84.9
Q ss_pred CccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-CC-----------
Q 024821 75 CDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-HL----------- 142 (262)
Q Consensus 75 ~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H~----------- 142 (262)
.......+..++.+|.+||+.+-+.+-|--+|+++++++| +..|+++++.+++.|++..+.|. |-
T Consensus 113 v~~~~al~~~L~~LK~~GVdGVmvDVWWGiVE~~~p~~Yd---WsgY~~l~~mvr~~GLKlq~vmSFHqCGGNvGd~~~I 189 (531)
T PLN02161 113 IKRLKALTVSLKALKLAGVHGIAVEVWWGIVERFSPLEFK---WSLYEELFRLISEAGLKLHVALCFHSNMHLFGGKGGI 189 (531)
T ss_pred cCCHHHHHHHHHHHHHcCCCEEEEEeeeeeeecCCCCcCC---cHHHHHHHHHHHHcCCeEEEEEEecccCCCCCCccCc
Confidence 4556668889999999999999999999999998899999 56699999999999999888774 53
Q ss_pred CCcHhHHHh-----------hCCCCC----------------hhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 143 DLPQALEDE-----------YGGWIN----------------RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 143 ~~P~~l~~~-----------~gGw~~----------------~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
-+|.|+.+. ..|..| +.-++.|.+|-+-..++|.+... -|+.|..+
T Consensus 190 pLP~WV~~~g~~~pDi~ftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~Dfm~SFr~~F~~~~~--~~I~eI~V 262 (531)
T PLN02161 190 SLPLWIREIGDVNKDIYYRDKNGFSNNDYLTLGVDQLPLFGGRTAVQCYEDFMLSFSTKFEPYIG--NVIEEISI 262 (531)
T ss_pred cCCHHHHhhhccCCCceEEcCCCCcccceeeeecccchhcCCCCHHHHHHHHHHHHHHHHHHHhc--CceEEEEe
Confidence 489997752 022222 22447888888888888877553 35555443
No 24
>PF13204 DUF4038: Protein of unknown function (DUF4038); PDB: 3KZS_D.
Probab=97.22 E-value=0.0025 Score=58.66 Aligned_cols=99 Identities=15% Similarity=0.305 Sum_probs=61.8
Q ss_pred HHHHHHcCCCcceecc--ccCcc--------cc--CCC------CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcH
Q 024821 85 VKLMADTGLDAYRFSI--SWSRL--------IP--NGR------GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQ 146 (262)
Q Consensus 85 i~l~k~lG~~~~R~si--~WsRI--------~P--~g~------g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~ 146 (262)
++..|+-|||.+|+.+ .|.+. .| ..+ ..+|++.+++.+++|+.|.++||+|.+.+.| +.|.
T Consensus 36 L~~r~~qgFN~iq~~~l~~~~~~~~~n~~~~~~~~~~~~~~~d~~~~N~~YF~~~d~~i~~a~~~Gi~~~lv~~w-g~~~ 114 (289)
T PF13204_consen 36 LDTRKEQGFNVIQMNVLPQWDGYNTPNRYGFAPFPDEDPGQFDFTRPNPAYFDHLDRRIEKANELGIEAALVPFW-GCPY 114 (289)
T ss_dssp HHHHHHTT--EEEEES-SSSS-B----TTS-BS-SSTT------TT----HHHHHHHHHHHHHHTT-EEEEESS--HHHH
T ss_pred HHHHHHCCCCEEEEEeCCCcccccccccCCCcCCCCCCccccCCCCCCHHHHHHHHHHHHHHHHCCCeEEEEEEE-CCcc
Confidence 6777999999999998 44433 11 101 1379999999999999999999999876664 2221
Q ss_pred hHHHhhCCCCC---hhhHHHHHHHHHHHHHHhCCC-CceEEeccCC
Q 024821 147 ALEDEYGGWIN---RMIVKDFTAYADVCFREFGDR-VSYWTTVNEP 188 (262)
Q Consensus 147 ~l~~~~gGw~~---~~~~~~F~~ya~~v~~~~gd~-V~~w~t~NEP 188 (262)
.+ +.|.. .-..+.-.+|.+.|++||+.. =..|++=||-
T Consensus 115 ---~~-~~Wg~~~~~m~~e~~~~Y~~yv~~Ry~~~~NviW~l~gd~ 156 (289)
T PF13204_consen 115 ---VP-GTWGFGPNIMPPENAERYGRYVVARYGAYPNVIWILGGDY 156 (289)
T ss_dssp ---H--------TTSS-HHHHHHHHHHHHHHHTT-SSEEEEEESSS
T ss_pred ---cc-ccccccccCCCHHHHHHHHHHHHHHHhcCCCCEEEecCcc
Confidence 11 44532 224677888999999999998 4779999997
No 25
>PF07745 Glyco_hydro_53: Glycosyl hydrolase family 53; InterPro: IPR011683 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This domain is found in family 53 of the glycosyl hydrolase classification []. These enzymes are endo-1,4- beta-galactanases (3.2.1.89 from EC). The structure of this domain is known [] and has a TIM barrel fold.; GO: 0015926 glucosidase activity; PDB: 1HJQ_A 1HJS_A 1HJU_B 1FHL_A 1FOB_A 2GFT_A 1UR4_B 1UR0_A 1R8L_B 2CCR_A ....
Probab=97.21 E-value=0.0051 Score=57.87 Aligned_cols=103 Identities=20% Similarity=0.326 Sum_probs=69.6
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC---CcHhHHHhhCCCCC-
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD---LPQALEDEYGGWIN- 157 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~---~P~~l~~~~gGw~~- 157 (262)
++=+++||+.|+|++|+-+ | +.|...|..| ++.-.++..+.+++||+.++++|--| =|.-- ..-..|.+
T Consensus 27 ~d~~~ilk~~G~N~vRlRv-w--v~P~~~g~~~---~~~~~~~akrak~~Gm~vlldfHYSD~WaDPg~Q-~~P~aW~~~ 99 (332)
T PF07745_consen 27 KDLFQILKDHGVNAVRLRV-W--VNPYDGGYND---LEDVIALAKRAKAAGMKVLLDFHYSDFWADPGKQ-NKPAAWANL 99 (332)
T ss_dssp --HHHHHHHTT--EEEEEE----SS-TTTTTTS---HHHHHHHHHHHHHTT-EEEEEE-SSSS--BTTB--B--TTCTSS
T ss_pred CCHHHHHHhcCCCeEEEEe-c--cCCcccccCC---HHHHHHHHHHHHHCCCeEEEeecccCCCCCCCCC-CCCccCCCC
Confidence 4457999999999999987 4 4555226666 67788999999999999999997222 23211 11146876
Q ss_pred --hhhHHHHHHHHHHHHHHhCC---CCceEEeccCCchh
Q 024821 158 --RMIVKDFTAYADVCFREFGD---RVSYWTTVNEPNGF 191 (262)
Q Consensus 158 --~~~~~~F~~ya~~v~~~~gd---~V~~w~t~NEP~~~ 191 (262)
.+..+.-.+|.+.+.+.+++ .++.+.+=||.+.-
T Consensus 100 ~~~~l~~~v~~yT~~vl~~l~~~G~~pd~VQVGNEin~G 138 (332)
T PF07745_consen 100 SFDQLAKAVYDYTKDVLQALKAAGVTPDMVQVGNEINNG 138 (332)
T ss_dssp SHHHHHHHHHHHHHHHHHHHHHTT--ESEEEESSSGGGE
T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCccEEEeCcccccc
Confidence 56778888999988888765 57888999998853
No 26
>PF01373 Glyco_hydro_14: Glycosyl hydrolase family 14; InterPro: IPR001554 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 14 GH14 from CAZY comprises enzymes with only one known activity; beta-amylase (3.2.1.2 from EC). A Glu residue has been proposed as a catalytic residue, but it is not known if it is the nucleophile or the proton donor. Beta-amylase [, ] is an enzyme that hydrolyses 1,4-alpha-glucosidic linkages in starch-type polysaccharide substrates so as to remove successive maltose units from the non-reducing ends of the chains. Beta-amylase is present in certain bacteria as well as in plants. Three highly conserved sequence regions are found in all known beta-amylases. The first of these regions is located in the N-terminal section of the enzymes and contains an aspartate which is known [] to be involved in the catalytic mechanism. The second, located in a more central location, is centred around a glutamate which is also involved [] in the catalytic mechanism. The 3D structure of a complex of soybean beta-amylase with an inhibitor (alpha-cyclodextrin) has been determined to 3.0A resolution by X-ray diffraction []. The enzyme folds into large and small domains: the large domain has a (beta alpha)8 super-secondary structural core, while the smaller is formed from two long loops extending from the beta-3 and beta-4 strands of the (beta alpha)8 fold []. The interface of the two domains, together with shorter loops from the (beta alpha)8 core, form a deep cleft, in which the inhibitor binds []. Two maltose molecules also bind in the cleft, one sharing a binding site with alpha-cyclodextrin, and the other sitting more deeply in the cleft [].; GO: 0016161 beta-amylase activity, 0000272 polysaccharide catabolic process; PDB: 1FA2_A 2DQX_A 1WDP_A 1UKP_C 1BYC_A 1BYA_A 1Q6C_A 1V3I_A 1BTC_A 1BYB_A ....
Probab=97.19 E-value=0.00059 Score=65.36 Aligned_cols=106 Identities=15% Similarity=0.343 Sum_probs=78.5
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec-cC-----------CCCc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL-HH-----------LDLP 145 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL-~H-----------~~~P 145 (262)
+.-.+.+++.+|++|++.+-+.+-|.-+++.+++++| +..|+++++.+++.|++..+.| +| .-+|
T Consensus 15 ~~~~~~~L~~LK~~GV~GVmvdvWWGiVE~~~p~~yd---Ws~Y~~l~~~vr~~GLk~~~vmsfH~cGgNvgD~~~IpLP 91 (402)
T PF01373_consen 15 WNALEAQLRALKSAGVDGVMVDVWWGIVEGEGPQQYD---WSGYRELFEMVRDAGLKLQVVMSFHQCGGNVGDDCNIPLP 91 (402)
T ss_dssp CHHHHHHHHHHHHTTEEEEEEEEEHHHHTGSSTTB------HHHHHHHHHHHHTT-EEEEEEE-S-BSSSTTSSSEB-S-
T ss_pred HHHHHHHHHHHHHcCCcEEEEEeEeeeeccCCCCccC---cHHHHHHHHHHHHcCCeEEEEEeeecCCCCCCCccCCcCC
Confidence 4478899999999999999999999999999899999 5669999999999999988876 24 3579
Q ss_pred HhHHHh-----------hCC--------CCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 146 QALEDE-----------YGG--------WINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 146 ~~l~~~-----------~gG--------w~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
.|+.+. .|. |....+++.|.+|-+...++|.+.. -|+-|..
T Consensus 92 ~Wv~~~~~~~di~ytd~~G~rn~E~lSp~~~grt~~~Y~dfm~sF~~~f~~~~---~~I~~I~ 151 (402)
T PF01373_consen 92 SWVWEIGKKDDIFYTDRSGNRNKEYLSPVLDGRTLQCYSDFMRSFRDNFSDYL---STITEIQ 151 (402)
T ss_dssp HHHHHHHHHSGGEEE-TTS-EEEEEE-CTBTTBCHHHHHHHHHHHHHHCHHHH---TGEEEEE
T ss_pred HHHHhccccCCcEEECCCCCcCcceeecccCCchHHHHHHHHHHHHHHHHHHH---hhheEEE
Confidence 998642 122 3333458999999999888887754 4444443
No 27
>PLN00197 beta-amylase; Provisional
Probab=97.19 E-value=0.0011 Score=65.62 Aligned_cols=108 Identities=19% Similarity=0.307 Sum_probs=82.9
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-C-----------CCCc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-H-----------LDLP 145 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H-----------~~~P 145 (262)
-.-.+..++.+|.+||+.+-+.+-|--+++++++++| +..|+++++.+++.|++..+.|. | .-+|
T Consensus 126 ~~~l~~~L~~LK~~GVdGVmvDvWWGiVE~~~p~~Yd---WsgY~~L~~mvr~~GLKlq~VmSFHqCGGNVGD~~~IpLP 202 (573)
T PLN00197 126 RKAMKASLQALKSAGVEGIMMDVWWGLVERESPGVYN---WGGYNELLEMAKRHGLKVQAVMSFHQCGGNVGDSCTIPLP 202 (573)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCcCC---cHHHHHHHHHHHHcCCeEEEEEEecccCCCCCCcccccCC
Confidence 3347889999999999999999999999999899999 55699999999999999888774 5 2589
Q ss_pred HhHHHh-----------hCCCCC----------------hhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 146 QALEDE-----------YGGWIN----------------RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 146 ~~l~~~-----------~gGw~~----------------~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
.|+.+. ..|..| +.-++.|.+|-+-.-++|.+... -|+.|..+
T Consensus 203 ~WV~~~g~~dpDifftDr~G~rn~EyLSlg~D~~pvl~GRTpiq~Y~DFM~SFr~~F~~~l~--~~I~eI~V 272 (573)
T PLN00197 203 KWVVEEVDKDPDLAYTDQWGRRNYEYVSLGCDTLPVLKGRTPVQCYADFMRAFRDNFKHLLG--DTIVEIQV 272 (573)
T ss_pred HHHHHhhccCCCceeecCCCCcccceeccccccccccCCCCHHHHHHHHHHHHHHHHHHHhc--CceeEEEe
Confidence 997752 022222 22368888888888888777544 35555444
No 28
>PLN03059 beta-galactosidase; Provisional
Probab=97.13 E-value=0.0028 Score=65.84 Aligned_cols=109 Identities=15% Similarity=0.156 Sum_probs=85.5
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc--------CCCCcHhHHH
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH--------HLDLPQALED 150 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~--------H~~~P~~l~~ 150 (262)
..|++=++.||++|+|++-+=+-|.--+|. +|++|.+|..=..++|+.+.+.|+-+++-.- .-.+|.||.+
T Consensus 59 ~~W~d~L~k~Ka~GlNtV~tYV~Wn~HEp~-~G~~dF~G~~DL~~Fl~la~e~GLyvilRpGPYIcAEw~~GGlP~WL~~ 137 (840)
T PLN03059 59 EMWPDLIQKAKDGGLDVIQTYVFWNGHEPS-PGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKY 137 (840)
T ss_pred HHHHHHHHHHHHcCCCeEEEEecccccCCC-CCeeeccchHHHHHHHHHHHHcCCEEEecCCcceeeeecCCCCchhhhc
Confidence 347788999999999999999999999999 8999999999999999999999988777542 3467999975
Q ss_pred hhCCC----CChhhHHHHHHHHHHHHHHhC---------CCCceEEeccCCc
Q 024821 151 EYGGW----INRMIVKDFTAYADVCFREFG---------DRVSYWTTVNEPN 189 (262)
Q Consensus 151 ~~gGw----~~~~~~~~F~~ya~~v~~~~g---------d~V~~w~t~NEP~ 189 (262)
. .|- .++...++-.+|.+.+++.++ +-|-...+=||-.
T Consensus 138 ~-~~i~~Rs~d~~fl~~v~~~~~~l~~~l~~~~l~~~~GGPIImvQIENEYG 188 (840)
T PLN03059 138 V-PGIEFRTDNGPFKAAMQKFTEKIVDMMKSEKLFEPQGGPIILSQIENEYG 188 (840)
T ss_pred C-CCcccccCCHHHHHHHHHHHHHHHHHHhhcceeecCCCcEEEEEeccccc
Confidence 4 442 255666777777777777774 2245556777754
No 29
>COG3693 XynA Beta-1,4-xylanase [Carbohydrate transport and metabolism]
Probab=97.11 E-value=0.0013 Score=61.36 Aligned_cols=96 Identities=19% Similarity=0.265 Sum_probs=74.6
Q ss_pred HHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCccee-ec-cCCCCcHhHHHhhCCCCChhhHHHHHH
Q 024821 89 ADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHV-TL-HHLDLPQALEDEYGGWINRMIVKDFTA 166 (262)
Q Consensus 89 k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~v-tL-~H~~~P~~l~~~~gGw~~~~~~~~F~~ 166 (262)
++.+.=+-=--.-|.-|+|+ +|.+| ++.-|.+++-++++|+..-- || .|--.|.|+.. ..+..+...+...+
T Consensus 56 re~n~iTpenemKwe~i~p~-~G~f~---Fe~AD~ia~FAr~h~m~lhGHtLvW~~q~P~W~~~--~e~~~~~~~~~~e~ 129 (345)
T COG3693 56 RECNQITPENEMKWEAIEPE-RGRFN---FEAADAIANFARKHNMPLHGHTLVWHSQVPDWLFG--DELSKEALAKMVEE 129 (345)
T ss_pred hhhcccccccccccccccCC-CCccC---ccchHHHHHHHHHcCCeeccceeeecccCCchhhc--cccChHHHHHHHHH
Confidence 34444333344578999998 89999 55579999999999997432 22 35567889853 23677889999999
Q ss_pred HHHHHHHHhCCCCceEEeccCCch
Q 024821 167 YADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 167 ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
+...|++||++.|..|-+.|||.-
T Consensus 130 hI~tV~~rYkg~~~sWDVVNE~vd 153 (345)
T COG3693 130 HIKTVVGRYKGSVASWDVVNEAVD 153 (345)
T ss_pred HHHHHHHhccCceeEEEecccccC
Confidence 999999999999999999999976
No 30
>PLN02801 beta-amylase
Probab=96.96 E-value=0.0034 Score=61.59 Aligned_cols=100 Identities=19% Similarity=0.383 Sum_probs=79.0
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-C-----------CCC
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-H-----------LDL 144 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H-----------~~~ 144 (262)
.-...+..++.+|++|++.+-+.+-|--+|+++++++| +..|+++++.+++.|++..+.|. | .-+
T Consensus 35 ~~~~l~~~L~~LK~~GVdGVmvDVWWGiVE~~~P~~Yd---WsgY~~l~~mvr~~GLKlq~vmSFHqCGGNVGD~~~IpL 111 (517)
T PLN02801 35 DEEGLEKQLKRLKEAGVDGVMVDVWWGIVESKGPKQYD---WSAYRSLFELVQSFGLKIQAIMSFHQCGGNVGDAVNIPI 111 (517)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEeeeeeeeccCCCCccC---cHHHHHHHHHHHHcCCeEEEEEEecccCCCCCCcccccC
Confidence 34457889999999999999999999999998899999 55699999999999999877764 4 358
Q ss_pred cHhHHHh-----------hCCCCC----------------hhhHHHHHHHHHHHHHHhCCCC
Q 024821 145 PQALEDE-----------YGGWIN----------------RMIVKDFTAYADVCFREFGDRV 179 (262)
Q Consensus 145 P~~l~~~-----------~gGw~~----------------~~~~~~F~~ya~~v~~~~gd~V 179 (262)
|.|+.+. ..|-.| +.-++.|.+|-+-.-++|.+..
T Consensus 112 P~WV~~~g~~~pDi~ftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~Dfm~SFr~~F~~~l 173 (517)
T PLN02801 112 PQWVRDVGDSDPDIFYTNRSGNRNKEYLSIGVDNLPLFHGRTAVEMYSDYMKSFRENMADFL 173 (517)
T ss_pred CHHHHHhhccCCCceeecCCCCcCcceeeeccCcccccCCCCHHHHHHHHHHHHHHHHHHhc
Confidence 9998752 022222 2246888888888888887754
No 31
>PLN02905 beta-amylase
Probab=96.90 E-value=0.0034 Score=62.88 Aligned_cols=100 Identities=15% Similarity=0.280 Sum_probs=78.3
Q ss_pred ccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-CC-----------C
Q 024821 76 DEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-HL-----------D 143 (262)
Q Consensus 76 d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H~-----------~ 143 (262)
....-.+..++.+|.+||+.+-+.+-|--+++++++++| +..|+++++.+++.|++..+.|. |- -
T Consensus 283 ~~~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~gP~~Yd---WsgY~~L~~mvr~~GLKlqvVMSFHqCGGNVGD~~~IP 359 (702)
T PLN02905 283 ADPDGLLKQLRILKSINVDGVKVDCWWGIVEAHAPQEYN---WNGYKRLFQMVRELKLKLQVVMSFHECGGNVGDDVCIP 359 (702)
T ss_pred cCHHHHHHHHHHHHHcCCCEEEEeeeeeeeecCCCCcCC---cHHHHHHHHHHHHcCCeEEEEEEecccCCCCCCccccc
Confidence 455567889999999999999999999999999899999 55699999999999999888774 52 5
Q ss_pred CcHhHHHh--------h---CCCCC----------------hhhHHHHHHHHHHHHHHhCCC
Q 024821 144 LPQALEDE--------Y---GGWIN----------------RMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 144 ~P~~l~~~--------~---gGw~~----------------~~~~~~F~~ya~~v~~~~gd~ 178 (262)
+|.|+.+. | .|..| +.-++.|.+|-+-.-++|.+.
T Consensus 360 LP~WV~e~g~~nPDifftDrsG~rn~EyLSlg~D~~pvl~GRTplq~Y~DFM~SFr~~F~~f 421 (702)
T PLN02905 360 LPHWVAEIGRSNPDIFFTDREGRRNPECLSWGIDKERILRGRTALEVYFDYMRSFRVEFDEF 421 (702)
T ss_pred CCHHHHHhhhcCCCceEecCCCCccCceeeeecccccccCCCCHHHHHHHHHHHHHHHHHHH
Confidence 89997752 0 22222 234577888887777777664
No 32
>PLN02705 beta-amylase
Probab=96.90 E-value=0.0033 Score=62.86 Aligned_cols=98 Identities=17% Similarity=0.217 Sum_probs=76.9
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc-CC-----------CCc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH-HL-----------DLP 145 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H~-----------~~P 145 (262)
-.-.+..++.+|.+||+.+-+.+-|--+|+++++++| +..|+++++.+++.|++..+.|. |- -+|
T Consensus 267 ~~al~a~L~aLK~aGVdGVmvDVWWGiVE~~~P~~Yd---WsgY~~L~~mvr~~GLKlqvVmSFHqCGGNVGD~~~IPLP 343 (681)
T PLN02705 267 PEGVRQELSHMKSLNVDGVVVDCWWGIVEGWNPQKYV---WSGYRELFNIIREFKLKLQVVMAFHEYGGNASGNVMISLP 343 (681)
T ss_pred HHHHHHHHHHHHHcCCCEEEEeeeeeEeecCCCCcCC---cHHHHHHHHHHHHcCCeEEEEEEeeccCCCCCCcccccCC
Confidence 3457888999999999999999999999998899999 55699999999999999887763 42 589
Q ss_pred HhHHHh-----------hCCCCC----------------hhhHHHHHHHHHHHHHHhCCC
Q 024821 146 QALEDE-----------YGGWIN----------------RMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 146 ~~l~~~-----------~gGw~~----------------~~~~~~F~~ya~~v~~~~gd~ 178 (262)
.|+.+. ..|..| +.-++.|.+|.+-.-++|.+.
T Consensus 344 ~WV~e~g~~nPDifftDr~G~rn~EyLSlg~D~~pvl~GRTplq~Y~DFM~SFr~~F~~f 403 (681)
T PLN02705 344 QWVLEIGKDNQDIFFTDREGRRNTECLSWSIDKERVLKGRTGIEVYFDFMRSFRSEFDDL 403 (681)
T ss_pred HHHHHhcccCCCceeecCCCCcccceeeeecCcccccCCCCHHHHHHHHHHHHHHHHHHh
Confidence 998752 012222 224578888888777777664
No 33
>PF02836 Glyco_hydro_2_C: Glycosyl hydrolases family 2, TIM barrel domain; InterPro: IPR006103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 2 GH2 from CAZY comprises enzymes with several known activities; beta-galactosidase (3.2.1.23 from EC); beta-mannosidase (3.2.1.25 from EC); beta-glucuronidase (3.2.1.31 from EC). These enzymes contain a conserved glutamic acid residue which has been shown [], in Escherichia coli lacZ (P00722 from SWISSPROT), to be the general acid/base catalyst in the active site of the enzyme. Beta-galactosidase from E. coli has a TIM-barrel-like core surrounded by four other largely beta domains [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3CMG_A 3FN9_C 1YQ2_A 3K4D_B 3LPG_B 3LPF_A 3K4A_B 3K46_B 3GM8_A 3DEC_A ....
Probab=96.65 E-value=0.0093 Score=54.57 Aligned_cols=95 Identities=15% Similarity=0.155 Sum_probs=62.4
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhC---
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYG--- 153 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~g--- 153 (262)
....++.|+.+||++|+|++|++- .|. + .++++.|-+.||-++.-+.....-.|- ..+
T Consensus 34 ~~~~~~~d~~l~k~~G~N~iR~~h-----~p~-----~-------~~~~~~cD~~GilV~~e~~~~~~~~~~--~~~~~~ 94 (298)
T PF02836_consen 34 PDEAMERDLELMKEMGFNAIRTHH-----YPP-----S-------PRFYDLCDELGILVWQEIPLEGHGSWQ--DFGNCN 94 (298)
T ss_dssp -HHHHHHHHHHHHHTT-SEEEETT-----S-------S-------HHHHHHHHHHT-EEEEE-S-BSCTSSS--STSCTS
T ss_pred CHHHHHHHHHHHHhcCcceEEccc-----ccC-----c-------HHHHHHHhhcCCEEEEeccccccCccc--cCCccc
Confidence 346789999999999999999943 232 1 456778888999888665421111111 001
Q ss_pred -CCCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCch
Q 024821 154 -GWINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPNG 190 (262)
Q Consensus 154 -Gw~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~~ 190 (262)
--.+++..+.+.+-++..++++.+. |-.|.+.||+..
T Consensus 95 ~~~~~~~~~~~~~~~~~~~v~~~~NHPSIi~W~~gNE~~~ 134 (298)
T PF02836_consen 95 YDADDPEFRENAEQELREMVRRDRNHPSIIMWSLGNESDY 134 (298)
T ss_dssp CTTTSGGHHHHHHHHHHHHHHHHTT-TTEEEEEEEESSHH
T ss_pred cCCCCHHHHHHHHHHHHHHHHcCcCcCchheeecCccCcc
Confidence 0135677888888889999999885 888999999953
No 34
>PF14587 Glyco_hydr_30_2: O-Glycosyl hydrolase family 30; PDB: 3CLW_B.
Probab=96.63 E-value=0.0049 Score=58.90 Aligned_cols=80 Identities=14% Similarity=0.179 Sum_probs=47.3
Q ss_pred CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHh---hCC-----CCChhhHHHHHHHHHHHHHHhCC---C
Q 024821 110 RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDE---YGG-----WINRMIVKDFTAYADVCFREFGD---R 178 (262)
Q Consensus 110 ~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~---~gG-----w~~~~~~~~F~~ya~~v~~~~gd---~ 178 (262)
+|.+|..+=.-=+.++++++++|+..++.. -+.-|.|+... +|+ -+.++..+.|++|-..|+++|.. .
T Consensus 95 dg~yDW~~D~gQrwfL~~Ak~rGV~~f~aF-SNSPP~~MT~NG~~~g~~~~~~NLk~d~y~~FA~YLa~Vv~~~~~~GI~ 173 (384)
T PF14587_consen 95 DGSYDWDADAGQRWFLKAAKERGVNIFEAF-SNSPPWWMTKNGSASGGDDGSDNLKPDNYDAFADYLADVVKHYKKWGIN 173 (384)
T ss_dssp TS-B-TTSSHHHHHHHHHHHHTT---EEEE--SSS-GGGSSSSSSB-S-SSS-SS-TT-HHHHHHHHHHHHHHHHCTT--
T ss_pred CCCcCCCCCHHHHHHHHHHHHcCCCeEEEe-ecCCCHHHhcCCCCCCCCccccccChhHHHHHHHHHHHHHHHHHhcCCc
Confidence 566665443445668999999999988754 45666665422 111 24567889999999999999943 5
Q ss_pred CceEEeccCCch
Q 024821 179 VSYWTTVNEPNG 190 (262)
Q Consensus 179 V~~w~t~NEP~~ 190 (262)
+++-.++|||+.
T Consensus 174 f~~IsP~NEP~~ 185 (384)
T PF14587_consen 174 FDYISPFNEPQW 185 (384)
T ss_dssp EEEEE--S-TTS
T ss_pred cceeCCcCCCCC
Confidence 888999999994
No 35
>PRK10150 beta-D-glucuronidase; Provisional
Probab=95.82 E-value=0.031 Score=56.28 Aligned_cols=93 Identities=17% Similarity=0.169 Sum_probs=64.4
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHH-------hh
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALED-------EY 152 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~-------~~ 152 (262)
.+..|+++||++|+|++|+|- .|. + .++++.|=+.||-++.-+.-+....|... ..
T Consensus 314 ~~~~d~~l~K~~G~N~vR~sh-----~p~-----~-------~~~~~~cD~~GllV~~E~p~~~~~~~~~~~~~~~~~~~ 376 (604)
T PRK10150 314 LNVHDHNLMKWIGANSFRTSH-----YPY-----S-------EEMLDLADRHGIVVIDETPAVGLNLSFGAGLEAGNKPK 376 (604)
T ss_pred HHHHHHHHHHHCCCCEEEecc-----CCC-----C-------HHHHHHHHhcCcEEEEeccccccccccccccccccccc
Confidence 478899999999999999962 342 1 35678888999977765432222222210 00
Q ss_pred CCCC----ChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCc
Q 024821 153 GGWI----NRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPN 189 (262)
Q Consensus 153 gGw~----~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~ 189 (262)
..|. +++..+.+.+-++.+++|+++. |-.|.+-||+.
T Consensus 377 ~~~~~~~~~~~~~~~~~~~~~~mv~r~~NHPSIi~Ws~gNE~~ 419 (604)
T PRK10150 377 ETYSEEAVNGETQQAHLQAIRELIARDKNHPSVVMWSIANEPA 419 (604)
T ss_pred ccccccccchhHHHHHHHHHHHHHHhccCCceEEEEeeccCCC
Confidence 1222 3567788888899999999985 77999999984
No 36
>PF14488 DUF4434: Domain of unknown function (DUF4434)
Probab=95.77 E-value=0.069 Score=45.38 Aligned_cols=104 Identities=17% Similarity=0.356 Sum_probs=68.8
Q ss_pred cccHHHHHHHHHcCCCcceeccccCcc-----ccCC--CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHh
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRL-----IPNG--RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDE 151 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI-----~P~g--~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~ 151 (262)
.+|+++++.|+++|++.+=+. |+.. .|.. ++.+.....+..+.+++++.+.||++++.|+. -|.|....
T Consensus 20 ~~W~~~~~~m~~~GidtlIlq--~~~~~~~~~yps~~~~~~~~~~~~d~l~~~L~~A~~~Gmkv~~Gl~~--~~~~w~~~ 95 (166)
T PF14488_consen 20 AQWREEFRAMKAIGIDTLILQ--WTGYGGFAFYPSKLSPGGFYMPPVDLLEMILDAADKYGMKVFVGLYF--DPDYWDQG 95 (166)
T ss_pred HHHHHHHHHHHHcCCcEEEEE--EeecCCcccCCccccCccccCCcccHHHHHHHHHHHcCCEEEEeCCC--Cchhhhcc
Confidence 358999999999999988533 4443 2321 11223344578999999999999999999984 45665421
Q ss_pred hCCCCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCch
Q 024821 152 YGGWINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPNG 190 (262)
Q Consensus 152 ~gGw~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~~ 190 (262)
-..| -++.=..-++.+.++||.. +..|-+-+|+.-
T Consensus 96 ~~~~----~~~~~~~v~~el~~~yg~h~sf~GWYip~E~~~ 132 (166)
T PF14488_consen 96 DLDW----EAERNKQVADELWQRYGHHPSFYGWYIPYEIDD 132 (166)
T ss_pred CHHH----HHHHHHHHHHHHHHHHcCCCCCceEEEecccCC
Confidence 0012 2233334667788888874 666777777653
No 37
>KOG0496 consensus Beta-galactosidase [Carbohydrate transport and metabolism]
Probab=95.26 E-value=0.075 Score=53.73 Aligned_cols=109 Identities=15% Similarity=0.152 Sum_probs=85.2
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc--------CCCCcHhHHH
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH--------HLDLPQALED 150 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~--------H~~~P~~l~~ 150 (262)
..|++=|+.+|++|+|++-.=+-|.-.+|. +|+.|.+|.-=..++|..+.+.|+-+++-+- +=.+|.||..
T Consensus 49 e~W~~~i~k~k~~Gln~IqtYVfWn~Hep~-~g~y~FsG~~DlvkFikl~~~~GLyv~LRiGPyIcaEw~~GG~P~wL~~ 127 (649)
T KOG0496|consen 49 EMWPDLIKKAKAGGLNVIQTYVFWNLHEPS-PGKYDFSGRYDLVKFIKLIHKAGLYVILRIGPYICAEWNFGGLPWWLRN 127 (649)
T ss_pred hhhHHHHHHHHhcCCceeeeeeecccccCC-CCcccccchhHHHHHHHHHHHCCeEEEecCCCeEEecccCCCcchhhhh
Confidence 447788999999999999999999999999 8999999988888999999999987655432 5567988876
Q ss_pred hhCCC----CChhhHHHHHHHHHHHHHHhCC-------CCceEEeccCCc
Q 024821 151 EYGGW----INRMIVKDFTAYADVCFREFGD-------RVSYWTTVNEPN 189 (262)
Q Consensus 151 ~~gGw----~~~~~~~~F~~ya~~v~~~~gd-------~V~~w~t~NEP~ 189 (262)
. .|- .|+.+..+..+|.+.++...+. =|-.-.+=||-.
T Consensus 128 ~-pg~~~Rt~nepfk~~~~~~~~~iv~~mk~L~~~qGGPIIl~QIENEYG 176 (649)
T KOG0496|consen 128 V-PGIVFRTDNEPFKAEMERWTTKIVPMMKKLFASQGGPIILVQIENEYG 176 (649)
T ss_pred C-CceEEecCChHHHHHHHHHHHHHHHHHHHHHhhcCCCEEEEEeechhh
Confidence 5 442 3677888899999998885541 244445677755
No 38
>PRK09525 lacZ beta-D-galactosidase; Reviewed
Probab=93.65 E-value=0.23 Score=53.34 Aligned_cols=90 Identities=17% Similarity=0.166 Sum_probs=62.9
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc---CCCCcHhHHHhhCC
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH---HLDLPQALEDEYGG 154 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~---H~~~P~~l~~~~gG 154 (262)
...+++||++||++|+|++|+| -.|. + .++.+.|=+.||-++--.. |--.|. .. .
T Consensus 370 ~e~~~~di~lmK~~g~NaVR~s-----HyP~-----~-------p~fydlcDe~GilV~dE~~~e~hg~~~~---~~--~ 427 (1027)
T PRK09525 370 EETMVQDILLMKQHNFNAVRCS-----HYPN-----H-------PLWYELCDRYGLYVVDEANIETHGMVPM---NR--L 427 (1027)
T ss_pred HHHHHHHHHHHHHCCCCEEEec-----CCCC-----C-------HHHHHHHHHcCCEEEEecCccccCCccc---cC--C
Confidence 4567899999999999999996 2443 1 3456788889997776542 111110 00 1
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCc
Q 024821 155 WINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPN 189 (262)
Q Consensus 155 w~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~ 189 (262)
..+++..+.+.+=++..++|.++. |-.|..-||+.
T Consensus 428 ~~dp~~~~~~~~~~~~mV~RdrNHPSIi~WSlgNE~~ 464 (1027)
T PRK09525 428 SDDPRWLPAMSERVTRMVQRDRNHPSIIIWSLGNESG 464 (1027)
T ss_pred CCCHHHHHHHHHHHHHHHHhCCCCCEEEEEeCccCCC
Confidence 134566677777788899999985 78899999975
No 39
>PF07488 Glyco_hydro_67M: Glycosyl hydrolase family 67 middle domain; InterPro: IPR011100 Alpha-glucuronidases, components of an ensemble of enzymes central to the recycling of photosynthetic biomass, remove the alpha-1,2 linked 4-O-methyl glucuronic acid from xylans. This family represents the central catalytic domain of alpha-glucuronidase [].; GO: 0046559 alpha-glucuronidase activity, 0045493 xylan catabolic process, 0005576 extracellular region; PDB: 1MQP_A 1K9E_A 1MQQ_A 1L8N_A 1K9D_A 1MQR_A 1K9F_A 1GQL_A 1GQI_B 1GQJ_B ....
Probab=93.31 E-value=0.56 Score=43.71 Aligned_cols=88 Identities=20% Similarity=0.359 Sum_probs=63.3
Q ss_pred ccccHHHHHHHHHcCCCcceec---cccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCC
Q 024821 78 YHKYKEDVKLMADTGLDAYRFS---ISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGG 154 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~s---i~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gG 154 (262)
..||.+=.++++++|+|++-+. ..-..| ..+.++.+.++-+.++.+||++.+++. |.-|.-+ ||
T Consensus 56 ~~R~~~YARllASiGINgvvlNNVNa~~~~L--------t~~~l~~v~~lAdvfRpYGIkv~LSvn-FasP~~l----gg 122 (328)
T PF07488_consen 56 LTRYRDYARLLASIGINGVVLNNVNANPKLL--------TPEYLDKVARLADVFRPYGIKVYLSVN-FASPIEL----GG 122 (328)
T ss_dssp -HHHHHHHHHHHHTT--EEE-S-SS--CGGG--------STTTHHHHHHHHHHHHHTT-EEEEEE--TTHHHHT----TS
T ss_pred hhHHHHHHHHHhhcCCceEEecccccChhhc--------CHHHHHHHHHHHHHHhhcCCEEEEEee-ccCCccc----CC
Confidence 4688888999999999987653 122222 234578889999999999999999995 7788654 55
Q ss_pred -----CCChhhHHHHHHHHHHHHHHhCCC
Q 024821 155 -----WINRMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 155 -----w~~~~~~~~F~~ya~~v~~~~gd~ 178 (262)
-++++++..|.+=++.+.++.-|-
T Consensus 123 L~TaDPld~~V~~WW~~k~~eIY~~IPDf 151 (328)
T PF07488_consen 123 LPTADPLDPEVRQWWKDKADEIYSAIPDF 151 (328)
T ss_dssp -S---TTSHHHHHHHHHHHHHHHHH-TT-
T ss_pred cCcCCCCCHHHHHHHHHHHHHHHHhCCCc
Confidence 357899999999999999998773
No 40
>PRK10340 ebgA cryptic beta-D-galactosidase subunit alpha; Reviewed
Probab=92.66 E-value=0.42 Score=51.40 Aligned_cols=90 Identities=14% Similarity=0.204 Sum_probs=62.4
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc---C-CCCcHhHHHhhC
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH---H-LDLPQALEDEYG 153 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~---H-~~~P~~l~~~~g 153 (262)
...+++|+++||++|+|++|+| ..|.. ..+.+.|=+.||-++--.. | |.... .+
T Consensus 354 ~e~~~~dl~lmK~~g~NavR~s-----HyP~~------------~~fydlcDe~GllV~dE~~~e~~g~~~~~----~~- 411 (1021)
T PRK10340 354 MDRVEKDIQLMKQHNINSVRTA-----HYPND------------PRFYELCDIYGLFVMAETDVESHGFANVG----DI- 411 (1021)
T ss_pred HHHHHHHHHHHHHCCCCEEEec-----CCCCC------------HHHHHHHHHCCCEEEECCcccccCccccc----cc-
Confidence 4578899999999999999996 25541 3567788889997776431 1 11100 00
Q ss_pred CC--CChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCc
Q 024821 154 GW--INRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPN 189 (262)
Q Consensus 154 Gw--~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~ 189 (262)
.+ .+++..+.|.+=++.+++|.++. |-.|..-||..
T Consensus 412 ~~~~~~p~~~~~~~~~~~~mV~RdrNHPSIi~WslGNE~~ 451 (1021)
T PRK10340 412 SRITDDPQWEKVYVDRIVRHIHAQKNHPSIIIWSLGNESG 451 (1021)
T ss_pred ccccCCHHHHHHHHHHHHHHHHhCCCCCEEEEEECccCcc
Confidence 01 23445566777788899999985 78899999973
No 41
>COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]
Probab=92.30 E-value=0.75 Score=48.31 Aligned_cols=90 Identities=16% Similarity=0.149 Sum_probs=65.0
Q ss_pred CccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCC
Q 024821 75 CDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGG 154 (262)
Q Consensus 75 ~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gG 154 (262)
+-.+..+.+|+++||++|+|++|.| =.|+. .++.+.|-+.||-++=..... .+++
T Consensus 317 ~~~~~~~~~dl~lmk~~n~N~vRts-----HyP~~------------~~~ydLcDelGllV~~Ea~~~--------~~~~ 371 (808)
T COG3250 317 VTDEDAMERDLKLMKEANMNSVRTS-----HYPNS------------EEFYDLCDELGLLVIDEAMIE--------THGM 371 (808)
T ss_pred ccCHHHHHHHHHHHHHcCCCEEEec-----CCCCC------------HHHHHHHHHhCcEEEEecchh--------hcCC
Confidence 3445569999999999999999998 44541 456677888999877654321 1233
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCc
Q 024821 155 WINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPN 189 (262)
Q Consensus 155 w~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~ 189 (262)
..+++..+...+=++..++|-++. |-.|..=||..
T Consensus 372 ~~~~~~~k~~~~~i~~mver~knHPSIiiWs~gNE~~ 408 (808)
T COG3250 372 PDDPEWRKEVSEEVRRMVERDRNHPSIIIWSLGNESG 408 (808)
T ss_pred CCCcchhHHHHHHHHHHHHhccCCCcEEEEecccccc
Confidence 455555566666678889998874 88899999955
No 42
>COG3934 Endo-beta-mannanase [Carbohydrate transport and metabolism]
Probab=91.61 E-value=0.064 Score=52.64 Aligned_cols=109 Identities=16% Similarity=0.110 Sum_probs=79.5
Q ss_pred cHHHHHHHHHcCCCcceecccc-CccccCCCCCCChhh-hhHHHHHHHHHHHcCCcceeecc----CCCCcHhHHHhhCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISW-SRLIPNGRGPVNPKG-LQYYNNLINELISYGIQPHVTLH----HLDLPQALEDEYGG 154 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~W-sRI~P~g~g~~n~~~-l~~y~~~id~l~~~GI~p~vtL~----H~~~P~~l~~~~gG 154 (262)
.+.|++.++.+|++..|++|.= .. +-+..|..|.+. +.+.+.+++.+...+|+.++||. |+.--.|-.-=.||
T Consensus 28 i~~dle~a~~vg~k~lR~fiLDgEd-c~d~~G~~na~s~~~y~~~fla~a~~l~lkvlitlivg~~hmgg~Nw~Ipwag~ 106 (587)
T COG3934 28 IKADLEPAGFVGVKDLRLFILDGED-CRDKEGYRNAGSNVWYAAWFLAPAGYLDLKVLITLIVGLKHMGGTNWRIPWAGE 106 (587)
T ss_pred hhcccccccCccceeEEEEEecCcc-hhhhhceecccccHHHHHHHhhhcccCcceEEEEEeecccccCcceeEeecCCC
Confidence 4578899999999999999643 33 223257778765 99999999999999999999986 22211111000011
Q ss_pred ------CCChhhHHHHHHHHHHHHHHhCCC--CceEEeccCCch
Q 024821 155 ------WINRMIVKDFTAYADVCFREFGDR--VSYWTTVNEPNG 190 (262)
Q Consensus 155 ------w~~~~~~~~F~~ya~~v~~~~gd~--V~~w~t~NEP~~ 190 (262)
...++....|.+|++.+++.|+.. +--|+.=|||-+
T Consensus 107 ~~pdn~iyD~k~~~~~kkyvedlVk~yk~~ptI~gw~l~Ne~lv 150 (587)
T COG3934 107 QSPDNVIYDPKFRGPGKKYVEDLVKPYKLDPTIAGWALRNEPLV 150 (587)
T ss_pred CCccccccchhhcccHHHHHHHHhhhhccChHHHHHHhcCCccc
Confidence 235667788999999999998875 667999999776
No 43
>COG3664 XynB Beta-xylosidase [Carbohydrate transport and metabolism]
Probab=91.30 E-value=0.43 Score=46.05 Aligned_cols=100 Identities=16% Similarity=0.197 Sum_probs=70.6
Q ss_pred HHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCC--ChhhHHHHH
Q 024821 88 MADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWI--NRMIVKDFT 165 (262)
Q Consensus 88 ~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~--~~~~~~~F~ 165 (262)
-+|+||+-+|.---|.-++.+ --++ ..++++++|.+.+.|+.-+.+-.||+.+.-....|.+-. .....+.++
T Consensus 14 ~~Ei~v~yi~~~~v~h~~~q~--~~~~---~t~~d~i~d~~~~~~~~~ie~~l~~~~l~~~~~~wq~n~~~~~~~~dl~~ 88 (428)
T COG3664 14 DDEIQVNYIRRHGVWHVNAQK--LFYP---FTYIDEIIDTLLDLGLDLIELFLIWNNLNTKEHQWQLNVDDPKSVFDLIA 88 (428)
T ss_pred hhhhceeeehhcceeeeeecc--ccCC---hHHHHHHHHHHHHhccHHHHHhhcccchhhhhhhcccccCCcHhHHHHHH
Confidence 368899999988888722222 2344 678999999999999544445567777754443333322 234789999
Q ss_pred HHHHHHHHHhCCC-Cc--eEEeccCCchhc
Q 024821 166 AYADVCFREFGDR-VS--YWTTVNEPNGFA 192 (262)
Q Consensus 166 ~ya~~v~~~~gd~-V~--~w~t~NEP~~~~ 192 (262)
.++..|+.++|-+ |. ....+||||..+
T Consensus 89 ~fl~h~~~~vg~e~v~kw~f~~~~~pn~~a 118 (428)
T COG3664 89 AFLKHVIRRVGVEFVRKWPFYSPNEPNLLA 118 (428)
T ss_pred HHHHHHHHHhChhheeecceeecCCCCccc
Confidence 9999999999963 33 456999999874
No 44
>COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism]
Probab=91.18 E-value=5.8 Score=37.26 Aligned_cols=137 Identities=18% Similarity=0.346 Sum_probs=78.8
Q ss_pred ccCCCCCCCeeeeeccc-ccccCCCCCCCCCCcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccC
Q 024821 25 TKNDFPPGFIFGSGTSA-YQVEGAANEDGRTPSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWS 103 (262)
Q Consensus 25 ~~~~fp~~FlwG~Atsa-~Q~EG~~~~~gk~~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~Ws 103 (262)
+....|+||.-|+-.|. .|+|-. ++| |.. .++. -++-++.+|+.|+|.+|+-| |-
T Consensus 31 ~v~~~~~dFikGaDis~l~~lE~~---Gvk-------f~d-----~ng~--------~qD~~~iLK~~GvNyvRlRv-wn 86 (403)
T COG3867 31 PVENSPNDFIKGADISSLIELENS---GVK-------FFD-----TNGV--------RQDALQILKNHGVNYVRLRV-WN 86 (403)
T ss_pred eccCChHHhhccccHHHHHHHHHc---Cce-------EEc-----cCCh--------HHHHHHHHHHcCcCeEEEEE-ec
Confidence 44458999999887654 456531 111 111 1111 13446999999999999976 22
Q ss_pred cccc-CCC----CCCChhhhhHHHHHHHHHHHcCCcceeecc---CCCCcHhHHHhhCCCCC---hhhHHHHHHHHHHHH
Q 024821 104 RLIP-NGR----GPVNPKGLQYYNNLINELISYGIQPHVTLH---HLDLPQALEDEYGGWIN---RMIVKDFTAYADVCF 172 (262)
Q Consensus 104 RI~P-~g~----g~~n~~~l~~y~~~id~l~~~GI~p~vtL~---H~~~P~~l~~~~gGw~~---~~~~~~F~~ya~~v~ 172 (262)
.=.. +|. |.-| ++.--++-..+++.||++++++| ||.-|..- ++-..|.+ ++...+--+|.+.+.
T Consensus 87 dP~dsngn~yggGnnD---~~k~ieiakRAk~~GmKVl~dFHYSDfwaDPakQ-~kPkaW~~l~fe~lk~avy~yTk~~l 162 (403)
T COG3867 87 DPYDSNGNGYGGGNND---LKKAIEIAKRAKNLGMKVLLDFHYSDFWADPAKQ-KKPKAWENLNFEQLKKAVYSYTKYVL 162 (403)
T ss_pred CCccCCCCccCCCcch---HHHHHHHHHHHHhcCcEEEeeccchhhccChhhc-CCcHHhhhcCHHHHHHHHHHHHHHHH
Confidence 1111 121 2223 44556677788999999999997 55555432 12234643 223344445555555
Q ss_pred HHhCC---CCceEEeccCCc
Q 024821 173 REFGD---RVSYWTTVNEPN 189 (262)
Q Consensus 173 ~~~gd---~V~~w~t~NEP~ 189 (262)
..+.+ ....-.+=||-+
T Consensus 163 ~~m~~eGi~pdmVQVGNEtn 182 (403)
T COG3867 163 TTMKKEGILPDMVQVGNETN 182 (403)
T ss_pred HHHHHcCCCccceEeccccC
Confidence 55543 566667889966
No 45
>smart00642 Aamy Alpha-amylase domain.
Probab=90.85 E-value=0.54 Score=39.75 Aligned_cols=64 Identities=14% Similarity=0.258 Sum_probs=45.1
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccc---------cCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLI---------PNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~---------P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
.+....+-++.++++|++++-++--+.... |..--.+|+ -..+-++++|++|+++||++++++.
T Consensus 17 ~~~gi~~~l~yl~~lG~~~I~l~Pi~~~~~~~~~~~gY~~~d~~~i~~~~Gt~~d~~~lv~~~h~~Gi~vilD~V 91 (166)
T smart00642 17 DLQGIIEKLDYLKDLGVTAIWLSPIFESPQGYPSYHGYDISDYKQIDPRFGTMEDFKELVDAAHARGIKVILDVV 91 (166)
T ss_pred CHHHHHHHHHHHHHCCCCEEEECcceeCCCCCCCCCCcCccccCCCCcccCCHHHHHHHHHHHHHCCCEEEEEEC
Confidence 355567778899999999998876544442 211112332 2456789999999999999999864
No 46
>KOG2233 consensus Alpha-N-acetylglucosaminidase [Intracellular trafficking, secretion, and vesicular transport]
Probab=87.74 E-value=5 Score=39.90 Aligned_cols=112 Identities=20% Similarity=0.424 Sum_probs=68.9
Q ss_pred ccccHHHHHHHHHcCCCcceec----cccCccccC-C---------------------------CCCCCh----hhhhHH
Q 024821 78 YHKYKEDVKLMADTGLDAYRFS----ISWSRLIPN-G---------------------------RGPVNP----KGLQYY 121 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~s----i~WsRI~P~-g---------------------------~g~~n~----~~l~~y 121 (262)
+.+|+..|+.|+-.|+|..=.. +-|.+|+-. | .|...+ .-+-.=
T Consensus 77 w~qWeR~iDWmALnGinl~la~~gQEaIWqkVf~~lgl~~eeldeyftgpAflAW~RMGNl~awgGpLs~aw~~~ql~Lq 156 (666)
T KOG2233|consen 77 WEQWEREIDWMALNGINLVLAPLGQEAIWQKVFMGLGLQREELDEYFTGPAFLAWHRMGNLHAWGGPLSPAWMLNQLLLQ 156 (666)
T ss_pred hHHHHhHhhHHHHcCcceeeccchhHHHHHHHHHHcCCCHHHHHHhcccHHHHHHHHhcCccccCCCCCHHHHHHHHHHH
Confidence 5789999999999999965332 234444322 0 122211 113344
Q ss_pred HHHHHHHHHcCCcceeeccCCCCcHhHHHhh--------CCCC------------C---hhhHHHHHHHHHHHHHHhCCC
Q 024821 122 NNLINELISYGIQPHVTLHHLDLPQALEDEY--------GGWI------------N---RMIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 122 ~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~--------gGw~------------~---~~~~~~F~~ya~~v~~~~gd~ 178 (262)
+++|+.+++-||+|++--+---.|..|..-+ +-|. + |-+++-=..|.+...++||+-
T Consensus 157 krIidrm~~lGmTpvLPaFaG~VP~al~rlfPesnf~rl~rWn~f~s~~~C~l~v~P~dplF~eIgs~Flr~~~kefG~~ 236 (666)
T KOG2233|consen 157 KRIIDRMLELGMTPVLPAFAGHVPDALERLFPESNFTRLPRWNNFTSRYSCMLLVSPFDPLFQEIGSTFLRHQIKEFGGV 236 (666)
T ss_pred HHHHHHHHHcCCCccchhhccccHHHHHHhCchhceeeccccCCCCcceeeeEEccCCcchHHHHHHHHHHHHHHHhCCc
Confidence 7899999999999998776555788776432 2232 1 223344445667788999972
Q ss_pred --CceEEeccCCc
Q 024821 179 --VSYWTTVNEPN 189 (262)
Q Consensus 179 --V~~w~t~NEP~ 189 (262)
+-.=-||||..
T Consensus 237 tniy~~DpFNE~~ 249 (666)
T KOG2233|consen 237 TNIYSADPFNEIL 249 (666)
T ss_pred ccccccCcccccC
Confidence 22223999943
No 47
>PF03198 Glyco_hydro_72: Glucanosyltransferase; InterPro: IPR004886 This family is a group of yeast glycolipid proteins anchored to the membrane. It includes Candida albicans (Yeast) pH-regulated protein, which is required for apical growth and plays a role in morphogenesis and Saccharomyces cerevisiae glycolipid anchored surface protein.; PDB: 2W61_A 2W62_A 2W63_A.
Probab=85.77 E-value=3 Score=39.03 Aligned_cols=89 Identities=18% Similarity=0.235 Sum_probs=49.9
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChh
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRM 159 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~ 159 (262)
-.+.||.+||+||+|++|+= -|-|+ .| .|...+.|.+.||-++++|. .|.--.++..-|..-
T Consensus 54 ~C~rDi~~l~~LgiNtIRVY----~vdp~----~n------Hd~CM~~~~~aGIYvi~Dl~---~p~~sI~r~~P~~sw- 115 (314)
T PF03198_consen 54 ACKRDIPLLKELGINTIRVY----SVDPS----KN------HDECMSAFADAGIYVILDLN---TPNGSINRSDPAPSW- 115 (314)
T ss_dssp HHHHHHHHHHHHT-SEEEES-------TT----S--------HHHHHHHHHTT-EEEEES----BTTBS--TTS------
T ss_pred HHHHhHHHHHHcCCCEEEEE----EeCCC----CC------HHHHHHHHHhCCCEEEEecC---CCCccccCCCCcCCC-
Confidence 57899999999999999973 23443 23 78899999999999999997 442211221111111
Q ss_pred hHHHHHHHHHHHHHHhCC--CCceEEeccC
Q 024821 160 IVKDFTAYADVCFREFGD--RVSYWTTVNE 187 (262)
Q Consensus 160 ~~~~F~~ya~~v~~~~gd--~V~~w~t~NE 187 (262)
..+.|.+|.+. ++.|.. .+-....=||
T Consensus 116 ~~~l~~~~~~v-id~fa~Y~N~LgFf~GNE 144 (314)
T PF03198_consen 116 NTDLLDRYFAV-IDAFAKYDNTLGFFAGNE 144 (314)
T ss_dssp -HHHHHHHHHH-HHHHTT-TTEEEEEEEES
T ss_pred CHHHHHHHHHH-HHHhccCCceEEEEecce
Confidence 23556666554 455543 2444555555
No 48
>PLN02361 alpha-amylase
Probab=81.95 E-value=2.4 Score=41.03 Aligned_cols=64 Identities=16% Similarity=0.304 Sum_probs=47.5
Q ss_pred ccccccHHHHHHHHHcCCCcceeccccCccccCCCC-----CCChh--hhhHHHHHHHHHHHcCCcceeec
Q 024821 76 DEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRG-----PVNPK--GLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 76 d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g-----~~n~~--~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+|....+.++.+++||++++=++-.....-+.|-. .+|+. ..+-++++|++|.++||++++++
T Consensus 26 ~~w~~i~~kl~~l~~lG~t~iwl~P~~~~~~~~GY~~~d~y~~~~~~Gt~~el~~li~~~h~~gi~vi~D~ 96 (401)
T PLN02361 26 DWWRNLEGKVPDLAKSGFTSAWLPPPSQSLAPEGYLPQNLYSLNSAYGSEHLLKSLLRKMKQYNVRAMADI 96 (401)
T ss_pred HHHHHHHHHHHHHHHcCCCEEEeCCCCcCCCCCCCCcccccccCcccCCHHHHHHHHHHHHHcCCEEEEEE
Confidence 478899999999999999999887655443333311 22322 24568999999999999999975
No 49
>PF00128 Alpha-amylase: Alpha amylase, catalytic domain; InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=81.56 E-value=1 Score=39.83 Aligned_cols=60 Identities=17% Similarity=0.301 Sum_probs=41.4
Q ss_pred cHHHHHHHHHcCCCcceeccccCccc-cCCC-----CCCChh--hhhHHHHHHHHHHHcCCcceeecc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLI-PNGR-----GPVNPK--GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~-P~g~-----g~~n~~--~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
..+-++.+|+|||+++-++=-+..-. ..|- -.+|++ ..+=++++|++|.++||++|+++.
T Consensus 6 i~~kLdyl~~lGv~~I~l~Pi~~~~~~~~gY~~~d~~~vd~~~Gt~~d~~~Lv~~~h~~gi~VilD~V 73 (316)
T PF00128_consen 6 IIDKLDYLKDLGVNAIWLSPIFESPNGYHGYDPSDYYAVDPRFGTMEDFKELVDAAHKRGIKVILDVV 73 (316)
T ss_dssp HHHTHHHHHHHTESEEEESS-EESSSSTTTTSESEEEEESTTTBHHHHHHHHHHHHHHTTCEEEEEEE
T ss_pred HHHhhHHHHHcCCCceecccccccccccccccceeeeccccccchhhhhhhhhhccccccceEEEeee
Confidence 35668899999999999885444110 0110 123332 466789999999999999999874
No 50
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=81.55 E-value=8.6 Score=35.30 Aligned_cols=106 Identities=15% Similarity=0.146 Sum_probs=69.1
Q ss_pred cHHHHHHHHHcC--CCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC---CCcHhHHHh----
Q 024821 81 YKEDVKLMADTG--LDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL---DLPQALEDE---- 151 (262)
Q Consensus 81 ~~eDi~l~k~lG--~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~---~~P~~l~~~---- 151 (262)
..+-++.+++.| ++++=+++.|.+-.-.++-.+|++.+--.+.+|++|+++|+++++.+.-+ +.|..-+.+
T Consensus 26 v~~~~~~~~~~~iP~d~~~lD~~w~~~~~~~~f~~d~~~FPd~~~~i~~l~~~G~~~~~~~~P~i~~~~~~~~e~~~~g~ 105 (308)
T cd06593 26 VNEFADGMRERNLPCDVIHLDCFWMKEFQWCDFEFDPDRFPDPEGMLSRLKEKGFKVCLWINPYIAQKSPLFKEAAEKGY 105 (308)
T ss_pred HHHHHHHHHHcCCCeeEEEEecccccCCcceeeEECcccCCCHHHHHHHHHHCCCeEEEEecCCCCCCchhHHHHHHCCe
Confidence 567788889999 55677778888533212345677777778999999999999988866522 223221110
Q ss_pred -----------------hCC---CCChhhHHHHHHHHHHHHHHhCCCCc-eEEeccCCc
Q 024821 152 -----------------YGG---WINRMIVKDFTAYADVCFREFGDRVS-YWTTVNEPN 189 (262)
Q Consensus 152 -----------------~gG---w~~~~~~~~F~~ya~~v~~~~gd~V~-~w~t~NEP~ 189 (262)
.++ +.||++.+.|.+..+.+.+ +| |+ +|.=+|||.
T Consensus 106 ~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~-~G--id~~~~D~~e~~ 161 (308)
T cd06593 106 LVKKPDGSVWQWDLWQPGMGIIDFTNPDACKWYKDKLKPLLD-MG--VDCFKTDFGERI 161 (308)
T ss_pred EEECCCCCeeeecccCCCcccccCCCHHHHHHHHHHHHHHHH-hC--CcEEecCCCCCC
Confidence 011 5688888888777776544 43 44 466688873
No 51
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=81.42 E-value=6.3 Score=36.66 Aligned_cols=98 Identities=16% Similarity=0.306 Sum_probs=64.9
Q ss_pred cccHHHHHHHHHcCCCcceecccc-------CccccCC---CCCC-ChhhhhHHHHHHHHHHHcCCcceeec-cCC----
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISW-------SRLIPNG---RGPV-NPKGLQYYNNLINELISYGIQPHVTL-HHL---- 142 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~W-------sRI~P~g---~g~~-n~~~l~~y~~~id~l~~~GI~p~vtL-~H~---- 142 (262)
...++-++.++++|+|++=+.+.+ |.++|.. .|.. ...|.+.+..+|++++++||++..-+ ...
T Consensus 19 ~~~~~~l~~l~~~~~N~V~~qVr~~gda~Y~S~~~p~s~~~~g~~~~~pg~DpL~~~I~eaHkrGlevHAW~~~~~~~~~ 98 (311)
T PF02638_consen 19 EQIDEMLDDLKSAGFNAVFVQVRPRGDALYPSDIEPWSGYLTGKQGKDPGFDPLEFMIEEAHKRGLEVHAWFRVGFNAPD 98 (311)
T ss_pred HHHHHHHHHHHHcCCCEEEEEEEeCcEEEecccccccccccCCCCCCCCCccHHHHHHHHHHHcCCEEEEEEEeecCCCc
Confidence 446777899999999987666543 3334421 1111 11368899999999999999987543 100
Q ss_pred ------CCcHhHHHh-------h----CC--CC---ChhhHHHHHHHHHHHHHHhC
Q 024821 143 ------DLPQALEDE-------Y----GG--WI---NRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 143 ------~~P~~l~~~-------~----gG--w~---~~~~~~~F~~ya~~v~~~~g 176 (262)
..|.|+..+ + ++ |. +|++.+...+-++.|+++|.
T Consensus 99 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~lnP~~PeVr~~i~~~v~Eiv~~Yd 154 (311)
T PF02638_consen 99 VSHILKKHPEWFAVNHPGWVRTYEDANGGYYWLNPGHPEVRDYIIDIVKEIVKNYD 154 (311)
T ss_pred hhhhhhcCchhheecCCCceeecccCCCCceEECCCCHHHHHHHHHHHHHHHhcCC
Confidence 235554311 1 22 44 47899999999999999995
No 52
>PLN00196 alpha-amylase; Provisional
Probab=80.96 E-value=2.6 Score=41.09 Aligned_cols=63 Identities=14% Similarity=0.144 Sum_probs=45.1
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccccCCCC-----CCCh---hhhhHHHHHHHHHHHcCCcceeec
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRG-----PVNP---KGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g-----~~n~---~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.|....+.++.+++|||+++=++-.....-+.|-. .+|. -.-+=++++|+++.++||+++++.
T Consensus 42 ~~~~i~~kldyL~~LGvtaIWL~P~~~s~s~hGY~~~D~y~ld~~~fGt~~elk~Lv~~aH~~GIkVilDv 112 (428)
T PLN00196 42 WYNFLMGKVDDIAAAGITHVWLPPPSHSVSEQGYMPGRLYDLDASKYGNEAQLKSLIEAFHGKGVQVIADI 112 (428)
T ss_pred CHHHHHHHHHHHHHcCCCEEEeCCCCCCCCCCCCCccccCCCCcccCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 46667888999999999999888655443333311 2331 123458999999999999999975
No 53
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=80.36 E-value=7.2 Score=35.87 Aligned_cols=119 Identities=13% Similarity=0.168 Sum_probs=72.1
Q ss_pred CcccccccccCCCCCCCCcCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCC-CChhhhhHHHHHHHHHHHcCC
Q 024821 55 PSIWDTFAHAGNVPGTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGP-VNPKGLQYYNNLINELISYGI 133 (262)
Q Consensus 55 ~s~wd~~~~~~~~~~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~-~n~~~l~~y~~~id~l~~~GI 133 (262)
.+.|+.|...... ..+..+.-.+.++++=|+..+++|+..+=+.-.|+.-.+..... .....-....++++-.+++|+
T Consensus 9 k~~W~Ww~~~~~~-~~~~~~g~~t~~~k~yIDfAa~~G~eYvlvD~GW~~~~~~~~~d~~~~~~~~dl~elv~Ya~~KgV 87 (273)
T PF10566_consen 9 KAAWSWWSMHNGK-GVGFKHGATTETQKRYIDFAAEMGIEYVLVDAGWYGWEKDDDFDFTKPIPDFDLPELVDYAKEKGV 87 (273)
T ss_dssp EEEECTCCCCTTS-SBSS-BSSSHHHHHHHHHHHHHTT-SEEEEBTTCCGS--TTT--TT-B-TT--HHHHHHHHHHTT-
T ss_pred eEEEeecccCCCC-CCCCcCCCCHHHHHHHHHHHHHcCCCEEEeccccccccccccccccccCCccCHHHHHHHHHHcCC
Confidence 3667766543211 12223444678899999999999999999999999733221111 111112457899999999999
Q ss_pred cceeeccCCC------CcHhHHH---hh---C---------CCCChhhHHHHHHHHHHHHHH
Q 024821 134 QPHVTLHHLD------LPQALED---EY---G---------GWINRMIVKDFTAYADVCFRE 174 (262)
Q Consensus 134 ~p~vtL~H~~------~P~~l~~---~~---g---------Gw~~~~~~~~F~~ya~~v~~~ 174 (262)
.+++-.+|-+ +=..+.+ .| | +-.+.+.++.+.+.++.++++
T Consensus 88 gi~lw~~~~~~~~~~~~~~~~~~~f~~~~~~Gv~GvKidF~~~d~Q~~v~~y~~i~~~AA~~ 149 (273)
T PF10566_consen 88 GIWLWYHSETGGNVANLEKQLDEAFKLYAKWGVKGVKIDFMDRDDQEMVNWYEDILEDAAEY 149 (273)
T ss_dssp EEEEEEECCHTTBHHHHHCCHHHHHHHHHHCTEEEEEEE--SSTSHHHHHHHHHHHHHHHHT
T ss_pred CEEEEEeCCcchhhHhHHHHHHHHHHHHHHcCCCEEeeCcCCCCCHHHHHHHHHHHHHHHHc
Confidence 9999887655 2111111 11 1 224567889999999988775
No 54
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=78.86 E-value=6.2 Score=32.21 Aligned_cols=53 Identities=9% Similarity=0.146 Sum_probs=38.3
Q ss_pred HHHHHHHcCCCcceeccc------c--CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 84 DVKLMADTGLDAYRFSIS------W--SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 84 Di~l~k~lG~~~~R~si~------W--sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
=++.+|++|+|++-+... | +++.+. .+..+ -+.+.++|++|+++||++++=+.
T Consensus 5 ~~~~lk~~~v~si~i~a~~h~g~ayYPt~~~~~---hp~L~-~Dllge~v~a~h~~Girv~ay~~ 65 (132)
T PF14871_consen 5 FVDTLKEAHVNSITIFAKCHGGYAYYPTKVGPR---HPGLK-RDLLGEQVEACHERGIRVPAYFD 65 (132)
T ss_pred HHHHHHHhCCCEEEEEcccccEEEEccCCCCcC---CCCCC-cCHHHHHHHHHHHCCCEEEEEEe
Confidence 368899999999998332 2 344332 22223 58899999999999999998654
No 55
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=77.57 E-value=9.1 Score=36.01 Aligned_cols=97 Identities=12% Similarity=0.166 Sum_probs=56.9
Q ss_pred CCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC-CCcHhHH--HhhCCCCChhhHHHHHHHHH
Q 024821 93 LDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL-DLPQALE--DEYGGWINRMIVKDFTAYAD 169 (262)
Q Consensus 93 ~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~-~~P~~l~--~~~gGw~~~~~~~~F~~ya~ 169 (262)
++++=++|.|..- .+.-.+|++.+---++++++|++.|++.++.+.-+ ..-.... ..+..|.||+..+.|.+..+
T Consensus 40 ~D~i~lDidy~~~--~~~Ft~d~~~FPdp~~mv~~L~~~G~klv~~i~P~i~~g~~~~~~~~~pDftnp~ar~wW~~~~~ 117 (332)
T cd06601 40 LDGLHVDVDFQDN--YRTFTTNGGGFPNPKEMFDNLHNKGLKCSTNITPVISYGGGLGSPGLYPDLGRPDVREWWGNQYK 117 (332)
T ss_pred CceEEEcCchhcC--CCceeecCCCCCCHHHHHHHHHHCCCeEEEEecCceecCccCCCCceeeCCCCHHHHHHHHHHHH
Confidence 3455555555421 12344555555556889999999999877765311 1000000 01234678888888776655
Q ss_pred HHHHHhCCCCceEEeccCCchhcc
Q 024821 170 VCFREFGDRVSYWTTVNEPNGFAM 193 (262)
Q Consensus 170 ~v~~~~gd~V~~w~t~NEP~~~~~ 193 (262)
.+.+ .| -.-.|.=+|||.++..
T Consensus 118 ~l~~-~G-v~~~W~DmnEp~~~~~ 139 (332)
T cd06601 118 YLFD-IG-LEFVWQDMTTPAIMPS 139 (332)
T ss_pred HHHh-CC-CceeecCCCCcccccC
Confidence 4433 23 2347999999998754
No 56
>PRK05799 coproporphyrinogen III oxidase; Provisional
Probab=77.05 E-value=6.7 Score=37.00 Aligned_cols=97 Identities=14% Similarity=0.187 Sum_probs=58.8
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCCh-hhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCCh
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNP-KGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~-~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
-+|.++.++++|++ |+||.-..+-++--..++. ...+-+.+.|+.+++.|+..+ +++. +++|.
T Consensus 98 t~e~l~~l~~~G~~--rvsiGvqS~~d~~L~~l~R~~~~~~~~~ai~~l~~~g~~~v~~dli-~GlPg------------ 162 (374)
T PRK05799 98 TEEKLKILKSMGVN--RLSIGLQAWQNSLLKYLGRIHTFEEFLENYKLARKLGFNNINVDLM-FGLPN------------ 162 (374)
T ss_pred CHHHHHHHHHcCCC--EEEEECccCCHHHHHHcCCCCCHHHHHHHHHHHHHcCCCcEEEEee-cCCCC------------
Confidence 46789999999999 5666555554431111221 125567889999999999744 5553 45552
Q ss_pred hhHHHHHHHHHHHHHHhCCCCceEEeccCCchhc
Q 024821 159 MIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFA 192 (262)
Q Consensus 159 ~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~ 192 (262)
++.+.|.+-.+.+.+.=-+.|..+...-+|+...
T Consensus 163 qt~e~~~~~l~~~~~l~~~~is~y~l~~~pgT~l 196 (374)
T PRK05799 163 QTLEDWKETLEKVVELNPEHISCYSLIIEEGTPF 196 (374)
T ss_pred CCHHHHHHHHHHHHhcCCCEEEEeccEecCCCHH
Confidence 2346666666665543225555555445777543
No 57
>TIGR01210 conserved hypothetical protein TIGR01210. This family of exclusively archaeal proteins has no characterized close homologs. Several rounds of PSI-BLAST with a stringent cutoff of 1e-8 shows apparent similarity of the central region of this family to the central regions of the oxygen-independent coproporphyrinogen III dehydrogenase HemN and to other enzymes.
Probab=76.63 E-value=6.8 Score=36.41 Aligned_cols=111 Identities=14% Similarity=0.188 Sum_probs=67.4
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCC-CCCChh-hhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChh
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGR-GPVNPK-GLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRM 159 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~-g~~n~~-~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~ 159 (262)
+|.+++|+++|++ +|++|.-.-.-++-- ..+|.. ..+-+.+.++.++++||.+.+.+. +.+| +....+
T Consensus 117 ~e~L~~l~~aG~~-~~v~iG~ES~~d~~L~~~inKg~t~~~~~~ai~~~~~~Gi~v~~~~i-~G~P--------~~se~e 186 (313)
T TIGR01210 117 EEKLEELRKIGVN-VEVAVGLETANDRIREKSINKGSTFEDFIRAAELARKYGAGVKAYLL-FKPP--------FLSEKE 186 (313)
T ss_pred HHHHHHHHHcCCC-EEEEEecCcCCHHHHHHhhCCCCCHHHHHHHHHHHHHcCCcEEEEEE-ecCC--------CCChhh
Confidence 7889999999987 244443333222100 012211 256788999999999999776654 2344 112235
Q ss_pred hHHHHHHHHHHHHHHhCCCCceEEeccCCchhccccccccCCCC
Q 024821 160 IVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFGIAPP 203 (262)
Q Consensus 160 ~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g~~~p 203 (262)
.++.+.+.++.+.+ +++.|....+.=+|+.....-|..|.+.|
T Consensus 187 a~ed~~~ti~~~~~-l~~~vs~~~l~v~~gT~l~~~~~~G~~~p 229 (313)
T TIGR01210 187 AIADMISSIRKCIP-VTDTVSINPTNVQKGTLVEFLWNRGLYRP 229 (313)
T ss_pred hHHHHHHHHHHHHh-cCCcEEEECCEEeCCCHHHHHHHcCCCCC
Confidence 77888888887765 45777777666677654444455565533
No 58
>PRK05402 glycogen branching enzyme; Provisional
Probab=76.50 E-value=13 Score=38.55 Aligned_cols=99 Identities=12% Similarity=0.108 Sum_probs=59.9
Q ss_pred ccccHHHH-HHHHHcCCCcceeccccC-------ccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeec--cCCCCc
Q 024821 78 YHKYKEDV-KLMADTGLDAYRFSISWS-------RLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTL--HHLDLP 145 (262)
Q Consensus 78 y~~~~eDi-~l~k~lG~~~~R~si~Ws-------RI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL--~H~~~P 145 (262)
|.-..+.+ +.+|+||++++=+.=-.. .-.|..--.+++ -..+-++++|++|.++||++++++ .|....
T Consensus 264 ~~~i~~~l~~ylk~LGv~~i~L~Pi~e~~~~~~~GY~~~~y~ai~~~~Gt~~dfk~lV~~~H~~Gi~VilD~V~NH~~~~ 343 (726)
T PRK05402 264 YRELADQLIPYVKEMGFTHVELLPIAEHPFDGSWGYQPTGYYAPTSRFGTPDDFRYFVDACHQAGIGVILDWVPAHFPKD 343 (726)
T ss_pred HHHHHHHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC
Confidence 33444564 888999999986653211 111111011221 134568999999999999999985 465321
Q ss_pred -----------HhHHH-----hhCC-------CCChhhHHHHHHHHHHHHHHhC
Q 024821 146 -----------QALED-----EYGG-------WINRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 146 -----------~~l~~-----~~gG-------w~~~~~~~~F~~ya~~v~~~~g 176 (262)
.+... .+.. +.++++.+.+.+-++.-+++|+
T Consensus 344 ~~~~~~~~~~~~y~~~~~~~~~~~~w~~~~~n~~~~~v~~~l~~~~~~W~~e~~ 397 (726)
T PRK05402 344 AHGLARFDGTALYEHADPREGEHPDWGTLIFNYGRNEVRNFLVANALYWLEEFH 397 (726)
T ss_pred ccchhccCCCcceeccCCcCCccCCCCCccccCCCHHHHHHHHHHHHHHHHHhC
Confidence 11000 0112 3467888888888888888886
No 59
>PF12876 Cellulase-like: Sugar-binding cellulase-like; InterPro: IPR024778 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This entry represents a family of putative cellulase enzymes.; PDB: 3GYC_B.
Probab=76.39 E-value=1.3 Score=33.22 Aligned_cols=19 Identities=42% Similarity=0.747 Sum_probs=14.0
Q ss_pred HHHHhCC--CCceEEeccC-Cc
Q 024821 171 CFREFGD--RVSYWTTVNE-PN 189 (262)
Q Consensus 171 v~~~~gd--~V~~w~t~NE-P~ 189 (262)
|+++||+ +|.+|.++|| |+
T Consensus 1 iv~~~~~~~~Il~Wdl~NE~p~ 22 (88)
T PF12876_consen 1 IVTRFGYDPRILAWDLWNEPPN 22 (88)
T ss_dssp -HHHTT-GGGEEEEESSTTTT-
T ss_pred CchhhcCCCCEEEEEeecCCCC
Confidence 4677776 7999999999 76
No 60
>cd07945 DRE_TIM_CMS Leptospira interrogans citramalate synthase (CMS) and related proteins, N-terminal catalytic TIM barrel domain. Citramalate synthase (CMS) catalyzes the conversion of pyruvate and acetyl-CoA to (R)-citramalate in the first dedicated step of the citramalate pathway. Citramalate is only found in Leptospira interrogans and a few other microorganisms. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center con
Probab=76.20 E-value=6.4 Score=36.08 Aligned_cols=109 Identities=10% Similarity=0.030 Sum_probs=71.8
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc-----HhHHHh--
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP-----QALEDE-- 151 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P-----~~l~~~-- 151 (262)
+-+.|+++.++.|++.+++.++=|...-.. -+.--++.++...++++.+++.|+++.+++-+|..| ..+.+-
T Consensus 75 ~~~~~~~~A~~~g~~~i~i~~~~S~~h~~~~~~~t~~e~l~~~~~~i~~a~~~G~~v~~~~~d~~~~~r~~~~~~~~~~~ 154 (280)
T cd07945 75 DGDKSVDWIKSAGAKVLNLLTKGSLKHCTEQLRKTPEEHFADIREVIEYAIKNGIEVNIYLEDWSNGMRDSPDYVFQLVD 154 (280)
T ss_pred CcHHHHHHHHHCCCCEEEEEEeCCHHHHHHHHCcCHHHHHHHHHHHHHHHHhCCCEEEEEEEeCCCCCcCCHHHHHHHHH
Confidence 346799999999999999998666544321 133346788999999999999999999999887643 222210
Q ss_pred ----h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 ----Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ----~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. -|..+| ....++.+.+.+++.+..-..-.-|-..+.
T Consensus 155 ~~~~~G~~~i~l~DT~G~~~P---~~v~~l~~~l~~~~~~~~i~~H~Hnd~Gla 205 (280)
T cd07945 155 FLSDLPIKRIMLPDTLGILSP---FETYTYISDMVKRYPNLHFDFHAHNDYDLA 205 (280)
T ss_pred HHHHcCCCEEEecCCCCCCCH---HHHHHHHHHHHhhCCCCeEEEEeCCCCCHH
Confidence 1 344555 345555666666675432234566665543
No 61
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=75.24 E-value=15 Score=33.90 Aligned_cols=106 Identities=17% Similarity=0.235 Sum_probs=72.7
Q ss_pred cHHHHHHHHHcCC--CcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC---CcHhHH------
Q 024821 81 YKEDVKLMADTGL--DAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD---LPQALE------ 149 (262)
Q Consensus 81 ~~eDi~l~k~lG~--~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~---~P~~l~------ 149 (262)
..+-++.+++.|+ +.+=+++.|..-. |+-.+|++.+--..+++++|+++|+++++.+.=+- .+..-+
T Consensus 32 v~~~~~~~~~~~iP~d~i~iD~~w~~~~--g~f~~d~~~FPdp~~mi~~l~~~G~k~~l~i~P~i~~~s~~~~e~~~~g~ 109 (303)
T cd06592 32 VLNYAQEIIDNGFPNGQIEIDDNWETCY--GDFDFDPTKFPDPKGMIDQLHDLGFRVTLWVHPFINTDSENFREAVEKGY 109 (303)
T ss_pred HHHHHHHHHHcCCCCCeEEeCCCccccC--CccccChhhCCCHHHHHHHHHHCCCeEEEEECCeeCCCCHHHHhhhhCCe
Confidence 4566788888885 4666777785432 34467777777789999999999999888765221 111111
Q ss_pred ---HhhC----------C------CCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 150 ---DEYG----------G------WINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 150 ---~~~g----------G------w~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+..| | +.||+..+.|.+..+.+...+|= --.|.=+|||.
T Consensus 110 ~vk~~~g~~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~Gv-dg~w~D~~E~~ 167 (303)
T cd06592 110 LVSEPSGDIPALTRWWNGTAAVLDFTNPEAVDWFLSRLKSLQEKYGI-DSFKFDAGEAS 167 (303)
T ss_pred EEECCCCCCCcccceecCCcceEeCCCHHHHHHHHHHHHHHHHHhCC-cEEEeCCCCcc
Confidence 0001 1 56899999999988888877752 34588899997
No 62
>PRK14041 oxaloacetate decarboxylase; Provisional
Probab=75.18 E-value=8.5 Score=38.06 Aligned_cols=56 Identities=23% Similarity=0.309 Sum_probs=41.0
Q ss_pred cccccHHH-----HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 77 EYHKYKED-----VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 77 ~y~~~~eD-----i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
.|..|.+| ++...+.|++.+|+..+-+.+ +-..+.++.+++.|+.+..++.+-..|
T Consensus 88 G~~~~~dDvv~~fv~~A~~~Gvd~irif~~lnd~-------------~n~~~~i~~ak~~G~~v~~~i~~t~~p 148 (467)
T PRK14041 88 GYRHYADDVVELFVKKVAEYGLDIIRIFDALNDI-------------RNLEKSIEVAKKHGAHVQGAISYTVSP 148 (467)
T ss_pred CcccccchhhHHHHHHHHHCCcCEEEEEEeCCHH-------------HHHHHHHHHHHHCCCEEEEEEEeccCC
Confidence 46678888 899999999999999866542 345667778888888777666543334
No 63
>cd03174 DRE_TIM_metallolyase DRE-TIM metallolyase superfamily. The DRE-TIM metallolyase superfamily includes 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues that cap the core of the barrel. In addition, the catalytic site includes three invariant residues - an aspartate (D), an arginine (R), and a glutamate (E) - which is the basis for the domain name "DRE-TIM".
Probab=74.99 E-value=7.4 Score=34.36 Aligned_cols=107 Identities=14% Similarity=0.061 Sum_probs=69.4
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC----CcHhHHH------
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD----LPQALED------ 150 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~----~P~~l~~------ 150 (262)
++|++.+++.|++.+|++++-+.+.-.- .+.=.+..++...+.++.+++.|+++.+.+.... .|..+.+
T Consensus 77 ~~~i~~a~~~g~~~i~i~~~~s~~~~~~~~~~~~~~~~~~~~~~i~~a~~~G~~v~~~~~~~~~~~~~~~~l~~~~~~~~ 156 (265)
T cd03174 77 EKGIERALEAGVDEVRIFDSASETHSRKNLNKSREEDLENAEEAIEAAKEAGLEVEGSLEDAFGCKTDPEYVLEVAKALE 156 (265)
T ss_pred hhhHHHHHhCCcCEEEEEEecCHHHHHHHhCCCHHHHHHHHHHHHHHHHHCCCeEEEEEEeecCCCCCHHHHHHHHHHHH
Confidence 8999999999999999999877422110 0111234577889999999999999999884322 2222211
Q ss_pred hh----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 151 EY----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 151 ~~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+. .|-.. ++.+.++.+.+.+++++..-.+-.-|-..+.
T Consensus 157 ~~g~~~i~l~Dt~G~~~---P~~v~~li~~l~~~~~~~~~~~H~Hn~~gla 204 (265)
T cd03174 157 EAGADEISLKDTVGLAT---PEEVAELVKALREALPDVPLGLHTHNTLGLA 204 (265)
T ss_pred HcCCCEEEechhcCCcC---HHHHHHHHHHHHHhCCCCeEEEEeCCCCChH
Confidence 11 23333 4666677777777887633335577776654
No 64
>PRK14705 glycogen branching enzyme; Provisional
Probab=74.75 E-value=17 Score=40.18 Aligned_cols=95 Identities=15% Similarity=0.120 Sum_probs=58.3
Q ss_pred cHHH-HHHHHHcCCCcceec--------cccCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc--CCCCcHh
Q 024821 81 YKED-VKLMADTGLDAYRFS--------ISWSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH--HLDLPQA 147 (262)
Q Consensus 81 ~~eD-i~l~k~lG~~~~R~s--------i~WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~--H~~~P~~ 147 (262)
..+. |+.+|+||++++=+. -+|- -.|.+--.++. -..+=++++|++|.++||.+|+++. |+..=.|
T Consensus 767 l~~~lldYlk~LGvt~IeLmPv~e~p~~~swG-Y~~~~y~ap~~ryGt~~dfk~lVd~~H~~GI~VILD~V~nH~~~d~~ 845 (1224)
T PRK14705 767 LAKELVDYVKWLGFTHVEFMPVAEHPFGGSWG-YQVTSYFAPTSRFGHPDEFRFLVDSLHQAGIGVLLDWVPAHFPKDSW 845 (1224)
T ss_pred HHHHHHHHHHHhCCCEEEECccccCCCCCCCC-CCccccCCcCcccCCHHHHHHHHHHHHHCCCEEEEEeccccCCcchh
Confidence 3345 689999999998654 2341 22221111111 1244589999999999999999863 6521112
Q ss_pred HHHhh----------------CC-------CCChhhHHHHHHHHHHHHHHhC
Q 024821 148 LEDEY----------------GG-------WINRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 148 l~~~~----------------gG-------w~~~~~~~~F~~ya~~v~~~~g 176 (262)
....+ .. +.++++.+.+.+=+..-+++|+
T Consensus 846 ~l~~fdg~~~y~~~d~~~g~~~~Wg~~~fn~~~~eVr~fli~~a~~Wl~eyh 897 (1224)
T PRK14705 846 ALAQFDGQPLYEHADPALGEHPDWGTLIFDFGRTEVRNFLVANALYWLDEFH 897 (1224)
T ss_pred hhhhcCCCcccccCCcccCCCCCCCCceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 11111 11 2357788888888888888887
No 65
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=74.55 E-value=12 Score=35.13 Aligned_cols=107 Identities=18% Similarity=0.187 Sum_probs=64.2
Q ss_pred HHHHHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHH--HHHHHHHHHcCCcceeeccCCCCc--------HhHH
Q 024821 82 KEDVKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYY--NNLINELISYGIQPHVTLHHLDLP--------QALE 149 (262)
Q Consensus 82 ~eDi~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y--~~~id~l~~~GI~p~vtL~H~~~P--------~~l~ 149 (262)
.+-++.+++.|+. ++=+++.|..- .+.-.+|++.+--- +++|++|+++|++.++.+.-+-.+ .+-+
T Consensus 27 ~~~~~~~r~~~iP~d~i~lD~~~~~~--~~~f~~d~~~FPdp~~~~mi~~L~~~G~k~~~~i~P~v~~~~~~~~~~~~~e 104 (339)
T cd06602 27 KEVVENMRAAGIPLDVQWNDIDYMDR--RRDFTLDPVRFPGLKMPEFVDELHANGQHYVPILDPAISANEPTGSYPPYDR 104 (339)
T ss_pred HHHHHHHHHhCCCcceEEECcccccC--ccceecccccCCCccHHHHHHHHHHCCCEEEEEEeCccccCcCCCCCHHHHH
Confidence 4445556655544 44455555432 12334555555555 899999999999988877533222 1111
Q ss_pred H---h----------h-----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 150 D---E----------Y-----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 150 ~---~----------~-----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. . + -.+.||+..+.|.+..+.+.+.+|- .-+|.=+|||..+
T Consensus 105 ~~~~g~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~~~Gv-dg~w~D~~Ep~~~ 169 (339)
T cd06602 105 GLEMDVFIKNDDGSPYIGKVWPGYTVFPDFLNPNTQEWWTDEIKDFHDQVPF-DGLWIDMNEPSNF 169 (339)
T ss_pred HHHCCeEEECCCCCEEEEEeCCCCCcCcCCCCHHHHHHHHHHHHHHHhcCCC-cEEEecCCCCchH
Confidence 0 0 0 1256888888888777766665542 4568899999743
No 66
>PRK14040 oxaloacetate decarboxylase; Provisional
Probab=74.16 E-value=9.1 Score=38.95 Aligned_cols=98 Identities=15% Similarity=0.158 Sum_probs=54.9
Q ss_pred cccccHHH-----HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc----Hh
Q 024821 77 EYHKYKED-----VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP----QA 147 (262)
Q Consensus 77 ~y~~~~eD-----i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P----~~ 147 (262)
-|.+|.+| ++..++.|++.+|+..+.+.+ +.....|+.+++.|.....++..=+.| ..
T Consensus 90 g~~~ypddvv~~~v~~a~~~Gid~~rifd~lnd~-------------~~~~~ai~~ak~~G~~~~~~i~yt~~p~~~~~~ 156 (593)
T PRK14040 90 GYRHYADDVVERFVERAVKNGMDVFRVFDAMNDP-------------RNLETALKAVRKVGAHAQGTLSYTTSPVHTLQT 156 (593)
T ss_pred ccccCcHHHHHHHHHHHHhcCCCEEEEeeeCCcH-------------HHHHHHHHHHHHcCCeEEEEEEEeeCCccCHHH
Confidence 46677666 999999999999999655433 234555666666666544444321223 11
Q ss_pred HHHh----------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 148 LEDE----------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 148 l~~~----------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+.+. -.|-..| ....+.++.+-+++ +.--..-+-|-..+.
T Consensus 157 ~~~~a~~l~~~Gad~i~i~Dt~G~l~P---~~~~~lv~~lk~~~-~~pi~~H~Hnt~GlA 212 (593)
T PRK14040 157 WVDLAKQLEDMGVDSLCIKDMAGLLKP---YAAYELVSRIKKRV-DVPLHLHCHATTGLS 212 (593)
T ss_pred HHHHHHHHHHcCCCEEEECCCCCCcCH---HHHHHHHHHHHHhc-CCeEEEEECCCCchH
Confidence 1110 1344555 44445555555666 322345577776654
No 67
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=73.83 E-value=24 Score=32.69 Aligned_cols=106 Identities=18% Similarity=0.199 Sum_probs=63.9
Q ss_pred HHHHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC-----CcHhHHHh----
Q 024821 83 EDVKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD-----LPQALEDE---- 151 (262)
Q Consensus 83 eDi~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~-----~P~~l~~~---- 151 (262)
+-++.+++.++. ++=+++.|..- .+.-.+|++-+--..++|++|+++|++.++.++-+- .|...+..
T Consensus 28 ~~~~~~~~~~iP~d~i~lD~~~~~~--~~~f~~d~~~FPdp~~~i~~l~~~g~k~~~~~~P~i~~~~~~~~~~~~~~~~~ 105 (317)
T cd06600 28 EVVDIMQKEGFPYDVVFLDIHYMDS--YRLFTWDPYRFPEPKKLIDELHKRNVKLVTIVDPGIRVDQNYSPFLSGMDKGK 105 (317)
T ss_pred HHHHHHHHcCCCcceEEEChhhhCC--CCceeechhcCCCHHHHHHHHHHCCCEEEEEeeccccCCCCChHHHHHHHCCE
Confidence 334555555543 44555556431 223356666666678999999999999877664221 12221110
Q ss_pred --------------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 --------------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 --------------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+-.|.||+..+.|.+..+.+....| -.-.|.=+|||..+
T Consensus 106 ~v~~~~g~~~~~~~w~G~~~~~Dftnp~a~~ww~~~~~~~~~~~g-vdg~w~D~~Ep~~~ 164 (317)
T cd06600 106 FCEIESGELFVGKMWPGTTVYPDFTNPDTREWWAGLFSEWLNSQG-VDGIWLDMNEPSDF 164 (317)
T ss_pred EEECCCCCeEEEeecCCCccccCCCChHHHHHHHHHHHHHhhcCC-CceEEeeCCCCccH
Confidence 1136789999999887777665544 24468899999643
No 68
>PRK05692 hydroxymethylglutaryl-CoA lyase; Provisional
Probab=73.23 E-value=10 Score=34.92 Aligned_cols=108 Identities=13% Similarity=0.050 Sum_probs=70.3
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccC-CCC-------cHhHHH
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHH-LDL-------PQALED 150 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H-~~~-------P~~l~~ 150 (262)
+-.+|+++..+.|++.+|+.++=|...-.. -+.--++.++...+.|+.++++|+++..++.. |.. |..+.+
T Consensus 80 ~~~~~ie~A~~~g~~~v~i~~~~s~~~~~~n~~~~~~e~l~~~~~~v~~ak~~g~~v~~~i~~~~~~~~~~~~~~~~~~~ 159 (287)
T PRK05692 80 PNLKGLEAALAAGADEVAVFASASEAFSQKNINCSIAESLERFEPVAEAAKQAGVRVRGYVSCVLGCPYEGEVPPEAVAD 159 (287)
T ss_pred cCHHHHHHHHHcCCCEEEEEEecCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCEEEEEEEEEecCCCCCCCCHHHHHH
Confidence 468999999999999999998766543211 23334567889999999999999998765542 322 322221
Q ss_pred h----------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 151 E----------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 151 ~----------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
. -.|..+| ....+.++.+.+++++.--..-.-|-..+
T Consensus 160 ~~~~~~~~G~d~i~l~DT~G~~~P---~~v~~lv~~l~~~~~~~~i~~H~Hn~~Gl 212 (287)
T PRK05692 160 VAERLFALGCYEISLGDTIGVGTP---GQVRAVLEAVLAEFPAERLAGHFHDTYGQ 212 (287)
T ss_pred HHHHHHHcCCcEEEeccccCccCH---HHHHHHHHHHHHhCCCCeEEEEecCCCCc
Confidence 1 1344444 55666777777788753223556666665
No 69
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=73.21 E-value=4.4 Score=39.79 Aligned_cols=64 Identities=19% Similarity=0.226 Sum_probs=44.4
Q ss_pred ccccccHHHHHHHHHcCCCcceeccccCc--------cccCCC---------CCCChh--hhhHHHHHHHHHHHcCCcce
Q 024821 76 DEYHKYKEDVKLMADTGLDAYRFSISWSR--------LIPNGR---------GPVNPK--GLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 76 d~y~~~~eDi~l~k~lG~~~~R~si~WsR--------I~P~g~---------g~~n~~--~l~~y~~~id~l~~~GI~p~ 136 (262)
+.|....+-++.+++||++++=++-...- -.|.-- |.+|+. ..+=++++|++|.++||++|
T Consensus 19 ~~~~~I~~kldyl~~LGvtaIwl~P~~~~~~~~~~hgY~~~D~~~~~~~~~~~~id~~fGt~~dl~~Li~~~H~~Gi~vi 98 (479)
T PRK09441 19 KLWNRLAERAPELAEAGITAVWLPPAYKGTSGGYDVGYGVYDLFDLGEFDQKGTVRTKYGTKEELLNAIDALHENGIKVY 98 (479)
T ss_pred cHHHHHHHHHHHHHHcCCCEEEeCCCccCCCCCCCCCCCeecccccccccccCCcCcCcCCHHHHHHHHHHHHHCCCEEE
Confidence 45666778899999999999987753332 222100 123332 34568999999999999999
Q ss_pred eec
Q 024821 137 VTL 139 (262)
Q Consensus 137 vtL 139 (262)
+++
T Consensus 99 ~D~ 101 (479)
T PRK09441 99 ADV 101 (479)
T ss_pred EEE
Confidence 986
No 70
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=73.16 E-value=18 Score=33.76 Aligned_cols=110 Identities=13% Similarity=0.080 Sum_probs=65.6
Q ss_pred cHHHHHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC-----CcHhHHHh--
Q 024821 81 YKEDVKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD-----LPQALEDE-- 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~-----~P~~l~~~-- 151 (262)
..+-++.+++.|+. ++=+++.|..- .+.-.+|++-+---+++|++|+++|++.++.++-+- .|..-+.+
T Consensus 26 v~~~~~~~~~~~iP~d~i~lD~~~~~~--~~~f~~d~~~FPdp~~mi~~L~~~G~k~~~~~~P~v~~~~~~~~y~e~~~~ 103 (339)
T cd06603 26 VKEVDAGFDEHDIPYDVIWLDIEHTDG--KRYFTWDKKKFPDPEKMQEKLASKGRKLVTIVDPHIKRDDGYYVYKEAKDK 103 (339)
T ss_pred HHHHHHHHHHcCCCceEEEEChHHhCC--CCceEeCcccCCCHHHHHHHHHHCCCEEEEEecCceecCCCCHHHHHHHHC
Confidence 34445555555544 33344444321 112335555555568899999999999888776432 12221110
Q ss_pred ----------------------hCCCCChhhHHHHHHHHHHHHHHhC-CCCceEEeccCCchhc
Q 024821 152 ----------------------YGGWINRMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNGFA 192 (262)
Q Consensus 152 ----------------------~gGw~~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~~~ 192 (262)
+-.+.||+..+.|.+..+......+ +-+-.|.=+|||.++.
T Consensus 104 g~~vk~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~~~~~g~~g~w~D~~Ep~~f~ 167 (339)
T cd06603 104 GYLVKNSDGGDFEGWCWPGSSSWPDFLNPEVRDWWASLFSYDKYKGSTENLYIWNDMNEPSVFN 167 (339)
T ss_pred CeEEECCCCCEEEEEECCCCcCCccCCChhHHHHHHHHHHHHhhcccCCCceEEeccCCccccC
Confidence 0135689999999888887665432 2356789999998753
No 71
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=72.80 E-value=25 Score=32.62 Aligned_cols=108 Identities=16% Similarity=0.235 Sum_probs=66.1
Q ss_pred HHHHHHHHHcCC--CcceeccccCcccc-C---CCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC---CCcHhHHHh-
Q 024821 82 KEDVKLMADTGL--DAYRFSISWSRLIP-N---GRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL---DLPQALEDE- 151 (262)
Q Consensus 82 ~eDi~l~k~lG~--~~~R~si~WsRI~P-~---g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~---~~P~~l~~~- 151 (262)
.+-++-+++.|+ +++=+++.|..... . +.-.+|++-+---+++|++|+++|++.++.++-+ +.|..-+.+
T Consensus 27 ~~~~~~~~~~~iP~d~i~lD~~w~~~~~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~v~P~v~~~~~~y~e~~~ 106 (317)
T cd06598 27 DDTIKTLREKDFPLDAAILDLYWFGKDIDKGHMGNLDWDRKAFPDPAGMIADLAKKGVKTIVITEPFVLKNSKNWGEAVK 106 (317)
T ss_pred HHHHHHHHHhCCCceEEEEechhhcCcccCCceeeeEeccccCCCHHHHHHHHHHcCCcEEEEEcCcccCCchhHHHHHh
Confidence 444556666554 46666667754332 1 1234566666667899999999999988877533 333321100
Q ss_pred --h-------------------C---CCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 --Y-------------------G---GWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 --~-------------------g---Gw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+ + .+.||+..+.|.+..+.. ...| ---+|.=+|||...
T Consensus 107 ~g~l~~~~~~~~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~-~~~G-vdg~w~D~~Ep~~~ 168 (317)
T cd06598 107 AGALLKKDQGGVPTLFDFWFGNTGLIDWFDPAAQAWFHDNYKKL-IDQG-VTGWWGDLGEPEVH 168 (317)
T ss_pred CCCEEEECCCCCEeeeeccCCCccccCCCCHHHHHHHHHHHHHh-hhCC-ccEEEecCCCcccc
Confidence 0 1 245888888887777664 2332 23458899999754
No 72
>PRK12858 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=71.91 E-value=14 Score=35.08 Aligned_cols=53 Identities=15% Similarity=0.266 Sum_probs=44.4
Q ss_pred HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 85 VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
++.++++|.+++-+-+-|. |+.+..+|.+-+++..++.++|.+.||..++-+.
T Consensus 112 ve~a~~~GAdAVk~lv~~~---~d~~~~~~~~~~~~l~rv~~ec~~~giPlllE~l 164 (340)
T PRK12858 112 VRRIKEAGADAVKLLLYYR---PDEDDAINDRKHAFVERVGAECRANDIPFFLEPL 164 (340)
T ss_pred HHHHHHcCCCEEEEEEEeC---CCcchHHHHHHHHHHHHHHHHHHHcCCceEEEEe
Confidence 5778999999999999887 5533446788899999999999999999888643
No 73
>PRK12399 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=70.59 E-value=18 Score=34.11 Aligned_cols=58 Identities=17% Similarity=0.276 Sum_probs=50.0
Q ss_pred HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 85 VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
.+.+|++|.+++.|=+-|. |+++.++|..-.++.+++.++|++.+|--++-+..||.+
T Consensus 111 ~~rike~GadavK~Llyy~---pD~~~~in~~k~a~vervg~eC~a~dipf~lE~ltY~~~ 168 (324)
T PRK12399 111 AKRIKEEGADAVKFLLYYD---VDEPDEINEQKKAYIERIGSECVAEDIPFFLEILTYDEK 168 (324)
T ss_pred HHHHHHhCCCeEEEEEEEC---CCCCHHHHHHHHHHHHHHHHHHHHCCCCeEEEEeeccCc
Confidence 6889999999999999887 665667899889999999999999999988887766543
No 74
>PRK04161 tagatose 1,6-diphosphate aldolase; Reviewed
Probab=70.57 E-value=18 Score=34.14 Aligned_cols=59 Identities=22% Similarity=0.325 Sum_probs=51.0
Q ss_pred HHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 84 DVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 84 Di~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
+.+.+|++|.+++.|=+-|. |+++.++|..-.++.+++.++|++.+|--++-+..||.+
T Consensus 112 s~~rike~GadavK~Llyy~---pD~~~ein~~k~a~vervg~eC~a~dipf~lE~l~Yd~~ 170 (329)
T PRK04161 112 SVKRLKEAGADAVKFLLYYD---VDGDEEINDQKQAYIERIGSECTAEDIPFFLELLTYDER 170 (329)
T ss_pred hHHHHHHhCCCeEEEEEEEC---CCCCHHHHHHHHHHHHHHHHHHHHCCCCeEEEEeccCCc
Confidence 57889999999999999887 665667898889999999999999999999988776543
No 75
>PRK12568 glycogen branching enzyme; Provisional
Probab=70.55 E-value=22 Score=37.23 Aligned_cols=98 Identities=12% Similarity=0.151 Sum_probs=59.4
Q ss_pred ccccHHH-HHHHHHcCCCcceec--------cccCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc--CCCC
Q 024821 78 YHKYKED-VKLMADTGLDAYRFS--------ISWSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH--HLDL 144 (262)
Q Consensus 78 y~~~~eD-i~l~k~lG~~~~R~s--------i~WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~--H~~~ 144 (262)
|....+. |+.+|+||++++=+. -+|- -.|.+.-.++. -..+-++++|++|.++||++++++. |+.-
T Consensus 268 ~~~la~~ll~ylk~LGvt~I~LmPi~e~~~~~~wG-Y~~~~~~a~~~~~G~~~dfk~lV~~~H~~Gi~VIlD~V~nH~~~ 346 (730)
T PRK12568 268 WPTLAEQLIPYVQQLGFTHIELLPITEHPFGGSWG-YQPLGLYAPTARHGSPDGFAQFVDACHRAGIGVILDWVSAHFPD 346 (730)
T ss_pred HHHHHHHHHHHHHHcCCCEEEECccccCCCCCCCC-CCCCcCCccCcccCCHHHHHHHHHHHHHCCCEEEEEeccccCCc
Confidence 4444455 688999999998544 2341 12211111111 1245689999999999999999864 5432
Q ss_pred c-----------HhHH-H-h---hCCC-------CChhhHHHHHHHHHHHHHHhC
Q 024821 145 P-----------QALE-D-E---YGGW-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 145 P-----------~~l~-~-~---~gGw-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
- .+-. + . +..| .++++.+.+.+=++.-+++|+
T Consensus 347 d~~~l~~fdg~~~Ye~~d~~~g~~~~W~~~~~N~~~peVr~~li~~a~~Wl~eyh 401 (730)
T PRK12568 347 DAHGLAQFDGAALYEHADPREGMHRDWNTLIYNYGRPEVTAYLLGSALEWIEHYH 401 (730)
T ss_pred cccccccCCCccccccCCCcCCccCCCCCeecccCCHHHHHHHHHHHHHHHHHhC
Confidence 1 0100 0 0 1123 467888888888888888886
No 76
>TIGR02402 trehalose_TreZ malto-oligosyltrehalose trehalohydrolase. Members of this family are the trehalose biosynthetic enzyme malto-oligosyltrehalose trehalohydrolase, formally known as 4-alpha-D-{(1-4)-alpha-D-glucano}trehalose trehalohydrolase (EC 3.2.1.141). It is the TreZ protein of the TreYZ pathway for trehalose biosynthesis, and alternative to the OtsAB system.
Probab=70.50 E-value=21 Score=35.82 Aligned_cols=98 Identities=16% Similarity=0.237 Sum_probs=59.6
Q ss_pred cccccHHHHHHHHHcCCCcceecc--------ccCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeec--cCCCC
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSI--------SWSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTL--HHLDL 144 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si--------~WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL--~H~~~ 144 (262)
.+.-..+-++.+|+||++++=+.= .|- -.|..--.++. ...+-++++|++|.++||++++++ .|...
T Consensus 109 ~~~gi~~~l~yl~~LGv~~i~L~Pi~~~~~~~~~G-Y~~~~~~~~~~~~G~~~e~k~lV~~aH~~Gi~VilD~V~NH~~~ 187 (542)
T TIGR02402 109 TFDAAIEKLPYLADLGITAIELMPVAQFPGTRGWG-YDGVLPYAPHNAYGGPDDLKALVDAAHGLGLGVILDVVYNHFGP 187 (542)
T ss_pred CHHHHHHhhHHHHHcCCCEEEeCccccCCCCCCCC-CCccCccccccccCCHHHHHHHHHHHHHCCCEEEEEEccCCCCC
Confidence 345566778999999999987652 121 00000001111 134568999999999999999985 36431
Q ss_pred ---------cHhHHHh-hCCC------CCh---hhHHHHHHHHHHHHHHhC
Q 024821 145 ---------PQALEDE-YGGW------INR---MIVKDFTAYADVCFREFG 176 (262)
Q Consensus 145 ---------P~~l~~~-~gGw------~~~---~~~~~F~~ya~~v~~~~g 176 (262)
| |+... ..+| .++ ++.+.+.+-++.-+++||
T Consensus 188 ~~~~~~~~~~-y~~~~~~~~wg~~~n~~~~~~~~vr~~i~~~~~~W~~e~~ 237 (542)
T TIGR02402 188 EGNYLPRYAP-YFTDRYSTPWGAAINFDGPGSDEVRRYILDNALYWLREYH 237 (542)
T ss_pred ccccccccCc-cccCCCCCCCCCccccCCCcHHHHHHHHHHHHHHHHHHhC
Confidence 2 22111 1233 245 777788887777777776
No 77
>PLN02746 hydroxymethylglutaryl-CoA lyase
Probab=70.24 E-value=13 Score=35.40 Aligned_cols=107 Identities=7% Similarity=-0.086 Sum_probs=70.2
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccC-CCC-------cHhHHHh
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHH-LDL-------PQALEDE 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H-~~~-------P~~l~~~ 151 (262)
-.+|++...++|++.+++.++=|...-.. -+.--++.++.+.++|+.++++|+++.+++.. |.. |..+.+.
T Consensus 123 n~~die~A~~~g~~~v~i~~s~Sd~h~~~n~~~t~~e~l~~~~~~v~~Ak~~Gl~v~~~is~~fg~p~~~r~~~~~l~~~ 202 (347)
T PLN02746 123 NLKGFEAAIAAGAKEVAVFASASESFSKSNINCSIEESLVRYREVALAAKKHSIPVRGYVSCVVGCPIEGPVPPSKVAYV 202 (347)
T ss_pred CHHHHHHHHHcCcCEEEEEEecCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCeEEEEEEeeecCCccCCCCHHHHHHH
Confidence 67999999999999999998766654331 23334678999999999999999999766632 332 2222211
Q ss_pred ------h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 152 ------Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 152 ------~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
. -|-.+| ....++.+.+.+++++..-..-.-|-..+
T Consensus 203 ~~~~~~~Gad~I~l~DT~G~a~P---~~v~~lv~~l~~~~~~~~i~~H~Hnd~Gl 254 (347)
T PLN02746 203 AKELYDMGCYEISLGDTIGVGTP---GTVVPMLEAVMAVVPVDKLAVHFHDTYGQ 254 (347)
T ss_pred HHHHHHcCCCEEEecCCcCCcCH---HHHHHHHHHHHHhCCCCeEEEEECCCCCh
Confidence 1 244454 55566666777777642123456666655
No 78
>PRK12313 glycogen branching enzyme; Provisional
Probab=70.14 E-value=24 Score=35.96 Aligned_cols=99 Identities=13% Similarity=0.187 Sum_probs=60.1
Q ss_pred ccccHHH-HHHHHHcCCCcceeccccCc-------cccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeec--cCCCCc
Q 024821 78 YHKYKED-VKLMADTGLDAYRFSISWSR-------LIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTL--HHLDLP 145 (262)
Q Consensus 78 y~~~~eD-i~l~k~lG~~~~R~si~WsR-------I~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL--~H~~~P 145 (262)
|....+. ++.+|+||++++=+.=-... -.+..--.+++ -..+=++++|++|.++||++++++ .|....
T Consensus 169 ~~~~~~~ll~yl~~LGv~~i~L~Pi~~~~~~~~~GY~~~~y~~i~~~~Gt~~d~k~lv~~~H~~Gi~VilD~V~nH~~~~ 248 (633)
T PRK12313 169 YRELADELIPYVKEMGYTHVEFMPLMEHPLDGSWGYQLTGYFAPTSRYGTPEDFMYLVDALHQNGIGVILDWVPGHFPKD 248 (633)
T ss_pred HHHHHHHHHHHHHHcCCCEEEeCchhcCCCCCCCCCCCcCcCcCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC
Confidence 4445567 49999999999875532111 01110011111 124558999999999999999985 465321
Q ss_pred H----hHH--------H---h-hCCC-------CChhhHHHHHHHHHHHHHHhC
Q 024821 146 Q----ALE--------D---E-YGGW-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 146 ~----~l~--------~---~-~gGw-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
. ++. + . +.+| .++++.+.+.+-++.-+++|+
T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~n~~~~~vr~~l~~~~~~W~~~~~ 302 (633)
T PRK12313 249 DDGLAYFDGTPLYEYQDPRRAENPDWGALNFDLGKNEVRSFLISSALFWLDEYH 302 (633)
T ss_pred cccccccCCCcceeecCCCCCcCCCCCCcccCCCCHHHHHHHHHHHHHHHHHhC
Confidence 1 110 0 0 0123 368888888888888888876
No 79
>TIGR01515 branching_enzym alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase. A sequence from Arabidopsis thaliana, GP|9294564, scores just above trusted, but appears either to contain corrupt sequence or, more likely, to be a pseudogene as some of the conserved catalytic residues common to the alpha amylase family are not conserved here.
Probab=69.47 E-value=31 Score=35.18 Aligned_cols=99 Identities=11% Similarity=0.102 Sum_probs=61.0
Q ss_pred ccccHHHH-HHHHHcCCCcceec-cccCcc------ccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc--CCCC-
Q 024821 78 YHKYKEDV-KLMADTGLDAYRFS-ISWSRL------IPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH--HLDL- 144 (262)
Q Consensus 78 y~~~~eDi-~l~k~lG~~~~R~s-i~WsRI------~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~--H~~~- 144 (262)
|....+.+ +.+|+||++++=+. |..+.- .|..--.++. -..+=++++|++|.++||++|+++. |...
T Consensus 155 ~~~i~~~l~dyl~~LGvt~i~L~Pi~e~~~~~~wGY~~~~y~~~~~~~Gt~~dlk~lV~~~H~~Gi~VilD~V~NH~~~~ 234 (613)
T TIGR01515 155 YRELADQLIPYVKELGFTHIELLPVAEHPFDGSWGYQVTGYYAPTSRFGTPDDFMYFVDACHQAGIGVILDWVPGHFPKD 234 (613)
T ss_pred HHHHHHHHHHHHHHcCCCEEEECCcccCCCCCCCCCCcccCcccccccCCHHHHHHHHHHHHHCCCEEEEEecccCcCCc
Confidence 34445564 89999999999873 433211 1110011221 1234589999999999999999864 5431
Q ss_pred ----------cHhHHH-----hhCCC-------CChhhHHHHHHHHHHHHHHhC
Q 024821 145 ----------PQALED-----EYGGW-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 145 ----------P~~l~~-----~~gGw-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
|.+... .+..| .++++.+.+.+-++.-+++|+
T Consensus 235 ~~~~~~~~~~~~y~~~~~~~~~~~~w~~~~~~~~~~~Vr~~l~~~~~~W~~ey~ 288 (613)
T TIGR01515 235 DHGLAEFDGTPLYEHKDPRDGEHWDWGTLIFDYGRPEVRNFLVANALYWAEFYH 288 (613)
T ss_pred cchhhccCCCcceeccCCccCcCCCCCCceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 111110 00112 467888999999999999887
No 80
>cd07939 DRE_TIM_NifV Streptomyces rubellomurinus FrbC and related proteins, catalytic TIM barrel domain. FrbC (NifV) of Streptomyces rubellomurinus catalyzes the condensation of acetyl-CoA and alpha-ketoglutarate to form homocitrate and CoA, a reaction similar to one catalyzed by homocitrate synthase. The gene encoding FrbC is one of several genes required for the biosynthesis of FR900098, a potent antimalarial antibiotic. This protein is also required for assembly of the nitrogenase MoFe complex but its exact role is unknown. This family also includes the NifV proteins of Heliobacterium chlorum and Gluconacetobacter diazotrophicus, which appear to be orthologous to FrbC. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarbox
Probab=69.32 E-value=5.5 Score=35.78 Aligned_cols=107 Identities=17% Similarity=0.061 Sum_probs=68.2
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC--CcHhHHH------h
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD--LPQALED------E 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~--~P~~l~~------~ 151 (262)
-.+|++...+.|++.+|+.++.|.+.=.. -+.-.++.++...+.++.+++.|+++.+++-... .|..+.+ +
T Consensus 71 ~~~~v~~a~~~g~~~i~i~~~~s~~~~~~~~~~~~~~~~~~~~~~i~~a~~~G~~v~~~~~~~~~~~~~~~~~~~~~~~~ 150 (259)
T cd07939 71 VKEDIEAALRCGVTAVHISIPVSDIHLAHKLGKDRAWVLDQLRRLVGRAKDRGLFVSVGAEDASRADPDFLIEFAEVAQE 150 (259)
T ss_pred CHHHHHHHHhCCcCEEEEEEecCHHHHHHHhCCCHHHHHHHHHHHHHHHHHCCCeEEEeeccCCCCCHHHHHHHHHHHHH
Confidence 37899999999999999999888764221 2333356788999999999999998877664221 1322221 1
Q ss_pred h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. -|...| +...++++.+.++++ .--.+-.-|-..+.
T Consensus 151 ~G~~~i~l~DT~G~~~P---~~v~~lv~~l~~~~~-~~l~~H~Hn~~Gla 196 (259)
T cd07939 151 AGADRLRFADTVGILDP---FTTYELIRRLRAATD-LPLEFHAHNDLGLA 196 (259)
T ss_pred CCCCEEEeCCCCCCCCH---HHHHHHHHHHHHhcC-CeEEEEecCCCChH
Confidence 1 244444 455566666667775 21234466666543
No 81
>TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases. Methanogens, then should and aparrently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea which lack a threonine dehydratase (mainly Euryarchaeota) should contain both a CimA and a LeuA gene. This is true of, for example, archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in TIGR00973 and one in TIGRT00977 which may fulfill these roles. Other species which have only one hit to this model and lack threonine dehydratase are very likely LeuA enzymes.
Probab=69.31 E-value=11 Score=35.72 Aligned_cols=106 Identities=12% Similarity=0.002 Sum_probs=69.6
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCC--CCcHhHHH------h
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHL--DLPQALED------E 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~--~~P~~l~~------~ 151 (262)
.++|++.+.+.|++.+|++++-|.+.-.. -+.-.++.++...+.++.+++.|+++.+++-.. .-|..+.+ +
T Consensus 73 ~~~di~~a~~~g~~~i~i~~~~Sd~~~~~~~~~~~~~~~~~~~~~i~~ak~~G~~v~~~~eda~r~~~~~l~~~~~~~~~ 152 (363)
T TIGR02090 73 LKKDIDKAIDCGVDSIHTFIATSPIHLKYKLKKSRDEVLEKAVEAVEYAKEHGLIVEFSAEDATRTDIDFLIKVFKRAEE 152 (363)
T ss_pred CHHHHHHHHHcCcCEEEEEEcCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCEEEEEEeecCCCCHHHHHHHHHHHHh
Confidence 58999999999999999998877664321 122235578889999999999999988887432 11332221 1
Q ss_pred h----------CCCCChhhHHHHHHHHHHHHHHhCCCCc-eEEeccCCchh
Q 024821 152 Y----------GGWINRMIVKDFTAYADVCFREFGDRVS-YWTTVNEPNGF 191 (262)
Q Consensus 152 ~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~-~w~t~NEP~~~ 191 (262)
. -|-.. ++.+.++.+.+.++++ +. .+-+-|-..+.
T Consensus 153 ~g~~~i~l~DT~G~~~---P~~v~~li~~l~~~~~--~~l~~H~Hnd~GlA 198 (363)
T TIGR02090 153 AGADRINIADTVGVLT---PQKMEELIKKLKENVK--LPISVHCHNDFGLA 198 (363)
T ss_pred CCCCEEEEeCCCCccC---HHHHHHHHHHHhcccC--ceEEEEecCCCChH
Confidence 1 23333 3556666666666665 33 24567776654
No 82
>cd07937 DRE_TIM_PC_TC_5S Pyruvate carboxylase and Transcarboxylase 5S, carboxyltransferase domain. This family includes the carboxyltransferase domains of pyruvate carboxylase (PC) and the transcarboxylase (TC) 5S subunit. Transcarboxylase 5S is a cobalt-dependent metalloenzyme subunit of the biotin-dependent transcarboxylase multienzyme complex. Transcarboxylase 5S transfers carbon dioxide from the 1.3S biotin to pyruvate in the second of two carboxylation reactions catalyzed by TC. The first reaction involves the transfer of carbon dioxide from methylmalonyl-CoA to the 1.3S biotin, and is catalyzed by the 12S subunit. These two steps allow a carboxylate group to be transferred from oxaloacetate to propionyl-CoA to yield pyruvate and methylmalonyl-CoA. The catalytic domain of transcarboxylase 5S has a canonical TIM-barrel fold with a large C-terminal extension that forms a funnel leading to the active site. Transcarboxylase 5S forms a homodimer and there are six dimers per complex
Probab=69.21 E-value=22 Score=32.37 Aligned_cols=94 Identities=12% Similarity=0.067 Sum_probs=58.9
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc----CCCCcHhHHH------
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH----HLDLPQALED------ 150 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~----H~~~P~~l~~------ 150 (262)
-++|+++..+.|++.+|+++.-+. ++...+.++.++++|+++.+.+. ....|..+.+
T Consensus 93 ~~~di~~~~~~g~~~iri~~~~~~-------------~~~~~~~i~~ak~~G~~v~~~i~~~~~~~~~~~~~~~~~~~~~ 159 (275)
T cd07937 93 VELFVEKAAKNGIDIFRIFDALND-------------VRNLEVAIKAVKKAGKHVEGAICYTGSPVHTLEYYVKLAKELE 159 (275)
T ss_pred HHHHHHHHHHcCCCEEEEeecCCh-------------HHHHHHHHHHHHHCCCeEEEEEEecCCCCCCHHHHHHHHHHHH
Confidence 578999999999999999875544 45567788888888888776552 1112222221
Q ss_pred hh----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 151 EY----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 151 ~~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+. -|-..| +...+..+.+-++++ ..-.+-+-|-..+.
T Consensus 160 ~~Ga~~i~l~DT~G~~~P---~~v~~lv~~l~~~~~-~~l~~H~Hnd~GlA 206 (275)
T cd07937 160 DMGADSICIKDMAGLLTP---YAAYELVKALKKEVG-LPIHLHTHDTSGLA 206 (275)
T ss_pred HcCCCEEEEcCCCCCCCH---HHHHHHHHHHHHhCC-CeEEEEecCCCChH
Confidence 11 233443 556666777777776 22235567766653
No 83
>COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=68.74 E-value=14 Score=38.83 Aligned_cols=101 Identities=23% Similarity=0.372 Sum_probs=66.9
Q ss_pred HcCCCcceeccc-cCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccC---CCCcHhHH--Hh------------
Q 024821 90 DTGLDAYRFSIS-WSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHH---LDLPQALE--DE------------ 151 (262)
Q Consensus 90 ~lG~~~~R~si~-WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H---~~~P~~l~--~~------------ 151 (262)
++=++++++++. |.+ .-+.-.+|+.-+---+.+|+.|++.||+.++.+.. -|.|..-+ ++
T Consensus 293 ~IP~d~~~lD~~~~~~--~~~~F~wd~~~FP~pk~mi~~l~~~Gikl~~~i~P~i~~d~~~~~e~~~~Gy~~k~~~g~~~ 370 (772)
T COG1501 293 DIPLDVFVLDIDFWMD--NWGDFTWDPDRFPDPKQMIAELHEKGIKLIVIINPYIKQDSPLFKEAIEKGYFVKDPDGEIY 370 (772)
T ss_pred cCcceEEEEeehhhhc--cccceEECcccCCCHHHHHHHHHhcCceEEEEeccccccCCchHHHHHHCCeEEECCCCCEe
Confidence 356779999995 887 22245677766667789999999999999887752 23343211 11
Q ss_pred ----------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhcc
Q 024821 152 ----------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAM 193 (262)
Q Consensus 152 ----------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~ 193 (262)
+-.+.||+.++.|.+....-...+| -.-+|.=+|||.+...
T Consensus 371 ~~~~w~~~~a~~DFtnp~~r~Ww~~~~~~~l~d~G-v~g~W~D~nEp~~~~~ 421 (772)
T COG1501 371 QADFWPGNSAFPDFTNPDAREWWASDKKKNLLDLG-VDGFWNDMNEPEPFDG 421 (772)
T ss_pred eecccCCcccccCCCCHHHHHHHHHHHHhHHHhcC-ccEEEccCCCCccccc
Confidence 1126789999998873322233333 2567999999998754
No 84
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=68.62 E-value=6.8 Score=39.25 Aligned_cols=64 Identities=16% Similarity=0.240 Sum_probs=42.3
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCcccc-CCC-----CCCChh--hhhHHHHHHHHHHHcCCcceeecc
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIP-NGR-----GPVNPK--GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P-~g~-----g~~n~~--~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
.+.-..+-++.+++||++++=++--...-.- .|- -.+|+. ..+-++++|+++.++||++|+++.
T Consensus 25 ~~~gi~~~l~yl~~lG~~~i~l~Pi~~~~~~~~gY~~~d~~~id~~~Gt~~~~~~lv~~ah~~gi~vilD~v 96 (543)
T TIGR02403 25 DLRGIIEKLDYLKKLGVDYIWLNPFYVSPQKDNGYDVSDYYAINPLFGTMADFEELVSEAKKRNIKIMLDMV 96 (543)
T ss_pred CHHHHHHhHHHHHHcCCCEEEECCcccCCCCCCCCCccccCccCcccCCHHHHHHHHHHHHHCCCEEEEEEC
Confidence 3445667789999999999876533221100 010 122322 356689999999999999999863
No 85
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=68.37 E-value=32 Score=31.89 Aligned_cols=108 Identities=11% Similarity=0.115 Sum_probs=62.0
Q ss_pred cHHHHHHHHHcCCC--cceeccccCccccCC--CCCCChhhhhHHHHHHHHHHHcCCcceeeccCC---CCcHhHHHh--
Q 024821 81 YKEDVKLMADTGLD--AYRFSISWSRLIPNG--RGPVNPKGLQYYNNLINELISYGIQPHVTLHHL---DLPQALEDE-- 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~--~~R~si~WsRI~P~g--~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~---~~P~~l~~~-- 151 (262)
..+-++.+++.|+. ++=+++.|.. ..+ .-.+|++-+--.+++|++|+++|+++++.++-+ +.+.+-+.+
T Consensus 26 v~~~~~~~~~~~iP~d~i~lD~~~~~--~~~~~~f~~d~~~FPdp~~mi~~L~~~G~kv~~~i~P~v~~~~~~y~e~~~~ 103 (319)
T cd06591 26 LLDVAKEYRKRGIPLDVIVQDWFYWP--KQGWGEWKFDPERFPDPKAMVRELHEMNAELMISIWPTFGPETENYKEMDEK 103 (319)
T ss_pred HHHHHHHHHHhCCCccEEEEechhhc--CCCceeEEEChhhCCCHHHHHHHHHHCCCEEEEEecCCcCCCChhHHHHHHC
Confidence 34445566655444 4444443321 112 334566666677899999999999988766422 122211100
Q ss_pred ---------------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 ---------------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ---------------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+-.+.||+..+.|.+..+..+...| ---+|.=+|||...
T Consensus 104 g~~v~~~~g~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G-vdg~w~D~~Ep~~~ 163 (319)
T cd06591 104 GYLIKTDRGPRVTMQFGGNTRFYDATNPEAREYYWKQLKKNYYDKG-VDAWWLDAAEPEYS 163 (319)
T ss_pred CEEEEcCCCCeeeeeCCCCccccCCCCHHHHHHHHHHHHHHhhcCC-CcEEEecCCCCCcc
Confidence 0125678887877665554444443 24568999999865
No 86
>PLN02784 alpha-amylase
Probab=67.89 E-value=9.5 Score=40.48 Aligned_cols=64 Identities=14% Similarity=0.211 Sum_probs=47.8
Q ss_pred ccccccHHHHHHHHHcCCCcceeccccCccccCCCC-----CCChh--hhhHHHHHHHHHHHcCCcceeec
Q 024821 76 DEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRG-----PVNPK--GLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 76 d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g-----~~n~~--~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+|....+.++.+++||++++=++=.....-+.|-. .+|.. ..+-++++|++|.++||++++++
T Consensus 518 ~w~~~I~ekldyL~~LG~taIWLpP~~~s~s~~GY~p~D~y~lds~yGT~~ELk~LI~a~H~~GIkVIlDi 588 (894)
T PLN02784 518 RWYMELGEKAAELSSLGFTVVWLPPPTESVSPEGYMPKDLYNLNSRYGTIDELKDLVKSFHEVGIKVLGDA 588 (894)
T ss_pred chHHHHHHHHHHHHHhCCCEEEeCCCCCCCCCCCcCcccccccCcCcCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 478899999999999999999877654444333311 12322 34568999999999999999984
No 87
>PRK12581 oxaloacetate decarboxylase; Provisional
Probab=67.56 E-value=15 Score=36.40 Aligned_cols=56 Identities=20% Similarity=0.393 Sum_probs=39.9
Q ss_pred cccccHHH-----HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 77 EYHKYKED-----VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 77 ~y~~~~eD-----i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
-|..|.+| +++.++.|++.+|..-.. |. ++-....|+.+++.|....+++.+=+.|
T Consensus 98 gy~~ypddvv~~fv~~a~~~Gidi~Rifd~l-----------nd--~~n~~~ai~~ak~~G~~~~~~i~yt~sp 158 (468)
T PRK12581 98 GYRHYADDIVDKFISLSAQNGIDVFRIFDAL-----------ND--PRNIQQALRAVKKTGKEAQLCIAYTTSP 158 (468)
T ss_pred CccCCcchHHHHHHHHHHHCCCCEEEEcccC-----------CC--HHHHHHHHHHHHHcCCEEEEEEEEEeCC
Confidence 47888999 899999999999976422 21 3445666777777777776666654445
No 88
>PRK07379 coproporphyrinogen III oxidase; Provisional
Probab=66.65 E-value=14 Score=35.43 Aligned_cols=104 Identities=16% Similarity=0.206 Sum_probs=61.1
Q ss_pred cHHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~~ 157 (262)
-++.++.|+++|+|.+-+++ +-+ ++... -|+.. ..+-..+.++.+++.|+. +-++|. +++|.
T Consensus 114 t~e~l~~l~~~GvnrislGvQS~~d~~L~~-l~R~~--~~~~~~~ai~~l~~~G~~~v~~dlI-~GlPg----------- 178 (400)
T PRK07379 114 DLEQLQGYRSLGVNRVSLGVQAFQDELLAL-CGRSH--RVKDIFAAVDLIHQAGIENFSLDLI-SGLPH----------- 178 (400)
T ss_pred CHHHHHHHHHCCCCEEEEEcccCCHHHHHH-hCCCC--CHHHHHHHHHHHHHcCCCeEEEEee-cCCCC-----------
Confidence 46889999999999555554 221 22221 12211 234567888999999998 556665 46662
Q ss_pred hhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhccccccccC
Q 024821 158 RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFGI 200 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g~ 200 (262)
++.+.+.+=++.+.+-=-+.|..+...-||......-+..|.
T Consensus 179 -qt~e~~~~tl~~~~~l~p~~is~y~L~~~pgT~l~~~~~~g~ 220 (400)
T PRK07379 179 -QTLEDWQASLEAAIALNPTHLSCYDLVLEPGTAFGKQYQPGK 220 (400)
T ss_pred -CCHHHHHHHHHHHHcCCCCEEEEecceecCCchhHHHhhcCC
Confidence 233445544554444333567777777788765444444443
No 89
>cd06543 GH18_PF-ChiA-like PF-ChiA is an uncharacterized chitinase found in the hyperthermophilic archaeon Pyrococcus furiosus with a glycosyl hydrolase family 18 (GH18) catalytic domain as well as a cellulose-binding domain. Members of this domain family are found not only in archaea but also in eukaryotes and prokaryotes. PF-ChiA exhibits hydrolytic activity toward both colloidal and crystalline (beta/alpha) chitins at high temperature.
Probab=66.41 E-value=26 Score=32.39 Aligned_cols=59 Identities=17% Similarity=0.100 Sum_probs=35.9
Q ss_pred HHHHHcCCCcceeccc--cCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCC
Q 024821 86 KLMADTGLDAYRFSIS--WSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDL 144 (262)
Q Consensus 86 ~l~k~lG~~~~R~si~--WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~ 144 (262)
+.+++.|++.+-+++- -..-.|.-.|.............|..|+++|++++|.+-.+.-
T Consensus 19 ~~~~~~g~~~v~lAFi~~~~~~~~~w~g~~~~~~~~~~~~~i~~lk~~G~kViiS~GG~~g 79 (294)
T cd06543 19 TYAAATGVKAFTLAFIVASGGCKPAWGGSYPLDQGGWIKSDIAALRAAGGDVIVSFGGASG 79 (294)
T ss_pred HHHHHcCCCEEEEEEEEcCCCCcccCCCCCCcccchhHHHHHHHHHHcCCeEEEEecCCCC
Confidence 5677889998887753 2222222011100011234577899999999999999876543
No 90
>cd07948 DRE_TIM_HCS Saccharomyces cerevisiae homocitrate synthase and related proteins, catalytic TIM barrel domain. Homocitrate synthase (HCS) catalyzes the condensation of acetyl-CoA and alpha-ketoglutarate to form homocitrate, the first step in the lysine biosynthesis pathway. This family includes the Yarrowia lipolytica LYS1 protein as well as the Saccharomyces cerevisiae LYS20 and LYS21 proteins. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. Th
Probab=65.86 E-value=7.7 Score=35.21 Aligned_cols=65 Identities=15% Similarity=0.224 Sum_probs=48.7
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccC-CCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHH-LDLP 145 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H-~~~P 145 (262)
-.+|++...+.|++.+|+.++=|...-.. .+.--++.++...++++.++++|+++.+++-. +..|
T Consensus 73 ~~~di~~a~~~g~~~i~i~~~~S~~~~~~~~~~~~~e~~~~~~~~i~~a~~~G~~v~~~~eda~r~~ 139 (262)
T cd07948 73 HMDDARIAVETGVDGVDLVFGTSPFLREASHGKSITEIIESAVEVIEFVKSKGIEVRFSSEDSFRSD 139 (262)
T ss_pred CHHHHHHHHHcCcCEEEEEEecCHHHHHHHhCCCHHHHHHHHHHHHHHHHHCCCeEEEEEEeeCCCC
Confidence 46799999999999999998665533221 12223567899999999999999999998842 3444
No 91
>cd07944 DRE_TIM_HOA_like 4-hydroxy-2-oxovalerate aldolase-like, N-terminal catalytic TIM barrel domain. This family of bacterial enzymes is sequence-similar to 4-hydroxy-2-oxovalerate aldolase (HOA) but its exact function is unknown. This family includes the Bacteroides vulgatus Bvu_2661 protein and belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues t
Probab=65.72 E-value=20 Score=32.55 Aligned_cols=94 Identities=11% Similarity=0.056 Sum_probs=62.6
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccC-CCCc-HhHHHh------h-
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHH-LDLP-QALEDE------Y- 152 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H-~~~P-~~l~~~------~- 152 (262)
.+|++...+.|++.+|+++..+ .++-..+.++.++++|+++.+.+.+ +..+ ..+.+. .
T Consensus 85 ~~~l~~a~~~gv~~iri~~~~~-------------~~~~~~~~i~~ak~~G~~v~~~~~~a~~~~~~~~~~~~~~~~~~g 151 (266)
T cd07944 85 IDLLEPASGSVVDMIRVAFHKH-------------EFDEALPLIKAIKEKGYEVFFNLMAISGYSDEELLELLELVNEIK 151 (266)
T ss_pred HHHHHHHhcCCcCEEEEecccc-------------cHHHHHHHHHHHHHCCCeEEEEEEeecCCCHHHHHHHHHHHHhCC
Confidence 5789999999999999987332 3677889999999999999988764 2233 322211 1
Q ss_pred ---------CCCCChhhHHHHHHHHHHHHHHhCCCCc-eEEeccCCchh
Q 024821 153 ---------GGWINRMIVKDFTAYADVCFREFGDRVS-YWTTVNEPNGF 191 (262)
Q Consensus 153 ---------gGw~~~~~~~~F~~ya~~v~~~~gd~V~-~w~t~NEP~~~ 191 (262)
.|-.+| +...++++.+.+++++.+. ..-.-|-..+.
T Consensus 152 ~~~i~l~DT~G~~~P---~~v~~lv~~l~~~~~~~~~i~~H~Hn~~Gla 197 (266)
T cd07944 152 PDVFYIVDSFGSMYP---EDIKRIISLLRSNLDKDIKLGFHAHNNLQLA 197 (266)
T ss_pred CCEEEEecCCCCCCH---HHHHHHHHHHHHhcCCCceEEEEeCCCccHH
Confidence 344444 5666667777777764233 34466666654
No 92
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=65.34 E-value=39 Score=31.33 Aligned_cols=108 Identities=12% Similarity=0.047 Sum_probs=61.8
Q ss_pred HHHHHHHHHcCCC--cceeccccCccccC--CCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC---CcHhHHHh---
Q 024821 82 KEDVKLMADTGLD--AYRFSISWSRLIPN--GRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD---LPQALEDE--- 151 (262)
Q Consensus 82 ~eDi~l~k~lG~~--~~R~si~WsRI~P~--g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~---~P~~l~~~--- 151 (262)
.+-++.+++.|+. ++=+.+.|....-. ..-.+|++.+--.+++|++|+++|++.++.++-+- .|..-+.+
T Consensus 32 ~~~~~~~r~~~iP~d~i~ld~~~~~~~~~~~~~f~~d~~~FPdp~~mi~~L~~~g~k~~~~i~P~i~~~~~~y~e~~~~g 111 (317)
T cd06599 32 LEFIDKCREHDIPCDSFHLSSGYTSIEGGKRYVFNWNKDRFPDPAAFVAKFHERGIRLAPNIKPGLLQDHPRYKELKEAG 111 (317)
T ss_pred HHHHHHHHHcCCCeeEEEEeccccccCCCceeeeecCcccCCCHHHHHHHHHHCCCEEEEEeCCcccCCCHHHHHHHHCC
Confidence 4445566666653 44444455543111 01245555555578999999999999888665332 22211100
Q ss_pred h----------------------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 152 Y----------------------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 152 ~----------------------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
| -.+.||+..+.|.+..+..+...| -.-.|+=+|||.+
T Consensus 112 ~~v~~~~g~~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~~~~G-vdg~w~D~~E~~~ 171 (317)
T cd06599 112 AFIKPPDGREPSIGQFWGGVGSFVDFTNPEGREWWKEGVKEALLDLG-IDSTWNDNNEYEI 171 (317)
T ss_pred cEEEcCCCCCcceecccCCCeEeecCCChHHHHHHHHHHHHHHhcCC-CcEEEecCCCCcc
Confidence 0 113678888888776655554443 2346888999974
No 93
>TIGR00433 bioB biotin synthetase. Catalyzes the last step of the biotin biosynthesis pathway.
Probab=65.17 E-value=14 Score=33.46 Aligned_cols=55 Identities=18% Similarity=0.201 Sum_probs=39.7
Q ss_pred cHHHHHHHHHcCCCcceeccccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 81 YKEDVKLMADTGLDAYRFSISWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
-+|.++.|+++|++.+-++++-+ ++.+.-.+. ..++.+.+.++.+++.||.+...
T Consensus 122 ~~e~l~~Lk~aG~~~v~i~~E~~~~~~~~i~~~---~s~~~~~~ai~~l~~~Gi~v~~~ 177 (296)
T TIGR00433 122 DPEQAKRLKDAGLDYYNHNLDTSQEFYSNIIST---HTYDDRVDTLENAKKAGLKVCSG 177 (296)
T ss_pred CHHHHHHHHHcCCCEEEEcccCCHHHHhhccCC---CCHHHHHHHHHHHHHcCCEEEEe
Confidence 48999999999999999998822 133321122 23667888999999999986544
No 94
>PRK03705 glycogen debranching enzyme; Provisional
Probab=64.56 E-value=11 Score=38.72 Aligned_cols=91 Identities=15% Similarity=0.239 Sum_probs=55.2
Q ss_pred HHHHHHcCCCcceeccc-----------------c-------CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 85 VKLMADTGLDAYRFSIS-----------------W-------SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~-----------------W-------sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
|+.+|+||++++=+.=- | -.+.|. -|.-....++-++++|++|.++||++|+++.
T Consensus 185 LdYLk~LGvt~I~L~Pv~~~~~~~~~~~~g~~~ywGYd~~~yfa~d~~-ygt~~~~~~~efk~LV~~~H~~GI~VIlDvV 263 (658)
T PRK03705 185 IAYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPLAMFALDPA-YASGPETALDEFRDAVKALHKAGIEVILDVV 263 (658)
T ss_pred hHHHHHcCCCEEEecCcccCCCcccccccccccccCcccccccccccc-cCCCCcchHHHHHHHHHHHHHCCCEEEEEEc
Confidence 89999999999865421 1 122222 1111112356799999999999999999853
Q ss_pred --CCC-----CcH----------h--HHH--hh---CC------CCChhhHHHHHHHHHHHHHHhC
Q 024821 141 --HLD-----LPQ----------A--LED--EY---GG------WINRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 141 --H~~-----~P~----------~--l~~--~~---gG------w~~~~~~~~F~~ya~~v~~~~g 176 (262)
|-. -|. + +.. .| .| +.++.+.+.+.+-++.-+++||
T Consensus 264 ~NHt~~~~~~~~~~~~~~~d~~~yy~~~~~g~~~~~~g~g~~ln~~~p~Vr~~iid~l~~W~~e~g 329 (658)
T PRK03705 264 FNHSAELDLDGPTLSLRGIDNRSYYWIREDGDYHNWTGCGNTLNLSHPAVVDWAIDCLRYWVETCH 329 (658)
T ss_pred ccCccCcCCCCcchhcccCCCccceEECCCCCcCCCCCccCcccCCCHHHHHHHHHHHHHHHHHhC
Confidence 432 121 1 000 01 11 2367777888887777777776
No 95
>cd07943 DRE_TIM_HOA 4-hydroxy-2-oxovalerate aldolase, N-terminal catalytic TIM barrel domain. 4-hydroxy 2-ketovalerate aldolase (Also known as 4-hydroxy-2-ketovalerate aldolase and 4-hydroxy-2-oxopentanoate aldolase (HOA)) converts 4-hydroxy-2-oxopentanoate to acetaldehyde and pyruvate, the penultimate step in the meta-cleavage pathway for the degradation of phenols, cresols and catechol. This family includes the Escherichia coli MhpE aldolase, the Pseudomonas DmpG aldolase, and the Burkholderia xenovorans BphI pyruvate aldolase. In Pseudomonas, the DmpG aldolase tightly associates with a dehydrogenase (DmpF ) and is inactive without it. HOA has a canonical TIM-barrel fold with a C-terminal extension that forms a funnel leading to the active site. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate
Probab=64.55 E-value=39 Score=30.31 Aligned_cols=93 Identities=18% Similarity=0.233 Sum_probs=63.2
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc--CCCCcHhHHHh------h-
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH--HLDLPQALEDE------Y- 152 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~--H~~~P~~l~~~------~- 152 (262)
.+|++..++.|++.+|+.++-+.+ .-..+.++.+++.|+++.+++. +...|..+.+. .
T Consensus 88 ~~~i~~a~~~g~~~iri~~~~s~~-------------~~~~~~i~~ak~~G~~v~~~~~~~~~~~~~~~~~~~~~~~~~G 154 (263)
T cd07943 88 VDDLKMAADLGVDVVRVATHCTEA-------------DVSEQHIGAARKLGMDVVGFLMMSHMASPEELAEQAKLMESYG 154 (263)
T ss_pred HHHHHHHHHcCCCEEEEEechhhH-------------HHHHHHHHHHHHCCCeEEEEEEeccCCCHHHHHHHHHHHHHcC
Confidence 699999999999999998866643 2357788899999999888873 22334433221 1
Q ss_pred ---------CCCCChhhHHHHHHHHHHHHHHhCCCCc-eEEeccCCchh
Q 024821 153 ---------GGWINRMIVKDFTAYADVCFREFGDRVS-YWTTVNEPNGF 191 (262)
Q Consensus 153 ---------gGw~~~~~~~~F~~ya~~v~~~~gd~V~-~w~t~NEP~~~ 191 (262)
-|-..| +...++++.+-++++. ++ .+-+-|-..+.
T Consensus 155 ~d~i~l~DT~G~~~P---~~v~~lv~~l~~~~~~-~~l~~H~Hn~~GlA 199 (263)
T cd07943 155 ADCVYVTDSAGAMLP---DDVRERVRALREALDP-TPVGFHGHNNLGLA 199 (263)
T ss_pred CCEEEEcCCCCCcCH---HHHHHHHHHHHHhCCC-ceEEEEecCCcchH
Confidence 244444 6677778888888876 33 24577777654
No 96
>PRK09058 coproporphyrinogen III oxidase; Provisional
Probab=64.31 E-value=22 Score=34.70 Aligned_cols=106 Identities=14% Similarity=0.122 Sum_probs=65.4
Q ss_pred cHHHHHHHHHcCCCcceecc-ccCc-cccCCCCCCChhhhhHHHHHHHHHHHcC-CcceeeccCCCCcHhHHHhhCCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SWSR-LIPNGRGPVNPKGLQYYNNLINELISYG-IQPHVTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~WsR-I~P~g~g~~n~~~l~~y~~~id~l~~~G-I~p~vtL~H~~~P~~l~~~~gGw~~ 157 (262)
-+|.+++|+++|+|.+.+++ |-+. +... -|... ..+-..+.|+.+++.| +.+.++|. +++|.
T Consensus 162 t~e~l~~l~~aGvnRiSiGVQSf~d~vLk~-lgR~~--~~~~~~~~i~~l~~~g~~~v~~DlI-~GlPg----------- 226 (449)
T PRK09058 162 DDEKADAALDAGANRFSIGVQSFNTQVRRR-AGRKD--DREEVLARLEELVARDRAAVVCDLI-FGLPG----------- 226 (449)
T ss_pred CHHHHHHHHHcCCCEEEecCCcCCHHHHHH-hCCCC--CHHHHHHHHHHHHhCCCCcEEEEEE-eeCCC-----------
Confidence 46889999999999777776 3432 2222 23322 1345678899999999 66666765 45662
Q ss_pred hhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhccccccccCCC
Q 024821 158 RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFGIAP 202 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g~~~ 202 (262)
++.+.|.+=.+.+.+-=-+.|..+...-||.......+..|..+
T Consensus 227 -qT~e~~~~~l~~~~~l~~~~is~y~L~~~pgT~l~~~~~~g~l~ 270 (449)
T PRK09058 227 -QTPEIWQQDLAIVRDLGLDGVDLYALNLLPGTPLAKAVEKGKLP 270 (449)
T ss_pred -CCHHHHHHHHHHHHhcCCCEEEEeccccCCCCHHHHHHHcCCCC
Confidence 23345555555555433367888888888887544333344443
No 97
>PRK12331 oxaloacetate decarboxylase; Provisional
Probab=64.03 E-value=23 Score=34.79 Aligned_cols=93 Identities=11% Similarity=0.015 Sum_probs=58.6
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc----HhHHH---h---
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP----QALED---E--- 151 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P----~~l~~---~--- 151 (262)
++|++...+.|++.+|+.++-+.+. ...+.|+.+++.|+.+.+++..-+-| ..+.+ +
T Consensus 99 ~~~v~~A~~~Gvd~irif~~lnd~~-------------n~~~~v~~ak~~G~~v~~~i~~t~~p~~~~~~~~~~a~~l~~ 165 (448)
T PRK12331 99 ESFVQKSVENGIDIIRIFDALNDVR-------------NLETAVKATKKAGGHAQVAISYTTSPVHTIDYFVKLAKEMQE 165 (448)
T ss_pred HHHHHHHHHCCCCEEEEEEecCcHH-------------HHHHHHHHHHHcCCeEEEEEEeecCCCCCHHHHHHHHHHHHH
Confidence 5567999999999999998665541 25668889999998887777654445 21111 1
Q ss_pred ----------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 ----------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ----------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
-.|..+|. ...+.++.+-++++ ..-..-+-|-..+.
T Consensus 166 ~Gad~I~i~Dt~G~l~P~---~v~~lv~alk~~~~-~pi~~H~Hnt~GlA 211 (448)
T PRK12331 166 MGADSICIKDMAGILTPY---VAYELVKRIKEAVT-VPLEVHTHATSGIA 211 (448)
T ss_pred cCCCEEEEcCCCCCCCHH---HHHHHHHHHHHhcC-CeEEEEecCCCCcH
Confidence 14566664 34445555666664 22234566666553
No 98
>TIGR00612 ispG_gcpE 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase. Chlamydial members of the family have a long insert. The family is largely restricted to Bacteria, where it is widely but not universally distributed. No homology can be detected between the GcpE family and other proteins.
Probab=63.54 E-value=54 Score=31.22 Aligned_cols=85 Identities=18% Similarity=0.259 Sum_probs=60.2
Q ss_pred cCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhh
Q 024821 73 VACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEY 152 (262)
Q Consensus 73 ~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~ 152 (262)
+|.=||+ |+-.+..+ +.|++.+|+. +|.+-. -+..+.+++.++++|+..=+...|-.++.-+.++|
T Consensus 76 VADIHFd-~~lAl~a~-~~g~dkiRIN----------PGNig~--~e~v~~vv~~ak~~~ipIRIGVN~GSL~~~~~~ky 141 (346)
T TIGR00612 76 VADIHFD-YRLAALAM-AKGVAKVRIN----------PGNIGF--RERVRDVVEKARDHGKAMRIGVNHGSLERRLLEKY 141 (346)
T ss_pred EEeeCCC-cHHHHHHH-HhccCeEEEC----------CCCCCC--HHHHHHHHHHHHHCCCCEEEecCCCCCcHHHHHHc
Confidence 3444554 55555444 4599999863 344432 36789999999999999999999999999999998
Q ss_pred CCCCChhhHHHHHHHHHHH
Q 024821 153 GGWINRMIVKDFTAYADVC 171 (262)
Q Consensus 153 gGw~~~~~~~~F~~ya~~v 171 (262)
|+-..+-.++--.++++.+
T Consensus 142 g~~t~eamveSAl~~v~~l 160 (346)
T TIGR00612 142 GDATAEAMVQSALEEAAIL 160 (346)
T ss_pred CCCCHHHHHHHHHHHHHHH
Confidence 7544444666666666653
No 99
>PF05089 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain; InterPro: IPR024733 Alpha-N-acetylglucosaminidase is a lysosomal enzyme that is required for the stepwise degradation of heparan sulphate []. Mutations on the alpha-N-acetylglucosaminidase gene can lead to mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B), characterised by neurological dysfunction but relatively mild somatic manifestations []. Alpha-N-acetylglucosaminidase is composed of three domains. This entry represents the central domain, which has a tim barrel fold [].; PDB: 4A4A_A 2VC9_A 2VCC_A 2VCB_A 2VCA_A.
Probab=63.14 E-value=39 Score=32.05 Aligned_cols=110 Identities=18% Similarity=0.471 Sum_probs=58.2
Q ss_pred ccccHHHHHHHHHcCCCcce---------------ec---------------cccCccccC-C-CCCCC----hhhhhHH
Q 024821 78 YHKYKEDVKLMADTGLDAYR---------------FS---------------ISWSRLIPN-G-RGPVN----PKGLQYY 121 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R---------------~s---------------i~WsRI~P~-g-~g~~n----~~~l~~y 121 (262)
|.||++.|+.|+--|||..= ++ +.|.|..-- | .|.+. ++-.+.=
T Consensus 18 W~rWEreIDWMALnGiNl~La~~GqEavw~~v~~~~G~t~~ei~~ff~GPA~laW~rMgNl~gwgGPLp~~w~~~q~~Lq 97 (333)
T PF05089_consen 18 WERWEREIDWMALNGINLPLAIVGQEAVWQRVLRELGLTDEEIREFFTGPAFLAWWRMGNLQGWGGPLPQSWIDQQAELQ 97 (333)
T ss_dssp HHHHHHHHHHHHHTT--EEE--TTHHHHHHHHHGGGT--HHHHHHHS--TT-HHHHHTTS--STT----TTHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCchhhhhhHHHHHHHHHHHHcCCCHHHHHHHcCCHHHHHHHHhCCcccCCCCCCHHHHHHHHHHH
Confidence 56888999999888888321 11 234444221 1 12221 2335566
Q ss_pred HHHHHHHHHcCCcceeeccCCCCcHhHHHhh--------CCC--------CChhhHHHHHHHHHHH----HHHhCCCCce
Q 024821 122 NNLINELISYGIQPHVTLHHLDLPQALEDEY--------GGW--------INRMIVKDFTAYADVC----FREFGDRVSY 181 (262)
Q Consensus 122 ~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~--------gGw--------~~~~~~~~F~~ya~~v----~~~~gd~V~~ 181 (262)
+++++.+++.||+|++--+---.|..+.+++ +.| ++|.. +.|.+.++.. -+.|| .-.+
T Consensus 98 ~kIl~RmreLGm~PVLPaF~G~VP~~~~~~~P~a~i~~~~~W~~f~~~~~L~P~d-plF~~i~~~F~~~q~~~yG-~~~~ 175 (333)
T PF05089_consen 98 KKILDRMRELGMTPVLPAFAGHVPRAFKRKYPNANITRQGNWNGFCRPYFLDPTD-PLFAEIAKLFYEEQIKLYG-TDHI 175 (333)
T ss_dssp HHHHHHHHHHT-EEEEE--S-EE-TTHHHHSTT--EE---EETTEE--EEE-SS---HHHHHHHHHHHHHHHHH----SE
T ss_pred HHHHHHHHHcCCcccCCCcCCCCChHHHhcCCCCEEeeCCCcCCCCCCceeCCCC-chHHHHHHHHHHHHHHhcC-CCce
Confidence 8999999999999999766444788877765 223 23322 5666666544 56688 4555
Q ss_pred EE--eccCCc
Q 024821 182 WT--TVNEPN 189 (262)
Q Consensus 182 w~--t~NEP~ 189 (262)
+. +|||-.
T Consensus 176 Y~~D~FnE~~ 185 (333)
T PF05089_consen 176 YAADPFNEGG 185 (333)
T ss_dssp EE--TTTTS-
T ss_pred eCCCccCCCC
Confidence 54 899943
No 100
>TIGR00539 hemN_rel putative oxygen-independent coproporphyrinogen III oxidase. Experimentally determined examples of oxygen-independent coproporphyrinogen III oxidase, an enzyme that replaces HemF function under anaerobic conditions, belong to a family of proteins described by the model hemN. This model, hemN_rel, models a closely related protein, shorter at the amino end and lacking the region containing the motif PYRT[SC]YP found in members of the hemN family. Several species, including E. coli, Helicobacter pylori, Aquifex aeolicus, and Chlamydia trachomatis, have members of both this family and the E. coli hemN family. The member of this family from Bacillus subtilis was shown to complement an hemF/hemN double mutant of Salmonella typimurium and to prevent accumulation of coproporphyrinogen III under anaerobic conditions, but the exact role of this protein is still uncertain. It is found in a number of species that do not synthesize heme de novo.
Probab=62.70 E-value=19 Score=33.89 Aligned_cols=92 Identities=15% Similarity=0.219 Sum_probs=53.2
Q ss_pred cHHHHHHHHHcCCCcceeccc-cC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSIS-WS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~-Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~ 157 (262)
-++.++.|+++|++.+-++++ -+ ++... -|.. ...+-+.+.|+.+++.|+.++ ++|. +++|.
T Consensus 99 t~e~l~~l~~~Gv~risiGvqS~~~~~l~~-lgR~--~~~~~~~~ai~~l~~~G~~~v~~dli-~GlPg----------- 163 (360)
T TIGR00539 99 TAEWCKGLKGAGINRLSLGVQSFRDDKLLF-LGRQ--HSAKNIAPAIETALKSGIENISLDLM-YGLPL----------- 163 (360)
T ss_pred CHHHHHHHHHcCCCEEEEecccCChHHHHH-hCCC--CCHHHHHHHHHHHHHcCCCeEEEecc-CCCCC-----------
Confidence 468899999999985555552 32 23322 1321 125668889999999999754 4443 45552
Q ss_pred hhhHHHHHHHHHHHHHHhC-CCCceEEeccCCc
Q 024821 158 RMIVKDFTAYADVCFREFG-DRVSYWTTVNEPN 189 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~ 189 (262)
++.+.|.+-.+.+.+ ++ +.+..+...=||+
T Consensus 164 -qt~~~~~~~l~~~~~-l~~~~is~y~l~~~~g 194 (360)
T TIGR00539 164 -QTLNSLKEELKLAKE-LPINHLSAYALSVEPN 194 (360)
T ss_pred -CCHHHHHHHHHHHHc-cCCCEEEeecceEcCC
Confidence 234555555555443 33 3444444333444
No 101
>TIGR03217 4OH_2_O_val_ald 4-hydroxy-2-oxovalerate aldolase. Members of this protein family are 4-hydroxy-2-oxovalerate aldolase, also called 4-hydroxy-2-ketovalerate aldolase and 2-oxo-4-hydroxypentanoate aldolase. This enzyme, part of the pathway for the meta-cleavage of catechol, produces pyruvate and acetaldehyde. Acetaldehyde is then converted by acetaldehyde dehydrogenase (acylating) (DmpF; EC 1.2.1.10) to acetyl-CoA. The two enzymes are tightly associated.
Probab=62.56 E-value=58 Score=30.65 Aligned_cols=94 Identities=15% Similarity=0.170 Sum_probs=63.7
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec--cCCCCcHhHHHh------h
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL--HHLDLPQALEDE------Y 152 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL--~H~~~P~~l~~~------~ 152 (262)
..+|++...+.|++.+|+....+.. +-..+.|+.+++.|+++.+.+ .|...|..+.+. +
T Consensus 89 ~~~dl~~a~~~gvd~iri~~~~~e~-------------d~~~~~i~~ak~~G~~v~~~l~~s~~~~~e~l~~~a~~~~~~ 155 (333)
T TIGR03217 89 TVHDLKAAYDAGARTVRVATHCTEA-------------DVSEQHIGMARELGMDTVGFLMMSHMTPPEKLAEQAKLMESY 155 (333)
T ss_pred CHHHHHHHHHCCCCEEEEEeccchH-------------HHHHHHHHHHHHcCCeEEEEEEcccCCCHHHHHHHHHHHHhc
Confidence 3689999999999999988754433 235789999999999988776 454455544321 2
Q ss_pred ----------CCCCChhhHHHHHHHHHHHHHHhCCCCce-EEeccCCch
Q 024821 153 ----------GGWINRMIVKDFTAYADVCFREFGDRVSY-WTTVNEPNG 190 (262)
Q Consensus 153 ----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~-w~t~NEP~~ 190 (262)
.|-..| +...++++.+-+++++.+.. +-.-|-..+
T Consensus 156 Ga~~i~i~DT~G~~~P---~~v~~~v~~l~~~l~~~i~ig~H~HnnlGl 201 (333)
T TIGR03217 156 GADCVYIVDSAGAMLP---DDVRDRVRALKAVLKPETQVGFHAHHNLSL 201 (333)
T ss_pred CCCEEEEccCCCCCCH---HHHHHHHHHHHHhCCCCceEEEEeCCCCch
Confidence 244444 66777777777888744432 446666554
No 102
>PRK05628 coproporphyrinogen III oxidase; Validated
Probab=62.36 E-value=22 Score=33.64 Aligned_cols=94 Identities=16% Similarity=0.141 Sum_probs=55.7
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWI 156 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~ 156 (262)
-+|.+++|+++|++.+.+++ ...-++- -+.. ...+-..+.++.+++.|+. +.++|. +++|.
T Consensus 107 ~~e~l~~l~~~G~~rvslGv--QS~~~~~L~~l~R~--~s~~~~~~a~~~l~~~g~~~v~~dli-~GlPg---------- 171 (375)
T PRK05628 107 SPEFFAALRAAGFTRVSLGM--QSAAPHVLAVLDRT--HTPGRAVAAAREARAAGFEHVNLDLI-YGTPG---------- 171 (375)
T ss_pred CHHHHHHHHHcCCCEEEEec--ccCCHHHHHHcCCC--CCHHHHHHHHHHHHHcCCCcEEEEEe-ccCCC----------
Confidence 46889999999998555554 3333221 1221 1245677899999999998 556654 34552
Q ss_pred ChhhHHHHHHHHHHHHHHhC-CCCceEEeccCCchhc
Q 024821 157 NRMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNGFA 192 (262)
Q Consensus 157 ~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~~~ 192 (262)
++.+.|.+=.+.+.+ ++ +.|..+...-||+...
T Consensus 172 --qt~~~~~~tl~~~~~-l~~~~i~~y~l~~~~gT~l 205 (375)
T PRK05628 172 --ESDDDWRASLDAALE-AGVDHVSAYALIVEDGTAL 205 (375)
T ss_pred --CCHHHHHHHHHHHHh-cCCCEEEeeeeecCCCChH
Confidence 234555555555443 44 4555555445666543
No 103
>TIGR01108 oadA oxaloacetate decarboxylase alpha subunit. This model describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea. The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.
Probab=62.18 E-value=22 Score=36.08 Aligned_cols=93 Identities=16% Similarity=0.161 Sum_probs=58.7
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc----HhHHH------h
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP----QALED------E 151 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P----~~l~~------~ 151 (262)
++|++...+.|++.+|+..+.+.+ +-....++.++++|+.+.+++..-+.| ..+.+ +
T Consensus 94 ~~~v~~a~~~Gvd~irif~~lnd~-------------~n~~~~i~~ak~~G~~v~~~i~~t~~p~~~~~~~~~~~~~~~~ 160 (582)
T TIGR01108 94 ERFVKKAVENGMDVFRIFDALNDP-------------RNLQAAIQAAKKHGAHAQGTISYTTSPVHTLETYLDLAEELLE 160 (582)
T ss_pred HHHHHHHHHCCCCEEEEEEecCcH-------------HHHHHHHHHHHHcCCEEEEEEEeccCCCCCHHHHHHHHHHHHH
Confidence 445899999999999999766653 235677788888888888776554455 21111 0
Q ss_pred ----------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 ----------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ----------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
-.|...| ....+.++.+-++++ ..-..-+-|-..+.
T Consensus 161 ~Gad~I~i~Dt~G~~~P---~~v~~lv~~lk~~~~-~pi~~H~Hnt~Gla 206 (582)
T TIGR01108 161 MGVDSICIKDMAGILTP---KAAYELVSALKKRFG-LPVHLHSHATTGMA 206 (582)
T ss_pred cCCCEEEECCCCCCcCH---HHHHHHHHHHHHhCC-CceEEEecCCCCcH
Confidence 1355555 444555566666665 22345677776654
No 104
>PRK10785 maltodextrin glucosidase; Provisional
Probab=62.04 E-value=11 Score=38.19 Aligned_cols=58 Identities=16% Similarity=0.197 Sum_probs=39.6
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCC-------CCCChh--hhhHHHHHHHHHHHcCCcceeecc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGR-------GPVNPK--GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~-------g~~n~~--~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
..+-++.+++|||+++=++=-... |..- -.+|+. ..+=++++|+++.++||++|+++.
T Consensus 181 I~~kLdYL~~LGv~~I~L~Pif~s--~s~hgYd~~Dy~~iDp~~Gt~~df~~Lv~~aH~rGikVilD~V 247 (598)
T PRK10785 181 ISEKLPYLKKLGVTALYLNPIFTA--PSVHKYDTEDYRHVDPQLGGDAALLRLRHATQQRGMRLVLDGV 247 (598)
T ss_pred HHHHHHHHHHcCCCEEEeCCcccC--CCCCCcCcccccccCcccCCHHHHHHHHHHHHHCCCEEEEEEC
Confidence 456689999999999877632211 1100 122322 345689999999999999999863
No 105
>PRK06294 coproporphyrinogen III oxidase; Provisional
Probab=61.27 E-value=26 Score=33.25 Aligned_cols=94 Identities=17% Similarity=0.176 Sum_probs=60.5
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWI 156 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~ 156 (262)
-++.++.++++|+| |+||.-..+.++. -|... ..+-..+.++.+++.|+. .-++|. +++|.
T Consensus 102 ~~~~l~~l~~~G~n--rislGvQS~~~~~L~~l~R~~--~~~~~~~ai~~~~~~g~~~v~~Dli-~GlPg---------- 166 (370)
T PRK06294 102 SESYIRALALTGIN--RISIGVQTFDDPLLKLLGRTH--SSSKAIDAVQECSEHGFSNLSIDLI-YGLPT---------- 166 (370)
T ss_pred CHHHHHHHHHCCCC--EEEEccccCCHHHHHHcCCCC--CHHHHHHHHHHHHHcCCCeEEEEee-cCCCC----------
Confidence 36889999999998 5555444443331 12211 244567788999999997 445554 45552
Q ss_pred ChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 157 NRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 157 ~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
++.+.|.+-.+.+.+-=-+.|..+...=||...
T Consensus 167 --qt~~~~~~~l~~~~~l~~~~is~y~l~~~~gT~ 199 (370)
T PRK06294 167 --QSLSDFIVDLHQAITLPITHISLYNLTIDPHTS 199 (370)
T ss_pred --CCHHHHHHHHHHHHccCCCeEEEeeeEecCCCh
Confidence 345667777777665333678888888888754
No 106
>PRK14510 putative bifunctional 4-alpha-glucanotransferase/glycogen debranching enzyme; Provisional
Probab=60.98 E-value=11 Score=41.48 Aligned_cols=57 Identities=16% Similarity=0.246 Sum_probs=37.5
Q ss_pred HHHHHHHHcCCCcceeccccCcc----------------ccCCCCCCChh----hhhHHHHHHHHHHHcCCcceeec
Q 024821 83 EDVKLMADTGLDAYRFSISWSRL----------------IPNGRGPVNPK----GLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 83 eDi~l~k~lG~~~~R~si~WsRI----------------~P~g~g~~n~~----~l~~y~~~id~l~~~GI~p~vtL 139 (262)
+.|+.+|+||++++=+.=-.... .|..--.+++. ..+=++++|++|.++||++|+++
T Consensus 191 ~~i~yLk~LGvt~I~L~Pi~~~~~~~~~~~~g~~~yWGY~~~~yfa~dp~yg~~~~~efk~lV~~~H~~GI~VILDv 267 (1221)
T PRK14510 191 EAISYLKKLGVSIVELNPIFASVDEHHLPQLGLSNYWGYNTVAFLAPDPRLAPGGEEEFAQAIKEAQSAGIAVILDV 267 (1221)
T ss_pred hhHHHHHHcCCCEEEeCCccccCcccccccccCcCcCCCCCCCCCCcChhhccCcHHHHHHHHHHHHHCCCEEEEEE
Confidence 56889999999998765322111 00000012221 45668999999999999999985
No 107
>KOG1065 consensus Maltase glucoamylase and related hydrolases, glycosyl hydrolase family 31 [Carbohydrate transport and metabolism]
Probab=60.56 E-value=37 Score=35.81 Aligned_cols=103 Identities=17% Similarity=0.326 Sum_probs=66.0
Q ss_pred HHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc---CCCCcH-----------h-
Q 024821 85 VKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH---HLDLPQ-----------A- 147 (262)
Q Consensus 85 i~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~---H~~~P~-----------~- 147 (262)
++.+.++|+. ..=..|.|-.=. ++-.+|+..+....++++.|.++|++.++.+. +-+... +
T Consensus 317 v~~~~~agiPld~~~~DiDyMd~y--kDFTvd~~~fp~~~~fv~~Lh~~G~kyvliidP~is~~~~y~~y~~g~~~~v~I 394 (805)
T KOG1065|consen 317 VENYRAAGIPLDVIVIDIDYMDGY--KDFTVDKVWFPDLKDFVDDLHARGFKYVLIIDPFISTNSSYGPYDRGVAKDVLI 394 (805)
T ss_pred HHHHHHcCCCcceeeeehhhhhcc--cceeeccccCcchHHHHHHHHhCCCeEEEEeCCccccCccchhhhhhhhhceee
Confidence 4556666665 554555553222 13567777777899999999999999999886 323320 1
Q ss_pred ---------HHHhhCC------CCChhhHHHHHHHHHHHHHHhCCCCc---eEEeccCCchhcc
Q 024821 148 ---------LEDEYGG------WINRMIVKDFTAYADVCFREFGDRVS---YWTTVNEPNGFAM 193 (262)
Q Consensus 148 ---------l~~~~gG------w~~~~~~~~F~~ya~~v~~~~gd~V~---~w~t~NEP~~~~~ 193 (262)
+.+-..| ++|+++++.|. ..+++|.+.|. +|+-+|||.-++.
T Consensus 395 ~~~~g~~~~lg~vwP~~~~fpDftnp~~~~Ww~----~~~~~fh~~vp~dg~wiDmnE~snf~~ 454 (805)
T KOG1065|consen 395 KNREGSPKMLGEVWPGSTAFPDFTNPAVVEWWL----DELKRFHDEVPFDGFWIDMNEPSNFPS 454 (805)
T ss_pred ecccCchhhhcccCCCcccccccCCchHHHHHH----HHHHhhcccCCccceEEECCCcccCCC
Confidence 1111122 56676655554 45668888775 6999999987654
No 108
>COG1523 PulA Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]
Probab=60.14 E-value=15 Score=38.14 Aligned_cols=62 Identities=19% Similarity=0.390 Sum_probs=42.1
Q ss_pred ccccHHH--HHHHHHcCCCccee----ccccCccccC-CC---------------C--CCCh---hhhhHHHHHHHHHHH
Q 024821 78 YHKYKED--VKLMADTGLDAYRF----SISWSRLIPN-GR---------------G--PVNP---KGLQYYNNLINELIS 130 (262)
Q Consensus 78 y~~~~eD--i~l~k~lG~~~~R~----si~WsRI~P~-g~---------------g--~~n~---~~l~~y~~~id~l~~ 130 (262)
|--..+. |+-+|+||++++.+ ++...+-.++ |. + ..|+ ..+.=++++|++|.+
T Consensus 197 y~gl~~~~~i~yLk~LGvtaVeLLPV~~~~~~~~l~~~gl~n~WGYdP~~fFAp~~~Yss~p~p~~~i~EfK~mV~~lHk 276 (697)
T COG1523 197 YLGLAEPVIIDYLKDLGVTAVELLPVFDFYDEPHLDKSGLNNNWGYDPLNFFAPEGRYASNPEPATRIKEFKDMVKALHK 276 (697)
T ss_pred eehhccccHHHHHHHhCCceEEEecceEEeccccccccccccccCCCcccccCCCccccCCCCcchHHHHHHHHHHHHHH
Confidence 3334444 99999999999984 4444443321 10 0 0122 257789999999999
Q ss_pred cCCcceeec
Q 024821 131 YGIQPHVTL 139 (262)
Q Consensus 131 ~GI~p~vtL 139 (262)
+||++|+++
T Consensus 277 aGI~VILDV 285 (697)
T COG1523 277 AGIEVILDV 285 (697)
T ss_pred cCCEEEEEE
Confidence 999999986
No 109
>TIGR01232 lacD tagatose 1,6-diphosphate aldolase. This family consists of Gram-positive proteins. Tagatose 1,6-diphosphate aldolase is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation.
Probab=59.97 E-value=30 Score=32.61 Aligned_cols=59 Identities=15% Similarity=0.238 Sum_probs=50.2
Q ss_pred HHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 84 DVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 84 Di~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
+.+.+|++|.+++.|=+-|. |+++.++|..-.++.+++.++|++.+|--++-+..||.+
T Consensus 111 s~~rike~GadavK~Llyy~---pD~~~ein~~k~a~vervg~ec~a~dipf~lE~ltYd~~ 169 (325)
T TIGR01232 111 SAKRLKEQGANAVKFLLYYD---VDDAEEINIQKKAYIERIGSECVAEDIPFFLEVLTYDDN 169 (325)
T ss_pred cHHHHHHhCCCeEEEEEEeC---CCCChHHHHHHHHHHHHHHHHHHHCCCCeEEEEeccCCC
Confidence 47889999999999988775 454467888889999999999999999999988877654
No 110
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=59.46 E-value=21 Score=35.95 Aligned_cols=64 Identities=17% Similarity=0.322 Sum_probs=43.0
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccc-cCCC-----CCCChh--hhhHHHHHHHHHHHcCCcceeecc
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLI-PNGR-----GPVNPK--GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~-P~g~-----g~~n~~--~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
-+.-..+.++.+++||++++=++--+..-. ..|- -.+|+. ..+-++++|+++.++||++++++.
T Consensus 31 dl~gi~~~ldyl~~lGv~~i~l~P~~~~~~~~~gY~~~d~~~id~~~Gt~~d~~~lv~~~h~~gi~vilD~V 102 (551)
T PRK10933 31 DLRGVTQRLDYLQKLGVDAIWLTPFYVSPQVDNGYDVANYTAIDPTYGTLDDFDELVAQAKSRGIRIILDMV 102 (551)
T ss_pred CHHHHHHhhHHHHhCCCCEEEECCCCCCCCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEEC
Confidence 455567889999999999997764332100 0110 112321 345689999999999999999864
No 111
>COG2100 Predicted Fe-S oxidoreductase [General function prediction only]
Probab=58.86 E-value=33 Score=32.75 Aligned_cols=80 Identities=21% Similarity=0.253 Sum_probs=56.6
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccccC------CCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHH
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIPN------GRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALED 150 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~------g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~ 150 (262)
-+-.-++=++.+++.|++-+-+|++ .+-|. |...+| +++-.++.+.+.+.||.+++ .|.|+
T Consensus 199 g~~L~~~lv~eLeeAGLdRiNlSv~--aLDpk~Ak~L~G~~dYd---v~kvle~aE~i~~a~idvlI------aPv~l-- 265 (414)
T COG2100 199 GVLLSKKLVDELEEAGLDRINLSVD--ALDPKLAKMLAGRKDYD---VKKVLEVAEYIANAGIDVLI------APVWL-- 265 (414)
T ss_pred ceeccHHHHHHHHHhCCceEEeecc--cCCHHHHHHhcCccccC---HHHHHHHHHHHHhCCCCEEE------eeeec--
Confidence 3445567789999999997777663 44443 222455 88889999999999999887 47787
Q ss_pred hhCCCCChhhHHHHHHHHHHHH
Q 024821 151 EYGGWINRMIVKDFTAYADVCF 172 (262)
Q Consensus 151 ~~gGw~~~~~~~~F~~ya~~v~ 172 (262)
.| .|.+-...+.+||+.+-
T Consensus 266 --PG-~ND~E~~~iIe~A~~iG 284 (414)
T COG2100 266 --PG-VNDDEMPKIIEWAREIG 284 (414)
T ss_pred --CC-cChHHHHHHHHHHHHhC
Confidence 34 34444577888887653
No 112
>COG3589 Uncharacterized conserved protein [Function unknown]
Probab=58.63 E-value=33 Score=32.67 Aligned_cols=69 Identities=19% Similarity=0.260 Sum_probs=47.3
Q ss_pred HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHH
Q 024821 85 VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDF 164 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F 164 (262)
|++|.+.|++-+=+|+ +.|++ -+...++.++++++.+.+.|++++|+.. |.-|.. -||. .+.++.|
T Consensus 22 i~~~~~~Gf~~IFtsl----~~~~~---~~~~~~~~~~ell~~Anklg~~vivDvn----Psil~~--l~~S-~~~l~~f 87 (360)
T COG3589 22 IDRMHKYGFKRIFTSL----LIPEE---DAELYFHRFKELLKEANKLGLRVIVDVN----PSILKE--LNIS-LDNLSRF 87 (360)
T ss_pred HHHHHHcCccceeeec----ccCCc---hHHHHHHHHHHHHHHHHhcCcEEEEEcC----HHHHhh--cCCC-hHHHHHH
Confidence 6777888887766654 34442 2345799999999999999999999995 776643 2333 2234444
Q ss_pred HHH
Q 024821 165 TAY 167 (262)
Q Consensus 165 ~~y 167 (262)
.+.
T Consensus 88 ~e~ 90 (360)
T COG3589 88 QEL 90 (360)
T ss_pred HHh
Confidence 444
No 113
>PRK08446 coproporphyrinogen III oxidase; Provisional
Probab=58.56 E-value=27 Score=32.78 Aligned_cols=92 Identities=17% Similarity=0.152 Sum_probs=54.0
Q ss_pred HHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCCh
Q 024821 82 KEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
++.++.++++|+|.+-+++ +-+ .+... -|... ..+-..+.++.+++.|+..+ ++|. +++|.
T Consensus 98 ~e~l~~l~~~GvnRiSiGvQS~~~~~L~~-lgR~~--~~~~~~~ai~~lr~~g~~~v~iDli-~GlPg------------ 161 (350)
T PRK08446 98 KAWLKGMKNLGVNRISFGVQSFNEDKLKF-LGRIH--SQKQIIKAIENAKKAGFENISIDLI-YDTPL------------ 161 (350)
T ss_pred HHHHHHHHHcCCCEEEEecccCCHHHHHH-cCCCC--CHHHHHHHHHHHHHcCCCEEEEEee-cCCCC------------
Confidence 6889999999999666665 332 23322 23322 24567889999999999855 6664 45562
Q ss_pred hhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 159 MIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 159 ~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
++.+.|.+-.+.+.+-=-+.|..+...=||.
T Consensus 162 qt~~~~~~~l~~~~~l~~~~is~y~L~~~~g 192 (350)
T PRK08446 162 DNKKLLKEELKLAKELPINHLSAYSLTIEEN 192 (350)
T ss_pred CCHHHHHHHHHHHHhcCCCEEEeccceecCC
Confidence 2345555555554432223344444333444
No 114
>cd07938 DRE_TIM_HMGL 3-hydroxy-3-methylglutaryl-CoA lyase, catalytic TIM barrel domain. 3-hydroxy-3-methylglutaryl-CoA lyase (HMGL) catalyzes the cleavage of HMG-CoA to acetyl-CoA and acetoacetate, one of the terminal steps in ketone body generation and leucine degradation, and is a key enzyme in the pathway that supplies metabolic fuel to extrahepatic tissues. Mutations in HMGL cause a human autosomal recessive disorder called primary metabolic aciduria that affects ketogenesis and leucine catabolism and can be fatal due to an inability to tolerate hypoglycemia. HMGL has a TIM barrel domain with a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues that cap the core of the barrel. The cleavage of HMG-CoA requires the presence of a divalent cation like Mg2+ or Mn2+, and the reaction is thought to involve general acid/base catalysis. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropy
Probab=58.40 E-value=32 Score=31.33 Aligned_cols=108 Identities=9% Similarity=0.082 Sum_probs=69.2
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeecc-CCCCc-------HhHHH-
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLH-HLDLP-------QALED- 150 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~-H~~~P-------~~l~~- 150 (262)
-.+|++...+.|++.+++.++=|.+.-.- -+.--++.++...+.++.++++|+++.+++. .|+.| ..+.+
T Consensus 75 ~~~dv~~A~~~g~~~i~i~~~~Sd~~~~~~~~~s~~~~~~~~~~~v~~ak~~G~~v~~~i~~~f~~~~~~~~~~~~~~~~ 154 (274)
T cd07938 75 NLRGAERALAAGVDEVAVFVSASETFSQKNINCSIAESLERFEPVAELAKAAGLRVRGYVSTAFGCPYEGEVPPERVAEV 154 (274)
T ss_pred CHHHHHHHHHcCcCEEEEEEecCHHHHHHHcCCCHHHHHHHHHHHHHHHHHCCCeEEEEEEeEecCCCCCCCCHHHHHHH
Confidence 47899999999999999998766542211 1222356788899999999999999987664 23322 22111
Q ss_pred -----hh----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 151 -----EY----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 151 -----~~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+. .|...| ....++.+.+.+++++.--.+-+-|-..+.
T Consensus 155 ~~~~~~~Ga~~i~l~DT~G~~~P---~~v~~lv~~l~~~~~~~~i~~H~Hnd~GlA 207 (274)
T cd07938 155 AERLLDLGCDEISLGDTIGVATP---AQVRRLLEAVLERFPDEKLALHFHDTRGQA 207 (274)
T ss_pred HHHHHHcCCCEEEECCCCCccCH---HHHHHHHHHHHHHCCCCeEEEEECCCCChH
Confidence 00 234444 556666777777776533345577776653
No 115
>TIGR02660 nifV_homocitr homocitrate synthase NifV. This family consists of the NifV clade of homocitrate synthases, most of which are found in operons for nitrogen fixation. Members are closely homologous to enzymes that include 2-isopropylmalate synthase, (R)-citramalate synthase, and homocitrate synthases associated with other processes. The homocitrate made by this enzyme becomes a part of the iron-molybdenum cofactor of nitrogenase.
Probab=57.87 E-value=24 Score=33.45 Aligned_cols=107 Identities=17% Similarity=0.042 Sum_probs=68.2
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC--CcHhHHH------h
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD--LPQALED------E 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~--~P~~l~~------~ 151 (262)
-++|++.+.+.|++.+|+.++-|.+.-.. -+.-.++.++...+.|+.+++.|+++.++.-... -|..+.+ +
T Consensus 74 ~~~di~~a~~~g~~~i~i~~~~Sd~~~~~~~~~s~~e~l~~~~~~i~~ak~~g~~v~~~~ed~~r~~~~~l~~~~~~~~~ 153 (365)
T TIGR02660 74 RDADIEAAARCGVDAVHISIPVSDLQIEAKLRKDRAWVLERLARLVSFARDRGLFVSVGGEDASRADPDFLVELAEVAAE 153 (365)
T ss_pred CHHHHHHHHcCCcCEEEEEEccCHHHHHHHhCcCHHHHHHHHHHHHHHHHhCCCEEEEeecCCCCCCHHHHHHHHHHHHH
Confidence 47999999999999999999877643221 1222356788899999999999999887765321 1222211 1
Q ss_pred h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. -|-.. +..+.++.+.+.++++-. --+-+-|-..+.
T Consensus 154 ~Ga~~i~l~DT~G~~~---P~~v~~lv~~l~~~~~v~-l~~H~HNd~GlA 199 (365)
T TIGR02660 154 AGADRFRFADTVGILD---PFSTYELVRALRQAVDLP-LEMHAHNDLGMA 199 (365)
T ss_pred cCcCEEEEcccCCCCC---HHHHHHHHHHHHHhcCCe-EEEEecCCCChH
Confidence 1 23333 356666667776776422 235567766653
No 116
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=57.30 E-value=32 Score=36.75 Aligned_cols=63 Identities=19% Similarity=0.188 Sum_probs=42.3
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCcc-------ccCCCCCCChh--hhhHHHHHHHHHHHcCCcceeecc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRL-------IPNGRGPVNPK--GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI-------~P~g~g~~n~~--~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
+....+-+..+++||++++=+|=-.... .+.....+|++ +.+-+++++++++++||.+|+++.
T Consensus 19 f~~~~~~l~YL~~LGis~IyLsPi~~a~~gs~hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIlDiV 90 (879)
T PRK14511 19 FDDAAELVPYFADLGVSHLYLSPILAARPGSTHGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLILDIV 90 (879)
T ss_pred HHHHHHHhHHHHHcCCCEEEECcCccCCCCCCCCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEec
Confidence 4456788899999999988666433211 00001123332 446689999999999999999863
No 117
>TIGR03581 EF_0839 conserved hypothetical protein EF_0839/AHA_3917. Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown.
Probab=57.10 E-value=27 Score=31.33 Aligned_cols=74 Identities=18% Similarity=0.371 Sum_probs=49.7
Q ss_pred ccccHHHHHHHHHcCCCccee----------------------ccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcc
Q 024821 78 YHKYKEDVKLMADTGLDAYRF----------------------SISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQP 135 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~----------------------si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p 135 (262)
--.-+.-|+|+++||.+++.| ++ | ++|. |-+| ++.+..++..+++.|++-
T Consensus 134 iV~vetAiaml~dmG~~SiKffPM~Gl~~leE~~avA~aca~~g~-~--lEPT--GGId---l~Nf~~I~~i~ldaGv~k 205 (236)
T TIGR03581 134 IVPIETAIAMLKDMGGSSVKFFPMGGLKHLEEYAAVAKACAKHGF-Y--LEPT--GGID---LDNFEEIVQIALDAGVEK 205 (236)
T ss_pred eeeHHHHHHHHHHcCCCeeeEeecCCcccHHHHHHHHHHHHHcCC-c--cCCC--CCcc---HHhHHHHHHHHHHcCCCe
Confidence 345677799999999998774 33 2 5776 6777 778899999999999986
Q ss_pred eeeccCCCCcHhHHHhhCCCCChhhHHH
Q 024821 136 HVTLHHLDLPQALEDEYGGWINRMIVKD 163 (262)
Q Consensus 136 ~vtL~H~~~P~~l~~~~gGw~~~~~~~~ 163 (262)
++ .|- -..+-++-.|-++++-+..
T Consensus 206 vi--PHI--YssiIDk~tG~TrpedV~~ 229 (236)
T TIGR03581 206 VI--PHV--YSSIIDKETGNTRVEDVKQ 229 (236)
T ss_pred ec--ccc--ceeccccccCCCCHHHHHH
Confidence 63 221 1112233356677665443
No 118
>PLN02447 1,4-alpha-glucan-branching enzyme
Probab=56.98 E-value=15 Score=38.60 Aligned_cols=100 Identities=13% Similarity=0.165 Sum_probs=58.0
Q ss_pred cccccHH-HHHHHHHcCCCcceeccccCcc-------ccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc--CCCC
Q 024821 77 EYHKYKE-DVKLMADTGLDAYRFSISWSRL-------IPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH--HLDL 144 (262)
Q Consensus 77 ~y~~~~e-Di~l~k~lG~~~~R~si~WsRI-------~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~--H~~~ 144 (262)
.|..+.+ -++.+|+||+|++=+.=-...- .|..--.++. ...+-++++|++|.++||.+++++. |..-
T Consensus 248 ty~~~~~~~L~ylk~LG~t~I~LmPi~e~~~~~~wGY~~~~~fa~~~~~Gtp~dlk~LVd~aH~~GI~VilDvV~nH~~~ 327 (758)
T PLN02447 248 SYREFADDVLPRIKALGYNAVQLMAIQEHAYYGSFGYHVTNFFAVSSRSGTPEDLKYLIDKAHSLGLRVLMDVVHSHASK 327 (758)
T ss_pred CHHHHHHHHHHHHHHcCCCEEEECCccccCCCCCCCcCcccCcccccccCCHHHHHHHHHHHHHCCCEEEEEeccccccc
Confidence 3444333 4899999999999765322211 1110001111 1234589999999999999999875 4321
Q ss_pred -------------cHhHHHhhCC----C-------CChhhHHHHHHHHHHHHHHhC
Q 024821 145 -------------PQALEDEYGG----W-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 145 -------------P~~l~~~~gG----w-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
+.|+.....| | .++++.+.+.+=++.-+++|+
T Consensus 328 ~~~~gl~~fDg~~~~Yf~~~~~g~~~~w~~~~~N~~~~eVr~fLl~~~~~Wl~ey~ 383 (758)
T PLN02447 328 NTLDGLNGFDGTDGSYFHSGPRGYHWLWDSRLFNYGNWEVLRFLLSNLRWWLEEYK 383 (758)
T ss_pred cccccccccCCCCccccccCCCCCcCcCCCceecCCCHHHHHHHHHHHHHHHHHhC
Confidence 1222110011 2 246777777777787788775
No 119
>PF03659 Glyco_hydro_71: Glycosyl hydrolase family 71 ; InterPro: IPR005197 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha-1,3-glucanases belonging to glycoside hydrolase family 71 (GH71 from CAZY).
Probab=56.78 E-value=46 Score=32.06 Aligned_cols=52 Identities=12% Similarity=0.323 Sum_probs=40.4
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
..|++||++.+++|++.|=+.|. . ....+ .+....+++.+.+.|.+.++.+-
T Consensus 17 ~dw~~di~~A~~~GIDgFaLNig------~-~d~~~---~~~l~~a~~AA~~~gFKlf~SfD 68 (386)
T PF03659_consen 17 EDWEADIRLAQAAGIDGFALNIG------S-SDSWQ---PDQLADAYQAAEAVGFKLFFSFD 68 (386)
T ss_pred HHHHHHHHHHHHcCCCEEEEecc------c-CCccc---HHHHHHHHHHHHhcCCEEEEEec
Confidence 35899999999999999999886 1 12333 35678888999999988887664
No 120
>TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA.
Probab=56.69 E-value=25 Score=30.94 Aligned_cols=65 Identities=11% Similarity=0.140 Sum_probs=41.6
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCC--CCCCChhhhhHHHHHHHHHHHcCCcceee-ccCCCCc
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNG--RGPVNPKGLQYYNNLINELISYGIQPHVT-LHHLDLP 145 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g--~g~~n~~~l~~y~~~id~l~~~GI~p~vt-L~H~~~P 145 (262)
-..+++=|++++++|.+.+|+..... |.. ..+.....++..+++.+.+.+.||...+= +.+++.|
T Consensus 83 ~~~~~~~i~~a~~lg~~~i~~~~g~~---~~~~~~~~~~~~~~~~l~~l~~~A~~~gi~l~lE~~~~~~~~ 150 (254)
T TIGR03234 83 REGVALAIAYARALGCPQVNCLAGKR---PAGVSPEEARATLVENLRYAADALDRIGLTLLIEPINSFDMP 150 (254)
T ss_pred HHHHHHHHHHHHHhCCCEEEECcCCC---CCCCCHHHHHHHHHHHHHHHHHHHHhcCCEEEEEECCcccCC
Confidence 34566778899999999998644321 211 11222344567888888899999987774 3444444
No 121
>PRK08599 coproporphyrinogen III oxidase; Provisional
Probab=56.36 E-value=31 Score=32.58 Aligned_cols=95 Identities=17% Similarity=0.317 Sum_probs=54.8
Q ss_pred cHHHHHHHHHcCCCcceecc-cc-CccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SW-SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~W-sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~ 157 (262)
-+|.+++|+++|++.+-+++ +- .++... -+... ..+-..+.++.+++.|+..+ +++. +++|.
T Consensus 99 ~~e~l~~l~~~G~~rvsiGvqS~~~~~l~~-l~r~~--~~~~~~~~i~~l~~~g~~~v~~dli-~GlPg----------- 163 (377)
T PRK08599 99 TKEKLQVLKDSGVNRISLGVQTFNDELLKK-IGRTH--NEEDVYEAIANAKKAGFDNISIDLI-YALPG----------- 163 (377)
T ss_pred CHHHHHHHHHcCCCEEEEecccCCHHHHHH-cCCCC--CHHHHHHHHHHHHHcCCCcEEEeee-cCCCC-----------
Confidence 46889999999999666655 22 233322 12211 25567889999999999743 3342 45552
Q ss_pred hhhHHHHHHHHHHHHHHhC-CCCceEEeccCCchhc
Q 024821 158 RMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNGFA 192 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~~~ 192 (262)
++.+.|.+=.+.+.+ ++ +.|..+...-+|....
T Consensus 164 -qt~~~~~~~l~~~~~-l~~~~i~~y~l~~~pgT~~ 197 (377)
T PRK08599 164 -QTIEDFKESLAKALA-LDIPHYSAYSLILEPKTVF 197 (377)
T ss_pred -CCHHHHHHHHHHHHc-cCCCEEeeeceeecCCChh
Confidence 233455555555433 33 3444444445776543
No 122
>PRK08195 4-hyroxy-2-oxovalerate/4-hydroxy-2-oxopentanoic acid aldolase,; Validated
Probab=56.35 E-value=75 Score=29.89 Aligned_cols=93 Identities=16% Similarity=0.211 Sum_probs=63.4
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc--CCCCcHhHHHh------h-
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH--HLDLPQALEDE------Y- 152 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~--H~~~P~~l~~~------~- 152 (262)
.+|++...+.|++.+|+....++. +--.+.|+.+++.|+++.+++. |-..|..+.+. +
T Consensus 91 ~~dl~~a~~~gvd~iri~~~~~e~-------------~~~~~~i~~ak~~G~~v~~~l~~a~~~~~e~l~~~a~~~~~~G 157 (337)
T PRK08195 91 VDDLKMAYDAGVRVVRVATHCTEA-------------DVSEQHIGLARELGMDTVGFLMMSHMAPPEKLAEQAKLMESYG 157 (337)
T ss_pred HHHHHHHHHcCCCEEEEEEecchH-------------HHHHHHHHHHHHCCCeEEEEEEeccCCCHHHHHHHHHHHHhCC
Confidence 589999999999999998865553 2258889999999999888775 33334443321 1
Q ss_pred ---------CCCCChhhHHHHHHHHHHHHHHhCCCCce-EEeccCCch
Q 024821 153 ---------GGWINRMIVKDFTAYADVCFREFGDRVSY-WTTVNEPNG 190 (262)
Q Consensus 153 ---------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~-w~t~NEP~~ 190 (262)
.|...| +...++.+.+-+++++.+.. +-.-|-..+
T Consensus 158 a~~i~i~DT~G~~~P---~~v~~~v~~l~~~l~~~i~ig~H~HnnlGl 202 (337)
T PRK08195 158 AQCVYVVDSAGALLP---EDVRDRVRALRAALKPDTQVGFHGHNNLGL 202 (337)
T ss_pred CCEEEeCCCCCCCCH---HHHHHHHHHHHHhcCCCCeEEEEeCCCcch
Confidence 344444 66677777777888644443 446666664
No 123
>PF12891 Glyco_hydro_44: Glycoside hydrolase family 44; InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases.; PDB: 3IK2_A 3ZQ9_A 2YJQ_B 2YKK_A 2YIH_A 2EEX_A 2EQD_A 2E0P_A 2E4T_A 2EO7_A ....
Probab=55.96 E-value=33 Score=31.04 Aligned_cols=106 Identities=13% Similarity=0.203 Sum_probs=54.9
Q ss_pred hHHHHHHHHHHHcCCcceeeccCC--------------CCcHh--HH--------------HhhCC----CCChh-----
Q 024821 119 QYYNNLINELISYGIQPHVTLHHL--------------DLPQA--LE--------------DEYGG----WINRM----- 159 (262)
Q Consensus 119 ~~y~~~id~l~~~GI~p~vtL~H~--------------~~P~~--l~--------------~~~gG----w~~~~----- 159 (262)
+.++.+|+.-++.|..+|+||-=- ..|.+ -. .+.+| -.+|+
T Consensus 24 ~~~~~f~~~~~~~ga~~m~T~pm~G~Vakd~~~~~~~~~fp~~~y~~Q~~~d~~~~~~Gng~~~~~~~~~~~~P~~~~~~ 103 (239)
T PF12891_consen 24 DVADTFIDQNLAAGAYSMMTLPMIGYVAKDANSVSESESFPSWRYGPQQWFDPWNPDCGNGVKPDKTALTSNDPDTPDNP 103 (239)
T ss_dssp HHHHHHHHHHHHTT-EEEEEE--SSEEES-BSEGBGGGTSSSTTEEEBS-EETTEEEEE-SEESTSSS--SSSGGSSSSE
T ss_pred HHHHHHHHHhhhcCcceeEeecccceEecCCCCcccccCCChhhcccccccCcCcCCCCccccCCCCCCCCCCCCCCccH
Confidence 678999999999999999998511 11111 00 00011 11333
Q ss_pred -hHHHHHHHHHHHHHHhCCC-----CceEEeccCCchhccccccccCCCCCCCCCCCCCCCCCCCCchhHHHHHHHHHHH
Q 024821 160 -IVKDFTAYADVCFREFGDR-----VSYWTTVNEPNGFAMVGYDFGIAPPKRCSPPLNNCSRGNSSTEPYMAVHHLLLAH 233 (262)
Q Consensus 160 -~~~~F~~ya~~v~~~~gd~-----V~~w~t~NEP~~~~~~gy~~g~~~p~~~~~~~~~~~~~~~~~~~~~~~hn~llAH 233 (262)
.++.| +..+..+||.. |++|..=|||.+...- --..+|- ...+.-+....++.
T Consensus 104 ~y~~ew---V~~l~~~~g~a~~~~gvk~y~lDNEP~LW~~T--H~dVHP~----------------~~t~~El~~r~i~~ 162 (239)
T PF12891_consen 104 VYMDEW---VNYLVNKYGNASTNGGVKYYSLDNEPDLWHST--HRDVHPE----------------PVTYDELRDRSIEY 162 (239)
T ss_dssp EEHHHH---HHHHHHHH--TTSTTS--EEEESS-GGGHHHH--TTTT--S-------------------HHHHHHHHHHH
T ss_pred hHHHHH---HHHHHHHHhccccCCCceEEEecCchHhhccc--ccccCCC----------------CCCHHHHHHHHHHH
Confidence 44555 66677777764 9999999999976320 0011221 12355566666777
Q ss_pred HHHHHHHHHhcCCCc
Q 024821 234 ASVARLYKKNYQVIS 248 (262)
Q Consensus 234 a~a~~~~~~~~~~~~ 248 (262)
|+|+| +..|.++
T Consensus 163 AkaiK---~~DP~a~ 174 (239)
T PF12891_consen 163 AKAIK---AADPDAK 174 (239)
T ss_dssp HHHHH---HH-TTSE
T ss_pred HHHHH---hhCCCCe
Confidence 77765 4677764
No 124
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=55.08 E-value=17 Score=33.37 Aligned_cols=70 Identities=20% Similarity=0.388 Sum_probs=48.9
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhh
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMI 160 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~ 160 (262)
.++-+++++++|++.+.+++= +.=+++.+++|+++++.+.++++ +|.+|--..|.-+...|....++|-
T Consensus 108 ~~~~f~~~~~~Gv~GvKidF~---------~~d~Q~~v~~y~~i~~~AA~~~L--mvnfHg~~kPtG~~RTyPN~mT~Eg 176 (273)
T PF10566_consen 108 LDEAFKLYAKWGVKGVKIDFM---------DRDDQEMVNWYEDILEDAAEYKL--MVNFHGATKPTGLRRTYPNLMTREG 176 (273)
T ss_dssp HHHHHHHHHHCTEEEEEEE-----------SSTSHHHHHHHHHHHHHHHHTT---EEEETTS---TTHHHCSTTEEEE--
T ss_pred HHHHHHHHHHcCCCEEeeCcC---------CCCCHHHHHHHHHHHHHHHHcCc--EEEecCCcCCCcccccCccHHHHHH
Confidence 366789999999999998862 12357889999999999999986 6777766678777666665555554
Q ss_pred H
Q 024821 161 V 161 (262)
Q Consensus 161 ~ 161 (262)
+
T Consensus 177 V 177 (273)
T PF10566_consen 177 V 177 (273)
T ss_dssp S
T ss_pred h
Confidence 3
No 125
>PRK09505 malS alpha-amylase; Reviewed
Probab=55.02 E-value=25 Score=36.46 Aligned_cols=60 Identities=17% Similarity=0.279 Sum_probs=42.0
Q ss_pred cHHHHHHHHHcCCCcceeccccCccc-----------c----CC-----CCCCChh--hhhHHHHHHHHHHHcCCcceee
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLI-----------P----NG-----RGPVNPK--GLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~-----------P----~g-----~g~~n~~--~l~~y~~~id~l~~~GI~p~vt 138 (262)
..+-++.+++||++++=++--...+. | .| --.+|+. ..+-++++|+++.++||++|++
T Consensus 232 i~~kLdyl~~LGv~aIwlsPi~~~~~~~~~~g~~g~~~~~~yhgY~~~D~~~id~~~Gt~~dfk~Lv~~aH~~Gi~VilD 311 (683)
T PRK09505 232 LTEKLDYLQQLGVNALWISSPLEQIHGWVGGGTKGDFPHYAYHGYYTLDWTKLDANMGTEADLRTLVDEAHQRGIRILFD 311 (683)
T ss_pred HHHhhHHHHHcCCCEEEeCccccccccccccccccCCCcCCCCCCCccccccCCCCCCCHHHHHHHHHHHHHCCCEEEEE
Confidence 55668899999999998875444331 1 01 0123332 4567999999999999999998
Q ss_pred cc
Q 024821 139 LH 140 (262)
Q Consensus 139 L~ 140 (262)
+.
T Consensus 312 ~V 313 (683)
T PRK09505 312 VV 313 (683)
T ss_pred EC
Confidence 53
No 126
>PRK11858 aksA trans-homoaconitate synthase; Reviewed
Probab=54.81 E-value=30 Score=32.97 Aligned_cols=106 Identities=14% Similarity=0.031 Sum_probs=68.3
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCC--CCcHhHHH------hh
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHL--DLPQALED------EY 152 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~--~~P~~l~~------~~ 152 (262)
++|++.+.+.|++.+|++++-|.+.=.. -+.--++.++...+.++.+++.|+++.++.-.. .-|..+.+ +.
T Consensus 78 ~~di~~a~~~g~~~i~i~~~~Sd~h~~~~~~~s~~~~l~~~~~~v~~a~~~G~~v~~~~ed~~r~~~~~l~~~~~~~~~~ 157 (378)
T PRK11858 78 KSDIDASIDCGVDAVHIFIATSDIHIKHKLKKTREEVLERMVEAVEYAKDHGLYVSFSAEDASRTDLDFLIEFAKAAEEA 157 (378)
T ss_pred HHHHHHHHhCCcCEEEEEEcCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHCCCeEEEEeccCCCCCHHHHHHHHHHHHhC
Confidence 7899999999999999999877753221 133335678899999999999999988875321 11222221 11
Q ss_pred ----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 153 ----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 153 ----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
.|-..| ..+.++.+.+.+++ +..--+-+-|-..+.
T Consensus 158 Ga~~I~l~DT~G~~~P---~~v~~lv~~l~~~~-~~~l~~H~Hnd~GlA 202 (378)
T PRK11858 158 GADRVRFCDTVGILDP---FTMYELVKELVEAV-DIPIEVHCHNDFGMA 202 (378)
T ss_pred CCCEEEEeccCCCCCH---HHHHHHHHHHHHhc-CCeEEEEecCCcCHH
Confidence 234444 55666666666666 322235566666653
No 127
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=54.57 E-value=27 Score=34.87 Aligned_cols=62 Identities=13% Similarity=0.310 Sum_probs=40.8
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccc-cCC-----CCCCChh--hhhHHHHHHHHHHHcCCcceeec
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLI-PNG-----RGPVNPK--GLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~-P~g-----~g~~n~~--~l~~y~~~id~l~~~GI~p~vtL 139 (262)
..-..+-++.+++||++++=++=-.+.-. ..| --.+|++ ..+-++++|+++.++||++|+++
T Consensus 27 l~gi~~~Ldyl~~LGv~~i~L~Pi~~~~~~~~gY~~~dy~~vd~~~Gt~~df~~Lv~~ah~~Gi~vilD~ 96 (539)
T TIGR02456 27 FPGLTSKLDYLKWLGVDALWLLPFFQSPLRDDGYDVSDYRAILPEFGTIDDFKDFVDEAHARGMRVIIDL 96 (539)
T ss_pred HHHHHHhHHHHHHCCCCEEEECCCcCCCCCCCCCCcccccccChhhCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 34456678999999999987653221100 001 0123332 34668999999999999999975
No 128
>cd06545 GH18_3CO4_chitinase The Bacteroides thetaiotaomicron protein represented by pdb structure 3CO4 is an uncharacterized bacterial member of the family 18 glycosyl hydrolases with homologs found in Flavobacterium, Stigmatella, and Pseudomonas.
Probab=54.47 E-value=32 Score=30.59 Aligned_cols=78 Identities=14% Similarity=0.215 Sum_probs=41.0
Q ss_pred ccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhH-------------------HHhh--CC---
Q 024821 99 SISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQAL-------------------EDEY--GG--- 154 (262)
Q Consensus 99 si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l-------------------~~~~--gG--- 154 (262)
.+.|..+.++|.-.+.. .......+++.++++|+++++.+..+...... ..+| .|
T Consensus 27 ~~~f~~i~~~G~l~~~~-~~~~~~~~~~~~~~~~~kvl~sigg~~~~~~~~~~~~~~~r~~fi~~lv~~~~~~~~DGIdi 105 (253)
T cd06545 27 NLAFANPDANGTLNANP-VRSELNSVVNAAHAHNVKILISLAGGSPPEFTAALNDPAKRKALVDKIINYVVSYNLDGIDV 105 (253)
T ss_pred EEEEEEECCCCeEEecC-cHHHHHHHHHHHHhCCCEEEEEEcCCCCCcchhhhcCHHHHHHHHHHHHHHHHHhCCCceeE
Confidence 34455555543211111 12345667777777788888777655433110 0012 23
Q ss_pred -CCChhh-HHHHHHHHHHHHHHhCC
Q 024821 155 -WINRMI-VKDFTAYADVCFREFGD 177 (262)
Q Consensus 155 -w~~~~~-~~~F~~ya~~v~~~~gd 177 (262)
|+.+.. .+.|..+++.+-++++.
T Consensus 106 DwE~~~~~~~~~~~fv~~Lr~~l~~ 130 (253)
T cd06545 106 DLEGPDVTFGDYLVFIRALYAALKK 130 (253)
T ss_pred EeeccCccHhHHHHHHHHHHHHHhh
Confidence 444322 46788888888777753
No 129
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=53.49 E-value=49 Score=31.79 Aligned_cols=108 Identities=18% Similarity=0.262 Sum_probs=65.9
Q ss_pred cHHHHHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccC---CCCc---HhHHHh-
Q 024821 81 YKEDVKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHH---LDLP---QALEDE- 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H---~~~P---~~l~~~- 151 (262)
..+-++.+++.|+. ++=++..|..-.. .-.+|++.+.-.+++++.|+++|++.++.++- -+.+ ..-..+
T Consensus 45 v~~~i~~~~~~~iP~d~~~iD~~~~~~~~--~f~~d~~~FPd~~~~~~~l~~~G~~~~~~~~P~v~~~~~~~~~~~~~~~ 122 (441)
T PF01055_consen 45 VREVIDRYRSNGIPLDVIWIDDDYQDGYG--DFTWDPERFPDPKQMIDELHDQGIKVVLWVHPFVSNDSPDYENYDEAKE 122 (441)
T ss_dssp HHHHHHHHHHTT--EEEEEE-GGGSBTTB--TT-B-TTTTTTHHHHHHHHHHTT-EEEEEEESEEETTTTB-HHHHHHHH
T ss_pred HHHHHHHHHHcCCCccceecccccccccc--ccccccccccchHHHHHhHhhCCcEEEEEeecccCCCCCcchhhhhHhh
Confidence 45666777776654 4445555655322 35677777777899999999999997776642 1222 211100
Q ss_pred -------hCC----------------CCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 -------YGG----------------WINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 -------~gG----------------w~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
-.| +.|++..+.|.+..+.+++.+| ---+|.=+|||..+
T Consensus 123 ~~~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~w~~~~~~~~~~~~G-vdg~w~D~~E~~~~ 184 (441)
T PF01055_consen 123 KGYLVKNPDGSPYIGRVWPGKGGFIDFTNPEARDWWKEQLKELLDDYG-VDGWWLDFGEPSSF 184 (441)
T ss_dssp TT-BEBCTTSSB-EEEETTEEEEEB-TTSHHHHHHHHHHHHHHHTTST--SEEEEESTTTBSS
T ss_pred cCceeecccCCcccccccCCcccccCCCChhHHHHHHHHHHHHHhccC-CceEEeecCCcccc
Confidence 012 6788888888887777776654 24568899999975
No 130
>TIGR02100 glgX_debranch glycogen debranching enzyme GlgX. This family consists of the GlgX protein from the E. coli glycogen operon and probable equivalogs from other prokaryotic species. GlgX is not required for glycogen biosynthesis, but instead acts as a debranching enzyme for glycogen catabolism. This model distinguishes GlgX from pullanases and other related proteins that also operate on alpha-1,6-glycosidic linkages. In the wide band between the trusted and noise cutoffs are functionally similar enzymes, mostly from plants, that act similarly but usually are termed isoamylase.
Probab=53.27 E-value=25 Score=36.45 Aligned_cols=56 Identities=13% Similarity=0.189 Sum_probs=36.6
Q ss_pred HHHHHHcCCCcceeccccCc----------------cccCCCCCCChh-----hhhHHHHHHHHHHHcCCcceeecc
Q 024821 85 VKLMADTGLDAYRFSISWSR----------------LIPNGRGPVNPK-----GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsR----------------I~P~g~g~~n~~-----~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
|+.+|+||++++=+.=--.- -.|..--.++.. .++=++++|++|.++||++|+++.
T Consensus 190 LdyLk~LGvtaI~L~Pi~~~~~~~~~~~~~~~~ywGYd~~~y~a~d~~y~~~g~~~efk~LV~~~H~~GI~VIlDvV 266 (688)
T TIGR02100 190 IDYLKKLGVTAVELLPVHAFIDDRHLLEKGLRNYWGYNTLGFFAPEPRYLASGQVAEFKTMVRALHDAGIEVILDVV 266 (688)
T ss_pred hHHHHHcCCCEEEECCcccCCccccccccCCCCccCcCcccccccChhhcCCCCHHHHHHHHHHHHHCCCEEEEEEC
Confidence 88999999999876531111 111100112221 255699999999999999999863
No 131
>PRK00366 ispG 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Reviewed
Probab=52.57 E-value=73 Score=30.53 Aligned_cols=72 Identities=22% Similarity=0.371 Sum_probs=52.8
Q ss_pred HHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHH
Q 024821 89 ADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYA 168 (262)
Q Consensus 89 k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya 168 (262)
.+.|++.+| |-| |.+-.. -+..+.+++.++++|+..=+...|-.++.-+.++||+-..+-.++--.+++
T Consensus 98 ~~~G~~~iR-------INP---GNig~~-~~~v~~vv~~ak~~~ipIRIGvN~GSL~~~~~~~yg~~t~eamveSAl~~~ 166 (360)
T PRK00366 98 AEAGADALR-------INP---GNIGKR-DERVREVVEAAKDYGIPIRIGVNAGSLEKDLLEKYGEPTPEALVESALRHA 166 (360)
T ss_pred HHhCCCEEE-------ECC---CCCCch-HHHHHHHHHHHHHCCCCEEEecCCccChHHHHHHcCCCCHHHHHHHHHHHH
Confidence 366899885 544 444210 256899999999999999999999999999999986533344666666666
Q ss_pred HHH
Q 024821 169 DVC 171 (262)
Q Consensus 169 ~~v 171 (262)
+.+
T Consensus 167 ~~l 169 (360)
T PRK00366 167 KIL 169 (360)
T ss_pred HHH
Confidence 653
No 132
>TIGR02629 L_rham_iso_rhiz L-rhamnose catabolism isomerase, Pseudomonas stutzeri subtype. Members of this family are isomerases in the pathway of L-rhamnose catabolism as found in Pseudomonas stutzeri and in a number of the Rhizobiales. This family differs from the L-rhamnose isomerases of Escherichia coli (see TIGR01748). This enzyme catalyzes the isomerization step in rhamnose catabolism. Genetic evidence in Rhizobium leguminosarum bv. trifolii suggests phosphorylation occurs first, then isomerization of the the phosphorylated sugar, but characterization of the recombinant enzyme from Pseudomonas stutzeri does show L-rhamnose isomerase activity. The name given is deliberately vague because the relative order of phosphorylation and isomerization is unclear.
Probab=51.61 E-value=78 Score=30.91 Aligned_cols=89 Identities=13% Similarity=0.285 Sum_probs=54.2
Q ss_pred cHHHHHHHHHcCCCcceecc--ccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHh--HHHhhCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSI--SWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQA--LEDEYGGW 155 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si--~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~--l~~~~gGw 155 (262)
-.+|++.+.++.--.-|+++ .|.. +|.+.++ +.++++||..- +...-|.-|+. -.-++|..
T Consensus 72 ~i~D~~~v~~Lt~~~~~v~LH~~wd~--------vD~~elk------~~A~e~GL~lda~Npn~Fs~~~~q~~~yk~GSL 137 (412)
T TIGR02629 72 KLEDCAVIQQLTRATPNVSLHIPWDK--------ADPKELK------ARGSALGLGFDAMNSNTFSDAPGQAHSYKFGSL 137 (412)
T ss_pred HHHHHHHHHhhcCCCCCccccCCCCc--------CCHHHHH------HHHHHcCCccceeccccccCccccccccccccc
Confidence 35788888877766666664 7722 3544444 88999999987 66555655522 01123556
Q ss_pred CCh--hhHHHHHHHHHHH---HHHhCCC-CceEE
Q 024821 156 INR--MIVKDFTAYADVC---FREFGDR-VSYWT 183 (262)
Q Consensus 156 ~~~--~~~~~F~~ya~~v---~~~~gd~-V~~w~ 183 (262)
.|| ++.+...+-+..| .+++|.+ |+.|+
T Consensus 138 tnPD~~VR~~AIeh~~~~i~Ig~elGs~~v~IW~ 171 (412)
T TIGR02629 138 SHTDAATRRQAVEHNLECIEIGKALGSKALTVWI 171 (412)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhCCCeeEEEC
Confidence 654 4666666665554 5666664 55565
No 133
>PRK14706 glycogen branching enzyme; Provisional
Probab=51.33 E-value=96 Score=31.94 Aligned_cols=90 Identities=18% Similarity=0.220 Sum_probs=53.9
Q ss_pred HHHHHcCCCcceecc--------ccCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeecc--CCCC---------
Q 024821 86 KLMADTGLDAYRFSI--------SWSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTLH--HLDL--------- 144 (262)
Q Consensus 86 ~l~k~lG~~~~R~si--------~WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL~--H~~~--------- 144 (262)
+.+|+||++++=+.= +|- -.|.+--.++. ...+=++++|++|.++||++++++. |+.-
T Consensus 175 ~ylk~lG~t~velmPv~e~~~~~~wG-Y~~~~~~~~~~~~g~~~~~~~lv~~~H~~gi~VilD~v~nH~~~~~~~l~~~d 253 (639)
T PRK14706 175 EYVTYMGYTHVELLGVMEHPFDGSWG-YQVTGYYAPTSRLGTPEDFKYLVNHLHGLGIGVILDWVPGHFPTDESGLAHFD 253 (639)
T ss_pred HHHHHcCCCEEEccchhcCCCCCCCC-cCcccccccccccCCHHHHHHHHHHHHHCCCEEEEEecccccCcchhhhhccC
Confidence 678999999976542 120 01110001111 1245589999999999999999853 5421
Q ss_pred --cHh-HHHhhC----CC-------CChhhHHHHHHHHHHHHHHhC
Q 024821 145 --PQA-LEDEYG----GW-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 145 --P~~-l~~~~g----Gw-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
|.+ ..+... .| .++++.+.+.+=++.-+++|+
T Consensus 254 g~~~y~~~~~~~g~~~~w~~~~~~~~~~eVr~~l~~~~~~W~~e~~ 299 (639)
T PRK14706 254 GGPLYEYADPRKGYHYDWNTYIFDYGRNEVVMFLIGSALKWLQDFH 299 (639)
T ss_pred CCcceeccCCcCCcCCCCCCcccCCCCHHHHHHHHHHHHHHHHHhC
Confidence 111 000001 12 267888888899999899887
No 134
>PRK08208 coproporphyrinogen III oxidase; Validated
Probab=51.31 E-value=30 Score=33.48 Aligned_cols=61 Identities=13% Similarity=0.193 Sum_probs=39.1
Q ss_pred cHHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLP 145 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P 145 (262)
-++.+++|+++|++.+.+++ +-+ ++... -+.-. ..+-..+.|+.+++.|+..+ ++|. +++|
T Consensus 140 t~e~l~~l~~~G~~rvslGvQS~~~~~L~~-l~R~~--~~~~~~~ai~~l~~~g~~~i~~dlI-~GlP 203 (430)
T PRK08208 140 TAEKLALLAARGVNRLSIGVQSFHDSELHA-LHRPQ--KRADVHQALEWIRAAGFPILNIDLI-YGIP 203 (430)
T ss_pred CHHHHHHHHHcCCCEEEEecccCCHHHHHH-hCCCC--CHHHHHHHHHHHHHcCCCeEEEEee-cCCC
Confidence 46889999999998555555 231 22222 12221 24567889999999999864 5553 4555
No 135
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=50.86 E-value=1.1e+02 Score=27.93 Aligned_cols=109 Identities=14% Similarity=0.183 Sum_probs=59.8
Q ss_pred HHHHHHHHHcCC--CcceeccccCccc--c---CC--CCCCChhhhhHHHHHHHHHHHcCCcceeeccCC-CC-c---Hh
Q 024821 82 KEDVKLMADTGL--DAYRFSISWSRLI--P---NG--RGPVNPKGLQYYNNLINELISYGIQPHVTLHHL-DL-P---QA 147 (262)
Q Consensus 82 ~eDi~l~k~lG~--~~~R~si~WsRI~--P---~g--~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~-~~-P---~~ 147 (262)
++=++.+++.|+ +++=+.+.|..-. + ++ .-.+|++-+--..++|++|+++|++.++.++-. .. | ..
T Consensus 28 ~~v~~~~r~~~iP~D~i~lD~dw~~~~~~~~~~~~~~~ft~d~~~FPdp~~mi~~Lh~~G~k~v~~v~P~~~~~~~~~~y 107 (292)
T cd06595 28 LALMDRFKKHNIPLDVLVIDMDWHVTDIPSKYGSGWTGYSWNRKLFPDPEKLLQDLHDRGLKVTLNLHPADGIRAHEDQY 107 (292)
T ss_pred HHHHHHHHHhCCCccEEEEecccccccccccccCCcceeEEChhcCCCHHHHHHHHHHCCCEEEEEeCCCcccCCCcHHH
Confidence 334455555444 4666677775421 1 11 235666667778999999999999988876522 11 1 11
Q ss_pred --HHHhh-----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 148 --LEDEY-----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 148 --l~~~~-----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+.... ..+.||+..+.|-+-....+...| -.-.|.=+|||...
T Consensus 108 ~~~~~~~~~~~~~~~~~~~D~tnp~a~~~w~~~~~~~~~~~G-idg~W~D~~E~~~~ 163 (292)
T cd06595 108 PEMAKALGVDPATEGPILFDLTNPKFMDAYFDNVHRPLEKQG-VDFWWLDWQQGNRT 163 (292)
T ss_pred HHHHHhcCCCcccCCeEEecCCCHHHHHHHHHHHHHHHHhcC-CcEEEecCCCCccc
Confidence 11111 124566666554333333333333 23468889998654
No 136
>TIGR01212 radical SAM protein, TIGR01212 family. This uncharacterized protein family shows significant similarity to TIGR01211, a longer protein that is a histone acetyltransferase at its C-terminus and is a subunit of RNA polymerase II (in yeast). This family lacks the GNAT acetyltransferase domain.
Probab=50.77 E-value=34 Score=31.57 Aligned_cols=107 Identities=14% Similarity=0.107 Sum_probs=61.5
Q ss_pred HHHHHHHHH---cCCCcceeccccCccccCCCCCCCh-hhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCC
Q 024821 82 KEDVKLMAD---TGLDAYRFSISWSRLIPNGRGPVNP-KGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 82 ~eDi~l~k~---lG~~~~R~si~WsRI~P~g~g~~n~-~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~ 157 (262)
++.+++|++ .|++ .|+++.-...-++--..+|. ..++-+.+.++.++++||++.+++. +++|.
T Consensus 123 ~e~l~~L~~l~~~G~~-~~i~lGlQS~~d~~L~~i~Rg~t~~~~~~ai~~l~~~gi~v~~~lI-~GlPg----------- 189 (302)
T TIGR01212 123 DEVLDLLAEYVERGYE-VWVELGLQTAHDKTLKKINRGHDFACYVDAVKRARKRGIKVCSHVI-LGLPG----------- 189 (302)
T ss_pred HHHHHHHHHhhhCCce-EEEEEccCcCCHHHHHHHcCcChHHHHHHHHHHHHHcCCEEEEeEE-ECCCC-----------
Confidence 344566654 4774 34444333332221011111 1245678899999999999777664 34551
Q ss_pred hhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhccccccccCCC
Q 024821 158 RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFGIAP 202 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g~~~ 202 (262)
++.+.+.+=++.+.+.=-+.|+.....-+|+.....-|..|.+.
T Consensus 190 -et~e~~~~t~~~l~~l~~d~i~i~~l~~~pgT~L~~~~~~g~~~ 233 (302)
T TIGR01212 190 -EDREEMMETAKIVSLLDVDGIKIHPLHVVKGTKMAKMYEKGELK 233 (302)
T ss_pred -CCHHHHHHHHHHHHhcCCCEEEEEEEEecCCCHHHHHHHcCCCC
Confidence 23466666666655543466888888888886655555555543
No 137
>cd07940 DRE_TIM_IPMS 2-isopropylmalate synthase (IPMS), N-terminal catalytic TIM barrel domain. 2-isopropylmalate synthase (IPMS) catalyzes an aldol-type condensation of acetyl-CoA and 2-oxoisovalerate yielding 2-isopropylmalate and CoA, the first committed step in leucine biosynthesis. This family includes the Arabidopsis thaliana IPMS1 and IPMS2 proteins, the Glycine max GmN56 protein, and the Brassica insularis BatIMS protein. This family also includes a group of archeal IPMS-like proteins represented by the Methanocaldococcus jannaschii AksA protein. AksA catalyzes the condensation of alpha-ketoglutarate and acetyl-CoA to form trans-homoaconitate, one of 13 steps in the conversion of alpha-ketoglutarate and acetylCoA to alpha-ketosuberate, a precursor to coenzyme B and biotin. AksA also catalyzes the condensation of alpha-ketoadipate or alpha-ketopimelate with acetylCoA to form, respectively, the (R)-homocitrate homologs (R)-2-hydroxy-1,2,5-pentanetricarboxylic acid and (R)-2-h
Probab=49.38 E-value=38 Score=30.44 Aligned_cols=107 Identities=14% Similarity=0.065 Sum_probs=64.7
Q ss_pred HHHHHHHHHcC----CCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC--CcHhHHH----
Q 024821 82 KEDVKLMADTG----LDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD--LPQALED---- 150 (262)
Q Consensus 82 ~eDi~l~k~lG----~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~--~P~~l~~---- 150 (262)
.+|+++..+.| ++.+|+.++-|.+.-.. -+.-.++.++...+.++.+++.|+++.++..+-. .|..+.+
T Consensus 72 ~~~v~~a~~~~~~~~~~~i~i~~~~s~~~~~~~~~~~~~~~~~~~~~~i~~a~~~G~~v~~~~~~~~~~~~~~~~~~~~~ 151 (268)
T cd07940 72 KKDIDAAAEALKPAKVDRIHTFIATSDIHLKYKLKKTREEVLERAVEAVEYAKSHGLDVEFSAEDATRTDLDFLIEVVEA 151 (268)
T ss_pred HhhHHHHHHhCCCCCCCEEEEEecCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCeEEEeeecCCCCCHHHHHHHHHH
Confidence 78999999999 99999987665543210 1222234678888999999999998776654311 1222211
Q ss_pred --hh----------CCCCChhhHHHHHHHHHHHHHHhCC-CCc-eEEeccCCchh
Q 024821 151 --EY----------GGWINRMIVKDFTAYADVCFREFGD-RVS-YWTTVNEPNGF 191 (262)
Q Consensus 151 --~~----------gGw~~~~~~~~F~~ya~~v~~~~gd-~V~-~w~t~NEP~~~ 191 (262)
+. .|-.. ++...++.+.+-+++++ .++ .+-.-|-..+.
T Consensus 152 ~~~~G~~~i~l~DT~G~~~---P~~v~~lv~~l~~~~~~~~i~l~~H~Hn~~GlA 203 (268)
T cd07940 152 AIEAGATTINIPDTVGYLT---PEEFGELIKKLKENVPNIKVPISVHCHNDLGLA 203 (268)
T ss_pred HHHcCCCEEEECCCCCCCC---HHHHHHHHHHHHHhCCCCceeEEEEecCCcchH
Confidence 11 22233 45666667777777765 133 34466666543
No 138
>COG0821 gcpE 1-hydroxy-2-methyl-2-(e)-butenyl 4-diphosphate synthase [Lipid metabolism]
Probab=49.00 E-value=1.3e+02 Score=28.70 Aligned_cols=72 Identities=22% Similarity=0.347 Sum_probs=53.2
Q ss_pred HHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHH
Q 024821 88 MADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAY 167 (262)
Q Consensus 88 ~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~y 167 (262)
..+.|+..+|+ -| |.+-.+ +....+++.++++||..=+...|-.+..-+.++|++-..+..++--.++
T Consensus 91 ~~~~g~~k~RI-------NP---GNig~~--~~v~~vVe~Ak~~g~piRIGVN~GSLek~~~~ky~~pt~ealveSAl~~ 158 (361)
T COG0821 91 AAECGVDKVRI-------NP---GNIGFK--DRVREVVEAAKDKGIPIRIGVNAGSLEKRLLEKYGGPTPEALVESALEH 158 (361)
T ss_pred hhhcCcceEEE-------CC---cccCcH--HHHHHHHHHHHHcCCCEEEecccCchhHHHHHHhcCCCHHHHHHHHHHH
Confidence 35677888875 34 443322 3789999999999999999999999999999999866555555555556
Q ss_pred HHHH
Q 024821 168 ADVC 171 (262)
Q Consensus 168 a~~v 171 (262)
++.+
T Consensus 159 a~~~ 162 (361)
T COG0821 159 AELL 162 (361)
T ss_pred HHHH
Confidence 6543
No 139
>TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift.
Probab=48.95 E-value=45 Score=32.53 Aligned_cols=56 Identities=20% Similarity=0.277 Sum_probs=38.9
Q ss_pred HHHHHHHHHcCCCcceecc-ccCc-cccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 82 KEDVKLMADTGLDAYRFSI-SWSR-LIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si-~WsR-I~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
+|.+++|+++|++.+-+++ +-+. +...-....+ ++.+.+.++.++++||.+.+++.
T Consensus 287 ~e~l~~l~~aG~~~v~iGiES~s~~~L~~~~K~~~---~~~~~~~i~~~~~~Gi~v~~~~I 344 (472)
T TIGR03471 287 YETLKVMKENGLRLLLVGYESGDQQILKNIKKGLT---VEIARRFTRDCHKLGIKVHGTFI 344 (472)
T ss_pred HHHHHHHHHcCCCEEEEcCCCCCHHHHHHhcCCCC---HHHHHHHHHHHHHCCCeEEEEEE
Confidence 6778999999999877777 3332 2222011123 55688999999999999887764
No 140
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=48.54 E-value=1e+02 Score=28.71 Aligned_cols=106 Identities=15% Similarity=0.170 Sum_probs=59.8
Q ss_pred HHHHHHHHHcCCC--cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC-----CCcHhHHHh---
Q 024821 82 KEDVKLMADTGLD--AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL-----DLPQALEDE--- 151 (262)
Q Consensus 82 ~eDi~l~k~lG~~--~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~-----~~P~~l~~~--- 151 (262)
.+.++.+++.|+. ++=+++.|..-. +.-.+|++-+---++++++|+++|++.++.+.-+ +.|..-+..
T Consensus 27 ~~~~~~~~~~~iP~d~i~lD~~~~~~~--~~f~~d~~~fPdp~~m~~~l~~~g~~~~~~~~P~v~~~~~~~~~~e~~~~g 104 (339)
T cd06604 27 REIADEFRERDIPCDAIYLDIDYMDGY--RVFTWDKERFPDPKELIKELHEQGFKVVTIIDPGVKVDPGYDVYEEGLEND 104 (339)
T ss_pred HHHHHHHHHhCCCcceEEECchhhCCC--CceeeccccCCCHHHHHHHHHHCCCEEEEEEeCceeCCCCChHHHHHHHCC
Confidence 3445555555543 444444554321 1233454444446789999999999987654322 122221110
Q ss_pred ---------------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 ---------------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ---------------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+-.|.||+..+.|.+.-+.+. ..| ---.|+=+|||..+
T Consensus 105 ~~v~~~~g~~~~~~~w~g~~~~~Dftnp~a~~ww~~~~~~~~-~~G-vdg~w~D~~Ep~~~ 163 (339)
T cd06604 105 YFVKDPDGELYIGRVWPGLSAFPDFTNPKVREWWGSLYKKFV-DLG-VDGIWNDMNEPAVF 163 (339)
T ss_pred eEEECCCCCEEEEEecCCCccccCCCChHHHHHHHHHHHHHh-hCC-CceEeecCCCcccc
Confidence 013678888888877666554 232 23458899999865
No 141
>COG0366 AmyA Glycosidases [Carbohydrate transport and metabolism]
Probab=48.31 E-value=25 Score=33.74 Aligned_cols=57 Identities=21% Similarity=0.270 Sum_probs=38.8
Q ss_pred HHHHHHHHcCCCcceecc------ccCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeec
Q 024821 83 EDVKLMADTGLDAYRFSI------SWSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 83 eDi~l~k~lG~~~~R~si------~WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
+-++.+++||++++=++= ++.+-...---.+|+ -.++-.+++++++.++||+.++++
T Consensus 33 ~~LdYl~~LGv~aiwl~Pi~~s~~~~~gY~~~Dy~~id~~~Gt~~d~~~li~~~H~~gi~vi~D~ 97 (505)
T COG0366 33 EKLDYLKELGVDAIWLSPIFESPQADHGYDVSDYTKVDPHFGTEEDFKELVEEAHKRGIKVILDL 97 (505)
T ss_pred HhhhHHHHhCCCEEEeCCCCCCCccCCCccccchhhcCcccCCHHHHHHHHHHHHHCCCEEEEEe
Confidence 667899999999985442 111111110122332 357789999999999999999987
No 142
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=47.26 E-value=36 Score=36.10 Aligned_cols=66 Identities=15% Similarity=0.257 Sum_probs=44.8
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccC--C-----CCCCChh--hhhHHHHHHHHHHHcCCcceeec--cCCC
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPN--G-----RGPVNPK--GLQYYNNLINELISYGIQPHVTL--HHLD 143 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~--g-----~g~~n~~--~l~~y~~~id~l~~~GI~p~vtL--~H~~ 143 (262)
+....+-++.+++||++++=+|=-+.-.-.+ | ...+|++ +.+-+++++++++++||.+|+++ .|-.
T Consensus 15 f~~~~~~L~YL~~LGv~~V~lsPi~~a~~gs~hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIlDiVpNH~a 91 (825)
T TIGR02401 15 FDDAAALLPYLKSLGVSHLYLSPILTAVPGSTHGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIVDIVPNHMA 91 (825)
T ss_pred HHHHHHhhHHHHHcCCCEEEeCcCccCCCCCCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEecccccc
Confidence 3446788899999999998776544321000 1 1233332 45668999999999999999986 4643
No 143
>PRK13347 coproporphyrinogen III oxidase; Provisional
Probab=47.07 E-value=36 Score=33.21 Aligned_cols=78 Identities=18% Similarity=0.273 Sum_probs=46.7
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCCh-hhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCCCh
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNP-KGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~-~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
-+|.+++|+++|++.+-+++ ...-|+--..++. ...+-..+.++.+++.|+. +-++|. +++|.
T Consensus 151 t~e~l~~L~~~G~~rvsiGv--QS~~~~vl~~l~R~~~~~~~~~ai~~lr~~G~~~v~~dli-~GlPg------------ 215 (453)
T PRK13347 151 TAEMLQALAALGFNRASFGV--QDFDPQVQKAINRIQPEEMVARAVELLRAAGFESINFDLI-YGLPH------------ 215 (453)
T ss_pred CHHHHHHHHHcCCCEEEECC--CCCCHHHHHHhCCCCCHHHHHHHHHHHHhcCCCcEEEeEE-EeCCC------------
Confidence 47899999999999544444 4443321111111 1355678899999999997 445553 34552
Q ss_pred hhHHHHHHHHHHHHH
Q 024821 159 MIVKDFTAYADVCFR 173 (262)
Q Consensus 159 ~~~~~F~~ya~~v~~ 173 (262)
++.+.|.+-.+.+.+
T Consensus 216 qt~e~~~~tl~~~~~ 230 (453)
T PRK13347 216 QTVESFRETLDKVIA 230 (453)
T ss_pred CCHHHHHHHHHHHHh
Confidence 234556555555543
No 144
>PRK14507 putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase; Provisional
Probab=46.52 E-value=54 Score=37.56 Aligned_cols=64 Identities=16% Similarity=0.266 Sum_probs=43.6
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccc--cCC-----CCCCChh--hhhHHHHHHHHHHHcCCcceeec--cCCC
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLI--PNG-----RGPVNPK--GLQYYNNLINELISYGIQPHVTL--HHLD 143 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~--P~g-----~g~~n~~--~l~~y~~~id~l~~~GI~p~vtL--~H~~ 143 (262)
...+-++.+++||++++=+|=-+.-.- ..| ...+|++ +.+-+++++++++++||.+|+++ .|..
T Consensus 759 ~~~~~l~Yl~~LGv~~i~lsPi~~a~~gs~hGYdv~D~~~idp~lG~~edf~~Lv~~ah~~Gi~vilDiV~NH~~ 833 (1693)
T PRK14507 759 DAEAILPYLAALGISHVYASPILKARPGSTHGYDIVDHSQINPEIGGEEGFERFCAALKAHGLGQLLDIVPNHMG 833 (1693)
T ss_pred HHHHHhHHHHHcCCCEEEECCCcCCCCCCCCCCCCCCCCccCcccCCHHHHHHHHHHHHHCCCEEEEEecccccC
Confidence 456678999999999997765443110 001 0123333 45668999999999999999986 4653
No 145
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=46.33 E-value=2.2e+02 Score=25.46 Aligned_cols=91 Identities=14% Similarity=0.225 Sum_probs=57.7
Q ss_pred cHHHHHHHHHcCCC--cceeccccCccccCCCC--CCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCC
Q 024821 81 YKEDVKLMADTGLD--AYRFSISWSRLIPNGRG--PVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWI 156 (262)
Q Consensus 81 ~~eDi~l~k~lG~~--~~R~si~WsRI~P~g~g--~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~ 156 (262)
..+-++.+++.|+. ++=+++.|..-. +.. .+|++-+.-.+++|+.|+++|++.++.+. |.-
T Consensus 26 v~~~~~~~~~~~iP~d~~~lD~~~~~~~--~~f~~~~d~~~Fpdp~~~i~~l~~~g~~~~~~~~----P~v--------- 90 (265)
T cd06589 26 VLEVIDGMRENDIPLDGFVLDDDYTDGY--GDFTFDWDAGKFPNPKSMIDELHDNGVKLVLWID----PYI--------- 90 (265)
T ss_pred HHHHHHHHHHcCCCccEEEECcccccCC--ceeeeecChhhCCCHHHHHHHHHHCCCEEEEEeC----hhH---------
Confidence 44556666665544 666666665432 234 56776677789999999999999998774 321
Q ss_pred ChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 157 NRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 157 ~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
.+.|.+..+......| ---.|+=+|||...
T Consensus 91 ----~~w~~~~~~~~~~~~G-vdg~w~D~~E~~~~ 120 (265)
T cd06589 91 ----REWWAEVVKKLLVSLG-VDGFWTDMGEPSPG 120 (265)
T ss_pred ----HHHHHHHHHHhhccCC-CCEEeccCCCCCcC
Confidence 4555554444322232 23458899999755
No 146
>cd07947 DRE_TIM_Re_CS Clostridium kluyveri Re-citrate synthase and related proteins, catalytic TIM barrel domain. Re-citrate synthase (Re-CS) is a Clostridium kluyveri enzyme that converts acetyl-CoA and oxaloacetate to citrate. In most organisms, this reaction is catalyzed by Si-citrate synthase which is Si-face stereospecific with respect to C-2 of oxaloacetate, and phylogenetically unrelated to Re-citrate synthase. Re-citrate synthase is also found in a few other strictly anaerobic organisms. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with
Probab=44.92 E-value=39 Score=30.97 Aligned_cols=59 Identities=25% Similarity=0.356 Sum_probs=47.1
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
-.+|++...++|++.+-+.++=|...-.. -+.--++.++.+.++++.++++|+++-+++
T Consensus 76 ~~~die~A~~~g~~~v~i~~s~S~~~~~~~~~~t~~e~l~~~~~~v~~a~~~g~~v~~~~ 135 (279)
T cd07947 76 NKEDLKLVKEMGLKETGILMSVSDYHIFKKLKMTREEAMEKYLEIVEEALDHGIKPRCHL 135 (279)
T ss_pred CHHHHHHHHHcCcCEEEEEEcCCHHHHHHHhCcCHHHHHHHHHHHHHHHHHCCCeEEEEE
Confidence 57999999999999998888666554321 233346789999999999999999988887
No 147
>PF01120 Alpha_L_fucos: Alpha-L-fucosidase; InterPro: IPR000933 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Family 29 (GH29 from CAZY) encompasses alpha-L-fucosidases (3.2.1.51 from EC) [], which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Alpha-L-fucosidase is responsible for hydrolysing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Fucosylated glycoconjugates are involved in numerous biological events, making alpha-l-fucosidases, the enzymes responsible for their processing, critically important. Deficiency in alpha-l-fucosidase activity is associated with fucosidosis, a lysosomal storage disorder characterised by rapid neurodegeneration, resulting in severe mental and motor deterioration []. The enzyme is a hexamer and displays a two-domain fold, composed of a catalytic (beta/alpha)(8)-like domain and a C-terminal beta-sandwich domain []. Drosophila melanogaster spermatozoa contains an alpha-l-fucosidase that might be involved in fertilisation by interacting with alpha-l-fucose residues on the micropyle of the eggshell []. In human sperm, membrane-associated alpha-l-fucosidase is stable for extended periods of time, which is made possible by membrane domains and compartmentalisation. These help preserve protein integrity []. ; GO: 0004560 alpha-L-fucosidase activity, 0005975 carbohydrate metabolic process; PDB: 3EYP_B 2ZX6_A 2ZWY_B 2ZX8_B 2WSP_A 2ZXA_A 2ZWZ_B 1ODU_B 1HL9_A 2ZX5_B ....
Probab=44.58 E-value=67 Score=30.20 Aligned_cols=59 Identities=19% Similarity=0.198 Sum_probs=36.3
Q ss_pred HHHHHHcCCCcceeccccCc---cccCCCCCCCh----hhhhHHHHHHHHHHHcCCcceeeccCCC
Q 024821 85 VKLMADTGLDAYRFSISWSR---LIPNGRGPVNP----KGLQYYNNLINELISYGIQPHVTLHHLD 143 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsR---I~P~g~g~~n~----~~l~~y~~~id~l~~~GI~p~vtL~H~~ 143 (262)
++++|++|++-+=+.-.-.. +.|+....+|. -.-+...++.++|+++||++-+-+.++|
T Consensus 97 ~~~ak~aGakY~VlTakHHDGF~LW~S~~t~~~v~~~~~krDiv~El~~A~rk~Glk~G~Y~S~~d 162 (346)
T PF01120_consen 97 AKLAKDAGAKYVVLTAKHHDGFCLWPSKYTDYNVVNSGPKRDIVGELADACRKYGLKFGLYYSPWD 162 (346)
T ss_dssp HHHHHHTT-SEEEEEEE-TT--BSS--TT-SSBGGGGGGTS-HHHHHHHHHHHTT-EEEEEEESSS
T ss_pred HHHHHHcCCCEEEeehhhcCccccCCCCCCcccccCCCCCCCHHHHHHHHHHHcCCeEEEEecchH
Confidence 78999999997765432211 23332222332 2457889999999999999999887665
No 148
>cd02874 GH18_CFLE_spore_hydrolase Cortical fragment-lytic enzyme (CFLE) is a peptidoglycan hydrolase involved in bacterial endospore germination. CFLE is expressed as an inactive preprotein (called SleB) in the forespore compartment of sporulating cells. SleB translocates across the forespore inner membrane and is deposited as a mature enzyme in the cortex layer of the spore. As part of a sensory mechanism capable of initiating germination, CFLE degrades a spore-specific peptidoglycan constituent called muramic-acid delta-lactam that comprises the outer cortex. CFLE has a C-terminal glycosyl hydrolase family 18 (GH18) catalytic domain as well as two N-terminal LysM peptidoglycan-binding domains. In addition to SleB, this family includes YaaH, YdhD, and YvbX from Bacillus subtilis.
Probab=43.96 E-value=71 Score=29.14 Aligned_cols=87 Identities=15% Similarity=0.222 Sum_probs=59.6
Q ss_pred HHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC-----C---------cHh---
Q 024821 85 VKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD-----L---------PQA--- 147 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~-----~---------P~~--- 147 (262)
.+.+++.+-+.=-++..|-.|.|+| .+... ...++++.++++|+++++++..++ - |..
T Consensus 16 ~~~~~~~~~~lt~v~p~w~~~~~~g--~~~~~---~~~~~~~~a~~~~~kv~~~i~~~~~~~~~~~~~~~~l~~~~~r~~ 90 (313)
T cd02874 16 YESLRANAPYLTYIAPFWYGVDADG--TLTGL---PDERLIEAAKRRGVKPLLVITNLTNGNFDSELAHAVLSNPEARQR 90 (313)
T ss_pred HHHHHHhcCCCCEEEEEEEEEcCCC--CCCCC---CCHHHHHHHHHCCCeEEEEEecCCCCCCCHHHHHHHhcCHHHHHH
Confidence 5667776666667888999998875 33322 236899999999999999998654 1 110
Q ss_pred -------HHHhh--CC----CC--ChhhHHHHHHHHHHHHHHhC
Q 024821 148 -------LEDEY--GG----WI--NRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 148 -------l~~~~--gG----w~--~~~~~~~F~~ya~~v~~~~g 176 (262)
+..+| .| |+ .++..+.|..+.+.+-++++
T Consensus 91 fi~~iv~~l~~~~~DGidiDwE~~~~~d~~~~~~fl~~lr~~l~ 134 (313)
T cd02874 91 LINNILALAKKYGYDGVNIDFENVPPEDREAYTQFLRELSDRLH 134 (313)
T ss_pred HHHHHHHHHHHhCCCcEEEecccCCHHHHHHHHHHHHHHHHHhh
Confidence 01122 34 44 44667889999999999886
No 149
>PRK12330 oxaloacetate decarboxylase; Provisional
Probab=43.84 E-value=71 Score=31.97 Aligned_cols=96 Identities=13% Similarity=0.140 Sum_probs=51.6
Q ss_pred HHHHHHHHHcCCCcceeccccCcccc----------CCC---CCC----C-hhhhhHHHHHHHHHHHcCCcceeeccCCC
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIP----------NGR---GPV----N-PKGLQYYNNLINELISYGIQPHVTLHHLD 143 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P----------~g~---g~~----n-~~~l~~y~~~id~l~~~GI~p~vtL~H~~ 143 (262)
+.|++..++.|++.+|+....+.+.- .|. +.+ + ...+++|.++.+++.+.|+.-+ ++-
T Consensus 100 ~~fv~~a~~~Gidi~RIfd~lndv~nl~~ai~~vk~ag~~~~~~i~yt~sp~~t~e~~~~~a~~l~~~Gad~I-~Ik--- 175 (499)
T PRK12330 100 DRFVEKSAENGMDVFRVFDALNDPRNLEHAMKAVKKVGKHAQGTICYTVSPIHTVEGFVEQAKRLLDMGADSI-CIK--- 175 (499)
T ss_pred HHHHHHHHHcCCCEEEEEecCChHHHHHHHHHHHHHhCCeEEEEEEEecCCCCCHHHHHHHHHHHHHcCCCEE-EeC---
Confidence 34789999999999999876665410 000 000 0 0124444444444444444321 221
Q ss_pred CcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhC-CCCceEEeccCCchh
Q 024821 144 LPQALEDEYGGWINRMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNGF 191 (262)
Q Consensus 144 ~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~~ 191 (262)
|- -|-.+| ....+.++.+-++++ +..-..-+-|-..+.
T Consensus 176 ------Dt-aGll~P---~~~~~LV~~Lk~~~~~~ipI~~H~Hnt~GlA 214 (499)
T PRK12330 176 ------DM-AALLKP---QPAYDIVKGIKEACGEDTRINLHCHSTTGVT 214 (499)
T ss_pred ------CC-ccCCCH---HHHHHHHHHHHHhCCCCCeEEEEeCCCCCcH
Confidence 11 456666 455566667777786 333345577776654
No 150
>PRK12677 xylose isomerase; Provisional
Probab=43.52 E-value=1.4e+02 Score=28.63 Aligned_cols=72 Identities=19% Similarity=0.235 Sum_probs=44.9
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCCh
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
..+|-++.++++|++.+=+. -..+.|-+. ...+--+..+++-+.+.++||++. ++...|..|.+ +.|++.++
T Consensus 32 ~~~E~v~~~a~~Gf~gVElh--~~~l~p~~~--~~~~~~~~~~~lk~~l~~~GL~v~~v~~n~f~~p~~---~~g~lts~ 104 (384)
T PRK12677 32 DPVEAVHKLAELGAYGVTFH--DDDLVPFGA--TDAERDRIIKRFKKALDETGLVVPMVTTNLFTHPVF---KDGAFTSN 104 (384)
T ss_pred CHHHHHHHHHHhCCCEEEec--ccccCCCCC--ChhhhHHHHHHHHHHHHHcCCeeEEEecCCCCCccc---cCCcCCCC
Confidence 46788999999999988552 233555321 111111357788888999999966 55544444533 23677764
No 151
>PRK09936 hypothetical protein; Provisional
Probab=43.01 E-value=1.9e+02 Score=27.01 Aligned_cols=98 Identities=15% Similarity=0.247 Sum_probs=61.4
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhh
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMI 160 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~ 160 (262)
|++=++.++.+|++++ =+.|++. |+..+-.. =...-+.++...+.||+..|.|+ +| |.|....- .++..
T Consensus 40 Wq~~~~~~~~~G~~tL--ivQWt~y---G~~~fg~~-~g~La~~l~~A~~~Gl~v~vGL~-~D-p~y~q~~~---~d~~~ 108 (296)
T PRK09936 40 WQGLWSQLRLQGFDTL--VVQWTRY---GDADFGGQ-RGWLAKRLAAAQQAGLKLVVGLY-AD-PEFFMHQK---QDGAA 108 (296)
T ss_pred HHHHHHHHHHcCCcEE--EEEeeec---cCCCcccc-hHHHHHHHHHHHHcCCEEEEccc-CC-hHHHHHHh---cCchh
Confidence 5667888999999975 3589998 22122111 14688999999999999999996 55 66654331 12211
Q ss_pred HH-HHHH-------HHHHHHHHhCCCCceEEeccCCc
Q 024821 161 VK-DFTA-------YADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 161 ~~-~F~~-------ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
.+ .+.+ .++...++.+-.|+.|-+==|+.
T Consensus 109 ~~~yl~~~l~~~~~qa~~~~~~~~~~v~GWYiP~ElD 145 (296)
T PRK09936 109 LESYLNRQLGASLQQARLWSAAWGVPVDGWYLPAELD 145 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCeEEeeeccc
Confidence 11 2222 34445566666677776544444
No 152
>cd06542 GH18_EndoS-like Endo-beta-N-acetylglucosaminidases are bacterial chitinases that hydrolyze the chitin core of various asparagine (N)-linked glycans and glycoproteins. The endo-beta-N-acetylglucosaminidases have a glycosyl hydrolase family 18 (GH18) catalytic domain. Some members also have an additional C-terminal glycosyl hydrolase family 20 (GH20) domain while others have an N-terminal domain of unknown function (pfam08522). Members of this family include endo-beta-N-acetylglucosaminidase S (EndoS) from Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and EndoH from Flavobacterium meningosepticum, and EndoE from Enterococcus faecalis. EndoS is a secreted endoglycosidase from Streptococcus pyogenes that specifically hydrolyzes the glycan on human IgG between two core N-acetylglucosamine residues. EndoE is a secreted endoglycosidase, encoded by the ndoE gene in Enterococcus faecalis, that hydrolyzes the glycan on human RNase B.
Probab=42.66 E-value=68 Score=28.30 Aligned_cols=18 Identities=11% Similarity=0.074 Sum_probs=12.2
Q ss_pred hHHHHHHHHHHHHHHhCC
Q 024821 160 IVKDFTAYADVCFREFGD 177 (262)
Q Consensus 160 ~~~~F~~ya~~v~~~~gd 177 (262)
..+.|..+++.+-++++.
T Consensus 126 ~~~~~~~lv~~Lr~~~~~ 143 (255)
T cd06542 126 SNEAFVRLIKELRKYMGP 143 (255)
T ss_pred hHHHHHHHHHHHHHHhCc
Confidence 456677777777777763
No 153
>cd07941 DRE_TIM_LeuA3 Desulfobacterium autotrophicum LeuA3 and related proteins, N-terminal catalytic TIM barrel domain. Desulfobacterium autotrophicum LeuA3 is sequence-similar to alpha-isopropylmalate synthase (LeuA) but its exact function is unknown. Members of this family have an N-terminal TIM barrel domain that belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of in
Probab=42.59 E-value=31 Score=31.24 Aligned_cols=107 Identities=11% Similarity=0.014 Sum_probs=68.3
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCC-CC----cHhHHHh----
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHL-DL----PQALEDE---- 151 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~-~~----P~~l~~~---- 151 (262)
+.+++++++.|++.+|+.++=|...-.. .+.-.++.++...+.++.+++.|+++.++.-+| |- |..+.+.
T Consensus 81 ~~~~~~a~~~g~~~i~i~~~~sd~~~~~~~~~~~~~~~~~~~~~i~~ak~~G~~v~~~~~~~~d~~~~~~~~~~~~~~~~ 160 (273)
T cd07941 81 DPNLQALLEAGTPVVTIFGKSWDLHVTEALGTTLEENLAMIRDSVAYLKSHGREVIFDAEHFFDGYKANPEYALATLKAA 160 (273)
T ss_pred hHHHHHHHhCCCCEEEEEEcCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCeEEEeEEeccccCCCCHHHHHHHHHHH
Confidence 3689999999999999988655443210 233335678899999999999999998865554 32 3332211
Q ss_pred --h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 --Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 --~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. -|-..| +...++++.+-+++++..--.-.-|-..+.
T Consensus 161 ~~~g~~~i~l~DT~G~~~P---~~v~~lv~~l~~~~~~~~l~~H~Hnd~Gla 209 (273)
T cd07941 161 AEAGADWLVLCDTNGGTLP---HEIAEIVKEVRERLPGVPLGIHAHNDSGLA 209 (273)
T ss_pred HhCCCCEEEEecCCCCCCH---HHHHHHHHHHHHhCCCCeeEEEecCCCCcH
Confidence 0 233444 556666777777776532234466666543
No 154
>PLN02960 alpha-amylase
Probab=41.53 E-value=34 Score=36.52 Aligned_cols=99 Identities=10% Similarity=0.159 Sum_probs=59.4
Q ss_pred cccccHHH-HHHHHHcCCCcceeccc--------cCccccCCCCCCCh--hhhhHHHHHHHHHHHcCCcceeec--cCCC
Q 024821 77 EYHKYKED-VKLMADTGLDAYRFSIS--------WSRLIPNGRGPVNP--KGLQYYNNLINELISYGIQPHVTL--HHLD 143 (262)
Q Consensus 77 ~y~~~~eD-i~l~k~lG~~~~R~si~--------WsRI~P~g~g~~n~--~~l~~y~~~id~l~~~GI~p~vtL--~H~~ 143 (262)
.|....+. ++.+|+||++++=+.=- |- -.+.+--.++. ...+=++.+|++|.++||.+++++ .|+.
T Consensus 414 tf~~~~e~~LdYLk~LGvt~IeLmPv~e~~~~~swG-Y~~~~yfa~~~~yGtp~dfk~LVd~aH~~GI~VILDvV~NH~~ 492 (897)
T PLN02960 414 SFKEFTQKVLPHVKKAGYNAIQLIGVQEHKDYSSVG-YKVTNFFAVSSRFGTPDDFKRLVDEAHGLGLLVFLDIVHSYAA 492 (897)
T ss_pred CHHHHHHHHHHHHHHcCCCEEEECCcccCCCCCCCC-CCcccCCCcccccCCHHHHHHHHHHHHHCCCEEEEEecccccC
Confidence 45555544 89999999999876521 21 00110001111 123458999999999999999987 3542
Q ss_pred C--cHhHH--Hh-------------hCCC-------CChhhHHHHHHHHHHHHHHhC
Q 024821 144 L--PQALE--DE-------------YGGW-------INRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 144 ~--P~~l~--~~-------------~gGw-------~~~~~~~~F~~ya~~v~~~~g 176 (262)
. +.-+. +. +..| .++++.+.+.+=++.-+++|+
T Consensus 493 ~d~~~~L~~FDG~~~~Yf~~~~~g~~~~WG~~~fNy~~~eVr~fLlsna~yWl~Eyh 549 (897)
T PLN02960 493 ADEMVGLSLFDGSNDCYFHSGKRGHHKRWGTRMFKYGDHEVLHFLLSNLNWWVTEYR 549 (897)
T ss_pred CccccchhhcCCCccceeecCCCCccCCCCCcccCCCCHHHHHHHHHHHHHHHHHHC
Confidence 1 10010 00 0112 356788888888888888886
No 155
>PLN02389 biotin synthase
Probab=41.50 E-value=62 Score=31.07 Aligned_cols=57 Identities=21% Similarity=0.222 Sum_probs=42.0
Q ss_pred ccHHHHHHHHHcCCCcceeccccCc-cccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSR-LIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsR-I~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
..+|.++.||+.|++.|-.+++=++ ++|.-... ..++..-+.++.+++.||++..++
T Consensus 176 l~~E~l~~LkeAGld~~~~~LeTs~~~y~~i~~~---~s~e~rl~ti~~a~~~Gi~v~sg~ 233 (379)
T PLN02389 176 LEKEQAAQLKEAGLTAYNHNLDTSREYYPNVITT---RSYDDRLETLEAVREAGISVCSGG 233 (379)
T ss_pred CCHHHHHHHHHcCCCEEEeeecCChHHhCCcCCC---CCHHHHHHHHHHHHHcCCeEeEEE
Confidence 5789999999999999988875223 44441111 136778899999999999876654
No 156
>PRK10426 alpha-glucosidase; Provisional
Probab=41.49 E-value=2.2e+02 Score=29.27 Aligned_cols=105 Identities=18% Similarity=0.187 Sum_probs=64.5
Q ss_pred cHHHHHHHHHcCCCc--ceeccccCccccC--CC-----CCCChhhhhHHHHHHHHHHHcCCcceeeccCC---CCcHhH
Q 024821 81 YKEDVKLMADTGLDA--YRFSISWSRLIPN--GR-----GPVNPKGLQYYNNLINELISYGIQPHVTLHHL---DLPQAL 148 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~--~R~si~WsRI~P~--g~-----g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~---~~P~~l 148 (262)
..+-++.+++.|+.. +=+. .|+..... |. -.+|++-+--.+++|++|++.|++.++.+.=+ +.|..-
T Consensus 223 v~~v~~~~r~~~IP~d~i~ld-dw~~~~~~~~g~~~~~~~~~d~~~FPdp~~mi~~L~~~G~k~v~~i~P~v~~~~~~y~ 301 (635)
T PRK10426 223 VQKKLDTMRNAGVKVNGIWAQ-DWSGIRMTSFGKRLMWNWKWDSERYPQLDSRIKQLNEEGIQFLGYINPYLASDGDLCE 301 (635)
T ss_pred HHHHHHHHHHcCCCeeEEEEe-cccccccccccccccccceEChhhCCCHHHHHHHHHHCCCEEEEEEcCccCCCCHHHH
Confidence 334467777877654 3334 57654221 11 14466666667899999999999988776522 333332
Q ss_pred HHh---h------C---------------CCCChhhHHHHHHHHHHHHHHhCCCCce-EEeccCC
Q 024821 149 EDE---Y------G---------------GWINRMIVKDFTAYADVCFREFGDRVSY-WTTVNEP 188 (262)
Q Consensus 149 ~~~---~------g---------------Gw~~~~~~~~F~~ya~~v~~~~gd~V~~-w~t~NEP 188 (262)
+.+ | | .+.||+..+.|.+..+.-+...| |+. |.=+||+
T Consensus 302 e~~~~gy~vk~~~g~~~~~~~~~~~~~~~Dftnp~ar~Ww~~~~~~~~~~~G--vdg~w~D~~E~ 364 (635)
T PRK10426 302 EAAEKGYLAKDADGGDYLVEFGEFYAGVVDLTNPEAYEWFKEVIKKNMIGLG--CSGWMADFGEY 364 (635)
T ss_pred HHHHCCcEEECCCCCEEEeEecCCCceeecCCCHHHHHHHHHHHHHHHhhcC--CCEEeeeCCCC
Confidence 211 0 1 15789999999877665555555 555 5788994
No 157
>PF00682 HMGL-like: HMGL-like of this family is not conserved in other members. are a sub-families of this Pfam.; InterPro: IPR000891 Pyruvate carboxylase (6.4.1.1 from EC) (PC), a member of the biotin-dependent enzyme family, is involved in the gluconeogenesis by mediating the carboxylation of pyruvate to oxaloacetate. Biotin-dependent carboxylase enzymes perform a two step reaction. Enzyme-bound biotin is first carboxylated by bicarbonate and ATP and the carboxyl group temporarily bound to biotin is subsequently transferred to an acceptor substrate such as pyruvate []. PC has three functional domains: a biotin carboxylase (BC) domain, a carboxyltransferase (CT) domain which perform the second part of the reaction and a biotinyl domain [, ]. The mechanism by which the carboxyl group is transferred from the carboxybiotin to the pyruvate is not well understood. The pyruvate carboxyltransferase domain is also found in other pyruvate binding enzymes and acetyl-CoA dependent enzymes suggesting that this domain can be associated with different enzymatic activities. This domain is found towards the N-terminal region of various aldolase enzymes. This N-terminal TIM barrel domain [] interacts with the C-terminal domain. The C-terminal DmpG_comm domain (IPR012425 from INTERPRO) is thought to promote heterodimerisation with members of IPR003361 from INTERPRO to form a bifunctional aldolase-dehydrogenase []. ; GO: 0003824 catalytic activity; PDB: 3MP5_E 3MP3_E 2CW6_E 3MP4_D 3HBL_A 3HB9_C 3HO8_A 3BG5_C 1YDN_B 3RMJ_A ....
Probab=41.45 E-value=61 Score=28.23 Aligned_cols=106 Identities=16% Similarity=0.136 Sum_probs=63.0
Q ss_pred HHHHHH----HHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC--CcHhHHHh---
Q 024821 82 KEDVKL----MADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD--LPQALEDE--- 151 (262)
Q Consensus 82 ~eDi~l----~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~--~P~~l~~~--- 151 (262)
++|++. +++.|++.+|+.++=|...... -+.--++.++...+.++.+++.|+++.+++-+.. .|..+.+-
T Consensus 66 ~~~i~~~~~~~~~~g~~~i~i~~~~s~~~~~~~~~~~~~~~~~~~~~~v~~ak~~g~~v~~~~~~~~~~~~~~~~~~~~~ 145 (237)
T PF00682_consen 66 EEDIERAVEAAKEAGIDIIRIFISVSDLHIRKNLNKSREEALERIEEAVKYAKELGYEVAFGCEDASRTDPEELLELAEA 145 (237)
T ss_dssp HHHHHHHHHHHHHTTSSEEEEEEETSHHHHHHHTCSHHHHHHHHHHHHHHHHHHTTSEEEEEETTTGGSSHHHHHHHHHH
T ss_pred HHHHHHHHHhhHhccCCEEEecCcccHHHHHHhhcCCHHHHHHHHHHHHHHHHhcCCceEeCccccccccHHHHHHHHHH
Confidence 455555 4569999999998776644331 2333356788999999999999999988876532 12221110
Q ss_pred ---h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 152 ---Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 152 ---~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
. -|...| +...++.+.+-+++++..-.+-.-|-..+
T Consensus 146 ~~~~g~~~i~l~Dt~G~~~P---~~v~~lv~~~~~~~~~~~l~~H~Hnd~Gl 194 (237)
T PF00682_consen 146 LAEAGADIIYLADTVGIMTP---EDVAELVRALREALPDIPLGFHAHNDLGL 194 (237)
T ss_dssp HHHHT-SEEEEEETTS-S-H---HHHHHHHHHHHHHSTTSEEEEEEBBTTS-
T ss_pred HHHcCCeEEEeeCccCCcCH---HHHHHHHHHHHHhccCCeEEEEecCCccc
Confidence 0 233333 44555566666666653334555665544
No 158
>PRK09249 coproporphyrinogen III oxidase; Provisional
Probab=40.99 E-value=50 Score=32.18 Aligned_cols=76 Identities=12% Similarity=0.255 Sum_probs=46.3
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWI 156 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~ 156 (262)
-+|.+++|+++|++.+.+++ ...-++- -+.. ...+-..+.++.+++.|+. +.++|. +++|.
T Consensus 150 t~e~l~~l~~aG~~risiGv--qS~~~~~L~~l~r~--~~~~~~~~ai~~l~~~G~~~v~~dli-~GlPg---------- 214 (453)
T PRK09249 150 DLEMLDALRELGFNRLSLGV--QDFDPEVQKAVNRI--QPFEFTFALVEAARELGFTSINIDLI-YGLPK---------- 214 (453)
T ss_pred CHHHHHHHHHcCCCEEEECC--CCCCHHHHHHhCCC--CCHHHHHHHHHHHHHcCCCcEEEEEE-ccCCC----------
Confidence 47889999999998666555 3332221 1221 1355677899999999994 445553 45552
Q ss_pred ChhhHHHHHHHHHHHHH
Q 024821 157 NRMIVKDFTAYADVCFR 173 (262)
Q Consensus 157 ~~~~~~~F~~ya~~v~~ 173 (262)
++.+.|.+-.+.+.+
T Consensus 215 --qt~e~~~~~l~~~~~ 229 (453)
T PRK09249 215 --QTPESFARTLEKVLE 229 (453)
T ss_pred --CCHHHHHHHHHHHHh
Confidence 234555555555544
No 159
>cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB.
Probab=40.79 E-value=67 Score=27.57 Aligned_cols=70 Identities=20% Similarity=0.208 Sum_probs=42.0
Q ss_pred CcCCccccccHHHHHHHHHcCCCcceeccc----cCc----cccCCCCCCCh---hhhhH---HHHHHHHHHHcCCccee
Q 024821 72 DVACDEYHKYKEDVKLMADTGLDAYRFSIS----WSR----LIPNGRGPVNP---KGLQY---YNNLINELISYGIQPHV 137 (262)
Q Consensus 72 ~~a~d~y~~~~eDi~l~k~lG~~~~R~si~----WsR----I~P~g~g~~n~---~~l~~---y~~~id~l~~~GI~p~v 137 (262)
|.|=.|| |+|.++.++++|++...+|.. -+. |+|- -.+. +.|.. ..+.|.+..++| .|++
T Consensus 6 d~aF~f~--y~e~~~~l~~~G~~v~~~s~~~~~~l~~~D~lilPG---G~~~~~~~~L~~~~~~~~~i~~~~~~g-~pil 79 (198)
T cd03130 6 DEAFNFY--YPENLELLEAAGAELVPFSPLKDEELPDADGLYLGG---GYPELFAEELSANQSMRESIRAFAESG-GPIY 79 (198)
T ss_pred cCccccc--cHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEECC---CchHHHHHHHHhhHHHHHHHHHHHHcC-CCEE
Confidence 3455677 999999999999988877652 111 2333 2232 23322 344455555667 4899
Q ss_pred eccCCCCcHhHH
Q 024821 138 TLHHLDLPQALE 149 (262)
Q Consensus 138 tL~H~~~P~~l~ 149 (262)
.+++ =++-|.
T Consensus 80 gICg--G~qlL~ 89 (198)
T cd03130 80 AECG--GLMYLG 89 (198)
T ss_pred EEcc--cHHHHH
Confidence 8885 344443
No 160
>PRK05660 HemN family oxidoreductase; Provisional
Probab=40.44 E-value=65 Score=30.60 Aligned_cols=94 Identities=14% Similarity=0.190 Sum_probs=57.4
Q ss_pred cHHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~ 157 (262)
-++.++.|+++|++.+-+++ +-+ ++... -+... ..+-..+.++.+++.|+.++ ++|. +++|.
T Consensus 106 ~~e~l~~Lk~~Gv~risiGvqS~~~~~L~~-l~r~~--~~~~~~~ai~~~~~~G~~~v~~dli-~Glpg----------- 170 (378)
T PRK05660 106 EADRFVGYQRAGVNRISIGVQSFSEEKLKR-LGRIH--GPDEAKRAAKLAQGLGLRSFNLDLM-HGLPD----------- 170 (378)
T ss_pred CHHHHHHHHHcCCCEEEeccCcCCHHHHHH-hCCCC--CHHHHHHHHHHHHHcCCCeEEEEee-cCCCC-----------
Confidence 45899999999999655555 222 22221 12221 24556778999999999875 5553 45662
Q ss_pred hhhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 158 RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
++.+.|.+-.+.+.+.=-+.+..+...=||+.
T Consensus 171 -qt~~~~~~~l~~~~~l~p~~is~y~l~~~~gT 202 (378)
T PRK05660 171 -QSLEEALDDLRQAIALNPPHLSWYQLTIEPNT 202 (378)
T ss_pred -CCHHHHHHHHHHHHhcCCCeEEeeccEeccCC
Confidence 23455666666655533356666665556653
No 161
>PF04914 DltD_C: DltD C-terminal region; InterPro: IPR006998 The dlt operon (dltA to dltD) of Lactobacillus rhamnosus 7469 encodes four proteins responsible for the esterification of lipoteichoic acid (LTA) by D-alanine. These esters play an important role in controlling the net anionic charge of the poly (GroP) moiety of LTA. DltA and DltC encode the D-alanine-D-alanyl carrier protein ligase (Dcl) and D-alanyl carrier protein (Dcp), respectively. Whereas the functions of DltA and DltC are defined, the functions of DltB and DltD are unknown. In vitro assays showed that DltD bound Dcp for ligation with D-alanine by Dcl in the presence of ATP. In contrast, the homologue of Dcp, the Escherichia coli acyl carrier protein (ACP), involved in fatty acid biosynthesis, was not bound to DltD and thus was not ligated with D-alanine. DltD also catalyzed the hydrolysis of the mischarged D-alanyl-ACP. The hydrophobic N-terminal sequence of DltD was required for anchoring the protein in the membrane. It is hypothesized that this membrane-associated DltD facilitates the binding of Dcp and Dcl for ligation of Dcp with D-alanine and that the resulting D-alanyl-Dcp is translocated to the primary site of D-alanylation []. These sequences contain the C-terminal region of DltD.; PDB: 3BMA_C.
Probab=39.91 E-value=76 Score=25.85 Aligned_cols=57 Identities=11% Similarity=0.156 Sum_probs=37.1
Q ss_pred hhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCC
Q 024821 117 GLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGD 177 (262)
Q Consensus 117 ~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd 177 (262)
..+=+.-+++.|++.|++|++.+.= -.+.|.. |-| .+++..+.|.+=.+..++++|=
T Consensus 34 Ey~Dl~l~L~~~k~~g~~~lfVi~P-vNg~wyd--ytG-~~~~~r~~~y~kI~~~~~~~gf 90 (130)
T PF04914_consen 34 EYDDLQLLLDVCKELGIDVLFVIQP-VNGKWYD--YTG-LSKEMRQEYYKKIKYQLKSQGF 90 (130)
T ss_dssp HHHHHHHHHHHHHHTT-EEEEEE-----HHHHH--HTT---HHHHHHHHHHHHHHHHTTT-
T ss_pred cHHHHHHHHHHHHHcCCceEEEecC-CcHHHHH--HhC-CCHHHHHHHHHHHHHHHHHCCC
Confidence 4566789999999999999998841 1123542 445 3556667777777778888875
No 162
>PF03511 Fanconi_A: Fanconi anaemia group A protein; InterPro: IPR003516 Fanconi anaemia (FA) [, , ] is a recessive inherited disease characterised by defective DNA repair. FA cells are sensitive to DNA cross-linking agents that cause chromosomal instability and cell death. The disease is manifested clinically by progressive pancytopenia, variable physical anomalies, and predisposition to malignancy []. Four complementation groups have been identified, designated A to D. The FA group A gene (FAA) has been cloned [], but its function remains to be elucidated.
Probab=39.39 E-value=26 Score=25.16 Aligned_cols=38 Identities=26% Similarity=0.343 Sum_probs=31.0
Q ss_pred CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC
Q 024821 103 SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL 142 (262)
Q Consensus 103 sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~ 142 (262)
+++.|+ ++.--+++++..-+++..|.++|| +.+.|++-
T Consensus 19 s~l~p~-~~~d~~kaldiCaeIL~cLE~R~i-sWl~LFql 56 (64)
T PF03511_consen 19 SYLAPK-EGADSLKALDICAEILGCLEKRKI-SWLVLFQL 56 (64)
T ss_pred HhcCcc-cccccHHHHHHHHHHHHHHHhCCC-cHHHhhhc
Confidence 678888 455557789999999999999999 88777753
No 163
>PRK09282 pyruvate carboxylase subunit B; Validated
Probab=39.27 E-value=96 Score=31.67 Aligned_cols=94 Identities=13% Similarity=0.085 Sum_probs=54.6
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc----HhHHH------
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP----QALED------ 150 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P----~~l~~------ 150 (262)
.++|+++.++.|++.+|+..+-+.+ +-..+.++..+++|....+++..-+.| ..+.+
T Consensus 98 v~~~v~~A~~~Gvd~irif~~lnd~-------------~n~~~~i~~ak~~G~~v~~~i~~t~~p~~t~~~~~~~a~~l~ 164 (592)
T PRK09282 98 VEKFVEKAAENGIDIFRIFDALNDV-------------RNMEVAIKAAKKAGAHVQGTISYTTSPVHTIEKYVELAKELE 164 (592)
T ss_pred hHHHHHHHHHCCCCEEEEEEecChH-------------HHHHHHHHHHHHcCCEEEEEEEeccCCCCCHHHHHHHHHHHH
Confidence 3456789999999999998766554 224555666677777666555322223 11110
Q ss_pred h----------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 151 E----------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 151 ~----------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+ -.|-..| ....+.++.+-++++ ..-..-+-|-..+.
T Consensus 165 ~~Gad~I~i~Dt~G~~~P---~~~~~lv~~lk~~~~-~pi~~H~Hnt~Gla 211 (592)
T PRK09282 165 EMGCDSICIKDMAGLLTP---YAAYELVKALKEEVD-LPVQLHSHCTSGLA 211 (592)
T ss_pred HcCCCEEEECCcCCCcCH---HHHHHHHHHHHHhCC-CeEEEEEcCCCCcH
Confidence 0 1345555 445556666667774 22345577776653
No 164
>smart00729 Elp3 Elongator protein 3, MiaB family, Radical SAM. This superfamily contains MoaA, NifB, PqqE, coproporphyrinogen III oxidase, biotin synthase and MiaB families, and includes a representative in the eukaryotic elongator subunit, Elp-3. Some members of the family are methyltransferases.
Probab=39.14 E-value=1.6e+02 Score=23.92 Aligned_cols=58 Identities=12% Similarity=0.157 Sum_probs=39.1
Q ss_pred cccHHHHHHHHHcCCCcceeccc-cCc-cccCCCCCCChhhhhHHHHHHHHHHHcC-Ccceeec
Q 024821 79 HKYKEDVKLMADTGLDAYRFSIS-WSR-LIPNGRGPVNPKGLQYYNNLINELISYG-IQPHVTL 139 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~-WsR-I~P~g~g~~n~~~l~~y~~~id~l~~~G-I~p~vtL 139 (262)
..-++.++.|++.|++.+.+|+. ++. +...-....+ .+.+.+.++.+++.| +.+-+.+
T Consensus 97 ~~~~~~~~~l~~~~~~~i~isl~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~g~~~v~~~~ 157 (216)
T smart00729 97 TLTEELLEALKEAGVNRVSLGVQSGSDEVLKAINRGHT---VEDVLEAVEKLREAGPIKVSTDL 157 (216)
T ss_pred cCCHHHHHHHHHcCCCeEEEecccCCHHHHHHhcCCCC---HHHHHHHHHHHHHhCCcceEEeE
Confidence 34478899999999998888885 543 2211011122 578899999999999 5544433
No 165
>PF11790 Glyco_hydro_cc: Glycosyl hydrolase catalytic core; InterPro: IPR024655 This entry represents the glycosyl hydrolase catalytic core of a group of uncharacterised proteins.
Probab=38.43 E-value=25 Score=31.26 Aligned_cols=24 Identities=13% Similarity=0.022 Sum_probs=0.0
Q ss_pred HHHHHHHHHhCCCCceEEeccCCch
Q 024821 166 AYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 166 ~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
+..+.+.+. +..+++.+.||||+.
T Consensus 54 ~~~~~v~~~-~~~~~~ll~fNEPD~ 77 (239)
T PF11790_consen 54 DWLANVQNA-HPGSKHLLGFNEPDL 77 (239)
T ss_pred HHHHHHHhh-ccCccceeeecCCCC
No 166
>TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase. This model represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. Several species, including E. coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterized protein.
Probab=38.36 E-value=27 Score=34.01 Aligned_cols=50 Identities=14% Similarity=0.236 Sum_probs=32.9
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ 134 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~ 134 (262)
-++.+++|+++|++.+-+++ ...-++. -+... ..+.+.+.++.+++.|++
T Consensus 150 ~~e~l~~lk~~G~~risiGv--qS~~~~~l~~l~r~~--~~~~~~~ai~~l~~~G~~ 202 (455)
T TIGR00538 150 TKDVIDALRDEGFNRLSFGV--QDFNKEVQQAVNRIQ--PEEMIFELMNHAREAGFT 202 (455)
T ss_pred CHHHHHHHHHcCCCEEEEcC--CCCCHHHHHHhCCCC--CHHHHHHHHHHHHhcCCC
Confidence 46889999999998555444 4433321 12211 245678889999999986
No 167
>PRK05904 coproporphyrinogen III oxidase; Provisional
Probab=38.05 E-value=81 Score=29.79 Aligned_cols=92 Identities=14% Similarity=0.222 Sum_probs=53.9
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCcHhHHHhhCCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLPQALEDEYGGWI 156 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P~~l~~~~gGw~ 156 (262)
-+|.+++|+++|++.+-+++ ...-++- -++.. ..+-+.+.++.+++.|+. +-+++. +++|.
T Consensus 102 t~e~l~~lk~~G~nrisiGv--QS~~d~vL~~l~R~~--~~~~~~~ai~~lr~~G~~~v~~dlI-~GlPg---------- 166 (353)
T PRK05904 102 TQSQINLLKKNKVNRISLGV--QSMNNNILKQLNRTH--TIQDSKEAINLLHKNGIYNISCDFL-YCLPI---------- 166 (353)
T ss_pred CHHHHHHHHHcCCCEEEEec--ccCCHHHHHHcCCCC--CHHHHHHHHHHHHHcCCCcEEEEEe-ecCCC----------
Confidence 36889999999998554444 3333221 12211 245678899999999997 445553 45552
Q ss_pred ChhhHHHHHHHHHHHHHHhC-CCCceEEeccCCch
Q 024821 157 NRMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNG 190 (262)
Q Consensus 157 ~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~ 190 (262)
++.+.|.+=.+.+. +++ +.|..+...=||..
T Consensus 167 --qt~e~~~~tl~~~~-~l~p~~is~y~L~~~~gT 198 (353)
T PRK05904 167 --LKLKDLDEVFNFIL-KHKINHISFYSLEIKEGS 198 (353)
T ss_pred --CCHHHHHHHHHHHH-hcCCCEEEEEeeEecCCC
Confidence 33455666566543 343 44544444445553
No 168
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=37.93 E-value=27 Score=20.18 Aligned_cols=15 Identities=27% Similarity=0.541 Sum_probs=12.2
Q ss_pred HHHHHHHHHHcCCcc
Q 024821 121 YNNLINELISYGIQP 135 (262)
Q Consensus 121 y~~~id~l~~~GI~p 135 (262)
-.++++.+++.||+|
T Consensus 20 a~~~~~~M~~~gv~P 34 (34)
T PF13812_consen 20 ALQLFDEMKEQGVKP 34 (34)
T ss_pred HHHHHHHHHHhCCCC
Confidence 467788888899987
No 169
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=37.72 E-value=43 Score=29.91 Aligned_cols=59 Identities=7% Similarity=0.054 Sum_probs=38.8
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.++++-+++++++|.+.++++-. +..+.. ..+.-...++.++++.+.+.++||...+=.
T Consensus 94 ~~~~~~i~~a~~lG~~~v~~~~~--~~~~~~~~~~~~~~~~~~l~~l~~~A~~~Gv~l~lE~ 153 (279)
T TIGR00542 94 EIMEKAIQLARDLGIRTIQLAGY--DVYYEEHDEETRRRFREGLKEAVELAARAQVTLAVEI 153 (279)
T ss_pred HHHHHHHHHHHHhCCCEEEecCc--ccccCcCCHHHHHHHHHHHHHHHHHHHHcCCEEEEee
Confidence 45677789999999999998521 111110 111112356777888899999999877743
No 170
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=37.65 E-value=36 Score=30.17 Aligned_cols=60 Identities=7% Similarity=0.011 Sum_probs=39.0
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
...++.-|++.+.+|.+.+++........+. ..+.-+..++.++++.+.+.++||+..+=
T Consensus 89 ~~~~~~~i~~a~~lGa~~i~~~~~~~~~~~~-~~~~~~~~~~~l~~l~~~a~~~gv~l~iE 148 (275)
T PRK09856 89 LDMIKLAMDMAKEMNAGYTLISAAHAGYLTP-PNVIWGRLAENLSELCEYAENIGMDLILE 148 (275)
T ss_pred HHHHHHHHHHHHHhCCCEEEEcCCCCCCCCC-HHHHHHHHHHHHHHHHHHHHHcCCEEEEe
Confidence 4456667899999999999986432211111 11111345677888899999999876543
No 171
>PRK10658 putative alpha-glucosidase; Provisional
Probab=37.59 E-value=2e+02 Score=29.84 Aligned_cols=103 Identities=16% Similarity=0.184 Sum_probs=63.2
Q ss_pred HHHHHHcCC--CcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC---CCcHhHHHh--------
Q 024821 85 VKLMADTGL--DAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL---DLPQALEDE-------- 151 (262)
Q Consensus 85 i~l~k~lG~--~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~---~~P~~l~~~-------- 151 (262)
++.+++.|+ +++-+++.|.+-.--+.-.+|++.+---++||++|+++|++.++.+.-+ +.|.+-+..
T Consensus 289 ~~~~r~~~iP~d~i~lD~~w~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~i~P~i~~~s~~f~e~~~~gy~vk~ 368 (665)
T PRK10658 289 IDGMAERDLPLHVFHFDCFWMKEFQWCDFEWDPRTFPDPEGMLKRLKAKGLKICVWINPYIAQKSPLFKEGKEKGYLLKR 368 (665)
T ss_pred HHHHHHcCCCceEEEEchhhhcCCceeeeEEChhhCCCHHHHHHHHHHCCCEEEEeccCCcCCCchHHHHHHHCCeEEEC
Confidence 455666665 4667777775532111345565555556889999999999988776522 222221110
Q ss_pred ----------------hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 152 ----------------YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 152 ----------------~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+-.+.||+..+.|.+..+.+.+ .| ---+|.=+||+.
T Consensus 369 ~~G~~~~~~~W~g~~~~~Dftnp~ar~W~~~~~~~l~d-~G-vdgfw~D~gE~~ 420 (665)
T PRK10658 369 PDGSVWQWDKWQPGMAIVDFTNPDACKWYADKLKGLLD-MG-VDCFKTDFGERI 420 (665)
T ss_pred CCCCEeeeeecCCCceeecCCCHHHHHHHHHHHHHHHh-cC-CcEEEecCCcee
Confidence 0125789999999888877554 44 223567788863
No 172
>TIGR01211 ELP3 histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases.
Probab=36.89 E-value=62 Score=32.52 Aligned_cols=104 Identities=12% Similarity=0.107 Sum_probs=59.9
Q ss_pred cHHHHHHHHHcCCCcceeccc--cCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCCh
Q 024821 81 YKEDVKLMADTGLDAYRFSIS--WSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~--WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
-++.+++|+++|++.+-++++ -.+++-. -+.- -..+-..+.++.+++.|+++.+.|. +++|.
T Consensus 205 ~~e~L~~L~~~G~~rVslGVQS~~d~VL~~-inRg--ht~~~v~~Ai~~lr~~G~~v~~~LM-~GLPg------------ 268 (522)
T TIGR01211 205 REEHIDRMLKLGATRVELGVQTIYNDILER-TKRG--HTVRDVVEATRLLRDAGLKVVYHIM-PGLPG------------ 268 (522)
T ss_pred CHHHHHHHHHcCCCEEEEECccCCHHHHHH-hCCC--CCHHHHHHHHHHHHHcCCeEEEEee-cCCCC------------
Confidence 468899999999986666652 2223322 1111 1245567888999999998666664 45552
Q ss_pred hhHHHHHHHHHHHHH--HhC-CCCceEEeccCCchhccccccccC
Q 024821 159 MIVKDFTAYADVCFR--EFG-DRVSYWTTVNEPNGFAMVGYDFGI 200 (262)
Q Consensus 159 ~~~~~F~~ya~~v~~--~~g-d~V~~w~t~NEP~~~~~~gy~~g~ 200 (262)
++.+.+.+=++.+++ .++ |.|+.+-+.=+|+.....-|..|.
T Consensus 269 qt~e~~~~t~~~l~~~~~l~pD~Ikiypl~V~~gT~L~~~~~~G~ 313 (522)
T TIGR01211 269 SSFERDLEMFREIFEDPRFKPDMLKIYPTLVTRGTELYELWKRGE 313 (522)
T ss_pred CCHHHHHHHHHHHHhccCCCcCEEEEecceeeCCCHHHHHHHcCC
Confidence 223455555555654 344 556666655555544333333343
No 173
>TIGR02104 pulA_typeI pullulanase, type I. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the subfamilies described in TIGR02102 and TIGR02103 but having a different domain architecture with shorter sequences. Members are called type I pullulanases.
Probab=36.26 E-value=76 Score=32.23 Aligned_cols=59 Identities=19% Similarity=0.215 Sum_probs=37.9
Q ss_pred hhHHHHHHHHHHHcCCcceeec--cCCC----------CcHhHHH--h------hCCC------CChhhHHHHHHHHHHH
Q 024821 118 LQYYNNLINELISYGIQPHVTL--HHLD----------LPQALED--E------YGGW------INRMIVKDFTAYADVC 171 (262)
Q Consensus 118 l~~y~~~id~l~~~GI~p~vtL--~H~~----------~P~~l~~--~------~gGw------~~~~~~~~F~~ya~~v 171 (262)
.+=++++|++|.++||++|+++ .|.. .|.|... . +.|+ .++++.+.+.+-++.-
T Consensus 228 ~~efk~lV~~~H~~Gi~VilDvV~NH~~~~~~~~f~~~~~~~~~~~~~~g~~~~~~g~~~~~~~~~~~v~~~i~~~~~~W 307 (605)
T TIGR02104 228 IRELKQMIQALHENGIRVIMDVVYNHTYSREESPFEKTVPGYYYRYNEDGTLSNGTGVGNDTASEREMMRKFIVDSVLYW 307 (605)
T ss_pred HHHHHHHHHHHHHCCCEEEEEEEcCCccCCCCCcccCCCCCeeEEECCCCCccCCCcccCCcccCCHHHHHHHHHHHHHH
Confidence 4569999999999999999985 3642 1222210 0 1122 2566777777777777
Q ss_pred HHHhC
Q 024821 172 FREFG 176 (262)
Q Consensus 172 ~~~~g 176 (262)
+++||
T Consensus 308 ~~e~~ 312 (605)
T TIGR02104 308 VKEYN 312 (605)
T ss_pred HHHcC
Confidence 77765
No 174
>COG3661 AguA Alpha-glucuronidase [Carbohydrate transport and metabolism]
Probab=35.89 E-value=1.7e+02 Score=29.25 Aligned_cols=90 Identities=17% Similarity=0.274 Sum_probs=59.7
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCC--CCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCC-
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGR--GPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGW- 155 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~--g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw- 155 (262)
.|.++-.++++++|+|..-+.= +--... --++..-+....++-|..+.+||++++++. |..|..| ||-
T Consensus 183 qR~kDYAR~laSiGINg~v~NN----VNvk~~e~~lit~~fl~k~aklAdiFR~YGIK~yLsin-faSP~~l----GgL~ 253 (684)
T COG3661 183 QRMKDYARALASIGINGTVLNN----VNVKKAESYLITAPFLAKAAKLADIFRPYGIKVYLSIN-FASPMEL----GGLK 253 (684)
T ss_pred HHHHHHHHHHhhcCcceEEecc----cccchhhhheechHhHHHHHHHHHHhhhccceEEEEec-cCCcccc----CCcC
Confidence 5677778899999999876531 110100 113444566778888999999999999984 7888765 664
Q ss_pred ----CChhhHHHHHHHHHHHHHHhCC
Q 024821 156 ----INRMIVKDFTAYADVCFREFGD 177 (262)
Q Consensus 156 ----~~~~~~~~F~~ya~~v~~~~gd 177 (262)
+++.+...|.+=|+.+.+-.-|
T Consensus 254 TADPLDe~VrawWkeka~~IY~yIPD 279 (684)
T COG3661 254 TADPLDEAVRAWWKEKADEIYKYIPD 279 (684)
T ss_pred cCCcccHHHHHHHHHHHHHHHHhccc
Confidence 3455666666666666554433
No 175
>PRK13398 3-deoxy-7-phosphoheptulonate synthase; Provisional
Probab=35.82 E-value=1e+02 Score=28.10 Aligned_cols=73 Identities=10% Similarity=0.008 Sum_probs=51.5
Q ss_pred CCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHH
Q 024821 74 ACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALE 149 (262)
Q Consensus 74 a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~ 149 (262)
..+......+=.+.+|++|+..+|-++.=+|--|.+--.+-++ .++.+-+.+++.||..+.+.++-..+..+.
T Consensus 36 sie~~~~~~~~A~~lk~~g~~~~r~~~~kpRTs~~s~~G~g~~---gl~~l~~~~~~~Gl~~~te~~d~~~~~~l~ 108 (266)
T PRK13398 36 AVESEEQMVKVAEKLKELGVHMLRGGAFKPRTSPYSFQGLGEE---GLKILKEVGDKYNLPVVTEVMDTRDVEEVA 108 (266)
T ss_pred cCCCHHHHHHHHHHHHHcCCCEEEEeeecCCCCCCccCCcHHH---HHHHHHHHHHHcCCCEEEeeCChhhHHHHH
Confidence 3445666677788899999999999976577777531112244 456666778999999999988665555543
No 176
>PRK01060 endonuclease IV; Provisional
Probab=35.08 E-value=1.2e+02 Score=26.94 Aligned_cols=52 Identities=17% Similarity=0.234 Sum_probs=36.4
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcc
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQP 135 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p 135 (262)
..+++-++.++++|++.+=+.+.-++.... +..+++. .+++-+.+.++|++.
T Consensus 12 ~~~~~~l~~~~~~G~d~vEl~~~~p~~~~~--~~~~~~~---~~~lk~~~~~~gl~~ 63 (281)
T PRK01060 12 GGLEGAVAEAAEIGANAFMIFTGNPQQWKR--KPLEELN---IEAFKAACEKYGISP 63 (281)
T ss_pred CCHHHHHHHHHHcCCCEEEEECCCCCCCcC--CCCCHHH---HHHHHHHHHHcCCCC
Confidence 347888999999999999987765554322 2345433 455556677999984
No 177
>TIGR02631 xylA_Arthro xylose isomerase, Arthrobacter type. This model describes a D-xylose isomerase that is also active as a D-glucose isomerase. It is tetrameric and dependent on a divalent cation Mg2+, Co2+ or Mn2+ as characterized in Arthrobacter. Members of this family differ substantially from the D-xylose isomerases of family TIGR02630.
Probab=34.88 E-value=2.9e+02 Score=26.52 Aligned_cols=57 Identities=21% Similarity=0.233 Sum_probs=37.5
Q ss_pred cccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCccee
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHV 137 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~v 137 (262)
..-...+-++.++++|++.+= +....+.|-+ -...+.. ...+++-+.|.++||++..
T Consensus 30 ~~~~~~e~i~~la~~GfdgVE--~~~~dl~P~~-~~~~e~~-~~~~~lk~~L~~~GL~v~~ 86 (382)
T TIGR02631 30 TALDPVEAVHKLAELGAYGVT--FHDDDLIPFG-APPQERD-QIVRRFKKALDETGLKVPM 86 (382)
T ss_pred CCcCHHHHHHHHHHhCCCEEE--ecccccCCCC-CChhHHH-HHHHHHHHHHHHhCCeEEE
Confidence 334578889999999999884 3334566652 1111111 3367788889999999654
No 178
>TIGR02635 RhaI_grampos L-rhamnose isomerase, Streptomyces subtype. This clade of sequences is closely related to the L-rhamnose isomerases found in Pseudomonas stutzeri and in a number of the Rhizobiales (TIGR02629). The genes of the family represented here are found in similar genomic contexts which contain genes apparently involved in rhamnose catabolism such as rhamnulose-1-phosphate aldolase (TIGR02632), sugar kinases, and sugar transporters.
Probab=34.72 E-value=1.9e+02 Score=27.82 Aligned_cols=87 Identities=17% Similarity=0.278 Sum_probs=54.5
Q ss_pred cHHHHHHHHHc-CCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCCh
Q 024821 81 YKEDVKLMADT-GLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 81 ~~eDi~l~k~l-G~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
--+|++.++++ +++ .++++- .|. +.. ..+.++.+.++++||+.. ++...|..|.+ ++|...|+
T Consensus 42 ~~~d~~~v~~L~~~~-~~v~lH----~~~-d~~------~d~~~~~~~l~~~GL~v~~i~p~~f~~~~~---~~GSLt~p 106 (378)
T TIGR02635 42 KIEDAALVHRLTGIC-PTVALH----IPW-DRV------EDYEELARYAEELGLKIGAINPNLFQDDDY---KFGSLTHP 106 (378)
T ss_pred HHHHHHHHHhhcCCC-Cceeec----cCC-ccc------cCHHHHHHHHHHcCCceeeeeCCccCCccc---CCCCCCCC
Confidence 45777777777 555 666652 221 111 237888888999999987 77766655543 34666655
Q ss_pred --hhHHHHHHHHHHH---HHHhCCC-CceE
Q 024821 159 --MIVKDFTAYADVC---FREFGDR-VSYW 182 (262)
Q Consensus 159 --~~~~~F~~ya~~v---~~~~gd~-V~~w 182 (262)
++.+...++.+.| ++.+|.. |..|
T Consensus 107 D~~vR~~AIe~~k~~idiA~eLGa~~I~iW 136 (378)
T TIGR02635 107 DKRIRRKAIDHLLECVDIAKKTGSKDISLW 136 (378)
T ss_pred CHHHHHHHHHHHHHHHHHHHHhCCCeEEEe
Confidence 4666777777655 5677774 4444
No 179
>cd06525 GH25_Lyc-like Lyc muramidase is an autolytic lysozyme (autolysin) from Clostridium acetobutylicum encoded by the lyc gene. Lyc has a glycosyl hydrolase family 25 (GH25) catalytic domain. Endo-N-acetylmuramidases are lysozymes (also referred to as peptidoglycan hydrolases) that degrade bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues.
Probab=34.02 E-value=2.8e+02 Score=23.22 Aligned_cols=30 Identities=13% Similarity=0.131 Sum_probs=19.4
Q ss_pred CCCChhhHHHHHHHHHHHHHHhCCCCceEE
Q 024821 154 GWINRMIVKDFTAYADVCFREFGDRVSYWT 183 (262)
Q Consensus 154 Gw~~~~~~~~F~~ya~~v~~~~gd~V~~w~ 183 (262)
+....+..+....|++.+-++.|-++-.+.
T Consensus 95 ~~~~~~~~~~~~~f~~~v~~~~G~~~~iY~ 124 (184)
T cd06525 95 GLSKDELNDYVLRFIEEFEKLSGLKVGIYT 124 (184)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCeEEEe
Confidence 333344557778888888777777665544
No 180
>PRK11572 copper homeostasis protein CutC; Provisional
Probab=33.60 E-value=94 Score=28.25 Aligned_cols=41 Identities=12% Similarity=0.165 Sum_probs=29.6
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHH
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINEL 128 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l 128 (262)
....+||+.++++|++.+=|++. -| +|.+|.+. ..++++.+
T Consensus 73 ~~M~~di~~~~~~GadGvV~G~L----~~--dg~vD~~~---~~~Li~~a 113 (248)
T PRK11572 73 AAMLEDIATVRELGFPGLVTGVL----DV--DGHVDMPR---MRKIMAAA 113 (248)
T ss_pred HHHHHHHHHHHHcCCCEEEEeeE----CC--CCCcCHHH---HHHHHHHh
Confidence 34678999999999999999874 22 47888543 45555554
No 181
>cd02803 OYE_like_FMN_family Old yellow enzyme (OYE)-like FMN binding domain. OYE was the first flavin-dependent enzyme identified, however its true physiological role remains elusive to this day. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase.
Probab=33.26 E-value=73 Score=29.18 Aligned_cols=39 Identities=15% Similarity=0.192 Sum_probs=32.5
Q ss_pred ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC
Q 024821 104 RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL 142 (262)
Q Consensus 104 RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~ 142 (262)
+..|...+-++++-+.-++++++.++++|-..++=|.|-
T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~vh~~g~~~~~Ql~h~ 100 (327)
T cd02803 62 KGYPGQLGIYDDEQIPGLRKLTEAVHAHGAKIFAQLAHA 100 (327)
T ss_pred cCCCCCcCcCCHHHHHHHHHHHHHHHhCCCHhhHHhhCC
Confidence 344543567888999999999999999999999999884
No 182
>PRK05692 hydroxymethylglutaryl-CoA lyase; Provisional
Probab=32.61 E-value=3.1e+02 Score=25.15 Aligned_cols=52 Identities=15% Similarity=0.188 Sum_probs=37.6
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~ 136 (262)
+.++-++..++.|+. ++.+|+-+--.|. .+..+ .+++.++++.+.+.|..-+
T Consensus 121 ~~~~~v~~ak~~g~~-v~~~i~~~~~~~~-~~~~~---~~~~~~~~~~~~~~G~d~i 172 (287)
T PRK05692 121 RFEPVAEAAKQAGVR-VRGYVSCVLGCPY-EGEVP---PEAVADVAERLFALGCYEI 172 (287)
T ss_pred HHHHHHHHHHHcCCE-EEEEEEEEecCCC-CCCCC---HHHHHHHHHHHHHcCCcEE
Confidence 455667888999974 6766665544465 45666 6789999999999998743
No 183
>TIGR03551 F420_cofH 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit. This enzyme, together with CofG, complete the biosynthesis of 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, the chromophore of coenzyme F420. The chromophore is also used in cyanobacteria DNA photolyases.
Probab=32.33 E-value=1.6e+02 Score=27.49 Aligned_cols=90 Identities=14% Similarity=0.112 Sum_probs=51.3
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccC-----CCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPN-----GRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGW 155 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~-----g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw 155 (262)
.+|.++.||++|++.+= +.+.+-+-|+ .+++.. .+-..+.++.+++.||++-.++. +++|
T Consensus 140 ~~e~l~~LkeAGl~~i~-~~~~E~~~~~v~~~i~~~~~~---~~~~~~~i~~a~~~Gi~v~s~~i-~G~~---------- 204 (343)
T TIGR03551 140 VEEALKRLKEAGLDSMP-GTAAEILDDEVRKVICPDKLS---TAEWIEIIKTAHKLGIPTTATIM-YGHV---------- 204 (343)
T ss_pred HHHHHHHHHHhCccccc-CcchhhcCHHHHHhcCCCCCC---HHHHHHHHHHHHHcCCcccceEE-EecC----------
Confidence 47789999999999885 2222211111 012222 44457899999999998766543 1222
Q ss_pred CChhhHHHHHHHHHHHHHHhCCCCceEEecc
Q 024821 156 INRMIVKDFTAYADVCFREFGDRVSYWTTVN 186 (262)
Q Consensus 156 ~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~N 186 (262)
.+.+.......+.+.+-..++. +.-.+++|
T Consensus 205 Et~ed~~~~l~~lr~l~~~~~~-~~~~iP~~ 234 (343)
T TIGR03551 205 ETPEHWVDHLLILREIQEETGG-FTEFVPLP 234 (343)
T ss_pred CCHHHHHHHHHHHHHhhHHhCC-eeEEEecc
Confidence 2333445555555555555543 34455655
No 184
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=32.31 E-value=3.6e+02 Score=23.84 Aligned_cols=53 Identities=19% Similarity=0.198 Sum_probs=34.7
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~ 136 (262)
.+++-++.++++|++.+=+++.=....+. ...++. ...+.+-+.+.++||++.
T Consensus 22 ~~~e~~~~~~~~G~~~iEl~~~~~~~~~~-~~~~~~---~~~~~l~~~l~~~gl~i~ 74 (283)
T PRK13209 22 CWLEKLAIAKTAGFDFVEMSVDESDERLA-RLDWSR---EQRLALVNALVETGFRVN 74 (283)
T ss_pred CHHHHHHHHHHcCCCeEEEecCccccchh-ccCCCH---HHHHHHHHHHHHcCCcee
Confidence 47899999999999998775421111111 111232 346778888899999874
No 185
>PRK08207 coproporphyrinogen III oxidase; Provisional
Probab=32.16 E-value=44 Score=33.16 Aligned_cols=60 Identities=18% Similarity=0.262 Sum_probs=39.0
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCc-ceeeccCCCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQ-PHVTLHHLDLP 145 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~-p~vtL~H~~~P 145 (262)
-+|-++.|+++|++ |+||.-...-++- -|.. -..+-..+.++.+++.|+. +-++|. +++|
T Consensus 268 t~e~L~~Lk~~Gv~--RISIGvQS~~d~vLk~igR~--ht~e~v~~ai~~ar~~Gf~~In~DLI-~GLP 331 (488)
T PRK08207 268 TEEKLEVLKKYGVD--RISINPQTMNDETLKAIGRH--HTVEDIIEKFHLAREMGFDNINMDLI-IGLP 331 (488)
T ss_pred CHHHHHHHHhcCCC--eEEEcCCcCCHHHHHHhCCC--CCHHHHHHHHHHHHhCCCCeEEEEEE-eCCC
Confidence 36789999999998 6666554443321 1221 1356678889999999994 445553 4555
No 186
>PF01261 AP_endonuc_2: Xylose isomerase-like TIM barrel; InterPro: IPR012307 This TIM alpha/beta barrel structure is found in xylose isomerase (P19148 from SWISSPROT) and in endonuclease IV (P12638 from SWISSPROT, 3.1.21.2 from EC). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae []. ; PDB: 3KWS_B 3DX5_A 3CQH_B 3CQI_A 3CQK_A 3CQJ_B 2G0W_B 1DXI_A 2ZDS_D 3TVA_B ....
Probab=32.13 E-value=38 Score=27.99 Aligned_cols=61 Identities=11% Similarity=0.058 Sum_probs=40.8
Q ss_pred ccccHHHHHHHHHcCCCcceeccccCccccCCC-CCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSISWSRLIPNGR-GPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~-g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
....++-+++++.+|++.+++...+-...+... .+--....+.++++.+.+.++|+++.+=
T Consensus 70 ~~~~~~~i~~a~~lg~~~i~~~~g~~~~~~~~~~~~~~~~~~~~l~~l~~~a~~~gv~i~lE 131 (213)
T PF01261_consen 70 LEYLKKAIDLAKRLGAKYIVVHSGRYPSGPEDDTEENWERLAENLRELAEIAEEYGVRIALE 131 (213)
T ss_dssp HHHHHHHHHHHHHHTBSEEEEECTTESSSTTSSHHHHHHHHHHHHHHHHHHHHHHTSEEEEE
T ss_pred HHHHHHHHHHHHHhCCCceeecCcccccccCCCHHHHHHHHHHHHHHHHhhhhhhcceEEEe
Confidence 456788899999999999999876411111100 1111235677888888899999876653
No 187
>cd08576 GDPD_like_SMaseD_PLD Glycerophosphodiester phosphodiesterase-like domain of spider venom sphingomyelinases D, bacterial phospholipase D, and similar proteins. This subfamily corresponds to the glycerophosphodiester phosphodiesterase-like domain (GDPD-like) present in sphingomyelinases D (SMases D) (sphingomyelin phosphodiesterase D, EC 3.1.4.4) from spider venom, the Corynebacterium pseudotuberculosis Phospholipase D (PLD)-like protein from pathogenic bacteria, and the Ajellomyces capsulatus H143 PLD-like protein from ascomycetes. Spider SMases D and bacterial PLD proteins catalyze the Mg2+-dependent hydrolysis of sphingomyelin producing choline and ceramide 1-phosphate (C1P), which possess a number of biological functions, such as regulating cell proliferation and apoptosis, participating in inflammatory responses, and playing a key role in phagocytosis. In the presence of Mg2+, SMases D can function as lysophospholipase D and hydrolyze lysophosphatidylcholine (LPC) to choline
Probab=31.94 E-value=3.5e+02 Score=24.81 Aligned_cols=103 Identities=11% Similarity=0.078 Sum_probs=68.4
Q ss_pred HHHHHHHHHcCCCcceeccc-cCccccCC---CCCC----ChhhhhHHHHHHHHHHHcC---CcceeeccCCCCcHhHHH
Q 024821 82 KEDVKLMADTGLDAYRFSIS-WSRLIPNG---RGPV----NPKGLQYYNNLINELISYG---IQPHVTLHHLDLPQALED 150 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~-WsRI~P~g---~g~~----n~~~l~~y~~~id~l~~~G---I~p~vtL~H~~~P~~l~~ 150 (262)
.+.|+-.-++|.|++=+.|. |+-=.|.- .... +....+..+++++.+++.- -.--++|. ||.-
T Consensus 10 ~~~v~~~l~~GANaiE~Dv~f~~~~~~~~~~Hg~pcdc~r~c~~~~~f~~~l~~~r~~ttpg~~~~l~lv------~lDl 83 (265)
T cd08576 10 LEGVDDALDHGANAIEIDVTFWSNGTGWWADHDVPCDCFRGCTAREMFDEILDYRRNGTTPGFRENLIFV------WLDL 83 (265)
T ss_pred HHHHHHHHHcCCCceeEEEEEccCCcEEEeeCCCccccccCCcHHHHHHHHHHHHHhcCCCCccceeEEE------EEEc
Confidence 46778888999999999995 33222210 0111 3344566777777776652 11223332 2222
Q ss_pred hhCCCCChhhHHHHHHHHHHHHHHhCCC---CceEEeccCCch
Q 024821 151 EYGGWINRMIVKDFTAYADVCFREFGDR---VSYWTTVNEPNG 190 (262)
Q Consensus 151 ~~gGw~~~~~~~~F~~ya~~v~~~~gd~---V~~w~t~NEP~~ 190 (262)
|-++...+...+...+.|+.+.++|+.. +..++.+..|.+
T Consensus 84 K~~~~~~~~~~~ag~~la~~ll~~~w~~~~~~ra~~~~s~~~~ 126 (265)
T cd08576 84 KNPDLCGECSINAGRDLARKLLEPYWNGGSGARALYGFSIPSI 126 (265)
T ss_pred CCCCcCHHHHHHHHHHHHHHHHHHhhcCCCCeEEEEEEEeccc
Confidence 4466777788999999999999999955 888999999985
No 188
>PRK07094 biotin synthase; Provisional
Probab=31.86 E-value=74 Score=29.19 Aligned_cols=57 Identities=14% Similarity=0.163 Sum_probs=39.7
Q ss_pred ccHHHHHHHHHcCCCcceeccc-c-CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 80 KYKEDVKLMADTGLDAYRFSIS-W-SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~-W-sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.-+|+++.|+++|++.+-++++ - +++...-.... ..+-+.+.++.+++.||.+..++
T Consensus 127 ~~~e~l~~Lk~aG~~~v~~glEs~~~~~~~~i~~~~---s~~~~~~~i~~l~~~Gi~v~~~~ 185 (323)
T PRK07094 127 RSYEEYKAWKEAGADRYLLRHETADKELYAKLHPGM---SFENRIACLKDLKELGYEVGSGF 185 (323)
T ss_pred CCHHHHHHHHHcCCCEEEeccccCCHHHHHHhCCCC---CHHHHHHHHHHHHHcCCeecceE
Confidence 4578999999999999888874 3 23443311112 25668889999999999765443
No 189
>TIGR00423 radical SAM domain protein, CofH subfamily. This protein family includes the CofH protein of coenzyme F(420) biosynthesis from Methanocaldococcus jannaschii, but appears to hit genomes more broadly than just the subset that make coenzyme F(420), so that narrower group is being built as a separate family.
Probab=31.78 E-value=1.6e+02 Score=26.91 Aligned_cols=54 Identities=20% Similarity=0.327 Sum_probs=37.7
Q ss_pred cHHHHHHHHHcCCCcce-eccc-c-----CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 81 YKEDVKLMADTGLDAYR-FSIS-W-----SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R-~si~-W-----sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
.+|.++.||++|++.+- .+.+ - .++.|+ ... .+.+.+.++.+++.||++..++.
T Consensus 106 ~~e~l~~LkeAGl~~i~~~g~E~l~~~~~~~i~~~---~~t---~~~~l~~i~~a~~~Gi~~~s~~i 166 (309)
T TIGR00423 106 IEEVLKRLKKAGLDSMPGTGAEILDDSVRRKICPN---KLS---SDEWLEVIKTAHRLGIPTTATMM 166 (309)
T ss_pred HHHHHHHHHHcCCCcCCCCcchhcCHHHHHhhCCC---CCC---HHHHHHHHHHHHHcCCCceeeEE
Confidence 47889999999999885 2321 1 122232 223 45678999999999999987764
No 190
>PRK06256 biotin synthase; Validated
Probab=31.31 E-value=67 Score=29.69 Aligned_cols=57 Identities=16% Similarity=0.223 Sum_probs=38.6
Q ss_pred ccHHHHHHHHHcCCCcceecc-ccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 80 KYKEDVKLMADTGLDAYRFSI-SWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si-~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.-+|.++.|+++|++.+-+++ +=.++.+.-... ..++.+.+.|+.+++.||++..++
T Consensus 150 l~~e~l~~LkeaG~~~v~~~lEts~~~~~~i~~~---~t~~~~i~~i~~a~~~Gi~v~~~~ 207 (336)
T PRK06256 150 LTEEQAERLKEAGVDRYNHNLETSRSYFPNVVTT---HTYEDRIDTCEMVKAAGIEPCSGG 207 (336)
T ss_pred CCHHHHHHHHHhCCCEEecCCccCHHHHhhcCCC---CCHHHHHHHHHHHHHcCCeeccCe
Confidence 457889999999999888766 312244431111 135667788999999999765443
No 191
>cd00019 AP2Ec AP endonuclease family 2; These endonucleases play a role in DNA repair. Cleave phosphodiester bonds at apurinic or apyrimidinic sites; the alignment also contains hexulose-6-phosphate isomerases, enzymes that catalyze the epimerization of D-arabino-6-hexulose 3-phosphate to D-fructose 6-phosphate, via cleaving the phosphoesterbond with the sugar.
Probab=31.10 E-value=2.1e+02 Score=25.36 Aligned_cols=25 Identities=12% Similarity=0.207 Sum_probs=14.9
Q ss_pred cccHHHHHHHHHcCCCcceeccccC
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWS 103 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~Ws 103 (262)
..+++.+++++++|++.+=+.+...
T Consensus 10 ~~l~~~l~~a~~~G~d~vEl~~~~~ 34 (279)
T cd00019 10 FGLENALKRAKEIGFDTVAMFLGNP 34 (279)
T ss_pred ccHHHHHHHHHHcCCCEEEEEcCCC
Confidence 4456666666666666665555443
No 192
>TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit. Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation. Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA.
Probab=30.96 E-value=57 Score=27.10 Aligned_cols=56 Identities=18% Similarity=0.318 Sum_probs=39.3
Q ss_pred cCCccccccHHHHHH-HHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCccee
Q 024821 73 VACDEYHKYKEDVKL-MADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHV 137 (262)
Q Consensus 73 ~a~d~y~~~~eDi~l-~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~v 137 (262)
.+|......++||+. ++++|++.+++.+.|+--.- .+.+.++| -+.|+++||.|=.
T Consensus 36 ~gcpa~e~L~~~I~~aL~~~Gv~~V~V~i~~~p~Wt--~d~it~~g-------r~~l~~~giapp~ 92 (146)
T TIGR02159 36 SGCPALEVIRQDIRDAVRALGVEVVEVSTSLDPPWT--TDWITEDA-------REKLREYGIAPPA 92 (146)
T ss_pred CCCchHHHHHHHHHHHHHhcCCCeEEEeEeeCCCCC--hHHCCHHH-------HHHHHhcCccCCC
Confidence 466777778888754 67789999988887765432 34555444 4678999998843
No 193
>PF04055 Radical_SAM: Radical SAM superfamily; InterPro: IPR007197 Radical SAM proteins catalyze diverse reactions, including unusual methylations, isomerization, sulphur insertion, ring formation, anaerobic oxidation and protein radical formation. Evidence exists that these proteins generate a radical species by reductive cleavage of S:-adenosylmethionine (SAM) through an unusual Fe-S centre [, ].; GO: 0003824 catalytic activity, 0051536 iron-sulfur cluster binding; PDB: 2A5H_D 3T7V_A 3C8F_A 3CB8_A 2FB2_A 2FB3_A 3CIX_A 3IIX_A 3IIZ_A 3CIW_A ....
Probab=30.77 E-value=56 Score=25.45 Aligned_cols=53 Identities=17% Similarity=0.177 Sum_probs=37.4
Q ss_pred HHHHHHHHHcCCCcceeccc-cCcc-ccCCCCCCChhhhhHHHHHHHHHHHcCCcce
Q 024821 82 KEDVKLMADTGLDAYRFSIS-WSRL-IPNGRGPVNPKGLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~-WsRI-~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~ 136 (262)
++.++.+++.|++.+++|++ -+.- ..+-.+ ....++..-+.++.|+++|+.++
T Consensus 90 ~~~l~~l~~~~~~~i~~~l~s~~~~~~~~~~~--~~~~~~~~~~~l~~l~~~g~~~~ 144 (166)
T PF04055_consen 90 EELLDELKKLGVDRIRISLESLDEESVLRIIN--RGKSFERVLEALERLKEAGIPRV 144 (166)
T ss_dssp HHHHHHHHHTTCSEEEEEEBSSSHHHHHHHHS--STSHHHHHHHHHHHHHHTTSETE
T ss_pred HHHHHHHHhcCccEEecccccCCHHHhhhhhc--CCCCHHHHHHHHHHHHHcCCCcE
Confidence 89999999999999999984 3221 111001 12346777889999999999863
No 194
>PF09713 A_thal_3526: Plant protein 1589 of unknown function (A_thal_3526); InterPro: IPR006476 This plant-specific family of proteins are defined by an uncharacterised region 57 residues in length. It is found toward the N terminus of most proteins that contain it. Examples include at least several proteins from Arabidopsis thaliana (Mouse-ear cress) and Oryza sativa (Rice). The function of the proteins are unknown.
Probab=30.73 E-value=63 Score=22.46 Aligned_cols=35 Identities=20% Similarity=0.230 Sum_probs=23.5
Q ss_pred HHHHHHHH-HcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHH
Q 024821 122 NNLINELI-SYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAY 167 (262)
Q Consensus 122 ~~~id~l~-~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~y 167 (262)
+++++.|. ..||+|.+|.. .|-. +.+++++.|..|
T Consensus 16 ~E~v~~L~~~a~I~P~~T~~-----VW~~------Le~eN~eFF~aY 51 (54)
T PF09713_consen 16 EECVRALQKQANIEPVFTST-----VWQK------LEKENPEFFKAY 51 (54)
T ss_pred HHHHHHHHHHcCCChHHHHH-----HHHH------HHHHCHHHHHHh
Confidence 67788885 56999999885 4432 223456777766
No 195
>cd07938 DRE_TIM_HMGL 3-hydroxy-3-methylglutaryl-CoA lyase, catalytic TIM barrel domain. 3-hydroxy-3-methylglutaryl-CoA lyase (HMGL) catalyzes the cleavage of HMG-CoA to acetyl-CoA and acetoacetate, one of the terminal steps in ketone body generation and leucine degradation, and is a key enzyme in the pathway that supplies metabolic fuel to extrahepatic tissues. Mutations in HMGL cause a human autosomal recessive disorder called primary metabolic aciduria that affects ketogenesis and leucine catabolism and can be fatal due to an inability to tolerate hypoglycemia. HMGL has a TIM barrel domain with a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues that cap the core of the barrel. The cleavage of HMG-CoA requires the presence of a divalent cation like Mg2+ or Mn2+, and the reaction is thought to involve general acid/base catalysis. This family belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropy
Probab=30.66 E-value=3.7e+02 Score=24.33 Aligned_cols=51 Identities=20% Similarity=0.224 Sum_probs=37.7
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~ 136 (262)
..+-++..|+.|.. ++.+++.+--.|. .+..+ .+++.++++.+.+.|..-+
T Consensus 116 ~~~~v~~ak~~G~~-v~~~i~~~f~~~~-~~~~~---~~~~~~~~~~~~~~Ga~~i 166 (274)
T cd07938 116 FEPVAELAKAAGLR-VRGYVSTAFGCPY-EGEVP---PERVAEVAERLLDLGCDEI 166 (274)
T ss_pred HHHHHHHHHHCCCe-EEEEEEeEecCCC-CCCCC---HHHHHHHHHHHHHcCCCEE
Confidence 44557888999984 7777776655566 45666 6788999999999997643
No 196
>TIGR03699 mena_SCO4550 menaquinone biosynthesis protein, SCO4550 family. members of this protein family are involved in menaquinone biosynthesis by an alternate pathway via futalosine.
Probab=30.16 E-value=2.3e+02 Score=26.20 Aligned_cols=53 Identities=15% Similarity=0.283 Sum_probs=34.8
Q ss_pred cHHHHHHHHHcCCCcceec----ccc---CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 81 YKEDVKLMADTGLDAYRFS----ISW---SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~s----i~W---sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+|.++.||++|++.+-.. ++. .++.|. .. ..+-+.+.++.+++.||++..++
T Consensus 142 ~~e~l~~Lk~aG~~~~~~~g~E~~~~~~~~~~~~~---~~---s~~~~l~~i~~a~~~Gi~v~~~~ 201 (340)
T TIGR03699 142 LREVLERLKEAGLDSIPGGGAEILSDRVRKIISPK---KI---SSEEWLEVMETAHKLGLPTTATM 201 (340)
T ss_pred HHHHHHHHHHcCCCcCCCCcccccCHHHHHhhCCC---CC---CHHHHHHHHHHHHHcCCCcccee
Confidence 3788999999999977421 110 111222 12 25567899999999999876554
No 197
>PRK03892 ribonuclease P protein component 3; Provisional
Probab=29.78 E-value=1.6e+02 Score=26.30 Aligned_cols=87 Identities=13% Similarity=0.127 Sum_probs=66.3
Q ss_pred HHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc---CCCC--cHhHHH--hhCCCCChh
Q 024821 87 LMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH---HLDL--PQALED--EYGGWINRM 159 (262)
Q Consensus 87 l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~---H~~~--P~~l~~--~~gGw~~~~ 159 (262)
+.|.+-=|-+++.++-+.++-. .|.--...|++|++.+...++++.-.++|-. ||++ |.-+.. ..=|...++
T Consensus 122 LAKlAa~n~VAIe~~L~plL~~-~G~~Rar~L~~~r~~l~L~rKYd~P~VISS~A~s~~~lRsPRdl~aL~~~iGme~~e 200 (216)
T PRK03892 122 LARMAAKRGVAIGFSLSPLLRA-NPYERANILRFMMKAWQLVNKYKVPRFITSSAESKWEVRGPRDLMSLGINIGMEIPQ 200 (216)
T ss_pred HHHHHHHcCeEEEEecHHHHhh-CchhHHHHHHHHHHHHHHHHHcCCCEEEecCcchhccCCCHHHHHHHHHHhCCCHHH
Confidence 3444444788999999999876 6777778899999999999999998888754 6666 544332 124677788
Q ss_pred hHHHHHHHHHHHHHH
Q 024821 160 IVKDFTAYADVCFRE 174 (262)
Q Consensus 160 ~~~~F~~ya~~v~~~ 174 (262)
..+.+..|.+.+.+|
T Consensus 201 a~~~Ls~~p~~i~~~ 215 (216)
T PRK03892 201 AKASLSFYPRIILKR 215 (216)
T ss_pred HHHHHHHhHHHHhhc
Confidence 888888888887765
No 198
>TIGR01589 A_thal_3526 uncharacterized plant-specific domain TIGR01589. This model represents an uncharacterized plant-specific domain 57 residues in length. It is found toward the N-terminus of most proteins that contain it. Examples include at least 10 proteins from Arabidopsis thaliana and at least one from Oryza sativa.
Probab=29.51 E-value=85 Score=22.09 Aligned_cols=36 Identities=19% Similarity=0.255 Sum_probs=23.9
Q ss_pred HHHHHHHHH-cCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHH
Q 024821 122 NNLINELIS-YGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYA 168 (262)
Q Consensus 122 ~~~id~l~~-~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya 168 (262)
+++++.|.+ .||+|.+|.. .|-. +.+++.+.|..|-
T Consensus 19 ~E~v~~L~~~a~I~P~~T~~-----VW~~------LekeN~eFF~aY~ 55 (57)
T TIGR01589 19 EETVSFLFENAGISPKFTRF-----VWYL------LEKENADFFRCYK 55 (57)
T ss_pred HHHHHHHHHHcCCCchhHHH-----HHHH------HHHHHHHHHHHHh
Confidence 567766664 8999998875 4532 2245667787773
No 199
>PTZ00445 p36-lilke protein; Provisional
Probab=29.33 E-value=86 Score=28.00 Aligned_cols=56 Identities=16% Similarity=0.245 Sum_probs=38.8
Q ss_pred HHHHHHcCCCcceeccccCccccCCCCCCChh---------hhhHHHHHHHHHHHcCCcceeecc
Q 024821 85 VKLMADTGLDAYRFSISWSRLIPNGRGPVNPK---------GLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 85 i~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~---------~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
++++++.|++.+=+.++=.-|---.-|-.++. +=.-...++.+|++.||...|..+
T Consensus 35 v~~L~~~GIk~Va~D~DnTlI~~HsgG~~~~~~~~~~~~~~~tpefk~~~~~l~~~~I~v~VVTf 99 (219)
T PTZ00445 35 VDLLNECGIKVIASDFDLTMITKHSGGYIDPDNDDIRVLTSVTPDFKILGKRLKNSNIKISVVTF 99 (219)
T ss_pred HHHHHHcCCeEEEecchhhhhhhhcccccCCCcchhhhhccCCHHHHHHHHHHHHCCCeEEEEEc
Confidence 68899999999988877664432101222221 234478899999999999888765
No 200
>PF02057 Glyco_hydro_59: Glycosyl hydrolase family 59; InterPro: IPR001286 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 59 GH59 from CAZY comprises enzymes with only one known activity; galactocerebrosidase (3.2.1.46 from EC). Globoid cell leukodystrophy (Krabbe disease) is a severe, autosomal recessive disorder that results from deficiency of galactocerebrosidase (GALC) activity [, , ]. GALC is responsible for the lysosomal catabolism of certain galactolipids, including galactosylceramide and psychosine [].; GO: 0004336 galactosylceramidase activity, 0006683 galactosylceramide catabolic process; PDB: 3ZR6_A 3ZR5_A.
Probab=29.30 E-value=1.1e+02 Score=31.66 Aligned_cols=65 Identities=14% Similarity=0.063 Sum_probs=36.5
Q ss_pred HHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCC----h-hhHHHHHHHHHHHHHHhCCCCceEEeccCCch
Q 024821 123 NLINELISYGIQPHVTLHHLDLPQALEDEYGGWIN----R-MIVKDFTAYADVCFREFGDRVSYWTTVNEPNG 190 (262)
Q Consensus 123 ~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~----~-~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~ 190 (262)
.++.++++++=..-+....|..|.|+. +||.. + .++....++..-+.+.+|=.++|--++||=..
T Consensus 116 ~L~~eAKkrNP~ikl~~L~W~~PgW~~---~g~~~~~~~~~~~a~Y~~~wl~ga~~~~gl~idYvg~~NEr~~ 185 (669)
T PF02057_consen 116 WLMAEAKKRNPNIKLYGLPWGFPGWVG---NGWNWPYDNPQLTAYYVVSWLLGAKKTHGLDIDYVGIWNERGF 185 (669)
T ss_dssp HHHHHHHHH-TT-EEEEEES-B-GGGG---TTSS-TTSSHHHHHHHHHHHHHHHHHHH-----EE-S-TTS--
T ss_pred hhHHHHHhhCCCCeEEEeccCCCcccc---CCCCCcccchhhhhHHHHHHHHHHHHHhCCCceEechhhccCC
Confidence 578888888866666777899999995 44433 2 23444456666777888888888889999654
No 201
>smart00052 EAL Putative diguanylate phosphodiesterase. Putative diguanylate phosphodiesterase, present in a variety of bacteria.
Probab=29.01 E-value=90 Score=26.40 Aligned_cols=62 Identities=15% Similarity=0.242 Sum_probs=43.4
Q ss_pred CcCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 72 DVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 72 ~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
..+.|.+-.-..++..+.++.++.++++.++-+-... |...-...+.+++.++..||+++++
T Consensus 148 ~ialddfg~~~~~~~~l~~l~~d~iKld~~~~~~~~~-----~~~~~~~l~~l~~~~~~~~~~via~ 209 (241)
T smart00052 148 RIALDDFGTGYSSLSYLKRLPVDLLKIDKSFVRDLQT-----DPEDEAIVQSIIELAQKLGLQVVAE 209 (241)
T ss_pred EEEEeCCCCcHHHHHHHHhCCCCeEEECHHHHhhhcc-----ChhHHHHHHHHHHHHHHCCCeEEEe
Confidence 3455555555567778888888888888776554332 2233566888999999999998875
No 202
>PF04646 DUF604: Protein of unknown function, DUF604; InterPro: IPR006740 This family includes a conserved region found in several uncharacterised plant proteins.
Probab=28.96 E-value=26 Score=31.90 Aligned_cols=72 Identities=15% Similarity=0.102 Sum_probs=39.7
Q ss_pred HHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhccccccccCC
Q 024821 127 ELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFGIA 201 (262)
Q Consensus 127 ~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g~~ 201 (262)
.+.++.+.|+++||||+.=.. -+.+....+.++.+.+-++.--.++-.+-.-|---....+.+..||..-.+
T Consensus 76 ~~~a~~~~pl~SlHH~~~~~P---ifP~~~~~~al~~L~~a~~~d~~~~lqqsicyd~~~~wsvsVSwGYsVqvy 147 (255)
T PF04646_consen 76 FLEAHPLAPLVSLHHWDSVDP---IFPNMSRLQALRHLLKAAKVDPARILQQSICYDRRRNWSVSVSWGYSVQVY 147 (255)
T ss_pred eeecCCCCceeeeeehhhccc---cCCCCCHHHHHHHHHHHHhhChHhhhheeeeccCceEEEEEEEccEEEEEE
Confidence 345566899999999985222 245556666777777755544333332211122222334445567765444
No 203
>cd01335 Radical_SAM Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme o
Probab=28.87 E-value=71 Score=25.54 Aligned_cols=59 Identities=19% Similarity=0.175 Sum_probs=39.8
Q ss_pred cHHHHHHHHHcCCCcceeccccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
-+++++.+++.|+..+.+|++-. .-.-.... -....++.+-+.|+.+.+.|+...+++.
T Consensus 87 ~~~~~~~l~~~g~~~i~i~le~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~i 146 (204)
T cd01335 87 TEELLKELKELGLDGVGVSLDSGDEEVADKIR-GSGESFKERLEALKELREAGLGLSTTLL 146 (204)
T ss_pred CHHHHHHHHhCCCceEEEEcccCCHHHHHHHh-cCCcCHHHHHHHHHHHHHcCCCceEEEE
Confidence 47899999999999999998543 22111000 0112366778888889998888776653
No 204
>PRK09057 coproporphyrinogen III oxidase; Provisional
Probab=28.22 E-value=1.5e+02 Score=28.10 Aligned_cols=96 Identities=9% Similarity=0.017 Sum_probs=57.5
Q ss_pred cHHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCCh
Q 024821 81 YKEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINR 158 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~ 158 (262)
-.+.++.|+++|+|.+.+++ |-+ ++... -|... ..+-..+.++.+++.++.+-++|. +++|.
T Consensus 103 ~~e~L~~l~~~GvnrislGvQS~~d~vL~~-l~R~~--~~~~~~~ai~~~~~~~~~v~~dli-~GlPg------------ 166 (380)
T PRK09057 103 EAGRFRGYRAAGVNRVSLGVQALNDADLRF-LGRLH--SVAEALAAIDLAREIFPRVSFDLI-YARPG------------ 166 (380)
T ss_pred CHHHHHHHHHcCCCEEEEecccCCHHHHHH-cCCCC--CHHHHHHHHHHHHHhCccEEEEee-cCCCC------------
Confidence 34999999999999666665 332 23322 22221 234456778888888888888875 45662
Q ss_pred hhHHHHHHHHHHHHHHhCCCCceEEeccCCchhc
Q 024821 159 MIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFA 192 (262)
Q Consensus 159 ~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~ 192 (262)
++.+.+.+=.+.+.+.=-+.|..+...-||+...
T Consensus 167 qt~~~~~~~l~~~~~l~p~~is~y~L~~~~gT~l 200 (380)
T PRK09057 167 QTLAAWRAELKEALSLAADHLSLYQLTIEEGTAF 200 (380)
T ss_pred CCHHHHHHHHHHHHhcCCCeEEeecceecCCChH
Confidence 1233444434444433224577777777777543
No 205
>PF04028 DUF374: Domain of unknown function (DUF374); InterPro: IPR007172 This is a bacterial domain of unknown function.
Probab=27.87 E-value=1.3e+02 Score=22.08 Aligned_cols=40 Identities=18% Similarity=0.264 Sum_probs=31.8
Q ss_pred HHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 86 KLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 86 ~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
+.++.+|+.++|-| . +..+.+-+.++++.|+ .|..+.+|.
T Consensus 27 ~~~~~~G~~~iRGS--------s-----~rgg~~Alr~~~~~lk-~G~~~~itp 66 (74)
T PF04028_consen 27 RVLERFGFRTIRGS--------S-----SRGGARALREMLRALK-EGYSIAITP 66 (74)
T ss_pred HHHHHcCCCeEEeC--------C-----CCcHHHHHHHHHHHHH-CCCeEEEeC
Confidence 67889999999999 2 3445677899999999 777777765
No 206
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=27.82 E-value=4.4e+02 Score=24.36 Aligned_cols=34 Identities=12% Similarity=0.062 Sum_probs=23.4
Q ss_pred CCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 155 WINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 155 w~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+.||+..+.|.+-.+.....+| ---.|.=+||+.
T Consensus 135 ftnp~a~~ww~~~~~~~~~~~G-vdg~w~D~~E~~ 168 (317)
T cd06594 135 LTNPAARDWFKQVIKEMLLDLG-LSGWMADFGEYL 168 (317)
T ss_pred cCCHHHHHHHHHHHHHHhhhcC-CcEEEecCCCCC
Confidence 5688898988777776654443 223578899964
No 207
>TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative. This family is conserved at better than 40 % identity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.
Probab=27.74 E-value=4.3e+02 Score=23.37 Aligned_cols=54 Identities=19% Similarity=0.184 Sum_probs=37.4
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcce
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPH 136 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~ 136 (262)
..++|-+++++++|++.+=+++.-....|. .-..+ -.....+-+.+.++||++.
T Consensus 16 ~~~~e~l~~~~~~G~~~VEl~~~~~~~~~~-~~~~~---~~~~~~~~~~l~~~gl~i~ 69 (279)
T TIGR00542 16 ECWLERLQLAKTCGFDFVEMSVDETDDRLS-RLDWS---REQRLALVNAIIETGVRIP 69 (279)
T ss_pred CCHHHHHHHHHHcCCCEEEEecCCccchhh-ccCCC---HHHHHHHHHHHHHcCCCce
Confidence 348899999999999999887654322222 11222 2446778888999999875
No 208
>TIGR00587 nfo apurinic endonuclease (APN1). All proteins in this family for which functions are known are 5' AP endonculeases that are used in base excision repair and the repair of abasic sites in DNA.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=27.51 E-value=1.7e+02 Score=26.29 Aligned_cols=50 Identities=12% Similarity=0.088 Sum_probs=34.5
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQP 135 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p 135 (262)
..+-++.++++|.+++-+-+..+|.... ..+++... +.+-+.+.++++..
T Consensus 13 ~~~a~~~~~~~G~~~~qif~~~P~~w~~--~~~~~~~~---~~~~~~~~~~~~~~ 62 (274)
T TIGR00587 13 LQAAYNRAAEIGATAFMFFLKSPRWWRR--PMLEEEVI---DWFKAALETNKNLS 62 (274)
T ss_pred HHHHHHHHHHhCCCEEEEEecCccccCC--CCCCHHHH---HHHHHHHHHcCCCC
Confidence 4567899999999999998888887644 33443333 33334477787763
No 209
>TIGR02102 pullulan_Gpos pullulanase, extracellular, Gram-positive. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. In contrast, a glycogen debranching enzyme such GlgX, homologous to this family, can release glucose at alpha,1-6 linkages from glycogen first subjected to limit degradation by phosphorylase. Characterized members of this family include a surface-located pullulanase from Streptococcus pneumoniae (PubMed:11083842) and an extracellular bifunctional amylase/pullulanase with C-terminal pullulanase activity (PubMed:8798645).
Probab=27.16 E-value=2.7e+02 Score=30.86 Aligned_cols=100 Identities=19% Similarity=0.291 Sum_probs=58.8
Q ss_pred cccccHHHHHHHHHcCCCcceeccccC--cc-----------------------------ccCCC-CC-C-Ch-hhhhHH
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISWS--RL-----------------------------IPNGR-GP-V-NP-KGLQYY 121 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~Ws--RI-----------------------------~P~g~-g~-~-n~-~~l~~y 121 (262)
.|.-..+.|+.+|+||++++-+.=-.+ .+ .|++. +. . +. ..++=+
T Consensus 478 tf~gl~ekLdYLkeLGVT~I~LmPv~d~~~~~e~~~~~~~~~~~~~~~~ynWGYdp~~yfape~~Ygtdp~dp~~ri~Ef 557 (1111)
T TIGR02102 478 TFAAFVEKLDYLQDLGVTHIQLLPVLSYFFVNEFKNKERMLDYASSNTNYNWGYDPQNYFALSGMYSEDPKDPELRIAEF 557 (1111)
T ss_pred CHHHHHHhHHHHHHcCCCEEEEcCccccccccccccccccccccccccccccCCCcCcCcccccccccCCcCccccHHHH
Confidence 355566779999999999987653221 00 01110 00 0 11 124668
Q ss_pred HHHHHHHHHcCCcceeecc--CCC--------CcHhHHH---------hhCC----CCChhhHHHHHHHHHHHHHHhC
Q 024821 122 NNLINELISYGIQPHVTLH--HLD--------LPQALED---------EYGG----WINRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 122 ~~~id~l~~~GI~p~vtL~--H~~--------~P~~l~~---------~~gG----w~~~~~~~~F~~ya~~v~~~~g 176 (262)
++||++|.++||++|+++. |.. .|.|... .++| -.++.+.+.+.+-++.-+++|+
T Consensus 558 K~LV~alH~~GI~VILDVVyNHt~~~~~f~~~~p~Yy~~~~~~G~~~~~~~g~~l~~e~~~vrk~iiDsl~yWv~ey~ 635 (1111)
T TIGR02102 558 KNLINEIHKRGMGVILDVVYNHTAKVYIFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHEMSRRILVDSIKYLVDEFK 635 (1111)
T ss_pred HHHHHHHHHCCCEEEEecccccccccccccccCCCceEeeCCCCCcccccCCCCCCcCCHHHHHHHHHHHHHHHHhcC
Confidence 9999999999999999853 421 1322110 0111 1345666777777777788776
No 210
>PRK09997 hydroxypyruvate isomerase; Provisional
Probab=26.99 E-value=1.4e+02 Score=26.26 Aligned_cols=62 Identities=15% Similarity=0.235 Sum_probs=37.1
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCCC--CCCChhhhhHHHHHHHHHHHcCCcceee-ccCCCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNGR--GPVNPKGLQYYNNLINELISYGIQPHVT-LHHLDLP 145 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g~--g~~n~~~l~~y~~~id~l~~~GI~p~vt-L~H~~~P 145 (262)
.++=|++++++|.+.+++...+ . |.+. .+......+..+++.+.+.++|+...+= +.|++.|
T Consensus 87 ~~~~i~~a~~lga~~i~~~~g~--~-~~~~~~~~~~~~~~~~l~~l~~~a~~~Gv~l~lE~~n~~~~~ 151 (258)
T PRK09997 87 VAAAIRYARALGNKKINCLVGK--T-PAGFSSEQIHATLVENLRYAANMLMKEDILLLIEPINHFDIP 151 (258)
T ss_pred HHHHHHHHHHhCCCEEEECCCC--C-CCCCCHHHHHHHHHHHHHHHHHHHHHcCCEEEEEeCCCcCCC
Confidence 5666889999999998864332 1 2211 1111223455677777788999886663 3454443
No 211
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=26.60 E-value=2.4e+02 Score=27.27 Aligned_cols=140 Identities=16% Similarity=0.171 Sum_probs=81.0
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccC---CCC--CCChhhhh-HHHHHHHHHHHcCCcceeecc----------CCCC
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPN---GRG--PVNPKGLQ-YYNNLINELISYGIQPHVTLH----------HLDL 144 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~---g~g--~~n~~~l~-~y~~~id~l~~~GI~p~vtL~----------H~~~ 144 (262)
..+.++.++++|++.|=+.=-|..---+ +-| .+|++-+- =...+++.+++.|+++=+=+- .-..
T Consensus 60 i~~~a~~~~~~G~e~fviDDGW~~~r~~d~~~~GdW~~~~~kFP~Gl~~l~~~i~~~Gmk~GlW~ePe~v~~~S~l~~~h 139 (394)
T PF02065_consen 60 ILELADAAAELGYEYFVIDDGWFGGRDDDNAGLGDWEPDPKKFPNGLKPLADYIHSLGMKFGLWFEPEMVSPDSDLYREH 139 (394)
T ss_dssp HHHHHHHHHHHT-SEEEE-SSSBCTESTTTSTTSBECBBTTTSTTHHHHHHHHHHHTT-EEEEEEETTEEESSSCHCCSS
T ss_pred HHHHHHHHHHhCCEEEEEcCccccccCCCcccCCceeEChhhhCCcHHHHHHHHHHCCCeEEEEeccccccchhHHHHhC
Confidence 4556788899999998888899654211 012 23433221 278999999999998654220 1135
Q ss_pred cHhHHHhh-----CC-------CCChhhHHHHHHHHHHHHHHhC-CCCceEEeccCCchhccccccccCCCCCCCCCCCC
Q 024821 145 PQALEDEY-----GG-------WINRMIVKDFTAYADVCFREFG-DRVSYWTTVNEPNGFAMVGYDFGIAPPKRCSPPLN 211 (262)
Q Consensus 145 P~~l~~~~-----gG-------w~~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~NEP~~~~~~gy~~g~~~p~~~~~~~~ 211 (262)
|.|+...- .| ..+|++.+...+-...+++.+| |.+| .=+|..-. .++...
T Consensus 140 Pdw~l~~~~~~~~~~r~~~vLD~~~pev~~~l~~~i~~ll~~~gidYiK--~D~n~~~~-----------~~~~~~---- 202 (394)
T PF02065_consen 140 PDWVLRDPGRPPTLGRNQYVLDLSNPEVRDYLFEVIDRLLREWGIDYIK--WDFNRDIT-----------EAGSPS---- 202 (394)
T ss_dssp BGGBTCCTTSE-ECBTTBEEB-TTSHHHHHHHHHHHHHHHHHTT-SEEE--EE-TS-TT-----------S-SSTT----
T ss_pred ccceeecCCCCCcCcccceEEcCCCHHHHHHHHHHHHHHHHhcCCCEEE--eccccCCC-----------CCCCCC----
Confidence 87864310 11 2478888888888888888887 4444 23444211 011000
Q ss_pred CCCCCCCCchhHHHHHHHHHHHHHHHHHHHHhcCCC
Q 024821 212 NCSRGNSSTEPYMAVHHLLLAHASVARLYKKNYQVI 247 (262)
Q Consensus 212 ~~~~~~~~~~~~~~~hn~llAHa~a~~~~~~~~~~~ 247 (262)
.. ...|...++--+..+.+++.+|+.
T Consensus 203 --------~~--~~~~~~~~~~y~l~~~L~~~~P~v 228 (394)
T PF02065_consen 203 --------LP--EGYHRYVLGLYRLLDRLRARFPDV 228 (394)
T ss_dssp --------S---GHHHHHHHHHHHHHHHHHHHTTTS
T ss_pred --------ch--HHHHHHHHHHHHHHHHHHHhCCCc
Confidence 00 455666666666788888888874
No 212
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=25.98 E-value=81 Score=27.89 Aligned_cols=59 Identities=8% Similarity=0.154 Sum_probs=37.1
Q ss_pred cccHHHHHHHHHcCCCcceeccccCcc-ccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRL-IPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI-~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+.++=|+..++||++.+|+. ..... .|. ....-...++..+++.+.+.++||+..+=.
T Consensus 94 ~~~~~~i~~a~~lG~~~v~~~-~~~~~~~~~-~~~~~~~~~~~l~~l~~~a~~~gv~l~lE~ 153 (284)
T PRK13210 94 EIMKKAIRLAQDLGIRTIQLA-GYDVYYEEK-SEETRQRFIEGLAWAVEQAAAAQVMLAVEI 153 (284)
T ss_pred HHHHHHHHHHHHhCCCEEEEC-Ccccccccc-cHHHHHHHHHHHHHHHHHHHHhCCEEEEEe
Confidence 335566788999999999974 11111 111 111112345678888888999999776643
No 213
>PRK09389 (R)-citramalate synthase; Provisional
Probab=25.95 E-value=80 Score=31.37 Aligned_cols=107 Identities=10% Similarity=-0.042 Sum_probs=68.5
Q ss_pred cHHHHHHHHHcCCCcceeccccCccccCC-CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC--CcHhHHH------h
Q 024821 81 YKEDVKLMADTGLDAYRFSISWSRLIPNG-RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD--LPQALED------E 151 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~WsRI~P~g-~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~--~P~~l~~------~ 151 (262)
-++|++.+.+.|++.+++.++-|.+.=.. -+.--++.++...+.++.+++.|+++.+++-... -|..+.+ +
T Consensus 75 ~~~di~~a~~~g~~~v~i~~~~Sd~h~~~~l~~s~~e~l~~~~~~v~~ak~~g~~v~~~~ed~~r~~~~~l~~~~~~~~~ 154 (488)
T PRK09389 75 VKVDIDAALECDVDSVHLVVPTSDLHIEYKLKKTREEVLETAVEAVEYAKDHGLIVELSGEDASRADLDFLKELYKAGIE 154 (488)
T ss_pred CHHHHHHHHhCCcCEEEEEEccCHHHHHHHhCCCHHHHHHHHHHHHHHHHHCCCEEEEEEeeCCCCCHHHHHHHHHHHHh
Confidence 48999999999999999999877664321 1222356788899999999999999888775322 1333221 1
Q ss_pred h----------CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 Y----------GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ~----------gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
. -|...| ..+.++.+.+.++++ ..--.-.-|-..+.
T Consensus 155 ~Ga~~i~l~DTvG~~~P---~~~~~lv~~l~~~~~-v~l~~H~HND~GlA 200 (488)
T PRK09389 155 AGADRICFCDTVGILTP---EKTYELFKRLSELVK-GPVSIHCHNDFGLA 200 (488)
T ss_pred CCCCEEEEecCCCCcCH---HHHHHHHHHHHhhcC-CeEEEEecCCccHH
Confidence 1 233444 555566666666654 11234567776654
No 214
>COG5520 O-Glycosyl hydrolase [Cell envelope biogenesis, outer membrane]
Probab=25.82 E-value=1.8e+02 Score=28.21 Aligned_cols=94 Identities=18% Similarity=0.312 Sum_probs=52.0
Q ss_pred HcCCCcceeccccCccccCCCCCCChhhhhHHHHHH--HHHHHcCCcceeeccCCCCcHhHHHh---hCC---CCChhhH
Q 024821 90 DTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLI--NELISYGIQPHVTLHHLDLPQALEDE---YGG---WINRMIV 161 (262)
Q Consensus 90 ~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~i--d~l~~~GI~p~vtL~H~~~P~~l~~~---~gG---w~~~~~~ 161 (262)
++|++..|+.|.=.+.--. |..|. +|+++= ...+..|+++|.+- |..|.|.-.. -|| -+.++--
T Consensus 77 ~lg~si~Rv~I~~ndfsl~--g~~d~----w~kels~Ak~~in~g~ivfASP--WspPa~Mktt~~~ngg~~g~Lk~e~Y 148 (433)
T COG5520 77 QLGFSILRVPIDSNDFSLG--GSADN----WYKELSTAKSAINPGMIVFASP--WSPPASMKTTNNRNGGNAGRLKYEKY 148 (433)
T ss_pred ccCceEEEEEecccccccC--CCcch----hhhhcccchhhcCCCcEEEecC--CCCchhhhhccCcCCccccccchhHh
Confidence 4788888887765554322 44442 233322 22556777777665 6788887543 133 3444333
Q ss_pred HHHHHHHHHHHHHhCC---CCceEEeccCCchh
Q 024821 162 KDFTAYADVCFREFGD---RVSYWTTVNEPNGF 191 (262)
Q Consensus 162 ~~F~~ya~~v~~~~gd---~V~~w~t~NEP~~~ 191 (262)
..|++|-...+..+++ -+.+-.+=|||...
T Consensus 149 a~yA~~l~~fv~~m~~nGvnlyalSVQNEPd~~ 181 (433)
T COG5520 149 ADYADYLNDFVLEMKNNGVNLYALSVQNEPDYA 181 (433)
T ss_pred HHHHHHHHHHHHHHHhCCCceeEEeeccCCccc
Confidence 3333333333444444 45666689999864
No 215
>PTZ00445 p36-lilke protein; Provisional
Probab=25.43 E-value=1.2e+02 Score=27.21 Aligned_cols=67 Identities=16% Similarity=0.217 Sum_probs=39.6
Q ss_pred hHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChh---------hHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 119 QYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRM---------IVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 119 ~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~---------~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+.-+.+++.|++.||+.+++= +|.-.-- .--|||.++. ....|....+.+-+ .|=+| .-+||-...
T Consensus 29 ~~~~~~v~~L~~~GIk~Va~D--~DnTlI~-~HsgG~~~~~~~~~~~~~~~tpefk~~~~~l~~-~~I~v-~VVTfSd~~ 103 (219)
T PTZ00445 29 ESADKFVDLLNECGIKVIASD--FDLTMIT-KHSGGYIDPDNDDIRVLTSVTPDFKILGKRLKN-SNIKI-SVVTFSDKE 103 (219)
T ss_pred HHHHHHHHHHHHcCCeEEEec--chhhhhh-hhcccccCCCcchhhhhccCCHHHHHHHHHHHH-CCCeE-EEEEccchh
Confidence 446888999999999998752 2322111 1138999987 45667776655443 22122 244665553
Q ss_pred h
Q 024821 190 G 190 (262)
Q Consensus 190 ~ 190 (262)
.
T Consensus 104 ~ 104 (219)
T PTZ00445 104 L 104 (219)
T ss_pred h
Confidence 3
No 216
>PRK06582 coproporphyrinogen III oxidase; Provisional
Probab=25.41 E-value=2.8e+02 Score=26.53 Aligned_cols=94 Identities=12% Similarity=0.056 Sum_probs=56.8
Q ss_pred HHHHHHHHHcCCCcceecc-ccC-ccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChh
Q 024821 82 KEDVKLMADTGLDAYRFSI-SWS-RLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRM 159 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si-~Ws-RI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~ 159 (262)
++.++.|+++|+|.+-+++ |-. ++... -|... ..+-..+.++.+++.++.+-++|. +++|. +
T Consensus 111 ~e~l~~l~~~GvnRiSiGvQS~~d~~L~~-lgR~h--~~~~~~~ai~~~~~~~~~v~~DlI-~GlPg------------q 174 (390)
T PRK06582 111 TEKFKAFKLAGINRVSIGVQSLKEDDLKK-LGRTH--DCMQAIKTIEAANTIFPRVSFDLI-YARSG------------Q 174 (390)
T ss_pred HHHHHHHHHCCCCEEEEECCcCCHHHHHH-cCCCC--CHHHHHHHHHHHHHhCCcEEEEee-cCCCC------------C
Confidence 5899999999998666665 332 23322 23322 134456667778887777888876 56662 2
Q ss_pred hHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 160 IVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 160 ~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
+.+.+.+=++.+.+-=-+.|..+...=||...
T Consensus 175 t~e~~~~~l~~~~~l~p~his~y~L~i~~gT~ 206 (390)
T PRK06582 175 TLKDWQEELKQAMQLATSHISLYQLTIEKGTP 206 (390)
T ss_pred CHHHHHHHHHHHHhcCCCEEEEecCEEccCCh
Confidence 34455555555554333566666666666644
No 217
>PLN02925 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase
Probab=25.39 E-value=2.2e+02 Score=29.90 Aligned_cols=52 Identities=21% Similarity=0.394 Sum_probs=40.0
Q ss_pred HHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHH
Q 024821 120 YYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCF 172 (262)
Q Consensus 120 ~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~ 172 (262)
.+..+|+.|+++|+..=+...|-.++.-+..+||. ...-.++--.+|++.|-
T Consensus 211 ~f~~~v~~ak~~~~~iRIGvN~GSLs~ri~~~yGd-tp~gmVeSAle~~~i~e 262 (733)
T PLN02925 211 VFTPLVEKCKKYGRAMRIGTNHGSLSDRIMSYYGD-SPRGMVESAFEFARICR 262 (733)
T ss_pred HHHHHHHHHHHCCCCEEEecCCcCchHHHHHHhCC-ChHHHHHHHHHHHHHHH
Confidence 34459999999999999999999999998888874 33335666666666553
No 218
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=25.16 E-value=2.3e+02 Score=26.69 Aligned_cols=95 Identities=15% Similarity=0.176 Sum_probs=58.7
Q ss_pred cccHHHHHHHHHcCCCcceecc--ccCcccc-CC------CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC------
Q 024821 79 HKYKEDVKLMADTGLDAYRFSI--SWSRLIP-NG------RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD------ 143 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si--~WsRI~P-~g------~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~------ 143 (262)
.+.++=++++++.|+|++=+.+ ++-.|.= .. .|. ....+.-.+++++.|+++||.|+.=+.-|-
T Consensus 13 ~~~~~~~~~i~~t~lNavVIDvKdd~G~i~y~s~~~~~~~~ga-~~~~i~D~~~l~~~l~e~gIY~IARIv~FkD~~la~ 91 (316)
T PF13200_consen 13 ERLDKLLDLIKRTELNAVVIDVKDDDGNITYDSQVPLAREIGA-VKPYIKDLKALVKKLKEHGIYPIARIVVFKDPVLAE 91 (316)
T ss_pred HHHHHHHHHHHhcCCceEEEEEecCCceEEecCCCchhhhccc-ccccccCHHHHHHHHHHCCCEEEEEEEEecChHHhh
Confidence 4577889999999999999988 4544421 10 011 111235578999999999999987655332
Q ss_pred -CcHhHHHhhC----------CCCCh---hhHHHHHHHHHHHHHH
Q 024821 144 -LPQALEDEYG----------GWINR---MIVKDFTAYADVCFRE 174 (262)
Q Consensus 144 -~P~~l~~~~g----------Gw~~~---~~~~~F~~ya~~v~~~ 174 (262)
.|.|...+-+ .|.|| ++.+.-.+-|+.+++.
T Consensus 92 ~~pe~av~~~~G~~w~d~~~~~WvnP~~~evw~Y~i~IA~Eaa~~ 136 (316)
T PF13200_consen 92 AHPEWAVKTKDGSVWRDNEGEAWVNPYSKEVWDYNIDIAKEAAKL 136 (316)
T ss_pred hChhhEEECCCCCcccCCCCCccCCCCCHHHHHHHHHHHHHHHHc
Confidence 2434321111 27654 5666666666666543
No 219
>cd00927 Cyt_c_Oxidase_VIc Cytochrome c oxidase subunit VIc. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. The VIc subunit is found only in eukaryotes and its specific function remains unclear. It has been reported that the relative concentrations of some nuclear encoded CcO subunits, including subunit VIc, compared to those of the mitochondrial encoded subunits, are altered significantly during the progression of prostate cancer.
Probab=25.16 E-value=31 Score=25.29 Aligned_cols=19 Identities=37% Similarity=0.676 Sum_probs=15.7
Q ss_pred CccccccH--HHHHHHHHcCC
Q 024821 75 CDEYHKYK--EDVKLMADTGL 93 (262)
Q Consensus 75 ~d~y~~~~--eDi~l~k~lG~ 93 (262)
.|||..|+ +|.+.|++.|+
T Consensus 46 adFYknYD~~kdFerM~~~G~ 66 (70)
T cd00927 46 ADFYKTYDAMKDFERMRKAGL 66 (70)
T ss_pred HHHHHccChHHHHHHHHHcCC
Confidence 57888775 78999999996
No 220
>PRK09967 putative outer membrane lipoprotein; Provisional
Probab=25.01 E-value=2.5e+02 Score=23.47 Aligned_cols=64 Identities=20% Similarity=0.216 Sum_probs=46.1
Q ss_pred ccHHHHHHHHHcCCC----cceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC
Q 024821 80 KYKEDVKLMADTGLD----AYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD 143 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~----~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~ 143 (262)
.+++.++.++..|+. .+.+++.=+-.+..+..++++++..-.+++.+.|++.+...+...-|-|
T Consensus 26 ~~~~~~~~~~~~g~~~~~~~~~i~l~~~v~F~~~sa~L~~~~~~~L~~ia~~l~~~~~~~v~I~GhTD 93 (160)
T PRK09967 26 FTPEQVAAMQSYGFTESAGDWSLGLSDAILFAKNDYKLLPESQQQIQTMAAKLASTGLTHARMDGHTD 93 (160)
T ss_pred CCHHHHHHHHhcCceecCCceEEEcCCceeeCCCccccCHHHHHHHHHHHHHHHhCCCceEEEEEEcC
Confidence 367889999999986 3445554334455556788999999999999999998765444444544
No 221
>TIGR00674 dapA dihydrodipicolinate synthase. Dihydrodipicolinate synthase is a homotetrameric enzyme of lysine biosynthesis. E. coli has several paralogs closely related to dihydrodipicoline synthase (DapA), as well as the more distant N-acetylneuraminate lyase. In Pyrococcus horikoshii, the bidirectional best hit with E. coli is to an uncharacterized paralog of DapA, not DapA itself, and it is omitted from the seed. The putative members from the Chlamydias (pathogens with a parasitic metabolism) are easily the most divergent members of the multiple alignment.
Probab=24.81 E-value=3.7e+02 Score=24.17 Aligned_cols=67 Identities=15% Similarity=0.155 Sum_probs=47.4
Q ss_pred CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 110 RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 110 ~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+|++|.++ +++.++.+.+.|+..++.+-+- |-+ ..-+.+...+..+.+.+.-++++....-....+
T Consensus 13 ~g~iD~~~---~~~~i~~l~~~Gv~Gi~~~Gst----------GE~-~~Ls~~Er~~~~~~~~~~~~~~~~vi~gv~~~s 78 (285)
T TIGR00674 13 DGSVDFAA---LEKLIDFQIENGTDAIVVVGTT----------GES-PTLSHEEHKKVIEFVVDLVNGRVPVIAGTGSNA 78 (285)
T ss_pred CCCcCHHH---HHHHHHHHHHcCCCEEEECccC----------ccc-ccCCHHHHHHHHHHHHHHhCCCCeEEEeCCCcc
Confidence 58999554 7899999999999999877531 111 112346667777888888888888777666555
Q ss_pred h
Q 024821 190 G 190 (262)
Q Consensus 190 ~ 190 (262)
.
T Consensus 79 ~ 79 (285)
T TIGR00674 79 T 79 (285)
T ss_pred H
Confidence 4
No 222
>cd01948 EAL EAL domain. This domain is found in diverse bacterial signaling proteins. It is called EAL after its conserved residues and is also known as domain of unknown function 2 (DUF2). The EAL domain has been shown to stimulate degradation of a second messenger, cyclic di-GMP, and is a good candidate for a diguanylate phosphodiesterase function. Together with the GGDEF domain, EAL might be involved in regulating cell surface adhesiveness in bacteria.
Probab=24.55 E-value=1e+02 Score=26.11 Aligned_cols=63 Identities=16% Similarity=0.204 Sum_probs=45.5
Q ss_pred CcCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 72 DVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 72 ~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
..+.|.+-.-..+++.+.++.++.++++-++.+=... |...-...+.++..++..|++++++=
T Consensus 147 ~l~ld~~g~~~~~~~~l~~~~~d~iKld~~~~~~~~~-----~~~~~~~l~~l~~~~~~~~~~via~g 209 (240)
T cd01948 147 RIALDDFGTGYSSLSYLKRLPVDYLKIDRSFVRDIET-----DPEDRAIVRAIIALAHSLGLKVVAEG 209 (240)
T ss_pred eEEEeCCCCcHhhHHHHHhCCCCEEEECHHHHHhHhc-----ChhhHHHHHHHHHHHHHCCCeEEEEe
Confidence 4566777777788888888888888888766443322 23345678888899999999888753
No 223
>PF04007 DUF354: Protein of unknown function (DUF354); InterPro: IPR007152 Members of this family are around 350 amino acids in length. They are found in archaea and some bacteria and have no known function.
Probab=24.48 E-value=88 Score=29.52 Aligned_cols=39 Identities=28% Similarity=0.474 Sum_probs=31.6
Q ss_pred ChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhh
Q 024821 114 NPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEY 152 (262)
Q Consensus 114 n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~ 152 (262)
|+.-+.+++.+|.+|.++|.++++|.-.++.=.-|-+.|
T Consensus 9 ~p~hvhfFk~~I~eL~~~GheV~it~R~~~~~~~LL~~y 47 (335)
T PF04007_consen 9 HPAHVHFFKNIIRELEKRGHEVLITARDKDETEELLDLY 47 (335)
T ss_pred CchHHHHHHHHHHHHHhCCCEEEEEEeccchHHHHHHHc
Confidence 566789999999999999999999998777644454443
No 224
>TIGR00676 fadh2 5,10-methylenetetrahydrofolate reductase, prokaryotic form. This protein is an FAD-containing flavoprotein.
Probab=24.14 E-value=2.8e+02 Score=24.99 Aligned_cols=74 Identities=14% Similarity=0.128 Sum_probs=46.9
Q ss_pred hhHHHHHHHHHHHcCCcc-ee-e---------------ccCCCCcHhHHHhhCCCCCh--hhHHHHHHHHHHHHHHhCCC
Q 024821 118 LQYYNNLINELISYGIQP-HV-T---------------LHHLDLPQALEDEYGGWINR--MIVKDFTAYADVCFREFGDR 178 (262)
Q Consensus 118 l~~y~~~id~l~~~GI~p-~v-t---------------L~H~~~P~~l~~~~gGw~~~--~~~~~F~~ya~~v~~~~gd~ 178 (262)
.+.+.++++.|++.||++ ++ + +..-++|.|+.++.....+. +..+.-.+++...++++.+.
T Consensus 169 ~~~~~~~~~~~~~~gi~~PIi~Gi~p~~s~k~~~~~~~~~Gv~vP~~~~~~l~~~~~~~~~~~~~gi~~~~~~~~~l~~~ 248 (272)
T TIGR00676 169 NDDYYRFVDRCRAAGIDVPIIPGIMPITNFKQLLRFAERCGAEIPAWLVKRLEKYDDDPEEVRAVGIEYATDQCEDLIAE 248 (272)
T ss_pred HHHHHHHHHHHHHcCCCCCEecccCCcCCHHHHHHHHhccCCCCCHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHC
Confidence 456888999999997662 21 1 22557788887765443332 23456677777777776543
Q ss_pred -Cc--eEEeccCCchh
Q 024821 179 -VS--YWTTVNEPNGF 191 (262)
Q Consensus 179 -V~--~w~t~NEP~~~ 191 (262)
++ |-.|+|=+...
T Consensus 249 g~~GiHl~t~n~~~~~ 264 (272)
T TIGR00676 249 GVPGIHFYTLNRADAT 264 (272)
T ss_pred CCCEEEEcCCCCHHHH
Confidence 44 66678877654
No 225
>PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=24.04 E-value=2.8e+02 Score=24.36 Aligned_cols=54 Identities=15% Similarity=0.161 Sum_probs=36.3
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCccee
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHV 137 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~v 137 (262)
.+++-++.++++|++.+=+++.=.+..+. ...++. ...+.+-+.+.++||+...
T Consensus 17 ~~~e~~~~~~~~G~~~iEl~~~~~~~~~~-~~~~~~---~~~~~l~~~l~~~Gl~i~~ 70 (284)
T PRK13210 17 SWEERLVFAKELGFDFVEMSVDESDERLA-RLDWSK---EERLSLVKAIYETGVRIPS 70 (284)
T ss_pred CHHHHHHHHHHcCCCeEEEecCCcccccc-cccCCH---HHHHHHHHHHHHcCCCceE
Confidence 47889999999999999887541111111 123332 3467788899999998753
No 226
>PRK08508 biotin synthase; Provisional
Probab=23.80 E-value=1.4e+02 Score=26.99 Aligned_cols=56 Identities=11% Similarity=0.069 Sum_probs=38.8
Q ss_pred cHHHHHHHHHcCCCcceecccc-CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeec
Q 024821 81 YKEDVKLMADTGLDAYRFSISW-SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~W-sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+|.++.|++.|++++-..++= ++++|.-... ..++..-+.++.+++.||++--++
T Consensus 101 ~~e~l~~Lk~aGld~~~~~lEt~~~~~~~i~~~---~~~~~~l~~i~~a~~~Gi~v~sg~ 157 (279)
T PRK08508 101 SVEQLKELKKAGIFSYNHNLETSKEFFPKICTT---HTWEERFQTCENAKEAGLGLCSGG 157 (279)
T ss_pred CHHHHHHHHHcCCCEEcccccchHHHhcCCCCC---CCHHHHHHHHHHHHHcCCeeccee
Confidence 5899999999999999887632 3466652111 224445567778999999765444
No 227
>KOG0470 consensus 1,4-alpha-glucan branching enzyme/starch branching enzyme II [Carbohydrate transport and metabolism]
Probab=23.78 E-value=1e+02 Score=32.31 Aligned_cols=64 Identities=20% Similarity=0.430 Sum_probs=42.9
Q ss_pred ccccHHH-HHHHHHcCCCcceec-c-cc-Ccc-----------ccCC-CCCCC-hhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 78 YHKYKED-VKLMADTGLDAYRFS-I-SW-SRL-----------IPNG-RGPVN-PKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 78 y~~~~eD-i~l~k~lG~~~~R~s-i-~W-sRI-----------~P~g-~g~~n-~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
|..+.|+ +..+|+||.|++-+= | +- +.. -|.+ -|..+ +.-+.-.+.||+++.+.||+++++..
T Consensus 253 Y~~FteKvlphlK~LG~NaiqLmpi~Ef~~~~~s~GY~~~nFFapssrYgt~~s~~ri~efK~lVd~aHs~GI~VlLDVV 332 (757)
T KOG0470|consen 253 YLGFTEKVLPHLKKLGYNAIQLMPIFEFGHYYASWGYQVTNFFAPSSRYGTPESPCRINEFKELVDKAHSLGIEVLLDVV 332 (757)
T ss_pred hhhhhhhhhhHHHHhCccceEEeehhhhhhhhhccCcceeEeecccccccCCCcccchHHHHHHHHHHhhCCcEEehhhh
Confidence 8888887 999999999987631 1 11 111 2221 12222 11255689999999999999999876
Q ss_pred C
Q 024821 141 H 141 (262)
Q Consensus 141 H 141 (262)
|
T Consensus 333 ~ 333 (757)
T KOG0470|consen 333 H 333 (757)
T ss_pred h
Confidence 4
No 228
>PRK12755 phospho-2-dehydro-3-deoxyheptonate aldolase; Provisional
Probab=23.57 E-value=2.1e+02 Score=27.38 Aligned_cols=73 Identities=22% Similarity=0.330 Sum_probs=49.8
Q ss_pred ccccHHHHHHHHHcCCC-------cc-------eeccccCccc--cCCCCCCC-hhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 78 YHKYKEDVKLMADTGLD-------AY-------RFSISWSRLI--PNGRGPVN-PKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~-------~~-------R~si~WsRI~--P~g~g~~n-~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
...-.|=.+.+|+++-+ .+ |.+.+|.-+. |.-+|..| ++||...++++-++.+.|+...--+.
T Consensus 67 ~~~aleyA~~Lk~l~~~~~d~l~ivmR~y~eKPRT~~gwkGli~DP~ldgs~~i~~GL~~~R~ll~~~~e~Glp~atE~l 146 (353)
T PRK12755 67 PEAALEYARRLKALADELSDRLLIVMRVYFEKPRTTVGWKGLINDPHLDGSFDIEEGLRIARKLLLDLVELGLPLATEAL 146 (353)
T ss_pred HHHHHHHHHHHHHHHhhhhcceEEEEEeccccCCCCcCCcCCCCCccccccccHHHHHHHHHHHHHHHHHhCCCEEEEec
Confidence 33344445556666543 44 5888898874 54457777 68999999999889999987666565
Q ss_pred CCCCcHhHHH
Q 024821 141 HLDLPQALED 150 (262)
Q Consensus 141 H~~~P~~l~~ 150 (262)
.-..|+.+.+
T Consensus 147 d~~~~~y~~D 156 (353)
T PRK12755 147 DPISPQYLGD 156 (353)
T ss_pred CcccHHHHHh
Confidence 5556766654
No 229
>COG0138 PurH AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [Nucleotide transport and metabolism]
Probab=23.18 E-value=93 Score=31.12 Aligned_cols=50 Identities=22% Similarity=0.433 Sum_probs=42.2
Q ss_pred cCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 73 VACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 73 ~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
.|+|-+--+++-|+.+++.|++++ |+|. |.++. +++|+.+-++||.-+.|
T Consensus 459 ~ASDAFFPF~D~i~~aA~aGi~aI--------IqPG--GSirD------~eVI~aAde~giaMvfT 508 (515)
T COG0138 459 LASDAFFPFPDGIDAAAKAGIKAI--------IQPG--GSIRD------QEVIAAADEHGIAMVFT 508 (515)
T ss_pred EeecccCCCcchHHHHHHcCCeEE--------ECCC--Ccccc------HHHHHHHHhcCcEEEEc
Confidence 478888889999999999999987 8997 45542 67899999999998876
No 230
>PF02156 Glyco_hydro_26: Glycosyl hydrolase family 26; InterPro: IPR022790 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 26 GH26 from CAZY encompasses mainly mannan endo-1,4-beta-mannosidases (3.2.1.78 from EC). Mannan endo-1,4-beta-mannosidase hydrolyses mannan and galactomannan, but displays little activity towards other plant cell wall polysaccharides []. The enzyme randomly hydrolyses 1,4-beta-D-linkages in mannans, galacto-mannans, glucomannans and galactoglucomannans. This entry also incoporates the enzyme Endogluconase H 3.2.1.4 from EC catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans. ; GO: 0008810 cellulase activity, 0016985 mannan endo-1,4-beta-mannosidase activity, 0006080 substituted mannan metabolic process; PDB: 2QHA_A 3CBW_A 2WHK_A 2VI0_A 2BVD_A 2BV9_A 2CIT_A 2V3G_A 2CIP_A 2X2Y_B ....
Probab=22.70 E-value=1.6e+02 Score=27.47 Aligned_cols=52 Identities=15% Similarity=0.294 Sum_probs=34.2
Q ss_pred hhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCC--CCh---hhHHHHHHHHHHHHHHhCC
Q 024821 118 LQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGW--INR---MIVKDFTAYADVCFREFGD 177 (262)
Q Consensus 118 l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw--~~~---~~~~~F~~ya~~v~~~~gd 177 (262)
++..-+.+..|++.||.+++-..| +--|+| ... ..++.|.+..+.+++++.+
T Consensus 136 ld~iA~~l~~l~~~~vPVl~Rp~H--------E~nG~WfwWg~~~~~~~~~y~~lwr~~~~~l~~ 192 (311)
T PF02156_consen 136 LDRIADFLKQLKDAGVPVLFRPFH--------EMNGGWFWWGAKGHCTPEQYKALWRHMVDYLRN 192 (311)
T ss_dssp HHHHHHHHHHHHCTTS-EEEEEST--------STTSSSSTTSTTSTCHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHhhcCCCeEEEeehh--------hcCCCccccCCCCCCCHHHHHHHHHHHHHHHHh
Confidence 333444556677889888887777 222554 232 3589999999999998864
No 231
>COG3684 LacD Tagatose-1,6-bisphosphate aldolase [Carbohydrate transport and metabolism]
Probab=22.48 E-value=1.2e+02 Score=28.19 Aligned_cols=56 Identities=18% Similarity=0.220 Sum_probs=47.4
Q ss_pred HHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC
Q 024821 84 DVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD 143 (262)
Q Consensus 84 Di~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~ 143 (262)
+.+.+|+.|-+++.|=+-|.-=. + ++|+.-+++.+++..+|.+.+|--++-...+|
T Consensus 116 sa~riK~~G~~avK~Lvy~~~D~---~-e~neqk~a~ierigsec~aedi~f~lE~ltyd 171 (306)
T COG3684 116 SAKRIKEDGGDAVKFLVYYRSDE---D-EINEQKLAYIERIGSECHAEDLPFFLEPLTYD 171 (306)
T ss_pred CHHHHHHhcccceEEEEEEcCCc---h-HHhHHHHHHHHHHHHHhhhcCCceeEeeeecC
Confidence 45889999999999999886432 3 78999999999999999999998888776654
No 232
>cd06565 GH20_GcnA-like Glycosyl hydrolase family 20 (GH20) catalytic domain of N-acetyl-beta-D-glucosaminidase (GcnA, also known as BhsA) and related proteins. GcnA is an exoglucosidase which cleaves N-acetyl-beta-D-galactosamine (NAG) and N-acetyl-beta-D-galactosamine residues from 4-methylumbelliferylated (4MU) substrates, as well as cleaving NAG from chito-oligosaccharides (i.e. NAG polymers). In contrast, sulfated forms of the substrate are unable to be cleaved and act instead as mild competitive inhibitors. Additionally, the enzyme is known to be poisoned by several first-row transition metals as well as by mercury. GcnA forms a homodimer with subunits comprised of three domains, an N-terminal zincin-like domain, this central catalytic GH20 domain, and a C-terminal alpha helical domain. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself.
Probab=22.39 E-value=2.8e+02 Score=25.52 Aligned_cols=56 Identities=18% Similarity=0.272 Sum_probs=39.9
Q ss_pred cHHHHHHHHHcCCCcceecc----ccCccccC-C--CCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 81 YKEDVKLMADTGLDAYRFSI----SWSRLIPN-G--RGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si----~WsRI~P~-g--~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
.++=|++|+.+|+|.+-+=+ .++. .|. + .|.+.++ =++++++.++++||+++.-+-
T Consensus 19 lk~~id~ma~~k~N~l~lhl~D~f~~~~-~p~~~~~~~~yT~~---ei~ei~~yA~~~gI~vIPeid 81 (301)
T cd06565 19 LKKLLRLLALLGANGLLLYYEDTFPYEG-EPEVGRMRGAYTKE---EIREIDDYAAELGIEVIPLIQ 81 (301)
T ss_pred HHHHHHHHHHcCCCEEEEEEecceecCC-CcccccCCCCcCHH---HHHHHHHHHHHcCCEEEecCC
Confidence 67789999999999887644 2221 222 1 3455544 479999999999999998763
No 233
>PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed
Probab=22.09 E-value=98 Score=27.51 Aligned_cols=56 Identities=7% Similarity=0.089 Sum_probs=35.9
Q ss_pred cccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhh----hhHHHHHHHHHHHcCCcceeec
Q 024821 79 HKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKG----LQYYNNLINELISYGIQPHVTL 139 (262)
Q Consensus 79 ~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~----l~~y~~~id~l~~~GI~p~vtL 139 (262)
.+++.-|++.+++|.+.+++.- .-.+. +.-+++. ++..+++.+.+.++||...+=.
T Consensus 99 ~~~~~~i~~a~~lG~~~i~~~~---~~~~~--~~~~~~~~~~~~~~l~~l~~~A~~~GV~i~iE~ 158 (283)
T PRK13209 99 EIMRKAIQLAQDLGIRVIQLAG---YDVYY--EQANNETRRRFIDGLKESVELASRASVTLAFEI 158 (283)
T ss_pred HHHHHHHHHHHHcCCCEEEECC---ccccc--cccHHHHHHHHHHHHHHHHHHHHHhCCEEEEee
Confidence 3466778999999999998741 11111 1122333 4566778888888998666543
No 234
>COG5016 Pyruvate/oxaloacetate carboxyltransferase [Energy production and conversion]
Probab=22.08 E-value=3e+02 Score=27.16 Aligned_cols=51 Identities=24% Similarity=0.458 Sum_probs=37.3
Q ss_pred cccccHHHH-----HHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 77 EYHKYKEDV-----KLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 77 ~y~~~~eDi-----~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
.|.+|.+|+ ++..+-|++.||+ +-. .|. ++-.+.-|+..++.|-....++.
T Consensus 91 GYrhyaDDvVe~Fv~ka~~nGidvfRi-------FDA----lND--~RNl~~ai~a~kk~G~h~q~~i~ 146 (472)
T COG5016 91 GYRHYADDVVEKFVEKAAENGIDVFRI-------FDA----LND--VRNLKTAIKAAKKHGAHVQGTIS 146 (472)
T ss_pred cccCCchHHHHHHHHHHHhcCCcEEEe-------chh----ccc--hhHHHHHHHHHHhcCceeEEEEE
Confidence 577888884 7778899999985 322 221 44567788888899988888776
No 235
>COG0635 HemN Coproporphyrinogen III oxidase and related Fe-S oxidoreductases [Coenzyme metabolism]
Probab=22.03 E-value=2.9e+02 Score=26.83 Aligned_cols=101 Identities=15% Similarity=0.166 Sum_probs=65.3
Q ss_pred HHHHHHHHHcCCCcceeccccCccccCC---CCCCChhhhhHHHHHHHHHHHcCCcce-eeccCCCCcHhHHHhhCCCCC
Q 024821 82 KEDVKLMADTGLDAYRFSISWSRLIPNG---RGPVNPKGLQYYNNLINELISYGIQPH-VTLHHLDLPQALEDEYGGWIN 157 (262)
Q Consensus 82 ~eDi~l~k~lG~~~~R~si~WsRI~P~g---~g~~n~~~l~~y~~~id~l~~~GI~p~-vtL~H~~~P~~l~~~~gGw~~ 157 (262)
.|-++.+++.|+| |+|+-=..+-++- -|.... .+--...++.+++.|+.-+ ++|. |.+|.
T Consensus 137 ~e~~~~l~~~GvN--RiSlGVQsf~~~~lk~lgR~h~--~~~~~~a~~~~~~~g~~~in~DLI-yglP~----------- 200 (416)
T COG0635 137 AEKFKALKEAGVN--RISLGVQSFNDEVLKALGRIHD--EEEAKEAVELARKAGFTSINIDLI-YGLPG----------- 200 (416)
T ss_pred HHHHHHHHHcCCC--EEEeccccCCHHHHHHhcCCCC--HHHHHHHHHHHHHcCCCcEEEEee-cCCCC-----------
Confidence 4668999999999 8887543333321 133322 2345678888999998754 4554 35552
Q ss_pred hhhHHHHHHHHHHHHHHhCCCCceEEeccCCchhcccccccc
Q 024821 158 RMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGFAMVGYDFG 199 (262)
Q Consensus 158 ~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~~~~gy~~g 199 (262)
++.+.|.+=.+.+.+-=-|+|..+-..-||+.....-...|
T Consensus 201 -QT~~~~~~~l~~a~~l~pdhis~y~L~~~p~t~~~~~~~~~ 241 (416)
T COG0635 201 -QTLESLKEDLEQALELGPDHLSLYSLAIEPGTKFAQRKIKG 241 (416)
T ss_pred -CCHHHHHHHHHHHHhCCCCEEEEeeeecCCCchhhhhcccC
Confidence 34556666666666666688999999999998755444333
No 236
>PRK07106 5-aminoimidazole-4-carboxamide ribonucleotide transformylase; Provisional
Probab=21.98 E-value=95 Score=30.07 Aligned_cols=50 Identities=20% Similarity=0.478 Sum_probs=42.1
Q ss_pred cCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceee
Q 024821 73 VACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVT 138 (262)
Q Consensus 73 ~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vt 138 (262)
.|+|-+--+++-|+..++.|++++ |+|- |.++. +++|+.|.++||.-+.|
T Consensus 334 lASDAFFPF~D~Id~Aa~~GV~aI--------iQPG--GSiRD------~evI~aane~giaMvfT 383 (390)
T PRK07106 334 LGSDAFFPFGDNIERAAKSGVKYI--------AQPG--GSIRD------DNVIETCNKYGMTMAFT 383 (390)
T ss_pred EEecccCCCCchHHHHHHcCCEEE--------ECCC--CCCCc------HHHHHHHHHhCCEEEEC
Confidence 478888889999999999999986 8997 44542 67899999999988776
No 237
>PRK09432 metF 5,10-methylenetetrahydrofolate reductase; Provisional
Probab=21.64 E-value=3e+02 Score=25.42 Aligned_cols=74 Identities=15% Similarity=0.124 Sum_probs=44.1
Q ss_pred hHHHHHHHHHHHcCCc-cee----------------eccCCCCcHhHHHhhCCCCCh-h-hHHHHHHHHHHHHHHhCCC-
Q 024821 119 QYYNNLINELISYGIQ-PHV----------------TLHHLDLPQALEDEYGGWINR-M-IVKDFTAYADVCFREFGDR- 178 (262)
Q Consensus 119 ~~y~~~id~l~~~GI~-p~v----------------tL~H~~~P~~l~~~~gGw~~~-~-~~~~F~~ya~~v~~~~gd~- 178 (262)
+.|.++++.+++.||+ |++ .+.+-++|.|+.++.....+. + ..+.=.+||....+++.+.
T Consensus 189 ~~~~~f~~~~~~~Gi~vPIi~GI~pi~s~~~~~~~~~~~Gv~vP~~l~~~l~~~~d~~~~~~~~Gi~~a~e~i~~L~~~g 268 (296)
T PRK09432 189 ESYLRFRDRCVSAGIDVEIVPGILPVSNFKQLKKFADMTNVRIPAWMAKMFDGLDDDAETRKLVGASIAMDMVKILSREG 268 (296)
T ss_pred HHHHHHHHHHHHcCCCCCEEeeccccCCHHHHHHHHHccCCCCCHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHCC
Confidence 4567889999999865 221 234778899998876665432 2 1222234454444444332
Q ss_pred C--ceEEeccCCchhc
Q 024821 179 V--SYWTTVNEPNGFA 192 (262)
Q Consensus 179 V--~~w~t~NEP~~~~ 192 (262)
| -|..|+|-+....
T Consensus 269 v~GvH~yt~n~~~~~~ 284 (296)
T PRK09432 269 VKDFHFYTLNRAELTY 284 (296)
T ss_pred CCEEEEecCCChHHHH
Confidence 3 2455899888653
No 238
>TIGR02026 BchE magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase. This model respresents the cobalamin-dependent oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under anaerobic conditions. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under anaerobic conditions (a separate enzyme, AcsF, acts under aerobic conditions). This model identifies two clades of sequences, one from photosynthetic, non-cyanobacterial bacteria and another including Synechocystis and several non-photosynthetic bacteria. The function of the Synechocystis gene is supported by gene clustering with other photosynthetic genes, so the purpose of the gene in the non-photosynthetic bacteria is uncertain. Note that homologs of this gene are not found in plants which rely solely on the aerobic cyclase.
Probab=21.49 E-value=1.4e+02 Score=29.41 Aligned_cols=61 Identities=11% Similarity=0.035 Sum_probs=39.0
Q ss_pred cHHHHHHHHHcCCCcceeccc-c-CccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCc
Q 024821 81 YKEDVKLMADTGLDAYRFSIS-W-SRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLP 145 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~-W-sRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P 145 (262)
-++-+++|++.|++.+-++++ = .++...-....+ .+-..+.|+.++++||.+.+.+. +++|
T Consensus 286 d~ell~~l~~aG~~~v~iGiES~~~~~L~~~~K~~t---~~~~~~ai~~l~~~Gi~~~~~~I-~G~P 348 (497)
T TIGR02026 286 DADILHLYRRAGLVHISLGTEAAAQATLDHFRKGTT---TSTNKEAIRLLRQHNILSEAQFI-TGFE 348 (497)
T ss_pred CHHHHHHHHHhCCcEEEEccccCCHHHHHHhcCCCC---HHHHHHHHHHHHHCCCcEEEEEE-EECC
Confidence 355678999999987777662 2 223222112233 45578899999999999877654 2444
No 239
>TIGR02351 thiH thiazole biosynthesis protein ThiH. Members this protein family are the ThiH protein of thiamine biosynthesis, a homolog of the BioB protein of biotin biosynthesis. Genes for the this protein generally are found in operons with other thiamin biosynthesis genes.
Probab=21.37 E-value=5e+02 Score=24.48 Aligned_cols=101 Identities=14% Similarity=0.059 Sum_probs=56.0
Q ss_pred cccccHHHHHHHHHcCCCcceecccc--CccccCC-C-CCCChhhhhHHHHHHHHHHHcCCcc-eeeccCCCCcHhHHHh
Q 024821 77 EYHKYKEDVKLMADTGLDAYRFSISW--SRLIPNG-R-GPVNPKGLQYYNNLINELISYGIQP-HVTLHHLDLPQALEDE 151 (262)
Q Consensus 77 ~y~~~~eDi~l~k~lG~~~~R~si~W--sRI~P~g-~-g~~n~~~l~~y~~~id~l~~~GI~p-~vtL~H~~~P~~l~~~ 151 (262)
-...-.++++.++++|++.+.++++= ...++.- + |. ...++.--+.++.+++.||+. -+.+. +++|.|..+.
T Consensus 157 i~~lt~e~~~~Lk~aGv~r~~i~lET~~~~~y~~i~~~g~--~h~~~~rl~~i~~a~~aG~~~v~~g~i-~Gl~e~~~d~ 233 (366)
T TIGR02351 157 VQPLNEEEYKKLVEAGLDGVTVYQETYNEKKYKKHHLAGK--KKDFRYRLNTPERAAKAGMRKIGIGAL-LGLDDWRTDA 233 (366)
T ss_pred cccCCHHHHHHHHHcCCCEEEEEeecCCHHHHHhcCcCCC--CCCHHHHHHHHHHHHHcCCCeeceeEE-EeCchhHHHH
Confidence 33467899999999999977777632 2233320 0 11 112444556788999999983 23222 2455433221
Q ss_pred hCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCchh
Q 024821 152 YGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPNGF 191 (262)
Q Consensus 152 ~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~~~ 191 (262)
-..+.-.+.+-.+|.+ ....+.+|+-+-.
T Consensus 234 ----------~~~a~~l~~L~~~~~~-~~~sv~~~~l~P~ 262 (366)
T TIGR02351 234 ----------FFTAYHLRYLQKKYWK-TEISISVPRLRPC 262 (366)
T ss_pred ----------HHHHHHHHHHHHHcCC-CCccccccccccC
Confidence 2233344444455543 3345788875533
No 240
>PF02055 Glyco_hydro_30: O-Glycosyl hydrolase family 30; InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=21.10 E-value=1.1e+02 Score=30.57 Aligned_cols=101 Identities=18% Similarity=0.318 Sum_probs=48.2
Q ss_pred HHcCCCcceeccccCcccc------CCCCCCChhh-------hhHHHHHHHHHHHcC--CcceeeccCCCCcHhHHHh--
Q 024821 89 ADTGLDAYRFSISWSRLIP------NGRGPVNPKG-------LQYYNNLINELISYG--IQPHVTLHHLDLPQALEDE-- 151 (262)
Q Consensus 89 k~lG~~~~R~si~WsRI~P------~g~g~~n~~~-------l~~y~~~id~l~~~G--I~p~vtL~H~~~P~~l~~~-- 151 (262)
+.+|++..|..|.=+..-. +.++.++.+. .++--.+|.++++.+ |+.+.+ =|.-|.|+-..
T Consensus 110 ~G~g~s~~R~pIgssDfs~~~Yty~d~~~D~~l~~Fs~~~~d~~~~ip~ik~a~~~~~~lki~aS--pWSpP~WMKtn~~ 187 (496)
T PF02055_consen 110 DGIGYSLLRVPIGSSDFSTRPYTYDDVPGDFNLSNFSIAREDKKYKIPLIKEALAINPNLKIFAS--PWSPPAWMKTNGS 187 (496)
T ss_dssp TTT---EEEEEES--SSSSS---ST-STTHTTTTT---HHHHHTTHHHHHHHHHHHHTT-EEEEE--ES---GGGBTTSS
T ss_pred CCceEEEEEeeccCcCCcCCcccccCCCCCCccccCCccccchhhHHHHHHHHHHhCCCcEEEEe--cCCCCHHHccCCc
Confidence 4589999998884322211 1122222221 222224666666543 444433 47899998432
Q ss_pred h--CCCCC----hhhHHHHHHHHHHHHHHhCCC-CceEE--eccCCchh
Q 024821 152 Y--GGWIN----RMIVKDFTAYADVCFREFGDR-VSYWT--TVNEPNGF 191 (262)
Q Consensus 152 ~--gGw~~----~~~~~~F~~ya~~v~~~~gd~-V~~w~--t~NEP~~~ 191 (262)
. +|... ++..+.|++|--+.++.|+.. |..|- +-|||...
T Consensus 188 ~~g~g~l~g~~~~~y~~~yA~Y~vkfi~aY~~~GI~i~aiT~QNEP~~~ 236 (496)
T PF02055_consen 188 MNGGGSLKGSLGDEYYQAYADYFVKFIQAYKKEGIPIWAITPQNEPDNG 236 (496)
T ss_dssp SCSS-BBSCGTTSHHHHHHHHHHHHHHHHHHCTT--ESEEESSSSCCGG
T ss_pred CcCCCccCCCCCchhHHHHHHHHHHHHHHHHHCCCCeEEEeccCCCCCC
Confidence 1 24443 345566777777777777754 55554 88999864
No 241
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=21.04 E-value=2.3e+02 Score=27.49 Aligned_cols=87 Identities=20% Similarity=0.293 Sum_probs=59.9
Q ss_pred HHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC--CCcH-----hHHH----------
Q 024821 88 MADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL--DLPQ-----ALED---------- 150 (262)
Q Consensus 88 ~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~--~~P~-----~l~~---------- 150 (262)
|.=+|+-+.|+.+.|. +.+|+ +...++|+...++||.=+=|-+-| ..-. .|.+
T Consensus 13 ~s~lgfG~MRlp~~~~-------~~id~---~~~~~~i~~aie~GiNyidTA~~Yh~g~sE~~lgkaL~~~~Rekv~LaT 82 (391)
T COG1453 13 LSILGFGCMRLPLKEQ-------GSIDE---ENANETIDYAIEHGINYIDTAWPYHGGESEEFLGKALKDGYREKVKLAT 82 (391)
T ss_pred cceeccceeecccccC-------CCccH---HHHHHHHHHHHHcCCceEeecccccCCCchHHHHHHhhhcccceEEEEe
Confidence 3448999999999987 56775 456899999999999877776433 2211 2222
Q ss_pred hhCCCCChhhHHHHHHHHHHHHHHhC-CCCceEEec
Q 024821 151 EYGGWINRMIVKDFTAYADVCFREFG-DRVSYWTTV 185 (262)
Q Consensus 151 ~~gGw~~~~~~~~F~~ya~~v~~~~g-d~V~~w~t~ 185 (262)
+..-|.- +..+.|.++-..=.++++ |.++++.+-
T Consensus 83 Klp~~~~-~~~edm~r~fneqLekl~~Dy~D~yliH 117 (391)
T COG1453 83 KLPSWPV-KDREDMERIFNEQLEKLGTDYIDYYLIH 117 (391)
T ss_pred ecCCccc-cCHHHHHHHHHHHHHHhCCchhhhhhhc
Confidence 1223332 345888898888899998 578888743
No 242
>TIGR01856 hisJ_fam histidinol phosphate phosphatase HisJ family. This model represents the histidinol phosphate phosphatase HisJ of Bacillus subtilis, and related proteins from a number of species within a larger family of phosphatases in the PHP hydrolase family. HisJ catalyzes the penultimate step of histidine biosynthesis but shows no homology to the functionally equivalent sequence in E. coli, a domain of the bifunctional HisB protein. Note, however, that many species have two members and that Clostridium perfringens, predicted not to make histidine, has five members of this family; this family is designated subfamily rather than equivalog to indicate that members may not all act as HisJ.
Probab=20.98 E-value=2.6e+02 Score=24.79 Aligned_cols=61 Identities=13% Similarity=0.225 Sum_probs=37.9
Q ss_pred hHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCC---CChhhHHHHHHHHHHHHHHhCCCCce
Q 024821 119 QYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGW---INRMIVKDFTAYADVCFREFGDRVSY 181 (262)
Q Consensus 119 ~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw---~~~~~~~~F~~ya~~v~~~~gd~V~~ 181 (262)
...+++++.+.++|++-+.-.-|..+|..... ..| .+.+..+.|.+-++.+-++|+++++.
T Consensus 15 ~~~ee~v~~A~~~Gl~~i~~TdH~p~~~~~~~--~~~~~~~~~~~~~~Y~~~i~~l~~~y~~~i~I 78 (253)
T TIGR01856 15 DTLEEVVQEAIQLGFEEICFTEHAPLPFEYPE--ETALDKMAFSSLPEYFKEINRLKKEYADKLKI 78 (253)
T ss_pred CCHHHHHHHHHHcCCCEEEecCCCCcccCCCc--cccccchhHHHHHHHHHHHHHHHHHhhCCCeE
Confidence 34689999999999999998889665422211 112 11223344444455567788876644
No 243
>PF03932 CutC: CutC family; InterPro: IPR005627 Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are between 200 and 300 amino acids in length and are found in both eukaryotes and bacteria.; GO: 0005507 copper ion binding, 0055070 copper ion homeostasis; PDB: 2BDQ_A 3IWP_I 1X8C_B 1X7I_A 1TWD_B.
Probab=20.81 E-value=1.4e+02 Score=26.15 Aligned_cols=41 Identities=22% Similarity=0.437 Sum_probs=27.0
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHH
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELI 129 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~ 129 (262)
...+||+.++++|++.+=|+.. -| +|.+|. +..+++++.+.
T Consensus 73 ~M~~dI~~~~~~GadG~VfG~L----~~--dg~iD~---~~~~~Li~~a~ 113 (201)
T PF03932_consen 73 IMKEDIRMLRELGADGFVFGAL----TE--DGEIDE---EALEELIEAAG 113 (201)
T ss_dssp HHHHHHHHHHHTT-SEEEE--B----ET--TSSB-H---HHHHHHHHHHT
T ss_pred HHHHHHHHHHHcCCCeeEEEeE----CC--CCCcCH---HHHHHHHHhcC
Confidence 4678999999999999988862 22 478994 44566666654
No 244
>cd00950 DHDPS Dihydrodipicolinate synthase (DHDPS) is a key enzyme in lysine biosynthesis. It catalyzes the aldol condensation of L-aspartate-beta- semialdehyde and pyruvate to dihydropicolinic acid via a Schiff base formation between pyruvate and a lysine residue. The functional enzyme is a homotetramer consisting of a dimer of dimers. DHDPS is member of dihydrodipicolinate synthase family that comprises several pyruvate-dependent class I aldolases that use the same catalytic step to catalyze different reactions in different pathways.
Probab=20.77 E-value=4.7e+02 Score=23.33 Aligned_cols=67 Identities=18% Similarity=0.173 Sum_probs=46.7
Q ss_pred CCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCCc
Q 024821 110 RGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEPN 189 (262)
Q Consensus 110 ~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP~ 189 (262)
+|++|.++ +++.++.+.+.|+..++.+-+- |-. ..-+.+...+..+.+.++-++++....-...++
T Consensus 15 dg~iD~~~---~~~~i~~l~~~Gv~gl~v~Gst----------GE~-~~lt~~Er~~l~~~~~~~~~~~~~vi~gv~~~~ 80 (284)
T cd00950 15 DGSVDFDA---LERLIEFQIENGTDGLVVCGTT----------GES-PTLSDEEHEAVIEAVVEAVNGRVPVIAGTGSNN 80 (284)
T ss_pred CCCcCHHH---HHHHHHHHHHcCCCEEEECCCC----------cch-hhCCHHHHHHHHHHHHHHhCCCCcEEeccCCcc
Confidence 58899655 7889999999999998877531 111 112346677777778888888887777666554
Q ss_pred h
Q 024821 190 G 190 (262)
Q Consensus 190 ~ 190 (262)
.
T Consensus 81 ~ 81 (284)
T cd00950 81 T 81 (284)
T ss_pred H
Confidence 3
No 245
>PF06415 iPGM_N: BPG-independent PGAM N-terminus (iPGM_N); InterPro: IPR011258 This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (5.4.2.1 from EC). The family is found in conjunction with Metalloenzyme (located in the C-terminal region of the protein). ; GO: 0004619 phosphoglycerate mutase activity, 0030145 manganese ion binding, 0006007 glucose catabolic process, 0005737 cytoplasm; PDB: 1EQJ_A 1EJJ_A 1O99_A 1O98_A 3IGZ_B 3IGY_B 3NVL_A 2IFY_A.
Probab=20.76 E-value=2.2e+02 Score=25.38 Aligned_cols=66 Identities=20% Similarity=0.306 Sum_probs=47.5
Q ss_pred CCCCcCCccccccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 69 GTGDVACDEYHKYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 69 ~~~~~a~d~y~~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
++|.+.+ |..+...=++++++.|++-+.+- ++-+|+...-.+++.+.+++.+.|.+.|+-.+.|+.
T Consensus 37 SdGGVHS-h~~Hl~al~~~a~~~gv~~V~vH-----~f~DGRDt~P~S~~~yl~~l~~~l~~~~~g~IAsv~ 102 (223)
T PF06415_consen 37 SDGGVHS-HIDHLFALIKLAKKQGVKKVYVH-----AFTDGRDTPPKSALKYLEELEEKLAEIGIGRIASVS 102 (223)
T ss_dssp SS-SSS---HHHHHHHHHHHHHTT-SEEEEE-----EEE-SSSS-TTTHHHHHHHHHHHHHHHTCTEEEEEE
T ss_pred cCCCccc-cHHHHHHHHHHHHHcCCCEEEEE-----EecCCCCCCcchHHHHHHHHHHHHHhhCCceEEEEe
Confidence 5666544 56667778899999999877653 344666566668999999999999999998888886
No 246
>cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.
Probab=20.62 E-value=2.9e+02 Score=23.17 Aligned_cols=57 Identities=12% Similarity=0.059 Sum_probs=36.5
Q ss_pred HHHHcCCCcceeccc---cCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCC
Q 024821 87 LMADTGLDAYRFSIS---WSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLD 143 (262)
Q Consensus 87 l~k~lG~~~~R~si~---WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~ 143 (262)
+.+++|++.+.+.+. .+.+.-..+.......-..|+.+.+.+.+.|+..++|=+|.|
T Consensus 47 ~a~~lgi~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~l~~~a~~~g~~~I~~G~~~d 106 (202)
T cd01990 47 LAKEIGIRHEVIETDELDDPEFAKNPPDRCYLCKKALYEALKEIAEELGLDVVLDGTNAD 106 (202)
T ss_pred HHHHcCCcEEEEeCCccccHHHhcCCCCccchhHHHHHHHHHHHHHHCCCCEEEEcCccc
Confidence 337899999888876 222221111111222344688888889999999999888754
No 247
>PF14417 MEDS: MEDS: MEthanogen/methylotroph, DcmR Sensory domain
Probab=20.58 E-value=2.2e+02 Score=24.07 Aligned_cols=58 Identities=21% Similarity=0.324 Sum_probs=45.7
Q ss_pred ccccHHHHHHHHHcCCCcceecc--ccCccccCCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCCCC
Q 024821 78 YHKYKEDVKLMADTGLDAYRFSI--SWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHLDL 144 (262)
Q Consensus 78 y~~~~eDi~l~k~lG~~~~R~si--~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~~~ 144 (262)
...+.+-++...+.|.+.+|..- .|.. .. +.+.+..|...+|.+.+.. |+..|+.||.
T Consensus 104 i~~~~~~~~~a~~~G~~~lRv~ge~~w~~-----~~--~~~~l~~yE~~ln~~~~~~--~~~~lC~Yd~ 163 (191)
T PF14417_consen 104 IAFWRAALEQALAEGYRGLRVIGEMTWAL-----RS--GWEELLRYEALLNRLFAEH--PFTALCAYDR 163 (191)
T ss_pred HHHHHHHHHHHHhCCCCcEEEEEechhhc-----cc--cHHHHHHHHHHHHHHhcCC--CEEEEeccch
Confidence 44566667788888999999986 4542 22 6778999999999999887 9998988774
No 248
>PF06777 DUF1227: Protein of unknown function (DUF1227); InterPro: IPR010643 This domain represents a conserved region within a number of eukaryotic DNA repair helicases.; GO: 0005634 nucleus
Probab=20.46 E-value=2e+02 Score=24.00 Aligned_cols=68 Identities=21% Similarity=0.259 Sum_probs=46.7
Q ss_pred hHHHHHHHHHHHcCCcce--eeccCCCCcHhHHHhh-CCCCChhhHHHHHHHHHHHHHHhCCCCceEEeccCC
Q 024821 119 QYYNNLINELISYGIQPH--VTLHHLDLPQALEDEY-GGWINRMIVKDFTAYADVCFREFGDRVSYWTTVNEP 188 (262)
Q Consensus 119 ~~y~~~id~l~~~GI~p~--vtL~H~~~P~~l~~~~-gGw~~~~~~~~F~~ya~~v~~~~gd~V~~w~t~NEP 188 (262)
+=|+++++.|+..++..- ..+.+--+|.-+.++. .| |=...++|..|-+.+++-++.+++--.+..|.
T Consensus 16 ~EY~rLV~GL~~~~~~~~~d~~~~npvLp~dil~eaVPG--nIR~AeHFv~flkR~veylk~rlrv~~v~~e~ 86 (146)
T PF06777_consen 16 DEYDRLVEGLREAEIARETDEILANPVLPDDILKEAVPG--NIRRAEHFVAFLKRFVEYLKTRLRVQHVISES 86 (146)
T ss_pred HHHHHHHHHHHHhccccccchhhcCCCCchhhhhhcCCc--hHHhHHHHHHHHHHHHHHHHHHhhhcceeecC
Confidence 459999999999986532 2234445666554331 22 22345899999999999999988777777773
No 249
>cd04501 SGNH_hydrolase_like_4 Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid.
Probab=20.33 E-value=2.6e+02 Score=22.66 Aligned_cols=59 Identities=14% Similarity=0.014 Sum_probs=38.0
Q ss_pred hhhhhHHHHHHHHHHHcCCcceeeccCCCCcHhHHH-hhCCCCChhhHHHHHHHHHHHHHHhC
Q 024821 115 PKGLQYYNNLINELISYGIQPHVTLHHLDLPQALED-EYGGWINRMIVKDFTAYADVCFREFG 176 (262)
Q Consensus 115 ~~~l~~y~~~id~l~~~GI~p~vtL~H~~~P~~l~~-~~gGw~~~~~~~~F~~ya~~v~~~~g 176 (262)
++-.+.++++|+.+++.|.++++... .|.--.. ........+.+..|.+..+.++++.+
T Consensus 79 ~~~~~~~~~li~~~~~~~~~~il~~~---~p~~~~~~~~~~~~~~~~~~~~n~~~~~~a~~~~ 138 (183)
T cd04501 79 EMIKDNIRSMVELAEANGIKVILASP---LPVDDYPWKPQWLRPANKLKSLNRWLKDYARENG 138 (183)
T ss_pred HHHHHHHHHHHHHHHHCCCcEEEEeC---CCcCccccchhhcchHHHHHHHHHHHHHHHHHcC
Confidence 45678899999999999999887642 2211000 00002234567788888888888864
No 250
>cd06522 GH25_AtlA-like AtlA is an autolysin found in Gram-positive lactic acid bacteria that degrades bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues. This family includes the AtlA and Aml autolysins from Streptococcus mutans which have a C-terminal glycosyl hydrolase family 25 (GH25) catalytic domain as well as six tandem N-terminal repeats of the GBS (group B Streptococcus) Bsp-like peptidoglycan-binding domain. Other members of this family have one or more C-terminal peptidoglycan-binding domain(s) (SH3 or LysM) in addition to the GH25 domain.
Probab=20.31 E-value=1.8e+02 Score=24.72 Aligned_cols=15 Identities=13% Similarity=0.144 Sum_probs=8.8
Q ss_pred ccHHHHHHHHHcCCC
Q 024821 80 KYKEDVKLMADTGLD 94 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~ 94 (262)
.++..++.+++.|+.
T Consensus 43 ~~~~n~~~A~~aGl~ 57 (192)
T cd06522 43 YAASQIANAKAAGLK 57 (192)
T ss_pred HHHHHHHHHHHCCCe
Confidence 455566666666653
No 251
>PLN03153 hypothetical protein; Provisional
Probab=20.29 E-value=1.1e+02 Score=30.78 Aligned_cols=38 Identities=32% Similarity=0.402 Sum_probs=22.8
Q ss_pred HHHcC-CcceeeccCCCCcHhHHHhhCCCCChhhHHHHHHHH
Q 024821 128 LISYG-IQPHVTLHHLDLPQALEDEYGGWINRMIVKDFTAYA 168 (262)
Q Consensus 128 l~~~G-I~p~vtL~H~~~P~~l~~~~gGw~~~~~~~~F~~ya 168 (262)
+.+.| +.|+|+||||+.=. --|.+-...+.++.+..-|
T Consensus 327 ~les~p~~P~vSlHH~~~~~---p~fP~~~~~~~~~~l~~a~ 365 (537)
T PLN03153 327 LLSSHPIAPFVSIHHVEAVD---PFYPGLSSLDSLKLFTRAM 365 (537)
T ss_pred HhhcCCCCCceeeeeccccc---cccCCcchHHHHHHHHHHh
Confidence 44454 99999999998611 1123334445666666544
No 252
>cd04733 OYE_like_2_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 2. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase.
Probab=20.22 E-value=2.7e+02 Score=25.92 Aligned_cols=40 Identities=13% Similarity=0.051 Sum_probs=33.1
Q ss_pred Ccccc---CCCCCCChhhhhHHHHHHHHHHHcCCcceeeccCC
Q 024821 103 SRLIP---NGRGPVNPKGLQYYNNLINELISYGIQPHVTLHHL 142 (262)
Q Consensus 103 sRI~P---~g~g~~n~~~l~~y~~~id~l~~~GI~p~vtL~H~ 142 (262)
++..| ...+-+|.+-+..++++.+.++++|-..++=|+|-
T Consensus 63 ~~~~~~~~~~~~~~~d~~i~~~~~l~~~vh~~G~~~~~Ql~h~ 105 (338)
T cd04733 63 HLEEPGIIGNVVLESGEDLEAFREWAAAAKANGALIWAQLNHP 105 (338)
T ss_pred cccCCCcCCCcccCCHHHHHHHHHHHHHHHhcCCEEEEEccCC
Confidence 45566 33466788899999999999999999999999983
No 253
>cd06562 GH20_HexA_HexB-like Beta-N-acetylhexosaminidases catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. The hexA and hexB genes encode the alpha- and beta-subunits of the two major beta-N-acetylhexosaminidase isoenzymes, N-acetyl-beta-D-hexosaminidase A (HexA) and beta-N-acetylhexosaminidase B (HexB). Both the alpha and the beta catalytic subunits have a TIM-barrel fold and belong to the glycosyl hydrolase family 20 (GH20). The HexA enzyme is a heterodimer containing one alpha and one beta subunit while the HexB enzyme is a homodimer containing two beta-subunits. Hexosaminidase mutations cause an inability to properly hydrolyze certain sphingolipids which accumulate in lysosomes within the brain, resulting in the lipid storage disorders Tay-Sachs and Sandhoff. Mutations in the alpha subunit cause in a deficiency in the HexA enzyme and result in
Probab=20.20 E-value=2e+02 Score=27.08 Aligned_cols=63 Identities=22% Similarity=0.291 Sum_probs=40.8
Q ss_pred cHHHHHHHHHcCCCcceeccccC---ccc----cCC--CCCCChh---hhhHHHHHHHHHHHcCCcceeeccCCCCcH
Q 024821 81 YKEDVKLMADTGLDAYRFSISWS---RLI----PNG--RGPVNPK---GLQYYNNLINELISYGIQPHVTLHHLDLPQ 146 (262)
Q Consensus 81 ~~eDi~l~k~lG~~~~R~si~Ws---RI~----P~g--~g~~n~~---~l~~y~~~id~l~~~GI~p~vtL~H~~~P~ 146 (262)
.++-|+.|+..++|.+-+-++=. ||+ |+- .|..+.. .-+=++++|+-++++||+++.-+ |+|.
T Consensus 20 ik~~Id~ma~~KlN~lh~HltDd~~~rle~~~~P~Lt~~ga~~~~~~YT~~di~eiv~yA~~rgI~vIPEI---D~PG 94 (348)
T cd06562 20 IKRTIDAMAYNKLNVLHWHITDSQSFPLESPSYPELSKKGAYSPSEVYTPEDVKEIVEYARLRGIRVIPEI---DTPG 94 (348)
T ss_pred HHHHHHHHHHhCCcEEEEeEEcCCCceEeeCCCchhhhccCcCCCceECHHHHHHHHHHHHHcCCEEEEec---cCch
Confidence 55668999999999887655311 222 210 1211111 13457999999999999999877 5665
No 254
>PRK09856 fructoselysine 3-epimerase; Provisional
Probab=20.12 E-value=3.1e+02 Score=24.09 Aligned_cols=52 Identities=6% Similarity=-0.002 Sum_probs=34.3
Q ss_pred ccHHHHHHHHHcCCCcceeccccCccccCCCCCCChhhhhHHHHHHHHHHHcCCccee
Q 024821 80 KYKEDVKLMADTGLDAYRFSISWSRLIPNGRGPVNPKGLQYYNNLINELISYGIQPHV 137 (262)
Q Consensus 80 ~~~eDi~l~k~lG~~~~R~si~WsRI~P~g~g~~n~~~l~~y~~~id~l~~~GI~p~v 137 (262)
..++.+++++++|++.+=+........+. .++. ..-+++-+.+.++||++..
T Consensus 14 ~l~~~l~~~~~~G~~~vEl~~~~~~~~~~---~~~~---~~~~~l~~~~~~~gl~v~s 65 (275)
T PRK09856 14 PIEHAFRDASELGYDGIEIWGGRPHAFAP---DLKA---GGIKQIKALAQTYQMPIIG 65 (275)
T ss_pred CHHHHHHHHHHcCCCEEEEccCCcccccc---ccCc---hHHHHHHHHHHHcCCeEEE
Confidence 47899999999999998874332222211 1222 2346677788899998653
No 255
>PF03129 HGTP_anticodon: Anticodon binding domain; InterPro: IPR004154 tRNA synthetases, or tRNA ligases are involved in protein synthesis. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases [] it is probably the anticodon binding domain [].; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding; PDB: 1KOG_B 1EVL_D 1EVK_B 1QF6_A 1FYF_B 2I4O_A 2I4M_A 2I4N_A 2I4L_A 1HC7_D ....
Probab=20.09 E-value=1.2e+02 Score=22.17 Aligned_cols=27 Identities=22% Similarity=0.324 Sum_probs=23.8
Q ss_pred ChhhhhHHHHHHHHHHHcCCcceeecc
Q 024821 114 NPKGLQYYNNLINELISYGIQPHVTLH 140 (262)
Q Consensus 114 n~~~l~~y~~~id~l~~~GI~p~vtL~ 140 (262)
+++..++..++.+.|+++||...+...
T Consensus 11 ~~~~~~~a~~l~~~L~~~gi~v~~d~~ 37 (94)
T PF03129_consen 11 DEEIIEYAQELANKLRKAGIRVELDDS 37 (94)
T ss_dssp HHHHHHHHHHHHHHHHHTTSEEEEESS
T ss_pred cHHHHHHHHHHHHHHHHCCCEEEEECC
Confidence 667889999999999999998888763
Done!